Research Article |
|
Corresponding author: Juan David Ramírez ( juan.ramirezgonzalez@mountsinai.org ) Academic editor: Alexander Probst
© 2023 Luisa Paez-Triana, Giovanny Herrera, Laura Vega, Diego García-Corredor, Martin Orlando Pulido Medellín, Alberto Paniz-Mondolfi, Marina Muñoz, Juan David Ramírez.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Paez-Triana L, Herrera G, Vega L, Garcia-Corredor D, Pulido Medellín MO, Paniz-Mondolfi A, Muñoz M, Ramírez JD (2023) Metagenomic exploration of endosymbionts and pathogens in the tropical lineage of Rhipicephalus sanguineus sensu lato (s.l.) ticks in Colombia. Metabarcoding and Metagenomics 7: e109085. https://doi.org/10.3897/mbmg.7.109085
|
Rhipicephalus sanguineus sensu lato (s.l.), commonly known as the domestic dog tick, is a globally distributed tick. This species plays a significant role in human and animal health, as well as in economy due to its ability to infest livestock. The use of shotgun metagenomics has proven valuable in identifying tick-borne pathogens and key members of the tick microbiome, including endosymbionts. However, the application of shotgun metagenomics in R. sanguineus s.l. ticks in Latin America remains largely unexplored. Therefore, our objective aimed to explore and further analyze the metagenome of the tropical lineage of R. sanguineus s.l. ticks in Colombia. Through our analysis, we identified the three most prevalent pathogens harbored by these ticks, namely: Anaplasma phagocytophilum, Francisella tularensis, and Theileria equi. However, the most abundant microorganism detected was not a pathogen but the endosymbiont Coxiella mudrowiae. Interestingly, Coxiella mudrowiae exhibited significant negative correlations with several pathogens and other endosymbionts. Furthermore, we successfully constructed 27 medium-quality metagenome-assembled genomes (MAGs) for this microorganism, enabling us to conduct a pangenome analysis by comparing them with available genomes and identifying proteins of interest, such as those involved in vitamin B synthesis. This study represents the first implementation of shotgun metagenomics as a methodology to expand our understanding of pathogens and endosymbionts in the circulating tropical lineage of R. sanguineus s.l. ticks in Colombia. The findings of this research serve as a foundation for the development of prevention and mitigation strategies against pathogens transmitted by this tick species. Information gained from this study can contribute to the improvement of public health measures and veterinary practices aimed at controlling the impact of tick-borne diseases.
Metagenome, microbiome, R. sanguineus s.l, shotgun-metagenomics, tick
Ticks are blood-feeding arthropods that serve as obligate vectors, capable of transmitting a wide range of pathogens, surpassing other types of vectors in terms of diversity (
It is widely recognized that ticks have the ability to harbor and transmit a diverse range of microorganisms, making them important vectors for diseases (
Moreover, comprehensive understanding of the tick microbiota is crucial for unraveling the complex interactions within this community and their implications for tick biology and disease transmission. Traditional methods used to identify microorganisms in ticks have limitations in terms of sensitivity, as they only capture a fraction of the tick’s microbiota (
The study of R. sanguineus s.l. ticks in South America, particularly in Colombia, is limited despite its significance in developing and tropical countries. A study by
A total of 38 adult ticks, collected from dogs, were obtained from the Casanare and Santander departments in Colombia. In the municipality of Yopal, Casanare (5°19'50"N, 72°23'26"W), we collected nine females and eleven males. Similarly, in the municipality of Puente Nacional, Santander (5°52'38"N, 73°40'43"W), we collected ten males and eight females (Suppl. materials
A graphical summary for all the methods is in Suppl. material
To identify pathogens and endosymbionts, a custom database (Number of genomes (N): 2153, Number of genomes remove (NR): 524) was created by compiling relevant information from the literature. Genomes of Anaplasma (N: 50, NR: 2), Rickettsia (N: 146, NR: 73), Francisella (N: 1075, NR: 161), Ehrlichia (N: 44, NR: 2), Borrelia (Lyme Disease) (N: 379, NR: 220), Coxiella (N: 158, NR: 0), Babesia (N: 5, NR: 2), and Theileria (N: 4, NR: 3) available in NCBI (RefSeq Annotated) were downloaded (Available genomes until July 2023). The database was then refined by excluding species that are not transmitted by ticks and those that are not known to circulate in the Americas. For the identification of endosymbionts, genomes of the Wolbachia (N: 292, NR: 62) were included in the genomes of endosymbionts belonging to the genus Rickettsia and Coxiella. Only the genera identified as endosymbionts of insects or arachnids were retained. The information of the genomes used are in Suppl. material
To explore differences in read abundance between sexes and departments, Wilcoxon tests were performed. The significance level was set at 0.05, and the analysis was conducted using RStudio software. This statistical test helped identify any significant variations in read abundance, providing valuable insights into potential differences associated with sex and geographical location. Additionally, to examine the relationships within and between different ecological communities, such as endosymbionts and pathogens, the Spearman’s nonparametric rank-order correlation with the Benjamini-Hochberg correction was employed. This statistical analysis was conducted using the “psych” package in RStudio. Significant correlations were identified when the p-value was less than 0.05. Strong correlations were defined as having a correlation coefficient (ρ) below -0.75 or above 0.75. The resulting correlations were then visualized using Cytoscape 3.9.0 (
To identify virulence factors, the Basic Local Alignment Search Tool (BLAST) was utilized. The host-filtered reads, obtained after aligning the reads to the Virulence Factors Database (VFDB), were subjected to BLASTn analysis, considering a minimum of 95% identity and at least an e-value of 7.04E-07 as a match. This approach allowed for the identification of genetic elements associated with the virulence of microorganisms (
The clean reads were subjected to assembly using MetaSpades v3.15.3 (
A step of dereplication was perform for all the MAGs, using the program deRep (
Furthermore, a comparative genomics analysis was conducted. Genomes identified as endosymbionts of ticks from the Coxiella genus and Coxiella burnetii RSA (GCF_000007765.2) were downloaded from NCBI. Amino acid trees of 40 proteins were constructed using the Uprot program in Taxxo (https://github.com/giraola/taxxo/wiki/Taxxo-wiki), which was executed within RStudio. These 40 proteins are universal phylogenetic markers that are single-copy and have been successfully used in various studies. ANIb (Average Nucleotide Identity using Blast) values were calculated between the complete genomes using the Taxxo tool. Genome pairs with an ANI score higher than 95.0% were considered to belong to the same species (
The annotation outputs from PROKKA were utilized to identify Clusters of Orthologous Groups (COG) with eggNOG-mapper v2 (
A total of 18.2 to 38.1 Mb of data per sample was obtained. However, the reads related to the host represented 88.5% of the reads (SD: ± 6). After the filter, only a small percentage of the sequences were successfully classified, with a mean of 19% (SD: ± 5). This left an average of 81% of the sequences unclassified in each sample. Among the classified sequences (Suppl. materials
Relative abundance of the top 10 A bacteria B archaea classified reads per tick, categorized by department and sexes. Families are indicated on the left side of the corresponding panel, while genera are presented on the right. Coxiellaceae *: For the bacterial genus (right panel), we excluded the reads from the Coxiellaceae family as they represent most of the relative abundance of the genus and were all assigned to Coxiella endosymbionts, which hindered the assessment of the abundance of other genera.
Although the overall composition remained consistent across all departments and sexes, significant differences were found in the most abundant genera (Suppl. material
Regarding pathogens (Suppl. material
Pathogens and endosymbionts present in each tick. A top 10 pathogens B top 10 Endosymbionts (excluding C. mudrowiae) C differences in relative abundance between departments and sexes for the top 3 pathogens D differences in relative abundance between departments and sexes for the top 3 endosymbionts E networks of positive (blue) and negative (red) correlations among the identified species. The significances are ranked as follows: The significances are ranked as follows: *: < 0.05 **: < 0.01 ***: < 0.001. WE: Wolbachia endosymbiont. RE: Rickettsia endosymbiont. CLE: Coxiella like endosymbiont.
Although no noticeable differences between sex and location were observed in the heatmaps (Fig.
The ticks showed a low abundance of reads (>52 reads per sample) related to virulence factors (Suppl. material
A total of 27 medium-quality MAGs belonging to the species Ca. Coxiella mudrowiae were successfully assembled (Fig.
Assembled MAGs belonging to C. mudrowiae species. A example (Sample An171) of the assembled MAGs and its components B shared genes between a representative of normal length MAG, a representative of longer length MAG, C. mudrowiae RSt, and C. mudrowiae RSA-CAT. This is a representation utilizing a reference MAG for each of the groups (Longer and Normal length). The red boxes highlight the Longer MAG, its contained gene count, and the substantial number of unique genes associated with it. C. Evolutionary insights between samples (without the Longer MAGs) based on the core genome (left panel) of the MAGs and available genomes, compared with a matrix where the core and accessory genes were either present or absent, which was graphically represented (right panel).
The phylogenetic analysis based on the core genome (Fig.
Comparative genomics and phylogenetic analysis of the obtained MAGs in relation to those available for C. mudrowiae. A genetic distance of SNPs between the available genomes and the obtained MAGs B phylogenetic network representing the relationships among the MAGs and available genomes C phylogenetic tree (bootstrap 1000 replicates, and jModelTest was used to select the best substitution method) based on 40 highly conserved proteins across various CLE, Coxiella burnetii, and MAGs (left panel). This phylogenetic tree is complemented by an ANIb analysis (right panel) for the entire genome conducted on CLE, Coxiella burnetii, and MAGs, with a threshold set at 95%.
Previous studies have suggested that the putative role of this endosymbiont involves synthesizing essential nutrients that are not readily available in the blood, considering the obligate hematophagy of this vector (
A genes involved in A. Vitamin B synthesis and B. cofactors identified in MAGs, C. mudrowiae RSt, C. mudrowiae RSA-CAT, and C. burnetii, along with the identity percentages of the proteins produced by them B phylogenetic tree (Bootstrap 1000 replicates) of BioB protein of all avaible genomes for CLE, C. burnetii and MAGs C phylogenetic tree (Bootstrap 1000 replicates) of coaE protein of all available genomes for CLE, C. burnetii and MAGs.
Ticks are recognized as the second most important vectors of diseases after mosquitoes (
Here we employed shotgun metagenomics to characterize the microbiome of 38 R. sanguineus s.l. specimens. The predominant bacterial composition in our samples was found to be from the Coxiellaceae family (Fig.
The presence of archaea and viruses in the samples was minimal in terms of abundance, but a diverse range of genera was identified within these groups (Fig.
To identify the pathogens and endosymbionts present in the samples and explore their interactions, a database was created using available genomes. As a result, three major pathogens were detected in the samples: A. phagocytophilum, which causes human granulocytic anaplasmosis (HGA) and anaplasmosis in animals (
On the other hand, a significant abundance of reads corresponded to various endosymbionts (Fig.
It is well-established that endosymbionts engage in ongoing interactions with the transmitted pathogens, which can either facilitate, restrict, or impede their transmission (
Similarly, considering the prominence of this endosymbiont in our samples, we assembled multiple MAGs and compared them to the genomes constructed thus far. Currently, two genomes have been characterized, one derived from R. sanguineus s.l and another from R. turanicus (
Furthermore, the MAGs formed a distinct monophyletic group separate from the assembled genomes. However, when analyzing average nucleotide identity (ANI) percentages and constructing a phylogenetic tree based on 40 informative proteins, the MAGs clustered with known Coxiella species and indicated a low degree of diversity within the country. This aligns with previous research that has attributed the limited intrapopulational diversity of C. mudrowiae to the restricted number of hosts, as represented by the two Coxiella strains derived from different tick species (
Finally, we conducted an assessment of specific genes that have been previously reported in this endosymbiont (Fig.
Despite certain limitations, such as competition for genomic resources between ticks and their pathogens, which can limit the number of assigned reads, and challenges associated with taxonomic assignment due to the scarcity of research on vector microbiomes, particularly in ticks, understanding the microbiome of these vectors is of utmost importance. It not only allows us to identify the pathogens that the tropical lineage of R. sanguineus s.l. could transmit or carry to a wide range of animals, including humans due to their close contact with domestic dogs, but it is also essential for the control of diseases transmitted by these ticks. This is particularly crucial considering the increasing emergence of resistance markers in pathogens worldwide. Understanding the dynamics of these microorganisms and their interaction with ticks is essential to effectively address the challenges related to veterinary and animal health. Additionally, it can provide valuable information for implementing more effective prevention and control strategies, thereby minimizing the impact of diseases transmitted by R. sanguineus s.l. on public health and the livestock industry. To gain a comprehensive understanding, it is imperative that future investigations focus on exploring the prevalence and circulation of the different pathogens identified in this study and implement active surveillance measures within the country.
Furthermore, conducting in-depth studies on the endosymbiont C. mudrowiae is crucial. This involves not only comprehending its behavior within the country through complete genome analysis but also assessing potential strategies that could pave the way for future vector control approaches utilizing this endosymbiont. Moreover, to further enhance our understanding of pathogen transmission in R. sanguineus s.l., it is imperative to expand this study to encompass the diverse lineages of this tick species worldwide. This expansion would allow for the identification of distinct pathogens that may be transmitted by different lineages. Additionally, considering the knowledge generated regarding endosymbionts in these ticks, it becomes important to explore their prevalence and potential interactions within each lineage. By doing so, we can develop comprehensive surveillance strategies that encompass the entire R. sanguineus s.l. species complex, thereby enabling a more thorough assessment of pathogen diversity and transmission dynamics. Such an approach would provide valuable insights for the implementation of effective surveillance and control measures targeting this complex species.
Our study has yielded a comprehensive understanding of the pathogens and endosymbionts present in R. sanguineus s.l ticks in Colombia. We have successfully identified three significant pathogens, namely A. phagocytophilum, F. tularensis, and T. equi, which have implications for both animal and human health. Implications ranging from the occurrence of mild symptoms to the death of both humans and animals. This information is invaluable for guiding tick-borne disease surveillance efforts in the country. Furthermore, we have made notable discoveries regarding the presence of highly relevant endosymbionts, including Coxiella mudrowiae, among others, within this tick species. One intriguing finding is the negative correlations observed between the presence of pathogens and endosymbionts. This suggests complex interactions within the tick microbiome that warrant further investigation. In addition, our study represents a milestone as we have successfully assembled multiple MAGs for C. mudrowiae, making them the first MAGs of this endosymbiont reported in Colombia. These MAGs have provided valuable insights into the phylogenetic relationships and functional characteristics of C. mudrowiae in the country. Overall, our findings contribute significantly to the understanding of pathogen and endosymbiont dynamics in R. sanguineus s.l ticks in Colombia. This knowledge serves as a crucial resource for guiding tick-borne disease surveillance efforts and has implications for public health and animal welfare.
We thank the personnel who helped in the collection of the specimens in the field.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This work was funded by Dirección de Investigación e Innovación from Universidad del Rosario.
Conceptualization: MM, JDR, LPT, APM. Data curation: MM, JDR, LPT. Formal analysis: JDR, LV, GH, LPT, MM. Funding acquisition: JDR. Investigation: JDR, GH, LV, APM, MM. Methodology: DGC, LPT, LV, APM, JDR, GH, MPM. Project administration: JDR. Resources: LPT, DGC, MPM, JDR. Software: LPT. Supervision: JDR, MM. Validation: JDR. Visualization: JDR, LPT, GH. Writing – original draft: JDR, LPT. Writing – review and editing: DGC, MPM, APM, GH, MM, LV.
Diego Garcia-Corredor https://orcid.org/0000-0001-5122-5435
Juan David Ramírez https://orcid.org/0000-0002-1344-9312
The raw sequences for each sample will be available under the project number PRJEB64029 in The European Nucleotide Archive (ENA) repository. The BioSampleID is specified in Suppl. material
Geographic locations where samples were collected
Data type: pdf
Visual representation illustrating the methods utilized in the analysis discussed in the article
Data type: pdf
Significant differences in the relative abundances of assigned reads in Bacteria and Archaea between departments and sexes of the various genera found in the samples
Data type: pdf
Pathogens and endosymbionts present in each tick
Data type: pdf
Results of dereplication, as well as phylogenomics analysis and pangenome exploration of all samples (including longer MAGs)
Data type: pdf
Metadata and metagenomic information of collected samples
Data type: xlsx
Metadata and information regarding the genomes included in the analysis for endosymbiont and pathogen identification
Data type: xlsx
Abundance of taxonomy at the genus level within the domains of Bacteria and Archaea
Data type: xlsx
Abundance of taxonomy at the Family level within the domains of Bacteria and Archaea
Data type: xlsx
p-values obtained between departments and sexes within each department using the Wilcoxon test on the relative abundances of each genus (Bacteria and Archaea)
Data type: xlsx
Abundance of taxonomy of endosymbionts and pathogens
Data type: xlsx
p-values obtained between departments and sexes within each department using the Wilcoxon test on the relative abundances of pathogens and endosymbionts
Data type: xlsx
Relative abundance by sample of virulence factors
Data type: xlsx
Relative abundance by sample of resistence marker
Data type: xlsx
Characteristics of each of the metagenomes found and their comparison with available genomes
Data type: xlsx
Gene quantity analysis by COG clusters in MAGs, available genomes, and C. burnetii
Data type: xlsx