Data Paper |
Corresponding author: Antoniya Hubancheva ( antonia.hubancheva@gmail.com ) Academic editor: Alexander Weigand
© 2023 Antoniya Hubancheva, Vedran Bozicevic, Jérôme Morinière, Holger R. Goerlitz.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Hubancheva A, Bozicevic V, Morinière J, Goerlitz HR (2023) DNA metabarcoding data from faecal samples of the lesser (Myotis blythii) and the greater (Myotis myotis) mouse-eared bats from Bulgaria. Metabarcoding and Metagenomics 7: e106844. https://doi.org/10.3897/mbmg.7.106844
|
A comprehensive understanding of trophic interactions in terrestrial ecosystems is crucial for ecological research and conservation. Recent advances in non-invasive methods, such as environmental DNA (eDNA) metabarcoding, have enabled researchers to collect vast amounts of data on wild animal diets. However, sharing this data and metadata effectively and transparently presents new challenges. To address this, a new type of scholarly journal publication has emerged that aims to describe datasets rather than report research investigations. In this paper, we present a dataset of consumed prey species and parasites based on the metabarcoding of 113 faecal samples from the greater and lesser mouse-eared bats (Myotis myotis and Myotis blythii), along with a detailed description of the data sampling, laboratory analysis, and bioinformatics pipeline. Our dataset comprises 1018 unique Barcode Index Numbers (BINs) from 12 Classes and 43 Orders. In addition, we provide interactive Krona charts to visually summarize the taxonomic relationships and relative read abundance of the consumed prey species and parasites. This data can be used for meta-analysis, exploring new predator-prey and host-parasite interactions, studying inter- and intraspecific ecological interactions, and informing protected area management, among other applications. By sharing this dataset, we hope to encourage other researchers to use it to answer additional ecological questions and advance our understanding of trophic interactions in terrestrial ecosystems.
Bats (Chiroptera), metabarcoding, Myotis myotis, Myotis blythii, parasite-host interactions, predator-prey interactions
Bats play a crucial role in terrestrial ecosystems worldwide by occupying various ecological niches and exploiting a range of food sources including insects, vertebrates, blood, nectar, pollen and fruit (
Myotis myotis (Borkhausen, 1797) and Myotis blythii s.l. (Tomes, 1857; for summary and discussion on taxonomy and phylogeny of the species see
The diet of M. myotis is extensively studied throughout its range with morphological methods (
Here, we provide a detailed description of the metabarcoding analysis of the faeces and of the diet of 113 individual bats (60 M. myotis and 53 M. blythii) collected in an area with high biodiversity value (
In summary, the dataset we present in this paper is a valuable resource that can aid in advancing ecological research and conservation efforts. We hope that by sharing our data, we can contribute to a more collaborative and transparent research environment that will lead to more effective conservation and management of terrestrial ecosystems.
Geographic coverage
Faecal samples were collected from individual bats at the entrance of the Orlova Chucka cave, Pepelina, Dve Mogili District, Bulgaria (43.593240, 25.960108). The cave is inhabited by 15 bat species all year round. In summer, however, it is predominantly occupied by mixed maternity colonies of M. myotis and M. blythii, as well as Rhinolophus euryale and Rhinolophus mehelyi (
Geographic coverage, sampling area and typical hunting habitats A topographical map showing the study area with the sampling site marked by a star and the approximate foraging range of the bats represented by a circle B–D representative hunting habitats of the lesser and greater mouse-eared bats, including small-scale agricultural fields, forests, karstic areas, and riverine habitats (modified after
Samples were collected from June to August in 2017 and 2018. Our period covers the lactation and post-lactation period of the female bats, during which they have to forage more actively to provide enough nutrition to both themselves and the pup.
Bats were captured in the morning (when returning to the roost after foraging) with a harp trap placed in front of the cave entrance. We emptied the trap every 5 to 10 minutes to minimize defecation in the trap, and thus potential cross-contamination between individuals by faeces attached to the fur. However, we could not fully prevent bats from defecating in the trap, therefore, a small proportion of cross-contamination between the different individuals might have occurred. After being removed from the trap, bats were placed in individual cotton bags until they defecated. Prior to data collection, the bags were brushed from previous guano and washed at 90 °C with bleach. After the bats had defecated in the bags, they were measured, sexed and identified to species level following
DNA metabarcoding was conducted at the AIM Lab (AIM—Advanced Identification Methods GmbH, Leipzig, Germany). Genomic data was extracted using the Quick-DNA Fecal/Soil Microbe 96 Kit (Zymo Research Corporation, Irvine CA, USA) and following the manufacturer’s instructions. To control for artifacts arising from lab contamination, we ran 6 empty vials as negative control samples through the lab procedure: 2 before extraction, and 2 before each of the two rounds of PCR. These negative control samples were processed in the same way as the faecal samples. Further laboratory analyses were carried out as per the methods described in
From each sample, paired-end reads were merged using the -fastq_mergepairs utility of USEARCH v11.0.667 (
Quality filtering was performed using --fastq_filter, allowing a maximum of 1 expected error along the length of the sequence and a minimum read length of 300 bases (parameters: --fastq_maxee 1, --minlen 300). This was followed by dereplication on the sample level using --derep_fulllength, keeping only a single copy of each unique sequence (parameters: --sizeout, --relabel Uniq). Cleaned and dereplicated sample files were concatenated into one large FASTA file, which was then dereplicated again, and also filtered for sequences occurring only once in the entire dataset (singletons) with the parameters --minuniquesize 2, --sizein, --sizeout, --fasta_width 0.
To save processing power, a clustering step (at 98% identity) was employed before chimera filtering using the VSEARCH utility --cluster_size and the centroids algorithm (parameters: --id 0.98, --strand plus, --sizein, -- sizeout, --fasta_width 0, --centroids). Chimeric sequences were then detected and filtered out from the resulting file using the VSEARCH -- uchime_denovo utility (parameters: --sizein, --sizeout, --fasta_width 0, -- nonchimeras). Next, a perl script obtained from the authors of VSEARCH (https://github.com/torognes/vsearch/wiki/VSEARCH-pipeline) was used to regenerate the concatenated FASTA file, but without the subsequently detected chimeric sequences. The resulting chimera-filtered file was then used to cluster the reads into operational taxonomic units (OTUs) using SWARM v.3.1.0 (
To reduce the risk of false positives, a cleaning step was employed that excluded read counts in the OTU table constituting <0.01% of the total number of reads in the sample. OTUs were additionally removed from the results based on negative control samples. If the number of reads for the OTU in any sample was less than the maximum for that OTU among negative controls, those reads were excluded from further analysis.
OTU representative sequences were blasted with the program Megablast (parameters: maximum hits: 1; scoring (match mismatch): 1–2; gap cost (open extend): linear; max E-value: 10; word size: 28; max target seqs 100) against (1) a custom database downloaded from GenBank (a local copy of the NCBI nucleotide database downloaded from ftp://ftp.ncbi.nlm.nih.gov/blast/db/), and (2) a custom database built from data downloaded from BOLD (www.boldsystems.org) (
All available Animalia data was downloaded from the BOLD database on 29 July 2022 using the available public data API (http://www.boldsystems.org/index.php/resources/api) in a combined TSV file format. The combined TSV file was then filtered to keep only the records that: (1) had a sequence (field 72, “nucleotides”); (2) had a sequence that did not hold exclusively one or more “-” (hyphens); had a sequence that did not contain non-IUPAC characters; (3) belonged to COI (the pattern “COI-5P” in either field 70 (“markercode”) or field 80 (“marker_codes”)); 5) had an available BIN (field 8, “bin_uri”). In (5), an exception was made in cases where the species belonging to that record did not occur with a BIN elsewhere in the dataset. In other words, “BIN-less” records were kept if their species were also completely BIN-less in the dataset. The dataset was then filtered to include only records from a custom European BOLD BLAST database.
Finally, a FASTA file annotated with (1) a Process ID (field 1, “processid”), (2) BIN (field 8), (3) taxonomy (fields 10, 12, 14, 16, 18, 20, 22 – “phylum_name”, “class_name”, “order_name”, “family_name”, “subfamily_name”, “genus_name”, “species_name”), (4) geolocation data (fields 47, 48, 55), and (5) GenBank ID (field 71, “genbank_accession”) was created from the filtered combined TSV file. This FASTA file was then converted into a BLAST database using Geneious v10.2.6 (Biomatters, Auckland, New Zealand). The results were exported and further processed according to methods described by
Briefly, the resulting CSV files containing BLAST results were exported from Geneious and combined with the OTU table generated by the bioinformatic pre-processing pipeline. The CSVs included: (1) OTU ID; (2) BOLD Process ID; (3) BIN; (4) Hit-%-ID value (the percentage of identical base pairs of the OTU query sequence with its closest counterpart in the reference database); (5) Grade-%-ID value (a value that combines query coverage, E-value and Hit-%-ID with weights of 0.5, 0.25 and 0.25 respectively); (6) length of the top BLAST hit sequence; (7) phylum, class, order, family, genus and species for each detected OTU.
As an additional measure of control other than BLAST, the OTUs were classified into taxa using the Ribosomal Database Project (RDP) naïve Bayesian classifier (
BOLD taxonomy was then used to create Krona charts (Fig.
Taxonomic relationships and relative read abundance of prey and parasite species in faecal samples collected from A 60 individuals of the greater mouse-eared bat, Myotis myotis, and B 53 individuals of the lesser mouse-eared bat, Myotis blythii. The Krona charts presented in this figure exclude the reads from the two bat species. However, Krona charts with included bat reads can be found in the Supplementary Information. An interactive graph is also available in the online version of this publication, offering a more in-depth analysis of the data.
The presented dataset is a comprehensive collection of 1018 BIN species belonging to 12 Classes and 43 Orders. The interactive Krona charts, based on BOLD taxonomy, provide a useful tool for visualizing the dataset (Fig.
Insects made up the largest proportion of the detected species. The observed differences in the RRA from carabid beetles between M. myotis (46%) and M. blythii (7%) aligns with the differences in diet, foraging style and habitat of these species (
Importantly, since the RRA does not accurately represent actual biomass abundance, it is crucial to complement this data with other research techniques, such as biologging (
Notably, in addition to the bats’ prey species, the provided dataset also includes reads from various ecto- and endo-parasites, such as ticks (Ixodes), mites (Mesostigmata and Sarcoptiformes), roundworms (Strongylida and Rhabditida), and other parasite species. Furthermore, we identified molluscs (Gastropoda) and worms (Annelida) in the samples, including Pomatias rivulare, Lumbricus rubellus, and Eisenia fetida, which were likely consumed by predatory carabid beetles or other arthropods that were then consumed by the bats. Moreover, the presence of species from the roundworm genus Steinernema, which are known to parasitize mole crickets and other bat prey, suggests that the dataset also contains parasites of the bats’ prey species. This comprehensive dataset thus offers valuable insights into the diversity and abundance of the parasites, the prey and their associated species of the greater and the lesser mouse-eared bats.
We thank Martin Georgiev, Theresa Hügel, Kathrin Dimitrova, and the entire field team of the Siemers Bat Research Station (field seasons 2017 and 2018) for their invaluable assistance with sample collection, methodological, logistical, and mental support. We acknowledge the Max Planck Institute for Ornithology and the National Museum of Natural History, Sofia for infrastructure and support.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This research was supported by the Bulgarian Academy of Sciences to A.H (Grant No. DFNP-631 17-71/28.07.2017) and an Emmy Noether grant to H.R.G. by the Deutsche Forschungsgemeinschaft (grant no. 241711556).
A.H. – Conceptualization, A.H. V.B. – Data curation, A.H., V.B. – Formal Analysis, A.H., H.R.G. – Funding acquisition, A.H., V.B., J.M. – Methodology, A.H., H.R.G – Project administration, H.R.G. J.M. – Resources, V.B. – Software, H.R.G., J.M. – Supervision, A.H., V.B. – Validation, A.H., V.B. – Visualization, A.H., V.B. – Writing original draft, A.H., V.B., J.M., H.R.G. – Writing review & editing.
Antoniya Hubancheva https://orcid.org/0000-0001-8362-1301
Vedran Bozicevic https://orcid.org/0000-0002-8778-3399
Jérôme Morinière https://orcid.org/0000-0001-9167-6409
Holger R. Goerlitz https://orcid.org/0000-0002-9677-8073
The dataset described in this data paper is available in the Supplementary materials and has been deposited in Dryad under https://doi.org/10.5061/dryad.4tmpg4fgz. The Dryad mirror also contains the raw metabarcoding reads from Illumina.
Metabarcoding data from M. myotis and M. blythii from Bulgaria
Data type: metabarcoding in tabular format
Explanation note: An Excel file, titled "mmyotis-mblythii-metabarcoding-data-bulgaria.xlsx", includes DNA metabarcoding data from the faecal samples of the lesser (Myotis blythii) and greater (Myotis myotis) mouse-eared bats from Bulgaria and a second sheet that contains descriptions of the columns in the metabarcoding dataset, along with their meanings.
Taxonomic relationships and relative abundance of prey and parasite species in faecal samples from M. myotis and M. blythii from Bulgaria
Data type: interactive chart
Explanation note: An interactive Krona chart that provides a visual representation of the metabarcoding data.