Research Article |
Corresponding author: Janina Schenk ( janina.schenk@uni-bielefeld.de ) Academic editor: Michael T. Monaghan
© 2019 Janina Schenk, Stefan Geisen, Nils Kleinboelting, Walter Traunspurger.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Schenk J, Geisen S, Kleinboelting N, Traunspurger W (2019) Metabarcoding data allow for reliable biomass estimates in the most abundant animals on earth. Metabarcoding and Metagenomics 3: e46704. https://doi.org/10.3897/mbmg.3.46704
|
Microscopic organisms are the dominant and most diverse organisms on Earth. Nematodes, as part of this microscopic diversity, are by far the most abundant animals and their diversity is equally high. Molecular metabarcoding is often applied to study the diversity of microorganisms, but has yet to become the standard to determine nematode communities. As such, the information metabarcoding provides, such as in terms of species coverage, taxonomic resolution and especially if sequence reads can be linked to the abundance or biomass of nematodes in a sample, has yet to be determined. Here, we applied metabarcoding using three primer sets located within ribosomal rRNA gene regions to target assembled mock-communities consisting of 18 different nematode species that we established in 9 different compositions. We determined abundances and biomass of all species added to examine if relative sequence abundance or biomass can be linked to relative sequence reads. We found that nematode communities are not equally represented by the three different primer sets and we found that relative read abundances almost perfectly correlated positively with relative species biomass for two of the primer sets. This strong biomass-read number correlation suggests that metabarcoding reads can reveal biomass information even amongst more complex nematode communities as present in the environment and possibly can be transferred to better study other groups of organisms. This biomass-read link is of particular importance for more reliably assessing nutrient flow through food-webs, as well as adjusting biogeochemical models through user-friendly and easily obtainable metabarcoding data.
Nematodes, quantification, community analysis, Illumina MiSeq, species diversity
Over the last centuries, researchers aimed at capturing the planet’s biodiversity that consists of about 1.9 million described species – a fraction compared to the diversity of undescribed species (
Nematode communities (and those resulting from nematode abundance and diversity estimates) are characterised by morphological taxon identification using microscopy. This is in contrast to comparable studies focusing on microorganisms, which are entirely based on molecular approaches, particularly metabarcoding (
Several studies investigated the performance of metabarcoding for community analyses, but mostly focusing on samples without an a priori knowledge of species composition initially present in the sample as reported for ciliates and microbial communities (
In this study, we used mock communities, consisting of 18 different nematode species and applied Illumina MiSeq sequencing, targeting three commonly used barcoding regions within the rRNA gene region. We changed the composition of those species, such as by adding them in equal abundances, by compositions as found in nature and by replacing larger adult with smaller juvenile specimens to assess and calibrate abundance depiction efficiency. We hypothesised (1) that relative taxon composition and revealed diversity depend on the primer pair, due to differences in resolution and PCR-induced differential amplification, especially for communities. Additionally, we hypothesised (2) that relative read abundance is best reflected by relative biomass rather than abundance data due to the differences in ribosomal copy number variation and the resulting increases in barcoding gene numbers.
Nematode species were raised on cultured agar plates (1.7%) dosed with cholesterol and E. coli and synchronised regarding life stages. In total, 18 species were used and organisms were individually removed with needles from plates and rinsed in a water drop in order to remove excess agar parts, bacteria and fungi. Organisms were then transferred into an Eppendorf tube containing lysis buffer (Machery & Nagel, Nucleo Spin Tissue XS Kit), according to predefined mock-community ratios (Table
Mock-community composition of 18 nematode species with a total of 198–200 individuals present in each assembled community. Nine different communities were created, all with at least one of each species present. One species (Rhomborhabditis regina) was added either as adults or juveniles, while only adults of all other species were added. Furthermore, the wet weight (ww) of the 18 species is given in µg, together with the range of biomass. Each sample was replicated 3 times, resulting in 27 mock-communities.
Species | Number of individuals | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Sample | Biomass ww | Biomass range | |||||||||
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | (µg) | (µg) | |
Rhomborhabditis regina adult (Schulte & Poinar, 1991) | 11 | 0 | 6 | 2 | 5 | 10 | 5 | 10 | 2 | 20.29 | 18.82–34.59 |
Rhomborhabditis regina juvenile | 0 | 11 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 6.09 | 3.25–16.24 |
Caenorhabditis elegans Maupas 1900 | 11 | 11 | 11 | 166 | 41 | 15 | 115 | 70 | 21 | 3.74 | 2.22–5.59 |
Panagrellus redivivus (Goodey, 1943) | 11 | 11 | 11 | 2 | 4 | 10 | 5 | 10 | 23 | 3.83 | 3.44–5.21 |
Pristionchus pacificus Sommer et al. 1996 | 11 | 11 | 11 | 2 | 5 | 10 | 5 | 10 | 13 | 3.10 | 2.62–3.83 |
Pristionchus entomophagus (Steiner, 1929) | 11 | 11 | 11 | 2 | 15 | 15 | 5 | 10 | 7 | 4.02 | 3.45–4.71 |
Plectus velox Bastian, 1865 | 11 | 11 | 11 | 2 | 5 | 10 | 5 | 5 | 2 | 4.62 | 3.39–7.79 |
Plectus cf. acuminatus Bastian, 1865 | 11 | 11 | 11 | 2 | 3 | 10 | 5 | 5 | 3 | 1.98 | 1.26–3.16 |
Plectus aquatilis Andrassy, 1985 | 11 | 11 | 11 | 2 | 2 | 5 | 5 | 5 | 2 | 1.50 | 1.15–2.06 |
Paroigolaimella bernensis (Steiner, 1914) | 11 | 11 | 11 | 2 | 20 | 15 | 5 | 5 | 32 | 1.37 | 1.16–1-67 |
Panagrolaimus thienemanni Hirschmann, 1952 | 11 | 11 | 11 | 2 | 5 | 5 | 5 | 10 | 10 | 0.31 | 0.28–0.53 |
Acrostichus sp. | 11 | 11 | 11 | 2 | 5 | 10 | 5 | 5 | 2 | 0.79 | 0.45–1.18 |
Acrostichus nudicapitatus (Steiner, 1914) | 11 | 11 | 11 | 2 | 5 | 25 | 5 | 5 | 2 | 0.92 | 0.68–1.44 |
Poikilolaimus regenfussi Sudhaus, 1980 | 11 | 11 | 11 | 2 | 15 | 5 | 5 | 10 | 11 | 0.65 | 0.44–0.91 |
Poikilolaimus oxycerca (de Man, 1895) | 11 | 11 | 11 | 2 | 5 | 5 | 5 | 5 | 2 | 0.73 | 0.70–0.98 |
Acrobeloides tricornus (Thorne, 1925) | 11 | 11 | 11 | 2 | 25 | 10 | 5 | 10 | 9 | 0.82 | 0.73–1.13 |
Acrobeloides cf. nanus (de Man,1880) | 11 | 11 | 11 | 2 | 10 | 15 | 5 | 10 | 23 | 0.77 | 0.62–1.04 |
Diploscapter coronatus (Cobb, 1893) | 11 | 11 | 11 | 2 | 5 | 15 | 5 | 5 | 2 | 0.24 | 0.20–0.38 |
Aphelenchoides parietinus Steiner, 1932 | 11 | 11 | 11 | 2 | 25 | 10 | 5 | 10 | 34 | 0.14 | 0.09–0.22 |
Sum | 198 | 198 | 199 | 200 | 200 | 200 | 200 | 200 | 200 |
Individual species were amplified, Sanger-sequenced and used as reference in the bioinformatic pipeline (see below). PCR conditions followed
Mean biomass (wet weight) of every species was calculated following (Andrássy 1956) for several individuals (n = 10–15 adults) that were heat-fixated and measured (Table
Except for the taxonomic classification and primer removal, the MiSeq standard operation procedure using mothur (
For each primer pair, read numbers for each species in each replicate were averaged to decrease variation and communities were compared to relative abundance and relative biomass, based on the number of inoculated specimens. For statistical analyses of communities, Bray Curtis similarity was applied with non-transformed data. Non-metric multidimensional scaling (NMDS) plots were created with MathLab (
The three primer pairs had a different amplification success (Table
Species recovered by the three primer pairs. Given are the species added in the mock-communities and the performance of each marker. An “x” indicates that the species was found, an “n” means the species was missing and an (x) shows that the nucleotide sequence of this species was identical to another species, indicated in bold and, therefore, could not be experimentally verified at the species level in the metabarcoding approach. Furthermore, the total number of OTUs that could be recovered from the mock community species, including species with identical sequences, as well as without genetically indistinguishable species (in parentheses), are given, together with the average read length (bp) and the contamination (in %).
Species | 18S_V1 | 18S_V4 | 28S_D3 |
---|---|---|---|
Rhomborhabditis regina | x | x | x |
Panagrellus redivivus | x | x | |
Caenorhabditis elegans | x | x | x |
Pristionchus pacificus | x | x | x |
Pristionchus entomophagus | x | x | x |
Aphelenchoides parietinus | x | x | |
Acrobeloides cf. nanus | x | x | x |
Acrobeloides tricornus | (x) | (x) | x |
Panagrolaimus thienemanni | x | x | |
Poikilolaimus cf. regenfussi | x | x | x |
Poikilolaimus oxycerca | x | x | |
Paroigolaimella bernensis | x | x | x |
Acrostichus sp. | x | x | |
Acrostichus nudicapitatus | (x) | (x) | |
Diploscapter coronatus | |||
Plectus aquatilis | x | x | x |
Plectus cf. acuminatus | (x) | (x) | (x) |
Plectus velox | (x) | (x) | x |
Number of OTUs | 13(10) | 16 (12) | 16 (14) |
Avg. read length (bp) | 405 | 583 | 532 |
Avg. contamination (%) | 0.53 | 0.81 | 2.14 |
The community structure differed between the primer pairs, with 28S_D3 resulting in the most similar community depiction to the initial inoculated communities, while depiction for the 18S_V1 primer pair substantially differed. Nematode communities amplified with the 28S_D3 primer pair grouped together with nematode mock communities presented as biomass data in the NMDS plot (Fig.
Relative biomass positively correlated with relative read numbers for the primer pair 28S_D3 (R2 = 0.90539, p < 0.001) and 18S_V4 (R2 = 0.81396, p < 0.001, Fig.
Relative species abundances were positively correlating with read numbers for the primer pairs 28S_D3 (R2 = 0.53694, p < 0.001) and 18S_V4 (R2 = 0.33077, p < 0.001), but less strong than for relative biomass-read number correlations (Fig.
Correlations of relative biomass and relative abundance against relative read proportions. a–b Correlation of 239 relative biomass proportions in μg (x-axis) against the relative number of generated reads (y-axis). Given is the slope of the 240 correlation and the according R2-value for the 28S_D3 and the 18S_V4 marker. c–d Correlation of relative abundance 241 proportions in μg (x-axis) against the relative number of generated reads (y-axis). Given is the slope of the correlation and 242 the according R2-value for the 28S_D3 and the 18S_V4 marker. 24
In this study, we show according to our hypothesis (1) that taxon composition and revealed diversity depend on the applied primer pairs. As furthermore hypothesised, we could show (2) that the relative sequence abundance strongly correlates with the relative taxon biomass for two of the three primer pairs tested.
A total of 78% of species diversity present in the samples was recovered for the 28S_D3 primer pair, while, for the other primers, resolution was lower (18S_V1:55%; 18S_V4: 67%). A higher diversity was recovered considering indistinguishable taxa (28S_D3 and 18S_V4: 89%; 18S_V1: 78%). Those that could not be distinguished shared the same marker gene sequence and, as such, cannot be distinguished, based on sequencing (Table
The community depiction for this study varied due to primer-induced differences. The 18S_V1 marker was, in our case, not suitable for distinguishing communities due to a low species recovery and an extreme over-amplification of mainly two species (Rhomborhabditis regina and Plectus aquatilis) and is, therefore, not discussed further. In turn and in line with most accurate diversity representations, the 28S_D3 primer pair could reliably distinguish between communities, followed by the 18S_V4 marker. The difference for the V4-marker is likely introduced by higher length-variation in this marker (Suppl. material
Relative biomass and read proportions for almost all mock-community species at the species level were not different for the 28S_D3 primer pair, suggesting that relative biomass can reliably be depicted by relative read numbers obtained by our nematode metabarcoding approach. In support, species that were missing or amplified at low proportions for the 28S_D3 and 18S_V4 markers in the dataset were those that also had a low overall biomass (Panagrolaimus thienemanni and Diploscapter coronatus, Table
We thank Steffi Gehner for raising nematodes on cultured plates at synchronised life stages and thank Professor Oliver Krüger for proving laboratory space for the molecular work. We also thank the German Federal Institute of Hydrology (BfG) for funding our research. Bioinformatic work was furthermore supported in parts by grants of the German Federal Ministry of Education and Research (BMBF) of the project “Bielefeld-Gießen Center for Microbial Bioinformatics – BiGi” (Grant-number 031A533) within the German Network for Bioinformatics Infrastructure (de.NBI). S. G. was supported by the NWO-VENI grant from the Netherlands Organisation for Scientific Research (016.Veni.181.078). We acknowledge the constructive comments by one anonymous referee and the editor.
Tables S1–S7
Data type: species data
Figures S1–S4
Data type: species data