Corresponding author: Andreas Kolter ( andreas.kolter@ruhr-uni-bochum.de ) Academic editor: Hugo de Boer
© 2021 Andreas Kolter, Birgit Gemeinholzer.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Kolter A, Gemeinholzer B (2021) Internal transcribed spacer primer evaluation for vascular plant metabarcoding. Metabarcoding and Metagenomics 5: e68155. https://doi.org/10.3897/mbmg.5.68155
|
The unprecedented ongoing biodiversity decline necessitates scalable means of monitoring in order to fully understand the underlying causes. DNA metabarcoding has the potential to provide a powerful tool for accurate and rapid biodiversity monitoring. Unfortunately, in many cases, a lack of universal standards undermines the widespread application of metabarcoding. One of the most important considerations in metabarcoding of plants, aside from selecting a potent barcode marker, is primer choice. Our study evaluates published ITS primers in silico and in vitro, through mock communities and presents newly designed primers. We were able to show that a large proportion of previously available ITS primers have unfavourable attributes. Our combined results support the recommendation of the introduced primers ITS-3p62plF1 and ITS-4unR1 as the best current universal plant specific ITS2 primer combination. We also found that PCR optimisation, such as the addition of 5% DMSO, is essential to obtain meaningful results in ITS2 metabarcoding. Finally, we conclude that continuous quality assurance is indispensable for reliable metabarcoding results.
barcoding, DMSO, internal transcribed spacer, Spermatophyta, mock community, PCR, Tracheophyta
Globally, one million species are threatened by extinction in the near future and 68% of monitored populations are declining (
Aside from marker choice, primer choice has repeatedly been identified as one of the key factors to facilitate accurate recovery of taxa in a sample (
Our study evaluates ITS primers, based on an in silico and in vitro analysis. The in vitro analysis, performed by using two mock communities, aims to compare the uniformity of amplification achieved with different primers and identify primer-specific amplification biases. The in silico analysis identified mismatches of common ITS primers in Spermatophyta (Cycadopsida, Gnetopsida, Pinopsida, Liliopsida and Magnoliopsida) and led to the design of five new ITS primers with improved universality. Our primer design was focused towards Spermatophyta, as this plant taxon is well represented in public sequence repositories compared to other plant taxa. However, we also reported mismatches in Bryophyta, Fungi, Polypodiopsida and Lycopodiopsida if adequate taxonomic representation were available. Furthermore, due to the high guanine-cytosine (GC) content in a substantial number of ITS2 sequences, we investigated the impact of dimethyl sulphoxide (DMSO) on mock community species retrieval success.
Spermatophyta sequences containing the ITS region, used as a template to generate primers, were downloaded from GenBank in April 2018 as described in
To screen for potential primer sequences in the 5.8S nrDNA region, a consensus sequence of all Spermatophyta plant sequences was established (Suppl. material
Primer statistics were calculated using the online tool OligoAnalyzer 3.1 (
We selected 22 frequently used ITS primers from literature to be analysed alongside the five newly designed primers (Table
Primer name | Orientation | nrDNA primer position | Distance to ITS region from primer 3’ [bp] | Primer sequence (5’→3’) | Publication | in silico / in vitro evaluation |
---|---|---|---|---|---|---|
ITS1 | forward | SSU | 11 | TCCGTAGGTGAACCTGCGG |
|
✓ / X |
ITS5 | forward | SSU | 32 | GGAAGTAAAAGTCGTAACAAGG |
|
✓ / X |
ITS-A | forward | SSU | 32 | GGAAGGAGAAGTCGTAACAAGG |
|
✓ / X |
ITS-u1 | forward | SSU | 32 | GGAAGKARAAGTCGTAACAAGG |
|
✓ / X |
ITS-p5 | forward | SSU | 46 | CCTTATCAYTTAGAGGAAGGAG |
|
✓ / X |
ITS2 | reverse | 5.8S | 30 | GCTGCGTTCTTCATCGATGC |
|
✓ / X |
ITS-C | reverse | 5.8S | 55 | GCAATTCACACCAAGTATCGC |
|
✓ / X |
ITS-u2 | reverse | 5.8S | 86 | GCGTTCAAAGAYTCGATGRTTC |
|
✓ / X |
ITS-p2 | reverse | 5.8S | 5 | GCCRAGATATCCGTTGCCGAG |
|
✓ / X |
ITS-2plR1 | reverse | 5.8S | 3 | GCCDAGATATCCRTTGYCRRGAG | this work | ✓ / X |
ITS3 | forward | 5.8S | 110 | GCATCGATGAAGAACGCAGC |
|
✓ / ✓ |
ITS-D | forward | 5.8S | 136 | CTCTCGGCAACGGATATCTCG |
|
✓ / X |
ITS-S2F | forward | 5.8S | 87 | ATGCGATACTTGGTGTGAAT |
|
✓ / ✓ |
ITS-u3 | forward | 5.8S | 110 | CAWCGATGAAGAACGYAGC |
|
✓ / ✓ |
ITS-p3 | forward | 5.8S | 141 | YGACTCTCGGCAACGGATA |
|
✓ / ✓ |
UniPlantF | forward | 5.8S | 75 | TGTGAATTGCARRATYCMG |
|
✓ / ✓ |
ITS-3p34unF1 | forward | 5.8S | 104 | CGATGAAGAAYGYAGYRAAMTG | this work | ✓ / X |
ITS-3p53plF1 | forward | 5.8S | 84 | AMTGCGAYACBTRGTGTGAATTGC | this work | ✓ / X |
ITS-3p62plF1 | forward | 5.8S | 78 | ACBTRGTGTGAATTGCAGRATC | this work | ✓ / ✓ |
58SPL | forward | 5.8S | 42 | TTTGAACGCAAGTTGCGCC | M.-J. Côté, published: |
✓ / X |
ITS4 | reverse | LSU | 40 | TCCTCCGCTTATTGATATGC |
|
✓ / ✓ |
ITS-B | reverse | LSU | 41 | CTTTTCCTCCGCTTATTGATATG |
|
✓ / X |
BEL-3 | reverse | LSU | 144 | GACGCTTCTCCAGACTACAAT |
|
X / ✓ |
ITS-u4 | reverse | LSU | 49 | RGTTTCTTTTCCTCCGCTTA |
|
✓ / ✓ |
ITS-p4 | reverse | LSU | 35 | CCGCTTAKTGATATGCTTAAA |
|
✓ / ✓ |
UniPlantR | reverse | LSU | 2 | CCCGHYTGAYYTGRGGTCDC |
|
✓ / ✓ |
ITS-4unR1 | reverse | LSU | 40 | TCCTCCGCTTATTKATATGC | this work | ✓ / ✓ |
Primers were compared to the DNA sequences using the R packages ShortRead and Biostrings (
We extracted DNA from 58 herbarium specimen by the use of silica-coated ferric beads and the tissue protocol by
Primers contained a part of the Illumina TruSeq read primer in addition to the target primer to act as a linker between PCR number one (target amplification) and PCR number two (Illumina indexed adapter being added), which results in the following forward primer: 5’-CAGACGTGTGCTCTTCCGATCT [optional spacer] [target primer]-3’ (reverse: 5’-CTACACGACGCTCTTCCGATCT [optional spacer] [target primer]-3’). A spacer that is non-complementary to the target sequence was added to: 1) prevent more than three identical consecutive nucleotides, 2) stop the TruSeq sequence from interfering with primer binding if it showed the potential to be partially complementary to the target sequence and, 3) to act as a mini-barcode with a length of 3 bp to facilitate pooling of samples with otherwise identical primers after the first round of PCR (termed internal index by
PCRs were conducted in a 12.5 µl reaction mix containing: 3.125 µl Trehalose (20%), 1.25 µl reaction buffer (10×), 0.625 µl MgCl2 (50 mM), 1.25 µl DMSO (50%), 0.3 µl bovine serum albumin (BSA) (0.01 mg/ml), 0.25 µl each forward and reverse primer (5 µM), 0.3125 mM dNTP (2 mM), 0.06 µl Platinum Taq (Invitrogen) polymerase (5 U/µl), 1 µl DNA template and 4.0775 µl ddH2O (modified from
Due to the relatively high GC content of ITS2 amplicons, we optimised PCR conditions in a pre-trial and concluded that the additive DMSO at a concentration of 5% enables amplification of ITS2 from genomic plant templates (Suppl. material
Sequencing data was processed with R (Suppl. material
To assess the successful detection of taxa in the mock communities, we calculated its read abundance for each taxon and primer combination. We define the read abundance for each taxon in each replicate as the proportion of reads for a given taxon relative to 1000 reads. If the median read abundance of all replicates of one taxon of a specific primer combination was above 0.1 and the taxon was detected (≥ one read) in all replicates of the respective primer combination, the taxon was classified as present (Table
The required read depth for each primer combination to detect all but one taxon with a confidence of 95% within a mock community was calculated separately for mix 1 and mix 2 in multiple steps. First, instead of assigning an arbitrary penalty score to missing values (taxon not detected in some replicates), we limited the calculation to taxa which could be detected in all replicates in all primer combinations (Appendices
Due to the sample size of the mock communities, the 95th percentile was roughly equivalent to the average value of the two taxa requiring the most reads. As the removal of singletons is desirable in some experimental setups, we cloned the workflow mentioned before with the exception that two reads of a specific taxon had to be detected in 95 out of 100 subsamples per primer combination (Table
We tested a total of 26 primers (Table
Primer name | Spermatophyta families with mismatches (total no. of families tested) | Average# number of mismatches in fungi sequences | Melting temperature [°C] (min / mean / max) | GC content [%] (mean/max) | GC terminal 3’ stretch [bp] | Primer length [bp] | Max ∆G [kcal/mole] | Self-dimer ∆G [kcal/mole] | Hairpin melting temperature [°C] mean/max | Max repeats [n]* |
---|---|---|---|---|---|---|---|---|---|---|
ITS1 | 5 (149) | < 1 ‒ 2 | 65.8 | 63 | 4 | 19 | -39.83 | -6.68 | 66.4 | 2 |
ITS5 | 113 (116) | < 1 | 59.3 | 41 | 2 | 22 | -38.61 | -3.61 | 29.5 | 4 |
ITS-A | 5 (116) | 1 ‒ 2 | 62.3 | 50 | 2 | 22 | -40.23 | -3.61 | 29.5 | 2 |
ITS-u1 | 0 (116) | < 1 | 59.4 / 60.8 / 62.3 | 45 / 50 | 2 | 22 | -39.42 | -3.61 | 29.5 | 4 |
ITS-p5 | 1 (95) | > 3 | 57.8 / 58.5 / 59.2 | 43 | 1 | 22 | -37.86 | -6.61 | 44.1 | 3 |
ITS2 | 26 (210) | < 1 | 64.0 | 55 | 2 | 20 | -40.18 | -13.62 | 37.8 | 2 |
ITS-C | 33 (210) | > 3 | 62.2 | 48 | 3 | 21 | -39.29 | -5.36 | 25.2 | 2 |
ITS-u2 | 71 (210) | < 1 | 60.5 / 62.1 / 63.7 | 45 / 50 | 1 | 22 | -41.25 | 37.8 | 50.5 | 3 |
ITS-p2 | 44 (210) | > 3 | 65.1 / 65.8 / 66.5 | 60 / 62 | 1 | 21 | -44.32 | -7.06 | 42.8 | 2 |
ITS-2plR1 | 8 (210) | > 3 | 60.5 / 65.0 / 69.9 | 54 / 65 | 1 | 23 | -44.15 | -7.15 | 26.7 / 56.3 | 3 |
ITS3 | 26 (210) | < 1 | 64.0 | 55 | 2 | 20 | -40.18 | -13.62 | 37.3 | 2 |
ITS-D | 56 (210) | > 3 | 63.9 | 57 | 2 | 21 | -41.93 | -7.06 | 64.1 | 2 |
ITS-S2F | 36 (210) | > 3 | 60.1 | 40 | 0 | 20 | -35.68 | -3.61 | 34.7 | 2 |
ITS-u3 | 22 (210) | < 1 | 58.2 / 59.9 / 61.8 | 50 / 53 | 2 | 19 | -35.76 | -13.74 | 32.7 / 47.0 | 2 |
ITS-p3 | 30 (210) | > 3 | 62.8 / 63.1 / 63.4 | 55 / 58 | 0 | 19 | -37.80 | -5.19 | 64.1 | 2 |
UniPlantF | 8 (210) | < 1 | 55.4 / 58.7 / 62.5 | 42 / 53 | 4 | 19 | -35.49 | -10.76 | 29.9 / 65.1 | 4 |
ITS-3p34unF1 | 5 (210) | < 1 | 55.9 / 61.0 / 65.8 | 43 / 55 | 1 | 22 | -39.94 | -6.48 | 34.0 / 57.3 | 4 |
ITS-3p53plF1 | 9 (210) | < 1 ‒ 2 | 62.3 / 66.1 / 69.5 | 47 / 54 | 2 | 24 | -44.47 | -8.65 | 48.5 / 77.4 | 2 |
ITS-3p62plF1 | 9 (210) | 2 ‒ 3 | 59.4 / 62.5 / 65.3 | 44 / 50 | 1 | 22 | -38.83 | -7.05 | 29.1 / 58.4 | 2 |
58SPL | 11 (210) | 1 ‒ 2 | 64.7 | 53 | 5 | 19 | -41.15 | -10.65 | 65.6 | 3 |
ITS4 | 14 (130) | < 1 | 59.8 | 45 | 2 | 20 | -38.09 | -3.91 | 28.8 | 2 |
ITS-B | 33 (123) | < 1 | 59.8 | 39 | 1 | 23 | -42.38 | -3.91 | 9.9 | 4 |
BEL-3 | NA (NA) | NA | 61.7 | 48 | 0 | 21 | -36.77 | -3.61 | 29.1 | 2 |
ITS-u4 | 28 (109) | < 1 | 59.7 / 60.0 / 60.4 | 42 / 45 | 0 | 20 | -38.71 | -3.61 | 17.3 | 4 |
ITS-p4 | 18 (140) | 1 | 57.1 / 57.8 / 58.6 | 36 / 38 | 0 | 21 | -38.43 | -4.85 | 34.8 / 40.6 | 3 |
UniPlantR | 33 (176) | > 3 | 60.3 / 65.8 / 73.0 | 63 / 80 | 3 | 20 | -41.13 | -10.71 | 42.9 / 56.0 | 3 |
ITS-4unR1 | 5 (130) | < 1 | 57.5 / 58.7 / 59.8 | 42 /45 | 2 | 20 | -37.78 | -3.91 | 20.3 | 3 |
GC primer content ranged from 36% to 80% with a 3’ terminal GC stretch of zero to five (Table
The total amplicon length of each primer combination can be calculated by adding the primer lengths, the distance to the ITS region of interest and the length of the ITS marker (Table
Position of previously published and newly introduced ITS primers in the 5.8S nrDNA region (5’ → 3’). Primer positions may be shifted by ± 2 bp in comparison to previously published alignments due to different annotation software being used to identify the 5.8S region. Primers usually used to amplify the ITS1 region (Table
Regardless of their position, the primers located in the SSU region, flanking the ITS1 region (ITS1, ITS-A, ITS-u1, ITS-p5), have five or fewer mismatches with the exception of ITS5 (Tables
The ITS-2plR1 primer displays the lowest number of mismatches (Table
The primers located in the 5.8S nrDNA region that are usually used to serve as forward primers to amplify the ITS2 region can be split into two major groups by the number of Spermatophyta families with mismatches (Table
Primers located in the LSU region, except for UniPlantR, are overlapping each other at a distance of 35–49 bp to the ITS2 region (Table
The sequencing yielded 3,196,249 paired raw reads (NCBI BioProject PRJNA740294). The filtering step that removed most reads on average (~30%) was to require an exact primer fit at the start of the sequence. A total of 6.5% of all reads failed to merge. As the overall read quality was very high, other filters (combined) removed, on average, less than 5% of the total reads. This results in a total of 2,128,039 merged reads to enter the analysis. There was no contamination detected in the blanks that could have affected the results. Rarefaction curves show a flat slope, except for the primer combination ITS-p3 + ITS-p4, as it yielded less reads than the other primer combinations (Suppl. material
Philodendron angustisectum could not be detected by any primer combination targeting the ITS2 region in mix 1 and Sassafras albidum could not be detected in mix 2, most probably due to DNA degradation. Both will be excluded from the following analysis. The number of missed taxa per mix ranges from 3 to 9 out of 21 for mix 1 and from 3 to 6 out of 18 for mix 2 (Table
Primer combination | Mix 1 (n = 21) | Mix 2 (n = 18) | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Forward | Reverse | fungi reads [%] | Lycopodiopsida reads [%] | missed taxa (n) | retrieved taxa score [%] | min. required read depth (incl. singletons) | min. required read depth (excl. singletons) | fungi reads [%] | missed taxa (n) | retrieved taxa score [%] | min. required read depth (incl. singletons) | min. required read depth (excl. singletons) |
ITS-3p62plF1 | ITS-4unR1 | < 1 | 5 | 3 | 92 | 2,000 | 3,300 | 1 | 3 | 94 | 1,300 | 2,000 |
UniPlantF | UniplantR | < 1 | 0 | 7 | 81 | 5,500 | 9,600 | < 1 | 5 | 81 | 2,400 | 3,400 |
UniPlantF | ITS-4unR1 | 41 | 3 | 6 | 85 | 3,200 | 5,100 | 47 | 5 | 86 | 2,400 | 5,600 |
ITS-3p62plF1 | UniPlantR | < 1 | 0 | 5 | 88 | 5,300 | 9,400 | < 1 | 4 | 83 | 1,900 | 2,700 |
58SPL | ITS-4unR1 | 10 | < 1 | 6 | 81 | 4,800 | 7,600 | 16 | 5 | 80 | 1,300 | 2,100 |
ITS-u3 | ITS-u4 | 57 | 13 | 6 | 90 | 6,600 | 10,800 | 74 | 4 | 87 | 6,600 | 9,800 |
ITS-p3 | ITS-p4 | < 1 | 34 | 6 | 83 | 2,400 | 4,000 | < 1 | 5 | 83 | 1,800 | 2,700 |
ITS-S2F | BEL-3 | < 1 | 26 | 7 | 83 | 6,800 | 11,200 | < 1 | 3 | 89 | 5,200 | 7,600 |
ITS3 | ITS4 | 80 | 6 | 9 | 75 | 8,800 | 14,300 | 87 | 6 | 77 | 9,100 | 14,000 |
The 58SPL primer has a similar number of mismatches compared to 3p62plF1 (Table
The number of fungi sequences varies from < 1% to 80% of all reads in mix 1 and from < 1% to 88% in mix 2 (Table
Addition of 5% DMSO to the PCR mix has a threefold effect: 1) In most cases, it reduces the number of reads necessary to detect the species with a GC content of ≥ 62% (Fig.
Impact of DMSO on mock community representation. The detection chance (colour) per genus (black lines) was tracked per replicate (horizontal coloured lines) by subsampling the reads 100 times randomly in steps of 100 from 100 to 10000 (x-axis). The detection chance was defined as the number of subsamples where the respective genus could be detected by at least one read.
Although universal ITS primers have been proposed in the past (
A plant-specific primer with a low number of mismatches, weak secondary structures and a uniform amplification of a complex DNA mixture has the highest chance to deliver representative results when used in combination with an unknown eDNA sample (Tables
Finding the balance between the elimination of primer mismatches and the number of total primer variants was one of the main design goals of this study. The five primers we generated (ITS-2plR1, ITS-3p* and ITS-4unR1) achieved less mismatches than most of the previously published ITS primers (Table
Although the study by
An additional advantage of our method is that it reveals mismatch patterns more easily. If a mismatch occurs consistently in a large number of families and if some of these families are only represented by few sequences, the underlying pattern becomes very hard to catch, if the analysis is based on sequence level only. This can be seen in the ITS-p4 and ITS-u2 primer (Suppl. material
The threshold we report of a 30% frequency of mismatch in the corresponding plant family prevents outdated taxonomic assignments (i.e. a genus is placed in the wrong family) and misidentified sequences from introducing noise, which would result in primer design with an unnecessarily high number of ambiguous nucleotides. However, there are rare cases of mismatches of the same nucleotide and at the same primer position that occur at very low frequency within a noticeable number of plant families (Suppl. material
Strictly eliminating all mismatches by replacing each mismatch by an ambiguous nucleotide increases the count of total primer variations exponentially. Every introduced ambiguous nucleotide comes with a risk of negatively impacting the primer performance (e.g. primer dimers). An alternative for primers with ambiguous nucleotides that have already been optimised as far as possible, considering a reasonable number of total primer variations and still have mismatches in a few plant families, would be to add only those primer variations that lead to the correction of the respective mismatch. Our results allow researchers to supplement existing primers with fixed (without ambiguous nucleotides) primers, targeting specific mismatches of plant families to tailor the primer mixture to their individual needs (Suppl. material
A modification of the ITS-4unR1 primer (11T>D) allows flexibility to either be truly universal or exclude certain taxa, at least partially wind-pollinated plant families. This could be useful in insect pollination studies to reduce the number of reads generated by NGS from some Pinus species (e.g. Pinus sylvestris).
Like previous studies (
Due to PCR stochasticity (
Having these aforementioned restrictions in mind, selection of the best suited primers is, in general, the most important tool in minimising additional biases during PCR and library preparation (
One possible explanation for the mixed performance of the 58SPL primer could be that the relatively high primer hairpin melting temperature makes a large proportion of the 58SPL primer unavailable to anneal to its intended template sequence, disfavouring amplicons with a rare prevalence (Table
The ITS3 + ITS4 primer combination, included in this study, was originally designed to amplify fungi (
The ITS-u3 + ITS-u4, ITS-p3 + ITS-p4 and ITS-S2F + BEL-3 primer combinations have some performance metrics in their favour (Table
To our knowledge, this is the first mock community-based primer comparison in the context of metabarcoding in the plant kingdom. Although our mock communities only reflect a fraction of the genetic diversity within plants, we have demonstrated that there are differences between different ITS primer combinations and that these differences are not necessarily based on primer mismatches. The differences not originating from primer mismatches cannot be detected by in silico analyses only, further illustrating the need for mock community studies to verify the results of metabarcoding programmes. In contrast, the universality of the primers can be better assessed by the more comprehensive evaluations made during the in silico analysis. Should the need arise, in essence, we recommend an integrative approach to evaluate primers by combining in silico and mock community analyses. The composition of the mock community should ideally be connected to the respective study area. Considering that we thoroughly screened the whole 5.8S nrDNA region for potential primer sequences, we advise using one of the primers presented in this paper as a starting point for further refinement, if needed.
In metabarcoding, regardless of the marker used, we point out that it is strongly recommended to integrate mock communities into the workflow to provide additional quality control (
The past has shown that ITS-based studies have struggled with amplification success (
This work has been funded by the DFG SPP1991 priority programme “Taxon-Omics: New approaches for discovering and naming biodiversity”.
The authors declare no conflict of interest.
We thank Volker Wissemann for access to the lab and Martin de Jong for helping with the acquisition of plant material.
Taxonomic breakdown of sequences used for the respective in silico primer evaluation.
nrDNA region | sequences (min/max) | species (min/max) | families (min/max) | order (min/max) |
---|---|---|---|---|
SSU | 22,574 / 63,024 | 15,180 / 35,113 | 95 / 149 | 38 /44 |
5.8S* | 187,522 | 85,362 | 210 | 53 |
LSU | 25,845 / 90,319 | 14,663 / 43,795 | 109 / 176 | 42 / 49 |
location | orientation | unmodified primer name | modified primer name | primer tail | spacer | primer sequence |
---|---|---|---|---|---|---|
5.8S | forward | ITS3 | ITS-3T73F1 | CAGACGTGTGCTCTTCCGATCT | GCATCGATGAAGAACGCAGC | |
ITS-S2F | ITS-S2F-T7 | CAGACGTGTGCTCTTCCGATCT | ATGCGATACTTGGTGTGAAT | |||
ITS-u3 | ITS-u3-T7 | CAGACGTGTGCTCTTCCGATCT | CAWCGATGAAGAACGYAGC | |||
ITS-p3 | ITS-p3-T7 | CAGACGTGTGCTCTTCCGATCT | YGACTCTCGGCAACGGATA | |||
UniPlantF | UniPlantF-T7 | CAGACGTGTGCTCTTCCGATCT | TGTGAATTGCARRATYCMG | |||
ITS-3p62plF1 | ITS-3T7p62plr1F1 | CAGACGTGTGCTCTTCCGATCT | AGA | ACBTRGTGTGAATTGCAGRATC | ||
ITS-3T7p62plr2F1 | CAGACGTGTGCTCTTCCGATCT | ATC | ACBTRGTGTGAATTGCAGRATC | |||
ITS-3T7p62plr3F1 | CAGACGTGTGCTCTTCCGATCT | GCG | ACBTRGTGTGAATTGCAGRATC | |||
58SPL | ITS-2Cote-T7 | CAGACGTGTGCTCTTCCGATCT | AG | TTTGAACGCAAGTTGCGCC | ||
LSU | reverse | UniPlantR | UniPlantR-T5 | CTACACGACGCTCTTCCGATCT | CCCGHYTGAYYTGRGGTCDC | |
ITS-4unR1 | ITS-4T5unR1 | CTACACGACGCTCTTCCGATCT | AG | TCCTCCGCTTATTKATATGC | ||
ITS4 | ITS-4T5unR2 | CTACACGACGCTCTTCCGATCT | AG | TCCTCCGCTTATTGATATGC | ||
ITS-u4 | ITS-u4-T5 | CTACACGACGCTCTTCCGATCT | RGTTTCTTTTCCTCCGCTTA | |||
ITS-p4 | ITS-p4-T5 | CTACACGACGCTCTTCCGATCT | CCGCTTAKTGATATGCTTAAA | |||
BEL-3 | S3R-T5 | CTACACGACGCTCTTCCGATCT | GACGCTTCTCCAGACTACAAT |
Sequencing primer and two-step PCR layout. Note: The template specific primers used in this example are for demonstration purposes only and vary in each unique PCR setup. Additional template (x) strands bending away from the primer sequence in the 1st PCR round demonstrate their non-complementarity.
classification | GC content [%] | read abundances (per 1.000 reads, median of replicates) per primer combination and presence / absence per replicate | |||||||||||||||||||
class | order family | genus species | ITS-3p62plF1 + ITS-4unR1 | UniPlantF + UniPlantR | UniPlantF + ITS-4unR1 | ITS-3p62plF1 + UniPlantR | 58SPL + ITS-4unR1 | ITS-u3 + ITS-u4 | ITS-p3 + ITS-p4 | ITS-S2F + BEL-3 | ITS3 + ITS4 | ||||||||||
Liliopsida | Asparagales Asparagaceae | Bellevalia trifoliata | 70 | 3.85 | 0.30 | 1.75 | 0.73 | 0.91 | 1.00 | 1.26 | 1.67 | 0.75 | |||||||||
Liliales Liliaceae | Gagea graeca | 65 | 182 | 0.57 | 71.1 | 0.64 | 43.3 | 61.5 | 169 | 25.3 | 36.9 | ||||||||||
Liliales Smilacaceae | Smilax aspera | 71 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.09 | 0.00 | ||||||||||
Poales Cyperaceae | Cyperus esculentus | 76 | 0.32 | 0.00 | 0.11 | 0.04 | 0.00 | 0.11 | 0.21 | 0.09 | 0.00 | ||||||||||
Schoenus nigricans | 70 | 2.47 | 0.16 | 1.55 | 0.17 | 0.50 | 0.65 | 1.53 | 0.24 | 0.28 | |||||||||||
Poales Poaceae | Briza maxima | 64 | 0.39 | 0.08 | 0.20 | 0.08 | 0.06 | 0.30 | 0.38 | 0.40 | 0.08 | ||||||||||
Magnolipsida | Apiales Apiaceae | Tordylium apulum | 56 | 22.5 | 7.85 | 9.32 | 9.09 | 5.18 | 6.06 | 10.8 | 6.05 | 2.83 | |||||||||
Brassicales Brassicaceae | Arabis verna | 52 | 163 | 429 | 95.5 | 435 | 166 | 23.1 | 115 | 91.7 | 17.7 | ||||||||||
Dipsacales Caprifoliaceae | Valerianella discoidea | 72 | 19.5 | 2.89 | 7.84 | 3.56 | 11.6 | 5.86 | 12.4 | 24.9 | 2.16 | ||||||||||
Ericales Ericaceae | Arbutus spec. # | 59 | 222 | 413 | 134 | 410 | 305 | 59.0 | 135 | 208 | 35.3 | ||||||||||
Ericales Ericaceae | Erica arborea | 56 | 1.19 | 4.31 | 0.91 | 4.96 | 1.50 | 0.29 | 1.15 | 0.18 | 0.16 | ||||||||||
Fabales Fabaceae | Anthyllis circinnata | 55 | 232 | 53.9 | 163 | 53.5 | 300 | 60.4 | 136 | 336 | 25.8 | ||||||||||
Gentianales Rubiaceae | Sherardia arvensis | 62 | 2.93 | 4.73 | 1.21 | 5.94 | 0.36 | 1.28 | 2.20 | 0.00 | 0.57 | ||||||||||
Laurales Lauraceae | Cinnamomum aromaticum | 76 | 0.06 | 0.00 | 0.00 | 0.05 | 0.00 | 0.06 | 0.00 | 0.09 | 0.00 | ||||||||||
Lindera obtusiloba | 77 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |||||||||||
Malpighiales Euphorbiaceae | Mercurialis annua | 58 | 46.0 | 8.22 | 27.1 | 7.82 | 34.4 | 12.1 | 23.1 | 16.5 | 13.8 | ||||||||||
Piperales Aristolochiaceae | Aristolochia guichardii | 76 | 0.85 | 0.41 | 0.50 | 0.36 | 0.38 | 0.18 | 0.69 | 0.19 | 0.22 | ||||||||||
Rosales Rosaceae | Prunus dulcis | 65 | 9.60 | 11.2 | 3.92 | 12.6 | 3.45 | 3.01 | 8.30 | 5.35 | 1.43 | ||||||||||
Sapindales Anacardiaceae | Pistacia lentiscus | 55 | 32.2 | 57.0 | 18.6 | 58.8 | 34.7 | 6.84 | 19.0 | 12.8 | 2.81 | ||||||||||
Solanales Solanaceae | Solanum citrullifolium | 77 | 0.56 | 0.19 | 0.21 | 0.42 | 0.26 | 0.16 | 0.18 | 0.09 | 0.07 | ||||||||||
Pinopsida | Pinales Pinaceae | Pinus strobus | 59 | 0.22 | 0.54 | 0.10 | 0.63 | 0.08 | 0.29 | 0.16 | 0.00 | 0.00 | |||||||||
sum of missed taxa (read abundance ≤ 0.1 or missing in any replicates) | |||||||||||||||||||||
3 | 7 | 6 | 5 | 6 | 6 | 6 | 7 | 9 | |||||||||||||
classification | GC content [%] | ITS-3p62plF1 + ITS-4unR1 | UniPlantF + UniPlantR | UniPlantF + ITS-4unR1 | ITS-3p62plF1 + UniPlantR | 58SPL + ITS-4unR1 | ITS-u3 + ITS-u4 | ITS-p3 + ITS-p4 | ITS-S2F + BEL-3 | ITS3 + ITS4 | |||||||||||
Lycopodiopsida | 67* | 48.7 | 0.00 | 32.8 | 0.00 | 2.37 | 139 | 360 | 257 | 53.3 | |||||||||||
Fungi | 35‒72 | 6.79 | 1.72 | 443 | 0.04 | 103 | 619 | 0.00 | 0.00 | 800 |
classification | GC content [%] | read abundances (per 1.000 reads, median) per primer combination and presence / absence per replicate |
|||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
class | order family | genus species | ITS-3p62plF1 + ITS-4unR1 | UniPlantF + UniPlantR | UniPlantF + ITS-4unR1 | ITS-3p62plF1 + UniPlantR | 58SPL + ITS-4unR1 | ITS-u3 + ITS-u4 | ITS-p3 + ITS-p4 | ITS-S2F + BEL-3 | ITS3 + ITS4 | ||||||||||
Liliopsida | Asparagales Asparagaceae | Maianthemum bifolium | 74 | 0.07 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.07 | 0.00 | |||||||||
Liliales Liliaceae | Gagea graeca | 64 | 183 | 0.62 | 64.3 | 0.92 | 45.6 | 64.4 | 234 | 34.7 | 32.1 | ||||||||||
Poales Cyperaceae | Schoenus nigricans | 70 | 2.48 | 0.16 | 1.52 | 0.16 | 0.49 | 0.46 | 2.70 | 0.38 | 0.23 | ||||||||||
Magnoliopsida | Apiales Apiaceae | Tordylium apulum | 56 | 28.9 | 9.94 | 10.1 | 10.8 | 6.67 | 7.39 | 19.4 | 8.99 | 3.03 | |||||||||
Asterales Asteraceae | Geropogon hybridus | 56 | 86.0 | 171 | 40.2 | 173 | 32.6 | 19.2 | 98.2 | 64.8 | 8.99 | ||||||||||
Brassicales Brassicaceae | Arabis verna | 52 | 201 | 560 | 101 | 535 | 210 | 25.2 | 187 | 158 | 18.0 | ||||||||||
Caryophyllales Caryophyllaceae | Silene colorata | 60 | 39.3 | 67.1 | 12.8 | 83.5 | 4.56 | 11.7 | 55.6 | 15.0 | 6.96 | ||||||||||
Caryophyllales Polygonaceae | Polygonum arenastrum | 77 | 4.75 | 16.5 | 2.10 | 23.8 | 3.47 | 0.96 | 4.27 | 2.19 | 0.51 | ||||||||||
Ericales Ericaceae | Calluna vulgaris | 62 | 1.32 | 3.94 | 1.04 | 3.70 | 2.15 | 0.33 | 0.71 | 0.38 | 0.14 | ||||||||||
Erica spec. | 56 | 1.39 | 5.39 | 0.73 | 5.33 | 1.38 | 0.28 | 0.89 | 0.38 | 0.23 | |||||||||||
Fabales Fabaceae | Anthyllis circinnata | 55 | 356 | 103 | 224 | 98.2 | 472 | 101 | 308 | 653 | 35.4 | ||||||||||
Fagales Fagaceae | Quercus ithaburensis | 68 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | ||||||||||
Laurales Lauraceae | Laurus nobilis | 75 | 0.29 | 0.00 | 0.09 | 0.05 | 0.00 | 0.04 | 0.18 | 0.22 | 0.00 | ||||||||||
Magnoliales Annonaceae | Asimina triloba | 79 | 0.15 | 0.00 | 0.00 | 0.00 | 0.00 | 0.06 | 0.00 | 0.00 | 0.00 | ||||||||||
Malpighiales Euphorbiaceae | Mercurialis annua | 58 | 51.8 | 11.0 | 26.3 | 8.83 | 43.6 | 14.2 | 36.6 | 30.8 | 13.8 | ||||||||||
Rosales Rosaceae | Prunus dulcis | 65 | 11.0 | 14.9 | 4.00 | 16.7 | 4.45 | 3.09 | 12.4 | 9.70 | 1.32 | ||||||||||
Solanales Convolvulaceae | Convolvulus siculus | 58 | 25.7 | 30.3 | 8.60 | 37.8 | 4.29 | 7.06 | 35.3 | 17.2 | 3.84 | ||||||||||
Pinopsida | Pinales Pinaceae | Pinus strobus | 59 | 0.35 | 0.65 | 0.21 | 0.74 | 0.05 | 0.27 | 0.34 | 0.15 | 0.00 | |||||||||
sum of missed taxa (read abundance ≤ 0.1 or missing in any replicates) | |||||||||||||||||||||
3 | 5 | 5 | 4 | 5 | 4 | 5 | 3 | 6 | |||||||||||||
classification | GC content [%] | ITS-3p62plF1 + ITS-4unR1 | UniPlantF + UniPlantR | UniPlantF + ITS-4unR1 | ITS-3p62plF1 + UniPlantR | 58SPL + ITS-4unR1 | ITS-u3 + ITS-u4 | ITS-p3 + ITS-p4 | ITS-S2F + BEL-3 | ITS3 + ITS4 | |||||||||||
Fungi | 35‒72 | 10.4 | 2.4 | 508 | 0.04 | 167 | 743 | 0.00 | 0.00 | 876 |
classification | GC [%] | required minimal read depth to achieve a detection with 95% probability (no singletons) [thousands] | |||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
class | order family | genus species | ITS-3p62plF1 + ITS-4unR1 | UniPlantF + UniPlantR | UniPlantF + ITS-4unR1 | ITS-3p62plF1 + UniPlantR | 58SPL + ITS-4unR1 | ITS-u3 + ITS-u4 | ITS-p3 + ITS-p4 | ITS-S2F + BEL-3 | ITS3 + ITS4 | ||||||||||
Liliopsida | Asparagales Asparagaceae | Bellevalia trifoliata | 70 | 1.3 | NA | 2.8 | 7 | 5 | 5.2 | 3.7 | 2.5 | 6 | |||||||||
Liliales Liliaceae | Gagea graeca | 65 | 0.1 | 7.5 | 0.1 | 6 | 0.2 | 0.1 | 0.1 | 0.2 | 0.2 | ||||||||||
Liliales Smilacaceae | Smilax aspera | 71 | NA | NA | NA | NA | NA | NA | NA | NA | NA | ||||||||||
Poales Cyperaceae | Cyperus esculentus | 76 | 13.5 | NA | NA | NA | NA | NA | NA | NA | NA | ||||||||||
Schoenus nigricans | 70 | 1.7 | NA | 3.7 | 22.2 | 8.3 | 8.1 | 2.9 | 16.9 | 15.1 | |||||||||||
Poales Poaceae | Briza maxima | 64 | 12.2 | NA | 21.7 | NA | NA | 13.1 | 10.5 | 12.6 | NA | ||||||||||
Magnolipsida | Apiales Apiaceae | Tordylium apulum | 56 | 0.3 | 0.6 | 0.5 | 0.5 | 0.9 | 0.7 | 0.5 | 0.7 | 1.7 | |||||||||
Brassicales Brassicaceae | Arabis verna | 52 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.2 | 0.1 | 0.1 | 0.3 | ||||||||||
Dipsacales Caprifoliaceae | Valerianella discoidea | 72 | 0.3 | 1.5 | 0.6 | 1.2 | 0.4 | 0.8 | 0.4 | 0.2 | 2.2 | ||||||||||
Ericales Ericaceae | Arbutus spec. # | 59 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.2 | ||||||||||
Ericales Ericaceae | Erica arborea | 56 | 4 | 1.1 | 5.5 | 0.9 | 2.9 | 14.5 | 4.6 | 28.9 | NA | ||||||||||
Fabales Fabaceae | Anthyllis circinnata | 55 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.2 | ||||||||||
Gentianales Rubiaceae | Sherardia arvensis | 62 | 1.5 | 1.1 | 3.7 | 0.8 | 20.7 | 3.4 | 2.1 | NA | 8.7 | ||||||||||
Laurales Lauraceae | Cinnamomum aromaticum | 76 | NA | NA | NA | NA | NA | NA | NA | NA | NA | ||||||||||
Lindera obtusiloba | 77 | NA | NA | NA | NA | NA | NA | NA | NA | NA | |||||||||||
Malpighiales Euphorbiaceae | Mercurialis annua | 58 | 0.1 | 0.6 | 0.2 | 0.6 | 0.2 | 0.4 | 0.3 | 0.4 | 0.4 | ||||||||||
Piperales Aristolochiaceae | Aristolochia guichardii | 76 | 5.5 | 11.2 | 8.1 | 11.9 | 12.4 | 17.7 | 6.6 | 18.9 | 22.4 | ||||||||||
Rosales Rosaceae | Prunus dulcis | 65 | 0.5 | 0.5 | 1.2 | 0.4 | 1.3 | 1.5 | 0.6 | 0.9 | 3.4 | ||||||||||
Sapindales Anacardiaceae | Pistacia lentiscus | 55 | 0.2 | 0.1 | 0.3 | 0.1 | 0.2 | 0.7 | 0.3 | 0.4 | 1.8 | ||||||||||
Solanales Solanaceae | Solanum citrullifolium | 77 | 8.3 | 22.1 | NA | 12.1 | 20.3 | NA | NA | 33.9 | NA | ||||||||||
Pinopsida | Pinales Pinaceae | Pinus strobus | 59 | 16.4 | 8.4 | NA | 6.6 | NA | NA | NA | NA | NA |
classification | GC [%] | required minimal read depth to achieve a detection with 95% probability (no singletons) [thousands] | |||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
class | order family | genus species | ITS-3p62plF1 + ITS-4unR1 | UniPlantF + UniPlantR | UniPlantF + ITS-4unR1 | ITS-3p62plF1 + UniPlantR | 58SPL + ITS-4unR1 | ITS-u3 + ITS-u4 | ITS-p3 + ITS-p4 | ITS-S2F + BEL-3 | ITS3 + ITS4 | ||||||||||
Liliopsida | Asparagales Asparagaceae | Maianthemum bifolium | 74 | NA | NA | NA | NA | NA | NA | NA | NA | NA | |||||||||
Liliales Liliaceae | Gagea graeca | 64 | 0.1 | 6.5 | 0.1 | 5 | 0.1 | 0.1 | 0.1 | 0.2 | 0.2 | ||||||||||
Poales Cyperaceae | Schoenus nigricans | 70 | 2 | NA | 2.9 | 22.3 | 9.6 | 9.4 | 1.7 | 11.7 | 16.8 | ||||||||||
Magnoliopsida | Apiales Apiaceae | Tordylium apulum | 56 | 0.2 | 0.5 | 0.5 | 0.5 | 0.8 | 0.6 | 0.3 | 0.5 | 1.4 | |||||||||
Asterales Asteraceae | Geropogon hybridus | 56 | 0.1 | 0.1 | 0.1 | 0.1 | 0.2 | 0.3 | 0.1 | 0.1 | 0.5 | ||||||||||
Brassicales Brassicaceae | Arabis verna | 52 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.2 | 0.1 | 0.1 | 0.3 | ||||||||||
Caryophyllales Caryophyllaceae | Silene colorata | 60 | 0.2 | 0.1 | 0.4 | 0.1 | 1.1 | 0.4 | 0.1 | 0.4 | 0.7 | ||||||||||
Caryophyllales Polygonaceae | Polygonum arenastrum | 77 | 1 | 0.3 | 2.1 | 0.2 | 1.4 | 4.2 | 1.2 | 2 | 9.3 | ||||||||||
Ericales Ericaceae | Calluna vulgaris | 62 | 3.3 | 1.2 | 4.4 | 1.2 | 2.2 | 13.9 | 7.1 | 11.4 | NA | ||||||||||
Erica spec. | 56 | 3.2 | 0.9 | 5.4 | 0.9 | 3 | 16.9 | 4.5 | 14.6 | 19.8 | |||||||||||
Fabales Fabaceae | Anthyllis circinnata | 55 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.2 | ||||||||||
Fagales Fagaceae | Quercus ithaburensis | 68 | NA | NA | NA | NA | NA | NA | NA | NA | NA | ||||||||||
Laurales Lauraceae | Laurus nobilis | 75 | 19 | NA | NA | NA | NA | NA | NA | 25.8 | NA | ||||||||||
Magnoliales Annonaceae | Asimina triloba | 79 | NA | NA | NA | NA | NA | NA | NA | NA | NA | ||||||||||
Malpighiales Euphorbiaceae | Mercurialis annua | 58 | 0.1 | 0.5 | 0.2 | 0.5 | 0.1 | 0.4 | 0.2 | 0.2 | 0.4 | ||||||||||
Rosales Rosaceae | Prunus dulcis | 65 | 0.5 | 0.4 | 1.2 | 0.3 | 1.1 | 1.5 | 0.4 | 0.5 | 3.5 | ||||||||||
Solanales Convolvulaceae | Convolvulus siculus | 58 | 0.2 | 0.2 | 0.6 | 0.2 | 1.1 | 0.6 | 0.2 | 0.3 | 1.3 | ||||||||||
Pinopsida | Pinales Pinaceae | Pinus strobus | 59 | 16.2 | 6.8 | NA | 6.1 | NA | 15.7 | NA | 31.2 | NA |