Research Article |
Corresponding author: Markus Majaneva ( markus.majaneva@ntnu.no ) Academic editor: Michal Grabowski
© 2018 Markus Majaneva, Ola H. Diserud, Shannon H.C. Eagle, Mehrdad Hajibabaei, Torbjørn Ekrem.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Majaneva M, Diserud OH, Eagle SHC, Hajibabaei M, Ekrem T (2018) Choice of DNA extraction method affects DNA metabarcoding of unsorted invertebrate bulk samples. Metabarcoding and Metagenomics 2: e26664. https://doi.org/10.3897/mbmg.2.26664
|
Characterisation of freshwater benthic biodiversity using DNA metabarcoding may allow more cost-effective environmental assessments than the current morphological-based assessment methods. DNA metabarcoding methods where sorting or pre-sorting of samples are avoided altogether are especially interesting, since the time between sampling and taxonomic identification is reduced. Due to the presence of non-target material like plants and sediments in crude samples, DNA extraction protocols become important for maximising DNA recovery and sample replicability. We sampled freshwater invertebrates from six river and lake sites and extracted DNA from homogenised bulk samples in quadruplicate subsamples, using a published method and two commercially available kits: HotSHOT approach, Qiagen DNeasy Blood & Tissue Kit and Qiagen DNeasy PowerPlant Pro Kit. The performance of the selected extraction methods was evaluated by measuring DNA yield and applying DNA metabarcoding to see if the choice of DNA extraction method affects DNA yield and metazoan diversity results. The PowerPlant Kit extractions resulted in the highest DNA yield and a strong significant correlation between sample weight and DNA yield, while the DNA yields of the Blood & Tissue Kit and HotSHOT method did not correlate with the sample weights. Metazoan diversity measures were more repeatable in samples extracted with the PowerPlant Kit compared to those extracted with the HotSHOT method or the Blood & Tissue Kit. Subsampling using Blood & Tissue Kit and HotSHOT extraction failed to describe the same community in the lake samples. Our study exemplifies that the choice of DNA extraction protocol influences the DNA yield as well as the subsequent community analysis. Based on our results, low specimen abundance samples will likely provide more stable results if specimens are sorted prior to DNA extraction and DNA metabarcoding, but the repeatability of the DNA extraction and DNA metabarcoding results was close to ideal in high specimen abundance samples.
Macroinvertebrates, Biomonitoring, Freshwater
Modern biomonitoring and water quality assessments of freshwater ecosystems rely on standardised sampling and identification of benthic macroinvertebrates (
In DNA metabarcoding, short homologous DNA fragments – e.g. part of the cytochrome c oxidase I (COI) gene – from bulk samples are sequenced in parallel and the resulting reads are matched against a reference library to identify the taxa present in the samples (
Efficient sample homogenisation and DNA extraction is a prerequisite for any successful DNA-based study. Conventionally, DNA has been extracted using desalting and organic solvents like phenol-chloroform (
Extensive scientific literature on comparisons of DNA extraction methods exists, each piece of literature concentrating on different target taxa and environments and showing differences in DNA yield and PCR success amongst methods (e.g.
Most freshwater DNA metabarcoding studies rely on separating biomass of organisms from the non-target organic and inorganic material and on grinding the organisms before adding lysis buffer (e.g.
In various soil studies, DNA has been successfully extracted directly, without first separating organisms (
Here, we compared the success of DNA metabarcoding for assessing a set of unsorted freshwater benthic macroinvertebrate samples, processed using different DNA extraction methods. Rather than being an exhaustive comparison, we chose to compare methods that have low toxicity, are easy to use and require little handling time. Thus, no conventional time-consuming or toxic desalting and organic solvent-based methods were included.
Samples were collected from three sites in Norway: River Atna at Skranglehaugen (10 August 2015, 61.98186°N, 09.80454°E, 1117 m above sea level) and at Vollen (10 August 2015, 61.98471°N, 10.02823°E, 720 m above sea level), as well as Lake Jonsvatn at Jonsborg (28 September 2015, 63.39569°N, 10.55370°E, 150 m above sea level). River Atna originates in the Rondane National Park in Central Norway and is a well-documented Nordic freshwater ecosystem (
Four minute benthic kick samples were collected from the River Atna (
Experimental setup. The 4-minute kick samples were collected from two sites along the River Atna and the Van Veen grab samples were collected from a depth gradient in the Lake Jonsvatn. From each sample, 12 subsamples were taken and DNA was extracted using three extraction methods (four subsamples per extraction method); HotSHOT extraction, Qiagen DNeasy Blood & Tissue Kit and MO BIO PowerPlant Pro Kit. An extraction blank was done for each extraction method.
Subsample sizes (a) and DNA yields (b). Two-way ANOVA followed by Tukey’s HSD was used to test differences amongst the extraction methods and F-statistic value (F) and significance (p) are given. The small letters denote significantly different extraction methods at each sampling site based on site and extraction interaction factors.
The subsamples were randomly divided into three groups exposed to three different DNA extraction protocols (Fig.
We amplified three fragments of the mitochondrial COI gene. The full barcode region (approximately 648 bp) was amplified, using the standard
The first-round PCRs (Suppl. material
The resulting raw amplicon reads (available in the ENA SRA repository with study name PRJEB26589) were processed, using mothur v.1.36.1 (
The OTUs were assigned taxonomically in two steps. First, they were searched against the NCBI non-redundant nucleotide database, using the BLAST 2.3.0+ (
To find statistically significant differences (p < 0.05) in DNA yield and in metazoan diversity amongst sites, one-way Kruskal-Wallis followed by pair-wise comparisons was used. To find significant differences in subsample size, in DNA yield and in metazoan diversity amongst different DNA extraction methods, two-way ANOVA followed by Tukey’s pair-wise comparisons was used. Partitioned pair-wise betadiversity values (βsim and βsne,
Total DNA was extracted from 28–131 mg subsamples, four subsamples/extraction method from each site (Figs
The DNA yield was lower in 15 m, 7.5 m and 2 m samples (42–495 ng) than in 0.25 m, Skranglehaugen and Vollen samples (169–4280 ng; Kruskal-Wallis χ2=50.99, p < 0.001, pair-wise Mann-Whitney with sequentially Bonferroni corrected p < 0.05; Fig.
A total of 10.3, 9.8 and 1.3 million reads were generated from the F230, BE and full barcode fragment, respectively (Suppl. material
The number of DNA species was the highest using PowerPlant Kit in the 15 m and 2 m samples, using Blood & Tissue Kit in the 7.5 m and Vollen samples and using HotSHOT extraction in the Skranglehaugen samples (Fig.
Number of DNA species in each subsample. Two-way ANOVA followed by Tukey’s HSD was used to test differences amongst the extraction methods and F-statistic value (F) and significance (p) are given. The small letters denote significantly different extraction methods at each sampling site based on site and extraction interaction factors.
The portion of shared DNA species amongst the extraction methods was lower for the 15 m, 7.5 m and 2 m samples (Fig.
The number of shared DNA species amongst extraction methods illustrated using Venn diagrams (a–c; g–i) and pairwise beta diversity values within extraction methods at each site measured as Sørensen dissimilarity (βsor, which is partitioned to nestedness, βsne and species turnover, βsim) (d–f; j–l), lines denote group means and 95% confidence intervals for βsor and βsne. The small letters denote significantly different extraction methods at each sampling site based on non-overlapping 95% confidence intervals for βsor and βsne.
When evaluating repeatability of extraction methods in more detail, based on the bivariate Poisson-lognormal correlations, the PowerPlant Kit outperformed both Blood & Tissue Kit and HotSHOT extraction in most sites (Fig.
Bivariate correlation values, using the full dataset. Circles denote values for 6 pairs of samples within each extraction method, lines denote group means and 95% confidence intervals and all 66 bivariate correlation values within the sampling site are summarised in the box plots. The small letters denote significantly different extraction methods at each sampling site based on non-overlapping 95% confidence intervals.
In this study, we compared three DNA extraction methods on unsorted bulk samples of lentic and lotic macroinvertebrates. DNA metabarcoding, based on unsorted samples, holds great potential for large scale monitoring of freshwaters because of reduced sample processing time, especially for samples with a high invertebrate to debris ratio. Our results show that the DNA extraction method employed influences the success of DNA metabarcoding for assessing macroinvertebrate biodiversity and is dependent on environmental factors such as the benthic substrate, type of vegetation and specimen abundance.
Our sampling sites differed markedly in their physical and biological composition. The Vollen and Skranglehaugen sites are characterised by abundant boreal and alpine species living in variable lotic microhabitats (
All DNA extraction methods tested might have been affected by PCR inhibition because of the presence of non-target organic matter in our bulk samples. However, the lotic samples suffered less than the lentic samples from inhibition, probably because the plant material present in the lotic samples was mainly bryophytes that have a relatively low amount of phenolic compounds (
Some samples were particularly difficult in terms of PCR inhibitors. In the 15 m lake sample, where also the PowerPlant Kit failed to produce DNA extracts unaffected by PCR inhibition, each DNA extraction method resulted in some close to zero and negative bivariate correlation values (Fig.
Often only presence/absence data are used for community studies based on DNA metabarcoding due to the low reliability of abundance data (
Based on our results, all tested DNA extraction methods appear suitable for large, high-abundance unsorted samples, particularly from lotic systems. Smaller samples with fewer specimens are less time-consuming to sort and likely provide more stable results if PCR-inhibiting substances are physically removed prior to DNA extraction, although DNA extraction with the inhibitor removal step performed significantly better in the case of low-abundance samples. In conclusion, we urge for continued testing of homogenised unsorted bulk samples and comparison with results from presorted samples since time spent on sorting samples might be more efficiently used for monitoring more localities using DNA metabarcoding.
Thanks to Elisabeth Stur and Karstein Hårsaker for field assistance and Rachel Smith for laboratory assistance. The paper is part of the project ‘Environmental Barcoding of Aquatic Invertebrates (EBAI)’ funded by the Research Council of Norway (project no. 243791/E50) and the Norwegian Environment Agency (project no. 15040013). This publication contributes to the EU COST Action DNAqua-Net (EU COST Action CA15219).
Fragment and working PCR conditions.
Final species table.
Number of reads per sample.
Diversity analyses.
Methods (Scripts)