Academic editor: Dirk Steinke
© 2019 Jonas Bylemans, Dianne M. Gleeson, Mark Lintermans, Christopher M. Hardy, Matthew Beitzel, Dean M. Gilligan, Elise M. Furlan.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Bylemans J, Gleeson DM, Lintermans M, Hardy CM, Beitzel M, Gilligan DM, Furlan EM (2018) Monitoring riverine fish communities through eDNA metabarcoding: determining optimal sampling strategies along an altitudinal and biodiversity gradient. Metabarcoding and Metagenomics 2: e30457. https://doi.org/10.3897/mbmg.2.30457
|
Monitoring aquatic biodiversity through DNA extracted from environmental samples (eDNA) combined with high-throughput sequencing, commonly referred to as eDNA metabarcoding, is increasing in popularity within the scientific community. However, sampling strategies, laboratory protocols and analytical pipelines can influence the results of eDNA metabarcoding surveys. While the impact of laboratory protocols and analytical pipelines have been extensively studied, the importance of sampling strategies on eDNA metabarcoding surveys has not received the same attention. To avoid underestimating local biodiversity, adequate sampling strategies (i.e. sampling intensity and spatial sampling replication) need to be implemented. This study evaluated the impact of sampling strategies along an altitudinal and biodiversity gradient in the upper section of the Murrumbidgee River (Murray-Darling Basin, Australia). An eDNA metabarcoding survey was used to determine the local fish biodiversity and evaluate the influence of sampling intensity and spatial sampling replication on the biodiversity estimates. The results show that optimal eDNA sampling strategies varied between sites and indicate that river morphology, species richness and species abundance affect the optimal sampling intensity and spatial sampling replication needed to accurately assess the fish biodiversity. While the generality of the patterns will need to be confirmed through future studies, these findings provide a basis to guide future eDNA metabarcoding surveys in river systems.
Environmental DNA, eDNA, Metabarcoding, Sampling, Fish
Robust methods for monitoring species biodiversity are the fundamental basis for ecological research and environmental management. High-Throughput Sequencing (HTS) of PCR amplicons derived from environmental DNA (eDNA), commonly referred to as eDNA metabarcoding, is becoming an increasingly popular tool for such monitoring surveys (
The popularity of eDNA-based monitoring of aquatic biodiversity has increased dramatically since the first published study (
Aquatic biodiversity estimates obtained from conventional monitoring surveys are known to be influenced by the type of sampling method used (
It can be expected that in freshwater lotic systems, the importance of eDNA sampling strategies (i.e. sampling intensity and spatial sampling replication) will increase in higher order streams. Firstly, it is well known that in lotic systems the fish species richness changes along an altitudinal gradient with higher order streams (i.e. lower altitude) supporting a higher species richness due to an increased habitat size and/or habitat diversity (
Here, we hypothesize that both the species richness and the river channel morphology (e.g. river width, depth and water flow) will influence the optimal sampling strategies (i.e. sampling intensity and spatial sampling replication) for eDNA metabarcoding surveys. To test this, we used eDNA metabarcoding to assess the fish biodiversity at five sites along an altitudinal and biodiversity gradient within a single catchment. A systematic sampling approach was used at each site to evaluate the impact of sampling strategies on the observed species richness derived from eDNA metabarcoding data.
This study examines the fish biodiversity within a single catchment of the Murray-Darling Basin (MDB) (Australia). River morphology, abiotic water conditions and fish biodiversity vary along an altitudinal gradient in the MDB (
To obtain a measure of expected fish biodiversity, which is required to evaluate the overall performance of the eDNA metabarcoding survey, an expert opinion survey was conducted. Additionally, conventional boat electrofishing data was available for the three most downstream sites. A detailed description of the expert opinion survey data and the electrofishing data is available in the supplementary materials (Suppl. material
Environmental DNA samples (i.e. twelve 2 L water samples) were collected from each site between the 3rd and 8th of November 2016 (only eleven samples were available from the MR01 site as one of the sampling bottles broke). Potential contaminating DNA was removed from sampling equipment prior to collecting water samples, using a 20% bleach solution and thoroughly rinsing with UV-sterilized tap water. One blank field control (BFC) was included for each site and consisted of a 2 L sampling bottle filled with UV-sterilized water which was opened on site, closed and submerged in the water. At each site, samples were collected over four transects across the river width spanning a 100 m river section (i.e. ca. 33 m distance between transects). At the MR01 site samples were collected by wading in the river while for the remaining sites all samples were collected from a canoe. Along each transect surface water samples were collected from both the left and right river banks (i.e. within 1m of the river bank) and the mid-channel. Samples were stored on ice, transported to the University of Canberra (ACT, Australia) and eDNA was captured within 12 h using a 1.2 µm glass fibre filter (Sartorius, Göttingen, Germany). Filtering equipment was cleaned between samples as described above and negative equipment controls (NEC) were obtained by filtering 500 mL of UV-sterilized tap water prior to processing water samples. Filters were stored at -20 °C until eDNA extractions were performed at the trace DNA laboratory (University of Canberra) using the PowerWater DNA Extraction Kit (MoBio Laboratories, Carlsbad, USA). For each sampling site one BFC and two NEC were included in the batch DNA extractions to monitor contamination and all eDNA extracts were stored at -20 °C for further analyses.
PCR amplification and the construction of the HTS libraries was done using the MiFish-U universal fish primers (
Trimmomatic v.0.36 was used to trim technical sequences (i.e. sequencing adaptors and primers) from the sequencing reads (
Further filtering and analyses of the metabarcoding data was done using the packages tidyverse (
The overall performance of the eDNA metabarcoding survey was evaluated by comparing the observed species richness with the estimated species richness obtained from both the expert opinion survey and the electrofishing data (Suppl. material
To determine the impact of sampling intensity and spatial sampling replication the eDNA metabarcoding data was used to construct species accumulation curves (SAC), the total estimates species richness (Sest) and the number of samples required to detect 95% of Sest was determined. The data obtained from the MR01 site was excluded from the analyses as no variation in species detections were observed between samples. For the remaining sites, two sampling strategies were evaluated with the analyses using all available sample replicates or only those samples collected from the river-banks (i.e. LB and RB). The community data was first converted to presence/absence data and species accumulation curves (SAC) were constructed for each site and sampling strategy using the inext function (R package inext) (
As sequencing depth (i.e. the number of reads per sample) could also influence the detection of species from metabarcoding analyses, the metabarcoding data with absolute read counts was used to determine the impact of sequencing depth. A custom R script relying on the R package vegan was used to first rarefy the community data to simulate different sequencing depths (i.e. 10,000; 20,000 and 40,000 reads per sample) (
A permutational multivariate analysis of variance was used for a more in-depth evaluation of the impact of spatial sampling replication. Metabarcoding results were transformed to presence/absence data and community level differences between sampling transects and locations (i.e. left bank, mid-channel and right bank) were assessed using the adonis function within the R package vegan (
After assigning the sequence reads to their respective samples an average of 31,365 sequence reads were obtained per sample with a minimum and maximum sequencing depth of 6,907 and 52,742 reads per sample, respectively. The overall quality of run was high (PhredQ30 score ≥ 91.17). The effect of the bioinformatics filtering processes on the number of sequencing reads for each sample is shown in the supplementary materials (Suppl. material
The estimated/observed species richness for each sampling site and each survey method (i.e. expert opinion survey, electrofishing data and eDNA metabarcoding survey) is given in Figure
The SAC constructed for each site using all samples and the river-bank samples only show that the number of samples needed to characterize the species community generally increases with decreasing altitude and an increase in species richness (Figure
Species accumulation curves (SAC) (A–B) and rank abundance curves (RAC) (C) for the four most downstream sampling sites in the Murrumbidgee River. SAC were constructed to compare two different sampling strategies (A) (i.e. using all available samples and only samples collected from the river-banks) and different sequencing depths (B). The vertical dashed lines in panel A show the number of samples needed to detect ≥ 95% of the estimated species richness (i.e. based on the Chao2 estimates) for each site and sampling strategy.
The results from the SAC derived from the data with different simulated sequencing depths show that, for most sites, increasing sequencing depth only moderately improves species detections (Figure
The results of the analysis of variance of the community data reveal the need for spatial sampling replication at low altitude sites with a high species richness and is consistent with the results of the SAC. The analysis of variance of the community data showed that sampling location had a significant effect on the community data (R2 = 0.01931, p-value = 0.043) while sampling transect did not (R2 = 0.00618, p-value = 0.876). The results show that the overall community dissimilarity between sampling locations is relatively low for the MR02 site compared to the three most downstream sites (Figure
Overall dissimilarity between the fish community data obtained from the samples collected from the left bank (LB), mid-channel (MC) and right bank (RB) for the different sampling sites (A). The heat map shows the average contribution of each species to the overall community dissimilarity (B).
Data obtained here shows that optimal sampling strategies vary along altitudinal and biodiversity gradients. These findings are unlikely to be exclusive to our study system as the increase in fish species richness with decreasing altitude is a global pattern (
Comparing the observed species richness from both electrofishing data and eDNA metabarcoding data clearly shows that standard electrofishing surveys are likely to underestimate the true species richness. Overall, the results show that eDNA metabarcoding detects approximately double the species richness compared to the electrofishing data (Figure
While previous studies have found that sampling replication will improve the species richness estimates of eDNA metabarcoding surveys (
The current study has shown that the sampling design can have a profound effect on the performance of eDNA metabarcoding survey in riverine systems. However, more research is needed to determine the universality of the patterns described here. Additionally, temporal sampling strategies will also need to be considered in future studies as eDNA concentrations undergo seasonal fluctuations (
The summarized data of the electrofishing, expert opinion and eDNA metabarcoding surveys are available in the Data directory of the Suppl. material
We wish to thank the Invasive Animals Cooperative Research Centre (grant/award number: 1.W.2) and the Holsworth Wildlife Research Endowment (grant/award number: 164) for funding this research. Finally, we wish to acknowledge Dr. Owen S. Wangensteen and Dr. Dirk Steinke for the valuable feedback on the previous version of the manuscript.