Research Article |
Corresponding author: Yiyuan Li ( yli19@nd.edu ) Academic editor: Michael T. Monaghan
© 2018 Yiyuan Li, Nathan T. Evans, Mark A. Renshaw, Christopher L. Jerde, Brett P. Olds, Arial J. Shogren, Kristy Deiner, David M. Lodge, Gary A. Lamberti, Michael E. Pfrender.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Li Y, Evans NT, Renshaw MA, Jerde CL, Olds BP, Shogren AJ, Deiner K, Lodge DM, Lamberti GA, Pfrender ME (2018) Estimating fish alpha- and beta-diversity along a small stream with environmental DNA metabarcoding. Metabarcoding and Metagenomics 2: e24262. https://doi.org/10.3897/mbmg.2.24262
|
Environmental DNA (eDNA) metabarcoding has been increasingly applied to biodiversity surveys in stream ecosystems. In stream networks, the accuracy of eDNA-based biodiversity assessment depends on whether the upstream eDNA influx affects downstream detection. Biodiversity assessment in low-discharge streams should be less influenced by eDNA transport than in high-discharge streams. We estimated α- and β-diversity of the fish community from eDNA samples collected in a small Michigan (USA) stream from its headwaters to its confluence with a larger river. We found that α-diversity increased from upstream to downstream and, as predicted, we found a significant positive correlation between β-diversity and physical distance (stream length) between locations indicating species turnover along the longitudinal stream gradient. Sample replicates and different genetic markers showed similar species composition, supporting the consistency of the eDNA metabarcoding approach to estimate α- and β-diversity of fishes in low-discharge streams.
eDNA, biodiversity assessment, biomonitoring, α-diversity, β-diversity, high throughput sequencing
Freshwater fishes provide vital ecosystem services to humans including protein, recreation (
Fish diversity in rivers is traditionally assessed using a variety of census methods including netting, trapping and electrofishing. The accuracy of these methods relies heavily on the intensity of sampling efforts and the susceptibility of species and individuals to capture (
An alternative approach to traditional capture-based survey methods is the use of environmental DNA (eDNA) metabarcoding to estimate fish biodiversity. Environmental DNA is the DNA that organisms shed into the environment through mechanisms such as sloughed tissue, faeces, exudates and decay of carcasses (
In eDNA metabarcoding studies, α-diversity (i.e. the number of species within a defined location) and β-diversity (i.e. the species turnover between locations) are often used to quantify biodiversity patterns. Due to the longitudinal nature of river ecosystems and the flow-dependent movement of eDNA, a reliable estimation of α- and β-diversity in rivers requires an understanding of the eDNA transport along the longitudinal stream gradient of the catchment. In a lotic environment, the eDNA transport distance can vary with discharge (Q, volume water per time), rate of release of DNA into the environment, characteristics of eDNA, entrapment of eDNA particles within benthic substrates and biofilms (
To test the relationship between β-diversity and longitudinal distance and the utility of eDNA to describe α- and β-diversity in systems with relatively low discharge, we surveyed freshwater fish diversity using eDNA along the course of a small stream in Michigan, USA from its headwaters to its confluence with a larger river. We hypothesised that β-diversity between sampling sites would correlate with the longitudinal distance between sites.
We collected water samples for eDNA isolation from Eagle Creek, a small first-order stream (Q = 0.1-324 L/s) in the Kalamazoo River watershed that flows approximately 12 km from its source in the Fort Custer Training Center (Michigan Army National Guard) to its confluence with the Kalamazoo River (42.3374N, 85.3377W) near the town of Augusta, Michigan. We sampled six locations in Eagle Creek and two sites in the Kalamazoo River (Figure
Samples were collected on 29 April 2015, between 0940 and 1230 h. We sampled from the furthest downstream site (Location 8) to the most upstream site (Location 1) (Figure
After collecting the water samples, we recorded the latitude and longitude coordinates for each sampling location using a handheld global positioning system (Garmin, Olathe, Kansas, USA). We then measured turbidity, dissolved oxygen (D.O.), pH, water temperature and flow velocity at each site using the cross-sectional approach (
Map of sampling locations along Eagle Creek. Location 1 is the most upstream sampling location in Eagle Creek. Location 6 is the most downstream sampling location in Eagle Creek. Locations 7 and 8 are in the Kalamazoo River. Location 7 is upstream of the confluence of Eagle Creek whereas Location 8 is downstream of the confluence.
In the laboratory within 6 hours of sample collection, water samples were vacuum-filtered on to 47 mm, 1.2-µm pore size polycarbonate membrane filters (EMD Millipore, Billerica, Massachusetts, USA). The filters were placed in 2.0 ml microcentrifuge tubes containing 700 µl of CTAB and stored at -20 °C as described by
To check for and quantify contamination during the experimental process, we included a series of field controls, mock communities and PCR negative controls (Suppl. material
The 3 replicates from each sampling location (24 samples in total), 4 mock communities and a single PCR negative control were PCR amplified for 3 mitochondrial gene regions. The three PCR primer sets were used in independent reactions to amplify segments of the mitochondrial 16S rRNA gene (L1865/H2195, hereafter Ac16s), 12S rRNA gene (L909/H1155, hereafter Am12s) and Cytochrome B gene (L14912/H15149c, hereafter CytB) (
Amplified DNA from the first PCR were electrophoresed through a 2% agarose gel, stained with ethidium bromide and then visualised on a UV light platform. Each amplified product was manually excised from the gels using single-use razor blades, cleaned with a QIAquick Gel Extraction Kit (Qiagen, Venlo, Netherlands) and eluted from spin columns with 30µl of Buffer EB (Qiagen, Venlo, Netherlands). Regardless of visual confirmation of amplification, for both the mock communities and PCR negative control, we excised a band from the agarose gel at the expected amplicon size. For each sample, the gel extracted PCR products from the three gene regions were combined in proportions determined from previous work with these markers to approximate equal read production across genes (Am12S = 2.5 ng; Ac16S = 3.75 ng; CytB = 18.75 ng). In a second PCR, the Illumina adaptor sequence was added, including individualised library barcodes per sample. The DNA concentration of each library was quantified via Qubit dsDNA HS Assay. Libraries were pooled in equal molar concentrations along with 25% PhiX DNA (v3, Illumina, San Diego, California, USA) and then 300-bp paired-end sequenced on an Illumina MiSeq in two MiSeq flow cells in the University of Notre Dame’s Genomics and Bioinformatics Core Facility (http://genomics.nd.edu/) using a MiSeq Reagent Kit v3 (600-cycle; Illumina, San Diego, California, USA).
Bioinformatics analysis followed
No observable bands (PCR products) were detected on agarose gels in the expected size ranges for the eight field negative controls. Therefore, the eight field negative controls were removed from further analysis. After species assignment, potential contamination was removed based on the threshold correction method (
To test whether the biodiversity patterns varied significantly between gene regions, an unfiltered and a filtered dataset were used for each region. A species (OTU) was considered detected if its DNA was found in any sample replicate or any marker from a site. This single occurrence criterion was used in the comparison of different gene regions since the “at least two markers” criterion in the moderate bioinformatics stringency (
Based on the sampling effort (three replicates for each site), the potential biodiversity in a relatively small environment and detection thresholds from previous studies, we utilised a moderate bioinformatics stringency (
All analyses of α- and β-diversity were conducted in R version 3.3.2 (
The correlation between β-diversity matrices and the longitudinal distance matrix was tested with a Mantel test using “mantel.rtest” command in the R package “ade4” version 1.7-6 (Dray et al. 2007). Since Location 7 in the Kalamazoo River was upstream of the Eagle Creek confluence, it was not included in the correlation analysis between biodiversity and linear stream distance.
To evaluate the biodiversity detected by each replicate, the β-diversity between replicates was also calculated with “vegdist” based on Jaccard distance. Differences in β-diversities within and between the sample locations were tested using the Multi-Response Permutation Procedure (MRPP) in the R package “vegan” version 2.4-2 (
The physical properties, including discharge and longitudinal distance from the headwaters of each sampling location along Eagle Creek are summarised in Suppl. materials
Illumina MiSeq sequencing generated 20.9 million raw reads. After filtering, merging and quality control, 2.1 million, 3.9 million and 3.0 million reads were used for downstream analysis for Ac16S, Am12S and CytB markers, across the total dataset (Table
The identifications of fish species were refined based on their known geographic distributions. For example, the Ac16S and CytB markers were not diagnostic between Umbra limi and Umbra pygmaea. Since only U. limi is known from Michigan, all OTUs with an assignment to an Umbra spp. were assigned to Umbra limi. Similarly, the three gene regions were not diagnostic for the two closely related species, Ambloplites ariommus and Ambloplites rupestris. As only A. rupestri is found in Michigan, Ambloplites spp. was assigned to A. rupestri. The Am12S marker could also not distinguish between Catostomus commersonii and Catostomus catostomus. Since both are found in Michigan, the identification of these species was grouped at the generic level as Catostomus spp. Notropis topeka were removed as these species are not found in Michigan or nearby regions (
Three gene regions were used for estimating α-diversity. When including all the OTUs (i.e. unfiltered dataset), the Am12S region detected the highest number of OTUs. When including only species-level assignments (i.e. filtered dataset), Ac16S detected the highest number of OTUs (Table
Venn diagram of number of species detected by each genetic marker based on filtered datasets.
Ac16s | Am12s | Cytb | |
---|---|---|---|
Number of reads after quality control | 2,137,148 | 3,861,079 | 3,010,080 |
Number of unique reads | 208,072 | 131,601 | 201,606 |
Number of unique reads without singletons | 53,645 | 46,396 | 43,363 |
Number of OTUs | 54 | 93 | 36 |
Number of OTUs after HMM filtering | 43 | 79 | 36 |
Number of OTUs after contamination removal (unfiltered dataset) |
29 | 33 | 15 |
Number of OTUs with species-level assignment (filtered dataset) |
20 | 15 | 9 |
Number of species (all three genetic markers combined) | 23 |
Different gene regions showed similar β-diversity patterns. For both the filtered and unfiltered data sets, β-diversity estimates, based on single gene regions and three markers, were positively correlated with each other. All estimates of β-diversity and longitudinal distance were positively correlated except between the Ac16S-filtered estimates and distance (all the r statistics and p-values are available in Suppl. material
Based on the combined signal of all three genetic markers, 23 fish species were considered detected in the 6 sampling locations in Eagle Creek and 2 sampling locations in Kalamazoo River (Table
Fish species presence along Eagle Creek and in the Kalamazoo River detected using three genetic markers (Ac16S, Am12S and Cytb). Each column is a sampling location. Each row is the presence (x) or absence of each species.
Sampling locations along Eagle Creek (1-6) and Kalamazoo River (7-8) | ||||||||
---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | |
Ambloplites rupestris | x | x | x | x | ||||
Ameiurus natalis | x | x | x | x | x | x | ||
Amia calva | x | x | ||||||
Cyprinella spiloptera | x | |||||||
Erimyzon sucetta | x | |||||||
Esox americanus | x | |||||||
Etheostoma caeruleum | x | x | x | |||||
Etheostoma exile | x | x | ||||||
Etheostoma nigrum | x | x | x | x | ||||
Hypentelium nigricans | x | x | ||||||
Lepomis cyanellus | x | |||||||
Lepomis gibbosus | x | x | x | x | x | |||
Lepomis gulosus | x | |||||||
Lepomis macrochirus | x | x | x | x | x | x | x | |
Micropterus dolomieu | x | |||||||
Micropterus salmoides | x | x | x | x | ||||
Moxostoma erythrurum | x | x | x | |||||
Notemigonus crysoleucas | x | x | ||||||
Noturus flavus | x | |||||||
Perca flavescens | x | x | ||||||
Percina maculata | x | |||||||
Semotilus atromaculatus | x | x | x | x | ||||
Umbra limi | x | x | ||||||
Total number of species | 2 | 7 | 5 | 8 | 8 | 10 | 9 | 11 |
Based on PCoA (Figure
PCoA of the β-diversity of sampling locations with Jaccard distance based on all three genetic markers.
In this study, we used eDNA metabarcoding to describe fish diversity along the course of a small stream. The local fish assemblage varied spatially from upstream to downstream, consistent with previous studies (
Fish species assemblages also showed absence, additions and turnovers (as measured using β-diversity) along the stream gradient. Samples collected from adjacent locations often showed similar species compositions and dissimilarity increased with distance between sites. The differences in species composition along the stream could be due to changes in habitat structure and availability, as well as changes in temperature, pH and dissolved oxygen along the stream course (
We collected three replicate eDNA samples in each location to improve species inference following our practice (
Understanding the spatial distribution of eDNA in flowing environments is critical for accurate detection of species in stream ecosystems (
Based on previous studies, the average transport distance of eDNA can be highly variable, depending on stream discharge (
Our goal was not to mechanistically describe why β-diversity changes in the river, but to test whether transport of eDNA obscures the signal of species turnover in lower discharge streams. The influence that environmental variables (turbidity, D.O., temperature etc.) have on the transport and detection of eDNA is not well understood, although, increasingly, studies are finding evidence that environmental variables can affect how long eDNA remains detectable in a system (
In this study, we estimated the longitudinal fish biodiversity within a stream using a multi-marker eDNA metabarcoding approach. We found that the α-diversity of Eagle Creek increased from upstream to downstream and that β-diversity was positively correlated with the longitudinal distance between locations. We also found that the observed biodiversity patterns were similar amongst different genetic markers and replicate samples. Our results suggest that eDNA can be transported hundreds of metres in a small stream, consistent with previous studies. Additional studies are needed to better quantify and understand eDNA transport dynamics in streams of varying discharge.
Raw sequence data is available on the NCBI repository under the BioProject PRJNA317862 with accession numbers: SRX3626969–SRX3626997; OTU sequences and R scripts used in this study are available on figshare (https://doi.org/10.6084/m9.figshare.5860851).
Sampling: NTE, BPO;
Laboratory work: MAR;
Data analyses: YL, NTE, MAR, CLJ, BPO, AJS, KD;
Study design and supervision: DML, GAL, MEP;
Manuscript writing: all authors edited and commented on the manuscript draft.
Funding for this research was provided by United States Department of Defense Strategic Environmental Research and Development Program Grant W912Hq-12-C-0073 (RC-2240). We thank Stuart E. Jones for help with statistical analyses, Crysta A. Gantz for assistance with field sampling and eDNA sample processing, Cameron R. Turner for help with sampling and experimental design and Jacqueline Lopez in the Notre Dame Genomics and Bioinformatics Core Facility for assistance with library QC and sequencing. We thank Michele Richards and other personnel of Fort Custer Training Center (Michigan Army National Guard) for site access and logistical support. We thank David C. Molik (University of Notre Dame), Paul Czechowski (Cornell University) for advice and helpful discussions.
Control samples that were used to test for contamination during each step of the experiment.
GPS and environmental variables of each site.
The longitudinal distance (upper triangular) and β- diversity (lower triangular) between sampling locations along Eagle Creek.
Mantel r and p-values for all the pairwise comparisons between single marker, three markers and longitudinal distance.
The correlation between environmental variables and β-diversity and longitudinal distance using Mantel test
PCoA plot based on all the replicates from Eagle Creek
Profiling table for all the replicates.