Research Article |
Corresponding author: Agnès Bouchez ( agnes.bouchez@inrae.fr ) Academic editor: Kristy Deiner
© 2022 Ana Baricevic, Cécile Chardon, Maria Kahlert, Satu Maaria Karjalainen, Daniela Maric Pfannkuchen, Martin Pfannkuchen, Frédéric Rimet, Mirta Smodlaka Tankovic, Rosa Trobajo, Valentin Vasselon, Jonas Zimmermann, Agnès Bouchez.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Baricevic A, Chardon C, Kahlert M, Karjalainen SM, Pfannkuchen DM, Pfannkuchen M, Rimet F, Tankovic MS, Trobajo R, Vasselon V, Zimmermann J, Bouchez A (2022) Recommendations for the preservation of environmental samples in diatom metabarcoding studies. Metabarcoding and Metagenomics 6: e85844. https://doi.org/10.3897/mbmg.6.85844
|
Implementation of DNA metabarcoding for diatoms for environmental monitoring is now moving from a research to an operational phase, requiring rigorous guidelines and standards. In particular, the first steps of the diatom metabarcoding process, which consist of sampling and storage, have been addressed in various ways in scientific and pilot studies and now need to be rationalised. The objective of this study was to compare three currently applied preservation protocols through different storage durations (ranging from one day to one year) for phytobenthos and phytoplankton samples intended for diatom DNA metabarcoding analysis. The experimental design used samples from four freshwater and two marine sites of diverse ecological characteristics. The impact of the sample preservation and storage duration was assessed through diatom metabarcoding endpoints: DNA quality and quantity, diversity and richness, diatom assemblage composition and ecological index values (for freshwater samples). The yield and quality of extracted DNA only decreased for freshwater phytobenthos samples preserved with ethanol. Diatom diversity was not affected and their taxonomic composition predominantly reflected the site origin. Only rare taxa (< 100 reads) differed among preservation methods and storage durations. For biomonitoring purposes, freshwater ecological index values were not affected by the preservation method and storage duration tested (including ethanol preservation), all treatments returning the same ecological status for a site. This study contributes to consolidating diatom metabarcoding. Thus, accompanied by operational standards, the method will be ready to be confidently deployed and prescribed in future regulatory monitoring.
biomonitoring, diatom assemblages, DNA metabarcoding, European Water Framework Directive, methods, sample preservation
Aquatic ecosystems provide many ecosystem services and functions, such as fishing, water provisioning and recreation, and are hosts to considerable biodiversity (
Until now, standard methods used to count and identify diatoms to species level are based on morphological criteria visible by light microscopy (
The diatom metabarcoding process involves five steps: 1) sampling and storage, 2) DNA extraction, 3) PCR amplification, 4) amplicon library preparation and sequencing and 5) bioinformatics treatment. All these steps can show variations among studies with a large range of protocols that are used. To date, only a few studies have compared different protocols;
Standardisation efforts at CEN (European Committee for Standardisation) have accompanied the application of the European Directives although standardisation of genomic methods for biomonitoring is still in its infancy. In 2018, CEN published two technical reports dealing with the management of diatom DNA barcodes (
Genomic methods for environmental monitoring are moving from research to operational applications. The choice of a preservation method by the end-users depends on sampling and shipment operational constraints. For example, during a field sampling day, including the visit to several potentially remote sites, deep-freezing may be difficult. If a sample shipment is required, it is cheaper and safer to use a preservative that is free of hazardous compounds (e.g. formaldehyde). Moreover, while several hundred samples can be processed in a single sequencing run, the time to collect all these samples in the field can last weeks to months. So, in order to derive best-practices for developing standards, it is important to know if the preservation protocol and/or the storage duration have an impact on the final assessment of the diatom assemblage.
The aim of our study is to highlight best practices for preserving phytobenthos and phytoplankton samples for DNA-based applications involving diatoms. With that aim, we compared different preservation methods and storage durations. The recommendations obtained will be useful in the context of subsequent standardisation.
We compared three preservation protocols through different storage durations. These are based on those proposed in
The experimental design is summarised in Fig.
Workflow of the study presenting the three preservation methods (FR: deep-frozen, RL: nucleic acid preservative solution and ET: ethanol). The blue box is detailed in Suppl. material
Six contrasting European sites (two Mediterranean marine sites – Spain, Croatia; four European river sites – France, Spain, Germany, Finland) were selected for sampling, based on differences in water quality and typology (Table
Description of the sampling sites: site code, location, geographic references according to WGS84 system, site characteristic, aquatic ecosystem and biotic compartment are indicated.
Site Code | Location | GPS Coordinates (Latitude, Longitude) | Trophic state | Aquatic ecosystem | Biotic compartment |
---|---|---|---|---|---|
LC | Lim bay - Croatia | 45,132529, 13,66059 | mesotrophic | marine | phytoplankton |
ES | Ebro bay - Spain | 40,816710, 0,73077 | mesotrophic | marine | phytoplankton |
OF | Edian river - France | 46,255750, 6,72342 | oligotrophic | freshwater | phytobenthos |
MS | Ebro river - Spain | 40,815005, 0,51997 | mesotrophic | freshwater | phytobenthos |
EG | Teltow channel - Germany | 52,437615, 13,32039 | eutrophic | freshwater | phytobenthos |
HF | Kalimenoja river - Finland | 65,169722, 25,86889 | humic | freshwater | phytobenthos |
Freshwater phytobenthos was sampled from biofilms following the European standard (
Marine phytoplankton was sampled by one vertical net haul at both marine sites (LC and ES) with a phytoplankton net (50 μm mesh size) from 15 m deep to the surface. Each net sample was suspended and evenly filtered until complete filter saturation (30 ml per filter for station LC and 60 ml per filter for station ES), on 1.2 µm cellulose (Millipore) (LC site) or GF/F glass microfibre filters (Whatman) (ES site) (Fig.
Three preservation methods were applied to phytoplankton and phytobenthos samples (Table
Description of the three preservation methods: method code, biotic compartment, storage conditions and material used for extraction are indicated.
Preservation method name | Preservation method code | Biotic compartment | Fixative solution | Storage temperature | Material used for preservation | Material used for extraction |
---|---|---|---|---|---|---|
Cryopreservation | FR | Phytobenthos | no | -20 °C | Pellet | Pellet |
Phytoplankton | no | -80 °C | Filter | Filter | ||
DNA stabilization solution preservation | RL | Phytobenthos | home-made nucleic acid preservative | -20 °C | Suspended biofilm with fixative solution | Pellet |
Phytoplankton | home-made nucleic acid preservative | -20 °C | Filter with fixative solution | Filter | ||
Ethanol preservation | ET | Phytobenthos | Ethanol (final conc. ~70%) | +4 °C | Suspended biofilm with fixative solution | Pellet |
Phytoplankton | Ethanol (final conc.~96%) | +4 °C | Filter with fixative solution | Filter |
FR preservation method
: For freshwater samples, twelve 2 ml subsamples of the biofilm suspension were obtained from one 300 ml bottle under agitation for each site. Subsamples were then centrifuged, supernatant was discarded and pellets were frozen and stored at -20 °C (Suppl. material
RL preservation method
: A nucleic acid preservation solution was home-made with 3.5 M ammonium sulphate, 17 mM sodium citrate and 13 mM ethylene-diamine-tetra-acetic acid (EDTA). pH was adjusted to 5.2 using 1 M H2SO4 and the solution was sterilised by filtration with 0.2 µm filter. For freshwater samples, one volume of the nucleic acid preservative solution was added to one volume of sampled biofilm, for one 300 ml bottle under agitation. 24 × 2 ml subsamples of the preserved biofilm suspension were then stored for each site (Suppl. material
ET preservation method
: For freshwater samples, three volumes of 96% ethanol were added to one volume of biofilm, in order to obtain a final ethanol concentration of 70%. This was applied to one 300 ml bottle under agitation. Six 17 ml subsamples of the preserved biofilm suspension were then stored for each site (Suppl. material
In all subsampling phases for freshwater biofilm samples (Suppl. material
The samples, preserved with the three methods, were further processed at six different storage durations (1 day, 1 week, 1 month, 3 months, 6 months and 1 year) during one year (Fig.
For freshwater samples, DNA extraction was performed on biofilm pellets, either those directly preserved (FR samples, Table
DNA extractions were performed using a commercial kit (Macherey–Nagel NucleoSpin Soil kit, Düren Germany) with purification columns following
At the end of the 1-year storage period, DNA quality and quantity were assessed on all 216 DNA extracts (Fig.
A 312 bp fragment of the rbcL chloroplastic gene was amplified from DNA extracts using Takara LA Taq polymerase and an equimolar mix of the forward primers Diat_rbcL_708F_1, 708F_2, 708F_3 and the reverse primers R3_1, R3_2 (
Demultiplexing and a quality check (FastQC,
Statistical analyses, as well as graphical presentations of the results, were performed using the R software version 3.6.0 (
For freshwater river sites, we assessed their ecological quality using the Specific Pollution-sensitivity Index (SPI) (
Fastq files are available at https://doi.org/10.57745/M3PESU.
Spectrophotometry measurements confirmed good DNA quality with 260/280 nm ratios between 1.8 and 2 for all samples. Measured DNA concentrations differed among samples and ranged from 1 to 160 ng/µl (Fig.
High-throughput sequencing of the 216 samples resulted in a total of 7.9 million reads. Only one sample (Ebro bay-RL-1 month) could not be sequenced successfully.
After bioinformatics processing, a total of 3.9 million (49.4%) reads were conserved with an average of 36,621 reads per sample. Read clustering (95% sequence similarity threshold) resulted in an average of 357 OTUs per sample and classification (85% bootstrap confidence score threshold) identified an average of 97 taxa per sample. Rarefaction curves indicated sufficient sequencing depth for most of the samples (Suppl. material
When all preservation methods and storage durations were considered, freshwater sites were, on average, characterised with higher number of reads (41,222 reads/sample), higher OTU and taxa richness (433 OTUs/sample and 111 taxa/sample) and higher diversity index (Shannon) values compared to marine sites (27,155 reads/sample, 199 OTUs/sample and 67 taxa/sample) (Fig.
Diversity parameters of diatom assemblages: box plots for (a) read numbers, (b) Shannon index, (c) number of OTUs and (d) number of taxa for the three preservation methods (blue: ET, yellow: FR, grey: RL) for marine (LC, ES) and freshwater sites (OF, MS, EG, HF). Boxes represent the interquartile range, with the median indicated with a line and whiskers extending to the highest and lowest values.
Using the Diat.barcode reference library, 289 OTUs were assigned at species level, 77 at genus level, 21 at family level, nine at order level and two at class level. Overall, 102 different diatom genera were detected in the dataset. The diatom assemblage composition differed among sites (Suppl. material
Do preservation methods and/or storage duration affect assemblage structure ?
Community ordination analyses taking read abundance into account (Bray-Curtis distance) showed that the samples differed primarily according to sampling sites (PERMANOVA, pseudoF5,101 = 573.08, R2 = 0.96, p = 0.001) (Fig.
Non-metric multidimensional scaling (NMDS) ordinations for all sites (above) and for each site (next page), based on Bray-Curtis distances, taking read abundance into account. Samples are marked according to the preservation method (colour) and storage duration (shape). The three preservation methods (ET, FR and RL) are visualised by ellipses for the six sites (next page).
Differences in assemblage composition between different methods can be seen in NMDS plots for each site (Fig.
Results of PERMANOVA analysis (adonis function) of OTUs, indicating the percentage of variance (R2) explained by preservation method and storage duration and associated probability (p).
site | preservation method | storage duration | ||
---|---|---|---|---|
R2 (%) | p | R2 (%) | p | |
LC | 0.546 | 0.0001 | 0.181 | 0.25 |
ES | 0.574 | 0.0011 | 0.116 | 0.75 |
OF | 0.605 | 0.0001 | 0.149 | 0.37 |
MS | 0.726 | 0.0002 | 0.082 | 0.57 |
HF | 0.849 | 0.0001 | 0.047 | 0.54 |
EG | 0.683 | 0.0001 | 0.105 | 0.47 |
Pairwise comparisons of preservation method pairs (ET-ethanol, FR-deep-frozen, RL-RNA-later) using pairwise.adonis function with 9999 permutations and P value adjustment method: Bonferroni.
site | pair | R2 | p.adjusted |
---|---|---|---|
OF | ET-FR | 0.576 | 0.015 |
ET-RL | 0.606 | 0.009 | |
FR-RL | 0.324 | 0.012 | |
MS | ET-FR | 0.675 | 0.012 |
ET-RL | 0.705 | 0.003 | |
FR-RL | 0.230 | 0.075 | |
HF | ET-FR | 0.860 | 0.015 |
ET-RL | 0.845 | 0.018 | |
FR-RL | 0.173 | 0.096 | |
EG | ET-FR | 0.663 | 0.006 |
ET-RL | 0.415 | 0.018 | |
FR-RL | 0.684 | 0.012 | |
LC | ET-FR | 0.592 | 0.012 |
ET-RL | 0.127 | 0.609 | |
FR-RL | 0.518 | 0.012 | |
ES | ET-FR | 0.457 | 0.012 |
ET-RL | 0.416 | 0.045 | |
FR-RL | 0.690 | 0.009 |
Are some taxa differentially detected ?
Assemblage changes are mainly due to changes in relative abundances for abundant taxa (Suppl. material
SPI scores were calculated for freshwater sites, based on OTUs assigned at species (73%) or genus (19%) levels and their read abundances. They ranged from 14.2–18.9 (Fig.
Identification of diatom assemblage in environmental samples through DNA metabarcoding has proved to be a reliable approach that has been successfully tested in many ecological contexts through numerous pilot studies for freshwater biomonitoring (
When metabarcoding is used to assess biodiversity or ecological quality indices, based on diatom assemblages, our results show an overall robustness of the approach that is only slightly affected by the method used to preserve the samples or by the storage duration. Overall, diatom assemblage composition differed among sampling sites, rather than due to preservation methods or storage duration. Detecting an important impact of sampling site on assemblage composition is not surprising since sites were chosen to represent very diverse environments with various trophic status. Diatoms are known to have specific ecological preferences; thus, their assemblages are shaped by local environmental properties. This is the reason why these assemblages are used as proxies of phytobenthos when monitoring the ecological status of waterbodies for the WFD (
In most cases, the preservation methods we explored did not affect the quantity and quality of the DNA extracted from preserved samples. The exception is the ET method applied to freshwater samples. Preservation with ethanol seems to lead to lower DNA yield than other methods. Ethanol acts both as a killing and a preservative agent, replacing water molecules in biological tissues (
Recent studies on macroinvertebrate have shown that organismal DNA is released from cells into the ethanol used for preservation during sample storage (
Sample storage duration, from one day to one year, does not affect the quantity and quality of the DNA extracted from preserved samples, except when preserving freshwater samples with ethanol. In that case, a decrease of DNA concentration was observed that was marked during the three first months of preservation, but this trend did not continue over the subsequent nine months. We can hypothesise that this decrease is linked to the release of DNA from the cells to the ethanol solution. This could be further evaluated by extracting DNA from the pellets and DNA from the ethanol in parallel.
For all methods and dates, even in the “worst case” of ethanol preservation for the samples that have been stored the longest, the final end-points were not affected. Indeed, the assemblage composition is largely homogeneous at each site, whatever the method and the storage duration. The small percentage of taxa that differ from one method to the other or from one date to the other are among those that are rare (< 100 reads). When diatom metabarcoding is dedicated to the evaluation of ecological status to compare changes in assemblage structure through time and space, which is currently its main application, it is definitely an approach that is not affected by the sample conservation. If diatom metabarcoding is dedicated to the detection of rare species (e.g. invasive, endangered or toxic species), then the choice of the sample preservation method may be more critical. However, in our study, we could not identify a specific trend and derive best practice which is in line with previous observations that rare OTUs may be random and are poorly reproducible (e.g.
Following results from numerous pilot studies, we can be confident that diatom metabarcoding is robust and can replace or complement the current approach, based on morphotaxonomy. To do so, stakeholders call for guidelines and/or standards to accompany the deployment of the method for biomonitoring purposes (
However, methods have to remain operational and, as far as possible, user-friendly. Concerning sample conservation, depending on the context, one method or the other maybe more adapted. Methods requiring freezing or deep-freezing conservation (FR and RL) imply immediate storage and have to avoid multiple freeze-thaw cycles. To be usable, they require fast access to -80 °C or -20 °C frozen facilities. For organisations that have to organise extensive field campaigns or to access adverse environments, without access to laboratory facilities for several days, the more practicable process is the addition of a preservative solution (nucleic acid preservative or ethanol) directly in the field. In this study, we did not test the impact of the conservation time of biofilms in nucleic acid preservative, prior to centrifugation and -20 °C storage, which may be an interesting alternative. Field collections become compromised when sample processing cannot be completed within short critical time periods when essential capabilities are unavailable. However, we assume that the conservation of samples in preservative is little affected by storage temperature (frozen or room temperature) in the first week following sampling (
A first attempt for standardisation was done in 2018, with the publication of a technical report (
Such robustness has been already observed for other steps of diatom metabarcoding: DNA extraction methods (
This study has shown that preservation method and storage duration have little effect on DNA metabarcoding results, especially when assessing diatom assemblage structure and ecological quality. Even the decrease in yield and quality of extracted DNA observed only for freshwater phytobenthos samples, preserved with ethanol, did not affect the final index values. Only low abundant taxa differed among methods and durations. Thus, preservation method choice may be important for low-density species. However, for biomonitoring purposes, freshwater ecological index values were not affected whatever the preservation method and storage duration considered (including ethanol preservation), well reflecting the site ecological status.
Diatom metabarcoding has shown to be robust enough to replace or complement the current approach, based on morphotaxonomy, paving the way to new possibilities for biomonitoring (
Thus, once accompanied by operational standards, the method will be ready to be deployed with confidence and prescribed for future regulatory monitoring. Since 2020, CEN has dedicated one of its working group (EN/TC 230/WG 28 - DNA and eDNA methods) to the development of new standards for genomic approaches applied to the biomonitoring of aquatic ecosystems. The results of this study will facilitate the emergence of a new standard, building on the initial technical report (
This work was initiated and supported by the DNAqua-Net COST Action CA15219 ‘Developing new genetic tools for bioassessment of aquatic ecosystems in Europe’ funded by the European Union. DNAqua-Net funded the lab exchanges of the two lead authors A. Baricevic and C. Chardon through Short-Term Scientific Missions in 2017 and 2019, respectively. This work largely benefited from the discussions with all participants to the “diatom workshop” held in Limassol (Cyprus) on 1–2 October 2019, organised by CUT and supported by DNAqua-Net, especially S. Derycke (Belgium), T. Elersek (Slovenia), S. Fazi (Italy), M. Kelly (UK), M. Kelly-Quinn (Ireland), Z. Ljubesic (Croatia), S. Theroux (USA), G. Varbiro (Hungary), M. Vasquez (Cyprus). INRAE funded the DNA sequencing at INRAE Genomics (GeT-PlaGe, Auzeville, France). RT acknowledges support of the CERCA Programme/Generalitat de Catalunya and help from IRTA technicians (D. Mateu, J.L. Costa and M. Rey) for sampling. JZ acknowledges support by the Federal Ministry of Education and Research [German Barcode of Life 2 Diatoms (GBOL2), grant number 01LI1501E]. MP acknowledges support of the CMR research vessel “Burin” crew and of the Croatian Science Foundation project: Life strategies of phytoplankton in the northern Adriatic (UIP-2014-09-6563). MK acknowledges support by the Swedish Agency for Marine and Water Management. CC, FR, VV and AB acknowledge support by the Office Français de la Biodiversité (OFB). All authors thank three reviewers for fruitful comments and Martyn Kelly for proofreading the English.
We acknowledge support by the OpenAccess Publication Fund of Freie Universität Berlin.
Data 1
Data type: png file
Explanation note: Detailed workflow of the study for phytobenthos samples. See Material and Methods for detailed explanations.
Data 2
Data type: png file
Explanation note: ASVs rarefaction curves for all 216 samples.
Data 3
Data type: tif file
Explanation note: Diatom assemblage compositions for all preservation methods and durations at each of the 6 sites.