Research Article |
Corresponding author: Kevin K. Beentjes ( kevin.beentjes@naturalis.nl ) Academic editor: Florian Leese
© 2018 Kevin K. Beentjes, Arjen G. C. L. Speksnijder, Menno Schilthuizen, Bartholomeus E.M. Schaub, Berry B. van der Hoorn.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Beentjes KK, Speksnijder AGCL, Schilthuizen M, Schaub BEM, van der Hoorn BB (2018) The influence of macroinvertebrate abundance on the assessment of freshwater quality in The Netherlands. Metabarcoding and Metagenomics 2: e26744. https://doi.org/10.3897/mbmg.2.26744
|
The use of molecular tools for the detection and identification of invertebrate species enables the development of more easily standardisable inventories of biological elements for water quality assessments, as it circumvents human-based bias and errors in species identifications. Current Ecological Quality Ratio (EQR) assessments methods, however, often rely on abundance data. Translating metabarcoding sequence data into biomass or specimen abundances has proven difficult, as PCR amplification bias due to primer mismatching often provides skewed proportions of read abundances. While some potential solutions have been proposed in previous research, we instead looked at the necessity of abundance data in EQR assessments. In this study, we used historical monitoring data from natural (lakes, rivers and streams) and artificial (ditches and canals) water bodies to assess the impact of species abundances on the EQR scores for macroinvertebrates in the Water Framework Directive (WFD) monitoring programme of The Netherlands. By removing all the abundance data from the taxon observations, we simulated presence/absence-based monitoring, for which EQRs were calculated according to traditional methods. Our results showed a strong correlation between abundance-based and presence/absence-based EQRs. EQR scores were generally higher without abundances (75.8% of all samples), which resulted in 9.1% of samples being assigned to a higher quality class. The majority of the samples (89.7%) were assigned to the same quality class in both cases. These results are valuable for the incorporation of presence/absence metabarcoding data into water quality assessment methodology, potentially eliminating the need to translate metabarcoding data into biomass or absolute specimen counts for EQR assessments.
Water Framework Directive, biological quality elements, abundance, indicator species, macrofauna, lakes, streams
Quality monitoring of freshwater ecosystems is prescribed under the European Union Water Framework Directive of 2000 (EU WFD; Directive 2000/60/EC) and focuses on monitoring of biological quality elements (BQEs). In Europe, benthic invertebrates are one of the most prevalently monitored BQE (
The incorporation of DNA barcodes (
Now that the actual identification of species in bulk samples with high throughput sequencing (HTS) has shown its efficacy, the focus seems to shift towards solving some of the other issues concerning these novel strategies, especially the relationship between input biomass or specimen counts and output HTS sequence abundances (
The discussion, regarding the use of HTS read counts as an approximation of biomass or specimen abundances, is important for the biological components of the WFD as well. Abundance of (indicator) species or species groups is used in many European assessment metrics (albeit regularly as abundance classes) and is often part of multi-metric approaches (
In this study, therefore, we assessed the influence of species abundances on the Ecological Quality Ratio (EQR) scores for macroinvertebrates in the WFD monitoring programme of The Netherlands. The Dutch system uses abundance data (in the form of abundance classes) for macroinvertebrates, where each species is scored as either a positive indicator, a negative indicator, a characteristic species or none of the aforementioned, depending on the type of water body (
EQR scores for macrofauna were calculated on historical monitoring data from four Dutch water authorities (Hoogheemraadschap van Rijnland, Waterschap Aa en Maas, Waterschap Brabantse Delta and Waterschap Rivierenland), using morphological macroinvertebrate records from 2009 to 2017. These records are based on traditional macrofauna monitoring using kick-net sampling and morphological identification. The dataset included 877 monitoring locations spanning 23 different water types according to the Dutch classification system. Most locations were monitored more than once (some even annually), creating a total of 1780 macrofauna samples. An overview of the samples is provided in Table
Overview of samples. Distribution of samples used in this study, per water authority (includes survey time span), divided into the three categories defined by the EQR calculation: artificial ditches and canals, natural lentic (lakes) waters and natural lotic (rivers and streams) waters. No monitoring sites are present in rivers and streams for Hoogheemraadschap van Rijnland.
Natural waters | Artificial waters | Total | ||
---|---|---|---|---|
Lakes (type M12 – M32) | Rivers and streams (type R04 – R18) | Ditches and canals (type M01 – M10) | ||
Hoogheemraadschap Rijnland (2009–2014) | 198 | n/a | 173 | 371 |
Waterschap Aa en Maas (2011–2017) | 9 | 221 | 150 | 380 |
Waterschap Brabantse Delta (2011–2016) | 139 | 230 | 62 | 431 |
Waterschap Rivierenland (2011–2017) | 8 | 56 | 534 | 598 |
Total | 354 | 507 | 919 | 1780 |
EQR macrofauna scores were calculated for all samples. The scoring system is based on the presence and/or abundance of positive indicator (DP), negative indicator (DN) and characteristic (KM) taxa. Most taxa are identified to species level in the Dutch macrofauna metrics, although for some “harder to identify” groups, species aggregates or higher-level taxonomic assignments are used (
where KM% is the percentage of characteristic taxa, KMmax is a constant value representing the expected fraction of characteristic taxa depending on the specific water type, DN% is the percentage of negative indicator individuals and KMDP% is the percentage of characteristic and positive indicator individuals (
For artificial water bodies, such as ditches and canals, the calculation is performed according to the following formula:
where PT is the absolute number of positive indicator taxa, PTmax is the absolute number expected positive indicator taxa, DN% is the percentage of negative indicator individuals and DNmax is a constant value depending on the specific water type (
For each of the 1780 samples, EQR scores were calculated using both original data with abundance classes and a manipulated dataset, converted to a presence/absence monitoring scheme by setting all specimen counts to 1. Any duplicate taxa in a given sample (e.g. where both adult and juvenile specimens were recorded separately) were removed to avoid aggregation into abundance classes other than 1 (abundance class 1 indicates a single specimen was found). QBWat software version 5.33 (
The investigated macrofauna samples had an average of 72.1 ± 0.8 (mean ± SEM) species (minimum 1, maximum 217) recorded, with an average of 1221.5 ± 25.8 specimens (minimum 1, maximum 11767). Mean EQRs, calculated with presence/absence-based data, were highly correlated to original EQRs based on abundance class data, for natural lentic sites (Pearson correlation r = 0.993, p <0.001) (Figure
EQR (presence/absence) versus EQR (abundance classes). Comparison of macroinvertebrate EQR scores in standard assessment using abundance classes and EQR scores in simulated scenarios with presence/absence data for (A) natural lentic waters (lakes, n=354), (B) natural lotic waters (streams and rivers, n=507) and (C) artificial waters (ditches and canals, n=919). Coloured boxes indicate EQR quality classes: “bad” (red), “poor” (orange), “moderate” (yellow), “good” (green) and “high” (blue). For artificial water bodies, there are only four quality classes, with “good” representing scores between 0.6 and 1.0. For all comparisons, the EQR scores of abundance class data and presence/absence data was significantly correlated (Pearson correlation, p <0.001). Pearson correlation values are provided in the panels.
Factors influencing ΔEQR. Comparison of differences in EQR between assessment using abundance classes and using presence/absence data, (A) split by water type and EQR calculation method and (B) split per original assessment quality class (“high” was omitted, with only two samples in this data set). On average, classifications without abundance are higher than original assessments (ΔEQR positive). Removal of abundance resulted in significantly lower differences in natural lentic waters compared to natural lotic and artificial waters (Dunn’s test, p <0.001). There was no significant difference between lotic and artificial. Removal of abundance data has significantly more impact on EQR assessments for samples originally classified as “bad” compared to all other classes (Dunn’s test, p <0.001). ΔEQR was also significantly higher in “poor” samples compared to “moderate” and “good”.
When assigning quality classes to the EQRs based on presence/absence data, 1596 (89.7%) of all samples were assigned to the same class, 22 (1.2%) were scored lower and 162 (9.1%) were scored higher. The change was most profound in the samples originally assigned to “bad”, where 51 out of 117 (43.6%) were assigned to “poor”, the class above. Results were comparable for the different water types: 95.2% of natural lentic samples, 89.9% of natural lotic samples and 87.4% of artificial samples were assigned to the same class. Samples assigned to a different quality class were assigned to a class either directly below or directly above its previous classification.
Our results show there is a strong correlation between traditional EQR based on freshwater macrofauna using abundance data and EQRs calculated without abundance data in the Dutch system. For most samples, scores were comparable between the abundance- and presence/absence-based methods, with the majority (89.7%) being assigned to the same quality class in both cases. The difference seems to be largest in samples at the lower end of the EQR score spectrum, with almost half (43.6%) ending up in a higher quality class (“poor” instead of “bad”).
Based on the formulae used for the calculation of the EQRs, it can already be deduced that abundance is not a consideration for all components that determine the final score. For natural lakes, half the score is represented by the fraction of characteristic taxa, which does not take individual specimen counts into account. The fraction of the score defined by factors that use abundance data is slightly lower for natural streams and rivers (two fifths) and for artificial ditches and canals abundances are not used for two thirds of the final score. (
The parts of the EQR score that do rely on abundance data in the Dutch system use abundance classes rather than actual specimen abundances. This may be a major factor in why the removal of abundances has only a limited impact on the EQR scores. Abundance classes were introduced into the Dutch metrics to reduce the effect of extremely high abundances of a single species on the EQR. The abundance class system uses a total of nine classes, where class “1” represents a single specimen and class “9” represents abundances over 1808 specimens. When applied to the monitoring data, this means that an abundance of 20 specimens is translated to class “4”, whereas an abundance of 200 specimens is assigned to class “6”. Thus, whilst the actual abundance difference might be tenfold, in the calculation it would be only 1.5-fold, already reducing the effect of absolute abundances on the final EQR (
These observations are important when considering the incorporation of molecular techniques into WFD quality monitoring methodology. Given that techniques, such as metabarcoding, are proving their efficacy in the process of identification of species in bulk samples, incorporation of such techniques into the actual monitoring is only a matter of time (
While our results imply that the technically difficult DNA-based quantifications might be avoided when calculating EQR scores, being able to measure species-abundance relationships from DNA data would nonetheless be desirable, since such relationships play an important role in understanding community composition and dynamics (
One of the main advantages of DNA-based identifications over traditional taxonomy is the ability to reliably identify larval stages and complicated taxonomical groups, for example in cryptic species, showing contrasting reactions to stressors (
Taking these considerations into account, together with the fact that expanding the DNA barcode repository for freshwater macroinvertebrates is one of the main focal points of the European DNAqua-Net collaboration (
We demonstrated the viability of adopting presence/absence data instead of specimen abundance data in a WFD water quality assessment programme. Given all obstacles hampering the translation of HTS read data into biomass or absolute specimen counts, this paves the way for incorporating metabarcoding workflows into future assessment methodology. While species abundances are still valuable for a thorough ecological understanding of natural systems, the EQRs have been used more as a relatively quick assessment of ecological status of water bodies compared to reference situations. The EQR methodology used in this paper applies to the quality monitoring in The Netherlands and results may vary for other nations, based on the methods of EQR calculation. We urge researchers to look into the actual influence of abundance data on their WFD programmes and in studies using metabarcoding data. With molecular techniques, such as metabarcoding of environmental DNA or bulk samples, proving to be successful, it is imperative that developments in routine EQR assessments, be they recalibrations or entirely new systems, strive to be more compatible with the potential lack of abundance data.
This study was part of the DNA Waterscan project funded by the Gieskes-Strijbis Fonds. Preliminary simulations of abundance data and EQR-scores were performed by Royal HaskoningDHV. We thank the Hoogheemraadschap van Rijnland and Aquon for providing data and instructions for the QBWat software programme and Waterschap Aa en Maas, Waterschap Brabantse Delta and Waterschap Rivierenland for additional monitoring data.
Monitoring event details and EQR scores