Research Article |
Corresponding author: Laura Biessy ( laura.biessy@cawthron.org.nz ) Corresponding author: John K. Pearman ( john.pearman@cawthron.org.nz ) Academic editor: Alexander Probst
© 2022 Laura Biessy, John K. Pearman, Sean Waters, Marcus J. Vandergoes, Susanna A. Wood.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Biessy L, Pearman JK, Waters S, Vandergoes MJ, Wood SA (2022) Metagenomic insights to the functional potential of sediment microbial communities in freshwater lakes. Metabarcoding and Metagenomics 6: e79265. https://doi.org/10.3897/mbmg.6.79265
|
Molecular-based techniques offer considerable potential to provide new insights into the impact of anthropogenic stressors on lake ecosystems. Microbial communities are involved in many geochemical cycling processes in lakes and a greater understanding of their functions could assist in guiding more targeted remedial actions. Recent advances in metagenomics now make it possible to determine the functional potential of entire microbial communities. The present study investigated microbial communities and their functional potential in surface sediments collected from three lakes with differing trophic states and characteristics. Surface sediments were analysed for their nutrient and elemental contents and metagenomics and metabarcoding analysis undertaken. The nutrients content of the surface sediments did not show as distinct a gradient as water chemistry monitoring data, likely reflecting effects of other lake characteristics, in particular, depth. Metabarcoding and metagenomics revealed differing bacterial community composition and functional potential amongst lakes. Amongst the differentially abundant metabolic pathways, the most prominent were clusters in the energy and xenobiotics pathways. Differences in the energy metabolism paths of photosynthesis and oxidative phosphorylation were observed. These were most likely related to changes in the community composition and especially the presence of cyanobacteria in two of the three lakes. Xenobiotic pathways, such as those involving polycyclic aromatic hydrocarbons, were highest in the lakes with the greatest agricultural land-use in their catchment. These results highlight how microbial metagenomics can be used to gain insights into the causes of differences in trophic status amongst lakes.
bacteria, cyanobacteria, New Zealand, nutrients, surface sediments, water quality
Lakes are particularly appealing for ecological study because they essentially function as isolated island-like systems, inherently connected to their surrounding terrestrial habitats, but each lake and its catchment is usually isolated from others (
Within lakes, bacteria are abundant and involved in the cycling of nutrients, playing a significant role in the breakdown of organic and inorganic compounds (
Soil and sediment contain highly diverse microbial communities (
Metagenomics studies have highlighted the importance of microbial communities in nutrient cycling and biochemical degradation within aquatic ecosystems (
Metagenomic studies have shown functional differences in communities across land-use, eutrophication or pollution gradients. For example, research on mangrove sediments showed reduced diazotroph abundance and nitrogen fixing capability and enhanced metabolism via increased methanogenesis and sulphate reduction in contaminated mangroves (
In this study, metagenomics was used to investigate the microbial communities and their functional potential in surface sediments collected from three lakes of differing trophic states (mesotrophic, eutrophic and supertrophic) and characteristics (e.g. size, depth, land use). The study lakes were in the Ō Tū Wharekai (Ashburton Basin, New Zealand/Aotearoa) which is an area of high cultural significance to Māori (the indigenous people of New Zealand) with high conservation values (
The aim of this research was to describe the microbial diversity and their functional profiles in these three distinct Lakes within a small geographic area. We hypothesised that: 1) the bacterial community composition of surface sediments would differ between these Lakes; 2) KEGG IDs belonging to metabolic pathways would be differentially abundant amongst the Lakes; and 3) key differences in the functional potential of pathways involved in nutrient cycling would be observed (e.g. nitrate reduction, nitrogen fixation or denitrification, methanogenesis or sulphate reducing pathways) amongst the Lakes and these would be related to the nutrient and geochemistry of the Lakes (
Three high-country (> 400 m above sea level) lakes in the Canterbury Region (New Zealand; Fig.
Physical characteristic of each lake. Data collected from
Lake type | Max. depth (m) | Altitude (m) | Area (km2) | Res. time (y) | Seasonal stratification | Trophic level (TLI) | TP (mg/m3) | TN (mg/m3) | Chl-a (mg/m3) | |
---|---|---|---|---|---|---|---|---|---|---|
Lake Denny | Glacial | 2.1 | 678 | 0.05 | 0.04 | Polymictic | Supertrophic (5.52) | 73 | 760 | 8.0 |
Kirihonuhonu/ Lake Emma | Glacial | 3 | 640 | 1.67 | 0.11 | Polymictic | Eutrophic (4.52) | 22 | 560 | 7.2 |
Ōtūroto/Lake Heron | Glacial | 37 | 692 | 6.95 | 0.97 | Seasonal stratification | Mesotrophic (3.19) | 6 | 145 | 3.9 |
Locations and images of the three Lakes sampled in the Ashburton Basin (South Island, New Zealand). Inset map of New Zealand shows the location of the Lakes.
Ōtūroto/Lake Heron (Fig.
Kirihonuhonu/Lake Emma (Fig.
Lake Denny (Fig.
Sampling was undertaken between 4 and 9 November 2019. Prior to sampling, a side scan sonar survey was undertaken to determine the deepest part of each Lake, where subsequent sampling was undertaken. Samples for sediment geochemistry were collected by Uwitech gravity corer (90 mm diameter core). Five cores were taken and the top 2 cm of sediment removed using core cutter and combined in a 500 ml container. These were stored chilled (4 °C) and shipped to the laboratory within 48 hrs for nutrient and elemental characterisation. Ponar grab samples were taken in triplicate at each site. Using sterile spatulas, ca. 4 g of the undisturbed surface sediment layer (ca. top 1–2 mm) was placed in sterile tubes and stored in liquid nitrogen for one week before being transferred to a -80 °C freezer in the laboratory for later DNA extraction.
Nutrient and elemental samples were homogenised, centrifuged (3000× g, 40 min, 4 °C) and the pore water decanted. The sediments were analysed for metals using the same method as described in
DNA for metabarcoding was extracted from 0.25 g of sediment in triplicate using the DNeasy PowerSoil Kit (Qiagen, Hilden, Germany). The V3-V4 region of the bacterial 16S rRNA gene was amplified by polymerase chain reaction (PCR) using the primers: 341F: 5-CCT ACG GGN GGC WGC AG-3 and 805R: 5-GAC TAC HVG GGT ATC TAA TCC-3 (
DNA was extracted from ca. 2 g of sediment using the RNeasy PowerSoil Total RNA kit (Qiagen) combined with the RNeasy PowerSoil DNA Elution kit (Qiagen) following the manufacturer’s instructions. Three samples were extracted per lake. DNA was diluted to 25 µg, transferred to a DNA stable tube (GENEWIZ, Suzhou, China) and dried using a vacuum desiccator following the manufacturer’s instructions before being shipped to the GENEWIZ facilities. Paired-end sequencing libraries were prepared using the VAHTS Universal DNA library kit for Illumina following the protocols of the manufacturer with amplification of the adapter-ligated DNA occurring via 8 cycles of PCR. Products were cleaned and validated using an Agilent 2100 Bioanalyzer and subsequently sequenced on an Illumina HiSeq X Ten (2ξ150 base pairs (bp)).
The raw reads from the sequencing were trimmed for adapters and removal of low quality sequences using Trimmomatic (quality thresholds: LEADING:20; TRAILING:20; SLIDINGWINDOW:40:30; MINLEN:70; (
Satellite imagery available in the Land Cover Database Version 5 (LCDBv5, Landcare Research New Zealand Ltd; https://lris.scinfo.org.nz/layer/104400-lcdb-v50-land-cover-database-version-50-mainland-new-zealand/) was used to derive seven land cover variables: (1) Native vegetation; (2) Urban; (3) Non-native vegetation; (4) Forestry; (5) High Production Grassland (HPG); (6) Low Production Grassland (LPG); and (7) Other. Suppl. material
Taxonomic composition of the bacterial communities was assessed by computing the relative abundance of phyla per lake with three different methods: i) 16S rRNA gene metabarcoding; ii) 16S rRNA from the metagenomic assembly and iii) taxonomic classification of the ORFs. To assess the functional aspects of the metagenome, ORFs were amalgamated to the level of KEGG ID or COG ID. The number of shared and unique identifiers (IDs) across the Lakes were calculated and visualised using the R package VennDiagram (
Multivariate analysis of the functional data was undertaken using PERMANOVA with the R package vegan (
Differentially abundant KEGG IDs were calculated, based on the raw read numbers using the DESeq2 package in R (
KEGG ID abundances were summed to the pathway level for each Lake to investigate changes with environmental variables. Significant differences were assessed with Kruskal Wallis in R.
The Lake sediments were of moderate to high density with Lake Denny being substantially higher in density than the other two Lakes. Organic matter and total organic carbon contents were relatively low and reasonably uniform across the Lakes on a per dry mass basis (Table
Sediment chemistry from samples of the top 0–2 cm of the lakebed sediments. Elemental results are total recoverable contents. TOC = Total organic carbon; dw = dry weight; Fe = Iron; Mn = Manganese.
Lakes | Bulk Density | Organic matter | TOC | Nitrogen | Phosphorus | Sulphur | Fe | Mn |
---|---|---|---|---|---|---|---|---|
kg.m3 | g. 100 g-1 dw | mg.kg-1 dw | ||||||
Denny | 284 | 12.9 | 4.90 | 0.57 | 1,040 | 1,200 | 38,100 | 921 |
Kirihonuhonu | 131 | 18.6 | 5.80 | 0.60 | 730 | 1,800 | 19,100 | 80 |
Ōtūroto | 141 | 12.8 | 4.70 | 0.64 | 1,900 | 1,100 | 41,600 | 2,370 |
Aluminium | Calcium | Zinc | Lead | Copper | Cadmium | Mn:Fe | ||
Denny | 38,500 | 9,070 | 122 | 34.6 | 61.7 | 0.097 | 0.0246 | |
Kirihonuhonu | 17,500 | 18,500 | 49.6 | 11.8 | 18.2 | 0.039 | 0.0362 | |
Ōtūroto | 31,000 | 10,500 | 92.7 | 25.9 | 21.5 | 0.074 | 0.0579 |
In total, across the three Lakes, 275,630 reads passed the quality filtration with total prokaryotic diversity in the Lakes reaching 3,664 ASVs.
Selected sediment chemistry parameters expressed in mass per area. Elemental results are total recoverable contents.
Lakes | Areal dry mass | Organic matter | Nitrogen | Phosphorus | Sulphur |
kg.m2 | kg.m2 | ||||
Denny | 5.67 | 0.73 | 0.032 | 0.006 | 0.007 |
Kirihonuhonu | 2.63 | 0.49 | 0.016 | 0.002 | 0.005 |
Ōtūroto | 2.81 | 0.36 | 0.018 | 0.005 | 0.003 |
The 16S rRNA metabarcoding revealed that the prokaryotic component of the community in all three Lakes was dominated by proteobacteria (27.6 ± 8.9% [mean ± standard deviation]; Fig.
Taxonomic composition of the three Lakes at the phyla level for; A) 16S rRNA gene metabarcoding, B) 16S rRNA genes from the metagenomics and C) coding sequences from the metagenomics data. Only the top 20 phyla (based on mean abundance) are plotted for each method.
Multivariate analysis showed that the taxonomic structure of the prokaryotic community in the surface sediments was significantly different, based on the 16S rRNA gene metabarcoding (adonis: F = 12.10; p = 0.005; Suppl. material
A total of 1,662 (mean = 1,230; sd = 49) 16S rRNA sequences were identified. As with the 16S rRNA metabarcoding, the proteobacteria and Bacteroidota were the most abundant taxa account for 24.8 ± 5.3% and 17.9 ± 2.6% of the relative abundance, respectively (Fig.
A total of 9,603,406 ORFs were predicted using Prodigal from the three Lakes with an average of 4,797,666 (sd = 208,687) coding sequence ORFs per sample. Of these, 7,627,694 were assigned a taxonomy at domain level. bacteria were the dominant component in each lake (mean 96.6%) followed by archaea (mean 3.0%) with Eukaryota (mean 0.25%) and Viruses (mean 0.12%) accounting for only a small proportion, based on taxonomic annotation of the ORFs. As with the rRNA approaches, the bacterial component was dominated by proteobacteria (40.5 ± 4.7%); however, Bacteroidota contributed to a substantially lower proportion of the community than with rRNA (9.0 ± 1.3%; Fig.
ORFs from Lake Denny had a higher average GC percentage (mean 60.6%) compared to those from Kirihonuhonu (58.3%) and Ōtūroto (58.3%). A total of 4,779,674 ORFs were annotated with KEGG (mean = 2,437,348; sd = 108,726) and were attributed to 10,867 KEGG IDs. Despite the vast majority of KEGG IDs being shared amongst the Lakes (Fig.
The shared and unique annotated Kyoto Encyclopaedia of Genes and Genomes (KEGG) IDs amongst the study three Lakes, based on Bray-Curtis dissimilarity matrix (A) and a Principal Coordinate plot of KEGG IDs, based on Bray-Curtis dissimilarity matrix.
In general, shared KEGG IDs in all three Lakes had similar proportions at the pathway level with carbohydrate metabolism, followed by genetic information processing and signalling and cellular processes most prominent (Suppl. material
In total, there were 1,326 KEGG IDs that were differentially abundant amongst the Lakes. The majority of these belonged to the KEGG pathway Metabolism with smaller numbers from the pathways Environmental Information Processing and Genetic Information Processing (Suppl. material
Energy metabolism had a high number of differentially abundant KEGG IDs with both Lakes Denny and Kirihonuhonu having more than Ōtūroto (Fig.
Number of differentially abundant Kyoto Encyclopaedia of Genes and Genomes (KEGG) IDs related to metabolism in pairwise comparisons between Lakes grouped by metabolic process.
Heatmap of the differentially expressed genes relating to: A) the photosynthesis pathway and photosynthetic antenna proteins and B) oxidative phosphorylation. RPM = Reads per million, KEGG = Kyoto Encyclopaedia of Genes and Genomes.
Xenobiotics pathways were abundant in Ōtūroto and Lake Denny and were largely absent from Kirihonuhonu (Suppl. material
Differences in the concentration of nutrients (nitrogen and phosphorus) per area in the Lakes were associated with changes in abundance of several metabolic pathways (Fig.
Abundance of metabolic pathways (reads per million; the point (mean) with standard deviation bars) per Lake and corresponding nitrogen concentrations (height of the bar plots). Nitrogen is the total nitrogen present in the sediment. Alongside is the proportional abundance of taxa for the pathway at class level. Other accounts for unclassified taxa at the class level and taxa that accounted for less than 1% of the community.
Abundance of metabolic pathways (reads per million; the point with standard deviation bars) per Lake and corresponding organic matter concentrations (height of the bar plots). Alongside is the proportional abundance of taxa for the pathway at class level. Other accounts for unclassified taxa at the class level and taxa that accounted for less than 1% of the community.
The present study investigated the diversity and functional potential of surface sediment communities in three New Zealand lakes and related this to differences in the lake sediment chemistry and biogeochemical conditions. Water quality monitoring data indicated that Ōtūroto had the lowest trophic status and hence was less nutrient enriched than Kirihonuhonu and Lake Denny, respectively. In contrast, the dry-mass nutrient contents of the surface sediments showed a differing pattern to the water column nutrients with Ōtūroto having the highest phosphorus and nitrogen contents, likely reflecting the substantially higher iron content in the sediment of Ōtūroto, which provides strong nutrient binding capacity (
We investigated the taxonomic composition of the surface sediments of three Lakes using a combination of metabarcoding and metagenomics. Regardless of the methodology, the phylum proteobacteria was the dominant component of the bacterial community with Bacteroidota also contributing substantially to the community. The dominance of these groups is in agreement with other lake sediment studies (
Comparisons between the three methodologies to assess the phylogenetic composition of the Lakes showed distinct differences. Firstly, the taxonomic annotation of the ORFs had a higher relative proportion of unclassified reads at the phylum level. This is most likely due to biases in the reference databases and the lack of reference genomes present for a variety of taxa, which would limit the annotation of functional genes. This would explain why Desulfobacterota had a significant contribution to the community based on both 16S rRNA methodologies, but was absent from the community composition, based on the annotation of the ORFs. It should also be noted that the composition of a community, based on ribosomal RNA, could be biased due to the variation in copy numbers of the 16S rRNA gene amongst taxa (
This study demonstrated that the majority of the annotated KEGG IDs (over 9,000) were shared amongst the three Lakes. This was expected as the majority of these KEGG IDs represent the core functions and metabolism of bacteria. These mainly included protein processing, carbohydrate and amino acid metabolisms, corresponding to the minimal metabolic machinery necessary for bacteria to survive (
We also hypothesised that the overall functional differences of the bacterial communities would vary between Lakes and that key differences in functional potential of genes involved in nutrient cycling would be observed. Multivariate analysis showed significant differences in the functional structure of the bacteria amongst the three Lakes, based on the relative abundance of KEGG IDs. While only between 1.5 and 2.4% of KEGG IDs were unique to a particular Lake, there were approximately 12% of the KEGG IDs which were differentially abundant in pair-wise comparisons of the Lakes. The majority of these differentially abundant KEGG IDs belonged to Metabolism pathways and, to a lesser extent, to Environmental Information and Genetic Information Processing pathways. Of note, were several differentially-abundant KEGG IDs clusters within the Metabolism pathways, especially within energy pathways, xenobiotics and secondary metabolites pathways. One of the main clusters of KEGG IDs abundant in Kirihonuhonu was attributed energy-converting hydrogenase and biosynthesis of nitrogenase protein. Previous studies have suggested that nitrogen fixation enzymes are negatively correlated to high nitrate concentrations (
Most of the KEGG IDs attributed to Xenobiotics pathways were absent or in very low relative abundance in Kirihonuhonu. Xenobiotics are synthetic chemicals from anthropogenic sources that do not or rarely exist as natural products (
Nitrogen metabolic pathways were, in general, associated with changes in the nutrient conditions of the sediments. Dissimilatory nitrate reduction, denitrification and nitrification were all observed to increase with increasing nitrogen concentrations within the sediment. Dissimilatory nitrate reduction is in direct competition with denitrification for free nitrate (
This small-scale metagenomic study of lake surface sediments showed the potential for such methodologies to be used to investigate functional processes occurring within lakes. The Lakes were chosen, based on monitoring data to be along a gradient of trophic state, with the microbial community related to the normalised sediment chemistry and Lake characteristics (e.g. depth). No strong differential abundance patterns of KEGG IDs were observed in key metabolic pathways (such as nitrogen reduction, sulphate reduction, methanogenesis); however, differences in photosynthetic and oxidative phosphorylation genes were recorded. These were most likely related to changes in the community and especially the presence of cyanobacteria. The overall relative abundance of metabolic pathways was shown to correlate with changes in the sediment chemistry suggesting that environmental factors are driving changes in the functional potential of the sediment (Suppl. material
The authors declare that they have no competing interests.
This research was funded by the New Zealand Ministry of Business, Innovation and Employment research programme - Our lakes’ health: past, present, future (C05X1707) and a Queen Elizabeth’s Technician Award fund attributed to Laura Biessy. We acknowledge Te Rūnanga o Arowhenua, Te Ngāi Tūāhuriri Rūnanga, Te Rūnanga o Ngāi Tahu, Te Taumutu Rūnanga, Environment Canterbury, the Department of Conservation and landowners for their support of this project and assistance with sampling and accessing the sites. We thank Jonathan Puddick (Cawthron Institute), Chris Moy (University of Otago), Julia Short (University of Adelaide) and Riki Ellison (Waka Taurua Ltd) for field assistance and Chris Weisener and Nick Falk (University of Windsor) for advice during project planning. The Department of Conversation is acknowledged for assistance with permitting. We would like to thank the editor and two anonymous reviewers for their input in improving the manuscript.
Figures S1–S6
Data type: images
Explanation note: Fig. S1. Principal Coordinate plot of the 16S rRNA gene metabarcoding results based on Bray-Curtis distances. Fig. S2. Heatmap of the differentially abundant genes. Fig. S3. Number of differentially abundant Kyoto Encyclopaedia of Genes and Genomes (KEGG). Fig. S4. Ternary plots illustrating the Kyoto Encyclopaedia of Genes and Genomes (KEGG). Fig. S5. Abundance of metabolic pathways. Fig. S6. Review figure showing the responses of metabolic pathways and nutrient concentrations in the three lakes.
Tables S1, S2
Data type: docx
Explanation note: Table S1. Land cover variables used and the categories included, and percentage of the catchment in each land use cover for the study lakes. Table S2. The reads per million (RPM) with standard deviation (S.D) for different metabolic pathways per lake association with different environmental variables.