Research Article |
Corresponding author: Danielle Botha ( daniellebotha3@gmail.com ) Academic editor: Dirk Steinke
© 2024 Danielle Botha, Sandra Barnard, Morné Du Plessis, Mushal Allam, Kai Behn, Arshad Ismail, Anja Linstädter, Malesela Vincent Mokoka, Zamantungwa Thobeka Happiness Mnisi, Frances Siebert.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Botha D, Barnard S, Du Plessis M, Allam M, Behn K, Ismail A, Linstädter A, Mokoka MV, Mnisi ZTH, Siebert F (2024) Application of a dual-locus metabarcoding approach for a more comprehensive account of cattle dietary items in a semi-arid African savanna with special reference to forbs. Metabarcoding and Metagenomics 8: e127959. https://doi.org/10.3897/mbmg.8.127959
|
Increasing livestock densities and more severe drought events challenge sustainable management in South Africa’s semi-arid savannas. Effective mitigation strategies require accurate assessments of livestock foraging behaviour. By utilising high-throughput sequencing technology, this study evaluated the use of a dual-locus metabarcoding approach (trnL and rbcL) together with study-area-specific reference libraries, to analyse cattle diets in two bioregions of the eastern semi-arid South African savanna. Both markers demonstrated the ability to identify various plant families, but trnL exhibited a higher diversity in terms of family and genus identification at both sampling sites. Forbs, although comprising a diverse component of savanna plant communities, have relatively small above-ground biomass, but can still serve as crucial forage items, especially during dry periods. Our study underscores the significant role of forbs in cattle diets, demonstrating a shift in cattle foraging preferences from grass-based diets to higher inclusions of forbs and woody taxa during the drier season. Although grasses, such as Setaria, were still prevalent, forbs, belonging to the genera Malvastrum, Asparagus, Pollichia and Ipomoea were also important food items for cattle as well as woody taxa belonging to Fabaceae, Combretaceae, Ebenaceae, and Malvaceae with a selection of food items from trees and shrubs from genera Albizia, Combretum, Euclea and Vachellia. Furthermore, our study highlights the value of a dual-locus metabarcoding approach for understanding herbivorous diets. Using trnL and rbcL markers, with study-area-specific reference libraries, improves taxonomic resolution for accurately reconstructing cattle diets in semi-arid savannas. This study may improve biodiversity estimates and inform sustainable rangeland management strategies in semi-arid African savanna ecosystems.
African savanna, dietary analysis, forbs, semi-arid, rbcL, sustainable land-use, trnL
African savannas cover almost half of the continent (
Although semi-arid African savannas co-evolved together with large mammalian herbivores, the replacement of functionally diverse native African herbivore communities by single-species livestock has led to substantial changes in savanna vegetation structure and composition across the African continent (
Environmental factors affecting current forage availability and seasonal variation in a grazer’s diet interact with individual animal traits in influencing diet composition (
However, selecting which genomic region, or DNA barcode, to use for species identification in a metabarcoding study is crucial to retrieving and identifying target taxa from an environmental sample. These barcodes are chosen for their sequence recovery, annotation, and sequence resolution among taxa (
This study aims to evaluate the use of a dual-barcode approach together with study-area-specific reference libraries (
The study sites that were used for the estimation of dietary profiles of domesticated cattle were located within the eastern semi-arid savanna in the Limpopo Province of South Africa (SA) across two bioregions, namely the Central Bushveld- and the Lowveld Bioregion (
DNA was extracted from the faeces of 24 individual cattle from Welverdiend and 24 individual cattle from the Syferkuil Experimental farm during the end of the dry season, specifically directly before the first rains of the new season when forage supply from the grass layer is particularly scarce (
The rbcL and trnL(UAA) intron chloroplast gene regions from 48 faecal samples as well as the extraction blank control, sample C2, were amplified using the barcode’s metagenomic sequencing library protocol (Illumina). Two PCR reactions were completed for rbcLa and trnL(UAA) library preparations and incorporated the Illumina overhang adapters onto the primers: rbcLa-F (
The PCR amplification of these reactions, as well as negative control reaction without any template DNA, were carried out simultaneously as follows: heated lid 110 °C, 95 °C for 30 seconds, 25 cycles of 95 °C for 30 seconds, 55 °C for 30 seconds, 72 °C for 30 seconds, then 72 °C for 5 minutes and held at 4 °C. Gel electrophoresis (1X TAE buffer, 1.5% agarose, 100 V) was used to visualise the PCR results and successful PCR products were purified using AMPure XP magnetic bead-based purification (Beckman Coulter, IN). Multiplexing was made possible by the Illumina Nextera XT Index kit (Illumina, San Diego, CA) using two 8-base indices. Two indexing primers per sample were utilised and PCR reactions with the purified DNA template (5 µl DNA template, 5 µl × 2 index primer, 25 µl 2x KAPA HiFi HotStart ReadyMix and 10 µl PCR-grade water) were performed on a Bio-Rad C1000 Thermal Cycler (Bio-Rad, Hercules, CA). The cycling conditions consisted of an initial denaturation cycle of 95 °C for 3 minutes, followed by eight cycles of denaturation (95 °C for 30 seconds), annealing (55 °C for 30 seconds), and extension (72 °C for 30 seconds). The cycle was completed with a final extension cycle of 72 °C for 5 minutes. The indexed libraries were purified with Ampure XP beads before library pooling, and their quantities were determined using the Qubit dsDNA HS Assay Kit from Thermo Fisher Scientific (Waltham, MA). Prior to sequencing, the quality of the purified amplicons was evaluated using the Agilent Bioanalyzer (Agilent Technologies, Santa Clara, CA). Following instructions from Illumina, the sample pool (2 nM) was denatured with 0.2 N NaOH, diluted to 7 pM, and mixed with 10% (v/v) denatured 20 pM PhiX. Samples were sequenced on the Illumina MiSeq platform (2 × 300 bp) by the NICD Sequencing Core Facility service, in Pretoria, South Africa.
Cutadapt (v. 4.1:
Frequency-based filtering was applied to both the rbcL and trnL datasets, which aided in removing ASVs only present in a single sample and an additional abundance filter, identified as necessary when tested with a simulated mock community (
As part of an additional quality control step, the unassigned ASVs resulting from taxonomic assignment by the compiled reference libraries by
To determine the sum of branch lengths (SBL) of the trnL and rbcL datasets, the ASVs were aligned in the Multiple alignment program for amino acid or nucleotide sequences (MAFFT, v.7:
Alpha-diversity was determined using the Shannon index and was calculated based on the abundance matrices obtained for each primer dataset for both assigned and unassigned reads sampled from each bioregion using the estimate-richness function of the "phyloseq" (v.1.48:
Taxonomic assignments of the rbcL and trnL ASVs retrieved from the Syferkuil and Welverdiend cattle faecal samples were visualised via read abundance, frequency of occurrence (FOO), also known as absolute data, as well as the relative read abundance (RRA) of ASVs assigned to family and genera per sampled individual. Each taxon was grouped according to plant life forms, i.e., woody taxa (trees/shrubs), woody taxa/forbs (shrubs/forbs), forbs, and graminoids (grasses). All statistical analyses were carried out in R (v.4.3.1:
The Welverdiend rbcL dataset comprised of 10 209 509 sequences and 4 989 550 ASVs, of which 494 were left after filtering and post-clustering curation. For the trnL dataset, 3 894 691 sequences resulted in 1 852 442 ASVs of which 1 217 were left after filtering and post-clustering curation. The Syferkuil rbcL dataset consisted of 1 858 096 sequences and resulted in the creation of 1 444 146 ASVs of which 1 403 were left after filtering and curation. Of the 1 985 791 sequences that were input for the Syferkuil trnL dataset, 1 234 380 ASVs were created and 3 320 were left after filtering and trimming. ASVs that were assigned to non-plant references and were subsequently removed from further analysis included 17 from the Welverdiend trnL dataset, 51 from the Syferkuil rbcL dataset and 18 from the Syferkuil trnL dataset. Furthermore, the blast results indicated that ASVs classified as unassigned by the compiled reference libraries could also not be matched with any plant species in NCBI’s non-redundant database within the specified parameters.
To compare the phylogenetic distances of the species detected with the trnL and rbcL at each of the foraged communities, Welverdiend and Syferkuil, NJ trees from each treatment for the trnL and the rbcL ASVs were compared. The phylogenetic distances or SBL for the trnL ASVs were higher than the rbcL ASVs at both sites. Specifically, at Welverdiend, the SBL for trnL (295.51) exceeded that for rbcL (137.58), while at Syferkuil, trnL (934.78) also exceeded that of rbcL (358.77). The SBL for both markers was higher at Syferkuil than at Welverdiend. The alpha diversity metrics (Shannon index) revealed that the highest diversity and richness of consumed plant species (both assigned and unassigned) was detected by the trnL marker at the Syferkuil sampling site (Fig.
For Welverdiend, the total abundance of unassigned reads was more prevalent in the trnL dataset than in the rbcL dataset (Suppl. material
The total abundance of unassigned ASVs for the Syferkuil rbcL dataset was much more prevalent than those that could be linked to a taxonomy and the dataset only consisted of 24 samples since the control sample was lost during filtering. The Syferkuil trnL dataset was also represented by only 24 samples since all reads from sample 43 were lost during filtering and this dataset also included the highest abundances of unassigned reads relative to the other three datasets. In the case of the Syferkuil dataset (Suppl. material
The use of rbcL and trnL as a dual-locus barcoding approach revealed complementary capabilities in the reconstruction of cattle dietary profiles. Both markers could identify a range of plant families, with trnL exhibiting higher family diversity compared to rbcL in both Welverdiend and Syferkuil study areas. While trnL could uniquely identify families like Apocynaceae, Asteraceae, and Euphorbiaceae, rbcL could exclusively identify families such as Combretaceae and Annonaceae. Genus level identifications varied between the markers, with rbcL consistently identifying genera across all families whereas trnL had fewer representative genera of common families identified or no genera representative of unique families identified. Table
A summary of the ability of the rbcL and trnL barcodes to recover plant life forms from cattle diets on family and genus levels.
Family | Genus | rbcL | trnL | Trees/ Shrubs | Forbs | Graminoids | Study area | |
---|---|---|---|---|---|---|---|---|
WV | SK | |||||||
Acanthaceae | ✓ | ✓ | ✓ | |||||
Thunbergia | ✓ | ✓ | ✓ | |||||
Anacardiaceae | ✓ | ✓ | ✓ | ✓ | ||||
Rhus | ✓ | ✓ | ✓ | |||||
Annonaceae | ✓ | ✓ | ✓ | |||||
Hexalobus | ✓ | ✓ | ✓ | |||||
Apocynaceae | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Adenium | ✓ | ✓ | ✓ | |||||
Asparagaceae | ✓ | ✓ | ✓ | |||||
Asparagus | ✓ | ✓ | ✓ | |||||
Asteraceae | ✓ | ✓ | ✓ | |||||
Capparaceae | ✓ | ✓ | ✓ | ✓ | ||||
Caryophyllaceae | ✓ | ✓ | ✓ | ✓ | ||||
Pollichia | ✓ | ✓ | ✓ | ✓ | ||||
Cleomaceae | ✓ | ✓ | ✓ | |||||
Cleome | ✓ | ✓ | ✓ | |||||
Combretaceae | ✓ | ✓ | ✓ | |||||
Combretum | ✓ | ✓ | ✓ | |||||
Convolvulaceae | ✓ | ✓ | ✓ | |||||
Ipomoea | ✓ | ✓ | ✓ | |||||
Cyperaceae | ✓ | ✓ | ✓ | |||||
Ebenaceae | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Diospyros | ✓ | ✓ | ✓ | ✓ | ||||
Euclea | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Ehretiaceae | ✓ | ✓ | ✓ | |||||
Ehretia | ✓ | ✓ | ✓ | |||||
Euphorbiaceae | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Fabaceae | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
Dichrostachys | ✓ | ✓ | ✓ | ✓ | ||||
Vachellia | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Albizia | ✓ | ✓ | ✓ | |||||
Philenoptera | ✓ | ✓ | ✓ | |||||
Indigofera | ✓ | ✓ | ✓ | |||||
Senegalia | ✓ | ✓ | ✓ | ✓ | ||||
Senna | ✓ | ✓ | ✓ | |||||
Geraniaceae | ✓ | ✓ | ✓ | |||||
Monsonia | ✓ | ✓ | ✓ | |||||
Lamiaceae | ✓ | ✓ | ✓ | |||||
Leucas | ✓ | ✓ | ✓ | |||||
Malvaceae | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Malvastrum | ✓ | ✓ | ✓ | ✓ | ||||
Moraceae | ✓ | ✓ | ✓ | |||||
Ficus | ✓ | ✓ | ✓ | |||||
Phyllanthaceae | ✓ | ✓ | ✓ | |||||
Flueggea | ✓ | ✓ | ✓ | |||||
Poaceae | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Cenchrus | ✓ | ✓ | ✓ | |||||
Setaria | ✓ | ✓ | ✓ | ✓ | ✓ | |||
Tricholaena | ✓ | ✓ | ✓ | |||||
Enteropogon | ✓ | ✓ | ✓ | |||||
Trachypogon | ✓ | ✓ | ✓ | |||||
Eragrostis | ✓ | ✓ | ✓ | |||||
Rhamnaceae | ✓ | ✓ | ✓ | |||||
Berchemia | ✓ | ✓ | ✓ | |||||
Ziziphus | ✓ | ✓ | ✓ | |||||
Rubiaceae | ✓ | ✓ | ✓ | |||||
Solanaceae | ✓ | ✓ | ✓ | ✓ | ||||
Solanum | ✓ | ✓ | ✓ | ✓ |
The FOO analysis of both the rbcL and trnL datasets from Welverdiend (Fig.
Bar chart showing the percentage of frequency of occurrence (FOO%) of the plant (a) families and (b) genera, identified as part of the diets of Welverdiend cattle, faceted according to their life form. White bars represent taxa identified with rbcL and grey bars represent those identified with trnL.
In terms of forb abundance, the Welverdiend rbcL dataset showed lower abundance for genera like Malvastrum and Thunbergia compared to the dominance of woody taxa, but trnL identifications highlight the importance of forbs with Malvaceaea being the dominant family. This marked difference in datasets underscores the need for a dual-locus approach to fully capture the richness and abundance of forbs in cattle diets. The RRA calculations for the Welverdiend rbcL dataset (Fig.
Relative read abundance (RRA) of families (a, b) and genera (c, d) identified with the rbcL (a, c) and trnL (b, d) barcoding marker from the Welverdiend dataset. The stacked bar plots the distribution of plant taxa in individual samples arranged according to their dominant life forms relative to the number of unassigned reads, with codes indicating graminoids (G), forbs (F), forbs/woody taxa (F/W) and woody taxa (W). For the trnL charts, the sample with no families identified (C2) and samples with no genera identified (4 and 16) were excluded from the visualisations.
Forbs/woody taxa were again identified as the dominant life form in most samples in the Welverdiend trnL dataset followed by woody taxa (Fig.
At Syferkuil, a different pattern emerged, with graminoids and woody taxa dominating both marker datasets, but particularly so in rbcL (Fig.
Bar chart showing the percentage of frequency of occurrence (FOO%) of the plant (a) families and (b) genera, identified as part of the diets of Syferkuil cattle, faceted according to their life form. White bars represent taxa identified with rbcL and grey bars represent those identified with trnL.
Reinforcing the dominance of grasses in the diets of cattle at Syferkuil, the rbcL marker (Fig.
Relative read abundance (RRA) of families (a, b) and genera (c, d) identified with the rbcL (a, c) and trnL (b, d) barcoding marker from the Syferkuil dataset. The stacked bar plots the distribution of plant taxa in individual samples arranged according to their dominant life forms relative to the number of unassigned reads, with codes indicating graminoids (G), forbs (F), forbs/woody taxa (F/W) and woody taxa (W). For the trnL charts, sample C2 had no family or genus identifications and was excluded from the visualisations.
In summary, the RRA analysis across both sites showed a predominant reliance on graminoids and woody taxa, with forbs contributing to the diets, but in smaller relative amounts. The cattle diets at Welverdiend showed a higher overall dominance of forb and woody taxa compared to those of Syferkuil, which relied more heavily on grasses. The use of both the rbcL and trnL markers further highlighted the differences in the identification of plant families and genera, with trnL providing more detailed insights into forb consumption. Despite these differences in resolution between loci, there was consistency across both datasets in the identification of key families such as Fabaceae, Ebenaceaea, and Poaceae. These families were prominent in both Welverdiend and Syferkuil. However, the broader diversity captured by trnL, especially forbs, emphasises the importance of using multiple markers to obtain a comprehensive view of the dietary composition of the cattle foraging in east semi-arid African savannas.
In this study, we determined the dietary composition of cattle foraging in two bioregions of the eastern semi-arid South African savanna and demonstrated that forbs and woody taxa make a substantial contribution to the overall diet of cattle during the dry periods. The taxonomies described using the different barcoding regions highlight the need for the use of a dual-locus identification system for plants (
The high number of trnL ASVs left after filtering, combined with its greater phylogenetic distances and alpha diversity metrics, indicates that trnL is more robust than rbcL for capturing the diversity and phylogenetic structure of food items contained in herbivorous diets. Syferkuil exhibited the highest alpha diversity (Shannon index) and species richness for both markers, especially for trnL. This suggests that plant communities contained in cattle diets are more diverse compared to those at Welverdiend (Kruskal-Wallis test: p < 0.05). A substantial number of ASVs remained unassigned after comparison with reference libraries and could also not be assigned to plant species in the NCBI nt database. This indicates potential gaps in these reference sequence libraries or the presence of unknown or poorly characterised plant species in eastern semi-arid South African savannas, further emphasising the need for expanded reference databases for accurate metabarcoding analyses that will ideally facilitate multi-barcode approaches for plant identifications (
Except for the differences in families identified, the rbcL and trnL barcoding markers have different affinities for amplifying certain genera from a common family identified in the diets of cattle foraging in each study site. The taxa under consideration, as well as the marker used, have a significant impact on the success of species identification using DNA metabarcoding (
Identified food items across the studied sites were representatives from 25 different plant families, of which 16 included forbs as a potentially foraged life form (Table
According to the results, forbs are foraged by a noticeable percentage of individuals in the sampled dataset, especially species from the genera Asparagus and Ipomoea at Welverdiend and Adenium, Pollichia and Indigofera at Syferkuil. Different forb community compositions and diversity across these two study sites (
The interaction between availability and selection determines the actual diet consumed by animals (
Recognising and understanding these dietary shifts in cattle diets can guide adaptive management practices and will ensure the sustainable use of forage in semi-arid landscapes while sustaining livestock productivity. To mitigate the impacts of nutrient stress imposed by increasing temperature and decreasing precipitation, managers should consider implementing rotational grazing, providing supplemental feeding when necessary, and conserving the diversity of not only grasses but of shrubs and forbs in grazing areas to support flexible foraging strategies during periods of scarcity in dryland landscapes (
While the analysis provides valuable insights into the foraging behaviour of these herbivores, it’s important to consider potential biases inherent to the methodology. PCR bias may result in selective amplification of certain taxa, leading to an over- or underrepresentation of specific plant taxa. Furthermore, not all taxa that are present in the samples may have been successfully amplified and identified, potentially skewing the perceived diversity and richness of the cattle diets. Moreover, the faecal samples that were collected represent a “snapshot” of what an individual consumed immediately before sampling, rather than providing an overview of their long-term dietary habits. The authors recommend exploring methods to mitigate PCR bias and improve the representativeness of sampling by incorporating multiple sampling points over time. Additionally, integrating dietary observations or tracking methods alongside genetic analyses could offer a more holistic understanding of cattle diets and foraging behaviour.
The food items occurring in the diets of herbivores foraging in two bioregions of an eastern semi-arid South African savanna could be recovered and successfully taxonomically assigned using a dual primer approach and study-area specific reference libraries. The large number of unassigned reads retrieved by both markers is largely indicative of incomplete reference libraries and highlights the need for more complete and accurate multi-barcode records from South African savannas; not just to enhance the reconstruction of dietary profiles but to support other metabarcoding studies dealing with plant identifications from this region. This study shows the success of a dual-locus barcoding approach for the identification of plants in that the rbcL marker supplemented missing taxonomies for trnL and vice versa. The use of two plant barcoding regions was therefore necessary to retrieve a more accurate taxonomic resolution to reconstruct the diets of herbivores, especially important nutritional food items such as forbs that are seasonally selected for, but not abundant in savanna ecosystems.
The importance of forbs was showcased via RRA and FOO calculations of the diets of Welverdiend and Syferkuil cattle where they represented 64% of the identified plant families, although in lower abundance and occurring less frequently compared to trees, shrubs and graminoids. Key forb genera in Welverdiend included Malvastrum, Asparagus, and Ipomoea while Adenium and Pollichia were important for cattle at Syferkuil. Although grasses dominated the diets at Syferkuil, these cattle also consumed woody taxa and forbs, which were heavily foraged upon when encountered. In contrast, cattle in the communal rangeland at Welverdiend displayed a greater proportion of woody taxa and forbs in their diets compared to grasses. While these forbs and woody taxa were foraged upon by most members of the sampled community, they did not dominate individual diets as seen for Syferkuil cattle. This indicates a shift in dietary reliance at the end of the dry season, where forbs and woody taxa become crucial for Welverdiend cattle due to the scarcity of grasses, while forbs and woody taxa are actively foraged on at Syferkuil experimental farm for their nutritional benefits.
This study emphasises the flexibility in cattle foraging behaviour in the dry season when more forbs and woody-taxa are included in their grass-based diets, allowing them to adapt to maintain nutritional balances despite changing environmental conditions and resource availability. Such adaptability gives rise to the need for adaptive management strategies in semi-arid ecosystems to maintain livestock productivity. Furthermore, the preferences exhibited by cattle for different plant growth forms and taxa in semi-arid African savannas present an opportunity for future work to better understand the flexible forging behaviours of other large mammalian herbivores under the influence of drier climates. Further investigations of these flexible foraging strategies can provide valuable insights into the protection of wild herbivores, a better understanding of forb ecology, and the overall maintenance of ecosystem health.
We wish to thank the South African Environmental Observation Network (SAEON), Ndlovu Node and the University of Limpopo for providing logistical support during field data collection. We are grateful to NICD for the metabarcoding of samples.
The authors have declared that no competing interests exist.
The North-West University Animal Production Sciences Research Ethics Committee approved the study as zero-risk and granted written permission for its completion. The ethics number is NWU-00177-18-A5.
We thank the North-West University for its support of this research. The research was also supported by the German-South African research project “South African Limpopo Landscapes Network” (SALLnet) funded by the German Federal Ministry of Education and Research (BMBF; Grant number: 01LL1802C).
Conceptualization: SB, AL, FS. Data curation: FS, SB, DB. Formal analysis: DB. Funding acquisition: AL, FS. Investigation: AL, FS, DB, KB, MVM, SB. Methodology: DB, MP, SB. Project administration: FS, KB, SB, AL. Resources: ZTHM, SB, FS, MA, AI, AL. Software: DB, MP. Supervision: SB, FS. Validation: SB, DB, FS, AL. Visualization: DB. Writing - original draft: DB, FS, SB. Writing - review and editing: AI, MVM, KB, AL, MA, MP, ZTHM.
Danielle Botha https://orcid.org/0000-0002-7321-5382
Sandra Barnard https://orcid.org/0000-0002-9299-2769
Morné du Plessis https://orcid.org/0000-0003-4154-7830
Mushal Allam https://orcid.org/0000-0002-9875-6716
Kai Behn https://orcid.org/0000-0003-2598-9575
Arshad Ismail https://orcid.org/0000-0003-4672-5915
Anja Linstädter https://orcid.org/0000-0003-0038-9557
Malesela Vincent Mokoka https://orcid.org/0000-0002-1026-2982
Zamantungwa Thobeka Happiness Mnisi https://orcid.org/0000-0002-7923-5077
Frances Siebert https://orcid.org/0000-0001-5549-8211
All of the data that support the findings of this study are available in the main text or Supplementary Information.
The sequences of the forward and reverse primers of the barcodes with their Illumina overhang adapters amplified from herbivorous faeces in this study
Data type: xlsx
Overview of the main steps followed in the materials and methods section for the mock community analysis conducted in
Data type: png
Explanation note: * An external quality filter was added to the VSEARCH approach since this workflow does not incorporate one in its various steps. ** “OTU” in this diagram may refer to operational taxonomic units as retrieved from the clustering of sequences based on 97% similarity in the VSEARCH approach, amplicon sequence variants retrieved by the inference of an error model in the DADA2 approach or grouping of strictly identical sequences as in the OBITOOLS approach.
Bar plots generated from the taxonomic assignments of the Welverdiend rbcL (rbcL_WV) and trnL dataset (trnL_WV)
Data type: png
Explanation note: Bar plots generated from the taxonomic assignments of the Welverdiend rbcL (rbcL_WV) and trnL dataset (trnL_WV) showing the absolute abundance of reads assigned to families (A and B) and genera (C and D) identified as well as the proportions of unassigned reads.
Bar plots generated from the taxonomic assignments of the Syferkuil rbcL (rbcL_SK) and trnL dataset (trnL_SK)
Data type: png
Explanation note: Bar plots generated from the taxonomic assignments of the Syferkuil rbcL (rbcL_SK) and trnL dataset (trnL_SK) showing the absolute abundance of reads assigned to families (A and B) and genera (C and D) identified as well as the proportions of unassigned reads.