Methods |
Corresponding author: Catarina Magalhães ( cmagalhaes@ciimar.up.pt ) Corresponding author: Maria Paola Tomasino ( maripa.tomasino@gmail.com ) Academic editor: Chloe Robinson
© 2024 Beatriz Santos, Luís Afonso, Filipe Alves, Ana Dinis, Rita Ferreira, Ana M. Correia, Raul Valente, Ágatha Gil, Luis Filipe C. Castro, Isabel Sousa-Pinto, Massimiliano Rosso, Cinzia Centelleghe, Sandro Mazzariol, Catarina Magalhães, Maria Paola Tomasino.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Santos B, Afonso L, Alves F, Dinis A, Ferreira R, Correia AM, Valente R, Gil Ágatha, Castro LFC, Sousa-Pinto I, Rosso M, Centelleghe C, Mazzariol S, Magalhães C, Tomasino MP (2024) Hidden in the blow - a matrix to characterise cetaceans’ respiratory microbiome: short-finned pilot whale as case study. Metabarcoding and Metagenomics 8: e121060. https://doi.org/10.3897/mbmg.8.121060
|
Cetaceans are key sentinel species of marine ecosystems and ocean health, being a strategic taxonomic group that evaluates the well-being of aquatic habitats and detects harmful environmental trends. Respiratory diseases are amongst the main causes of death in these animals, so identifying the microbiome community in their exhaled breath condensates (EBC), i.e. blow, has been proposed as a key biomarker for assessing respiratory health. Yet, to characterise microbiomes related to these animals’ respiratory tract and use them as a proxy for health status, it is necessary to develop baseline data on the microorganisms associated with cetaceans. Here, the short-finned pilot whale (SFPW, Globicephala macrorhynchus) was used as a case study to validate the most suitable primer set to explore the prokaryotic diversity of the cetaceans’ respiratory tract. DNA extracted from blow samples (n = 12) of animals off Madeira Island was sequenced to amplify both V3-V4 and V4-V5 hypervariable regions of the 16S rRNA gene, using the same sequencing platform (Illumina MiSeq). Independently of the primer set used, all blows shared Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria phyla in their composition. V3-V4 resulted in a higher diversity of taxa with relative abundance above 1%, whereas the V4-V5 primers captured a higher number of microbial Amplicon Sequence Variants, detecting the rare microbial biosphere with pathogen potential. Additionally, it captured the core microbiome more efficiently. Thus, this study provides a detailed characterisation of SFPW respiratory-associated microbial communities, strengthening the idea of sociality influencing microbiome composition in the respiratory tract. Moreover, it supports the use of blow as a relevant biomarker for the physiological state of the airways in free-ranging cetaceans.
blow sampling, exhale breath condensate, Globicephala macrorhynchus, health status, metabarcoding, respiratory disease
As keystone species, cetaceans play vital ecological functions, considering their role as nutrient vectors, position in the food chains and use as bioindicators of environmental health (
Respiratory tract infections make a substantial contribution amongst the primary causes of death observed in these marine mammals (
Previous research on the airway microbiota of cetaceans resorted to the analysis of blow samples, using different sampling methodologies, DNA extraction and amplification and targeting different hypervariable gene regions, some of these using a metabarcoding approach. Nevertheless, the majority of data accessible on cetacean-associated microbiome, namely pathogens, diseases and parasites, come from captive, stranded, sick or injured individuals, which cannot be considered representative of the free-ranging populations (
To the best of our knowledge, there are no published studies on the SFPW respiratory microbiome. In addition, no previous studies on other cetacean species have compared the amplification targeting different hypervariable gene regions using the same methodology on the same samples. In light of this, the present study addresses the respiratory microbiome of free-ranging cetaceans, using SFPW as a model species. The main goal is to provide a network of consistent microbiome core taxa of the SFPW blow, by comparing V3-V4/V4-V5 hypervariable regions of 16S rRNA gene. The V4–V5 regions are currently recommended to target marine microbes (including both bacteria and archaea) by the Earth Microbiome Project (
Blow sampling was conducted in September and October 2018, during at-sea research monitoring campaigns in the southern waters of Madeira Island, Portugal, targeting SFPW (Fig.
Blow sampling locations and dates in the Madeira Archipelago, with an illustration of the short-finned pilot whale (Globicephala macrorhynchus; © E. Berninsone / ARDITI).
Sample collection was carried out using a PERFORMAgeneTM PG-100 swab collection kit (DNA Genotek®). This kit was attached to an extendable 5-metre aluminium pole and used as a sampling device to collect the blow (Suppl. material
The samples were kept refrigerated after collection, during transportation and until storage in the laboratory. Under lab conditions, samples were kept frozen until extraction. The maximum time between collection and extraction was approximately one year.
DNA extraction from blow samples was performed using the QIAmp® DNA Mini Kit (QIAGEN), following the manufacturer’s instructions. An additional sample concentration step in an Eppendorf Concentrator Plus™ was added to increase the concentration of the extracted DNA from blow samples. A Qubit™ 3 Fluorometer with a Qubit™ dsDNA High Sensitivity (HS) assay kit (Invitrogen™) was used for DNA quantification, after running negative (0 ng/µl dsDNA) and positive (10 ng/µl dsDNA) controls.
Samples were prepared for the amplification of the 16S rRNA gene hypervariable V4-V5 region (≈ 400 base pairs (bp)), using specific primers 515F-Y/926R (
Amplification and sequencing of both regions were performed through a dual-step PCR protocol followed by high-throughput sequencing. The PCR reactions included 2.5 μl of template DNA in a total volume of 25 μl. PCR conditions involved a 3 min denaturation step at 95 °C, followed by 35 cycles of 98 °C for 20 s, 60 °C for 30 s and 72 °C for 30 s and, finally, an extension stage at 72 °C for 5 min. A second PCR reaction was performed to add indexes and sequencing adapters to the target region, according to manufacturer’s recommendations (
The “DADA2” software package (v.1.20) (
The ASV counts and taxonomy tables from the upstream analysis, together with the metadata table containing the sample information (Sample ID, sampling day and geographical coordinates and residency pattern), were used as input for the “phyloseq” R package (v.1.36) for downstream analysis (
The distribution and diversity of the prokaryotic community across the different used primer sets were investigated. Alpha diversity was analysed for different subsets of samples by calculating four different indexes, also using “phyloseq” package: Observed and Chao 1 for the estimation of unique ASVs abundance and Shannon and Inverse Simpson as species diversity measures. Differences in alpha diversity were tested using the Wilcoxon signed rank test, considering a level of significance of 0.05. Moreover, the beta diversity was analysed using the “vegan” package and a Non-metric MultiDimensional Scaling (NMDS) plot, based on the Bray-Curtis dissimilarity (“phyloseq” package). This measure is a statistical index used to quantify the compositional dissimilarity between two different sites, depending on the two communities’ counts of shared and non-shared taxa (
The microbial communities´ taxonomic composition was evaluated by creating taxonomy bar plots representing the relative abundance of each identified taxa (number of reads per taxa per total read count in each sample). The distribution of prokaryotes taxa across the samples was analysed at four taxonomic levels: phylum, class, family and genus. Taxonomic distribution plots were generated using several R-packages, namely, “phyloseq” (v.1.36) (
To identify the common set of microbial taxa that originated for each of the datasets, the core microbiome was calculated using the “phyloseq” package “microbiome” (
A total of 12 blow samples were collected from six sampling events (Fig.
Summarised metadata of blow samples collected and analysed. * Samples gma_01a and gma_01b correspond to the same individual, as samples gma_07a and gma_07b; ** Samples gma_P1, gma_P2, gma_P3 were collected from a pool of individuals within the same group.
Sample ID | Sampling date | Class age | Type of sample |
---|---|---|---|
gma_01a * | 28/9/2018 | Adult | Isolated individual |
gma_01b * | 28/9/2018 | Adult | Isolated individual |
gma_02 | 29/9/2018 | Adult | Isolated individual |
gma_03 | 29/9/2018 | Adult | Isolated individual |
gma_04 | 2/10/2018 | Adult | Isolated individual |
gma_05 | 2/10/2018 | Adult | Isolated individual |
gma_06 | 5/10/2018 | Adult | Isolated individual |
gma_07a * | 6/10/2018 | Adult | Isolated individual |
gma_07b * | 6/10/2018 | Adult | Isolated individual |
gma_P1 ** | 6/10/2018 | Adult | Pool |
gma_P2 ** | 6/10/2018 | Adult | Pool |
gma_P3 ** | 10/10/2018 | Adult | Pool |
After the sequencing process, a total of 685,692 reads (with a mean of 57,141 ± 7,920.8 reads per sample) was obtained for the V3-V4 dataset; and 862,763 for the V4-V5 (with a mean of 71,896.9 ± 12,990.5 reads per sample). After the quality filter steps, the V3-V4 dataset showed a lower decrease in the number of sequences throughout the workflow compared to V4-V5, with 60.4 ± 5.7% and 48.5 ± 7.7% of sequences retained per sample, respectively. Therefore, the non-target sequences were eliminated. Consequently, the final output was 415,340 sequences in the V3-V4 dataset that were assigned to 2,764 ASVs (with a mean of 34,611.7 ± 6,228.9 reads per sample) and 417,733 sequences in the V4-V5 dataset that were assigned to 3,665 ASVs (with a mean of 34,811.1 ± 7,554.7 reads per sample).
Alpha diversity metrics varied with the set of primers used (Fig.
Box plot diagrams representing the median, first quartile and third quartile of observed richness, Chao1, Shannon and Inverse Simpson alpha diversity indices of the blow microbiome. Different primer sets were represented by different colours (blue for V3-V4 and orange for V4-V5).
The NMDS plot for beta diversity of microbial communities depicted a separation of the samples, based on the used primer set, with four distinct main clusters, two for each primer set (Fig.
Non-metric multidimensional scaling (NMDS) of the microbiota found in short-finned pilot whale blow samples, in merged V3-V4 and V4-V5 datasets, based on Bray-Curtis dissimilarity. The primer sets used are represented by different shapes (circles for V3-V4 and triangles for V4-V5) and the sampling days by different colours.
Regarding the PERMANOVA test results, the primer set influenced the prokaryotic communities’ distribution (p = 0.006). Specifically, approximately 25% (R2 = 0.249) of the prokaryotic community distribution variation was explained by using different primers (Suppl. material
The composition of the blow prokaryotic communities was investigated on the two datasets (V3-V4 and V4-V5) to analyse the differences and commonalities between using the two primer sets. Considering the richness, 1,890 bacterial and 38 archaeal taxa were recorded in the V3-V4 dataset, whereas V4-V5 recorded 2,327 bacterial and 30 archaeal taxa. Taxonomic analysis at the phylum level (Fig.
Relative abundance of prokaryotic phyla (A) and top 10 genera (B) identified across the SFPW blow samples, by V3-V4 (on the top) and V4-V5 (on the bottom) datasets.
At the Class level (Suppl. material
Regarding the family level (Suppl. material
Concerning the genus level (Fig.
At the family and genus levels, it is possible to observe two clusters of samples in terms of composition: Gma_01a, 02, 03, 04, 05, 06 and Gma_01b, 07a, 07b, P1, P2, P3 in V3-V4; Gma_01 to 05 and Gma_06 to P3 in V4-V5 (Suppl. material
The NMDS for the core microbiome analysis demonstrated an obvious separation of the samples based on the primer set used, making it possible to differentiate three distinct main clusters: two originated from the V3-V4 dataset and a unique cluster for the V4-V5 dataset (Fig.
Non-metric multidimensional scaling (NMDS) of the core microbiota found in short-finned pilot whale blow samples (Genus Level), in merged V3-V4 and V4-V5 datasets, based on Bray-Curtis dissimilarity. The primer sets used are represented by different shapes (circles for V3-V4 and triangles for V4-V5) and the sampling days by different colours.
The main goal of non-invasive sampling techniques for cetaceans is to avoid disturbing, injuring or negatively influencing the sampled individual during sample collection (
The sampling kit used (PERFORMAgene) has been employed in previous studies and applied to different animals, such as livestock (
In this study, the prokaryotic community harboured in the respiratory tract of SFPW was described for the first time and a comprehensive comparison of the performance of two different 16S primer sets’ (V3-V4 and V4-V5) was conducted. Both hypervariable 16S regions are recommended in literature for assessing marine microbial diversity (
Using different hypervariable regions, as well as different types of samples, storage, methods of DNA extraction and 16S databases, can influence the obtained data and the interpretation of the results. In
Alpha diversity measures, captured by each primer set, differed significantly. Our results showed that the V4-V5 dataset captured more abundance in unique ASVs (higher values in the Observed ASVs and Chao1 measures) and, subsequently, more identified taxa. On the other hand, V3-V4 resulted in higher values for the applied alpha diversity indexes (higher values in Shannon and Inverse Simpson measures). This is probably explained by the fact that these used indexes only consider unique ASVs in relative abundance above 1%. Beta diversity revealed that samples were grouped according to the primer sets used. Despite separating the samples in different clusters in relation to the primer set used, the V4-V5 dataset appeared to better represent the distribution of prokaryotic communities. In this dataset, individuals that were travelling together when they were sampled (namely Gma_2/Gma_3, Gma_4/Gma_5 and Gma_7/Gma_P1/Gma_P2) appeared within the same cluster and had more approximate values in the NMDS when compared to the V3-V4 dataset. There is evidence that sociality affects microbes in the respiratory tract (
The analysis of the blow core microbiome could provide useful features for the health monitoring of cetaceans worldwide. All samples from both datasets shared a main core microbiota in their blow, composed of Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria phyla. Nevertheless, the dominant ASVs were not the same between the results obtained from the amplification of V3-V4 and V4-V5 regions.
In this study, at a phylum level, the most dominant taxa recovered from both datasets were Proteobacteria. This aligns with previous studies of other species of cetaceans, either of baleen whales (
Regarding the genera level, Cutibacterium (dominant in the V4-V5 dataset) is a typical dominant microbial community in the nasal microbiota (
Regarding the comparison of the core microbiome between the different datasets used, our results show that V4-V5 provides less variation in the data obtained from all the samples, with all the blow microbiomes showing similarity between them. Furthermore, it is clear that, within the same differentiated cluster for this dataset, the NMDS values for the sampled individuals which travelled together (Gma_2/Gma_3, Gma_4/Gma_5 and Gma_7/Gma_P1/Gma_P2) are tendentiously close. This reinforces the influence of sociality in the microbiome composition, similar to what was inferred in the beta-diversity analysis. Therefore, our results suggest that the V4-V5 dataset could be more consistent in determining the core microbiome present in the respiratory tract of free-ranging SFPW, showing less variation in this parameter of tested samples. Nevertheless, the higher variation documented by V3-V4 may also be relevant since it provides a broader level of representation of the detected taxa within the blow. In this regard, a combination between the two primer sets could be complementary and represent a more robust way of characterising the microbiome. This is a widely used approach for metabarcoding studies as the combination of different primers have shown to improve taxa coverage, while also helping to reduce diversity bias (
The comparison between the primer sets showed that all samples from both datasets (V3-V4 and V4-V5) shared the main taxa composed of Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria phyla. This study provides a detailed characterisation of the microbial richness present in the blow of SFPW across multiple taxonomic levels. Following this work’s results, it is concluded that the primer set selection for the microbiome assessment in cetacean blow samples should depend mainly on the goal of the analyses. If the main goal is to capture more diversity present in higher relative abundance, the V3-V4 primer set is demonstrated to have a better performance; whereas, if the purpose is to gather more information in the form of unique ASVs and to identify the microbial rare biosphere, we propose the use of the primer set targeting the hypervariable regions V4-V5. Despite the V4-V5 dataset detecting a higher number of unique ASVs and taxa, most had a relative abundance of < 1%. The V4-V5 dataset showed more consistent results in determining the core microbiome in the blow samples, while V3-V4 had higher variation. In this regard, a combination of primers may prove to be the more robust way to characterise the blow microbiome. This study also offers evidence that social behaviour influences the microbiome composition of the respiratory tract in cetacean species.
Nevertheless, several other aspects require consideration and future development to advance the blow microbiome as a health monitoring tool for cetaceans. Besides optimisation of the sampling and processing protocols, it is also relevant to test different methodologies to enhance sequencing efficiency and downstream procedures. Moreover, crossing this type of data with photogrammetry datasets to assess body condition is relevant to properly infer the pathogenic potential of these microbial communities in cetacean species.
In conclusion, this study represents an important contribution towards understanding the microbiome present in the respiratory tract of free-ranging cetaceans and marks the initial step in characterising the blow microbiome of SPFW. Finally, this study further underscores the potential of the blow microbiome as a future biomarker for assessing the health status and physiological state of the airways in free-ranging cetaceans.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This study was conducted under the project MarInfo (NORTE-01-0145-FEDER-000031) supported by NORTE 2020 under the PORTUGAL 2020 Partnership Agreement through the European Regional Development Fund (ERDF) and the project EMPHATIC funded by Biodiversa+, the European Biodiversity Partnership, under the joint call 2022 – 2023 BiodivMon for research proposals, co-funded by the European Commission and with the following funding organisations: Fundación Biodiversidad (FB, Spain), Fundação para a Ciência e Tecnologia (FCT, Portugal), Agence Nationale de la Recherche (ANR, France) and the Ministry of Universities and Research (MUR, Italy). It also had the support of the project MARCET (MAC/1.1b/149), co-financed by the MAC 2014–2020 program under the Interreg fund, of the project Oceanic Observatory of Madeira (M1420-01-0145-FEDER-000001-OOM), and of the Portuguese Foundation for Science and Technology (FCT) throughout the strategic projects UIDB/04292/2020 and UIDP/04292/2020 granted to MARE and throughout the LA/P/0069/2020 granted to the Associate Laboratory ARNET. RF was partially supported by the FCT grant SFRH/BD/147225/2019.
Conceptualization: SM, ÁG, FA, AMC, MR, AD, RV, CC, MPT, RF, CM. Data curation: BS, MPT. Formal analysis: MPT, BS. Funding acquisition: MR, RF, AMC, CM, ISP, LFCC, FA, AD. Investigation: MPT, ÁG, RV, BS, LA, AMC. Methodology: AMC, BS, FA, AD, RF, RV, ÁG, MR, MPT. Resources: FA, AD, MPT, CM, MR, ISP, RF, LFCC. Supervision: MPT, AMC, ISP, LFCC, CM. Visualization: MPT, BS. Writing - original draft: BS, LA. Writing - review and editing: SM, CM, BS, MR, ISP, MPT, AMC, FA, LFCC, ÁG, RF, RV, CC, AD, LA.
Luís Afonso https://orcid.org/0000-0003-3978-1388
Filipe Alves https://orcid.org/0000-0003-3752-2745
Ana Dinis https://orcid.org/0000-0002-1508-9197
Rita Ferreira https://orcid.org/0000-0002-1383-9054
Ana M. Correia https://orcid.org/0000-0003-4781-6894
Raul Valente https://orcid.org/0000-0001-8544-373X
Ágatha Gil https://orcid.org/0000-0003-4589-2201
Luis Filipe C. Castro https://orcid.org/0000-0001-7697-386X
Isabel Sousa-Pinto https://orcid.org/0000-0002-9231-0553
Massimiliano Rosso https://orcid.org/0000-0002-5161-6767
Sandro Mazzariol https://orcid.org/0000-0002-4756-1871
Maria Paola Tomasino https://orcid.org/0000-0003-3255-0982
The raw sequencing data from this study are openly available in the European Nucleotide Archive (ENA) with the reference number PRJEB72700 (https://www.ebi.ac.uk/ena/browser/home.
Supplementary information
Data type: pdf
table S1. Primers used for the blow microbiome analysis, targeting the 16S rRNA gene hypervariable region. table S2. Metadata of the blow samples collected and analysed: Sample ID, sampling date, geographical coordinates (DD), behaviour, age class, number of individuals sampled (1/pool) and residency pattern. Residency pattern was confirmed via Photo-ID, comparing with (OOM/MARE-ARDITI) Madeira’s photographic-ID catalogue of the species following
Upstream Bioinformatic Analysis
Data type: R file
Explanation note: R script used for the upstream bioinformatic analysis (DADA2 software package) of Illumina MiSeq originated reads.
Downstream Bioinformatic Analysis
Data type: R file
Explanation note: R script used for the downstream bioinformatic analysis (taxonomic identification, alpha and beta diversity) of quality-filtered Illumina MiSeq originated reads.
Bioinformatic Analysis: core microbiome
Data type: R file
Explanation note: R script used for the calculation of the core microbiome for each sequenced dataset.