Research Article
Print
Research Article
A DNA metabarcoding workflow for the detection of terrestrial gastropods from ingested DNA
expand article infoMathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy
‡ Institute of Ecology and Environmental Science (iEES), Créteil, France
Open Access

Abstract

DNA metabarcoding is a powerful approach for assessing biodiversity and unravelling trophic interactions, particularly in the context of biological invasions where introduced predators may impact native communities. However, the analysis of predator digestive contents with DNA metabarcoding poses technical challenges including the specificity of primers used to amplify the target DNA, amplification bias and contamination issues, which must be carefully considered when working on new taxa. This study addresses these challenges by developing and validating a methodology for assessing the diversity of terrestrial gastropods predated by the exotic flatworm Obama nungara. Specifically, an optimized primer pair providing taxonomic resolution for at least 40 gastropod species and genetic lineages, including taxonomically complex and diverse slug genera such as Arion and Deroceras is proposed. The primer pair was optimized through in silico predictions, laboratory controls, and field testing. Mock communities, laboratory-fed individuals, and technical replicates from the field were used as metabarcoding controls to calibrate filter thresholds for Operational Taxonomic Units. Preliminary field results revealed significant dietary diversity in O. nungara, suggesting a broad feeding spectrum. Beyond O. nungara, the primers and the assay developed here can also be applied to study the diets of other terrestrial predators or to detect gastropods in environmental DNA samples, providing a useful tool for ecological and conservation research.

Key words:

Diet analysis, field controls, laboratory controls, primer design, soil macrofauna, taxonomic resolution

Introduction

Developed over a decade ago, DNA metabarcoding is a powerful tool for understanding biodiversity and interactions between organisms using environmental samples such as soil, water, feces or digestive contents (Taberlet et al. 2012; Bohmann et al. 2014; Deiner et al. 2017). It can address ecological questions, including the impact of predators on ecosystem functioning, by revealing diet composition and food preferences (Sow et al. 2020). In particular, the study of the diet of invasive alien species can reveal crucial information on prey types—such as endemic, endangered, or species of ecological, agricultural, economic, or health importance—as well as reveal unknown prey diversity co-existing in the predator’s environment (Martins et al. 2022).

Two main DNA metabarcoding strategies exist: a generalist approach using broadly conserved primer pairs, and a targeted one using primers designed for specific taxa or narrower groups, often covering the same marker region (i.e., COI or 16S) (Da Silva et al. 2019). The targeted approach has the advantage of using primers that allow a finer level of taxonomic identification than a generalist approach (Baetscher et al. 2023). This approach seems appropriate for studying the effects of introduced predators at a fine scale, as it provides detailed information on the species consumed and therefore the consequences of their predation pressure on prey communities (Ficetola and Taberlet 2023).

When initiating a dietary study on new predator or prey taxa using digestive DNA, several issues and technical limitations widely mentioned in the literature need to be considered, such as: 1) the species coverage of the selected markers for the target taxa and their taxonomic resolution (Aizpurua et al. 2018), 2) detection biases related to DNA conservation (Pompanon et al. 2012; Ando et al. 2020) or preferential amplification of certain species (De Barba et al. 2014; Nichols et al. 2018) and 3) strategies to be applied during the analyses to eliminate erroneous sequences (De Barba et al. 2014; Alberdi et al. 2018). The choice of primers and analysis parameters strongly affects results, determining the rate of undetected species (false negatives), over- or under-representation relative to actual environmental presence, or detection due to contamination (false positives) (Ficetola et al. 2015; Alberdi et al. 2019).

The present study is dedicated to the development and validation of a DNA metabarcoding methodology for the detection of terrestrial gastropod species (Gastropoda, Stylommatophora) in the digestive contents of a predator, the exotic flatworm Obama nungara (Platyhelminthes, Geoplanidae). This flatworm, native to South America, feeds on earthworms, slugs, snails and other flatworms according to laboratory experiments (Boll and Leal-Zanchet 2016; Noël et al. 2025a). Citizen science has reported its presence in several European countries, and it is now widespread in France (covering 75% of the territory, Justine et al. 2020), where it is considered a potential threat to soil biodiversity (Noël et al. 2025b). Detailed studies of its population dynamics, predation and consequent impact on invaded ecosystems are particularly valuable in providing valid recommendations for managing its invasion. To date, only earthworms as potential prey items have been analyzed using DNA metabarcoding of the digestive contents of O. nungara (Roy et al. 2022). The authors took advantage of primers already available in the literature and specifically targeting earthworms (Bienert et al. 2012). Terrestrial gastropods, which are widely consumed by O. nungara, have not been the subject of similar analyses until now.

In France, 422 species of terrestrial gastropods have been recorded in the Red List of Continental Molluscs of France (https://uicn.fr/liste-rouge-mollusques-continentaux/). However, identification of these species remains a major challenge due to the taxonomic complexity of certain genera. In the slug genus Arion, for instance, misidentification cases may arise due to complex hybridization patterns. These concerns notably involve Arion vulgaris and Arion ater sensu lato (i.e., Arion ater sensu stricto and the two morphotypes of A. rufus as described by Reise et al. (2020)). Moreover, some species exhibit significant intraspecific genetic diversity, particularly in mitochondrial DNA lineages, which suggests the presence of highly differentiated genetic lineages within nominal species, pointing to species complexes or cryptic species. This phenomenon is exemplified by Arion subfuscus, which, according to Pinceel et al. (2005), is subdivided into five highly divergent mtDNA lineages. Similarly, genetic divergence is observed within other species complexes, such as the snails Trochulus hispidus (Kruckenhauser et al. 2014) and the subspecies of Theba pisana (Greve et al. 2010). Designing reliable PCR primers for terrestrial gastropods therefore requires careful consideration of their complex taxonomy and genetic diversity.

The aim of the present work was to develop primers and suitable filtering parameters for estimating the diversity of terrestrial gastropod species consumed by O. nungara individuals collected in the field. The validation of this methodology was of particular interest to understand the technical limitations of the approach in terms of species coverage, taxonomic resolution, detection bias, and potential contamination. To achieve this, a multistep methodology was proposed: 1) primer design based on an alignment of reference sequences of the ribosomal 16S rRNA gene, enriched with sequences obtained from gastropods collected from different sites infested by O. nungara, 2) validation of primers in silico and by conventional PCR on a selected panel of gastropod species, 3) metabarcoding analysis of mock controls (DNA mixtures of established specific composition and concentration), dietary controls (DNA from gut contents of O. nungara fed sequentially with known gastropod species), and DNA from gut contents of O. nungara collected from sites with known terrestrial gastropod diversity. The results allowed the selection of a pair of primers and analysis parameters that met the validation criteria established for the detection of terrestrial gastropods in O. nungara gut contents. Although not validated for this purpose, these primers may be of interest for studying other types of samples (e.g. digestive DNA from other terrestrial gastropod predators or environmental DNA (eDNA) from soil).

Methods

Specimen collection and species identification

Gastropods were sampled twice a year from autumn 2021 to spring 2024 in anthropized green spaces invaded by O. nungara, across four regions of mainland France (Paris region, La Rochelle, Bordeaux region and Brittany). Sampling was carried out over a standardized 2-hour survey in all sheltering microhabitats within a 100 m2 area at each site. Samples were stored in absolute ethanol at 4 °C prior to DNA extraction. In a preliminary identification step, samples were assigned to morphospecies using identification keys from Gargominy (2020) and from Mc Donnell et al. (2009). A total of 469 individuals were collected (ranging from 1 to 61 per morphospecies, depending on availability in the field and rarity of the species), and DNA barcoding was then used to assign specimens to species or genetic lineages. Total DNA was extracted from about 4 mm2 of the mantle with the DNeasy Blood and Tissue kit (Qiagen, France) following the manufacturer instructions. The standard barcode fragment of 650 pb of the cytochrome oxidase I gene (COI) was amplified and sequenced using LCO1490/HCO2198 primer pair from Folmer et al. (1994). In total COI PCRs were performed on 400 individuals, yielding 349 successful COI barcodes. Using these barcodes, 38 species or genetic lineages of terrestrial snails and slugs were identified using BOLD Identification System (IDS) and BLAST from NCBI (BOLD dataset: http://dx.doi.org/10.5883/DS-MOLPWP1).

Gastropod reference database

To develop and validate specific primers for the amplification and sequencing of DNA from terrestrial gastropods in predator gut contents, available primers and sequences for several genes were collected from the literature and the National Center for Biotechnology Information (NCBI) database. To select the most appropriate gene for this study, several criteria were applied: the gene had to be well documented, with sufficient sequences available to build a reference database, and suitable for targeting taxa within terrestrial gastropods, which encompass more than ten families. Although COI is commonly used, preliminary in silico and PCR tests did not reveal regions suitable for designing exclusive primers for gastropods. Moreover, previous studies have shown that primer bias in COI can lead to the underrepresentation or non-detection of some taxa, even within closely related groups (Deagle et al. 2014; Elbrecht and Leese 2015; Piñol et al. 2015). The mitochondrial 16S rRNA gene was therefore selected, as it combines conserved regions for primer design with sufficient variability for species-level discrimination. This choice is supported by several studies that have highlighted 16S as a suitable alternative marker for metabarcoding in other taxa (Clarke et al. 2014; Deagle et al. 2014; Elbrecht et al. 2016).

In order to build the mitochondrial 16S rDNA database, the primer pair 16sar-L (5’-CGCCTGTTTATCAAAAACAT-3’) (Palumbi 1996) and modified 16sbr-H (5’-CCGGTYTGAACTCAGATCAYGT-3’) (Palumbi 1996, modified in Bossuyt and Milinkovitch (2000)) was used to amplify a 550 bp 16S fragment (hereafter referred to as “large fragment”) from one to three samples per species/lineage when available, giving priority to maximizing genetic diversity (by covering distinct COI clades when applicable) and geographic spread when possible, in order to avoid redundancy and to better represent intraspecific diversity for a total of 110 individuals. Reactions were carried out in a volume of 25 µL containing 2.5 µL of DNA extract. Amplification mix contained 0.08 U of GoTaq® G2 DNA Polymerase (Promega, France), 1X Colorless GoTaq® Reaction Buffer, 2 mM of MgCl2, 0.5 mM of dNTPS, 0.2 µM of each primer and 0.25 mg/mL of bovine serum albumin. The following cycling conditions were used: an initial denaturing step at 94 °C for 3 min; 36 cycles of denaturation at 94 °C for 30 s; annealing at 51 °C for 30 s, extension at 72 °C for 1 min and final extension step at 72 °C for 10 min. PCR products were sent to Eurofins Genomics for sequencing using the 16sar-L primer. Sequences were manually corrected using Chromas 2.6.6 software (https://technelysium.com.au) and submitted to BOLD and GenBank (all references and data available in BOLD dataset: http://dx.doi.org/10.5883/DS-MOLPWP1).

To improve species coverage for the primer design, 244 sequences were retrieved from NCBI and added to the database. These sequences were selected to represent both species recorded at our study sites and those likely to occur in French habitats potentially colonized by O. nungara. This addition represents 20 new species and lineages, bringing the total to 58, which corresponds to 13% of the French terrestrial gastropod fauna, considering species that could potentially be encountered based on their geographical distribution. Sequences were then aligned using the MUSCLE algorithm in SeaView4 (Gouy et al. 2010).

Design of specific primer pairs and evaluation of minibarcodes

215 earthworm 16S sequences from Barraux et al. (2024) and from our laboratory (not published), and the only available 16S sequence closely related to O. nungara from NCBI (AN: KP208777, listed as Obama sp.) were added to the gastropod alignment. Highly conserved regions surrounding variable regions were visually identified, and specific primer pairs (i.e. expected to amplify only gastropod DNA and not earthworm or O. nungara DNA) targeting ~100 bp variable sequences were manually designed. Standard primer design checks (e.g. annealing temperature balance, absence of strong secondary structures such as hairpins or primer–dimers, assessed with the Multiple Primer Analyzer web tool from Thermo Fisher Scientific) were performed before empirical validation.

To assess the overall coverage and specificity of the designed primer pairs for terrestrial gastropods, in silico PCR were first performed using the ecoPCR program (Ficetola et al. 2010). Each primer pair was tested against 18,210 mitochondrial reference sequences for Eukaryota (18,209 plus Obama sp.) retrieved from the NCBI Reference Sequence Database (RefSeq) (https://ftp.ncbi.nlm.nih.gov/genomes/refseq/mitochondrion/), and against 26,578 bacterial 16S rRNA sequences from the Bacterial 16S Ribosomal RNA RefSeq Targeted Loci Project (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA33175). Up to two mismatches per primer were allowed, with a single mismatch within the last three bases at the 3’ end considered a PCR failure.

PCR specificity cross-tests were also conducted on DNA extracts (see below CTAB extraction procedure) from three gastropod species, three earthworm species and eleven samples of O. nungara that were fed differently (with an empty stomach, previously fed with terrestrial gastropods or earthworms, or with gastropods and earthworms). The PCR reaction was carried out using the same reagent composition and volumes as previously described. The following cycling conditions were used: an initial denaturing step at 94 °C for 3 min; 40 cycles of denaturation at 94 °C for 30 s; annealing at variable temperature depending on primer pairs for 30 s, extension at 72 °C for 1 min and final extension step at 72 °C for 10 min.

To ensure broad coverage of terrestrial gastropod species, promising primers were tested in silico using the PRIMERMINER 0.21 R package (Elbrecht and Leese 2017). The penalty score, which quantifies mismatches between primer and target sequences—including mismatch type, position, and adjacency—was calculated for each primer. Sequences with a score below 120 were considered likely to be successfully amplified, whereas scores above 120 indicated low amplification probability, with 120 serving as the practical threshold for primer selection. PCR amplifications on 49 DNA samples from the 38 gastropod COI species/lineages were also conducted with the protocol described above.

Finally, a Neighbor-Joining tree was reconstructed with the software MEGA (Kimura 2-parameter model, 1000 Bootstrap replicates) based on the short DNA sequences (i.e. 16S metabarcoding region) targeted by the retained primers, to verify whether the different species or lineages occur in distinct clades.

To discuss the potential of using the primer pair in future studies studying terrestrial gastropods, its coverage in silico was also tested on available 16S sequences of terrestrial gastropod species from France (sequences available for 206 species out of 422 described, across 43 families) (list extracted from https://uicn.fr/liste-rouge-mollusques-continentaux/) and from Europe (sequences available for 698 species out of 2480 described, across 53 families and including sequences from this study) (list extracted from https://portals.iucn.org/library/node/48439) using the PRIMERMINER package.

Metabarcoding controls and field samples

Two types of mock community controls were set up (See Suppl. material 1). Three mock community controls (“POS-G1 to G3”) were each composed of DNA from five different gastropod species randomly introduced at the same concentration (21 ng/µL). Three other mock community controls (“NREP-G1 to G3”) were each composed of 27 species among the most abundant in anthropized sites. DNA volumes were diluted to the same concentration as above and randomly graduated from 10 µL to 1 µL depending on species (See Suppl. material 1). Both types of mock communities were designed to ensure that all introduced DNA from species were retrieved in proportional sequence counts, to assess the contamination level and to set a filtering threshold for clusters (i.e., sets of similar sequences grouped based on a predefined similarity threshold) (De Barba et al. 2014; Ficetola et al. 2015; Port et al. 2016; Thomas et al. 2016; Andruszkiewicz et al. 2017; Braukmann et al. 2019).

Six “controlled-feeding” controls were DNA from digestive contents of O. nungara individuals fed for three weeks with one prey per week. Three individuals (“UNI-G1 to G3”) received the same species, Cornu aspersum (non-differentiated genetic lineages, as they could not be recognized based on the external morphology). The other three individuals (“MIX-G1 to G3”) received a different species every week in the following order: Hygromia cinctella, Cornu aspersum and Arion hortensis L1. Consumption was checked after three days, and prey was always eaten. After three weeks, O. nungara individuals were killed and stored in absolute ethanol for further DNA extraction. Both types of “controlled-feeding” controls were used to test if all given prey would be detected by sequencing digestive contents after three weeks of experimentation and if there were potential amplification biases between DNA from different species. They were also used to assess contamination levels and to set a filtering threshold for clusters (De Barba et al. 2014; Port et al. 2016; Andruszkiewicz et al. 2017).

Three negative controls (“BLANK 1 to 3”), from three independent PCR amplifications without added DNA, were set up to check for possible contamination during PCR and to set the threshold of stochasticity for technical replicates. A pairwise distance matrix between negative replicates was created using the Bray-Curtis dissimilarity index (Bray and Curtis 1957) based on sequence frequency data. The distance value ranged from 0, indicating identical relative abundance profiles, to 1, representing completely distinct assemblages (no shared sequences). Since negative controls are not expected to have repeatable sequence frequencies, this allowed a threshold of stochasticity to be set for technical replicates, with samples exceeding a maximum value being discarded (De Barba et al. 2014; Corse et al. 2017).

The last three controls were DNA from digestive contents of three O. nungara individuals collected on the field during sampling campaigns (V6, V29 and V53 samples) and tested positive for gastropod DNA. Each of them was replicated three times (i.e. three independent PCR amplifications and sequencing per DNA to create technical replicates). Replicated field controls were used to apply parameters and thresholds (DNA quantity, distance threshold between technical replicates, threshold for clusters filtering) calibrated with other controls mentioned above, to ensure their reliability on real digestive contents from the field (De Barba et al. 2014; Pansu et al. 2015; Piñol et al. 2015; Beentjes et al. 2019; Mata et al. 2019).

Finally, 16 DNA extracts from O. nungara individuals (5 from Limeil-Brevannes, Paris region, 3 from Saint-Medard-en-Jalles, Bordeaux region, 3 from Cleguer, Brittany and 5 from Concarneau, Brittany) that tested positive for gastropod DNA were included in the experiment. By estimating the diversity of terrestrial gastropod species at these particularly rich collection sites, it was possible to experiment in vivo the retained primer pair and the filtering threshold for clusters calibrated from metabarcoding controls.

DNA from about 20 mg of gut content was extracted from the region corresponding to the opening of the mouth of O. nungara according to Roy et al. (2022). DNA extraction was performed with CTAB method adapted from Sambrook and Russell (2001). Samples were incubated overnight at 57 °C and DNA was extracted twice with an equal volume of chloroform: isoamyl alcohol (24:1).

DNA amplifications were then conducted on all 40 samples by using the validated primer pair and the corresponding PCR protocol (see Results). For each sample and replicate, a separate library was prepared with a target sequencing depth of 10,000 reads. Each library was assigned a unique index, following GenoScreen’s protocol for multiplexing. Sequencing was performed on the Illumina MiSeq platform (2 × 150 bp paired-end chemistry) with 15% PhiX.

Bioinformatic analysis

The Migale platform (https://galaxy.migale.inrae.fr/) was used to trim, merge, filter and cluster sequences. Pair-end reads were first trimmed with FROGS SICKLE program, based on a Phred quality score Q < 30. Merging of pair-end reads followed by first filtering were realized with FROGS PRE-PROCESS program with the following parameters: maximum lengths of 127 bp for forward reads (R1) and 134 bp for reverse reads (R2) were selected for merging, based on reads length distribution reports from FASTQC and MULTIQC programs; >30 bp coverage zone and 97% nucleotide identity were required to assemble paired-end reads and merged amplicons not comprised between 119 and 135 bp were discarded.

Clustering was then performed using FROGS CLUSTERING SWARM program with d = 1 (a small local linking threshold, representing the maximum number of differences between sequences in each aggregation swarm step) and the fastidious option selected. Clustered sequences will be referred to and identified hereafter as Operational Taxonomic Units (OTUs) in this study. Sequences identified as chimera in all samples were discarded using FROGS REMOVE CHIMERA program. Finally, clusters were filtered using the FROGS CLUSTER FILTERS program: only singletons were discarded, and the OTU filtering threshold established from control analyses was applied to both the technical replicates and the sixteen samples from the field.

For the taxonomic assignment of OTUs, a double BLAST approach was used: first with the BLASTn program from NCBI and second, with the reference database. Only BLAST matches with at least 95% identity and 95% sequence coverage were used to assign a gastropod species or lineage to each OTU. Unassigned OTUs, including those under the threshold and those unknown from any reference database, were then discarded from the analysis to maintain clean assignments and real detections. To further validate the taxonomic assignments, a Neighbor-Joining phylogenetic tree was reconstructed including OTU sequences with the corresponding reference sequences. This tree was then inspected to confirm that OTUs assigned to the same nominal species were clustered together and that the reference sequences of each species were included within the same clade.

Sequencing results were finally analyzed in R 4.4.2 (R Core Team 2024) with the PHYLOSEQ 1.46.0 R package (McMurdie and Holmes 2013) for each type of metabarcoding controls and field data, including data transformation for qualitative interpretation and graphical visualizations. Following the methodology of De Barba et al. (2014), an approach based on the relative read abundance (RRA) (i.e. proportional summaries of count reads for each detected prey) was chosen at the individual scale to drive the analysis (Casey et al. 2019; Deagle et al. 2019; Ando et al. 2020).

Results

Gastropod reference database

Ninety-six 16S sequences were obtained from DNA extracts of gastropods, representing 38 species and lineages described by morphology and/or the COI gene from French sites invaded by O. nungara. Including sequences from NCBI, the final database contained 340 sequences from 58 species/lineages (1 to 40 sequences per species or lineage) (See Suppl. material 2) and the alignment spanned 489 bp, including gaps.

Primer design and efficiency evaluation

Six primers, ranging from 14 to 20 bp, were designed by visualizing the 16S alignment of sequences from terrestrial gastropods, earthworm and O. nungara (See Suppl. material 3). These primers were complemented by three primers specific to gastropods obtained from Taberlet et al. (2018), resulting in eight primer pairs targeting short DNA fragments (98–217 bp) (Fig. 1).

Figure 1.

Schema of 16S DNA sequence in terrestrial gastropods showing variable (white) regions, conserved (green) regions and positions of the primers designed and retrieved for this study, the lengths indicated include the primers; and table detailing names, sequence lengths, 5’-3’ sequences and references for each potential primer pair.

In silico PCR results indicated that none of the primer pairs amplified bacterial 16S sequences, with amplification restricted to eukaryotic taxa. Among them, three primer pairs (the two including the MVF primer and Gast01R-Gast01F) produced the highest number of amplicons. Gast01R-Gast01F amplified exclusively Animalia, while the MVF-containing pairs retrieved mostly Animalia but also some sequences from Fungi and a small number of sequences from Chromista, Plantae, Protozoa, and an unassigned Eukaryota taxon (NC_086778). The five remaining primer pairs amplified fewer sequences overall, all belonging to Animalia (Fig. 2A). Within Mollusca, the performance of primer pairs has been examined on Stylommatophora, an order that includes most of terrestrial gastropods. The reference dataset contained 45 sequences from 45 Stylommatophora species. In silico PCR showed that Gast01R-Gast01F could theoretically amplify all 45 species, while the other primer pairs amplified between 26 and 40 species, except for GastD-Gast01R-F, which retrieved only about 10 species. Apart from primer pairs involving MVF, all amplified exclusively terrestrial gastropods, with no co-amplification of Obama sp. or Lumbricidae (See Suppl. material 4). Based on these results, Gast01R-Gast01F, VRF-VRR, and VRF-MVR appeared to be the most promising primer pairs for broad coverage and specificity of terrestrial gastropods (Fig. 2B). Additional details on amplification across kingdoms, phyla, and other taxonomic levels are provided in Suppl. material 4.

Figure 2.

In silico evaluation of primer pairs across eukaryotic taxa. A Global amplification per primer pair. Each bar represents the number of species amplified per primer pair, colored by kingdom. B Amplification specifically within terrestrial gastropods (order Stylommatophora). Bars indicate the number of Stylommatophora species amplified per primer pair. Dashed vertical line indicates the maximum number of target species available (n = 45).

Specificity cross-tests showed that, contrary to in silico predictions, two primer pairs (VRF-VRR and MVF-VRR) amplified only the targeted gastropod taxa and O. nungara samples fed with gastropods. Two primer pairs (VRF-MVR and GastD-GastE) amplified some of the targeted gastropods but also produced unspecific amplification. Four other primer pairs showed broader cross-amplification: MVF-MVR co-amplified targeted gastropods together with O. nungara from empty or earthworm-fed stomachs and earthworm DNA; GastD-Gast01F co-amplified targeted gastropods and O. nungara from empty and earthworm-fed stomachs; and GastD-Gast01R-F amplified the targeted gastropods but also co-amplified O. nungara from empty stomachs and produced non-specific amplicons (Fig. 3). The primer pair Gast01R-Gast01F, although initially promising in terms of taxonomic coverage in silico (Fig. 2) and in the literature (Taberlet et al. 2018), showed limited resolution for some key genera when tested in our phylogenetic alignments (poor taxonomic resolution for Deroceras species, Theba and Cornu lineages). In contrast, VRF-VRR consistently yielded reproducible amplification across different DNA samples. Given its better performance and broader utility, VRF-VRR was selected for all subsequent analyses, and Gast01R-Gast01F was not pursued further.

Figure 3.

PCR specificity cross-tests results for the three taxonomic groups (O. nungara, earthworms, and gastropods) using different potential primer pairs for minibarcode design. Bars indicate the number of species and samples showing each amplification outcome. “Targeted amplification” includes the three gastropod species (A. parvipenis, C. aspersum L1, D. invadens), the three O. nungara fed with gastropod prey (g. prey) and the two fed with both gastropod and earthworm prey (both prey); “Co-amplification” includes non-targeted amplifications either with the three O. nungara with empty stomach (e.s.), the three O. nungara fed with earthworm prey (e. prey) or with earthworm species (A. icterica, L. terrestris, L. castaneus L1). Colors correspond to amplification categories as indicated in the legend, annealing temperatures are indicated for each primer pair.

In silico amplifications using PRIMERMINER 0.21 showed that DNA sequences from 30 out of the 38 lineages from the field could theoretically be amplified with the VRF-VRR primer pair (Table 1) based on the standard penalty score of 120. Twenty-six of these lineages had a penalty score of zero and were therefore perfectly matched to the primer pair, while four species had a penalty score between 36 and 51. Two lineages showed haplotype-dependent results (Arion hortensis L2: 0 to 152, and Theba pisana L1: 71 to 325), and PCR amplification was predicted to fail for each sequence of six species (Table 1). Except for the family Pomatiidae, represented only by Pomatias elegans, primers could amplify at least one species for each of the other eleven families (all species from this study for most of them). Otherwise, the results of the in silico amplifications extended to French and European species showed that the VRF-VRR primer pair could theoretically be able to amplify DNA from at least 178 species found in France across 37 families, and 607 species found in Europe across 47 families. All detailed results for France and Europe are available (See Suppl. materials 5, 6).

Table 1.

Results of in silico (penalty score) and in vitro analysis (PCR amplification) for DNA sequences from each of the 38 terrestrial gastropod species found during sampling. A value of “0” means that the sequences perfectly match the primer pair, sequences with score below 120 are supposed to amplify, sequences above 120 are supposed to fail, and H-D (haplotype-dependent) means that results were variable between haplotypes from the same species. Abbreviations: “+” indicates amplification (PCR band on a gel), “-” indicates no amplification (no PCR band), and “unspecific” denotes an unspecific amplification with multiple bands.

Family Genus and Species Lineage Number of samples tested in silico penalty score PCR amplification
Agriolimacidae Deroceras agreste 3 152 +
D. invadens 3 0 +
D. panormitanum 2 0 +
D. reticulatum 3 152 +
Arionidae Arion ater rufus 5 0 +
Ar. distinctus 3 0 +
Ar. hortensis L1 3 0 +
Ar. hortensis L2 3 H-D +
Ar. intermedius 3 0 +
Ar. subfuscus S1 2 0 +
Ar. vulgaris 3 0 +
Boettgerillidae Boettgerilla pallens 3 0 +
Cochlicopidae Cochlicopa lubrica 1 0 +
Geomitridae Backeljaia gigaxii 1 0 +
Cochlicella barbara 3 0 +
Xeroplexa intersecta 3 0 +
Helicidae Cepaea hortensis 2 0 +
Ce. nemoralis 3 0 +
Cornu aspersum L1 4 0 +
Co. aspersum L2 2 0 +
Helix lucorum 1 0 +
Theba pisana L1 2 H-D unspecific
Th. pisana L2 1 325 unspecific
Hygromiidae Hygromia cinctella 3 254 +
Monacha cartusiana 1 0 +
Trochulus hispidus 2 249 +
Limacidae Ambigolimax parvipenis 5 0 +
Am. valentianus 1 0 +
Limacus flavus 3 0 +
Limax maximus 3 0 +
Milacidae Milax gagates 2 36 +
Mi. nigricans 2 36 +
Tandonia budapestensis 3 36 +
Oxychilidae Oxychilus draparnaudi 4 0 +
Pomatiidae Pomatias elegans 1 211 -
Testacellidae Testacella haliotidea 3 51 +
Te. maugei 2 0 +
Te. scutulum 3 0 +

The results of PCR assays on gastropod DNA showed that 35 out of the 38 species and lineages tested showed the expected amplification band. Pomatias elegans was the only species that did not amplify, and two other lineages from a morphospecies (Theba pisana L1 and L2) showed unspecific results (Table 1).

Fifty-six out of 58 species and lineages were monophyletic in the NJ tree reconstructed based on the short fragment: they grouped into distinct clades or were distinct singletons (See Suppl. material 7). This is particularly relevant for the very close species of the genus Deroceras (Deroceras invadens and Deroceras panormitanum, and Deroceras agreste and Deroceras reticulatum) (Ventura et al. 2025) and the lineages of Arion hortensis (2 lineages never described) and Arion subfuscus (5 lineages) described in the literature (Pinceel et al. 2004, 2005) that are differentiated with the short 16S metabarcoding region. Furthermore, recently differentiated subspecies of Arion ater (Reise et al. 2020), Arion ater ater, Arion ater ruber and Arion ater rufus could also be distinguished. Finally, two species, Ambigolimax parvipenis and Ambigolimax valentianus, were indistinguishable, as they merged in the tree reconstructed based on the short 16S metabarcoding region.

Metabarcoding controls assessment and field samples

High-throughput sequencing resulted in 714,602 paired-end reads for the 40 samples (metabarcoding controls and field samples). Details of reads and OTU counts through the bioinformatic process before analysis of the clustering filter threshold are shown in Table 2, and raw reads have been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB100783 (https://www.ebi.ac.uk/ena/browser/view/PRJEB100783). As expected, the three negative controls (“BLANK 1 to 3”) retained few sequences (5 to 8 per sample) after trimming and merging procedures. In these controls, the vast majority of reads were discarded because they corresponded to very short sequences (less than 25 bp). The mock communities (“POS-G1 to G3” and “NREP-G1 to G3”) and the three replicate samples (V6, V29 and V53, from digestive contents of O. nungara collected in the field) showed an average of 5,000 analyzable sequences. The “controlled-feeding” controls showed a considerable decrease after the trimming and merging of paired-end reads, with an average of 911 ± 498 sequences for “UNI-G1 to G3” and 1870 ± 540 for “MIX-G1 to G3” before cluster filtering. Finally, the sixteen field samples yielded an average of 17,000 analyzable sequences (Table 2).

Table 2.

Overview of the consecutive reduction of analyzed reads, sequences and OTUs through the bioinformatic pipeline before cluster filtering; from sequencing results to singletons removal for each type of metabarcoding controls and field samples (Ntot = 40 DNA or gut samples). Indicated values are means and standard deviations across samples.

Sample type (n) Sequencing (N reads) Trimming (N reads) Merging/ Pre-processing (N sequences) Clustering (N sequences; N OTUs) Chimera removal (N sequences; N OTUs) Singletons removal (N sequences; N OTUs)
Negative (3) 23342 ± 1331 18621 ± 2046 7 ± 2 7 ± 2; 3 ± 1 7 ± 2; 3 ± 1 7 ± 2; 3 ± 1
NREP mocks (3) 6963 ± 213 5847 ± 230 4417 ± 203 4417 ± 203; 84 ± 2 4382 ± 202; 49 ± 1 4301 ± 183; 40
POS mocks (3) 6747 ± 168 5633 ± 138 4287 ± 147 4287 ± 147; 38 ± 8 4266 ± 151; 20 ± 4 4259 ± 147; 11 ± 2
UNI feeding (3) 7533 ± 247 3708 ± 316 927 ± 516 927 ± 516; 20 ± 2 916 ± 503; 17 ± 2 911 ± 498; 13 ± 2
MIX feeding (3) 7843 ± 334 4134 ± 439 1878 ± 541 1878 ± 541; 10 ± 1 1876 ± 540; 9 ± 1 1870 ± 540; 7 ± 2
Technical replicates (9) 14632 ± 1066 11578 ± 852 6301 ± 809 6301 ± 809; 40 ± 5 6287 ± 793; 30 ± 5 6254 ± 818; 20 ± 3
Field samples (16) 26566 ± 925 21561 ± 776 17561 ± 679 17561 ± 679; 20 ± 3 17502 ± 667; 15 ± 2 17339 ± 801; 13 ± 2

In all controls, OTUs were either assigned to terrestrial gastropod species (based on more than 95% identity and 95% coverage with NCBI sequences and confirmed by alignment of OTU sequences with the reference database), or showed “no significant similarity” in both BLAST analysis and the database (i.e. likely artefactual sequences). The number of OTUs per nominal species or lineage varied from 1 to 6 OTUs across all samples. Some species showed high OTU richness, such as Cochlicella barbara (6 OTUs) and Cornu aspersum L1 and L2 (4 OTUs each), but phylogenetic reconstruction confirmed the clustering of OTUs assigned to the same nominal species with reference sequences (data not shown). As expected, the 16S region amplified by the VRF-VRR primer pair only failed to distinguish Ambigolimax parvipenis from Ambigolimax valentianus.

In POS-G1 and POS-G3 mocks, all five expected gastropod species were recovered, with more than 99% of sequences from these species on average, followed by unassigned clusters and artefactual sequences (Fig. 4A-C). DNA sequences from four of the five expected species from POS-G2 were also detected, representing 87% of the sequences. However, DNA sequences from the fifth species, Deroceras invadens, were not detected in POS-G2 or in any of the NREP mock communities, and instead sequences from Deroceras panormitanum were detected, suggesting a laboratory error (Fig. 4).

Figure 4.

Comparison of expected and observed sequencing results for both types of mock community: A–C (POS) and D–F (NREP). Results are displayed as relative read abundance (RRA, % of sequences) after applying a 0.1% threshold filter for unexpected and non-assigned OTUs (representing 99.9% of the target gastropod taxa) and clustering reads from the same expected species. For D–F (NREP), each point represents the observed abundance of a given taxon in one sample; the red dotted line indicates the theoretical 1:1 relationship between observed and expected abundances, and the gray dashed line shows the regression across all points, representing the observed trend. The color legend applies for both type of mock community.

The VRF-VRR primer pair amplified and discriminated 16S DNA sequences from 24 out of the 27 species and lineages included in the “NREP” mock communities (Fig. 4D-F). The 27 most abundant OTUs, representing on average more than 99% of the sequences in these samples, were successfully assigned to the 24 expected species. Indeed, Cornu aspersum L1 was represented by 4 OTUs and Pomatias elegans, Theba pisana L1 and Theba pisana L2 were not detected.

In both “POS” and “NREP” mock types, Cornu aspersum L1 showed the highest disparity between expected and observed sequence counts, as the results showed that it was overrepresented in the mock communities: +29% in POS-G1 and +10% on average in NREP mocks. Conversely, in the POS-G1 and POS-G2 mocks, Testacella haliotidea (-16% in POS-G1 and -3% in NREP mocks on average) and Hygromia cinctella (-18% in POS-G2 and -4% in NREP mocks on average) sequences were the most underrepresented in the observed communities (Fig. 4).

In the “controlled-feeding” controls UNI-G1 to G3 (fed only with Cornu aspersum), on average more than 99% of the sequences were assigned to Cornu aspersum L1 (one OTU for UNI-G1), to Cornu aspersum L2 (one OTU for UNI-G3) or to Cornu aspersum L1 and Cornu aspersum L2 (one OTU each for UNI-G2). In all three “controlled-feeding” controls MIX-G1 to G3, sequences from the last prey given (Arion hortensis L1, 1 OTU) were detected in up to 99% of the sequences in MIX-G3 and 54% on average in MIX-G1 and MIX-G2. MIX-G1 and MIX-G2 also showed sequences from the second prey (Cornu aspersum L1), whereas sequences from the first prey (Hygromia cinctella) were not detected in any sample. Again, sequences from the second prey Cornu aspersum L1 were highly abundant in the “MIX” samples, yielding 40% (MIX-G1) and 50% (MIX-G2) of the sequences compared to sequences from the last prey Arion hortensis L1, consumed and digested one week later (See Suppl. material 8).

The results obtained for controls were used to set the filtering threshold for OTUs. The mock communities showed that unexpected or artefactual OTUs (i.e. assigned to unexpected species or returned as “no significant similarity”) had an average occurrence of 0.1% across samples. On the other hand, unexpected OTUs from non-given prey for the “controlled-feeding” controls averaged 0.4%. These two potential thresholds were tested on PCR technical replicates from the field (V6, V29 and V53 samples). Of the 70 OTUs detected in these samples (see Suppl. material 8), 33 were assigned to 14 terrestrial gastropod species, representing on average 95.8% of the sequences. Two OTUs matched a chromosome 8 gene from Homo sapiens and Pan troglodytes, but accounted for only 0.016% of the sequences. Thirty-five OTUs, corresponding on average to 4.2% of the sequences, could not be assigned to any taxon, as no significant BLAST matches were retrieved. When applied, the 0.1% filter was not strict enough, leaving several unassigned OTUs and artefactual sequences in the samples. Conversely, the 0.4% filter allowed to discard each of these from the replicates – except for one, OTU 7 – while preserving all correctly identified OTUs. To ensure that any potential contamination was discarded and because this work was based on digestive contents, such as the “controlled-feeding” controls, the 0.4% filter was retained for further analyses.

Negative controls (BLANK 1–3) amplified very few sequences (5–8 reads in total, scattered across six species after discarding non-assigned OTUs), indicating minimal background noise and no evidence of systematic contamination. Bray-Curtis distances were calculated among these controls to provide an estimate of stochastic variation (d = 0.7), consistent with previous work using similar approaches (De Barba et al. 2014; Lopes et al. 2015; Zepeda-Mendoza et al. 2016). This threshold was used as a reference to evaluate technical replicates: any replicate with a Bray-Curtis distance greater than 0.7 would have been discarded. Replicates for field-collected digestive content samples (V6, V29 and V53) showed much lower distances (≤ 0.1) and were therefore all retained for merging. For the merging step, a conservative criterion was applied: only OTUs detected in all three technical replicates from the same sample were retained and their read counts averaged to generate consensus results (Alberdi et al. 2018; Beentjes et al. 2019) (See Suppl. material 9). Final retained OTUs were assigned to species: V6 sample, from La Rochelle, showed sequences from Boettgerilla pallens, Deroceras invadens and Oxychilus draparnaudi. V29, in Cleguer, Brittany, showed sequences from Arion distinctus, Boettgerilla pallens, Cornu aspersum L1 and the unassigned OTU 7. Finally, sequences from Ambigolimax parvipenis/valentianus, Cochlicella barbara, Cornu aspersum L1 and Deroceras invadens were recovered from the digestive contents of V53, sampled in Lampaul-Plouarzel, Brittany. A single unassigned OTU, but no possible artefactual sequences or intriguing assignments demonstrate the consistency of this filtering threshold for digestive contents from the field.

Gut contents from the 16 field samples showed 1 to 6 prey species per sample after applying the 0.4% OTU filtering threshold (See Suppl. material 8). Between 5 (in Saint-Medard-en-Jalles, region of Bordeaux) and 8 species (in Concarneau, Brittany) were recorded at the 4 sites studied (Fig. 5). 32 of the 43 (74%) detections in gut contents were species also observed at the sites during sampling phases. Deroceras invadens and Arion hortensis L1 were detected in all four sites and in the digestive contents of all sites except Cleguer, Brittany, for Arion hortensis L1. At the Cleguer site, two species not found in the field (Oxyloma elegans and Lauria cylindracea) were detected in the gut contents of two samples, and one OTU was assigned as Deroceras sp. due to a lack of taxonomic precision and the ambiguity between two potential species (Deroceras laeve and Deroceras lasithionense, the latter being endemic to the island of Crete in Greece). Finally, an OTU representing 2–4% of the sequences in two samples from Saint-Medard-en-Jalles, but below the assignment criteria, was named as “Unassigned OTU” (Fig. 5).

Figure 5.

Number of digestive contents in which each prey species was detected for the sixteen field samples, shown per sampling site. Site titles follow the nomenclature “Region, Site”. Each point corresponds to one prey species: red symbols indicate species also recorded during field sampling, whereas grey symbols indicate species not detected in the field but found in gut contents.

Discussion

In this study, new primers were developed to amplify and sequence DNA of terrestrial gastropods from digestive samples, and a detailed procedure was proposed using metabarcoding controls to ensure their specificity, resolution, and coverage for the target taxa. The first results from field data were also obtained, by sequencing the gut contents of the flatworm O. nungara, an exotic predator of terrestrial gastropods that is widespread in Europe.

Specificity, coverage and resolution of the newly designed primers

Universal primers detect a wide range of taxa but often lack specificity, leading to co-amplification of non-target organisms or contaminants. Their broad taxonomic range can also reduce resolution, sometimes limiting identification to family or genus level (Garwood et al. 2023). On the contrary, a prey-targeted approach maximizes the sequencing of gastropod DNA, allowing for a finer resolution of the predator’s diet. It also avoids the need for predator DNA masking with blocking primers, which, despite their effectiveness, introduce biases. Indeed, these primers can unintentionally inhibit the amplification of certain prey species or cause amplifications to fail altogether, distorting dietary analysis (Shehzad et al. 2012; Piñol et al. 2015; Robeson et al. 2018).

This study introduces a new primer pair specifically targeting short 16S DNA fragments of about 162 bp from terrestrial gastropods, representing a key advancement for the effective implementation of metabarcoding in trophic interaction and biodiversity studies. To date, only one other primer pair, developed by Taberlet et al. (2018) and already used in several studies (Mulero et al. 2021; Zhou et al. 2024), has been designed specifically for gastropods but it has not been validated in our cross-tests implying O. nungara, earthworms and gastropods DNA.

A key step in primer development is ensuring broad taxonomic coverage, i.e. the ability to amplify as many species or lineages as possible within the target taxa. Two complementary approaches, in silico amplifications based on sequence alignments and PCR amplifications on DNA from a variety of identified samples, were performed in this study and the results from both methods were only slightly divergent: among the 38 gastropod species and lineages tested, DNA amplification was expected for 32 referring to in silico tests, and PCR results showed successful amplification in 35 cases.

DNA from Pomatias elegans, Theba pisana L1, Theba pisana L2, Deroceras agreste, Deroceras reticulatum, Hygromia cinctella, and Trochulus hispidus was predicted in silico to fail amplification due to mismatches between their sequences and the retained primer pair, particularly at the 3’ end of the reverse primer. This region is critical for primer binding, and mismatches or gaps in this area can significantly reduce amplification efficiency (Elbrecht and Leese 2017). As expected, PCR amplification of DNA from Pomatias elegans and Theba pisana L2 were unsuccessful, and these species were not detected in mock community controls. For Theba pisana L1, in silico results varied depending on the sequence considered, but PCR amplification was ultimately unsuccessful and it was not detected in the mock communities.

However, despite similar mismatches, PCR amplification tests successfully amplified DNA from Deroceras agreste, Deroceras reticulatum, Hygromia cinctella, Trochulus hispidus, and all tested specimens of Arion hortensis L2, though in silico predictions for the latter showed some haplotype-dependent variability in amplification potential. These results underscore the need for experimental validation, as in silico predictions may not fully capture amplification potential. They also highlight the need to account for both interspecific and intraspecific sequence variability when designing primers for eDNA studies, ensuring their applicability across a wide range of targeted taxa (Elbrecht and Leese 2017; Taberlet et al. 2018; Bruce et al. 2021). Otherwise, the in silico analyses suggest that the VRF-VRR primer pair could be useful for the detection of terrestrial gastropod species from a wide range of habitats in France (i.e. not only anthropized areas) and in the rest of Europe in future studies using eDNA or digestive DNA. However, this still requires experimental and/or field confirmation.

Another important step in marker development is their resolution i.e. the potential of the amplified fragment to unambiguously distinguish taxa (species or genetic lineages, as described for the reference database). The phylogenetic reconstruction based on the short 16S fragment revealed that nearly all species and lineages considered could be clearly distinguished. These promising results confirm the 16S database’s value for future research and expansion. Indeed, some species still lack sufficient or even any available sequences in the literature, as is the case for Ambigolimax valentianus.

Maintaining a taxonomically verified and updated reference database is crucial for deriving ecological insights. Here, we emphasize that BLAST-based identification can be limited not by the tool itself, but by incomplete or erroneous reference databases and unresolved taxonomy, which can hinder accurate species or lineage assignment from environmental DNA. Indeed, there are two types of constraints on the identification of terrestrial gastropods from mitochondrial data: the existence of highly differentiated genetic lineages within nominal species, suggesting species complexes or cryptic species, and the existence of hybrids and introgressed forms, possibly leading to morpho-molecular incongruences.

Regarding the existence of genetic lineages within species, Pinceel et al. (2004, 2005) identified five distinct mitochondrial (mtDNA) groups (S1 to S5) within Arion subfuscus, which appears polyphyletic in phylogenetic reconstructions. Arion subfuscus samples from the French sites were assigned to group S1 using the updated reference database of these lineages. Pinceel et al. (2004, 2005) noted that the high divergence between mtDNA lineages may reflect cryptic diversity, although nuclear markers and morphological data do not provide conclusive evidence for distinct species. Similarly, genetic divergence has been observed within other species complexes, such as Trochulus hispidus (Kruckenhauser et al. 2014) which is polyphyletic and the subspecies of Theba pisana (Greve et al. 2010), underscoring the fact that multiple lineages can exist within a single species. These cases highlight the need to include all described lineages in reference databases when identifying taxa from environmental or digestive samples. Indeed, although their taxonomic status is not yet confirmed, their ecological role may be different.

Beyond cases of taxonomic identification challenges at the genetic lineage level, possible cases of misidentification could occur due to complex patterns of hybridization. They concern, for example, Arion vulgaris and Arion ater s.l. (i.e. Arion ater s.s. and the two morphotypes of A. rufus according to Reise et al. (2020). Arion vulgaris tends to replace local species in anthropized areas, producing hybrids with Arion ater s.l. and a continuum of morphological intermediates (Reise et al. 2020). In the present study, molecular analysis of the French samples with Arion rufus external morphology allowed them to be assigned to Reise et al.’s (2020) “br” lineage corresponding to the predominantly British subspecies Arion ater rufus (i.e. Rowson et al.’s (2014) Arion rufus and Chevallier’s (1972, 1974) Arion rufus rufus).

It should also be noted that studies of terrestrial gastropods have revealed remarkably high levels of both intra- and interspecific divergence in mtDNA, often exceeding 10% (Parmakelis et al. 2013). This complicates BLAST-based identification and reinforces the need for comprehensive databases and phylogenetic placement of OTUs. The 95% threshold arbitrarily chosen in this study seems rather strict in this sense, but limits misidentifications.

Key considerations for metabarcoding in dietary analysis

How to interpret high-throughput sequencing results has always been a concern in metabarcoding studies, especially when analyzing fragmented and/or partially digested DNA. Whether to rely on a quantitative or qualitative approach depends on several factors such as research aims, species-specific amplification biases, target taxa and dietary diversity, and should be considered when evaluating sequencing results (Deagle et al. 2010, 2013, 2019; De Barba et al. 2014; Alberdi et al. 2019; Casey et al. 2019; Lamb et al. 2019; Gold et al. 2023; Shelton et al. 2023).

Extracting DNA from digestive contents rather than feces implies that the sequencing results are highly dependent on the time elapsed since prey consumption, regardless of the species consumed. Consequently, the number of sequences per species does not necessarily reflect their abundance in the diet but rather indicates that the prey was consumed more recently – or conversely, earlier and more fully digested (Deagle et al. 2010, 2019). Analyzing metabarcoding data from a quantitative perspective (e.g., sequence counts or relative read abundance i.e. RRA) is also particularly challenging when considering the bias introduced by the preferential amplification of primer pairs (also referred to “species-specific bias” in Thomas et al. 2016; Yang et al. 2021; Gold et al. 2023). This bias results from several factors, including sequence motifs, GC content, fragment length, primer mismatches and DNA degradation, which affect amplification efficiency (Deiner et al. 2017; Braukmann et al. 2019). The relative abundance of detected taxa is biased because the observed sequence counts often fail to reflect the initial DNA proportions, even in controlled mock communities (Deiner et al. 2017; Shelton et al. 2023).

The results from mock communities obtained in this study are consistent with this trend. Indeed, Cornu aspersum L1 is systematically overrepresented in the samples. This suggests that this species may be preferentially amplified during the annealing step. In the MIX “controlled-feeding” controls, this species was consistently detected at a sequence number equivalent to that of Arion hortensis L1, which was consumed a week later. Interestingly, Arion hortensis L1 yielded sequencing results that were consistent with expectations in the mock communities, suggesting it is not negatively affected by amplification bias. This supports the idea that Cornu aspersum L1 DNA hybridizes more effectively and thus biases its true sequence counts in samples. Conversely, other species appear to show the opposite trend. For example, Hygromia cinctella was consistently underrepresented in mock communities, was completely absent from the “controlled-feeding” controls despite being the first prey eaten. It was also not detected in the field replicates or in the 16 field samples, despite being sampled in two of the four study sites. These results are consistent with an in silico binding score of 254, indicating that DNA from this species would theoretically not be expected to amplify efficiently with the VRF-VRR primers. However, PCR tests still showed amplification bands for this species, suggesting that although its DNA can be amplified, it is likely to be penalized during the annealing step due to mismatches. This likely results in low sequence counts, often insufficient to pass OTU filtering thresholds. The same thought process can be applied to other species such as Testacella haliotidea or Deroceras agreste, where the in silico score is either below 120 but not equal to 0, or above 120 and sequence counts in mock communities are significantly below expectations.

These results highlight a critical loss of diversity information as demonstrated by the “controlled-feeding” controls. The process is inherently multivariate, with the amplification of one taxon’s DNA influenced by the co-occurrence of others within the same samples (Shelton et al. 2023). In addition, species-specific biases remain, making it difficult to recover accurate relative abundance information from amplicon data (Yang et al. 2021), although several studies have attempted to do so in diet studies of different organisms (Port et al. 2016; Thomas et al. 2016; Ando et al. 2020). This issue is exacerbated in complex matrices, such as dietary or environmental samples, where DNA degradation, inhibitors, and preferential amplification further complicate accurate quantification (De Barba et al. 2014). Given that the quantitative aspect of metabarcoding data, particularly for digestive contents, presents numerous interpretative challenges, it was deliberately chosen to focus on the qualitative aspect, specifically on the detection (presence/absence) of prey through OTU clustering. This approach is not merely a way to circumvent the limitations of quantitative analyses but is actually better suited to the study objectives. Indeed, the goal was to gain a comprehensive understanding of the predator’s overall diet across multiple individuals and sampling sites, rather than quantifying the relative consumption of each prey item at the individual level. By emphasizing prey detection rather than abundance estimates, this qualitative framework allows for a broader and more ecologically relevant perspective on dietary diversity.

Filtering threshold

Another critical element of metabarcoding analysis is the selection of an appropriate OTU filtering threshold. This choice directly affects species detection and quantification by influencing how sequences are grouped and background noise is removed. In metabarcoding, contamination and artefacts can arise at multiple stages, including field sampling (e.g., cross-contamination between samples), DNA extraction (e.g., introduction of external DNA from reagents), PCR amplification (e.g., carry-over contamination or sequencing errors), and sequencing (e.g., through tag jumping). Low-abundance OTUs often correspond to such contaminants or errors, making the filtering strategy essential (Taberlet et al. 2018; Zinger et al. 2019). An adequate threshold reduces spurious sequences while preserving ecological signal, improving dietary diversity estimates, and helping to limit artefacts from rare events such as secondary predation.

Secondary predation, i.e., the detection of DNA from prey previously consumed by the predator’s prey, represents an additional source of potential bias (Da Silva et al. 2019; Deagle et al. 2019). Its impact depends on predator ecology: DNA from secondary ingestion is typically more degraded and occurs in lower quantities than DNA from directly consumed prey, reducing its likelihood of amplification (Hardy et al. 2017; Mata et al. 2021). For generalist or omnivorous predators, however, secondary consumption can broaden the apparent diet and introduce biases if not carefully interpreted (Bowser et al. 2013; Da Silva et al. 2019). In the case of O. nungara, secondary predation is considered unlikely. Carnivorous gastropods are rare among the species in our study area (e.g. Oxychilus spp.; Díaz et al. 2025). Other predatory species, such as Testacella spp., mainly feed on earthworms, which would not be amplified by our primers (Stokes and Hirst 1958). Therefore, any secondary predation signal is expected to be minimal relative to the main trophic signal.

In metazoan metabarcoding, a common filtering threshold of 1% of total read abundance is effective in reducing contaminants and rare erroneous sequences. Optimal thresholds, however, vary with target taxa, barcoding regions, sequencing depth, environment, and study goals (Deagle et al. 2019; Zinger et al. 2019; Ando et al. 2020). In this study, the availability of lab-fed individuals provided a robust reference for calibration, allowing thresholds to be refined with known diets—a situation not always possible in wild populations or environmental samples. In such contexts, threshold determination must rely more heavily on controls such as mock communities, positive and negative controls, and replicates to approximate artefact levels and avoid over-filtering ecologically relevant taxa.

Extensive controls—including positive and negative controls, mock communities, and technical replicates—were employed here to estimate artefact rates and refine thresholds. PCR replicates are generally recommended when feasible to increase reliability and allow stochastic amplification events to be identified, although they were not implemented in the present study (Deiner et al. 2017; Zinger et al. 2019), in order to balance the trade-off between sequencing costs and the large number of samples processed, while relying on other validation procedures. Positive controls with known DNA mixtures further guide contamination assessment and threshold selection (De Barba et al. 2014; Ando et al. 2020). Mock community analyses suggested a 0.1% threshold, whereas controlled-feeding data indicated 0.4% is more appropriate. Because DNA from mock communities is less degraded than gut content DNA, lower thresholds are reasonable to account for fewer artefacts. The 0.4% threshold performed well for digestive content samples: it removed unassigned OTUs while retaining species consistent with field observations. For instance, all species detected in samples V6, V29, and V53 matched species recorded at their respective collection sites.

The choice of OTU filtering thresholds primarily depends on the taxon and study objectives, as no universal value exists. In this study, a single threshold was applied across all samples, calibrated and validated with mock and controlled-feeding experiments to ensure that rare but ecologically relevant taxa were retained. The purpose of this threshold is not to provide a universal value, but to illustrate a rigorous approach to OTU filtering. Filtering thresholds are highly sensitive to the diversity of diets or ecosystems: in dietary metabarcoding, specialists with narrow diets may lose rare prey even under mild thresholds, exaggerating inter-individual variation, while generalists may appear artificially homogeneous if low-abundance taxa are removed (Deagle et al. 2019; Littleford-Colquhoun et al. 2022). Similarly, in environmental DNA studies, species-rich ecosystems risk losing rare taxa under strict filtering, whereas species-poor ecosystems may lose a large fraction of observed diversity even at moderate thresholds (Ando et al. 2020). Although precise thresholds depend on the system, lower thresholds may be necessary in highly diverse diets or ecosystems to retain rare taxa, whereas higher thresholds can be more appropriate in species-poor systems where low-abundance sequences are more likely to be artefacts. Optimal thresholds depend on sequencing depth, taxon abundance distributions, artefact prevalence, and study objectives; mock communities and controls are essential to guide appropriate selection (Drake et al. 2022). Importantly, for environmental samples where reference individuals or lab-fed systems are absent, empirical validation using controls, replicates, and mock communities becomes even more critical to balance artefact removal and biodiversity retention. Overall, these considerations highlight the importance of carefully calibrating filtering thresholds to the diversity and complexity of the system under study, as well as the specific ecological questions being addressed.

First field results and perspectives

Diet estimation remains a major challenge in ecology. Here are presented the first results on the dietary diversity of the invasive species O. nungara, from specimens collected directly in their invaded habitats, three private gardens and a greenhouse. Prey diversity surveys were conducted at these sites, allowing for a comparison of gut content sequencing results with field observations. Species observed in the field were expected in gut contents, though not necessarily all, as O. nungara could exhibit selective feeding. If the predator is highly selective, or if certain prey are rare, the gut contents should contain fewer species than observed in the field. Poor sampling or incomplete coverage could lead to the reverse pattern. The results obtained for Saint-Medard-en-Jalles suggest that O. nungara might show selectivity, which could be influenced by factors such as prey accessibility (e.g., life cycles, defensive mechanisms) or an active feeding preference, as indicated by the observed disparity between field and gut contents species diversity. In contrast, at Concarneau, the positive correlation between species detected in the field and those found in the gut contents suggests a more opportunistic feeding behavior. At Cleguer, the detection of species in gut contents that are not recorded in field surveys indicates a potential gap in sampling effort. Indeed, field inventories focused on macrofauna, which can lead to an underrepresentation of micro-snails such as Lauria cylindracea and other small species. High-throughput sequencing of digestive DNA can therefore enhance diversity estimates, serving as a complementary tool to field surveys (Kartzinel et al. 2015; Jo et al. 2016; Amundsen and Sánchez‐Hernández 2019; De Sousa et al. 2019).

These findings highlight the importance of considering sampling effort and PCR biases when interpreting the predator’s feeding behavior. The contradictions also highlight the limited number of specimens and sites studied so far, preventing definitive ecological and trophic conclusions. However, O. nungara demonstrates a remarkable dietary diversity at this stage. Discarding the unidentified OTU and the taxon assigned to Deroceras sp., 16 gastropod species and lineages were detected in the digestive contents of only sixteen individuals, while a total of 28 species and lineages were recorded across sampled sites. Among these, there are widespread and common species across the sites, such as Cornu aspersum, Deroceras invadens, and Ambigolimax parvipenis, as well as rarer and more sporadic species like Arion hortensis L2, Arion intermedius, and Cochlicopa lubrica. A broader-scale analysis is now a thrilling perspective to determine whether O. nungara primarily adopts an opportunistic or selective feeding strategy, and what ecological impact it can have on its prey and their ecosystems.

Beyond this system, studies using environmental DNA (eDNA) to investigate terrestrial gastropods remain scarce, in contrast to aquatic mollusks where eDNA is increasingly applied to detect diverse species (Mulero et al. 2021; Zieritz et al. 2022; Vanderpool et al. 2024). This gap is particularly relevant given the growing presence of exotic predatory flatworms in regions with high gastropod endemism, such as Platydemus manokwari in Guadeloupe, Martinique, and Saint-Martin (Justine et al. 2021), and Bipalium vagum in Jamaica (Brown et al. 2022), with further expansions projected globally (Fourcade et al. 2022). Terrestrial gastropods are also consumed by a wide range of other taxa, including malacophagous beetles (Pseudoophonus rufipes, Pterostichus melanarius; Němec and Horsák 2024), snail-eating snakes (You et al. 2015; Poyarkov et al. 2022) and occasional snakes feeding on snails (Zamenis situla; Christopoulos and Kotselis 2023), rodents (Allen 2004), and omnivorous birds (Laporta-Ferreira and Salomão 2004; Morii et al. 2021). Studying these interactions with a species-level molecular assay can provide additional resolution and complement existing dietary studies.

The metabarcoding assay developed here, based on newly designed primers targeting gastropod DNA, provides a practical tool for both gut content analyses and environmental DNA studies. It can help investigate terrestrial gastropod consumption by different predator taxa, as well as explore gastropod presence in understudied habitats. While the primers were designed in the context of O. nungara predation, they could also be applied to other ecological situations involving terrestrial gastropods, offering additional opportunities for ecological research and conservation studies.

Conclusion

In this study, a workflow was designed and validated for the detection of terrestrial gastropods DNA and application in metabarcoding studies. A combination of in silico predictions, laboratory testing and field validation showed that it could be used to detect both common and rare gastropod species in anthropized areas. A key contribution of this work lies in the calibration of OTU filtering thresholds using mock communities and dietary controls, allowing to fine-tune detection sensitivity and reduce amplification artifacts. This methodological step significantly improved the reliability of dietary data recovered from digestive contents. Using this validated approach, we performed the first targeted dietary analysis of gastropod prey in the invasive flatworm O. nungara, revealing a wide range of consumed species. Interestingly, some prey species were detected despite not being inventoried during field surveys, highlighting the potential of digestive metabarcoding to improve biodiversity assessments, particularly in complex or under-sampled ecosystems. This framework may also be useful for investigating other predator-gastropod systems and for improving biodiversity assessments in terrestrial environments. Future studies will expand the dietary analysis of O. nungara to include a larger sample size, as well as exploring other predator-gastropod systems. This will enable us to gain a better understanding of the ecological interactions between introduced predators and their prey.

Acknowledgements

The authors would like to thank Agnès Gigon, Thomas Lerch and Yoan Fourcade for their help in collecting the samples, and Clémentine Loth for sequencing. Sampling missions and experiments were funded by the French National Research Agency, as part of the PLATWORM project n°ANR-21-CE02-0016.

Additional information

Conflict of interest

The authors have declared that no competing interests exist.

Ethical statement

No ethical statement was reported.

Use of AI

No use of AI was reported.

Funding

No funding was reported.

Author contributions

MV and VR designed the study. Collection of the samples was assured by MV, NM, SN, LD and VR. Laboratory work was performed by MV, NM and VR. MV conducted the analyses. MV wrote the first draft of the manuscript. All authors contributed to the final version and approved the submitted manuscript.

Author ORCIDs

Mathis Ventura https://orcid.org/0009-0001-6831-9654

Nicolas Mazuras https://orcid.org/0009-0004-0888-6659

Shanèze Noël https://orcid.org/0009-0001-5380-1085

Lise Dupont https://orcid.org/0000-0003-4824-2685

Virginie Roy https://orcid.org/0000-0003-1908-0074

Data availability

All of the data that support the findings of this study are available in the main text or Supplementary Information.

References

  • Aizpurua O, Budinski I, Georgiakakis P, Gopalakrishnan S, Ibañez C, Mata V, Rebelo H, Russo D, Szodoray-Parádi F, Zhelyazkova V, Zrncic V, Gilbert MTP, Alberdi A (2018) Agriculture shapes the trophic niche of a bat preying on multiple pest arthropods across Europe: Evidence from DNA metabarcoding. Molecular Ecology 27: 559–593. https://doi.org/10.1111/mec.14474
  • Alberdi A, Aizpurua O, Gilbert MTP, Bohmann K (2018) Scrutinizing key steps for reliable metabarcoding of environmental samples. Methods in Ecology and Evolution 9: 134–147. https://doi.org/10.1111/2041-210X.12849
  • Alberdi A, Aizpurua O, Bohmann K, Gopalakrishnan S, Lynggaard C, Nielsen M, Gilbert MTP (2019) Promises and pitfalls of using high‐throughput sequencing for diet analysis. Molecular Ecology Resources 19: 327–348. https://doi.org/10.1111/1755-0998.12960
  • Amundsen P, Sánchez‐Hernández J (2019) Feeding studies take guts – critical review and recommendations of methods for stomach contents analysis in fish. Journal of Fish Biology 95: 1364–1373. https://doi.org/10.1111/jfb.14151
  • Ando H, Mukai H, Komura T, Dewi T, Ando M, Isagi Y (2020) Methodological trends and perspectives of animal dietary studies by noninvasive fecal DNA metabarcoding. Environmental DNA 2: 391–406. https://doi.org/10.1002/edn3.117
  • Andruszkiewicz EA, Starks HA, Chavez FP, Sassoubre LM, Block BA, Boehm AB (2017) Biomonitoring of marine vertebrates in Monterey Bay using eDNA metabarcoding. PLOS ONE 12: e0176343. https://doi.org/10.1371/journal.pone.0176343
  • Baetscher DS, Locatelli NS, Won E, Fitzgerald T, McIntyre PB, Therkildsen NO (2023) Optimizing a metabarcoding marker portfolio for species detection from complex mixtures of globally diverse fishes. Environmental DNA 5: 1589–1607. https://doi.org/10.1002/edn3.479
  • Barraux A, Noël S, Roy V, Dupont L (2024) Challenges of molecular barcode-based identification of earthworm specimens for biodiversity assessment. Frontiers in Ecology and Evolution 12: 1358984. https://doi.org/10.3389/fevo.2024.1358984
  • Beentjes KK, Speksnijder AGCL, Schilthuizen M, Hoogeveen M, Van Der Hoorn BB (2019) The effects of spatial and temporal replicate sampling on eDNA metabarcoding. PeerJ 7: e7335. https://doi.org/10.7717/peerj.7335
  • Bohmann K, Evans A, Gilbert MTP, Carvalho GR, Creer S, Knapp M, Yu DW, De Bruyn M (2014) Environmental DNA for wildlife biology and biodiversity monitoring. Trends in Ecology & Evolution 29: 358–367. https://doi.org/10.1016/j.tree.2014.04.003
  • Boll PK, Leal-Zanchet AM (2016) Preference for different prey allows the coexistence of several land planarians in areas of the Atlantic Forest. Zoology: Analysis of Complex Systems, ZACS 119: 162–168. https://doi.org/10.1016/j.zool.2016.04.002
  • Bossuyt F, Milinkovitch MC (2000) Convergent adaptive radiations in Madagascan and Asian ranid frogs reveal covariation between larval and adult traits. Proceedings of the National Academy of Sciences of the United States of America 97: 6585–6590. https://doi.org/10.1073/pnas.97.12.6585
  • Braukmann TWA, Ivanova NV, Prosser SWJ, Elbrecht V, Steinke D, Ratnasingham S, De Waard JR, Sones JE, Zakharov EV, Hebert PDN (2019) Metabarcoding a diverse arthropod mock community. Molecular Ecology Resources 19: 711–727. https://doi.org/10.1111/1755-0998.13008
  • Bray JR, Curtis JT (1957) An Ordination of the Upland Forest Communities of Southern Wisconsin. Ecological Monographs 27: 325–349. https://doi.org/10.2307/1942268
  • Brown M-D, Lindo J, Robinson R (2022) First record of exotic terrestrial flatworms (Tricladida: Geoplanidae) Bipalium vagum Jones & Sterrer, 2005 and Dolichoplana striata Moseley, 1877 with confirmation of Platydemus manokwari de Beauchamp, 1963 in Jamaica. BioInvasions Records: 11. https://doi.org/10.3391/bir.2022.11.2.10
  • Bruce K, Blackman R, Bourlat SJ, Hellström AM, Bakker J, Bista I, Bohmann K, Bouchez A, Brys R, Clark K, Elbrecht V, Fazi S, Fonseca V, Hänfling B, Leese F, Mächler E, Mahon AR, Meissner K, Panksep K, Pawlowski J, Schmidt Yáñez P, Seymour M, Thalinger B, Valentini A, Woodcock P, Traugott M, Vasselon V, Deiner K (2021) A practical guide to DNA-based methods for biodiversity assessment. Pensoft Publishers, 90 pp. https://doi.org/10.3897/ab.e68634
  • Casey JM, Meyer CP, Morat F, Brandl SJ, Planes S, Parravicini V (2019) Reconstructing hyperdiverse food webs: Gut content metabarcoding as a tool to disentangle trophic interactions on coral reefs. Methods in Ecology and Evolution 10: 1157–1170. https://doi.org/10.1111/2041-210X.13206
  • Chevallier H (1972) Arionidae (Mollusca, Pulmonata) des Alpes et du Jura français. Haliotis 2: 7–23.
  • Chevallier H (1974) Les grands Arion de France (Mollusca, Pulmonata): taxonomie, biogeographie, ecologie, polymorphisme, croissance et cycle biologique. Verlag nicht ermittelbar.
  • Christopoulos A, Kotselis C (2023) Tasting or routine meal? First record of slugs (Gastropoda) consumption by Zamenis situla Linnaeus, 1758 (Squamata: Colubridae). Herpetology Notes 16: 237–240.
  • Clarke LJ, Soubrier J, Weyrich LS, Cooper A (2014) Environmental metabarcodes for insects: In silico PCR reveals potential for taxonomic bias. Molecular Ecology Resources 14: 1160–1170. https://doi.org/10.1111/1755-0998.12265
  • Corse E, Meglécz E, Archambaud G, Ardisson M, Martin J, Tougard C, Chappaz R, Dubut V (2017) A from‐benchtop‐to‐desktop workflow for validating HTS data and for taxonomic identification in diet metabarcoding studies. Molecular Ecology Resources 17. https://doi.org/10.1111/1755-0998.12703
  • Da Silva LP, Mata VA, Lopes PB, Pereira P, Jarman SN, Lopes RJ, Beja P (2019) Advancing the integration of multi‐marker metabarcoding data in dietary analysis of trophic generalists. Molecular Ecology Resources 19: 1420–1432. https://doi.org/10.1111/1755-0998.13060
  • De Barba M, Miquel C, Boyer F, Mercier C, Rioux D, Coissac E, Taberlet P (2014) DNA metabarcoding multiplexing and validation of data accuracy for diet assessment: Application to omnivorous diet. Molecular Ecology Resources 14: 306–323. https://doi.org/10.1111/1755-0998.12188
  • De Sousa LL, Silva SM, Xavier R (2019) DNA metabarcoding in diet studies: Unveiling ecological aspects in aquatic and terrestrial ecosystems. Environmental DNA 1: 199–214. https://doi.org/10.1002/edn3.27
  • Deagle BE, Chiaradia A, McInnes J, Jarman SN (2010) Pyrosequencing faecal DNA to determine diet of little penguins: Is what goes in what comes out? Conservation Genetics 11: 2039–2048. https://doi.org/10.1007/s10592-010-0096-6
  • Deagle BE, Thomas AC, Shaffer AK, Trites AW, Jarman SN (2013) Quantifying sequence proportions in a DNA‐based diet study using Ion Torrent amplicon sequencing: Which counts count? Molecular Ecology Resources 13: 620–633. https://doi.org/10.1111/1755-0998.12103
  • Deagle BE, Jarman SN, Coissac E, Pompanon F, Taberlet P (2014) DNA metabarcoding and the cytochrome c oxidase subunit I marker: Not a perfect match. Biology Letters 10: 20140562. https://doi.org/10.1098/rsbl.2014.0562
  • Deagle BE, Thomas AC, McInnes JC, Clarke LJ, Vesterinen EJ, Clare EL, Kartzinel TR, Eveson JP (2019) Counting with DNA in metabarcoding studies: How should we convert sequence reads to dietary data? Molecular Ecology 28: 391–406. https://doi.org/10.1111/mec.14734
  • Deiner K, Bik HM, Mächler E, Seymour M, Lacoursière‐Roussel A, Altermatt F, Creer S, Bista I, Lodge DM, De Vere N, Pfrender ME, Bernatchez L (2017) Environmental DNA metabarcoding: Transforming how we survey animal and plant communities. Molecular Ecology 26: 5872–5895. https://doi.org/10.1111/mec.14350
  • Díaz AC, Martin SM, Cao LM (2025) [September 3] New reports for Oxychilus draparnaudi (H. Beck, 1837) (Gastropoda: Oxychilidae) in the Province of Buenos Aires, Argentina. The Nautilus 139(1). http://sedici.unlp.edu.ar/handle/10915/180499
  • Drake LE, Cuff JP, Young RE, Marchbank A, Chadwick EA, Symondson WOC (2022) An assessment of minimum sequence copy thresholds for identifying and reducing the prevalence of artefacts in dietary metabarcoding data. Methods in Ecology and a 13: 694–710. https://doi.org/10.1111/2041-210X.13780
  • Elbrecht V, Leese F (2015) Can DNA-Based ecosystem assessments quantify species abundance? Testing primer bias and biomass—sequence relationships with an innovative metabarcoding protocol. PLOS ONE 10: e0130324. https://doi.org/10.1371/journal.pone.0130324
  • Elbrecht V, Leese F (2017) PrimerMiner: an r package for development and in silico validation of DNA metabarcoding primers. Methods in Ecology and Evolution 8: 622–626. https://doi.org/10.1111/2041-210X.12687
  • Elbrecht V, Taberlet P, Dejean T, Valentini A, Usseglio-Polatera P, Beisel J-N, Coissac E, Boyer F, Leese F (2016) Testing the potential of a ribosomal 16S marker for DNA metabarcoding of insects. PeerJ 4: e1966. https://doi.org/10.7717/peerj.1966
  • Ficetola GF, Coissac E, Zundel S, Riaz T, Shehzad W, Bessière J, Taberlet P, Pompanon F (2010) An In silico approach for the evaluation of DNA barcodes. BMC Genomics 11: 434. https://doi.org/10.1186/1471-2164-11-434
  • Ficetola GF, Pansu J, Bonin A, Coissac E, Giguet‐Covex C, De Barba M, Gielly L, Lopes CM, Boyer F, Pompanon F, Rayé G, Taberlet P (2015) Replication levels, false presences and the estimation of the presence/absence from eDNA metabarcoding data. Molecular Ecology Resources 15: 543–556. https://doi.org/10.1111/1755-0998.12338
  • Folmer O, Black M, Hoeh W, Lutz R, Vrijenhoek R (1994) DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Molecular Marine Biology and Biotechnology 3: 294–299.
  • Fourcade Y, Winsor L, Justine J-L (2022) Hammerhead worms everywhere? Modelling the invasion of bipaliin flatworms in a changing climate. Diversity & Distributions 28: 844–858. https://doi.org/10.1111/ddi.13489
  • Gargominy O (2020) BiodiversiClés: Escargots de France métropolitaine (MALACO-FR). PatriNat (OFB-CNRS-MNHN).
  • Garwood TJ, Moore SA, Fountain-Jones NM, Larsen PA, Wolf TM (2023) Species in the feces: DNA Metabarcoding to detect potential gastropod hosts of Parelaphostrongylus tenuis consumed by moose (Alces alces). Journal of Wildlife Diseases 59. https://doi.org/10.7589/JWD-D-22-00120
  • Gold Z, Shelton AO, Casendino HR, Duprey J, Gallego R, Van Cise A, Fisher M, Jensen AJ, D’Agnese E, Andruszkiewicz Allan E, Ramón-Laca A, Garber-Yonts M, Labare M, Parsons KM, Kelly RP (2023) Signal and noise in metabarcoding data. PLOS ONE 18: e0285674. https://doi.org/10.1371/journal.pone.0285674
  • Gouy M, Guindon S, Gascuel O (2010) SeaView Version 4: A multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Molecular Biology and Evolution 27: 221–224. https://doi.org/10.1093/molbev/msp259
  • Greve C, Hutterer R, Groh K, Haase M, Misof B (2010) Evolutionary diversification of the genus Theba (Gastropoda: Helicidae) in space and time: A land snail conquering islands and continents. Molecular Phylogenetics and Evolution 57: 572–584. https://doi.org/10.1016/j.ympev.2010.08.021
  • Hardy N, Berry T, Kelaher BP, Goldsworthy SD, Bunce M, Coleman MA, Gillanders BM, Connell SD, Blewitt M, Figueira W (2017) Assessing the trophic ecology of top predators across a recolonisation frontier using DNA metabarcoding of diets. Marine Ecology Progress Series 573: 237–254. https://doi.org/10.3354/meps12165
  • Jo H, Ventura M, Vidal N, Gim J, Buchaca T, Barmuta LA, Jeppesen E, Joo G (2016) Discovering hidden biodiversity: The use of complementary monitoring of fish diet based on DNA barcoding in freshwater ecosystems. Ecology and Evolution 6: 219–232. https://doi.org/10.1002/ece3.1825
  • Justine J-L, Winsor L, Gey D, Gros P, Thévenot J (2020) Obama chez moi! The invasion of metropolitan France by the land planarian Obama nungara (Platyhelminthes, Geoplanidae). PeerJ 8: e8385. https://doi.org/10.7717/peerj.8385
  • Justine J-L, Gey D, Vasseur J, Thévenot J, Coulis M, Winsor L (2021) Presence of the invasive land flatworm Platydemus manokwari (Platyhelminthes, Geoplanidae) in Guadeloupe, Martinique and Saint Martin (French West Indies). Zootaxa 4951: 381–390. https://doi.org/10.11646/zootaxa.4951.2.11
  • Kartzinel TR, Chen PA, Coverdale TC, Erickson DL, Kress WJ, Kuzmina ML, Rubenstein DI, Wang W, Pringle RM (2015) DNA metabarcoding illuminates dietary niche partitioning by African large herbivores. Proceedings of the National Academy of Sciences of the United States of America 112: 8019–8024. https://doi.org/10.1073/pnas.1503283112
  • Kruckenhauser L, Duda M, Bartel D, Sattmann H, Harl J, Kirchner S, Haring E (2014) Paraphyly and budding speciation in the hairy snail (Pulmonata, Hygromiidae). Zoologica Scripta 43: 273–288. https://doi.org/10.1111/zsc.12046
  • Lamb PD, Hunter E, Pinnegar JK, Creer S, Davies RG, Taylor MI (2019) How quantitative is metabarcoding: A meta‐analytical approach. Molecular Ecology 28: 420–430. https://doi.org/10.1111/mec.14920
  • Littleford-Colquhoun BL, Freeman PT, Sackett VI, Tulloss CV, McGarvey LM, Geremia C, Kartzinel TR (2022) The precautionary principle and dietary DNA metabarcoding: Commonly used abundance thresholds change ecological interpretation. Molecular Ecology 31: 1615–1626. https://doi.org/10.1111/mec.16352
  • Lopes CM, De Barba M, Boyer F, Mercier C, Da Silva Filho PJS, Heidtmann LM, Galiano D, Kubiak BB, Langone P, Garcias FM, Gielly L, Coissac E, De Freitas TRO, Taberlet P (2015) DNA metabarcoding diet analysis for species with parapatric vs sympatric distribution: A case study on subterranean rodents. Heredity 114: 525–536. https://doi.org/10.1038/hdy.2014.109
  • Martins B, Silva-Rocha I, Mata VA, Gonçalves Y, Rocha R, Rato C (2022) Trophic interactions of an invasive gecko in an endemic-rich oceanic island: Insights using DNA metabarcoding. Frontiers in Ecology and Evolution 10: 1044230. https://doi.org/10.3389/fevo.2022.1044230
  • Mata VA, Rebelo H, Amorim F, McCracken GF, Jarman S, Beja P (2019) How much is enough? Effects of technical and biological replication on metabarcoding dietary analysis. Molecular Ecology 28: 165–175. https://doi.org/10.1111/mec.14779
  • Mata VA, da Silva LP, Veríssimo J, Horta P, Raposeira H, McCracken GF, Rebelo H, Beja P (2021) Combining DNA metabarcoding and ecological networks to inform conservation biocontrol by small vertebrate predators. Ecological Applications: A Publication of the Ecological Society of America 31: e02457. https://doi.org/10.1002/eap.2457
  • Mc Donnell RJ, Vendetti JE, Paine TD, Gormally MJ (2009) Slugs: A Guide to the Introduced and Native Fauna of California. University of California, Agriculture and Natural Resources, 37 pp. https://doi.org/10.3733/UZUM1090
  • Morii Y, Kitazawa M, Squires TE, Watanabe M, Watanabe Y, Saito T, Yamazaki D, Uchida A, Machida Y (2021) A complete dietary review of Japanese birds with special focus on molluscs. Scientific Data 8: 19. https://doi.org/10.1038/s41597-021-00800-6
  • Mulero S, Toulza E, Loisier A, Zimmerman M, Allienne J-F, Foata J, Quilichini Y, Pointier J-P, Rey O, Boissier J (2021) Malacological survey in a bottle of water: A comparative study between manual sampling and environmental DNA metabarcoding approaches. Global Ecology and Conservation 25: e01428. https://doi.org/10.1016/j.gecco.2020.e01428
  • Němec T, Horsák M (2024) Generalist carabid beetles are more malacophagous than previously recognized and cause diversified types of shell damage. Journal of Zoology (London, England) 324: 11–20. https://doi.org/10.1111/jzo.13179
  • Nichols RV, Vollmers C, Newsom LA, Wang Y, Heintzman PD, Leighton M, Green RE, Shapiro B (2018) Minimizing polymerase biases in metabarcoding. Molecular Ecology Resources 18: 927–939. https://doi.org/10.1111/1755-0998.12895
  • Noël S, Fourcade Y, Roy V, Gigon A, Dupont L (2025a) Experimental assessment of the predation pressure by the exotic flatworm Obama nungara in its introduced range. Current Zoology: zoaf025. https://doi.org/10.1093/cz/zoaf025
  • Noël S, Fourcade Y, Roy V, Bonnet G, Dupont L (2025b) Population dynamics of the exotic flatworm Obama nungara in an invaded garden. Ecology and Evolution 15: e70827. https://doi.org/10.1002/ece3.70827
  • Palumbi SR (1996) Nucleic Acids II: The polymerase chain reaction. In: Molecular Systematics, 2nd edition. Sinauer, Sunderland, 205–247.
  • Pansu J, De Danieli S, Puissant J, Gonzalez J-M, Gielly L, Cordonnier T, Zinger L, Brun J-J, Choler P, Taberlet P, Cécillon L (2015) Landscape-scale distribution patterns of earthworms inferred from soil DNA. Soil Biology & Biochemistry 83: 100–105. https://doi.org/10.1016/j.soilbio.2015.01.004
  • Pinceel J, Jordaens K, Van Houtte N, De Winter AJ, Backeljau T (2004) Molecular and morphological data reveal cryptic taxonomic diversity in the terrestrial slug complex Arion subfuscus/fuscus (Mollusca, Pulmonata, Arionidae) in continental north-west Europe. Biological Journal of the Linnean Society. Linnean Society of London 83: 23–38. https://doi.org/10.1111/j.1095-8312.2004.00368.x
  • Pinceel J, Jordaens K, Backeljau T (2005) Extreme mtDNA divergences in a terrestrial slug (Gastropoda, Pulmonata, Arionidae): Accelerated evolution, allopatric divergence and secondary contact. Journal of Evolutionary Biology 18: 1264–1280. https://doi.org/10.1111/j.1420-9101.2005.00932.x
  • Piñol J, Mir G, Gomez‐Polo P, Agustí N (2015) Universal and blocking primer mismatches limit the use of high‐throughput DNA sequencing for the quantitative metabarcoding of arthropods. Molecular Ecology Resources 15: 819–830. https://doi.org/10.1111/1755-0998.12355
  • Port JA, O’Donnell JL, Romero‐Maraccini OC, Leary PR, Litvin SY, Nickols KJ, Yamahara KM, Kelly RP (2016) Assessing vertebrate biodiversity in a kelp forest ecosystem using environmental DNA. Molecular Ecology 25: 527–541. https://doi.org/10.1111/mec.13481
  • Poyarkov NA, Nguyen TV, Pawangkhanant P, Yushchenko PV, Brakels P, Nguyen LH, Nguyen HN, Suwannapoom C, Orlov N, Vogel G (2022) An integrative taxonomic revision of slug-eating snakes (Squamata: Pareidae: Pareineae) reveals unprecedented diversity in Indochina. PeerJ 10: e12713. https://doi.org/10.7717/peerj.12713
  • Reise H, Schwarzer A-K, Hutchinson JMC, Schlitt B (2020) Genital morphology differentiates three subspecies of the terrestrial slug Arion ater (Linnæus, 1758) s.l. and reveals a continuum of intermediates with the invasive A. vulgaris Moquin-Tandon, 1855. Folia Malacologica 28: 1–34. https://doi.org/10.12657/folmal.028.001
  • Robeson MS, Khanipov K, Golovko G, Wisely SM, White MD, Bodenchuck M, Smyser TJ, Fofanov Y, Fierer N, Piaggio AJ (2018) Assessing the utility of metabarcoding for diet analyses of the omnivorous wild pig (Sus scrofa). Ecology and Evolution 8: 185–196. https://doi.org/10.1002/ece3.3638
  • Rowson B, Anderson R, Turner JA, Symondson WOC (2014) The slugs of Britain and Ireland: Undetected and undescribed species increase a well-studied, economically important fauna by more than 20%. PLOS ONE 9: e91907. https://doi.org/10.1371/journal.pone.0091907
  • Roy V, Ventura M, Fourcade Y, Justine J-L, Gigon A, Dupont L (2022) Gut content metabarcoding and citizen science reveal the earthworm prey of the exotic terrestrial flatworm, Obama nungara. European Journal of Soil Biology 113: 103449. https://doi.org/10.1016/j.ejsobi.2022.103449
  • Sambrook J, Russell D (2001) Preparation and analysis of eukaryotic genomic DNA. Molecular Cloning–A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York, USA, 6.1-6.30.
  • Shehzad W, Riaz T, Nawaz MA, Miquel C, Poillot C, Shah SA, Pompanon F, Coissac E, Taberlet P (2012) Carnivore diet analysis based on next‐generation sequencing: Application to the leopard cat (Prionailurus bengalensis) in Pakistan. Molecular Ecology 21: 1951–1965. https://doi.org/10.1111/j.1365-294X.2011.05424.x
  • Shelton AO, Gold ZJ, Jensen AJ, D′Agnese E, Andruszkiewicz AE, Van Cise A, Gallego R, Ramón‐Laca A, Garber‐Yonts M, Parsons K, Kelly RP (2023) Toward quantitative metabarcoding. Ecology 104: e3906. https://doi.org/10.1002/ecy.3906
  • Thomas AC, Deagle BE, Eveson JP, Harsch CH, Trites AW (2016) Quantitative DNA metabarcoding: Improved estimates of species proportional biomass using correction factors derived from control material. Molecular Ecology Resources 16: 714–726. https://doi.org/10.1111/1755-0998.12490
  • Vanderpool DD, Wilcox TM, Young MK, Pilgrim KL, Schwartz MK (2024) Simultaneous species detection and discovery with environmental DNA metabarcoding: A freshwater mollusk case study. Ecology and Evolution 14: e11020. https://doi.org/10.1002/ece3.11020
  • Ventura M, Cucherat X, Hutchinson JMC, Noel S, Mazuras N, Dupont L, Roy V (2025) First records of the true Sicilian slug Deroceras panormitanum (Eupulmonata: Agriolimacidae) in France and mitochondrial sequences of three additional species of the genus Deroceras. Folia Malacologica 33. https://doi.org/10.12657/folmal.033.006
  • Yang C, Bohmann K, Wang X, Cai W, Wales N, Ding Z, Gopalakrishnan S, Yu DW (2021) Biodiversity Soup II: A bulk‐sample metabarcoding pipeline emphasizing error reduction. Methods in Ecology and Evolution 12: 1252–1264. https://doi.org/10.1111/2041-210X.13602
  • You C-W, Poyarkov Jr NA, Lin S-M (2015) Diversity of the snail-eating snakes Pareas (Serpentes, Pareatidae) from Taiwan. Zoologica Scripta 44: 349–361. https://doi.org/10.1111/zsc.12111
  • Zepeda-Mendoza ML, Bohmann K, Carmona Baez A, Gilbert MTP (2016) DAMe: A toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses. BMC Research Notes 9: 255. https://doi.org/10.1186/s13104-016-2064-9
  • Zhou C, Zhu C, Cheng Y, Lei Y, Nan Y, Ouyang S, Wu X (2024) Exploring freshwater snail diversity and community structure in china’s largest lake using eDNA technology. Ecological Indicators 158: 111577. https://doi.org/10.1016/j.ecolind.2024.111577
  • Zieritz A, Lee PS, Eng WWH, Lim SY, Sing KW, Chan WN, Loo JS, Mahadzir FN, Ng TH, Yeo DCJ, Gan LX, Gan JY, Gibbins C, Zoqratt MZHM, Wilson J-J (2022) DNA metabarcoding unravels unknown diversity and distribution patterns of tropical freshwater invertebrates. Freshwater Biology 67: 1411–1427. https://doi.org/10.1111/fwb.13926
  • Zinger L, Bonin A, Alsos IG, Bálint M, Bik H, Boyer F, Chariton AA, Creer S, Coissac E, Deagle BE, De Barba M, Dickie IA, Dumbrell AJ, Ficetola GF, Fierer N, Fumagalli L, Gilbert MTP, Jarman S, Jumpponen A, Kauserud H, Orlando L, Pansu J, Pawlowski J, Tedersoo L, Thomsen PF, Willerslev E, Taberlet P (2019) DNA metabarcoding—Need for robust experimental designs to draw sound ecological conclusions. Molecular Ecology 28: 1857–1862. https://doi.org/10.1111/mec.15060

Supplementary materials

Supplementary material 1 

DNA composition

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: xlsx

Explanation note: Mock communities’ composition per species per volume. All DNA were diluted to the same concentration, volumes went from 1 to 10 µL in NREP mocks, and 4 µL only in POS mocks.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (7.93 kb)
Supplementary material 2 

Reference database for 16S gene

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: xlsx

Explanation note: List of accession number, taxonomy, sequence fragment and reference for all terrestrial gastropods samples used to construct the reference database for 16S gene.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (41.38 kb)
Supplementary material 3 

Sequence alignments

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: xlsx

Explanation note: Sequence alignments detailed for each primer pair to discriminate O. nungara and earthworm consensus sequence from terrestrial gastropods consensus sequence. Coloured bases indicate difference with the designed primer pair.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (19.34 kb)
Supplementary material 4 

Reference database for all available reference sequences from NCBI projects

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: ods

Explanation note: List of accession, complete taxonomy and in silico PCR results of all available reference sequences from NCBI projects.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (11.38 MB)
Supplementary material 5 

Species lists

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: xlsx

Explanation note: Species lists of terrestrial gastropod species described in France and Europe.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (160.90 kb)
Supplementary material 6 

Bioinformatic

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: xlsx

Explanation note: Results of in silico PCR using the VRF-VRR primer pair for available 16S sequences of terrestrial gastropod species described in France and Europe.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (130.32 kb)
Supplementary material 7 

Phylogenetic tree

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: pdf

Explanation note: Neighbor Joining tree reconstructed using 96 terrestrial gastropod 16S sequences obtained by amplifying a short metabarcode of 162 bp using VRF-VRR primer pair, and 244 sequences from the literature. Bootstrap values (1000 repetitions) > 70% are indicated at nodes. The 38 species and lineages indicated in bold were sampled in the field and had their DNA extracted for the various analyses conducted in this study.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (255.73 kb)
Supplementary material 8 

High-throughput sequencing

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: xlsx

Explanation note: Per metabarcoding control and field sample, details of the raw OTUs data obtained from bioinformatic analyses. Details of seed sequence, length (bp), affiliated species, and results of BLAST criteria are displayed. Filtering outcomes for field replicates and field samples are highlighted.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (49.32 kb)
Supplementary material 9 

Graph of results of gut contents sequencing

Mathis Ventura, Nicolas Mazuras, Shanèze Noël, Lise Dupont, Virginie Roy

Data type: pdf

Explanation note: Results of gut contents sequencing by relative read abundance (RRA %) for the three samples replicated. The figure shows the results after applying the stochasticity threshold and merging the replicates to produce consensus single samples.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.
Download file (71.52 kb)
login to comment