Figures
Abstract
Nematodes are keystone actors of soil, freshwater and marine ecosystems, but the complexity of morphological identification has limited broad-scale monitoring of nematode biodiversity. DNA metabarcoding is increasingly used to assess nematode diversity but requires universal primers with high taxonomic coverage and high taxonomic resolution. Several primers have been proposed for the metabarcoding of nematode diversity, many of which target the 18S rRNA gene. In silico analyses have a great potential to assess key parameters of primers, including taxonomic coverage, resolution and specificity. Based on a recently-available reference database, we tested in silico the performance of fourteen commonly used and one newly optimized primer for nematode metabarcoding. Most primers showed very good coverage, amplifying most of the sequences in the reference database, while four markers showed limited coverage. All primers showed good taxonomic resolution. Resolution was particularly good if the aim was the identification of higher-level taxa, such as genera or families. Overall, species-level resolution was higher for primers amplifying long fragments. None of the primers was highly specific for nematodes as, despite some variation, they all amplified a large number of other eukaryotes. Differences in performance across primers highlight the complexity of the choice of markers appropriate for the metabarcoding of nematodes, which depends on a trade-off between taxonomic resolution and the length of amplified fragments. Our in silico analyses provide new insights for the identification of the most appropriate primers, depending on the study goals and the origin of DNA samples. This represents an essential step to design and optimize metabarcoding studies assessing nematode diversity.
Citation: Ficetola GF, Guerrieri A, Cantera I, Bonin A (2024) In silico assessment of 18S rDNA metabarcoding markers for the characterization of nematode communities. PLoS ONE 19(4): e0298905. https://doi.org/10.1371/journal.pone.0298905
Editor: Hideyuki Doi, University of Hyogo, JAPAN
Received: September 6, 2023; Accepted: January 31, 2024; Published: April 5, 2024
Copyright: © 2024 Ficetola et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All the analyses are based on two databases already available online: 18S-NemaBase and GenBank database: -The reference database used is available as supplementary appendix of Gattoni et al., 2023. Journal of Nematology 55 (https://github.com/WormsEtAl/18SNemaBase) -The GenBank database is available at https://ftp.ncbi.nlm.nih.gov/genbank/ -The script are available on figshare via the following DOI: 10.6084/m9.figshare.24722199.
Funding: G.F.F., A.G., I.C. and A.B. have been funded by the European Research Council under the European Community’s Horizon 2020 Programme, Grant Agreement no. 772284 (IceCommunities). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Nematodes are probably the most abundant animals on Earth, and are a crucial component of soil, freshwater and marine ecosystems [1–3]. Despite their importance, the complexity and labor of morphological identification has long limited broad-scale analyses of nematode biodiversity [4, 5]. The biodiversity of nematodes is estimated to be 1–10 million species, but less than 30,000 species have been described using morphology [6]. Recent advances in DNA metabarcoding have fostered the study of nematode biodiversity from a range of environments, highlighting their impressive diversity and the multiple key roles they play [2, 3, 7–12].
The identification of appropriate primers is a fundamental step of all DNA metabarcoding analyses [13, 14]. Several features are extremely important for the selection of primers. First, primers must have a low number of mismatches with the sequences of the target group [high taxonomic coverage; 15–18]. Second, primers should amplify highly variable regions to enable the identification of target taxa at a high taxonomic level [high resolution; 16, 17]. Third, short amplicons are generally favored, in order to reduce the cost of sequencing. Although recent advances in sequencing technologies now allow to sequence longer fragments with the same budget [19], the use of primers amplifying short fragments is relevant when working with environmental and ancient DNA, as it is often degraded and consists of short sequences [14, 20–23]. Finally, primers that only amplify the target taxon are frequently preferred (i.e. primers with high specificitity), because non-specific amplification can reduce the detection of target taxa, particularly if they show limited abundance [14, 17, 24, 25].
In silico approaches are extremely useful to assess key features of primers, including the number of mismatches with the target sequences (a key determinant of taxonomic coverage) and the potential taxonomic resolution [17]. In silico tests allow cheap and rapid comparisons of a very large number of primers and often provide a good estimate of the actual performance of primers across various taxa, that can later be confirmed by in vitro assays on real-world samples [11, 17, 26–29]. Accurate in silico assessments of primers for DNA metabarcoding require the availability of extensive, high-quality reference databases over which primers can be tested [17, 28]. The recent publication of a curated 18S rRNA database of nematode sequences [18S-NemaBase; 10] poses the basis for such assessments.
In this study, we built upon the 18S-NemaBase to compare the performance of 15 primers proposed for the metabarcoding of nematodes (Table 1). Using in silico PCR, we 1) assessed whether the selected primers are able to amplify a large proportion of nematode taxa (coverage), 2) tested the taxonomic resolution of primers and evaluated whether there is a trade-off between marker length and taxonomic resolution and 3) tested primer specificity, i.e., assessed whether they only amplify nematodes, or also amplify a broad range of other organisms. Our results help to evaluate the appropriateness of different primers for different aims, ranging from the analysis of potentially degraded environmental DNA (eDNA) to whole-organism community DNA [30].
Methods
Primer selection
All 15 primers selected for our analyses target the 18S rDNA region of nematodes (Table 1, Fig 1). Among these primers, 13 were selected because they have been identified by a previous review as the primers more commonly used in nematode metabarcoding [10]. One additional primer, Euka02 [31] is often used for metabarcoding of eukaryotic eDNA and it has been suggested to provide a good estimate of nematode diversity [8, 12, 31–33]. Furthermore, we developed a new primer pair (Nema02; see Table 1) by optimizing primer pair F548_A / R1912 from ref. [11]. More specifically, we performed in silico PCRs on the public sequence database GenBank v249 with the ecoPCR program [17] to evaluate Nematoda and non-Nematoda variability at each position of the sequences matching the F548_A and R1912 primers, and in a 10-base interval in 5’ and 3’ of these sequences. The objective was to fine-tune the primer sequences to maximize non-Nematoda variability while minimizing Nematoda variability in primer-matching sequences, especially in 3’, in order to increase specificity. Taxonomic resolution of the associated marker was evaluated using the ecostaxspecificity program of the OBITools package [34]. Compatibility of annealing temperatures and absence of problematic primer dimers or hairpins were checked using OligoCalc [http://biotools.nubic.northwestern.edu/OligoCalc.html; 35] and the OligoAnalyzer Tool (https://eu.idtdna.com/pages/tools/oligoanalyzer?returnurl=%2Fcalc%2Fanalyzer), respectively.
Primers were aligned with the Caenorhabditis elegans 18S rDNA gene (GenBank accession number: AY268117). Redrawn on the basis of ref. [10].
In silico PCR
The availability of high-quality, curated databases is pivotal to test the performance of metabarcoding primers. We based our analyses on the 18S-NemaBase [10], which represents the most complete and high-quality available reference database of nematode 18S rRNA. The original 18S-NemaBase includes 5231 sequences identified at family level or better, representing 214 families and 2734 species [10].
The performance of primers was tested using the ecoPCR program [17]. EcoPCR allows in silico assessment of amplification of a sequence on the basis of its match with a selected primer pair in a region of a specified length [17]. The sequence is selected from a given reference database. To work, the program requires the reference database in the ecoPCR format. Thus, we converted the 18S-NemaBase from the fasta to the ecoPCR format using the obiconvert command of the OBITools command suite [34]. The original 18S-NemaBase database consisted of 5231 sequences. However, 283 sequences (i.e. 5,4%) were excluded because they showed problems during the conversion from the fasta to the ecoPCR format (268 sequences) or because they were assigned to non-nematode taxa in 18S-NemaBase (15 sequences). Thus, we based our analyses on a total of 4948 sequences.
For in silico PCR, we allowed a maximum of 3 mismatches between each primer and the sequences in the database. For each primer pair, the length of amplified fragments was selected based on the literature (Table 1).
Running ecoPCR on the curated 18S-NemaBase allowed us to calculate two measures of primer performance:
- Taxonomic coverage, as the percentage of amplified sequences within the reference database;
- Resolution, i.e. the ability of markers to distinguish between closely-related taxa.
Taxonomic resolution was calculated using the procedure detailed in ref. [36]. First, all the sequences obtained in each ecoPCR were compared among them to produce a list of unique metabarcodes. We then obtained the list of taxa associated to each unique metabarcodes. Taxonomic resolution was tested at three levels: species, genus and family. We assessed if, for each unique metabarcodes and taxonomic level, all the amplified taxa belong to the same taxon. Let’s assume, for instance, that multiple species within one genus share the same metabarcode. This particular metabarcode shows a genus-level and a family-level resolution but not a species-level resolution. The average taxonomic resolution of markers was then calculated as the proportion of unique metabarcodes that have a species-level, genus-level and family-level resolution [36].
We used a linear regression to test whether there is a positive relationship between taxonomic resolution and the log-transformed average length of the amplified fragments. This analysis was run at the species-level resolution because all the primers showed excellent resolution at the genus- and family level (see Results).
Furthermore, we assessed the specificity of the tested primers using the whole GenBank database (version 249) instead of the 18S-NemaBase as reference database in the ecoPCR program. This time, we only retained one sequence per species, to avoid biases due to the overrepresentation of model species (e.g. Caenorhabditis elegans for nematodes). Then, specificity was measured as the percentage of nematodes amplified over non-target organisms [17]. Primers showing high specificity are in principle preferable because they have a higher probability of detecting target taxa, including rare species [14, 17, 28, 37, 38].
Finally, we used primer sequence logos (weblogos) to further assess the conservation of primers both in nematodes and in non-target taxa [14, 39]. In weblogos, we graphically represented the pattern of primer conservation, by retrieving the match of the forward and reverse primers against GenBank sequences, based on the results of the ecoPCR run over GenBank v249. Logos were built as stacks of symbols (A, C, G, T), with one stack for each position in the primer sequence. The height of each stack corresponds to the nucleotide conservation at each position, measured in bits and ranging from 0 (same probability for the four nucleotides) to 2 (perfect conservation of the position) [39]. Like specificity analyses (see above), weblogo analyses were run after randomly selecting only one amplicon per species, to avoid bias due to the overrepresentation of model species in GenBank. Weblogos were built using the ggseqlogo R package [39].
Results
Taxonomic coverage
The taxonomic coverage (proportion of amplified sequences compared to the 18S-NemaBase database) was highly variable across primers. The majority of primers showed very good to excellent coverage, amplifying >90% of sequences in the reference database, while four primers showed a limited coverage (Table 2, Fig 2). The three primers with the highest coverage were 3NDf-1132rmod, EcoF-EcoR and Euka02, all showing coverage ≥ 97%. These primers amplified fragments with very different lengths, spanning from 100 bp (Euka02) to more than 800 bp (1813F_2646R; Table 2).
Metabarcode length refers to the length of the amplified fragment.
Taxonomic resolution
All primers showed a good resolution if the aim was species-level identification. Even the primer with the lowest species-level resolution (Euka02), associated most (78%) of unique metabarcodes with just one nematode species (Table 2, Fig 3). Eight primers showed a species-level resolution >90%; all of them amplified fragments >350bp. The three primers with the highest species-level resolution (95%) showed generally low coverage (Table 2, Figs 2 and 3).
If the aim was genus-level identification, all primers showed excellent resolution, as they were able to tease apart 95% of genera or more. Resolution was even better if the target was family identification, with all primers showing a resolution ≥97% (Table 2, Fig 3). Overall, we observed a positive relationship between primer taxonomic resolution at the species level, and the length of the amplified metabarcodes (linear regression: F1,13 = 13.6, P = 0.003; R2 = 0.51; Fig 4), suggesting that the increase in the metabarcodes length facilitate taxonomic resolution at the species-level.
The size of symbols is proportional to primer coverage.
Specificity
When tested on the whole GenBank, primers showed specificity values that ranged between 1% or less (FO4-R22, SSU_F04-SSU_R22) and ~7% (1813F-2646R, Nema02; Fig 5a). These values indicate that all the markers amplify a very large number of non-nematode sequences. Nevertheless, some of the primers with high specificity only amplified a limited number of nematode taxa (Figs 1 and 5b). The primers with the best compromise between specificity and coverage included 18SILVOmidF-18SILVOmid, Euka02 and Nema02 (Figs 3–5). For both nematodes and non-nematodes, weblogos showed clear differences in the patterns of variability of bases in the regions matching the tested primers (Figs 6 and 7).
a) Specificity was measured as the N of nematode sequences / total N of amplified sequences. b) N of nematode species amplified by the different primer pairs, when the in silico PCR is run on the whole GenBank.
The height of stacks corresponds to the nucleotide conservation at that position; the height of symbols within stacks indicates the relative frequency of each nucleotide at that position.
The height of stacks corresponds to the nucleotide conservation at that position; the height of symbols within stacks indicates the relative frequency of each nucleotide at that position.
Discussion
Our analyses showed that primers suggested for the assessment of nematode diversity have heterogeneous performances, particularly in terms of taxonomic coverage and specificity. Conversely, the taxonomic resolution of all primers was generally good, and across-primer variation in resolution was strongly related to the well-known trade-off with metabarcode length (Fig 4). The selection of the most appropriate primer pair for the analyses of nematode biodiversity depends on the balance between multiple factors, including study aims, media from which DNA is extracted, and the adopted sequencing technology.
The taxonomic coverage was highly variable. The majority of primers showed very good to excellent coverage, as they amplified more than 90% (and sometimes >97%) of available sequences, still, some showed limited coverage. The primers with low coverage were either universal primers that amplify a large number of eukaryotes and are not designed specifically for nematodes (FO4—R22, 1391f- EukBr) [40–42], or primers from landmark studies of nematode phylogeny (1813F - 2646R, SSU_R22) [43, 44] that were developed using sequences available at that time. It should be highlighted that our bioinformatic analyses are based on the number and position of mismatches in the priming region. This parameter is particularly relevant in metabarcoding studies, when DNA is amplified by complex mixtures comprising the DNA of many different taxa. In these cases, a large number of mismatches is problematic. Species with less mismatches are amplified preferentially, while species with more mismatches, or with mismatches close to the 3’ end, tend to be overlooked [15, 16, 45, 46]. A few mismatches probably are less problematic in phylogenetic studies, where the DNA of just one species is extracted at each time directly from specimens [43, 44].
In silico assessments of primer coverage, like the one performed here, have their own limitations, as some taxa may be amplified in vitro but not in silico, and vice versa. In silico analyses focusing on curated databases can also miss key issues, such as non-specific amplification [28]. Nevertheless, several in vitro tests have confirmed the appropriateness of primers studied with in silico analyses [11, 26–28, 47, 48]. For instance, our results are in agreement with previous studies that used mock communities to compare the performance of MMSF-MMSR, NemF-18Sr2b, NF1_18Sr2b and SSU_F04-SSU_R22, and observed a limited coverage of SSU_F04-SSU_R22 [18, 49]. Similarly, Kenmotsu et al. [50] confirmed that F_1183-R_1631 and NF1_18Sr2b show a similar, very good performance; Geisen et al. [51] confirmed the excellent performance of 3NDf-1132rmod; and Guardiola et al. [31] suggested an excellent coverage for Euka02 [see also ref. 36 for in vitro tests confirming the excellent taxonomic coverage of this marker for all the tested invertebrate phyla]. Nevertheless, comparative in vitro tests performed on both mock communities and real samples will be extremely important to validate our conclusions on taxonomic coverage, particularly for primers that have received limited testing so far [e.g., Nema02, but see references in 11 for analyses confirming the good performance of related primers].
All primers showed good to excellent resolution on the considered reference database. Even the shortest primer (Euka02) showed a reasonably good ability to discriminate between species (Fig 3) and, for all primers, the frequency of genera sharing the same metabarcode was ≤5% (i.e. genus-level resolution was always 95% or higher). These findings are highly promising for the use of metabarcoding for nematode analyses. However, it is important to acknowledge some caveats. The use of high quality, curated databases is fundamental for all the metabarcoding analyses, being pivotal for assessments of marker performance, and for accurate taxonomic identification [14, 52–55]. Our analyses were run on a large, curated database containing about 5000 sequences from 214 families, 668 genera and 2734 species [10]. Unfortunately, this database only represents the currently described species and genera, but most nematodes inhabiting the Earth still require description [6, 56]. We stress that all measures of taxonomic resolution strongly depend on the available data [57]. For instance, if the reference database only includes one species within a given genus, analyses would return a species-level resolution, despite unanalysed species within that genus may share the same metabarcode [36]. Less optimistic resolution values might be obtained with broader reference databases.
A key issue of the analysed primers is that none of them is specific to nematodes, and they amplify a broad range of eukaryotes (Fig 5). Non-specific amplification can reduce the detection of rare taxa, and can even increase false positives [28, 37]. This can be particularly problematic when analyses target complex mixtures of DNA(e.g. eDNA extracted from soil) that comprise the DNA of both nematodes and other organisms [58, 59]. For instance, in vitro assessments of the EcoF-EcoR primers detected a very large number of non-nematode taxa, suggesting that this marker can be not appropriate for analyses only focusing on nematodes [48]. The low specificity of most primers make them valuable for whole analyses of eukaryote diversity in soil, sediments or aquatic environments [31, 40, 48]. Studies focusing on nematodes should thus assess whether the retrieved data are enough to obtain reliable estimates of species diversity / occurrence. Approaches such as rarefaction curves and analyses of detection probability can allow to assess whether key parameters, such as the number of replicates and sequencing depth, are enough to obtain robust biodiversity estimates, or need to be increased for well-grounded ecological inference [58, 60, 61]. Alternatively, fine-tuning the primer sequences so as to introduce variability at the 3’end of the sequence for non-nematode can greatly help increase the real marker specificity. For example, preliminary analyses of soil samples with the Nema02 marker showed that >50% of MOTUs and >70% of sequences were assigned to nematodes, despite the limited specificity of this marker in silico.
Within the primers with good taxonomic coverage, resolution was clearly related to the length of the amplified fragment (Fig 4). This is not unexpected, as longer fragments generally include more variable sites and thus can provide a better resolution [14; see below for further discussion]. Nevertheless, even the shortest fragments allow an excellent genus-level resolution. Many functional analyses of nematodes are performed at the genus level. For instance, Nemaplex (http://nemaplex.ucdavis.edu) is a major database of nematode traits, and provides traits at the genus-level resolution. Ensuring that primers provide robust identification at the genus level is thus extremely important for all the studies focusing on nematode traits and functional diversity.
The better resolution of markers amplifying long fragments is related to a frequent trade-off between taxonomic resolution and fragment length. Long markers generally have a larger number of informative bases, and are therefore expected to show a better capacity of discrimination between closely-related species. For instance, many barcoding studies use a standard marker amplifying a 658-bp long fragment of COI, because it is assumed to provide enough resolution to discriminate between closely-related species [62]. Nevertheless, the study of long markers can be problematic in some conditions. First, DNA extracted from difficult substrates (e.g., environmental DNA from sediments and water) or from museum specimens is often degraded, making the use of long markers challenging [14, 62–65]. Furthermore, some high-throughput sequencing technologies are particularly cost-effective for sequencing short amplicons (e.g. Illumina NovaSeq), while cost-effectiveness decreases and error rate increases if longer fragments are targeted [62]. Therefore, when planning their study, researchers must find the right balance considering study aims, targeted substrate and cost-effectiveness. Our analysis can help to identify the most appropriate primer pair depending on the study focus. Markers amplifying short fragments (<150 bp) can be preferred by studies extracting DNA from water or ancient sediments [66] or if the aim is the analysis of a massive amount of specimens that require highly cost-effective sequencing platforms [62]. Longer fragments (within 450–500 bp) can be appropriate for both metabarcoding of whole-organism community DNA, for intracellular eDNA, and for eDNA extracted from substrates protecting eDNA from degradation (e.g. developed soils with high clay content) [67, 68]. These fragments can be sequenced with platforms such as Illumina MiSeq, which still enable processing a large number of samples at reasonable prices. Primers well suited for this aim include 3NDf-1132rmod, MMSF-MMSR and, if confirmed by in vitro tests, Nema02. Among these primers, Nema02 is the one with the highest specificity for nematodes (Fig 5a and 5b). Finally, long fragments ensure the highest resolution and are particularly suited for whole-organism community DNA [but see 19 for a remarkable example of long-read analysis of DNA extracted from environmental samples]. It has been suggested that, beyond a given threshold, longer markers do not provide a better resolution because of the occurrence of a saturation point [62]. For nematodes, we did not detect such saturation point, as the highest resolution tended to increase with marker length also above 500 bp (Fig 4). The continuing developments of high-throughput sequencing are making long-read metabarcoding increasingly feasible at progressively more affordable prices.
Ongoing development of DNA metabarcoding are opening new avenues to the study of biodiversity, and to the identification of management priorities [12, 69–71]. Nematodes are increasingly recognized as key components of soil communities, and advances in molecular approaches to assess and monitor their biodiversity are extremely important for the growing knowledge on this highly diverse phylum [11, 56]. The selection of most appropriate markers is a fundamental step to maximise the information drawn by each study [61].
Acknowledgments
The authors acknowledge the support of the APC central fund of the University of Milan.
References
- 1. van den Hoogen J, Geisen S, Routh D, Ferris H, Traunspurger W, Wardle DA, et al. Soil nematode abundance and functional group composition at a global scale. Nature. 2019;572(7768):194–8. pmid:31341281
- 2. Delgado-Baquerizo M, Reich PB, Trivedi C, Eldridge DJ, Abades S, Alfaro FD, et al. Multiple elements of soil biodiversity drive ecosystem functions across biomes. Nature Ecol Evol. 2020;4(2):210–20. pmid:32015427
- 3. Crowther TW, van den Hoogen J, Wan J, Mayes MA, Keiser AD, Mo L, et al. The global soil community and its influence on biogeochemistry. Science. 2019;365:eaav0550. pmid:31439761
- 4. Sapkota R, Nicolaisen M. High-throughput sequencing of nematode communities from total soil DNA extractions. BMC Ecology. 2015;15(1):3. pmid:25880249
- 5. Porazinska DL, Giblin-Davis RM, Faller L, Farmerie W, Kanzaki N, Morris K, et al. Evaluating high-throughput sequencing as a method for metagenomic analysis of nematode diversity. Mol Ecol Resour. 2009;9(6):1439–50. pmid:21564930
- 6. Hodda M. Phylum Nematoda: a classification, catalogue and index of valid genera, with a census of valid species. Zootaxa. 2022;5114:001–289. pmid:35391386
- 7. Kerfahi D, Tripathi BM, Porazinska DL, Park J, Go R, Adams JM. Do tropical rain forest soils have greater nematode diversity than High Arctic tundra? A metagenetic comparison of Malaysia and Svalbard. Global Ecol Biogeogr. 2016;25:716–28.
- 8. Ohlmann M, Mazel F, Chalmandrier L, Bec S, Coissac E, Gielly L, et al. Mapping the imprint of biotic interactions on β-diversity. Ecol Lett. 2018;21(11):1660–9. pmid:30152092
- 9. Zawierucha K, Porazinska DL, Ficetola GF, Ambrosini R, Baccolo G, Buda J, et al. A hole in the nematosphere: tardigrades and rotifers dominate the cryoconite hole environment, whereas nematodes are missing. J Zool. 2021;313:18–36.
- 10. Gattoni K, Gendron EMS, Sandoval-Ruiz R, Borgemeier A, McQueen JP, Shepherd RM, et al. 18S-NemaBase: Curated 18S rRNA database of nematode sequences. J Nematol. 2023;55(1). pmid:37143483
- 11. Kawanobe M, Toyota K, Ritz K. Development and application of a DNA metabarcoding method for comprehensive analysis of soil nematode communities. Appl Soil Ecol. 2021;166:103974.
- 12. Guerrieri A, Cantera I, Marta S, Bonin A, Carteron A, Ambrosini R, et al. Local climate modulates the development of soil nematode communities after glacier retreat. Global Change Biol. 2024;30:e17057. pmid:38273541
- 13. Ficetola GF, Taberlet P. Towards exhaustive community ecology via DNA metabarcoding. Mol Ecol. 2023: pmid:36762839
- 14.
Taberlet P, Bonin A, Zinger L, Coissac E. Environmental DNA for biodiversity research and monitoring. Oxford: Oxford University Press; 2018 Apr 1789–93 p.
- 15. Piñol J, Mir G, Gomez-Polo P, Agustí N. Universal and blocking primer mismatches limit the use of high-throughput DNA sequencing for the quantitative metabarcoding of arthropods. Mol Ecol Resour. 2015;15:819–30. pmid:25454249
- 16. Marquina D, Andersson AF, Ronquist F. New mitochondrial primers for metabarcoding of insects, designed and evaluated using in silico methods. Mol Ecol Resour. 2019;19(1):90–104. pmid:30226026
- 17. Ficetola GF, Coissac E, Zundel S, Riaz T, Shehzad W, Bessière J, et al. An In silico approach for the evaluation of DNA barcodes. BMC Genomics. 2010;11:434. pmid:20637073
- 18. Ahmed M, Back MA, Prior T, Karssen G, Lawson R, Adams I, et al. Metabarcoding of soil nematodes: the importance of taxonomic coverage and availability of reference sequences in choosing suitable marker(s). Metabarcoding and Metagenomics. 2019;3:e36408.
- 19. Jamy M, Biwer C, Vaulot D, Obiol A, Jing H, Peura S, et al. Global patterns and rates of habitat transitions across the eukaryotic tree of life. Nature Ecol Evol. 2022;6:1458–70. pmid:35927316
- 20. Eichmiller JJ, Best SE, Sorensen PW. Effects of Temperature and Trophic State on Degradation of Environmental DNA in Lake Water. Environmental Science and Technology. 2016;50:1859–67. pmid:26771292
- 21. Jo T, Murakami H, Masuda R, Sakata MK, Yamamoto S, Minamoto T. Rapid degradation of longer DNA fragments enables the improved estimation of distribution and biomass using environmental DNA. Mol Ecol Resour. 2017;17(6):e25–e33. pmid:28449215
- 22. Bylemans J, Furlan EM, Gleeson DM, Hardy CM, Duncan RP. Does size matter? An experimental evaluation of the relative abundance and decay rates of aquatic environmental DNA. Environ Sci Technol. 2018;52(11):6408–16. pmid:29757618
- 23. Walker DM, Leys JE, Dunham KE, Oliver JC, Schiller EE, Stephenson KS, et al. Methodological considerations for detection of terrestrial small-body salamander eDNA and implications for biodiversity conservation. Mol Ecol Resour. 2017;17(6):1223–30. pmid:28296353
- 24. Coissac E, Riaz T, Puillandre N. Bioinformatic challenges for DNA metabarcoding of plants and animals. Mol Ecol. 2012;21:1834–47. pmid:22486822
- 25. MacDonald AJ, Sarre SD. A framework for developing and validating taxon-specific primers for specimen identification from environmental DNA. Mol Ecol Resour. 2017;17(4):708–20. pmid:27768246
- 26. Epp LS, Boessenkool S, Bellemain EP, Haile J, Esposito A, Riaz T, et al. New environmental metabarcodes for analysing soil DNA: potential for studying past and present ecosystems. Mol Ecol. 2012;21:1821–33. pmid:22486821
- 27. Clarke LJ, Soubrier J, Weyrich LS, Cooper A. Environmental metabarcodes for insects: in silico PCR reveals potential for taxonomic bias. Mol Ecol Resour. 2014;14:1160–70. pmid:24751203
- 28. Collins RA, Bakker J, Wangensteen OS, Soto AZ, Corrigan L, Sims DW, et al. Non-specific amplification compromises environmental DNA metabarcoding with COI. Methods Ecol Evol. 2019;10(11):1985–2001.
- 29. Van Nynatten A, Gallage KS, Lujan NK, Mandrak NE, Lovejoy NR. Ichthyoplankton metabarcoding: An efficient tool for early detection of invasive species establishment. Mol Ecol Resour. 2023;23(6):1319–33. pmid:37101312
- 30. Pawlowski J, Apothéloz-Perret-Gentil L, Altermatt F. Environmental DNA: What’s behind the term? Clarifying the terminology and recommendations for its future use in biomonitoring. Mol Ecol. 2020;29(22):4258–64. pmid:32966665
- 31. Guardiola M, Uriz MJ, Taberlet P, Coissac E, Wangensteen OS, Turon X. Deep-sea, deep-sequencing: Metabarcoding extracellular DNA from sediments of marine canyons. Plos One. 2015;10(10):e0139633. pmid:26436773
- 32. Calderón-Sanou I, Zinger L, Hedde M, Martinez-Almoyna C, Saillard A, Renaud J, et al. Energy and physiological tolerance explain multi-trophic soil diversity in temperate mountains. Divers Distrib. 2022;28:2549–64.
- 33. Rosero P, Crespo-Pérez V, Espinosa R, Andino P, Barragán A, Moret P, et al. Multi-taxa colonisation along the foreland of a vanishing equatorial glacier. Ecography. 2021;44:1010–21.
- 34. Boyer F, Mercier C, Bonin A, Le Bras Y, Taberlet P, Coissac E. OBITools: a unix-inspired software package for DNA metabarcoding. Mol Ecol Resour. 2016;16:176–82. pmid:25959493
- 35. Kibbe WA. OligoCalc: an online oligonucleotide properties calculator. Nucleic Acids Res. 2007;35(Web Server issue):W43–6. Epub 2007/04/25. pmid:17452344.
- 36. Ficetola GF, Bonin A, Boyer F, Valentini A, Meyer A, Dejean T, et al. Comparison of markers for the monitoring of freshwater benthic biodiversity through DNA metabarcoding. Mol Ecol. 2021;30:3189–202. pmid:32920861
- 37. Wilcox TM, McKelvey KS, Young MK, Jane SF, Lowe WH, Whiteley AR, et al. Robust detection of rare species using environmental DNA: The importance of primer specificity. Plos One. 2013;8(3):9. pmid:23555689
- 38. Leese F, Sander M, Buchner D, Elbrecht V, Haase P, Zizka VMA. Improved freshwater macroinvertebrate detection from environmental DNA through minimized nontarget amplification. Environmental DNA. 2021;3:261–76.
- 39. Wagih O. ggseqlogo: a versatile R package for drawing sequence logos. Bioinformatics. 2017;33(22):3645–7. pmid:29036507
- 40. Fonseca VG, Carvalho GR, Sung W, Johnson HF, Power DM, Neill SP, et al. Second-generation environmental sequencing unmasks marine metazoan biodiversity. Nat Commun. 2010;1(1):98. pmid:20981026
- 41.
Lane DJ. 16S/23S sequencing. In: Stackebrant E, Goodfellow M, editors. Nucleic acid technologies in bacterial systematic. New York: Wiley; 1991. p. 115–75.
- 42. Medlin L, Elwood HJ, Stickel S, Sogin ML. The characterization of enzymatically amplified eukaryotic 16S-like rRNA-coding regions. Gene. 1988;71(2):491–9. Epub 1988/11/30. pmid:3224833.
- 43. Blaxter ML, De Ley P, Garey JR, Liu LX, Scheldeman P, Vierstraete A, et al. A molecular evolutionary framework for the phylum Nematoda. Nature. 1998;392(6671):71–5. pmid:9510248
- 44. Holterman M, van der Wurff A, van den Elsen S, van Megen H, Bongers T, Holovachov O, et al. Phylum-wide analysis of SSU rDNA reveals deep phylogenetic relationships among nematodes and accelerated evolution toward crown clades. Mol Biol Evol. 2006;23(9):1792–800. pmid:16790472
- 45. Fonseca VG. Pitfalls in relative abundance estimation using eDNA metabarcoding. Mol Ecol Resour. 2018;18(5):923–6.
- 46. Wu JH, Hong PY, Liu WT. Quantitative effects of position and type of single mismatch on single base primer extension. Journal of microbiological methods. 2009;77(3):267–75. Epub 2009/03/17. pmid:19285527.
- 47. Kartzinel TR, Pringle RM. Molecular detection of invertebrate prey in vertebrate diets: trophic ecology of Caribbean island lizards. Mol Ecol Resour. 2015;15(4):903–14. pmid:25545675
- 48. Waeyenberge L, de Sutter N, Viaene N, Haegeman A. New insights into nematode DNA-metabarcoding as revealed by the characterization of artificial and spiked nematode communities. Diversity. 2019;11(4):52.
- 49. Sikder MM, Vestergård M, Sapkota R, Kyndt T, Nicolaisen M. Evaluation of metabarcoding primers for analysis of soil nematode communities. Diversity. 2020;12(10):388.
- 50. Kenmotsu H, Takabayashi E, Takase A, Hirose Y, Eki T. Use of universal primers for the 18S ribosomal RNA gene and whole soil DNAs to reveal the taxonomic structures of soil nematodes by high-throughput amplicon sequencing. PLOS ONE. 2021;16(11):e0259842. pmid:34780544
- 51. Geisen S, Snoek LB, ten Hooven FC, Duyts H, Kostenko O, Bloem J, et al. Integrating quantitative morphological and qualitative molecular methods to analyse soil nematode community responses to plant range expansion. Methods Ecol Evol. 2018;9(6):1366–78.
- 52. Leray M, Knowlton N, Ho SL, Nguyen BN, Machida RJ. GenBank is a reliable resource for 21st century biodiversity research. Proc Natl Acad Sci USA. 2019;116(45):22651–6. pmid:31636175
- 53. Morinière J, Balke M, Doczkal D, Geiger MF, Hardulak LA, Haszprunar G, et al. A DNA barcode library for 5,200 German flies and midges (Insecta: Diptera) and its implications for metabarcoding-based biomonitoring. Mol Ecol Resour. 2019;19(4):900–28. pmid:30977972
- 54. White HJ, León-Sánchez L, Burton VJ, Cameron EK, Caruso T, Cunha L, et al. Methods and approaches to advance soil macroecology. Global Ecol Biogeogr. 2020;29(10):1674–90.
- 55. Williams JW, Spanbauer TL, Heintzman PD, Blois J, Capo E, Goring SJ, et al. Strengthening global-change science by integrating aeDNA with paleoecoinformatics. Trends Ecol Evol. 2023. Epub 2023/05/26. pmid:37230884.
- 56. Hodda M. Phylum Nematoda: trends in species descriptions, the documentation of diversity, systematics, and the species concept. Zootaxa. 2022;5114:290–317. pmid:35391385
- 57. Weigand H, Beermann AJ, Ciampor F, Costa FO, Csabai Z, Duarte S, et al. DNA barcode reference libraries for the monitoring of aquatic biota in Europe: Gap-analysis and recommendations for future work. Sci Total Environ. 2019;678:499–524. pmid:31077928
- 58. Jurburg SD, Keil P, Singh BK, Chase JM. All together now: Limitations and recommendations for the simultaneous analysis of all eukaryotic soil sequences. Mol Ecol Resour. 2021;21(6):1759–71. pmid:33943001
- 59. Geisen S, Lara E, Mitchell E. Contemporary issues, current best practice and ways forward in soil protist ecology. Mol Ecol Resour. 2023;n/a(n/a). pmid:37259890
- 60. Ficetola GF, Pansu J, Bonin A, Coissac E, Giguet-Covex C, De Barba M, et al. Replication levels, false presences, and the estimation of presence / absence from eDNA metabarcoding data. Mol Ecol Resour. 2015;15:543–56. pmid:25327646
- 61. Zinger L, Bonin A, Alsos I, Bálint M, Bik H, Boyer F, et al. DNA metabarcoding—need for robust experimental designs to draw sound ecological conclusions. Mol Ecol. 2019;28:1857–62. pmid:31033079
- 62. Yeo D, Srivathsan A, Meier R. Longer is Not Always Better: Optimizing Barcode Length for Large-Scale Species Discovery and Identification. Syst Biol. 2020;69(5):999–1015. pmid:32065638
- 63. Troth CR, Sweet MJ, Nightingale J, Burian A. Seasonality, DNA degradation and spatial heterogeneity as drivers of eDNA detection dynamics. Sci Total Environ. 2021;768. pmid:33736342
- 64. Jo T, Minamoto T. Complex interactions between environmental DNA (eDNA) state and water chemistries on eDNA persistence suggested by meta-analyses. Mol Ecol Resour. 2021;21(5):1490–503. pmid:33580561
- 65. Sproul JS, Maddison DR. Sequencing historical specimens: successful preparation of small specimens with low amounts of degraded DNA. Mol Ecol Resour. 2017;17(6):1183–201. pmid:28199781
- 66. Parducci L, Bennet KD, Ficetola GF, Alsos IG, Suyama Y, Wood JR, et al. Ancient plant DNA in lake sediments. New Phytol. 2017;214:924–42. pmid:28370025
- 67. Levy-Booth DJ, Campbell RG, Gulden RH, Hart MM, Powell JR, Klironomos JN, et al. Cycling of extracellular DNA in the soil environment. Soil Biol Biochem. 2007;39(12):2977–91.
- 68. Pietramellara G, Ascher, Borgogni F, Ceccherini MT, Guerri G, Nannipieri P. Extracellular DNA in soil and sediment: fate and ecological relevance. Biology and Fertility of Soils. 2009;45:219–35.
- 69. Eisenhauer N, Buscot F, Heintz-Buschart A, Jurburg SD, Küsel K, Sikorski J, et al. The multidimensionality of soil macroecology. Global Ecol Biogeogr. 2021;30(1):4–10. pmid:33692654
- 70. Guerra CA, Bardgett RD, Caon L, Crowther TW, Delgado-Baquerizo M, Montanarella L, et al. Tracking, targeting, and conserving soil biodiversity. Science. 2021;371(6526):239. pmid:33446546
- 71. Zeiss R, Eisenhauer N, Orgiazzi A, Rillig M, Buscot F, Jones A, et al. Challenges of and opportunities for protecting European soil biodiversity. Conserv Biol. 2022;36(5):e13930. pmid:35510330
- 72. Stoeck T, Bass D, Nebel M, Christen R, Jones MDM, Breiner H-W, et al. Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water. Mol Ecol. 2010;19(s1):21–31. pmid:20331767
- 73. Mangot J-F, Domaizon I, Taib N, Marouni N, Duffaud E, Bronner G, et al. Short-term dynamics of diversity patterns: evidence of continual reassembly within lacustrine small eukaryotes. Environ Microbiol. 2013;15(6):1745–58. pmid:23297806
- 74. Müller CA, de Mattos Pereira L, Lopes C, Cares J, dos Anjos Borges LG, Giongo A, et al. Meiofaunal diversity in the Atlantic Forest soil: A quest for nematodes in a native reserve using eukaryotic metabarcoding analysis. For Ecol Manage. 2019;453:117591.
- 75. Starke R, Kermer R, Ullmann-Zeunert L, Baldwin IT, Seifert J, Bastida F, et al. Bacteria dominate the short-term assimilation of plant-derived N in soil. Soil Biol Biochem. 2016;96:30–8.