Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Determining diet from faeces: Selection of metabarcoding primers for the insectivore Pyrenean desman (Galemys pyrenaicus)

  • Amaiur Esnaola ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Leioa, Basque Country, Spain

  • Aitor Arrizabalaga-Escudero,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing

    Affiliation Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Leioa, Basque Country, Spain

  • Jorge González-Esteban,

    Roles Conceptualization, Investigation, Methodology, Supervision, Validation, Writing – review & editing

    Affiliation DESMA Estudios Ambientales, Sunbila, Navarre, Spain

  • Arturo Elosegi,

    Roles Conceptualization, Funding acquisition, Investigation, Supervision, Validation, Writing – review & editing

    Affiliation Department of Plant Biology and Ecology, University of the Basque Country UPV/EHU, Leioa, Basque Country, Spain

  • Joxerra Aihartza

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Supervision, Validation, Writing – review & editing

    Affiliation Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Leioa, Basque Country, Spain

Determining diet from faeces: Selection of metabarcoding primers for the insectivore Pyrenean desman (Galemys pyrenaicus)

  • Amaiur Esnaola, 
  • Aitor Arrizabalaga-Escudero, 
  • Jorge González-Esteban, 
  • Arturo Elosegi, 
  • Joxerra Aihartza


Molecular techniques allow non-invasive dietary studies from faeces, providing an invaluable tool to unveil ecological requirements of endangered or elusive species. They contribute to progress on important issues such as genomics, population genetics, dietary studies or reproductive analyses, essential knowledge for conservation biology. Nevertheless, these techniques require general methods to be tailored to the specific research objectives, as well as to substrate- and species-specific constraints. In this pilot study we test a range of available primers to optimise diet analysis from metabarcoding of faeces of a generalist aquatic insectivore, the endangered Pyrenean desman (Galemys pyrenaicus, É. Geoffroy Saint-Hilaire, 1811, Talpidae), as a step to improve the knowledge of the conservation biology of this species. Twenty-four faeces were collected in the field, DNA was extracted from them, and fragments of the standard barcode region (COI) were PCR amplified by using five primer sets (Brandon-Mong, Gillet, Leray, Meusnier and Zeale). PCR outputs were sequenced on the Illumina MiSeq platform, sequences were processed, clustered into OTUs (Operational Taxonomic Units) using UPARSE algorithm and BLASTed against the NCBI database. Although all primer sets successfully amplified their target fragments, they differed considerably in the amounts of sequence reads, rough OTUs, and taxonomically assigned OTUs. Primer sets consistently identified a few abundant prey taxa, probably representing the staple food of the Pyrenean desman. However, they differed in the less common prey groups. Overall, the combination of Gillet and Zeale primer sets were most cost-effective to identify the widest taxonomic range of prey as well as the desman itself, which could be further improved stepwise by adding sequentially the outputs of Leray, Brandon-Mong and Meusnier primers. These results are relevant for the conservation biology of this endangered species as they allow a better characterization of its food and habitat requirements.


The diet of many consumers is difficult to determine: direct observation can be methodologically challenging and visual identification of prey remains in faeces difficult [1], especially in the case of generalist feeders, which prey upon a large variety of species. Consumers rarely forage at random and prey choice can be affected by prey defence and escape strategies, by nutritional quality, or by spatial and temporal distributions of predators and prey, among other factors [2]. New molecular tools such as DNA metabarcoding allow non-invasive studies of diet, as well as assignation of the consumer species that produced a scat, thus preventing identification errors. These new techniques allow analysing the environmental DNA (eDNA) extracted from faeces [35] and the identification of both soft- and hard-bodied prey species to species level, what was impossible by means of traditional morphological techniques [6]. Furthermore, DNA metabarcoding facilitates high-resolution dietary analyses further disclosing trophic and habitat requirements of consumers and providing an invaluable tool to unveil food web structures [7], particularly useful for elusive and endangered species [811]. Especially, these innovative techniques make diet studies useful for species conservation. For instance, a cost-effective screening of multiple DNA metabarcodes in faeces detected a broad diversity of plants (99 taxa) in the diet of the vulnerable Italian hare (Lepus corsicanus), including items that leave no solid remains or that lack diagnostic taxonomic features [12]. Similarly, Schwarz et al. [13] documented dietary differences consistent across site and year in the diet of male and female harbor seals (Phoca vitulina), likely affecting on commercial prey such as salmon. Moreover, the foraging ecology of the Alpine long-eared bat Plecotus macrobullaris was inferred from the molecular analysis of faeces and from the ecological requirements of prey, which would be impossible by traditional radio-tracking methods [14]. Furthermore, the analysis of the prey consumed showed that the Mediterranean horseshoe bat Rhinolophus euryale does not only rely on the habitats where it directly hunts, but also on other habitats of vital importance for its prey’s larval stages, where bats do not forage [15]. Results like these help setting up guidelines for species management.

High throughput sequencing methods are effective when applied to the dietary analyses of predators [16] since they enable the examination of very degraded, fragmented and different DNA pieces without previous knowledge of prey identity [1,11,17]. However, some methodological constraints must be taken into account. On the one hand, DNA quality is affected by its transition time across the gut, as well as the exposure of scats to environmental conditions (e.g. temperature oscillations, sun, rain, humidity and fungal attacks) [2,18,19]. On the other hand, the high sensitivity of molecular methods may produce abundant false positives as a consequence of secondary predation, scavenging, or contamination from any source [20]. Finally, the completeness of diet characterization depends on methodological details such as the taxonomic coverage of primers, the spectrum of consumed prey species, PCR strategy, sequencing workflows, bioinformatics decisions or the information available in databases (e.g. [2123]).

Studies that target terrestrial animals usually rely on markers within the mitochondrial cytochrome c oxidase subunit I (COI) region, since it has a high copy number and variation suitable for allowing species-level identification [24,25]. Indeed, the COI has one of the most complete reference databases nowadays, particularly well represented for many invertebrate taxa in GenBank ( and BOLD System ( Nevertheless, even though the COI is short enough to be identified in fairly fragmented DNA sequences, for very fragmented sequences such as those usually found in faeces, shorter markers have been used successfully (e.g. [26]). The choice of the markers is usually guided by DNA reference databases, but also depends on the research question [27]. In this sense, Alberdi et al. [21] showed that the results of dietary analyses depend on the selection of marker regions, amplicon sizes, primers, as well as the taxonomic level required. While various primers do detect some prey species, detection of others is primer-dependant. If too long regions are amplified, some species will not be identified, resulting in false negatives. This effect will inflate the difference between samples or individuals, thus yielding skewed information about intraspecific variability. Therefore, the selection of the marker region and primers is a critical decision in any DNA metabarcoding study, as factors such as primer length or specificity have a great effect on the results [24,28].

False positives and false negatives in dietary studies can have strong implications in the interpretation of predator ecology, as well as in ecosystem management [29]. Successful identification of relevant prey taxa is key to obtain sound conclusions about the ecological role, trophic specialization and conservation of any predator—or their consumed prey [13,30]—. Besides, the need for simultaneous identification of diet and predator identity from faeces makes methodological decisions more demanding, as a broader phylogenetic spectrum must be targeted.

Here we conducted a DNA metabarcoding study to compare different primers for non-invasive determination of the diet and the identity of an aquatic predator. In particular, we aimed at assessing how different primer pairs—or their combinations—affect both the characterization of a phylogenetically diverse diet and the predator identity.

We chose as a model species the Pyrenean desman (Galemys pyrenaicus, É. Geoffroy Saint-Hilaire, 1811, Insectivora, Talpidae) a semi-aquatic insectivorous generalist mammal that lives in cool and clean mountain streams (Fig 1). It is endemic to the northern Iberian Peninsula and the Pyrenees, but its distribution area has been severely reduced during the last decades, being currently listed as Vulnerable in the Red List Categories by the IUCN [31]. Recent research has shown the desman to feed mainly on freshwater invertebrates [32,33] and to prefer riffles to runs or pools [34]. Even so, there is still a lack of information to adequately manage this endangered species [35,36]. Namely, it is still unknown to which extent the diet of desmans depend on prey availability, and which types of prey they select for; moreover, it is still unclear whether desmans’ reported habitat selection within rivers [34] reflects differences in prey availability or other factors. These and other questions ask for detailed diet studies.

Fig 1. A Pyrenean desman in the Elama Stream.

Photo by Joxerra Aihartza.

Our study provides valuable information about differences in primer efficiency when using DNA metabarcoding in diet studies, offering an overview of the taxonomic coverage provided by each primer set, as well as by their combinations. Moreover, regarding desman ecology, our results will be useful to better understand and compare the output of published studies (e.g. [32,33]), as well as to unveil the habitat requirements of this endangered species. This information is necessary to design and implement specific management actions for its conservation.

Materials and methods

Sample collection

We searched for faeces of the Pyrenean desman in the basins or the Urumea and Oria rivers, in the Basque Country (Northern Iberian Peninsula), between November 2015 and April 2016, either by prospecting rock crevices and roots, or by building artificial shelters (Table 1) specially designed for this species [37]. A total of 24 droppings were preserved in 98% ethanol and frozen at -80°C.

This study is part of a broader research on the spatial and trophic ecology of the Pyrenean desman, which met local legal requirements and was approved by the Ethics Committee for Animal Welfare of the University of the Basque Country (Ref. CEBA/M20/2016/022). No specific permissions were required for the activities carried out in this study, as samples were not invasively collected and did not involve manipulation of endangered or protected species.

Selection of universal primers

We selected five primer sets (Table 2) aiming at a broad taxonomic coverage of potential prey species. These primers amplify fragments of varying lengths within the COI region (Fig 2), currently used as the standard animal barcode region, which has a well-documented reference database.

Fig 2. Primer locations.

Visual representation of the marker locations in the mitochondrial COI (modified from Brandon-Mong et al. [22]).

DNA extraction, library preparation and NGS sequencing

DNA was extracted using the Qiagen Powerfecal DNA kit (Qiagen Iberia, S.L. Madrid), following the manufacturer guidelines. Subsequently, DNA was PCR amplified from extracts using the five primer sets (Table 2), at the Analytical Services (SGIker) of the University of the Basque Country, UPV/EHU. Samples were purified and a second reaction was performed to index each amplified product and attach Illumina adaptors using the Illumina Nextera v2 kit. Amplifications with Zeale, Gillet, Leray and Meusnier primer sets were performed with the Qiagen Multiplex PCR Kit protocol (Qiagen Iberia, S.L. Madrid, using 12.5 μL Qiagen 2X (1X final), 1.25 μL forward primer (10 μM; 0.5 μM final), 1.25 μL reverse primer (10 μM; 0.5 μM final), 8 μL H2O and 2 μL DNA, in a final volume of 25 μL. Amplification with Brandon-Mong primer set was performed with 12.5 μL KAPA HIFI 2X (1X final), 2.5 μL forward primer (10 μM; 1 μM final), 2.5 μL reverse primer (10 μM; 1 μM final), 5.5 μL H2O and 2 μL DNA, in a final volume of 25 μL. Each primer set had its own PCR program, modified from the reference to the used reactive, as indicated in the S1 Table. Once amplified, PCR outputs were sequenced in a Illumina MiSeq NGS platform (sequencing of 2x300 bp paired-end reads) with the MiSeq Reagent Kit v3 (600 cycle), following the manufacturer instructions.

Bioinformatic analyses

Paired-end reads were merged using USEARCH [42,43], demultiplexed by primers, adapter and primer sequences were removed, and reads were quality- and length-filtered using CUTADAPT [44]. Then, singletons were removed and the remaining sequences were screened for chimeras using USEARCH. UPARSE algorithm [45] was used to cluster sequences into Operational Taxonomic Units (OTUs) at a 97% similarity threshold. Finally, Genbank nt database was used to assign taxonomy to OTUs using BLAST ( Species level assignments were performed when query sequences matched reference sequences above 98% similarity and 75% overlap [16]. If query sequences matched more than one species in the database, the hit with the longest alignment length was selected. Besides, as a rule, only hits with e-value below 1e-20 were accepted [46] to make sure that the match did not occur by chance.

Subsequent analyses were performed taking into account the occurrence of identified prey taxa (the number of pellets that a taxon was found in) [47]. Primer outputs were also tested to see whether any of the OTUs built from them could also identify the predator itself, i.e. the Pyrenean desman.

Data analysis

To study the overall effect of primers on variation in species composition of diet, we performed a permutational multivariate ANOVA using adonis with 999 random permutations in vegan 2.4–6 package [48] for R version 3.4.3 [49]. A Jaccard distance measure was used to calculate dissimilarities between samples. We performed NMDS in vegan 2.4–6 package for R to visualize dissimilarities in species composition among samples. Pairwise differences in species composition between primers were also tested using the function pairwise.perm.manova in package RVAideMemoire 0.9-69-3 for R [50]. The variation in species composition within primer sets (i.e. the homogeneity) was also tested using the betadisper and permutest functions with 999 permutations in package vegan 2.4–6 for R [48]. Pairwise differences in homogeneity between primer sets were analysed using Tukey's HSD test in package vegan 2.4–6 for R [48].


The sequencing output differed considerably among primer sets, both in the amount of sequence reads and in the number of rough (total) OTUs (Fig 3). Gillet primers yielded the highest numbers of reads and rough OTUs, followed by Leray (S2 Table). Brandon-Mong, Gillet and Leray primers identified the desman itself, but only Gillet and Leray did so in all faecal samples; no other predator whose scats could be mistaken was identified. All five primers identified the most common prey taxa (namely Baetis sp., Hydropsyche sp., Odontocerum sp. and Psychoda sp.) but differed considerably in the less abundant prey groups (Fig 4 and S3 Table). Quantitatively, Gillet yielded the largest list of taxonomically assigned OTUs (19.15% of the OTUs), as well as the highest occurrence values. Zeale and Leray primers followed in rough OTUs and assignments (with the 38.5% and the 7.81% of the OTUs assigned, respectively), whilst Brandon-Mong and Meusnier primers showed the lowest efficiency, with less rough OTUs, and only 15.79% and 7.12% of them assigned, respectively. Taking into account the number of taxa assigned, Gillet primer set was the most efficient amplifying DNA of Mollusca, Annelida and arthropods such as Diptera, Ephemeroptera, Plecoptera, Trichoptera or Arachnida (S3 Table); Meusnier primer set was the most efficient amplifying Salmonids, whereas Zeale set was the best amplifying Coleoptera. Although the detected occurrence was lower, Leray primer set was also quite efficient amplifying Trichoptera and Coleoptera. Comparing Gillet with Zeale and Leray separately, Gillet primer set provided more information on Mollusca and arthropods such as Diptera, Ephemeroptera, Plecoptera, Trichoptera and Arachnida than Zeale, whereas the reverse occurred for Coleoptera, Psocoptera and Decapoda. Gillet primer sets outperformed Leray primers for Mollusca, Annelida and arthropods such as Coleoptera, Diptera, Ephemeroptera, Plecoptera, Psocoptera, Trichoptera and Arachnida, while Leray yielded slightly higher values for Nematoda.

Fig 3. Species-level taxonomic assignment of OTUs obtained for each primer set.

Number of OTUs and % of total sequences obtained.

Fig 4. Number of OTUs assigned to species and their occurrences for each primer set.

Results taking into account high taxonomical levels.

Primer choice had statistically significant effects on the resulting diet composition (F(4,115) = 15.712; R2 = 0.353; p = 0.001). This was clearly illustrated by the NMDS (Fig 5), where samples were clustered in 4 main groups: Zeale, Meusnier, Brandon-Mong and Leray+Gillet. There were significant overall and pairwise differences in species composition between primer sets (overall test: F = 10.425; p < 0.001; pairwise tests: all p = 0.001), but samples amplified with Zeale and Leray or Gillet differed the most in terms of species composition, whereas Leray and Gillet primer sets showed rather similar composition, although Leray seemed to yield higher variability. Moreover, differences in species composition were lowest between Meusnier and Gillet primer sets (p = 0.029), followed by Zeale and Meusnier (p = 0.013) and Gillet and Brandon-Mong (p = 0.003). On the contrary, the highest differences were found between Zeale and Gillet (p < 0.001), followed by Leray and Gillet (p = 0.001) (Fig 6).

Fig 5. NMDS ordination of samples.

Stress = 0.193; k = 2; non-metric fit R2 = 0.963. Dots represent individual desman samples and colours different primer sets. OTUs are represented with green letters. More distant dots indicated a more different prey composition.

Fig 6. Tukey's HDS plot for all the comparisons between primer sets.

Red colour represents significant differences (p < 0.05). The plot shows the differences in mean levels of groups where the largest difference in homogeneity was between Gillet and Zeale, followed by Gillet and Leray.

We optimized the potential prey species identified by accumulating the amplification outputs of the five primer sets. Gillet provided the highest prey identifications on its own (Fig 7). When combining Gillet with the other sets, the identified species number increased most with Zeale (a total of 129 prey species or accurate taxonomic groups, 37.2% more than only using Gillet). Subsequent combinations yielding the highest increases in species numbers were obtained by adding up Leray first, followed by Brandon-Mong and Meusnier primer sets, in this order, with quantitatively smaller increases in each step.

Fig 7. Number of prey species accumulations obtained by different and progressive primer combinations.

Progressive primer combinations (one+one, two+one, and so on) were selected according to the highest species accumulation values (black columns) obtained in previous combination-level. B refers to Brandon-Mong primers, G to Gillet, L to Leray, M to Meusnier and Z to Zeale.


Our study shows that the selection of molecular markers—even primers targeting different sections of the same gene—considerably affect the characterization of diet. This is particularly important in generalist predators with a phylogenetically diverse diet. Although some prey taxa were consistently detected by all five primer sets used—likely the ones most frequently consumed—, the resulting picture of dietary composition depended on the primer set considered. Our results confirmed those by Alberdi et al. [21] in bats, who had also observed contrasting performance of four primer sets targeting two different genetic markers (the COI and the 16S).

Primer performance differed not only quantitatively, in the total amounts of sequence reads, rough and taxonomically assigned OTUs, and numbers of identified putative prey specific taxa, but also qualitatively, in the proportion of the main phylogenetic groups each primer yielded. These differences may be due to the specific affinity of each primer set to amplify certain taxa, due to the amplicon length, or to the degeneration degree of each primer as well.

Specifically, since longer sequences of DNA resist digestion worse [26], the primer sets targeting longer DNA fragments are less efficient in detecting prey from faeces. The integrity of a sequence of a given length may be related to various factors such as differences in digestion among tissues, the time under digestion, retention time, etc., and this variation differs among prey species (reviewed in [51]). This is partially consistent with our case study, where Gillet primers, targeting almost the shortest fragment of COI gene (Table 1), yielded the highest output in terms of sequence reads, OTUs, and potential prey detected as well. On the contrary, Leray primers, which targeted a similarly broad taxonomic range but with much longer amplicons, retrieved less fragments of DNA and were unable to amplify some taxa that Gillet primer pairs did recover. Nevertheless, primers targeting almost the same stretch length—Gillet and Meusnier, for instance—performed unequally as well, both in terms of quantitative and qualitative data. This may be a consequence of the different degrees of degeneration of the primers [52]. In fact, Gillet and Leray primers, showing the highest degeneration degree (Table 2), are the most successful in terms of OTUs gathered after the PCR, but many of them could not be assigned to any known taxa (only 19.15% and 7.8% of them, respectively), suggesting that they would also amplify many other DNA substrates beyond the targeted COI fragment. On the contrary, primers with lesser degeneration—such as Zeale—produced a much smaller amount of OTUs, but a higher proportion of them (38.5%) were assigned to know taxa, showing a higher bond to the barcode region. Consequently, the varying persistence of relatively long DNA sequences after digestion, and the primers used to amplify them, affect the final perception about the intraspecific variation of individual diets. Therefore, the selection of suitable primer sets for diet analyses is crucial when a wide dietary variation is expected [53].

Recently, a pipeline for the diet analysis of the desman has been published [33], where DNA metabarcoding of old and fresh faecal samples was implemented with nested PCRs. This methodological contribution adds a pre-amplification step to increase the number of reads corresponding to the target taxa. Nevertheless, authors pointed out that their procedure could lead to an increase in the specificity of the amplification, thus losing some essential prey sequences. Our results suggest that such nested PCR would further increase primer bias, multiplying the effect of the two primer sets used. Hawlitschek et al. [33] also proposed that primer cocktails or pools of amplification products of more than one primer pair should be used in future studies to improve the amplification success of the target group. That is precisely what the present work tested and showed to be true.

Regarding the desman trophic ecology, two recent metabarcoding studies provided new data: Biffi et al. [32], employing only the Gillet primer set, described the desman as “more generalist than previously thought” as a consequence of its diverse summer diet, mostly based on Ephemeroptera and Plecoptera in the North of Pyrenees; Hawlitschek et al. [33], working in the north-western Iberian Peninsula with Meusnier primers and nested PCR, found that Ephemeroptera and Diptera were the most abundant prey groups. Our data, although carried out in a different area and more limited in sample size, also identified these taxa as relevant components of the desman winter diet. However, the number of species and their occurrence rates largely depended on the primer set considered. To which extent do the results of these studies show methodological differences rather than regional variations in desman prey availability? We can hardly tell. Our results suggest the characterization of the diet of desmans—or any other animal—using a single primer set to be prone to serious biases. These will weaken ecological conclusions such as predator-prey interactions or prey selection patterns, which may be key for implementing conservation measures.

A species might be characterised as generalist due to the overall consideration of many individual-level specialist diets [54] or due to the capacity of individuals to forage on a wide diversity of food items [55]. When analysing the diet of a generalist predator, the choice of primer sets may condition the interpretation of its trophic ecology and specialization. The way foraging habits are perceived can in turn affect the interpretation of community dynamics and ecosystem functioning [56], and thus, any biased description of diet can lead managers to misinterpret food chain structures and to take wrong conservation decisions [57,58]. For instance, trophic relationships have been described with molecular techniques for bats [5961], birds [62,63], rodents [64] and invertebrates such as snails [65], among others. I t is essential to invest in accurate trophic and spatial ecology studies to obtain detailed knowledge about the trophic requirements of a species. Our results stress the importance of combining different primer sets to detect the widest range of potential prey species and to avoid losing essential information. Methodological decisions affect the assessment of trophic requirements of any animal, as well as management measures based on this assessment [66].

Additionally, when working with endangered and elusive animals it is essential to unambiguously identify the source of faecal samples collected. So far, molecular tools have been useful to provide accurate dietary description from scats in a wide range of predators such as bats [46,67], carnivores [68,69], pinnipeds [8], birds [70] and the desman itself [32]. In the present study desman was identified from faeces with 3 of the tested primer sets, namely Leray, Gillet and Meusnier. However, only Leray and Gillet primer sets successfully identified the desman in all analysed faecal samples, confirming the lower performance of Meusnier to identify Chordata [40]. In general, in diet studies involving insectivorous predators with non-unequivocally identifiable faeces, at least one of the selected primers should be able to amplify DNA of the predator, reducing identification mistakes.

Our results suggest that the diet of Pyrenean desman can be characterised by combining multiple primers, but primer selection must also consider data accuracy and costs. In terms of cost-effectiveness, Gillet and Zeale primer sets would be the best combination to identify the widest taxonomic range of prey, as well as the desman itself. This combination would further improve stepwise by sequentially adding Leray, Brandon-Mong and Meusnier primers, from highest to lowest enrichment. Nevertheless, it must also be taken into account that to combine primers the sequences they amplify must not differ too much in length, to avoid biases towards the smallest amplicons in some NGS sequencing procedures. Thus, in our case the combination of Gillet and Zeale primers would likely result in less problems than the combination of Gillet and Leray, as fragment amplicons with Leray are double in size.

Overall, the present study shows that using different sub-regions in a specific marker yields contrasting results, and highlights the ecological relevance of considering several primer sets to characterize the prey spectrum of generalist predators. In the end, when taking conservation measures for threatened species the methodological procedure could be crucial. For instance, the diet of an endangered animal yields key information about its ecological role and its biology, information that can be essential to protect it. Thus, in diet metabarcoding studies it is necessary to test and assess different primer combinations. Trophic and habitat requirements are key factors in the conservation of many endangered species [71, 72]. Given that results on diet depend so much on the combination of primers used, testing and assessing different primer combinations is necessary in diet metabarcoding studies.

Supporting information

S1 Table. Details of PCR conditions for the five primer sets.


S2 Table. Gross sequence reads, sieved sequence reads and number of OTUs obtained with each primer set.


S3 Table. Number of species identified (N.Sp.) number of occurrences (Occ.) in each taxonomic group, for each primer set.



We thank Iñigo Mendiola and Aitor Lekuona (Provincial Council of Gipuzkoa), and Asunción Yarzabal and Iñaki Uranga (City Council of San Sebastián) for their assistance in making this work possible, and also Secuencing and Genotyping Unit—Genomic Facilities—SGIker (UPV/EHU/ERDF, EU) for technical and human support provided by them. The authors declare that no Conflict of Interests exists and that the commercial company DESMA Estudios Ambientales does not alter our adherence to PLOS ONE policies on sharing data and materials.


  1. 1. Clare EL, Fraser EE, Braid HE, Fenton MB, Hebert PDN. Species on the menu of a generalist predator, the eastern red bat (Lasiurus borealis): using a molecular approach to detect arthropod prey. Mol Ecol. 2009; 18(11): 2532–2542. pmid:19457192
  2. 2. Symondson WOC. Molecular identification of prey in predator diets. Mol Ecol. 2002; 11(4): 627–641. pmid:11972753
  3. 3. Bohmann K, Evans A, Gilbert MTP, Carvalho GR, Creer S, Knapp M, et al. Environmental DNA for wildlife biology and biodiversity monitoring. Trends Ecol Evol. 2014; 29(6): 358–367. pmid:24821515
  4. 4. Sheppard SK, Harwood JD. Advances in molecular ecology: tracking trophic links through predator-prey food-webs. Funct Ecol. 2005; 19(5): 751–762.
  5. 5. Taberlet P, Waits LP, Luikart G. Noninvasive genetic sampling: look before you leap. Trends Ecol Evol. 1999; 14(8): 323–327. pmid:10407432
  6. 6. Razgour O, Clare EL, Zeale MRK, Hanmer J, Schnell IB, Rasmussen M, et al. High-throughput sequencing offers insight into mechanisms of resource partitioning in cryptic bat species. Ecol Evol. 2011; 1(4): 556–570. pmid:22393522
  7. 7. Pompanon F, Deagle BE, Symondson WOC, Brown DS, Jarman SN, Taberlet P. Who is eating what: diet assessment using next generation sequencing. Mol Ecol. 2012; 21: 1931–1950. pmid:22171763
  8. 8. Casper RM, Jarman SN, Gales NJ, Hindell MA. Combining DNA and morphological analyses of faecal samples improves insight into trophic interactions: a case study using a generalist predator. Mar Biol. 2007; 152(4): 815–825.
  9. 9. Greenstone MH, Rowley DL, Lundgren JG, Pfannenstiel RS, Rehner SA. Barcoding generalist predators by polymerase chain reaction: carabids and spiders. Mol Ecol. 2005; 14(10): 3247–3266. pmid:16101789
  10. 10. Piggot MP, Taylor AC. Remote collection of animal DNA and its applications in conservation management and understanding the population biology of rare and cryptic species. Wildl Res. 2003; 30(1): 1–13.
  11. 11. Valentini A, Pompanon F, Taberlet P. DNA barcoding for ecologists. Trends Ecol Evol. 2009; 24(2): 110–117. pmid:19100655
  12. 12. Buglione M, Maselli V, Rippa D, de Filippo G, Trapanese M, Fulgione D. A pilot study on the application of DNA metabarcoding for non-invasive diet analysis in the Italian hare. Mamm Biol. 2018: 31–42.
  13. 13. Schwarz D, Spitzer SM, Thomas AC, Kohnert CM, Keates TR, Acevedo-Gutiérrez A. Large-scale molecular diet analysis in a generalist marine mammal reveals male preference for prey of conservation concern. Ecol Evol. 2018: 1–17.
  14. 14. Alberdi A, Garin I, Aizpurua O, Aihartza J. The foraging ecology of the mountain long-eared bat Plecotus macrobullaris revealed with DNA mini-barcodes. PLoS ONE. 2012; 7(4): e35692. pmid:22545129
  15. 15. Arrizabalaga-Escudero A, Garin I, García-Mudarra JL, Alberdi A, Aihartza J, Goiti U. Trophic requirements beyond foraging habitats: The importance of prey source habitats in bat conservation. Biol Conserv. 2015; 191: 512–519
  16. 16. Clare EL, Symondson WO, Broders H, Fabianek F, Fraser EE, MacKenzie A, et al. The diet of Myotis lucifugus across Canada: assessing foraging quality and diet variability. Mol Ecol. 2013; 23(15): 3618–3632. pmid:24274182
  17. 17. Pompanon F, Samadi S. Next generation sequencing for characterizing biodiversity: promises and challenges. Genetica. 2015; 143(2): 133–138. pmid:25613325
  18. 18. Deagle BE, Chiaradia A, McInnes J, Jarman SN. Pyrosequencing faecal DNA to determine diet of little penguins: is what goes in what comes out? Conserv Genet. 2010; 11(5): 2039–2048.
  19. 19. Oehm J, Juen A, Nagiller K, Neuhauser S, Traugott M. Molecular scatology: How to improve prey DNA detection success in avian faeces? Mol Ecol Resour. 2011; 11(4): 620–628. pmid:21676193
  20. 20. Sheppard SK, Bell J, Sunderland KD, Fenlon J, Skervin D, Symondson WOC. Detection of secondary predation by PCR analyses of the gut contents of invertebrate generalist predators. Mol Ecol. 2005; 14(14): 4461–4468. pmid:16313606
  21. 21. Alberdi A, Aizpurua O, Gilbert MTP, Bohmann K. Scrutinizing key steps for reliable metabarcoding of environmental samples. Methods Ecol Evol. 2017; 9(1): 134–147.
  22. 22. Brandon-Mong G-J, Gan H-M, Sing K-W, Lee P-S, Lim P-E, Wilson J-J. DNA metabarcoding of insects and allies: an evaluation of primers and pipelines. Bull Entomol Res. 2015; 105(6): 717–727. pmid:26344799
  23. 23. Divoll TJ, Brown VA, Kinne J, McCracken GF, O'Keefe JM. Disparities in second-generation DNA metabarcoding results exposed with accessible and repeatable workflows. Mol Ecol Resour. 2018; 18(3): 590–601. pmid:29455464
  24. 24. Elbrecht V, Leese F. Validation and development of COI metabarcoding primers for freshwater macroinvertebrate bioassessment. Front Environ Sci. 2017; 5: 1–11.
  25. 25. Hebert PDN, Cywinska A, Ball SL, deWaard JR. Biological identifications through DNA barcodes. Proc R Soc Lond B: Biol Sci. 2003; 270(1512): 313–321.
  26. 26. Deagle BE, Eveson JP, Jarman SN. Quantification of damage in DNA recovered from highly degraded samples-a case study on DNA in faeces. Front Zool. 2006; 3: 1–11.
  27. 27. Creer S, Deiner K, Frey S, Porazinska D, Taberlet P, Thomas WK, et al. The ecologist's field guide to sequence-based identification of biodiversity. Methods Ecol Evol. 2016; 7(9): 1008–1018.
  28. 28. Gibson J, Shokralla S, Porter TM, King I, Konynenburg SV, Janzen DH, et al. Simultaneous assessment of the macrobiome and microbiome in a bulk sample of tropical arthropods through DNA metasystematics. PNAS. 2014; 111(22): 8007–8012. pmid:24808136
  29. 29. Hines J, Gessner MO. Consumer trophic diversity as a fundamental mechanism linking predation and ecosystem functioning. J Anim Ecol. 2012; 81(6): 1146–1153. pmid:22676625
  30. 30. Waraniak JM, Baker EA, Scribner KT. Molecular diet analysis reveals predator–prey community dynamics and environmental factors affecting predation of larval lake sturgeon Acipenser fulvescens in a natural system. J Fish Biol. 2018. pmid:29956319
  31. 31. Fernandes G, Herrero J, Aulagnier S, Amori G. Galemys pyrenaicus. In: The IUCN Red List of Threatened Species; 2008. Available from [12 Jun 2018].
  32. 32. Biffi M, Gillet F, Laffaille P, Colas F, Aulagnier S, Blanc F, et al. Novel insights into the diet of the Pyrenean desman (Galemys pyrenaicus) using next-generation sequencing molecular analyses. J Mammal. 2017; 98(5): 1497–1507.
  33. 33. Hawlitschek O., Fernández-González A., Balmori-de la Puente A., Castresana J. A pipeline for metabarcoding and diet analysis from fecal samples developed for a small semi-aquatic mammal. PLoS ONE. 2018; 13(8): e0201763. pmid:30106960
  34. 34. Esnaola A, González-Esteban J, Elosegi A, Arrizabalaga-Escudero A, Aihartza J. Need for speed: Preference for fast‐flowing water by the endangered semi‐aquatic Pyrenean desman (Galemys pyrenaicus) in two contrasting streams. Aquat Conserv. 2018 Apr 27.
  35. 35. Estrategia para la conservación del Desmán Ibérico (Galemys pyrenaicus) en España. Madrid, Spain: Ministerio de Agricultura, Alimentación y Medio Ambiente (MAGRAMA); 2013.
  36. 36. Némoz M, Bertrand A. Plan national d'actions en faveur du desman des Pyrénées (Galemys pyrenaicus), 2009–2014. Toulouse, France: Société Française pour l'Etude et la Protection des Mammifères/Ministère de l'Ecologie, de l'Energie, du Développement Durable et de l'Aménagement du Territoire; 2008
  37. 37. González-Esteban J, Esnaola A, Aihartza A. A new sampling method to detect the Pyrenean desman (Galemys pyrenaicus). Hystrix. Forthcoming.
  38. 38. Gillet F, Tiouchichine M-L, Galan M, Blanc F, Némoz M, Aulagnier S, et al. A new method to identify the endangered Pyrenean desman (Galemys pyrenaicus) and to study its diet, using next generation sequencing from faeces. Mamm Biol. 2015; 80(6): 505–509.
  39. 39. Leray M, Yang JY, Meyer CP, Mills SC, Agudelo A, Ranwez V, et al. A new versatile primer set targeting a short fragment of the mitochondrial COI region for metabarcoding metazoan diversity: application for characterizing coral reef fish gut contents. Front Zool. 2013; 10(1): 34.
  40. 40. Meusnier I, Singer GAC, Landry J-F, Hickey DA, Hebert PDN, Hajibabaei M. A Universal DNA mini-barcode for biodiversity analysis. BMC Genomics. 2008; 9(1): 214.
  41. 41. Zeale MRK, Butlin RK, Barker GLA, Less DC, Jones G. Taxon-specific PCR for DNA barcoding arthropod prey in bat faeces. Mol Ecol Resour. 2011; 11(2): 236–244. pmid:21429129
  42. 42. Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010; 26(19): 2460–2461. pmid:20709691
  43. 43. Edgar RC, Flyvbjerg H. Error filtering, pair assembly and error correction for next-generation sequencing reads. Bioinformatics. 2015; 31(21): 3476–3482. pmid:26139637
  44. 44. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J, 2011; 17(1): 10–12.
  45. 45. Edgar RC. UPARSE: highly accurate OTU sequences from microbial amplicon reads. Nat Methods. 2013; 10(10): 996–998. pmid:23955772
  46. 46. Vesterinen EJ, Lilley T, Laine VN, Wahlberg N. Next generation sequencing of fecal DNA reveals the dietary diversity of the widespread insectivorous predator Daubenton’s bat (Myotis daubentonii) in Southwestern Finland. PLoS ONE. 2013; 8(11): e0201763.
  47. 47. Shiel CB, McAney CM, Sullivan C, Fairley JS. Identification of arthropod fragments in bat droppings. London: Mammal Society; 1997.
  48. 48. Oksanen J, Blanchet FG, Kindt R, Legendre P. Others (2010) vegan: community ecology package. R package version, 2–0; 2016.
  49. 49. R Core Team (2017). R, version 3.4.3. R Foundation for Statistical Computing, Vienna, Austria. [cited 21 September 2018] Available from:
  50. 50. Hervé M (2014). RVAideMemoire: diverse basic statistical and graphical functions; 2014. Available from: [12 Jun 2018].
  51. 51. Clare EL. Molecular detection of trophic interactions: emerging trends distinct advantages, significant considerations and conservation applications. Evol Appl. 2014; 7: 1144–1157. pmid:25553074
  52. 52. Elbrecht V, Leese F. Validation and development of COI metabarcoding primers for freshwater macroinvertebrate bioassessment. Front Environ Sci. 2017; 5: 1–11.
  53. 53. De Barba M, Miquel C, Boyer F, Mercier C, Rioux D, Coissac E, et al. DNA metabarcoding multiplexing and validation of data accuracy for diet assessment: application to omnivorous diet. Mol Ecol Resour. 2014; 14(2): 306–323. pmid:24128180
  54. 54. Araujo MS, Bolnick DI, Layman CA. The ecological causes of individual specialisation. Ecol Lett. 2011; 14(9): 948–958. pmid:21790933
  55. 55. Clare EL, Barber BR, Sweeney BW, Hebert PDN, Fenton MB. Eating local: influences of habitat on the diet of little brown bats (Myotis lucifugus). Mol Ecol. 2011; 20(8): 1772–1780. pmid:21366747
  56. 56. Bolnick DI, Amarasekare P, Araújo MS, Bürger R, Levine JM, Novak M, et al. Why intraspecific trait variation matters in community ecology. Trends Ecol Evol. 2011; 26(4): 183–192. pmid:21367482
  57. 57. Bolnick DI, Svanbäck R, Fordyce JA, Yang LH, Davis JM, Hulsey CD, et al. The ecology of individuals: incidence and implications of individual specialization. Am Nat. 2003; 161(1): 1–28. pmid:12650459
  58. 58. Devictor V, Clavel J, Julliard R, Lavergne S, Mouillot D, Thuiller W, et al. Defining and measuring ecological specialization. J Appl Ecol. 2010; 47(1): 15–25.
  59. 59. Burgar JM, Murray DC, Craig MD, Haile J, Houston J, Stokes V, et al. Who's for dinner? High-throughput sequencing reveals bat dietary differentiation in a biodiversity hotspot where prey taxonomy is largely undescribed. Mol Ecol. 2013; 23(15): 3605–3617. pmid:24118181
  60. 60. Mata VA, Amorim F, Corley MFV, McCracken GF, Rebelo H, Beja P. Female dietary bias towards large migratory moths in the European free-tailed bat (Tadarida teniotis). Biol Letters. 2016; 12(3): 20150988.
  61. 61. Krauel JJ, Brown V, Westbrook JK, McCracken GF. Predator–prey interaction reveals local effects of high-altitude insect migration. Oecologia. 2018; 186(1): 49–58. pmid:29101468
  62. 62. Gerwing TG, Kim J-H, Hamilton DJ, Barbeau MA, Addison JA. Diet reconstruction using next-generation sequencing increases the known ecosystem usage by a shorebird. Auk. 2016; 133(2): 168–177.
  63. 63. Ando H, Setsuko S, Horikoshi K, Suzuki H, Umehara S, Inoue-Murayama M, et al. Diet analysis by next-generation sequencing indicates the frequent consumption of introduced plants by the critically endangered red-headed wood pigeon (Columba janthina nitens) in oceanic island habitats. Ecol Evol. 2018; 3(12): 4057–4069.
  64. 64. Lopes CM, De Barba M, Boyer F, Mercier C, Da Silva Filho PJS, Heidtmann LM, et al. DNA metabarcoding diet analysis for species with parapatric vs sympatric distribution: a case study on subterranean rodents. Heredity. 2015; 114(5): 525–536. pmid:25649502
  65. 65. Boyer S, Wratten SD, Holyoake A, Abdelkrim J, Cruickshank RH. Using next-generation sequencing to analyse the diet of a highly endangered land snail (Powelliphanta augusta) feeding on endemic earthworms. PLoS ONE. 2013; 8(9): e75962. pmid:24086671
  66. 66. Kohn MH, Wayne RK. Facts from feces revisited. Trends Ecol Evol. 1997; 12(6): 223–227. pmid:21238046
  67. 67. Arrizabalaga-Escudero A, Clare EL, Salsamendi E, Alberdi A, Garin I, Aihartza J, et al. Assessing niche partitioning of co-occurring sibling bat species by DNA metabarcoding. Mol Ecol. 2018; 27(5): 1273–1283. pmid:29411450
  68. 68. Farrel LE, Roman J, Sunquist ME. Dietary separation of sympatric carnivores identified by molecular analysis of scats. Mol Ecol. 2000; 9 (10): 1583–1590. pmid:11050553
  69. 69. Shehzad W, Riaz T, Nawaz MA, Miquel C, Poillot C, Shah SA, et al. Carnivore diet analysis based on next-generation sequencing: application to the leopard cat (Prionailurus bengalensis) in Pakistan. Mol Ecol. 2012; 21: 1951–1965. pmid:22250784
  70. 70. Marchesi L, Pedrini P, Sergio F. Biases associated with diet study methods in the Eurasian Eagle-Owl. J Raptor Res. 2002; 36(1): 11–16.
  71. 71. Sinclair ARE, Fryxell JM, Caughley G. Wildlife Ecology, Conservation and Management. 2nd ed. Oxford: Blackwell Publishing; 2006.
  72. 72. Sutherland WJ. The Conservation Handbook: Research, Management and Policy. 1st ed. Oxford: Blackwell Publishing; 2000.