Sequences from the DNA barcode region of the mitochondrial COI gene are an effective tool for specimen identification and for the discovery of new species. The Barcode of Life Data Systems (BOLD) (www.boldsystems.org) currently hosts 4.5 million records from animals which have been assigned to more than 490,000 different Barcode Index Numbers (BINs), which serve as a proxy for species. Because a fourth of these BINs derive from Lepidoptera, BOLD has a strong capability to both identify specimens in this order and to support studies of faunal overlap. DNA barcode sequences were obtained from 4503 moths from 329 sites across Pakistan, specimens that represented 981 BINs from 52 families. Among 379 species with a Linnaean name assignment, all were represented by a single BIN excepting five species that showed a BIN split. Less than half (44%) of the 981 BINs had counterparts in other countries; the remaining BINs were unique to Pakistan. Another 218 BINs of Lepidoptera from Pakistan were coupled with the 981 from this study before being compared with all 116,768 BINs for this order. As expected, faunal overlap was highest with India (21%), Sri Lanka (21%), United Arab Emirates (20%) and with other Asian nations (2.1%), but it was very low with other continents including Africa (0.6%), Europe (1.3%), Australia (0.6%), Oceania (1.0%), North America (0.1%), and South America (0.1%). This study indicates the way in which DNA barcoding facilitates measures of faunal overlap even when taxa have not been assigned to a Linnean species.
Citation: Ashfaq M, Akhtar S, Rafi MA, Mansoor S, Hebert PDN (2017) Mapping global biodiversity connections with DNA barcodes: Lepidoptera of Pakistan. PLoS ONE 12(3): e0174749. https://doi.org/10.1371/journal.pone.0174749
Editor: Sebastian D. Fugmann, Chang Gung University, TAIWAN
Received: January 6, 2017; Accepted: March 14, 2017; Published: March 24, 2017
Copyright: © 2017 Ashfaq et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: DNA barcode sequences included in the study are available in the following DOI: (dx.doi.org/10.5883/DS-MAMOT).
Funding: This research was enabled by grant HEC No. 20-1403/R& D/09 “Sequencing DNA Barcodes of Economically Important Insect Species from Pakistan” from the Higher Education Commission of Pakistan and by grant 106106-001 “Engaging Developing Nations in iBOL” from the International Development Research Centre in Canada. Sequence analysis was made possible by a grant from the Government of Canada through Genome Canada and the Ontario Genomics Institute in support of the International Barcode of Life (iBOL) project. This is a contribution to the Food From Thought project supported by the Canada First Research Excellence Fund. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Biodiversity inventories are a critical element of biogeographic analysis and have traditionally been based on morphological approaches [1–3]. However, molecular analysis  has the advantage of both revealing patterns of regional genetic divergence and allowing biodiversity comparisons at larger geographic and taxonomic scales [5,6]. DNA barcoding  has gained general acceptance for specimen identification and species discovery , resulting in the assembly of barcode records from nearly 500,000 animal species on the Barcode of Life Data Systems (BOLD) (www.boldsystems.org) . The use of DNA barcoding to examine genetic patterns  and to reveal biodiversity overlap in insect communities  is well documented. However, the implementation of the Barcode Index Number (BIN) system  as a proxy for species  is allowing DNA barcoding to support biodiversity assessment , providing the opportunity to accelerate biotic inventories [15–20]. As a consequence, BINs have already been used to examine animal biodiversity [21,22] at regional  and global  scales.
The order Lepidoptera has seen particularly intensive barcode analysis, work which has employed BINs to identify species , to analyze genetic structure , to discover regional species connections  and to explore biodiversity [28,29]. For example, Hausmann et al.  employed BINs to reveal genetic patterns in European geometrids, while Janzen et al.  used them to evaluate the diversity of skipper butterflies in Costa Rica. Zenker et al.  used BINs to examine the diversity of tiger moths in the Neotropics while Kekkonen & Hebert  used them to expose cryptic species complexes in Australian moths. These studies have not only validated the utility of BINs in biodiversity analysis , but have also expanded the barcode reference library for Lepidoptera which now includes one million records representing more than 116,000 BINs from 203 countries and dependent territories. As such, the Lepidoptera represent nearly a fourth of the 492,000 BINs on BOLD (accessed 23 December, 2016).
Global environmental change  and expanding human-mediated disturbance  have created the need for a better understanding of biodiversity connections at broad scales . The growing volume of DNA barcode data can address this need. The current study employs the Lepidoptera of Pakistan as a model to show how DNA barcoding can both expose regional biodiversity and enable the estimation of faunal overlap with other countries.
Materials and methods
No specific permissions were required for this study. Insects from private lands were collected only after the consent of the respective owners. The study did not involve endangered or protected species.
Specimens were collected at 329 sites across Pakistan (Fig 1) ranging in elevation from 17m to 4283m. Diverse environments were sampled from deserts in the south to agricultural lands in the central regions and forests in the northwest. Collections were made from 2010–2013 with light traps, UV illuminated sheets, Malaise traps, sweep nets and hand collections (employed for larvae). Specimens were killed in cyanide jars, and placed individually in paper envelopes before specimens were relaxed, pinned, labeled and stored in the collection at the Insect Molecular Biology Laboratory at the National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad. Tissue samples from identified specimens were also donated by the National Insect Museum at the National Agriculture Research Centre (NARC), by the University of Agriculture, Faisalabad, and by several amateur collectors.
Morphological identifications were performed at the NARC, at the Insect DNA Barcoding Museum (NIBGE), and at the Centre for Biodiversity Genomics (CBG) with input from several taxonomic experts (see acknowledgements). Where possible, morphological identifications were supplemented by matching barcode records from the Pakistan specimens with those already on BOLD. Specimen information, collection data, and specimen images were submitted to BOLD and are available under the dataset, DS-MAMOT which can be accessed through the following DOI (dx.doi.org/10.5883/DS-MAMOT).
A single leg was removed from each specimen with a sterile forceps and transferred to a 96-well microplate pre-loaded with 30 μl of 95% EtOH in each well. DNA extraction, PCR amplification and sequencing were performed at the Canadian Centre for DNA Barcoding (CCDB) following standard protocols (http://ccdb.ca/resources.php). PCR amplification of COI-5′ was performed with C_LepFolF and C_LepFolR (http://www.ccdb.ca/docs/CCDB_PrimerSets.pdf) primers. These primers are mixtures of LepF1  /LCO1490  and LepR1  /HCO2198 , respectively. Amplifications involved 12.5 μL reactions containing standard PCR ingredients  and 2 μL of DNA template. PCR products were analyzed on a 2% agarose E-gel® 96 system (Invitrogen Inc.) and amplicons were sequenced using the BigDye Terminator Cycle Sequencing Kit (v3.1) on an Applied Biosystems 3730XL DNA Analyzer. Sequences were assembled, aligned and edited using CodonCode Aligner (CodonCode Corporation, USA) before translation in MEGA5  to verify that they were free of stop codons before submission to BOLD. All sequences generated in this study are accessible on BOLD in the dataset DS-MAMOT which also provides GenBank accession numbers.
ClustalW nucleotide sequence alignments , genetic divergence, and neighbor-joining (NJ) clustering analysis were performed with MEGA5. The Kimura-2-Parameter (K2P)  distance model was used, along with pairwise deletion of missing sites, with nodal support estimated using 500 bootstrap replicates. Sequences meeting the quality criteria (>507 bp, <1% Ns, no stop codon or contamination flag) were assigned BINs by the Refined Single Linkage (RESL) algorithm implemented on BOLD . The RESL algorithm runs monthly on all eligible sequences in BOLD and the resulting BIN array is accessible through individual BIN pages. The presence or absence of a “barcode gap”  was determined for each species to test the reliability of its discrimination. Using the "Barcode Gap Analysis" (BGA) tool on BOLD, a species is unambiguously identifiable if its maximum intraspecific distance is less than its Nearest-Neighbor (NN) distance. The relationship between geographic distance (km) and intraspecific distance (K2P) was analyzed for each species with at least three individuals and three locations using the “Geo-Distance Correlation” tool on BOLD that employs two methods to facilitate analysis. The Mantel Test  is used to examine the significance of the relationship between the geographic distance and genetic divergence matrices on a species-by-species basis. The second analysis examines the relationship across all species by employing the Minimum Spanning Tree for the collection sites as a proxy for the geographic distribution of each species, creating a set of values (one for each species) which is regressed against the corresponding set of maximum intraspecific sequence divergences at COI . Geographic and genetic distances for all species were subsequently pooled in a single analysis enabling a scatter-plot of the two parameters with regression trendline generated in Excel.
BOLD was also searched for additional BINs of Lepidoptera from Pakistan, and the resultant 218 BINs were added to the 981 from this study before ascertaining BIN overlap with other countries. To permit this analysis, the 203 countries/ dependent territories with records were each assigned to a continent (Africa, Asia, Australia, Europe, North America, South America, Oceania) before BIN overlap was calculated in Excel and plotted using SimpleMappr (http://www.simplemappr.net/).
Nearly half of the specimens (2116/4503) (47%) were identified to a Linnaean species (by morphology) while another 1280 (28%) were placed in a genus. With one exception, the remaining 1107 (25%) specimens could be assigned to a family (S1 Table). Collectively, the specimens belong to 52 families, 108 subfamilies, 412 genera, 379 species and a total of 981 BINs (S1 Table). Most BINs (711/981) were represented by two or more individuals, but 270 were singletons (S2 Table). Table 1 lists the 52 families together with their specimen and BIN counts as well confamilial divergences (K2P). The Erebidae and Noctuidae possessed the largest number of specimens (1065, 830 respectively) and the most BINs (198, 168). By contrast, 29 families were represented by less than 10 specimens each, and eight by just a single specimen.
Sequence divergences are only reported for families with more than one BIN.
Intraspecific distances and BIN assignments for the 379 named species were calculated for the specimens from Pakistan (S3 Table). A third (35.6%) were represented by a single specimen while the rest had two or more records with Cnaphalocrocis medinalis having the highest count (71). These 379 species were assigned to 382 BINs. All species with multiple records were placed in a single BIN, barring Amyna axis, Bedellia somnulentella, Biston suppressaria, Chiasmia hebesata,and Theretra alecto, which were assigned to two, while two pairs of closely allied sphingids (Hyles chuvilini—H. stroehlei; Hippotion boerhaviae—H. rosetta) were each assigned to a single BIN. The species-BIN association was further supported by NJ clustering (S1 Fig). Excluding six species, Nebrarctia transversa (2.2%), Amyna axis (2.3%), Theretra alecto (2.3%), Odontopera muscularia (2.3%), Diaphania indica (2.5%), and Scirpophaga excerptalis (3.3%), the maximum intraspecific distance for all species was less than 2%. The maximum intraspecific distance for each species was lower than its NN distance (Fig 2A and 2B) and these values did not increase significantly with sample size (Fig 2C). However, as expected, divergences increased with taxonomic rank (Table 2) as conspecific divergences averaged 0.2% (range 0.0–3.3%) while congeneric (interspecific) (mean = 6.4%; range 2.7–16%) and confamilial (intergeneric) (mean = 11.3%; range 4.1–22.1%) were much higher. The inclusion of taxa unidentified to a species increased the mean confamilial divergence for 13 families and extended its range in the Crambidae and Erebidae (Table 1). When records from Pakistan were combined with those for conspecifics from other countries, there was more evidence of deep divergence (S3 Table). Intraspecific distances exceeded 2% in 89 species with particularly high values in Coleophora trifolii (K2P = 16.7%) and Emmelina monodactyla (K2P = 16.3%). Similarly, the incidence of BIN splits increased with 59 species assigned to two BINs, 19 to three BINs, 5 to four BINs, and one species (Glyphodes onychinalis) to six BINs.
This analysis only considers specimens that were assigned to a Linnaean species.
S3 Table reports the results from the Geo-Distance-Correlation analysis for the 237 species with more than three barcode records. The K2P distances for conspecifics from different regions ranged from 0.2% - 16.7% while geographic distances between collection sites ranged from 23–19,665 km. The Mantel Test (R2) for 148 of these species was <0.5, indicating no significant relationship between genetic and geographic distances (S3 Table). However, the scatter-plot of the pooled geographic and genetic distances for all 237 species indicated a weak relationship (R2 = 0.15) between the two measures (Fig 3).
Vertical downward arrow (red) indicates the percentage of species with less or more than 2% divergence across their range.
About 44% (427/981) of the BINs from Pakistan were shared with other countries while the remaining so far only possess records from this nation (S2 Table). Among the 1199 BINs of Pakistan Lepidoptera (218 from prior studies + 981 from this study), there was 21% overlap with the 856 BINs from India (21% of 856 BINs), Sri Lanka (21% of 86) or United Arab Emirates (20% of 436) and with other Asian nations (2.1% of 22,657). BIN sharing was far lower with nations on other continents including Africa (0.6% of 18,665), Australia (0.6% of 15,028), Europe (1.3% of 8,126), North and Central America (including the Caribbean) (0.1% of 31,283), Oceania (1.0% of 5189), and South America (0.1% of 19,863). Fifty-seven countries/ dependent territories, five with many (>500) BINs of Lepidoptera, did not share any species with Pakistan (Fig 4; S4 Table).
This study represents the first step toward the development of a DNA barcode reference library for the moths of Pakistan based upon the analysis of 4503 specimens which included representatives of 981 BINs (= proxy for species) from 52 families. Although about 300 species of butterflies are known from Pakistan [27,45,46], diversity estimates are only available for a few moth families [47–49]. As a first approximation, we hypothesize that about 8,000–10,000 species of Lepidoptera occur in Pakistan so barcode records cover just a tenth of the fauna. Among the 134 recognized families of moths (, www.boldsystems.org/), 52 were represented in this study. The Erebidae and Noctuidae dominated the collections with both the highest number of samples and BINs, an expected result since along with Geometridae  these two families are known for their species richness . Only 47% of the specimens could be identified to a valid species underscoring the challenge in using traditional taxonomic approaches to evaluate biodiversity in regions that lack well-developed taxonomic resources.
Intraspecific divergences were greater than 2% in just six of the 244 named species represented by two or more specimens. Even including these six taxa with deep intraspecific divergence, conspecific divergence was always less than the NN distance. This result mirrors that from other large-scale analyses on Lepidoptera . For example, Hebert et al.  found that only 9 of 1327 species from eastern North America showed intraspecific divergences above 1.5%. Another study which examined 1004 species from two distant European sites (1600 km apart) found that most (88%) of the shared species showed less than 2% maximum divergence .
The BIN system supported the genetic distinctiveness of the moth species from Pakistan as it assigned the 379 identified species to 382 BINs, mirroring results for the butterflies of this nation as 79 of 81 species were assigned to a unique BIN . Similar congruence between BINs and morphological species has also been established in other studies on Lepidoptera . For example, 90% of 92 Gelechiinae species from Finland  and 94% of 286 species of tiger moths from Brazil  were congruent with BINs. A similar pattern was reported for 3514 species of central European beetles as there was a 92% correspondence between morphospecies and BINs .
The integration of sequences from Pakistan with those from conspecifics from other regions substantially increased the frequency of deep sequence divergence; 89 species showed more than 2% intraspecific (maximum) distance and BIN splits were detected in 84 species. In fact, intraspecific divergence reached 16.7% in Coleophora trifolii, while specimens of Glyphodes onychinalis were assigned to six BINs. Prior studies have also reported cases of high intraspecific divergence and resultant BIN splits in Lepidoptera [57,58]. For example, 8% of the 1541 species of Noctuoidea analyzed from sites across Canada showed >2% intraspecific divergence and a similar incidence of BIN splits . Likewise, 12.4% of European lepidopterans (124/1004) possessed more than 2% sequence divergence at COI . Such cases may often reflect overlooked cryptic species , or simply misidentifications but this can only be confirmed by detailed taxonomic studies . The coupling of cryptic species complexes with the prevalence of misidentified specimens represent important barriers to the analysis of species overlap at larger geographic scales.
Prior barcode studies have also revealed cases where different species show such low divergence that they are assigned to the same BIN [13,25,26]. In the present study, just two species pairs, Hyles chuvilini—H. stroehlei and Hippotion boerhaviae—H. rosetta, fell into this category. By comparison, Zahiri et al.  found that 21 species of 1541 Canadian noctuoid species shared their BIN with at least one other species, while Huemer et al.  found that 16 of 1004 lepidopteran species in Europe shared the BINs.
Although a study on one genus of aquatic beetles suggested that geographic variation in barcode sequences represents an important complication for barcode analysis , other investigations have not supported this conclusion. In their study on 1004 species of European Lepidoptera, Huemer et al.  found an increase in intraspecific divergence with distance, but it was so small in relation to NN distances that it rarely compromised identifications. A weak relationship between geographical distance and intraspecific divergence was also noted in the present study, as in earlier work on the butterflies of Pakistan, but again species assignments were not impeded . As a consequence, it seems fair to conclude that while increased geographic coverage narrows the barcode gap, it rarely complicates species discrimination [61,62].
The order Lepidoptera includes about 180,000 described species with perhaps another 300,000 awaiting description [50,51]. Because barcode coverage is now available for 116,000 BINs, this order provides a good opportunity to evaluate how DNA barcoding can aid understanding of global biodiversity patterns. When distribution patterns for the 1199 BINs of this order from Pakistan were compared with all records from other regions, they indicated strong regional endemism as 21% of the BINs were shared with India and 2.1% with Asia while overlap with the faunas on other continents was just 0.1–1.3%. This result points towards a stronger regional biodiversity divergence, a conclusion supported by other studies [27,63]. However, the presence of unreported conspecifics in adjoining nations (Afghanistan, China, India, Iran) whose faunas have seen little barcode analysis may alter this assertion. Biodiversity loss  and the slow pace of conventional approaches to the documentation of the global fauna have created an urgent need for robust, rapid approaches for biodiversity analysis. The BIN system represents a partial solution as it circumvents the taxonomic impediment [12,19], enabling the automated comparisons of faunal overlap . When coupled with access to low-cost sequencing methods , DNA barcoding represents an enabler for the biodiversity assessments needed to support conservation programs .
S1 Fig. NJ analysis of moth species from Pakistan based on the analysis of 2116 COI sequences derived from 379 species.
Bootstrap values (>50%) (500 replicates) are shown above the branches. The scale bar shows K2P distances. The node for each species with multiple specimens is collapsed to a vertical line or triangle, with the horizontal depth indicating the level of intraspecific divergence. Analyses were conducted in MEGA5.
S1 Table. Records for moths from Pakistan with BINs and morphological identification.
S2 Table. Unique (Pakistan-only), non-unique (shared with other countries) and singleton Lepidoptera BINs from Pakistan.
S3 Table. Barcode Index Numbers (BINs), maximum intraspecific distances (K2P) and geo-distance correlation analysis for the identified species of moths from Pakistan and their conspecifics from other countries with public records on the Barcode of Life Data Systems (www.boldsystems.org).
S4 Table. Number of Lepidoptera BINs from other countries and continents shared with Pakistan.
We thank Axel Hausmann, Mari Kekkonen, Rodolphe Rougerie and Reza Zahiri for aiding with taxonomic assignments. We thank Axel Hausmann, Jerome Moriniere and one anonymous reviewer for their comments on an earlier version of this manuscript.
- Conceptualization: MA SM PDNH.
- Data curation: MA PDNH.
- Formal analysis: MA.
- Funding acquisition: MA SM PDNH.
- Investigation: MA SA MAR.
- Methodology: MA SA.
- Project administration: MA SM.
- Resources: MA SM PDNH MAR.
- Supervision: MA SM PDNH.
- Validation: MA PDNH.
- Visualization: MA.
- Writing – original draft: MA SA.
- Writing – review & editing: MA PDNH.
- 1. De Broyer C, Danis B. How many species in the Southern Ocean? Towards a dynamic inventory of the Antarctic marine species. Deep-Sea Res Pt II. 2011; 58: 5–17.
- 2. Joschko M, Fox CA, Lentzsch P, Kiesel J, Hierold W, Kruck S, et al. Spatial analysis of earthworm biodiversity at the regional scale. Agr Ecosyst Environ. 2006; 112:367–380.
- 3. Meza-Joya FL, Torres M. Spatial diversity patterns of Pristimantis frogs in the tropical Andes. Ecol Evol. 2016; 6:1901–1913. pmid:26929819
- 4. Drummond AJ, Newcomb RD, Buckley TR, Xie D, Dopheide A, Potter BCM, et al. Evaluating a multigene environmental DNA approach for biodiversity assessment. GigaScience. 2015; 4:46. pmid:26445670
- 5. Murray GGR, Wang F, Harrison EM, Paterson GK, Mather AE, Harris SR, et al. The effect of genetic structure on molecular dating and tests for temporal signal. Methods Ecol Evol. 2016; 7:80–89. pmid:27110344
- 6. Rosetti N, Remis MI. Spatial genetic structure and mitochondrial DNA phylogeography of Argentinean populations of the grasshopper Dichroplus elongatus. PLoS ONE. 2012; 7:e40807. pmid:22859953
- 7. Hebert PDN, Cywinska A, Ball SL, deWaard JR. Biological identifications through DNA barcodes. Proc Biol Sci. 2003; 270:313–321. pmid:12614582
- 8. Gwiazdowski RA, Foottit RG, Maw HEL, Hebert PDN. The Hemiptera (Insecta) of Canada: constructing a reference library of DNA barcodes. PLoS ONE. 2015; 10: e0125635. pmid:25923328
- 9. Ratnasingham S, Hebert PDN. BOLD: The Barcode of Life Data System (http://www.barcodinglife.org). Mol Ecol Notes. 2007; 7:355–364. pmid:18784790
- 10. Zhou X, Jacobus LM, DeWalt RE, Adamowicz SJ, Hebert PDN. Ephemeroptera, Plecoptera, and Trichoptera fauna of Churchill (Manitoba, Canada): insights into biodiversity patterns from DNA barcoding. J North Am Benthological Soc. 2010; 29:814–837.
- 11. Novotny V, Miller SE, Hulcr J, Drew RA, Basset Y, Janda M, et al. Low beta diversity of herbivorous insects in tropical forests. Nature. 2007; 448:692–695. pmid:17687324
- 12. Ratnasingham S, Hebert PDN. A DNA-based registry for all animal species: the Barcode Index Number (BIN) System. PLoS ONE. 2013; 8:e66213. pmid:23861743
- 13. Zahiri R, Lafontaine JD, Schmidt BC, Dewaard JR, Zakharov EV, Hebert PDN, et al. A transcontinental challenge—a test of DNA barcode performance for 1,541 species of Canadian Noctuoidea (Lepidoptera). PLoS ONE. 2014; 9:e92797. pmid:24667847
- 14. Gibson J, Shokralla S, Porter TM, King I, van Konynenburg S, Janzen DH, et al. Simultaneous assessment of the macrobiome and microbiome in a bulk sample of tropical arthropods through DNA metasystematics. Proc Natl Acad Sci USA. 2014; 111: 8007–8012. pmid:24808136
- 15. Janzen DH, Hajibabaei M, Burns JM, Hallwachs W, Remigio E, Hebert PDN, et al. Wedding biodiversity inventory of a large and complex Lepidoptera fauna with DNA barcoding. Philos Trans R Soc Lond B Biol Sci. 2005; 360:1835–1845. pmid:16214742
- 16. Ashfaq M, Hebert PDN, Mirza JH, Khan AM, Zafar Y, Mirza MS. Analyzing mosquito (Diptera: Culicidae) diversity in Pakistan by DNA barcoding. PLoS ONE. 2014; 9:e97268. pmid:24827460
- 17. Deharveng L, Bedos A, Daugeron C, Villemant C, Judson MLI. Organization, usefulness and limitations of an ATBI (All Taxa Biodiversity Inventory): the inventory of terrestrial invertebrates in the Mercantour National Park. Zoosystema. 2015; 37:9–30.
- 18. Brehm G, Hebert PDN, Colwell RK, Adams M-O, Bodner F, Friedemann K, et al. Turning up the heat on a hotspot: DNA barcodes reveal 80% more species of geometrid moths along an Andean elevational gradient. PLoS ONE. 2016; 11:e0150327. pmid:26959368
- 19. Hebert PDN, Ratnasingham S, Zakharov EV, Telfer AC, Levesque-Beaudin V, Milton MA, et al. Counting animal species with DNA barcodes: Canadian insects. Philos Trans R Soc B: Biol Sci. 2016; 371.
- 20. Wirta H, Varkonyi G, Rasmussen C, Kaartinen R, Schmidt NM, Hebert PDN, et al. Establishing a community-wide DNA barcode library as a new tool for arctic research. Mol Ecol Resour. 2016; 16:809–822. pmid:26602739
- 21. Schmidt S, Schmid-Egger C, Moriniere J, Haszprunar G, Hebert PDN. DNA barcoding largely supports 250 years of classical taxonomy: identifications for Central European bees (Hymenoptera, Apoidea partim). Mol Ecol Resour. 2015; 15:985–1000. pmid:25588628
- 22. Zenker MM, Rougerie R, Teston JA, Laguerre M, Pie MR, Freitas AVL, et al. Fast census of moth diversity in the Neotropics: a comparison of field-assigned morphospecies and DNA barcoding in tiger moths. PLoS ONE. 2016; 11:e0148423. pmid:26859488
- 23. Chambers EA, Hebert PDN. Assessing DNA barcodes for species identification in North American reptiles and amphibians in natural history collections. PLoS ONE. 2016; 11:e0154363. pmid:27116180
- 24. Ashfaq M, Prosser S, Nasir S, Masood M, Ratnasingham S, Hebert PDN, et al. High diversity and rapid diversification in the head louse, Pediculus humanus (Pediculidae: Phthiraptera). Sci Rep. 2015; 5:14188. pmid:26373806
- 25. Huemer P, Mutanen M, Sefc KM, Hebert PDN. Testing DNA barcode performance in 1000 species of European Lepidoptera: large geographic distances have small genetic impacts. PLoS ONE. 2014; 9:e115774. pmid:25541991
- 26. Hausmann A, Godfray HC, Huemer P, Mutanen M, Rougerie R, van Nieukerken EJ, et al. Genetic patterns in European geometrid moths revealed by the Barcode Index Number (BIN) system. PLoS ONE. 2013; 8:e84518. pmid:24358363
- 27. Ashfaq M, Akhtar S, Khan AM, Adamowicz SJ, Hebert PDN. DNA barcode analysis of butterfly species from Pakistan points towards regional endemism. Mol Ecol Resour. 2013; 13:832–843. pmid:23789612
- 28. Telfer AC, Young MR, Quinn J, Perez K, Sobel CN, Sones JE, et al. Biodiversity inventories in high gear: DNA barcoding facilitates a rapid biotic survey of a temperate nature reserve. Biodivers Data J. 2015; 30:2015.
- 29. Wilson J-J, Jisming-See S-W, Brandon-Mong G-J, Lim A-H, Lim V-C, Lee P-S, et al. Citizen science: the first Peninsular Malaysia butterfly count. Biodivers Data J. 2015; 3:e7159.
- 30. Janzen DH, Hallwachs W, Burns JM, Hajibabaei M, Bertrand C, Hebert PDN, et al. Reading the complex skipper butterfly fauna of one tropical place. PLoS ONE. 2011; 6:e19874. pmid:21857895
- 31. Kekkonen M, Hebert PDN. DNA barcode-based delineation of putative species: efficient start for taxonomic workflows. Mol Ecol Resour. 2014; 14:706–715. pmid:24479435
- 32. Janzen D, Hallwachs W. DNA barcoding the Lepidoptera inventory of a large complex tropical conserved wildland, Area de Conservacion Guanacaste (ACG), northwestern Costa Rica. Genome. 2016; 59(9):641–660. pmid:27584861
- 33. Finn DS, Zamora-Munoz C, Murria C, Sainz-Bariain M, Alba-Tercedor J. Evidence from recently deglaciated mountain ranges that Baetis alpinus (Ephemeroptera) could lose significant genetic diversity as alpine glaciers disappear. Freshwater Sci. 2016; 33:207–216.
- 34. McKinney LA, Fulkerson GM, Kick EL. Investigating the correlates of biodiversity loss: a cross-national quantitative analysis of threatened bird species. Hum Ecol Rev. 2009; 16:103–113.
- 35. Landry J-F, Nazari V, deWaard JR, Mutanen M, Lopez-Vaamonde C, Huemer P, et al. Shared but overlooked: 30 species of Holarctic Microlepidoptera revealed by DNA barcodes and morphology. Zootaxa. 2013; 3749:1–93. pmid:25113597
- 36. Hebert PDN, Penton EH, Burns JM, Janzen DH, Hallwachs W. Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes fulgerator. Proc Natl Acad Sci USA. 2004; 101:14812–14817. pmid:15465915
- 37. Folmer O, Black M, Hoeh W, Lutz R, Vrijenhoek R. DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol Mar Biol Biotechnol. 1994; 3:294–299. pmid:7881515
- 38. Hebert PDN, Dewaard JR, Zakharov EV, Prosser SW, Sones JE, McKeown JTA, et al. A DNA 'barcode blitz': rapid digitization and sequencing of a natural history collection. PLoS ONE. 2013; 8:e68535. pmid:23874660
- 39. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011; 28:2731–2739. pmid:21546353
- 40. Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994; 22:4673–4680. pmid:7984417
- 41. Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980; 16:111–120. pmid:7463489
- 42. Meyer CP, Paulay G. DNA barcoding: error rates based on comprehensive sampling. PLoS Biol. 2005; 3:e422. pmid:16336051
- 43. Mantel N. The detection of disease clustering and a generalized regression approach. Cancer Res. 1967; 27:209–220. pmid:6018555
- 44. Blagoev GA, deWaard JR, Ratnasingham S, deWaard SL, Lu L, Robertson J, et al. Untangling taxonomy: a DNA barcode reference library for Canadian spiders. Mol Ecol Resour. 2016; 16:325–341. pmid:26175299
- 45. Roberts TJ. The butterflies of Pakistan. Oxford University Press, Karachi. 2001; 290p.
- 46. Perveen F, Khan A. Checklist of butterfly fauna from Kabal, Swat, Pakistan. J Advances Biol. 2013; 2:115–121.
- 47. Kamaluddin S, Younus MF, Attique T. A cladistic analysis of the family Sphingidae (Lepidoptera) from Pakistan and Azad Kashmir. FUUAST J Biol. 2014; 4:157–172.
- 48. Rafi MA, Sultan A, Kitching IJ, Pittaway AR, Markhasiov M, Khan MR, et al. The hawkmoth fauna of Pakistan (Lepidoptera: Sphingidae). Zootaxa. 2014; 3794:393–418. pmid:24870331
- 49. Yakovlev RV, Witt TH. Cossidae (Lepidoptera) of Pakistan. Biol Bull Bogdan Chmelnitskiy Melitopol State Pedagogical University. 2016; 6:67–76.
- 50. Capinera JL. Butterflies and moths. Encyclopedia Entomol. 2008; 4:626–672.
- 51. van Nieukerken EJ, Kaila L, Kitching IJ, Kristensen NP, Lees DC, Minet J, et al. Order Lepidoptera Linnaeus, 1758. In Zhang Z-Q. (Ed.), Animal Biodiversity: An outline of higher-level classification and survey of taxonomic richness. Zootaxa. 2011; 3148:212–221.
- 52. Zahiri R, Holloway JD, Kitching IJ, Lafontaine JD, Mutanen M, Wahlberg N. Molecular phylogenetics of Erebidae (Lepidoptera, Noctuoidea). Syst Entomol. 2012; 37: 102–124.
- 53. Hajibabaei M, Janzen DH, Burns JM, Hallwachs W, Hebert PDN. DNA barcodes distinguish species of tropical Lepidoptera. Proc Natl Acad Sci USA. 2006; 103:968–971. pmid:16418261
- 54. Hebert PDN, deWaard JR, Landry J-F. DNA barcodes for 1/1000 of the animal kingdom. Biol Lett. 2009; 23:359–362.
- 55. Kekkonen M, Mutanen M, Kaila L, Nieminen M, Hebert PDN. Delineating species with DNA barcodes: a case of taxon dependent method performance in moths. PLoS ONE. 2015; 10:e0122481. pmid:25849083
- 56. Hendrich L, Morinière J, Haszprunar G, Hebert PDN, Hausmann A, Köhler F, et al. A comprehensive DNA barcode database for Central European beetles with a focus on Germany: adding more than 3500 identified species to BOLD. Mol Ecol Resour. 2015; 15:795–818. pmid:25469559
- 57. Niu YQ, Nansen C, Li XW, Liu TX. Geographical variation of Plutella xylostella (Lepidoptera: Plutellidae) populations revealed by mitochondrial COI gene in China. J Appl Entomol. 2014; 138:692–700.
- 58. Ashfaq M, Hebert PDN. DNA barcodes for bio-surveillance: regulated and economically important arthropod plant pests. Genome. 2016; 59:933–945. pmid:27753511
- 59. Bortolus A. Error cascades in the biological sciences: the unwanted consequences of using bad taxonomy in ecology. Ambio. 2008; 37:114–118. pmid:18488554
- 60. Bergsten J, Bilton DT, Fujisawa T, Elliott M, Monaghan MT, Balke M, et al. The effect of geographical scale of sampling on DNA barcoding. Syst Biol. 2012; 61:851–869. pmid:22398121
- 61. Lukhtanov VA, Sourakov A, Zakharov EV, Hebert PDN. DNA barcoding Central Asian butterflies: increasing geographical dimension does not significantly reduce the success of species identification. Mol Ecol Resour. 2009; 9:1302–1310. pmid:21564901
- 62. Candek K, Kuntner M. DNA barcoding gap: reliable species identification over morphological and geographical scales. Mol Ecol Resour. 2015; 15:268–277. pmid:25042335
- 63. Werneck FP, Nogueira C, Colli GR, Sites JW, Costa GC. Climatic stability in the Brazilian Cerrado: implications for biogeographical connections of South American savannas, species richness and conservation in a biodiversity hotspot. J Biogeogr. 2012; 39:1695–1706.
- 64. Rohland N, Reich D. Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture. Genome Res. 2012; 22:939–946. pmid:22267522
- 65. Myers N, Mittermeier RA, Mittermeier CG, da Fonseca GAB, Kent J. Biodiversity hotspots for conservation priorities. Nature. 2000; 403:853–858. pmid:10706275