The concept of DNA barcoding for species identification has gained considerable momentum in animals because of fairly successful species identification using cytochrome oxidase I (COI). In plants, matK and rbcL have been proposed as standard barcodes. However, barcoding in complex genera is a challenging task.
Methodology and Principal Findings
We investigated the species discriminatory power of four reportedly most promising plant DNA barcoding loci (one from nuclear genome- ITS, and three from plastid genome- trnH-psbA, rbcL and matK) in species of Indian Berberis L. (Berberidaceae) and two other genera, Ficus L. (Moraceae) and Gossypium L. (Malvaceae). Berberis species were delineated using morphological characters. These characters resulted in a well resolved species tree. Applying both nucleotide distance and nucleotide character-based approaches, we found that none of the loci, either singly or in combinations, could discriminate the species of Berberis. ITS resolved all the tested species of Ficus and Gossypium and trnH-psbA resolved 82% of the tested species in Ficus. The highly regarded matK and rbcL could not resolve all the species. Finally, we employed amplified fragment length polymorphism test in species of Berberis to determine their relationships. Using ten primer pair combinations in AFLP, the data demonstrated incomplete species resolution. Further, AFLP analysis showed that there was a tendency of the Berberis accessions to cluster according to their geographic origin rather than species affiliation.
We reconfirm the earlier reports that the concept of universal barcode in plants may not work in a number of genera. Our results also suggest that the matK and rbcL, recommended as universal barcode loci for plants, may not work in all the genera of land plants. Morphological, geographical and molecular data analyses of Indian species of Berberis suggest probable reticulate evolution and thus barcode markers may not work in this case.
Citation: Roy S, Tyagi A, Shukla V, Kumar A, Singh UM, Chaudhary LB, et al. (2010) Universal Plant DNA Barcode Loci May Not Work in Complex Groups: A Case Study with Indian Berberis Species. PLoS ONE 5(10): e13674. doi:10.1371/journal.pone.0013674
Editor: Simon Joly, Montreal Botanical Garden, Canada
Received: December 11, 2009; Accepted: October 6, 2010; Published: October 27, 2010
Copyright: © 2010 Roy et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Funding from Department of Biotechnology, Government of India is acknowledged. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
DNA barcoding is the use of short DNA sequences for species identification. Since its inception as an approach for large scale species identification –, several studies have reported the application of COI in a wide range of animal taxa , –. However, the attempts to identify a single locus for barcoding in plants have largely been unsuccessful , . There are growing evidences which suggest the need of deploying more than one locus for barcoding in plants –. The following regions have been suggested for plant DNA barcoding: ITS (the internal transcribed spacer region of the nuclear ribosomal genes), rbcL and psbA-trnH ; ITS and psbA-trnH ; rbcL ; rpoC1, rpoB and matK or rpoC1, matK and psbA-trnH ; matK, atpF-atpH and psbA-trnH or matK and psbK-psbI , psbA-trnH and rbcL ; psbA-trnH ; trnLUAA ; matK and psbA-trnH , matK and rbcL  matK, rbcL and trnH-psbA  and ITS2 , .
In most of the recent plant barcoding studies, the coding regions of matK and rbcL and the non-coding plastid intergenic spacer of trnH-psbA have been suggested as prime candidates for barcoding , . Following the first suggestion by Kress et al. (2005) , several subsequent reports projected trnH-psbA as a strong candidate for plant barcoding –, . However, Consortium for the Barcoding of Life (CBOL) disregarded trnH-psbA as it does not consistently provide bidirectional unambiguous sequencing reads . Erstwhile studies have focused predominantly on plastid regions for barcoding. Chase et al. (2005)  and Kress et al. (2005)  recovered highest mean percentage sequence divergence (2.81 and 5.7% respectively) for nrITS region for plant barcoding. However, the use of ITS region as barcode locus has often been considered unfavorable because of the presence of paralogs in several plant taxa. Yet, in other studies, ITS has been successfully used as barcode locus , , . More recently, ITS2 has been projected as an important plant barcode locus , . We examined four DNA barcoding loci (one nuclear- nrITS and three plastid loci- trnH-psbA, rbcL and matK) in 16 species of Berberis L. (Berberidaceae) from India. For validation of the techniques, we tested these loci in selected species of two other genera, Ficus L. (Moraceae), comprises keystone species in tropical rain forest ecosystems and Gossypium L. (Malvaceae), a pan tropical genus including the commercial cotton plants cultivated widely on the tropical and subtropical regions throughout the world.
The genus Berberis comprises of about 500 species , . Based on phytogeographic distribution Schneider (1905) divided the species of Berberis into two groups, Septentrionales and Australes, . The group Septentrionales (Old World) consists of ca 300 species occurring mainly in Eurasia but extending to North America (two species) and North Africa (four species). The group Australes (New World) contains about 200 species with most of them distributed in South America and a few in Middle America , . These geographical groups are supported by grouping based on morphological characters . A recent molecular study based on the internal transcribed spacer (ITS) sequences, supports the treatment of these two groups within Berberis . In subdividing these groups, Ahrendt (1961) accepted the schemes of Schneider (1905) and recognized 29 sections with some modifications. Ahrendt (1961) further subdivided these sections into numerous subsections. In India, Berberis is represented by 55 species , which according to Ahrendt (1961) belong to 8 sections and 7 subsections . The majority of the species are centered in the Himalayan region extending from Pakistan to Western China and to Central and Southern China. The 16 species selected for the present study represent wide geographical distribution across India and belong to 5 sections and subsections (Table S1). The detailed morpho-taxonomic characters of these sections and subsections were described by Ahrendt (1961) . The geographical locations of the selected species are indicated in Figure 1. Some species of Berberis are known for high medicinal value because of the presence of alkaloids, principally ‘berberine’ , which show activity against cholera, diarrhea, amoebiasis, malaria and leishmaniasis . Some species of Berberis give a high value wood dye while some others provide edible berries.
Circles indicate the regions from where species were collected. Exact GPS data are shown in Table S1. Region A (western Himalayas): B. aristata, B. asiatica, B. chitria, B. glaucocarpa, B. jaeschkeana, B. lycium, B. pachyacantha, B. umbellata; Region B (Eastern Himalayas): B. angulosa, B. griffithiana, B. insignis, B. macrosepala, B. replicata; Region C (Satpura - Central India): B. Hainesii; Region D (Nilgiri Hills-Southern India): B. tinctoria, B. wightiana; Region E (Gujrat): G. herbaceum, G. arboreum Region F (Assam): G. barbadense; Region G (Karnataka): G. herbaceum, G. arboretum, G. hirsutum Region H (Andhra Pradesh): G. hirsutum; Region I (Tamil Nadu): G. barbadense; Region J (Uttar Pradesh): Ficus species.
The taxonomy of Berberis is somewhat uncertain . For example, Orisi (1984) considered 17 Patagonian species in Argentina , where as Landrum (1999) synonymized several of these species and recognized only nine . Complexity in Berberis taxonomy has been attributed to hybridization and some degree of introgression in transitional zones which produce intermediate forms . Out of 500 species of Berberis, Ahrendt (1961) recorded as many as 70 species and infraspecific taxa with a suspected hybrid origin . Based on molecular phylogeny of Berberis species and previous taxonomic treatment of Landrum (1999) , Kim et al (2004) questioned the status of most sections and subsections in this genus . Although taxonomic revision of Indian Berberis, based on morpho-taxonomic parameters has been proposed , no attempt has been made to establish the species delimitation in Indian Berberis using molecular approaches. In this study, we examined the application of standard plant DNA barcode loci in this challenging group considering Indian species of Berberis.
In order to test the universality of the standard barcode loci we applied these loci to Ficus and Gossypium. Although, taxonomically the genus Ficus is considered to be quite difficult, it forms distinct natural groups in which many species are very common and conspicuous and can easily be identified even with sterile specimens . Ficus consists of about 1000 species of woody trees, shrubs, vines, epiphytes and hemi- epiphytes , occurring in most tropical and subtropical forests thought the world. In India, the genus comprises about 100 species which are distributed throughout the country with maximum diversity in Western Ghats and North Eastern India , . The candidate species of Ficus selected in the present study belong to three subgenera and six sections (Table S2) . Earlier phylogenetic study using ITS sequences indicates that three subgenera of Ficus studied here are monophyletic . The four species of Gossypium considered here are well studied both taxonomically and at molecular diversity level elsewhere , .
morphometric analysis, 3–5 representative accessions from each species of Berberis were studied, as described in material and methods. The character matrix thus developed is shown in Table S3. A consensus parsimony tree was developed using the matrix. The cladogram showed clear segregation of the accessions into distinct species clades (Figure S1).
For DNA barcoding, we examined 129 DNA sequences for ITS (GenBank accession numbers GU934610-GU934738), 78 for matK (GenBank accession numbers GU934739-GU934816), 97 for rbcL (GenBank accession numbers GU934817-GU934913), and 83 for trnH-psbA (GenBank accession numbers GU934914-GU934996) representing 16 species of Indian Berberis. PCRs were generally successful with all the four loci. The maximum success in PCR was observed with rbcL and ITS (97%), followed by trnH-psbA (92%) and matK (76%). Sequencing success ranged from 95% for rbcL to 85% for matK (Table 1). The alignment of sequences was straight forward except in case of trnH-psbA, due to high variation in sequence length. The mean sequence lengths of ITS (ITS1+5.8S+ITS2), matK, rbcL and trnH-psbA were 602.2, 488.1, 479.0 and 410.0 bp, respectively. The corresponding percentage frequencies of parsimony informative characters were 4.9, 1.8, 0.6 and 2.6 and the percentage variable sites were 6.1, 3.0, 0.8, and 3.2, respectively (Table 1). The genetic divergence within and between species was calculated. The highest mean intraspecific divergence was obtained in ITS and the lowest mean inter- and intraspecific divergence was obtained in case of rbcL (Table 1). ANOVA test showed ITS and trnH-psbA as the most divergent barcode loci at interspecific level followed by matK and rbcL (Table S4 A). At intraspecific level, rbcL was the least and ITS was the most divergent locus (Table S4 B). In multilocus analysis, ITS+trnH-psbA provided the highest divergence at interspecific level as compared to the two, three and four loci combinations (Table S5 A). At intra specific level, there was no significant difference in the sequence divergence between all combinations (Table S5 B). To evaluate the barcoding gap we looked at the minimum inter- and maximum intraspecific divergences for each locus. No distinct barcoding gap was noticed in any of the four loci (Figure 2, Table S6). To detect paralogs of ITS, if any, we cloned the PCR product from eight randomly selected species and sequenced at least eight clones from each species. None of the species showed the presence of multiple copies as evident from sequence data (GenBank accession numbers HM347877 to HM347940).
Sequence divergence for different gene regions in different species of Berberis. X-axis: maximum intraspecific, Y-axis: minimum interspecific K2P distances.
Phylogenetic methods were applied using each barcode locus taken alone and in combinations to evaluate species recovery. The NJ, MP and UPGMA methods were used for both single locus (Table 2 and Figure S2) and multilocus analysis with 500 bootstrap replicates (Table 2 and Figure S3). When all the sequences for a given locus were considered, ITS, matK and trnH-psbA were able to form species specific clade only in case of B. pachyacantha. Not a single species was recovered with rbcL using any of the three methods. The clades formed in the trees were mostly mixtures of several species. While the species specific clades in ITS, trnH-psbA and matK trees resolved with high bootstrap confidence levels (76–99%), the internal branches of the non species-specific clades with mixtures of species showed low bootstrap support (0 to 65%). A data set of 58 sequences representing 13 species was prepared to have at least three common accessions (except in B. angulosa) which were sequenced using all the four loci. In this data set, the species recovery and their bootstrap support increased for all the four loci (Table 2 and Figure S4). In multilocus analysis, we used combinations of two, three and four loci to see if species recovery was better than in single locus analysis (Table 2, Figure S3). In two loci combination, highest species recovery was observed with ITS+trnH-psbA and ITS+matK at 30.8% followed by matK+rbcL at 23.1% and ITS+rbcL at 15.4%. All three-locus combinations yielded maximum species recovery at 30.8% except ITS+trnH-psbA+rbcL which yielded 23.1% species recovery. The four loci combination did not provide better species recovery as compared to the best performing two or three loci combinations. We did not find any significant difference using three methods of phylogenetic tree construction with single locus analysis as far as recovery of species in Berberis is concerned. However, in some cases of multilocus analysis, NJ method provided better species recovery as compared to MP and UPGMA.
Character-based method was applied for species delineation as an alternative to the genetic divergence based approach, as diagnostic characters prevent the loss of information characteristic to distance approach , . Using simple (characters which are confined to a single nucleotide position) or compound (combined states at multiple nucleotide positions) diagnostic characters, it was observed that except in B. pachyacantha, none of the loci showed unique diagnostic character(s) to distinguish species in Berberis (Table 3). Species discrimination was also calculated using the criteria reported by CBOL plant working group  i.e. discrimination was considered successful, if the minimum interspecific K2P (Kimura-2-parameter) distance involving a species was larger than its maximum intraspecific K2P distance. No one species but B. pachyacantha exhibited minimum interspecific K2P distance higher than the maximum intraspecific K2P distances with ITS sequences and trnH-psbA. Other two loci did not meet these criteria in any of the species in Berberis (Table S7). The ratios of interspecific and intraspecific K2P distances were calculated for all the loci. ITS exhibited the highest inter to intraspecific K2P distances ratio, followed by trnH-psbA (Figure 3A).
Ratios could not be determined in some species where there was either single accession or sequencing failure in some loci.
Ficus and Gossypium
We tested all the four loci in Ficus and Gossypium to validate applicability of barcoding loci in unrelated genera. In all, we analyzed 33 accessions representing 11 species of Ficus (GenBank accession numbers ITS; HM368181-HM368213, matk; GU935030-GU935054, rbcL; GU935055-GU935086, trnH-psbA; GU935087-GU935117) and 51 accessions representing four species of Gossypium (GenBank accession numbers ITS; GU935118-GU935168, matk; GU935169-GU935214, rbcL; GU935215-GU935256, trnH-psbA; HM437871-HM437907). PCR amplification and sequencing were largely successful with all the four loci (Table 1). The mean sequence length, parsimony informative characters and percent variable sites for the four loci are shown in Table 1. In both Ficus and Gossypium, ITS exhibited the highest interspecific and rbcL and trnH-psbA the lowest intraspecific divergence (Table 1). In ANOVA test, ITS emerged as the most divergent barcode locus at interspecific level than all other loci in Ficus and Gossypium (Table S8). At intraspecific level rbcL and trnH-psbA were the least divergent loci in Ficus where as in Gossypium, all loci were equally divergent at inra specific level. The highest inter- to intraspecific K2P distance ratios were exhibited by ITS in both the genera (Figure 3B and C). In the phylogenetic analysis of Ficus, ITS exhibited 100% species recovery (Figure S5A, Table 2) followed by trnH-psbA (82%) (Figure S5D, Table 2). In Gossypium, ITS recovered 100% species (Figure S6A). Other loci could not distinguish the species in the two genera. Since all tested species of both Ficus and Gossypium were recovered using ITS, we did not apply multilocus combinations in these genera. ITS and trnH-psbA exhibited distinct barcoding gap in Ficus (Figure 4) where the minimum inter specific K2P distances were significantly higher than the maximum intraspecific K2P distances (t test, p = 0.02, and 0.006 respectively, Table S6). In Gossypium, none of the loci showed significant barcoding gaps (Table S6). In character-based approach, diagnostic characters were found in the ITS sequences of Ficus and Gossypium. Nine species of Ficus exhibited simple diagnostic characters in single or multiple positions and two species, F. elastica and F. virens exhibited compound characters. All the four species of Gossypium were identified by simple or compound diagnostic characters using ITS. trnH-psbA exhibited simple diagnostic character at single position in four species and simple diagnostic characters at multiple positions in other two species of Ficus. trnH-psbA showed simple diagnostic character at single position in G. barbadense. The matK and rbcL did not provide any diagnostic characters in Gossypium species. However, in case of Ficus, matK and rbcL exhibited simple diagnostic characters in three species (F. carica, F. hispida and F. recemosa) and two species (F. carica, F. virens) respectively (Table 3). In distance-based approach, the highest intraspecific K2P distance of trnH-psbA was lower than the lowest interspecific K2P distance in all the species of Ficus. Similar results were obtained using ITS except in F. religiosa where maximum intraspecific K2P distance was equal to minimum interspecific K2P distance. In Gossypium, the highest intraspecific K2P distance of ITS was lower than the lowest interspecific K2P distance in G. hirsutum and G. barbadense but not in case of G. herbaceum and G. arboretum (Table S9).
AFLP analysis of Berberis species
Using ten different primer combinations, a total of 784 bands were scored, 776 were polymorphic representing 98.9% of the total number of bands (Table 4). The number of bands varied among species. The distribution of total bands in different species is shown in Table 5. The number of polymorphic bands ranged from 46 for EcoRI+ACA/MseI+CTG to 97 for EcoRI+AAG/MseI+CAT (Table 4). Among the polymorphic bands, only eight bands were unique representing the three species, B. insignis, B. pachyacantha and B. replicata. The results of principal coordinate analysis, based on Jaccard's coefficient of similarity are shown in Figure 5. The first three cumulatively accounted for 18.7% of the total variance detected, comprising 10.2%, 4.9% and 3.6% from the first, second and third vectors respectively. Ordination of the first vector with second and third showed two distinct clusters corresponding to species of Eastern Himalayas, B. replicata, B. angulosa and B. insignis along with one species of Western Himalayas B. umbellata, and the remaining species formed a second cluster. The trends revealed by principal coordinate analysis were supported by UPGMA cluster analysis based on Jaccard's similarity matrix (Figure 6). The phenogram largely recognized the major two clusters as that of principal coordinate analysis. Within the cluster I, the species of B. umbellata, and B. replicata, and in cluster II the species of B. pachyacantha and B. hainesii were well separated. Other species were not recognized by AFLP method. However, there was a tendency of these species to cluster according to geographic location rather than species affiliation. For example, B. replicata, B. angulosa and B. insignis are exclusively from Eastern Himalayas grouped in one cluster. Similarly the species from central part of India, B. hainesii formed one distinct cluster while the species from Southern part of India, B. tinctoria, and B. wightiana formed another distinct cluster. The remaining species are from Western Himalayas and were mostly scattered. The cophenetic correlation coefficient was 0.86 (p≤0.01), which indicates good agreement with the cluster analysis with the original distance matrix. In addition, the Mantel test between Jaccard's distance matrix based on AFLP and geographical distance matrix derived from GPS data of respective samples was quite strong (r = 0.46, p≤0.001).
The three vectors, one, two and three contribute 10.2%, 4.9 and 3.6% of total variability respectively. Colors represent regions from where species were collected. Green: Western Himalaya, Red: Eastern Himalaya, Blue: Central India and Dark Red: Southern India.
The numbers preceding the abbreviated species name denotes the DNA number. Abbreviations are: B.ang: Berberis angulosa, B.ari: Berberis aristata, B.asi: Berberis asiatica, B.chi: Berberis chitria, B.glu: Berberis glaucocarpa, B.hai: Berberis hainesii, B.ins: Berberis insignis, B.lyc: Berberis lycium, B.pac: Berberis pachyacantha, B.rep: Berberis replicata, B.tin: Berberis tinctoria, B.umb: Berberis umbellata, B.wit: Berberis wightiana.
We investigated the species resolution ability of four barcode loci viz. ITS, matK, rbcL, and trnH-psbA in 16 species of Berberis, 11 species of Ficus and 4 species of Gossypium. In our study, the matK2.1a failed to give amplification in Berberis. The modified matK primer was largely successful in PCR amplification in all the three genera. There are mixed reports about PCR success and sequencing using matK primers, depending upon the use of particular primer and data sets , , . However, the other three primers, ITS, rbcL and trnH-psbA provided good amplification with the three test genera. In spite of successful design of genus specific matK primer for Berberis, the PCR success rate was lower (76%) in Berberis as compared to Ficus (85%) and Gossypium (100%). This lower success rate of PCR using genus-specific matK in Berberis may be due to the instability and the uniqueness of the primer's 3′-end in matK sequences of Berberis samples as reported in other cases , . A successful barcode locus is evaluated on the basis of its ability to recover monophyletic clusters corresponding to individual species . Based on this criterion, all the species of Ficus and Gossypium were recovered using ITS. None of the four loci could resolve the species in Berberis except B. pachyacantha. However, when we analyzed 58 common sequences for all the four loci with at least three accessions per species, the proportion of species recovery increased with ITS and trnH-psbA. The improvement in species recovery by ITS and trnH-psbA using this reduced sequence set as compared to initial sequence set of 129 and 83 respectively, was due to removal of some sequences having higher intraspecific divergence than the others within the same species. Such overlap will not affect species identification of unknowns in already characterized species but can have impact in incompletely sampled groups .
In Berberis, species recovery improved using multilocus combinations as compared to the single locus. The species recovery by two loci combination was as good as three or four loci combinations (30.8%) (Table 2). In most of the studies reported earlier, multilocus analysis of more than three loci did not provide significant gain in species recovery as compared to one or two loci combinations , , , . In most of these reports, ITS was not used in multilocus analysis. Our results indicate that ITS in combination with trnH-psbA provided better species discrimination as compared to matK+rbcL in these tested species for multilocus barcode analysis.
We examined the use of character-based approach as an alternative to distance-based approach for species delineation. In case of Berberis, ITS showed diagnostic characters only for the species B. pachyacantha. This species was also resolved with distance based approach. The tested species of Ficus and Gossypium were resolved by ITS sequences using either of the methods. Character-based approaches have been shown to be successful in species identification , . The study of Rach et al. (2008), focused only on simple characters i.e. a standalone diagnostic at a single nucleotide position . Wong et al. (2009) examined 64 shark species by character-based approaches . They identified 20 species of shark by simple character and 37 species using compound characters of two or three nucleotide positions in combination. The consideration of simple diagnostic characters at multiple positions in our study increases the level of confidence of a resolved species as compared to a single position. However, we noticed no additional advantage using diagnostic character-based approach over distance-based approach as far as percentage species resolution is concerned.
The four barcode loci did not resolve the sections and subsections of Indian Berberis except section ‘vulgaris’, represented by only one species, B. pachyacantha. Similarly, although the species of Ficus could be resolved using ITS and trnH-psbA, they failed to identify the subgenus and sections of the genus. A recent molecular study conducted on a large number of samples (100 species) using three nuclear markers (ITS, ETS and G3pdh) support neither subgeneric nor sectional classification in Ficus except in subgenus Sycidium .
Although matK and rbcL have been shown to provide high level of species recovery in several plant DNA barcoding studies on different floristic or biodiversity hotspots , , , , , , these loci were not found useful in many other studies dealing with specific taxonomic groups – as in the present study. Another leading barcoding locus proposed by several workers is trnH-psbA , , , which also did not work in the tested species of Berberis and Gossypium. However, it provided good species recovery in Ficus. In our study, ITS recovered one species in Berberis and all the tested species of Ficus and Gossypium. Several other studies have also reported ITS as one of the suitable markers for barcoding in plants , . Other studies described its inherent difficulties, e.g. low PCR success , , problem of secondary structure formation, resulting in poor quality sequence data ,  and multiple copy numbers , etc. We did not find any difficulty in PCR and sequencing for ITS (Table 1). In addition, no paralogos of ITS were found in Berberis and Ficus using PCR as well as sequencing of at least eight clones from PCR products of each of the tested eight species of Berberis. Natural hybridization and polyploidy are rare in Ficus . Therefore divergent paralogs of ITS may be uncommon in Ficus. It is also reported that ITS has undergone complete concerted evolution in Gossypium following allopolyploid speciation . These studies and our findings in ITS sequences of Berberis indicate none of the tested three genera have paralogs of ITS. Our results, especially in Ficus and Gossypium suggest that ITS holds a good promise as a candidate barcode locus, as also reported earlier .
The low level of barcoding success observed in Berberis is not uncommon in plants. Similar difficulties have earlier been reported in Aspalathus , Crocus  Solanum sect. Petota  and Carex . Even in the well studied taxonomic group, barley (Hordeum L.), Seberg et al. (2009) reported recognition of less than 50% species using matK and rpoC1 . In the morphologically distinct species of the Galapagos sun flower tree, Scalesia Arn, (Asteraceae), no variation was found in plastid loci and almost none in nuclear loci . In our study, the morpho-taxonomic parameters of the selected species were considered for species delimitation. In Berberis the phenogram derived from this matrix showed that morphologically the species are well delineated whereas none of the four loci tested could distinguish species of Indian Berberis, except B. pachyacantha. This prompted us to check the species relationship in Berberis and evaluate genetic basis for the delimitation of species using amplified fragment length polymorphism (AFLP). Though the species recovery increased using AFLP method, most of the species remained unresolved. None of the Ahrendt's (1961) sections or subsections in Berberis was recognized with the exception of B. pachyacantha, which belongs to section ‘vulgaris’. Only one species of section ‘vulgaris’ was included in the present study. It remains to be seen if the species would be resolved when other species of the section are included. Kim et al. (2004), in analyzing the ITS phylogeny of Berberis species including a few species from India (B. edgeworthiana, B. insignis, B. coriaria, B. hookeri) could not recognize the sections and subsections of Berberis proposed by Ahrendt (1961). That the species could not be fully resolved using AFLP has been noted before for complexes containing closely related or hybridizing taxa –. There are several reports on occurrence of hybridization in the genus Berberis –. The lack of resolution of most of the species e.g. B. asiatica, B. glaucocarpa, B. lycium of the section Asiaticae, B. chitria, B. aristata, B. tinctoria, and B. wightiana of the section Tinctoriae, B. angulosa of the section Angulosae and B. insignis of the section Wallichianae with AFLP data as well as ITS sequences indicates a probable hybridization in these species. This is further corroborated by the fact that in AFLP analysis there was a tendency of the species of Berberis to cluster according to their geographic location rather than to species identity. This indicates that, species discrimination seems possible with morphological characters, but reproductive isolation appears to be weak in Berberis and in many cases probably only affected by geographical barriers. These findings are consistent with either non-monophyletic or reticulate evolution of these species. Kim et al (2004)  using ITS sequences of 79 taxa of Berberis representing four major groups including Septentrionales and 22 sections in the genus showed that these traditional geographical groups are monophyletic. Therefore, the latter hypothesis is preferred because of evidence of hybridization and relatively young age (5.33-0.01 Ma) of Indian Berberis (Pleistocene record of Kashmir, India) .
The taxonomic problems described here for Indian Berberis are not unique. Similar problems were reported by Spooner (2009) for Solanum sect. Petota . Hawkes (1990) reported 232 species of sect. Petota  but Spooner and Salas (2006) reduced it to 190  and more recently, Spooner has converged these to about 110 species . Harlan and de Wet (1971) showed differences in the number of species recognized by different taxonomists in crops, e.g. 100 to 200 in wild relatives of potatoes, 2 to 24 in wheat and 1 to 31 in sorghum . These are some examples where taxonomic disputes still remain unresolved. Plant DNA barcoding in these cases may be problematic and contribute to complexities in search of universal loci for plant DNA barcode.
Barcoding in plant genera like Berberis with possible occurrence of natural hybridization and gene introgression may be quite challenging. The morphological, geographical and genomic diversity study in Indian species of Berberis indicates probable reticulate nature of the species. Our results with Ficus and Gossypium suggest that ITS and trnH-psbA are good candidates for plant DNA barcoding and the matK and rbcL, the standard barcode loci for plant barcoding do not work in all the tested species of these three genera.
Materials and Methods
Sampling and morphometric analysis
One hundred and sixty four accessions representing 16 species of Berberis were collected from four different geographical regions e.g. Eastern Himalayas, Western Himalayas, Central India and Sothern India (Figure 1). Out of these, 3–5 representative accessions from each species were evaluated by morpho-taxonomic analysis. All morphological characters were weighted equally. Multistate characters were unordered. The character matrix thus developed (Table S3) was used to generate the phenograms (Figure S1) with PAUP*4.0b . The parsimony trees were generated using bootstrap analyses with 10,000 replicates. Bootstrap searches were heuristic with simple addition of taxa, TBR branch-swapping and MulTrees turned off. We sampled at least two species from each region except central India where only one species occurs. Thirty three accessions of 11 species of Ficus and 51 accessions of 4 species of cultivated Gossypium were collected from different parts of India. Multiple accessions were included for each species. Specimen vouchers were deposited in the Herbarium of National Botanical Research Institute, India (LWG). Accession numbers including specimen collection locations are given in Table S10.
PCR and DNA sequencing
Genomic DNA was extracted from either fresh or silica gel dried leaf materials using DNeasy Plant Mini Kit (Qiagen, Germany) according to manufacturer's instructions. PCR amplification was performed in 50-µl reaction mixtures containing approximately 50–75 ng genomic DNA templates, 1.5 mM MgCl2, 0.2 mM of each dNTP, 1 µM of each primer, 0.1 mg BSA/ml and 1 unit Taq DNA polymerase. The thermocycler programme was 94°C for 1 min (1 cycle), 94°C for 40 sec, 48°C–52°C (depending upon primer sets used) for 35 cycles, 72°C for 40 sec and 72°C for 5 min (1 cycle). The primers, matK2.1a and matK3.2r reported by Plant Working Group failed to give PCR amplification in Berberis even after changing some PCR conditions including addition of DMSO. A modified matK forward primer, matK-NBRI (here after referred as matK) was designed after aligning the matK sequences of genera closely related to Berberis e.g. Nandina, Ranzania, Mahonia. The matK along with the reported matK3.2r primer was able to successfully amplify in Berberis, Ficus and Gossypium. For primer sequences and references see Table S11. The PCR products were cleaned by Qiaquick® PCR Purification kit (Qiagen, Germany). In a few cases, where multiple bands appeared, these were gel extracted and sequenced. Sequencing was carried out bidirectionally using automated capillary sequencer, ABI3730XL DNA analyzer (Applied Biosystems, UK). Pairwise alignments were made by using the sequences obtained from forward and reverse primers. Sequences which covered more than 70% overlap between forward and reverse sequences were considered (except a few sequence of matK where coverage was less than 50%). A minimum average QV of 30 was considered as quality sequences. DNA sequences were edited manually by visual inspection of the electropherograms of both end sequences using Sequencher 4.1.4. The GenBank accession numbers for the sequences are given in Table S10.
Each nrITS sequence was searched in nucleotide data base using BLAST, to confirm its plant origin rather than from a possible fungal contamination of the sample. In all cases the best match retrieved the plant species, either as the same plant species sequence as query sequences or as the nearest plant species (e.g. in most cases sequences of Indian Berberis species were not available in data base and B. thumbergii showed the best match). Secondly, we looked for the presence of the characteristic conserved motif in the 5.8S rRNA gene of angiosperm plant ITS sequences . The characteristic motif (5′- GAATTGCAGAATCC-3′) was found in all the ITS sequences where as the variant of the motif generally found in fungi (5′-GAATTGCAGAATTC-3′) was not found in any of the sequences.
To establish the species diversity of Indian species of Berberis at the genome level, we employed amplified fragment length ploymorphism analysis in 55 accessions of 13 species of Berberis. We used the same DNA samples as used in barcode analysis except in few cases where DNA quality was poor for AFLP assays. AFLP protocol was followed as described in user manual (AFLP Plant Mapping Kit, Applied Biosystem, and USA) with minor modifications. Briefly, 0.5–1.0 µg genomic DNA was digested with 10 U EcoRI and 10 U MseI in a 20 µL reaction and incubated at 37°C for 5 h. Following 15 min heat inactivation of enzymes, 20 µL of ligation master mix containing 75 pmol each MseI and EcoRI adapters with 20 U T4 DNA ligase in 1X T4 DNA ligase buffer was added and incubated overnight at 16°C. The digestion- ligation mixture was diluted with 160 µL sterile water. Pre-selective amplification was performed by using a tri-selective nucleotide (+3) at the 3′. Ten primer combinations were employed to detect polymorphism among different genotypes: EcoRI+ AAG/MseI+CAA, EcoRI +AAG/MseI+CAT, EcoRI+AAC/MseI+CTT,EcoRI+AAC/MseI+CTG, EcoRI+AGG/MseI+CAA EcoRI+AGG/MseI+CTG, EcoRI+ACC/MseI+CTC, EcoRI+ACC/MseI+CTT, EcoRI+ACG/MseI+CAC and EcoRI+ ACA/MseI+CTG. The EcoRI adapter primers were 5′ fluorescent labeled either with 6-carboxyfluorescein (FAM)/(JOE)/(NED). The MseI adapter primers were unlabeled. Each 25 µL reaction contained 5 µL diluted +1 reaction, 1X PCR buffer, 1.5 1 mM MgCl2, 300 µM dNTP, 4 pmol each Eco RI adapter +3 primer, 25 pmol Mse I 2 adapter +3 primer, and 1 U Taq DNA polymerase. The amplification profile was 94°C for 2 min, 10 cycles of 94°C for 20 s, 66°C for 30 s, 72°C for 2 min, reducing the annealing temperature by 1°C per cycle, followed by 30 cycles of 94°C for 30 s, 56°C for 30 s, 72°C for 2 min, ending with 72°C for 30 min.
AFLP data analysis
Selective +3 AFLP amplification products were resolved using automated sequencing gels on an ABI Prism1 3730xl DNA Analyzer. Image analysis was performed using GeneMapper version 4.0 (Applied Biosystems,USA), and by visual inspection. Fragments were sized by running dye-labeled standards in each well. GeneMapper automatically scores fragments ranging from 50–500 bp in length. Similarity of fragment size was assumed to indicate homology. Fragment data were recorded as ‘1’ (presence) or ‘0’ (absence) and data entered into binary data matrix as discrete variables. Jaccard's coefficient of similarity was calculated for all pair wise comparison and a dendrogram was made through cluster analysis using the unweighted pair group method based on arithmetic average (UPGMA). The correlation between the Jaccard's similarity and the cophenetic coefficients for the clusters was calculated. Jaccard's coefficient of similarity was used in principal co-ordinate analysis using the DCENTRE and EIGEN functions to resolve pattern of variation among and within species. The relative contribution significance of AFLP bands in species discrimination for the three co-ordinates was analyzed. The NTSYS-pc2.02e was used for all statistical analysis .
Correlation between Geographical and genetic distances
GPS data of samples were converted into a geographical distance matrix using WGS84 (World Geodetic System) model in Geographical Distance Matrix Generator programme . Mantel test was applied to find the correlation between geographical distance matrix and Jaccard's distance coefficient matrix obtained from AFLP data.
Cloning and sequencing of ITS for copy number detection
The gel eluted PCR fragments (as described above) of ITS sequences of 8 accessions from 8 species of Berberis were cloned separately into a pTZ57R/T (Fermentas, USA) TA cloning vector following standard protocol.
These were transformed into Escherichia coli DH5α competent cells. Eight colonies were randomly picked up for screening. Plasmid was isolated following standard protocol. Each clone was sequenced using M13 forward and reverse primers as described above.
The sequences were aligned by ClustalW and the inter- and intraspecific genetic distances were calculated using MEGA4  for each DNA barcode locus. The pair wise distances were calculated with the simplest K2P model implemented in MEGA4. The K2P model considers that transition and transversion happen at different rates and takes into account both transition and transversion rates to calculate the divergence between sequences. These considerations are important as far as variation at inter- and intraspecific level are considered. ANOVA with Bonferroni's Multiple Comparison Test was performed to compare mean inter- and intraspecific variability for each individual pair and all possible multilocus combinations of barcode loci. To exclude inequality of variances, if any, data was log transformed wherever required. The DNA barcoding gaps were evaluated by comparing the minimum interspecific divergence and the maximum intraspecific divergence by t-test. In order to assess the character based approach for barcoding, we first generated haplotype variation from the aligned sequences along with a reference sequence from GenBank data base for each locus using iBarcode web program . Identification and confirmation of unique characters were accomplished in a straight forward manner with respect to the respective reference sequence from the data base by visual inspection. Diagnostic characters were identified using at least three or more accessions per species. Species discrimination power was also calculated using distance approach following CBOL plant working group .
To evaluate whether the species were recovered as monophyletic, using the four barcode loci singly or in combinations, we used standard phylogenetic methods. Phylogenetic trees were made with MEGA4 using Neighbor Joining (NJ), Parsimony and Unweighted Pair Group Method with Arithmetic Mean (UPGMA). The NJ and UPGMA trees were built with K2P distance model and 500 bootstrap replicates. MP trees were built with default setting implemented at MEGA4. In all cases indels were treated as complete deletion. In multilocus analysis, 58 DNA sequences representing at least three common accessions were included per species (except in case of B. angulosa) for the four loci.
Unrooted Bootstrap 50% majority-rule consensus tree of Berberis species. Bootstrap support values are indicated on the nodes. The detailed morphological characters are as described in Table S3.
(0.05 MB PDF)
Strict consensus NJ, MP and UPGMA trees of Berberis species. A) ITS, (B) matK, (C) rbcL and (D) trnH-psbA. Numbers at the branch nodes are bootstrap values. Codes preceding the species name indicate DNA numbers corresponding to the accession numbers analyzed in this study.
(0.12 MB PDF)
Strict consensus NJ, MP and UPGMA trees based on sequences of different multilocus combinations in Berberis. A) ITS+trnH-psbA+matK+rbcL (B) ITS+matK+rbcL (C) ITS+trnH-psbA+rbcL (D) ITS+trnH-psbA+matK (E) trnH-psbA+matK+rbcL (F) ITS+matK (G) ITS+rbcL (H) ITS+trnH-psbA (I) trnH-psbA+matK (J) trnH-psbA+rbcL (K) matK+rbcL. Other details are as in Figure S2.
(0.19 MB PDF)
Strict consensus NJ, MP, UPGMA trees based on 58 common sequences of four loci in Berberis. (A) ITS, (B) matK, (C) rbcL and (D) trnH-psbA. Other details are, as in Figure S2.
(0.09 MB PDF)
Strict consensus NJ, MP, UPGMA trees based on ITS (A), matK (B), rbcL (C) and trnH-psbA (D) sequences of different species of Ficus. F.elastica and F.rumphi were represented by only one accession with trnH-psbA sequences.
(0.10 MB PDF)
Strict consensus NJ, MP, UPGMA trees based on ITS (A), matK (B), rbcL (C) and trnH-psbA (D) sequences of different species of Gossypium.
(0.08 MB PDF)
The classification and distribution of Berberis species according toAhrendt(1961). The arrangement of species is alphabetical. The detailed GPS data for distribution of the selected species is given in Table S10.
(0.02 MB PDF)
The classification of Ficus species according to Corner (1958).
(0.05 MB PDF)
The binary matrix developed on the basis of detailed morphological parameters considered for species delineation in Berberis. The character matrix developed on the basis of detailed morphological parameters considered for species delineation in Berberis.
(0.09 MB PDF)
One Way ANOVA with Bonferroni's multiple comparison tests to compare inter (A) and intraspecific (B) variability for each individual locus in Berberis.
(0.06 MB PDF)
One way ANOVA with Bonferroni's multiple comparison tests to compare inter (A) and intraspecific (B) variability for each possible multilocus combinations in Berberis.
(0.06 MB PDF)
Results of paired t-test to compare between minimum inter and maximum intraspecific K2P distances of different loci.
(0.06 MB PDF)
Minimum inter- and maximum intraspecific K2P distances of Berberis species for different loci and ability to discriminate species. These values could not be calculated in some cases (-) where there was either single accession or sequencing failure for the locus.
(0.08 MB PDF)
One way ANOVA with Bonferroni's multiple comparison tests to compare inter (A, Ficus and C, Gossypium) and intraspecific (B, Ficus and D, Gossypium) variability for each individual locus.
(0.03 MB PDF)
Minimum inter- and maximum intraspecific K2P distances of species of Ficus and Gossypium for different loci and ability to discriminate species. These values could not be calculated in some cases (-) where there was either single accession or sequencing failure for the locus.
(0.09 MB PDF)
List of Accessions and DNA numbers of different plant species along with Global Positioning System (GPS) data, and collector's name. In some cases GPS data could not be taken.
(0.09 MB PDF)
Primer sequences used in this study (listed 5′- to 3′).
(0.01 MB PDF)
The authors acknowledge The Department of Science & Technology, Government of India for JC Bose Fellowship to RT, JK Agri Genetics Ltd, Hyderabad and SN Jena for providing some of the Gossypium germplasms and SK Mandal for help in statistical analysis. The authors are thankful to the reviewers for their constructive comments in improving the presentation.
Conceived and designed the experiments: SR RT. Performed the experiments: SR AT VS AK UMS LBC BD NKN TH. Analyzed the data: SR AT SKB. Contributed reagents/materials/analysis tools: PKS RT. Wrote the paper: SR AT RT.
- 1. Hebert PDN, Cywinska A, Ball SL, DeWaard JR (2003) Biological identifications through DNA barcodes. Philos Trans, Ser B 270: 313–321.
- 2. Blaxter M (2003) Molecular systematics: Counting angels with DNA. Nature 421: 122–124.
- 3. Tautz D, Arctander P, Minelli A, Thomas RH, Vogler AP (2002) DNA points the way ahead in taxonomy. Nature 418: 479.
- 4. Smith MA, Fisher BL, Hebert PD (2005) DNA barcoding for effective biodiversity assessment of a hyperdiverse arthropod group: the ants of Madagascar. Philos Trans R Soc Lond B Biol Sci 360: 1825–1834.
- 5. Vences M, Thomas M, Bonett RM, Vieites DR (2005) Deciphering amphibian diversity through DNA barcoding: chances and challenges. Philos Trans R Soc Lond B Biol Sci 360: 1859–1868.
- 6. Ward RD, Zemlak TS, Innes BH, Last PR, Hebert PD (2005) DNA barcoding Australia's fish species. Philos Trans R Soc Lond B Biol Sci 360: 1847–1857.
- 7. Pennisi E (2007) Taxonomy. Wanted: a barcode for plants. Science 318: 190–191.
- 8. Rubinoff D, Cameron S, Will K (2006) Are plant DNA barcodes a search for the Holy Grail? Trends Ecol Evol 21: 1–2.
- 9. Chase MW, Cowan RS, Hollingsworth PM, Van den Berg C, Madrinan S, et al. (2007) A proposal for a standardised protocol to barcode all land plants. Taxon 56: 295–299.
- 10. Chase MW, Salamin N, Wilkinson M, Dunwell JM, Kesanakurthi RP, et al. (2005) Land plants and DNA barcodes: short-term and long-term goals. Philos Trans R Soc Lond B Biol Sci 360: 1889–1895.
- 11. Kress WJ, Erickson DL (2007) A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region. PLoS ONE 2: e508.
- 12. Lahaye R, van der Bank M, Bogarin D, Warner J, Pupulin F, et al. (2008) DNA barcoding the floras of biodiversity hotspots. Proc Natl Acad Sci U S A 105: 2923–2928.
- 13. Taberlet P, Coissac E, Pompanon F, Gielly L, Miquel C, et al. (2007) Power and limitations of the chloroplast trnL (UAA) intron for plant DNA barcoding. Nucleic Acids Res 35: e14.
- 14. Kress WJ, Wurdack KJ, Zimmer EA, Weigt LA, Janzen DH (2005) Use of DNA barcodes to identify flowering plants. Proc Natl Acad Sci U S A 102: 8369–8374.
- 15. Newmaster SG, Fazekas AJ, Raghupathy S (2006) DNA barcoding in land plants: evaluation of rbcL in a multigene tired approach. Can J Bot 84: 335–341.
- 16. Shaw J, Lickey EB, Schilling EE, Small RL (2007) Comparison of whole chloroplast genome sequences to choose noncoding regions for phylogenetic studies in angiosperms:the tortoise & the hare III. Am J Bot 94: 275–288.
- 17. CBOL , Plant , Working , Group (2009) A DNA barcode for land plants. Proc Natl Acad Sci U S A 106: 12794–12797.
- 18. Kress WJ, Erickson DL, Jones FA, Swenson NG, Perez R, et al. (2009) Plant DNA barcodes and a community phylogeny of a tropical forest dynamics plot in Panama. Proc Natl Acad Sci U S A 106: 18621–18626.
- 19. Chen S, Yao H, Han J, Liu C, Song J, et al. (2010) Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS One 5: e8613.
- 20. Gao T, Yao H, Song J, Liu C, Zhu Y, et al. (2010) Identification of medicinal plants in the family Fabaceae using a potential DNA barcode ITS2. J Ethnopharmacol.
- 21. Edwards D, Horn A, Taylor D, Savolainen V, Hawkins J (2008) DNA barcoding of a large genus, Aspalathus L. (Fabaceae). Taxon 57: 1317–1327.
- 22. Ahrendt L (1961) Berberis and Mahonia: a taxonomic revision. J Linn Soc (bot) 57: 1–410.
- 23. Schneider CK (1905) Die Gattung Berberis (Euberberis). Vorarbeiten für eine Monographie. Bull Herb Boissier Ser 2 5: 33–48, 1. Bull Herb Boissier Ser 2 5: 33–148, 391–403, 449–464, 655–670, 800–812, 813–831.
- 24. Landrum LR (1999) Revision of Berberis (Berberidaceae) in Chile and adjacent Southern Argentina. Ann Mo Bot Gard 86: 793–834.
- 25. Kim Y-D, Kim S-H, Landrum LR (2004) Taxonomic and phytogeographic implications from ITS phylogeny in Berberis (Berberidaceae). J Plant Res 117: 175–182.
- 26. Rao RR, Hussain T, Datta B, Garg A (1998) Revision of the Berberidaceae of Indian-region II. Rheedea 3: 109–143.
- 27. Bottini MCJ, Bustosa AD, Sanso M, Jouve N, Poggio L (2007) Relationships in Patagonian species of Berberis (Berberidaceae) based on the characterization of rDNA internal transcribed spacer sequences. Bot J Linn Soc 153: 321–328.
- 28. Singh PB, Ambasta SS, Tripathi VN, Agrawal R, Singh M, et al. (1986) Blunt renal injury—an experience of 30 cases. Injury 17: 228–229.
- 29. Orsi MC (1984) Berberidaceae. Buenos Aires: INTA. pp. 325–348.
- 30. Corner EJH (1958) An introduction to the distrubution of Ficus. Reinwardtia 4: 325–355.
- 31. Corner EJH (1965) Check-list of Ficus in Asia and Australasia with keys to identification. Gardens' Bulletin Singapore 21: 1–186.
- 32. Hooker F (1888) Urticaceae. The flora of British India. London: L. Reev and Co. pp. 494–537.
- 33. King G (1888) The species of Ficus of the Indo-Malayan and Chinese countires. Annals of the Royal Botanic Garden, Calcatta. London: L. Reev and Co. pp. 1–185.
- 34. Weiblen GD (2000) Phylogenetic relationships of functionally dioecious FICUS (Moraceae) based on ribosomal DNA sequences and morphology. Am J Bot 87: 1342–1357.
- 35. Abdalla AM, Reddy OUK, El-Zik KM, Pepper AE (2001) Genetic diversity and relationships of diploid and tetraploid cottons revealed using AFLP. Theor Appl Genet 102: 222–229.
- 36. Wendel JF (1989) New World tetraploid cottons contain Old World cytoplasm. Proc Natl Acad Sci U S A 86: 4132–4136.
- 37. Desalle R (2007) Phenetic and DNA taxonomy; a comment on Waugh. Bioessays 29: 1289–1290.
- 38. Waugh J (2007) DNA barcoding in animal species: progress, potential and pitfalls. Bioessays 29: 188–197.
- 39. Miura F, Uematsu C, Sakaki Y, Ito T (2005) A novel strategy to design highly specific PCR primers based on the stability and uniqueness of 3′-end subsequences. Bioinformatics 21: 4363–4370.
- 40. Onodera K, Melcher U (2004) Selection for 3′ end triplets for polymerase chain reaction primers. Mol Cell Probes 18: 369–372.
- 41. Fazekas AJ, Burgess KS, Kesanakurti PR, Graham SW, Newmaster SG, et al. (2008) Multiple multilocus DNA barcodes from the plastid genome discriminate plant species equally well. PLoS ONE 3: e2802.
- 42. Meyer CP, Paulay G (2005) DNA barcoding: error rates based on comprehensive sampling. PLoS Biol 3: e422.
- 43. Kelly RP, Sarkar IN, Eernisse DJ, Desalle R (2007) DNA barcoding using chitons (genus Mopalia). Mol Ecol Notes 7: 177–183.
- 44. Starr JR, Naczi RFC, Chouinard BN (2009) Plant DNA barcodes and species resolution in sedges (Carex, Cyperaceae). Mol Ecol Resour 9: 151–163.
- 45. Rach J, Desalle R, Sarkar IN, Schierwater B, Hadrys H (2008) Character-based DNA barcoding allows discrimination of genera, species and populations in odonata. Proc Royal Soc B: Biol Sci 275: 237–247.
- 46. Wong EHK, Shivaji MS, Hanner RH (2009) Identifying sharks with DNA barcodes: assesing the utility of a nucleotide diagnostic approach. Mol Ecol Resour 9: 243–256.
- 47. Ronsted N, Weiblen GD, Savolainen V, Cook JM (2008) Phylogeny, biogeography, and ecology of Ficus section Malvanthera (Moraceae). Mol Phylogenet Evol 48: 12–22.
- 48. Alvarez I, Wendel JF (2003) Ribosomal ITS sequences and plant phylogenetic inference. Mol Phylogenet Evol 29: 417–434.
- 49. Storey WB (1975) Advances in fruit breeding. In: Janick J, Moore JN, editors. West Lafayette, Indiana, USA: Purdue University Press. pp. 568–589. Figs in.
- 50. Wendel JF, Schnabel A, Seelanan T (1995) Bidirectional interlocus concerted evolution following allopolyploid speciation in cotton (Gossypium). Proc Natl Acad Sci U S A 92: 280–284.
- 51. Nieto FG, Rosselló JA (2007) Better the devil you know? Guidelines for insightful utilization of nrDNA ITS in species-level evolutionary studies in plants. Mol Phyl Evol 44: 911–919.
- 52. Seberg O, Petersen G (2009) How many loci does it take to DNA barcode a crocus? PLoS ONE 4: e4598.
- 53. Spooner DM (2009) DNA barcoding will frequently fail in complicated groups: An example in wild potatoes. Mol Ecol Resour 9: 1177–1189.
- 54. Cervera MT, Storme V, Soto A, Ivens B, Van Montagu M, et al. (2005) Intraspecific and interspecific genetic and phylogenetic relationships in the genus Populus based on AFLP markers. Theor Appl Genet 111: 1440–1456.
- 55. Dodd RS, Kashani N (2003) Molecular differentiation and diversity among the California red oaks (Fagaceae; Quercus section Lobatae). Theor Appl Genet 107: 884–892.
- 56. Schmidt-Lebuhn AN (2007) Using amplified fragment length polymorphism (AFLP) to unravel species relationships and delimitations in Minthostachys (Labiatae). Bot J Linn Soc 153: 9–19.
- 57. Schmidt-Lebuhn AN, Kessler M, Kumar M (2006) Promiscuity in the Andes: Species relationshis in Polylepis (Rosaceae, Sanguisorbeae) based on AFLP and morphology. Syst Bot 31: 547–559.
- 58. Bottini MCJ, Premoli AC, Poggio L (1999) Hybrid speciation in Berberis L: an isoenzymatic approach. St.Louis, Missouri, USA: XVI International Botanical Congress.
- 59. Lubell JD, Brand MH, Lehrer JM, Holsinger KE (2008a) Detecting the influence of ornamental Berberis thunbergii var. atropurpurea in invasive populations of berberis thunbergii (Berberidaceae) using AFLP. Am J Bot 95: 700–705.
- 60. Lubell JD, Brand MH, Lehrer JM (2008b) AFLP identification of Berberis thunbergii cultivars, inter-specific hybrids and their parental species. J hort sci biotechnol 83:
- 61. Puri GS (1946) Some fossil leaves of Berberis from the Karewas of Kashmir. J Indian Bot Soc 25: 177–183.
- 62. Hawkes JG (1990) The potato: Evolution, biodiversity and genetic resources. Washington DC, USA: Smithsonian Institution Press.
- 63. Spooner DM, Salas A (2006) Structure, biosystematics and genetic resources in Handbook of potato production, improvement and post-harvest management. New York, USA: Haworth's Press. pp. 1–39.
- 64. Harlan JR, Wet JMJD (1971) Towards rational classification of cultivated plants. Taxon 20: 509–517.
- 65. Swofford DL (1998) PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. Sinauer Associates, Sunderland, Massachusetts.
- 66. Jobes DV, Thien LB (1997) A Conserved Motif in the 5.8S Ribosomal RNA (rRNA) Geneisa Useful Diagnostic Marker for Plant Internal Transcribed Spacer (ITS) Sequences. Plant Mol Biol Report 15: 326–334.
- 67. Rohlf FJ (2005) NTSYS-pc: Numerical taxonomy and multivariate analysis system, version2.02e. New York, USA: Applied Biostatistics, Port Jefferson.
- 68. Ersts PJ (Internet) Geographic Distance Matrix Generator (version 1.2.3). American Museum of Natural History, Center for Biodiversity and Conservation. http://biodiversityinformaticsamnhorg/open_source/gdmg, accessed on 22/4/2010.
- 69. Tamura K, Dudley J, Nei M, Kumar S (2007) Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–1599.
- 70. Singer GA, Hajibabaei M (2009) iBarcode.org: web-based molecular biodiversity analysis. BMC Bioinformatics 10(Suppl 6): S14.