Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Complete chloroplast genomes from apomictic Taraxacum (Asteraceae): Identity and variation between three microspecies


Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in plastomes is important to characterize the nature of evolutionary processes in nuclear and cytoplasmic genomes with the different selection pressures, population structures and breeding systems.


The organization of chloroplast genomes (plastomes) has similarities at the structural and gene level across higher plants [1, 2]. The DNA sequences show characteristic variation depending on their taxonomic position, and sequence fragments are widely exploited in molecular taxonomy [3]. The chloroplast (or, more generally, plastid) genome (plastome, ctDNA, cpDNA) shows maternal inheritance in most species [4] and normally there is only one haplotype in a plant. Since there is no sexual recombination among plastomes (although horizontal transfer of whole chloroplasts [5], or chloroplast capture [6] may occur), chloroplast markers can give robust phylogenies and are then used to estimate divergence times between lineages [7]. The sequencing of the first plastome in Nicotiana tabacum [8] has been followed by some 626 chloroplast whole plastomes belonging to 133 different plant families (including 18 well-defined species from 16 genera in the Asteraceae) deposited in the NCBI Organelle Genome Resources database by early 2016 [911].

Typical angiosperm plastome sizes range from 135 to 160 kb (although much reduced in hemi-parasitic plants). The plastome has a conserved quadripartite structure composed of two copies (ca. 25 kb) of an Inverted Repeat (IR) which divides the remainder of the plastome into one Large and one Small Single Copy region (LSC and SSC) [1, 2]. One monophyletic clade within the legumes (including the tribes Cicereae, Hedysareae, Trifolieae and Fabeae and some other genera; see Wojciechowski [12]) and all conifers [13] have smaller plastomes in which one copy of the inverted repeat is missing, defining evolutionary lineages. Using whole plastid sequences from two orchid species, Luo et al. [14] demonstrated that chloroplast structure, gene order and content are similar but differ with expansions and contractions at the inverted repeat-small single-copy junction and ndh genes.

PCR-amplified sequences within plastomes are used extensively for species identification and reconstruction of phylogeny at around the species level. Several regions are consistently the most variable across angiosperm lineages and some are widely used for barcoding approaches for purposes such as species discovery, floristic surveys, identification of plants, or identification of composition of natural products (e.g. Bruni et al. [15]; Bruni et al. [16]; Hollingsworth et al. [17]), following amplification and sequencing: ndhF-rpl32, rpl32-trnL-UAG, ndhC-trnV, 5'rps16-trnQ, psbE-petL, trnT-psbD, petA-psbJ, and rpl16 (e.g. Dong et al. [18]). However, there is no universal ‘best’ region. The average number of regions applied to inter specific studies is about 2.5 which may be too little to access the full discriminating power of this plastome [19]. It is important to have multiple complete plastome for species across a family as references both to characterize any major structural changes, which would be difficult to identify from fragments, and to aid design of conserved PCR primers to exploit polymorphic regions in larger samples within and between taxa.

What are the limits on use of chloroplast sequences for addressing taxonomic questions? The answer depends on the rates of evolution and nature of variation found at different regions of the plastome. Shaw et al. [19] commented on the use of plastome sequences at increasingly low taxonomic levels: the genes most commonly analysed after amplification by PCR may be appropriate for delineation of species but may not represent the most variable regions of the chloroplast. In date palm, chloroplast haplotypes may correlate with populations [20], although founder effects may be strong in such species. Särkinen and George [21] used full plastome sequences of Solanum chloroplasts to identify the most variable plastid markers, concluding that different chloroplast regions are appropriate for study of evolution at different taxonomic levels from family downwards.

In the Asteraceae, Wang et al. [22] have analysed 81 genes from chloroplasts of 70 different species, showing the family is monophyletic and branching is consistent with tribal relationships as understood on the basis of morphology. The Asteraceae family includes an inversion in the plastome relative to other eudicots [23]. The boundaries of a 22.8 kb inversion define a split within the family, and a second 3.3 kb inversion is nested within the larger inversion. Generally, one of the end points of the smaller inversion is upstream of the gene trnE, and the other end point is located between the gene trnC and rpoB. The two inversions are similar among members of the Asteraceae lineage suggesting that the second inversion event occurred within a short evolutionary time after the first event. Estimates of divergence times based on ndhF and rbcL gene sequences suggest that two inversions originated during the late Eocene (38–42 MYA), soon after the Asteraceae originated in the mid Eocene (42–47 MYA) [23].

The genus Taraxacum (Cichorieae, Asteraceae) is known for its complex reticular evolution including polyploidy events, hybridization and apomixis [24] that makes it difficult to reconstruct a reliable phylogeny. Repeated hybridization between sexual (diploids or rarely tetraploids) and apomictic (triploids and higher ploidies) taxa, rapid colonization of wide areas by apomicts after the Last Glacial Maximum (LGM), low levels of morphological differentiation and remaining ancestral sequence polymorphisms have been of interest and a challenge to botanists for more than a century (e.g. Nägeli, having seen the results of Mendel [25], suggested that Mendel should investigate the apomictic Hieracium species, see [2630]). Investigation of genotypic diversity in pure apomictic and mixed sexual-apomictic populations showed variation arises from both mutation (accumulation of somatic mutations/allele divergence) and recombination (gene flow between sexual-apomictic individuals) [3134]. Utilization of common chloroplast markers from coding and non-coding regions showed at best weak differentiation within the genus but helped to distinguish evolutionary old and primitive from evolutionary younger or more advanced groups of haplotypes [35, 36]. Nevertheless, observed haplotypes were not species specific, some being rare while others were frequent and shared among different and not related taxa, even between sexual and apomictic plants (e.g. [32, 34, 35]). Mes et al. [37] showed a high level of homoplasy in several non-coding plastome regions.

Here we aimed to sequence whole chloroplast genomes (plastomes) of three morphologically well-defined apomictic microspecies or agamospecies from the Taraxacum officinale aggregate (dandelions), namely T. obtusifrons, T. stridulum and T. amplum. Our goals were to characterize the nature and scale of differentiation between plastomes in three related apomictic taxa and see if there were features of plastome variation that may be a consequence of apomixis. We then aimed to find the evolutionary relationships between the plastomes in the microspecies, and place them phylogenetically in the genus Taraxacum, the tribe Cichorieae and the Asteraceae. The results also aimed to identify appropriate regions for use as markers in future studies comparing mutation and inheritance of the nuclear genome in the apomicts with the maternally inherited plastome.

Materials and methods

Plant material and DNA sequencing

Three agamospecies (2n = 3x = 24) of Taraxacum officinale agg. [section Taraxacum (formerly Ruderalia), Asteraceae], T. obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978) were germinated and planted in pots. The seeds came from the agamospermous progeny of maternal plants genotyped by nuclear markers by Majeský et al. [32] and ploidy was measured by chromosome counts and flow cytometry [32]. Geographical records of origin and voucher specimens are deposited in the Herbarium of the Department of Botany, Palacký University, Olomouc, Czech Republic (herbarium abbreviation: OL). Nuclear markers confirm the genotypes used for sequencing; plants were karyotyped showing 2n = 3x = 24 chromosomes, and voucher specimens of the sequenced plants have been deposited in the University of Leicester, UK, herbarium (LTR). Total DNA including nuclear, mitochondrial and plastome DNA was extracted from fresh green young leaves using standard cetyl-trimethyl-ammonium bromide (CTAB) methods [38] to obtain high quality DNA.

DNA was sequenced commercially (Interdisciplinary Center for Biotechnology Research, University of Florida, USA); accession S3 was sequenced with Illumina Miseq 2x300bp paired end reads while accessions O978 and A978 were sequenced using Illumina Hiseq500 2x150bp reads. About 59,258,642 paired-end reads were obtained for S3 (22 Gb), and 58,713,854 and 69,056,774 paired-end reads (12 Gb) were obtained for A978 and O978 respectively.

Sequence assembly

Assembly and analysis of the plastomes were performed on Ubuntu Linux 13.10, with Geneious version 7.1.4 and later [39] (available from Using paired end reads from S3, de novo assembly generated one large contig of >150,000 bp (420,584 reads) which was largely homologous to the Lactuca sativa var. salinas (DQ_383816; Asteraceae) [40] plastome which was then used to generate a consensus reference sequence. For A978 and O978, and for final assembly of the S3 plastome, all raw reads were mapped to the S3 reference (five iterations). The initial assembly showed some areas of double-coverage of repeated regions, and minimal coverage at the four junctions between IRs and the SSC/LSC regions; repeated assembly to short regions corrected these, until uniform coverage with no assembly gaps, high similarity of all assembled reads to the consensus, and minimal unmatched paired reads, was achieved. Plastome bases were numbered so the first base pair after IR2, immediately before the trnH gene, became base number 1.

Plastome annotation

Coding sequences and directions were identified in the Taraxacum plastome and genes; rRNA and tRNAs were annotated with the Geneious annotation function and DOGMA (Dual Organellar Genome Annotator [41], with reference to published plastomes. In particular, the Taraxacum annotation was optimized by comparison with Lactuca (DQ_383816) to identify gene and exon boundaries, and tRNA genes were further confirmed with the online tRNAscan-SE 1.21 search server [42]. A circular plastome map was drawn using the online program GenomeVX [43].

Short repeat motifs

REPuter [44] was used to identify and locate DNA repeats including direct (forward), inverted (palindrome) repeats, reverse, and complementary sequences more than 20 bp long (90% identity; Hamming distance 2). TandemRepeatFinder [45] was used to find tandem repeats.

Comparison of chloroplast features and phylogenetic analyses

To see the extent of difference between Taraxacum and 21 Asteraceae accessions with full plastome sequences, GC content, genome size, gene content and nature of LSC/SSC/IR were compared. Further, we compared the plastid sequences among 18 species and 16 genera in Asteraceae aligning the entire chloroplast (downloaded from GenBank) and the three Taraxacum plastomes. Based on primary alignment, regions with the highest sequence divergence were visualised in mVISTA program [46] in Shuffle-LAGAN mode with default parameters to reveal their sequence variation. The alignments were visually checked and edited manually. Based on the comparison of plastome sequences, the regions with highest sequence polymorphism levels were chosen for further phylogenetic analyses. The aim of the phylogenetic analyses was to examine the congruence of the phylogenetic trees with respect to placement of the three Taraxacum microspecies within the subsampled Asteraceae family (with the whole plastome sequences available) and with respect to used plastome region for phylogeny reconstruction.

Maximum Likelihood fits of 24 different nucleotide substitution models for 22 accessions using the whole chloroplast genome plus 40 genic and inter-genic regions were calculated, and evolutionary analyses were conducted in MEGA6 [47].

Phylogenetic analysis was conducted using the maximum likelihood (ML) method based on the best-fitted model of evolution as outlined in S1 Table. The bootstrap consensus tree was inferred from 1000 replicates [48]. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates are collapsed. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Joining and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (4 categories). All three codon positions were included. Analyses were conducted in MEGA6 [47]. Trees were built for the entire plastome, 24 non-coding intergenic regions, 11 coding regions (including one intron), as well as separate analyses for the LSC, SSC and IR regions, tRNA and rRNA, genes in order to evaluate intragenomic variation in rates of molecular evolution, using Nicotiana tabacum (Solanaceae) as the outgroup.


Structure of Taraxacum chloroplasts

Circular plastomes were assembled from the whole genome sequence data (average plastid coverage >2000 fold for each accession). The chloroplasts of accessions O978 and S3 were identical and 151,322 bp long, while A978 was 151,349 bp long. Fig 1 shows the circular map for the A978 accession, with genes, short repeats, the major Inverted Repeats (IR1 and IR2; 24,431 bp; see Fig 2), and LSC/SSC regions (LSC 83,889bp and SSC 18,571bp in O978 and S3). GC content (blue graph) was higher than average in the 7kb of the Inverted Repeat regions adjacent to the SSC.

Fig 1. Map of the plastome of Taraxacum amplum (A978).

Genes are shown inside or outside the circle to indicate clockwise or counterclockwise transcription direction respectively. The Inverted Repeat (IR, 24,431bp) is indicated by a thicker line for IR1 and IR2. GC content is show in the inner blue graph. Small Single Copy (SSC) and long single copy (LSC) regions are indicated, and the inverted regions (Inv1 and Inv2) within LSC relative to other species are shown as orange arcs. Short tandem repeats (microsatellites and minisatellites) are indicated by blue dots, palindromes by red dots, forward repeats by green dots and reverse repeats by black dots.

Fig 2. Dot-plot sequence comparison of Taraxacum and Nicotiana chloroplast sequences, showing the Inverted Repeats (IR1 and IR2), hemi-nested inversions between the two plastomes (Inv1 and Inv2) and inversion of the SSC.

Chloroplast genome polymorphism between Taraxacum microspecies

Between the two Taraxacum plastomes, there were 28 SNPs (9 transversions and 19 transitions; Chi-square = 15.1; p = 0.0001), occurring in all regions of the plastome (13 in LSC, 13 in SSC and 2 in IRs; Table 1 and Fig 1). Two SNPs in LSC genes (rpoC1 and accD) were non-synonymous changes with the other 9 SNPs in genes being synonymous. There were 16 indels between 1 and 24 bp long, all but one occurring within the LSC region (the LSC representing 55% of the plastome; p<0.001; Table 1). A unique 22bp insertion, the duplicated 11bp motif TGTAGACATAA in an intron of the trnL-UAA gene, was present in accession A978 (S1 Fig). Overall, non-coding regions show a higher sequence divergence than coding regions in Taraxacum (Table 1). In the sequence alignment, the highest divergence was seen in regions including the intergenic spacer of trnH-psbA, trnK-rps16, rps16-trnQ, trnS-trnC, trnC-petN, rpoC2-rps2, psbZ-trnG, trnG-trnfM, ycf3-trnS, trnT-trnL, trnF-ndhJ, trnM-atpE, petB-petD, trnN-ycf1, ycf1-rps15, ndhD-ccsA, rpl32-ndhF, psbI-trnS, ndhF-ycf1 and ndhI-ndhG.

Table 1. Transition/transversion and insertion/deletion events between Taraxacum microspecies S3/O978 and A978; where indel occurs in a gene, the gene name is indicated; other indels are intergenic.

Gene content and arrangement were identical in all three sequenced Taraxacum plastomes. The plastome contains 135 unique genes, including a total of 81 protein-coding genes (plus 9 duplicated in IR), 4 rRNA (all duplicated in the IR) and 38 unique tRNA genes (one in the SSC region, 23 in the LSC region and 7 duplicated in the IR region) with two copies of the trnF-GGA gene in the LSC region and four rRNA genes in the IR region (Table 2; Fig 1). Within the IRs, there are 19 genes duplicated: all four rRNA, seven tRNA and eight protein-coding genes. Only the 5' end of the ycf1 genes (467 bp) and 3' end of rps19 (67 bp) are present in the IRs, and the gene rps12 is trans-spliced, with the 5' exon in the LSC and the remaining two exons in the IRs (Fig 1). There are 18 different intron-containing genes (of which six are tRNA coding genes). All intronic genes contain one intron, except two (ycf3, clpP) that contain two introns. The trnK-UUU gene had the largest intron (2,557 bp) with another gene, matK, located in it (Table 3). Sequences have been submitted to GenBank (GenBank accession number: KX499523, KX499524, KX499525), and the full raw reads from the three genotypes have been uploaded into SRA with BioSample accessions: SAMN05300515, SAMN05300516, SAMN05300517.

Table 3. Intron and exon sizes in genes in the Taraxacum plastome.

A total of 26233 codons in S3 and O978, and 26253 codons in A978 represent the coding regions of 90 protein-coding genes. Codon usage was biased towards A and T at the third codon position. Among the codons, serine (8.8% and 8.9% of O978, A978 respectively) and methionine (1.77% and 1.80% of O978, A978 respectively) are the most and the least abundant amino acids (S2 Table).

Investigation of various types of repeats present in Taraxacum plastome showed the presence of five main types of repeats (complement, forward, reverse, palindromic and tandem) (Fig 3, S3 Table). The most abundant were short repeats of sequence motifs with 21–30 nucleotides, except for tandem repeats, were the most abundant were motifs with only 10–20 nucleotides. Comparison with Lactuca (DQ_383816) showed difference in both types of present repeats and length of repeats (Fig 3).

Fig 3. Repetitive motif abundance in Taraxacum (only A978 shown since the three accessions were similar) and Lactuca plastomes.

C = Complement repeats, P = Palindromic repeats, F = Forward repeats, R = Reverse repeats.

Comparison of chloroplast features between Taraxacum and 21 accessions of Asteraceae and phylogenetic analyses

Comparison of chloroplasts between Taraxacum and other Asteraceae (Table 4) showed no dramatic difference in compared features (Fig 4, numerical data in S4 Table). The most prominent difference was observed in the number of genes with Taraxacum, together with Helianthus annuus (S4 Table), having the highest gene content (136 genes) from all of the compared species. Genome size, GC content and size of LSC did not vary considerably, while size of SSC was slightly bigger for two taxa (Parthenium argentatum and Leontopodium leiolepis) and of IR was lower for Ageratina adenophora and Praxelis clematidea (Fig 4).

Fig 4. A radar-plot comparing features of the plastomes of 21 accessions of Asteraceae, showing, from inside to out, sizes of major plastome regions, GC content, genome size and number of different types of genes.

Table 4. List of plastomes from GenBank used for comparison.

Based on comparison of sequences of whole plastomes, higher sequence divergence was present within non-coding regions. The most divergent coding regions between Taraxacum plastomes and the others 18 Asteraceae plastomes were rpoC1, rpoC2, trnL, accD, clpP, psbB, ndhD, ycf1, ndhA, rps16 and ndhF (S2 Fig). Using the Maximum Likelihood method and nucleotide substitution models with minimum Bayesian information criterion (BIC) value for each tree from MEGA6 ([47]; S1 Table), 41 trees were produced. In all of them, the three Taraxacum microspecies appeared as a clade which usually (in 33 of the 41 trees) showed a well-supported sister group relationship to Lactuca. This is consistent with both genera belonging to subfamily Cichorioideae. Some DNA regions showed either a paraphyleteic (rRNA, tRNA, trnG-trnfM, petA-psbJ, clpP) or polyphyletic (trnH-psbA, rpoC2-rps2, trnS-trnC) Cichorioideae (S3 Fig), but most of these were relatively short sequences. Species in subfamily Carduoideae (belonging to the genera Cynara, Centaurea and Carthamus) were often sister to Cichorioideae (in 17 of the 41 trees), but there were several where other groups showed this relationship.


Species in the Asteraceae family have contrasting evolutionary pressures from intense selection by people in agricultural and weedy species, with presumably relaxed selection in favourable niches, and there are some invasive species with genetic bottlenecks. The species also have various breeding systems including apomixis, sporophytic self-incompatibility, cleistogamy, wind and insect pollination and there is interest in the use of more apomictic crop species. With whole plastome sequences and comparisons between families, it will be valuable to identify the nature of evolutionary processes in nuclear and cytoplasmic genomes with the different selection pressures, population structures and breeding systems. Here, we provide brief discussion of main features of Taraxacum plastome gained form sequencing of whole chloroplasts in three apomictic accessions.

Chloroplast genome polymorphisms between Taraxacum microspecies and differentiation power of plastome sequences at low taxonomic level

The three apomictic accessions for which whole plastome sequences were generated in the present study belong to a group of common dandelions (generally called T. officinale aggregate). Sequenced individuals represent agamospermous progeny of maternal plants genotyped by nuclear markers by Majeský et al. [32]. This genotyping showed two defined groups (OSP and AMP) and supported the presence of nine tight genetic clusters among the nine studied apomictic accessions (for details see Majeský et al. [32]). The genotyping agreed with the morphologically-based division of the accessions into separate apomictic microspecies (a taxonomic rank for apomictic taxa based on morphology). Of the three apomictic microspecies sequenced in the present study, two (O978, S3) belong to the OSP group and A978 belongs to the AMP group. Despite their clear and robust nuclear differentiation, sequencing of the chloroplast trnL–trnF intergenic spacer showed they shared the cp1a haplotype: haplotype cp1a (haplotype 18a in Wittzell [35]) is the most common (derived) haplotype shared among wide spectrum of different sections (dandelion groups) in Taraxacum [32, 34, 35]. This suggests haplotype cp1a might be derived from the most recent common ancestor of many derived Taraxacum sections.

Van der Hulst et al. ([49] their Fig 3), identified three Taraxacum chloroplast haplotypes in more than two plants (namely C1, C2 and C4), and found these were not restricted to single clades based on nuclear marker data (AFLPs (amplified fragment length polymorphisms) and microsatellites). They were neither monophyletic nor congruent with nuclear markers, thus negating the model that matrilineal markers would delimit nuclear marker data to matrilineal groups and thus detect clonal lineages. However, this study employed population-based sampling (randomly sampled individuals within a ‘park lawn’). In such a habitat many different morphological clones (microspecies) coexist (see e.g. [50, 51]) with different origin. In the case of apomicts, like Taraxacum, nuclear markers are able to delimit clonal lineages [32, 52] and the extent of a clonal lineage can be supported by matrilineal markers, although not unambiguously, (e.g. see Majeský et al. [34]). However, the markers used only consider a small fraction of the whole chloroplast and inevitably cannot discover all differences within particular chloroplast lineages. Whole plastome sequencing of well-defined samples measured all genetic variability among the three apomictic dandelions. The plastome sequences were identical in the two apomictic accessions O978 and S3, belonging to same morphological group OSP, and differed by 27bp in length, 28 SNPs and 16 indels from A978, belonging to different AMP group (Table 1).

What do these results show about the relationship between the apomictic microspecies where we sequenced the plastome, following the work of Majeský et al.[32] Plastomes are evolving at a different, slower rate, compared to nuclear markers, as noted by Wolfe [53]. While nuclear markers showed genetic boundaries between the O and S sub-groups, whole chloroplast sequences did not. This may point to the young evolutionary age of the two microspecies (T. obtusifrons and T. stridulum): they have not accumulated any chloroplast mutations between each other and their most recent common ancestor. Morphologically, they are well-defined as separate morphological units [32] with a low number of observed genotypes within investigated individuals from the O and S microspecies: two (T. obtusifrons) and four (T. stridulum) multilocus genotypes were detected by six nuclear SSRs (simple sequence repeats) among 21 and 23 genotyped individuals, while AFLPs showed only one AFLP-phenotype among 10 fingerprinted individuals of both microspecies. Apomictic reproduction cuts off a lineage from genetic recombination so an asexual lineage is expected to rapidly diverge as a result of accumulation of mutations and transposon activity that become the major generators of diversity and driver for genome evolution [54, 55].

Comparison of Taraxacum plastome with other genera

Sequence comparison of the plastome of Taraxacum with the reference Nicotiana tabacum [8] revealed hemi-nested inversions in the LSC region (Inv1: 21,737 bp in S3/O978, and 21,711 bp in A978; inv2 of 2,543 bp in S3/O978 and 2,542 bp in A978; Figs 1 and 2). The nested inversion ended just upstream of the trnE-UUC gene with the large inversion. The other end-point of the inversion is located between the trnC-GCA and rpoB genes (Fig 5). The inversion in the LSC (Inv1 and Inv2; [23, 40]) is conserved across all 21 Asteraceae chloroplast sequences. Liu et al. [56] suggested that the LSC inversion region has undergone inversion followed by reinversion in Asteraceae, and that this could be a particularly active region for sequence rearrangements in the plastome: the existence of within-species variation in the presence of this major inversion supports the hypothesis that this region is a hotspot for inversion events (Fig 5).

Fig 5. Comparative plastome maps.

Endpoints of the large 22 kb inversion present in most Asteraceae and of a small inversion (3.3 kb in other Asteraceae).

Another large inversion between N. tabacum and Taraxacum (Figs 2 and 6) is present between base pair positions 108321 in S3, O978 (108,358 in A978) and 126891 in S3, O978 (126,919 in A978); it is flanked by inverted repeats and encompasses the entire SSC region (18,571 bp in S3, 18,561 bp in A978) (Figs 2, 6 and 7). The SSC inversion from ndhF to rps15 is present in all of the Asteraceae lineages involved in this study except Artemisia frigida (NC_020607) [56], Artemisia montana (NC_025910), Carthamus tinctorius (KP404628) [57], Centaurea diffusa (NC_024286) [76] and one reported Lactuca sativa (NC_007578) [59].

Fig 6. Comparative plastome maps.

Gene order and inversion of the SSC region. Gene sequences were annotated and indicated along the black lines. Genes above the black lines indicate their transcription in reverse direction and genes below the black lines represent their transcription in forward direction.

Fig 7. Comparative plastome maps.

Border position of LSC, IR and SSC region among the 20 Asteraceae plastomes. Genes are indicated by coloured boxes.

Comparison of features of the plastomes of 21 accessions of Asteraceae, showed overall similarity of chloroplasts across wider spectrum of different evolutionary lineages. There were even no dramatic differences among representatives of the three main subfamilies (Carduoidae, Cichorioidae, Asteroidae), what may stress overall high stability of chloroplast features at lower taxonomic level (S4 Table, Fig 4). The most remarkable difference was seen in the number of total tRNA and coding genes (S4 Table), with Lasthenia burkei being taxon with the lowest number of genes (119—Total Gene N°/79—N° Coding Genes/20—N° tRNA) comparing with the three Taraxacum (136 –Total Gene N°/90—N° Coding Genes/38—N° tRNA). Holmquist [60] considered that recombinogenic domains of chromosomes may be GC rich. Fig 1 shows that the GC content was lower in the SSC region flanked by IR1 and IR2, and higher in 7kb (of the 24kb) of the IR regions 1kb away from the SSC border, with an evident spike from low GC at end of both IRs; both ends of Inv1 had a low GC content. Thus, as found by Walker et al. [61], high GC content was not associated with inversion breakpoints in the plastome.

The number of direct (forward), reverse, palindromic and tandemly repeated sequence motifs of various length classes in Taraxacum, compared with Lactuca (DQ_383816), can be seen in Fig 3 (see also S3 Table). The notable difference was the increased frequency of direct repeats more than 50bp long in Lactuca, where there were 27 compared to none in Taraxacum (37 compared to 4 repeats >40bp long). Liu et al. [56] commented on variation in number and variety of repeats in the Asteraceae plastomes. Repeats have a role in plastome organization, but like Liu et al. [56], we found no correlation between large repeats and rearrangement endpoints. Our comparative repeat analysis showed considerable variation between even Taraxacum and Lactuca, with many more direct repeats of 40bp or more in Lactuca (Fig 3; 1% larger plastome than Taraxacum). Relationships of repeats and mutation have been considered in chloroplast genomes [58], although in the related Taraxacum plastomes, SNPs and non-repeat indels showed little relationship with repeats.

Phylogenetic utility of chloroplast regions

Polymorphisms between the two Taraxacum plastomes and between Taraxacum and other Asteraceae included many chloroplast regions widely used for phylogenetic analysis. The presence of two trnF-GAA genes duplicated in the LSC is unusual and would make this region difficult to use for phylogeny and diversity studies (S4 Fig). Duplication of trnF-GAA gene was encountered already by Wittzell [35], who, based on sequence variation of trnL-trnF region in number of different Taraxacum taxa, provide support for the informal division of dandelions on evolutionarily old and evolutionary younger/derived taxa. The presence of duplicated trnF gene is not specific only for Taraxacum, but is present also in other compared species of Asteraceae: namely in Carthamus tinctorius, Guizotia abyssinica, Ageratina adenophora, Praxelis clematidea and Lasthenia burkei (S4 Fig). Thus, duplication of the trnF-GAA gene probably occurred several (at least three or four) times independently in the three main Asteraceae subfamilies: Asteroideae, Cichorideae, and Carduoideae.

All three investigated apomictic Taraxacum microspecies represented separate clade sister to Lactuca in all phylogenetic analyses (S3 Fig). This was expected because Taraxacum and Lactuca belong to the same evolutionary lineage—Cichorioideae—within the Asteraceae family (no other species of Cichorioideae was included). This is also in accordance with the current knowledge of the relationships within the subfamily [62]. Although the close relationships of both genera, Taraxacum represent a distinct evolutionary lineage (Crepidinae) than Lactuca (Lactucinae) [62] which according to Tremetsberger et al. [63] have diverged during the Miocene, at least 16.2 MYA. Because of low level of sequence divergence between the investigated Taraxacum accessions and because these microspecies represent only a scant part of species known in the genus, it is not possible to draw some conclusions about their evolutionary relationships. In part of the phylograms accession A978 appeared to be basal to O978/S3, but other phylograms do not support this and the relations between the plastomes appeared as unresolved. Definitely, whole plastome sequences provide far more discrimination power than individual markers, for phylogeny reconstruction. For deeper insight into the evolution of the Taraxacum genus, it will require wider sampling of more distinct taxa. Kirschner et al. [36] used a parsimony analysis of morphological and chloroplast data (two intergenic spacers psbA-trnH + trnL-trnF) in Taraxacum to show an overall lack of congruence. They suggested the conflict was a consequence of reticulation affecting morphology (and presumably nuclear markers), a process unlikely for the chloroplast genomes. Intergenic spacer psbA-trnH belonged among the most divergent plastome regions (in the sense of sequence divergence between the two distinct plastomes A978 versus O978/S3) in our analyses (presence of one SNP and 5bp InDel; Table 1), but as noted above, no sequence variation was observed among the three investigated accessions for the trnL-trnF intergenic spacer.

Both the more conserved coding regions and variable non-coding regions of the chloroplast genome have proved useful for phylogenetic studies [61, 64], with faster rates of evolution in noncoding regions; however the data here show care is needed in interpretation based on single regions as might be amplified by ‘barcode’ markers. Maybe some incongruences arise where mutations are reiterated (similarities are not identical by descent), although rare male chloroplast transmission (e.g. [65, 66]) and recombination events cannot be ruled out.

It is important to select marker sequences which have a rate of evolution that is appropriate to the evolutionary distance of the accessions under analysis and the questions being addressed [67]. Walker et al. [61] have pointed out that rates of molecular evolution vary over the plastome, particularly in noncoding regions. Here, two of the plastomes, from accessions which are in well-defined clades based on morphology and nuclear DNA markers, were identical: without the full plastome sequence, there would always have been questions about whether the plastome markers we happened to use were appropriate. It was also evident that the most frequently used chloroplast markers (including trnL-trnF, and matK) showed few polymorphisms between O/S and A Taraxacum and to position Taraxacum with respect to other species.

Taraxacum microspecies, and of the species in the Cichorieae. This would enable comparisons of evolutionary rates of sexual and apomictic species, and between nuclear and plastome sequences. Tremetsberger et al. [63] used fossil-calibration based on pollen and a nuclear sequence to estimate divergence between species in the group, but the prehistoric and fossil record for the majority of the Asteraceae, including Taraxacum, is poor [63, 68, 69].


We expect whole genome sequencing [61] to be used increasingly for taxonomy and systematics, within-species biodiversity, population, phylogenetic and evolutionary projects. With the total cellular DNA used here, without enrichment for chloroplast sequences, 3.5 to 4% of reads mapped to the chloroplast (400 unreplicated plastomes per 1C (unreplicated haploid) nuclear genome), allowing robust assembly including the duplications and inversions. Even with automation, PCR amplification and sequencing of multiple regions of chloroplasts and nuclear plastomes is time-consuming and requires optimization, while whole plastome sequencing only requires DNA extraction and a service provider. Analysis and interpretation of whole-genome-sequencing results is, however not yet optimized nor routine.

In the current study, we sequence full chloroplast of three well characterized apomictic Taraxacum microspecies. We provide the full annotated plastome sequences for the genus, which can be used in diverse spectrum of further comparative analyses and provide reference plastome for primer design in taxonomic and phylogenetic studies of the genus. We also showed the low sequence divergence between the investigated apomictic taxa, what point to their recent origin (probably post-Pleistocenic). The sequenced plastome (A978) may represent the most common recent chloroplast type involved in origin of many evolutionarily younger Taraxacum taxa.

Supporting information

S1 Fig. Alignment of trnL-UAA sequence from 19 Asteraceae species including the two Taraxacum (A978 and O978) species sequenced in the present study.

Arrowhead indicates a 22bp insertion in A978 with respect to O978 and other species.


S2 Fig. Comparison of plastome sequences of 18 Asteraceae accessions, two Taraxacum plastomes generated in this study and 16 previously reported plastomes using mVISTA program.

The Y-scale represents the percent of identity ranging from 50 to 100%. Arows above the graphs indicate the direction of transcription.


S3 Fig. Phylogenetic trees derived from maximum likelihood analysis of alignments of DNA sequences of 21 different Asteraceae species of a total of whole plastome and 40 different chloroplast regions indicated below the trees.

Numbers above node are bootstrap support values.


S4 Fig. Alignment of trnF-GAA sequence of investigated Asteraceae.


S1 Table. Maximum Likelihood fits of 24 different nucleotide substitution models for 22 accessions using the whole chloroplast genome plus 40 genic and inter-genic regions.

Evolutionary analyses were conducted in MEGA6 [47]. Models with the lowest BIC scores (Bayesian Information Criterion) are considered to describe the substitution pattern the best, and were used for the trees in S3 Fig. As noted in MEGA6, “non-uniformity of evolutionary rates among sites may be modelled by using a discrete Gamma distribution (+G) with 5 rate categories and by assuming that a certain fraction of sites are evolutionarily invariable (+I). Whenever applicable, estimates of gamma shape parameter and/or the estimated fraction of invariant sites are shown. For estimating ML values, a tree topology was automatically computed. All positions with less than 95% site coverage were eliminated. That is, fewer than 5% alignment gaps, missing data, and ambiguous bases were allowed at any position.” There were a total of 136267 positions in the whole genome dataset, and the number of positions in the separate alignments for each region is shown (total number of positions in the dataset). Abbreviations: GTR: General Time Reversible; HKY: Hasegawa-Kishino-Yano; TN93: Tamura-Nei; T92: Tamura 3-parameter; K2: Kimura 2-parameter; JC: Jukes-Cantor.


S2 Table. Codon usage and codon-anticodon recognition pattern of the 21 Asteraceae plastomes calculated by

Absolute numbers and values recalculated as per mille (1/1000) and proportion are shown with a heat map gives relative usage of each codon.


S3 Table. Repetitive motif abundance in Taraxacum and Lactuca plastomes computed by Reputer and Tandem Repeat Finder.


S4 Table. Characteristics of plastomes of 21 different accessions of 16 Asteraceae genera.


Author Contributions

  1. Conceptualization: RHMS LM TS RG PHH.
  2. Data curation: RHMS LM TS RG PHH.
  3. Formal analysis: RHMS LM TS RG PHH.
  4. Funding acquisition: RHMS.
  5. Methodology: RHMS LM TS RG PHH.
  6. Resources: LM.
  7. Supervision: RHMS LM TS RG PHH.
  8. Validation: RHMS LM TS RG PHH.
  9. Writing – original draft: RHMS LM TS RG PHH.
  10. Writing – review & editing: RHMS LM TS RG PHH.


  1. 1. Palmer JD and Thompson WF. Chloroplast DNA rearrangements are more frequent when a large inverted repeat sequence is lost. Cell 1982; 29: 537–550. pmid:6288261
  2. 2. Jansen RK, Raubeson LA, Boore JL, Chumley TW, Haberle RC, Wyman SK, et al. Methods for obtaining and analyzing whole chloroplast genome sequences. Methods Enzymol. 2005; 395: 348–384. pmid:15865976
  3. 3. Hollingsworth PM, Forrest LL, Spouge JL, Hajibabaei M, Ratnasingham S, van der Bank M, et al. A DNA barcode for land plants. Proc Natl Acad Sci U S A. 2009; 106: 12794–12797. pmid:19666622
  4. 4. Birky CW Jr. The inheritance of genes in mitochondria and chloroplasts: laws, mechanisms, and models. Annu Rev Genet. 2001; 35: 125–148. pmid:11700280
  5. 5. Stegemann S, Keuthe M, Greiner S, Bock R. Horizontal transfer of chloroplast genomes between plant species. Proc Natl Acad Sci USA. 2012; 109:2434–2438. pmid:22308367
  6. 6. Wang Z.H., Peng H., and Kilian N.. Molecular phylogeny of the Lactuca alliance (Cichorieae subtribe Lactucinae, Asteraceae) with focus on their Chinese centre of diversity detects potential events of reticulation and chloroplast capture. PloS One. 2013, 8: e82692. pmid:24376566
  7. 7. Moore MJ, Soltis PS, Bell CD, Burleigh JG, Soltis DE. Phylogenetic analysis of 83 plastid genes further resolves the early diversification of eudicots. Proc Natl Acad Sci U S A. 2010; 107: 4623–4628. pmid:20176954
  8. 8. Shinozaki K, Ohme M, Tanaka M, Wakasugi T, Hayashida N, Matsubayashi T, et al. The complete nucleotide sequence of the tobacco chloroplast genome: its gene organization and expression. EMBO J. 1986; 5: p. 2043. pmid:16453699
  9. 9. Jansen RK, Cai Z, Raubeson LA, Daniell H, Leebens-Mack J, Müller KF, et al. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc Natl Acad Sci U S A. 2007;104:19369–19374. pmid:18048330
  10. 10. Moore MJ, Bell CD, Soltis PS, Soltis DE. Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms. Proc Natl Acad Sci U S A. 2007;104: 19363–19368. pmid:18048334
  11. 11. Parks M, Cronn R, Liston A. Increasing phylogenetic resolution at low taxonomic levels using massively parallel sequencing of chloroplast genomes. BMC Biol. 2009;7: 1.
  12. 12. Wojciechowski MF. IRLC (Inverted Repeat Lacking Clade). Version 11 July 2006: 11. The tree of life web project, 2006.
  13. 13. Raubeson LA, Jansen RK. A rare chloroplast-DNA structural mutation is shared by all conifers. Biochem Syst Ecol. 1992; 20: 17–24.
  14. 14. Luo J, Hou BW, Niu ZT, Liu W, Xue QY, Ding XY. Comparative chloroplast genomes of photosynthetic orchids: insights into evolution of the Orchidaceae and development of molecular markers for phylogenetic applications. PLoS One. 2014; 9: e99016. pmid:24911363
  15. 15. Bruni I, De Mattia F, Galimberti A, Galasso G, Banfi E, Casiraghi M, Labra M. Identification of poisonous plants by DNA barcoding approach. Int J Legal Med. 2010; 124: 595–603. pmid:20354712
  16. 16. Bruni I, Galimberti A, Caridi L, Scaccabarozzi D, De Mattia F, Casiraghi M, Labra M. A DNA barcoding approach to identify plant species in multiflower honey. Food chem. 2015; 170:308–15. pmid:25306350
  17. 17. Hollingsworth PM, Li DZ, van der Bank M, Twyford AD. Telling plant species apart with DNA: from barcodes to genomes. Phil. Trans. R. Soc. B. 2016; 371: 20150338. pmid:27481790
  18. 18. Dong W, Liu J, Yu J, Wang L, Zhou S. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding. PLoS One. 2012; 7: e35071. pmid:22511980
  19. 19. Shaw J, Shafer HL, Leonard OR, Kovach MJ, Schorr M, Morris AB. Chloroplast DNA sequence utility for the lowest phylogenetic and phylogeographic inferences in angiosperms: The tortoise and the hare IV. Am J Bot. 2014; 101: 1987–2004. pmid:25366863
  20. 20. Zehdi-Azouzi S, Cherif E, Moussouni S, Gros-Balthazard M, Naqvi SA, Ludeña B, et al.. Genetic structure of the date palm (Phoenix dactylifera) in the Old World reveals a strong differentiation between eastern and western populations. Ann Bot. 2015; 116: 101–12. pmid:26113618
  21. 21. Särkinen T, George M. Predicting plastid marker variation: can complete plastid genomes from closely related species help? PLoS One. 2013; 8: e82266. pmid:24312409
  22. 22. Wang M, Cui L, Feng K, Deng P, Du X, Wan F, et al.. Comparative analysis of Asteraceae chloroplast genomes: Structural organization, RNA editing and evolution. Plant Mol Biol Report. 2015; 33: 1526–1538.
  23. 23. Kim KJ, Choi KS, Jansen RK. Two chloroplast DNA inversions originated simultaneously during the early evolution of the sunflower family (Asteraceae). Mol Biol Evol. 2005; 22: 1783–1792. pmid:15917497
  24. 24. Asker S, Jerling L. Apomixis in plants. CRC press; 1992.
  25. 25. Mendel G, Versuche über Pflanzenhybriden. Verhandlungen des naturforschenden Vereines in Brunn. 1866; 4: 44.
  26. 26. Richards AJ, The origin of Taraxacum agamospecies. Bot J Linn Soc. 1973; 66: 189–211.
  27. 27. Mogie M, Ford H. Sexual and asexual Taraxacum species. Bot J Linn Soc. 1988; 35: 155–168.
  28. 28. King LM, Schaal BA. Genotypic variation within asexual lineages of Taraxacum officinale. Proc Natl Acad Sci U S A. 1990; 87: 998–1002. pmid:2300590
  29. 29. Nogler G.A., Gametophytic apomixis. Springer; 1984.
  30. 30. Kirschner J, Drábková LZ, Štěpánek J, Uhlemann I. Towards a better understanding of the Taraxacum evolution (Compositae—Cichorieae) on the basis of nrDNA of sexually reproducing species. Plant Syst Evol. 2015; 301: 1135–56.
  31. 31. Van der Hulst RG, Mes TH, Den Nijs JC, Bachmann K. Amplified fragment length polymorphism (AFLP) markers reveal that population structure of triploid dandelions (Taraxacum officinale) exhibits both clonality and recombination. Mol Ecol. 2000; 9: 1–8. pmid:10652071
  32. 32. Majeský Ľ, Vašut RJ, Kitner M, Trávníček B. The pattern of genetic variability in apomictic clones of Taraxacum officinale indicates the alternation of asexual and sexual histories of apomicts. PLoS One. 2012; 7: e41868. pmid:22870257
  33. 33. Mes TH, Kuperus P, Kirschner J, Štepánek J, Štorchová H, Oosterveld P, et al., Detection of genetically divergent clone mates in apomictic dandelions. Mol Ecol. 2002; 11: 253–265. pmid:11856426
  34. 34. Majeský Ľ, Vašut RJ, Kitner M. Genotypic diversity of apomictic microspecies of the Taraxacum scanicum group (Taraxacum sect. Erythrosperma). Plant Syst Evol. 2015; 301: 2105–24.
  35. 35. Wittzell H. Chloroplast DNA variation and reticulate evolution in sexual and apomictic sections of dandelions. Mol Ecol. 1999; 8: 2023–2035. pmid:10632854
  36. 36. Kirschner J, Štěpánek J, Mes TH, Den Nijs JC, Oosterveld P, Štorchová H, et al. Principal features of the cpDNA evolution in Taraxacum (Asteraceae, Lactuceae): a conflict with taxonomy. Plant Syst Evol. 2003; 239: 231–255.
  37. 37. Mes TH, Kuperus P, Kirschner J, Stepanek J, Oosterveld P, Storchova H, et al. Hairpins involving both inverted and direct repeats are associated with homoplasious indels in non-coding chloroplast DNA of Taraxacum (Lactuceae: Asteraceae). Genome. 2000; 43: 634–641. pmid:10984175
  38. 38. Doyle JJ. Isolation of plant DNA from fresh tissue. Focus. 1990; 12: 13–15.
  39. 39. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012; 28: 1647–1649. pmid:22543367
  40. 40. Timme RE, Kuehl JV, Boore JL, Jansen RK. A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: identification of divergent regions and categorization of shared repeats. Am J Bot. 2007; 94: 302–312. pmid:21636403
  41. 41. Wyman SK, Jansen RK, Boore JL, Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004; 20: 3252–3255. pmid:15180927
  42. 42. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997; 25: 955–964. pmid:9023104
  43. 43. Conant GC, Wolfe KH. GenomeVx: simple web-based creation of editable circular chromosome maps. Bioinformatics. 2008; 24: 861–862. pmid:18227121
  44. 44. Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001; 29: 4633–4642. pmid:11713313
  45. 45. Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999; 27: 573. pmid:9862982
  46. 46. Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I. VISTA: computational tools for comparative genomics. Nucleic Acids Res. 2004; 32: W273–279. pmid:15215394
  47. 47. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013; 30: 2725–2729. pmid:24132122
  48. 48. Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution, 1985; 783–791.
  49. 49. Van der Hulst RG, Mes TH, Falque M, Stam P, Den Nijs JC, Bachmann K. Genetic structure of a population sample of apomictic dandelions. Heredity. 2003; 90: 326–335. pmid:12692586
  50. 50. Ford H. Life history strategies in two coexisting agamospecies of dandelion. Biol J Linn Soc. 1985; 25: 169–186.
  51. 51. Richards AJ. Plant breeding systems. George Allen & Unwin; 1986.
  52. 52. Kirschner J, Oplaat C, Verhoeven KJ, Zeisek V, Uhlemann I, Trávníček B, Räsänen J. Identification of oligoclonal agamospermous microspecies: taxonomic specialists versus microsatellites. Preslia. 2016; 88: 1–7.
  53. 53. Wolfe KH, Li WH, Sharp PM. Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc Natl Acad Sci U S A. 1987; 84: 9054–9058. pmid:3480529
  54. 54. Richards AJ. A comparison of within-plant karyological heterogeneity between agamospermous and sexual Taraxacum (Compositae) as assessed by the nucleolar organiser chromosome. Plant Syst Evol. 1989; 163: 177–185.
  55. 55. Heslop-Harrison JP, Brandes A, Taketa S, Schmidt T, Vershinin AV, Alkhimova EG, et al., The chromosomal distributions of Ty1-copia group retrotransposable elements in higher plants and their implications for genome evolution. Genetica. 1997; 100: 197–204. pmid:9440273
  56. 56. Liu Y, Huo N, Dong L, Wang Y, Zhang S, Young HA, et al., Complete chloroplast genome sequences of Mongolia medicine Artemisia frigida and phylogenetic relationships with other plants. PLoS One. 2013; 8: e57533. pmid:23460871
  57. 57. Lu C, Shen Q, Yang J, Wang B, Song C. The complete chloroplast genome sequence of Safflower (Carthamus tinctorius L.). Mitochondrial DNA A DNA MappSeqAnal. 2015; Mar 5: 1–3.
  58. 58. Ahmed I, Biggs PJ, Matthews PJ, Collins LJ, Hendy MD, Lockhart PJ. Mutational dynamics of aroid chloroplast genomes. Genome Biol Evol. 2012; 4: 1316–1323. pmid:23204304
  59. 59. Kanamoto H, Yamashita A, Okumura S, Hattori M, Tomizawa KI. The complete genome sequence of the Lactuca sativa (lettuce) chloroplast. Plant Cell Physiol. 2004; 45:. S39–S39.
  60. 60. Holmquist GP. Chromosome bands, their chromatin flavors, and their functional features. Am J Hum Genet. 1992; 51: 17. pmid:1609794
  61. 61. Walker JF, Zanis MJ, Emery NC. Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae). Am J Bot. 2014; 101: 722–729. pmid:24699541
  62. 62. Kilian N, Gemeinholzer B, and Lack H. Cichorieae. Systematics, evolution, and biogeography of Compositae, 2009: 343–383.
  63. 63. Tremetsberger K, Gemeinholzer B, Zetzsche H, Blackmore S, Kilian N, Talavera S. Divergence time estimation in Cichorieae (Asteraceae) using a fossil-calibrated relaxed molecular clock. Org Divers Evol. 2013; 13: 1–3.
  64. 64. Nie X, Lv S, Zhang Y, Du X, Wang L, Biradar SS, Tan X, Wan F, Weining S. Complete chloroplast genome sequence of a major invasive species, crofton weed (Ageratina adenophora). PLoS One. 2012; 7: e36869. pmid:22606302
  65. 65. McCauley DE, Sundby AK, Bailey MF, Welch ME. Inheritance of chloroplast DNA is not strictly maternal in Silene vulgaris (Caryophyllaceae): evidence from experimental crosses and natural populations. Am. J. Bot.. 2007; 94:1333–1337. pmid:21636500
  66. 66. Ellis JR, Bentley KE, McCauley DE. Detection of rare paternal chloroplast inheritance in controlled crosses of the endangered sunflower Helianthus verticillatus. Heredity. 2008; 100:574–80. pmid:18301440
  67. 67. Saeidi H, Rahiminejad MR, Vallian S, Heslop-Harrison JS. Biodiversity of diploid D-genome Aegilops tauschii Coss. in Iran measured using microsatellites. Genet Resour Crop Evol. 2006; 53: 1477–1484.
  68. 68. Sterk AA, Hommels CH, Jenniskens MJPJ, Neuteboom JH, den Nijs JCM, Oosterveld P et al. Paardebloemen: planten zonder vader. Variatie, evolutie en toepassingen van het geslacht paardebloem (Taraxacum). Utrecht: KNNV; 1987: 184–189.
  69. 69. Richards AJ. The origin of Taraxacum agamospecies. J. Linn. Soc. Bot. 1973; 66: 189–211.
  70. 70. Dempewolf H, Kane NC, Ostevik KL, Geleta M, Barker MS, Lai Z, Stewart ML, Bekele E, Engels JM, Cronk QC, Rieseberg LH. Establishing genomic tools and resources for Guizotia abyssinica (Lf) Cass.—the development of a library of expressed sequence tags, microsatellite loci, and the sequencing of its chloroplast genome. Mol Ecol Resour. 2010; 10:1048–58. pmid:21565115
  71. 71. Kumar S, Hahn FM, McMahan CM, Cornish K, Whalen MC. Comparative analysis of the complete sequence of the plastid genome of Parthenium argentatum and identification of DNA barcodes to differentiate Parthenium species and lines. BMC plant biology. 2009; 17:1.
  72. 72. Liu PL, Wan Q, Guo YP, Yang J, Rao GY. Phylogeny of the genus Chrysanthemum L.: evidence from single-copy nuclear gene and chloroplast DNA sequences. PLoS One. 2012; 7: e48970. pmid:23133665
  73. 73. Choi KS, Park S. The complete chloroplast genome sequence of Aster spathulifolius (Asteraceae); genomic features and relationship with Asteraceae. Gene. 2015; 572: 214–221. pmid:26164759
  74. 74. Doorduin L, Gravendeel B, Lammers Y, Ariyurek Y, Chin-A-Woeng T, Vrieling K. The complete chloroplast genome of 17 individuals of pest species Jacobaea vulgaris: SNPs, microsatellites and barcoding markers for population and phylogenetic studies. DNA research. 2011; 28: dsr002.
  75. 75. Zhang Y, Li L, Yan TL, Liu Q. Complete chloroplast genome sequences of Praxelis (Eupatorium catarium Veldkamp), an important invasive species. Gene. 2014; 549: 58–69. pmid:25042453
  76. 76. Turner KG, Grassa CJ. Complete plastid genome assembly of invasive plant, Centaurea diffusa. bioRxivorg. 2014; 1: 005900.
  77. 77. Curci PL, De Paola D, Danzi D, Vendramin GG, Sonnante G. Complete chloroplast genome of the multifunctional crop globe artichoke and comparison with other Asteraceae. PLoS One. 2015; 10: e0120589. pmid:25774672