The early-diverging eudicot order Trochodendrales contains only two monospecific genera, Tetracentron and Trochodendron. Although an extensive fossil record indicates that the clade is perhaps 100 million years old and was widespread throughout the Northern Hemisphere during the Paleogene and Neogene, the two extant genera are both narrowly distributed in eastern Asia. Recent phylogenetic analyses strongly support a clade of Trochodendrales, Buxales, and Gunneridae (core eudicots), but complete plastome analyses do not resolve the relationships among these groups with strong support. However, plastid phylogenomic analyses have not included data for Tetracentron. To better resolve basal eudicot relationships and to clarify when the two extant genera of Trochodendrales diverged, we sequenced the complete plastid genome of Tetracentron sinense using Illumina technology. The Tetracentron and Trochodendron plastomes possess the typical gene content and arrangement that characterize most angiosperm plastid genomes, but both genomes have the same unusual ~4 kb expansion of the inverted repeat region to include five genes (rpl22, rps3, rpl16, rpl14, and rps8) that are normally found in the large single-copy region. Maximum likelihood analyses of an 83-gene, 88 taxon angiosperm data set yield an identical tree topology as previous plastid-based trees, and moderately support the sister relationship between Buxaceae and Gunneridae. Molecular dating analyses suggest that Tetracentron and Trochodendron diverged between 44-30 million years ago, which is congruent with the fossil record of Trochodendrales and with previous estimates of the divergence time of these two taxa. We also characterize 154 simple sequence repeat loci from the Tetracentron sinense and Trochodendron aralioides plastomes that will be useful in future studies of population genetic structure for these relict species, both of which are of conservation concern.
Citation: Sun Y-x, Moore MJ, Meng A-p, Soltis PS, Soltis DE, Li J-q, et al. (2013) Complete Plastid Genome Sequencing of Trochodendraceae Reveals a Significant Expansion of the Inverted Repeat and Suggests a Paleogene Divergence between the Two Extant Species. PLoS ONE 8(4): e60429. doi:10.1371/journal.pone.0060429
Editor: Jonathan H. Badger, J. Craig Venter Institute, United States of America
Received: December 15, 2012; Accepted: February 26, 2013; Published: April 5, 2013
Copyright: © 2013 Sun et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was supported by Knowledge Innovation Project of Chinese Academy of Sciences (KSCX2-EW-J-20), National Natural Science Foundation of China grant (31070191) and U.S. National Science Foundation grant (ER-0431266). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The eudicot order Trochodendrales  contains only two extant genera, both of which are monotypic: Trochodendron Sieb. & Zucc. and Tetracentron Oliver. Historically, these two genera have been treated either as the separate families Trochodendraceae and Tetracentraceae, or as the combined family Trochodendraceae –. The Trochodendraceae sensu APG III  appear to have been widespread in the Northern Hemisphere during the Paleogene and Neogene –. However, the two extant species of the family have small geographic ranges and are restricted to eastern Asia . Trochodendron aralioides Sieb. & Zucc. is a large, evergreen shrub or small tree native to the mountains of Japan to South Korea and Taiwan, and the Ryukyu Islands , , whereas Tetracentron sinense Oliver is a deciduous tree occurring in southwestern and central China and the eastern Himalayan regions. Both species are characterized by apetalous flowers arranged in cymose inflorescences and by loculicidal capsules that dehisce to release winged seeds , , , . Although earlier researchers reported that wood of Trochodendrales wood lacked vessels and thus suggested that Trochodendrales were among the earliest-diverging angiosperms, recent research has documented the presence of vessels in the wood of both genera , , .
Molecular phylogenetic studies, including analyses of complete plastid genome sequences, have routinely recovered Trochodendrales as an early-diverging member of the clade Eudicotyledoneae (sensu ; all italicized clade names follow this system), specifically as part of a strongly supported clade with Buxales and Gunneridae, or core eudicots –. However, the relationships among Trochodendrales, Buxales, and Gunneridae have often been only weakly supported. In the 17-gene analysis of Soltis et al. , which included data from all three plant genomes, Trochodendrales and Buxales were subsequent sisters to Gunneridae, with 100% and 98% BS support, respectively. However, other studies have found Buxales to be sister to Gunneridae with only weak support , , –, whereas in other analyses Trochodendrales have appeared as sister to Gunneridae , –.
Complete plastid genome sequences have been used increasingly over the past decade to resolve deep-level phylogenetic relationships that have been unclear based on only a few genes. For example, recent plastid phylogenomic studies have helped to resolve key relationships among the earliest-diverging Mesangiospermae  as well as early-diverging Eudicotyledoneae and Pentapetalae , . Indeed, the plastid genome represents an excellent source of characters for plant phylogenetics due to the generally strong conservation of plastid genome structure and its mix of sequence regions that vary tremendously in evolutionary rate –, which enable plastid genome sequence data to be applied to phylogenetic problems at almost any taxonomic level in plants , –. It is now relatively inexpensive to generate complete plastid genome sequence due to rapid improvements in next-generation sequencing (NGS) technologies , – and due to the relatively small size of the plastid genome (~150 kb) and its structural conservation, which enable dozens of plastomes to be multiplexed per sequencing lane and facilitate relatively straightforward genome assembly –.
Despite the promise of NGS technology for plastid genomics, the complete plastomes of only eight genera of early-diverging eudicots have been reported: Ranunculus (Ranunculaceae, Ranunculales), Megaleranthis (Ranunculaceae, Ranunculales), Nandina (Berberidaceae, Ranunculales), Nelumbo (Nelumbonaceae, Proteales), Platanus (Platanaceae, Proteales), Meliosma (Sabiaceae, Sabiales), Trochodendron (Trochodendraceae, Trochodendrales) and Buxus (Buxaceae, Buxales). Previous phylogenetic analyses based on some of these complete genomes have not fully resolved the relationships among early-diverging eudicots, however; in addition to the uncertainty surrounding relationships of Buxales, Trochodendrales, and Gunneridae, the positions of Sabiales and Proteales remain poorly supported –. Plastome taxon sampling is still sparse in these clades, however, and additional sampling may help elucidate these recalcitrant relationships.
In addition to their important role in phylogenetics, plastid genomes may be rich sources of population-level data. The non-recombination and uniparental inheritance of most plastid genomes can make plastid genomes extremely useful for population genetics, particularly for tracing maternal lineages –. For example, chloroplast simple sequence repeats (cpSSR) have been widely used in plant population genetics , including within early-diverging eudicots, where numerous cpSSR loci have been reported from the plastid genome of the endangered species Megaleranthis saniculifolia (Ranunculaceae) .
Here we report the complete plastid genome sequences of Tetracentron sinense and Trochodendron aralioides (the protein-coding and rRNA genes of Trochodendron cp genome were used for phylogenetic analyses in Moore et al. , but the cp genome structure of this genus has never been reported), as well as the results of new phylogenetic analyses based on adding Tetracentron and Megaleranthis genomes  to the 83-gene data set of Moore et al. . We also compare the plastid genome structure of Trochodendron and Tetracentron, including the characterization of a significant expansion of the inverted repeat in both taxa, and we estimate the divergence time between the two genera. Finally, we characterize the distribution and location of cpSSRs in both Tetracentron sinense and Trochodendron aralioides, which provided further opportunity to study the population genetic structures of these two ancient relict species.
Sequencing and Genome Assembly
Illumina paired-end sequencing produced 892.11 Mb of data for Tetracentron sinense. We obtained 9912310 raw reads of 90 bp in length. The N50 of contigs was 13,981 bp and the summed length of contigs was 143,709 bp. The mean coverage of this genome was 5424.2×. After de novo and reference-guided assembly, we obtained a cp genome containing nine gaps. PCR and Sanger sequencing were used for filling the gaps. Four junction regions between IRs and SSC/LSC were first determined based on de novo contigs, and subsequently confirmed by PCR amplifications and Sanger sequencing, sequenced results were compared with the assembled genome directly and no mismatch or indel was observed, which validated the accuracy of our assembly. The genome sequences of Tetracentron sinense and Trochodendron aralioides have been submitted to GenBank (GenBank IDs: KC608752 and KC608753).
General Features of the Tetracentron and Trochodendron Plastomes
The plastid genome size of Tetracentron sinense is 164,467 base pairs (bp) (Figure 1), and that of Trochodendron aralioides is 165,945 bp (Figure 2). Both genomes show typical quadripartite structure, consisting of two copies of an inverted repeat (IR) separated by the large single-copy (LSC) and small single-copy (SSC) regions (Table 1). The IR exhibits a significant expansion relative to most other angiosperms at the LSC/IR junction; specifically, the IR in both Tetracentron and Trochodendron has expanded to include the entirety of the rps19, rpl22, rps3, rpl16, rpl14, and rps8 genes (Figures 1, 2). The SSC/IR boundary occurs within the ycf1 gene, as is typical in angiosperms, but is slightly expanded in the Trochodendron genome to include 1461 bp of the 5′ end of ycf1 (versus 1083 bp in Tetracentron; Figure 3). This expansion of the IR at the SSC junction contributes to the difference in length between the two Trochodendrales plastomes; the remainder of the difference is largely the result of length differences among various noncoding regions (Table 2).
Both genomes contain 119 genes (79 protein-coding genes, 30 tRNA genes, and 4 rRNA genes) arranged in the same order, of which 24 are duplicated in the IR regions (Table 3). Sequence divergence between Tetracentron and Trochodendron in coding regions is low (Table 4, Figures 4, 5). Only 7 genes (rps11, rpoA, rpl32, rps16, ndhF, ycf1, and rpl36) exhibit divergences of more than 2%, and 12 genes have an identical sequence (Table 4, Figure 4). The genes ndhF, ycf1, and rpl36 have the highest sequence divergences (2.7%, 3.5% and 4.4%, respectively). The coding regions account for 57.5% and 57.3% of the Tetracentron and Trochodendron plastid genomes, respectively. For both cp genomes, single introns are present in 18 genes, whereas three genes (rps12, clpP, and ycf3) have two introns (Table 5). The overall genomic G/C nucleotide composition is 38.1% and 38.0% for Tetracentron and Trochodendron, respectively; detailed A/T contents of different regions of the plastome for both genomes are listed in Table 6. Due to the lower A/T content of the four rRNA genes, the IR regions possess lower A/T content than the single-copy regions.
Characterization of SSR Loci
In all, 154 SSR loci (77 each from Tetracentron sinense and Trochodendron aralioides) were detected in the two plastid genomes, of which 123 are mononucleotide repeats, 28 are dinucleotide repeats, two are trinucleotide repeats, and one is a tetranucleotide repeat (Table 7). Nearly all of the SSR loci are composed of A/T repeats (Table 7), and these SSR loci are mostly present in noncoding regions. The tetranucleotide locus identified in Tetracentron is in the first intron of ycf3. The two trinucleotide loci in Trochodendron are both located in the spacer region between trnK-UUU and rps16. The unique C mononucleotide repeat from Trochodendron is present in the trnV-ndhC intergenic spacer region.
Phylogenetic and Molecular Dating Analyses
ML analyses of the 83-gene, 88-taxon data set yielded a tree with a similar topology and bootstrap support (BS) values (Figure 6) as that of the plastid phylogenomic study of Moore et al. . The clades of Trochodendron+Tetracentron and Ranunculus+Megaleranthis were supported with 100% ML BS support. Trochodendrales are sister to the remaining angiosperms with high support (BS = 100%), but Buxaceae are sister to Gunneridae with only 67% BS support.
Numbers associated with branches are ML bootstrap support values. Error bars around nodes correspond to 95% highest posterior distributions of divergence times based on 6 fossils using the program BEAST. Eo = Eocene, Mi = Miocene, Ol. = Oligocene, Pa = Paleocene, Pl = Pliocene.
Molecular dating analyses suggest that Trochodendron and Tetracentron diverged between 44-30 million ago. The crown group 95% highest posterior density (HPD) age estimates for other major lineages of Pentapetalae were as follows: Superasteridae (115-109 mya), Dilleniaceae+Superrosidae (116-112 mya), Superrosidae (114-111 mya), Santalales (98-75 mya), Caryophyllales (76-60 mya), Asteridae (104-99 mya), Rosidae (111-108 mya), Vitaceae+Saxifragales (114-110 mya), and Saxifragales (109-107 mya).
Expansion of the IR Region in Trochodendrales Plastomes
The plastid genomes of Tetracentron and Trochodendron exhibit the typical gene content and genome structure of angiosperms , –, with the notable exception of a significantly expanded IR region (Figures 1, 2, 3). This ~4 kb expansion is responsible for the relatively large size of both Trochodendrales plastomes, which are ~4–5 kb larger than the typical upper size range of angiosperm plastid genomes, including those of nearly all other early-diverging eudicots (Table 8). Significant expansion, contraction, and even loss of the IR appears to be an evolutionarily uncommon phenomena but are nonetheless associated with much of the more significant variation in plastome size in angiosperms. For example, the largest known angiosperm plastome, that of Pelargonium x hortorum, also possesses the largest known IR, at ~76 kb in length . Other significant IR expansions and contractions have been found in Campanulaceae –, Apiaceae , and Lemna (Araceae) .
Impact of Additional Taxon Sampling on Basal Eudicot Phylogeny
The inclusion of Megaleranthis and Tetracentron in our analyses had no effect on the relationships among the major early-diverging eudicot lineages, and very little effect on support values. Of the basal splits among the eudicots with BS values less than 100% in both the current tree and that of Moore et al. , all were within 3% BS value. For example, the sister relationship of Buxales and Gunneridae is 70% in Moore et al.  vs. 67% with the inclusion of Megaleranthis and Tetracentron, and the sister relationship of Sabiales and Proteales has BS support of 80% in Moore et al.  vs. 83% in the current analyses. These similar values are unsurprising given that Tetracentron and Trochodendron are found to be relatively closely related in our analyses. Indeed, the relatively low sequence divergence between the Tetracentron and Trochodendron plastid genomes supports the taxonomic placement of Tetracentraceae within Trochodenraceae, as advocated by APG III . Although it is possible that the addition of the noncoding regions of the plastid genome (or at least those noncoding regions that can be aligned) to our data set may improve support for these relationships, we may have to look to the other plant genomes for a confident resolution of relationships among the early-diverging eudicots. In fact, the sister relationship of Buxales and Gunneridae received high support (BS = 98%) in the 17-gene analyses of Soltis et al. , which employed a combination of 11 plastid genes, 18S and 26S nuclear rDNA, and 4 mitochondrial genes. However, the sister relationship of Sabiales and Proteales were more poorly supported (BS = 59%) in Soltis et al. .
Divergence Time Between Tetracentron and Trochodendron
Cenozoic Trochodendrales fossils are known throughout the Northern Hemisphere, with the Paleocene Nordenskioldia the earliest certain fossil of the order –. Both Tetracentron and Trochodendron had wide distributions in the Northern Hemisphere during the Paleogene and Neogene. Fossil remains of Tetracentron have been found in Japan –, Idaho , Princeton, British Columbia and Republic, Washington , and Iceland ; Trochodendron fossil remains have been reported from Kamchatka , Japan , Idaho and Oregon –, Washington , and British Columbia . Our estimate of the divergence time between the two genera of Trochodendraceae (44-30 mya) encompasses the recent estimate of 37-31 mya from Bell et al. , which was based on analysis of 567 taxa and three genes, as well as the mid-Eocene estimate of ~45 mya derived from the rbcL analysis of Anderson et al. , which employed numerous fossil constraints from the early-diverging eudicots. The congruence among these studies and with the fossil record suggests that a mid- to late Eocene divergence for the two extant Trochodendraceae lineages may be a reasonable estimate.
Analysis of Plastid SSR Loci in the Trochodendrales
Because microsatellite loci, including cpSSRs, often exhibit high variation within species, they are considered valuable molecular markers for population genetics –. A limited number of SSR loci were recently characterized for Tetracentron , but no cpSSR loci are available for Trochodendraceae. The 77 cpSSR loci that were identified in both Tetracentron and Trochodendron represent ~42% more loci than the 54 loci reported in the plastid genome of Megaleranthis (Ranunculaceae), the only other early-diverging eudicot for which a comprehensive analysis of cpSSR loci is available. The abundant and varied cpSSR loci identified in Trochodendrales will be useful in characterizing the population genetics of both extant species, which are of conservation interest in the wild because of their relatively narrow, presumably relictual distributions, and decreasing numbers . Tetracentron is officially afforded second-class protection in China.
Materials and Methods
Sample Preparation, Sequencing, and Assembly
Fresh leaves of Tetracentron sinense were collected from the Kunming Institute of Botany at the Chinese Academy of Sciences, and a voucher was deposited at the Herbarium of Wuhan Botanical Garden, Chinese Academy of Science (HIB). Chloroplast DNA was isolated following the protocol of Zhang et al. , and an Illumina library was constructed following the manufacturer’s protocol (Illumina). The DNA was indexed by tag and sequenced together with eight other species in one lane of an Illumina Genome Analyzer IIx at Beijing Genomics Institute (BGI) in Shenzhen, China. Illumina Pipeline 1.3.2 was used conducting image analysis and base calling. Raw sequence reads produced by Illumina paired-end sequencing were filtered for high quality reads which were subsequently assembled into contigs with a minimum length of 100 bp using SOAPdenovo  with the Kmer = 57. Contigs were aligned to the Trochodendron aralioides plastid genome using BLAST (http://blast.ncbi.nlm.nih.gov/), and aligned contigs were ordered according to the reference genome.
Genome Annotation and Analysis
The Tetracentron and Trochodendron plastid genomes were annotated with DOGMA  and BLAST tools from NCBI (the National Center for Biotechnology Information). Physical maps were generated using GenomeVx  with subsequent manual editing. Sequence divergence between the Tetracentron and Trochodendron plastid genomes was evaluated using DnaSP version 5.10 , and genome sequence identity plots were generated using mVISTA  (http://genome.lbl.gov/vista/mvista/submit.shtml). Msatfinder ver. 1.6.8  was used to identify SSR loci by manually setting repeat units.
Phylogenetic and Divergence Time Analyses
All protein-coding sequences, as well as all rRNA sequences, were extracted from the Tetracentron and Megaleranthis plastome  and added manually to the 83-gene, 86-taxon alignment of Moore et al. . ML analyses were performed on the concatenated 83-gene data set using the following partitioning strategy: (1) codon positions 1 and 2 together; (2) codon position 3; and (3) rRNA genes. The optimal nucleotide sequence model was selected for each partition using jModelTest 2.1.1 using the Decision Theory (DT) criterion . The following models were selected: TVM+I+Γ for codon positions 1+2 and for codon position 3, and TIM1+ I+Γ for rRNA.
Partitioned ML analyses were conducted using GARLI 2.0 . A total of ten search replicates were conducted to find the optimal tree, and nonparametric bootstrap support was assessed with 100 replicates . All ML searches used random taxon addition to build starting trees.
Divergence times were estimated using BEAST version 1.7.4 , using the same dating strategies employed in Moore et al. . In addition to the three calibration points (used in Moore et al. ) of minimum ages of 131.8 mya for angiosperms –, 125 mya for eudicots , , and 85 mya for the most recent common ancestor of Quercus and Cucumis , we additionally constrained the stem lineage of Malpighiales using a minimum of 89.3 my  and the node uniting Calycanthus and Liriodendron using 98 my , and set the age of Proteales to a minimum of 98 my .
We thank the anonymous reviewers for their helpful comments on earlier versions of this manuscript.
Conceived and designed the experiments: JQL HCW. Performed the experiments: YXS MJM APM. Analyzed the data: YXS MJM. Contributed reagents/materials/analysis tools: YXS MJM JQL HCW. Wrote the paper: YXS MJM PSS DES HCW.
- 1. Angiosperm Phylogeny Group (2009) An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Bot J Linn Soc 161: 105–121. doi: 10.1111/j.1095-8339.2009.00996.x
- 2. Smith AC (1945) A taxonomic review of Trochodendron and Tetracentron. J Arnold Arbor Harvard University 26: 123–142.
- 3. Cronquist A (1981) An Integrated System of Classification of Flowering Plants. New York: Columbia University Press.
- 4. Endress PK (1986) Reproductive structures and phylogenetic significance of extant primitive angiosperms. Plant Syst Evol 152: 1–28. doi: 10.1007/bf00985348
- 5. Endress PK, Igersheim A (1999) Gynoecium diversity and systematics of the basal eudicots. Bot J Linn Soc 130: 305–393. doi: 10.1111/j.1095-8339.1999.tb00528.x
- 6. Magallόn S, Crane PR, Herendeen PS (1999) Phylogenetic pattern, diversity, and diversification of eudicots. Ann Missouri Bot Gard 86: 297–372. doi: 10.2307/2666180
- 7. Pigg KB, Wehr WC, Ickert-Bond SM (2001) Trochodendron and Nordenskioldia (Trochodendraceae) from the Middle Eocene of Washington State, U.S.A. Int J Plant Sci. 162: 1187–1198. doi: 10.1086/321927
- 8. Crane PR (1989) Paleobotanical evidence on the early radiation of nonmagnoliid dicotyledons. Plant Syst Evol 162: 165–191. doi: 10.1007/bf00936916
- 9. Crane PR, Manchester SR, Dilcher DL (1990) A preliminary survey of fossil leaves and well-preserved reproductive structures from the Sentinel Butte Formation (Paleocene) near Almont, North Dakota. Fieldiana Geol NS 20: l–63.
- 10. Crane PR, Manchester SR, Dilcher DL (1991) Reproductive and vegetative structure of Nordenskioldia (Trochodendraceae), a vesselless dicotyledon from the early Tertiary of the Northern Hemisphere. Am J Bot 8: 1311–1334. doi: 10.2307/2445271
- 11. Manchester SR, Crane PR, Dilcher DL (1991) Nordenskioldia and Trochodendron fruits (Trochodendraceae) from the Miocene of northwestern North America. Bot Gaz 152: 357–368. doi: 10.1086/337898
- 12. Fields PF (1996a) The Succor Creek flora of the middle Miocene Sucker Creek Formation, southwestern Idaho and eastern Oregon: systematics and paleoecology. PhD diss. Michigan State University, East Lansing.
- 13. Fields PF (1996b) A Trochodendron infructescence from the 15 Ma Succor Creek flora in Oregon: a geographic and possibly temporal range extension. Am J Bot 83suppl: 110.
- 14. Manchester SR (1999) Biogeographical relationships of North American Tertiary floras. Ann Missouri Bot Gard 86: 472–522. doi: 10.2307/2666183
- 15. Grímsson F, Denk T, Zetter R (2008) Pollen, fruits, and leaves of Tetracentron (Trochodendraceae) from the Cainozoic of Iceland and western North America and their palaeobiogeographic implications. Grana 47: 1–14. doi: 10.1080/00173130701873081
- 16. Watson L, Dallwitz MJ (2006) The families of flowering plants: descriptions, illustrations, identification, information retrieval. Version 3.
- 17. Mabberley DJ (1987) The plant-book. Cambridge: Cambridge University Press.
- 18. Doweld AB (1998) Carpology, seed anatomy and taxonomic relationships of Tetracentron (Tetracentraceae) and Trochodendron (Trochodendraceae). Ann Bot 82: 413–443.
- 19. Li HF, Chaw SM, Du CM, Ren Y (2011) Vessel elements present in the secondary xylem of Trochodendron and Tetracentron (Trochodendraceae). Flora 206: 595–600. doi: 10.1016/j.flora.2010.11.018
- 20. Cantino PD, Doyle JA, Graham SW, Judd WS, Olmstead RG, et al. (2007) Towards a phylogenetic nomenclature of Tracheophyta. Taxon 56: 822–846. doi: 10.2307/25065865
- 21. Soltis DE, Soltis PS, Endress PK, Chase MW (2005) Phylogeny and Evolution of the Angiosperms. Sunderland, MA: Sinauer.
- 22. Qiu Y, Dombrovska O, Lee J, Li L, Whitlock BA, et al. (2005) Phylogenetic analysis of basal angiosperms based on nine plastid, mitochrondrial, and nuclear genes. Int J Plant Sci 166: 815–842. doi: 10.1086/431800
- 23. Qiu YL, Li L, Hendry TA, Li R, Taylor DW, et al. (2006) Reconstructing the basal angiosperm phylogeny: evaluating information content of mitochondrial genes. Taxon 55: 837–856. doi: 10.2307/25065680
- 24. Worberg A, Quandt D, Barniske A-M, Löhne C, Hilu KW, et al. (2007) Phylogeny of basal eudicots: insights from non-coding and rapidly evolving DNA. Org Divers Evol 7: 55–77. doi: 10.1016/j.ode.2006.08.001
- 25. Soltis DE, Moore MJ, Burleigh JG, Bell CD, Soltis PS (2010) Assembling the Angiosperm Tree of Life: progress and future prospects. Ann Missouri Bot Gard 97: 514–526. doi: 10.3417/2009136
- 26. Moore MJ, Soltis PS, Bell CD, Burleigh JG, Soltis DE (2010) Phylogenetic analysis of 83 plastid genes further resolves the early diversification of eudicots. Proc Natl Acad Sci USA 107: 4623–4628. doi: 10.1073/pnas.0907801107
- 27. Moore MJ, Hassan N, Gitzendanner MA, Bruenn RA, Croley M, et al. (2011) Phylogenetic analysis of the plastid inverted repeat for 244 species: insights into deeper-level angiosperm relationships from a long, slowly evolving sequence region. Int J Plant Sci 172: 541–558. doi: 10.1086/658923
- 28. Soltis DE, Smith S, Cellinese N, Refulio-Rodriquez NF, Olmstead R, et al. (2011) Inferring angiosperm phylogeny: a 17-gene analysis. Am J Bot 98: 704–730. doi: 10.3732/ajb.1000404
- 29. Qiu YL, Li LB, Wang B, Chen Z, Knoop V, et al. (2006) The deepest divergences in land plants inferred from phylogenomic evidence. Proc Natl Acad Sci USA 103: 15511–15516. doi: 10.1073/pnas.0603335103
- 30. Barniske AM, Borsch T, Müller K, Krug M, Worberg A, et al. (2012) Phylogenetics of early branching eudicots: Comparing phylogenetic signal across plastid introns, spacers, and genes. J Syst Evol 50: 85–108. doi: 10.1111/j.1759-6831.2012.00181.x
- 31. Hoot SB, Magallón S, Crane PR (1999) Phylogeny of basal eudicots based on three molecular data sets: atpB, rbcL and 18S nuclear ribosomal DNA sequences. Ann Mo Bot Gard 86: 1–32. doi: 10.2307/2666215
- 32. Soltis DE, Soltis PS, Chase MW, Mort M, Albach D, et al. (2000) Angiosperm phylogeny inferred from a combined data set of 18S rDNA, rbcL AND atpB sequences. Bot J Linn Soc 133: 381–461. doi: 10.1111/j.1095-8339.2000.tb01588.x
- 33. Moore MJ, Bell CD, Soltis PS, Soltis DE (2007) Using plastid genome-scale data to resolve enigmatic relationships among basal angiosperms. Proc Natl Acad Sci USA 104: 19363–19368. doi: 10.1073/pnas.0708072104
- 34. Jansen RK, Cai Z, Raubeson LA, Daniell H, DePamphilis CW, et al. (2007) Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proc Natl Acad Sci USA 104: 19369–19374. doi: 10.1073/pnas.0709121104
- 35. Wolfe KH, Li WH, Sharp PM (1987) Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc Natl Acad Sci USA 84: 9054–9058. doi: 10.1073/pnas.84.24.9054
- 36. Downie SR, Palmer JD (1992) Use of chloroplast DNA rearrangements in reconstructing plant phylogeny. In: Soltis PS, Soltis DE, Doyle JJ, eds. Molecular Systematics of Plants. New York: Chapman and Hall. 14–35.
- 37. Raubeson LA, Jansen RK (2005) Chloroplast genomes of plants. In: Henry R, ed. Diversity and Evolution of Plants-genotypic Variation in Higher Plants. Oxfordshire: CABI Publishing. 45–68.
- 38. Moore MJ, Dhingra A, Soltis PS, Shaw R, Farmerie WG, et al. (2006) Rapid and accurate pyrosequencing of angiosperm plastid genomes. BMC Plant Biol 6: 17. doi: 10.1186/1471-2229-6-17
- 39. Parks M, Cronn R, Liston A (2009) Increasing phylogenetic resolution at low taxonomic levels using massively parallel sequencing of chloroplast genomes. BMC Biol 7: 84. doi: 10.1186/1741-7007-7-84
- 40. Diekmann K, Hodkinson TR, Wolfe KH, van den Bekerom R, Dix PJ, et al. (2009) Complete chloroplast genome sequence of a major allogamous forage species, perennial ryegrass (Lolium perenne L.). DNA Res 16: 165–76. doi: 10.1093/dnares/dsp008
- 41. Kumar S, Hahn FM, McMahan CM, Cornish K, Whalen MC (2009) Comparative analysis of the complete sequence of the plastid genome of Parthenium argentatum and identification of DNA barcodes to differentiate Parthenium species and lines. BMC Plant Biol 9: 131. doi: 10.1186/1471-2229-9-131
- 42. Wu F-H, Chan M-T, Liao D-C, Hsu C-T, Lee Y-W, et al. (2010) Complete chloroplast genome of Oncidium Gower Ramsey and evaluation of molecular markers for identification and breeding in Oncidiinae, BMC Plant Biol. 10: 68. doi: 10.1186/1471-2229-10-68
- 43. Whitall JB, Syring J, Parks M, Buenrostro J, Dick C, et al. (2010) Finding a (pine) needle in a haystack: chloroplast genome sequence divergence in rare and widespread pines. Mol Ecol 19: 100–114. doi: 10.1111/j.1365-294x.2009.04474.x
- 44. Shendure J, Ji H (2008) Next-generation DNA sequencing. Nat Biotechnol 26: 1135–1145. doi: 10.1038/nbt1486
- 45. Stull GW, Moore MJ, Mandala VS, Douglas N, Kates H-R, et al.. (2013) A targeted enrichment strategy for massively parallel sequencing of angiosperm plastid genomes. App Plant Sci: in press.
- 46. Zhang YJ, Ma PF, Li DZ (2011) High-Throughput Sequencing of Six Bamboo Chloroplast Genomes: Phylogenetic Implications for Temperate Woody Bamboos (Poaceae: Bambusoideae). PLoS ONE 6: e20596. doi: 10.1371/journal.pone.0020596
- 47. Steele PR, Hertweck KL, Mayfield D, McKain MR, Leebens-Mack J, et al. (2012) Quality and quantity of data recovered from massively parallel sequencing: Examples in Asparagales and Poaceae. Am J Bot 99: 330–348. doi: 10.3732/ajb.1100491
- 48. Straub SCK, Parks M, Weitemier K, Fishbein M, Cronn RC, et al. (2012) Navigating the tip of the genomic iceberg: Next-generation sequencing for plant systematics. Am J Bot 99: 349–364. doi: 10.3732/ajb.1100335
- 49. McCauley DE, Stevens JE, Peroni PA, Raveill JA (1996) The spatial distribution of chloroplast DNA and allozyme polymorphisms within a population of Silene alba (Caryophyllaceae). Am J Bot 83: 727–31. doi: 10.2307/2445849
- 50. Small RL, Cronn RC, Wendel JF (2004) Use of nuclear genes for phylogeny reconstruction in plants. Aust Syst Bot 17: 145–70. doi: 10.1071/sb03015
- 51. Provan J, Powell W, Hollingsworth PM (2001) Chloroplast microsatellites: new tools for studies in plant ecology and evolution. Trends Ecol Evol 16: 142–147. doi: 10.1016/s0169-5347(00)02097-8
- 52. Kim YK, Park CW, Kim KJ (2009) Complete chloroplast DNA sequence from a Korean endemic genus, Megaleranthis saniculifolia, and its evolutionary implications. Mol Cells 27: 365–381. doi: 10.1007/s10059-009-0047-6
- 53. Shinozaki K, Ohem M, Tanaka M, Wakasugi T, Hayashida N, et al. (1986) The complete nucleotide sequence of tobacco chloroplast genome: its gene organization and expression. EMBO Journal 5: 2043–2049.
- 54. Palmer JD (1991) Plastid chromosomes: structure and evolution. In: Vasil IK, Bogorad L, eds. Cell Culture and Somatic Cell Genetics in Plants, Vol. 7A, The Molecular Biology of Plastids. San Diego, USA: Academic Press. 5–53.
- 55. Chumley TW, Palmer JD, Mower JP, Fourcade HM, Calie PJ, et al. (2006) The complete chloroplast genome sequence of Pelargonium x hortorum: organization and evolution of the largest and most highly rearranged chloroplast genome of land plants. Mol Biol Evol 23: 2175–2190. doi: 10.1093/molbev/msl089
- 56. Cosner ME, Jansen RK, Palmer JD, Downie SR (1997) The highly rearranged chloroplast genome of Trachelium caeruleum (Campanulaceae): multiple inversions, inverted repeat expansion and contraction, transposition, insertions/deletions, and several repeat families. Curr Genet 31: 419–429. doi: 10.1007/s002940050225
- 57. Knox EB, Palmer JD (1999) The chloroplast genome arrangement of Lobelia thuliniana (Lobeliaceae): Expansion of the inverted repeat in an ancestor of the Campanulales. Plant Syst Evol 214: 49–64. doi: 10.1007/bf00985731
- 58. Plunkett GM, Downie SR (2000) Expansion and contraction of the chloroplast inverted repeat in Apiaceae subfamily Apioideae. Syst Bot 25: 648–667. doi: 10.2307/2666726
- 59. Mardanov AV, Ravin NV, Kuznetsov BB, Samigullin TH, Antonov AS, et al. (2008) Complete sequence of the duckweed (Lemna minor) chloroplast genome: structural organization and phylogenetic relationships to other angiosperms. J Mol Evol 66: 555–564. doi: 10.1007/s00239-008-9091-7
- 60. Ozaki K (1987) Tetracentron leaves from the Neogene of Japan. Transactions and Proceedings of the Palaeontological Society, Japan, NS 146: 77–87.
- 61. Suzuki M, Joshi L, Noshira S (1991) Tetracentron wood from the Miocene of Noto Peninsula, Central Japan, with a short revision of homoxylic fossil woods. Bot Mag Tokyo 104: 34–48. doi: 10.1007/bf02493402
- 62. Manchester SR, Chen I (2006) Tetracentron fruits from the Miocene of western North America. Int J Plant Sci 167: 601–605. doi: 10.1086/503206
- 63. Pigg KB, Dillhoff RM, DeVore ML, Wehr WC (2007) New diversity among the Trochodendraceae from the Early/Middle Eocene Okanogan Highlands of British Columbia, Canada, and northeastern Washington State, United States. Int J Plant Sci 168: 521–532. doi: 10.1086/512104
- 64. Chelebaeva AI, Chigayeva GB (1988) The genus Trochodendron (Trochodendraceae) in Miocene of Kamchatka. Bot Zh 73: 315–318.
- 65. Bell CD, Soltis DE, Soltis PS (2010) The age and diversification of the angiosperm re-revisited. Am J Bot 97: 1296–1303. doi: 10.3732/ajb.0900346
- 66. Anderson CL, Bremer K, Friis EM (2005) Dating phylogenetically basal eudicots using rbcL sequences and multiple fossil reference points. Am J Bot 92: 1737–1748. doi: 10.3732/ajb.92.10.1737
- 67. Powell W, Morgante M, Mcdevitt R, Vendramin GG, Rafaslki JA (1995) Polymorphic simple sequence repeat regions in chloroplast genomes: applications to the population genetics of pines. Proc Natl Acad Sci USA 92: 7759–7763. doi: 10.1073/pnas.92.17.7759
- 68. Grassi F, Labra M, Scienza A, Imazio S (2002) Chloroplast SSR markers to assess DNA diversity in wild and cultivated grapevines. Vitis 41: 157–158.
- 69. Ebert D, Peakall R (2009) Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species. Mol Ecol Res 9: 673–690. doi: 10.1111/j.1755-0998.2008.02319.x
- 70. Yang Z, Lu R, Tao C, Chen S, Ji Y (2012) Microsatellites for Tetracentron sinense (Trochodendraceae), a Tertiary relict endemic to East Asia. Am J Bot 99: e320–e322. doi: 10.3732/ajb.1200012
- 71. Fu LG (1992) China Plant Red Data Book (Vol.1). Beijing: Science Press.
- 72. Li R, Zhu H, Ruan J, Qian W, Fang X, et al. (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20: 265–272. doi: 10.1101/gr.097261.109
- 73. Wyman SK, Jansen RK, Boore JL (2004) Automatic annotation of organellar genomes with DOGMA. Bioinformatics 20: 3252–3255. doi: 10.1093/bioinformatics/bth352
- 74. Conant GC, Wolfe KH (2008) GenomeVx: simple web-based creation of editable circular chromosome maps. Bioinformatics 24: 861–862. doi: 10.1093/bioinformatics/btm598
- 75. Rozas J, Sánchez-Delbarrio JC, Messeguer X, Rozas R (2003) DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19: 2496–2497. doi: 10.1093/bioinformatics/btg359
- 76. Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I (2004) VISTA: computational tools for comparative genomics. Nucleic Acids Res 32: W273–W279. doi: 10.1093/nar/gkh458
- 77. Thurston MI, Field D (2005) Msatfinder: detection and characterisation of microsatellites, version 1.6.8.
- 78. Darriba D, Taboada GL, Doallo R, Posada D (2012) jModelTest 2: more models, new heuristics and parallel computing. Nat Methods 9: 772. doi: 10.1038/nmeth.2109
- 79. Zwickl DJ (2006) Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion. Ph.D. dissertation, The University of Texas at Austin.
- 80. Felsenstein J (1985) Confidence limits on phylogeny: An approach using the bootstrap. Evolution 39: 783–791. doi: 10.2307/2408678
- 81. Drummond AJ, Suchard MA, Xie D, Rambaut A (2012) Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol 29: 1969–1973. doi: 10.1093/molbev/mss075
- 82. Doyle JA (1992) Revised palynological correlations of the lower Potomac Group (USA) and the Cocobeach sequence of Gabon (Barremian-Aptian). Cretac Res 13: 337–349. doi: 10.1016/0195-6671(92)90039-s
- 83. Hughes NF (1994) The Enigma of Angiosperm Origins. Cambridge, UK: Cambridge Univ Press.
- 84. Brenner GJ (1996) Flowering Plant Origin, Evolution and Phylogeny. New York: Chapman and Hall. 91–115.
- 85. Friis EM, Pedersen KR, Crane PR (1999) Early angiosperm diversification: The diversity of pollen associated with angiosperm reproductive structures in Early Cretaceous floras from Portugal. Ann Mo Bot Gard 86: 259–296. doi: 10.2307/2666179
- 86. Doyle JA, Hotton CL (1991) Pollen and Spores: Patterns of Diversification. Oxford: Clarendon. 169–195.
- 87. Magallόn S, Castillo A (2009) Angiosperm diversification through time. Am J Bot 96: 349–365. doi: 10.3732/ajb.0800060
- 88. Friis EM, Eklund H, Pedersen KR, Crane PR (1994) Virginianthus calycanthoides gen. et sp. nov. – A calycanthaceous flower from the Potomac Group (Early Cretaceous) of eastern North America. Int J Plant Sci 155: 772–785. doi: 10.1086/297217
- 89. Crane PR, Pedersen KR, Friis EM, Drinnan AN (1993) Early Cretaceous (early to middle Albian) platanoid infl orescences associated with Sapindopsis leaves from the Potomac Group of North America. Syst Bot 18: 328–344. doi: 10.2307/2419407