Gentiana crassicaulis and G. straminea are alpine plants of Gentiana with important medicinal value and complex genetic backgrounds. In this study, the mitochondrial genomes (mtDNAs) of these two species were sequenced. The mtDNAs of G. crassicaulis and G. straminea are 368,808 and 410,086 bp long, respectively, 52 and 49 unique genes are annotated in the two species, and the gene arrangement varies widely. Compared to G. crassicaulis, G. straminea loses three effective genes, namely atp6, trnG-GCC and trnV-GAC. As a pseudogene, the atp6 gene of G. straminea is incomplete, which is rare in higher plants. We detected 1696 and 1858 pairs of long repeats and 213 SSRs and 250 SSs in the mtDNAs of G. crassicaulis and G. straminea, respectively. There are 392 SNPs and 18 InDels between the two genomes, and syntenic sequence and structural variation analysis show low collinearity between the two genomes. Chloroplast DNA transferring to mtDNA is observed in both species, and 46,511 and 55,043 bp transferred segments containing three tRNA genes are identified, respectively. Comparative analysis of mtDNAs of G. crassicaulis, G. straminea and four species of Gentianales determined 18 core genes, and there is no specific gene in G. crassicaulis and G. straminea. The phylogenetic tree based on mtDNAs places Gentianaceae in a branch of Gentianales. This study is the first to analyze the mtDNAs of Gentianaceae, which could provide information for analysis of the structure of mtDNAs of higher plants and phylogenetic research of Gentianaceae and Gentianales.
Citation: Ala KG, Zhao Z, Ni L, Wang Z (2023) Comparative analysis of mitochondrial genomes of two alpine medicinal plants of Gentiana (Gentianaceae). PLoS ONE 18(1): e0281134. https://doi.org/10.1371/journal.pone.0281134
Editor: Xiang Jia Min, Youngstown State University, UNITED STATES
Received: July 5, 2022; Accepted: January 13, 2023; Published: January 26, 2023
Copyright: © 2023 Ala et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The sequencing raw data generated in this study is available at GenkBank, BioProject: PRJNA831324. The mitochondrial genomes of Gentiana crassicaulis and G. straminea described here are accessible via GenBank IDs OM320814 and OM328072.
Funding: This work was supported by grants from the National Natural Science Foundation of China (No. 82073959 and No. 81173654). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The mitochondrion is an important organelle in eukaryotic cells and is intimately associated with energy generation and synthesis of fatty acids and some active proteins . This genetically semi-autonomous organelle can encode genes related to its function . Mitochondrial genomes (mtDNAs) of plants originated from independent living prokaryotes, and phylogenetic analysis found that they were closely related to Proteobacteria . The mtDNA of Arabidopsis thaliana was first sequenced in 1997 . With the development of sequencing technology, more and more plants have the mtDNA sequenced. The mtDNAs of plants mainly have the following characteristics : (1) The genomes are large and genome size varies greatly among species. For example, the mtDNA of Viscum scurruloideum is only 66 kb in size , whereas Larix sibirica has the largest mtDNA size, with 11.7 Mb . (2) Most mitochondria of plants encode roughly the same genes, but the order of genes varies greatly in higher plants [8, 9]. (3) Exchange of genes occurs between mtDNA and other genomes. The plant chloroplast genomes (cpDNAs) and mtDNAs are known to have extensive and widespread homologies due to sequence transfer [10, 11], and the transfer of cpDNA to mtDNA has been going on for at least 300 million years . In addition, many mitochondrial genes have been found in the nuclear genomes of flowering plants . (4) Genome fragments have been repeated and lost many times. (5) Recombination between repeated sequences complicates the structure of the genome. (6) The genome changes rapidly in structure and slowly in DNA sequences. Although the mtDNA evolves more slowly than nuclear genomes and cpDNAs and is generally used in studies of high classification levels, such as genera, families or even higher , it has abundant length polymorphism and unique advantages, which have been increasingly applied in molecular identification and systematic research of plants. With further research, the significance of the mtDNAs of plants is expected to be further developed.
Gentiana L. is the largest genus of Gentianaceae, widely distributed in temperate alpine regions of Europe, Asia and North America. The Qinghai–Tibet Plateau in southwest China is the distribution center of this genus and the region with the most abundant species diversity and endemic species of this genus . Gentiana crassicaulis and G. straminea belong to Gentiana sect. Cruciata, have complex genetic backgrounds and are endemic to the Sino–Himalayan subregion . The roots and flowers of G. crassicaulis and G. straminea are the sources of traditional Chinese medicine and Tibetan medicine used to treat rheumatism pain, stroke hemiplegia, muscle cramps, joint pain and damp-heat jaundice [17–19]. The cpDNA of Gentiana sect. Cruciata was studied [20–22], and the rps16 pseudogene in the nonparasitic plants of the Asterids clade was identified for the first time. Based on sequences of chloroplast and mitochondrial fragments, a series of studies on molecular identification and systematic analysis were carried out [23, 24], and mtDNA showed significance in the identification of species in Gentiana sect. Cruciata . At present, there is no report on the mtDNA of Gentianaceae.
In this study, we sequenced the mtDNAs of two Gentianaceae species, namely G. crassicaulis and G. straminea, for the first time, in order to provide data and information for the study of genome structure and species evolution of higher plants, and systematic analysis and molecular identification of Gentianaceae and Gentianales.
Materials and methods
Plant materials and mtDNA extraction
Gentiana crassicaulis and G. straminea were collected from Tibet, China, with GPS locations of E 97°14.003’, N 31°09.172’ and E 97°14.021’, N 31°09.174’, respectively. Also, the location of the specimens is not within any protected area. The plant samples were collected and transported to the laboratory in fresh condition. The roots of plants were frozen in liquid nitrogen and stored at –80 °C. The mtDNA was isolated from roots with a modified CTAB method .
Genome sequencing and assembly
Genomic DNA was purified and fragmented to 300~500 bp to construct short-insert libraries, then sequenced on Illumina NovaSeq 6000 platform. Purified DNA was processed into 15-20k fragments and sequenced on PacBio Sequel II platform (LingEn Biotechnology Co., Ltd., Shanghai, China). Quality control of raw sequencing data was performed. To process the Illumina data, we performed the following steps: 1) removed adapter sequences in reads; 2) removed the bases containing non-AGCT at the 5’ end; 3) clipped the ends of reads with low quality (quality value <Q20); 4) removed reads containing up to 10% of N; 5) discarded small fragments with length <75 bp after adapter removal. PacBio data were processed as follows: 1) polymerase reads <200 bp were filtered out; 2) polymerase reads with quality <0.80 were filtered out; 3) subreads from the polymerase reads were extracted, and adapter sequences were filtered out; 4) subreads <200 bp were filtered out. Reads for each species-specific library have been deposited in the BioProject PRJNA831324.
Genome assembly involved the following steps: 1) Illumina sequencing data were assembled using GetOrganelle v1.7.1. BWA v0.7.17 was used to compare sequences of Illumina assembly to PacBio data. Then, the extracted PacBio data and Illumina data were mixed and assembled using SPAdes v3.14.1 . The sequences with sufficiently high coverage depth and long assembly length were selected as candidate sequences, and the mitochondrial scaffold sequences were confirmed by comparison with the NT library, with sequences connected according to overlaps. 2) Clean reads were compared with the mtDNA sequence, and Pilon v1.23 was used to correct the bases.
A homologous comparison was carried out to predict the genes of genomes. The protein sequences of the NCBI mitochondrial reference genome were compared to genomes of the samples using BLAST+ 2.7.1 (e-value ≤ 1E-5). Poor alignment results were filtered out, and redundancy was removed. Then, the integrity of genes and the boundaries between exons and introns were manually corrected to obtain a high accuracy conservative gene set. The databases used to annotate coding genes included NR, Swiss-Prot, eggNOG, KEGG and GO. The tRNA and rRNA genes were predicted by tRNAscan-SE  and RNAmmer 1.2 , respectively. OrganellarGenomeDRAW (OGDRAW) was used to map the genomes .
Repeat sequence analysis
There are a large number of repeats in mtDNAs of plants, with repeats of more than 30 bases considered as long repeats. REPuter  was used to analyse long repeats, and the parameters were set as follows: minimal repeat size, 30 bp; hamming distance, 3; maximum computed repeats, 5000. Four types of long repeats were identified as follows: F (Forward), R (Reverse), C (Complement) and P (Palindromic).
Simple sequence repeats (SSRs) exist widely in the mtDNAs of plants and are generally repeats composed of repeat units of 1–6 bp. MISA  was used to identify SSRs, in which the minimum repeats of mono-, di-, tri-, tetra-, penta- and hexanucleotides were set as 8, 5, 4, 3, 3 and 3, respectively. The minimum distance between two SSRs was set to 100 bp. Primer3  was used to design primers for SSRs.
Comparative analysis of mtDNAs of G. crassicaulis and G. straminea
MUMmer  was used to conduct a global comparison of the sequences of G. crassicaulis and G. straminea, find out the sites with differences between the sequences and conduct preliminary filtering to detect potential single nucleotide polymorphisms (SNPs). The 100 bp sequences on both sides of each SNP were extracted and compared with the assembly results using BLAT to verify the SNPs. SNPs <101 bp and those located in repeats were removed to obtain reliable SNPs.
LASTZ was used to compare the sequences of G. crassicaulis and G. straminea. The comparison data were processed by axt-correction, axtSort and axtBest programs to select the best results and obtain the initial Insertion/Deletions (InDels). Sequences of 150 bp on both sides of each InDel were compared with sequencing reads from another sample for verification, and reliable InDels were obtained by filtration.
MUMmer was used for comparing the mtDNAs of G. crassicaulis and G. straminea to confirm the large range of collinearity. LASTZ was then used to compare the regions to confirm the alignment and find areas of translocation (Trans), inversion (Inv) and translocation+inversion (Trans+Inv).
Transfer of cpDNA to mtDNA
The cpDNA sequences of G. crassicaulis and G. straminea in our previous study  were downloaded from GenBank with IDs KY595458 and KJ657732, respectively. BLASTn (NCBI-BLAST-2.2.30+) was used to determine the homologous regions of mtDNA and cpDNA, with parameters set as consistency >90% and length >200 bp.
Comparison of mtDNAs of Gentianales
Collinear and structural variation analyses were performed on the mtDNAs of G. crassicaulis, G. straminea and four species in Gentianales (Asclepias syriaca, Cynanchum auriculatum, Rhazya stricta and Scyphiphora hydrophyllacea) using MUMmer. Homologous genes in all species were identified as core genes (common genes). After removing core genes, dispensable genes (non-common genes) were obtained. Specific genes were genes specific only to a particular species. All dispensable genes were combined with core genes to form pan genes.
Phylogenetic relationships were analyzed based on the concatenated dataset of 29 protein-coding genes (nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad7, nad9, cob, cox1, cox2, cox3, atp1, atp4, atp8, atp9, ccmB, ccmC, ccmFC, ccmFn, matR, rpl5, rpl16, rps1, rps3, rps4, rps12 and rps13) of 33 species, including G. crassicaulis, G. straminea, 30 species of other Asterids plants and 1 species of the outgroup (Cycas taitungensis). PhyML 3.0 was used to construct the phylogenetic tree by the maximum-likelihood method and Bayesian correction, with bootstrap calculations repeated 1000 times . The GenBank IDs of mtDNAs used to construct the phylogenetic tree are shown in S1 Table.
Genome sequencing and assembly
Illumina NovaSeq 6000 platform and PacBio Sequel II platform were used to sequence the mtDNAs of G. crassicaulis and G. straminea. For the Illumina sequencing, 4603.1 Mb and 4818.6 Mb raw data, 4327.2 Mb and 4578.3 Mb clean data were obtained from G. crassicaulis and G. straminea, respectively. The Q20 values for G. crassicaulis and G. straminea were 97.54% and 97.48, and the Q30 values for G. crassicaulis and G. straminea were 92.82% and 92.74%, respectively. The Illumina sequencing was paired-end and the insert size was 450 bp in both species sequencing. In regard to the PacBio sequencing, subreads with N50 = 9697 bp and subreads with N50 = 10079 bp were generated from G. crassicaulis and G. straminea, respectively. The information of raw data and quality control is shown in S2 Table. The mtDNAs of G. crassicaulis and G. straminea are 368,808 and 410,086 bp, respectively, which are the first two mtDNAs of Gentianaceae. Compared with other mtDNAs of Gentianales, they are smaller than those of Asclepias syriaca (682 kb) and R. stricta (548 kb) in Apocynaceae but similar to those of Cynanchum auriculatum (426 kb) in Asclepiadaceae and Scyphiphora hydrophyllacea (354 kb) in Rubiaceae. The GC contents of the two genomes are 44.83% and 44.55%, respectively, which are close to most higher plants [36, 37]. The mtDNA sequences of G. crassicaulis and G. straminea have been uploaded to GenBank with IDs OM320814 and OM328072, respectively.
Fifty-two unique genes were annotated in the mtDNA of G. crassicaulis. There are 32 protein-coding genes, 17 tRNA genes and 3 rRNA genes among these unique genes. In addition, there are two copies of five protein-coding genes, three copies of four protein-coding genes, and two copies of seven tRNA genes. Forty-nine unique genes were annotated in the mtDNA of G. straminea. There are 31 protein-coding genes, 15 tRNA genes and 3 rRNA genes among these unique genes. In addition, there are two copies of seven protein-coding genes, two copies of seven tRNA genes, and three copies of four tRNA genes (Table 1). The mtDNAs of G. crassicaulis and G. straminea are assembled into two single-ring molecules, and the order of the genes varies greatly (Fig 1). Gentiana crassicaulis contains three more unique genes than G. straminea, which are atp6 (two copies), trnG-GCC (two copies) and trnV-GAC (two copies). The atp6 gene of G. straminea, as a pseudogene, is incomplete and exists in the form of a fragment. There are three copies of atp6 with the length of 71, 277 and 277 bp, respectively (Fig 2). Primers were designed for PCR amplification of atp6 regions in G. straminea, which further verified the accuracy of the sequences (S1 Data). An incomplete pseudogene of atp6, which can be regarded as missing, is rare in mtDNAs of angiosperms.
Genes (exons are shown as closed boxes) shown outside the curve are transcribed counterclockwise, whereas those inside are transcribed clockwise. Genes from the same protein complex are colored the same.
This figure is not to scale.
There are nine genes with introns in the mtDNAs of the two species, respectively, namely nad1, nad2, nad4, nad5, nad7, cox1, cox2, ccmFc and rps3. The number of introns is the same, and most introns are the same or similar in length (S3 Table). It is worth noting that nad2 and nad5 of G. crassicaulis contain two trans-splicing introns each, and nad1, nad2 and nad5 of G. straminea contain one, three and two trans-splicing introns, respectively. The structures of these NADH dehydrogenase genes containing trans-splicing introns vary greatly in different plants and first appeared in primitive vascular plants, which may be molecular markers of plant evolution [38–40].
Repeat sequence analysis
We detected 1696 pairs of long repeats in the mtDNA of G. crassicaulis, including 822 pairs of forward repeats, 2 pairs of reverse repeats, 1 pair of complement repeats, and 871 pairs of palindromic repeats. Lengths of repeat units range from 30 to 9683 bp. In addition, we detected 1858 pairs of long repeats in the mtDNA of G. straminea, including 985 pairs of forward repeats, 40 pairs of reverse repeats, 17 pairs of complement repeats, and 816 pairs of palindromic repeats. Lengths of repeat units range from 30 to 30,663 bp. In the two genomes, the number of repeats with a length of 30 to 34 bp is the largest, as 998 (58.5%) and 936 (50.4%), respectively (Fig 3, S4 Table).
As an important component of mtDNA, repeated sequences contribute much of the genome structure complexities through mediation of homologous recombinations . Generally, large repeats (>1000 bp) with high sequence similarity tend to recombine more frequently, medium repeats (100–1000 bp) recombine occasionally, and small repeats (<100 bp) rarely . The active repeat-mediated intragenomic recombinations in G. crassicaulis and G. straminea were detected with long reads database following Dong et al. . The repeated sequences in G. crassicaulis and G. straminea mtDNA contain small proportions (6.9% and 9.7%, respectively) of large and medium repeats, and no evidence of recombination activity was detected other than the 13 and 16 repeats shown in S4 Table. The length of the repeats and recombination rate are correlated, with no recombination evidence detected for repeat sequences of shorter than 100 bp. All the repeats with recombination activity showed low recombination frequencies less than 12%. The low recombination level provides evidence for the predominant master conformation in both species. Such low recombination frequency has also been seen in other plant mtDNAs, for example, in Silene conica and Silene noctiflora , tens of large repeats induced recombinations at a frequency around 5%; in Nymphaea colorata , ten recombinationally active repeat pairs were detected from 886,982 repeat pairs examined; and in Vigna radiata , no recombination activity was detected.
SSR is a common molecular marker to evaluate the genetic diversity of species. A total of 213 SSRs were detected in the mtDNA of G. crassicaulis, including 127 mononucleotide SSRs (59.6%), 12 dinucleotide SSRs (5.6%), 14 trinucleotide SSRs (6.6%), 45 tetranucleotide SSRs (21.1%), 12 pentanucleotide SSRs (5.6%) and 3 hexanucleotide SSRs (1.4%). A total of 250 SSRs were detected in the mtDNA of G. straminea, including 147 mononucleotide SSRs (58.8%), 20 dinucleotide SSRs (8.0%), 22 trinucleotide SSRs (8.8%), 41 tetranucleotide SSRs (16.4%), 17 pentanucleotide SSRs (6.8%) and 3 hexanucleotide (1.2%) (Fig 4). The number of A/T mononucleotide SSRs is the largest, with 118 and 134, respectively. Among all SSRs, only one SSR is longer than 20 bp (Table 2). The total SSRs of G. crassicaulis and G. straminea are 3361 and 3700 bp long, respectively, and primers have been designed for SSRs (S5 Table).
Comparative analysis of mtDNAs of G. crassicaulis and G. straminea
Comprehensive comparisons were performed in terms of synteny and homologous sequences of the two Gentiana species. SNP refers to the polymorphism of DNA sequences caused by a single nucleotide variation. InDel refers to the insertion and deletion of small fragments of the genome. A comparison of the mtDNAs of G. crassicaulis and G. straminea reveals 392 SNPs, including 354 SNPs in intergenic regions and 38 SNPs in coding regions, including 15 synonymous mutations, 22 non-synonymous mutations and 1 premature stop codon mutation. There are no synonymous or non-synonymous mutations in the start and stop codons of the SNPs in the coding region (S6 Table). In addition, there are 18 InDels, all of which are in intergenic regions and do not cause gene mutations (S7 Table).
By comparing homologous sequences between genomes, insertion and deletion could be observed, and the structural variation of genomes in the process of evolution. The coverages of collinear comparison of mtDNAs of G. crassicaulis and G. straminea are 60.89% and 54.81%, respectively (S8 Table). The number of comparison regions is 209, and the numbers of collinearity, Trans, Inv and Trans+Inv are 14, 87, 8 and 100, respectively (Fig 5, S9 Table). Structural variation study further clarifies the differences between the two genomes, and there are lots of Trans, Inv, Trans+Inv and Indels (Fig 6, S10 Table). The results show significant structural differences between the mtDNAs of G. crassicaulis and G. straminea, resulting in a distinct difference in gene order (Fig 1). And despite gene content being conserved, the numbers of copies changed, especially in some tRNAs (Table 1). Structural differences in mtDNA from the same genus of plants are relatively common, for example, a conventional single molecular structure was resolved in Broussonetia papyrifera (Moraceae) and B. monoica while a double-ring structure was resolved in B. kaempferi with gene content showing consistency except for a few tRNA genes . As a whole, homologous regions across the two Gentiana mtDNAs revealed high compositional and low structural conservation. According to Fu et al. 2021 , the time of divergence of G. straminea and its relative species in Gentiana sect. Cruciata was about 4.14 million years ago. The rearrangement rate of mtDNAs of the two species in this study was estimated using the number of rearrangement events (Trans / Inv -related events events) by dividing it by two times the divergence time and a rate of ~19.9 rearrangement events per million years was obtained, which is not surprising given prior studies of angiosperms . The mtDNAs of seed plants show high rates of rearrangement of at least 600-fold and rearrangement rates may vary greatly even within a single genus [48, 49].
The upper and lower bars represented the mitochondrial genomes. Dark-orange and blue regions in each bar represented the forward and reverse direction of the aligned genome, respectively. White regions in each bar represent the sequences that could not be aligned to the other genome. Lines between the two bars indicated the syntenic types and locations: magenta, blue-green, dark yellow and light-green represent collinearity, translocation, inversion and translocation+inversion, respectively.
The inner circle is the G. crassicaulis genome, and the outer circle is the G. straminea genome. Collinear: homologous region; translocation: translocation region; inversion: inverted region; tran+inver: translocation and inversion region; insertion: insertion region with length ≥50 bp; deletion: deletion region with length ≥50 bp; complexInDel: region without alignment but corresponding to location; forward_chain: forward chain of a genomic sequence, at which time the gene sitz markers increase clockwise; reverse_chain: the reverse chain of a genomic sequence, where the gene coordinates increase counterclockwise.
Transfer of cpDNA to mtDNA
Recent research has shown information exchange and transmission between chloroplasts and mitochondria in plants . The transfer of cpDNA to mitochondria occurs mostly in the ancestors of vascular plants . The length of mtDNA of G. crassicaulis is 2.48 times that of its cpDNA (148,757 bp), and the length of mtDNA of G. straminea is 2.75 times that of its cpDNA (148,991 bp). Both species show transfer of cpDNA to mtDNA (Fig 7). In the cpDNA and mtDNA of the two species, 21 and 24 transferred segments with total lengths of 46,511 and 55,043 bp are identified with a similarity greater than 90%, respectively. The length of the transferred segments in G. crassicaulis is 141–6551 bp and the length of the transferred segments in G. straminea is 109–6176 bp. Some of these transferred segments of cpDNA carry genes, in which the protein-coding genes usually become non-functional, while the tRNA genes mostly remain functional. This phenomenon has also been found in Magnolia biondii , Asclepias syriaca , R. stricta , Salvia miltiorrhiza  and other plants. Three tRNA genes, including trnW-CCA, trnV-GAC and trnI-CAU, from the cpDNA were identified in the mtDNA of G. crassicaulis. Three tRNA genes from the cpDNA, namely trnW-CCA, trnI-CAU and trnN-GUU, were also found in the mtDNA of G. straminea (S11 Table). Intracellular gene transfer of DNA from the chloroplast to the mitochondrion in higher plants is a common phenomenon, and the transferred fragments differ greatly. In the analysis of the homologs in 79 species of chloroplast-derived mitochondrion fragments, fragment containing trnW-CCA was also observed in Nelumbo nucifera, Nicotiana tabacum, Vitis vinifera and Liriodendron tulipifera, while the most common fragments were those that contain trnN-GTT, trnH-GTG and trnM-CAT .
The green lines within the circle show the regions of the chloroplast genome those have been inserted into different locations of the mitochondrial genome.
Comparison of mtDNAs of Gentianales
Collinearity and structural variation analysis were performed based on the mtDNAs of G. crassicaulis, G. straminea and four species in Gentianales, namely Asclepias syriaca, R. stricta, Cynanchum auriculatum and Scyphiphora hydrophyllacea. The coverages of collinear comparison only accounted for 18.40–26.02% of the genome of G. crassicaulis and 16.79–25.95% of the genome of G. straminea, respectively. A large number of Trans, Inv and Indels were found in pairwise comparison, and the proportion of collinearity in the comparison areas was only 12.14–14.58% for G. crassicaulis and 6.71–13.79% for G. straminea, respectively (S1 and S2 Figs). The results show that the mtDNAs of the two Gentiana species are markedly different from other species in Gentianales. There are 42 pan genes (all non-shared genes and shared genes) in the mtDNAs of these six species in Gentianales, including 18 core genes. Asclepias syriaca and R. stricta have one specific gene each (Fig 8, S12 Table). These core genes and specific genes could provide the basis for studying the functional differences among different species in Gentianales.
Gentiana crassicaulis and G. straminea belong to the Gentianaceae family in the Asterids clade. To identify its phylogenetic position, the mtDNAs of 32 species belonging to six orders of Asterids were selected for phylogenetic analysis, including Ericales, Gentianales, Solanales, Lamiales, Asterales and Apiales (S1 Table). Cycas taitungensis, from Cycadales, was set as an outgroup. In the phylogenetic tree, bootstrap values of 20 out of 31 nodes are 100%, and bootstrap values of 7 nodes are ≥90%. Gentiana crassicaulis and G. straminea are closely clustered on a branch with a bootstrap value of 100%. Gentianaceae is located in the branch of Gentianales, which is consistent with the classification of APG IV. The topologies of the phylogenetic tree mainly consist of two clades, one of which is composed of Euasterids I (Solanales, Gentianales and Lamiales) and Euasterids II (Asterales and Apiales), and the other is composed of basal Asterids (Ericales) (Fig 9). The systematic relationship of these orders is consistent with that based on cpDNAs in a previous study . It should be noted that the relationship of Solanales, Gentianales and Lamiales within Euasterids I remains poorly resolved [54, 55]. In this study, the result supports the position of Gentianales as the sister of Solanales, but the node connecting the three orders has a relatively low bootstrap value of 73%. This problem also exists in cpDNA-based phylogenetic tree . The system relationship among the orders in Euasterids I needs further analysis.
The tree was constructed via a maximum-likelihood analysis using PhyML 3.0 with 1000 bootstrap replications.
Gentiana crassicaulis and G. straminea are closely related species of Gentiana sect. Cruciata and their cpDNAs show a consistent structure and high sequence similarity , unlike their mtDNAs, which show great structural differences and are also very different from those of other species in Gentianales. Studies have shown that mtDNAs of plants differentiate structurally quickly, resulting in a large loss of collinear and shared DNA [56, 57]. For example, only 51% of the mtDNA of Vigna radiata is homologous to that of Glycine max, despite their differentiation for only 19 million years . The sizes of mtDNAs of plants vary greatly among different families and genera  but also among closely related species. For example, Gossypium arboreum, woody cotton, and Gossypium hirsutum, upland cotton, differ in mtDNA size by 65.6 kb. Moreover, only about half of mtDNA sequences are identical among different populations of Silene venosa . There is no positive correlation between genome size and gene number. The size of the mtDNA of G. crassicaulis is smaller than that of G. straminea, but it contains more genes than G. straminea. For another example, the mtDNA of Welwitschia mirabilis is more than twice as large as the mtDNA of Cycas taitungensis, but it has fewer genes than Cycas taitungensis . The main reason is that the mtDNA is paradoxically composed of intergenic regions that are evolving more rapidly than the surrounding DNA, resulting in sparse gene arrangement and an increasingly large and complex genome .
In this study, the deletion of atp6 is found in the mtDNA of G. straminea. There are widespread gene deletions in the mtDNAs of plants during evolution. The number of protein-coding genes decreases with plant evolution, and the lost genes are usually ribosome protein genes and succinate dehydrogenase genes . For example, sdh2 was lost in the earliest evolution of plants, and rps9, rps11 and rps16 were lost in higher plants when they differentiated from lower plants. Subsequently, in the differentiation of monocotyledons and dicotyledons, monocotyledons lost rps12, sdh3 and sdh4, and dicotyledons lost rps2. Furthermore, at differentiation of different classes and genera of higher plants, some genes may have been lost . The respiratory chain complex V genes in the mtDNAs of plants include atp1, atp4, atp6, atp8 and atp9. Deletion of these genes has also been detected in some plants. For example, Chondrus crispus loses atp1 , Arabidopsis thaliana and Nicotiana tabacum lose atp4 [4, 60], Nicotiana tabacum loses atp8 , and Citrullus lanatus has one additional source of five edited sites in the form of a partial atp9 pseudogene . The atp6 gene is absent in the single-cell green alga Chlamydomonas reinhardtii . Moreover, some plants, such as Lotus japonicus, contain the atp6 pseudogene in addition to the complete atp6 gene . There is no complete atp6 gene in G. straminea, and only the atp6 pseudogene exists, which is rare in angiosperms. The atp6 gene encodes the largest F0 subunit (A subunit) in the F1F0-ATP synthase system . Male sterility of rice, rape and radish is known to be related to atp6 mutation [65–67]. In our previous study, the rps16 pseudogene of the Asterids clade nonparasitic plant was first identified in the cpDNA of G. straminea . Is there a correlation between the specificity of genomic structure and the environment of G. straminea as a cold-tolerant alpine plant? How does the loss of relevant genes affect its function? All of these questions need to be further studied.
Genetic information is exchanged between the genomes of plant organelles (mitochondria and chloroplasts). Chloroplasts are highly genetically conserved with few foreign DNA, but the transfer of cpDNA to mtDNA occurs frequently . For example, the expansion of mtDNA of Cucurbita pepo may be related to the transfer of a large number of chloroplast sequences . Seventeen chloroplast sequences ranging in size from 32 bp to 6.7 kb were detected in mtDNA of rice, and they were scattered throughout mtDNA, accounting for 6.3% of the total length of mtDNA. No common motifs or secondary structures were found in the boundary regions of the integration of these foreign DNA sequences. It is speculated that the integration of the chloroplast DNA sequences is not site-specific and may be the result of non-homologous or random recombination . Meanwhile, the DNA sequences transferred from chloroplasts to mitochondria which are functional are limited to tRNA genes. Due to multiple sequence changes and the absence of RNA editing, transferred protein-coding sequences are often non-functional .
Loss of tRNA genes is also associated with the evolution of plants, which may occur during the differentiation of bryophytes and gymnosperms [63, 70]. There are huge differences in tRNA genes between the mtDNAs of higher plants and those of lower plants, and there are also differences in tRNA genes among plants of different families or genera of higher plants, as well as among the populations of the same species [44, 71]. An important feature of tRNA genes in the mtDNAs of plants is that they have dual origins; that is, some tRNA genes are directly inherited from mitochondrial ancestors, while the others are derived from the transferred sequences of chloroplasts. Therefore, tRNA genes in mtDNAs of plants could be divided into "tRNA originating from mitochondria" and "tRNA originating from chloroplasts" by source . For example, in the mtDNA of Boea hygrometrica, half of the 28 tRNA genes were identified as originating from the cpDNA . In the mtDNA of Spirodela polyrhiza, 4 out of 19 tRNA genes originated from the cpDNA . In this study, three tRNA genes are derived from the cpDNA in the mtDNA of G. crassicaulis and G. straminea, respectively, and G. crassicaulis contains two more unique tRNA genes than G. straminea.
The genetic background of Gentiana sect. Cruciata is complex, and there are hybrid individuals of different species , which makes it difficult to identify related species of this section. Previous studies showed incomplete concerted evolution of ITS sequences in many species of Gentiana sect. Cruciata, such as G. crassicaulis [23, 24]. MtDNAs and cpDNAs are maternal-inherited with moderate sizes and no heterozygote, and rapid progress has been made in the application of DNA barcoding developed from them in the identification of traditional Chinese medicine [5, 74]. A previous study also found that nad1/b-c and nad5/d-e from mtDNAs have implications for species identification in Gentiana sect. Cruciata . In our study, a large number of repeats and SSR markers are developed based on the mtDNAs of G. crassicaulis and G. straminea, which may be further used for molecular identification and genetic diversity of Gentiana and Gentianaceae.
In this study, the mtDNAs of alpine plants G. crassicaulis and G. straminea of Gentianaceae are sequenced for the first time, and the sequences and structures are compared and analyzed.
The mtDNAs of G. crassicaulis and G. straminea are 368,808 and 410,086 bp long, respectively, 52 and 49 unique genes are annotated in the two species. And the pseudogene atp6 was identified in G. straminea, which is rare in higher plants. A large number of repeats and SSR markers are developed and cpDNA transferring to mtDNA is observed in both species. The phylogenetic tree based on mtDNAs supports the traditional taxonomic view of Gentianaceae. The work could provide data for analysis of mtDNA structure and species evolution of higher plants and data for phylogenetic study and species identification of Gentianaceae.
S1 Data. Validation of sequences of atp6 gene.
S1 Fig. Collinear relationship between mitochondrial genomes of Gentiana crassicaulis (G. straminea) and other four Gentianales plants.
S2 Fig. Structural variation map of mitochondrial genomes of Gentiana crassicaulis (G. straminea) and other four Gentianales plants.
S1 Table. GenBank IDs of mitochondrial genomes used to construct the phylogenetic tree.
S2 Table. Illumina NovaSeq 6000 sequencing results and PacBio Sequel II sequencing results.
S3 Table. Characteristics of introns and exons in mitochondrial genes of Gentiana crassicaulis and G. straminea.
S4 Table. Long repeats of mitochondrial genomes of Gentiana crassicaulis and G. straminea.
S5 Table. SSRs of mitochondrial genomes of Gentiana crassicaulis and G. straminea.
S6 Table. Gentiana straminea SNP annotation results of the reference sequence of G. crassicaulis.
S7 Table. Gentiana straminea InDel annotation results of the reference sequence of G. crassicaulis.
S8 Table. Coverage statistics of collinearity comparison of Gentiana crassicaulis (Target) and G. straminea (Query).
S9 Table. Information of collinearity comparison of Gentiana crassicaulis (Target) and G. straminea (Query).
S10 Table. Information of structural variation analysis of Gentiana crassicaulis (Target) and G. straminea (Query).
S11 Table. Information exchange and transfer between chloroplasts and mitochondria in Gentiana crassicaulis and G. straminea.
S12 Table. Pan genes of 6 Gentianales plants.
Pan RZ, Wang XJ, Li NH. Phytophysiology. Higher Education Press, Beijing, China. 2012.
- 2. Picault N, Hodges M, Palmieri L, Palmieri F. The growing family of mitochondrial carriers in Arabidopsis. Trends in Plant Science. 2004; 9(3):138–146. pmid:15003237
- 3. Dyall SD, Brown MT, Johnson PJ. Ancient invasions: from endosymbionts to organelles. Science. 2004; 304(5668):253–257. pmid:15073369
- 4. Unseld M, Marrienfeld J R, Brandt P, Brennicke A. The mitochondrial genome of Arabidopsis thaliana contains 57 genes in 366,924 nucleotides. Nature Genetics. 1997; 15(1):57–61. pmid:8988169
- 5. Lu JN, Zhao ZL, Ni LH, Dorje G. Identification of medicinal plants based on mitochondrial DNA sequences. Chinese Traditional and Herbal Drugs. 2016; 47(10):1791–1796.
- 6. Skippington E, Barkman TJ, Rice DW, Palmer JD. Miniaturized mitogenome of the parasitic plant Viscum scurruloideum is extremely divergent and dynamic and has lost all nad genes. Proceedings of the National Academy of Sciences of the United States of America. 2015; 112(27):E3515–24. pmid:26100885
- 7. Putintseva YA, Bondar EI, Simonov EP, Sharov VV, Oreshkova NV, Kuzmin DA, et al. Siberian larch (Larix sibirica Ledeb.) mitochondrial genome assembled using both short and long nucleotide sequence reads is currently the largest known mitogenome. BMC Genomics. 2020; 21:654. pmid:32972367
- 8. Li S, Xue L, Su A, Lei B, Wang Y, Hua J. Progress on sequencing and alignment analysis of higher plant mitochondrial genomes. Journal of China Agricultural University. 2011; 16:22–27.
- 9. Lei B, Li S, Liu G, Wang Y, Su A, Hua J. Evolutionary analysis of mitochondrial genomes in higher plants. Molecular Plant Breeding. 2012; 10(4):490–500.
- 10. Huang CY, Grunheit N, Ahmadinejad N, Timmis JN, Martin W. Mutational decay and age of chloroplast and mitochondrial genomes transferred recently to angiosperm nuclear chromosomes. Plant Physiology. 2005; 138:1723–1733. pmid:15951485
- 11. Turmel M, Otis C, Lemieux C. The chloroplast and mitochondrial genome sequences of the charophyte Chaetosphaeridium globosum: insights into the timing of the events that restructured organelle DNAs within the green algal lineage that led to land plants. Proceedings of the National Academy of Sciences of the United States of America. 2002; 99:11275–11280. pmid:12161560
- 12. Wang D, Wu Y, Shih AC, Wu C, Wang Y, Chaw S. Transfer of Chloroplast Genomic DNA to Mitochondrial Genome Occurred At Least 300 MYA. Molecular Biology and Evolution. 2007; 24(9):2040–8. pmid:17609537
- 13. Adams KL, Palmer JD. Evolution of mitochondrial gene content: gene loss and transfer to the nucleus. Molecular Phylogenetics and Evolution. 2003; 29(3):380–395. pmid:14615181
- 14. Qiu YL, Li L, Wang B, Xue JY, Hendry TA, Li RQ, et al. Angiosperm phylogeny inferred from sequences of four mitochondrial genes. Journal of Systematics and Evolution. 2010; 48(6):391–425.
He TN. Genus Gentiana. In: He TN, editor. Flora Reipublicae Popularis Sinicae. Volume 62. Science Press, Beijing, China; 1988. pp. 14–257.
- 16. Zhang X L, Wang Y J, Ge X J, Yuan YM, Liu JQ. Molecular phylogeny and biogeography of Gentiana sect. Cruciata (Gentianaceae) based on four chloroplast DNA datasets. Taxon. 2009; 58(3):862–870.
Wang WY. Jie-Ji. In: Yang YC, editor. Tibetan Medicine. Qinghai People’s Publishing House, Xining, China; 1991. pp. 9–12.
Chinese Pharmacopoeia Commission. Pharmacopoeia of the People’s Republic of China. Volume 1. Chinese Medical Science and Technology Press, Beijing, China; 2020. pp. 282.
- 19. Zhao ZL, Dorje G, Wang ZT. Identification of medicinal plants used as Tibetan traditional medicine Jie-Ji. Journal of Ethopharmacology. 2010; 132 (1):122–126. pmid:20692329
- 20. Ni L, Zhao Z, Xu H, Chen S, Dorje G. The complete chloroplast genome of Gentiana straminea (Gentianaceae), an endemic species to the Sino-Himalayan subregion. Gene. 2016; 577(2):281–288. pmid:26680100
- 21. Ni L, Zhao Z, Xu H, Chen S, Dorje G. Chloroplast genome structures in Gentiana (Gentianaceae), based on three medicinal alpine plants used in Tibetan herbal medicine. Current Genetics. 2017; 63(2):241–252. pmid:27422574
- 22. Dong BR, Zhao ZL, Ni LH, Dorje G, Liu TH. Molecular markers based upon whole chloroplast genomes and identifying alpine Gentiana waltonii and G. lhassica (Gentianaceae). Acta Pharmaceutica Sinica. 2021; 56(9):2584–2591.
- 23. Xiong B, Zhao ZL, Ni LH, Gaawe D, Mi M. DNA-based identification of Gentiana robusta and related species. China Journal of Chinese Materia Medica. 2015; 40(23):4680–4685.
- 24. Ni LH, Zhao ZL, Xiong B, Gaawe D, Mi M. A strategy for identifying six species of Sect. Cruciata (Gentiana) in Gansu using DNA barcode sequences. Acta Pharmaceutica Sinica. 2016; 51(5):821–827. pmid:29878733
- 25. Lu JN, Zhao ZL, Ni LH, Gaawe D, Mi M. The identification of Sect. Cruciata (Gentiana) species using mtDNA nad1/b-c and nad5/d-e fragments. Acta Pharmaceutica Sinica. 2019; 54(1):166–172.
- 26. Chen J, Guan R, Chang S, Du T, Zhang H, Xing H. Substoichiometrically different mitotypes coexist in mitochondrial genomes of Brassica napus L. PLoS ONE. 2011; 6(3):e17662. pmid:21423700
- 27. Dmitry A, Anton K, Jeffrey SM, Pavel AP. HYBRIDSPADES: an algorithm for hybrid assembly of short and long reads. Bioinformatics. 2016; 32(7):1009–1015. pmid:26589280
- 28. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Research. 1997; 25:955–964. pmid:9023104
- 29. Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery D. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Research. 2007; 35:3100–3108. pmid:17452365
- 30. Lohse M, Drechsel O, Bock R. OrganellarGenomeDRAW (OGDRAW): a tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Current Genetics. 2007; 52:267–274. pmid:17957369
- 31. Kurtz S, Schleiermacher C. REPuter: fast computation of maximal repeats in complete genomes. Bioinformatics. 1999; 15(5):426–427. pmid:10366664
- 32. Thiel T, Michalek W, Varshney R, Graner A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theoretical and Applied Genetics. 2003; 106 (3):411–422. pmid:12589540
- 33. Andreas U, Ioana C, Triinu K, Jian Y, Faircloth BC, Maido R, et al. Primer3-new capabilities and interfaces. Nucleic Acids Research. 2012; 15:e115. pmid:22730293
- 34. Maruyama F, Kobata M, Kurokawa K, Nishida K, Sakurai A, Nakano K, et al. Comparative genomic analyses of Streptococcus mutans provide insights into chromosomal shuffling and species-specific content. BMC Genomics. 2009; 10:358. pmid:19656368
- 35. Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Systematic Biology. 2010; 59:307–21. pmid:20525638
- 36. Wang W, Wu Y, Messing J. The mitochondrial genome of an aquatic plant, Spirodela polyrhiza. PLoS ONE. 2012; 7(10):e46747 pmid:23056432
- 37. Aljohi HA, Liu W, Lin Q, Zhao Y, Yu J. Complete sequence and analysis of coconut palm (Cocos nucifera) mitochondrial genome. PLoS ONE. 2016; 11(10):e0163990. pmid:27736909
- 38. Terasawa K, Odahara M, Kabeya Y, Kikugawa T, Sekine Y, Fujiwara M, et al. The mitochondrial genome of the moss Physcomitrella patens sheds new light on mitochondrial evolution in land plants. Molecular Biology and Evolution. 2007; 24(3): 699–709. pmid:17175527
- 39. Hecht J, Grewe F, Knoop V. Extreme RNA editing in coding islands and abundant microsatellites in repeat sequences of Selaginella moellendorffii mitochondria: the root of frequent plant mtDNA recombination in early tracheophytes. Genome Biology and Evolution. 2011; 3:344–358. pmid:21436122
- 40. Zhang T, Fang Y, Wang X, Deng X, Zhang X, Hu S, et al. The complete chloroplast and mitochondrial genome sequences of Boea hygrometrica: insights into the evolution of plant organellar genomes. PLoS ONE. 2012; 7(1):e30531. pmid:22291979
- 41. Gualberto JM, Newton KJ. Plant mitochondrial genomes: dynamics and mechanisms of mutation. Annual Review of Plant Biology. 2017; 68: 225–252. pmid:28226235
- 42. Dong S, Chen L, Liu Y, Wang Y, Zhang S. The draft mitochondrial genome of Magnolia biondii and mitochondrial phylogenomics of angiosperms. PLoS ONE. 2020; 15(4):e0231020. pmid:32294100
- 43. Dong S, Zhao C, Chen F, Liu Y, Zhang S, Wu H, et al. The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination. BMC Genomics. 2018; 19:614. pmid:30107780
- 44. Sloan DB, Alverson AJ, Chuckalovcak JP, Wu M, McCauley DE, Palmer JD, et al. Rapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates. PLoS Biology. 2012; 10(1):e1001241. pmid:22272183
- 45. Alverson AJ, Zhuo S, Rice DW, Sloan DB, Palmer JD. The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats. PLoS ONE. 2011; 6(1): e16404. pmid:21283772
- 46. Lai C, Jie Wang, Kan S, Zhang S, Li P, Reeve WG, et al. Comparative analysis of mitochondrial genomes of Broussonetia spp. (Moraceae) reveals heterogeneity in structure, synteny, intercellular gene transfer, and RNA editing. Frontiers in Plant Science. 2022; 13:1052151. pmid:36531410
- 47. Fu PC, Twyford AD, Sun SS, Wang HY, Xia MZ, Tan CX, et al. Recurrent hybridization underlies the evolution of novelty in Gentiana (Gentianaceae) in the Qinghai-Tibetan Plateau. AoB Plants. 2021; 13(1):1–14. pmid:33510891
- 48. Cole LW, Guo W, Mower JP, Palmer JD. High and variable rates of repeat-mediated mitochondrial genome rearrangement in a genus of plants. Molecular Biology and Evolution. 2018; 35(11):2773–2785. pmid:30202905
- 49. Guo W, Grewe F, Fan W, Young GJ, Knoop V, Palmer JD, et al. Ginkgo and Welwitschia mitogenomes reveal extreme contrasts in gymnosperm mitochondrial evolution. Molecular Biology and Evolution. 2016; 33(6):1448–1460. pmid:26831941
- 50. Straub SCK, Cronn R C, Christopher E, Mark F, Aaron L. Horizontal transfer of DNA from the mitochondrial to the plastid genome and its subsequent evolution in milkweeds (Apocynaceae). Genome Biology and Evolution. 2013; 5:1872–1885. pmid:24029811
- 51. Park S, Ruhlman TA, Sabir JSM, Mutwakil MHZ, Baeshen MN, Sabir MJ, et al. Complete sequences of organelle genomes from the medicinal plant Rhazya stricta (Apocynaceae) and contrasting patterns of mitochondrial genome evolution across asterids. BMC Genomics. 2014; 15:405. pmid:24884625
- 52. Gao C, Wu C, Zhang Q, Zhao X, Li Z. Characterization of chloroplast genomes from two Salvia medicinal plants and gene transfer among their mitochondrial and chloroplast Genomes. Frontiers in Genetics. 2020; 11:574962. pmid:33193683
- 53. Gui S, Wu Z, Zhang H, Zheng Y, Zhu Z, Liang D, et al. The mitochondrial genome map of Nelumbo nucifera reveals ancient evolutionary features. Scientific Reports. 2016; 6:30158. pmid:27444405
- 54. Bremer B, Bremer K, Heidari N, Erixon P, Olmstead RG, Anderberg AA, et al. Phylogenetics of Asterids based on 3 coding and 3 non-coding chloroplast DNA markers and the utility of non-coding DNA at higher taxonomic levels. Molecular phylogenetics and evolution. 2002; 24(2):274–301. pmid:12144762
- 55. Ku C, Chung WC, Chen LL, Kuo CH. The complete plastid genome sequence of Madagascar periwinkle Catharanthus roseus (L.) G. Don: plastid genome evolution, molecular marker identification, and phylogenetic implications in Asterids. PLoS ONE. 2013; 8(6):e68518. pmid:23825699
Knoop V. Seed plant mitochondrial genomes: complexity evolving. In: Bock R, Knoop V, editors. Genomics of chloroplasts and mitochondria. Springer; 2012. pp.175–200.
Mower JP, Sloan DB, Alverson AJ. Plant mitochondrial genome diversity: the genomics revolution. In: Wendel JF, editor. Plant genome diversity. Volume 1. Springer; 2012. pp. 123–144.
- 58. Naito K, Kaga A, Tomooka N, Kawase M. De novo assembly of the complete organelle genome sequences of azuki bean (Vigna angularis) using next-generation sequencers. Breeding Science. 2013; 63:176–182. pmid:23853512
- 59. Burger G. Complete sequence of the mitochondrial DNA of the red alga Porphyra purpurea: cyanobacterial introns and shared ancestry of red and green algae. The Plant Cell. 1999; 11(9):1675–1694. pmid:10488235
- 60. Sugiyama Y, Watase Y, Nagase M, Makita N, Yagura S, Hirai A, et al. The complete nucleotide sequence and multipartite organization of the tobacco mitochondrial genome: comparative analysis of mitochondrial genomes in higher plants. Molecular Genetics and Genomics. 2005; 272(6):603–615. pmid:15583938
- 61. Xue JY, Liu Y, Li L, Wang B, Qiu YL. The complete mitochondrial genome sequence of the hornwort Phaeoceros laevis: retention of many ancient pseudogenes and conservative evolution of mitochondrial genomes in hornworts. Current Genetics. 2010; 56:53–61. pmid:19998039
- 62. Alverson AJ, Xiaoxin W, Rice DW, Stern DB, Kerrie B, Palmer JD. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Molecular Biology and Evolution. 2010; 27(6):1436. pmid:20118192
- 63. Kazakoff SH, Imelfort M, Edwards D, Koehorst J, Biswas B, Batley J, et al. Capturing the biofuel wellhead and powerhouse: the chloroplast and mitochondrial genomes of the Leguminous feedstock tree Pongamia pinnata. PLoS ONE. 2012; 7(12):e51687. pmid:23272141
- 64. Ji J, Huang W, Yin C, Gong Z. Mitochondrial cytochrome c oxidase and F1F0-ATPase dysfunction in peppers (Capsicum annuum L.) with cytoplasmic male sterility and its association with orf507 and Ψatp6-2 Genes. International Journal of Molecular Sciences. 2013; 14:1050–1068. pmid:23296278
- 65. Makaroff CA, Apel IJ, Palmer JD. The atp6 coding region has been disrupted and a novel reading frame generated in the mitochondrial genome of cytoplasmic male-sterile radish. Journal of Biological Chemistry. 1989; 264(20):11706–11713. pmid:2745412
- 66. Kadowaki KI, Suzuki T, Kazama S. A chimeric gene containing the 5′ portion of atp6 is associated with cytoplasmic male-sterility of rice. Molecular and General Genetics. 1990; 224(1):10–16. pmid:2148966
- 67. Landgren M, Zetterstrand M, Sundberg E, Glimelius K. Alloplasmic male-sterile Brassica lines containing B. tournefortii mitochondria express an ORF 3′ of the atp6 gene and a 32 kDa protein. Plant Molecular Biology. 1996; 32(5):879–890. pmid:8980539
- 68. Hao WL, Palmer JD. Fine-scale mergers of chloroplast and mitochondrial genes create functional, transcompartmentally chimeric mitochondrial genes. Proceedings of the National Academy of Sciences of the United States of America. 2009; 106(39):16728–16733. pmid:19805364
- 69. Notsu Y, Masood S, Nishikawa T, Kubo N, Akiduki G, Nakazono M, et al. The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants. Molecular genetics and genomics. 2002; 268:434–445. pmid:12471441
- 70. Ye N, Wang X, Li J, Bi C, Xu Y, Wu D, et al. Assembly and comparative analysis of complete mitochondrial genome sequence of an economic plant Salix suchowensis. PeerJ. 2017; 5(1):e3148. pmid:28367378
- 71. Wei S, Wang X, Bi C, Xu Y, Wu D, Ye N. Assembly and analysis of the complete Salix purpurea L. (Salicaceae) mitochondrial genome sequence. SpringerPlus. 2016; 5(1):1894. pmid:27843751
- 72. Sprinzl M, Steegborn C, Frank Hübel, Steinberg S. Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Research. 1998; 6(1):148–153. pmid:9399820
- 73. Li X, Wang L, Yang H, Liu J. Confirmation of natural hybrids between Gentiana straminea and G.siphonantha (Gentianaceae) based on molecular evidence. Frontiers of Biology in China. 2008; 3:470–476.
- 74. Chen SL, Yao H, Han JP, Xin TY, Pang XH, Shi CL, et al. Principles for molecular identification of traditional Chinese materia medica using DNA barcoding. China Journal of Chinese Materia Medica. 2013; 38(2):141–148. pmid:23672031