Figures
Abstract
Vigna reflexo-pilosa can be found in both wild and cultivated forms. It is the only tetraploid species in the genus Vigna in Fabaceae, occurring through hybridization between Vigna hirtella and Vigna trinervia, with the chromosome number of 2n = 4x = 44. V. reflexo-pilosa provides an invaluable gene pool for improving cultivated Vigna crop varieties. This study aimed to report the complete mitochondrial and chloroplast genomes of V. reflexo-pilosa. A total of 6,496,297 raw reads were generated from V. reflexo-pilosa using the long-read PacBio technology. The complete mitochondrial genome was assembled into a linear structure with a total length of 370,913 base pairs (bp) with 45.20% GC content. It contains 32 protein-coding genes, 18 transfer RNA genes, and 3 ribosomal RNA genes. A total of 520 RNA editing sites were detected in 30 protein-coding genes. The V. reflexo-pilosa mitochondrial genome shared large colinear blocks with Vigna radiata as compared to nine other mitochondrial genomes in Fabaceae. In addition, Vigna hirtella (male parent) and Vigna trinervia (female parent) were sequenced using the MGI sequencing technology. The complete chloroplast genome of V. reflexo-pilosa, V. hirtella, and V. trinervia was assembled into a circular structure with a total length of 150,967, 151,915 and 151,226 bp, respectively. All three chloroplast genomes consist of 128 genes. We found no evidence of shared genes between the mitochondrial and chloroplast genomes of V. reflexo-pilosa. Comparison of the three Vigna chloroplast genomes showed high levels of similarity between V. reflexo-pilosa and V. trinervia, revealing maternal inheritance of the chloroplast genomes. Based on both mitochondrial and chloroplast genes, phylogenetic trees showed that V. reflexo-pilosa is closely related to V. radiata. These genomes enhance our understanding of mitochondrial and chloroplast evolution of V. reflexo-pilosa and are valuable genetic resources in legumes.
Citation: Ruang-areerate P, Pinsupa S, Kongkachana W, Yoocha T, Phetchawang P, Paenpong P, et al. (2025) Assembly and analysis of the complete mitochondrial and chloroplast genomes of Vigna reflexo-pilosa. PLoS One 20(6): e0325243. https://doi.org/10.1371/journal.pone.0325243
Editor: Karthikeyan Adhimoolam, Jeju National University, KOREA, REPUBLIC OF
Received: July 31, 2024; Accepted: May 9, 2025; Published: June 11, 2025
Copyright: © 2025 Ruang-areerate et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The mitochondrial genome sequence of V. reflexo-pilosa was deposited under NCBI accession number OR654957 (https://www.ncbi.nlm.nih.gov/nuccore/OR654957). The chloroplast genome sequences of V. reflexo-pilosa, V. hirtella and V. trinervia were deposited under NCBI accession numbers PP728425, PP728426 and PP728427 (https://www.ncbi.nlm.nih.gov/nuccore/PP728425, https://www.ncbi.nlm.nih.gov/nuccore/PP728426, and https://www.ncbi.nlm.nih.gov/nuccore/PP728427), respectively.
Funding: This work was funded by 2000092. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The genus Vigna belonging to the family Fabaceae includes over 100 species of five subgenera, which are Ceratotropis, Haydonia, Lasiosporon, Plectotropis and Vigna [1,2]. Among them, Vigna species in the subgenus Ceratotropis distribute across Asia, thus they are known as Asian Vigna [3]. Seven Asian Vigna species including Vigna aconitifolia (moth bean), Vigna angularis (azuki bean), Vigna mungo (black gram), Vigna radiata (mung bean), Vigna reflexo-pilosa (créole bean), Vigna stipulacea (Minni Payaru) and Vigna umbellata (rice bean) have been domesticated owing to their rapid growth and adaptability to various cropping systems [3]. V. reflexo-pilosa, which is a wild legume and a neglected crop distributed in Asia, is originated from interspecific hybridization from two genome donors, Vigna hirtella and Vigna trinervia [4,5]. It is a tetraploid legume species with the chromosome number of 2n = 4x = 44, whereas other Vigna species are diploid legume species with the chromosome number of 2n = 2x = 22 [3–5].
Mitochondrion plays a crucial role in energy production in animal and plant cells. In general, mitochondrial genomes are circular double-stranded DNA molecules that are various in contents, sizes and structures, especially in plants [6,7]. The mitochondrial genomes of legumes in Fabaceae are approximately 400 kb in length [8–12]. For example, the mitochondrial genomes of cultivated mung bean (V. radiata) and wild mung bean (V. radiata var. sublobata) are 401 and 403 kb in length, respectively [8,9]. The mitochondrial genomes of common bean (Phaseolus vulgaris) and wild soybean (Glycine soja) are also 390 kb and 400 kb in length, respectively [10,11].
Chloroplast is an organelle within plant cells that is specifically used for photosynthesis. Chloroplast genomes commonly consist of a large single-copy region (LSC), inverted repeats (IRs) and a single small-copy region (SSC) [12–22]. In legumes, chloroplast genomes are approximately 150 − 155 kb in length and encode about 126 − 130 genes [12–19]. For instance, the chloroplast genome of V. radiata and Glycine max is approximately 151 and 152 kb in length and consist of 127 and 130 genes, respectively [15,17].
Next-generation sequencing (NGS) is used to study genomes and genes associated with interesting traits. In plants, NGS has been successfully utilized to sequence complete mitochondrial genomes [9–12,23,24] and chloroplast genomes [12–22].
In this study, we sequenced, assembled, and annotated the mitochondrial genome of V. reflexo-pilosa and the chloroplast genomes of V. reflexo-pilosa, V. hirtella, and V. trinervia using the PacBio and MGI sequencing technologies, respectively. We analyzed mitochondrial RNA editing sites, repeats and SSRs, and compared our mitochondrial genome with related legume species. Transfer sequences between mitochondrial and chloroplast genomes of V. reflexo-pilosa were also evaluated. Moreover, we conducted a phylogenetic analysis of V. reflexo-pilosa using mitochondrial and chloroplast genes.
Materials and methods
Plant materials, DNA extraction, and sequencing
We used DNA of the créole bean (V. reflexo-pilosa var. reflexo-pilosa) accession AusTRCF30263 that was previously used in Pootakham et al. [5]. According to the previous study, DNA was extracted using CTAB buffer [25] and 25:24:1 phenol, chloroform, and isoamyl alcohol. DNA was precipitated using absolute ethanol and washed with 70% ethanol. DNA was resuspended in 10 mM Tris-HCl (pH 8.0) after air drying. DNA was purified using the Ampure PB beads kit (Pacific Biosciences, Menlo Park, CA, USA), and the integrity of DNA was assessed using the Pippin Pulse Electrophoresis system (Sage Science, Beverly, MA, USA).
To sequence mitochondrial (V. reflexo-pilosa) and chloroplast genomes (V. reflexo-pilosa, V. hirtella, and V. trinervia), DNA libraries and sequencing were prepared following the previous studies [5,26]. Briefly, the SMRTbell Express Template Prep Kit 2.0 was used to create the PacBio SMRTbell library from the high molecular weight DNA template, and the PacBio Sequel system (Pacific BioSciences, Menlo Park, CA, USA) was used to sequence the libraries and to produce raw reads for assembling a mitochondrial genome. Whole-genome libraries for the chloroplast genome sequencing were constructed following the MGIEasy Universal DNA Library Prep Set Instruction Manual (MGI Tech, San Jose, CA, USA) and sequenced on the DNBSEQ-G400 using the MGISEQ-2000RS Sequencing Flow Cell v3.0 following the manufacturer’s protocol.
Assembly and annotation
The long-read PacBio raw data were assembled using Canu v1.9 with the default parameters [27]. To extract mitochondrial contigs, all contigs were blasted against three published mitochondrial Vigna genomes, V. radiata (HM367685), V. angularis (NC 021092), and V. unguiculata (MW448464), using BLASTN [28]. Several mitochondrial contigs were identified and chosen when they have the longest length and cover all mitochondrial genomes. Moreover, the selected mitochondrial contig was blasted against known mitochondrial genes of V. radiata using BLASTN. Finally, genes and ribosomal RNAs (rRNAs) were identified using the mitofy software v2012 [29], and transfer RNAs (tRNAs) were detected using tRNAscan-SE v1.23 with the default parameters [30].
Chloroplast genomes were assembled and annotated using the GetOrganelle software v1.7.7.1 [31] and the web application GeSeq [32], respectively. Chloroplast protein-coding genes and rRNAs were identified using ARAGORN v1.2.36 [33] in the GeSeq software. Chloroplast tRNAs were identified using tRNAscan-SE v1.23. Furthermore, OGDRAW v1.3.1 [34] was used to display the circular structure of the chloroplast genomes.
To investigate chloroplast-derived regions, the mitochondrial genome of V. reflexo-pilosa was blasted against its chloroplast genome using BLASN, with a threshold of 1e-10 and >90% identity.
Identification of RNA editing sites, repeats, and SSRs
To identify candidate RNA editing sites, we used transcriptome data from a previous study [5]. The transcriptome data was mapped to the V. reflexo-pilosa mitochondrial genome using minimap v2.17 [35]. The RNA-seq mapping result was visualized using IGV v2.8.10 [36], and then RNA editing sites were manually checked for changing C-to-T (the plus stand of a gene) or G-to-A (the minus stand of a gene) with changes by at least 10%.
Repeat sequences in the V. reflexo-pilosa mitochondria genome were identified using the web application REPuter [37]. In addition, the simple sequence repeats (SSRs) of the V. reflexo-pilosa mitochondrial genome were detected using MISA v2.1 [38] with the default parameters (mononucleotide (the number of repeats = 10), dinucleotide (6), trinucleotide (5), tetranucleotide (5), pentanucleotide (5), and hexanucleotide (5)).
Comparative analysis of mitochondrial genomes
Structure variations in mitochondria genomes between V. reflexo-pilosa and three other Vigna species (V. angularis, V. radiata, and V. unguiculata) as well as seven plant species in Fabaceae (Glycine max, Glycine soja, Medicago polymorpha, Phaseolus vulgaris, Robinia pseudoacacia, Senna occidentalis, and Trifolium grandiflorum) were preformed using MAUVE v20150226 [39] and the D-GENIES web application [40]. The mitochondrial genomes of the ten species are available in the NCBI database with accession numbers (see in the method of phylogenetic analysis)
Furthermore, the sequence divergence of shared fragment mitochondrial sequences and 14 conserved genes between V. reflexo-pilosa and three other Vigna species (V. angularis, V. radiata, and V. unguiculata) was estimated using dnaSP v.6.12.03 [41].
Comparative analysis of chloroplast genomes
Sequence divergence among three chloroplast genomes (V. reflexo-pilosa, V. trinervia, and V. hirtella) was investigated using mVISTA with the Shuffe-LAGAN mode [42]. The V. reflexo-pilosa chloroplast genome was used as a reference.
Phylogenetic analysis
Phylogenetic analyses with the maximum likelihood (ML) method were carried out using MEGA X [43] based on both mitochondrial and chloroplast genes. The best-fit model was a GTR + I + G model that was evaluated in the mitochondrial and chloroplast gene sets using the best DNA/protein model tool in MEGA X. Node supports were performed with 1000 bootstrap replicates. Ginkgo biloba is used as an outgroup species.
For mitochondrial phylogeny, we used 14 mitochondrial genes including atp9, ccmB, ccmC, cob, cox1, cox3, nad3, nad4, nad6, nad7, nad9, rps3, rps4 and rps12, which were shared in V. reflexo-pilosa and 28 plant species. The 28 mitochondrial genomes from the NCBI database are 10 legumes in Fabaceae (Glycine max: NC_020455, Glycine soja: NC_039768, Medicago polymorpha: MW971562, Phaseolus vulgaris: NC_045135, Robinia pseudoacacia: MW448465, Senna occidentalis: NC_038221, Trifolium grandiflorum: NC_048501, Vigna angularis: NC_021092, Vigna radiata: NC_015121, and Vigna unguiculata: MW448464) and 18 other plants (Arabidopsis thaliana: NC_037304, Capsicum annuum: NC_024624, Citrullus lanatus: NC_014043, Cucumis sativus: NC_016005, Cucurbita pepo: NC_014050, Eucalyptus grandis: NC_040010, G. biloba: NC_027976, Gossypium barbadense: NC_028254, Luffa acutangula: NC_050067, Nicotiana tabacum: NC_006581, Oryza sativa: NC_066488, Populus alba: NC_041085, Salix purpurea: NC_029693, Solanum lycopersicum: NC_035963, Sorghum bicolor: NC_008360, Zea luxurians: NC_008333, Zea mays: NC_007982, and Zea perennis: NC_008331).
For chloroplast phylogeny, we used 54 chloroplast genes, including atpA, atpB, atpE, atpF, atpH, atpI, ccsA, clpP, matK, ndhB, ndhC, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, petA, petB, petD, petG, petL, petN, psaA, psaB, psaC, psaI, psaJ, psbA, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbM, psbN, psbT, psbZ, rbcL, rpl20, rpl23, rpl2, rpl36, rpoA, rpoB, rps2, rps3, rps4, rps7, rps8, and ycf3, were shared in the chloroplast genome of three Vigna species from our study (V. reflexo-pilosa, V. hirtella, and V. trinervia) and the 28 plant species, which are the same species in the construction of the mitochondrial evolutionary tree. The 28 chloroplast genomes from the NCBI database are 10 legumes in Fabaceae (G. max: PP712901, G. soja: NC_022868, M. polymorpha: NC_042848, P. vulgaris: NC_009259, R. pseudoacacia: NC_026684, S. occidentalis: OR478159, T. grandiflorum: NC_024034, V. angularis: NC_021091, V. radiata: NC_013843, and V. unguiculata: NC_018051) and the 18 other plants (A. thaliana: NC_000932, C. annuum: NC_018552, C. lanatus: NC_032008, C. pepo: NC_038229, C. sativus: NC_007144, E. grandis: NC_014570, G. biloba: NC_016986, G. barbadense: NC_008641, L. acutangula: MT381996, N. tabacum: NC_001879, O. sativa: NC_031333, P. alba: NC_008235, S. purpurea: NC_026722, S. lycopersicum: NC_007898, S. bicolor: NC_008602, Z. luxurians: NC_030301, Z. mays: NC_001666, and Z. perennis: NC_030300).
Results
Assembly and annotation of the V. reflexo-pilosa mitochondrial genome
A total of 6,496,297 PacBio long-reads were generated from V. reflexo-pilosa accession AusTRCF30263 and assembled into a mitochondrial genome. The complete mitochondrial genome of V. reflexo-pilosa was a linear structure and 370,913 bp in length (Fig 1). Four nucleotides, adenine (A), thymine (T), cytosine (C), and guanine (G), are present in percentages of approximately 27.52%, 27.28%, 22.62%, and 22.58%, respectively. The overall GC content is 45.20% (Table 1). The mitochondrial genome consisted of 53 genes, containing 32 protein-coding genes, 18 transfer RNA genes, and 3 ribosomal RNA genes as indicated in Fig 1 and Table 1.
Protein-coding genes, tRNA genes, and rRNA genes are shown on the line.
A total of 32 protein-coding genes were identified in the mitochondrial genome of V. reflexo-pilosa (Table 2). They were classified into nine groups including genes encoding ATP synthase, cytochrome c biogenesis, cytochrome c oxidase, cytochrome c reductase, maturase, NADH dehydrogenase, transport membrane protein, large ribosomal subunit, and small ribosomal subunit. Eight genes out of the 32 genes (ccmFc, nad1, nad2, nad4, nad5, nad7, rps3, and rps10) have introns as shown in Table 2. The sizes of protein-coding genes in the mitochondrial genome of V. reflexo-pilosa varied from 303 bp (nad4L and rps14) to 2002 bp (nad5).
Chloroplast genome assembly and annotation of V. reflexo-pilosa, V. hirtella and V. trinervia
The sizes of the complete chloroplast genomes of V. reflexo-pilosa, V. trinervia, and V. hirtella were 150,967 bp (Fig 2), 151,226 bp and 151,915 bp (S1 Fig), respectively. All three chloroplast genomes consist of 83 protein-coding genes, 36 tRNA genes, 8 rRNA genes, and a pseudogene. The chloroplast genome of V. reflexo-pilosa included a large single-copy (LSC) region of 80,704 bp, a small single-copy (SSC) region of 17,402 bp and two inverted repeat (IR) regions of 26,430 bp as shown in Table 3. The chloroplast genome of V. trinervia and V. hirtella also included the LSC region of 80,904 and 81,518 bp, the SSC region of 17,424 and 17,471 bp and the IR regions of 26,449 and 26,463 bp, respectively.
Known protein-coding genes, tRNA genes and rRNA genes are shown on the outside and inside of the circle. Colored genes were based on their functional groups.
A total of 128 protein-coding genes were identified in three Vigna chloroplast genomes and categorized into 17 functional groups (Table 4). These groups included genes encoding photosystem I subunit, photosystem II subunit, NADH dehydrogenase, cytochrome b/f complex subunit, ATP synthase subunit, rubisco, maturase, protease, envelope membrane protein, acetyl-CoA carboxylase, c-type cytochrome synthesis, large ribosomal subunit, small ribosomal subunit, RNA polymerase, ribosomal RNAs, transfer RNAs, and unknown function. Twelve genes out of the 128 genes have introns (atpF, clpP1, ndhA, ndhB, petB, petD, rpl2, rpl16, rpoC1, rps7, rps12, and ycf3).
RNA editing in the V. reflexo-pilosa mitochondrial genome
A total of 520 edited sites were identified in 30 protein-coding genes of the V. reflexo-pilosa mitochondrial genome (atp4, atp6, atp8, atp9, ccmB, ccmC, ccmFc, ccmFn, cob, cox1, cox3, mttB, matR, nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad7, nad9, rpl5, rpl16, rps1, rps3, rps4, rps10, rps12, and rps14), containing edited C to T (G to A) base changes (Fig 3). However, no edited site was found in the atp1 gene. The nad4 gene contained the highest number, with 45 editing sites followed by ccmFn and nad7 genes, each with 34 editing sites. Most genes typically start with the codon ATG. However, four genes (mttB, nad1, nad4L, and rps10) were edited at the second site of the start codon (ACG to ATG). Additionally, the editing site of ccmFc occurred at the first position of its stop codon (CGA to TGA).
Repeats and SSRs in the V. reflexo-pilosa mitochondrial genome
A total of 50 repeats, consisting of 27 forward and 23 palindromic repeats, were detected in the mitochondrial genome of V. reflexo-pilosa (S1 Table). The repeats ranged from 40 to 357 bp in length.
A total of 37 SSRs were identified (S2 Table). Most of the mononucleotide repeats (96.70%) are composed of A/T. Among the monomeric SSRs, thymine (T) and adenine (A) repeats were the most abundant, accounting for 57.58% and 39.39% of the total monomer repeats, respectively. The four dinucleotides are all composed entirely of AT/TA.
Collinearity analysis of mitochondrial genomes in Fabaceae
Mitochondrial genome synteny among eleven plant species in the family Fabaceae, including four Vigna species, P. vulgaris, G. max, G. soja, R. pseudoacacia, M. polymorpha, T. grandiflorum, and S. occidentalis, was evaluated. We found numerous collinear blocks that contained long and short ortholog sequences (Fig 4). Most of them were rearranged among species, except G. max and G. soja. Notably, the mitochondrial genomes of V. reflexo-pilosa and V. radiata have undergone less genome rearrangements than other Vigna species. Furthermore, a dot-plot analysis was also performed (Fig 5). The results showed numerous collinear blocks between V. reflexo-pilosa and V. radiata, but others were short and fragmented.
Locally collinear blocks identified among four Vigna species and seven other plant species in Fabaceae. Each colored region represents a local collinear block revealing the similarity between genomes. The connecting lines represent as connected blocks to trace each orthologous region.
Dot plot of four Vigna species and seven other plant species in Fabaceae. Horizontal coordinates in each box represent the mitochondrial Vigna reflexo-pilosa sequence, whereas vertical coordinates represent other mitochondrial sequences. Color lines denote the percentage of sequence identity.
Between V. reflexo-pilosa and other Vigna species, a total of six shared mitochondrial fragments (0.95–477.20 kb) were observed (S3 Table). Pi values of the six fragments ranged from 0.0088 (fragment 2) to 0.0159 (fragment 4). In addition, Pi values of the fourteen conserved genes in Fabaceae ranged from 0.0031 (nad7) to 0.0203 (nad9) (S4 Table). These results revealed low levels of divergence in the mitochondrial regions among Vigna species and plant species in Fabaceae.
Comparative chloroplast genomes
The differences among three Vigna chloroplast genomes were evaluated using mVISTA based on the annotated V. reflexo-pilosa chloroplast genome as a reference (Fig 6). The chloroplast genome of V. reflexo-pilosa was highly conserved with V. trinervia when compared with V. hirtella. The result showed a high divergence level between V. reflexo-pilosa and V. hirtella in several non-coding regions of LSC and SSC regions such as rpoB-trnC, psbZ-trnG, psaA-ycf3, psbA-trnH, and ycf1-rps15. Among the chloroplast genomes, a low divergence level was found in IR regions.
Horizontal and vertical axes show genome position and a percentage sequence identity, respectively.
Phylogenetic relationship of V. reflexo-pilosa
To reveal the evolution of V. reflexo-pilosa, ML phylogenetic trees were performed using mitochondrial and chloroplast genes (Fig 7). Based on 14 conserved genes among 29 mitochondrial genomes, the ML tree showed that V. reflexo-pilosa was in the clade of Fabaceae and was closer to V. radiata than to V. angularis and V. unguiculata with high bootstrap values (>90%) as shown in Fig 7A. Based on 54 conserved genes among 31 chloroplast genomes (the 29 species with the analysis of mitochondrial genomes and two Vigna species: V. trinervia and V. hirtella), the ML tree revealed that V. reflexo-pilosa was also in the clade Fabaceae and was closer to V. trinervia than to V. radiata, V. hirtella, V. angularis and V. unguiculata with 100 bootstrap values (Fig 7B).
(A) Maximum likelihood tree inferred from the concatenated data sets of 14 mitochondrial genes in 29 accessions (B) Maximum likelihood tree inferred from the concatenated data sets of 54 plastid genes in 33 accessions. Bootstrap support values less than 100% are denoted above branches.
Discussion
The mitochondrial genome of V. reflexo-pilosa was constructed using PacBio long-read data, resulting in a genome size of 370,913 bp with a GC content of 45.20%. The size of V. reflexo-pilosa mitochondrial genome was slightly smaller than the mitochondrial genome size of closely related species such as V. angularis (404,466 bp), V. radiata (401,262 bp), and V. unguiculata (383,314 bp) [8,12]. However, the overall GC and gene contents of the mitochondrial genomes in Vigna species are similar. The mitochondrial genome of V. reflexo-pilosa contains 31 unique protein-coding genes, which are consistent with other Vigna species such as V. angularis and V. radiata [8,12]. Additionally, the presence of two copies of the atp9 gene was observed in the V. reflexo-pilosa mitochondrial genome, similar to that found in V. radiata, V. angularis and Castanea mollissima [8,12,44]. Our results revealed the absence of seven mitochondrial genes (cox2, rpl2, rpl10, rps2, rps11, rps13, and sdh3) in V. reflexo-pilosa, which are concordant with V. angularis and V. radiata [8,12]. Notably, rpl2 is commonly lost in legumes, and cox2 is lost in the subfamily Phaseolinae [10].
The chloroplast genome of V. reflexo-pilosa (150,967 bp), V. trinervia (151,226 bp), and V. hirtella (151,915 bp) were slightly different with the result of a previous study (153,169 bp, 151,161 bp and 151,564 bp, respectively) [18]. Their chloroplast genome sizes are similar to V. radiata, which was found to be 151,271 bp [15], and P. vulgaris was 150,285 bp [19]. The number of genes of the three Vigna chloroplast genomes are similar to the number of genes in other legume chloroplast genomes that contain 126 − 130 genes including 108 − 111 unique genes and 17 − 19 genes which are duplicated in the IR region. [13–17,19].
While mitochondrial genomes in most plants often contain shared genes with chloroplast genomes [14], our finding demonstrated there was no gene transfer from chloroplast to mitochondria of V. reflexo-pilosa. Both mitochondrial and chloroplast genomes encode a set of genes that allow them to synthesize their own proteins. The independence of the two genomes indicates that each has preserved the necessary capacities to perform its essential functions.
In angiosperms, RNA editing is a post-transcriptional process that modifies genetic information by converting C to U, which was found in mitochondrial transcripts [45–47]. Interestingly, RNA-editing events were found in 30 genes of the mitochondrial genome of V. reflexo-pilosa, which were also observed in other plants such as Arabidopsis thaliana [48] and Calophyllum soulattri [49]. RNA-editing in several genes such as ccmFc, mttB, nad1, nad4L, nad5, and rps10 were consistent with presumed RNA-editing events in V. angularis [12] and P. vulgaris [10]. Remarkably, mttB, nad1, nad4L, and rps10 genes were edited on the second site of the start codon, and the ccmFc gene encoded C-terminal region of ccmF was edited on the first site of the stop codon, which was found in other plants such as Phaseolus vulgaris [10], Calophyllum soulattri [49], and Garcinia mangostana [50]. The ccmF genes are essential components of the electron transport chain in mitochondria [51]; therefore, the RNA-editing event affects their functions. Indeed, RNA editing can lead to the start and stop codon generation and affect the structure of protein in plant mitochondria [52]. For example, the nucleotide in the early position of exon2 of ccmFc gene in wheat mitochondria changed from an asparagine codon to a stop codon, which may lead to change protein structure and function [51].
Comparison of three Vigna chloroplast genomes using mVISTA supported that V. trinervia is a maternal parent of V. reflexo-pilosa due to the high similarity of their chloroplast sequences, which is concordant with several previous studies based on gene and genome sequences [4,5,18].
Phylogenetic analysis of V. reflexo-pilosa based on mitochondrial and chloroplast protein-coding genes provides insights on its evolution. Our tree and previous trees in Fabaceae using mitochondrial or chloroplast genes had highly similar topologies [2,18]. For example, the phylogenetic relationships based on single copy chloroplast genes among 23 Vigna accessions, V. reflexo-pilosa and V. trinervia showed a monophyletic group, and V. hirtella showed a polyphyletic group with V. reflexo-pilosa and V. trinervia [18]. To contrast, phylogenetic trees based on nuclear genes or nuclear rDNA-ITS regions, V. reflexo-pilosa and V. hirtella showed a monophyletic group, whereas V. trinervia showed a paraphyletic or polyphyletic group with V. reflexo-pilosa and V. hirtella [2,5]. Therefore, V. reflexo-pilosa, V. trinervia and V. hirtella are high relationships based on the results of phylogenetic trees, confirming V. reflexo-pilosa and its parents. Therefore, phylogenetic trees could represent and reflect the evolutionary history of V. reflexo-pilosa and related species using the difference of markers.
Conclusions
In this study, the complete mitochondrial genome of V. reflexo-pilosa and the chloroplast genomes of V. reflexo-pilosa. V. trinervia (female parent), and V. hirtella (male parent) were presented. The gene contents of the mitochondrial genome of V. reflexo-pilosa were similar to those of other Vigna species. There are 520 RNA-editing sites in 30 genes of the V. reflexo-pilosa mitochondrial genome. Comparison of mitochondrial genomes across Fabaceae species showed that the mitochondrial structure of V. reflexo-pilosa and V. radiata were similar than other Fabaceae species. In addition, high chloroplast sequence similarities were observed in V. reflexo-pilosa and V. trinervia, revealing maternal lineage. The phylogenetic analysis based on both mitochondrial and chloroplast genes showed that V. radiata is closely related to V. reflexo-pilosa. Thus, the complete mitochondrial and chloroplast genomes of V. reflexo-pilosa as well as its parents are a good source of genetic information for studying evolution and phylogeny in plants of the family Fabaceae.
Supporting information
S1 Figure. The complete chloroplast genome of V. trinervia and V. hirtella.
Circular structure of (A) V. trinervia and (B) V. hirtella chloroplast genome. Known protein-coding genes, tRNAs and rRNAs are shown on the outside and inside of the circle. Colored genes were based on their functional groups.
https://doi.org/10.1371/journal.pone.0325243.s001
(DOCX)
S1 Table. List of repeats in the V. reflexo-pilosa mitochondrial genome.
https://doi.org/10.1371/journal.pone.0325243.s002
(XLSX)
S2 Table. List of SSRs in the V. reflexo-pilosa mitochondrial genome.
https://doi.org/10.1371/journal.pone.0325243.s003
(XLSX)
S3 Table. Nucleotide diversity of six shared mitochondrial fragments between V. reflexo−pilosa and three other Vigna species.
https://doi.org/10.1371/journal.pone.0325243.s004
(DOCX)
S4 Table. Nucleotide diversity of 14 shared mitochondrial genes in V. reflexo-pilosa and ten reported plant species in the family Fabaceae.
https://doi.org/10.1371/journal.pone.0325243.s005
(DOCX)
References
- 1. Verma SK, Singh CK, Taunk J, Gayacharan, Chandra Joshi D, Kalia S, et al. Vignette of Vigna domestication: From archives to genomics. Front Genet. 2022;13:960200. pmid:36338960
- 2. Takahashi Y, Somta P, Muto C, Iseki K, Naito K, Pandiyan M, et al. Novel Genetic Resources in the Genus Vigna Unveiled from Gene Bank Accessions. PLoS One. 2016;11(1):e0147568. pmid:26800459
- 3.
Tomooka N, Vaughan DA, Moss H, Maxted N. The Asian Vigna. Springer Netherlands. 2002. https://doi.org/10.1007/978-94-010-0314-8
- 4. Chankaew S, Isemura T, Isobe S, Kaga A, Tomooka N, Somta P, et al. Detection of genome donor species of neglected tetraploid crop Vigna reflexo-pilosa (créole bean), and genetic structure of diploid species based on newly developed EST-SSR markers from azuki bean (Vigna angularis). PLoS One. 2014;9(8):e104990. pmid:25153330
- 5. Pootakham W, Sonthirod C, Naktang C, Yundaeng C, Yoocha T, Kongkachana W, et al. Genome assemblies of Vigna reflexo-pilosa (créole bean) and its progenitors, Vigna hirtella and Vigna trinervia, revealed homoeolog expression bias and expression-level dominance in the allotetraploid. Gigascience. 2022;12:giad050. pmid:37470496
- 6. Smith DR, Keeling PJ. Mitochondrial and plastid genome architecture: Reoccurring themes, but significant differences at the extremes. Proc Natl Acad Sci U S A. 2015;112(33):10177–84. pmid:25814499
- 7. Zardoya R. Recent advances in understanding mitochondrial genome diversity. F1000Res. 2020;9:F1000 Faculty Rev-270. pmid:32399193
- 8. Alverson AJ, Zhuo S, Rice DW, Sloan DB, Palmer JD. The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats. PLoS One. 2011;6(1):e16404. pmid:21283772
- 9. Lin C-P, Schafleitner R, Chen C-Y, Lo H-F, Chen L-FO. The complete mitochondrial genome of wild mung bean (Vigna radiata var. sublobata TC1966). Mitochondrial DNA B Resour. 2019;4(2):3016–7. pmid:33365835
- 10. Bi C, Lu N, Xu Y, He C, Lu Z. Characterization and Analysis of the Mitochondrial Genome of Common Bean (Phaseolus vulgaris) by Comparative Genomic Approaches. Int J Mol Sci. 2020;21(11):3778. pmid:32471098
- 11. Asaf S, Khan AL, Al-Harrasi A, Kim TH, Lee I-J. The first complete mitochondrial genome of wild soybean (Glycine soja). Mitochondrial DNA B Resour. 2018;3(2):527–8. pmid:33474228
- 12. Naito K, Kaga A, Tomooka N, Kawase M. De novo assembly of the complete organelle genome sequences of azuki bean (Vigna angularis) using next-generation sequencers. Breed Sci. 2013;63(2):176–82. pmid:23853512
- 13. Tian S, Lu P, Zhang Z, Wu JQ, Zhang H, Shen H. Chloroplast genome sequence of Chongming lima bean (Phaseolus lunatus L.) and comparative analyses with other legume chloroplast genomes. BMC Genomics. 2021;22(1):194. pmid:33736599
- 14. Ha Y-H, Chang KS, Gil H-Y. Characteristics of chloroplast and mitochondrial genomes and intracellular gene transfer in the Korean endemic shrub, Sophora koreensis Nakai (Fabaceae). Gene. 2024;894:147963. pmid:37926173
- 15. Tangphatsornruang S, Sangsrakru D, Chanprasert J, Uthaipaisanwong P, Yoocha T, Jomchai N, et al. The Chloroplast Genome Sequence of Mungbean (Vigna radiata) Determined by High-throughput Pyrosequencing: Structural Organization and Phylogenetic Relationships. DNA Research. 2009;17(1):11–22.
- 16. Nawae W, Yundaeng C, Naktang C, Kongkachana W, Yoocha T, Sonthirod C, et al. The Genome and Transcriptome analysis of the vigna mungo chloroplast. Plants (Basel). 2020;9(9):1247. pmid:32967378
- 17. Saski C, Lee S-B, Daniell H, Wood TC, Tomkins J, Kim H-G, et al. Complete chloroplast genome sequence of Gycine max and comparative analyses with other legume genomes. Plant Mol Biol. 2005;59(2):309–22. pmid:16247559
- 18. Lee J, Kang YJ, Park H, Shim S, Ha J, Lee T, et al. Unraveling the maternal and paternal origins of allotetraploid Vigna reflexo-pilosa. Sci Rep. 2023;13(1):22951. pmid:38135720
- 19. Guo X, Castillo-Ramírez S, González V, Bustos P, Fernández-Vázquez JL, Santamaría RI, et al. Rapid evolutionary change of common bean (Phaseolus vulgaris L) plastome, and the genomic diversification of legume chloroplasts. BMC Genomics. 2007;8:228. pmid:17623083
- 20. Ruang-Areerate P, Kongkachana W, Naktang C, Sonthirod C, Narong N, Jomchai N, et al. Complete chloroplast genome sequences of five Bruguiera species (Rhizophoraceae): comparative analysis and phylogenetic relationships. PeerJ. 2021;9:e12268. pmid:34733586
- 21. Ruang-Areerate P, Yoocha T, Kongkachana W, Phetchawang P, Maknual C, Meepol W, et al. Comparative analysis and phylogenetic relationships of ceriops species (rhizophoraceae) and avicennia lanata (acanthaceae): insight into the chloroplast genome evolution between middle and seaward zones of mangrove forests. Biology (Basel). 2022;11(3):383. pmid:35336757
- 22. Ruang-Areerate P, Sangsrakru D, Yoocha T, Kongkachana W, U-Thoomporn S, Prathip Na Thalang O, et al. Comparative Chloroplast Genomes and Phylogenetic Relationships of True Mangrove Species Brownlowia tersa and Brownlowia argentata (Malvaceae). Curr Issues Mol Biol. 2025;47(2):74. pmid:39996795
- 23. Wei S, Wang X, Bi C, Xu Y, Wu D, Ye N. Assembly and analysis of the complete Salix purpurea L. (Salicaceae) mitochondrial genome sequence. Springerplus. 2016;5(1):1894. pmid:27843751
- 24. Ruang-Areerate P, Shearman J, Kongkachana W, Jomchai N, Yoocha T, U-Thoomporn S, et al. The complete mitochondrial genome of Luffa acutangula. Mitochondrial DNA B Resour. 2020;5(3):3208–9. pmid:33458114
- 25. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bulletin. 1987;19:11–5.
- 26. Pootakham W, Naktang C, Sonthirod C, Kongkachana W, Narong N, Sangsrakru D, et al. Chromosome-level genome assembly of Indian mangrove (Ceriops tagal) revealed a genome-wide duplication event predating the divergence of Rhizophoraceae mangrove species. Plant Genome. 2022;15(3):e20217. pmid:35608212
- 27. Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27(5):722–36. pmid:28298431
- 28. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421. pmid:20003500
- 29. Alverson AJ, Wei X, Rice DW, Stern DB, Barry K, Palmer JD. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Mol Biol Evol. 2010;27(6):1436–48. pmid:20118192
- 30. Schattner P, Brooks AN, Lowe TM. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005;33(Web Server issue):W686-9. pmid:15980563
- 31.
Jin J-J, Yu W-B, Yang J-B, Song Y, dePamphilis CW, Yi T-S, et al. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Cold Spring Harbor Laboratory. 2018. https://doi.org/10.1101/256479
- 32. Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, et al. GeSeq - Versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017;45: W6–W11.
- 33. Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 2004;32(1):11–6. pmid:14704338
- 34. Greiner S, Lehwark P, Bock R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019;47(W1):W59–64. pmid:30949694
- 35. Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094–100. pmid:29750242
- 36. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29(1):24–6. pmid:21221095
- 37. Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001;29(22):4633–42. pmid:11713313
- 38. Beier S, Thiel T, Münch T, Scholz U, Mascher M. MISA-web: a web server for microsatellite prediction. Bioinformatics. 2017;33(16):2583–5. pmid:28398459
- 39. Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14(7):1394–403. pmid:15231754
- 40. Cabanettes F, Klopp C. D-GENIES: dot plot large genomes in an interactive, efficient and simple way. PeerJ. 2018;6:e4958. pmid:29888139
- 41. Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol Biol Evol. 2017;34(12):3299–302. pmid:29029172
- 42. Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I. VISTA: computational tools for comparative genomics. Nucleic Acids Res. 2004;32(Web Server issue):W273-9. pmid:15215394
- 43. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018;35(6):1547–9. pmid:29722887
- 44. Guo H, Liu Q, Chen Y, Niu H, Zhao Q, Song H, et al. Comprehensive assembly and comparative examination of the full mitochondrial genome in Castanea mollissima Blume. Genomics. 2023;115(6):110740. pmid:37923179
- 45. Takenaka M, Verbitskiy D, van der Merwe JA, Zehrmann A, Brennicke A. The process of RNA editing in plant mitochondria. Mitochondrion. 2008;8(1):35–46. pmid:18326075
- 46. Hao W, Liu G, Wang W, Shen W, Zhao Y, Sun J, et al. RNA Editing and Its Roles in Plant Organelles. Front Genet. 2021;12:757109. pmid:34659369
- 47. Chu D, Wei L. The chloroplast and mitochondrial C-to-U RNA editing in Arabidopsis thaliana shows signals of adaptation. Plant Direct. 2019;3(9):e00169. pmid:31517178
- 48. Giegé P, Brennicke A. RNA editing in Arabidopsis mitochondria effects 441 C to U changes in ORFs. Proc Natl Acad Sci U S A. 1999;96(26):15324–9. pmid:10611383
- 49. Cadorna CAE, Pahayo DG, Rey JD. The first mitochondrial genome of Calophyllum soulattri Burm.f. Sci Rep. 2024;14(1):5112. pmid:38429360
- 50. Wee C-C, Nor Muhammad NA, Subbiah VK, Arita M, Nakamura Y, Goh H-H. Mitochondrial genome of Garcinia mangostana L. variety Mesta. Sci Rep. 2022;12(1):9480. pmid:35676406
- 51. Giegé P, Rayapuram N, Meyer EH, Grienenberger JM, Bonnard G. CcmF(C) involved in cytochrome c maturation is present in a large sized complex in wheat mitochondria. FEBS Lett. 2004;563(1–3):165–9. pmid:15063743
- 52. Tang W, Luo C. Molecular and Functional Diversity of RNA Editing in Plant Mitochondria. Mol Biotechnol. 2018;60(12):935–45. pmid:30244436