Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Comparative analysis of chloroplast genomes of seven Juniperus species from Kazakhstan

  • Shyryn Almerekova,

    Affiliation Molecular Genetics Laboratory, Institute of Plant Biology and Biotechnology, Almaty, Kazakhstan

  • Moldir Yermagambetova,

    Roles Formal analysis, Investigation, Writing – review & editing

    Affiliations Molecular Genetics Laboratory, Institute of Plant Biology and Biotechnology, Almaty, Kazakhstan, Faculty of Biology and Biotechnology, Al-Farabi Kazakh National University, Almaty, Kazakhstan

  • Smatulla Jumanov,

    Roles Investigation, Resources, Writing – review & editing

    Affiliation Aksu-Zhabagly Nature Reserve, Zhabagly, Turkestan Region, Kazakhstan

  • Saule Abugalieva,

    Roles Conceptualization, Project administration, Validation, Writing – original draft, Writing – review & editing

    Affiliations Molecular Genetics Laboratory, Institute of Plant Biology and Biotechnology, Almaty, Kazakhstan, Faculty of Biology and Biotechnology, Al-Farabi Kazakh National University, Almaty, Kazakhstan

  • Yerlan Turuspekov

    Roles Conceptualization, Supervision, Writing – original draft, Writing – review & editing

    yerlant@yahoo.com

    Affiliations Molecular Genetics Laboratory, Institute of Plant Biology and Biotechnology, Almaty, Kazakhstan, Faculty of Biology and Biotechnology, Al-Farabi Kazakh National University, Almaty, Kazakhstan

Correction

28 Jan 2025: The PLOS One Staff (2025) Correction: Comparative analysis of chloroplast genomes of seven Juniperus species from Kazakhstan. PLOS ONE 20(1): e0318596. https://doi.org/10.1371/journal.pone.0318596 View correction

Abstract

Juniperus species are shrubs or trees in the family Cupressaceae that play an important role in forest ecosystems. In this study, we report the complete sequences of the plastid (pt) genomes of five Juniperus species collected in Kazakhstan (J. communis, J. sibirica, J. pseudosabina, J. semiglobosa, and J. davurica). The sequences of the pt genomes of the five species were annotated in addition to two full pt genome sequences from J. sabina and J. seravschanica, which we have previously reported. The pt genome sequences of these seven species were compared to the pt genomes of Juniperus species available in the public NCBI database. The total length of the pt genomes of Juniperus species, including previously published pt genome data, ranged from 127,469 bp (J. semiglobosa) to 128,097 bp (J. communis). Each Juniperus plastome consisted of 119 genes, including 82 protein-coding genes, 33 transfer RNA and 4 ribosomal RNA genes. Among the identified genes, 16 contained one or two introns, and 2 tRNA genes were duplicated. A comparative assessment of pt genome sequences suggested the identification of 1145 simple sequence repeat markers. A phylogenetic tree of 26 Juniperus species based on the 82 protein-coding genes separated the Juniperus samples into two major clades, corresponding to the Juniperus and Sabina sections. The analysis of pt genome sequences indicated that accD and ycf2 were the two most polymorphic genes. The phylogenetic evaluation of 26 Juniperus species using these two genes confirmed that they can be efficiently used as DNA barcodes for phylogenetic analyses in the genus. The sequenced plastomes of these Juniperus species have provided a large amount of genetic data that will be valuable for future genomic studies of this genus.

Introduction

Juniperus L. is a genus in the family Cupressaceae that is mainly distributed throughout the Northern Hemisphere, from sea level to above the timberline. There are approximately 75 species in the genus, which belong to three different sections: Sabina, Juniperus, and Caryocedrus [1]. Species in the genus are widely distributed in dry foothills or at mid to high altitudes of Central Asian mountains, including Kazakhstan [2]. In Kazakhstan, the genus representatives are mainly found on dry mountain slopes and in alpine and subalpine belts [3, 4]. The country has eight species of Juniperus, including J. communis L., J. sibirica Burgsd., J. x media Dmitr., J. pseudosabina Fisch. et C.A. Mey., J. sabina L., J. semiglobosa Regel., J. seravschanica Kom. [5], and J. davurica Pall. [6]. Two of those species, J. communis and J. sibirica, belong to J. sect. Juniperus; the remaining six species belong to J. sect. Sabina [1]. Juniperus species play an important ecological role in the formation of mountain forest ecosystems as well as hydrobiological regulation and erosion prevention [1]. Juniperus species are highly significant in medicine and are used in perfumery and cosmetics [79]. Many chemical substances have been extracted from different Juniperus species, including cedrol [10, 11], glucosides [12, 13], polyphenols [14, 15], terpenoids, and tannins [1, 14, 16].

The genetic diversity of the species in the genus has been thoroughly studied using different types of DNA markers. In particular, markers such as random amplified polymorphic DNA (RAPD) [1720], inter-simple sequence repeats (ISSR) [19, 21, 22], amplified fragment length polymorphisms (AFLP) [2325], and single-nucleotide polymorphisms (SNP) [26] have been widely used to evaluate the genetic diversity of Juniperus. In the last 20 years, simple sequence repeats (SSRs), or microsatellites, have become the most popular marker; these are extensively employed in the population genetics of Juniperus species [2731]. However, the taxonomic relationships of many Juniperus species are dubious due to their complex morphological characteristics [32, 33]. For example, the species J. communis includes several morphological forms, which has led to a complex taxonomy in this species [34]. Another example is J. excelsa M. Bieb, which consists of morphologically similar species such as J. excelsa, J. polycarpos, and J. seravschanica [35]. A taxonomic reassessment of J. turkestanica and J. pseudosabina from Central Asia was conducted by Adams and Turuspekov (1998) [18] using RAPD markers. The analysis, based on less reliable RAPD markers [36], revealed that J. turkestanica and J. pseudosabina belong to a single species named J. pseudosabina. Therefore, further clarification of the phylogeny requires new molecular tools for more accurate resolution in the genus. The application of next-generation sequencing (NGS) technologies may provide further resources for genetic diversity assessments of Juniperus species. The rapid advancements in NGS technologies have allowed us to quickly and inexpensively obtain genome sequences [37, 38]. One promising direction in the genomic study of Juniperus is the evaluation of the genetic variation in plastid genomes. The Juniperus plastid genome is a circular molecule with a size of around 130,000 bp. Due to the small sizes of plastid genomes and their conserved gene content, structure, and uniparentally inherited characteristics, they may be informative when assessing the genetic diversity and phylogenetic relationships within the genus [39]. For instance, plastid genome markers (trnT-trnF, trnS-trnG, trnC-trnD, matK, rbcL, trnL, psbD, psbM, and petN) have been successfully used to resolve phylogenetic relationships in the Juniperus genus [4043].

Recently, promising steps have been accomplished in the characterization of the complete pt genomes of Juniperus species from China [4450], Pakistan [51], and Kazakhstan [52, 53]. A complete pt genome comparison of four Juniperus species from China has been published [54]. Chen et al. (2022) [54] reported that five significantly divergent regions, including accD, accD-rpl2, ycf1, ycf2, and rrn23-rrn4.5, could be used as DNA barcodes to identify relationships between different species and potential genetic markers. The authors concluded that further samples were needed for a reliable extrapolation of the phylogenetic relationships of Cupressaceae [54]. Thus, assessing genetic variations in newly sequenced pt genomes of the genus may be highly informative for studies of taxonomic relationships with higher precision. In this work, we report a comparison of the complete pt genome sequences of seven Juniperus species from Kazakhstan and discuss the possibilities for generating molecular markers for a phylogenetic analysis of the genus.

Materials and methods

Plant material and DNA extraction

The leaves of five Juniperus species were collected from different geographical locations in Kazakhstan in 2017 and 2020–2021 (Table 1) and dried in silica gel for DNA isolation. The permission for collecting plant material of Red Book species J. seravschanica was obtained from the Forestry and Wildlife Committee Ministry of Ecology, Geology and Natural Resources of the Republic of Kazakhstan. Total DNA was extracted from dried leaves using the cetyltrimethylammonium bromide (CTAB) method [55]. The quality and quantity of the isolated DNA of the Juniperus samples were evaluated using agarose gel electrophoresis and a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific Inc., USA).

thumbnail
Table 1. Geographical locations of the collected Juniperus samples.

https://doi.org/10.1371/journal.pone.0295550.t001

Plastid genome sequencing, assembly, and annotation

Complete pt genome paired-end sequencing was performed using the Illumina NovaSeq 6000 platform (Illumina Inc., USA) at Macrogen Inc. (Seoul, Republic of Korea). DNA that passed quality control DNA was used for library construction using a TruSeq Nano DNA Kit (Illumina Inc., USA). The generated raw read FASTQ format files were used for the genome assembly. The raw data were checked using FastQC and trimmed using Trimmomatic tools [56]. Contigs were assembled from the trimmed reads using the SPAdes 3.13.0 [57] assembler approach. The assembled pt genomes were annotated using PGA [58] and GeSeq [59]. The graphical map of the plastid genomes of Juniperus species was generated using Organellar Genome DRAW (OGDRAW) v1.3.1 [60].

Genome comparisons, SSRs, and repeat analyses

The nucleotide diversity (Pi) of the seven complete pt genomes of the Juniperus species was calculated using the DnaSP (DNA Sequence Polymorphism) package, with a 200 bp step size and a 600 bp window length [61]. Simple sequence repeats (SSRs) were detected using MISA software [62] with the following thresholds: eight for mononucleotide repeats, four for dinucleotide repeats, four for trinucleotide repeats, and three for tetranucleotide, pentanucleotide, and hexanucleotide repeats. The forward (F), reverse (R), and palindromic (P) repeat elements were identified using the PERuter web- based program [63] with the following parameter settings: Hamming distance = 3 and minimum repeat size = 30 bp. In addition, each of the Juniperus plastid genomes was searched for tandem repeats using the Tandem Repeats Finder program [64] with the default settings.

Phylogenetic analysis

The concatenated sequences of 82 common protein-coding genes of 26 Juniperus samples were used for the phylogenetic analysis. Hesperocyparis stephensonii and Cupressus torulosa was used as the outgroup. The nucleotide sequences of the common protein-coding genes of five plastomes of Juniperus species from Kazakhstan were obtained in this study. The remaining 23 samples, including the outgroup, were extracted from the database of the National Center for Biotechnology Information (NCBI). Based on the Akaike information criterion, the best nucleotide substitution model (GTR+I+G) was calculated in jModelTest 2.1.10 [65]. Bayesian inference (BI) was carried out in MrBayes 3.2.7 [66]. FigTree 1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/) was used to visualize and refine the BI-based phylogenetic tree. The maximum likelihood (ML) method [67] based tree was constructed using the MEGA 11 package [68] with 1000 bootstrap replicates. The newly obtained pt genome nucleotide sequences of the Juniperus species from Kazakhstan were deposited in GenBank. The accession numbers are listed in Table 1.

Results

Features of the plastomes of Juniperus species

DNA libraries of pt genomes from five Juniperus species (J. communis, J. sibirica, J. davurica, J. pseudosabina, and J. semiglobosa) were sequenced using the Illumina NovaSeq 6000 sequencing platform and compared with two previously reported pt genomes of J. sabina and J. seravschanica that had been collected in Kazakhstan. The largest amount of raw data in the compared pt genomes was in J. pseudosabina (4.88 GB), followed by J. davurica (4.38 GB), J. sibirica (4.37 GB), J. semiglobosa (4.18 GB), J. sabina (3.68 GB), J. seravschanica (3.27 GB), and J. communis (3.0 GB). The overall GC content of the assembled pt genomes of J. communis, J. sibirica, J. pseudosabina, J. semiglobosa, J. davurica, J. sabina, and J. seravschanica were 34.87%, 34.87%, 34.98%, 34.93%, 34.86%, 34.36%, and 34.45%, respectively (Table 2).

thumbnail
Table 2. Comparative characteristics of pt genomes of seven studied Juniperus species.

https://doi.org/10.1371/journal.pone.0295550.t002

The assembled lengths of the pt genomes ranged from 127,469 bp in J. semiglobosa to 128,097 bp in J. communis. The consensus circular gene map of the complete pt genomes of Juniperus species from Kazakhstan is shown in Fig 1. Individual circular maps for the five newly generated pt genomes are provided in S1 Fig. The nucleotide sequences of the complete pt genome data were deposited in the NCBI database. The accession numbers are provided in Tables 13.

thumbnail
Table 3. List of genes in pt genomes of J. communis, J. sibirica, J. pseudosabina, J. semiglobosa, and J. davurica.

One or two asterisks indicate one or two intron-containing genes, respectively; (x2) indicates the duplicated genes.

https://doi.org/10.1371/journal.pone.0295550.t003

The sequenced pt genomes of Juniperus species encoded 119 genes, including 82 protein-coding genes, 33 transfer RNA (tRNA) genes, and 4 ribosomal RNA (rRNA) genes (Table 2). Among the 119 identified genes, 16 (8 protein-coding genes and 6 tRNA genes) contained one or two introns; two tRNA genes were duplicated. The trnA-UGC, trnG-UCC, trnI-GAU, trnK-UUU, trnL-UAA, trnV-UAC, rpl16, rpl2, rpoC1, atpF, petB, petD, ndhA, and ndhB genes contained one intron, whereas rps12 and ycf3 contained two introns; two tRNA genes (trnI-CAU and trnQ-UUG) were duplicated (Table 3).

Comparative analysis of the plastomes of Juniperus species

The Pi values for the 82 analyzed protein-coding genes of the seven Juniperus complete pt genomes varied from 0.00036 to 0.04373, with an average value of 0.006. The assessment of the variable regions allowed us to identify accD, clpP-infA-matK, ycf1, ycf2, and ycf3 as the five most polymorphic genes (Pi > 0.013). The highest Pi value (0.044) was found in the accD gene (Fig 2). In several genes, more than one variable hotspot was identified. For instance, in accD, matK, and ycf2, two variable hotspots were detected, and ycf1 included more than three variable hotspots (Fig 2).

thumbnail
Fig 2. Nucleotide diversity (Pi) in the 82 protein-coding genes of Juniperus species.

Sliding window analysis with a window length of 600 bp and a step size of 200 bp.

https://doi.org/10.1371/journal.pone.0295550.g002

Phylogenetic analysis

A phylogenetic tree was constructed based on the concatenated sequences of the 82 common protein-coding genes acquired from the samples of 21 Juniperus species from the NCBI database and 5 samples from this study using the BI and ML analyses. Hesperocyparis stephensonii was applied as an outgroup in the analysis (Fig 3). The aligned length of the 82 protein-coding genes of the Juniperus samples, including the outgroup, was 82,953 bp. In general, 2990 bp (3.6%) of the nucleotide sequences of the coding genes out of the 82,953 bp aligned lengths were polymorphic.

thumbnail
Fig 3. Phylogenetic tree from BI and ML analyses based on 82 shared protein-coding genes of the Juniperus and outgroup samples.

The numbers above the branches indicate BI/ML support values. Number of clades provided according to Mao et al. (2010) [41].

https://doi.org/10.1371/journal.pone.0295550.g003

The phylogenetic tree separated the Juniperus samples into two major clades, corresponding to the Juniperus and Sabina sections. Two Juniperus species, J. communis and J. sibirica–both collected in Kazakhstan were clustered in the clade of the Juniperus section. The remaining five species were grouped in different subclades in the clade of the Sabina section (Fig 3).

Since the Pi values for the 82 identified genes in seven plastid genomes suggested that the most polymorphic genes were accD and ycf2 (Fig 2), these two genes were applied to reconstruct the phylogenetic tree for samples included in Fig 3. The BI output for the phylogenetic analysis of 26 taxa using accD and ycf2 (S2 Fig) showed a high similarity with the results in Fig 3.

SSRs and repeat sequences

In total, 1145 SSRs, or microsatellites, were identified in the seven Juniperus pt genomes using MISA software. The number of SSRs in individual Juniperus pt genomes ranged from 152 in J. seravschanica to 170 in J. sibirica and J. davurica (Table 4). Mononucleotide repeats (807) were the most common motifs in all the analyzed Juniperus species, with an average of 70.5% of the total amount of SSRs, followed by 241 dinucleotide (21%), 41 trinucleotide (3.6%), 49 tetranucleotide (4.3%), 5 pentanucleotide (0.4%), and 2 hexanucleotide (0.2%) repeats. The hexanucleotide repeats (AATATC/ATATTG) were least frequent in all seven pt genomes, appearing only in J. communis (1) and J. sibirica (1) species (Fig 4A). Most of the mononucleotide repeats were represented by the A/T motif in each plastome; the dinucleotide repeat was represented by the AT/AT motif, the trinucleotide repeat by AAG/CTT, the tetranucleotide repeat by AAAG/CTTT, and the pentanucleotide repeat by AATCC/ATTGG (Fig 4B).

thumbnail
Fig 4. Repeat sequence analysis of seven Juniperus plastomes.

(A) Number of repeat types; (B) Frequencies of detected SSRs; (C) Number of palindromic, forward, reverse, and tandem repeats.

https://doi.org/10.1371/journal.pone.0295550.g004

thumbnail
Table 4. Types and amounts of simple sequence repeat markers (SSRs) in the pt genomes of Juniperus species.

https://doi.org/10.1371/journal.pone.0295550.t004

Repetitive regions, including forward, reverse, palindromic, and tandem repeats in the seven Juniperus plastomes were identified using REPuter and Tandem Repeats Finder, respectively. A total of 1137 repetitive regions were detected, including 537 forward, 29 reverse, 64 palindromic, and 507 tandem repeats. Reverse repeats were rare in Juniperus plastomes and were found only in J. communis (11) and J. sibirica (18). The forward and tandem repeats were widely distributed across the Juniperus plastomes, with forward repeats ranging from 64 in J. sibirica to 84 in J. semiglobosa, and tandem repeats ranging from 63 in J. seravschanica to 80 in J. communis (Fig 4C).

Discussion

In this study, we have summarized our analysis of the complete pt genome sequences of seven Juniperus species from Kazakhstan, including J. sabina and J. seravschanica, which we have previously reported [52, 53]. The comparative assessment of the seven pt genomes suggested a high degree of similarity, as all species consisted of 119 genes, including 82 protein-coding genes, 33 tRNA genes, and 4 rRNA genes. The same number of genes has been found in J. chinensis, J. gaussenii, J. pingii and J. procumbens [54], J. microsperma [47], J. recurva [46], and J. tibetica [45]. All the Juniperus plastomes included in this study had lost rps16, whereas this gene was present in the J. formosana (KX832625.1) and J. osteosperma plastomes in GenBank. Similar to in other studies [4447], the rps12 and ycf3 genes included two intronic regions, trnI-CAU, and trnQ-UUG were duplicated in the Juniperus pt genomes. The sequenced genomes in this study were characterized by a lack of inverted repeats, like in other species from the Cupressaceae family [4547, 54]. This contrasts with the majority of angiosperm chloroplast genomes that are characterized by two inverted repeats, which divide the genome into large and small single-copy regions [69]. The evaluation of 82 protein-coding genes in seven pt genomes suggested that the total length of these genes was 82,953 bp. An assessment of the nucleotide sequences revealed that 3.6% were polymorphic, confirming the conserved nature of the pt genomes at the genus level [70, 71]. The size of the pt genomes of these seven species was within the expected range (127–128 Kb) of other sequenced pt genomes of Juniperus members [46, 47, 49]. The GC content ranged from 34.36% to 34.98% (Table 2); the same value has been observed in other Juniperus pt genomes, including those of J. cedrus, J. tibetica, J. squamata, J. chinensis, J. gaussenii, J. pingii, and J. procumbens [44, 45, 48, 54]. Despite the strong similarity of these seven pt genomes, several highly divergent regions, including accD, clpP-infA, matK, matK-ndhA, ycf1, ycf2, and ycf3, were identified. In particular, the specific regions of accD and ycf2 were characterized by high levels of nucleotide diversity (Fig 2), confirming the results reported in other studies of Juniperus species [54, 72]. Therefore, the highlighted sequence regions of accD and ycf2 could be used as candidate molecular markers for DNA barcoding and phylogenetic analyses.

In addition to the determination of single-nucleotide polymorphic sequences, the comparative assessment of pt genomes is a powerful approach for the identification of informative simple sequence repeat (SSR) markers [73, 74]. SSRs are widely used as molecular markers in population genetics, species identification, and phylogenetic analyses in plants [73, 75, 76]. In this study, the evaluation of the pt genomes of seven Juniperus species from Kazakhstan allowed the identification of 1145 SSRs (Table 4). The dominant SSRs were mononucleotides with A/T as the highest content, confirming the results of previous studies [70, 77, 78]. Most of the pt SSRs were located in the ycf1 gene and intergenic regions (S1 Table). Our study showed that forward repeats (537) were the most abundant compared to reverse (29) and palindromic (64) repeats. The lengths of all three types of repeats ranged from 31 to 261 bp, similar to those in a previously published report by Androsiuk et al. [79]. As in a previous report on Juniperus pt genomes [54], a complement repeat was not observed in any of the seven studied Juniperus plastomes.

Nucleotide sequences of pt genomes can also be efficiently used in the molecular systematics of Juniperus species. Juniperus is known as a taxonomically controversial genus because of its diverse morphological characteristics [32, 33]. In most plants, including Juniperus, the pt genome is uniparentally inherited [39]. Despite its slower evolution and lack of recombination, it has variable regions, which can potentially resolve the phylogenetic relationships at the family or the genus level [80]. Unfortunately, previous studies in Juniperus used sequences of short DNA regions in nuclear and plastid genomes, which may have potentially affected the precision of the taxonomic relationships [18, 54]. Therefore, we reconstructed a phylogenetic tree using the concatenated sequences of 82 protein-coding genes from 26 samples of different Juniperus species, including seven species from Kazakhstan and an outgroup (Fig 3). Similar to published reports using plastid and nuclear DNA regions [41, 81], an assessment of the phylogenetic tree in this work allowed the separation of all samples into two clades that corresponded to the Juniperus and Sabina sections of the genus and confirmed the monophyletic origin of the genus [1, 80]. The phylogenetic tree, which was based on the concatenated sequences of 82 pt genes, grouped all 26 samples into four out of five clades of the Sabina section as reported by Mao et al. (2010) [41]. Compared with the classical work of Mao et al. (2010) [41], this study showed several different patterns when assessing Juniperus’ taxonomy. First, in clade I, J. tibetica was found to be a predecessor of J. pseudosabina (Fig 3), which was correctly shown in Adams and Schwarzbach (2013) [43]. Second, the assessment of clade III suggested that J. davurica was an ancestral species to J. sabina (Fig 3). In contrast, Adams and Schwarzbach (2013) [43] suggested otherwise, and data from Mao et al. (2010) [41] did not resolve the difference between these two species. An evaluation of Fig 3 also revealed that J. semiglobosa and J. microsperma were two sister species, unlike in previous reports [41, 43] where J. microsperma was shown to be a predecessor of J. semiglobosa. Third, clade IV was populated by J. seravschanica (Fig 3), which was not studied by Mao et al. (2010) [41]. Our phylogenetic tree also suggested that J. seravschanica was a predecessor species in this clade (Fig 3). Fig 3 shows that J. chinensis and J. gaussenii were identified as two sister species. Only 0.4% of the nucleotides in the 82 studied genes were polymorphic, whereas the phylogenetic tree generated by Mao et al. (2010) [41] suggested that J. gaussenii was older than J. chinensis. Also, to confirm the robustness of identified divergent regions in plastid genomes (Fig 2), accD and ycf2 were applied to reconstruct the BI phylogenetic tree of 26 taxa and 2 outgroups using 82 genes (Fig 3). The profile of the phylogenetic tree using these two genes (S2 Fig) was nearly similar to the profile of the dendrogram based on concatenated sequences of 82 genes (Fig 3). The result suggests that accD and ycf2 could be efficiently used as candidate molecular markers for DNA barcoding and phylogenetic analyses. Thus, the sequenced plastomes of Juniperus species have provided an immense amount of genetic data that will help to improve resolution in the phylogeny of the genus.

Conclusion

In this study, we annotated the complete pt genomes of seven species of the Juniperus genus grown in Kazakhstan, including two pt genomes (J. sabina and J. seravschanica) we previously reported. The size of the pt genomes of these seven species was within the expected range (127–128 Kb) of other reported sequences of the genus. The comparative assessment of the seven pt genomes showed that all consisted of 119 genes, including 82 protein-coding genes, 33 tRNA genes, and 4 rRNA genes. The total length of the 82 protein-coding genes in the seven pt genomes was 82,953 bp, in which only 3.6% were polymorphic, confirming the conserved nature of the pt genomes of the genus. Several highly divergent regions, including accD, clpP-infA, matK, matK-ndhA, ycf1, ycf2, and ycf3, were identified. The specific regions of accD and ycf2 were characterized by high levels of nucleotide diversity and would be useful for a DNA barcoding assessment of the genus. Our study predicted the availability of 1145 SSRs in the pt genomes of the genus, majority of which were located in the DNA region ycf1 and in intergenic regions. A phylogenetic tree using 26 samples of different Juniperus species, including seven species from Kazakhstan and an outgroup, was constructed using the concatenated sequences of the 82 genes and the BI and ML approaches. The phylogenetic tree allowed the separation of all samples into two clades that corresponded with the Juniperus and Sabina sections and confirmed the monophyletic origin of the genus. The phylogenetic analysis also highlighted several differences compared with classical taxonomic studies in J. sect. Sabina. Thus, the analysis of the complete sequences of seven pt genomes provided valuable data, which could be used in genetics studies of Juniperus.

Supporting information

S1 Fig. Individual circular maps for five newly generated pt genomes of Juniperus species.

https://doi.org/10.1371/journal.pone.0295550.s001

(PDF)

S2 Fig. Bayesian inference phylogenetic tree of species in the genus Juniperus based on concatenated nucleotide sequences of accD and ycf2 genes.

https://doi.org/10.1371/journal.pone.0295550.s002

(PDF)

S1 Table. SSRs identified in Juniperus pt genomes.

https://doi.org/10.1371/journal.pone.0295550.s003

(XLSX)

Acknowledgments

The authors thank PhD Levente Laczkó (Department of Metagenomics, University of Debrecen, Hungary) for his advices on the selection of statistical methods in the study.

In memory of Dr Robert Adams (Baylor University, Texas, USA) ‐ an outstanding scientist, great teacher, and friend.

References

  1. 1. Adams RP. Junipers of the world: The genus Juniperus. 4th ed. Bloomington (IN): Trafford Publishing; 2014
  2. 2. Eastwood A, Lazkov G, Newton AC. The red list of trees of Central Asia. 2009.
  3. 3. Flora of Kazakhstan (Flora Kazakhstana). Vol. 1. Alma-Ata; 1956.
  4. 4. Yermagambetova MM., Abugalieva SI., Turuspekov YK., Almerekova SS. Conspectust of the genus Juniperus L. growing in Kazakhstan. Konspekt roda Juniperus L., proizrastayushchego v Kazakhstane. [Tr Prikl Bot Genet Sel]. 2022; 183: 161–170. [In Russian, with English summary]. https://doi.org/10.30901/2227-8834-2022-3-161-170
  5. 5. Abdulina SA. Checklist of vascular plants of Kazakhstan. Almaty: Academy of Sciences of Kazakhstan; 1999.
  6. 6. Kotukhov YA. On the discovery of Daurian juniper in the Kazakhstan Altai. O nakhodke mozhzhevel’nika daurskogo v Kazakhstanskom Altaye. Leninogorsk; 1974.
  7. 7. Stewart CD, Jones CD, Setzer WN. Essential oil compositions of Juniperus virginiana and Pinus virginiana, two important trees in Cherokee traditional medicine. Am. J. Essent Oil Nat Prod. 2014; 2: 17–24.
  8. 8. Koundal R, Kumar A, Thakur S, Agnihotri VK, Chand G, Singh RD. Seasonal variation in phytochemicals of essential oil from Juniperus communis needles in western Himalaya. Journal of essential oil research. 2015; 27(5): 406–411. https://doi.org/10.1080/10412905.2015.1043399
  9. 9. Sharma A, Flores-Vallejo RC, Cardoso-Taketa A, Villarreal ML. Antibacterial activities of medicinal plants used in Mexican traditional medicine. J Ethnopharmacol. 2017; 208: 264–329. pmid:27155134
  10. 10. Jin KS, Lee JY, Hyun SK. et al. Juniperus chinensis and the functional compounds, cedrol and widdrol, ameliorate α-melanocyte stimulating hormone-induced melanin formation in B16F10 cells. Food Sci Biotechnol. 2015; 24: 611–618. https://doi.org/10.1007/s10068-015-0080-5
  11. 11. Özek G, Schepetkin IA, Yermagambetova M, Özek T, Kirpotina LN, Almerekova SS, et al. Innate Immunomodulatory Activity of Cedrol, a Component of Essential Oils Isolated from Juniperus Species. Molecules. 2021; 26(24): 7644. https://doi.org/10.3390/molecules26247644
  12. 12. Inatomi Y, Iida N, Murata H, Inada A, Murata J, Lang FA, et al. A pair of new atropisomeric cupressuflavone glucosides isolated from Juniperus communis var. depressa. Tetrahedron Letters. 2005; 46(38): 6533–6535 https://doi.org/10.1016/j.tetlet.2005.07.091
  13. 13. De Marino S, Festa C, Zollo F, Rusolo F, Capone F, Guerriero E, et al. Phytochemical profile of Juniperus oxycedrus ssp. oxycedrus berries: A new monoterpene glucoside and evaluation of the effects on cancer cell lines. Phytochem Lett. 2014; 10: 152–159. https://doi.org/10.1016/j.phytol.2014.08.015
  14. 14. Ennajar M, Bouajila J, Lebrihi A, Mathieu F, Abderraba M, Raies A, et al. Chemical composition and antimicrobial and antioxidant activities of essential oils and various extracts of Juniperus phoenicea L. (Cupressacees). J Food Sci. 2009; 74(7): M364–M371. https://doi.org/10.1111/j.1750-3841.2009.01277.x
  15. 15. Amalich S, Fadili K, Fahim M, Hilali FE, Zaïr T. Polyphenols content and antioxidant power of fruits and leaves of Juniperus phoenicea L. From Tounfite (Morocco). Mor J Chem. 2016; 4(1): 177–186. https://doi.org/10.48317/IMIST.PRSM/morjchem-v4i1.4013
  16. 16. Adams RP, Skopec MM, Muir JP. Comparison of leaf terpenoids and tannins in Juniperus osteosperma from woodrat (Neotoma lepida) browsed and non-browsed trees. Phytologia. 2016; 98(1): 17–25.
  17. 17. Adams RP, Demeke T. Systematic relationships in Juniperus based on random amplified polymorphic DNAs (RAPDs). Taxon. 1993; 42(3): 553–571. https://doi.org/10.2307/1222534
  18. 18. Adams RP, Turuspekov Y. Taxonomic reassessment of some Central Asian and Himalayan scale‐leaved taxa of Juniperus (Cupressaceae) supported by random amplification of polymorphic DNA. Taxon. 1998; 47(1): 75–83. https://doi.org/10.2307/1224021
  19. 19. Adams RP, Schwarzbach AE, Pandey RN. The concordance of terpenoid, ISSR and RAPD markers, and ITS sequence data sets among genotypes: an example from Juniperus. Biochem Syst Ecol. 2003; 31(4): 375–387. https://doi.org/10.1016/S0305-1978(02)00157-6
  20. 20. Kasaian J, Behravan J, Hassany M, Emami SA, Shahriari F, Khayyat MH. Molecular characterization and RAPD analysis of Juniperus species from Iran. Genet Mol Res. 2011; 10(2): 1069–74. pmid:21710457.
  21. 21. Meloni M, Perini D, Filigheddu R, Binelli G. Genetic variation in five Mediterranean populations of Juniperus phoenicea as revealed by inter-simple sequence repeat (ISSR) markers. Ann Bot. 2006; 97(2): 299–304. pmid:16311272.
  22. 22. Rahimian Boogar A, Salehi H. ISSR-based genetic diversity assessment of five populations of Juniperus polycarpos K. Koch in southern habitats of Iran. Flower and Ornamental Plants. 2021; 6(1): 1–12.
  23. 23. Terrab A, Schönswetter P, Talavera S, Vela E, Stuessy TF. Range-wide phylogeography of Juniperus thurifera L., a presumptive keystone species of western Mediterranean vegetation during cold stages of the Pleistocene. Mol. Phylogenet Evol. 2008; 48(1): 94–102. https://doi.org/10.1016/j.ympev.2008.03.018
  24. 24. Juan A, Fay MF, Pastor J, Juan R, Fernández I, Crespo MB. Genetic structure and phylogeography in Juniperus oxycedrus subsp. macrocarpa around the Mediterranean and Atlantic coasts of the Iberian Peninsula, based on AFLP and plastid markers. Eur J For Res. 2012; 131: 845–856. https://doi.org/10.1007/s10342-011-0558-5
  25. 25. Rumeu B, Vargas P, Nogales M. Phylogeography and genetic structure of the threatened Canarian Juniperus cedrus (Cupressaceae). Bot J Linn.2014; 175(3): 376–394. https://doi.org/10.1111/boj.12172
  26. 26. García C, Guichoux E, Hampe A. A comparative analysis between SNPs and SSRs to investigate genetic variation in a juniper species (Juniperus phoenicea ssp. turbinata). Tree Genet Genomes. 2018; 14: 1–9. https://doi.org/10.1007/s11295-018-1301-x
  27. 27. Zhang Q, Yang YZ, Wu GL, Zhang DY, Liu JQ. Isolation and characterization of microsatellite DNA primers in Juniperus przewalskii Kom (Cupressaceae). Conserv Genet. 2008; 9: 767–769. https://doi.org/10.1007/s10592-007-9387-y
  28. 28. Sertse D, Gailing O, Eliades NG, Finkeldey R. (2013). Transferability and application of microsatellites (SSRs) from Juniperus communis L. to Juniperus procera Hochst. Ex endl. Open Journal of Genetics. 2013; 3(2): 115–126.
  29. 29. Teixeira H, Rodríguez-Echeverría S, Nabais C. Genetic diversity and differentiation of Juniperus thurifera in Spain and Morocco as determined by SSR. PLoS One. 2014; 9(2): e88996. https://doi.org/10.1371/journal.pone.0088996
  30. 30. Reim S, Lochschmidt F, Proft A, Tröber U, Wolf H. Genetic structure and diversity in Juniperus communis populations in Saxony, Germany. Biodiv Res Conserv. 2016; 42(1): 9–18. https://doi.org/10.1515/biorc-2016-0008
  31. 31. Lu D, Huang H, Wang A, Zhang G. Genetic Evaluation of Juniperus sabina L. (Cupressaceae) in Arid and Semi-Arid Regions of China Based on SSR Markers. Forests. 2022; 13(2): 231. https://doi.org/10.3390/f13020231
  32. 32. Adams RP, Morris JA, Schwarzbach AE. The Evolution of Caribbean Juniperus (Cupressaceae): Terpenoids, RAPDs and DNA SNPs data. Phytologia. 2008; 90: 103–120.
  33. 33. Sadykova GA, Hantemirova EV, Polezhaeva MA, Aliev KU. Genetic Variability of Tree Junipers of Section Sabina: Data from Dagestan, Armenia, and Crimea. Russian Journal of Genetics. 2021; 57: 1223–1228.
  34. 34. Adams RP, Schwarzbach AE. Taxonomy of Juniperus section Juniperus: Sequence analysis of nrDNA and five cpDNA regions. Phytologia. 2012; 94(2): 280–297.
  35. 35. Hojjati F, Kazempour-Osaloo S, Adams RP, Assadi M. Molecular phylogeny of Juniperus in Iran with special reference to the J. excelsa complex, focusing on J. seravschanica. Phytotaxa. 2018; 375(2): 135–157. https://doi.org/10.11646/phytotaxa.375.2.1
  36. 36. Bhagyawant SS. RAPD-SCAR markers: an interface tool for authentication of traits. J Biosci Med. 2016; 4: 1–9.
  37. 37. Alkan C, Sajjadian S, Eichler EE. Limitations of next-generation genome sequence assembly. Nat Methods. 2011; 8: 61–65. pmid:21102452
  38. 38. Hu T, Chitnis N, Monos D, Dinh A. Next-generation sequencing technologies: An overview. Hum Immunol. 2021; 82(11): 801–811. pmid:33745759
  39. 39. Bock R. Structure, function, and inheritance of plastid genomes. In: Cell and molecular biology of plastids. Berlin and Heidelberg: Springer; 2007.
  40. 40. Zhang Q, Chiang TY, George M, Liu JQ, Abbott RJ. Phylogeography of the Qinghai‐Tibetan Plateau endemic Juniperus przewalskii (Cupressaceae) inferred from chloroplast DNA sequence variation. Mol Ecol. 2005; 14(11): 3513–3524. https://doi.org/10.1111/j.1365-294X.2005.02677.x
  41. 41. Mao K, Hao G, Liu J, Adams RP, Milne RI. Diversification and biogeography of Juniperus (Cupressaceae): variable diversification rates and multiple intercontinental dispersals. New Phytologist. 2010; 188(1): 254–272. https://doi.org/10.1111/j.1469-8137.2010.03351.x
  42. 42. Terry RG, Adams RP. A molecular re-examination of phylogenetic relationships among Juniperus, Cupressus, and the Hesperocyparis-Callitropsis-Xanthocyparis clades of Cupressaceae. Phytologia. 2015; 97(1): 66–74.
  43. 43. Adams RP, Schwarzbach AE. Phylogeny of Juniperus using nrDNA and four cpDNA regions. Phytologia. 2013; 95(2): 179–187.
  44. 44. Guo Q, Bianba D, Zheng W. Characterization of the complete chloroplast genome of Juniperus cedrus (Cupressaceae). Mitochondrial DNA Part A. 2016; 27(6): 4355–4356. https://doi.org/10.3109/19401736.2015.1089498
  45. 45. Miao J, Tso S, Li J, Xie S, Mao K. The complete chloroplast genome of Juniperus tibetica (Cupressaceae), the conifer that occupies the highest known treeline in the Northern Hemisphere. Mitochondrial DNA Part B. 2019; 4(1): 609–611. https://doi.org/10.1080/23802359.2018.1561229
  46. 46. Song X, Li J, Tso S, Xie S, Li L, Mao Q, et al. Characterization of the complete chloroplast genome of Juniperus recurva (Cupressaceae), the Dropping Juniper from the Himalaya. Mitochondrial DNA Part B. 2019; 4(1): 1219–1220. https://doi.org/10.1080/23802359.2019.1591194
  47. 47. Tso S, Li J, Xie S, Miao J, Hu Q, Mao K. Characterization of the complete chloroplast genome of Juniperus microsperma (Cupressaceae), a rare endemic from the Qinghai-Tibet Plateau. Conserv Genet Resour. 2019; 11: 325–328. https://doi.org/10.1007/s12686-018-1027-y
  48. 48. Xie S, Li J, Miao J, Xu J, Mao K. The complete chloroplast genome of Juniperus squamata (Cupressaceae), a shrubby conifer from Asian Mountains. Mitochondrial DNA Part B. 2019; 4(2): 2137–2139. https://doi.org/10.1080/23802359.2019.1623105
  49. 49. Zhang X, Qu X, Li J, Huang J, Jia Y, Chen S. The complete chloroplast genome sequence of Juniperus saltuaria (Cupressaceae). Mitochondrial DNA Part B. 2020; 5(3): 2565–2567. https://doi.org/10.1080/23802359.2020.1781559
  50. 50. Wang M, Li R. The complete chloroplast genome of Juniperus rigida (Cupressaceae). Mitochondrial DNA Part B. 2022; 7(7): 1340–1341. https://doi.org/10.1080/23802359.2022.2098858
  51. 51. Irfan S, Rana SSA, Sani IA, Haun L, Weixue M, Jabeen R, et al. Characterization of the Complete Chloroplast Genome Sequence of Juniperus polycarpos K. Koch (Cupressaceae), from Ziarat, Pakistan. Indian J Sci Technol. 2021; 14(36): 2806–2814. https://doi.org/10.17485/IJST/v14i36.151
  52. 52. Almerekova S, Yermagambetova M, Abugalieva S, Turuspekov Y. The complete chloroplast genome sequencing data of Juniperus sabina L. (Cupressaceae Bartl.) from Kazakhstan. Data in Brief. 2022; 45: 108644. https://doi.org/10.1016/j.dib.2022.108644
  53. 53. Yermagambetova M, Abugalieva S, Turuspekov Y, Almerekova S. Illumina sequencing data of the complete chloroplast genome of rare species Juniperus seravschanica (Cupressaceae) from Kazakhstan. Data in Brief. 2023; 46: 108866. https://doi.org/10.1016/j.dib.2022.108866
  54. 54. Chen C, Xia X, Peng J, Wang D. Comparative analyses of six complete chloroplast genomes from the genus Cupressus and Juniperus (Cupressaceae). Gene. 2022; 837: 146696. https://doi.org/10.1016/j.gene.2022.146696
  55. 55. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemistry Bulletin. 1987; 19: 11–15.
  56. 56. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15): 2114–2120. pmid:24695404
  57. 57. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012; 19(5): 455–477. pmid:22506599
  58. 58. Qu XJ, Moore MJ, Li DZ, Yi TS. PGA: A software package for rapid, accurate, and flexible batch annotation of plastomes. Plant Methods. 2019; 15: 50. pmid:31139240
  59. 59. Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, et al. GeSeq-versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017; 45(W1): W6–W11. https://doi.org/10.1093/nar/gkx391
  60. 60. Greiner S, Lehwark P, Bock R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019; 47: W59-W64. https://doi.org/10.1093/nar/gkz238
  61. 61. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009; 25(11): 1451–1452. pmid:19346325
  62. 62. Beier S, Thiel T, Münch T, Scholz U, Mascher M. MISA-web: a web server for microsatellite prediction. Bioinformatics. 2017; 33: 2583–2585. pmid:28398459
  63. 63. Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001; 29(22): 633–4642. pmid:11713313
  64. 64. Benson G Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999; 27: 573–580. pmid:9862982
  65. 65. Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012; 9: 772. pmid:22847109
  66. 66. Ronquist F, Huelsenbeck JP. Mrbayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003; 19: 1572–1574. pmid:12912839
  67. 67. Tamura K, Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993; 10: 512–526. pmid:8336541
  68. 68. Tamura K, Stecher G, Kumar S. MEGA 11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. 2021; 38(7): 3022–3027. pmid:33892491
  69. 69. Hirao T, Watanabe A, Kurita M, Kondo T, Takata K. Complete nucleotide sequence of the Cryptomeria japonica D. Don. chloroplast genome and comparative chloroplast genomics: diversified genomic structure of coniferous species. BMC Plant Biol. 2008; 8(1): 1–20. https://doi.org/10.1186/1471-2229-8-70
  70. 70. Cheon KS, Kim KA, Kwak M, Lee B, Yoo KO. The complete chloroplast genome sequences of four Viola species (Violaceae) and comparative analyses with its congeneric species. PLoS One. 2019; 14(3): e0214162. https://doi.org/10.1371/journal.pone.0214162
  71. 71. Wu L, Cui Y, Wang Q, Xu Z, Wang Y, Lin Y, et al. Identification and phylogenetic analysis of five Crataegus species (Rosaceae) based on complete chloroplast genomes. Planta. 2021; 254: 1–12. https://doi.org/10.1007/s00425-021-03667-4
  72. 72. Chen J, Hao Z, Xu H, Yang L, Liu G, Sheng Y, et al. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng. Front Plant Sci. 2015; 6:447. https://doi.org/10.3389/fpls.2015.00447
  73. 73. Ebert D, Peakall ROD. Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species. Mol Ecol Resour. 2009; 9(3): 673–690. pmid:21564725
  74. 74. Song SL, Lim PE, Phang SM, Lee WW, Hong DD, Prathep A. Development of chloroplast simple sequence repeats (cpSSRs) for the intraspecific study of Gracilaria tenuistipitata (Gracilariales, Rhodophyta) from different populations. BMC Res Notes. 2014; 7(1): 1–9.https://doi.org/10.1186/1756-0500-7-77
  75. 75. Ebadi A, Ghaderi N, Vafaee Y. Genetic diversity of Iranian and some European grapes as revealed by nuclear and chloroplast microsatellite and SNP molecular markers. J Hortic Sci Biotechnol. 2019; 94(5): 599–610. https://doi.org/10.1080/14620316.2019.1585210
  76. 76. Zhu M, Feng P, Ping J, Li J, Su Y, Wang T. Phylogenetic significance of the characteristics of simple sequence repeats at the genus level based on the complete chloroplast genome sequences of Cyatheaceae. Ecol Evol. 2021; 11(20): 14327–14340. pmid:34707858
  77. 77. Biju VC, Shidhi PR, Vijayan S, Rajan VS, Sasi A, Janardhanan A, et al. The Complete Chloroplast Genome of Trichopus zeylanicus, And Phylogenetic Analysis with Dioscoreales. The plant genome. 2019; 12(3): 1–11. https://doi.org/10.3835/plantgenome2019.04.0032
  78. 78. Mehmood F, Shahzadi I, Ahmed I, Waheed MT, Mirza B. Characterization of Withania somnifera chloroplast genome and its comparison with other selected species of Solanaceae. Genomics. 2020; 112(2): 1522–1530. https://doi.org/10.1016/j.ygeno.2019.08.024
  79. 79. Androsiuk P, Jastrzębski JP, Paukszto Ł, Okorski A, Pszczółkowska A, Chwedorzewska KJ,et al. The complete chloroplast genome of Colobanthus apetalus (Labill.) Druce: genome organization and comparison with related species. PeerJ. 2018; 6: e4723. https://doi.org/10.7717/peerj.4723
  80. 80. Kim K, Nguyen VB, Dong J, Wang Y, Park JY, Lee SC, Yang TJ. Evolution of the Araliaceae family inferred from complete chloroplast genomes and 45S nrDNAs of 10 Panax-related species. Sci Rep. 2017; 7(1): 1–9. pmid:28687778
  81. 81. Little DP. Evolution and circumscription of the true Cypresses (Cupressaceae: Cupressus). Syst Bot. 2006; 31: 461–480. https://doi.org/10.1600/036364406778388638