Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Population genetic diversity and structure of the endangered species Tetracentron sinense Oliver (Tetracentraceae) with SNPs based on RAD sequencing

  • Zhong–Qiong Tian,

    Roles Formal analysis, Funding acquisition, Resources, Software, Visualization, Writing – original draft

    Affiliations Key laboratory of southwest China wildlife resources conservation (Ministry of Education), China West Normal University, Nanchong, China, Key Laboratory of Biodiversity and Environment on the Qinghai-Tibetan Plateau, Ministry of Education, Lhasa, China

  • Chao–Yang Jiang,

    Roles Investigation, Validation

    Affiliation Key Laboratory of Biodiversity and Environment on the Qinghai-Tibetan Plateau, Ministry of Education, Lhasa, China

  • Yu–Min Shu,

    Roles Investigation, Validation

    Affiliation Key Laboratory of Biodiversity and Environment on the Qinghai-Tibetan Plateau, Ministry of Education, Lhasa, China

  • Huan Zhang,

    Roles Investigation, Validation

    Affiliation Institute of plant adaptation and utilization in southwest mountain, China West Normal University, Nanchong, China

  • Qiong La,

    Roles Funding acquisition, Investigation, Project administration, Validation, Writing – review & editing

    Affiliation Key laboratory of southwest China wildlife resources conservation (Ministry of Education), China West Normal University, Nanchong, China

  • Xiao-Hong Gan

    Roles Funding acquisition, Investigation, Project administration, Supervision, Visualization, Writing – review & editing

    bhgan@cwnu.edu.cn

    Affiliations Key Laboratory of Biodiversity and Environment on the Qinghai-Tibetan Plateau, Ministry of Education, Lhasa, China, Institute of plant adaptation and utilization in southwest mountain, China West Normal University, Nanchong, China

Correction

4 Sep 2025: The PLOS One Staff (2025) Correction: Population genetic diversity and structure of the endangered species Tetracentron sinense Oliver (Tetracentraceae) with SNPs based on RAD sequencing. PLOS ONE 20(9): e0331705. https://doi.org/10.1371/journal.pone.0331705 View correction

Abstract

Tetracentron sinense Oliv. (T. sinense), as a tertiary living fossil, has experienced a significant decline in population numbers. Currently, genetic resources depletion and human activities have led to habitat fragmentation of relict and endangered plants, despite the abundant evidence of its medicinal, economic, and ecological value. Conservation strategies were clarified and evaluated based on the genetic structure characteristics and diversity patterns among 25 wild populations using Restriction site-associated DNA sequencing (RAD-seq) technology. Through SNP calling, filtering, genetic diversity analysis, discriminant analysis of principal components (DAPC), maximum-likelihood phylogenetic tree, and ADMIXTURE clustering, significant population structure and differentiation were identified. The results revealed a total of 2,169 single nucleotide polymorphisms (SNPs), indicating lower genetic variation but higher genetic differentiation (He: 0.10, I: 0.16, Fst: 0.33). Analysis of molecular variance (AMOVA) showed that genetic variation within populations accounted for 77% of the total variance. DAPC, maximum-likelihood phylogenetic tree, and ADMIXTURE clustering analysis grouped the 25 populations into five distinct clades influenced by isolation, restricted gene flow, and complex topography. To preserve the genetic integrity of T. sinense, it is recommended to establish conservation units corresponding to different geographic clades, with a focus on populations with low/high genetic diversity by implementing artificial reproduction and germplasm resource nurseries. Given the species' vulnerable conservation status, urgent implementation of the aforementioned conservation strategies is necessary to safeguard the remaining genetic resources.

Introduction

Genetic diversity represents a cornerstone of biodiversity conservation, with its preservation recognized as critical for maintaining ecosystem resilience and evolutionary potential [1]. Conservation strategies prioritize the maintenance of genetic variation both within and among populations, as this variation underpins species' capacity to adapt to environmental change. Declines in genetic diversity heighten inbreeding risks and erode adaptive potential, thereby threatening the long-term viability of populations [2]. The erosion of genetic diversity in endangered plant species significantly elevates extinction vulnerability by reducing adaptive capacity and ecological resilience. Consequently, comprehensive characterization of genetic architecture—including population structure, differentiation patterns, and allelic variation—constitutes an imperative precursor to effective conservation planning. Such analyses enable evidence-based prioritization of conservation units and management interventions, particularly for taxa like T. sinense that persist in fragmented populations with critically low effective population sizes [3,4].

As the sole extant species within the genus Tetracentron, T. sinense is endemic to East Asian riparian ecosystems and forest understories [5]. This Arcto-Tertiary relict, often termed a ‘living fossil’ due to its morphological conservatism, exhibits a robust paleobotanical record traceable to the Eocene epoch (53.0–36.5 million years ago) [6]. T. sinense holds paramount importance for its economic applications (furniture, ornamental horticulture, timber production) and medicinal properties (e.g., anticancer, antibacterial compounds). However, unsustainable harvesting, anthropogenic disturbance, and habitat fragmentation have precipitated a marked reduction in population size [7]. Consequentially, this species is listed in Appendix III of CITES [8,9]. Despite its conservation priority, population genetic studies leveraging high-throughput sequencing (HTS) technologies remain scarce for T. sinense.

Previous investigations on T. sinense have addressed diverse ecological and physiological aspects, including pollination biology, seedling establishment, community dynamics, photosynthetic ecophysiology, and population structure [4,1017]. However, comprehensive population genetic studies remain conspicuously limited. Analyses using chloroplast sequences and inter-simple sequence repeat (ISSR) markers have identified distinct genetic clusters among populations [4,18,19]. Moreover, genetic variation assessments revealed the presence of evolutionarily significant units (ESUs) across its range. Critical threats to its survival—such as inefficient pollination systems, reduced genetic diversity, and geographic isolation—have been identified as primary drivers of endangerment. The recent completion of a high-quality chromosome-level genome assembly for T. sinense has underscored the critical need for developing robust genetic markers to comprehensively evaluate population genetic diversity and adaptive potential [19].

Population genetic studies in widely distributed species provide a robust statistical framework for understanding natural population evolution. Advances in HTS, or next-generation sequencing (NGS), have enabled the genomic development of numerous genetic markers. Restriction site-associated DNA sequencing (RAD-seq) using Illumina platforms is cost-effective and produces highly reliable data, making it a powerful tool for population genomic analyses [20]. Single nucleotide polymorphisms (SNPs) represent the most abundant form of genomic variation within species, providing detailed insights into population genetic architecture [21]. SNP genotyping has emerged as a powerful tool for characterizing natural population evolution and informing conservation genetics. In this study, we utilized SNPs to characterize the genetic diversity, structure, and differentiation of T. sinense [22]. By leveraging a large SNP dataset, we investigated key population genetic parameters critical for understanding evolutionary processes. The methodological framework and findings presented here are anticipated to facilitate the development and implementation of targeted conservation strategies for this endangered species.

Materials and methods

Sample collection and DNA extraction

T. sinense is distributed primarily in central and southern China. A total of 164 individuals were collected from 25 wild populations across China (Fig 1, Table 1, S1 Table). Samples were collected with a minimum spatial distance of 50 meters to minimize kinship bias, and leaf tissues were preserved using silica gel desiccation. Genomic DNA was extracted from dried leaves via the cetyltrimethylammonium bromide (CTAB) protocol [23]. Voucher specimens were deposited in the herbarium of China West Normal University (Specimen ID: MCS202005).

thumbnail
Table 1. The geographical location, population code, and sample size of 25 T. sinense populations.

https://doi.org/10.1371/journal.pone.0324161.t001

thumbnail
Fig 1. Geographic distribution of 25 T. sinense populations.

Notes: reprinted from Li, S., Gan, X., Han, H. et al. Low within-population genetic diversity and high genetic differentiation among populations of the endangered plant T. sinense revealed by inter-simple sequence repeat analysis. Annals of Forest Science 75, 74 (2018). under a CC BY license, with permission from Annals of Forest Science, original copyright 2018.

https://doi.org/10.1371/journal.pone.0324161.g001

RAD library preparation and sequencing

RAD-seq libraries were constructed following the protocol described by Baird et al. [24], with minor modifications. Genomic DNA was first digested with the restriction endonuclease TaqαI. Digested fragments were then ligated to P1 adapters containing sequences for PCR amplification, Illumina sequencing, and sample-specific barcodes, along with the TaqαI recognition site. After adapter ligation, DNA fragments were size-selected via agarose gel electrophoresis to isolate target bands in the 300–500 bp range. The selected fragments were ligated to P2 adapters, and paired-end libraries with 150 bp insert sizes were prepared for sequencing on the Illumina HiSeqTM platform at Majorbio Company (Shanghai, China).

Sequence alignment and SNP genotyping

Raw sequencing data were demultiplexed using barcodes to assign reads to individual samples. Adapter-trimmed reads were evaluated for sequencing quality and data volume. To ensure transparency, we explicitly declare that we performed de novo assembly of the genome using Stacks v2.4 software [25], a critical step for readers unfamiliar with the workflow. The Stacks v2.4 [25] (ulian Catchen and Nicolas Rochette, Urbana, IL, USA) software was utilized for clustering sequence fragments post quality control, followed by SNP detection and filtering using the resulting vcf-file with VCFtools v.0.1.16 [26]. Initial processing included: (1) adapter removal and 5' -end trimming of non-AGCT bases; (2) quality trimming of low-quality ends (quality score < 20); (3) removal of reads with ≥10% ambiguous bases (Ns); and (4) exclusion of fragments shorter than 25 bp after trimming. Clean reads were mapped to samples to ensure data integrity for downstream analysis.

Genetic diversity

Genetic diversity parameters for T. sinense were estimated using Stacks v2.4 [25], including observed allele count (Na), effective allele count (Ne), expected heterozygosity (He), Shannon information index (I), and nucleotide diversity (π). Neutrality tests were conducted using VCFtools v0.1.16 [26] to compute Tajima' s D at 95% confidence intervals across all loci [27,28]. Analysis of variance (ANOVA) and post-hoc multiple comparisons were performed to evaluate diversity index differences among the 25 populations [29]. Molecular variance partitioning was implemented in Arlequin v3.5 to quantify significant genetic variation among and within populations [30].

Population structure, phylogenetic tree, and principal components analysis

To characterize the genetic structure of the 25 T. sinense populations, SNPs were first filtered using PLINK v1.9 [31], followed by genetic structure estimation with ADMIXTURE v1.3.0 [32]. Genetic cluster numbers (K) ranging from 1 to 27 were evaluated via 10-fold cross-validation (CV), with 27 independent runs performed for each K value. The optimal K was determined based on the lowest CV error in the ADMIXTURE analysis. Additionally, the optimal population structure of 164 T. sinense individuals from 25 populations was inferred using adegenet v2.1.11 [33], integrated with geographic coordinates. A maximum-likelihood (ML) phylogeny was reconstructed in RAxML v7.3 [34] under the GTRGAMMA substitution model with 1000 bootstrap replicates. Population phylogenetic trees were generated using the RAxML ML algorithm. Individual clustering patterns were visualized via principal component analysis (PCA) based on SNP-derived genetic distances. Genetic relationships among individuals were characterized using discriminant analysis of principal components (DAPC) implemented in the adegenet package (R v3.6.1) [33]. Kinship coefficients were estimated with GCTA v1.92.1 [35], while identity-by-state (IBS) genetic distances were computed using PLINK v1.90 [31] to generate a distance matrix.

Population linkage disequilibrium analyses

Pairwise linkage disequilibrium (LD) among 2,169 genome-wide SNPs across all scaffolds in 164 Tetracentron sinense individuals was estimated using allele frequency correlations (r²) via PopLDdecay v3.41 [36]. LD analysis quantified r² values to characterize marker associations. Population genetic differentiation (FST) was calculated using VCFtools v0.1.16 [26], while gene flow (Nm) estimates were derived from Fst values using the equation:

Ethics statement

Because this study was carried out on an endangered species, I confirmed that I complied with all relevant institutional, national and international guidelines. This study was supported by National Natural Science Foundation of China, including handling these plants and collecting samples. The Nature Reserve Authority granted permissions to Professor Xiao–hong Gan for using the endangered species of plant (T. sinense).

Results

Quality and processing of sequence data

A total of 164 samples from 25 T. sinense populations (JFS, HG, FP, BDGS, MLZ, TJH, HK, WL, KP, BMXS, QZMS, LGS, MCS, FJS, BH, DFD, SHS, SNJ, HBY, KKS, DSH, BSJ, EM, WFHH, GLGS) were subjected to RAD-seq for SNP identification. Sequencing with the TaqαI restriction enzyme yielded 1,755 million base pairs (Mbp) of raw data. Clean reads per sample ranged from 6,917,919–26,051,926, with sequence quality scores (Q30) averaging 94.62% (range: 87.37–96.27%). Genome-wide GC content varied from 41.22% to 59.54%, with a mean of 47.24% (S2 Table, S1 Fig). Following initial quality filtering, 6,158,926 SNPs were identified. After applying stringent filtering criteria (missing rate < 50%, minor allele frequency ≥ 0.05), 38,097 high-quality SNPs were retained for genotyping analysis. Subsequent population genetic diversity and structure analyses utilized 2,169 polymorphic markers derived from high-quality unlinked SNPs.

Genetic diversity at species and population levels

We summarized the genetic diversity parameters of T. sinense at the species-level. The average values of Na, Ne, He, I, and π were 2.0, 1.17, 0.13, 0.24, and 0.23, respectively (Tables 2, 3, and S3). Based on the entire dataset analyzed using VCFtools [26], the Tajima' s D value at the species-level was 0.67 (Table 3). Notably, the significantly positive Tajima' s D values suggest a potential decrease in population size and/or the action of balancing selection [27].

thumbnail
Table 2. Genomic diversity statistics calculated at the species and population level of 25 T. sinense populations.

https://doi.org/10.1371/journal.pone.0324161.t002

thumbnail
Table 3. Pi and Tajima' s D values among 5 clades of T. sinense.

https://doi.org/10.1371/journal.pone.0324161.t003

Genetic diversity parameters for T. sinense are summarized in Tables 2, 3, and S2. At the species level, average values for observed alleles (Na = 2.00), effective alleles (Ne = 1.17), expected heterozygosity (He = 0.13), Shannon' s index (I = 0.24), and nucleotide diversity (π = 0.23) were recorded. Tajima' s D analysis of the full dataset using VCFtools [26,27] yielded a species-level value of 0.6741 (Table 3). Notably, the significantly positive Tajima' s D suggests potential population size reduction and/or balancing selection acting on the species [27].

At the population level, genetic diversity parameters varied as follows: Na ranged from 1.28 (GLGS) to 1.50 (BSJ), Ne from 1.00 (HG) to 1.19 (BMXS), He from 0.08 (MLZ) to 0.12 (BSJ), and I from 0.13 (MLZ) to 0.19 (BSJ) (Table 2). Congruent trends were observed in He and I metrics. Notably, WFHH, BDGS, and MLZ populations displayed relatively high genetic diversity, whereas HBY and HG populations exhibited significantly lower values (Table 2).

Population genetic structure, principal components analysis, and phylogenetic analysis

To characterize population genetic relationships, genetic structure analysis was performed using adegenet and ADMIXTURE v1.3.0 [32,33]. Adegenet analysis revealed K = 5 as the optimal clustering solution for the 164 T. sinense individuals, with the sum of squares reaching a plateau at this value (Fig 2A). Principal component analysis (PCA) results, congruent with adegenet findings, also supported five genetic clusters (Fig 2B). Cluster composition was as follows, Cluster1: BH, BMXS, BSJ, GLGS, HK, KP, TJH, WL; Cluster2: DFD, FJS, JFS, LGS, SHS; Cluster3: DSH, KKS; Cluster4: BDGS, FP, HBY, HG, MCS, QZMS, SNJ, WFHH, MLZ; Cluster5: EM (monotypic). Detailed cluster assignments are provided in Supplementary S4 Table.

thumbnail
Fig 2. Population genetic structure of T. sinense according based on the adegenet and ADMIXTURE analysis.

A, Best choice of number of clusters based on adegenet analysis; B, PCA result of the five clusters based on adegenet analysis; C, the patterns are shown with the CV error value for the K value (K = 2–27) showing the best K = 5; D, the five clades are represented in different colors (CladeI-Vare denoted by blue, purple, yellow, green and red, respectively).

https://doi.org/10.1371/journal.pone.0324161.g002

The optimal number of genetic clusters (K) for the 164 T. sinense individuals was determined via CV error minimization in ADMIXTURE analysis. CV errors ranged from 0.35 to 0.66 across K values 1–27, with the lowest error (0.35) observed at K = 5 (Fig 2C, Supplementary S5 Table). Complete results for all tested K values (1–27) are presented in Supplementary S2 Fig. At K = 5, individuals from 25 populations were partitioned into five genetically distinct clades with maximal probability: CladeI: 8 populations from the Hengduan and Wuling Mountains; CladeII: 4 populations from the Wuling Mountains; CladeIII: 5 populations from the Minshan Mountains; CladeIV: 3 populations from the Hengduan Mountains; CladeV: 5 populations from the Qinling-Daba Mountains. Geographic distributions and clade assignments are visualized in Fig 2D and detailed in Supplementary S6 Table.

DAPC was performed to characterize genetic structure among 164 T. sinense individuals. Principal components 1–3 (PC1–PC3) accounted for 33.62% of cumulative variance (Fig 3A and B, Supplementary S7 Table). DAPC plots revealed distinct clustering of the five genetic groups identified in ADMIXTURE analysis, confirming their genetic divergence.

thumbnail
Fig 3. DAPC and Maximum-likelihood (ML) phylogenetic analysis of T. sinense samples, with the proportion of the variance explained being 15.03% for PC1, 10.49% for PC2 and 8.1% for PC3.

A, PC1 V PC2; B, PC1 V PC3; C, ML phylogenetic tree of 164 individuals from 25 T. sinense populations constructed with the genome–wide SNP dataset.

https://doi.org/10.1371/journal.pone.0324161.g003

A ML phylogenetic tree was reconstructed using high-quality unlinked SNPs with RAxML v7.3 [34]. The tree exhibited strong bootstrap support across most branches, validating its reliability. Five major clades were resolved, CladeI: DFD, EM, SHS, LGS, KKS, JFS, DSH, FJS; CladeII: SNJ, BDGS, MLZ, QZMS, WFHH; CladeIII: TJH, BSJ, BH, HK, WL; CladeIV: KP, BMXS, GLGS; CladeV: SNJ, MCS, HG, HBY, FP. Subclustering within CladeI and CladeIII was observed; CladeI-Subclade1: KKS, JFS, FJS, LGS, SHS; CladeI-Subclade2: DFD, EM, DSH; CladeIII-Subclade1: BSJ, BH, TJH; CladeIII-subclade2: WL, HK, BH. These results are visualized in Fig 3C.

Population differentiation, LDdecay, and AMOVA analyses

Population differentiation coefficients (Fst) among the 25 T. sinense populations averaged 0.33 (range: 0.14–0.50) across 164 individuals (Table 4). According to Wright' s guidelines, Fst values >0.25 indicate substantial population divergence. In this study, Fst values between certain clades fell below 0.25, whereas others exceeded this threshold, with the highest differentiation observed between CladesII and IV (Fst = 0.50). Gene flow (Nm) estimates among the five clades ranged from 0.28 (CladesIV and V) to 1.53 (CladesII and V) (Table 5).

Analysis of molecular variance (AMOVA) indicated that 77% of total genetic variance resided within populations, with only 23% distributed among populations (Table 6). This pattern suggests that the majority of genetic diversity in T. sinense occurs within populations. A calculated Nm value of 1.66 indicates moderate gene flow among populations.

thumbnail
Table 6. Analysis of molecular variance (AMOVA) analysis of genetic variation within and among populations in T. sinense.

https://doi.org/10.1371/journal.pone.0324161.t006

LD dynamics were characterized across the five inferred clades. CladeI exhibited the slowest LD attenuation, with r² values remaining ≥0.94 up to 9 kb. Conversely, CladeV displayed the most rapid LD decay, where r² dropped below 1.00 at ~3 kb, followed by CladesIII and IV (r² = 1.00 at 9 kb). CladeII maintained r² ≥ 1.00 until ~15 kb. Integrating evidence from high nucleotide diversity, phylogenetic relationships, and the shortest LD decay distance, CladeV was hypothesized to represent the evolutionary origin center (Fig 4, Table 7). Geographically, CladeV and CladeIV were the most isolated (Fst = 0.47), indicating strong genetic differentiation. Additionally, CladeIV harbored the lowest genetic diversity among clades, further supporting the Qinling-Daba Mountains as the species' origin center.

thumbnail
Table 7. Linkage disequilibrium decay statistics of r2 and LD distance of five clades.

https://doi.org/10.1371/journal.pone.0324161.t007

Discussion

Effective conservation of endangered species requires characterization of genetic diversity and population structure. He and I are widely recognized as key metrics of genetic diversity, where higher values reflect greater genetic variability and reduced population uniformity. In this study, interpopulation He and I averaged 0.10 and 0.16, respectively (Table 2). Compared to other endangered taxa—Tetraena mongolica (He = 0.35, I = 0.27), Davidia involucrata (He = 0.34, I = 0.51), Chosenia arbutifolia (He = 0.37, I = 0.53), and Sinowilsonia fienryi (He = 0.28, I = 0.40)—T. sinense exhibited significantly lower genetic diversity [3740]. Populations from Sichuan and Guizhou provinces showed elevated diversity, likely due to Southwest China' s favorable climatic conditions (humidity and temperature) supporting species persistence. Conversely, GLGS, BMXS, and KP populations in Yunnan displayed reduced diversity, potentially resulting from thermal stress. Despite overall low genetic diversity, a moderate gene flow estimate (Nm = 1.66) indicates residual adaptive potential in wild T. sinense populations. Previous research highlights plant breeding systems and gene flow as critical determinants of genetic diversity [41,42]. Limited pollen dispersal and short-distance seed movement in T. sinense likely exacerbate inbreeding depression, increasing homozygosity and exposing deleterious recessive alleles. These factors contribute to the species' endangered status, marked by skewed age structures with abundant juveniles but few mature individuals.

Genetic differentiation in Tetracentron sinense arises from long-term genetic isolation and species-specific biological traits. Analysis of Fst, a metric quantifying clade-level divergence, revealed an average genetic differentiation of 0.33 among the five clades, indicating substantial population subdivision. AMOVA results further showed that 77% of genetic variation resided within populations. While conservation initiatives have prioritized protecting populations in nature reserves, the scarcity of mid-aged individuals poses challenges for population renewal. High intrapopulation genetic diversity suggests the species retains adaptive potential for environmental shifts.

Sampling sites for T. sinense are situated in montane nature reserves, preserving habitat heterogeneity. Significant genetic differentiation among populations has resulted from prolonged geographic isolation. Comparable trends of reduced genetic variation and elevated differentiation have been documented in other endangered taxa, including Loropetalum subcordatum, Omphalogramma souliei, and Cypripedium japonicum [4345]. Climate change and anthropogenic activities have driven recent population declines in T. sinense. Collectively, these results suggest that reduced genetic variation and elevated differentiation in T. sinense stem from prolonged isolation, climate change, and anthropogenic pressures.

Consistent genetic clustering into five clades was inferred by adegenet and Admixture analyses for 164 T. sinense individuals across 25 populations. Subtle discrepancies in cluster composition—adegenet Cluster1 encompassing Admixture CladesIII and IV, adegenet Cluster4 merging Admixture CladesII and V, and adegenet Clusters2 and 3 corresponding to Admixture CladeI—likely arise from algorithmic differences between software platforms. Despite these nuances, the overall congruence in clustering patterns validates the robustness of the inferred population structure. Using RAD-seq, we identified five geographically structured genetic clusters in T. sinense, aligning with Sun et al.' s [18] K = 5 optimal solution and previous refugia hypotheses [4,18,46]. Geographically, CladeI occupied the southern Sichuan Basin (DFD/EM/DSH on the Sichuan-Guizhou border and KKS/JFS/FJS/LGS/SHS near Wuling Mountains), CladeII spanned northern Wuling and southern Qinling-Daba Mountains, CladeIII resided in northern Minshan Mountains, CladeIV occupied southwestern Hengduan Mountains, and CladeV inhabited Qinling-Daba Mountains. These clades are partitioned by major mountain systems (Hengduan, Wuling, Minshan, Qinling-Daba), underscoring historical isolation' s role in shaping distributions [4]. Comparative analysis revealed: (1) Li et al.' s ClusterI corresponds to our merged CladesI and III, while their ClusterII aligns with our CladesII and V; (2) Sun et al. [18] classified Gansu populations (RW/RT) as a distinct CladeI, whereas we integrated them into broader phylogeographic groupings. Population genetic structure reflects spatiotemporal genetic variation distribution [41,47], with this study' s findings advancing understanding of T. sinense’s evolutionary history.

These findings indicate that genetic clustering in T. sinense is primarily structured by major mountain systems (Hengduan, Wuling, Minshan, and Qinling-Daba Mountains), which have historically served as refugia for multiple taxa in the region, including Oxyria sinensis, Roscoea humeana, Pinus armandii, and Eurycorymbus cavaleriei [4851]. Geographical isolation mediated by these ranges likely contributes to the species' low genetic variation, high differentiation, and unique genetic structure. The observed disjunct distributions are hypothesized to result from intense Quaternary glaciations and tectonic activity (e.g., Tibetan Plateau uplift) [5256]. For isolated populations, restricted long-distance gene flow—exacerbated by topographic barriers—may drive patterns similar to those documented in Phellodendron amurense, Tetraena mongolica, and Gymnocarpos przewalskii [39,57,58]. Prolonged geographic isolation can lead to reproductive isolation and elevated extinction risk, as reflected in the fragmented genetic structure of T. sinense populations.

Classified as a relict endangered species in China [9], T. sinense has been the focus of recent conservation efforts through nature reserve establishment to safeguard its germplasm resources. Maintaining intact genetic diversity is critical for preserving evolutionary potential to adapt to future environmental changes. Conservation strategies include: (1) integrated in situ/ex situ preservation and reintroduction for low-diversity populations (HBY, HG, BSJ, BH); (2) germplasm nurseries with seed collection for high-diversity populations (SHS, BDGS, FP, WL); (3) gene flow augmentation from large source to small sink populations to mitigate inbreeding depression and genetic drift [58,59]; (4) clade-specific management units aligned with the five evolutionary clusters identified via admixture and DAPC analyses; and (5) genetic rescue interventions (immigration, pollen exchange, controlled hybridization) for isolated populations. Ex situ preservation and reintroduction are particularly vital for sustaining genetic diversity in this endangered species.

Conclusions

The current study examined the population genetic structures and conservation strategies for the relict and endangered plant, T. sinense, in central and southern China. Using genome-wide SNPs (RAD-seq) analysis, we found low genetic diversity and high differentiation in T. sinense. Additionally, we confirmed the presence of five conservation management units, with the Hengduan, Wuling, Minshan, and Qinling-Daba Mountains serving as refuges in China. The populations in the Qinling-Daba Mountains are considered the original center of T. sinense. This study represents the first exploration of genetic information through SNP analysis. Furthermore, we provide theoretical guidance for future conservation efforts.

Supporting information

S1 Fig. The quality distribution of sequencing.

https://doi.org/10.1371/journal.pone.0324161.s001

(TIF)

S2 Fig. Clades from 1–27 based on Admixture analysis.

https://doi.org/10.1371/journal.pone.0324161.s002

(PDF)

S1 Table. Detail informations of T. sinense samples.

https://doi.org/10.1371/journal.pone.0324161.s003

(XLSX)

S2 Table. The raw data of the 164 T. sinense individuals.

https://doi.org/10.1371/journal.pone.0324161.s004

(XLSX)

S3 Table. Single locus nucleotide diversity (π) value.

https://doi.org/10.1371/journal.pone.0324161.s005

(XLSX)

S4 Table. 164 T. sinense individuals based on the adegenet analysis.

https://doi.org/10.1371/journal.pone.0324161.s006

(XLSX)

S5 Table. K value and CV error in Admixture analysis.

https://doi.org/10.1371/journal.pone.0324161.s007

(XLSX)

S6 Table. Admixture determined clades of 164 T. sinense individuals.

https://doi.org/10.1371/journal.pone.0324161.s008

(XLSX)

S7 Table. Principal components (PC1, PC2 and PC3) of 164 T. sinense individuals.

https://doi.org/10.1371/journal.pone.0324161.s009

(XLSX)

Acknowledgments

The authors are also grateful to 25 nature reserves and plant protection stations for their help with the geographic information of the distribution of T. sinense. This study has been supported by National Natural Science Foundation of China (32070371 to X.G., 31760127 to Q.L), High-level graduate research project, Xizang University (2021-GSP-B019 to Z.T.), and authors thank them here. All claims expressed in this article are solely those of the author and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  1. 1. Adams WM, Aveling R, Brockington D, Dickson B, Elliott J, Hutton J, et al. Biodiversity conservation and the eradication of poverty. Science. 2004;306(5699):1146–9. pmid:15539593
  2. 2. Holsinger K. Conservation of genetic diversity in rare and endangered plants. The unity of evolutionary biology. 1991. p. 626–32.
  3. 3. Hedrick P. Genetic conservation in captive populations and endangered species. Appl Popul Biol. New York: Springer. 1992. p. 45–68.
  4. 4. Li S, Gan X, Han H, Zhang X, Tian Z. Low within-population genetic diversity and high genetic differentiation among populations of the endangered plant Tetracentron sinense Oliver revealed by inter-simple sequence repeat analysis. Annals of Forest Science. 2018;75(3).
  5. 5. Yang Z, Lu R, Tao C, Chen S, Ji Y. Microsatellites for Tetracentron sinense (Trochodendraceae), a Tertiary relict endemic to East Asia. Am J Bot. 2012;99(8):e320-2. pmid:22837412
  6. 6. Wu Z, Sun H, Zhou Z, Peng H, Li D. Origin and differentiation of endemism in the flora of China. Front Biol China. 2007;2(2):125–43.
  7. 7. Wang Y-F, Lai G-F, Efferth T, Cao J-X, Luo S-D. New glycosides from Tetracentron sinense and their cytotoxic activity. Chem Biodivers. 2006;3(9):1023–30. pmid:17193335
  8. 8. Frankel O, Soulé M. Conservation and evolution. 1981.
  9. 9. Fu L. Plant red book in China—rare and endangered plants (Book I). Beijing: China Science Press. 1992.
  10. 10. Cao L, Gan X, He S. Effect of different geographical provenances and matrix on seed germination and seeding initial growth of Tetracentron sinense. Guihaia. 2012;32(5):656–62.
  11. 11. Gan X, Xie D, Cao L. Sporogenesis and development of gametophytes in an endangered plant, Tetracentron sinense Oliv. Biol Res. 2012;45(4):393–8. pmid:23558997
  12. 12. Gan X, Cao L, Zhang X, Li H. Floral biology, breeding system and pollination ecology of an endangered tree Tetracentron sinense Oliv. (Trochodendraceae). Bot Stud. 2013;54(1):50. pmid:28510885
  13. 13. Hong-Yan H, Ning X, Shan L, Xue-Mei Z. The effect of low temperature during imbibition on germination characteristics of Tetracentron sinense (Tetracentraceae) seeds. Plant Divers. 2015;37(05):586.
  14. 14. Tian Z, Li H, Li W, Gan X, Zhang X, Fan Z. Structural characteristics and niches of dominant tree populations in <em>Tetracentron sinense</em> communities: implications for conservation. Bot Sci. 2018;96(2):157–67.
  15. 15. Zhang H, Duan F, Li Y, Wang Q, Lu X, Gan X, et al. Population structure and quantitative characteristics of Tetracentron sinense (Trochodendraceae) in Leigong Mountain Nature Reserve, China. Bot Sci. 2020;98(1):86–100.
  16. 16. Fan W, Li W, Zhang X, Gan X. Photosynthetic Physiological Characteristics of Tetracentron sinense Oliv in Different DBH Classes and the Factors Restricting Regeneration. J Plant Growth Regul. 2021;41(5):1943–52.
  17. 17. Lu X, Xu N, Chen Y, Li Y, Gan X. Effects of Light Intensity and Ground Cover on Seedling Regeneration of Tetracentron sinense Oliv. J Plant Growth Regul. 2020;40(2):736–48.
  18. 18. Sun Y, Moore MJ, Yue L, Feng T, Chu H, Chen S, et al. Chloroplast phylogeography of the East Asian Arcto‐Tertiary relict Tetracentron sinense (Trochodendraceae). Journal of Biogeography. 2014;41(9):1721–32.
  19. 19. Li M, Yang Y, Xu R, Mu W, Li Y, Mao X, et al. A chromosome-level genome assembly for the tertiary relict plant Tetracentron sinense oliv. (trochodendraceae). Mol Ecol Resour. 2021;21(4):1186–99. pmid:33486895
  20. 20. Acquadro A, Barchi L, Portis E, Carrasquilla-Garcia N, Tirone M, Lanteri S. Rad2seq: an efficient protocol for plant genotyping by sequencing. In: IX International Symposium on Artichoke, Cardoon and Their Wild Relatives. Not specified. 2015.
  21. 21. Brookes AJ. The essence of SNPs. Gene. 1999;234(2):177–86. pmid:10395891
  22. 22. Chen X, Gao Z. The study and application of DNA molecular marker technique. Mol Plant Breed. 2019;17:1970–7.
  23. 23. Dovu J, Doyle J. A rapid DNA isolation procedure for small quantities of fresh leaf material. Phytochem Bull. 1987;19:11–5.
  24. 24. Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, et al. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS One. 2008;3(10):e3376. pmid:18852878
  25. 25. Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. Stacks: an analysis tool set for population genomics. Mol Ecol. 2013;22(11):3124–40. pmid:23701397
  26. 26. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8. pmid:21653522
  27. 27. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989;123(3):585–95. pmid:2513255
  28. 28. Anderson CJ, Tay WT, McGaughran A, Gordon K, Walsh TK. Population structure and gene flow in the global pest, Helicoverpa armigera. Mol Ecol. 2016;25(21):5296–311. pmid:27661785
  29. 29. Hu Y, Wang L, Xie X, Yang J, Li Y, Zhang H. Genetic diversity of wild populations of Rheum tanguticum endemic to China as revealed by ISSR analysis. Biochemical Systematics and Ecology. 2010;38(3):264–74.
  30. 30. Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564–7. pmid:21565059
  31. 31. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
  32. 32. Alexander DH, Lange K. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics. 2011;12:246. pmid:21682921
  33. 33. Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24(11):1403–5. pmid:18397895
  34. 34. Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22(21):2688–90. pmid:16928733
  35. 35. Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88(1):76–82. pmid:21167468
  36. 36. Zhang C, Dong S-S, Xu J-Y, He W-M, Yang T-L. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35(10):1786–8. pmid:30321304
  37. 37. Chen J-M, Zhao S-Y, Liao Y-Y, Gichira AW, Gituru RW, Wang Q-F. Chloroplast DNA phylogeographic analysis reveals significant spatial genetic structure of the relictual tree Davidia involucrata (Davidiaceae). Conserv Genet. 2014;16(3):583–93.
  38. 38. He Y, Ji W, Zhou L, Ma J, Ji X, Li H. Genetic diversity and relationships of endangered Sinowilsonia henryi hemsl by issr. 2014.
  39. 39. Cheng J, Kao H, Dong S. Population genetic structure and gene flow of rare and endangered Tetraena mongolica Maxim. revealed by reduced representation sequencing. BMC Plant Biol. 2020;20(1):391. pmid:32842966
  40. 40. Wang Y, Jiao Z, Zheng J, Zhou J, Wang B, Zhuge Q, et al. Population Genetic Diversity and Structure of an Endangered Salicaceae Species in Northeast China: Chosenia arbutifolia (Pall.) A. Skv. Forests. 2021;12(9):1282.
  41. 41. Hongxin W, Zhi-Ang H. Plant breeding system, genetic structure and conservation of genetic diversity. Biodiversity Science. 1996;04(2):92–6.
  42. 42. Morjan CL, Rieseberg LH. How species evolve collectively: implications of gene flow and selection for the spread of advantageous alleles. Mol Ecol. 2004;13(6):1341–56. pmid:15140081
  43. 43. HUANG Y, ZHANG C, LI D. Low genetic diversity and high genetic differentiation in the critically endangeredOmphalogramma souliei(Primulaceae): implications for its conservation. J of Sytematics Evolution. 2009;47(2):103–9.
  44. 44. Gong W, Gu L, Zhang D. Low genetic diversity and high genetic divergence caused by inbreeding and geographical isolation in the populations of endangered species Loropetalum subcordatum (Hamamelidaceae) endemic to China. Conserv Genet. 2010;11(6):2281–8.
  45. 45. Tian HZ, Han LX, Zhang JL, Li XL, Kawahara T, Yukawa T, et al. Genetic diversity in the endangered terrestrial orchid Cypripedium japonicum in East Asia: Insights into population history and implications for conservation. Sci Rep. 2018;8(1):6467. pmid:29691494
  46. 46. Liu P-L, Zhang X, Mao J-F, Hong Y-M, Zhang R-G, E Y, et al. The Tetracentron genome provides insight into the early evolution of eudicots and the formation of vessel elements. Genome Biol. 2020;21(1):291. pmid:33267872
  47. 47. Brown AH. Isozymes, plant population genetic structure and genetic conservation. Theor Appl Genet. 1978;52(4):145–57. pmid:24317500
  48. 48. Wang J, Gao P, Kang M, Lowe AJ, Huang H. Refugia within refugia: the case study of a canopy tree (Eurycorymbus cavaleriei) in subtropical China. Journal of Biogeography. 2009;36(11):2156–64.
  49. 49. Liu L, Hao Z-Z, Liu Y-Y, Wei X-X, Cun Y-Z, Wang X-Q. Phylogeography of Pinus armandii and its relatives: heterogeneous contributions of geography and climate changes to the genetic differentiation and diversification of Chinese white pines. PLoS One. 2014;9(1):e85920. pmid:24465789
  50. 50. Meng L, Chen G, Li Z, Yang Y, Wang Z, Wang L. Refugial isolation and range expansions drive the genetic structure of Oxyria sinensis (Polygonaceae) in the Himalaya-Hengduan Mountains. Sci Rep. 2015;5:10396. pmid:26013161
  51. 51. Chu X, Gugger PF, Li L, Zhao J, Li Q. Responses of an endemic species (Roscoea humeana) in the Hengduan Mountains to climate change. Diversity and Distributions. 2021;27(11):2231–44.
  52. 52. Sun H, Li Z. Qinghai-tibet plateau uplift and its impact on tethys flora. Advances in Earth Sci. 2003;18(6):852–62.
  53. 53. Hedrick KA, Seong YB, Owen LA, Caffee MW, Dietsch C. Towards defining the transition in style and timing of Quaternary glaciation between the monsoon-influenced Greater Himalaya and the semi-arid Transhimalaya of Northern India. Quaternary International. 2011;236(1–2):21–33.
  54. 54. Shi H, Shi X, Glasmacher UA, Yang X, Stockli DF. The evolution of eastern Sichuan basin, Yangtze block since Cretaceous: Constraints from low temperature thermochronology. Journal of Asian Earth Sciences. 2016;116:208–21.
  55. 55. Xie C, Xie D-F, Zhong Y, Guo X-L, Liu Q, Zhou S-D, et al. The effect of Hengduan Mountains Region (HMR) uplift to environmental changes in the HMR and its eastern adjacent area: Tracing the evolutionary history of Allium section Sikkimensia (Amaryllidaceae). Mol Phylogenet Evol. 2019;130:380–96. pmid:30240912
  56. 56. Hou K, Qian H, Zhang Y, Zhang Q. Influence of tectonic uplift of the Qinling Mountains on the paleoclimatic environment of surrounding areas: Insights from loess–paleosol sequences, Weihe Basin, central China. CATENA. 2020;187:104336.
  57. 57. Yang H, Li X, Liu D, Chen X, Li F, Qi X, et al. Genetic diversity and population structure of the endangered medicinal plant Phellodendron amurense in China revealed by SSR markers. Biochemical Systematics and Ecology. 2016;66:286–92.
  58. 58. Zhang H-X, Wang Q, Jia S-W. Genomic Phylogeography of Gymnocarpos przewalskii (Caryophyllaceae): Insights into Habitat Fragmentation in Arid Northwestern China. Diversity. 2020;12(9):335.
  59. 59. Pulliam HR. Sources, Sinks, and Population Regulation. The American Naturalist. 1988;132(5):652–61.