Figures
Abstract
To explore the genetic diversity of wild tea plants, the methods of the phenotypic traits and SSR molecular marker analysis were adopted to analyze the 40 samples of 7 wild tea populations in Chongzuo, Guangxi. The results showed that the coefficient of variation of 18 descriptive phenotypic traits ranged from 15.76% to 50.64%, and the diversity index ranged from 0.38 to 1.04. The coefficient of variation of the four numerical phenotypic traits ranged from 31.91% to 56.86%. The diversity index ranged from 1.70 to 1.93. Based on phenotypic trait clustering, the 40 tea plant samples were divided into three distinct groups. A total of 117 alleles (Na) were detected by 14 pairs of SSR primers. The polymorphism information content (PIC) of primers ranged from 0.25 to 0.87. The study showed that the optimal K value for population number, based on genetics, was 2. Meanwhile, the genetic distance among populations ranged from 0.24–0.74, the percentage of variation among populations was 20.46%. The genetic distance between the 40 tea plant resources ranged from 0.20–0.95, and the percentage of variation between individuals was 74.22%. When the individual genetic distance was 0.33, the 40 tea plant resources were divided into three groups. This study elucidates the genetic diversity of 40 wild tea plant accessions from seven populations in Chongzuo, Guangxi, which can provide a theoretical basis for breeding superior tea cultivars and conserving tea plant resources.
Citation: Wei Z, Wu F, Feng H, Li Z, Jian Q, Zhao Y, et al. (2025) Genetic diversity analysis of wild tea plant resources in Chongzuo based on phenotypic traits and SSR molecular markers. PLoS One 20(11): e0337070. https://doi.org/10.1371/journal.pone.0337070
Editor: Muhammad Abdul Rehman Rashid, Government College University Faisalabad, PAKISTAN
Received: August 6, 2025; Accepted: November 4, 2025; Published: November 18, 2025
Copyright: © 2025 Wei et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: This research received support from Guangxi South Subtropical Agricultural Science Research Institute self-raised project (NYS2025ZC13), the Basic scientific research business project of Guangxi Academy of Agricultural Sciences (Guikenong 2023YM39, Guikenong 2021YT167), the Guangxi Agricultural Science and Technology Innovation Alliance Project (Guangxi Agricultural Science Technology Alliance 202506-1) and the Innovation and Application Team of Microbial Fermented Tea (NYS2025ST08).
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Tea (Camellia sinensis [L.] O. Kuntze) is an evergreen plant belonging to the family Theaceae and the genus Camellia [1–3]. Through long-term evolution and widespread cultivation, it has developed numerous varieties and ecotypes. The species is naturally distributed across Southwest China, South China and northern parts of Southeast Asia. Chongzuo in Guangxi, located in southwest of China (21 ° 36 '−23 ° 22' N, 106 ° 33'] −108 ° 6 'E), has a long history of tea cultivation. This region harbors many wild tea plant populations and holds significant potential for future development [4]. As such, Chongzuo offers considerable value for research on the origin of tea plants, biodiversity, and breeding of superior cultivars [5].
Genetic diversity analysis plays a crucial role in the identification and evaluation of germplasm resources, aiding in the discovery and selection of superior germplasm [6]. Phenotypic trait analysis offers an intuitive, straightforward and cost-effective approach to assess genetic diversity in tea plants, helping to elucidate patterns of genetic variation based on observable characteristics. For instance, a study of 89 wild tea resources from Yingde revealed rich genetic diversity [7]. Similarly, research on 195 arbor-type tea germplasm resources in Guizhou Province showed considerable variation in the coefficient of variation among different numerical trait types, although the genetic diversity indices differed only slightly [8]. Phenotypic traits enable rapid identification of desirable characteristics, thereby improving breeding efficiency. However, these traits are easily influenced by environmental factors, and relying solely on phenotypic data may compromise the accuracy of genetic diversity assessments. In contrast, Molecular marker technology provides a more accurate and stable genetic diversity analysis.
SSR molecular marker technology is widely used in genetic diversity studies of tea plants due to its advantages of high polymorphism, good repeatability, co-dominant inheritance and abundance [9]. For example, this method was applied to analyze the genetic diversity and relationships among 45 Dali tea germplasm samples and 5 Pu ‘er tea germplasm samples from the Lancang River Basin. The results indicated that Dali tea germplasm resources exhibit moderate genetic diversity and richness, with classification correlating to geographical origin [10]. Similarly, in a study of wild tea germplasm from Liubao Town and Nandu Town in Wuzhou City, SSR markers revealed abundant genetic diversity and high utilization value of the local tea resources [3]. Currently, there are few reports on the genetic diversity of wild tea resources in Chongzuo area. Therefore, evaluating the wild tea resources in this region will facilitate the discovery, innovative utilization, and conservation of valuable germplasm. In this study, phenotypic traits and SSR markers were used to analyze the genetic diversity of 40 wild tea plant samples from seven populations in Chongzuo. Th findings aim to provide a theoretical basis for the identification, conservation and utilization of unique tea resources in Chongzuo and Guangxi, as well as to support future breeding efforts.
2. Materials and methods
2.1. Plant material
The 40 samples of wild tea plants were collected from Chongzuo, Guangxi. detailed information (Table 1) .
2.2. Observation method of phenotypic traits of tea plant
The phenotypic traits of 40 tea samples were observed and statistically counted (Table 2), referring to the “Specification and Data Standards for the Description of Tea Germplasm Resources” [11]. The descriptive phenotypic traits were repeatedly observed 10 times, and the numerical phenotypic traits were repeatedly measured 3 times. Finally, the result of this research was the average value of the three measurements.
2.3. SSR primer information
The primer sequences were based on (‘Camellia sinensis (L.) O. Kuntze cv. Shuchazao’) genome design and selection of some of the reported SSR primers for tea plants, detailed information were shown in Table 3 [12–13], and the primers were synthesized by Wuhan Tianyi Huayu Gene Technology Co., Ltd.
2.4. Sequence amplification
Genomic DNA was extracted from tea samples using a plant tissue genomic DNA extraction kit. The quality and concentration of DNA were assessed using a micro-volume nucleic acid protein analyzer, and the purity was evaluated by 1% agarose gel electrophoresis. And then the qualified DNA was stored at −20°C for future use. The PCR amplification system included 500 ng of template DNA, 1 μL each of forward and reverse primers, 12.5 μL of 2 × TaqPCR MasterMix enzyme, and ddH2O to a final volume of 25 μL. The PCR amplification program was set as following: 95°C for 5 minutes for initial denaturation; 95°C for 30 seconds, followed by gradient annealing from 62°C to 52°C for 30 seconds, and extension at 72°C for 30 seconds, running for 10 cycles; 95°C for 30 seconds, 52°C for 30 seconds, and 72°C for 30 seconds, for 25 cycles; a final extension at 72°C for 20 minutes, and storage at 4°C. Following PCR, amplification products were analyzed using fluorescent capillary electrophoresis. GeneMarker software was used to analyze the results, including determining allele counts, peak patterns, and genotypes for each sample.
2.5. Data analysis
Phenotypic data were organized and analyzed using Excel 2019, calculating the distribution frequency, diversity index (H’), standard deviation, mean, and coefficient of variation (). SPSS 22.0 was used to standardize all phenotypic data, followed by hierarchical clustering analysis, and the construction of a dendrogram using the system clustering method. The Genetic diversity index (H’) was calculated as: H′ = -ΣPiln Pi. Where Pi is the frequency of the i-th descriptive value of the trait. For quantitative traits, the Genetic diversity index (H’) was computed after qualitative treatment, assigning different grades to each observation following the method of Guo et al [14].
The SSR marker amplification results were read as “1” for a presence of a band at a given locus and “0” for absence, forming a 1/0 matrix. In GenAlEx version 6.501, SSR locus polymorphism was assessed, including the calculation of observed alleles (Na), effective alleles (Ne), Shannon ′ s information index (I), polymorphism information content (PIC), observed heterozygosity (Ho), and expected heterozygosity (He). Variation, differentiation, and genetic differentiation coefficient (Fst) as well as gene flow (Nm) among and within populations were calculated. Genetic distances between populations were computed using PowerMarker software. Clustering analysis was performed and a circular dendrogram was drawn with the UPGMA method. A Mantel test was performed using the skbio package in Python to assess the correlation between genetic distance (based on SSR markers) and phenotypic Euclidean distance. STRUCTURE 2.3.4 was used to analyze the population structure of 40 samples, with settings of K = 10, a burn-in period of 10,000, and MCMC (Markov Chain Monte Carlo) set to 100,000. Each K value was run 20 times, and the optimal ΔK value was calculated.
3. Result
3.1. Descriptive phenotypic Characterization
The statistical results of the distribution frequency, coefficient of variation (CV), and Shannon-weaver diversity index of the descriptive phenotypic traits of the tested tea resources were shown in Table 4. The coefficients of variation for the 18 descriptive traits ranged from 15.76% to 50.64% (all > 10%), with an average of 29.48%, indicating considerable variability among these traits, among which the coefficient of leaf color was the largest and tree type was the smallest. The coefficient of variation of 13 traits was higher than the average value, such as tree posture, branch density, bud and leaf pubescence, leaf color, leaf surface, leaf blade, serration sharpness, serration density, serration depth, leaf base, leaf tip, leaf margin and leaf size was higher than the average value, indicating that these 13 traits varied greatly in the tested tea resources. The Shannon-weaver diversity index ranged from 0.38 to 1.04, with an average of 0.76. The coefficient of serration sharpness was the largest and the tree type was the smallest. The diversity indexes of 12 traits were higher than the average value, such as branch density, bud leaf color, leaf shape, leaf color, leaf surface, leaf blade, leaf quality, serration sharpness, serration density, leaf tip, leaf margin and leaf size were higher than the average value, and the diversity indexes of leaf color, serration sharpness, serration density, leaf tip and leaf size were larger (≥1), indicating that the variation types of these 5 traits were rich.
3.2. Numerical phenotypic trait analysis
The results of numerical phenotypic traits of the tested tea resources were shown in Table 5, The leaf length of 40 tea resources ranged from 4.66 to 18.25, with an average of 10.42. The leaf width ranged from 1.78 to 6.57, with an average of 3.61. The maximum length-width ratio was 7.6, the minimum value was 1.17, and the average value is 3.12. The leaf area ranged from 7.93 to 69.68, with an average of 27.10. The coefficients of variation (CV) for the quantitative traits ranged from 31.91% to 56.86%, all exceeding 10%, with an average of 41.43%. This indicates considerable variability and relatively low consistency among these traits. The Shannon-weaver diversity index was the highest length-width ratio (1.93) and the smallest leaf area (1.70), with an average value of 1.86, and the figures of leaf width, leaf length and length-width ratio were higher than those of phenotypic traits. Overall, the numerical phenotypic traits of the 40 tested tea resources showed rich diversity.
3.3. Cluster analysis of phenotypic traits
The cluster analysis was carried out based on the Euclidean genetic distance by systematic clustering method (Fig 1). The results showed that when the Euclidean distance was 20, the 40 tea plants samples were divided into three groups. Group I contained 10 resources, including 5 resources of the Raoxiu population (RS) and 5 resources of the Jiangbian population (JB). The principal tree type of Group I was small tree type, with the morphology characteristics of medium leaf size, erect growth habit, dense branching, yellow-green bud leaves, flat leaf surface, flat leaf surface, hard leaf texture, obtuse leaf serrations, flat leaf margin, and semi-erect leaf attitude. Group Ⅱ contained 25 resources, Group II contained 25 resources, including 10 resources from Changqiao population (CQ), 5 resources from Xinlin population (XL), 5 resources from Liutou population (LT) and 5 resources from Nanei population (NN). The principal tree type of Group II was small tree type, with the morphological characteristics of moderate to dense branching density, yellow-green to pale green bud leaves, oblong to lanceolate leaf shape, slightly raised to raised leaf surface, cuneate leaf base, ascending to slightly ascending leaf insertion angle, and small to medium leaf size. Group Ⅲ contained 5 resources, all from Qingshan population (QS). The principal morphological characteristics include: arboreous habit, large leaf size, sparse branching density, green bud leaves, sparsely pubescent of buds and leaf, oblong-elliptic leaf shape, dark green leaf color, slightly raised leaf surface, slightly recurved leaf blade, medium leaf blade texture, sharp and dense leaf serrations, cuneate leaf base, obtuse leaf apex, undulate leaf margin, and horizontal leaf attitude. Group III was distinguished from Group I and II primarily by tree-structured, branch density, bud leaf coloration, leaf color, leaf insertion angle, and leaf size. Group I and II were differentiated mainly by bud leaf coloration, leaf surface morphology, leaf quality, and serration acuteness.
3.4. Polymorphism analysis of SSR loci
The polymorphism of 40 tea samples was detected by 14 pairs of primers (Table 6). A total of 117 alleles (Na) were detected totally, with an average number of 8.36 alleles. Among these data, the number of alleles detected at A5 and CSK065 was the least, which was 4, and the number of alleles detected at CsFM1715 was the most, which was 15. A total of 58.86 effective number of alleles (Ne) were detected at 14 loci, ranging from 1.36 to 8.41, with an average of 4.20, accounting for 50.4%. The Shannon diversity index (I) ranged from 0.54 to 2.36, with a mean value of 1.56. The observed heterozygosity (Ho) ranged from 0.08 to 0.87, with an average of 0.57. The expected heterozygosity (He) ranged from 0.26 to 0.88, with an average of 0.69. Among the 14 loci, 13 loci of the expected heterozygosity (He) were greater than the observed heterozygosity (Ho), and the observed heterozygosity (Ho) of 1 locus was greater than the expected heterozygosity (He). The mean value of fixation index (F) is 0.19. The polymorphism information content (PIC) ranged from 0.25 to 0.87, with an average of 0.66. Among them, there were 12 loci with PIC greater than 0.5, 1 locus with PIC between 0.25 and 0.5, and 1 locus with PIC less than 0.25. The SSR primers selected by the research had a high level of amplification site polymorphism on the tested materials, and the polymorphism of different SSR loci was significantly different, indicating that the 40 ancient tea tree resources tested had rich genetic diversity.
3.5. Population genetic diversity analysis
The results of genetic diversity of the seven populations were shown in Table 7. The number of alleles (Na) ranged from 2.29 to 5.50, with an average of 3.21. The effective number of alleles (Ne) ranged from 1.81 to 3.71, with an average of 2.51. The proportion of effective alleles ranged from 67.36% to 88.35%, and the alleles were evenly distributed in each population. The observed heterozygosity (Ho) of the population ranged from 0.45 to 0.70, with an average of 0.56. The expected heterozygosity (He) ranged from 0.36 to 0.65, with an average of 0.50. Shannon diversity index (I) ranged from 0.60 to 1.34, with an average of 0.89. In conclusion, these results demonstrate that all seven populations possess rich genetic diversity.
3.6. Genetic structure analysis
The genetic structure analysis of 40 tea resources showed that ΔK was the largest when it is at K = 2 (Fig 2). The 40 tea plant resources could be divided into two subgroups (Fig 3). There were 35 resources in subgroup I (blue) and 5 resources in subgroup II (red). Among them, 5 tea plant samples (S1-S5) of QS were distributed in subgroup II, and 35 tea plant samples (S6-S40) of the other 6 populations were distributed in subgroup I. According to the Q value distribution of the two subgroups, among the 40 tested tea tree resources, the Q value of S6-S40 was > 0.9, and the Q value of S1-S5 was < 0.6, indicating that the genetic background of QS was complex.
Numbers of S1 to S40 are shown in Table 1.
3.7. Analysis of the population genetic relationship
The genetic distance among the seven tea populations ranged from 0.24 to 0.74 (Table 8). Among them, the genetic distance of JB and RX was the shortest, and the genetic distance of QS and LT was the longest. The genetic distance clustering among the seven tea tree populations showed that (Fig 4), when the genetic distance was 0.20, it was clustered into two categories. Among them, the QS was clustered into group I alone, and the CQ, LT, JB, RX, NN and XL were clustered into group II. The QS had the farthest genetic relationship with other native populations. At the genetic distance of 0.17, the six populations in group II were clustered into two groups. Among the six populations, NN and XL were clustered into one group, and CQ, LT, JB and RX were clustered into the other group. NN and XL are distantly related to CQ, LT, JB, and RX populations. At the genetic distance of 0.12, only JB and RX were clustered together, and the genetic relationship was the closest. The population genetic relationship showed that the QS was far from the other six populations.
3.8. Analysis of genetic relationship of single tea plant
The genetic distance among the 40 tea resources ranged from 0.20 to 0.95 (Fig 5). When the genetic distance was 0.33, 40 tea samples were divided into three groups. The five resources of the QS were clustered into the first category, accounting for 12.5% of the survey resources. The five resources of QS were clustered into group I, accounting for 12.5% of the tested resources. Group II had 31 tea plant resources, accounting for 77.5% of the tested resources. Among them, tea resources from different regions were clustered into different subgroups according to genetic distance, and the subgroups were not clustered according to the same geographical location. Group III contained 4 tea plant resources, accounting for 10.00% of the survey resources, which were S8, S9 of RX and S12, S16 of CQ. The single tea plant genetic relationship showed that the genetic relationship among the 5 resources of the QS and the other 35 resources was distant, which is consistent with the results of the population genetic relationship analysis.
To evaluate the relationship between clustering based on phenotypic traits and SSR markers, a Mantel test was conducted between the genetic distance matrix (SSR markers) and the Euclidean distance matrix (phenotypic traits). A significant correlation was found (r = 0.211, P = 0.003 < 0.05), indicating that phenotypic divergence among the 40 tea plant samples is associated with their genetic differentiation.
3.9. Population genetic differentiation and gene flow analysis
The results of genetic differentiation coefficient (Fst) and gene flow (Nm) of 7 tea populations were shown in Table 9. The range of Fst among populations was 0.06–0.38. The Fst value between QS and NN was the largest, and the Fst value between QS and LT, XL, JB, RX, CQ were greater than 0.25. The Fst values between NN and (LT, JB, RX, XL), between XL and (LT, RX), and between RX and LT ranged from 0.15 to 0.25. The Fst values among other populations were less than 0.15. The gene flow (Nm) ranged from 0.41 to 4.06. The Nm between QS and LT, XL, JB, RX, CQ, NN was less than 1, while it was greater than 4 between CQ and JB, and the Nm of other groups was in the range of 1–4. The results of population genetic differentiation and gene flow showed that the genetic differentiation between QS and the other six populations was greater, and gene flow was lower.
3.10 analysis of molecular variance among populations
The analysis of molecular variance among 7 populations was shown in Table 10. The genetic variation among individuals was 74.22%, the genetic variation among populations was 20.46%, and the genetic variation among individuals within populations was 5.32%. The genetic variation of the tested materials mainly came from populations and individuals.
4. Discussion
Guangxi wild tea plants are widely distributed with unique local characteristics [15,16]. Studies have demonstrated the high genetic diversity and significant utilization value in several wild tea plant resources from Guangxi, including Jinxiu wild tea [17], the Sanjiang tea plant population [18], and wild tea resources from Liubao Town in Wuzhou, Guangxi [19]. After long-term natural selection or artificial domestication, wild tea plants showed diversity and stability variation from molecular level to phenotypic traits, which laid a foundation for the screening of tea plant germplasm resources and genetic breeding research [20].
Plant phenotypic traits are affected by the interaction of plant genotype and growth environment. The description and identification of traits are the most basic methods and approaches for germplasm resources research. The coefficient of variation and diversity index are important indicators to reflect the genetic diversity of crops, and are important basis for distinguishing different populations of tea plants [21]. In this study, the diversity of descriptive phenotypic traits and numerical phenotypic traits of 40 tea plant samples were analyzed. The coefficient of variation of 18 descriptive phenotypic traits ranged from 15.76% to 50.64%, with an average of 29.48%, all of which exceeded 10% [22], indicating that these descriptive traits had poor consistency and were rich in variation. The diversity index of descriptive phenotypic traits ranged from 0.38 to 1.04, and the mean value was 0.76. It was higher than the diversity index of descriptive phenotypic traits reported by 195 arbor tea trees in Guizhou (0.67) [8], Sandu wild tea (0.26) [23], and large tea plants in central Guizhou (0.42) [24]. The coefficient of variation of numerical phenotypic traits ranged from 0.32 to 0.57, indicating that these numerical phenotypic traits had poor consistency and were rich in variation, which was higher than that of Gaolu tea population [25]. The diversity index of numerical phenotypic traits ranged from 1.70 to 1.93, with an average of 1.86, which was higher than 1 (the average value of tea resources in China was 0.96) [8]. It indicated that the phenotypic traits of the tested tea plants resources were rich in variation. Cluster analysis of phenotypic traits showed that the materials in each group could be distinguished by population, indicating that the phenotypic traits of each group were significantly different. The three groups were distinguished by tree-structured, branch density, bud leaf coloration, leaf color, leaf insertion angle and leaf size, leaf surface morphology, leaf quality and serration acuteness, indicating that these nine phenotypes were the main factors for the differences in phenotypic traits of the tested tea resources.
In this study, the average number of alleles (Na) detected by 14 pairs of SSR primers was 8.36, the average number of effective alleles (Ne) was 4.20, the average Shannon diversity index (I) was 1.56, and the average polymorphism information content (PIC) was 0.66. It is higher than the mean values of Na, Ne and PIC reported in the analysis of genetic diversity of Shiqian tea resources based on SSR molecular markers [26]. It is higher than the average value of Na, Ne and I reported in a study on the genetic diversity and population structure of wild tea germplasm from Badong County, Hubei Province [27]. The above results indicate that the 14 pairs of SSR primers screened in this study are highly polymorphic and that the alleles are evenly distributed within the population, making them suitable for subsequent research.
The genetic diversity analysis of 7 tea plant populations in Chongzuo using 14 pairs of SSR primers showed that the mean value of Shannon diversity index (I) was 0.89. The I value is a parameter that measure the level of genetic diversity within a population. Higer values of I indicate increased population diversity and richer genetic diversity [28]. The average Shannon diversity index in this study was higher than the average Shannon diversity index of 122 Yunnan wild tea plants (0.88) [29], and of 38 Huangjincha individuals (0.55) [30]. According to the concept of heterozygosity, when the heterozygosity is greater than 0.5, the population has relatively rich genetic diversity, and the closer the observed heterozygosity value is to the expected heterozygosity value, the higher the genetic diversity of the population [31,32]. The mean values of observed heterozygosity and expected heterozygosity in this study were 0.56 and 0.50 respectively, which were relatively close to each other, higher than the mean values of observed heterozygosity (0.32) and expected heterozygosity (0.28) of Anxi tea resources in Fujian [33], and also higher than the mean values of observed heterozygosity (0.37) and expected heterozygosity (0.30) of Wuyishan tea resources [34]. The results of I and heterozygosity in this study indicate that the seven tea tree populations in Chongzuo possess high genetic diversity.
The genetic structure of a population is defined by the composition of its subgroups, which in turn reveals the distinctive characteristics of those subgroups. In this study, ΔK is the largest when K is 2, indicating that the optimal population number is 2. According to the method of Evanno et al. [35], the 40 analyzed tea plant samples were clustered into two subgroups. Five materials in subgroup II were all from QS, and the Q value was less than 0.6. The research showed that when the Q-value of a test material in the subgroup exceeded 0.6, it suggested a relatively simple genetic background; conversely, a lower Q-value corresponded to a more complex genetic relationship [36]. The results of this study showed that the genetic background of QS was complex.
Analysis of the genetic relationships estimated that the genetic distance among the seven tea tree populations ranged from 0.24 to 0.74. When the genetic distance was 0.20, the QS was clustered into one group alone, and there was a clear boundary between the group of QS and the group of other six populations. The genetic distance among the 40 tea plant individuals ranged from 0.20 to 0.95, which was greater than that among populations, indicating substantial genetic differences among individuals. When the individual genetic distance was 0.33, the 40 samples were divided into three groups, and five samples from the QS were clustered into one group. The genetic distance clustering between populations and individuals showed that the QS and the five samples contained had specific genetic background, which may be due to long-term natural selection and certain genetic variation. QS can be used as one of the key materials for screening excellent tea resources.
Genetic drift, mating system, and gene flow are important factors affecting genetic structure. Among them, gene flow can counteract genetic drift and reduce genetic differentiation [37]. It has been reported that the genetic differentiation coefficient (Fst) reflects the degree of population differentiation, with a larger Fst value indicating a higher level of differentiation [38]. Gene flow (Nm) can measure the level of gene flow among populations. Generally, when Nm is greater than 4, the gene flow among populations is sufficient. If Nm is less than 1, it is less [39]. In this study, the (Fst) among the QS and the other six tea populations was greater than 0.25 and the Nm was less than 1. It is speculated that the QS may be due to the increase of genetic differentiation caused by genetic drift, resulting in a high degree of genetic differentiation among the QS and the other six tea populations. There was a certain degree of gene flow among the other 6 populations (except QS), and the degree of genetic differentiation was not high. This was consistent with the results of molecular variance analysis between populations (the genetic variation between populations was 20.46%, and the variation between individuals was 74.23%). The tea plant populations in different regions are affected by natural selection, artificial selection, geographical environment and distribution area, which will cause the change of population genetic structure, the gradual increase of genetic difference and the formation of certain genetic differentiation. The comparative analysis of genetic structure and genetic differentiation of tea plant samples in Guangdong and Guangxi showed that the degree of genetic differentiation was large due to natural environmental factors [40]. In this study, the altitude of QS was 716.53 meters, which was higher than that of other tea populations. Therefore, it was speculated that the altitude difference was one of the reasons for the high genetic differentiation of this population.
In this study, both the population-based and individual-based kinship clustering analyses, derived from phenotypic clustering and SSR marker-based population genetic structure, successfully distinguished the five tea plants resources from the Qingshan (QS) population. Furthermore, the Mantel test indicated a significant correlation between phenotypic and genetic distances, suggesting that the five QS resources possess distinct phenotypic characteristics and a complex genetic background. Except for the five resources of QS, the consistency of phenotypic clustering and SSR marker clustering results of other resources was poor, which has been reported in other crops [41,42]. The phenotype observed in this study is only a part of the phenotype of wild tea resources, and the SSR marker represents a difference in DNA, and this genetic difference has little or no correlation with phenotype. On the other hand, most phenotypic traits are determined by multiple genes. The number of SSR primers in this study is limited and cannot represent all the DNA information of the observed phenotypic traits.
5. Conclusions
The genetic differences among wild tea resources in Chongzuo area of Guangxi are large, the genetic basis is wide, and the genetic diversity is rich, especially the genetic background of QS. This result provides a reference for further excavation and utilization of Chongzuo wild tea germplasm resources, cultivation of new tea varieties and development of new tea products in the future. In addition, in order to avoid the loss of characteristics due to the mixed evolution among different tea plant populations, key areas should be selected or resource nurseries should be established to protect the wild tea plants germplasm resources in Chongzuo.
References
- 1. Li W, Shi X, Guo W, Banerjee AK, Zhang Q, Huang Y. Characterization of the complete chloroplast genome of Camellia granthamiana (Theaceae), a Vulnerable species endemic to China. Mitochondrial DNA B Resour. 2018;3(2):1139–40. pmid:33474444
- 2. Yang YQ, Juan T, Wang F, Peng SL, Chen J, Tan MY. Research and application progress in chloroplast genome of tea plant. Biotechnology Bulletin. 2024;40(02):20–30.
- 3. Wang LB, Huang LY, Teng CQ, Wu LY, Cheng H, Yu CP. Genetic and Phylogenetic Analysis for Germplasm Resources of Camellia sinensis from Wuzhou City. J Tea Science. 2022;42(05):601–9.
- 4. Wen LX. Process optimization and quality analysis for wild tea in Jinxiu County of Guangxi. Food Research and Development. 2019;40(09):87–94.
- 5. Wang PS, Yu FL. The geographic distribution, diversity and utilization of wild tea camellias in China. J Tea Science. 2002;:105–8.
- 6.
Yao MZ. Studies on genetic diversity and structure of tea germplasm in China based on ISSR and ETS-SSR markers. Zhejiang: Zhejiang University. 2009.
- 7. Zheng JY, Xie MW, Ceng ZH, Ya H, Yan CY. Genetic diversity and primitive analysis of leaf phenotypic traits of wild tea in Yingde. J Tropical and Subtropical Botany. 2024;32(02):219–28.
- 8. Yang DX, Qiao DH, Yang H, Chen J, Li Y, Liang SH, et al. Genetic Diversity of Guizhou Arboreous Tea Plants Based on Phenotypic Traits and Biochemical Components. China Tea. 2023;45(07):37–45.
- 9. Mu XT, Liao ZC, Ling CJ, Zheng X, Lin JY, He JP. Genetic diversity and relationship analysis of 55 wild tea germplasm from Qingyuan based on SSR markers. China Tea. 2023;45(02):36–43.
- 10. Huang LC, Chen HY, Luo YJ, Tao YL, Lan ZQ. Study on Genetic Diversity of Camellia taliensis Population in Lancang River Basin Based on SSR Markers. Journal of Agricultural Biotechnology. 2025;33(01):68–78.
- 11.
Chen L. Description specification and data standard of tea germplasm resources. China Agriculture Press. 2007.
- 12. Cai YM, Jiang SF, Mao GG, Li MD, Diao HL, Wu DF. Tea germplasm resources: Genetic diversity and relationship analysis based on SSR molecular markers. Chinese Agricultural Science Bulletin. 2023;39(30):54–60.
- 13. Shi BX, Zheng MX, Xiang YP, Wang B, Shi JJ, Chen JJ. Genetic diversity analysis and molecular ID construction of tea resources in Wangcang County. China Tea. 2023;45(07):46–53.
- 14. Guo RH, Liu ZZ, Cai YL, Cao MJ, Shi YS, Song YC. Evaluation of phenotypic diversity for two maize landrace populations having historical importance in China. J Maize Sciences. 2009;17(02):24–8.
- 15. Zhou YH, Qiao XY, Ma CL, Qiao TT, Jin JQ, Yao MZ. Genetic diversity and structure of tea landraces from Guangxi based on EST-SSR analysis. Scientia Silvae Sinicae. 2011;47(03):59–67.
- 16. Li CC, Deng HQ, Zhuge TQ. The present situation, issues and protection suggestions of wild ancient tea trees in Guangxi. J Guangxi Agriculture. 2018;33(04):44–6.
- 17. Chen YY, Yuan SS, Wu CL, Lai XF, Zhao WF, Zhu Y. Molecular fingerprint and diversity analysis of wild tea trees in Jinxiu county, Guangxi province. Guangdong Agricultural Sciences. 2016;43(03):60–5.
- 18. Meng RG, Chen L, Xu Y, Lin W, Zhou QW, Xie YL. Genetic diversity analysis of tea genetic resources in Sanjiang, Guangxi. J Tea Science. 2023;43(02):147–58.
- 19. Huang HC, Jiang WX, Liang CX, Yao Q, Nie HQ, Bai TD. EST-SSR-based Analysis on Genetic Diversity of Ancient Liupao Tea Trees and Their Progeny in Guangxi. Molecular Plant Breeding. 2021;19(07):2410–8.
- 20. Ding Y, Zheng XX, Huang HT, Mao YX, Zhao Y. Analysis of agronomic traits and genetic diversity of four major tea populations in Zhejiang Province, China. Acta Agriculturae Zhejiangensis. 2023;35(02):364–72.
- 21. Ding Z, Li YX, Yuan Y, Wang HY, Liu XS, Jiang CJ. Phenotypic Traits and Genetic Diversity of Elite Tea Population in Anhui (China). J Tea Science. 2018;38(02):155–61.
- 22. Zhang LJ, Li GS, Bai CK, Wen MM, Zhang ZQ. Evaluation and correlation analysis on quantitative traits of Cornusofficinalis germplasm resources. J Plant Genetic Resources. 2012;13(04):655–9.
- 23. Yang C, Guo C, Qiao DH, Guo Y, Liang SH, Li Y. Diversity analysis of phenotypic characters and biochemical components in Sandu wild tea plant. Jiangsu Agric Sci. 2023;51(08):111–9.
- 24. Huang Z, Song QF, Yin J, Li F, Niu SZ. Genetic Diversity of Tea Germplasm in Central Guizhou Based on Morphological Characters. Molecular Plant Breeding. 2021;19(15):5189–203.
- 25. Wu YT, Wang XY, Wei LH. Survey and analysis of the diversity and adaptability of phenotypic traits in the population of Gaolu Tea. Modern Agricultural Sci Technol. 2024;(04):117–9.
- 26. Lin KQ, Li YX, Wei J, Wang AR, Yu YC, Duan CL, et al. Southwest China Journal of Agricultural Sciences. 2024;37(07):1435–41.
- 27. Cui QM, Liang JB, Ma HJ, Hu SL, Chen QH, Wu LY. Genetic diversity and population structure relationship analysis of wild tea germplasm resources in Badong County, Hubei Province. J Tea Science. 2024;44(02):193–206.
- 28. Sork VL. Genomic Studies of Local Adaptation in Natural Plant Populations. J Hered. 2017;109(1):3–15. pmid:29045754
- 29. Zhou M, Li YY, Sun XM, Wang JJ, Xie J, Cheng H. Genetic diversity assessment of ancient tea plants in Yunnan province of China revealed by EST-SSR markers. Jiangsu Agricultural Sciences. 2013;41(12):22–7.
- 30. Yang Y, Liu Z, Zhao Y, Liang GQ, Zhao X. Genetic Diversity and Relationship of Huangjincha Cultivar Based on EST-SSR Markers. J Tea Science. 2009;29(03):236–42.
- 31. Tang CW, Yu X, Liu WJ, Hao Z, Shi L, Xing WT. Stndy on genetic diversity and genetic differentiation of 13 sheep populations in Xinjiang Uygur Autonomous Region. J Domestic Animal Ecology. 2011;32(01):13–9.
- 32. Hansson B, Westerberg L. On the correlation between heterozygosity and fitness in natural populations. Mol Ecol. 2002;11(12):2467–74. pmid:12453232
- 33. Zhu YY, Yu WT, Gao SL, Lv SY, Wang P, Jin WM. The diversity of tea germplasm resources and genetic relationship of ‘Tieguanyin’-derived varieties in Anxi, Fujian, China. Acta Agriculturae Zhejiangensis. 2024;36(07):1591–601.
- 34. Liu C, Yu W, Cai C, Huang S, Wu H, Wang Z, et al. Genetic Diversity of tea plant (Camellia sinensis (L.) Kuntze) germplasm resources in wuyi mountain of china based on single nucleotide polymorphism (SNP) markers. Horticulturae. 2022;8(10):932.
- 35. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14(8):2611–20. pmid:15969739
- 36. Wu C-L, Zhang Q-Q, Dong B-X, Li S-F, Zhang C-Q. Analysis of Genetic Structure and Genetic Relationships of Partial Maize In-bred Lines in China. A A S. 2010;36(11):1820–31.
- 37. Yeh FC, Boyle TJB. Population genetic analysis of codominant and dominant markers and quantitative traits. Belg J Bot. 1997;129.
- 38. Ren Z, Bai Q, Su SC. Genetic diversity analysis of Pistacia chinensis Bunge based on SSR markers. Acta Botanica Boreali-Occidentalia Sinica. 2022;42(09):1530–9.
- 39. Yang J, Zhang LL, Zhang WJ, Chen LH, Zheng GH, Li YJ. Population Structure and Genetic Differences of Tea Germplasm Resources in Fujian. J Tea Science. 2023;43(06):769–83.
- 40. Qiao XY, Qiao TT, Zhou YH, Jin JQ, Ma CL, Yao MZ. Comparative analysis of genetic structure and differentiation of Guangdong and Guangxi tea germplasms based on EST-SSR markers. Scientia Agricultura Sinica. 2011;44(16):3297–311.
- 41. Liu YQ, Tian ZM, Qi LP, Gong XC, Feng Y, Zhang YS. Fingerprint construction and genetic diversity analysis of 20 potato varieties (lines). Jiangsu Agricultural Sci. 2022;50(21):41–6.
- 42. Song LS, Yang X, Xia JJ, Yu GQ, Peng ST, Duan YH. Phenotypic comparison and genetic diversity of the starch-type potatoes. J Plant Genetic Resources. 2025;26(06):1202–14.