Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Core collection construction and genetic diversity analysis of tea plant (Camellia sinensis [L:] O. Kuntze) accessions in Huangshan city using SSR markers

  • Xu Ruan,

    Roles Conceptualization, Data curation, Investigation, Methodology, Writing – original draft

    Affiliation Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

  • Qiong Wu,

    Roles Conceptualization, Data curation, Project administration, Supervision, Writing – review & editing

    Affiliation Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

  • Xiaoyu Jiao,

    Roles Investigation

    Affiliation Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

  • Dandan Liu,

    Roles Software, Visualization

    Affiliation Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

  • Minghui Sun,

    Roles Validation

    Affiliation Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

  • Leigang Wang,

    Roles Investigation

    Affiliation Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

  • Wenjie Wang

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Writing – review & editing

    Affiliation Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

Abstract

Assessing genetic diversity and building a core collection is essential to advancing tea plant breeding. In this study, ten SSR markers exhibiting robust amplification and polymorphism were employed to genotype 292 tea accessions sourced from various regions in Huangshan city. The results revealed significant genetic variation, encompassing 180 alleles. Genetic structure was evaluated using neighbor-joining clustering, principal coordinate analysis, and Structure analyses, which categorized the tea accessions into two primary clusters. The genetic diversity within these clusters demonstrated high similarity, likely due to their close geographical proximity. A core collection was established utilizing Core Hunter software, resulting in the selection of 35% of the accessions to effectively represent the genetic diversity of the entire collection. This core collection comprises 102 tea accessions, preserving a high percentage of allele richness and genetic diversity. This research offers valuable insights into genomics research and the sustainable management of tea plant genetic resources in Huangshan city.

Introduction

Tea (Camellia sinensis [L.] O. Kuntze) is a perennial woody plant and the world’s most popular beverage crop, originating in southwest China with a rich history spanning over 3,000 years [13]. As a self-incompatible species, it exhibits significant genetic diversity due to prolonged natural selection [4]. The climate in Anhui province is particularly suitable for tea cultivation [5]. The region boasts an abundance of tea germplasm, with tea production primarily concentrated in the southern areas of the Yangtze River and the Dabie Mountains. Huangshan city, situated in Anhui province, is renowned for its long-standing tradition of producing famous teas such as Huangshan Maofeng, Taiping Houkui, and Keemun black tea, alongside a wealth of local tea landraces. However, the widespread adoption of clonal tea varieties has led to a decline in these landraces in Huangshan city in recent years. Furthermore, excessive harvesting of wild tea plants has contributed to a reduction in natural populations. To preserve resource diversity and support breeding research, it is essential to gather and conserve wild and landrace resources in Huangshan city. In response, the Tea Research Institute of Anhui Academy of Agricultural Sciences established the Anhui Landraces Germplasm Tea Repository in 2020. This initiative has successfully collected over 90,000 landrace seeds from 27 major tea-producing counties and districts, encompassing nearly the entire tea-producing area in Anhui province. The repository features separate rows designated for planting landraces from each area, with each row containing approximately 100 landraces individuals. Overall, the repository houses nearly 100,000 landraces individuals, including about 30,000 tea landraces individuals sourced from 300 villages within Huangshan city.

It is essential to construct a core collection that accurately represents genetic diversity while minimizing the number of accessions to effectively protect and utilize diverse landraces [6,7]. Currently, research on core collection has been conducted across various crops, including cotton [8], peanut [9], potato [10] and radish [11]. A core collection comprising 532 accessions was randomly selected from the China National Germplasm Tea Repository, based on cultivated region grouping and phenotypic data, with approximately 20% of the accessions included [12]. In 2004, 126 tea accessions were selected from a total of 615 tea plants in China as a core collection; genetic diversity and agronomic trait analyses indicated that this collection was well representative [13]. Recently, research on core germplasm collections has primarily relied on morphological and molecular markers [14]. The main molecular markers employed include Amplified fragment length polymorphism (AFLP), Random amplified polymorphic DNA (RAPD), Single nucleotide polymorphism (SNP), and Simple Sequence Repeats (SSR), among other technologies [1517]. SSR is particularly favored due to its high polymorphism, substantial informational content, and co-dominant inheritance.

23 SSR markers were employed to analyze 788 tea accessions from around the world, resulting in the construction of a core collection comprising 192 accessions that encapsulated the variation present within the 788 accessions [18]. A targeted core collection was subsequently established using 33 SSR markers for 462 tea accessions [19], which included 100 accessions (21.6% of the total collection) originating from four regions: 73 from Korea, 22 from China, 4 from Japan, and 1 from India. Additionally, 171 tea accessions were characterized from Sri Lankan and Indian germplasms phenotypically and genotypically using 28 SSR markers, identifying 21 Sri Lankan and 18 Indian accessions as part of the core collections [20]. The population structure of 573 tea accessions in Anhui Province was analyzed using 60 SSR markers, leading to the development of a core collection of 115 accessions that effectively represented the genetic diversity of the original samples [21]. Notably, there remains a gap in research regarding the core collection of tea landraces in Anhui Province. In this study, 292 tea accessions were selected from Huangshan city, sourced from the Anhui Landraces Germplasm Tea Repository, comprising 287 landraces and 5 cultivars. To assess genetic diversity and population structure, ten highly polymorphic SSR primers were utilized. The objective was to construct a core collection and provide insights for the conservation and utilization of tea accessions in Anhui province.

Materials and methods

Primer screening

Ten SSR markers were selected from the 74 pairs of tea plant SSR markers developed by Ma [22] (Table 1). Subsequently, 10 pairs of SSR primers were synthesized by Shanghai Sheng Gong Co., Ltd. (Shanghai, China) for further experiments.

thumbnail
Table 1. Information of 10 SSR primers used in this study.

https://doi.org/10.1371/journal.pone.0322209.t001

Plant materials

A total of 287 landraces were sampled from the Anhui Landraces Germplasm Tea Repository. They were collected from towns in She County (SX), Xiuning County (XN), Qimen County (QM), Yi County (YX), Huizhou District (HZ), and Huangshan District (HS). The number of towns was 13, 20, 16, 7, 5, and 8 respectively, and the number of selected villages of each town was 69, 48, 49, 34, 42, and 50 respectively. One landrace from each village was selected randomly. In addition, 5 cultivars, such as Wancha No. 4 (WC04), Wancha No. 5 (WC05), Anhui No. 1 (AH01), Anhui No. 3 (AH03), and Caoxi No. 1 (CX01) were included, bringing the total to 292 experimental materials (S1 Table).

DNA extraction and PCR amplification

Young leaves of tea plants were collected and extracted using a Tiangen Express plant genomic DNA kit (Beijing, China). The quality of the extracted DNA was assessed via 1% agarose gel electrophoresis. The DNA was subsequently diluted to a concentration of 30 ng/µL and stored at −20°C for future experiments. For the PCR experiment, reagents were sourced from Jinsha Corporation (Beijing, China). The concentration of the PCR product was estimated using agarose gel electrophoresis; 1 µL of the product was diluted to 10 ng/µL with water. The mixture was prepared with an internal plate at a HiDi ratio of 1:130. The diluted 1 µL product was combined with the mixture at a ratio of 1:9, denatured at 95°C for 5 minutes, and then placed on the sample stand of an ABI3730XL sequencer (ABI, USA) for capillary electrophoresis detection.

Data analysis

GenAlEx v6.5 [23] was employed to calculate various genetic parameters for each microsatellite marker, including the number of alleles (Na), effective number of alleles (Ne), observed heterozygosity (Ho), expected heterozygosity (He), unbiased expected heterozygosity (Uhe), hybridization coefficient (F), and Shannon’s Information index (I). The polymorphism information content (PIC) was determined using PowerMarker V3.25 software [24]. Additionally, F-statistical calculations (FIS, FIT, and FST) and principal coordinate analysis (PCoA) were conducted in GenAlEx v6.5, in conjunction with Microsoft Excel. Genetic distances were clustered using PowerMarker V3.25 [24] following the neighbor-joining clustering [25], and the resulting phylogenetic trees were enhanced and edited using the ITOL website [26].

Structure V2.3.4 [27] was employed for population structure analysis utilizing a Bayesian model-based hybrid analysis approach. The Markov chain Monte Carlo (MCMC) was configured with 100,000 iterations and a run length of 100,000, while the number of genetically homogeneous clusters (K value) was set to range from 2 to 20, with 10 repeated runs for each K value. The optimal K value was determined using the highest ΔK method as proposed by Evanno [28]. Subsequently, the optimal set of core germplasm collection was extracted using Core Hunter V3.0 [29], which employs a local search algorithm to maximize genetic diversity and allele richness. Ten sample components were tested (10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, and the initial component) with Core Hunter V3.0, based on the reported distribution of core germplasm components of woody plants [3032]. T-tests were conducted between the core and initial germplasm groups using Microsoft Excel. The smallest core germplasm (P≤0.05) that was not significantly different from the 100% population group was selected to construct the optimal core collection [33].

Results

Genetic diversity of tea accessions

A total of 180 alleles were identified within 292 tea accessions using 10 SSR markers. Each tea plant was successfully genotyped with these alleles, demonstrating the strong discriminatory ability of the SSR markers (Table 2). The markers exhibited significant genetic variation, with the number of alleles per locus (Na) ranging from 15 to 22 (mean = 18.00), the effective number of alleles (Ne) from 4.00 to 9.71 (mean = 7.02), observed heterozygosity (Ho) from 0.14 to 0.61 (mean = 0.34), expected heterozygosity (He) from 0.75 to 0.90 (mean = 0.85), unbiased expected heterozygosity (uHe) from 0.75 to 0.90 (mean = 0.85), Shannon’s diversity index (I) from 1.74 to 2.56 (mean = 2.22), and polymorphic information content (PIC) from 0.72 to 0.89 (mean = 0.83). The SSR marker TM045 exhibited the highest allele count (22), while TM069, TM076, and TM085 had the lowest (15). Observed heterozygosity was consistently lower than expected heterozygosity across all markers. The high level of polymorphism was evident, as indicated by a PIC value exceeding 0.7 for all markers. Notably, the TM077 marker proved particularly effective in capturing genetic diversity, with the highest observed PIC value of 0.89.

thumbnail
Table 2. Genetic diversity parameters for the 292 accessions using SSR markers.

https://doi.org/10.1371/journal.pone.0322209.t002

The genetic diversity index values displayed minimal variation in Huangshan city. The SX population exhibited the highest values for Na and I, while the HS population demonstrated the highest values for Ne and Ho, along with the lowest fixation index (F) (Table 3).

thumbnail
Table 3. Genetic diversity parameters for six populations.

https://doi.org/10.1371/journal.pone.0322209.t003

Genetic structure of tea accessions

The genetic relationships among the test materials were examined by constructing a phylogenetic tree using the neighbor-joining clustering, based on Nei’s genetic distance (Fig 1). The 292 accessions were broadly categorized into two groups: Group I and Group II. Notably, the tea accessions from the test regions appeared in both groups and were not classified according to geographical distance. This suggests that the genetic diversity among the accessions is quite similar, likely due to their close geographical proximity, which has resulted in the clustering of these accessions.

thumbnail
Fig 1. Neighbor-Joining cluster diagram of 292 tea accessions.

The Neighbor-Joining tree was constructed based on Nei’s genetic distance. Group I (blue) includes 112 accessions, and Group II (red) includes 180 accessions.

https://doi.org/10.1371/journal.pone.0322209.g001

A total of 292 tea plant accessions were analyzed using principal coordinate analysis (PCoA) with GenAlEx version 6.503 to validate the findings of the phylogenetic examination. The PCoA indicated that the first two coordinates explained 5.66% and 5.48% of the total genetic variation, respectively (Fig 2). This analysis demonstrated that the tea accessions were grouped into two distinct clusters, corroborating the conclusions drawn from the phylogenetic study.

thumbnail
Fig 2. Principal coordinate analysis of 292 tea accessions.

Due to the close relationship, all the accessions were not distinguished, and could be divided into two groups by auxiliary lines.

https://doi.org/10.1371/journal.pone.0322209.g002

The analysis of the population’s genetic structure revealed the highest ΔK value at K=2, suggesting that the model’s fit was most accurate when the accessions were categorized into two groups (Fig 3).

thumbnail
Fig 3. Structure analysis of 292 tea accessions.

(A) The population structure of tea plants was determined using STRUCTURE 2.3.4 software (K = 2–6), (B) Estimated average likelihood L(K) distribution (mean ± SD) from 2 to 20 possible clusters (K), (C) Delta K distribution based on the rate of change in L(K) between continuous K values.

https://doi.org/10.1371/journal.pone.0322209.g003

The accessions in Cluster I and Cluster II comprise diverse populations that closely align with the results obtained from Neighbor-Joining cluster. These findings suggest that there is minimal variation in the genetic diversity of tea accessions in Huangshan city.

Core collection development and assessment

Nine candidate core collections of varying sizes were generated using the Core Hunter V3.0 software. To identify the most efficient core size, these nine core sizes were compared against the entire population based on six genetic diversity parameters: Na, Ne, Ho, He, uHe, and I (Table 4).

thumbnail
Table 4. Comparison of genetic diversity parameters of different fractions of core collections.

https://doi.org/10.1371/journal.pone.0322209.t004

The core sizes comprising 10% of the total germplasm were found to be significantly different (P ≤ 0.05) from the full population regarding the effective population size (Ne) and expected heterozygosity (He). Additionally, core sizes representing 15% of the total exhibited significant differences (P ≤ 0.05) in the fragrance information index (I). Specifically, when core collections constituted 10% of the total germplasm, the genetic diversity parameters Ne and He demonstrated significant differences (P ≤ 0.05). Conversely, at 15% core collection size, the fragrance information index I displayed significant differences. Furthermore, when the number of alleles accounted for 35%, the number of alleles (Na) showed significant differences. However, other parameters, including observed heterozygosity (Ho), unbiased expected heterozygosity (uHe), and fixation index (F), did not exhibit substantial differences at any percentage.

As a result, it was determined that a core size of 35% represented the most optimal core germplasm collection, comprising 102 tea accessions. This core collection included 17 accessions from QM, 22 from SX, 14 from XN, 14 from YX, 18 from HS, and 15 from HZ (Fig 4B). The composition was as follows (S2 Table). In comparison to the entire collection, the core collection maintained 82.62% of Na, 100.48% of I, and 115.15% of Ho. Furthermore, it demonstrated a 108.77% increase in Ne, a 102.41% increase in He, and a 103.57% increase in uHe. The research indicates that the core and the whole collection exhibit a similar distribution (Fig 4A).

thumbnail
Fig 4. (A) PCoA of the core collection (n=102) and whole collection (n=292) were based on genotypic data from SSR markers. (B) Comparison of core collection and initial whole collection.

https://doi.org/10.1371/journal.pone.0322209.g004

Discussion

Genetic diversity and population structure

The identification and evaluation of germplasm resources are essential for their protection and utilization, serving as a critical foundation for germplasm innovation, genetic enhancement, and the development of plant breeding [34]. Currently, the population structure and genetic differences of 208 tea landraces in Fujian province have been studied [35]. Additionally, Wang examined the genetic diversity and phylogenetic relationships of 50 ancient tea accessions in Guangxi province [36]. Anhui Province is rich in tea landrace resources, including Huangshan, Qimen, Jinji, and other species [37]. However, there is a scarcity of studies focusing on the population structure and genetic differences of landraces, with most research primarily addressing clonal varieties and related aspects.

In this research, a substantial number of tea landraces from Huangshan city were collected for the first time to analyze their genetic diversity and population structure. The findings indicated that the population from She County exhibited the highest genetic variability (Na = 13.60, I = 2.12) (Table 4). This is attributed to the fact that She County has the largest tea plantation area in Huangshan city, amounting to 18,900 hectares, with the majority being tea landraces [38]. The slow promotion of clonal tea plants has contributed to the effective protection of the landraces. The genetic distance among the populations in Huangshan city is relatively small, allowing for the population structure to be broadly categorized into two groups. The results of Neighbor-Joining, PCoA, and Structure analyses support this conclusion.

The core collection of tea accessions

Determining the number of core collections is essential. Currently, the proportion of core collections for woody plants typically ranges from 10% to 45% [39]. For instance, the Chinese fir [31] has a core collection proportion of 42.9% (300 out of 700), while Toona sinensis [40] accounts for 20% (208 out of 1040). Ginkgo biloba L. [41] has a core collection proportion of 12.3% (27 out of 101), and Camellia oleifera C.Abel [42] represents 15% (45 out of 300). In this study, we employed Core Hunter 3.0 software to select 35% of core collections from the accessions of Huangshan city, resulting in a total of 102 accessions. Among these, 19 accessions were selected, representing 17.6% of the QM accessions. Additionally, 22 accessions were selected from SX (21.6%), 14 from XN (13.7%), 14 from YX (13.7%), 18 from HS (17.6%), and 15 from HZ (14.7%).

Allele retention is a crucial factor in evaluating the construction of core germplasm. For instance, when establishing the seed bank for sugarcane (Saccharum officinarum L.), the objective was to preserve at least 70% of allele diversity and other indicators of genetic diversity [43]. A comparison between the core collection and the entire collection revealed that the core collection exhibited higher genetic diversity, retaining 82.62% of allele richness. The indicators Ne, I, Ho, He, and uHe not only did not decrease but also showed an increase, indicating a balanced geographical distribution (Table 4). Based on these findings, the construction of core collections offers good representation and coverage. This study represents the first evaluation of genetic diversity in tea landrace resources in Huangshan city, and the successful establishment of core collections provides a foundation for the identification and conservation of tea landrace resources in China.

Conclusions

In this study, ten pairs of SSR markers were utilized to analyze the genetic diversity and structure of tea accessions in Huangshan city. The results indicated significant genetic diversity among the accessions, with a close genetic distance observed between county populations. This close genetic distance may be attributed to the geographical proximity of the counties. A core collection was established, encompassing 35% of the entire accessions, which included 19 accessions from Qimen County, 22 from She County, 14 from Xiuning County, 14 from Yi County, 18 from Huangshan District, and 15 from Huizhou District. The selection and establishment of this core germplasm provide a crucial foundation and theoretical support for the effective management, protection, and utilization of germplasm resources in Huangshan city.

Supporting information

S1 Table. Geographic distribution of 292 collected tea accessions.

https://doi.org/10.1371/journal.pone.0322209.s001

(DOCX)

S2 Table. Information of the reserved accessions.

https://doi.org/10.1371/journal.pone.0322209.s002

(DOCX)

Acknowledgments

The researchers would like to acknowledge the agricultural administration department of Huangshan city to provide the excellent support in this project.

References

  1. 1. Mukhopadhyay M, Mondal TK, Chand PK. Biotechnological advances in tea (Camellia sinensis [L.] O. Kuntze): a review. Plant Cell Rep. 2016;35(2):255–87. pmid:26563347
  2. 2. Wang X, Feng H, Chang Y, Ma C, Wang L, Hao X, et al. Population sequencing enhances understanding of tea plant evolution. Nat Commun. 2020;11(1):4447. pmid:32895382
  3. 3. Liu C. The origin and distribution of tea trees and tea in China. JTSR. 2023;12(2):1–8.
  4. 4. Tan L-Q, Wang L-Y, Wei K, Zhang CC, Wu LY, Qi GN, et al. Floral transcriptome sequencing for SSR marker development and linkage map construction in the tea plant (Camellia sinensis). PLoS One. 2013;8(11):e81611. pmid:24303059
  5. 5. Su H, Wu W, Wan X, Ning J. Discriminating geographical origins of green tea based on amino acid, polyphenol, and caffeine content through high-performance liquid chromatography: Taking Lu’an guapian tea as an example. Food Sci Nutr. 2019;7(6):2167–75. pmid:31289665
  6. 6. Frankel OH, Brown AHD. Current plant genetic resources - a critical appraisal. “Genetic perspectives of germplasm conservation. In: Arber WK, Llimensee K, Peacock WJ, Stralinger P, editors. Genetic Manipulation Impact on Man and Society. Cambridge: Cambridge University Press; 1984. p. 161-70.
  7. 7. Brown AHD. Core collections: a practical approach to genetic resources management. Genome. 1989;31(2):818–24.
  8. 8. Egan LM, Conaty WC, Stiller WN. Core collections: is there any value for cotton breeding? Front Plant Sci. 2022;13:895155. pmid:35574064
  9. 9. Otyama PI, Kulkarni R, Chamberlin K, Ozias-Akins P, Chu Y, Lincoln LM, et al. Genotypic characterization of the U.S. peanut core collection. G3 (Bethesda). 2020;10(11):4013–26. pmid:32887672
  10. 10. Bamberg J, del Rio A, Kinder D, Louderback L, Pavlik B, Fernandez C. Core collections of potato (solanum) species native to the USA. Am J Potato Res. 2016;93(6):564–71.
  11. 11. Lee YJ, Mun J-H, Jeong Y-M, Joo S-H, Yu H-J. Assembly of a radish core collection for evaluation and preservation of genetic diversity. Hortic Environ Biotechnol. 2018;59(5):711–21.
  12. 12. Wang X, Chen L, Yang Y. Establishment of core collection for Chinese tea germplasm based on cultivated region grouping and phenotypic data. Front Agric China. 2011;5(3):344–50.
  13. 13. LI J, Jiang C. Preliminary construction of tea core germplasm in China. Journal of Anhui Agricultural University. 2004;31(3):282–7.
  14. 14. Herrmann D, Flajoulot S, Barre P, Huyghe C, Ronfort J, Julier B. Comparison of morphological traits and molecular markers to analyse diversity and structure of alfalfa (Medicago sativa L.) cultivars. Genet Resour Crop Evol. 2017;65(2):527–40.
  15. 15. Wang RJ, Gao XF, Kong XR, Yang J. An efficient identification strategy of clonal tea cultivars using long-core motif SSR markers. Springerplus. 2016;5(1):1152. pmid:27504250
  16. 16. Zhou Q, Mu K, Ni Z, Liu X, Li Y, Xu L. Analysis of genetic diversity of ancient Ginkgo populations using SSR markers. Industrial Crops and Products. 2020;145:111942.
  17. 17. Li J-W, Li H, Liu Z-W, Wang Y-X, Chen Y, Yang N, et al. Molecular markers in tea plant (Camellia sinensis): Applications to evolution, genetic identification, and molecular breeding. Plant Physiol Biochem. 2023;198:107704. pmid:37086694
  18. 18. Taniguchi F, Kimura K, Saba T, Ogino A, Yamaguchi S, Tanaka J. Worldwide core collections of tea (Camellia sinensis) based on SSR markers. Tree Genetics Genomes. 2014;10(6):1555–65.
  19. 19. Hyun DY, Gi GY, Sebastin R, Cho GT, Kim SH, Yoo E, et al. Utilization of phytochemical and molecular diversity to develop a target-oriented core collection in tea germplasm. Agronomy. 2020;10(11):1667.
  20. 20. Kottawa-Arachchi JD, Ranatunga MAB, Sharma RK, Chaudhary HK, Attanayake RN, Amarakoon AMT, et al. Morpho-molecular genetic diversity and population structure analysis to enrich core collections in tea [Camellia sinensis (L.) O. Kuntze] germplasm of Sri Lanka and India. Genet Resour Crop Evol. 2023;71(6):2597–616.
  21. 21. TAO L, TING Y, CHEN H, WEN H, XIE H, LUO L, et al. Core collection construction of tea plant germplasm in Anhui Province based on genetic diversity analysis using simple sequence repeat markers. Journal of Integrative Agriculture. 2023;22(9):2719–28.
  22. 22. Ma J-Q, Zhou Y-H, Ma C-L, Yao M-Z, Jin J-Q, Wang X-C, et al. Identification and characterization of 74 novel polymorphic EST-SSR markers in the tea plant, Camellia sinensis (Theaceae). Am J Bot. 2010;97(12):e153-6. pmid:21616837
  23. 23. Peakall R, Smouse PE. GenAlEx 6.5: genetic analysis IN Excel. Population genetic software for teaching and research--an update. Bioinformatics. 2012;28(19):2537–9. pmid:22820204
  24. 24. Liu K, Muse SV. PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics. 2005;21(9):2128–9. pmid:15705655
  25. 25. Gascuel O, Steel M. Neighbor-joining revealed. Mol Biol Evol. 2006;23(11):1997–2000. pmid:16877499
  26. 26. Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47(W1):W256–9. pmid:30931475
  27. 27. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–59. pmid:10835412
  28. 28. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14(8):2611–20. pmid:15969739
  29. 29. De Beukelaer H, Davenport GF, Fack V. Core Hunter 3: flexible core subset selection. BMC Bioinformatics. 2018;19(1):203. pmid:29855322
  30. 30. Belaj A, Dominguez-García M del C, Atienza SG, Martín Urdíroz N, De la Rosa R, Satovic Z, et al. Developing a core collection of olive (Olea europaea L.) based on molecular markers (DArTs, SSRs, SNPs) and agronomic traits. Tree Genetics & Genomes. 2011;8(2):365–78.
  31. 31. Duan H, Cao S, Zheng H, Hu D, Lin J, Cui B, et al. Genetic characterization of chinese fir from six provinces in southern china and construction of a core collection. Sci Rep. 2017;7(1):13814. pmid:29062029
  32. 32. Feng Y, Yang Z, Tan J, Li H, Chen X. Selection of first generation nucleus population of Pinus massoniana in Guangxi. Journal of Northeast Forestry University. 2018;46(12):20–4.
  33. 33. Lv J, Li C, Zhou C, Chen J, Li F, Weng Q, et al. Genetic diversity analysis of a breeding population of Eucalyptus cloeziana F. Muell. (Myrtaceae) and extraction of a core germplasm collection using microsatellite markers. Industrial Crops and Products. 2020;145:112157.
  34. 34. Glaszmann JC, Kilian B, Upadhyaya HD, Varshney RK. Accessing genetic diversity for crop improvement. Curr Opin Plant Biol. 2010;13(2):167–73. pmid:20167531
  35. 35. Yang J, Zhang L, Zhang W, Chen L, ZHENG G, LI Y. Population structure and genetic differences of tea germplasm resources in Fujian. Journal of Tea Science Research. 2023;43(6):769–83.
  36. 36. Xiaoyun W, Shaobo T, Shishi L. ISSR analysis of genetic diversity of ancient tea resources in Guilin. Journal of Guangxi Agriculture. 2023;38(6):18–22.
  37. 37. Wang X, Wang L, Hao X, Li N, Huang J, Ding C. A review and prospects of the development, innovation, and genetic breeding of the tea plant in China. Journal of Huazhong Agricultural University. 2022;41(5):1–8.
  38. 38. Jiang W. Thought and suggestion of high quality development of Shexian tea industry. Local Agriculture. 2024;4(4):16–8.
  39. 39. Liu J, Gao S, Xu Y, Wang M, Ngiam JJ, Rui Wen NC, et al. Genetic diversity analysis of sapindus in china and extraction of a core germplasm collection using EST-SSR markers. Front Plant Sci. 2022;13:857993. pmid:35685004
  40. 40. Dai J, Fan Y, Diao S, Yin H, Han X, Liu J. Construction of core collection and phenotypic evaluation of toona sinensis. Forests. 2023;14(6):1269.
  41. 41. Khojand S, Zeinalabedini M, Azizinezhad R, Imani A, Ghaffari MR. Identification of the core collection in Iranian almond germplasm: utilizing morphological traits and evaluating biochemical properties of the collection. Genet Resour Crop Evol. 2023;71(6):3037–59.
  42. 42. Wang P, Su J, Wu H, Chen Y, Xie Y, Wang H, et al. Analysis of germplasm genetic diversity and construction of a core collection in Camellia oleifera C.Abel by integrating novel simple sequence repeat markers. Genet Resour Crop Evol. 2023;70(5):1517–30.
  43. 43. Balakrishnan R, Nair NV, Sreenivasan TV. Genetic Resources and Crop Evolution. 2000;47(1):1–9.