Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Restriction site-associated DNA sequencing (RAD-seq) of tea plant (Camellia sinensis) in Sichuan province, China, provides insights into free amino acid and polyphenol contents of tea

  • Xiaoping Wang ,

    Contributed equally to this work with: Xiaoping Wang, Minshan Sun

    Roles Conceptualization, Investigation, Methodology, Writing – original draft, Writing – review & editing

    wangxp922@sina.com

    Affiliation Tea Research Institute, Tea Refining and Innovation Key Laboratory of Sichuan Province, Sichuan Academy of Agricultural Sciences, Chengdu, PR China

  • Minshan Sun ,

    Contributed equally to this work with: Xiaoping Wang, Minshan Sun

    Roles Data curation, Writing – review & editing

    Affiliation Tea Research Institute, Tea Refining and Innovation Key Laboratory of Sichuan Province, Sichuan Academy of Agricultural Sciences, Chengdu, PR China

  • Yuanyuan Xiong,

    Roles Writing – original draft

    Affiliation Tea Research Institute, Tea Refining and Innovation Key Laboratory of Sichuan Province, Sichuan Academy of Agricultural Sciences, Chengdu, PR China

  • Xiao Liu,

    Roles Data curation

    Affiliation Tea Research Institute, Tea Refining and Innovation Key Laboratory of Sichuan Province, Sichuan Academy of Agricultural Sciences, Chengdu, PR China

  • Chunhua Li,

    Roles Writing – original draft

    Affiliation Tea Research Institute, Tea Refining and Innovation Key Laboratory of Sichuan Province, Sichuan Academy of Agricultural Sciences, Chengdu, PR China

  • Yun Wang,

    Roles Writing – original draft

    Affiliation Tea Research Institute, Tea Refining and Innovation Key Laboratory of Sichuan Province, Sichuan Academy of Agricultural Sciences, Chengdu, PR China

  • Xiaobo Tang

    Roles Writing – original draft

    Affiliation Tea Research Institute, Tea Refining and Innovation Key Laboratory of Sichuan Province, Sichuan Academy of Agricultural Sciences, Chengdu, PR China

Abstract

Worldwide, tea is a popular beverage; within the realm of Chinese tea, Sichuan tea holds particular significance for its role in the origin and composition of Chinese tea cultivars. Sichuan tea is noted for its rich content of free amino acids (FAAs) and tea polyphenols (TPs), which has made it an important subject for studying genetic diversity and the genes regulating these compounds. In this study, 139 varieties of tea were collected from areas in Sichuan Province, China, with similar geographical and climatic conditions. The FAA content was approximately 3% and the TP content was approximately 17%. Using RAD sequencing, 5,656,224 variant loci were identified, primarily comprising SNPs (94.17%) and indels (5.83%). Evolutionary analysis revealed that genetic divergence was not closely linked to the collection location. Population structure analysis confirmed a division into two main populations having a similar composition to the phylogenetic clusters. Screening for FAA-related SNPs identified significant loci associated with 33 genes that potentially regulate FAA content. Similarly, TP-related analysis pinpointed 8 SNPs significantly linked to 20 candidate genes. Notably, genetic associations hinted at the genes involved in the stress response and the accumulation of phenolic compounds, enhancing the understanding of determinants of tea quality. This research underscores the potential for molecular breeding based on genetic insights, suggesting pathways to improve the FAA and TP contents in tea. These findings not only provide a solid foundation for exploring gene–chemical interactions but also offer practical strategies for improving the nutritional and sensory attributes of tea cultivars through informed breeding practices.

Introduction

Tea, a magical beverage originating from ancient China, is accompanied by thousands of years of culture and history [1]. At present, tea, as an important economic crop in Asia, Africa, America, and elsewhere, has been widely developed and planted, and has an important contribution to the agricultural economy [2]. In China, tea plants are primarily cultivated in the southwest, south of the Yangtze River, and north of the Yangtze River regions [3, 4]. Following the selection and cultivation of specific flavors and aromas, several renowned tea cultivars have emerged, including “Longjing,” “Dahongpao,” and “Pu-erh.”

Sichuan is a major tea-producing area in China in which Ya’an, Chengdu, and Emeishan are famous varieties of tea [4]. In Sichuan tea, the original color of the tea leaves is usually maintained, providing a crisp taste and unique aroma. The rich resource of tea cultivars in Sichuan provides a basis for the further cultivation of even better varieties, and facilitate the study of the molecular mechanism of the formation of the unique flavor of Sichuan tea.

Tea is rich in nutrients, including free amino acids (FAAs), tea polyphenols (TPs), and vitamins. FAAs primarily contribute to the refreshing taste of tea, not only endowing tea with its unique flavor but also exerting vital physiological functions in the human body [58]. For example, theanine has antidepressant and mood-boosting effects, providing a significant role in improving people’s mental health [912]. TPs are the primary cause of the antioxidant properties of tea, capable of scavenging free radicals, delaying aging, and preventing various chronic diseases [13, 14]. Additionally, TPs exhibit antibacterial and antiviral bioactivities, which hold considerable significance for maintaining human health [1518].

The advent of high-throughput technology has facilitated the widespread application of restriction site-associated DNA sequencing (RAD-seq) in the development of molecular markers associated with desirable traits in a diverse array of plants [1925]. Currently, the genomic data for tea plant have been completed, and by investigating the data of target traits, combined with genome-wide association analysis (GWAS), the molecular marker sites and candidate genes related to traits can be rapidly identified [2, 2630]. Qiu et al. used GWAS to investigate the diversity of flavonoid metabolites in tea [31]. Through screening, Lu et al. identified four genes related to the plant size and leaf color of tea plant using GWAS. The results indicated that these genes were associated with the size, leaf color, etc [32]. Jiang et al. employed GWAS to identify aroma substances in 70 tea cultivars from Fujian province and Liu et al. revealed the phylogenetic relationships of four typical tea landraces from Hunan province [3335].

In this study, we collected 139 commonly used tea cultivars in Sichuan. We measured the concentrations of FAAs and TPs within these samples. Subsequently, we employed restriction site associated DNA sequencing (RAD-seq) and GWAS to explore the genetic evolution of these teas, as well as the identification of molecular markers and candidate genes related to the FAA and TP content in Sichuan teas. The results provide a reference for the study of flavor differences in Sichuan tea cultivars, as well as a scientific reference for the more accurate localization of key genes for tea flavor formation and variety breeding.

Materials and methods

Plant materials

From plants cultivated under the same soil and environmental conditions in Meishan City, Sichuan province (29.81°N, 103.17°E), 139 tea plant accessions were collected for RAD-seq construction. The geographic origins of these accessions were mainly from different areas of the Sichuan province, China, and comprised 91 from Yaan, 9 from Chengdu, 8 from Meishan, 14 from Guangyuan, and 17 from Mianyang (S1 Table). In the spring of April, buds with two tea plant leaves showing good growth were taken as materials, and the samples were stored in the refrigerator at −80°C after being frozen in liquid nitrogen.

Detection of FAAs and TPs

The total FAA content in tea was determined using ninhydrin colorimetry (GB/T 8314–2013) [36]. The detection of TPs in tea was performed using the Folin phenol method referenced in the National Standards of China (GB/T 8313–2018) [37]. Normal distribution analysis with Kolmogorov-Smirnov test was performed, histogram and violin plot were drawn by GraphPad Prism 9.5.0.

DNA extraction and sequencing

Genomic DNA samples from all accessions were extracted using a CTAB method and incubated for 1 hour at 37°C with 2 μL of RNase-A [38]. The genomic DNA was digested with the restriction enzyme EcoRI with the addition of the P1 adapter. The ligated products were then pooled, broken into small fragments, and fragments of 300–700 bp were separated using 2% agarose gel electrophoresis. The P2 adapter was added and PCR amplification was performed to enrich RAD tags. The DNA libraries were using the Illumina sequencing platform.

Data processing

The sequencing original data from the Illumina platform were screened using fastp (version 0.18.0) [39]. Reads with unknown nucleotides (N) ≥ 10%, Phred quality score ≤ 20 with more than 50% base content, and those containing connectors were excluded. Subsequently, the filtered reads were compared with the reference genome using BWA software (version 0.7.12) [40]. The results were sorted using Sambamba and repeated sequences were labeled [41]. The reference C. sinensis genome was downloaded from the BIG database (accession no.: GWHACFB00000000) [2]. Variants were detected by GATK (version 4.0) with the filter parameters (−Window 4, −filter "QD < 4.0 || FS > 60.0 || MQ < 40.0 ", −G_filter "GQ < 20") [42, 43].

Population structure, principal component analysis, and phylogenetic tree construction

The population structure of 139 tea plants was conducted using Admixture (version 1.3.0), and the tested K was set from 1 to 9 [44]. According to the cross-validation method, the best genetic cluster, K, was determined and the results were plotted using R. Pophelper software (v2.2.7) was used to plot the genetic composition of each sample [45]. PCA and kinship was analyzed using GCTA (version 1.92.2) [46]. A phylogenetic tree was conducted using MEGA (version 7.0) and the NJ algorithm with 500 bootstrap replicates.

Genome-wide association analysis

Results with a heterozygosity ratio greater than 0.8 and a minor allele frequency (MAF) less than 0.05 were removed. The decay of linkage disequilibrium (LD) was evaluated using PopldDecay software [47]. A genome-wide association study (GWAS) was performed based on the GLM, GLM(Q), MLM(K), and MLM(QK) models to calculate the relationships between markers and phenotyping data using the GEMMA (v0.98.1) [48].

Results

Phenotypic analysis of free amino acids and tea polyphenols

The 139 varieties were all collected from the Sichuan Province of China, mainly within 8 areas with relatively similar geographic and climatic environments. The FAA and TP contents of these 139 varieties all showed a normal distribution, with the FAA content mainly distributed at approximately 3% and the TP content mainly distributed at approximately 17% (Fig 1A–1D).

thumbnail
Fig 1. The frequency distribution histogram of FAAs and TPs.

(A) FAA content distribution. (B) Frequency distribution histogram of FAA. (C) TP content distribution. (D) Frequency distribution histogram of TP.

https://doi.org/10.1371/journal.pone.0314144.g001

Statistics of sequencing data

The EcoRI cleavage sites contained in the 15 chromosomes of the tea plant genome were evaluated, with a total of 1,274,788 sites found, of which the number of effective cleavage sites was 866,855 (Fig 2A, S2 Table). After sequencing 139 samples, we obtained 511 Gb of clean reads, with an average of more than 3 Gb of data per sample, and the Q30 values of the samples were all approximately 90% (S3 Table). The mapping rate of the obtained clean reads was greater than 96% for all sample data after matching to the reference genome (S4 Table).

thumbnail
Fig 2. The distribution of RF and the analysis of SNPs and indels.

(A) Chromosome distribution map of RF. (B) The location information of SNPs and indels. (C) The functional annotation of SNPs and indels. (D) Transition and transversion statistics of SNPs.

https://doi.org/10.1371/journal.pone.0314144.g002

Through the analysis, we screened a total of 5,656,224 variant loci, including 5,326,438 SNP loci (94.17%) and 329,786 indel loci (5.83%). The two most prominent functions of loci were nonsynonymous SNVs (88,990) and synonymous SNVs (80,574) (Fig 2B). These SNPs were mainly distributed in the intergenic region, followed by the intronic region, and only 175,323 were distributed in the exonic region, in addition to a small number of SNPs distributed in the upstream and downstream regions of the gene (Fig 2C). Among these SNPs, there were 3,779,918 SNPs in the transition category, mainly in the G->A and C->T categories, and there were 1,546,520 SNPs in the transversion category (Fig 2D).

Evolutionary analysis

After filtering for rare alleles, high deletion rates, and high heterozygosity, we retained 1,179,501 variation loci, including 1,107,524 SNP loci and 71,977 indel loci. The phylogenetic analysis of these individuals showed that these varieties diverged depending on the collection location, suggesting that they may have a relatively similar have genetic background (Fig 3A). The analysis of their components showed that all varieties were mainly divided into two groups (Fig 3B), with group I containing approximately 100 varieties and group II containing approximately 33 varieties, and a further 5 varieties clustered into cluster III (Fig 3A and 3B). The results of the linkage disequilibrium (LD) decay values of all the populations showed that the R2 value decreased rapidly from 0.1 to 0.05 in the 0–50 kb range, indicating the high precision of the correlation analysis results (Fig 3C).

thumbnail
Fig 3. The phylogenetic evolution tree and principal component of the all accessions.

(A) Evolutionary tree. Green circle, CDCZ. Red circle, CDQL. Blue check, GYQC. Green triangle, MSHY. Red rectangle, MYAX. Red star, YAMS. Green star, YAYJ. (B) Principal component analysis (PCA) of 139 tea plant cultivars. (C) LD decay analysis. GYQC, Guangyuan Qingchuan. MYBC, Mianyang Beichuan. MYAX, Mianyang Anxian. CDQZ, Chengdu Chongzhou. CDQL, Chengdu Qionglai. YAMS, Yaan Mingshan. YAYJ, Yaan Yajiang. MSHY, Meishan Hongya.

https://doi.org/10.1371/journal.pone.0314144.g003

Population structure analysis

The values of the cross-validation (CV) errors were calculated for each value of K from 2 to 9. The results showed that all accessions could be divided into two populations (K = 2), showing similar results to the PCA and evolutionary analysis (Fig 4A and 4B). When K = 2, the composition of the two populations corresponded essentially to clusters I and II in the phylogenetic tree, and when K = 2, the compositional varieties of the three populations corresponded basically to the three clusters in the evolutionary tree (Figs 3A and 4C).

thumbnail
Fig 4. Population structure of 139 tea plant cultivars.

(A) Cross-validation of K values. (B) Heat map of relationship between samples. (C) Stacked diagram of the tea plant community structure.

https://doi.org/10.1371/journal.pone.0314144.g004

Screening of FAA-related SNPs and candidate genes

We analyzed the SNPs associated with the FAA contents of four models, and the results showed that under p < 0.00001 conditions, the four methods identified a total of 30 significant SNP loci, of which 14 SNP loci were shared (Fig 5A). The genes within 50 kb of these significantly related SNP loci were analyzed, and the results showed that there were 33 related genes (Fig 5B). These loci may be significantly associated with the FAA content, and these candidate genes were distributed on six chromosomes and two scaffolds: Chr1 with one SNP locus and the associated genes, including Cha01g016240 (prolyl 4-hydroxylase 10, P4H10), Cha01g016250, Cha01g016260, Cha01g016270 (phospholipase D delta, PLDDELTA), and Cha01g016280 (serine/arginine-rich splicing factor SR45a, SR45a); Chr3 with one SNP locus associated with three candidate genes (Cha03g018190, Cha03g018200, Cha03g018210); Chr8 with one SNP locus associated with Cha08g000060, Cha08g000070, Cha08g000080 (5-methyltetrahydropteroyltriglutamate–homocysteine methyltransferase, METE); Chr9 with one SNP locus associated with Cha09g016500, Cha09g016510, Cha09g016520 (nuclear transcription factor Y subunit B-5, NFYB5), Cha09g016530 (cullin-1, CUL1); three significant SNP sites associated with ten genes on Chr5, such as Cha05g013620 (Lectin receptor-like kinase 4, LecRK4), Cha05g013630 (LecRK4), Cha05g013660 (LecRK3) (Fig 5C and 5D, and S5 Table).

thumbnail
Fig 5. FAA-related SNPs and candidate genes.

(A) FAA-related SNPs screened from the four models. (B) FAA-related genes screened from the four models. (C) Manhattan plots of FAA association analysis by GLM(Q). Negative log10-transformed P-values from a genome-wide scan are plotted against position on each of the 15 chromosomes. Red and black dots indicates the genome-wide significance threshold (p > = 4 and 5, respectively). (D) Quantile–Quantile (Q–Q) plots of FAA traits. The red line represents the predicted values and the blue dots represent the observed values, showing the difference between the predicted value and the observed value.

https://doi.org/10.1371/journal.pone.0314144.g005

Screening of TP-related SNPs and candidate genes

The same analysis of the SNPs associated with TP showed that eight SNPs were significantly associated (p < 0.00001) by four of the methods (Fig 6A). There were 20 genes potentially associated with these SNPs (Fig 6B). The results of the GLM(Q) model showed that these SNPs were mainly distributed onto chromosomes Chr1, Chr2, Chr4, Chr8, Chr9, and Chr11, which were significantly correlated with TPs (Fig 6C and 6D). The genes associated with the SNP loci on chromosome Chr1 were Cha01g002220 (NAC078), Cha01g002230 (NAC078), Cha01g002240 (homology of malate dehydrogenase 1, MDH1), Cha01g002250 (homology of ACYL-CoA-BINDING PROTEIN4, ACBP4), Cha01g002260, and Cha01g006610 (non-specific lipid transfer protein 1) and the genes linked to the SNP loci on chromosomes Chr2 and Chr4 were Cha02g000510 (MYB3) and Cha04g018610 (fluG), respectively; the genes associated with the significant SNP loci on chromosome 8 were Cha08g010570, Cha08g010580, Cha08g010590, and Cha08g010600, and the genes associated with the SNP loci on chromosome Chr9 were Cha09g017840 (histone acetyltransferase, HAT), Cha09g017850 (VESICLE-ASSOCIATED MEMBRANE PROTEIN721, VAMP721), and Cha09g017860 (vacuolar protein sorting 13, VPS13), in addition to the genes associated with the SNP loci on Chr11, Cha11g000470 (glycerol-3-phosphate acyltransferase 3, GPAT3), Cha11g000480 (GPAT3), Cha11g000490, Cha11g000500, Cha11g000510, Cha11g000520, Cha11g000530 (long chain acyl-CoA synthetase 1, LACS1), and Cha11g000540 (LACS1) (S6 Table).

thumbnail
Fig 6. TP-related SNPs and candidate genes.

(A) TP-related SNPs were screened using four models. (B) FAA-related genes were screened using four models. (C) Manhattan plots of TP association analysis by GLM(Q). Negative log10-transformed P-values from a genome-wide scan are plotted against position on each of 15 chromosomes. Red and black dots indicates the genome-wide significance threshold (p > = 4 and 5, respectively). (D) Quantile–Quantile (Q–Q) plots of TP trait. The red line represents the predicted values and the blue dots represents the observed values, showing the difference between the predicted value and the observed value.

https://doi.org/10.1371/journal.pone.0314144.g006

Discussion

Tea is one of the most important economic crops in China, and Sichuan is a major tea-producing region in southwest China that is the origin of a series of well-known tea cultivars with unique flavors and textures. In this study, RAD sequencing was performed on 139 tea species from several major tea-producing areas in the Sichuan province, China. The analysis of these data can provide useful resources for the collection and molecular breeding of teas from the Sichuan region. The results of these data suggest that the 139 tea accessions from the Sichuan region collected in this study consist of two populations.

The main cultivated tea cultivars belong to the genus Camellia L., section Thea (L.) Dyer, which contains the two major varieties of C. sinensis var. sinensis and C. sinensis var. assamica (Masters) Chang [2]. Wang et al. collected multiple accessions in the Sichuan region in their study, and the results showed that these accessions mainly belonged to C. sinensis var. sinensis, which may predict that multiple accessions in this study also belong to C. sinensis var. sinensis. Lu et al. analyzed 120 ancient tea plants collected in Guizhou and Yunnan Provinces and found that the natural environment or artificial breeding selection had less selection pressure on ancient tea plants [32]. In this study, we were unable to collect samples from other tea-producing regions for further analysis. The collection sites of multiple accessions in this study were mainly located in hilly areas with altitudes usually of 800–1,500 m. Most of the plants and growing conditions were similar to those of C. sinensis var. sinensis, which usually has smaller leaves, cold tolerance, and has similar characteristics when growing as a shrub [27, 49, 50]. Jiang et al. analyzed 70 tea cultivars, mainly from Fujian Province, and shows that tea in Fujian is mainly composed of three groups [33]. In terms of population structure and evolutionary relationships, the tea cultivars in Sichuan province mainly diverged into two populations with a highly similar genetic background and typical regional characteristics (Figs 3 and 4).

The contents of total catechins, caffeine, FAA, TP, vitamins, and other components in tea have a direct influence on the flavor of tea, and systematic analyses of FAA and TP contents of tea in Sichuan will provide guidance for breeding for the high content of these two substances. A systematic study of the contents of FAA and TP in tea will guide the breeding of tea with high contents of these two substances. GWAS is a commonly used analytical method, and, in this study, we found that the results obtained from four different analytical models had a high degree of similarity, which indicated a high degree of confidence in the results.

We selected 33 genes based on the FAA analysis results, but the functions of multiple genes are currently unknown. Among them, we found three LECRK homologous genes on chromosome Chr 5. In multiple plant species, LECRK is associated with salt stress and cold stress [5154]. Furthermore, on chromosome 13, the PGIP1 gene is closely associated with biotic and abiotic stress in multiple species [5557]. The varieties used in this study were collected from higher altitude regions and belong to C. sinensis var. sinensis. They have a strong ability to tolerate cold stress, which suggests a relationship between the FAA content in tea leaves and cold stress.

The GWAS results of TPs revealed 24 candidate genes, including genes homologous to MYB3 on chromosome Chr2, which has been reported to activate the phenylpropanoid pathway and promote the accumulation of polyphenolic substances in tobacco [58]. LACS is a key enzyme involved in the activation of fatty acids into the corresponding CoA esters [59]. Fatty acyl-CoA intermediates may alter the biosynthetic pathways of polyphenolic compounds [60, 61].

Conclusions

In this study, we collected 139 tea accessions from Sichuan Province, China, and studied the distribution of their SNPs using RAD-seq. The results revealed that these 139 accessions were mainly divided into two genetic populations. The GWAS analysis of FAA and TP contents and SNPs identified 14 SNPs significantly associated with FAAs, and identified 33 candidate genes. Moreover, 8 SNPs were significantly associated with TPs, along with 20 candidate genes. These findings provide a reference for further research into the regulatory relationship between these genes and the content of FAAs and TPs in tea, offering a genetic resource for future molecular breeding.

Supporting information

S1 Table. The list of 139 accessions information.

https://doi.org/10.1371/journal.pone.0314144.s001

(XLSX)

S2 Table. Evaluation of genomic enzyme digestion.

https://doi.org/10.1371/journal.pone.0314144.s002

(XLSX)

Acknowledgments

We are grateful to the Henan Assist Research Biotechnology Co., Ltd (Zhengzhou, China) for assisting in sequencing and bioinformatics analysis.

References

  1. 1. Lu H, Zhang J, Yang Y, Yang X, Xu B, Yang W, et al. Earliest tea as evidence for one branch of the Silk Road across the Tibetan Plateau. Scientific reports. 2016;6:18955. Epub 2016/01/08. pmid:26738699; PubMed Central PMCID: PMC4704058.
  2. 2. Wang X, Feng H, Chang Y, Ma C, Wang L, Hao X, et al. Population sequencing enhances understanding of tea plant evolution. Nat Commun. 2020;11(1):4447. Epub 2020/09/09. pmid:32895382; PubMed Central PMCID: PMC7477583.
  3. 3. Chen WH. A discussion on the evolution and classification of the chinese tea varieties. Acta Horticulturae Sinica. 1964;3(2):191–8.
  4. 4. Kan NC. Study on Geographical Distribution of Wild Tea Trees in Sichuan and Chongqing, and Origin of Tea Tree. southwest china Journal of Agricultural sciences. 2013;26(01):382–5.
  5. 5. Li J, Ma J, Li Q, Fan S, Fan L, Ma H, et al. Determination of 35 Free Amino Acids in Tea Using Ultra-Performance Liquid Chromatography Coupled With Quadrupole Time-of-Flight Mass Spectrometry. Front Nutr. 2021;8:767801. Epub 2021/12/28. pmid:34957181; PubMed Central PMCID: PMC8697017.
  6. 6. Yang BH, Gao Y, Li HM, Ye SB, He HX, Xie SR. Rapid prediction of yellow tea free amino acids with hyperspectral images. Plos One. 2019;14(2):e0210084. Epub 2019/02/21. WOS:000459307000009; PubMed Central PMCID: PMC6382264. pmid:30785888
  7. 7. Zhao F, Wu W, Wang C, Wang X, Liu H, Qian J, et al. Dynamic change of oligopeptides and free amino acids composition in five types of tea with different fermentation degree processed from the same batch of fresh tea (Camelilia Sinensis. L.) leaves. Food Chemistry. 2023;404(Pt A):134608. Epub 2022/10/23. pmid:36272301.
  8. 8. Zhu YC, Luo YH, Wang PP, Zhao MY, Li L, Hu XS, et al. Simultaneous determination of free amino acids in Pu-erh tea and their changes during fermentation. Food Chemistry. 2016;194:643–9. Epub 2015/10/17. WOS:000364248900085. pmid:26471603
  9. 9. Shao J, Wei Y, Wei XL. A comprehensive review on bioavailability, safety and antidepressant potential of natural bioactive components from tea. Food Research International. 2022;158:111540. Epub 2022/07/16. WOS:000847827500010. pmid:35840236
  10. 10. Unno K, Furushima D, Nomura Y, Yamada H, Iguchi K, Taguchi K, et al. Antidepressant Effect of Shaded White Leaf Tea Containing High Levels of Caffeine and Amino Acids. Molecules. 2020;25(15):3550. Epub 2020/08/07. WOS:000559113600001; PubMed Central PMCID: PMC7435365. pmid:32756488
  11. 11. Unno K, Muguruma Y, Inoue K, Konishi T, Taguchi K, Hasegawa-Ishii S, et al. Theanine, Antistress Amino Acid in Tea Leaves, Causes Hippocampal Metabolic Changes and Antidepressant Effects in Stress-Loaded Mice. International Journal of Molecular Sciences. 2020;22(1):193. Epub 2021/01/01. pmid:33379343; PubMed Central PMCID: PMC7795947.
  12. 12. Yin C, Gou LS, Liu Y, Yin XX, Zhang L, Jia GG, et al. Antidepressant-like Effects of L-theanine in the Forced Swim and Tail Suspension Tests in Mice. Phytotherapy Research. 2011;25(11):1636–9. Epub 2011/03/23. WOS:000297307000009. pmid:21425373
  13. 13. Li X, Chen S, Ouyang KH, Wang WJ. Effects of polysaccharides from Yingshan Yunwu tea on free amino acids, flavor nucleotides and antioxidant abilities in chickens. Res Vet Sci. 2022;149:11–20. Epub 2022/06/19. pmid:35716518.
  14. 14. Mao Y, Li H, Wang Y, Fan K, Song Y, Han X, et al. Prediction of Tea Polyphenols, Free Amino Acids and Caffeine Content in Tea Leaves during Wilting and Fermentation Using Hyperspectral Imaging. Foods (Basel, Switzerland). 2022;11(16):2537. Epub 2022/08/27. pmid:36010536; PubMed Central PMCID: PMC9407140.
  15. 15. Li J, Feng CM, Jin JY, Yang WQ, Wang ZL. Current understanding on antibacterial mechanisms and research progress of tea polyphenols as a supplementary disinfectant for drinking water. Journal of water and health. 2022;20(11):1611–28. Epub 2022/12/01. WOS:000879445600001. pmid:36448612
  16. 16. Xiang Q, Cheng L, Zhang RL, Liu YA, Wu ZF, Zhang X. Tea Polyphenols Prevent and Intervene in COVID-19 through Intestinal Microbiota. Foods (Basel, Switzerland). 2022;11(4):506. Epub 2022/02/26. ARTN 506 WOS:000806175800001; PubMed Central PMCID: PMC8871045. pmid:35205982
  17. 17. Zhang N, Liu WG, Qian KJ. antibacterial effect of tea polyphenols combined with common antibiotics on multidrug-resistant. Minerva medica. 2020;111(6):536–43. Epub 2020/04/29. WOS:000599429700004.
  18. 18. Zhang WL, Jiang WB. Antioxidant and antibacterial chitosan film with tea polyphenols-mediated green synthesis silver nanoparticle via a novel one-pot method. International Journal of Biological Macromolecules. 2020;155:1252–61. Epub 2019/11/15. WOS:000536122500130. pmid:31726160
  19. 19. Yang H, Wei CL, Liu HW, Wu JL, Li ZG, Zhang L, et al. Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing. PLoS One. 2016;11(3):e0151424. Epub 2016/03/11. pmid:26962860; PubMed Central PMCID: PMC4786323 Jian-Bo Jian is an employee of BGI-ShenZhen, and the company declared that they will not develop products or apply for patents and marketed products based on this publication. This does not alter the authors’ adherence to all PLOS ONE policies on sharing data and materials.
  20. 20. Bus A, Hecht J, Huettel B, Reinhardt R, Stich B. High-throughput polymorphism detection and genotyping in Brassica napus using next-generation RAD sequencing. BMC Genomics. 2012;13:281. Epub 2012/06/26. pmid:22726880; PubMed Central PMCID: PMC3442993.
  21. 21. Bateman RM, Sramkó G, Paun O. Integrating restriction site-associated DNA sequencing (RAD-seq) with morphological cladistic analysis clarifies evolutionary relationships among major species groups of bee orchids. Ann Bot. 2018;121(1):85–105. Epub 2018/01/13. pmid:29325077; PubMed Central PMCID: PMC5786241.
  22. 22. Xu LY, Wang LY, Wei K, Tan LQ, Su JJ, Cheng H. High-density SNP linkage map construction and QTL mapping for flavonoid-related traits in a tea plant (Camellia sinensis) using 2b-RAD sequencing. BMC Genomics. 2018;19(1):955. Epub 2018/12/24. pmid:30577813; PubMed Central PMCID: PMC6304016.
  23. 23. Yamashita H, Uchida T, Tanaka Y, Katai H, Nagano AJ, Morita A, et al. Genomic predictions and genome-wide association studies based on RAD-seq of quality-related metabolites for the genomics-assisted breeding of tea plants. Scientific reports. 2020;10(1):17480. Epub 2020/10/17. pmid:33060786; PubMed Central PMCID: PMC7562905.
  24. 24. Hsu YM, Wang SS, Tseng YC, Lee SR, Fang H, Hung WC, et al. Assessment of genetic diversity and SNP marker development within peanut germplasm in Taiwan by RAD-seq. Scientific reports. 2022;12(1):14495. Epub 2022/08/26. pmid:36008445; PubMed Central PMCID: PMC9411510.
  25. 25. Nagano Y, Tashiro H, Nishi S, Hiehata N, Nagano AJ, Fukuda S. Genetic diversity of loquat (Eriobotrya japonica) revealed using RAD-Seq SNP markers. Scientific reports. 2022;12(1):10200. Epub 2022/06/24. pmid:35739209; PubMed Central PMCID: PMC9226044.
  26. 26. Chen S, Wang P, Kong W, Chai K, Zhang S, Yu J, et al. Gene mining and genomics-assisted breeding empowered by the pangenome of tea plant Camellia sinensis. Nat Plants. 2023;9(12):1986–99. Epub 2023/11/28. pmid:38012346.
  27. 27. Xia E, Tong W, Hou Y, An Y, Chen L, Wu Q, et al. The Reference Genome of Tea Plant and Resequencing of 81 Diverse Accessions Provide Insights into Its Genome Evolution and Adaptation. Mol Plant. 2020;13(7):1013–26. Epub 2020/05/01. pmid:32353625.
  28. 28. Liu YJ, Chen S, Jiang CK, Liu HR, Wang JY, He WZ, et al. Combined QTL mapping, GWAS and transcriptomic analysis revealed a candidate gene associated with the timing of spring bud flush in tea plant (Camellia sinensis). Horticulture Research. 2023;10(9):uhad149. ARTN uhad149 WOS:001097280000006. pmid:37691963
  29. 29. Chen JD, Zheng C, Ma JQ, Jiang CK, Ercisli S, Yao MZ, et al. The chromosome-scale genome reveals the evolution and diversification after the recent tetraploidization event in tea plant. Horticulture Research. 2020;7(1):63. WOS:000531392800006. pmid:32377354
  30. 30. Wang Z, Huang R, Moon D-G, Ercisli S, Chen L. Achievements and prospects of QTL mapping and beneficial genes and alleles mining for important quality and agronomic traits in tea plant (Camellia sinensis). Beverage Plant Research. 2023;3(1):22.
  31. 31. Qiu H, Zhang X, Zhang Y, Jiang X, Ren Y, Gao D, et al. Depicting the genetic and metabolic panorama of chemical diversity in the tea plant. Plant Biotechnol J. 2024;22(4):1001–16. Epub 2023/12/04. pmid:38048231; PubMed Central PMCID: PMC10955498.
  32. 32. Lu LT, Chen HF, Wang XJ, Zhao YC, Yao XZA, Xiong BA, et al. Genome-level diversification of eight ancient tea populations in the Guizhou and Yunnan regions identifies candidate genes for core agronomic traits. Horticulture Research. 2021;8(1):190. Epub 2021/08/12. WOS:000683738000001; PubMed Central PMCID: PMC8355299. pmid:34376642
  33. 33. Jiang L, Xie S, Zhou C, Tian C, Zhu C, You X, et al. Analysis of the genetic diversity in tea plant germplasm in fujian province based on restriction site-associated DNA sequencing. Plants. 2023;13(1):100. Epub 2024/01/11. pmid:38202408; PubMed Central PMCID: PMC10780744.
  34. 34. Liu Z, Zhao Y, Yang P, Cheng Y, Huang F, Li S, et al. Population whole-genome resequencing reveals the phylogenetic relationships and population structure of four Hunan typical tea landraces. Beverage Plant Research. 2023;3(1):9.
  35. 35. Huang F, Duan J, Lei Y, Liu Z, Kang Y, Luo Y, et al. Genetic diversity, population structure and core collection analysis of Hunan tea plant germplasm through genotyping-by-sequencing. Beverage Plant Research. 2022;2(1):1–7.
  36. 36. Shao JL, Li QW, Dong BS, Liu HC, SHu JH. Determination of totaI free-amino acid in tea by Nihydrin colorimetry. China Food Additives. 2008;19(2):162–5.
  37. 37. Wang XL. Discussion on the Repeatability Regulation of Tea Polyphenols in GB/T 8313–2018. China Tea. 2023;45(10):25–30.
  38. 38. Hai TY, Fang GC, Qing ZM, Song CC, Bing CR. Application of ISSR markers to the genetic polymorphism of Camellia sinensis. Journal of Fujian Agriculture and Forestry University. 2007;36(1):51–5.
  39. 39. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34(17):i884–i90. pmid:30423086
  40. 40. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60. Epub 2009/05/20. pmid:19451168; PubMed Central PMCID: PMC2705234.
  41. 41. Tarasov A, Vilella AJ, Cuppen E, Nijman IJ, Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015;31(12):2032–4. Epub 2015/02/24. pmid:25697820; PubMed Central PMCID: PMC4765878.
  42. 42. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303. Epub 2010/07/21. pmid:20644199; PubMed Central PMCID: PMC2928508.
  43. 43. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, del Angel G, Levy‐Moonshine A, et al. From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Current protocols in bioinformatics. 2018;43(1):11.0.1-.0.33. Epub 2014/11/29. pmid:25431634; PubMed Central PMCID: PMC4243306.
  44. 44. Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Research. 2009;19(9):1655–64. pmid:19648217
  45. 45. Francis RM. pophelper: an R package and web app to analyse and visualize population structure. Mol Ecol Resour. 2017;17(1):27–32. Epub 2016/02/07. pmid:26850166.
  46. 46. Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. American journal of human genetics. 2011;88(1):76–82. Epub 2010/12/21. pmid:21167468; PubMed Central PMCID: PMC3014363.
  47. 47. Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35(10):1786–8. Epub 2018/10/16. pmid:30321304.
  48. 48. Zhou X, Stephens M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet. 2012;44(7):821–4. Epub 2012/06/19. pmid:22706312; PubMed Central PMCID: PMC3386377.
  49. 49. Chen L, Yu FL, Tong QQ. Discussions on phylogenetic classification and evolution of section Thea. JOURNAL OF TEA SCIENCE. 2000;20(2):89–94.
  50. 50. Yang JB, Yang J, Li HT, Zhao Y, Yang SXJCGIsolation and characterization of 15 microsatellite markers from wild tea plant (Camellia taliensis) using FIASCO method. Conservation Genetics. 2009;10(5):1621–3.
  51. 51. Sun XL, Yu QY, Tang LL, Ji W, Bai X, Cai H, et al. GsSRK, a G-type lectin S-receptor-like serine/threonine protein kinase, is a positive regulator of plant tolerance to salt stress. J Plant Physiol. 2013;170(5):505–15. Epub 2013/01/02. pmid:23276523.
  52. 52. Ma Q, Chen C, Zeng Z, Zou Z, Li H, Zhou Q, et al. Transcriptomic analysis between self- and cross-pollinated pistils of tea plants (Camellia sinensis). BMC Genomics. 2018;19(1):289. Epub 2018/04/27. pmid:29695246; PubMed Central PMCID: PMC5918555.
  53. 53. Sawicki M, Rondeau M, Courteaux B, Rabenoelina F, Guerriero G, Gomès E, et al. On a Cold Night: Transcriptomics of Grapevine Flower Unveils Signal Transduction and Impacted Metabolism. Int J Mol Sci. 2019;20(5). Epub 2019/03/08. pmid:30841651; PubMed Central PMCID: PMC6429367.
  54. 54. Shang H, Li P, Zhang X, Xu X, Gong J, Yang S, et al. The Gain-of-Function Mutation, OsSpl26, Positively Regulates Plant Immunity in Rice. Int J Mol Sci. 2022;23(22). Epub 2022/11/27. pmid:36430644; PubMed Central PMCID: PMC9697700.
  55. 55. Di Matteo A, Bonivento D, Tsernoglou D, Federici L, Cervone F. Polygalacturonase-inhibiting protein (PGIP) in plant defence: a structural view. Phytochemistry. 2006;67(6):528–33. Epub 2006/02/07. pmid:16458942.
  56. 56. Prabhu SA, Wagenknecht M, Melvin P, Gnanesh Kumar BS, Veena M, Shailasree S, et al. Immuno-affinity purification of PglPGIP1, a polygalacturonase-inhibitor protein from pearl millet: studies on its inhibition of fungal polygalacturonases and role in resistance against the downy mildew pathogen. Mol Biol Rep. 2015;42(6):1123–38. Epub 2015/01/19. pmid:25596722.
  57. 57. Dong C, Li R, Wang N, Liu Y, Zhang Y, Bai S. Apple vacuolar processing enzyme 4 is regulated by cysteine protease inhibitor and modulates fruit disease resistance. J Exp Bot. 2022;73(11):3758–73. Epub 2022/03/09. pmid:35259265.
  58. 58. Zong Y, Li S, Xi X, Cao D, Wang Z, Wang R, et al. Comprehensive Influences of Overexpression of a MYB Transcriptor Regulating Anthocyanin Biosynthesis on Transcriptome and Metabolome of Tobacco Leaves. Int J Mol Sci. 2019;20(20). Epub 2019/10/19. pmid:31623091; PubMed Central PMCID: PMC6829574.
  59. 59. De Marcos Lousa C, van Roermund CW, Postis VL, Dietrich D, Kerr ID, Wanders RJ, et al. Intrinsic acyl-CoA thioesterase activity of a peroxisomal ATP binding cassette transporter is required for transport and metabolism of fatty acids. Proc Natl Acad Sci U S A. 2013;110(4):1279–84. Epub 2013/01/05. pmid:23288899; PubMed Central PMCID: PMC3557071.
  60. 60. Zhang YM, Rock CO. Evaluation of epigallocatechin gallate and related plant polyphenols as inhibitors of the FabG and FabI reductases of bacterial type II fatty-acid synthase. The Journal of biological chemistry. 2004;279(30):30994–1001. Epub 2004/05/11. pmid:15133034.
  61. 61. Milke L, Ferreira P, Kallscheuer N, Braga A, Vogt M, Kappelmann J, et al. Modulation of the central carbon metabolism of Corynebacterium glutamicum improves malonyl-CoA availability and increases plant polyphenol synthesis. Biotechnology and bioengineering. 2019;116(6):1380–91. Epub 2019/01/27. pmid:30684355.