Tissue-Specific, Development-Dependent Phenolic Compounds Accumulation Profile and Gene Expression Pattern in Tea Plant [Camellia sinensis]

Phenolic compounds in tea plant [Camellia sinensis (L.)] play a crucial role in dominating tea flavor and possess a number of key pharmacological benefits on human health. The present research aimed to study the profile of tissue-specific, development-dependent accumulation pattern of phenolic compounds in tea plant. A total of 50 phenolic compounds were identified qualitatively using liquid chromatography in tandem mass spectrometry technology. Of which 29 phenolic compounds were quantified based on their fragmentation behaviors. Most of the phenolic compounds were higher in the younger leaves than that in the stem and root, whereas the total amount of proanthocyanidins were unexpectedly higher in the root. The expression patterns of 63 structural and regulator genes involved in the shikimic acid, phenylpropanoid, and flavonoid pathways were analyzed by quantitative real-time polymerase chain reaction and cluster analysis. Based on the similarity of their expression patterns, the genes were classified into two main groups: C1 and C2; and the genes in group C1 had high relative expression level in the root or low in the bud and leaves. The expression patterns of genes in C2-2-1 and C2-2-2-1 groups were probably responsible for the development-dependent accumulation of phenolic compounds in the leaves. Enzymatic analysis suggested that the accumulation of catechins was influenced simultaneously by catabolism and anabolism. Further research is recommended to know the expression patterns of various genes and the reason for the variation in contents of different compounds in different growth stages and also in different organs.


Introduction
Tea is one of the three popular nonalcoholic beverages consumed throughout the world. A great quantity of epidemiological studies indicated that the daily consumption of green tea is considered as part of a lifestyle which supports healthiness and long life [1][2][3]. Research reports suggested that the consumption of tea [Camellia sinensis (L.)] had a protective effect on reducing the incidence of all cancers [4,5], decreasing body weight and body fat various biological processes [19], metabolic pathways [16] and regulatory networks [20]. Extensive studies on model organisms like tobacco, Arabidopsis thaliana, and Medicago truncatula facilitated the understanding of regulation system and subcellular location in the flavonoid pathway [21][22][23]. In the recent years, more number of researches were conducted on the synthetic regulation, transport and galloylation of phenolic compounds, and polymerization of PAs synthesis [6,[23][24][25][26][27][28]. There are similarities and differences in biosynthetic pathway of phenolic compounds between model plants and tea plant. For instance, the galloylated catechins, including epigallocatechin gallate (EGCG) and epicatechin gallate (ECG), account for up to 76% of catechins in the tea plant [29,30]. It has been believed that the flavonoids are synthesized exclusively in the endoplasmic reticulum followed by the transfer to vacuoles for bio-processes by multidrug resistance-associated protein or multidrug and toxic compound extrusion proteins [31,32]. However, catechins were possibly located mainly in the chloroplasts of mesophyll cells and vessel walls rather than the vacuoles [33]. Light radiation had no significant effect on the accumulation of anthocyanin and galloylated catechins in tea plant grown in outside field [34], which was inconsistent from the other model organisms.
Little research has been specialized in the function of structural genes and regulatory genes in the metabolism of phenolic compounds in tea plant, and a very few structural and regulatory genes have been logged in the National Center of Biotechnology Information (NCBI) database. A C.sinensis transcriptome database containing 127,094 unigenes has been obtained recently by highthroughput Illumina for large-scale ribonucleic acid (RNA) sequencing in the Key Laboratory of Tea Biochemistry and Biotechnology, and a large number of predicted structural and regulatory genes related to the metabolism of phenolic compounds have been screened out from this database [35]. A genome-wide bioinformatic analysis of transcription factors in the biosynthesis of phenolic compounds has been published [36].
In order to predict function of these genes, the profile of phenolic compounds in tea plant were established by employing liquid chromatography in tandem mass spectrometry technology, and the expression pattern of potential genes were investigated using quantitative real-time polymerase chain reaction (qRT-PCR) techniques and classified by clustering analysis. In addition, the metabolism of galloylated catechins was researched by in vitro enzymology analysis. The present research attempted to merge genetic and metabolic analyses and get insight into biological connections between genetic and biochemical pathways.

Plant Materials
Based on the extent of growth maturity of C. sinensis, the following samples were collected from more than 3 year old tea plants during early summer from the Experimental Tea Garden, Anhui Agricultural University, Anhui, China (latitude: 31.86N, longitude: 117.27E, altitude: 20 m above mean sea level): young shoots (bud, first leaf, second leaf, third leaf, fourth leaf, young stems), and tender roots. The collected samples were immediately frozen in liquid nitrogen and stored at 280uC prior to analysis.

Extraction and Identification of Phenolic Compounds
The total phenolic compounds were extracted as follows: 0.5 g sample (fresh leaves, young stem, and root) was grounded in liquid nitrogen and extracted with 5 mL extraction solution [80% methanol:1% hydrochloric acid (HCl)] using an ultrasonic sonicator for 10 min at room temperature. After centrifugation at 4000 g for 15 min, the residues were re-extracted twice as above, and the supernatants were filtered through a 0.22 mm membrane.
The liquid chromatography (LC)-time of flight (TOF)-mass spectrometer (MS) system used in this study consisted of a quaternary pump with a vacuum degasser, thermostated column compartment, autosampler, diode array detector (DAD), and TOF-MS from Agilent Technologies (Palo Alto, CA, USA). A Phenomenex Synergi 4 u Fusion-RP80 column (particle size: 5 mm, length: 250 mm, and internal diameter: 4.6 mm) was used at a flow rate of 1.0 mL min 21 . The column oven temperature was set at 25uC. The mobile phase consisted of 1% acetic acid in water and 100% acetonitrile; and the gradient of the latter increased linearly from 0 to 10% (v/v) at 10 min, to 13% at 30 min, to 16% at 65 min, to 33% at 81 min, to 90% at 85 min, and to 90% at 90 min. The DAD was set at 280 nm and 340 nm for real-time monitoring of the peak intensities. Ultraviolet (UV) spectra were continuously recorded from 200 nm to 600 nm for the identification of plant components. Mass spectra were acquired simultaneously using the electrospray ionization in the positive and negative ionization modes at the fragmentation voltages of 175 V over the range of m/z 100 to 2000. A drying gas flow of 12 L min 21 , drying gas temperature of 325uC, nebulizer pressure of 35 psi, and capillary voltages of 3500 V were used.
The ultra performance liquid chromatography (UPLC)-MS/ MS system used in this study consisted of a quaternary pump with a vacuum degasser, thermostated column compartment, autosampler, DAD, and triple quadrupole Mass Spectrometer (QQQ) from Agilent Technologies (Palo Alto, CA, USA). An Agilent 20RBAX RRHD Eclipse Plus C18 column (particle size: 1.8 mm, length: 100 mm, and internal diameter: 2.1 mm) was used at a flow rate of 0.2 mL min 21 . The column oven temperature was set at 40uC. The mobile phase consisted of 0.4% acetic acid in water and 100% acetonitrile; and the gradient of latter increased linearly from 0 to 7% (v/v) at 10 min, and to 7% at 22 min, to 11% at 25 min, to 12% at 30 min, to 14% at 31 min, to 35% at 43 min, and to 80% at 47 min. Mass spectra were acquired simultaneously using electrospray ionization in the positive and negative ionization modes over the range of m/z 100 to 2000. A drying gas flow of 6 L min 21 , drying gas temperature of 350uC, nebulizer pressure of 45 psi, and capillary voltages of 3500 V were used.
The phenolic compounds were identified qualitatively using LC-MS by comparing the retention times (t R ), wavelengths of maximum absorbance (lmax), protonated/deprotonated molecules ([M+H] + /[M-H] 2 ), and major fragment ions with those of the authentic standards and published literature.

Extraction and Quantitative Determination of Phenolic Compounds
MS-based multiple reaction monitoring (MRM) mode was used for simultaneous quantitation of phenolic compounds. The extraction method and MS-MRM conditions were similar to the above described qualitative analysis.
Spectrophotometry analysis of anthocyanins was performed as suggested by Pang et al. [24]. Total anthocyanin concentration was calculated using the molar absorbance of cyanidin-3-Oglucoside (XinRan Biological, Shanghai, China).
The PAs were extracted as reported by the methods of Martin et al. [39] and Pang et al. [24] with some modifications. The sample (0.5 g each of fresh leaves, stems, and roots) was grounded in liquid nitrogen and extracted with 5 mL extraction solution (70% acetone:0.5% acetic acid) by sonication for 10 min at room temperature. The extract was centrifuged to remove the debris, and the residues were re-extracted twice as mentioned above.
For the analysis of soluble PAs, 1 mL of pooled supernatant added with 1 mL of water was extracted with 2 mL of chloroform. Aqueous supernatant (0.5 mL) was then added to 3 mL n-butanol-HCl (95/5 v/v) and incubated at 95uC for 1 h. The supernatant was cooled to room temperature, and the absorbance at 550 nm and 600 nm were recorded. The control treatment was performed under same conditions without boiling. Absorbance values were converted into PAs equivalents using a standard curve of procyanidin B2.
For the analysis of insoluble PAs, the residues were added to 3 mL n-butanol-HCl (95/5 v/v) and incubated at 95uC for 1 h. The detection method used was similar to that of soluble PAs. The hydrolysates were then subjected to LC-MS analysis (as described above).
The primer sequences are listed in Table S1. The PCR mixture contained cDNA template (approximate 0.01 mg/mL), 10 mL SYBR Green PCR Master Mix (Takara, DaLian, China), and 200 nmol L -1 of each gene-specific primer in a final volume of 20 mL. qRT-PCR assays were performed using a CFX96 TM optical reaction module (Bio-RAD, USA). The PCRs were performed with the following program: 95uC for 30 sec, followed by 40 cycles at 95uC for 5 sec, and 60uC for 30 sec (58uC for 30 sec for root) in 96-well optical reaction plates. The specificity of amplicons was verified by melting curve analysis (55uC to 95uC). Expression of messenger RNA was assessed by evaluating threshold cycle (C T ) values in quadruplicate reactions. Values were normalized against the expression level of the housekeeping gene glyceraldehyde-3-phosphate dehydrogenase (GAPDH). The relative expression values were evaluated by the 2 2ggCt method: gC T = C T, target -C T, GAPDH , -ggC T = -(gC T, target -gC T, bud ), where C T, target and C T, GAPDH were the threshold cycles of targets and housekeeping gene GAPDH, respectively.
The experiments were repeated thrice.

Clustering Method
The hierarchical cluster of the gene expression data was performed using the between-groups linkage method of Statistical Package for the Social Sciences statistics17.0.  Figure 2; and t R 2 (min) was the retention time in Figure 3.   [37,40]. All enzyme assays were conducted in phosphate buffer.
The UGGT and ECGT reactions solution were prepared as mentioned by Liu et al [37]. The GCH assay solution was incubated at 30uC for 0.5 h in a total volume of 2.5 mL containing 50 mM phosphate buffer (pH 6.5), 0.2 mM EGCG or ECG, 4 mM ascorbic acid, and crude enzyme extract (0.4 mg total protein).
The above enzyme reactions were terminated by adding ethyl acetate. Each of the reaction products was extracted thrice with 3 mL ethyl acetate, and the extract was evaporated and redissolved in 500 mL methanol and used directly for the analyses of enzymatic reaction products.
The experiments were repeated thrice.

Identification of Phenolic Compounds via LC-TOF-MS and UPLC-QQQ-MS/MS
Phenolic compounds in tea plant were analyzed qualitatively by LC-TOF-MS and UPLC-QQQ-MS/MS, and they were identified ( Table 1 [17,[41][42][43][44]. A total of 64 metabolites were detected successfully by one-time measurement. Of which, 50 were identified as phenolic compounds ( Table 1). All the samples were analyzed in negative ionization mode except for anthocyanidin, theanine, caffeine, theobromine, and theophylline, which were obtained in positive ionization mode.
Nine phenolic acids among the 50 phenolic compounds consisted of hydroxybenzoic acid (HBA) and hydroxycinnamic acid (HCA) derivatives, and they were identified by direct comparison of data with authentic standards (Table 1, Figs. 2  and 3, Fig. S1) and literature [17]. The gallic acid (GA) derivatives which were belonging to HBA derivatives included the following: GA (peak 11), b-glucogallin (bG, peak 10), and galloylquinic acid (GQA, peaks 1 and 13). HCA derivatives comprised of p-coumaroylquinic acid (peaks 23 and 32) and caffeoylquinic acid (peak 27). Quinic acid (peaks 4 and 7) had no effectual absorption spectra at the UV-Visible spectroscopy, but could be detected with the MS.
The identity of eight monomers of flavan-3-ol (catechins) was confirmed by direct comparison of their absorbance spectra and t R with the authentic standards ( Table 1, Figs. 2 and 3, Fig. S2). The main types were di-or tri-hydroxyl flavan-3-ol in the B-ring, which included EGCG (peak 33), ECG (peak 43), EC (peak 31), EGC (peak 21), C (peak 25), and GC (peak 16). Epiafzelechin gallate (peak 19) and epiafzelechin (peak 36) were low monohydroxyl content in the B-ring. Among them, EGCG were the major characteristic phenolic compounds in tea plant which were also the functional ingredient of tea beverages.
About 12 flavonol derivatives with mono-to tri-hydroxyl in the B-ring were detected in the fresh tea leaves (

Quantitative Determination of Phenolic Compounds via
UPLC-QQQ-MS/MS. To gain insight into the biological relationship between the metabolism of phenolic compounds and gene expressions, the accumulation of phenolic compounds were comprehensively measured in the different developmental stages of leaves and different organs ( Table 2).
The accumulation patterns of quinic acid, GA derivatives, and HCA derivatives varied widely from the bud to fourth leaf ( Table 2). Galloylquinic acid was the most predominant compound among the GA derivatives. The contents of the three gallic acid derivatives decreased remarkably with the development of leaves. For instance, the content of b-glucogallin and galloylquinic acid in the fourth leaf decreased to 10.37% and 11.69%, respectively, when compared to the bud. The accumulation of HCA derivatives was found to be developmentally regulated in leaf. The content was high in the first leaf, followed by the bud, and reduced significantly in the fourth leaf. The accumulation of GA and HCA derivatives in both stem and root were trace, but quinic acid was highly accumulated in the stem.
The amount of total catechins declined gradually from the bud to fourth leaf, but decreased significantly from the leaves and stem to root ( Table 2). The contents of GC and C were low in the leaves, and EGC and EC gradually increased during the leaf development. In contrast, the amounts of galloylated catechins such as EGCG and ECG were predominant in the leaves and declined gradually with the development of leaves. Their contents in the fourth leaf reduced to 80.79% and 53.78%, respectively, compared to that with the bud. In terms of content and components, the catechins, especially galloylated catechins (EGCG and ECG) were abundant and diverse in both leaves and stems. On the contrary, the contents of catechins were very low in the roots. Galloylated catechins (EGCG and ECG) and trihydroxyl in B-ring catechins (GC and EGC) were found in trace, while only EC was detected in relatively larger extent.
Both UPLC-QQQ-MS/MS and n-butanol-HCl hydrolysis assays indicated that the content of PAs was much lower than the monomers of flavan-3-ol and shared the same trend with EGCG and ECG in the leaves at different developmental stages  Table 2). The content of PAs declined to 36.43% and 57.52% from the bud to fourth leaf, respectively.
Surprisingly, the total amount of PAs in the root was higher than in the leaves and stems. UPLC-QQQ-MS/MS findings displayed that the types of PAs were primarily procyanidin dimers and trimers in the leaves and stems ( Table 2). However, the dimers (m/z 577) to pentamers (m/z 1553) of flavan-3-ol were detected in the root of tea plant (Fig. S5). Resulting anthocyanidins of PAs by n-butanol-HCl hydrolysis were primarily anthocyanidins with di-hydroxyl groups in the B-ring in the roots; whereas, anthocyanidins with mono-, di-, and tri-hydroxyl groups in the B-ring were simultaneously detected in the leaves (Fig. S6).
The contents of total flavonol derivatives were higher in the first and second leaves compared to the buds, and it changed dramatically in the leaves at different developmental stages ( Table 2). For instance, the contents of flavonol derivatives in the first leaf were almost three times more than in the bud. But, the amount of flavonol derivatives in the leaves was markedly higher than the stem and root. Interestingly, the tri-hydroxyl in the B-ring flavonol was absent in the root.
Spectrophotometry analysis showed that the anthocyanin content was very low in the leaves and decreased gradually with the development of leaves ( Table 2).
Above all, the accumulation patterns of different phenolic compounds were variegated in different organs and leaves at different developmental stages. It suggested that different biosynthesis pathways gave rise to the phenolic compounds. These pathways were regulated along with the development of tea plant, although most phenolic compounds shared the main artery of flavonoid biosynthesis pathway (Table 2, Fig. 4).

The Analyses of Gene Expression and Enzymatic Activity
The structural and regulator genes related to the metabolism of phenolic compounds in tea plant were very few in the NCBI database. From the transcriptome database and NCBI database, 63 predicted structural and regulator genes related to the metabolism of phenolic compounds were screened out and obtained by high-throughput illumina (Table S1).
To acquire the key genes involved in the metabolism of phenolic compounds, the expression patterns of these predicted genes in the leaves at different developmental stages and different organs were investigated, and a representative hierarchical cluster of the gene expression data was performed using the between-groups linkage method (Fig. 5). Based on the similarity of their expression patterns, the 63 screened genes were classified into two main groups: C1 and C2; and four subgroups were clustered further in C2 group.
The genes in group C1 had high relative expression level in the root and low in the bud and leaves. For instance, the structural genes CsPAL1, CsPAL2, and Cs4CL1 were highly expressed in the root; and CsDFR2 had low relative expression level in the bud and leaves compared with other genes in the multi-gene family (Fig. 6). Further verification is required to understand whether these expression patterns are relevant to the accumulation of PAs in high levels in the root (Fig. 4).  The UGGT activity was assayed with the substrates gallic acid. The ECGT activity was assayed with the substrates EGC and b-glucogallin. The GCH activity was assayed with the substrate EGCG. doi:10.1371/journal.pone.0062315.g007 CsMYB4-5 and CsMYB4-6 in group C1 were clustered into subgroup 4 in 27 C. sinensis R2R3-MYBs subgroups; and CsbHLH2-1 and CsbHLH2-2 in group C1 were clustered into subfamily 2 in 9 C. sinensis bHLHs subfamilies in a recent study [36]. The MYB proteins of subgroup 4 were shown to be the representative factors for the phenolic acid metabolism and lignin biosynthesis by interacting with bHLH proteins [50,51], and the bHLH proteins of subfamilies 2 and 24 were predicted to be involved in the flavonoid metabolism [36]. It suggested that the high expression levels of CsMYB4-5, CsMYB4-6, CsbHLH2-1 and CsbHLH2-2 (Fig. 5) might lead to low phenolic acid levels in the root (Fig. 4).
The expression patterns of several subgroup genes in group C2 revealed the development-dependent character (Fig. 5), which might be responsible for the difference in accumulation pattern of phenolic compounds in the leaves at different developmental stages (Fig. 4).
The expression patterns of the genes in subgroup C2-2-1 showed high expressions in the bud, which decreased significantly with the development of the leaves (Fig. 5). These observations were in accordance with the accumulation patterns of gallic acid derivatives, HCA derivatives, anthocyanidins, and galloylated catechins (Fig. 4). Some genes in this subgroup had proved to be involved in the biosynthesis of gallic acid derivatives and galloylated catechins. For instance, CsUGT75E2 and CsUGT75E3 could be gathered in group L of the phylogenetic tree together with Arabidopsis UGT75B1 (data were not shown); and the latter had been identified to catalyze the formation of glucose esters of the benzoates at the carboxyl group of the aglycone [52]. Glucose esters of aglycones (such as b-glucogallin) were considered as the biosynthetic intermediates, and the high energy of the glucose ester might drive the transfer of aglycone to a further acceptor [37]. Although the expression patterns of CsUGT75E2 and CsUGT75E3 were similar to that of UGGT activity (Figs. 5 and 7), the function of these genes should have been identified. Likewise, CsSCPL might involve in the galloylation of catechins in tea plant , and further confirmation is needed to observe whether or not the EGGT is expressed by CsSCPL1 or CsSCPL3.
The level of gene expressions in group C2-2-2-1 were higher in the first and second leaves (Fig. 5), which were in accordance with the accumulation patterns of flavonol derivatives and nongalloylated catechins (Fig. 4). It suggested that the expressions of the structural genes such as CsCHS1, CsF3'5'H1, CsF3'5'H2, CsFLS2 and CsUGT78E1; and regulator genes such as CsMYB5-1, CsMYB4-2, CsMYB7-1, CsbHLH24-5, CsbHLH24-3 and CsbHLH24-2 might be responsible for the biosynthesis of flavonol derivatives and nongalloylated catechins. Likewise, the low expression of CsF3'5'H1, CsF3'5'H2, and CsFLS2 in the root might directly lead to the lack of tri-hydroxyl group compounds such as catechins (EGCG, EGC and GC), flavonols (myricetin), and anthocyanidins (delphinidin) in the B-ring in root ( Table 2, Fig. 5 and Fig. S6). Published literature showed that the subgroup 7 of the MYB gene family participated in the regulation of flavonol biosynthesis [53], and the subfamily 24 of the bHLH gene family was identified as the regulators in the flavonoid or anthocyanin metabolism [54]. The high expression of CsMYB7-1 and some CsbHLHs such as CsbHLH24-5, CsbHLH24-3 and CsbHLH24-2 in the tender leaf might enhance the accumulation of flavonol derivatives associating with the above structural genes (Figs. 4 and 5).
Some structural genes related to the phenylpropanoid pathway and flavonoid synthetic pathway in group C2-2-2-2 had higher expressions level in the mature leaves. Further studies are required to understand their functions.
Interestingly, the hierarchical cluster analysis based on the similarity of the expression patterns did not show the real transcriptional level of genes. The C T value might be used to show the real transcriptional level of genes and compare breadthwise the different expressions of multi-gene family genes in a sample. Different transcriptional levels in some multi-gene family such as PAL1/PAL2/PAL3, 4CL1/4CL2, F3'H1/F3'H2/F3'H3, DFR1/DFR2, LAR1/LAR2, and ANR1/ANR2 were observed (Fig. 6), which suggested that these genes were from the multifunctional gene family.
Besides anabolism, the accumulation of phenolic compounds could be influenced by catabolism. The accumulation of nongalloylated catechins were considered to be aroused from the hydrolysis of galloylated catechins. The activities of enzymes involved in the metabolism of galloylated catechins were investigated in the leaves at different developmental stages and different organs (Fig. 7). During leaf development, the activity of GCH increased; and it was in contrast with UGGT and ECGT, which lead directly to an increase in nongalloylated catechins (EC and EGC) ( Table 2, Figs. 4 and 7).
In addition, no obvious activities of UGGT, ECGT, and GCH were detected in the root. The lower contents of GA and bG (the substrates for galloylation of catechins), coupled with lower activities of UGGT and ECGT, possibly were attributed to the absence of the galloylated catechin (ECG and EGCG) in the roots.

Discussion
For the past few years, flavonoids have attracted due attention and revealed diverse pharmacological activities and biological functions through a good number of outstanding researches [16,22,28,[55][56][57].
There were varieties of phenolic compounds in tea plant. The polyphenols in fresh tea leaves and other tea products included flavonols (O-glycosylated flavonols and acylated glycosylated flavonols), flavan-3-ols (catechins, methylated catechins, and PAs), phenolic acid ramifications, and flavones [18]. Simultaneous determination of polyphenols may comprehend the synthetic diversity and regulation-complexity of polyphenols biosynthesis in tea. HPLC with UV detection has extensively been used for the analysis of phenolic compounds, especially catechins (flavan-3-ols) [29,[58][59][60]. However, the inability to detect few compounds simultaneously with catechins by HPLC has created the need for other analytical options. For example, quinic acid did not have an absorption spectrum in the UV-Visible spectrometer. Likewise, ECG and quercetin 3-O-galactosylrutinoside could not be separated effectually using a HPLC column; and flavones and PAs were difficult to be precisely quantified for their low content. Some of them could not be identified due to the lack of standard substances as well. MS-MRM mode could dramatically impact on improving the simultaneous determination of those phenolic compounds in tea plant. In this study, 50 polyphenols were quantitatively analyzed by means of LC-TOF-MS and UPLC-QQQ-MS/MS ( Table 1, Figs. 2 and 3), and 29 phenolic compounds were qualitatively and quantitatively analyzed simultaneously by UPLC-QQQ-MS/MS ( Table 2).
Several studies have focused on the tissue-specific, developmentdependent, external stimulation (including sugar, hormones, drought, wounding, and UV-B irradiation response) to accumulation of phenolic compounds and related gene expression in tea plant [43,44,[61][62][63][64][65][66][67][68]. The study results obtained were in agreement with the previous researches. The accumulation of phenolic compounds in leaves were developmentally regulated, and the content of most phenolic compounds such as gallic acid derivatives, HCA derivatives, galloylated catechins, PAs, and anthocyanidin were highest in the bud or first leaf and declined gradually along with the development of leaves. Some other phenolic compounds such as quinic acid, nongalloylated catechins, and flavonol derivatives were very low in the bud but increased markedly along with the development of leaves.
The expression patterns of the basic genes related to the accumulation of total polyphenols at different stages of tea leaf development and their relationship with catechins concentration have been investigated in many studies. However, the understanding on the phenolic compound metabolic flux in tea plant is still scanty as only a fewer genes have been logged in the NCBI database. In the recent C.sinensis transcriptome database, the C. sinensis phylogenetic trees have been constructed for R2R3-MYB and bHLH regulatory proteins using the previous Arabidopsis data with further classification into 27 subgroups and 32 subfamilies [36].
A hierarchical clustering analysis suggested that the screened 63 genes were classified into two main groups -C1 and C2. The expression patterns of genes in C2 subgroups were developmentdependent, and the expression patterns of genes in C2-2-1 and C2-2-2 subgroups probably were concerned with the accumulation of phenolic compounds in leaves. For instance, the expression of CsUGT75E2, CsUGT75E3, CsSCPL1, and CsSCPL3 were consistent with the accumulation of galloylated catechins, while these genes might be involved in the biosynthesis of galloylated catechins [37]. CsMYB7-1 in C2-2-2-1 group was another example. The genes in subgroup7 might participate in the regulation of flavonol biosynthesis [53,69]. The expression of CsMYB7-1 had high accumulation of flavonol derivatives. The subfamilies 2, 5, and 24 of the bHLH gene family in plants were identified as the regulators in the metabolism of flavonoid or anthocyanin [54]. The expression patterns of some CsbHLHs were similar to that of CsMYB7-1.
The total amount of PAs was highest in the root. But the root lacked gallic acid derivatives, galloylated catechins, and trihydroxyl in the B-ring flavonol and flavan-3-ols ( Table 2 and Fig. 4). The CsF3'5'Hs were responsible for tri-hydroxyl group in the B-ring phenolic compounds; and CsUGT75E2, CsUGT75E3, CsSCPL1, and CsSCPL3 might have attributed for the galloylation of catechins. These genes were absent in the root (Fig. 5). R2R3-MYBs of subgroup 4 were shown to represent the phenolic acid metabolism and lignin biosynthesis. CsMYB4-5 and CsMYB4-6 were highly expressed in roots, which suggested that they might be closely related to repress phenolic acid biosynthesis.
The biosynthesis and regulation of PAs were quite well understood in A. thaliana, M. truncatula, and other model plants; and the study on relevant regulatory genes were carried out. MYB TFs that were shown to involve in the regulation of PAs biosynthesis included the following: TRANSPARENT TESTA2 from Arabidopsis thaliana [70], MYBPA1 and MYBPA2 from Vitis vinifera [71], and DkMyb4 from the fruit of Diospyros kaki [72]. Recently, it had been reported that the M. truncatula PAs regulator, an R2R3-MYBs transcription factor of subgroup 5 acted as a key regulator of PAs biosynthesis rather than anthocyanin biosynthesis; and it positively regulated CHS, F3H, ANS, and ANR through a probable activation of WD40-1 [22]. MATE1, a precursor transporter involved in the biosynthesis of PAs had been isolated and characterized from the model legume M. truncatula [23,73,74]. Nevertheless, relatively little is known about the pivotal structural gene of PAs polymerization from monomer flavan-3-ol. It was not clear that why PAs polymerization took place in the root rather than in the leaves of tea plant. Galloylation of catechins at position 3 of the C-ring could possibly interfere with the polymerization in the leaves.
PAs were rapidly up-regulated by stresses such as pathogen infection, wounding and herbivory [75][76][77]. The high accumulation of PAs is to protect plants including tea plant from being attacked by microbial pathogens, insect pests and larger herbivores [78].
It is most remarkable that accumulation profile of the nongalloylated catechins might be influenced by synthesis and catabolism. The level of ANR expression is not absolutely consistent with the accumulation of epicatechins in this paper. The results of Pang showed that the recombinant ANR1 and ANR2 proteins produce EC, C, EGC and GC [79]. In our experiment (data not shown), however, utilizing partly purified ANR enzyme from tea plant, only EC and EGC were detected, without any trace of C and GC. The results suggested the function of recombinant proteins may be not consistent with the enzyme in tea plant, or there may be another ANR gene in tea plant. In our transgenic experiment (data not shown), the level of ANR expression is consistent well with the content of DMACA-stained compounds. In fact, the accumulation of EC is affected by biosynthesis, hydrolysis of galloylated catechins [38], polymerization [24] and galloylation [37] jointly.

Conclusion
Of the 50 phenolic compounds identified, 29 were quantified based on their fragmentation behaviors. The accumulation of phenolic compounds in the tea plant was developmentally regulated in bud, leaves, and root; and the content was higher in younger leaves. The expression patterns of genes in C2-2-1 and C2-2-2-1 groups were probably responsible for the developmentdependent accumulation of phenolic compounds in the leaves. Further researches may help to understand the expression patterns and functions of various genes involved in the biosynthesis and/or metabolism of different compounds in tea plant and the reason for the variation in their content in different growth stages and also in different organs.