Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Transcriptome Analysis and Development of SSR Molecular Markers in Glycyrrhiza uralensis Fisch.

Abstract

Licorice is an important traditional Chinese medicine with clinical and industrial applications. Genetic resources of licorice are insufficient for analysis of molecular biology and genetic functions; as such, transcriptome sequencing must be conducted for functional characterization and development of molecular markers. In this study, transcriptome sequencing on the Illumina HiSeq 2500 sequencing platform generated a total of 5.41 Gb clean data. De novo assembly yielded a total of 46,641 unigenes. Comparison analysis using BLAST showed that the annotations of 29,614 unigenes were conserved. Further study revealed 773 genes related to biosynthesis of secondary metabolites of licorice, 40 genes involved in biosynthesis of the terpenoid backbone, and 16 genes associated with biosynthesis of glycyrrhizic acid. Analysis of unigenes larger than 1 Kb with a length of 11,702 nt presented 7,032 simple sequence repeats (SSR). Sixty-four of 69 randomly designed and synthesized SSR pairs were successfully amplified, 33 pairs of primers were polymorphism in in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. This study not only presents the molecular biology data of licorice but also provides a basis for genetic diversity research and molecular marker-assisted breeding of licorice.

Introduction

Licorice is an important herbal medicine because of its high medicinal value and applications in light and food industries. In the Pharmacopoeia of the People’s Republic of China [1], three species of Glycyrrhiza, namely, G. uralensis Fisch., G. inflata Bat., and G. glabra L., are listed as authentic medicinal licorice. Licorice is mainly distributed in China, especially in the northeast and north China, as well as in northwest arid, semi-arid, and desert regions.

The active ingredients of licorice include saponins, polysaccharides, flavonoids, and triterpenes; among these content, total saponins, including glycyrrhizic acid, glycrrhetinic acid, and neoisoliquiritin, exert pharmacological effects, such as protection against hepatotoxicity and anti-inflammatory [24]. Although licorice has been widely investigated in chemical and pharmacological fields, the metabolic pathways of the active ingredients of this plant have been rarely studied. In particular, insufficient genome and transcriptome sequencing data complicate research on the metabolic pathway of glycyrrhizic acid [5, 6]. At the genome level, RNA sequencing can be used for gene screening analysis to detect gene expression and differences [7, 8]. This technology has been widely utilized for studies on medicinal plants, such as Polygonum cuspidatum, because of its high flux and repeatability, wide detection range, and quantitative accurate characteristics [9, 10].

In this study, the latest HiSeq 2500 platform was used for licorice transcriptome sequencing to completely utilize licorice genes and germplasm resources. The resulting sequence data were assembled and annotated. Genes related to glycyrrhizic acid biosynthesis of secondary metabolites were found. This research not only research glycyrrhizic acid biosynthesis of secondary metabolites, but also provides a basis for gene annotation and discovery. In addition, a large number of molecular markers for simple sequence repeats (SSR) were predicted and developed for licorice. These markers can be used for future studies on gene mapping, linkage map development, genetic diversity analysis, and marker-assisted selection breeding of G. uralensis.

Materials and Methods

1. Plant materials

Plant material was collected from a 4-year-old fresh healthy G. uralensis plant grown in a field in Beijing, China (the Beijing University of Chinese Medicine Endangered Medicinal Plant Research and Testing Base). roots, stems, and leaves were immediately stored in liquid nitrogen for analysis. In addition, Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L., Glycyrrhiza pallidiflora Maxim.were chosen to detect polymorphism of primer pairs.

2. RNA isolation and transcriptome sequencing

Total RNA was extracted from the roots, stems, and leaves by using an Ed lai kit (Ed Biological Technology Co., Ltd., Beijing; article number: RN40). Nanodrop, Qubit 2.0, and Agilent 2100 were used to determine RNA purity, concentration, and integrity, respectively. mRNA was purified and enriched from the total RNA by using poly (T) low-adsorption magnetic beads. mRNA was interrupted at a high temperature to select its suitable length. Synthesis was continued to the second cDNA chain to purify cDNA. Finally, the resulting cDNA from the mixture of roots, stems, and leaves (3:1:1) was used to construct the transcriptome sequence library. The cDNA library was enriched through PCR and subjected to Illumina HiSeq 2500 high-throughput sequencing. The transcriptome datasets are available in the NCBI Sequence Read Archive (SRA) under accession number SRX1295883.

3. De novo sequence assembly

The cDNA library was sequenced using the Illumina HiSeq 2500 system. Raw image data from sequencing were transformed by base calling into raw sequence data and defined as raw reads. Clean data were generated from the raw data through data processing, including removal of low-quality reads and adapter sequences. The clean reads were subjected to de novo assembly using the Trinity software to recover full-length transcripts across a broad range of expression levels; this technique presents sensitivity similar to genome alignment methods [11]. Transcriptome assembly was then conducted.

4. Annotation of unigenes

For functional annotation of unigenes, the sequences were compared using the following databases: Nr (), Swiss–Prot (http://www.uniprot.org/), gene ontology (GO) (http://www.geneontology.org/), Clusters of Orthologous Groups (COG) (http://www.ncbi.nlm.nih.gov/cog/), Kyoto Encyclopedia of Genes and Genomes (KEGG) (http://www.genome.jp/kegg/), and Non-redundant Nucleotide (Nt) (). Comparison was performed using the BLASTX algorithm set at an E-value ≤ 1e−5, with the annotation information of homologous genes in the library.

5. Development and detection of SSR molecular markers for licorice

Unigenes larger than 1 Kb were subjected to SSR analysis by using the MISA software (http://pgrc.ipk-gatersleben.de/misa/misa.html). Search criteria included the number of repetitions for mono-, di-, tri-, tetra-, penta-, and hexa-nucleotides, with repetition times of 10, 6, 5, 4, 3, 3, and 2. Primers for each SSR were designed using the Primer 6 software. A total of 69 primer pairs were obtained and used for amplification. Detailed information about the designed primers is shown in S1 Table. DNA for PCR amplification was extracted from different samples through cetrimonium bromide method [12]. PCR amplification was conducted as follows: denaturation at 94°C for 3 min, followed by 35 cycles of 94°C for 40 s, 55°C–60°C for 30 s, and 72°C for 60 s. Final extension was performed at 72°C for 5 min. PCR products were analyzed through electrophoresis on 2.5% agarose gels.

Results and Analysis

1. RNA sequencing and assembly of licorice transcriptome

To obtain transcriptome information of licorice, we extracted the total RNA from the roots, stems, and leaves mixed at 3:1:1 ratio and sequenced through HiSeq 2500. A total of 26,766,870 raw sequencing reads were generated. By removing the adaptors and low-quality data, we obtained 5.41 Gb clean reads, 45.22% GC, and 0.05% N. The base quality value Q30 reached 86.59%, which indicates satisfactory sequencing quality of the licorice samples. The obtained data were used for further analysis.

Sample data were merged and assembled using the Trinity software. By using the overlapping information in high-quality reads, we obtained 3,114,638 contigs with an average length of 51.00 nt and N50 length of 48 nt (Table 1). The contigs were clustered according to the similarity of the paired-end information and contigs. The clustering yielded 87,242 transcripts with an average length of 1121.92 nt in the assembled part (Table 1). Further assembly generated 46,641 unigenes (total length of 36,725,337 nt and average length of 787.40 nt) (Table 1). For transcripts, the size range of 1000–2000 nt accounted for most about 25.11% of all transcripts, and unigene with lengths 200–300 nt accounted for most about 32.78% of all unigene, the frequency distribution of transcripts and unigenes are shown in Fig 1 and Fig 2, respectively.

2. Annotation of licorice functional genes

To identify the gene function and GO classification of licorice, we annotated the unigenes through BLAST search against the non-redundant database (Nr/Nt), with a significance threshold of an e-value of 1 × 10−5. A total of 29,614 licorice unigenes were obtained through BLAST sequence comparison analysis.

GO is used to classify gene function and describe the functional attributes of genes and gene products in an organism. Among 29,614 licorice unigenes, 29,389 unigenes were got Nr annotations, accordingly, 22,244 of 29,389 unigenes with Nr annotations were annotated with GO information. The GO classification system comprises three large categories: molecular function, biological process, and cellular components, which can be further divided into 58 small categories (Fig 3). Among all unigenes with GO annotations, 45.59% belong to Biological Process, 22.70% to Cellular Component, and 27.71% to Molecular Function. In Biological Process, oxidation–reduction process (GO: 0055114) accounts for the largest proportion, followed by regulation of transcription, DNA template (GO: 0006355), and protein phosphorylation (GO: 0006468). In Cellular Component, nucleus (GO: 0005634) accounts for the largest proportion, followed by plasma membrane (GO: 0005886) and integral component of membrane (GO: 0016021). In Molecular Function, ATP binding (GO: 0005524) accounts for the largest proportion, followed by zinc-ion binding (GO: 0008270) and DNA binding (GO: 0003677).

thumbnail
Fig 3. Functional annotation of unigenes based on GO categories.

https://doi.org/10.1371/journal.pone.0143017.g003

A total of 8458 of 29389 Nr-annotated unigenes were annotated in the COG database. Among 25 COG categories, only the general function prediction (2313) accounts for the largest proportion, followed by replication recombination and repair (1108), as well as transcription (1068). About 3.97% (336) unigenes present unknown function (Fig 4) and regarded as unique genes of licorice.

thumbnail
Fig 4. Functional classification of licorice based on COG.

https://doi.org/10.1371/journal.pone.0143017.g004

The KEGG database is employed to analyze gene products in the metabolic pathway of cells and determine their functions. About 6451 unigenes are in contrast with the KEGG database, of which 178 are involved in metabolic pathways. Ribosome (448) contains the most number of unigenes, followed by the plant hormone signal transduction (248), oxidative phosphorylation (221), RNA transport (182), and starch and sucrose metabolism (180). The distribution of pathway containing more than 50 genes, based on the KEGG database, is shown in Fig 5.

thumbnail
Fig 5. Distribution of pathway based on the KEGG database.

https://doi.org/10.1371/journal.pone.0143017.g005

3. Main metabolism-related genes of licorice

Pharmacological research on licorice has mainly focused in glycyrrhizic and glycyrrhetinic acids, flavonoids, and polysaccharides [13]. Glycyrrhizic acid is an oleanane-type pentacyclic triterpene compound synthesized from mevalonic acid (MVA). Three molecules of acetyl COA condensation form 3-hydroxy-3-methyl glutaric acid-COA, resulting in the formation of MVA under the catalysis of HMG-CoA reductase. Focal phosphorylation, decarboxylation, and dehydration of the compound generate isopenteny diphosphate (IPP), which is an isomer of dimethylally diphosphate (DMAPP). The combination of IPP and DMAPP forms geranyl pyrophosphate (GPP), whereas the combination of GPP and IPP forms farnesyl diphosphate (FPP). The connected end-to-end connection squalene, namely, 2,3-oxidized squalene, generates β-amyrin under the action of the β-AS enzyme. The triterpenoid skeleton is then formed under the action of triterpenoid cyclase and through a series of reactions, such as adding oxygen to form glycyrrhizic acid [14]. Analysis of the KEGG pathways showed that 40 kinds of enzymes are involved in the terpenoid backbone biosynthesis. Moreover, annotation of 29,614 genes revealed that 16 genes participate in the synthesis of 11 kinds of enzymes, such as β-amyrin (Table 2). Nevertheless, the corresponding gene sequences of nine types of enzymes were not found. The genes corresponding to the biosynthesis of IPP and DMPPEC1.1.1.88, EC2.7.4.2, EC1.1.1.267, EC2.7.7.60, and EC5.3.3.2 were also not found. The corresponding genes of the other synthesized GPP, FPP, squalene, 2,3-oxidized squalene, and β-amyrin EC2.5.1.1, EC2.5.1.10, EC2.5.1.21, EC1.14.99.7 and 5.4.99 were also not detected.

thumbnail
Table 2. Enzymes related to glycyrrhizic acid metabolic pathways.

https://doi.org/10.1371/journal.pone.0143017.t002

4. Development and SSR locus analysis

A total of 11,702 unigenes with a length of more than 1 Kb were found in the licorice transcriptome. Unigenes present a total length of 22,739,272 bp. To develop new molecular markers, we used the MISA software (http://pgrc.ipk-gatersleben.de/misa/misa.html) to determine potential microsatellites defined as mono- to hexa-nucleotide motifs. A total of 7,032 potential SSR loci were detected. The frequency of SSRs is 60.10%, and the average distribution distance is 3,234 bp. The SSR loci in unigenes are 48,611,547. Each unigene contained more than one SSR loci. The SSR locus numbers of mono-, di-, tri-, tetra-, penta-, and hexa-nucleotide repeats are 3,394, 1,692, 1,814, 101, 19, and 12, respectively (Table 3).

To identify licorice SSRs, we analyzed 66 kinds of repeat primitives (Table 4). Mono-, di-, tri-, tetra-, penta-, and hexa-nucleotide repeat motifs present 2, 4, 10, 21, 17, and 12 types, respectively. From the frequency of occurrence of different repeat primitives, the most abundant type is A/T, accounting for 46.23% of the total SSRs, followed by AG/CT (15.84%) and AAG/CTT (6.23%) repeats. Among the di-nucleotide repeat primitive, AG/CT appears frequently, accounting for 65.84% of the di-nucleotide SSR. In the tri-nucleotide repeat primitive, AAG/CTT appears frequently, accounting for 24% of tri-nucleotide SSR.

thumbnail
Table 4. Amounts of different SSR repeat motifs in licorice.

https://doi.org/10.1371/journal.pone.0143017.t004

A total of 1,681 SSR sites were randomly selected from the SSR-containing sequences to design SSR primers with the Primer 6.0 software. Sixty-nine SSR pairs were randomly designed and synthesized. Sixty-four pairs were successfully used for PCR amplification of genomic DNA (Fig 6), whereas the five remaining pairs failed to generate PCR products at the same annealing temperatures. 53 pairs PCR products present the expected sizes and 11 pairs PCR products are larger than the expected sizes, which could be due to the fact that the PCR products contain introns. And 64 primer pairs polymorphic were detected in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. by the 2.5% agarose electrophoresis analysis, the results showed that 33 pairs of primers were polymorphism in different species.

thumbnail
Fig 6. Photograph of PCR amplification results for SSR markers in licorice.

The first line is the DNA ladder. The subsequent lines are the PCR products generated using different primers.

https://doi.org/10.1371/journal.pone.0143017.g006

Discussion

1. Licorice RNA-seq technology

Many technologies have been used to analyze and quantify the transcriptome of model or non-model organisms, such as Arabidopsis, rice, radish (Raphanus sativus L.), and Haloxylon ammodendron; as such, these techniques are vital to elucidate the complexity of growth and development of organisms. For medicinal plants, organ formation and development are controlled by complex interactions among genetic and environmental factors. The transcriptome data in publicly available libraries are insufficient and limited to describe the complex mechanisms of gene expression, as well as the genetic characteristics of species. Therefore, new generation of high-throughput sequencing technologies has been used as a powerful and cost-efficient tool for research on non-model organisms [9, 15]. In this experiment, we used RNA-Seq technology and obtained 5.41 Gb of clean data and 46, 641 unigenes from the assembly of the clean data (Table 1). The N50 length of the unigenes is 1,395 nt, with an average length of 787.40 nt. The results are comparable with the obtained unigenes in the recently published transcriptome analyses of other plant species, such as H. ammodendron (N50 = 1,354 bp, average length = 728 bp) [15], Reaumuria soongorica (N50 = 1,109 bp, average length = 677 bp) [16], and radish (Raphanus sativus L.) (N50 = 1,256 bp, average length = 820 bp) [17]. Longer unigenes may be obtained because of the developed Trinity software, which is a powerful software package for de novo assembly and generates increased number of full-length transcripts [11].

In this study, 29,614 unigenes were functionally annotated, whereas 17,027 (36.51%) did not obtain functional annotations. Unigenes may contain known functions of protein sequences because they are relatively short and lack conservative functional sequence. Known genes were not matched because their sequences contain missing parts and are relatively short. Moreover, unigenes contain non-coding RNA. In this regard, sequences were not functionally annotated because of insufficient number of unigenes and limited public information database.

2. Licorice genes related to the isoprenoid biosynthesis pathway

The isoprenoid biosynthesis pathway can synthesize kinetin, gibberellic acid, carotenoid, chlorophyll, sterols, monoterpenes, terpenes, and dolichol secondary metabolites [18]. The triterpene compound of glycyrrhizic acid is synthesized in the isoprene metabolic pathway. In this study, through transcriptome sequencing and KEGG database annotation, 40 kinds of enzymes are involved in terpenoid backbone biosynthesis, and 16 genes participate in the mevalonate pathway synthesis; moreover, 11 enzymes are coded in 29,614 annotated genes of licorice. Nevertheless, nine genes related to enzyme synthesis were not found. Li [5] studied the gene expression of wild licorice for 5 years and found 18 kinds of enzyme involved in licorice saponin synthesis; of these enzymes, 16 participate in the MVA synthesis of mevalonate kinase (EC2.7.1.36) and MEP synthesis pathway DXP synthase (EC2.2.1.7), whereas two enzymes are not related to the annotated genes. This result is significantly related to the sequencing of the samples with different ages and periods; transcriptome sequencing also show the different periods at which sample genes are expressed [19]. Nine kinds of enzyme in the synthesis of licorice saponins were not found. In different periods, the content of glycyrrhizin differs but the conclusion remains controversial. Liu [20] found that cultivated licorice with different ages presents varied contents of glycyrrhizin, total flavones, and polysaccharides; moreover, the highest content of glycyrrhizin was observed in the third year of cultivation. Sun [19] showed that the content of glycyrrhizin was higher after 4 years of cultivation. In addition, the quantity of synthesized saponin differs between the wild and cultivated Glycyrrhiza; licorice grows faster under cultivated conditions than that under wild conditions, resulting in higher primary metabolite contents. Although secondary metabolites are major components of traditional Chinese medicinal materials, their accumulation is related to adversity stress and are thus beneficial for accumulation of licorice saponin [5]. Therefore, we aim to design and conduct detailed tests and analyses by using different periods and ages of licorice material in the future.

3. Characteristics of SSR molecular markers

In the analysis of the SSR polymorphism loci of the licorice transcriptome with more than 1 Kb length and 11,702 unigenes by using the MISA software, a total of 7,032 SSR loci were detected with a frequency of 60.10% and an average distribution distance of 3,234 bp. In the licorice SSR loci, the most frequent repeat type is mono-nucleotide with 3,394 (48.27%), followed by tri- and di-nucleotide repeats, with 1,814 (25.80%) and 1,692 (24.06%), respectively. This distribution frequency differs from those of most plant genomes, such as field pea, faba bean, and autotetraploid Alfalfa, in which the most abundant repeat motif is tri-nucleotide (57.7%, 61.7%, and 61.19%, respectively) [10, 21]; in Sesamum, the most abundant repeat motif is di-nucleotide repeat motifs [22]. Autotetraploid Alfalfa, field pea, faba bean, and P. cuspidatum do not have mononucleotide repeat sequences, which could be due to the different standards used in SSR search criteria [9, 10, 21, 22]. In this study, we explored the mono-nucleotide repeat motifs in licorice; during the process, a condition where the mono-nucleotide repeat is dominant was generated, which decreases the number of other nucleotide repeats.

In this study, the occurrence frequency of tri-nucleotide repeats (25.80%) is higher than the di-nucleotide repeat frequencies (24.06%). Studies on P. cuspidatum [9], autotetraploid Alfalfa [10], Asteraceae (Mikania micrantha) [23], Asteraceae (Chrysanthemum nankingense) [24], and radish (R. sativus L.) [17] demonstrated similar conclusion. The di-nucleotide repeats of other plants, namely, rubber tree [25], Sesamum [22], and blunt snout bream [26], have higher frequencies than tri-nucleotide repeat frequencies. This finding may be due to the different genetics of different species and standards used for SSR search. In licorice di-nucleotide repeat motifs, AG/CT appeared the most, accounting for 15.84% of SSR. This result is consistent with that in Sesamum [22] and radish [17]. In plants, the presence of CT repeat sequence to 5′UTRs is probably related to reverse transcription and has a significant role in gene regulation [27]. By contrast, in the licorice tri-nucleotide repeat motifs, AAG /CTT appeared most, accounting for 6.23% of the total SSR. This result is consistent with that in Sesamum [22], and radish [17]; conversely, in rubber tree [25] and Asteraceae (C. nankingense) [24], AAG/TTC and CCA/GGT were the most abundant, respectively, which could be due to the frequency used in different encoding proteins of species.

Among 69 primer pairs, 64 (92.75%) were amplified successfully. The PCR success rate was similar to that in Sesamum [22], lower than that in rubber tree [25], and higher than that reported in a previous study [10]. These results suggest that the quality of assembled unigenes were high, and SSRs identified in our study could be used for future analysis.

Supporting Information

S1 Table. Sequences of 69 primer pairs for SSR markers.

https://doi.org/10.1371/journal.pone.0143017.s001

(XLS)

Author Contributions

Conceived and designed the experiments: YL. Performed the experiments: YL PZ MS. Analyzed the data: YL MS. Contributed reagents/materials/analysis tools: JH MQ WW CL. Wrote the paper: YL.

References

  1. 1. China Pharmacopoeia Committee. Pharmacopoeia of People’s Republic of China, Vol. 1, Chemical Industry Press, Beijing, 2010, pp. 283–284.
  2. 2. Shibata S. A drug over the millennia: pharmacognosy, chemistry, and pharmacology of licorice. Yakuqaku Zasshi. 2000; 120 (10): 849–862.
  3. 3. Matsui S, Matesumoto H, Sonoda Y, Ando K, Aizu-Yokota E, Sato T, et al. Glycyrrhizin and related compounds down-regulate produntion of inflammatory chemokines IL-8 and eotaxin 1 in a human lung fibroblast cell line. Int Immunopharmacol. 2004; 4(13): 1633–1644. pmid:15454116
  4. 4. Wang B, Wang YX, Zhao HY, Zong Y, Xu JJ. Research progress of the major components and the pharmacological effect of Glycyrrhiza uralensis fisch. Journal of Jilin Medical College. 2013; 34(3): 215–218.
  5. 5. Li Y, Luo HM, Sun C, Song JY, Wu Q, Wang N, et al. EST analysis reveals putative genes involved in glycyrrhizin biosynthesis. BMC Genomics. 2010; 11: 268 pmid:20423525
  6. 6. Ramilowski Jordan A.1, Sawai Satoru, Seki Hikaru, Mochida Keiichi, Yoshida Takuhiro. et al. Glycyrrhiza uralensis transcriptome landscape and study of phytochemicals. PCP. 2013; 54(5): 697–710
  7. 7. Wang XC, Yang ZR, Wang M, Li W, Li SC. High- throughput Sequencing Technology and Its Application. China Biotechnology. 2012; 32(1): 109–114.
  8. 8. Zhang QF, Li J, Fan ZX,Yang LQ,Bu X, Application of High–Throughput Sequencing Technology in Agricultural Research. Shandong Agricultural sciences. 2013; 45(1): 137–140.
  9. 9. Hao DC, Ma P, Mu J, Chen SL, Xiao PG, Peng Y, et al. De novo characterization of the root transcriptome of a traditional Chinese medicinal plant Polygonum cuspidatum. Science China Life Science. 2012; 55(5): 452–466.
  10. 10. Liu ZP, Chen TL, Ma LC, Zhao ZG, Zhao Patrick X., Nan ZB, et al. Global Transcriptome Sequencing using the Illumina Platform and the Development of EST-SSR Markers in Autotetraploid Alfalfa. PLOS One. 2013; 8(12): e83549. pmid:24349529
  11. 11. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7): 644–652. pmid:21572440
  12. 12. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytotchemical Bulletin. 1987; 19:11–15.
  13. 13. Gao XY, Wang WQ, Wei SL, Li WD. Review of pharmacologica leffects of Glycyrrhiza Radix and its bioactive compounds. China Journal of Chinese Materia Medica. 2009; 34(21): 2695–2700. pmid:20209894
  14. 14. Li G, Zhou CM, Jiang XL, He ZP. Research of licorice cultivation and glycyrrhizic acid biosynthesis and regulation. Journal of Chinese Medicinal Materials. 2004; 27(6): 462–465.
  15. 15. Long Y, Zhang JW, Tian XJ, Wu SS, Zhang Q, Zhang JP, et al. De novo assembly of the desert tree Haloxylon ammodendron (C. A. Mey.) based on RNA-Seq data provides insight into droughtresponse, gene discovery and marker identification. BMC Genomics. 2014; 15: 1111. pmid:25511667
  16. 16. Shi Y, Yan X, Zhao P, Yin H, Zhao X, Xiao H, et al. Transcriptomic analysis of a tertiary relict plant, extreme xerophyte Reaumuria soongorica to identify genes related to drought adaptation. PLOS One. 2013; 8(5): e63993. pmid:23717523
  17. 17. Wang SF, Wang XF, He QW, Liu XX, Xu WL, Li LB, et al. Transcriptome analysis of the roots at early and late seedling stages using Illumina paired-end sequencing and development of EST-SSR markers in radish. Plant Cell Rep. 2012; 31(8):1437–1447. pmid:22476438
  18. 18. Chappell J. Bioehemistry and molecular biology of the isoprenoid biosynthetic pathway in plants. Annu. Rev. Plant. Physiol. Plant Mol. Biol. 1995; 46: 521–547.
  19. 19. Sun L, Yu JG, Li DY, Luo XZ, Zhao CJ, Yang SL, et al. Compare of contents Study in glycyrrhizin and liguiritin, Journal of Chinese Medicinal Materials. 2001; 24(08): 550–552.
  20. 20. Liu JR, Zhao WB, Wang HY, Jiang FS, Xiang Yi, Li XY, et al. Output of Cultivated Glycyrrhizia in Different Growth Stages and Analytical Comparison cf Its Active Ingredients. Shanghai Journal of Traditional Chinese Medicine. 2004; 8 (11): 56–58.
  21. 21. Kaur Sukhjiwan, Pembleton Luke W, Cogan Noel Ol, Savin KeithW, Leonforte Tony, Paull Jeffrey, et al. Transcriptome sequencing of field pea and faba bean for discovery and validation Of SSR genetic marker. BMC Genomics. 2012; 13: 104. pmid:22433453
  22. 22. Wei WL, Qi XQ, Wang LiH, Zhang YX, Hua W, Li DH, et al. Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics. 2011; 12: 451. pmid:21929789
  23. 23. Yan YB, Huang YL, Fang XT, Lu L, Zhou RC, Ge XJ, et al. Development and characterization of EST-SSR markers in the invasive weed Mikania micrantha (Asteraceae). American Journal of Botany. 2011; e1–e3. pmid:21613074
  24. 24. Wang HB, Jiang JF, Chen SM, Qi XY, Peng H, Li PR, et al. Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale unigene Assembly and SSR Marker Discovery. PLOS One. 2013; 8(4): e62293. pmid:23626799
  25. 25. Li DJ, Deng Z, Qin B, Liu XH, Men ZH. De novo assembly and characterizationg of bark transcriptome using Illumina sequencing and development of EST-SSR marker in rubber tree (Hevea brasiliensis Muell. Arg.) BMC Genomics. 2012; 13:192. pmid:22607098
  26. 26. Gao ZX, Luo W, Liu H, Zeng C, Liu XL, Yi SK, et al. Transcriptome Analysis and SSR/SNP Markers Information of the Blunt Snout Bream (Megalobrama amblycephala). PLOS One. 2012; 7(8): e42637. pmid:22880060
  27. 27. Martienssen R A, Colot V: DNA methylation and epigenetic inheritance in plants and filamentous fungi. Science. 2001; 293: 1070–1074. pmid:11498574