Transcriptome Analysis and Development of SSR Molecular Markers in Glycyrrhiza uralensis Fisch.

Yaling Liu; Pengfei Zhang; Meiling Song; Junling Hou; Mei Qing; Wenquan Wang; Chunsheng Liu

doi:10.1371/journal.pone.0143017

Abstract

Licorice is an important traditional Chinese medicine with clinical and industrial applications. Genetic resources of licorice are insufficient for analysis of molecular biology and genetic functions; as such, transcriptome sequencing must be conducted for functional characterization and development of molecular markers. In this study, transcriptome sequencing on the Illumina HiSeq 2500 sequencing platform generated a total of 5.41 Gb clean data. De novo assembly yielded a total of 46,641 unigenes. Comparison analysis using BLAST showed that the annotations of 29,614 unigenes were conserved. Further study revealed 773 genes related to biosynthesis of secondary metabolites of licorice, 40 genes involved in biosynthesis of the terpenoid backbone, and 16 genes associated with biosynthesis of glycyrrhizic acid. Analysis of unigenes larger than 1 Kb with a length of 11,702 nt presented 7,032 simple sequence repeats (SSR). Sixty-four of 69 randomly designed and synthesized SSR pairs were successfully amplified, 33 pairs of primers were polymorphism in in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. This study not only presents the molecular biology data of licorice but also provides a basis for genetic diversity research and molecular marker-assisted breeding of licorice.

Citation: Liu Y, Zhang P, Song M, Hou J, Qing M, Wang W, et al. (2015) Transcriptome Analysis and Development of SSR Molecular Markers in Glycyrrhiza uralensis Fisch. PLoS ONE 10(11): e0143017. https://doi.org/10.1371/journal.pone.0143017

Editor: Shilin Chen, Chinese Academy of Medical Sciences, Peking Union Medical College, CHINA

Received: May 3, 2015; Accepted: October 29, 2015; Published: November 16, 2015

Copyright: © 2015 Liu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

Data Availability: The transcriptome datasets are available in the NCBI Sequence Read Archive (SRA) under accession number SRX1295883.

Funding: This study was supported by the Chinese herbal medicine standardization production technology service platform [Ministry of Consumption (2011) 340], Wang WQ; the National Natural Science Foundation of China (31400285), Liu YL; National Ministry of Science and Technology support project – Optimization of licorice standardization planting base in north China and series products of comprehensive exploitation and research (2011BAI07B02), Qing M; the Inner Mongolia autonomous region science and technology innovation to guide the bounty program – Quality evaluation of wild and cultivated radix glycyrrhizae in Inner Mongolia region (65861161449), Qing M.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Licorice is an important herbal medicine because of its high medicinal value and applications in light and food industries. In the Pharmacopoeia of the People’s Republic of China [1], three species of Glycyrrhiza, namely, G. uralensis Fisch., G. inflata Bat., and G. glabra L., are listed as authentic medicinal licorice. Licorice is mainly distributed in China, especially in the northeast and north China, as well as in northwest arid, semi-arid, and desert regions.

The active ingredients of licorice include saponins, polysaccharides, flavonoids, and triterpenes; among these content, total saponins, including glycyrrhizic acid, glycrrhetinic acid, and neoisoliquiritin, exert pharmacological effects, such as protection against hepatotoxicity and anti-inflammatory [2–4]. Although licorice has been widely investigated in chemical and pharmacological fields, the metabolic pathways of the active ingredients of this plant have been rarely studied. In particular, insufficient genome and transcriptome sequencing data complicate research on the metabolic pathway of glycyrrhizic acid [5, 6]. At the genome level, RNA sequencing can be used for gene screening analysis to detect gene expression and differences [7, 8]. This technology has been widely utilized for studies on medicinal plants, such as Polygonum cuspidatum, because of its high flux and repeatability, wide detection range, and quantitative accurate characteristics [9, 10].

In this study, the latest HiSeq 2500 platform was used for licorice transcriptome sequencing to completely utilize licorice genes and germplasm resources. The resulting sequence data were assembled and annotated. Genes related to glycyrrhizic acid biosynthesis of secondary metabolites were found. This research not only research glycyrrhizic acid biosynthesis of secondary metabolites, but also provides a basis for gene annotation and discovery. In addition, a large number of molecular markers for simple sequence repeats (SSR) were predicted and developed for licorice. These markers can be used for future studies on gene mapping, linkage map development, genetic diversity analysis, and marker-assisted selection breeding of G. uralensis.

Materials and Methods

1. Plant materials

Plant material was collected from a 4-year-old fresh healthy G. uralensis plant grown in a field in Beijing, China (the Beijing University of Chinese Medicine Endangered Medicinal Plant Research and Testing Base). roots, stems, and leaves were immediately stored in liquid nitrogen for analysis. In addition, Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L., Glycyrrhiza pallidiflora Maxim.were chosen to detect polymorphism of primer pairs.

2. RNA isolation and transcriptome sequencing

Total RNA was extracted from the roots, stems, and leaves by using an Ed lai kit (Ed Biological Technology Co., Ltd., Beijing; article number: RN40). Nanodrop, Qubit 2.0, and Agilent 2100 were used to determine RNA purity, concentration, and integrity, respectively. mRNA was purified and enriched from the total RNA by using poly (T) low-adsorption magnetic beads. mRNA was interrupted at a high temperature to select its suitable length. Synthesis was continued to the second cDNA chain to purify cDNA. Finally, the resulting cDNA from the mixture of roots, stems, and leaves (3:1:1) was used to construct the transcriptome sequence library. The cDNA library was enriched through PCR and subjected to Illumina HiSeq 2500 high-throughput sequencing. The transcriptome datasets are available in the NCBI Sequence Read Archive (SRA) under accession number SRX1295883.

3. De novo sequence assembly

The cDNA library was sequenced using the Illumina HiSeq 2500 system. Raw image data from sequencing were transformed by base calling into raw sequence data and defined as raw reads. Clean data were generated from the raw data through data processing, including removal of low-quality reads and adapter sequences. The clean reads were subjected to de novo assembly using the Trinity software to recover full-length transcripts across a broad range of expression levels; this technique presents sensitivity similar to genome alignment methods [11]. Transcriptome assembly was then conducted.

4. Annotation of unigenes

For functional annotation of unigenes, the sequences were compared using the following databases: Nr (), Swiss–Prot (http://www.uniprot.org/), gene ontology (GO) (http://www.geneontology.org/), Clusters of Orthologous Groups (COG) (http://www.ncbi.nlm.nih.gov/cog/), Kyoto Encyclopedia of Genes and Genomes (KEGG) (http://www.genome.jp/kegg/), and Non-redundant Nucleotide (Nt) (). Comparison was performed using the BLASTX algorithm set at an E-value ≤ 1e⁻⁵, with the annotation information of homologous genes in the library.

5. Development and detection of SSR molecular markers for licorice

Unigenes larger than 1 Kb were subjected to SSR analysis by using the MISA software (http://pgrc.ipk-gatersleben.de/misa/misa.html). Search criteria included the number of repetitions for mono-, di-, tri-, tetra-, penta-, and hexa-nucleotides, with repetition times of 10, 6, 5, 4, 3, 3, and 2. Primers for each SSR were designed using the Primer 6 software. A total of 69 primer pairs were obtained and used for amplification. Detailed information about the designed primers is shown in S1 Table. DNA for PCR amplification was extracted from different samples through cetrimonium bromide method [12]. PCR amplification was conducted as follows: denaturation at 94°C for 3 min, followed by 35 cycles of 94°C for 40 s, 55°C–60°C for 30 s, and 72°C for 60 s. Final extension was performed at 72°C for 5 min. PCR products were analyzed through electrophoresis on 2.5% agarose gels.

Results and Analysis

1. RNA sequencing and assembly of licorice transcriptome

To obtain transcriptome information of licorice, we extracted the total RNA from the roots, stems, and leaves mixed at 3:1:1 ratio and sequenced through HiSeq 2500. A total of 26,766,870 raw sequencing reads were generated. By removing the adaptors and low-quality data, we obtained 5.41 Gb clean reads, 45.22% GC, and 0.05% N. The base quality value Q30 reached 86.59%, which indicates satisfactory sequencing quality of the licorice samples. The obtained data were used for further analysis.

Sample data were merged and assembled using the Trinity software. By using the overlapping information in high-quality reads, we obtained 3,114,638 contigs with an average length of 51.00 nt and N50 length of 48 nt (Table 1). The contigs were clustered according to the similarity of the paired-end information and contigs. The clustering yielded 87,242 transcripts with an average length of 1121.92 nt in the assembled part (Table 1). Further assembly generated 46,641 unigenes (total length of 36,725,337 nt and average length of 787.40 nt) (Table 1). For transcripts, the size range of 1000–2000 nt accounted for most about 25.11% of all transcripts, and unigene with lengths 200–300 nt accounted for most about 32.78% of all unigene, the frequency distribution of transcripts and unigenes are shown in Fig 1 and Fig 2, respectively.

Download:

Fig 1. Frequency distribution of transcripts.

https://doi.org/10.1371/journal.pone.0143017.g001

Download:

Fig 2. Frequency distribution of unigenes.

https://doi.org/10.1371/journal.pone.0143017.g002

Download:

Table 1. Assembly results.

https://doi.org/10.1371/journal.pone.0143017.t001

2. Annotation of licorice functional genes

To identify the gene function and GO classification of licorice, we annotated the unigenes through BLAST search against the non-redundant database (Nr/Nt), with a significance threshold of an e-value of 1 × 10⁻⁵. A total of 29,614 licorice unigenes were obtained through BLAST sequence comparison analysis.

GO is used to classify gene function and describe the functional attributes of genes and gene products in an organism. Among 29,614 licorice unigenes, 29,389 unigenes were got Nr annotations, accordingly, 22,244 of 29,389 unigenes with Nr annotations were annotated with GO information. The GO classification system comprises three large categories: molecular function, biological process, and cellular components, which can be further divided into 58 small categories (Fig 3). Among all unigenes with GO annotations, 45.59% belong to Biological Process, 22.70% to Cellular Component, and 27.71% to Molecular Function. In Biological Process, oxidation–reduction process (GO: 0055114) accounts for the largest proportion, followed by regulation of transcription, DNA template (GO: 0006355), and protein phosphorylation (GO: 0006468). In Cellular Component, nucleus (GO: 0005634) accounts for the largest proportion, followed by plasma membrane (GO: 0005886) and integral component of membrane (GO: 0016021). In Molecular Function, ATP binding (GO: 0005524) accounts for the largest proportion, followed by zinc-ion binding (GO: 0008270) and DNA binding (GO: 0003677).

Download:

Fig 3. Functional annotation of unigenes based on GO categories.

https://doi.org/10.1371/journal.pone.0143017.g003

A total of 8458 of 29389 Nr-annotated unigenes were annotated in the COG database. Among 25 COG categories, only the general function prediction (2313) accounts for the largest proportion, followed by replication recombination and repair (1108), as well as transcription (1068). About 3.97% (336) unigenes present unknown function (Fig 4) and regarded as unique genes of licorice.

Download:

Fig 4. Functional classification of licorice based on COG.

https://doi.org/10.1371/journal.pone.0143017.g004

The KEGG database is employed to analyze gene products in the metabolic pathway of cells and determine their functions. About 6451 unigenes are in contrast with the KEGG database, of which 178 are involved in metabolic pathways. Ribosome (448) contains the most number of unigenes, followed by the plant hormone signal transduction (248), oxidative phosphorylation (221), RNA transport (182), and starch and sucrose metabolism (180). The distribution of pathway containing more than 50 genes, based on the KEGG database, is shown in Fig 5.

Download:

Fig 5. Distribution of pathway based on the KEGG database.

https://doi.org/10.1371/journal.pone.0143017.g005

3. Main metabolism-related genes of licorice

Pharmacological research on licorice has mainly focused in glycyrrhizic and glycyrrhetinic acids, flavonoids, and polysaccharides [13]. Glycyrrhizic acid is an oleanane-type pentacyclic triterpene compound synthesized from mevalonic acid (MVA). Three molecules of acetyl COA condensation form 3-hydroxy-3-methyl glutaric acid-COA, resulting in the formation of MVA under the catalysis of HMG-CoA reductase. Focal phosphorylation, decarboxylation, and dehydration of the compound generate isopenteny diphosphate (IPP), which is an isomer of dimethylally diphosphate (DMAPP). The combination of IPP and DMAPP forms geranyl pyrophosphate (GPP), whereas the combination of GPP and IPP forms farnesyl diphosphate (FPP). The connected end-to-end connection squalene, namely, 2,3-oxidized squalene, generates β-amyrin under the action of the β-AS enzyme. The triterpenoid skeleton is then formed under the action of triterpenoid cyclase and through a series of reactions, such as adding oxygen to form glycyrrhizic acid [14]. Analysis of the KEGG pathways showed that 40 kinds of enzymes are involved in the terpenoid backbone biosynthesis. Moreover, annotation of 29,614 genes revealed that 16 genes participate in the synthesis of 11 kinds of enzymes, such as β-amyrin (Table 2). Nevertheless, the corresponding gene sequences of nine types of enzymes were not found. The genes corresponding to the biosynthesis of IPP and DMPPEC1.1.1.88, EC2.7.4.2, EC1.1.1.267, EC2.7.7.60, and EC5.3.3.2 were also not found. The corresponding genes of the other synthesized GPP, FPP, squalene, 2,3-oxidized squalene, and β-amyrin EC2.5.1.1, EC2.5.1.10, EC2.5.1.21, EC1.14.99.7 and 5.4.99 were also not detected.

Download:

Table 2. Enzymes related to glycyrrhizic acid metabolic pathways.

https://doi.org/10.1371/journal.pone.0143017.t002

4. Development and SSR locus analysis

A total of 11,702 unigenes with a length of more than 1 Kb were found in the licorice transcriptome. Unigenes present a total length of 22,739,272 bp. To develop new molecular markers, we used the MISA software (http://pgrc.ipk-gatersleben.de/misa/misa.html) to determine potential microsatellites defined as mono- to hexa-nucleotide motifs. A total of 7,032 potential SSR loci were detected. The frequency of SSRs is 60.10%, and the average distribution distance is 3,234 bp. The SSR loci in unigenes are 48,611,547. Each unigene contained more than one SSR loci. The SSR locus numbers of mono-, di-, tri-, tetra-, penta-, and hexa-nucleotide repeats are 3,394, 1,692, 1,814, 101, 19, and 12, respectively (Table 3).

Download:

Table 3. Numbers of SSR repeat types in licorice.

https://doi.org/10.1371/journal.pone.0143017.t003

To identify licorice SSRs, we analyzed 66 kinds of repeat primitives (Table 4). Mono-, di-, tri-, tetra-, penta-, and hexa-nucleotide repeat motifs present 2, 4, 10, 21, 17, and 12 types, respectively. From the frequency of occurrence of different repeat primitives, the most abundant type is A/T, accounting for 46.23% of the total SSRs, followed by AG/CT (15.84%) and AAG/CTT (6.23%) repeats. Among the di-nucleotide repeat primitive, AG/CT appears frequently, accounting for 65.84% of the di-nucleotide SSR. In the tri-nucleotide repeat primitive, AAG/CTT appears frequently, accounting for 24% of tri-nucleotide SSR.

Download:

Table 4. Amounts of different SSR repeat motifs in licorice.

https://doi.org/10.1371/journal.pone.0143017.t004

A total of 1,681 SSR sites were randomly selected from the SSR-containing sequences to design SSR primers with the Primer 6.0 software. Sixty-nine SSR pairs were randomly designed and synthesized. Sixty-four pairs were successfully used for PCR amplification of genomic DNA (Fig 6), whereas the five remaining pairs failed to generate PCR products at the same annealing temperatures. 53 pairs PCR products present the expected sizes and 11 pairs PCR products are larger than the expected sizes, which could be due to the fact that the PCR products contain introns. And 64 primer pairs polymorphic were detected in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. by the 2.5% agarose electrophoresis analysis, the results showed that 33 pairs of primers were polymorphism in different species.

Download:

Fig 6. Photograph of PCR amplification results for SSR markers in licorice.

The first line is the DNA ladder. The subsequent lines are the PCR products generated using different primers.

https://doi.org/10.1371/journal.pone.0143017.g006

Discussion

1. Licorice RNA-seq technology

Many technologies have been used to analyze and quantify the transcriptome of model or non-model organisms, such as Arabidopsis, rice, radish (Raphanus sativus L.), and Haloxylon ammodendron; as such, these techniques are vital to elucidate the complexity of growth and development of organisms. For medicinal plants, organ formation and development are controlled by complex interactions among genetic and environmental factors. The transcriptome data in publicly available libraries are insufficient and limited to describe the complex mechanisms of gene expression, as well as the genetic characteristics of species. Therefore, new generation of high-throughput sequencing technologies has been used as a powerful and cost-efficient tool for research on non-model organisms [9, 15]. In this experiment, we used RNA-Seq technology and obtained 5.41 Gb of clean data and 46, 641 unigenes from the assembly of the clean data (Table 1). The N50 length of the unigenes is 1,395 nt, with an average length of 787.40 nt. The results are comparable with the obtained unigenes in the recently published transcriptome analyses of other plant species, such as H. ammodendron (N50 = 1,354 bp, average length = 728 bp) [15], Reaumuria soongorica (N50 = 1,109 bp, average length = 677 bp) [16], and radish (Raphanus sativus L.) (N50 = 1,256 bp, average length = 820 bp) [17]. Longer unigenes may be obtained because of the developed Trinity software, which is a powerful software package for de novo assembly and generates increased number of full-length transcripts [11].

In this study, 29,614 unigenes were functionally annotated, whereas 17,027 (36.51%) did not obtain functional annotations. Unigenes may contain known functions of protein sequences because they are relatively short and lack conservative functional sequence. Known genes were not matched because their sequences contain missing parts and are relatively short. Moreover, unigenes contain non-coding RNA. In this regard, sequences were not functionally annotated because of insufficient number of unigenes and limited public information database.

2. Licorice genes related to the isoprenoid biosynthesis pathway

The isoprenoid biosynthesis pathway can synthesize kinetin, gibberellic acid, carotenoid, chlorophyll, sterols, monoterpenes, terpenes, and dolichol secondary metabolites [18]. The triterpene compound of glycyrrhizic acid is synthesized in the isoprene metabolic pathway. In this study, through transcriptome sequencing and KEGG database annotation, 40 kinds of enzymes are involved in terpenoid backbone biosynthesis, and 16 genes participate in the mevalonate pathway synthesis; moreover, 11 enzymes are coded in 29,614 annotated genes of licorice. Nevertheless, nine genes related to enzyme synthesis were not found. Li [5] studied the gene expression of wild licorice for 5 years and found 18 kinds of enzyme involved in licorice saponin synthesis; of these enzymes, 16 participate in the MVA synthesis of mevalonate kinase (EC2.7.1.36) and MEP synthesis pathway DXP synthase (EC2.2.1.7), whereas two enzymes are not related to the annotated genes. This result is significantly related to the sequencing of the samples with different ages and periods; transcriptome sequencing also show the different periods at which sample genes are expressed [19]. Nine kinds of enzyme in the synthesis of licorice saponins were not found. In different periods, the content of glycyrrhizin differs but the conclusion remains controversial. Liu [20] found that cultivated licorice with different ages presents varied contents of glycyrrhizin, total flavones, and polysaccharides; moreover, the highest content of glycyrrhizin was observed in the third year of cultivation. Sun [19] showed that the content of glycyrrhizin was higher after 4 years of cultivation. In addition, the quantity of synthesized saponin differs between the wild and cultivated Glycyrrhiza; licorice grows faster under cultivated conditions than that under wild conditions, resulting in higher primary metabolite contents. Although secondary metabolites are major components of traditional Chinese medicinal materials, their accumulation is related to adversity stress and are thus beneficial for accumulation of licorice saponin [5]. Therefore, we aim to design and conduct detailed tests and analyses by using different periods and ages of licorice material in the future.

3. Characteristics of SSR molecular markers

In the analysis of the SSR polymorphism loci of the licorice transcriptome with more than 1 Kb length and 11,702 unigenes by using the MISA software, a total of 7,032 SSR loci were detected with a frequency of 60.10% and an average distribution distance of 3,234 bp. In the licorice SSR loci, the most frequent repeat type is mono-nucleotide with 3,394 (48.27%), followed by tri- and di-nucleotide repeats, with 1,814 (25.80%) and 1,692 (24.06%), respectively. This distribution frequency differs from those of most plant genomes, such as field pea, faba bean, and autotetraploid Alfalfa, in which the most abundant repeat motif is tri-nucleotide (57.7%, 61.7%, and 61.19%, respectively) [10, 21]; in Sesamum, the most abundant repeat motif is di-nucleotide repeat motifs [22]. Autotetraploid Alfalfa, field pea, faba bean, and P. cuspidatum do not have mononucleotide repeat sequences, which could be due to the different standards used in SSR search criteria [9, 10, 21, 22]. In this study, we explored the mono-nucleotide repeat motifs in licorice; during the process, a condition where the mono-nucleotide repeat is dominant was generated, which decreases the number of other nucleotide repeats.

In this study, the occurrence frequency of tri-nucleotide repeats (25.80%) is higher than the di-nucleotide repeat frequencies (24.06%). Studies on P. cuspidatum [9], autotetraploid Alfalfa [10], Asteraceae (Mikania micrantha) [23], Asteraceae (Chrysanthemum nankingense) [24], and radish (R. sativus L.) [17] demonstrated similar conclusion. The di-nucleotide repeats of other plants, namely, rubber tree [25], Sesamum [22], and blunt snout bream [26], have higher frequencies than tri-nucleotide repeat frequencies. This finding may be due to the different genetics of different species and standards used for SSR search. In licorice di-nucleotide repeat motifs, AG/CT appeared the most, accounting for 15.84% of SSR. This result is consistent with that in Sesamum [22] and radish [17]. In plants, the presence of CT repeat sequence to 5′UTRs is probably related to reverse transcription and has a significant role in gene regulation [27]. By contrast, in the licorice tri-nucleotide repeat motifs, AAG /CTT appeared most, accounting for 6.23% of the total SSR. This result is consistent with that in Sesamum [22], and radish [17]; conversely, in rubber tree [25] and Asteraceae (C. nankingense) [24], AAG/TTC and CCA/GGT were the most abundant, respectively, which could be due to the frequency used in different encoding proteins of species.

Among 69 primer pairs, 64 (92.75%) were amplified successfully. The PCR success rate was similar to that in Sesamum [22], lower than that in rubber tree [25], and higher than that reported in a previous study [10]. These results suggest that the quality of assembled unigenes were high, and SSRs identified in our study could be used for future analysis.

Supporting Information

S1 Table. Sequences of 69 primer pairs for SSR markers.

https://doi.org/10.1371/journal.pone.0143017.s001

(XLS)

Author Contributions

Conceived and designed the experiments: YL. Performed the experiments: YL PZ MS. Analyzed the data: YL MS. Contributed reagents/materials/analysis tools: JH MQ WW CL. Wrote the paper: YL.

References

1. China Pharmacopoeia Committee. Pharmacopoeia of People’s Republic of China, Vol. 1, Chemical Industry Press, Beijing, 2010, pp. 283–284.
2. Shibata S. A drug over the millennia: pharmacognosy, chemistry, and pharmacology of licorice. Yakuqaku Zasshi. 2000; 120 (10): 849–862.
- View Article
- Google Scholar
3. Matsui S, Matesumoto H, Sonoda Y, Ando K, Aizu-Yokota E, Sato T, et al. Glycyrrhizin and related compounds down-regulate produntion of inflammatory chemokines IL-8 and eotaxin 1 in a human lung fibroblast cell line. Int Immunopharmacol. 2004; 4(13): 1633–1644. pmid:15454116
- View Article
- PubMed/NCBI
- Google Scholar
4. Wang B, Wang YX, Zhao HY, Zong Y, Xu JJ. Research progress of the major components and the pharmacological effect of Glycyrrhiza uralensis fisch. Journal of Jilin Medical College. 2013; 34(3): 215–218.
- View Article
- Google Scholar
5. Li Y, Luo HM, Sun C, Song JY, Wu Q, Wang N, et al. EST analysis reveals putative genes involved in glycyrrhizin biosynthesis. BMC Genomics. 2010; 11: 268 pmid:20423525
- View Article
- PubMed/NCBI
- Google Scholar
6. Ramilowski Jordan A.1, Sawai Satoru, Seki Hikaru, Mochida Keiichi, Yoshida Takuhiro. et al. Glycyrrhiza uralensis transcriptome landscape and study of phytochemicals. PCP. 2013; 54(5): 697–710
- View Article
- Google Scholar
7. Wang XC, Yang ZR, Wang M, Li W, Li SC. High- throughput Sequencing Technology and Its Application. China Biotechnology. 2012; 32(1): 109–114.
- View Article
- Google Scholar
8. Zhang QF, Li J, Fan ZX,Yang LQ,Bu X, Application of High–Throughput Sequencing Technology in Agricultural Research. Shandong Agricultural sciences. 2013; 45(1): 137–140.
- View Article
- Google Scholar
9. Hao DC, Ma P, Mu J, Chen SL, Xiao PG, Peng Y, et al. De novo characterization of the root transcriptome of a traditional Chinese medicinal plant Polygonum cuspidatum. Science China Life Science. 2012; 55(5): 452–466.
- View Article
- Google Scholar
10. Liu ZP, Chen TL, Ma LC, Zhao ZG, Zhao Patrick X., Nan ZB, et al. Global Transcriptome Sequencing using the Illumina Platform and the Development of EST-SSR Markers in Autotetraploid Alfalfa. PLOS One. 2013; 8(12): e83549. pmid:24349529
- View Article
- PubMed/NCBI
- Google Scholar
11. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7): 644–652. pmid:21572440
- View Article
- PubMed/NCBI
- Google Scholar
12. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytotchemical Bulletin. 1987; 19:11–15.
- View Article
- Google Scholar
13. Gao XY, Wang WQ, Wei SL, Li WD. Review of pharmacologica leffects of Glycyrrhiza Radix and its bioactive compounds. China Journal of Chinese Materia Medica. 2009; 34(21): 2695–2700. pmid:20209894
- View Article
- PubMed/NCBI
- Google Scholar
14. Li G, Zhou CM, Jiang XL, He ZP. Research of licorice cultivation and glycyrrhizic acid biosynthesis and regulation. Journal of Chinese Medicinal Materials. 2004; 27(6): 462–465.
- View Article
- Google Scholar
15. Long Y, Zhang JW, Tian XJ, Wu SS, Zhang Q, Zhang JP, et al. De novo assembly of the desert tree Haloxylon ammodendron (C. A. Mey.) based on RNA-Seq data provides insight into droughtresponse, gene discovery and marker identification. BMC Genomics. 2014; 15: 1111. pmid:25511667
- View Article
- PubMed/NCBI
- Google Scholar
16. Shi Y, Yan X, Zhao P, Yin H, Zhao X, Xiao H, et al. Transcriptomic analysis of a tertiary relict plant, extreme xerophyte Reaumuria soongorica to identify genes related to drought adaptation. PLOS One. 2013; 8(5): e63993. pmid:23717523
- View Article
- PubMed/NCBI
- Google Scholar
17. Wang SF, Wang XF, He QW, Liu XX, Xu WL, Li LB, et al. Transcriptome analysis of the roots at early and late seedling stages using Illumina paired-end sequencing and development of EST-SSR markers in radish. Plant Cell Rep. 2012; 31(8):1437–1447. pmid:22476438
- View Article
- PubMed/NCBI
- Google Scholar
18. Chappell J. Bioehemistry and molecular biology of the isoprenoid biosynthetic pathway in plants. Annu. Rev. Plant. Physiol. Plant Mol. Biol. 1995; 46: 521–547.
- View Article
- Google Scholar
19. Sun L, Yu JG, Li DY, Luo XZ, Zhao CJ, Yang SL, et al. Compare of contents Study in glycyrrhizin and liguiritin, Journal of Chinese Medicinal Materials. 2001; 24(08): 550–552.
- View Article
- Google Scholar
20. Liu JR, Zhao WB, Wang HY, Jiang FS, Xiang Yi, Li XY, et al. Output of Cultivated Glycyrrhizia in Different Growth Stages and Analytical Comparison cf Its Active Ingredients. Shanghai Journal of Traditional Chinese Medicine. 2004; 8 (11): 56–58.
- View Article
- Google Scholar
21. Kaur Sukhjiwan, Pembleton Luke W, Cogan Noel Ol, Savin KeithW, Leonforte Tony, Paull Jeffrey, et al. Transcriptome sequencing of field pea and faba bean for discovery and validation Of SSR genetic marker. BMC Genomics. 2012; 13: 104. pmid:22433453
- View Article
- PubMed/NCBI
- Google Scholar
22. Wei WL, Qi XQ, Wang LiH, Zhang YX, Hua W, Li DH, et al. Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics. 2011; 12: 451. pmid:21929789
- View Article
- PubMed/NCBI
- Google Scholar
23. Yan YB, Huang YL, Fang XT, Lu L, Zhou RC, Ge XJ, et al. Development and characterization of EST-SSR markers in the invasive weed Mikania micrantha (Asteraceae). American Journal of Botany. 2011; e1–e3. pmid:21613074
- View Article
- PubMed/NCBI
- Google Scholar
24. Wang HB, Jiang JF, Chen SM, Qi XY, Peng H, Li PR, et al. Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale unigene Assembly and SSR Marker Discovery. PLOS One. 2013; 8(4): e62293. pmid:23626799
- View Article
- PubMed/NCBI
- Google Scholar
25. Li DJ, Deng Z, Qin B, Liu XH, Men ZH. De novo assembly and characterizationg of bark transcriptome using Illumina sequencing and development of EST-SSR marker in rubber tree (Hevea brasiliensis Muell. Arg.) BMC Genomics. 2012; 13:192. pmid:22607098
- View Article
- PubMed/NCBI
- Google Scholar
26. Gao ZX, Luo W, Liu H, Zeng C, Liu XL, Yi SK, et al. Transcriptome Analysis and SSR/SNP Markers Information of the Blunt Snout Bream (Megalobrama amblycephala). PLOS One. 2012; 7(8): e42637. pmid:22880060
- View Article
- PubMed/NCBI
- Google Scholar
27. Martienssen R A, Colot V: DNA methylation and epigenetic inheritance in plants and filamentous fungi. Science. 2001; 293: 1070–1074. pmid:11498574
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. China Pharmacopoeia Committee. Pharmacopoeia of People’s Republic of China, Vol. 1, Chemical Industry Press, Beijing, 2010, pp. 283–284.

[ref2] 2. Shibata S. A drug over the millennia: pharmacognosy, chemistry, and pharmacology of licorice. Yakuqaku Zasshi. 2000; 120 (10): 849–862.
View Article
Google Scholar

[3] View Article

[4] Google Scholar

[ref3] 3. Matsui S, Matesumoto H, Sonoda Y, Ando K, Aizu-Yokota E, Sato T, et al. Glycyrrhizin and related compounds down-regulate produntion of inflammatory chemokines IL-8 and eotaxin 1 in a human lung fibroblast cell line. Int Immunopharmacol. 2004; 4(13): 1633–1644. pmid:15454116
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref4] 4. Wang B, Wang YX, Zhao HY, Zong Y, Xu JJ. Research progress of the major components and the pharmacological effect of Glycyrrhiza uralensis fisch. Journal of Jilin Medical College. 2013; 34(3): 215–218.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref5] 5. Li Y, Luo HM, Sun C, Song JY, Wu Q, Wang N, et al. EST analysis reveals putative genes involved in glycyrrhizin biosynthesis. BMC Genomics. 2010; 11: 268 pmid:20423525
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref6] 6. Ramilowski Jordan A.1, Sawai Satoru, Seki Hikaru, Mochida Keiichi, Yoshida Takuhiro. et al. Glycyrrhiza uralensis transcriptome landscape and study of phytochemicals. PCP. 2013; 54(5): 697–710
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Wang XC, Yang ZR, Wang M, Li W, Li SC. High- throughput Sequencing Technology and Its Application. China Biotechnology. 2012; 32(1): 109–114.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Zhang QF, Li J, Fan ZX,Yang LQ,Bu X, Application of High–Throughput Sequencing Technology in Agricultural Research. Shandong Agricultural sciences. 2013; 45(1): 137–140.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Hao DC, Ma P, Mu J, Chen SL, Xiao PG, Peng Y, et al. De novo characterization of the root transcriptome of a traditional Chinese medicinal plant Polygonum cuspidatum. Science China Life Science. 2012; 55(5): 452–466.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Liu ZP, Chen TL, Ma LC, Zhao ZG, Zhao Patrick X., Nan ZB, et al. Global Transcriptome Sequencing using the Illumina Platform and the Development of EST-SSR Markers in Autotetraploid Alfalfa. PLOS One. 2013; 8(12): e83549. pmid:24349529
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref11] 11. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7): 644–652. pmid:21572440
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref12] 12. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytotchemical Bulletin. 1987; 19:11–15.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref13] 13. Gao XY, Wang WQ, Wei SL, Li WD. Review of pharmacologica leffects of Glycyrrhiza Radix and its bioactive compounds. China Journal of Chinese Materia Medica. 2009; 34(21): 2695–2700. pmid:20209894
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref14] 14. Li G, Zhou CM, Jiang XL, He ZP. Research of licorice cultivation and glycyrrhizic acid biosynthesis and regulation. Journal of Chinese Medicinal Materials. 2004; 27(6): 462–465.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref15] 15. Long Y, Zhang JW, Tian XJ, Wu SS, Zhang Q, Zhang JP, et al. De novo assembly of the desert tree Haloxylon ammodendron (C. A. Mey.) based on RNA-Seq data provides insight into droughtresponse, gene discovery and marker identification. BMC Genomics. 2014; 15: 1111. pmid:25511667
View Article
PubMed/NCBI
Google Scholar

[47] View Article

[48] PubMed/NCBI

[49] Google Scholar

[ref16] 16. Shi Y, Yan X, Zhao P, Yin H, Zhao X, Xiao H, et al. Transcriptomic analysis of a tertiary relict plant, extreme xerophyte Reaumuria soongorica to identify genes related to drought adaptation. PLOS One. 2013; 8(5): e63993. pmid:23717523
View Article
PubMed/NCBI
Google Scholar

[51] View Article

[52] PubMed/NCBI

[53] Google Scholar

[ref17] 17. Wang SF, Wang XF, He QW, Liu XX, Xu WL, Li LB, et al. Transcriptome analysis of the roots at early and late seedling stages using Illumina paired-end sequencing and development of EST-SSR markers in radish. Plant Cell Rep. 2012; 31(8):1437–1447. pmid:22476438
View Article
PubMed/NCBI
Google Scholar

[55] View Article

[56] PubMed/NCBI

[57] Google Scholar

[ref18] 18. Chappell J. Bioehemistry and molecular biology of the isoprenoid biosynthetic pathway in plants. Annu. Rev. Plant. Physiol. Plant Mol. Biol. 1995; 46: 521–547.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref19] 19. Sun L, Yu JG, Li DY, Luo XZ, Zhao CJ, Yang SL, et al. Compare of contents Study in glycyrrhizin and liguiritin, Journal of Chinese Medicinal Materials. 2001; 24(08): 550–552.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref20] 20. Liu JR, Zhao WB, Wang HY, Jiang FS, Xiang Yi, Li XY, et al. Output of Cultivated Glycyrrhizia in Different Growth Stages and Analytical Comparison cf Its Active Ingredients. Shanghai Journal of Traditional Chinese Medicine. 2004; 8 (11): 56–58.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref21] 21. Kaur Sukhjiwan, Pembleton Luke W, Cogan Noel Ol, Savin KeithW, Leonforte Tony, Paull Jeffrey, et al. Transcriptome sequencing of field pea and faba bean for discovery and validation Of SSR genetic marker. BMC Genomics. 2012; 13: 104. pmid:22433453
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref22] 22. Wei WL, Qi XQ, Wang LiH, Zhang YX, Hua W, Li DH, et al. Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics. 2011; 12: 451. pmid:21929789
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref23] 23. Yan YB, Huang YL, Fang XT, Lu L, Zhou RC, Ge XJ, et al. Development and characterization of EST-SSR markers in the invasive weed Mikania micrantha (Asteraceae). American Journal of Botany. 2011; e1–e3. pmid:21613074
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref24] 24. Wang HB, Jiang JF, Chen SM, Qi XY, Peng H, Li PR, et al. Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale unigene Assembly and SSR Marker Discovery. PLOS One. 2013; 8(4): e62293. pmid:23626799
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref25] 25. Li DJ, Deng Z, Qin B, Liu XH, Men ZH. De novo assembly and characterizationg of bark transcriptome using Illumina sequencing and development of EST-SSR marker in rubber tree (Hevea brasiliensis Muell. Arg.) BMC Genomics. 2012; 13:192. pmid:22607098
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref26] 26. Gao ZX, Luo W, Liu H, Zeng C, Liu XL, Yi SK, et al. Transcriptome Analysis and SSR/SNP Markers Information of the Blunt Snout Bream (Megalobrama amblycephala). PLOS One. 2012; 7(8): e42637. pmid:22880060
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref27] 27. Martienssen R A, Colot V: DNA methylation and epigenetic inheritance in plants and filamentous fungi. Science. 2001; 293: 1070–1074. pmid:11498574
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

Figures

Abstract

Introduction

Materials and Methods

1. Plant materials

2. RNA isolation and transcriptome sequencing

3. De novo sequence assembly

4. Annotation of unigenes

5. Development and detection of SSR molecular markers for licorice

Results and Analysis

1. RNA sequencing and assembly of licorice transcriptome

2. Annotation of licorice functional genes

3. Main metabolism-related genes of licorice

4. Development and SSR locus analysis

Discussion

1. Licorice RNA-seq technology

2. Licorice genes related to the isoprenoid biosynthesis pathway

3. Characteristics of SSR molecular markers

Supporting Information

S1 Table. Sequences of 69 primer pairs for SSR markers.

Author Contributions

References