Adventitious rooting is the most important mechanism underlying vegetative propagation and an important strategy for plant propagation under environmental stress. The present study was conducted to obtain transcriptomic data and examine gene expression using RNA-Seq and bioinformatics analysis, thereby providing a foundation for understanding the molecular mechanisms controlling adventitious rooting. Three cDNA libraries constructed from mRNA samples from mung bean hypocotyls during adventitious rooting were sequenced. These three samples generated a total of 73 million, 60 million, and 59 million 100-bp reads, respectively. These reads were assembled into 78,697 unigenes with an average length of 832 bp, totaling 65 Mb. The unigenes were aligned against six public protein databases, and 29,029 unigenes (36.77%) were annotated using BLASTx. Among them, 28,225 (35.75%) and 28,119 (35.62%) unigenes had homologs in the TrEMBL and NCBI non-redundant (Nr) databases, respectively. Of these unigenes, 21,140 were assigned to gene ontology classes, and a total of 11,990 unigenes were classified into 25 KOG functional categories. A total of 7,357 unigenes were annotated to 4,524 KOs, and 4,651 unigenes were mapped onto 342 KEGG pathways using BLAST comparison against the KEGG database. A total of 11,717 unigenes were differentially expressed (fold change>2) during the root induction stage, with 8,772 unigenes down-regulated and 2,945 unigenes up-regulated. A total of 12,737 unigenes were differentially expressed during the root initiation stage, with 9,303 unigenes down-regulated and 3,434 unigenes up-regulated. A total of 5,334 unigenes were differentially expressed between the root induction and initiation stage, with 2,167 unigenes down-regulated and 3,167 unigenes up-regulated. qRT-PCR validation of the 39 genes with known functions indicated a strong correlation (92.3%) with the RNA-Seq data. The GO enrichment, pathway mapping, and gene expression profiles reveal molecular traits for root induction and initiation. This study provides a platform for functional genomic research with this species.
Citation: Li S-W, Shi R-F, Leng Y (2015) De Novo Characterization of the Mung Bean Transcriptome and Transcriptomic Analysis of Adventitious Rooting in Seedlings Using RNA-Seq. PLoS ONE 10(7): e0132969. https://doi.org/10.1371/journal.pone.0132969
Editor: Turgay Unver, Cankiri Karatekin University, TURKEY
Received: March 29, 2015; Accepted: June 19, 2015; Published: July 15, 2015
Copyright: © 2015 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the paper and its Supporting Information files. All sequences files are available from DDBJ/EMBL/GenBank under the accession number GBXO01000001-GBXO01078617.
Funding: This work was supported by the National Natural Science Foundation of China (31260090).
Competing interests: The authors have declared that no competing interests exist.
Adventitious roots refer to roots that form from any tissue that is not a root, such as leaves and stems. Adventitious rooting is one of the most important mechanisms of vegetative propagation in plants and one of the most important methods for the commercial production of horticultural species throughout the world . As an alternative or supplement to seed propagation in ecosystems where soil disturbances occur frequently, adventitious rooting is an important plant response to environmental stresses and a strategy for plant propagation under stress . The formation of adventitious roots has been associated with an important aspect of tissue dedifferentiation that involves shifting cells from normal morphogenetic pathways to functions associated with the development of root primordia . This shift leads to de novo root formation and a multitude of metabolic changes involving the enzymes and macromolecules associated with the induction, initiation, and development of root primordia in plant cuttings . Although the physiological and biochemical changes that occur during adventitious root formation have been extensively studied, the molecular mechanisms involved remain less well understood.
Various molecular and genetic approaches have been used to study adventitious root development in Arabidopsis and other plants . The physiological and biochemical changes that occur during the complex process of in vitro root development must be attributed to the presence and activity of metabolic pathways. In turn, these metabolic pathways must be controlled by the regulation of RNA transcription. Identifying the RNA transcription profile during this process will thus improve our understanding of the fundamental processes that control adventitious rooting. To this end, several studies have sought to investigate the transcriptional changes and differences in gene expression that occur during adventitious root formation using proteomic and cDNA microarrays [6–10]. Using the latter method, Brinker et al. (2004) identified 220 genes that were changed significantly during root development in hypocotyl cuttings of Pinus contorta . Proteomic analyses were also used to investigate the proteins involved in the adventitious rooting of Arabidopsis thaliana mutants by Sorin et al. (2006), who identified 11 proteins predicted to be involved in different biological processes, including the regulation of auxin homeostasis and light-associated metabolic pathways . Using the Medicago GeneChip, Holmes et al. (2010) identified 904 and 993 up- and down-regulated probe sets in root-forming cultures of Medicago truncatula as well as significant changes in metabolism, signaling and the expression of transcription factors linked to in vitro adventitious root formation processes . Recently, using a NimbleGen microarray, Rigal et al. (2012) identified 7,107 transcript levels that changed during early stages of adventitious root development in the model tree Populus trichocarpa .
A major limitation of the microarray method is that only a portion of the total transcripts can be assayed. Many genes are not represented on the microarrays, while genes from large and highly similar families may yield ambiguous expression results due to non-specific hybridization . Recently, a high-throughput deep-sequencing technology (i.e., next-generation sequencing, NGS), RNA-Seq, has been widely used to explore transcriptomic data and study gene expression at the whole genome level in model and non-model organisms [12, 13]. Emerging de novo short read assembly technology has been successfully applied to identify gene expression profiles and discover new genes without a reference genome sequence [13, 14]. This technology platform enables the precise elucidation of transcripts present within a particular sample and can be used to calculate gene expression based on absolute transcript abundance .
The process of adventitious rooting consists of three successive but interdependent physiological stages, namely, induction, initiation and expression. The induction stage comprises molecular and biochemical events without visible changes. The initiation stage is characterized by cell division and organization of the root primordia [1, 16]. Studies in herbaceous plants reveal that the critical events that culminate in the formation of adventitious roots in hypocotyl cuttings occur within the first 3–12 h after excision of the primary roots [3, 17]. In mung bean hypocotyl cuttings, the induction stage lasts from 0 h to 12 h after primary root excision, and the initiation stage lasts from 12 h to 48 h. The first emerging adventitious root primordia were clearly visible at 48 h and adventitious roots grown through the epidermis of the hypocotyls within 72 h of the start of the cutting cultures [16, 18]. In cuttings of woody plants such as in Malus and Populus, cell divisions as early as 48 h after auxin exposure [17, 19]. Early significant physiological and biochemical changes in endogenous hormone pools occur during the first 48 h after excision . Transcriptome monitoring in Populus trichocarpa cuttings revealed significant shifts during 0–48 h time period after excision. 27% of the genes were differentially regulated between 0 and 6 h, 36% between 6 and 24 h, and 4% between 24 and 48 h [2,19]. The critical dedifferentiation events during the process of adventitious rooting occur within these two stages.
Herein, we exploited RNA-Seq technology to characterize the mung bean transcriptome and further to highlight global changes in gene expression during early stage of root development (i.e., induction and initiation stages) in mung bean hypocotyl cuttings. Mung bean is one of the most important tropical grain legumes that serves as a significant and a cheap source of carbohydrates and easily digestible protein for the people of Asia and Africa, but increasingly extends into Australia, USA, Canada and Ethiopia . However, the genomic and transcriptomic data of this plant have not been revealed so far. Furthermore, this plant has been widely used as a model plant species for studying physiological, biochemical, and molecular mechanisms under the process of adventitious root formation [16, 18, 22–25]. In present study, we aimed to characterize the molecular basis of physiological processes that occur during early stage of root development and to identify differentially expressed genes (DEGs) and metabolic pathways. Real-time quantitative PCR was used to validate several of the transcriptional changes observed.
Materials and Methods
Plant material and culture conditions
Mung bean [Vigna radiata (L.) R. Wilczek] seeds were washed in distilled water and sterilized in a 6% sodium hypochlorite solution for 15 min. The seeds were subsequently washed three times in sterile distilled water, sown in Petri dishes (30 seeds per a 12 cm- Petri dishes), covered with a 5 mm-layer of sterilized perlite, and incubated in a growth chamber at 25±1°C for 36 h in the dark and then at 25±1°C with a 14-h photoperiod under white fluorescent lamps (PAR of 100 μM m−2 s−1). Five days after germination, seedlings that were 5 cm in height were used for the experiments. To investigate gene expression changes during adventitious rooting, the primary roots of the seedlings were removed from the bases of the hypocotyls, and the resulting explants (10 per beaker) were cultured in 50-mL beakers containing 40 mL sterilized distilled water for 6 h (Wat6) or 24 h (Wat24) under the same aseptic conditions applied to the seedling culture. The basal 0.5 cm of each hypocotyl, where adventitious roots originated in vitro, was cut and harvested after a 6- or 24-h incubation. The same parts of seedling hypocotyls were directly harvested and used as the control tissues (Con) (Fig 1). The three parallel treatments were set in each group. All of the harvested tissues were immediately frozen in liquid nitrogen and stored at -80°C until further analysis.
Adventitious root primordia are visible at 48 h after the primary roots excision and adventitious roots grow through the epidermis of the hypocotyls within 96 h. The basal 0.5 cm of hypocotyls at 0 h (Con), 6 h (Wat6), and 24 h (Wat24) after the primary roots excision and incubation in water were harvested and used as study samples.
Total RNA extraction
The tissues from 10 hypocotyls were fully ground in liquid nitrogen, and approximately 50 mg of tissue powder was mixed with 600 μL buffer Rlysis-P (from kit SK8631, Sangon, Shanghai, China) in a 1.5 mL RNase-free tube for 5 min in a water bath at 65°C to ensure sufficient lysis. Next, 60 μL buffer PCA (from kit SK8631, Sangon) was added and mixed thoroughly, and the mixture was incubated at -20°C for 3 min. After centrifugation at 10,000 g for 5 min at 4°C, an equal volume of cooled phenol chloroform (phenol water) was added to the supernatant, mixed, and then centrifuged at 12,000 g for 5 min at 4°C. An equal volume of cooled chloroform was added to the supernatant and mixed. Following centrifugation at 12,000 g for 5 min at 4°C, an equal volume of cooled isopropanol was added to the supernatant, shaken gently, and left to precipitate for 10 min. After centrifugation at 12,000 g for 20 min at 4°C, the pellet was recovered, washed twice with 75% ethanol, dried for 5–15 min at ambient temperature, dissolved in 50 μL RNase-free water, and stored at -80°C. A 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA) was used to confirm RNA integrity with RNA Integrity Number (RIN) values of 8.1–9.9. RNA concentration was determined using a NanoDrop ND-1000 Spectrophotometer (NanoDrop, Wilmington, DE, USA).
cDNA library construction and transcriptome sequencing
Equal amounts of total RNA from each sample were pooled to construct the cDNA library. Oligo(dT) 25 beads (Invitrogen) were used to enrich for poly(A) mRNAs from the total RNA pool. Following purification, the mRNA was cleaved into fragments using Fragment Mix reactive system at 94°C for 4 min. First-strand cDNA was synthesized using Superscript II reverse transcriptase (18064–014, Invitrogen), First Strand Master Mix, random hexamer (N6) primers, and the fragmented mRNA templates. The reaction was performed at 25°C for 10 min, 42°C for 50 min, 70°C for 15 min, and then held at 4°C. Subsequently, the second strand cDNA was synthesized using Second Strand Master Mix (18064–014, Invitrogen). The synthesized dscDNA fragments were purified with Agencourt AMPure XP Beads (Agencourt). The End Repair Control and AMPure XP beads were used to repair the 3' ends and purify the repaired cDNA fragments. Subsequently, adenylation of the 3' ends of the cDNA fragments was conducted using Klenow exo (M0212L, NEB). After end repair and A-tailing, Illumina paired-end adapters were ligated to the cDNA fragments using T4 Ligase (Fermentas) and purified twice with AMPure XP Beads. To prepare the cDNA sequencing library, the ligated cDNA was enriched and amplified using selective PCR. The PCR procedure was performed as follows: 98°C for 30 s; 15 cycles of 98°C for 10 s, 60°C for 30 s, 72°C for 30 s, and 72°C for 5 min; holding at 4°C, followed by purification with AMPure XP beads. The quality and quantity of the cDNA library were measured using the Agilent 2100 Bioanalyzer and Qubit 2.0 (Life Technologies). Finally, paired-end sequencing of the constructed cDNA library was carried out at Sangon Biotech. Co. Ltd. (Shanghai, China) on an Illumina HiSeq 2000 system (Illumina).
De novo assembly and sequence clustering
The raw reads were filtered, and high-quality clean read data were obtained by deleting adaptor sequences, removing reads containing more than 5% ambiguous bases (undetermined bases, N) and low-quality reads (reads containing more than 10% bases with a Q-value ≤20). The de novo assembly of the clean reads was carried out using the TRINITY paired-end assembly method (Trinity RNA-Seq r2013-02-25, http://trinityrnaseq.sourceforge.net/)  with an optimized k-mer length of 25. The assembled sequences were clustered with Chrysalis, a module of Trinity. The longest transcript that could not be extended on either end within each clustered loci was defined as a unigene. The assembled unigenes (longer than 200 bp) have been deposited in the Transcriptome Shotgun Assembly Sequence Database (http://www.ncbi.nih.gov/genbank/tsa.html) at DDBJ/EMBL/GenBank under the accession number GBXO01000001-GBXO01078617.
Similarity searches were performed using locally installed BLAST+ v2.2.27 software . The transcripts and unigenes were subjected to similarity searches against protein and nucleotide sequence databases using BLASTx and MEGABLAST, respectively, at an e-value cut-off of e-5. BLAST annotations were filtered using either subject or query coverage (>30%) and sequence identity (>50% for megablast and >30% for blastx).
Mapping reads, calling variations and quantifying transcripts
Due to the lack of a reference sequence, the assembled transcripts were assumed to be the reference sequence to compute transcript expression levels [26, 28, 29]. The expression values were used to create an expression profile with the help of Agilent's GeneSpring program. The read sequences were aligned against these transcript reference sequences using BWA-0.6.2-http://bio-bwa.sourceforge.net/  in the end-to-end alignment mode.
Functional annotation and classification
All resulting unigenes that exceeded 200 bp in length were annotated according to their sequence similarity to previously annotated genes. First, the unigenes were aligned using BLASTx to the public protein databases NR, SWISS-PROT, TrEMBL, Pfam, and CDD with similarity set at >30% and an E-value ≤1e-5. The KOG (Clusters of Orthologous Groups for eukaryotic complete genomes) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway annotations were performed by sequence comparisons against the two databases using BLASTALL and KAAS software (ftp://ftp.ncbi.nih.gov/blast/executables/release/2.2.18/) with an E-value ≤1e-5. The resulting blast hits were processed using Blast2GO software (version 2.3.5, http://www.blast2go.de/)  with an E-value threshold of 1e-5 to retrieve associated GO terms. GO classification was achieved using WEGO software . The results that presented the best alignment were used to identify the sequence direction and to predict the coding regions using BLASTx searches against protein databases, with the priority order of NR, SWISS-PROT, KEGG and KOG if conflicting results were obtained. The ESTScan software  was used to analyze the unigenes that did not align to any of the above databases. KEGG mapping was used to determine the metabolic pathways. Enzyme codes were extracted, and KEGG [34–36] pathways were retrieved from the KEGG web server (http://www.genome.jp/kegg/). To further enrich the pathway annotations, unigenes were submitted to the KEGG Automatic Annotation Server (KAAS) , and the single-directional best hit information method was selected. To identify the enriched pathways, the phyper test was used to measure the relative coverage of the annotated KEGG orthologous groups of a pathway against the transcriptome background, and the pathways with a p-value ≤0.05 were classified as enriched.
Expression analysis and identification of DEGs
The expression levels of unigenes were measured by mapping back the number of clean reads to the assembled unigenes using BWA-0.6.2-http://bio-bwa.sourceforge.net/ . The number of clean reads mapped to each unigene was calculated and then normalized to RPKM (reads per Kb per million reads) using ERANGE3.1 software . Unigene expression levels were analyzed using the DEGseq R package  with the MARS (MA-plot-based method with Random Sampling) model. The DEGs between each pair of samples were screened using the Audic-Claverie algorithm  with an FDR threshold of ≤0.001 and an absolute value of log2 ≥1. Multiple test corrections of the p-value and FDR were performed with the Benjamini-Hochberg correction .
Real-time quantitative reverse transcription PCR validation
To validate the transcriptome data, 39 genes with known functions that were assumed to play roles in adventitious root initiation were selected for further analysis. Hypocotyl tissues were harvested from three biological replicates subjected to the same experimental design as that of the samples subjected to Illumina sequencing for RNA-Seq. Total RNA was extracted with TRIzol reagent (Invitrogen, Carlsbad, CA, USA) and purified on RNeasy mini spin columns (Qiagen) with on-column DNase I treatment according to the manufacturer’s protocol. RNA integrity was examined with an Agilent Bioanalyzer 2100 (Agilent Technologies). First strand cDNA was synthesized using the AMV First Strand cDNA Synthesis Kit (Roche Applied Science, Mannheim, Germany) according to the manufacturer’s instructions. The gene-specific primer pairs (S8 Table) were designed using Primer Premier 5.0 software (Applied Biosystems, Foster City, CA, USA) according to the confirmed sequences. Real-time PCR was run in a LightCycler480 II (Roche Applied Science) with ABI SYBR Green PCR Master Mix (ABI, Foster, USA). The thermal cycling program was 95°C for 3 min and 40 cycles of 95°C for 15 s and 60°C for 40 s. Melting curve analysis was carried out for each primer set to verify the presence of a single melting peak after amplification. ‘No cDNA’ samples (water) and ‘no RT’ samples were included as negative controls. Output data were generated with Sequence Detector version 1.3.1 software (ABI) and evaluated using Student’s t-tests with the delta-delta Ct method described by Livak and Schmittgen . The standard error of the mean was calculated for the three biological replicates. Expression levels were calculated relative to the reference gene using the comparative threshold cycle method.
Solexa RNA paired-end sequencing
Total RNA extraction and cDNA synthesis were performed from three samples: hypocotyls (Con), hypocotyls after primary root excision and incubation in water for 6 h (Wat6, root induction stage), and hypocotyls after primary root removal and incubation in water for 24 h (Wat24, root initiation stage). The three cDNA libraries were sequenced separately using the Illumina HiSeq 2000 system and respectively generated 7.361e+09 bp, 5.998e+09 bp, and 5.885e+09 bp raw reads. Raw reads were subjected to quality control using SeqQC. The ratio of >Q20 bases was more than 87% across the three libraries. The percentages of undetermined bases (Ns) were 0.144%, 0.137%, and 0.224% in the three libraries, respectively (Table 1). After deleting adapter sequences and discarding low-quality sequences from the raw data, 6.832 Gbp (92.81% of the total reads), 5.558 Gbp (92.66% of the total reads), and 5.557 Gbp (94.42% of the total reads) of high-quality reads were obtained for the three libraries, respectively. The average length of the clean reads exceeded 95 bp, and the ratio of retained reads was more than 95% by pre-processing (Table 1). To assess the contamination of the processed reads, random sets of one hundred thousand sequences were aligned against the Nr database. The results are presented in S1 Table. This assay indicated that the sequencing quality was high enough for further analysis. These processed paired-end reads were used for transcript assembly.
De novo assembly
The paired-end de novo assembly of the processed reads was performed using the TRINITY transcriptome assembly software program. After filtering out repetitive sequences and those shorter than 200 bases in length, a total of 133,287 transcripts (166 Mb) with a sequence length > 200 bp were generated. The total length of the transcripts was 1.66e+08 bases, and the mean length of the transcripts was approximately 1248 bases (Table 2). The average GC content of the transcripts was 37.84%, indicating that the transcripts were AT-rich at 62.16% (Table 3; S1 Fig). The N50 was 2132 in this assembly, which was higher than most other plant transcriptome assemblies [12, 26, 28, 42, 43]. The higher the N50 value, the better the assembly . Further clustering using the Chrysalis cluster module of TRINITY resulted in 78,697 unigenes (65 Mb), which represented the longest transcripts in sequence length within each loci. Approximately 47% (37,438) of the unigenes had a length that exceeded 500 bp (Table 2; S2 Fig). It has been demonstrated that longer transcripts are easier and more likely to be mapped to correct transcript sequences . The lengths of the assembled transcripts and unigenes are shown in S2 Fig. The ratios of mapped reads were 93.55%, 94.08%, and 94.04%, and the expression ratios of unigenes were 91.92% (72,342), 84.71% (66,663), and 82.19% (64,680) in the Con, Wat6, and Wat24 samples, respectively, demonstrating a decreasing trend in gene expression during root development (Table 3).
Functional annotation of the unigenes
As a non-model plant, the mung bean unigenes obtained in this RNA-Seq analysis were aligned against the six public protein databases, Nr (NCBI non-redundant (nr) database), the SWISS-PROT protein database, TrEMBL, Pfam, KOG (Clusters of Orthologous Groups of proteins in eukaryotes), and CDD with the criteria of similarity >30% and E-value ≤1e-5. Approximately 36.77% of the unigenes (29,029) were annotated using BLASTx. Among them, 28,084 (35.69%), 27,934 (35.50%), 19434 (24.62%), 16704 (21.16%), 12738 (16.14%), and 11990 (15.19%) unigenes could be annotated using the TrEMBL, Nr, SWISS-PROT, CDD, Pfam, and KOG databases, respectively. A four-way Venn diagram was constructed to depict the shared sets of transcripts annotated by the four databases (S3 Fig).
The blast statistics showed that 88.11% of the unigenes exhibited strong homology (E-value < 10–20), and 68.71% exhibited very strong homology (E-value < 10–50) to available plant sequences in the TrEMBL database, most of which belonged to Glycine max. The percentage of unigenes with both a bitscore >1000 and an E-value = 0 account for 32.25% (Table 4, S2 Table). The 10 top-hit species based on Nr annotation indicated that 81% of the unigenes can be annotated with sequences from Glycine max, while nearly 96% of the unigenes can be annotated with sequences from 5 top-hit species, including Glycine max, Cicer arietinum, Medicago truncatula, Vitis vinifera, and Phaseolus vulgaris (S4 Fig). Gene ontology (GO) category analysis assigned 61,357, 65,653, and 28,948 unigenes to the GO terms cellular component, biological process, and molecular function, respectively (Fig 2). The top-3 GO subcategories under cellular component are cell (14,114 unigenes, 17.9%), organelle (10,186 unigenes, 12.9%), and cell part (14,114 unigenes, 17.8%). The top-2 GO subcategories under biological process are metabolic process (11,841 unigenes, 15.0%) and cellular process (12,789 unigenes, 16.2%). The top-2 GO subcategories under molecular function are binding (12,800 unigenes, 16.2%) and catalytic activity (11,023, 14.0%). A total of 11,990 unigenes were classed into 25 KOG functional categories, with the top 3 subcategories identified as signal transduction mechanisms (1,704 unigenes, 14.2%), general function prediction only (1,530 unigenes, 12.8%), and posttranslational modification, protein turnover, chaperones (1,309 unigenes, 10.9%) (Fig 3). A total of 7,357 unigenes were annotated to 4,524 KOs (KEGG Orthology), and 4,651 unigenes were mapped into 342 KEGG pathways (Fig 4). The top 10 pathways are presented in Table 5.
Unigenes with BLASTx matches against the plant Nr database were classified into three main GO categories (biological process, cellular component, molecular function) and 57 sub-categories. The left-hand scale on the y-axis shows the percentage of the unigenes in each of the categories. The right-hand scale on the y-axis indicates the number of the unigenes in the same category.
Unigenes were assigned to one or more of the 25 COG classification categories.
GO enrichment analysis
GO enrichment analysis is a proven method to identify primary biological functions. The functional enrichment of DEGs indicated that, at FDR≤0.05, 258 GOs were enriched in the Wat6 versus Con (Wat6:Con), 183 GOs were enriched in the Wat24 versus Con (Wat24:Con), and 222 GOs were enriched in Wat24 versus Wat6 (Wat24:Wat6). The functional enrichment of the DEGs revealed different GOs in the three samples. For example, the functions of oxidoreductase activity, response to oxidative stress, DNA binding transcription factor activity, and photosynthesis were enriched in Wat6, while the functions of ribosome and translation were enriched in Wat24 (S3 Table). These results suggest that profound cellular and metabolic reorganization occurs during the root induction stage.
GO enrichment further demonstrated that total of 897 terms were significantly regulated with 595 up-regulated and 302 down-regulated in Wat6:Con, whereas total of 487 terms were significantly regulated with 232 up-regulated and 255 down-regulated in Wat24:Con, and total of 484 terms were significantly regulated with 128 up-regulated and 356 terms down-regulated from Wat6 to Wat24 (Table 6). The up-regulation and down-regulation of GO categories are presented in Fig 5. In the group of down-regulated terms, the proportions of unigenes in each GO category exhibited a trend of Wat6 > Wat24 > Wat24:Wat6, with the exception of the subcategory of nutrient reservoir activity in molecular function, suggesting that the more significant down-regulation of DEGs occurs during the root induction stage. In the group of up-regulated terms, the major GO categories exhibited a trend of Wat24 > Wat6 > Wat24:Wat6, with the exceptions of rhythmic process in biological process and extracellular matrix in cellular component, suggesting that the more significant up-regulation of DEGs occurs during the root induction stage. Comparing between the down-regulated and up-regulated groups, we found that the significant down-regulated categories appeared under molecular function, with the top subcategories of protein binding transcription factor activity, nucleic acid binding transcription factor activity, and molecular transducer activity. The significant up-regulated categories appeared under both cellular component and molecular function, with the top subcategories of antioxidant activity, structural molecule activity, and nutrient reservoir activity in molecular function and extracellular matrix, extracellular region part, cell junction, and macromolecular complex in the cellular component category. The top 10 significant up- and down-regulated GO categories are listed in Table 7. Among the top-10 up-regulated GO groups, GO:0003735, GO:0005840, GO:0005198, GO:0022626, GO:0044445, GO:0006412, GO:0044391, and GO:0030529 were all up-regulated in Wat6, Wat24, and Wat24:Wat6. Moreover, the top-10 up-regulated GO categories were identical in Wat24 and Wat24:Wat6. However, in the down-regulated groups, only GO:0001071 and GO:0003700 were both down-regulated in Wat6 and Wat24; the others were all associated with different GO categories across the three samples (Table 7, S4 Table). These results indicate that nearly the same groups of GO categories were significantly up-regulated at the root induction and initiation stages, including ribosome, structural constituent of ribosome, structural molecule activity, translation, ribonucleoprotein complex, ribosomal subunit, cytosolic ribosome, non-membrane-bounded organelle, intracellular non-membrane-bounded organelle, and cytosolic part. Clearly, these GO categories are associated with protein synthesis. However, several distinct GO categories were significantly down-regulated at the root induction and initiation stages, including nucleic acid binding transcription factor activity, sequence-specific DNA binding transcription factor activity, RNA biosynthetic process, DNA integration, nucleic acid binding transcription factor activity, sequence-specific DNA binding transcription factor activity, and nucleic acid metabolic process. These GO categories are associated with RNA transcription. Interestingly, GO:0016491, oxidoreductase activity, was significantly up-regulated in Wat6 but significantly down-regulated in Wat24:Wat6, suggesting an increase in cellular oxidoreductase activity during the root induction stage that became a decrease during the root initiation stage. Compared with Wat6, the significant down-regulated GO categories include response to chemical stimulus, oxidoreductase activity, response to endogenous stimulus, response to auxin stimulus, response to stimulus, response to hormone stimulus, and response to organic substance. Clearly, these GO categories involve responses to stimulus and hormone signaling.
KEGG enrichment analysis
Pathway enrichment analysis revealed that 9, 11, and 9 pathways were the significant difference pathways enriched in Wat6, Wat24, and Wat24:Wat6, respectively. Further analysis indicated that 5, 5, and 3 pathways were significantly (RDF ≤0.05) down-regulated and 14, 6, and 6 pathways were significantly up-regulated in Wat6, Wat24, and Wat24:Wat6, respectively (Table 6). These results indicate that more KOs were up-regulated than down-regulated, especially in Wat6, suggesting that the key up-regulation of KOs occurred during the root induction stage. KEGG enrichment analysis further indicated that ko03010 (ribosome), ko0094 (phenylpropanoid biosynthesis), ko00360 (phenylalanine metabolism), and ko00909 (sesquiterpenoid and triterpenoid biosynthesis) were all up-regulated in Wat6, Wat24, and Wat24:Wat6. The significant down-regulated KOs during Wat6 were photosynthesis, carbon fixation in photosynthetic organisms, carotenoid biosynthesis, nitrogen metabolism, sphingolipid metabolism, glycerolipid metabolism, and porphyrin and chlorophyll metabolism. The significant down-regulated KOs during Wat24 were cutin, diterpenoid biosynthesis, cytokine-cytokine receptor interaction, and circadian rhythm—plant, and those in Wat24:Wat6 were oxidative phosphorylation, nitrogen metabolism, plant hormone signal transduction, diterpenoid biosynthesis, photosynthesis, and cysteine and methionine metabolism. Among them, ko00195 (photosynthesis) and ko00910 (nitrogen metabolism) were down-regulated in both Wat6 and Wat24:Wat6, suggesting that photosynthesis and nitrogen metabolism were continuously down-regulated from the root induction stage to the root initiation stage (Tables 8 and 9, S5 Table). The principal aspects of the KEGG enrichment results were consistent with the GO enrichment results.
Gene expression profiling during adventitious rooting
Gene expression levels can be estimated from Illumina sequencing based on the number of clean reads for a gene. The RPKM method  was used to calculate the expression abundances of unigenes during adventitious rooting. The results indicated that the unigenes numbered with RPKM = 100–500, RPKM = 500–1000, and RPKM≥1000 exhibited a clearly increasing trend from Con to Wat24, suggesting that the expression abundances of certain genes greatly increased during root development (Table 3). A total of 11,717 unigenes showed differential expression (log2 ≥1) in Wat6, with 8,772 unigenes down-regulated and 2,945 unigenes up-regulated. A total of 12,737 unigenes showed differential expression during Wat24, with 9,303 unigenes down-regulated and 3,434 unigenes up-regulated. Compared with Wat6, a total of 5,334 unigenes showed differential expression in the Wat24 sample, with 2,167 unigenes down-regulated and 3,167 unigenes up-regulated. These results indicate that 74.9% and 73.04% of the DEGs were down-regulated at the root induction and initiation stages, respectively, while 59.4% of the DEGs were up-regulated from the root induction stage to the initiation stage (Table 10). Further analysis revealed that 283 unigenes were specifically up-regulated DEGs and 546 unigenes were specifically down-regulated DEGs in Wat6; 619 and 753 unigenes were specifically up- and down-regulated DEGs in Wat24; and 424 and 163 unigenes were specifically up- and down-regulated DEGs from Wat6 to Wat24. Most of the specifically expressed DEGs were low-abundance genes (read number ≤100). For example, among the specifically expressed DEGs with a read number ≥100, 34 were up-regulated and 11 were down-regulated in Wat6, 69 were up-regulated and 11 were down-regulated in Wat24, and 29 were up-regulated and 0 were down-regulated from Wat6 to Wat24. Moreover, among the specifically expressed DEGs with both a read number ≥100 and log2 ≥4, 209 unigenes were up-regulated and 96 were down-regulated in Wat6, 238 were up-regulated and 59 were down-regulated in Wat24, and 100 were up-regulated and 34 were down-regulated from Wat6 to Wat24 (Table 10). These results indicate that many more specific DEGs were significantly up-regulated than down-regulated during adventitious root induction and initiation.
Specifically up- and down-regulated unigenes during adventitious root induction
To evaluate the changes in DEGs during adventitious root induction and initiation, we selected the top 50 DEGs with both a read number >1000 and log2 >5 (fold change >32) (S6 Table). After filtering out the unigenes termed hypothetical protein, uncharacterized protein, and unknown in the database, the remaining DEGs are listed in Tables 11, 12 and 13. Among the top-25 genes with more than 32-fold up-regulation in the Wat6 sample, the most abundantly expressed genes (read number >1000) include five cationic peroxidase genes (Vr39448, Vr31128, Vr22610, Vr39339, and Vr39180), two pathogenesis-related protein genes (Vr39039 and Vr36526), two anthocyanin metabolism-associated genes (Vr36323 and Vr36176), and two isoflavone metabolism-associated genes (Vr38993 and Vr35207). The other important genes include basic chitinase class 3 (Vr40472) and trypsin protease inhibitor precursor (Vr35851). It is worth noting that an auxin-related gene, auxin efflux carrier (Vr21159), was significantly up-regulated. However, only six genes with more than 32-fold down-regulation appeared in the top DEGs list, including three MYB transcription factor genes (Vr40489, Vr39799, and Vr13836), polyprotein precursor gene (Vr38043), S-type anion channel SLAH3-like gene (Vr24590), and auxin-induced protein 5NG4-like gene (Vr55469) (Table 12). The other genes with more than 16-fold down-regulation include heat shock 70 kDa protein-like (Vr40796 and Vr42894), ABC transporter G family member 22-like (Vr50534), serine glyoxylate aminotransferase 2 (Vr41217), probable E3 ubiquitin-protein ligase HERC1-like (Vr15096), putative organic cation transport protein (Vr56588), and histidine kinase 1-like isoform X2 (Vr33063) (S6 and S7 Tables).
Specifically up- and down-regulated unigenes during adventitious root initiation
There were 33 highly abundant (read number >1000) genes with more than 32-fold (log2 > 5) up-regulation in the Wat24 sample. Similar to the Wat6 sample, six cationic peroxidase 1-like genes, two pathogen-related protein genes, a polygalacturonase gene, a polygalacturonase PG1 precursor gene, a basic chitinase class 3 gene, and a trypsin protease inhibitor precursor gene were all significantly up-regulated in the Wat24 sample. However, many other genes were exclusively up-regulated in Wat24, such as patatin group A-3-like (Vr43029 and Vr58791), ethylene-responsive transcription factor ERF086-like (Vr42199), 7-ethoxycoumarin O-deethylase-like (Vr38890), potassium transporter 5-like (Vr18948), peroxidase C3-like isoform 2 (Vr40216 and Vr41032), casparian strip membrane protein 2 (Vr36698), and proline-rich protein (Vr34521). Only two genes, polyprotein precursor (Vr38043) and auxin-induced protein 5NG4-like (Vr48206), which were also observed in Wat6, were down-regulated more than 32-fold (Table 12; S5 and S6 Tables).
We further analyzed the DEGs between the Wat6 and Wat24 samples. Seven genes with a read number >1000 were up-regulated by more than 32-fold from Wat6 to Wat24, including casparian strip membrane protein 2 (Vr36698), vignain-like (Vr44673), early nodulin-like protein 1-like (Vr59584), low-temperature-induced 65 kDa protein-like (Vr34411), LEA-18 (Vr35442), and ef1a (Vr35419) (Table 13). In addition, the genes auxin-binding protein ABP19a-like (Vr51177) and casparian strip membrane protein 3 (Vr68124) were specifically expressed in Wat24 compared with Wat6. A number of the important genes that exhibited highly abundant expression and more than 16-fold up-regulation include two heat shock 70 kDa protein-like genes (Vr42894 and Vr40796), two MYB transcription factor MYB114 genes (Vr40489 and Vr39799), two patatin group A-3-like genes (Vr43029 and Vr42547), a metacaspase-9-like gene (Vr39095), and a probable E3 ubiquitin-protein ligase HERC1-like gene (Vr15096) (S6 Table). Compared with the Wat6 sample, only two genes with reads >1000 were down-regulated by more than 32-fold, including a thiazole biosynthetic enzyme gene, chloroplastic-like gene (Vr41972), and circadian clock-associated FKF1 gene (Vr49846) (Table 13). Four genes with reads >1000 were down-regulated more than 16-fold: formate dehydrogenase (Vr13406), GIR1 (Vr38378), beta-glucosidase 47-like (Vr41355), and GDSL esterase/lipase (Vr45510) (S6 and S7 Tables).
Validation of gene expression
To validate the differential expression data obtained through statistical comparisons of RPKM values, a total of 39 interesting DEGs of four types: 17 auxin signaling-related genes, 14 stress response-related genes, 3 LATERAL ORGAN BOUNDARY (LBD)-DOMAIN genes, and 3 internal reference genes were selected for validation of the transcriptomic data using real-time quantitative PCR (qRT-PCR). Detailed information on these genes is presented in S8 Table. According to the RNA-Seq results and the study published by Jian et al. , we selected three genes: CPY20, eIF5A, and ACTIN (Actin-related protein 4), as internal reference genes for qRT-PCR. The qRT-PCR results showed that CPY20 was the most stable housekeeping gene, so it was used to calculate the relative expression levels in this study. Out of the 39 selected genes, 36 showed a strong correlation (92.3%) to the RNA-Seq data (Fig 6). The qRT-PCR results confirmed that PER1, PER2, ADH1, LBD29, LBD41, and PIN1 were significantly up-regulated at the two time points; AUX22C, AUX15A, and QORL (Quinone oxidoreductase-like protein) were significantly up-regulated at Wat6 but returned to their original levels by Wat24; and the other genes showed a significant reduction at both time points.
The gene expression levels measured by qRT-PCR were compared with that of RNA-Seq. White histograms represent expression levels determined by RNA-Seq in RPKM units (left axis), while grey columns represent gene expression levels determined by qRT-PCR and normalized to three control genes (right axis). Bars represent the mean (± SE) of three experiments. Different letters (a, b, and c) represent statistically significant differences (P < 0.01) among the data of qRT-PCR, analysed using Student’s t-test.
Transcriptomic data can reveal gene expression profiles and give fundamental insights into biological processes. As a high-throughput, accurate and low-cost method, RNA-Seq, a new next-generation sequencing (NGS) method, has been widely applied to analyze transcriptomes qualitatively and quantitatively. NGS has proven to be a powerful tool for DEG screening, especially for species without available genomic information [42, 43]. In this study, the Illumina HiSeq 2000 platform was used to perform a de novo transcriptome sequencing analysis of the mung bean to better understand gene expression changes during adventitious rooting. Pooled RNA samples from hypocotyls and hypocotyls sampled at two time points after primary root excision were used to construct cDNA libraries for deep sequencing. This sequencing generated 7.36 Gbp, 5.998 Gbp, and 5.885 Gbp of sequence data, and obtained approximately 68.32 million, 55.58 million, and 55.57 million paired-end clean reads in the mung bean hypocotyls 0 h, 6 h, and 24 h after primary root excision, respectively. The newly developed Trinity method was used for de novo reads assembly. The Trinity method can recover more full-length transcripts across a broad range of expression levels and provides a unified, sensitive solution for transcriptome reconstruction in species without a reference genome, similar to methods that rely on genome alignments . Another study demonstrated that Trinity was a better approach than was SOAPdenovo for assembly, as the assembled unigenes did not contain gaps, and the average unigene length was nearly twice the length of those produced by SOAPdenovo . After de novo assembly, we obtained 78,697 unigenes with a mean length of 832 bp, which is longer than has been reported previously in studies using the same technology [26, 28, 42, 43]. Among the total number of unigenes, 91.92% (72,342), 84.71% (66,663), and 82.19% (64,680) of the unigenes were expressed in the Con, Wat6, and Wat24 samples, respectively. Consequently, the read number, mapped read number, and expressed genes show decreasing trends during adventitious rooting.
To understand the gene expression profile during rooting, the clean reads were mapped back to the assembled unigenes using the BWA-0.6.2 software. The number of reads mapped to each unigene was then counted and normalized using RPKM . Gene expression values were measured using the method described by DEGseq R package . We identified a total of 11,717 unigenes that showed differential expression (fold change>2) during the adventitious root induction stage, whereas 12,737 unigenes showed differential expression during the adventitious root initiation stage. Between the induction stage and the initiation stage, 5,334 unigenes showed differential expression, suggesting their possible role in the activation of the primordium and root meristem formation. Using a DNA microarray method, Rigal et al. (2012) studied gene expression changes during adventitious rooting in the model tree Populus trichocarpa. Their results indicated that 5,781 genes were differentially expressed in the organization of the adventitious root primordium; 6,538 genes were differentially expressed during primordium differentiation; and 1,146 genes were differentially expressed between these two stages . In another similar study using cDNA microarrays, Brinker et al. (2004) identified 220 genes that changed significantly during root development in hypocotyl cuttings of Pinus contorta . The results obtained suggest that RNA-Seq is a sensitive, low-cost, and accurate method for deep-sequencing transcriptome of plant without available genomic information and was able to identify more DEGs during the early stages of adventitious rooting relative to the results of DNA microarrays. This technology also enables the precise elucidation of transcripts in the samples.
GO enrichment analysis indicated that the majority of GO categories significantly up-regulated at the root induction and initiation stages were protein synthesis-related, including ribosome, structural constituent of ribosome, translation, ribonucleoprotein complex, ribosomal subunit, cytosolic ribosome, non-membrane-bounded organelle, intracellular non-membrane-bounded organelle, and cytosolic part. Conversely, the significantly down-regulated GO categories were DNA, RNA synthesis-related, and signal transduction-related, which included DNA integration, RNA biosynthetic, nucleic acid metabolic process, nucleic acid binding transcription factor activity, sequence-specific DNA binding transcription factor activity. These results indicate that during the root induction stage, the cells experience an increase in the assembly of ribosomes and protein synthesis and a reduction of DNA and RNA synthesis . GO categories related to response to stimulus and hormone signaling, such as response to chemical stimulus, oxidoreductase activity, response to endogenous stimulus, response to auxin stimulus, response to stimulus, response to hormone stimulus, and response to organic substance were significantly up-regulated at the root induction stage and down-regulated at the root initiation stage.
KEGG enrichment revealed that pathways such as ribosome, phenylpropanoid biosynthesis, phenylalanine metabolism, and terpenoid biosynthesis were up-regulated, whereas pathways such as photosynthesis, carbon fixation, carotenoid biosynthesis, nitrogen metabolism, sphingolipid metabolism, glycerolipid metabolism, cutin, cytokine-cytokine receptor interaction, oxidative phosphorylation, and plant hormone signal transduction were significantly down-regulated during the early stage of adventitious rooting. The loss of the photosynthetic function of hypocotyl cells was also revealed in hypocotyl cuttings of P. contorta at an early stage of adventitious root formation .
Although many genes specifically involved in the regulation of adventitious rooting have been identified in several plant species, the global profiling of gene expression during this process is not well studied using transcriptomic method. To better understand gene expression patterns during early root development, we selected the genes that exhibited greater than 32-fold changes and higher abundant expression (read number >1000). The most highly up-regulated unigenes encoded proteins involved in (1) functions related to stress, such as cationic peroxidase, pathogenesis-related protein, 7-ethoxycoumarin O-deethylase-like (cytochrome P450 monooxygenase), peroxidase C3-like isoform 2, low-temperature-induced 65 kDa protein-like (water stress-induced), early nodulin-like protein 1-like (phytocyanin family of blue copper proteins, a ubiquitous family of plant cupredoxins), late embryogenesis abundant protein (LEA-18, water stress-induced), heat shock 70 kDa protein-like; anthocyanin metabolism-associated and flavone metabolism-associated genes; (2) functions related to cell wall remodeling, such as polygalacturonase and polygalacturonase PG1 precursor (a pectin lyase-like superfamily protein), endoglucanase 17-like, peroxidase C3-like isoform 2 (lignin biosynthesis activity); (3) functions related to protein and lipid metabolism, such as ef1a, metacaspase-9-like (a peptidase), vignain-like (a peptidase), E3 ubiquitin-protein ligase HERC1-like, trypsin protease inhibitor precursor, and patatin group A-3-like (phospholipase A2 activity); (4) functions related to auxin transport and signal transduction, such as auxin efflux carrier, auxin-binding protein ABP19a-like, and proline-rich protein; and (5) a function act as transcription factors, such as ethylene-responsive transcription factor ERF086-like and the MYB family MYB114. In addition, the gene encoding casparian strip membrane protein 2 (unknown function) was specifically expressed in Wat24 compared with Wat6 (Table 14).
During the auxin-induced root initiation stage in hypocotyl cuttings of Pinus contorta, genes involved in cell replication and cell wall weakening and a transcript encoding a PINHEAD/ZWILLE-like protein were up-regulated, while genes related to auxin transport, photosynthesis, and cell wall synthesis were down-regulated. During the root meristem formation stage, the transcript abundance of genes involved in auxin transport, auxin responsive transcription, and cell wall synthesis, as well as a gene encoding a B-box zinc finger-like protein, was increased, while transcripts encoding proteins involved in cell wall weakening were decreased . The most highly over-expressed transcripts during the induction stage of adventitious rooting in the stem cuttings of Populus trichocarpa were from genes that encoded proteins involved in cell wall remodeling, such as glycoside hydrolases (GHs), pectate lyases, pectin esterases and expansins, auxin-, gibberellin-, or ethylene-responsive genes, as well as genes that have been implicated in signaling, such as Ser/Thr protein kinases. The members of the AP2/ERF, MYB, NAC, WRKY, and bHLH transcription factor families exhibited significant expression changes during this process . Table 14 summarizes the most differentially expressed genes during early stage of adventitious rooting with or without auxin treatment in several plant species investigated using DNA microarray or RNA-seq technologies. This summary indicates that, during early stage of adventitious rooting, the common gene functional categories occur in all plants investigated, including stress response, cell wall weakening and modification, plant hormone signaling, signal transduction, and transcription factors. Although the same genes appear in this list, the auxin-induced expression genes, even the plant-specific expression genes are also distinct. For example, genes encoding histone H3 and CDC2 associated with cell replication, genes encoding PINHEAD/ZWILLE-like protein, DNA binding protein, and B-box zinc finger like protein involved in signal transduction, were induced by IBA; and genes encoding AUX1-like and SAMS were repressed by IBA in Pinus contorta . Genes encoding GH3, indole-3-acetate O-methyltransferase, and cytokinin oxidases involved in plant hormone signaling, and gene encoding glutathione S-transferases (GSTs) in glutathione synthesis, were induced by IBA; and genes encoding adenylate isopentenyltransferase and cytokinin hydroxylases involved in plant hormone signaling were down-regulated by IBA in Camellia sinensis . Genes encoding lateral root primordium (lrp1), SCARECROW-like6, PISTILLATA, and AINTEGUMENTA LIKE1 (PtAIL1, PtPLT1.1, and PtAIL9) in AP2/ERF transcription factor family, were up-regulated in Populus trichocarpa without auxin treatment . Genes encoding metacaspase-9-like (a peptidase), vignain-like (a peptidase), and trypsin protease inhibitor precursor involved in protein degradation, gene encoding patatin group A-3-like (a phospholipase) involved in lipid metabolism, and genes encoding potassium transporter 5-like related to transporter, were up-regulated; and genes encoding S-type anion channel SLAH3-like and organic cation transport protein functioning as transporters were down-regulated in Vigna radiata in this study.
The profile of the top up-regulated genes indicates that stress-response processes are paramount during the early stage of adventitious rooting. The excision of primary roots, acting as a type of wounding stress, causes an oxidative burst leading to oxidative stress during adventitious rooting in mung bean seedlings [16, 18, 23–25]. Many studies have shown that in the course of adventitious rooting, peroxidase (POD) activity is sharply reduced during the induction stage, increased during the initiation stage and gradually reduced during the expression stage [16, 23, 47, 48]. Peroxidases comprise a large family of enzymes that function as antioxidants and also respond to water stress . Late embryogenesis abundant (LEA) proteins act as water-binding molecules, membrane-stabilizers, and ion modulators and are induced by drought stress [49–52]. Increases in LEA proteins and pathogenesis-related proteins indicate that the plants were exposed to water stress . During the root primordia formation stage in P. contorta, transcripts encoding enzymes of the flavonoid pathway were up-regulated . One of these flavonoid pathway proteins, chalcone synthase, and a pathogenesis-related protein contribute to a constitutive defense barrier in the root epidermis of the pea . Increases in the expression of isoflavone reductase-like and isoflavone 2'-hydroxylase-like genes suggests a role for the flavonoid pathway in response to stress during early stages of rooting.
The regulation of genes with potential roles in cell wall remodeling is an essential process for adventitious root formation. In this study, the top up-regulated genes included many that encode proteins involved in cell wall synthesis and loosening, such as the significantly up-regulated genes endoglucanase 17-like and endo-1,3(4)-beta-glucanases, which are members of the glycoside hydrolase family and potentially participate in cell wall loosening . The phenylpropanoid biosynthesis and phenylalanine metabolism pathways were also significantly up-regulated. The phenylpropanoid polymer complex is the component of lignin that accumulates between cellulose, hemicellulose and pectin components in the cell wall. Phenylpropanoid synthesis starts from phenylalanine . The common derivatives from phenylpropanoid pathway, phenolic acids, flavonoids, and lignin , are crucial regulators in cell division and differentiation  and stimulate in vitro rooting . During the early stage of adventitious rooting, genes with the potential to be active in cell wall synthesis were down-regulated, while genes involved in weakening cell walls were up-regulated, suggesting that the cell walls were undergoing remodeling and weakening [6, 9].
Genes associated with auxin transport and signaling are regulated during the early stage of adventitious rooting. During the root induction stage, the gene encoding auxin efflux carrier was significantly up-regulated. Auxin efflux carriers control auxin distribution to establish and maintain auxin concentration gradients in various tissues , triggering the establishment of new growth axes . During the root initiation stage, an auxin signaling gene, auxin-binding protein ABP19a-like, was specifically expressed. ABP1 has known to mediate rapid cellular auxin effects through the non-transcriptional auxin response pathway and is essential for auxin-regulated processes. ABP1 also regulates the expression of AUX/IAA genes [61, 62], suggesting that active transport of auxin starts during the early stage of root meristem formation. However, down-regulation was observed in another auxin carrier complex gene, ABC transporter G family member 22-like, which functions in cellular auxin efflux and influx .
In this study, the gene encoding the ethylene-responsive transcription factor ERF017-like was significantly down-regulated during the root initiation stage. During root primordia formation in P. contorta cuttings, the gene encoding an ethylene responsive element binding protein (EREBP)-like protein was down-regulated . Ethylene biosynthesis has been demonstrated to be required for adventitious root formation, and there is crosstalk between ethylene and auxin during the process of adventitious root formation in tomato [64, 65]. These results suggest that ethylene signaling also mediates adventitious rooting.
The MYB transcription factor is an important regulator of the initiation and primordium formation of adventitious roots. Interestingly, three genes encoding the MYB transcription factor MYB134 were significantly down-regulated during the root induction stage but significantly up-regulated during the root initiation stage (S7 Table and Fig 6) in this study. In cuttings of Populus trichocarpa, MYB family expression levels changed the most during primordium differentiation . MYB77 acts in a synergistic manner with ARF7 to enhance the expression of auxin-responsive genes and mediate the auxin response . These results suggest that members of the MYB transcription factor family mediate the initiation of and the formation of adventitious root primordium.
In recent years, we have made great efforts to reveal the mechanisms that regulate adventitious rooting in plants at the physiological and molecular levels using a model plant Vigna radiata (L.) R. Wilczek, a tropical legume that serves as a significant source of dietary protein for the people of Asia and Africa. In this paper, we provide the first study to report the transcriptome of seedling hypocotyls of V. radiata and in vitro adventitious rooting of hypocotyl cuttings using RNA-Seq analysis. We obtained 78,697 assembled unigenes using the Trinity de novo assembly method. Among these unigenes, 72,342, 66,663, and 64,680 genes were expressed in hypocotyls or at the 6 h and 24 h time points during adventitious rooting, respectively, but only 29,029 (36.77%) unigenes could be annotated using public databases. The global transcriptomic data reveal that profound cellular and metabolic reorganization occurs during the root induction stage. We used gene clustering and the enrichment of GO terms and KEGG pathways to describe the overall biological processes regulated during this developmental process. We also used RPKM analysis to investigate the differentially expressed gene profiles at the three developmental stages. Furthermore, real-time quantitative PCR was used to confirm the differential expression levels observed for 39 of the unigenes. The results obtained using RNA-Seq were consistent with the average expression levels in three biological replicates. Further investigation of the transcriptional changes at more closely spaced developmental stages will provide additional valuable information. Our full transcript abundance analysis, presented in S2 and S7 Tables, represents a useful resource for further insight into mung bean transcriptome and in vitro adventitious root development and for candidate gene selection.
S3 Fig. Venn diagram of number of unigenes annotated by BLASTx with an E-value threshold of 10−5 against protein databases.
The numbers in the circles indicate the number of unigenes annotated by single or multiple databases. The Venn diagram shows unigenes unique to each database and which are shared amongst different databases.
S4 Fig. Percentage of unigenes matching the 10 top species using BLASTx against the Nr database.
S1 Table. Statistics of random 100,000 sequences alignment against Nr database.
S2 Table. BLAST results against the NCBI Nr database for all the assembled unigenes with an E-value threshold of 1e-5.
S3 Table. Top 10 significant GOs in the three samples.
S4 Table. Top list of significantly up- and down-regulated GOs.
S5 Table. Top list of significantly up- and down-regulated KOs.
S6 Table. Top list of significantly up- and down-regulated DEGs in the three samples.
S7 Table. The expression abundance of unigenes in the three samples presented as read number and RPKM.
Conceived and designed the experiments: SWL. Performed the experiments: RFS YL. Analyzed the data: SWL RFS. Contributed reagents/materials/analysis tools: SWL. Wrote the paper: SWL.
- 1. Li S-W, Xue L, Xu S, Feng H, An L. Mediators, genes and signaling in adventitious rooting. Bot Rev. 2009; 75: 230–247.
- 2. Ramirez-Carvajal GA, Davis JM. Cutting to the base: Identifying regulators of adventitious rooting. Plant Signal Behav. 2010; 5: 281–283. pmid:20037469
- 3. De Klerk GJ, Van Der Krieken W, De Jong JC. The formation of adventitious roots: new concepts, new possibilities. In Vitro Cell Dev Biol Plant. 1999; 35: 189–199.
- 4. Batish DR, Singh HP, Kaur S, Kohli RK, Yadav SS. Caffeic acid affects early growth, and morphogenetic response of hypocotyl cuttings of mung bean (Phaseolus aureus). J Plant Physiol. 2008; 165: 297–305. pmid:17643552
- 5. Gutierrez L, Bussell JD, Pacurar DI, Schwambach J, Pacurar M, Bellini C. Phenotypic plasticity of adventitious rooting in Arabidopsis is controlled by complex regulation of AUXIN RESPONSE FACTOR transcripts and microRNA abundance. Plant Cell. 2009; 21: 3119–3132. pmid:19820192
- 6. Brinker M, van Zyl L, Liu W, Craig D, Sederoff RR, Clapham DH, et al. Microarray analyses of gene expression during adventitious root development in Pinus contorta. Plant Physiol. 2004; 135: 1526–1539. pmid:15247392
- 7. Sorin C, Negroni L, Balliau T, Corti H, Jacquemot MP, Davanture M, et al. Proteomic analysis of different mutant genotypes of Arabidopsis led to the identification of 11 proteins correlating with adventitious root development. Plant Physiol. 2006; 140: 349–364. pmid:16377752
- 8. Holmes P, Djordjevic MA, Imin N. Global gene expression analysis of in vitro root formation in Medicago truncatula. Fun Plant Biol. 2010; 37: 1117–1131.
- 9. Rigal A, Yordanov YS, Perrone I, Karlberg A, Tisserant E, Bellini C, et al. The AINTEGUMENTA LIKE1 homeotic transcription factor PtAIL1 controls the formation of adventitious root primordia in poplar. Plant Physiol. 2012; 160: 1996–2006. pmid:23077242
- 10. Holmes P, Goffard N, Weiller GF, Rolfe BG, Imin N. Transcriptional profiling of Medicago truncatula meristematic root cells. BMC Plant Biol. 2008; 8: 1–21.
- 11. Sweetman C, Wong DCJ, Ford CM, Drew DP. Transcriptome analysis at four developmental stages of grape berry (Vitis vinifera cv. Shiraz) provides insights into regulated and coordinated gene expression. BMC Genomics. 2012; 13: 691. pmid:23227855
- 12. Annadurai RS, Jayakumar V, Mugasimangalam RC, Katta MAVSK, S Anand, S Gopinathan, et al. Next generation sequencing and de novo transcriptome analysis of Costus pictus D. Don, a non-model plant with potent anti-diabetic properties. BMC Genomics. 2012; 13: 663. pmid:23176672
- 13. Turktas M, Kurtoglu KY, Korado G, Zhang B, Hernande P, Unver T. Sequencing of plant genomes—a review. Turk J Agric For. 2014; 38: 1–16.
- 14. Huang H-H, Xu L-L, Tong Z-K, Lin E-P, Liu Q-P, Cheng L-J, et al. De novo characterization of the Chinese fir (Cunninghamia lanceolata) transcriptome and analysis of candidate genes involved in cellulose and lignin biosynthesis. BMC Genomics. 2012; 13: 648. pmid:23171398
- 15. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008; 5: 621–628. pmid:18516045
- 16. Li S-W, Xue L, Xu S, Feng H, An L. Hydrogen peroxide acts as a signal molecule in the adventitious root formation of mung bean seedlings. Environ Exp Bot. 2009; 65: 63–71.
- 17. De Klerk GJ, Keppel M, Terbrugge J, Meekes H. Timing of the phases in adventitious root-formation in apple microcuttings. J Exp Bot. 1995; 46: 965–972.
- 18. Yang W, Zhu C, Ma X, Li G, Gan L, Ng D, et al. Hydrogen peroxide is a second messenger in the salicylic acid-triggered adventitious rooting process in mung bean seedlings. PLoS ONE. 2013; 8(12): e84580. pmid:24386397
- 19. Ramirez-Carvajal GA, Morse AM, Dervinis C, Davis JM. The cytokinin type-B response regulator PtRR13 is a negative regulator of adventitious root development in Populus. Plant Physiol. 2009; 150: 759–771. pmid:19395410
- 20. De Klerk GJ, Arnholdt-Schmitt B, Lieberei R, Neumann KH. Regeneration of roots, shoots and embryos: physiological, biochemical and molecular aspects. Biol Plant. 1997; 39: 53–66.
- 21. Schafleitner R, Nair RM, Rathore A, Wang Y-W, Lin C-Y, Chu S-U, et al. The AVRDC—The World Vegetable Center mungbean (Vigna radiata) core and mini core collections. BMC Genomics. 2015; 16:344. pmid:25925106
- 22. Wiesmann Z, Riov J, Epstein E. Comparison of movement and metabolism of indole-3-acetic acid and indole-3-butyric acid in mung bean cuttings. Physiol Plant. 1988; 74: 556–560.
- 23. Li S-W, Xue L, Xu S, Feng H, An L. IBA-induced changes in antioxidant enzymes during adventitious rooting in mung bean seedlings: the role of H2O2. Environ Exp Bot. 2009; 66: 442–450.
- 24. Bai X, Todd CD, Desikan R, Yang Y, Hu X. N-3-oxo-decanoyl-L-homoserine-lactone activates auxin-induced adventitious root formation via hydrogen peroxide- and nitric oxide-dependent cyclic GMP signaling in mung bean. Plant Physiol. 2012; 158: 725–736. pmid:22138973
- 25. Li S-W, Leng Y, Feng L, Zeng X-Y. Involvement of abscisic acid in regulating antioxidative defense systems and IAA-oxidase activity and improving adventitious rooting in mung bean [Vigna radiata (L.)Wilczek] seedlings under cadmium stress. Environ Sci Pollut Res. 2014; 21:525–537.
- 26. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29: 644–652. pmid:21572440
- 27. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009; 10: 421. pmid:20003500
- 28. Xu DL, Long H, Liang JJ, Zhang J, Chen X, Li JL, et al. De novo assembly and characterization of the root transcriptome of Aegilops variabilis during an interaction with the cereal cyst nematode. BMC Genomics. 2012; 13: 133. pmid:22494814
- 29. Barrero RA, Chapman B, Yang Y, Moolhuijzen P, Keeble-Gagnère G, Zhang N, et al. De novo assembly of Euphorbia fischeriana root transcriptome identifies prostratin pathway related genes. BMC Genomics. 2011; 12: 600. pmid:22151917
- 30. Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. 2012; 9: 357–359. pmid:22388286
- 31. Conesa A, Gotz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005; 21: 3674–3676. pmid:16081474
- 32. Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, et al. WEGO: a web tool for plotting GO annotations. Nucleic Acids Res. 2006; 34: W293–W297. pmid:16845012
- 33. Iseli C, Jongeneel CV, Bucher P. ESTScan a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proceedings/International Conference on Intelligent Systems for Molecular Biology; ISMB International Conference on Intelligent Systems for Molecular Biology. 1999. pp. 138–148.
- 34. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004; 32: D277–D280. pmid:14681412
- 35. Kanehisa M, Goto S. KEGG Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000; 28: 27–30. pmid:10592173
- 36. Mao X, Cai T, Olyarchuk JG, Wei L. Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary. Bioinformatics. 2005; 21: 3787–3793. pmid:15817693
- 37. Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007; 35(suppl 2): W182–W185.
- 38. Wang L, Feng Z, Wang X, Wang X, Zhang X. DEGseq: an R package for identifying differentially expressed genes from RNA-Seq data. Bioinformatics. 2010; 26: 136–138. pmid:19855105
- 39. Audic S, Claverie JM. The significance of digital gene expression profiles. Genome Res. 1997; 7: 986–995. pmid:9331369
- 40. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat So Ser B. 1995; 57: 289–300.
- 41. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real time quantitative PCR and the 2-ΔΔCt method. Methods. 2001; 25: 402–408. pmid:11846609
- 42. Qiu Q, Ma T, Hu Q, Liu B, Wu Y, Zhou H, et al. Genome-scale transcriptome analysis of the desert poplar, Populus euphratica. Tree Physiol. 2011; 31: 452–461. pmid:21427158
- 43. Wong MM, Cannon CH, Wickneswari R. Identification of lignin genes and regulatory sequences involved in secondary cell wall formation in Acacia auriculiformis and Acacia mangium via de novo transcriptome sequencing. BMC Genomics. 2011; 12: 342. pmid:21729267
- 44. Feng C, Chen M, Xu C-J, Bai L, Yin X-R, Li X, et al. Transcriptomic analysis of Chinese bayberry (Myrica rubra) fruit development and ripening using RNA-Seq. BMC Genomics. 2012; 13: 19. pmid:22244270
- 45. Jian B, Liu B, Bi Y, Hou W, Wu C, Han T. Validation of internal control for gene expression study in soybean by quantitative real-time PCR. BMC Mol Biol. 2008; 9:59. pmid:18573215
- 46. Wei K, Wang L-Y, Wu L-Y, Zhang C-C, Li H-L, Tan L-Q, et al. Transcriptome analysis of indole-3-butyric acid-induced adventitious root formation in nodal cuttings of Camellia sinensis (L.). PLoS ONE 9(9): 2014; e107201. pmid:25216187
- 47. Hatzilazarou SP, Syros TD, Yupsanis TA, Bosabalidis AM, Economou AS. Peroxidases, lignin and anatomy during in vitro and ex vitro rooting of gardenia (Gardenia jasminoides Ellis) microshoots. J Plant Physiol. 2006; 163: 827–836. pmid:16777530
- 48. Metaxas D, Syros T, Yupsanis T, Economou AS. Peroxidases during adventitious rooting in cuttings of Arbutus unedo and Taxus baccata as affected by plant genotype and growth regulator treatment. Plant Growth Regul. 2004; 44: 257–266.
- 49. Alvarez S, Marsh EL, Schroeder SG, Schachtman DP. Metabolomic and proteomic changes in the xylem sap of maize under drought. Plant Cell Environ. 2008; 31: 325–340. pmid:18088330
- 50. Hand SC, Menze MA, Toner M, Boswell L, Moore D. LEA proteins during water stress: not just for plants anymore. Annu Rev Physio. 2011; l73: 115–134.
- 51. Rorat T. Plant dehydrins—Tissue location, structure and function. Cell Mol Biol Lett. 2006; 11: 536–556. pmid:16983453
- 52. Wang W, Vinocur B, Altman A. Plant responses to drought, salinity and extreme temperatures: towards genetic engineering for stress tolerance. Planta. 2003; 218: 1–14. pmid:14513379
- 53. Mylona P, Moerman M, Yang WC, Gloudemans T, Van de Kerckhove J, van Kammen A, et al. The root epidermis specific pea gene RH2 is homologous to a pathogenesis-related gene. Plant Mol Biol. 1994; 26: 39–50. pmid:7948884
- 54. Cosgrove DJ. Growth of the plant cell wall. Nat Rev Mol Cell Biol. 2005; 6: 850–861. pmid:16261190
- 55. Davin LB, Jourdes M, Patten AM, Kim KW, Vassao DG, Lewis NG. Dissection of lignin macromolecular configuration and assembly: Comparison to related biochemical processes in allyl/propenyl phenol and lignan biosynthesis. Nat Prod Rep. 2008; 25: 1015–1090. pmid:19030603
- 56. Boudet AM, LaPierre C, Grima-Pettenati J. Biochemistry and molecular biology of lignification. New Phytol. 1995; 129:203–236.
- 57. Tamagnone L, Merida A, Stacey N, Plaskitt K, Parr A, Chang CF, et al. Inhibition of phenolic acid metabolism results in precocious cell death and altered cell morphology in leaves of transgenic tobacco plants. Plant Cell. 1998; 10: 1801–1816. pmid:9811790
- 58. Macedo ES, Sircar D, Cardoso HG, Peixe A, Arnholdt-Schmitt B. Involvement of alternative oxidase (AOX) in adventitious rooting of Olea europaea L. microshoots is linked to adaptive phenylpropanoid and lignin metabolism, Plant Cell Rep. 2012; 31: 1581–1590. pmid:22544084
- 59. Zazimalova E, Krecek P, Skupa P, Hoyerova K, Petrasek J. Polar transport of the plant hormone auxin—the role of PIN-FORMED (PIN) proteins. Cell Mol Life Sci. 2007; 64: 1621–1637. pmid:17458499
- 60. Chapman EJ, Estelle M. Mechanism of auxin-regulated gene expression in plants. Annu Rev Genet. 2009; 43: 265–285. pmid:19686081
- 61. Braun N, Wyrzykowska J, Mulle P, David K, Couch D, Perrot-Reichenmann C, et al. Conditional repression of AUXIN BINDING PROTEIN1 reveals that it coordinates cell division and cell expansion during postembryonic shoot development in Arabidopsis and tobacco. Plant Cel.l 2008; 20: 2746–2762.
- 62. Tromas A, Braun N, Muller P, Khodus T, Paponov IA, Palme K, et al. The AUXIN BINDING PROTEIN 1 is required for differential auxin responses mediating root growth. PLoS One. 2009; 4: e6648. pmid:19777056
- 63. Cho M, Lee SH, Cho H-T. P-glycoprotein4 displays auxin efflux transporter-like action in Arabidopsis root hair cells and tobacco cells. Plant Cell. 2007; 19:3930–3943. pmid:18156217
- 64. Negi S, Sukumar P, Liu X, Cohen JD, Muday GK. Genetic dissection of the role of ethylene in regulating auxin-dependent lateral and adventitious root formation in tomato. Plant J. 2010; 61: 3–15. pmid:19793078
- 65. Vidoz ML, Loreti E, Mensuali A, Alpi A, Perata P. Hormonal interplay during adventitious root formation in flooded tomato plants. Plant J. 2010; 63: 551–562. pmid:20497380
- 66. Shin R, Burch AY, Huppert KA, Tiwari SB, Murphy AS, Guilfoyle TJ, et al. The Arabidopsis transcription factor MYB77 modulates auxin signal transduction. Plant Cell. 2007; 19: 2440–2453. pmid:17675404