Figures
Abstract
GRAS transcriptional factors have diverse functions in plant growth and development, and are named after the first three transcription factors, namely, GAI (GIBBERELLIC ACID INSENSITIVE), RGA (REPRESSOR OF GAI) and SCR (SCARECROW) identified in this family. Knowledge of the GRAS gene family in maize remains was largely unknown, and their characterization is necessary to understand their importance in the maize life cycle. This study identified 86 GRAS genes in maize, and further characterized with phylogenetics, gene structural analysis, genomic loci, and expression patterns. The 86 GRAS genes were divided into 8 groups (SCL3, HAM, LS, SCR, DELLA, SHR, PAT1 and LISCL) by phylogenetic analysis. Most of the maize GRAS genes contain one exon (80.23%) and closely related members in the phylogenetic tree had similar structure and motif composition. Different motifs especially in the N-terminus might be the sources of their functional divergence. Segmental- and tandem-duplication occurred in this family leading to expansion of maize GRAS genes and the expression patterns of the duplicated genes in the heat map according to the published microarray data were very similar. Quantitative RT-PCR (qRT-PCR) results demonstrated that the expression level of genes in different tissues were different, suggesting their differential roles in plant growth and development. The data set expands our knowledge to understanding the function of GRAS genes in maize, an important crop plant in the world.
Citation: Guo Y, Wu H, Li X, Li Q, Zhao X, Duan X, et al. (2017) Identification and expression of GRAS family genes in maize (Zea mays L.). PLoS ONE 12(9): e0185418. https://doi.org/10.1371/journal.pone.0185418
Editor: Meng-xiang Sun, Wuhan University, CHINA
Received: January 19, 2017; Accepted: September 12, 2017; Published: September 28, 2017
Copyright: © 2017 Guo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the Ministry of Agriculture of China (grant number 2016ZX08009002), the foundation of the Ministry of Education Key Laboratory of Cell Activities and Stress Adaptations (grant number lzujbky-2016-bt05), the National Natural Science Foundation of China (grant number 31301080) and Funds of Shandong "Double Tops" Program.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The GRAS gene family is an important plant-specific transcription factor family whose name is an acronym of the first three identified members: GIBBERELLIC ACID INSENSITIVE (GAI), REPRESSOR OF GA1 (RGA), and SCARECROW (SCR) [1]. Typically, GRAS proteins are 400–700 amino acids and exhibit some C-terminal homology. A common characteristic of all GRAS proteins is the presence of 5 carboxy-terminal motifs in the order of: leucine heptad repeat Ⅰ(LHRI), VHIID motif, leucine heptad repeat Ⅱ(LHRⅡ), PFYRE motif, and SAW motif [2–3]. Leucine heptad repeats are frequently found in bZIP transcription factors, which are important for protein-protein interactions [4]. The VHIID and PFYRE motifs are consistently found and highly conserved in GRAS proteins. The SAW motif is less conserved, but has 3 conserved pairs of amino acids found with consistent spacing: R-E, W-G, and W-W. While the function of these motifs is unknown, their highly conserved nature suggests that they are critical to GRAS protein function [5]. These conserved motifs can directly affect the function of GRAS proteins; in fact, mutations in the SAW and PFYRE motifs of SLR1 and RGA proteins result in huge phenotypic variation in Arabidopsis [6–7]. The order of these conserved motifs is similar across most GRAS proteins. In contrast, the N-terminus is variable, except for DELLA subfamily that contains 2 conserved motifs (named DELLA and TVHYNP) and the variable length and sequence of N-terminus seems like the major contributor for their gene-specific functions [8, 9, 10]. The GRAS family is divided into 7 subfamilies at first, DELLA, SCARECROW (SCR), LATERAL SUPPRESSOR (LS), HAIRY MERISTEM (HAM), Phytochrome A signal transduction 1 (PAT1), SHORT-ROOT (SHR) and SCARECROW-LIKE9 (SCL9) [1, 4]. Then the SCL9 was renamed as LISCL, and a new subfamily SCL3 was established and the GRAS gene family was divided into eight distinct branches, namely LISCL, PAT1, SCL3, DELLA, SCR, SHR, LS and HAM, based on studies on the model plants Arabidopsis and rice [7]. Later, the GRAS family was divided into ten subfamilies, DELLA, AtSCR, AtLAS, HAM, AtPAT1, AtSHR, AtSCL3 and LISCL, DLT, AtSCL4/7 [9]. In other studies, the GRAS genes can be divided into at least 13 branches [11] in Populus, Arabidopsis and rice, and 16 branches [12] in Medicago truncatula. So far, the GRAS gene family has been genome-wide explored in several plant species, including Populus, Arabidopsis, rice, Chinese cabbage, Prunus mume, tomato, sacred lotus, grapevine, Isatis indigotica, Medicago truncatula, Castor Beans, and pine [11, 12, 13, 14, 15, 16, 17, 18, 19, 20].
The GRAS gene family plays a crucial role in diverse plant growth and development processes, including gibberellin signal transduction [21, 22], phytochrome A signal transduction [23, 24], axillary meristem initiation [25], shoot meristem maintenance [26], root radial patterning [27, 28]. For example, PAT1 and SCL21 are mainly involved in phytochrome A (phyA) signal transduction pathways in Arabidopsis thaliana. Light signaling via the phyA photoreceptor controls basic plant developmental processes, such as de-etiolation and hypocotyl elongation [24]. SCARECROW-LIKE 13 (SCL13) mainly involves in phyB signal transduction pathways to control basic plant developmental processes like PAT1 and SCL21 in Arabidopsis thaliana [29]. SCARECROW-LIKE 3 (SCL3) is involved in signal transduction pathways through gibberellins (GAs) and acts as a positive regulator to integrate and maintain a functional GA pathway by attenuating the DELLA repressors in the root endodermis [30]. DELLA subfamily is a representative subfamily of GA signaling that has been analyzed in detail. Gain- or loss-of-function mutants of the DELLA genes in Arabidopsis, maize, wheat, rice (Oryza sativa) and barley show GA-insensitive dwarf or GA constitutive response phenotypes [21, 31, 32, 33]. Studies of the Arabidopsis ga1-3 (RGA) and the rice SLENDER RICE 1 (SLR1) have demonstrated that these DELLA proteins function in the nucleus and are degraded rapidly when the plants are treated with GA [6, 34]. The degradation of the DELLA proteins is thought to be an essential event in GA signal transduction. Mutations which lack the DELLA motif or the surrounding regions cannot be degraded by treated with GA, and show a GA-insensitive dwarf phenotype [6, 35, 36]. Previous studies showed that miR171 can regulate some GRAS genes. The miR171-targeted SCL transcription factors SCL6/SCL6-IV, SCLL22/SCL6-III, SCL27/SCL6-II (also known as hairy meristems [HAM] and lost meristems [LOM]) have been demonstrated to play an important role in the proliferation of meristematic cells, polar organization and chlorophyll synthesis [3, 37–41]. DELLA-regulated POR expression is, at least in part, mediated by miR171-targeted SCLs in light [42]. MOC1, a member of LS subfamily, is a key gene for controlling rice tillers, which may improve the production of crops [43].
Compared with other families of transcription factors, very few researches have explored the whole genome of the GRAS families. The identification of GRAS members in different species was slightly different among studies. There were 32 to 34 genes identified from Arabidopsis [10, 11, 44], 57 and 60 GRAS genes were identified in rice in two reports [10, 11]. In addition, 68 GRAS transcription factors were identified in Medicago truncatula [12]. As more species have their genome sequenced available, more GRAS proteins could be identified among them. Furthermore, the genome-wide comparisons of GRAS family members may also be performed among several important species for evolutionary analysis.
Maize is one of the most crops in the world and it has tremendous value for providing food, forage, pharmaceuticals, and other industrial products. To improve root growth, plant height and seed size, it is necessary to explore the GRAS family in maize. With the availability of maize genome sequences [45], it is possible for us to identify all the GRAS family genes in maize and find the right gene which is very necessary for production or growth.
In this study, we conducted a genome-wide analysis for all the members of GRAS family in maize. The GRAS genes were identified with database on the website, conducted phylogenetic relationships and analyzed their protein structures and gene structures. We discerned their locations on the chromosomes and their expression patterns as well. Then, qRT-PCR was performed to confirm the expression patterns getting from the database. The data presented here is necessary to explore systematically the gene function of the maize GRAS family genes.
Results and discussion
Genome-wide identification of GRAS family members in maize
It is possible to identify all GRAS gene family members in maize because the maize genome has been sequenced [45]. Here we identified 86 GRAS transcription factors (ZmGRAS1-ZmGRAS86) from the maize genome and the gene location, the number of amino acids, molecular weight, theoretical pI, were analyzed and summarized in Table 1. The length of amino acid sequences encoded by ZmGRAS varied from 111 amino acids (aa) to 734 aa, and molecular weight ranged from 12308.9 to 72083.7 kDa and the pI varied from 4.4973 to 7.7965. The average value of pI was 6.44532, suggesting that the maize GRAS proteins tended to acidic. In addition, alternative splicing was found in 18 ZmGRAS genes, with 2 to 4 alternative splice forms (S1 Table).
However, inconsistent with our results, there were 104 and 112 GRAS genes of maize in the PlantTFDB website (http://planttfdb.cbi.pku.edu.cn/family.php?sp=Zma&fam=GRAS) and PlnTFDB website (http://plntfdb.bio.uni-potsdam.de/v3.0/fam_mem.php?family_id=GRAS&sp_id=ZMA), respectively (S2 Table). The number of the GRAS genes in the two websites was greater than our results. After analysis carefully, we found that the 104 GRAS genes of maize in the PlantTFDB website were very different from the 112 GRAS genes in the PlnTFDB website. Genes written in the red words were the different genes between the two websites in S2 Table. The 104 GRAS genes in the PlantTFDB website included alternative splicing genes. If the “genes” containing splice variants were considered one gene, there remained only 86 GRAS genes which was consistent with our result above. For example, the gene “GRMZM2G015080”in the PlantTFDB website existed two transcripts which were considered to be two genes, but only one gene, actually. In addition to alternative splicing, most of the GRAS genes in the PlnTFDB website could not be found in the MaizeGDB website (http://www.maizegdb.org/gene_center/gene) probably owning to the low version. We downloaded the GRAS protein sequences of Arabidopsis thaliana, Medicago truncatula, Oryza sativa and Sorghum bicolor from the PlantTFDB website.
Very few reports have been published on Zea mays GRAS proteins, to our knowledge, three of these genes were previously described, such as, D9 (ZmGRAS12) [45, 46], D8 [ZmGRAS54) [45, 47, 48] and SCR (ZmGRAS48) [49, 50](Table 1). The relatively high number of GRAS genes in maize may be due to the maize genome experiencing tandem and large-scale segmental duplications [51].
Phylogenetic analysis of GRAS genes
To study evolutionary relationships between GRAS transcription factors, the sequences of Arabidopsis, Medicago truncatula, Oryza sativa and Sorghum bicolor were downloaded for alignment and were used to conduct phylogenetic tree by PHYLIP (Version 3.695) using Neighbor-Joining method [52, 53]. The 86 maize GRAS proteins comprise 8 subfamilies (SCL3, HAM, LS, SCR, DELLA, SHR, PAT1 and LISCL) by clade support values, tree topology and Arabidopsis classification (Fig 1) [7]. Each of the SCR and LS subfamily contained only four maize GRAS genes and was the relatively small subfamily of the whole subfamilies. The number of maize GRAS genes in SHR, PAT1, DELLA, HAM, SCL3 was very similar about ten, while the LISCL subfamily was very different from the above subfamilies that contained the largest number of maize GRAS genes and this branch was also the largest branch in Fig 1, contained 106 members from Zea mays, Arabidopsis thaliana, Oryza sativa, Sorghum bicolor and Medicago truncatula. In this family, GRAS genes of eudicot plants Arabidopsis and Medicago truncatula didn’t clustered together with others tightly.
The proteins are clustered into 8 subgroups, signed in 8 different colors, representing subfamilies of SCR (blue), SCL3 (red), LISCL (green), PAT1 (purple), SHR (light blue), DELLA (orange), HAM (light purple), LS (pink). Gene ID of GRAS gene family members from Arabidopsis thaliana, Oryza sativa, Sorghum bicolor and Medicago truncatula were listed in the supporting information (S3 Table).
ZmGRAS protein sequence alignments and conserved motifs
In Arabidopsis, GRAS proteins have 5 conserved domains in the C terminus, named as LHR I, VHIID, LHR II, PFYRE and SAW [2–3]. To identify conserved domains, we performed an alignment within ZmGRAS protein sequences using Clustal X, (Version 2.0) [54]. Multiple sequence alignments of the 86 predicted ZmGRAS proteins led to the discovery of five conserved C-terminal GRAS domains mentioned above, which were similar to Arabidopsis GRAS proteins (S1 Fig). The conserved motifs for each GRAS protein were also identified using MEME (http://meme.sdsc.edu/meme/intron.html) (Fig 2B; S4 Table). A total of 20 conserved motifs were identified (named Motif1-20) and detailed information for each motif was listed in S4 Table. Motifs (Motif1, 2, 3, 4, 5, 6, 9 and 11) were widely distributed in most maize C-terminus of GRAS proteins, while the N-terminus contained various motifs, such as motif15, 16, 17, which was consistent with the previous conclusions that C-terminal region of the GRAS proteins was more conserved than the N-terminal region [2]. Although the motifs of the N-terminal regions of the GRAS genes were variable, it increased functional diversity and the complexity of biological networks of the GRAS genes [9, 10]. Genes among different subfamilies had divergent motifs in the N-terminal regions, but most of the GRAS proteins in the same subfamily had similar motifs. For example, In LISCL subfamily, there are three specific motifs (motif15, motif16, and motif17) in its C-terminus (Fig 2B; S2 Fig). The results were consistent with previous study that the pattern of protein disorder could be more conserved through evolution than the amino acid sequence in the N-terminus [55].
(A) The phylogenetic tree was constructed by PHYLIP using NJ method. The genes marked by the red lines were duplicated gene pairs mentioned in the following paragraph. (B) The motif sizes are indicated at the bottom of the figure. Different motifs are indicated by different colors numbered from motif1-20, and the combined P-values are shown on the left side of the figure. The same color in different proteins refers to the same motif. The structural features of the 20 motifs were listed in S4 Table. (C) The structures of the 86 putative maize GRAS genes. The exons and introns are represented by red boxes and black lines.
Structural organization of ZmGRAS genes and chromosomal localization
The overall pattern of intron positions can affect phylogenetic relationships when analyzing gene family evolution [56]. To evaluate the diversity of the GRAS genes, we analyzed the structure of each maize GRAS gene. The result showed that 69 (80.23%) ZmGRAS genes with non-intron and only 17 (19.77%) genes with 1–5 intron (Fig 2C). There were nine genes with one intron, five genes with two introns, two genes with three introns, and only one gene with five introns. In addition, most GRAS gene members of the same branch generally showed similar exon-intron structures.
To investigate the chromosomal distribution of the GRAS family in maize, the physical location information of the maize GRAS genes on chromosomes according to the phytozome database (http://phytozome.jgi.doe.gov/, v3) was used to draw the map. The 86 GRAS genes demonstrated a nonrandom distribution. More than one third of the GRAS transcription factors were found on two chromosomes: chromosome 4 (n = 17, 19.77%) and chromosome 1 (n = 13, 15.12%), and only 3 (3.49%) on chromosome 8. ZmGRAS genes were not found on the short arms of chromosome 6, 7 and 8, and only one GRAS gene (ZmGRAS5) was found on the short arm of chromosome 3. Eleven ZmGRAS genes were clustered at the end of the short arm of chromosome 4.
Duplication events are of interest in across many taxa, and maize originates from an ancient allotetraploid event and has undergone several rounds of polyploidy [45, 57]. We identified eleven duplicated genes with highly amino acid sequence and structure similarities, and all of them contain only one exon. These duplicated genes belonged to six groups with six, six, four, two, two, two genes in SHR, PAT1, LISCL, DELLA, SCL3, LS subfamily, respectively, each of the duplicated genes contained two genes with very close genetic relationship (Fig 2; Fig 3). Five of these duplicated genes were distributed on chromosome 1 and none of them on chromosome 4, and 6. In addition, two pairs duplicated genes on chromosome 2 and 7 (ZmGRAS10/ZmGRAS61, ZmGRAS32/ZmGRAS46) belonging to the same SHR subfamily. These results suggested that segmental duplication has played a role in subfamily’s origin [40].
The chromosome number is indicated at the top of each chromosome. Eleven pairs of paralogues are indicated by the red line.
Expression of ZmGRAS genes in different tissues and developmental stages
GRAS transcription factors have important roles in plant growth and development, such as cell maintenance and proliferation, axillary shoot meristem formation, root radial pattering, and male gametogenesis. Genes expressed high in particular tissues may play essential roles in the development of the tissues. The expression of GRAS genes in different tissues and different developmental stages were analyzed using published microarray data (Maize eFP Browser, http://bar.utoronto.ca/efp_maize/cgi-bin/efpWeb.cgi). The expression data included 13 different tissues, including germinating seed 24 H (seedling), coleoptile, radicle, stem and SAM (V1), first internode, immature tassel, meiotic tassel, anthers, primary root, pooled leaves, silks, base of stage 2 leaf (adult leaf), and embryo 24 DAP. The expression patterns of 75 genes were found in this database (Fig 4).
In the heat map, columns represent genes, while rows represent different tissues, including germinating seed 24 H (seedling), coleoptile, radicle, stem and SAM (V1), first internode, immature tassel, meiotic tassel, anthers, primary root, pooled leaves, silks, base of stage 2 leaf (adult leaf), and embryo 24 DAP. The color changes from green to red represent the relative low or high expression in leaves respectively. The red lines represent eleven duplicated genes pairs.
In Fig 4, partial genes in a branch with similar expression patterns in different tissues were laid in the same branch in Fig 2, for example, the nine pair duplicated genes (81.82%, ZmGRAS10/61, ZmGRAS12/54, ZmGRAS13/84, ZmGRAS18/34, ZmGRAS27/47, ZmGRAS36/50, ZmGRAS41/71, ZmGRAS46/32, ZmGRAS58/64) in which genes within the same branch had similar expression patterns (Fig 2; Fig 4), but the duplicated gene pair ZmGRAS40/62 belonging to the PAT1 subfamily had different expression pattern (Fig 4). We analyzed the regulator elements of the 2000bp promoter region of the gene ZmGRAS40 and ZmGRAS62 using plantcare (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) and New PLACE (https://sogo.dna.affrc.go.jp/cgi-bin/sogo.cgi?lang=en&pj=640&ac-tion=page&page=newplace) online websites and found that the regulator elements in front of the two promoters were not the same (S5 Table). For example, in the 2000bp promoter region of gene ZmGRAS62 there were four “MYBPLANT motif” which couldn’t be found in the 2000bp promoter of gene ZmGRAS40, the “MYBPLANT motif” was distributed in four positions upstream of the start codon, respectively, “-387bp”, “-495bp”, “-1489bp”, “-1662bp”which was plant MYB binding site according to the New PLACE website. The AmMYB308 and AmMYB330 transcription factors from Antirrhinum regulated phenylpropanoid and lignin biosynthesis in tobacco [58]. The Lignin was one of the components of the cell wall and filled with the cellulose framework to enhance the mechanical strength of the plant, which is conducive to the organization of water transport, and affect the growth and development of organs including leaf and stem. Additionally, in the promoter of the gene ZmGRAS62, there existed GA-responsive elements (GARE2OSREP1 and GAREAT) which didn’t existed in the promoter of the gene ZmGRAS40 according to the New PLACE website [59, 60]. Gibberellins (GAs) were phytohormones that regulate various aspects of plant development, including germination dormancy, leaf morphogenesis and shoot and root growth, etc [61]. We speculated that different expression level between the gene ZmGRAS62 and ZmGRAS40 at the stage of first internode (V5), base of stage 2 leaf (V5) and pooled leave possibly owning to the above reason. But further experiments were needed to verify this hypothesis. Additionally, partial genes with similar expression patterns belonged to different branch. For example, ZmGRAS60 and ZmGRAS34 had similar expression patterns, but they were divided into SHR subfamily and LISCL subfamily, respectively.
Real-time quantitative RT-PCR (qPCR) was used to validate the microarray data. Eleven genes were selected to confirm expression patterns in primary root, pooled leaf, coleoptile, SAM, adult leaf, silk, seedling, meiotic tassel and immature tassel. As shown in Fig 5, 6 genes (except ZmGRAS69, ZmGRAS40, ZmGRAS19, ZmGRAS12 and ZmGRAS62) had similar expression patterns between the qPCR data and microarray data. For example, ZmGRAS67 had higher expression in seedling, and ZmGRAS10, ZmGRAS61 were mainly expressed in SAM. However, ZmGRAS69 had high expression in seedlings in the qPCR, in contrast to the pooled leaf microarray data. These conflicting results may result from the different plant materials, different growth conditions, and different experimental conditions. These results suggest that some GRAS genes with different expression levels in different organs might play key roles in plant development. Several GRAS genes may also have unique functions during specific developmental stages.
cDNAs from nine different tissues of eight developmental stages were used to detect the expression of eleven GRAS genes in which ZmGRAS40, ZmGRAS54 and ZmGRAS62 have multiple splice variants. The quantitative primers for ZmGRAS40, ZmGRAS62 were used to test all of the variants, and the quantitative primers for the gene ZmGRAS54 were used to test the main two transcripts (GRMZM2G144744-T01 and GRMZM2G144744-T02). The relative expression levels which were normalized to ACTIN were determined by the comparative CT method (2−ΔΔCT ) [62] Three biological replicates were conducted for each experiment.
Conclusion
86 putative GRAS family genes were identified from maize via sequence comparison between maize, Arabidopsis, rice, Medicago truncatula and Sorghum bicolor. Only a few genes from this transcription factor family have been previously characterized in detail in maize. Thus, this work is the first comprehensive and systematic analysis of GRAS transcription factors in maize. The number of the maize GRAS family genes is larger than other plants, suggesting that they might encounter gene segmental-duplication and tandem-duplication during the evolution. Due to the fact that all of the maize GRAS genes expressed differentially, the genes possibly encountered sub-functionalization or neo-functionalization. So, it’s reasonable to explore each gene for its specific role in maize growth and development, to support the current work on maize molecular breeding.
Materials and methods
Database search for GRAS genes in maize
The whole protein sequences of Zea mays were downloaded from the MaizeGDB (http://www.maizegdb.org/, v3). The protein sequences for 34 Arabidopsis GRAS genes were downloaded from plant TFDB (http://planttfdb.cbi.pku.edu.cn/). HMMER 3.0 software was obtained from the HMMER website (http://hmmer.janelia.org/) and was employed in searching for GRAS proteins in the entire protein dataset of Zea mays with a cut-off E-value of 1e-5 using PF03514.11 which was the newest HMM model for the GRAS transcription factor family downloaded from the Pfam database (http://pfam.xfam.org/) [63, 64] as a query. Genes were classified according to the distance homology with Arabidopsis and rice genes [7].
Phylogenetic analysis of maize GRAS proteins
All GRAS protein sequences of Arabidopsis, rice, Medicago truncatula, and Sorghum bicolor were downloaded from plant TFDB (http://planttfdb.cbi.pku.edu.cn/). Then, together with the 86 maize GRAS proteins, the multiple sequence alignment was performed using Clustal X, (Version 2.0) [54]. The aligned sequences were then subjected to phylogenetic analysis by Neighbor Joining (NJ) method using PHYLIP (Version 3.695) with 1000 bootstrap replicates [52, 53].
Chromosomal location of maize GRAS genes
All GRAS genes were mapped to maize chromosomes based on information available at the Phytozome website (http://phytozome.jgi.doe.gov/, v3). The map was drafted using photoshop CS3 based on chromosome size.
Analysis and distribution of conserved motifs and exon-intron structures
The exon-intron organization of GRAS genes was determined by the online GSDS 2.0 tools (Gene Structure Display Server) [65] based on the CDS sequence and corresponding genomic sequences which were obtained from the website (https://phytozome.jgi.doe.gov/pz/portal.html). Multiple EM for Motif Elicitation (MEME, http://meme-suite.org/) [66] was used to search for possible conserved motifs in the complete amino acid sequences of maize GRAS proteins using the default settings.
Gene duplication analysis of maize GRAS genes
Gene duplication events of GRAS genes in maize B73 were investigated. We defined the gene duplication using the following criteria: 1) the alignment of whole protein length covered >80% of the longest gene, 2) the aligned region had an identity >80% and 3) only one duplication event was counted for tightly linked genes. The duplicated gene pairs were connected by red lines using photoshop CS3.
Expression patterns of GRAS family in maize
The data of expression patterns of GRAS family genes in maize was found in Maize eFP Browser (http://bar.utoronto.ca/efp_maize/cgi-bin/efpWeb.cgi). The expression patterns of different genes were searched by primary gene ID. The expression level of different tissues was put in a table, then analyzed using Cluster (v3.0) [67] and Java Treeview (v1.1.6).
Plant materials and growth conditions
Maize seeds (Zea mays L. cvB73) were grown in soil under greenhouse conditions at 25°C/22°C (day/night) with a photoperiod of 16/8 h (day/night) for 2 weeks.
RNA isolation and real-time quantitative RT-PCR expression analysis
Primary root and pooled leaf were sampled when the first leaf is fully extended (V1), coleoptile was sampled 6 days after sowing (6 DAS), SAM was sampled when the three leaves were fully extended(V3), adult leaf was sampled when the seven leaves were extended(V5), silk was sampled when the silks emerge from the husk(R1), seedling was sampled from 24 h after imbibition, meiotic tassel was sampled when the eighteen leave were extended(V18), immature tassel was sampled when the thirteen leave were extended(V13) [68, 69], these nine different tissue materials were collected and stored for RNA isolation. Total RNA was extracted using Trizol (Invitrogen, Carlsbad, CA, USA). All the primers for qPCR were designed using QuantPrimer (http://quantprime.mpimp-golm.mpg.de/) (S6 Table). We selected the best pair of primers for qRT-PCR, the specificity of primers was tested by melting curve. A single peak indicates that the amplification product is specific and the corresponding PCR results were used for data analysis. Reverse transcription was performed with 5 μg total RNA as the template by using the TransScript® II One-Step gDNA Removal and cDNA Synthesis SuperMix (TRANSGEN BIOTECH, AH311). Quantitative RT-PCR (qRT-PCR) was carried out on the Bio-RAD CFX96 using the Real SYBR Mixture (CWBIO, CW0760). The results were analyzed with the Bio-RAD CFX Manager software. Three biological replicates were performed.
Supporting information
S1 Table. Details of splicing variants of maize GRAS genes.
https://doi.org/10.1371/journal.pone.0185418.s001
(DOC)
S2 Table. The maize GRAS genes in the PlantTFDB and PlnTFDB websites.
https://doi.org/10.1371/journal.pone.0185418.s002
(DOCX)
S3 Table. Gene ID of GRAS gene family members from four model plants: Arabidopsis thaliana, Medicago truncatula, Oryza sativa and Sorghum bicolor.
https://doi.org/10.1371/journal.pone.0185418.s003
(DOCX)
S4 Table. The structural features of motif 1–20.
https://doi.org/10.1371/journal.pone.0185418.s004
(DOC)
S5 Table. Partial of the different cis-regulate elements in the 2000bp promoter region of the gene ZmGRAS40 and ZmGRAS62.
https://doi.org/10.1371/journal.pone.0185418.s005
(DOCX)
S6 Table. Primers used in the Real-time quantitative RT-PCR.
https://doi.org/10.1371/journal.pone.0185418.s006
(DOCX)
S1 Fig. C-terminal conserved domains of maize GRAS genes.
https://doi.org/10.1371/journal.pone.0185418.s007
(TIF)
S2 Fig. Three N-terminal specific motifs of LISCL subfamily.
https://doi.org/10.1371/journal.pone.0185418.s008
(TIF)
Acknowledgments
We would like to thank Tingting Zhang (Shandong Agricultural University, China) for her critical suggestions. We also thank Xiaodong Xue, Qinxia Li and Caihua Qin (Shandong Agricultural University, China) for the data analysis and processing.
References
- 1. Bolle C. The role of GRAS proteins in plant signal transduction and development. Planta. 2004;218: 683–692. pmid:14760535
- 2. Pysh LD, Wysocka-Diller JW, Camilleri C, Bouchez D, Benfey PN. The GRAS gene family in Arabidopsis: sequence characterization and basic expression analysis of the SCARECROWLIKE genes. Plant J. 1999;18: 111–119. pmid:10341448
- 3. Gallagher KL, Benfey PN. Both the conserved GRAS domain and nuclear localization are required for SHORT-ROOT movement. Plant J. 2009;57: 785–797. pmid:19000160
- 4. Guiltinan MJ, Miller L. Molecular characterization of the DNA-binding and dimerization domains of the bZIP transcription factor, EmBP-1. Plant Mol Biol. 1994;26: 1041–1053. pmid:7811964
- 5. Heery DM, Kalkhoven E, Hoare S, Parker MG. A signature motif in transcriptional co-activators mediates binding to nuclear receptors. Nature. 1997;387: 733–736. pmid:9192902
- 6. Itoh H, Ueguchi-Tanaka M, Sato Y, Ashikari M, Matsuoka M. The gibberellin signaling pathway is regulated by the appearance and disappearance of SLENDER RICE1 in nuclei, Plant Cell. 2002;14: 57–70. pmid:11826299
- 7. Tian C, Wan P, Sun S, Li J, Chen M. Genome-wide analysis of the GRAS gene family in rice and Arabidopsis. Plant Mol Biol. 2004;54: 519–532. pmid:15316287
- 8. Sun XL, Jones WT, Harvey D, Edwards PJB, Pascal SM, Kirk C, et al. N-terminal domains of DELLA proteins are intrinsically unstructured in the absence of interaction with GID1/gibberellic acid receptors. J Biol Chem. 2010;285(15): 11557–71. pmid:20103592
- 9. Sun X, Xue B, Jones WT, Rikkerink E, Dunker AK, Uversky VN. A functional required unfoldome from the plant kingdom: intrinsically disordered N-terminal domains of GRAS proteins are involved in molecular recognition during plant development. Plant Mol Biol. 2011;77: 205–23. pmid:21732203
- 10. Sun X, Jones WT and Rikkerink EH. GRAS proteins: the versatile roles of intrinsically disordered proteins in plant signalling. Biochem J. 2012;442(1): 1–12. pmid:22280012
- 11. Liu X, Widmer A. Genome-wide comparative analysis of the GRAS gene family in Populus, Arabidopsis and rice. Plant Mol Biol Report. 2014;32: 1129–1145.
- 12. Song L, Tao L, Cui H, Ling L, Guo C. Genome-wide identification and expression analysis of the GRAS family proteins in Medicago truncatula. Acta Physiol Plant. 2017;39: 93.
- 13. Song XM, Liu TK, Duan WK, Ma QH, Ren J, Wang Z, et al. Genome-wide analysis of the GRAS gene family in Chinese cabbage. Genomics. 2014;103(1): 135–146. pmid:24365788
- 14. Lu JX, Wang T, Xu Z, Sun L, Zhang Q. Genome-wide analysis of the GRAS gene family in Prunus mume. Mol Gent Genomics. 2015;290: 303–17.
- 15. Abarca D, Pizarro A, Hernández I, Sánchez C, Solana SP, Del Amo A, et al. The GRAS gene family in pine: transcript expression patterns associated with the maturation-related decline of competence to form adventitious roots. BMC Plant Biol. 2014;14: 354. pmid:25547982
- 16. Huang W, Xian Z, Kang X, Tang N, Li Z. Genome-wide identification, phylogeny and expression analysis of GRAS gene family in tomato. BMC Plant Biol. 2015;15: 209. pmid:26302743
- 17. Wang Y, Shi S, Zhou Y, Zhou Y, Yang J, Tang X. Genome-wide identification and characterization of GRAS transcription factors in sacred lotus (Nelumbo nucifera). Peer J. 2016;4: e2388. pmid:27635351
- 18. Sun X, Xie Z, Zhang C, Mu Q, Wu W, Wang B, et al. A characterization of grapevine of GRAS domain transcription factor gene family. Funct Integr Genomics. 2016;16: 347–363. pmid:26842940
- 19. Xu W, Chen Z, Ahmed N, Han B, Cui Q, Liu A. Genome-Wide Identification, Evolutionary Analysis, and Stress Responses of the GRAS Gene Family in Castor Beans. Int J Mol Sci. 2016 Jun 24;17(7). pmid:27347937
- 20. Zhang L, Li Q, Chen JF, Chen WS. Computational identification and systematic classification of novel GRAS genes in Isatis indigotica. Chin J Nat Med. 2016;14(3):161–76. pmid:27025363
- 21. Peng J, Carol P, Richards DE, King KE, Cowling RJ, Murphy GP, et al. The Arabidopsis GAI gene defines a signaling pathway that negatively regulates gibberellin responses. Genes Dev. 1997;11: 3194–3205. pmid:9389651
- 22. Silverstone AL, Ciampaglio CN, Sun T. The Arabidopsis RGA gene encodes a transcriptional regulator repressing the gibberellin signal transduction pathway. Plant Cell. 1998;10: 155–169. pmid:9490740
- 23. Bolle C, Koncz C, Chua NH. PAT1, a new member of the GRAS family, is involved in phytochrome A signal transduction. Genes Dev. 2000;14: 1269–1278. pmid:10817761
- 24. Torres-Galea P, Hirtreiter B, Bolle C. Two GRAS Proteins, SCARECROW-LIKE21 and PHYTOCHROME A SIGNAL TRANSDUCTION1, Function Cooperatively in Phytochrome A Signal Transduction1. Plant Physiol. 2013;161: 291–304. pmid:23109688
- 25. Greb T, Clarenz O, Schafer E, Muller D, Herrero R, Schmitz G, et al. Molecular analysis of the LATERAL SUPPRESSOR gene in Arabidopsis reveals a conserved control mechanism for axillary meristem formation. Genes Dev. 2003;17: 1175–1187. pmid:12730136
- 26. Stuurman J, Jäggi F, Kuhlemeier C. Shoot meristem maintenance is controlled by a GRAS-gene mediated signal from differentiating cells. Genes Dev. 2002;16: 2213–2218. pmid:12208843
- 27. Di Laurenzio L, Wysocka-Diller J, Malamy JE, Pysh L, Helariutta Y, Freshour G, et al. The SCARECROW gene regulates an asymmetric cell division that is essential for generating the radial organization of the Arabidopsis root. Cell. 1996;86(3): 423–433. pmid:8756724
- 28. Helariutta Y, Fukaki H, Wysocka-Diller J, Nakajima K, Jung J, Sena G, et al. The SHORT-ROOT gene controls radial patterning of the Arabidopsis root through radial signaling. Cell. 2000;101: 555–567. pmid:10850497
- 29. Torres-Galea P, Huang LF, Chua NH, Bolle C. The GRAS protein SCL13 is a positive regulator of phytochrome-dependent red light signaling, but can also modulate phytochrome A responses.Mol Genet Genomics. 2006;276: 13–30. pmid:16680434
- 30. Heo JO, Chang KS, Kima IA, Lee MH, Lee SA, Song SK, et al. Funneling of gibberellin signaling by the GRAS transcription regulator SCARECROW-LIKE 3 in the Arabidopsis root. PNAS. 2011;108: 2166–2171. pmid:21245304
- 31. Chandler PM, Marion-Poll A, Ellis M, Gubler F. Mutants at the Slender1 locus of barley cv Himalaya. Molecular and physiological characterization. Plant Physiol. 2002;129: 181–190. pmid:12011349
- 32. Peng J, Richards DE, Hartley NM, Murphy GP, Devos KM, Flintham JE, et al. ‘Green revolution’ genes encode mutant gibberellin response modulators. Nature. 1999;400: 256–261. pmid:10421366
- 33. Ikeda A, Sonoda Y, Vernieri P, Perata P, Hirochika H, Yamaguchi J. The slender rice mutant, with constitutively activated gibberellin signal transduction, has enhanced capacity for abscisic acid level. Plant Cell Physiol. 2002;43: 974–979. pmid:12354914
- 34. Silverstone AL, Jung HS, Dill A, Kawaide H, Kamiya Y, Sun TP. Repressing a repressor: gibberellin-induced rapid reduction of the RGA protein in Arabidopsis. Plant Cell. 2001;13: 1555–1566. pmid:11449051
- 35. Dill A, Sun T. Synergistic derepression of gibberellin signalling by removing RGA and GAI function in Arabidopsis thaliana. Genetics. 2001;159: 777–785. pmid:11606552
- 36. Gubler F, Chandler PM, White RG, Llewellyn DJ, Jacobsen JV. Gibberellin signaling in barley aleurone cells: control of SLN1 and GAMYB expression. Plant Physiol. 2002;129: 191–200. pmid:12011350
- 37. Llave C, Kasschau KD, Rector MA, Carrington JC. Endogenous and silencing-associated small RNAs in plants. Plant Cell. 2002; 14: 1605–1619. pmid:12119378
- 38. Rhoades MW, Reinhart BJ, Lim LP, Burge CB, Bartel B, Bartel DP. Prediction of plant microRNA targets. Cell. 2002;110: 513–520. pmid:12202040
- 39. Schulze S, Schafer BN, Parizotto EA, Vionnet O, Theres K. LOST MERISTEMS genes regulate cell differentiation of central zone descendants in Arabidopsis shoot meristem. Plant J. 2010;64: 668–678. pmid:21070418
- 40. Curaba J, Talbot M, Li Z, Helliwell C. Over-expression of microRNA171 affects phase transitions and floral meristem determinancy in barley. BMC Plant Biol. 2013;13: 6. pmid:23294862
- 41. Wang L, Mai YX, Zhang YC, Luo Q, Yang HQ. MicroRNA171c-targeted SCL6-II, SCL6-III, and SCL6-IV genes regulate shoot branching in Arabidopsis. Mol Plant. 2010;3: 794–806. pmid:20720155
- 42. Ma Z, Hu X, Cai W, Huang W, Zhou X, Luo Q, et al. Arabidopsis miR171-targeted scarecrow-like proteins binding to GT cis-elements and mediate gibberellin-regulated chlorophyll biosynthesis under light condition. Plos Genet. 2014;10: e1004519. pmid:25101599
- 43. Li X, Qian Q, Fu Z, Wang Y, Xiong G, Zeng D, et al. Control of tillering in rice. Nature. 2003;422: 618–621. pmid:12687001
- 44. Lee MH, Kim B, Song SK, Heo JO, Yu NI, Lee SA, et al. Large-scale analysis of the GRAS gene family in Arabidopsis thaliana. Plant Mol Biol. 2008;67: 659–670. pmid:18500650
- 45. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, et al. The B73 maize genome: complexity diversity dynamics, Science. 2009;326: 1112–1115. pmid:19965430
- 46. Lawit SJ, Wych HM, Xu D, Kundu S, Tomes DT. Maize DELLA Proteins dwarf plant8 and dwarf plant9 as Modulators of Plant Development. Plant Cell Physiol. 2010;51(11): 1854–1868. pmid:20937610
- 47. Thornsberry JM, Goodman MM, Doebley J, Kresovich S, NielsenD, Buckler ES 4th. Dwarf8 polymorphisms associate with variation in flowering time. Nat Genet. 2001;28: 286–289. pmid:11431702
- 48. Andersen JR, Schrag T, Melchinger AE, Zein I, Lübberstedt T. Validation of Dwarf8 polymorphisms associated with flowering time in elite European inbred lines of maize (Zea mays L.) Theor Appl Genet. 2005;111: 206–217. pmid:15933874
- 49. Lim J, Helariutta Y, Specht CD, Jung J, Sims L, Bruce WB, et al. Molecular analysis of the SCARECROW gene in maize reveals a common basis for radial patterning in diverse meristems. Plant Cell. 2000;12: 1307–1318. pmid:10948251
- 50. Slewinski TL, Anderson AA, Zhang C, Turgeon R. Scarecrow plays a role in establishing Kranz anatomy in maize leaves. Plant Cell Physiol. 2012;53(12): 2030–7. pmid:23128603
- 51. Cannon SB, Mitra A, Baumgarten A, Young ND, May G. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004;4: 10. pmid:15171794
- 52. Revell LJ, and Chamberlain SA. Rphylip: an R interface for PHYLIP. Methods Ecol Evol. 2014;5: 976–981.
- 53. Shimada MK, and Nishida T. A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data. Mol Phylogenet Evol. 2017;109: 409–414. pmid:28232198
- 54. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, et al. Clustal W and Clustal X Version 2.0. Bioinformatics. 2007;23: 2947–2948. pmid:17846036
- 55. Fuxreiter M, Simon I, Bondos S. Dynamic protein-DNA recognition: beyond what can be seen. Trends Biochem Sci. 2011;36: 415–423. pmid:21620710
- 56. Patthy L. Intron-dependent evolution: preferred types of exons and introns. FEBS Lett. 1987;214: 1–7. pmid:3552723
- 57. Wei F, Coe E, Nelson W, Bharti AK, Engler F, Butler E, et al. Physical and genetic structure of the maize genome reflects its complex evolutionary history. PLoS Genet. 2007;3: e123. pmid:17658954
- 58. Tamagnone L, Merida A, Parr A, Mackay S, Culianez-Macia FA, Roberts K, et al. The AmMYB308 and AmMYB330 transcription factors from antirrhinum regulate phenylpropanoid and lignin biosynthesis in transgenic tobacco. Plant Cell. 1998;10: 135–154. pmid:9490739
- 59. Sutoh K, Yamauchi D. Two cis-acting elements necessary and sufficient for gibberellin-upregulated proteinase expression in rice seeds. Plant J. 2003;34: 636–645.
- 60. Ogawa M, Hanada A, Yamauchi Y, Kuwahara A, Kamiya Y, Yamaguchi S. Gibberellin biosynthesis and response during Arabidopsis seed germination. Plant Cell. 2003;15: 1591–1604. pmid:12837949
- 61. Claeys H, De Bodt S, Inzé D. Gibberellins and DELLAs: central nodes in growth regulatory networks. Trends Plant Sci. 2014;19: 231–239. pmid:24182663
- 62. Livak KJ, and Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001;25: 402–408. pmid:11846609
- 63. Eddy SR. Accelerated profile HMM searches. PLoS Comput Biol. 2011;7(10): e1002195. pmid:22039361
- 64. Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42: D222–D230. pmid:24288371
- 65. Hu B, Jin J, Guo AY, Zhang H, LuoJ, Gao G. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics. 2015;31: 1296–1297. pmid:25504850
- 66. Bailey TL, Johnson J, Grant CE, Noble WS.The MEME Suite. Nucleic Acids Res. 2015;43: 39–49.
- 67. De Hoon MJL, Imoto S, Nolan J, and Miyano S. Open source clustering software. Bioinformatics. 2004;20(9): 1453–1454. pmid:14871861
- 68. Sekhon RS, Lin HJ, Childs KL, Hansey CN, Buell CR, Leon ND, et al. Genome-wide atlas of transcription during maize development. Plant J. 2011;66: 553–563. pmid:21299659
- 69. Winter D, Vinegar B, Nahal H, Ammar R, Wilson GV, Provart NJ. An “Electronic Fluorescent Pictograph” browser for exploring and analyzing large-scale biological data sets. PloS one. 2007;2(8): e718. pmid:17684564