Figures
Abstract
Sweet corn has become a popular food worldwide. It possesses six-times more sugar than field corn due to the presence of recessive shrunken2 (sh2) gene. Despite availability of diverse sweet corn germplasm, comprehensive characterization of sh2 has not been undertaken so far. Here, entire Sh2 gene (7320 bp) among five field corn-(Sh2Sh2) and six sweet corn-(sh2sh2) inbreds was sequenced. A total of 686 SNPs and 372 InDels were identified, of which three SNPs differentiated the wild-(Sh2) and mutant-(sh2) allele. Ten InDel markers were developed to assess sh2 gene-based diversity among 23 sweet corn and 25 field corn lines. Twenty-five alleles and 47 haplotypes of sh2 were identified among 48 inbreds. Among markers, MGU-InDel-2, MGU-InDel-3, MGU-InDel-5 and MGU-InDel-8 had PIC>0.5. Major allele frequency varied from 0.458–0.958. The gene sequence of these maize inbreds was compared with 25 orthologues of monocots. Sh2 gene possessed 15–18 exons with 6-225bp among maize, while it was 6–21 exons with 30-441bp among orthologues. While intron length across maize genotypes varied between 67-2069bp, the same among orthologues was 57–2713 bp. Sh2-encoded AGPase domain was more conserved than NTP transferase domain. Nucleotide and protein sequences of sh2 in maize and orthologues revealed that rice orthologue was closer to maize than other monocots. The study also provided details of motifs and domains present in sh2 gene, physicochemical properties and secondary structure of SH2 protein in maize inbreds and orthologues. This study reports detailed characterization and diversity analysis in sh2 gene of maize and related orthologues in various monocots.
Citation: Chhabra R, Muthusamy V, Baveja A, Katral A, Mehta B, Zunjare RU, et al. (2022) Allelic variation in shrunken2 gene affecting kernel sweetness in exotic-and indigenous-maize inbreds. PLoS ONE 17(9): e0274732. https://doi.org/10.1371/journal.pone.0274732
Editor: Amit Kumar Singh, National Bureau of Plant Genetic Resources, INDIA
Received: March 11, 2022; Accepted: September 3, 2022; Published: September 22, 2022
Copyright: © 2022 Chhabra et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files.
Funding: RC acknowledges financial support from ‘Women Scientist-A Scheme’ (File No. WOS-A/LS-306/2018), Department of Science and Technology, Govt. of India, and ‘Young Scientists' Scheme’ [Sanction No. YSS/2015/001029], Science and Engineering Research Board (SERB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Maize serves as a source of food and nutritional security in the developing countries [1]. Of the various special types, sweet corn has gained popularity as a fresh and processed vegetable [2]. It differs from other maize types in terms of kernel sweetness and possesses higher amount of sugar content [3, 4]. Demand for sweet corn has substantially increased in last few years due to urbanization, increased consumption and availability of organized food processing industries [5, 6]. US$ 1541.72 million and US$ 1475 million worth of import and export were recorded in sweet corn during 2020, respectively [7]. The kernel sweetness is genetically controlled, thus identification of mutations in gene(s) involved in the starch biosynthesis pathway provides gateway to enhance sweetness in kernels [8, 9]. Amongst various mutations reported till date, a recessive shrunken2 (sh2) gene located on chromosome 3L at bin location 3.09, has been widely used in the sweet corn breeding programmes [10]. Recessive sh2 mutation was first identified as loss of function gene [11] and it was later artificially selected by the breeders to develop sweet corn germplasm globally [12, 13]. Using this sh2 mutant, large number of sh2-based sweet corn inbreds and hybrids has been developed and commercialized [8, 14]. Sh2 gene encodes the large subunit of ADP-glucose pyrophosphorylase (AGPase) that catalyzes the conversion of glucose-1-phosphate into ADP-glucose in the starch biosynthesis pathway [15]. Recessive sh2-based sweet corn genotypes also known as ‘extra sweet corn’ or ‘super sweet’ accumulate six times higher sugar as compared to ordinary maize at the milky ripe stage [16]. The sh2-based varieties have longer shelf life and are better suited for extended storage. This makes the sh2-based sweet corn more popular over other types especially in the Asian countries [14]. Though the sh2 gene in maize was cloned in 1990s [17], limited information on functional and structural diversity of sh2 gene among diverse maize inbreds is available. This is primarily due to the fact that breeding for sweet corn is limited to a few countries as compared to field corn (used for dried grain purpose) that is grown in more than 180 countries worldwide [18]. In South-East Asian countries, sh2-based sweet corn has become very popular in recent times and several composites and hybrids are now available for commercial cultivation [19, 20]. Diverse sweet corn germplasm has been developed by selective introgression of sh2 allele from donor sources. Two major versions of mutant alleles viz., (i) sh2-Reference (sh2-Ref) [13] and (ii) sh2-i (sh2-intermediate) created by Dr. Gyula Fiscor [21] have been reported in maize. Recessive sh2-Ref allele due to complete loss of function causes extreme crumbling of grains upon maturity [12], while sh2-i allele having partial loss of function causes much lesser shrunken phenotype [22]. Of these, sh2-Ref has been mainly used in the sweet corn cultivar development worldwide [8]. Manicacci et al. [23] analyzed the variation in partial gene sequence (coding regions: 1228 bp) of complete wild type Sh2 allele (7320 bp) among a set of 50 maize and teosinte accessions. However, diversity of entire sequence of sh2-Ref allele (including 5’UTR, coding, non-coding and 3’UTR of >7000 bp) of diverse sweet corn inbreds has not been analyzed and compared with the different wild type (Sh2) alleles present in the traditional inbreds. Though various studies on diversity of sweet corn inbreds using genome-wide SSRs have been conducted [24, 25], no effort has been directed to understand the nucleotide diversity in the entire sh2 gene affecting kernel sweetness in maize. Further, comparative analysis of sh2 gene of maize with its orthologues among related monocots is also not available in the public domain. The present study was therefore targeted to (i) characterize sequence of sh2 gene in diverse mutant and wild-type inbreds, (ii) identify haplotypes of sh2 gene using gene-based InDel markers among diverse inbreds, and (iii) study evolutionary relationship of maize sh2 gene with its orthologues from related monocots.
Materials and methods
Genetic material
Five diverse inbreds of wild- (Sh2-Wild1 to Sh2-Wild5), and six shrunken-type (sh2-mutant1 to sh2-mutant6) were selected to characterize the wild- (Sh2: first alphabet in capital letter) and mutant (sh2: first alphabet in small letter) alleles present among the inbreds (S1 Table). Forty-eight genotypes including 23 sweet corn inbreds with recessive sh2-Ref (hereafter sh2) and 25 diverse wild-type lines (Sh2 allele) were used to study the diversity in the Sh2 gene using gene-based markers. These diverse inbreds have been developed by various breeding centres of India as well as by CIMMYT, Mexico (S2 Table).
Genomic DNA isolation, amplification and sequencing of sh2 gene in maize
Five wild- and six shrunken-type inbreds were used for characterization of sh2 gene. Genomic DNA was isolated from seeds of inbreds employing modified sodium dodecyl sulphate (SDS) extraction protocol [26]. Thirteen overlapping primers were designed with the help of Primer3 Online software covering 7320 bp of Sh2 gene (GenBank accession no. M81603: wild type allele of Sh2; hereafter Sh2-Wild-M81603) which amplify the specific fragments ranging from 500-900bp in selected wild and shrunken-type inbreds. These primers were custom-synthesized from M/S Sequencher Pvt. Ltd. PCR was carried out on Veriti 96 well thermal cycler (M/s. Applied Biosystems) in a 50μl reaction in triplicates that consisted of 100ng template DNA, 1x OnePCR™ Mix (GeneDireX Ready-to-use PCR master mix) and 0.4 μM of each forward and reverse primer. The PCR conditions were as follows: initial denaturation at 95°C for 5 min, 35 cycles consisting of denaturation at 95°C for 45 s, primer annealing at 60°C for 45 s, primer extension at 72°C for 1 min, and final extension at 72°C for 5 min. Ten microlitre of the amplicon was checked on 2.0% Seakem LE agarose gel and remaining was processed for sequencing from M/s. Sequencher Pvt. Ltd.
Sequence alignment of gene in selected sh2- and wild type maize inbreds
The sequencing results were evaluated using BioEdit and MEGA tool using ClustalW alignment to study variation in SNPs or InDels among wild (Sh2) and mutant (sh2) inbreds [27, 28]. The alignment file was then subjected to DnaSP6 software version 6.11.01 to calculate the number of SNPs, InDels, haplotypes, number of polymorphic sites, haplotype gene diversity and nucleotide diversity [29]. Putative SNPs which clearly differentiated the wild (Sh2) and mutant (sh2) allele, were then analyzed using bioinformatics tools viz., SOFTBERRY’s FGENESH and RegRNA2.0 for their functionality [30, 31].
Analysis of gene diversity in sh2- and wild-type maize inbreds
Based on complete sequence of Sh2 gene in 11 selected genotypes (five wild-type and six mutants), identified InDel polymorphisms were considered for further marker development (Fig 1, Table 1). The PCR reactions with InDel markers were carried out on standard PCR conditions and annealing temperature for each primer pair was optimized according to the Tm. The PCR products were run on 4% metaphor agarose and 8% polyacrylamide gel electrophoresis (PAGE) depending upon the size of the InDel. The gel profile was analysed in DARwin v6.0 to calculate dissimilarity matrix using Jaccard’s coefficient, and principal coordinate analysis (PCoA) was performed to estimate diversity of sh2 gene [32]. Dendrogram was constructed from Newick tree employing iTOL (Interactive tree of life) online software [33]. Total number of alleles, major allele frequency, gene diversity, polymorphism information content (PIC) and heterozygosity were also calculated using PowerMarker v3.25 [34].
Phylogenetic analysis of sh2 orthologues in selected monocots
Orthologues of Sh2 were searched and the sequences were retrieved in related monocots viz., Oryza sativa var. japonica, O. sativa var. indica, Brachypodium distachyon, Setaria italica, Sorghum bicolor, Triticum aestivum, Hordeum vulgare and Aegilops tauschii across Ensembl Plants’ database using the protein BLAST (BLASTp) tool with an expectation value (e-value) ≤1e-5 [35]. A total of 37 sequences, viz., 11 generated in the present study for Z. mays, one Sh2-Wild-M81603 sequence of Z. mays available in the public domain and 25 orthologous sequences, were used for phylogenetic study (S3 Table). Phylogenetic analysis was performed on the basis of nucleotide and protein sequences using MEGA7 software.
Gene structure and promoter prediction of sh2 gene
To predict 5’UTR, transcription start site (TSS), intron-exon boundaries and polyA tail, maize sh2 gene of the selected inbreds and othologues was then submitted to Softberry’s gene annotation tool (FGENESH). It was based on Hidden Markov Model (HMM) and promoter prediction was also performed with Neural network tool on BDGP (Berkeley Drosophila Genome Project) with the minimum promoter score of 0.8 [36].
Homology modelling of SH2 protein
To build homology models of protein at different complex levels, a web-based integrated service, SWISS-MODEL was employed [37]. Various parameters, viz., GMQE (Global Model Quality Estimation), QSQE (Quaternary Structure Quality Estimate), Oligomeric state and QMEAN were studied to build a putative model for the protein under investigation. Protein data bank (PDB) file of the selected model was then submitted to RAMPAGE server for Ramachandran Plot assessment [38]. It shows statistical distribution of backbone dihedral angles ϕ and ψ in different combinations [39].
Domains analysis and physicochemical properties of SH2 protein
Protein domains and features in SH2 protein (large sub-unit of AGPase) were identified from Ensembl Plants’ database. These domains were annotated by seven major protein signature databases [PANTHER, CDD (Conserved domain database), Gene3D, PROSITE, Superfamily, Pfam and TIGRFAM through InterPro]. Further, in selected inbreds and orthologues, protein domain search was carried out with MOTIF Search tool. The different physicochemical properties of SH2, viz., molecular weight, aliphatic index (AI), isoelectric point, Grand average of hydropathy (GRAVY) and instability index were estimated for primary structure of all the retrieved and predicted proteins using ProtParam tool of ExPASy and PEPSTATS tools [40, 41].
Results
Sequence characterization of sh2 gene in selected mutant and wild-type inbreds of maize
Sequence analysis of entire sh2 gene among 11 selected inbreds (five wild type and six mutants) along with the Sh2-Wild-M81603 revealed a total of 686 SNPs and 372 InDels. Parsimony informative sites and singleton variable sites were 253 and 323, respectively. Number of identified haplotypes was 12 with a haplotype gene diversity of 1.000, while standard deviation of haplotype diversity and variance were 0.034 and 0.00116, respectively. Tajima’s D value was -0.94172 (statistically non-significant, P>0.10) with nucleotide diversity (Pi) of -0.94172 and theta (per site) from Eta of 0.02874. Average length of InDel was 6.004 bp with InDel diversity (ki) of 23.879. Forty-six conserved regions were observed with p-value ranging from 0.0000–0.0426. Tajima’s D-values for two populations of mutant and wild were calculated to be -1.30991 and -0.40253, respectively, which were not statistically significant at P>0.10. Among all polymorphic sites, only seven SNPs clearly differentiated the mutant and wild-type allele, viz., two (SNP583 and SNP755) in 5’UTR and five (SNP5112, SNP5228, SNP5379, SNP6226 and SNP6446) in intronic regions, but only three of these (SNP583, SNP755 and SNP5112) were exploited to develop markers [8]. Cluster diagram employing Neighbour-Joining method with a total branch length of 0.101 revealed diverse relationships among mutants (sh2) and wild-type (Sh2) inbreds (Fig 2a). Among the inbreds, (i) sh2-mutant2 and sh2-mutant3, and (ii) sh2-mutant1 and sh2-mutant4 were clustered together, while among the wild type inbreds, (i) Sh2-Wild2 and Sh2-Wild4, and (ii) Sh2-Wild-M81603 and Sh2-Wild5 were together. The significance of identified polymorphisms in coding sequences viz., non-synonymous (Ka)/synonymous (Ks) nucleotide diversity ratio was estimated for sh2 gene in selected maize genotypes. Among the selected 11 maize inbreds, sh2-mutant1 and sh2-mutant4 had the minimum Ka/Ks ratio, which indicated very less non-synonymous changes, therefore having less deviation from evolutionary point of view. Sh2-Wild3 had the highest Ka/Ks ratio followed by Sh2-Wild4, hence they were more significantly diverse suggesting that particular allele is under selective pressure (S4 Table).
Diversity and haplotype analysis of sh2 gene among diverse maize inbreds using InDel markers
Sh2-sequence comparison among the wild and mutant inbreds revealed presence of 372 InDels. Ten InDels of >2bp were exploited for gene-based marker diversity using 48 diverse genotypes (Table 1). All the InDel markers (MGU-InDel-1 to MGU-InDel-10) were polymorphic among the diverse genotypes. A total of 25 alleles were generated with allelic range of 1 to 3 per locus (Table 2). Major allele frequency ranged from 0.458 (MGU-InDel-5) to 0.958 (MGU-InDel-9) with mean of 0.6227. Mean PIC and gene diversity were 0.38 and 0.46, respectively. No heterozygosity was found in the panel of inbreds using these gene-based markers (Table 2). Genetic dissimilarity ranged from 0 to 0.94 with an average of 0.54. Genetic dissimilarity grouped the diverse genotypes into six major clusters. Cluster-V comprised of the highest number of inbreds (13 inbreds) followed by cluster-IV with 12 inbreds, while cluster-VI comprised of only two genotypes (HKI-1128 and HKI-1105) (Fig 3). Further, marker scores were exploited for haplotype study among 48 inbreds. In total, 47 haplotypes were observed with VQL-1 and UMI-1230 possessing the same haplotype. All the 23 sweet corn inbreds having recessive sh2 allele had different haplotypes as depicted by InDel markers. A pictorial representation of haplotypes in respective genotypes is presented in Fig 4.
Phylogenetic analysis of sh2 sequences of maize and its orthologues in monocots
The phylogenetic analysis was performed to understand the evolutionary relationship of sh2 gene among a total of 37 accessions of Z. mays, S. bicolour, T. aestivum, A. tauschii, O. sativa (indica and japonica), H. vulgare, S. italica and B. distachyon. Eleven sequences of maize generated under the study, and Sh2-Wild-M81603 were used for the phylogenetic analysis. Clustering method based on nucleotide separated 37 genotypes (12 maize and 15 orthologues) into four major clusters with the branch length of 3.810 (Fig 5). The percentage of replicate trees ranged from 27–100% in which the associated taxa grouped together with bootstrap test of 10000 replicates. Cluster-A and cluster-B had 23 genotypes and one genotype, respectively. The cluster-A was further divided into two sub-clusters, viz., -A1 with 12 and -A2 with 11 genotypes. Interestingly, all maize sequences were grouped into single cluster-A1. Phylogenetic analysis revealed that the largest cluster-A consisted of 12 members of maize, four members of rice, two members of wheat species, and one each of A. tauschii, B. distachyon, H. vulgare and S. italica. While, cluster-C comprised of two members of rice, one member each of A. tauschii, B. distachyon, H. vulgare, S. bicolour and S. italica. Cluster-D included seven members comprising two of rice, and one each of A. tauschii, B. distachyon, H. vulgare, S. bicolor and S. italica. No species-specific cluster could be identified in cluster-B, -C and -D indicating sequence diversity among different accessions of crops for sh2 locus.
Gene structure and promoter prediction for sh2 in maize and its orthologues
Gene structure prediction revealed 15–18 exons among maize sequences, while 6–21 exons were predicted among the orthologues. It also predicted transcription start site (TSS), positions of first, internal and last coding sequences (CDSf, CDSi and CDSl) and PolyA signal sequence. Twenty four of the predicted genes were found to be complete with respect to protein coding sequence as all the genes had CDSf, CDSi and CDSl. The exons and introns boundaries of sh2 gene in maize genotypes and among the orthologues have been annotated in Fig 5. Length of the introns across the maize inbreds (sequenced in the present study) ranged from a minimum of 67 bp in sh2-mutant6 to maximum of 2069 bp in Sh2-Wild2. The gene architecture of selected orthologues when compared with 12 maize sequences demonstrated that the minimum (57) and maximum (2713) number of bases in introns were in A. tauschii (NW_017912774.1). The comparative lengths of exons across the sh2 sequences of maize varied from 6bp (Sh2-Wild2) to 225bp observed in all 11 maize sh2 sequences generated under the study. Sh2-Wild-M81603 possessed 16 exons ranging from 6 to 250bp in length. In case of orthologues, the number of bases in exon varied from 30bp in A. tauschii (NW_017912774.1) to 441bp in S. italica (NC_028458.1) (Fig 5). All genes were predicted with a strong promoter having score between 0.9–1.0.
Characterization of SH2 protein in selected mutant and wild-type inbreds of maize
The comparison of protein sequences of wild and mutant inbreds with reference sequence, SH2-AAB52952.1 (GenBank accession: Sh2-Wild-M81603) showed 159 variable regions (amino acid substitutions) and 43 InDels. There were 17 InDels among the mutant sequences and 26 among the wild sequences, when compared with SH2-AAB52952.1 protein sequence (Table 3). The evolutionary history among predicted SH2 protein in mutant- and wild- type maize genotypes was generated with branch length of 0.13439593 (Fig 2b). Phylogenetic analysis revealed two major clusters viz., -A and–B, each having nine and three inbreds, respectively. Cluster-B possessed all mutant version of sh2 gene (SH2-mutant3, SH2-mutant5 and SH2-mutant6), while cluster-A had mix of both mutant- and wild- type SH2 proteins.
Phylogenetic analysis of SH2 protein sequences of maize and its orthologues
Clustering method based on protein sequences separated 37 genotypes (12 maize and 15 orthologues) into three key clusters viz., cluster -A, -B and -C with twenty three, seven and two genotypes, respectively. The cluster-A can be further separated into two sub-clusters -A1 and -A2 with 14 and nine members, respectively (S1 Fig). Maize SH2 protein was grouped into a single sub-cluster-A1. The clustering pattern also revealed that maize SH2 protein is closer to rice SH2 orthologues compared to other monocots. Cluster-A3 comprised of two accessions of T. aestivum and one each of O. sativa var. japonica, O. sativa var. indica, S. bicolor, H. vulgare, A. tauschii, S. italica and B. distachyon, while cluster-B had seven accessions comprising two accessions of O. sativa var. japonica and one each of S. bicolor, A. tauschii, S. italica, O. sativa var. indica and T. aestivum. Cluster-C also had seven accessions, one each of O. sativa var. japonica, O. sativa var. indica, S. bicolor, H. vulgare, A. tauschii, S. italica and B. distachyon.
Homology modelling of SH2 protein
The protein sequence of large subunit of AGPase (SH2) was searched for similar templates. A total of 726 templates were found identical to target sequence having identity between 18.46 and 49.76 (Fig 6a). Although the similarity was not very high, top four templates were selected for modelling having GMQE score between 0.64–0.66, which depicted the expected quality of the model and QSQE score of 0.49–0.52. All those templates were found in homotetramer state and had 2ADP and 1ADQ as ligands. The 1yp4.1.A template (Glucose-1-phosphate adenylyl transferase small subunit) found by HHBlits with an identity score of 49.76% was chosen as the best template for homology modelling for large subunit of AGPase. The Z-score is a factor of model quality which calculates the total energy of the protein structure. The curves obtained in protein-protein interaction fingerprint (PPI fingerprint) provided information of template interface, and its value less than 0 represented that the residues on interface were less vulnerable to mutate than those on surface, which was in agreement with our model. The QMEAN Z-score denoting ‘degree of nativeness’ was -1.17 (Fig 6a–6d).
The assessment of Ramachandran plot of large subunit of AGPase for stereochemical and kinetic properties showed 95% (490) of amino acids were in most favoured zone, 4% (20) in allowed region and 0.6% (3) in outlier region. MolProbity score was 1.4 with a clash score of 1.04. Although AGPase has a well-documented role in starch metabolism, where ATP and glucose-1-phosphate are major substrates, no other substrate/ligand was predicted for developed model. However, online server Raptor-X-binding site prediction determined Mg+2 as ligand with pocket multiplicity value of 97 with p-value 1.75e-13 on amino acid residues viz., L89, G90, G91, G92, T93, G94, S95, Q96, T106, P107, A169, F180, Q181, G182, T183, S186, S210, G211, D212 and S296, depicting that the predicted pocket is true (Fig 6e).
Domains, motifs and features of SH2 protein
The SH2-AAB52952.1 protein was annotated with 14 domains with three major domains including (i) AGPase domains (1, 2 and 3); (ii) nucleotide triphosphate transferase (NTP) and (iii) bacterial transferase hexapeptide domains (Table 4, S2a and S2b Fig). In most of the sh2 mutants, AGPase domain was intact as in SH2-AAB52952.1 sequence, but NTP transferase domain was distorted or short in length in SH2-mutant5 and SH2-mutant6 (S5 Table). Among the orthologues, all the accessions possessed NTP transferase domain except seven accessions, viz., Q10Q61 (O. sativa var. japonica), I1H8B7 (B. distachyon), A0A287P195 (H. vulgare), C5WTQ1 (S. bicolor), B8AQHO (O. sativa var. indica), K4AAH8 (S. italica) and M8BCF1 (A. tauschii) which had only AGPase domain. Besides, these three domains, some of the orthologues also possessed beta-helix, fucokinase, CTP transferase, DUF4889, NHR2 and DUF4954 domains.
Physicochemical properties of SH2 protein in diverse maize inbreds and its orthologues
The coding sequence of SH2 protein in maize was 1908 bp long with a translated protein of 516 amino acids. Protparam and PEPSTATS analyses of the SH2-AAB52952.1 protein revealed that it consisted of 60 negatively charged and 55 positively charged amino acids. Aliphatic index and GRAVY were found to be 85.06 and -0.207, respectively, which indicated the non-polar nature of SH2 protein (Table 5). The instability index was computed to be 38.88, which classified the protein as stable. The predicted secondary structure had 138 α-helices, 26 β-turns, 238 random coils and 114 extended strands (S3a Fig). The SH2 protein was predicted with four unfolded regions with a total of 53 amino acids out of 516 amino acids, and the longest unfolded region had 26 residues (S3b Fig).
When SH2 protein in maize genotypes was compared with each other, and it was found that leucine is predominant followed by either serine or glycine in all wild genotypes with a range of 41–56% leucine, but in case of SH2-mutants, glycine was the predominant amino acid except SH2-mutant2 (leucine was maximum with 46%) and SH2-mutant4 (serine was predominant with 47%). In SH2-AAB52952.1, serine and leucine were equally predominant followed by glycine and alanine. Among the selected genotypes of maize, SH2-mutant1 had instability index of 42.10 depicting its unstable nature. The aliphatic index among the 12 genotypes of maize ranged from 82.39–87.73, which was comparable to SH2-AAB52952.1 protein, depicting its thermostable soluble nature, while negative GRAVY value revealed the non-polar nature of the peptide.
The sequence of SH2 protein in selected maize genotypes, when compared to orthologues of selected monocots, revealed that out of selected five accessions of O. sativa var. japonica, three (Q688T8, Q7G065 and Q6AVT2) were stable and two (Q0D713 and Q10Q61) were unstable proteins. Proteins of B. distachyon and H. vulgare were stable, whereas out of three orthologues of S. bicolor, two (C5WTQ1 and A0A1Z5R3X9) were unstable with instability index of 40.79 and 45.89, respectively. In case of O. sativa var. indica, out of four selected orthologues, only one (K4AAH8) was unstable with instability index of 40.18, whereas out of three orthologues of T. aestivum, only (A5GZ74) one was highly stable with the value of 33.32. Among the three orthologues of A. tauschii, one accession (M8AY46) was highly stable, one M8BCF1 (39.60) was moderately stable, while A0A0U4H00 (42.86) was unstable. Aliphatic index of selected protein sequences ranged from 78.57 (A0A1Z5R3X9) in S. bicolor, which suggested its lower solubility and it went upto 101.24 (B8AQH0) in O. sativa var. indica, showing increased solubility of this protein, as compared to other orthologues (Table 3). The SOPMA comparison of selected maize sequences and sh2 orthologues showed that the percentage of α-helices in those proteins ranged from 21.96–33.26%; with minimum in SH2-mutant1 and maximum in SH2-Wild2 sequence of maize, whereas that of extended strands varied from 19.5–27.07%. The percentage of β-turns ranged from 4.79–8.45% and that of random coils varied from 33.7–50.39%.
Discussion
Sweet corn generates livelihood to millions of farmers due to its diverse usage worldwide [8, 10, 20]. In maize, several recessive genes like sh2, sugary1 (su1) and sugary enhancer1 (se1) enhance kernel sweetness, and often used alone or in combination for the development of sweet corn hybrids [42]. Of these, sh2-based sweet corn is more popular in Asian countries as it possesses three-time more sweetness besides having better shelf-life after harvest over su1 types [16, 20]. The large subunit of AGPase is encoded by Sh2 gene [43]. Mutation in large subunit caused noteworthy decrease in endosperm’s AGPase activity and starch content. Of these, sh2 mutant has been abundantly used in sweet corn cultivar development across countries [44]. Sh2 gene has been reported to have low nucleotide diversity and highly conserved protein sequence among crops [23]. In most of the plants, AGPase is a tetrameric protein complex with two large and two small subunits [45]. However, various mutations have been reported in large subunit of AGPase encoded by Sh2. In maize, SNPs, InDels and chromosomal rearrangements in sh2 gene have been reported [12, 46–48]. Tuncel et al. [49] reported that mutation in OsAGPL2 gene in rice had severely shrivelled seeds. AGPase activity in endosperm of 16 barley mutants had 15–25% of that observed in wild-type [50]. Thus, loss or impaired functions in various mutants of sh2 cause elevated kernel sweetness in the endosperm [10]. In the present investigation, comprehensive characterization of sh2 gene in a diverse set of mutant- and wild- type inbreds of maize has been undertaken and compared with the orthologous sequences of the related monocot species.
Diversity analysis in sh2 gene among exotic and indigenous maize inbreds
Gene-based InDel markers showed a low degree of polymorphism among the selected genotypes with an allele frequency of an average of 2.5 alleles per locus. However, the allele frequency is much lower than that observed in a microsatellite-based diversity analysis performed among sweet corn genotypes [24, 25, 51] and Zheng et al. [52] reported 3.26–5.20 alleles/locus. Babu et al. [53] reported higher number of alleles/locus using SSRs, as compared to that employing lysine- and tryptophan-biosynthesis pathway-specific candidate gene-based SSR in a set of quality protein maize (QPM) inbreds. These SSRs are present mostly in non-genic region, and high level of allelic polymorphism is due to reasons like recombination errors, unequal crossing over and replication slippage at the SSR locus [54]. On the other hand, high degree of mutations within a gene is not tolerated as functional errors that may lead to loss of the genotype from the populations. Average PIC among the 48 genotypes was also low compared to the earlier studies dealt with genome-wide SSRs [24, 25, 50, 52]. This is in accordance with the study carried out by Manicacci et al. [23] where low degree of nucleotide polymorphisms was observed in coding region of sh2 gene across 50 accessions of maize and teosinte. Further, absence of heterozygosity in sh2 gene depicted the complete homozygous nature of the inbreds. This was possibly due to stringent maintenance of the inbreds due to their strict self-pollination for many generations. However, some of the genome-wide SSR loci distributed throughout the genome earlier revealed some degree of heterozygosity among the sweet corn inbreds [25].
With the advancement of sequencing techniques, nowadays it is possible to generate haplotype information for identifying unknown germplasm [55]. Haplotypic structural variability strongly affects the frequency and distribution of recombination events in maize. Beyond allele frequencies, haplotype data collected in population samples contain information about the history of allelic associations in gene genealogies, and this is of tremendous potential for conservation genomics. In a study carried out by Yao and Schnable [56], cis-effects were identified that examined up to 3-fold differences in recombination rates across the a1-sh2 interval among the different haplotypes in maize and teosinte due to several large InDel polymorphisms in teosinte relative to maize. This permits the dissection of variation in introgression rates across the genome, thus unfolding the evolutionary processes [55]. Retrieving information about haplotypes within populations considerably improves the estimation of various factors relevant to evolutionary conservation. In the present study, InDel-based markers were used to identify 47 haplotypes of sh2 gene in a diverse set of maize inbreds of exotic and indigenous origin. Two inbreds viz., VQL-1 and UMI-1230 showed similar haplotype thereby suggesting the origin of Sh2 gene from a common ancestry. InDel markers are co-dominant and involves simple and easy PCR assay, and requires standard gel electrophoresis for separation of amplicons. Further, InDel-based assay does not involve high cost as required in sequence-based approach, as shown in similar studies carried out on su1 and fatb genes for allelic diversity and comprehensive molecular characterization [2, 57]. Our study showed that Sh2-specific InDel markers developed in the present study would be useful in identifying haplotypes of unknown lines to understand their lineage.
Molecular characterization of sh2 gene and large subunit of AGPase
Upon characterization of sh2 gene in diverse mutant- and wild-type inbreds, it was observed that each genotype possessed different allele, thereby showing highly diverse nature of selected eleven lines. Most of the identified SNPs and InDels were present in either 5’UTR or intronic region. Sh2 gene in both mutant-and wild- type maize inbreds had longer introns. However, the dissimilarity of all eleven sequences with Sh2-M81603 (GenBank Accession) was found to be very low (1.7 to 5.1%). Manicacci et al [23] studied molecular evolution of Sh2 gene in maize by comparing coding region (4669 bp) on different accessions of maize and teosinte. They also found very less nucleotide variations suggesting a purifying selection effect in whole species predating domestication. In the present study as well, majority of SNPs and InDels were in the non-coding regions thereby suggesting the conserved functional nature of the sh2 gene in maize. We, earlier have developed, SNP-based allele-specific markers surrounding these three SNPs for marker-assisted selection of sh2 gene in sweet corn breeding programme [8].
The 3D structure of proteins is important to study protein dynamics, function, as well as protein-ligands interaction. If we look upon the structure of large subunit of AGPase in eukaryotes, the protein is a tetramer comprising two separate subunits having catalytic and modulatory roles. Modulatory subunit is related to changes in enzyme regulation in different plant tissues. Prediction of the AGPase secondary structure suggests similar folding pattern as in other sugar nucleotide pyrophosphorylases, revealing that they evolved from the same antecedent [58]. The atomic resolution structure of AGPase was first proposed by Jin et al. [59] which provided strong understandings for the mechanism of enzyme catalysis and its allosteric regulation. However, the 3D structure of large subunit of AGPase in maize is not available in public domain. Therefore, homology modelling was performed as a practical option to decipher the 3D model for the same. The statistics of residues in favoured and allowed region and a very low percentage in the outlier region suggested that the Ramachandran plot for AGPase is acceptable. Among the orthologues, the average of pI was 7.12 (5.27–9.29) depicting that the enzyme probably precipitates in basic buffers. Though in some of the orthologues, the pI is very much less than 7, viz., ~5.0–5.8, signifying that in those species, protein will apparently be precipitated in acidic buffer. This fact will be helpful in isoelectric focussing during purification of recombinant SH2 proteins by improvement of buffer systems. In longer proteins, presence of more charged amino acids is useful in buffering the effect of variations in their composition and can keep neutral pI close to 7.4. Wide variation in the pI of the SH2 protein than their size (range: 45653.59–68833.88 g/mol) synchronized with the report that the molecular weight of orthologous proteins was much more conserved than their isoelectric point [60].
Computation of Instability index (Ii) helped in predicting in vivo half-life (T1/2) of SH2 protein in different species. Previously, it was reported that proteins with Ii> 40 have a T1/2 of less than 5h, whereas those with <40 have a longer T1/2 of 16h [61]. In our study, SH2 protein in 22 genotypes out of 37 was stable and in others, it was thermally unstable. GRAVY indices of SH2 sequences ranged from -0.259 to -0.015 reflecting the hydrophobic nature of the amino acids [62]. AGPases of cereal crops are readily denatured at high temperatures. AGPase of maize endosperm loses 74–96% of its activity upon heating at 57–60°C for 5 min [63]. This could be the one of the possible reasons of better heat tolerance ability in maize compared to many other cereals like wheat [64]. Aliphatic index is another important criterion which measures relative volume occupied by the aliphatic side chains of the amino acids such as alanine, leucine, isoleucine and valine, and also serves as a measure of thermostability of proteins [65]. Lu et al. [66] compared 110 pairs of homologous mesophilic and thermophilic proteins for their amino acid composition and found that due to more leucine residues, thermophilic proteins have higher value of GRAVY and aliphatic index. Similar to earlier reports, there was frequent occurrence and conservation of leucine residues in case of SH2 protein in maize as well as in orthologues. Higher aliphatic index implies that an increased thermostability of the proteins may support an increase in their solubility and again emphasizes the conclusion that thermostability and solubility are correlated positively with each other. Among the sh2 mutant and wild genotypes of maize, value of AI showed its thermostability over a wide range of temperatures. On the other hand, when SH2 orthologues were compared with each other, the accession of O. sativa var. indica (B8AQH10), was found to have maximum AI (101.24), followed by the orthologue in S. italica (K4AAH8:100.96) and O. sativa var. japonica (Q10Q61: 100.51), depicting their very high thermostability and hence soluble nature.
Phylogenetic relationship of sh2 gene and large subunit of AGPase among maize and orthologues
DNA and protein sequence variations among different orthologues provide precise information on the divergence of alleles in various crops that can be exploited in phylogenetic and evolutionary studies. Among the eight monocots studied in the present investigation, one of the orthologues in A. tauschii with six exons, had two longer introns in it, while the second orthologue, had 13 exons with intermediate size of introns. However, T. aestivum had the maximum number of exons (21) in one of the orthologues with smaller length of introns [67]. In an investigation on comparative analysis of AGPases with special emphasis on wheat carried out by Batra et al. [68], it was suggested that during evolution, introns of large subunit genes got divided and resulted in smaller introns in monocots. Introns in orthologues of S. italica and S. bicolor were larger in length as intron-12 in case of maize. Long introns are generally favoured as they intensify the natural selection efficiency by releasing Hill-Robertson (HR) effect. This showed that introns might be involved in decreasing intragenic HR effect between locations which are under the impact of natural selection in finite populations. In a gene, a positive correlation was reported between the intronic burden and its evolutionary conservation [69] and the negative correlation was observed for gene expression with the total size of introns and number of introns. Genes possessing higher intronic burden had smaller density of nonsense and missense mutations in coding regions, suggesting that genes are under stronger pressure from purifying selection. Further, the clustering pattern in the present study revealed that maize sh2 gene is closer to rice sh2 orthologues compared to other monocots. This indicated that sh2 gene of rice and wheat diverged much later as compared to earlier divergence of sh2 in wheat, sorghum, barley, foxtail millets, Brachypodium and Aegilops.
The domain study in protein sequences revealed that the AGPase domain is more conserved than the NTP-transferase domain. Less polymorphism viz., only 0–5 amino acid substitutions in the inbreds viz., SH2-mutant2 (5) >SH2-Wild4 (4) >SH2-mutant3 (3) >SH2-Wild2 (2) >SH2-mutant1 (1) >SH2-mutant4 (0), might be due to positive selection pressure. Generally, wild ancestors show more variations than their domesticated descendants. It might be during the course of evolution when some of the cereal crops like wheat and barley stopped responding to the effectors, 3-PGA and Pi, whereas maize and rice retained this mechanism of allosteric regulation. The AGPase in rice endosperm aligns closer to that of maize in terms of allosteric regulation. Since the primary sequences of small subunit of AGPase in different species are more conserved, the differences in allosterically regulated properties are perhaps due to the differences in large subunit. In a study on rice AGPases conducted by Tuncel et al. [49], alignment of the primary sequences of larger subunit showed that it shared 77% identity with that of maize, while it shared lesser identity of 71% with large subunit of wheat and barley. The phylogenetic trees generated from nucleotide and protein sequence data were shown to be consistent in cluster formation, and SH2 protein of the maize indicated the conservation of AGPase domain. The evolutionary analyses of SH2 protein in maize revealed that the division of cluster-A1 corresponded well with domain structures and sequence conservation. The reason behind the SH2 proteins of other monocots being available in different cluster could be due to similar functional relationship.
The present investigation studied allelic variation in sh2 gene among 48 exotic- and indigenous- maize inbreds of sweet- and field- corn types. The study revealed that diverse allelic haplotypes of sh2 gene were present in sweet corn and field corn inbreds. The study also provided details of gene architecture and evolutionary linkages in sh2 gene, and physicochemical properties and secondary structure of large sub-unit of AGPase protein encoded by sh2. This study reports detailed characterization and allelic diversity analysis of sh2 gene in a set of diverse maize inbreds and various orthologues in related monocots.
Supporting information
S1 Table. Pedigree details of Sh2 mutant and Sh2-wild genotypes used in present study.
https://doi.org/10.1371/journal.pone.0274732.s001
(DOCX)
S2 Table. Source institute of 48 maize inbreds.
https://doi.org/10.1371/journal.pone.0274732.s002
(DOCX)
S3 Table. List of selected sh2 inbreds, wild inbreds and orthologues of Sh2 gene in monocots.
https://doi.org/10.1371/journal.pone.0274732.s003
(DOCX)
S4 Table. Evolutionary distance and synonymous and non-synonymous scores in comparison with reference to Sh2-Wild-M81603 allele.
https://doi.org/10.1371/journal.pone.0274732.s004
(DOCX)
S5 Table. Variations identified in AGPase and NTP transferase domain in selected maize genotypes.
https://doi.org/10.1371/journal.pone.0274732.s005
(DOCX)
S1 Fig. Evolutionary relationship of SH2 protein in maize and its orthologues in selected monocots.
https://doi.org/10.1371/journal.pone.0274732.s006
(TIF)
S2 Fig. Domains in SH2 protein; (a) in SH2-AAB52952.1 protein and (b) Comparison of SH2 protein mutants and wild inbreds.
https://doi.org/10.1371/journal.pone.0274732.s007
(TIF)
S3 Fig. (a) Secondary structure prediction and (b) FoldIndex and disorder in SH2 protein.
https://doi.org/10.1371/journal.pone.0274732.s008
(TIF)
Acknowledgments
The field and laboratories facilities provided by ICAR-IARI, New Delhi, and off-season nursery facility by IIMR-WNC, Hyderabad, are duly acknowledged.
References
- 1. Prasanna BM, Palacios-Rojas N, Hossain F, Muthusamy V, Menkir A, Dhliwayo T, et al. Molecular breeding for nutritionally enriched maize: status and prospects. Front Genet. 2020; 10:1392. pmid:32153628
- 2. Chhabra R, Muthusamy V, Gain N, Katral A, Prakash NR, Zunjare RU, et al. Allelic variation in sugary1 gene affecting kernel sweetness among diverse-mutant and-wild-type maize inbreds. Mol Genet Genom. 2021; 23:1–8. pmid:34159441
- 3. Mehta BK, Muthusamy V, Zunjare RU, Baveja A, Chauhan HS, Chhabra R, et al. Biofortification of sweet corn hybrids for provitamin-A, lysine and tryptophan using molecular breeding. J Cereal Sci. 2020; 96:103093.
- 4. Chauhan HS, Chhabra R, Rashmi T, Muthusamy V, Zunjare RU, Mishra SJ, et al. Impact of vte4 and crtRB1 genes on composition of vitamin-E and provitamin-A carotenoids during kernel-stages in sweet corn. J Food Comp Analysis. 2022; 105:104264.
- 5. Mehta BK, Muthusamy V, Baveja A, Chauhan HS, Chhabra R, Bhatt V, et al. 2020. Composition analysis of lysine, tryptophan and provitamin-A during different stages of kernel development in biofortified sweet corn. J Food Comp Anal. 2020; 94:103625.
- 6. Baveja A, Muthusamy V, Panda KK, Zunjare RU, Das AK, Chhabra R, et al. Development of multinutrient-rich biofortified sweet corn hybrids through genomics-assisted selection of shrunken2, opaque2, lcyE and crtRB1 genes. J Appl Genet. 2021; 62:419–429. pmid:33886083
- 7.
FAOSTAT, 2020. https://www.fao.org/faostat/en/#data (data accessed on 22nd August, 2022).
- 8. Chhabra R, Hossain F, Muthusamy V, Baveja A, Mehta BK, Zunjare RU. Development and validation of gene‐based markers for shrunken2‐Reference allele and their utilization in marker‐assisted sweet corn (Zea maysSachharata) breeding programme. Plant Breed. 2019; 139:1135–1144.
- 9. Mehta BK, Chhabra R, Muthusamy V, Zunjare RU, Baveja A, Chauhan HS, et al. Expression analysis of β-carotene hydroxylase1 and opaque2 genes governing accumulation of provitamin-A, lysine and tryptophan during kernel development in biofortified sweet corn. 3 Biotech. 2021; 11:325. pmid:34194909
- 10. Hossain F, Nepolean T, Vishwakarma AK, Panday N, Prasanna BM, Gupta HS. Mapping and validation of microsatellite markers linked to sugary1 and shrunken2 genes in maize. J Plant Biochem Biotechnol. 2015; 24:135–142.
- 11. Mains EB. Heritable characters in maize: Linkage of a factor for shrunken endosperm with the a1 factor for aleurone color. J Heredity. 1949; 40(1):21–4.
- 12. Kramer V, Shaw JR, Senior ML, Hannah LC. The Sh2-R allele of the maize shrunken-2 locus was caused by a complex chromosomal rearrangement. Theor Appl Genet. 2015; 128:445–452. pmid:25504539
- 13. Neuffer MG. An allele of sh2. Maize Genetics Cooperation Newsletter. 1996; 70:324–328.
- 14. Lertrat K, Pulam T. Breeding for increased sweetness in sweet corn. Int J Plant Breed. 2007; 1:27–30.
- 15. Obana Y, Omoto D, Kato C, Matsumoto K, Nagai Y, Kavakli IH, et al. Enhanced turnover of transitory starch by expression of up regulated ADP-glucose pyrophosphorylase in Arabidopsis thaliana. Plant Sci. 2006; 170:1–11.
- 16. Khanduri A, Prasanna BM, Hossain F, Lakhera PC. Genetic analysis and association studies of yield components and kernel sugar concentration in sweet corn. Indian J Genet. 2010; 70:257–263.
- 17. James MG, Robertson DS, Myers AM. Characterization of the maize gene sugary1, a determinant of starch composition in kernels. Plant Cell. 1995; 7:417–429. pmid:7773016
- 18.
FAOSTAT. 2019. http://www.fao.org/faostat/en/#data/QCL/visualize (data accessed on 11th March, 2022).
- 19. Yadav OP, Hossain F, Karjagi CG, Kumar B, Zaidi PH, Jat SL, et al. Genetic Improvement of maize in India: Retrospect and prospects. Agric Res. 2015; 4:325–338.
- 20. Chhabra R, Hossain F, Muthusamy V, Baveja A, Mehta B, Zunjare RU. Mapping and validation of Anthocyanin1 pigmentation gene for its effectiveness in early selection of shrunken2 gene governing kernel sweetness in maize. J Cereal Sci. 2019; 87:258–265.
- 21. Neuffer MG. An allele of sh2. Maize Genetics Cooperation Newsletter. 1996; 70:324–328.
- 22. Dodson-Swenson HG, Tracy WF. Endosperm carbohydrate composition and kernel characteristics of shrunken2-intermediate (sh2-i/sh2-i Su1/Su1) and shrunken2-intermediate–sugary1 reference (sh2-i/sh2-i su1-r/su1-r) in sweet corn. Crop Sci. 2015; 55:2647–2656.
- 23. Manicacci D, Falque M, Le Guillou S, Piégu B, Henry AM, Le Guilloux M, et al. Maize Sh2 gene is constrained by natural selection but escaped domestication. J Evol Biol. 2006; 20:503–516. pmid:17305816
- 24. Hung NT, Huyen TN, Loc NV, Cuong BM. The application of SSR molecular indicator to assess the purity and genetic diversity of waxy corn inbred lines. J ISSASS. 2012; 18:45–54.
- 25. Mehta B, Hossain F, Muthusamy V, Baveja A, Zunjare R, Jha SK, et al. Microsatellite-based genetic diversity analyses of sugary1-, shrunken2- and double mutant- sweet corn inbreds for their utilization in breeding programme. Physiol Mol Biol Plants. 2017; 23:411–420. pmid:28461728
- 26.
Dellaporta SL, Wood J, Hicks JB. Maize DNA miniprep. In: Malberg J, Messing , Sussex I, editors. Plant molecular biology. New York: Cold Spring Harbor; 1985. pp. 363.
- 27.
Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. In Nucleic acids symposium series 1999 (Vol. 41, No. 41, pp. 95–98). [London]: Information Retrieval Ltd., c1979-c2000.
- 28. Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016; 33(7):1870–4 pmid:27004904
- 29. Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017; 34(12):3299–302 pmid:29029172
- 30. Solovyev V, Kosarev P, Seledsov I, Vorobyev D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol. 2006; 7(1):1–2. pmid:16925832
- 31. Chang TH, Huang HY, Hsu JBK, Weng SL, Horng JT, Huang HD. An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs. BMC Bioinform. 2013; 14(Suppl 2):S4. pmid:23369107
- 32.
Perrier X, Flori A, Bonnot F. Data analysis methods. In: Hamon P, Seguin M, Perrier X, Glaszmann JC, editors. Genetic diversity of cultivated tropical plants. London: CRC Press; 2003. pp. 43–76.
- 33. Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019; 47:W256–259. pmid:30931475
- 34. Liu K, Muse SV. PowerMarker: integrated analysis environment for genetic marker data. Bioinformatics. 2005; 21:2128–2129
- 35.
Bolser D, Staines DM, Pritchard E, Kersey P. Ensembl plants: integrating tools for visualizing, mining, and analyzing plant genomics data. In Plant bioinformatics 2016 (pp. 115–140). Humana Press, New York, NY.
- 36. Waibel A, Hanazawa T, Hinton G, Shikano K, Lang KJ. Phoneme recognition using time-delay neural networks. IEEE Transactions on Acoustics, Speech, and Signal Processing. 1989; 37(3):328–339.
- 37. Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018; 46(W1):W296–303. pmid:29788355
- 38. Lovell SC, Davis IW, Arendall WB III, De Bakker PI, Word JM, Prisant MG, et al. Structure validation by Cα geometry: ϕ, ψ and Cβ deviation. Proteins: Struct Funct Bioinfo. 2003; 50(3):437–450.
- 39. Ramachandran GN. Molecular structure of collagen. International review of connective tissue research. 1963; 1:127–182. pmid:14110864
- 40. Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A. ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003; 31(13):3784–3788. pmid:12824418
- 41. Rice P, Longden I, Bleasby A. EMBOSS: the European molecular biology open software suite. Trends Genet. 2000; 16(6):276–277. pmid:10827456
- 42. Revilla P, Anibas CM, Tracy WF. Sweet corn research around the world 2015–2020. Agronomy. 2021; 11(3):534.
- 43. Preiss J, Danner S, Summers PS, Morell M, Barton CR, Yang L, et al. Molecular characterization of the brittle-2 gene effect on maize endosperm ADP glucose pyrophosphorylase subunits. Plant Physiol. 1990; 92:881–885. pmid:16667400
- 44.
Tracy WF. Sweet corn. In: Hallauer AR, editors. Specialty corns. New York: CRC Press; 2001. pp. 155–196
- 45. Batra R, Nelles DA, Pirie E, Blue SM, Marina RJ, Wang H, et al. Elimination of toxic microsatellite repeat expansion RNA by RNA-targeting Cas9. Cell. 2017; 170: 899–912. pmid:28803727
- 46. Okita TW, Nakata PA, Anderson JM, Sowokinos J, Morell M, Preiss J. The subunit structure of potato tuber ADP-glucose pyrophosphorylase. Plant Physiol. 1990; 93:785–790. pmid:16667537
- 47. Lal SK, Giroux MJ, Brendel V, Vallejos CE, Hannah LC. The maize genome contains a Helitron insertion. Plant Cell. 2003; 15:381–391. pmid:12566579
- 48. Lee JK, Park JY, Kim JH, Kwon SJ, Shin JH, Hong SK, et al. Genetic mapping of the Isaac-CACTA transposon in maize. Theor Appl Genet. 2006; 113:16–22. pmid:16783589
- 49. Tuncel A, Kawaguchi J, Ihara Y, Matsusaka H, Nishi A, Nakamura T, et al. The rice endosperm ADP-glucose pyrophosphorylase large subunit is essential for optimal catalysis and allosteric regulation of the heterotetrameric enzyme. Plant Cell Physiol. 2014; 55:1169–1183. pmid:24747952
- 50. Johnson KL, Jones BJ, Bacic A, Schultz CJ. Thefasciclin-like arabinogalactan proteins of Arabidopsis. A multigene family of putative cell adhesion molecules. Plant Physiol. 2003; 133:1911–1925. pmid:14645732
- 51. Park JS, Park JY, Park KJ, Lee JK. Genetic diversity among waxy corn accessions in Korea revealed by microsatellite markers. Korean J Breed Sci. 2008; 40:250–257.
- 52. Zheng H, Wang H, Yang H, Wu J, Shi B, Cai R, et al. Genetic diversity and molecular evolution of Chinese waxy maize germplasm. PLoS One. 2013; 8:1–11. pmid:23818949
- 53. Babu BK, Pooja P, Bhatt JC, Agrawal PK. Characterization of Indian and exotic quality protein maize (QPM) and normal maize (Zea mays L.) inbreds using simple sequence repeat (SSR) markers. Afr J Biotechnol. 2012; 11:9691–9700.
- 54. Tautz D, Schlotterer C. Simple sequences. Curr Opin Genet Dev. 1994; 4:832–837. pmid:7888752
- 55. Leitwein M, Duranton M, Rougemont Q, Gagnaire PA, Bernatchez L. Using haplotype information for conservation genomics. Trends Ecol Evol. 2020; 35:245–258. pmid:31810774
- 56. Yao H, Schnable PS. Cis-effects on meiotic recombination across distinct a1-sh2 intervals in a common Zea genetic background. Genetics. 2005; 170:1929–1944. pmid:15937141
- 57. Katral A, Muthusamy V, Zunjare RU, Chhabra R, Maman S, Yadava DK, et al. Allelic variation in Zmfatb gene defines variability for fatty acids composition among diverse maize genotypes. Front Nutr. 2022;9. pmid:35600823
- 58. Ballicora MA, Iglesias AA, Preiss J. ADP-glucose pyrophosphorylase: a regulatory enzyme for plant starch synthesis. Photosynth Res. 2004; 79:1–24. pmid:16228397
- 59. Jin X, Ballicora MA, Preiss J, Geiger JH. Crystal structure of potato tuber ADP‐glucose pyrophosphorylase. EMBO J. 2005; 24:694–704. pmid:15692569
- 60. Nandi B, Nandy RK, Sarkar A, Ghose AC. Structural features, properties and regulation of the outer-membrane protein W (OmpW) of Vibrio cholerae. Microbiol. 2005; 151:2975–2986. pmid:16151208
- 61. Rogers S, Wells R, Rechsteiner M. Amino acid sequences common to rapidly degraded proteins: the PEST hypothesis. Sci. 1986; 234:364–368. pmid:2876518
- 62. Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982; 157:105–132. pmid:7108955
- 63. Hwang SK, Singh S, Maharana J, Kalita S, Tuncel A, Rath T, et al. Mechanism underlying heat stability of the rice endosperm cytosolic ADP-glucose pyrophosphorylase. Front Plant Sci. 2019; 10:70. pmid:30804963
- 64. Sanchez B, Rasmussen A, Porter JR. Temperatures and the growth and development of maize and rice: a review. Glob Chang Biol. 2014; 20:408–417. pmid:24038930
- 65. Gupta SK, Rai AK, Kanwar SS, Sharma TR. Comparative analysis of zinc finger proteins involved in plant disease resistance. PLoS One. 2012. pmid:22916136
- 66. Lu B, Wang G, Huang P. A comparison of amino acid composition of proteins from thermophiles and mesophiles. Wei Sheng Wu XueBao = ActamicrobiologicaSinica. 1998; 38:20–25. pmid:12549384
- 67. Jo BS, Choi SS. Introns: the functional benefits of introns in genomes. Genomics Inform. 2015; 13:112. pmid:26865841
- 68. Batra R, Saripalli G, Mohan A, Gupta S, Gill KS, Varadwaj PK, et al. Comparative analysis of AGPase genes and encoded proteins in eight monocots and three dicots with emphasis on wheat. Front Plant Sci. 2017; 8:19. pmid:28174576
- 69. Gorlova O, Fedorov A, Logothetis C, Amos C, Gorlov I. Genes with a large intronic burden show greater evolutionary conservation on the protein level. BMC Evol Biol. 2014; 14:1–7. pmid:24629165