Lysine acetylation is a reversible, dynamic protein modification regulated by lysine acetyltransferases and deacetylases. Recent advances in high-throughput proteomics have greatly contributed to the success of global analysis of lysine acetylation. A large number of proteins of diverse biological functions have been shown to be acetylated in several reports in human cells, E.coli, and dicot plants. However, the extent of lysine acetylation in non-histone proteins remains largely unknown in monocots, particularly in the cereal crops. Here we report the mass spectrometric examination of lysine acetylation in rice (Oryza sativa). We identified 60 lysine acetylated sites on 44 proteins of diverse biological functions. Immunoblot studies further validated the presence of a large number of acetylated non-histone proteins. Examination of the amino acid composition revealed substantial amino acid bias around the acetylation sites and the amino acid preference is conserved among different organisms. Gene ontology analysis demonstrates that lysine acetylation occurs in diverse cytoplasmic, chloroplast and mitochondrial proteins in addition to the histone modifications. Our results suggest that lysine acetylation might constitute a regulatory mechanism for many proteins, including both histones and non-histone proteins of diverse biological functions.
Citation: Nallamilli BRR, Edelmann MJ, Zhong X, Tan F, Mujahid H, Zhang J, et al. (2014) Global Analysis of Lysine Acetylation Suggests the Involvement of Protein Acetylation in Diverse Biological Processes in Rice (Oryza sativa). PLoS ONE 9(2): e89283. https://doi.org/10.1371/journal.pone.0089283
Editor: Mario Soberón, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico
Received: August 23, 2013; Accepted: January 15, 2014; Published: February 20, 2014
Copyright: © 2014 Nallamilli et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Peng’s group is supported by USDA grant 2011-67013- 30125. MJ Edelmann is supported by Southern Crop Fund 322038-191400-027000-241080 and 322431-191400-027000-329180 (USDA ARS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Protein acetylation is a highly conserved posttranslational modification (PTM) in prokaryotes and eukaryotes, that is probably more conserved than protein phosphorylation , although it is considered to be less common than phosphorylation or ubiquitination , . Acetylation is a covalent modification, where an acetyl group is transferred from acetyl-coenzyme A by acetyltransferases, and it might either affect the α-amino group of a protein N-terminus, or the ε-amino group of a lysine residue. The N-terminal acetylation, catalyzed by N-terminal acetyltransferases (NATs) is an irreversible modification, which occurs during the protein synthesis. Conversely, lysine acetylation at the ε-amino group is a reversible, dynamic modification regulated by lysine acetyltransferases (KATs) and lysine deacetylases (KDACs), and is much less common than widespread N-terminal acetylation. It has been originally identified as a post-translational modification of histones by Allfrey et al., 1964  and since then it has been found to be highly conserved in both prokaryotes and eukaryotes .
Lysine acetylation in histones plays a central role in epigenetic control of gene expression by regulation of the chromatin structure. Acetylation neutralizes the positive charges of lysine residues and decreases histone affinity to negatively-charged DNA molecules thereby increasing the accessibility of DNA to transcription factors , . Furthermore, transcriptional co-regulators and chromatin remodeling factors can also recognize the acetylated lysine residues. Histones can be modified by acetylation on multiple residues, and the most important acetylation targets include K9, K14, K18, K23, and K27 of histone H3, K5, K8, K12, K16, and K20 of histone H4, K5, K9, and K13 of H2A, as well as K5, K12, K15, and K20 of H2B , . Histone acetylation on lysine residues results in relaxed state of chromatin structure, which is often associated with increased gene activity, whereas deacetylation leads to a compact state of chromatin structure and thus transcriptional repression. Acetyltransferases, which catalyze the lysine acetylation, are classified as type A or B, depending on their subcellular distribution. Type A acetyltransferases are involved in the acetylation of nuclear histones, whereas type B acetyltransferases are cytoplasmic proteins involved in acetylation of histones in cytoplasm .
Recent advances in high-throughput mass spectrometry-based proteomics have made substantial contributions to the global analysis of lysine acetylation and a large number of acetylated non-histone proteins of diverse biological function have been identified. For example, 195 acetylated proteins on 388 sites have been identified in HeLa cells and mouse liver mitochondria with the help of acetyllysine antibody and high-throughput mass spectrometry . Similar methodology used in combination with high-resolution mass spectrometric analysis yielded identification of 3600 lysine acetylation sites on 1750 proteins in human cells . Interestingly, a large number of acetylation sites are also shown in prokaryotes, such as Escherichia coli and Salmonella , . For instance 138 acetylation sites on 91 proteins have been identified in the global analysis of Escherichia coli’s acetylome .
Plant proteins can also be acetylated, highlighted by the fact that multiple acetyltransferases and deacetylases regulating this modification have been identified in the genome of various plant species. The Arabidopsis genome contains at least twelve histone acetyltransferases (HAT) and eighteen histone deacetylases (HDAC) . The rice genome contains 19 HDAC genes  and using the chromatin database, seven HAT genes were identified in rice genome (http://www.chromdb.org/). It has been shown that histone acetylation plays a significant role in the regulation of cell cycle, development, flowering time, and hormone signal transduction in plants . Dynamic and reversible changes in histone H3 acetylation is observed at two submergence-inducible genes, alcohol dehydrogenase 1 (ADH1) and pyruvate decarboxylase 1 (PDC1) in rice . It has been shown that H3K9 and H4K12 acetylation status is elevated in euchromatic regions in rice . Increased H3K9 acetylation at the RFT1 locus (RICE FLOWERING LOCUS T 1) is correlated with the activation of RFT1 transcription, which encodes a mobile flowering signal and promotes floral transition under short-day conditions in rice , . Tan et al., 2011  carried out large-scale analysis of histone modifications in response to cell wall removal and regeneration in rice. They found that differential H3K18 and H3K23 acetylation is closely associated with cell wall removal. Acetylation on H3K18 and H3K23 were identified and quantified using isotope labeling assisted mass spectrometry-based approach.
Despite the extensive studies on lysine acetylation in histones, the extent of lysine acetylation in non-histone proteins in plants has remained largely unknown until recently. In 2011, two large-scale analyses of lysine acetylation in Arabidopsis were reported. One study identified 91 acetylated sites on 74 proteins of diverse functional classes, and another one identified 64 acetylated sites on 57 proteins , . These results indicated that lysine acetylation is important in the regulation of key metabolic enzymes in Arabidopsis. The identified acetylated proteins included photosynthesis-related proteins, such as Photo system II (PSII) subunits, light-harvesting chlorophyll a/b-binding proteins, RuBisCO large and small subunits, and chloroplastic ATP synthase (β-subunit) . Interestingly in 2012, one more study related to lysine acetylation in Grapevine (Vitis vinifera) was reported with the identification of 138 lysine acetylated sites . Rice serves as the staple food for over half of the world’s population and it is a model plant for plant biological studies of monocots, particularly the cereal crops. The presence of a large number of acetyltranferases and deacetylases in the rice genome suggests that acetylation of non-histone proteins may also play an essential role in rice development and metabolism. Here we report a proteomics study of lysine acetylation in rice. We identified 60 lysine acetylated sites on 44 proteins controlling diverse biological functions in various cellular components. Gene ontology analysis clearly demonstrated that lysine acetylation in rice is not limited to histones, but occurs in diverse proteins localized in compartments such as cytoplasm, nucleus, chloroplasts and mitochondria.
Materials and Methods
Plant Materials and Growth Conditions
The growth conditions of rice (Oryza sativa, cultivar Nipponbare) were the same as in our previous study , . Plants were grown in the greenhouse of the Department of Biochemistry and Molecular Biology, Mississippi State University, MS, USA.
Suspension Cell Culture
Rice (Oryza sativa) NB2P suspension cell cultures were maintained as reported , . Briefly, suspension cells were grown at 24°C with constant shaking on a gyratory shaker at 150 rpm in B5 liquid medium (pH 5.7) containing 20 g/L sucrose, 0.5 g/L MES, 2.0 mg/L 2,4-dichlorophenoxyacetic acid (2,4-D), 2 g/L casein enzymatic hydrolysate, and 0.005% (w/v) pectinase. Suspension cells were subcultured weekly. Healthy looking suspension cells were harvested three days after subculture for protein extraction.
Protein Extraction and Tryptic Digestion
Proteins were isolated using phenol extraction method ,  from three biological replicate samples. Ten grams of exponentially growing cell suspension culture was harvested three days after subculture and ground in liquid nitrogen. Protein extracts were prepared in an extraction buffer (0.9 M sucrose, 0.5 M Tris-HCl pH 8.7, 0.05 M EDTA, 0.1 M KCl, and 2% β-mercaptoethanol added freshly), mixed with an equal volume of saturated phenol (pH 8.0) and then homogenized for 10 minutes. The homogenate was centrifuged at 2500×g for 10 minutes, the phenol phase was recovered and the phenol extraction was repeated three times. The final collection of phenol was mixed with five volumes of precipitation buffer (methanol with 0.1 M ammonium acetate and 1% β -mercaptoethanol). Precipitation was carried out at −70°C overnight. The precipitant was recovered by centrifugation at 13400×g for 10 minutes and the pellet was washed three times with cold precipitation buffer and then three times with ice cold 70% ethanol. The protein pellet was lyophilized to powder in a speed vacuum (LABCONCO, model LYPH-LOCK 6) and stored at −70°C for further analysis. The triplicates protein samples were digested with trypsin at pH 7.8 using a trypsin:substrate ratio of 1∶40, followed by peptide purification by C18 Sep-Pak columns (#WAT020515, Waters, USA) exactly as reported .
Immunoaffinity Purification of Lysine-acetylated Peptides
Triplicate of samples were used for affinity purification of the lysine-acetylated peptides. The volume of peptide samples was concentrated using vacuum centrifugation and dissolved in 400 µl MOPS immunoprecipitation buffer (50 mM MOPS pH 7.2, 10 mM NaH2PO4, 50 mM NaCl). Peptides were incubated with 30 µl anti-acetyl-lysine antibody conjugated to agarose (ICP0388, Immunechem, USA) for 12 hours at 4°C as it has been previously published . In the next step, agarose beads were washed four times at 4°C with the MOPS immunoprecipitation buffer and once with deionized water. The acetylated peptides were eluted from the resin using 50 µl 0.1% formic acid, which was incubated with the beads for 15 minutes at room temperature with gentle shaking. The elution step was repeated twice. The samples were further purified by reversed-phase C18 columns, and after the volume was concentrated on vacuum centrifuge, the sample volume was adjusted to 40 µl in solvent containing 2% acetonitrile and 0.1% formic acid.
Protein Identification by Mass Spectrometry
Peptides were separated by using 75-µm i.d. ×15 cm reversed-phase column (fused-silica C18 column, Thermo) controlled by an Ultimate 3000 nanoflow HPLC (Dionex). Peptides were eluted using a 180-minute gradient (2%–95% solvent B containing 95% acetonitrile and 0.1% formic acid) at a flow rate of 0.5 µl/min, and introduced into an Orbitrap Velos mass spectrometer (Thermo Fisher). The Orbitrap was operated in the data-dependent mode, where the full scan MS spectra (300–2,000 amu) were acquired with a resolution of 100,000 and analyzed by an FT-MS analyzer. The five most intense ions were then selected for collision-induced (CID) fragmentation in the OrbiTrap at normalized collision energy of 35% and activation time of 40 ms.
The acquired data were analyzed by Proteome Discoverer 1.3 (Thermo) using Sequest, with minimum precursor mass of 350 Da and maximum 5000 Da. The signal to noise threshold used for FT was 1.5. The searches were done against the NCBI reference genome database for Oryza sativa (August 2011) using only full tryptic peptides with maximum missed cleavage site of four, precursor mass tolerance of 10 ppm and fragment mass tolerance of 0.8 Da. The protein modifications allowed in the search included N-terminal acetylation (+42.011 Da), carbamidomethyl on cysteine (+57.021 Da), oxidation on methionine (+15.995 Da), acetylation on lysine (+42.011 Da), and trimethylation on lysine (+42.047 Da). The maximum number of peptides considered was 500 and the fragment ion cutoff percentage was 0.1. The peptides were grouped by mass and sequence, and the proteins were also subjected to protein grouping, taking into account only the PSMs with confidence at least medium and delta Cn>0.15. The Peptide Validator option was used, where the reverse decoy database search was performed. The following cut-off values were used: the results were filtered similarly as reported Kelly et al., 2009 , using the maximum Cn value of 0.1, the normalized XCorr values of 1.5, 2, 2.5, 3 and 4 for charge states of 1+, 2+, 3+, 4+, 5, and the peptide rank of 1 .
Gene Ontology Analysis
Functional classification of proteins was carried out according to the gene ontology (GO) rules using the AgBase at http://www.agbase.msstate.edu/ . Three independent gene ontologies were used to describe the function of gene products such as cellular component (CC), molecular function (MF) and biological process (BP). GO annotations were obtained from GORetriever, a tool available at AgBase . GOSlimViewer tool was used to obtain the summary data to generate GO pie charts (AgBase). Proteins without annotations at AgBase were searched in other databases, including NCBI, UniProt, Gramene and TIGR (Rice Genome Annotation Resource) .
Western Blot Analysis
Protein samples were isolated from four different rice tissues such as suspension cells, endosperm (5 days after anthesis), flower and leaf. Proteins samples were separated on SDS-polyacrylamide gel and transferred to PVDF (Millipore) membrane. Acetylated Lys antibody (ImmuneChem) was used in a 1∶1000 dilution as reported .
Detection of Lysine Acetylation in Rice
Antibodies specifically recognizing acetylated lysine have been successfully used to detect the lysine acetylated proteins in different organisms such as Arabidopsis, human cells and E. coli , , , , . To detect the status of lysine acetylation in rice proteome, proteins from four different tissues, including suspension cells, endosperm, flower and leaf, were examined by Western blotting using antibodies specific for acetylated lysine. Multiple major protein bands with molecular weight higher than histones were detected (Fig. 1), indicating that lysine acetylation is probably not limited to histones but also occurs in other rice proteins. Most clearly, four distinct protein bands were co-migrated with 52, 25, 19, and 6 kD standard protein markers (Fig. 1). In addition, multiple weaker bands plus smears were observed, which were different from the patterns revealed by Coomassie blue stain.
Protein samples were collected from different tissues: 1. Suspension cells, 2. Endosperm (5 days after anthesis), 3. Flower, and 4. Leaf. Equal amount of protein samples were loaded for western blot analysis. Proteins samples were separated on SDS-polyacrylamide gel and transferred to PVDF (Millipore) membrane. Acetylated Lys antibody (ImmuneChem) was used in a 1∶1000 dilution as previously described Finkemeier et al., 2011 .
Global Mapping of Acetylated Peptides by Immunoaffinity Purification Coupled with LC-MS/MS
To map lysine acetylation on a global scale, we used immunoaffinity enrichment combined with high-resolution mass spectrometry. Rice cell suspension cultures were selected as experimental materials because they provide good amount of protein extracts of actively dividing cells and are easy to maintain and collect. Proteins extracted from exponentially growing cells were digested with trypsin and the generated peptides were subjected to affinity enrichment with antibodies specific for acetyllysine. Antibody based immunoaffinity purifications were highly recommended for the enrichment of specific modified peptides because antibodies can easily distinguish proteins with different modifications such as acetylation, methylation and trimethylation. The enriched peptides were analyzed by a high-resolution Orbitrap mass spectrometer operated in the fourier transform (FT) mode. High-resolution Orbitrap mass spectrometer was successfully used to distinguish different protein modifications like acetylation and trimethylation in different organisms. The mass difference of the two modifications is 0.03 Da. Our mass spectrometer was operated at the resolving power of 100,000 on the orbitrap, which was sufficient to distinguish between these two modifications . The criteria we used for identifying acetylated peptide included an accurate measurement of precursor and product ions and a mass difference of m/z 170 between two “y” or “b” ions to establish an acetylated lysine. In our previous studies , we included another criterion, a unique immonium ion at m/z 126.1 of acetylated lysine, which is a further fragment ion induced by the loss of NH3 from the acetylated lysine immonium ions at m/z 143.1, typical of an acetylated lysine residue. Due to the substantial improvement of mass spectrometory resolution, we did not observe inconsistencies between accurate ion measurements and immonium ion detection in our preliminary studies. More importantly, the samples we used for mass analysis were enriched with antibodies specific for acetylated lysine. Although the acetyl group and the trimethyl group are very close in mass, they are highly different in structure and property. The antibodies can easily distinguish these two groups. The anti-acetyl-lysine antibody (ImmuneChem) we used is highly specific for the acetyl group and has been successfully used to global identification of lysine acetylated proteins in different organisms including Arabidopsis, human cells and E. coli , , , , . In the acetyl group affinity enriched sample, the trimethylated peptides are largely eliminated. The chance for the acetylation modification to be confused by trimethylation is small, if not impossible. In addition, we noted that immonium ions were not used as a criterion in the identification of acetylated lysine in multiple publications using advanced mass spectrometry analysis , , . Therefore, we did not include immonium ion data here. Using this approach followed by data analysis and manual examination of the mass spectra, we identified 74 lysine acetylated sites from 52 different proteins of diverse biological functions. Among them, 8 peptides had an acetylation at the C-terminals and 6 peptides might have acetylaton at both interior and C-terminal sites. All the other peptides have interior acetylated lysine. C-terminal lysine acetylation has been observed in other acetylome studies, where trypsin was used for protein digestion , . The proposed explanation is that the C-terminal lysine could be digested by cellular proteinases, and in this case these lysines could be in the C-terminus prior to the tryptic digestion . Alternatively, pre-existing C-terminal acetylation may have critical biological functions in many proteins . Plant cells have a big vacuole which occupies up to 90% of the cell volume. The vacuole contains a large amount of various proteinases which are released during sample grinding. The detection of C-terminal lysine acetylation is highly possible. Given that trypsin does not cleave after modified acetyl lysine, however, the C-terminal lysine acetylation sites were not included in our further analyses. Apart from the C-terminal acetylated lysine sites, we had identified 60 lysine acetylated sites from 44 different proteins with high confidence (Table 1, Table S1 and Figure S1). Some examples of the mass spectra are shown in Fig. 2 and all the other spectra are shown in Figure 1. Many non-histone proteins, which had not been previously shown to be modified by lysine acetylation, were found to be acetylated.
The detected b- and y- fragment ion series are shown. The CID fragmentation spectrum of acetylated peptide unique to (A) Dihydroorotate dihydrogenase, (B) Enolase, (C) Histone H2A, and (D) Histone H4.
The Acetylation Sites Display Substantial Sequence Bias
To understand the regulation of protein acetylation and identify a possible consensus motif, we examined the occupancy frequency of amino acids in positions surrounding the identified acetylated lysine residues. Substantial bias in amino acid distribution was observed from position −20 to +20 around the modified lysine although a well-defined consensus sequence was not detected. For example, charged amino acid K and nonpolar amino acids A, G and L presented in much higher frequency than the average of the 20 amino acids in these positions (Table S2). In contrast, charged amino acid H and the aromatic amino acids (W, Y, and F) were presented in much lower frequency. The amino acids in positions −2 to + 2 around the lysine acetylation sites were examined in Arabidopsis . We found that rice and Arabidopsis shared high sequence similarity in these positions. For example, the preferred top five amino acids in −2 position were A, G, L, E, and R in rice (Fig. 3A) and G, L, A, E, and T in Arabidopsis in the presented order. The preferred top five amino acids in the −1 position were A, E, G, S, and R in rice (Fig. 3A) and G, A, D, S, and E in Arabidopsis. The amino acids from −6 to + 6 were examined in human cells . Plants and human cells share little sequence similarity in positions from −3 to +2 around the acetylation sites. Interestingly, the amino acids share moderate degree of similarity in positions −6 to −4 and +3 to +6. For example, the charged amino acids K, R, and E presented in high frequency in these positions in both rice and human cells (Table S2 and Fig. 3A). Secondary structure analysis showed that the acetylation sites distribution was about 52% in coil, 39% in helix, and 9% in beta-strand (Fig. 3B and Table S3), which was highly similar to the distribution in human cells .
(A) Amino acid frequency percentages for ±6 amino acids from lysine acetylated site. Letter K in the center represents the acetylated lysine. Amino acid frequency percentages are calculated by considering all identified lysine acetylated sites on different proteins in rice. (B) Different secondary structural regions of acetylated lysines identified in rice.
Gene Ontology Analysis of the Acetylated Proteins
For a comprehensive analysis of the distribution and function of the acetylated proteins, we carried out GO analysis using the AgBase at http://www.agbase.msstate.edu/ . GO annotations of identified proteins were obtained from different databases such as AgBase, NCBI, UniProt, Gramene and TIGR (Rice Genome Annotation Resource) . Three independent categories of gene ontologies were used to describe the function of gene products, which were cellular component, molecular function and biological process in which the gene product participates. Analysis of protein distribution within the cellular component (Fig. 4A ) indicated that the majority of identified lysine acetylated proteins were localized in nucleus (17.2%), followed by organelle (13.7%), plastid (10.3%), cytoplasm (8.6%), mitochondria (5.1%), plasma membrane (5.1%), peroxisome (3.4%), vacuole (1.7%), cell wall (1.7%), etc. Total nuclear proteins occupy a substantial fraction of the acetylated proteins in rice. Compared with the subcellular distribution profile of Arabidopsis acetylated proteins, we identified more proteins in nucleus, mitochondria, and plasma membrane. In contrast, the percentage of proteins in cytoplasm was smaller (Fig. 5).
The pie chart was generated using the analysis results of the GOSlimViewer tool at AgBase. Percentage distribution of the unique proteins was used to make the pie charts. Gene distribution was grouped on the basis of Cellular Components (A), Molecular Functions (B) and Biological Processes (C).
Cellular components of identified rice acetylated proteins were compared with the recently reported Arabidopsis acetylated proteins data .
Molecular function analysis showed that proteins with ion binding (21.3%), oxidoreductase activity (8.7%), DNA binding (7.7%), nuclease activity (5.8%), RNA binding (2.9%), methyltransferase activity (2.9%), kinase activity (1.9%), and hydrolase activity(1.9%) account for over half of all the identified proteins (Fig. 4B). Other molecular functions included: ligase activity (1.9%), lyase activity (1%), nucleotidyltransferase activity (1%), peptidase activity (1%), transcription factor activity (1%), translation factor activity (1%) and transferase activity (1%). Biological process analysis showed that response to stress (10.6%), DNA metabolic processes (8.8%), cellular nitrogen metabolic process (7%), biosynthetic process (6.1%), chromosome organization (4.4%), cellular component assembly (4.4%), carbohydrate metabolic process (3.5%) and catabolic process (2.6%) were the major biological processes in which the acetylated proteins are involved (Fig. 4C).
Selected Examples of Acetylated Proteins
1) Lysine acetylation of nuclear proteins.
Histones are well known for their modification by lysine acetylation and methylation. In the present study, we identified six histone proteins with 14 lysine acetylation sites (Table 1). K13 and K17 of histone H4; K43 and K46 of histone H2B.7; K7 and K13 of histone H2A are found to be lysine acetylated. In addition to these core histones, we found that three core histone H2A/H2B/H3/H4-domain containing proteins (Q94JJ4 (Histone H2B.4), Q6F362 (Histone H2B.9) and Q0JQP0 (Os01g0149400 protein)) were also acetylated. Transcription factors such as BHLH transcription factor, GATA zinc finger family protein, and basic helix-loop-helix DNA-binding protein are also acetylated. Transposon proteins and retrotransposon proteins were the biggest group of acetylated nuclear proteins with 9 members being acetylated (Table 1). Two lysine acetylated sites were identified in two transposon proteins which belonged to CACTA, En/Spm sub-class (Q7XLN2 and Q2QRJ1). Four other lysine acetylated sites were identified in two mutator sub-class transposon proteins. We also identified five retrotransposon proteins with acetylated lysine sites in six positions. Out of the five acetylated retrotransposon proteins, one protein belong to Ty3-gypsy subclass, one protein belongs to Ty1-copia subclass and three proteins belong to an unclassified subclass (Table 1).
2) Lysine acetylation of protein kinases.
We identified one rice protein kinase to be lysine acetylated and its function is presently unknown. Two lysine acetylation sites (K128 and K140) were identified as acetylation sites in lectin-like receptor kinase 1 (Table 1 and Fig. 6). In plants, lectin-like receptor kinases have diverse role in plant microorganism interaction and oligosaccharide signal transduction.
3) Lysine acetylation of metabolic proteins.
Several important metabolic proteins (Table 1 and Fig. 6) were found to be lysine acetylated including glyceraldehyde-3-phosphate dehydrogenase (GAPDH, Q7FAH2), enolase (B9G3A0), cytochrome P450 72A1 (Q8L4Q4), and dihydroorotate dehydrogenase (Q6Z744). Previous studies already indicated that nuclear translocation of GAPDH is mediated by acetylation of three lysine residues at position117, 227 and 251 in human cells . Lysine acetylome studies have indicated that protein acetylation plays a critical role in the regulation of metabolism in Arabidopsis, Escherichia coli and human cells , , , and our results are consistent with these preceding reports.
4) Lysine acetylation of acetyltransferase.
Acetyltransferases are responsible for addition of an acetyl moiety to the lysine residue of substrate proteins. We identified one lysine acetylated site (K158) in the acetyltransferase protein (A2ZVA8) (Table 1), which is not yet characterized. Previously, it has been shown that P/CAF, a histone acetyltransferase enzyme, is self-acetylated in humans . Acetylation of P/CAF enhances its acetyltransferase activity, suggesting that acetylation of acetyltrasferases may also regulate the acetylation activity in rice.
5) Lysine acetylation of hypothetical function unknown proteins.
Among the lysine acetylated proteins, the hypothetical proteins with unknown function were a major group, including 10 proteins with 12 lysine-acetylated sites (Table 1). Further investigations on the molecular functions of these proteins are required to fully appreciate the function of protein acetylation in the control of cellular activities.
1. The Size of Rice Acetylome
Lysine acetylation has been shown to be a widespread protein modification from E. coli to human cells , –. It was first described in histones, but we now know that it targets a wide range of proteins, particularly the proteins involved in metabolism and protein complexes . In spite of the genome-wide examination of acetylomes in multiple organisms, the precise acetylome size and the regulatory mechanism of protein acetylation are still largely unknown for non-histone proteins, particularly in plants. Wu et al., 2011  identified sixty-four lysine modification sites in 57 proteins in Arabidopsis. Meanwhile, Finkermeier et al., 2011  identified 91 lysine acetylation sites on 74 proteins. Melo-Braga identified 138 acetylation sites in grapewine . In our studies in rice, we identified 60 lysine acetylated sites on 44 proteins. The identification of the acetylated sites and proteins has substantially advanced our understanding on protein acetylation in plants and their role in cellular activities. It is evident that protein acetylation plays a much broader role than merely regulation of histone functions. Interestingly, we found that only three out of the 44 acetylated proteins in our list were in common with those reported in Arabidopsis studies. These three proteins were cytochrome P450, an armadillo repeat-containing protein, and a glyceraldehyde-3-phosphate dehydrogenase protein. Nuclear proteins were the largest group in our identified proteins. However, we identified very few photosynthesis proteins probably because we used dark grown suspension cells as protein source. In contrast, proteins involved in photosynthesis represented a significant proportion of the total observed acetylated proteins in the two reports in Arabidopsis , . These studies suggested that each of the three reports identified only a small fraction of a large plant acetylome. Analysis of the acetylated histone peptides identified in these studies strongly supports this conclusion. In Arabidopsis, about 15 acetylation sites have been reported in the four core histones . Finkemeier et al., 2011  identified two acetylation sites in H2B, one site on H3 and three sites on H4. Wu et al., 2011  identified one acetylation site on H3 and two sites on H4. In this study, we identified two acetylation sites on H4, two sites on H2B, and two sites on H2A. The low coverage of the core histone modification sites in these three studies clearly suggests that a large portion of the plant acetylome has not been discovered. In human cells, 3600 lysine acetylation sites on 1750 proteins were identified , including all the known acetylated histone sites. In this study, they further separated the peptides from immunoaffinity purification by means of isoelectric focusing the peptides into 12 fractions and made use of diverse lysine deacetylase inhibitors. These methods can also be used to further investigate the plant acetylome. In addition, mutants of the lysine deacetylase genes can be very helpful as well. Meanwhile, it is worth to note that protein biochemistry studies in plants have always been a challenge compared with other organisms. For example, tandem affinity protein purification has become a very successful routine protein purification method in many organisms. In plants, however, proteins successfully purified with this approach are still very limited –. This could be due to cell wall effect on protein extraction or the plant secondary compounds that may interfere with affinity purification.
To overcome the problem in plants, different approaches should be used. Further separation of peptides from immunoaffinity purification by means of isoelectric focusing into multiple fractions and make use of diverse lysine deacetylase inhibitors and mutants may lead to a better coverage of the lysine acetylome in plants .
2. Comparison of Lysine Acetylation in Different Organisms
Even though lysine acetylation first identified for histone proteins, now it is considered as widespread modification for different non histone proteins. Recent global proteomic analysis studies further extended the scope of lysine acetylation in different organisms such as E. coli, salmonella, mouse, human, and Drosophila , –, –, –. Most of the large scale acetylation studies are reported in bacteria, mouse and human cell lines. In plants, only Arabidopsis and grape vine have been studied –. Little is known about the lysine acetylation beyond the histone proteins in major crop plants such as rice, maize, wheat and sorghum. Our results present an example of protein acetylation in monocots. We summarized all the published protein acetylation data and our rice results in Table S4 for the convenience of comparison.
3. The Features of the Acetylation Sites in Rice
We examined the protein sequence 20 amino acids upstream and 20 amino acids downstream of the acetylation sites in details (Table S2 and Fig. 3A). Although a clear consensus sequence was not discovered, sequence bias around the acetylation site was evident. Interestingly, the amino acid bias shares common features with Arabidopsis and human cells to a certain degree. In Arabidopsis, only the amino acids in four positions around the acetylation sites were examined . In the −1 position, the top five amino acids in terms of frequency of occurrence were G, A, D, S, and E. In rice, this order was the following: A, E, G, S, and R. In the −2 position, the top five amino acid presence order was G, L, A, E, and T and in rice it was the following: A, G, L, E, and R. The results clearly indicate that rice and Arabidopsis share high degree of similarity in amino acid preference. Compared with human cells , the rice acetylation sites share little similarity in positions from −3 to +2. Interestingly, however, from position −4 to −6 and +3 to +6 both human and rice cells have similar preference to charged amino acids K, R, and D. Our further analysis showed that the bias to these three charged amino acids extend to −20 and +20 in both directions (Table S2). Other preferred amino acids include A, G, L, S, and T. Interestingly, the charged amino acid histidine is well underrepresented in sites around the acetylated lysine. Other underrepresented amino acids include W, Y, and F. The distribution of the acetylated sites among local secondary structure was about 52% in coil, 39% in helix, and 9% in beta-strand (Fig. 3B and Table S3), which was similar to the distribution in human cells . Once more comprehensive analysis of the amino acid sequences of acetylated proteins is completed, the feature of the acetylation sites will be revealed.
4. Diverse Function of the Acetylated Proteins
The acetylated proteins identified in our studies are involved in diverse molecular processes with wide biological functions. Nuclear proteins represent 17.2% of the acetylated proteins. These proteins regulate genome stability (transposon and retrotransposon proteins), transcription (transcription factors and histone modifications), and genome reverse transcription (gag-pol protein). Protein involved in translational activity is also subjected to lysine acetylation, which include translation initiation factor IF-3-like protein (Q6K674).
Another important group of acetylated proteins we detected are metabolic proteins, which includes glyceraldehyde-3-phosphate dehydrogenase (Q7FAH2), enolase (B9G3A0), cytochrome P450 72A1 (Q8L4Q4), and dihydroorotate dihydrogenase (Q6Z744) (Table 1 and Fig. 6). Glyceraldehyde 3-phosphate dehydrogenase (GAPDH) is a key enzyme in glycolysis pathway. This enzyme is also lysine acetylated in Arabidopsis and E. coli , . Enolase is another glycolysis pathway enzyme that was shown to be lysine acetylated in both our study and E. coli . The fact that these and other glycolytic enzymes were shown to be acetylated in bacteria, mammals and plants suggests that functional regulation of acetylation in glycolysis might be conserved. Cytochrome P450 proteins are heme-thiolate proteins that play a key role in biosynthesis of lignins, terpenoids, alkaloids, sterols, and fatty acids. They are also involved in the herbicide detoxification, phytoalexin synthesis, pigment biosynthesis, and phytohormone synthesis in plants . In Arabidopsis, cytochrome P450 (At5g45340) is also lysine acetylated, suggesting that cytochrome acetylation is conserved in plants. We identified two lysine acetylation sites in armadillo repeat-containing protein. Armadillo family proteins are involved in many functions like cell death, cell division and cytoskeleton organization in many plants. An Arabidopsis protein from this group, armadillo repeat-containing kinesin related protein, has also been shown to be acetylated, indicating that acetylation may play a role in the regulation of this protein group .
Lysine acetylation is known to have a negative crosstalk with ubiquitination and sumoylation, therefore constituting a regulatory switch controlling protein stability and function , . Effects of acetylation on E3 ubiquitin ligase activity in human and animals have already been documented, such as acetylation-mediated inhibition of Mdm2, a ubiquitin E3 ligase and an important negative regulator of p53 . Similar to the ubiquitination, SUMO (Small Ubiquitin-like Modifier) proteins modify their targets in the process called sumoylation, regulating their stability, localization and transcription. SUMO proteins are first activated by an E1 enzyme, followed by conjugation via E2 enzyme, and finally they are ligated to the ε-amino group of lysine residue in the target protein by the E3 ligase . In this study, we found that E3 SUMO-protein ligase SIZ2 was lysine acetylated (Table 1 and Fig. 6). Our results provide a basis for further mechanistic studies of the regulatory mechanism of SUMO E3 ligase activity in rice. Interestingly, we also found that a protein exosome complex exonuclease is lysine acetylated in rice. Exoribonuclease complex is involved in the degradation of unstable mRNAs containing AU-rich elements (AREs) within their 3′-untranslated regions, implying a possible role of lysine acetylation in the degradation of mRNAs as well.
In summary, our global proteomic study of lysine acetylation has revealed that this modification is much more abundant in rice than previously anticipated, and it targets a large group of non-histone proteins with broad biological functions. Several of the acetylated proteins identified in our studies are also acetylated in Arabidopsis or other organisms. Our results suggest that lysine acetylation is a highly conserved modification, constituting a common regulatory mechanism in the control of cellular activities.
Fragmentation spectra of lysine acetylated peptides identified in rice.
Summary of lysine acetylated peptides identified in rice.
Amino acid frequency percentages for ±20 amino acids from lysine acetylated site.
Detailed secondary structural characteristics of acetylated lysines identified in rice.
We thank Ken Pendarvis (Institute for Genomics, Biocomputing and Biotechnology, MSU) for technical help in mass spectrometry.
Conceived and designed the experiments: ZP BRRN MJE. Performed the experiments: BRRN MJE HM JZ. Analyzed the data: BRRN MJE XZ FT BN ZP. Contributed reagents/materials/analysis tools: ZP MJE. Wrote the paper: BRRN ZP MJE.
- 1. Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, et al. (2009) Lysine acetylation targets protein complexes and co-regulates major cellular functions. Science 325: 834–840.
- 2. McEwan DG, Dikic I (2011) The Three Musketeers of Autophagy: phosphorylation, ubiquitylation and acetylation. Trends Cell Biol 21: 195–201.
- 3. Allfrey VG, Faulkner R, Mirsky AE (1964) Acetylation and Methylation of Histones and Their Possible Role in the Regulation of Rna Synthesis. Proc Natl Acad Sci U S A 51: 786–794.
- 4. Liu Z, Cao J, Gao X, Zhou Y, Wen L, et al. (2011) CPLA 1.0: an integrated database of protein lysine acetylation. Nucleic Acids Res 39: D1029–1034.
- 5. Mutskov V, Gerber D, Angelov D, Ausio J, Workman J, et al. (1998) Persistent interactions of core histone tails with nucleosomal DNA following acetylation and transcription factor binding. Mol Cell Biol 18: 6293–6304.
- 6. Puig OM, Belles E, Lopez-Rodas G, Sendra R, Tordera V (1998) Interaction between N-terminal domain of H4 and DNA is regulated by the acetylation degree. Biochim Biophys Acta 1397: 79–90.
- 7. Hollender C, Liu Z (2008) Histone deacetylase genes in Arabidopsis development. J Integr Plant Biol 50: 875–885.
- 8. Servet C, Conde e Silva N, Zhou DX (2010) Histone acetyltransferase AtGCN5/HAG1 is a versatile regulator of developmental and inducible gene expression in Arabidopsis. Mol Plant 3: 670–677.
- 9. Chen ZJ, Tian L (2007) Roles of dynamic and reversible histone acetylation in plant development and polyploidy. Biochim Biophys Acta 1769: 295–307.
- 10. Kim SC, Sprung R, Chen Y, Xu Y, Ball H, et al. (2006) Substrate and functional diversity of lysine acetylation revealed by a proteomics survey. Mol Cell 23: 607–618.
- 11. Zhang J, Sprung R, Pei J, Tan X, Kim S, et al. (2009) Lysine acetylation is a highly abundant and evolutionarily conserved modification in Escherichia coli. Mol Cell Proteomics 8: 215–225.
- 12. Wang Q, Zhang Y, Yang C, Xiong H, Lin Y, et al. (2010) Acetylation of metabolic enzymes coordinates carbon source utilization and metabolic flux. Science 327: 1004–1007.
- 13. Pandey R, Muller A, Napoli CA, Selinger DA, Pikaard CS, et al. (2002) Analysis of histone acetyltransferase and histone deacetylase families of Arabidopsis thaliana suggests functional diversification of chromatin modification among multicellular eukaryotes. Nucleic Acids Res 30: 5036–5055.
- 14. Hu Y, Qin F, Huang L, Sun Q, Li C, et al. (2009) Rice histone deacetylase genes display specific expression patterns and developmental functions. Biochem Biophys Res Commun 388: 266–271.
- 15. Tsuji H, Saika H, Tsutsumi N, Hirai A, Nakazono M (2006) Dynamic and reversible changes in histone H3-Lys4 methylation and H3 acetylation occurring at submergence-inducible genes in rice. Plant Cell Physiol 47: 995–1003.
- 16. Yin BL, Guo L, Zhang DF, Terzaghi W, Wang XF, et al. (2008) Integration of cytological features with molecular and epigenetic properties of rice chromosome 4. Mol Plant 1: 816–829.
- 17. Komiya R, Ikegami A, Tamaki S, Yokoi S, Shimamoto K (2008) Hd3a and RFT1 are essential for flowering in rice. Development 135: 767–774.
- 18. Zhou DX, Hu Y (2010) Regulatory function of histone modification in controlling rice gene expression and plant growth. Rice 3: 103–111.
- 19. Tan F, Zhang K, Mujahid H, Verma DP, Peng Z (2011) Differential histone modification and protein expression associated with cell wall removal and regeneration in rice (Oryza sativa). J Proteome Res 10: 551–563.
- 20. Finkemeier I, Laxa M, Miguet L, Howden AJ, Sweetlove LJ (2011) Proteins of diverse function and subcellular location are lysine acetylated in Arabidopsis. Plant Physiol 155: 1779–1790.
- 21. Wu X, Oh MH, Schwarz EM, Larue CT, Sivaguru M, et al. (2011) Lysine acetylation is a widespread protein modification for diverse proteins in Arabidopsis. Plant Physiol 155: 1769–1778.
- 22. Melo-Braga MN, Verano-Braga T, Leon IR, Antonacci D, Nogueira FC, et al. (2012) Modulation of protein phosphorylation, N-glycosylation and Lys-acetylation in grape (Vitis vinifera) mesocarp and exocarp owing to Lobesia botrana infection. Mol Cell Proteomics 11: 945–956.
- 23. Zhang J, Nallamilli BR, Mujahid H, Peng Z (2010) OsMADS6 plays an essential role in endosperm nutrient accumulation and is subject to epigenetic regulation in rice (Oryza sativa). Plant J 64: 604–617.
- 24. Nallamilli BR, Zhang J, Mujahid H, Malone BM, Bridges SM, et al. (2013) Polycomb group gene OsFIE2 regulates rice (Oryza sativa) seed development and grain filling via a mechanism distinct from Arabidopsis. PLoS Genet 9: e1003322.
- 25. Lee TJ, Shultz RW, Hanley-Bowdoin L, Thompson WF (2004) Establishment of rapidly proliferating rice cell suspension culture and its characterization by fluorescence-activated cell sorting analysis. Plant Mol. Biol. Rep 22: 259–267.
- 26. Mujahid H, Tan F, Zhang J, Nallamilli BR, Pendarvis K, et al. (2013) Nuclear proteome response to cell wall removal in rice (Oryza sativa). Proteome Sci 11: 26.
- 27. Chitteti BR, Tan F, Mujahid H, Magee BG, Bridges SM, et al. (2008) Comparative analysis of proteome differential regulation during cell dedifferentiation in Arabidopsis. Proteomics 8: 4303–4316.
- 28. Li G, Nallamilli BR, Tan F, Peng Z (2008) Removal of high-abundance proteins for nuclear subproteome studies in rice (Oryza sativa) endosperm. Electrophoresis 29: 604–617.
- 29. Sobott F, Watt SJ, Smith J, Edelmann MJ, Kramer HB, et al. (2009) Comparison of CID versus ETD based MS/MS fragmentation for the analysis of protein ubiquitination. J Am Soc Mass Spectrom 20: 1652–1659.
- 30. Schwer B, Eckersdorff M, Li Y, Silva JC, Fermin D, et al. (2009) Calorie restriction alters mitochondrial protein acetylation. Aging Cell 8: 604–606.
- 31. Kelly TJ, Lerin C, Haas W, Gygi SP, Puigserver P (2009) GCN5-mediated transcriptional control of the metabolic coactivator PGC-1beta through lysine acetylation. J Biol Chem 284: 19945–19952.
- 32. MacCoss MJ, Wu CC, Yates JR 3rd (2002) Probability-based validation of protein identifications using a modified SEQUEST algorithm. Anal Chem 74: 5593–5599.
- 33. McCarthy FM, Wang N, Magee GB, Nanduri B, Lawrence ML, et al. (2006) AgBase: a functional genomics resource for agriculture. BMC Genomics 7: 229.
- 34. Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, et al. (2007) The TIGR Rice Genome Annotation Resource: improvements and new features. Nucleic Acids Res 35: D883–887.
- 35. Montgomerie S, Sundararaj S, Gallin WJ, Wishart DS (2006) Improving the accuracy of protein secondary structure prediction using structural alignment. BMC Bioinformatics 7: 301.
- 36. Zybailov B, Sun Q, van Wijk KJ (2009) Workflow for large scale detection and validation of peptide modifications by RPLC-LTQ-Orbitrap: application to the Arabidopsis thaliana leaf proteome and an online modified peptide library. Anal Chem 81: 8015–8024.
- 37. Wang D, Thompson P, Cole PA, Cotter RJ (2005) Structural analysis of a highly acetylated protein using a curved-field reflectron mass spectrometer. Proteomics 5: 2288–2296.
- 38. Cheema A, Knights CD, Rao M, Catania J, Perez R, et al. (2010) Functional mimicry of the acetylated C-terminal tail of p53 by a SUMO-1 acetylated domain, SAD. J Cell Physiol 225: 371–384.
- 39. Ventura M, Mateo F, Serratosa J, Salaet I, Carujo S, et al. (2010) Nuclear translocation of glyceraldehyde-3-phosphate dehydrogenase is regulated by acetylation. Int J Biochem Cell Biol 42: 1672–1680.
- 40. Santos-Rosa H, Valls E, Kouzarides T, Martinez-Balbas M (2003) Mechanisms of P/CAF auto-acetylation. Nucleic Acids Res 31: 4285–4292.
- 41. Zhang K, Sridhar VV, Zhu J, Kapoor A, Zhu J (2007) Distinctive core histone post-translational modification patterns in Arabidopsis thaliana. Plos one 11: e1210.
- 42. Rigaut G, Shevchenko A, Rutz B, Wilm M, Mann M (1999) generic protein purification method for protein complex characterization and proteome exploration. Nat Biotechnol 17: 1030–1032.
- 43. Puig O, Caspary F, Rigaut G, Rutz B, Bouveret E, et al. (2001) The tandem affinity purification (TAP) method: a general procedure of protein complex purification. Methods 24: 218–229.
- 44. Rohila JS, Chen M, Cerny R, Fromm ME (2004) Improved tandem affinity purification tag and methods for isolation of protein heterocomplexes from plants. Plant J 38: 172–181.
- 45. Rubio V, Shen Y, Saijo Y, Liu Y, Gusmaroli G, et al. (2005) An alternative tandem affinity purification strategy applied to Arabidopsis protein complex isolation. Plant J 41: 767–778.
- 46. Rohila JS, Chen M, Chen S, Chen J, Cerny R, et al. (2006) Protein-protein interactions of tandem affinity purification-tagged protein kinases in rice. Plant J 46: 1–13.
- 47. Van Leene J, Stals H, Eeckhout D, Persiau G, Van De Slijke E, et al. (2007) A tandem affinity purification-based technology platform to study the cell cycle interactome in Arabidopsis thaliana. Mol Cell Proteomics 6: 1226–1238.
- 48. Brown AP, Affleck V, Fawcett T, Slabas AR (2006) Tandem affinity purification tagging of fatty acid biosynthetic enzymes in Synechocystis sp. PCC6803 and Arabidopsis thaliana. J Exp Bot 57: 1563–1571.
- 49. Rivas S, Romeis T, Jones JD (2002) The Cf-9 disease resistance protein is present in an approximately 420-kilodalton heteromultimeric membrane-associated complex at one molecule per complex. Plant Cell 14: 689–702.
- 50. Abe M, Fujiwara M, Kurotani K, Yokoi S, Shimamoto K (2008) Identification of dynamin as an interactor of rice GIGANTEA by tandem affinity purification (TAP). Plant Cell Physiol 49: 420–32.
- 51. De Lucia F, Crevillen P, Jones AM, Greb T, Dean C (2008) A PHD-polycomb repressive complex 2 triggers the epigenetic silencing of FLC during vernalization. Proc Natl Acad Sci U S A 105: 16831–16836.
- 52. Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, et al. (2012) Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways. Mol Cell Proteomics 11: 1048–1062.
- 53. Jeffers V, Sullivan WJ Jr (2012) Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii. Eukaryot Cell 11: 735–742.
- 54. Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, et al. (2011) Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation. Sci Signal 4: ra48.
- 55. Yu BJ, Kim JA, Moon JH, Ryu SE, Pan JG (2008) The diversity of lysine-acetylated proteins in Escherichia coli. J Microbiol Biotechnol 18: 1529–1536.
- 56. Zhang K, Zheng S, Yang JS, Chen Y, Cheng Z (2013) Comprehensive profiling of protein lysine acetylation in Escherichia coli. J Proteome Res 12: 844–851.
- 57. Zhao S, Xu W, Jiang W, Yu W, Lin Y, et al. (2010) Regulation of cellular metabolism by protein lysine acetylation. Science 327: 1000–1004.
- 58. Pan G, Zhang X, Liu K, Zhang J, Wu X, et al. (2006) Map-based cloning of a novel rice cytochrome P450 gene CYP81A6 that confers resistance to two different classes of herbicides. Plant Mol Biol 61: 933–943.
- 59. Caron C, Boyault C, Khochbin S (2005) Regulatory cross-talk between lysine acetylation and ubiquitination: role in the control of protein stability. Bioessays 27: 408–415.
- 60. Hunter T (2007) The age of crosstalk: phosphorylation, ubiquitination, and beyond. Mol Cell 28: 730–738.
- 61. Wang X, Taplick J, Geva N, Oren M (2004) Inhibition of p53 degradation by Mdm2 acetylation. FEBS Lett 561: 195–201.
- 62. Park HJ, Kim WY, Park HC, Lee SY, Bohnert HJ, et al. SUMO and SUMOylation in plants. Mol Cells 32: 305–316.