A comprehensive examination of the lysine acetylation targets in paper mulberry based on proteomics analyses

Rocky desertification is a bottleneck that reduces ecological and environmental security in karst areas. Paper mulberry, a unique deciduous tree, shows good performance in rocky desertification areas. Its resistance mechanisms are therefore of high interest. In this study, a lysine acetylation proteomics analysis of paper mulberry seedling leaves was conducted in combination with the purification of acetylated protein by high-precision nano LC-MS/MS. We identified a total of 7130 acetylation sites in 3179 proteins. Analysis of the modified sites showed a predominance of nine motifs. Six positively charged residues: lysine (K), arginine (R), and histidine (H), serine (S), threonine (T), and tyrosine (Y) occurred most frequently at the +1 position, phenylalanine (F) was both detected both upstream and downstream of the acetylated lysines; and the sequence logos showed a strong preference for lysine and arginine around acetylated lysines. Functional annotation revealed that the identified enzymes were mainly involved in translation, transcription, ribosomal structure and biological processes, showing that lysine acetylation can regulate various aspects of primary carbon and nitrogen metabolism and secondary metabolism. Acetylated proteins were enriched in the chloroplast, cytoplasm, and nucleus, and many stress response-related proteins were also discovered to be acetylated, including PAL, HSP70, and ERF. HSP70, an important protein involved in plant abiotic and disease stress responses, was identified in paper mulberry, although it is rarely found in woody plants. This may be further examined in research in other plants and could explain the good adaptation of paper mulberry to the karst environment. However, these hypotheses require further verification. Our data can provide a new starting point for the further analysis of the acetylation function in paper mulberry and other plants.


Introduction
Ecosystem degradation and soil erosion are major problems that restrict the regional economy in karst areas. Ecological restoration is a focus in these regions, and the selection of suitable vegetation can be challenging because of these problems. The strong ability of paper mulberry to withstand environmental pressures indicates that further exploration of its resistance mechanisms is necessary. Proteins, as important type components encoded by genes that control the entire metabolic process. Protein regulation encompasses multilayered and interconnected transcriptional and translational processes [1]. This process begins with transcription from DNA and the splicing of genes into RNA molecules, which are subsequently translated into polypeptides later [2]. However, in the absence of posttranslational modifications, proteins likely cannot perform as many functions. Posttranslational modifications (PTMs) of histone are well-known for their critical roles in cellular pathways, as they can change the physicochemical properties of proteins and affect their activity and stability [3,4]. When PTM occurs to a protein, it will directly change the protein's binding ability and function, introducing new functions by introducing new functional groups such as acetyl, phospho, ubiquityl, succinyl and methyl groups [5]. Among PTMs, lysine acetylation was first discovered on histone tails where chromatin structure and gene expression are regulated. Changes in cellular lysine acetylation status can also alter metabolic enzyme activity and provide an adaptive mechanism for specific metabolic changes in cells [6].
Paper mulberry (Broussonetia papyrifera) is a unique fiber-bearing economic forest species in China due to its wide distribution, fast growth, strong adaptability and resistance to extreme environments. In recent years, it has been continuously developed and applied in different fields, especially as leaves for animal feed and bark and branches as sources of advanced papermaking materials and medicines. Based on transcriptomics, proteomics, phosphorylated proteomics and glycoproteomics studies, whole-genome analysis is now widely applied among archaea, bacteria and real bacteria. The acetylation of proteins in all three life fields, including nuclear biology, has been explored [7,8]. Some studies have been performed on lysine acetylation, and many histone acetyl transferases [9][10][11] have been identified in plants. Many studies have also shown that histone acetylation is involved in the responses to different environmental factors, including light and low temperature, to help plants better adapt to extreme conditions. It is evident that protein acetylation plays a much broader role than merely regulating of histone functions [12]. At the same time, qualitative, quantitative and functional analysis of key proteins in specific key metabolic pathways can be performed [13]. An interaction between the protein acetylation modification in paper mulberry and its anti-stress mechanism has been reported, but the types of acetylases in paper mulberry and the acetylation of important biologically-related pathways such as photosynthesis and metabolism remain to be further explored. Moreover, the modifications and expression patterns are unclear.
In this study, we aimed to identify systematically characterize lysine acetylation sites in tree proteins. This research will enhance our understanding of how paper mulberry has adapted to extreme karst environments.

Plant materials
The experimental materials were selected from Guizhou in August 2019 at a coordinate longitude of 106.66136 and latitude of 26.45024., and cutting seedlings with the same growth were cultivated by using the cutting propagation technique. The plant grows in greenhouse conditions (22˚C -25˚C) and under natural light cycles. About a month later, when the plant height was approximately 20 cm, 15g of uniformly growing leaves were collected from the vegetative stage of the plant, frozen immediately in liquid nitrogen and stored at -80˚C until used for RNA extraction. All experiments were performed at least 3 times using independently collected and extracted tissues, unless otherwise indicated.
Protein extraction. Each sample was ground in liquid nitrogen into cell powder and then transferred to a 5-mL centrifuge tube. After that, four volumes of lysis buffer (8 M urea, 1% Triton-100, 10 mM dithiothreitol, and 1% Protease Inhibitor Cocktail) were added to the cell powder, followed by sonication three times on ice using a high-intensity ultrasonic processor. The remaining debris was removed by centrifugation at 20,000 × g at 4˚C for 10 min. Finally, the protein was precipitated with cold 20% TCA for 2 h at -20˚C. After centrifugation at 12,000 × g at 4˚C for 10 min, the supernatant was discarded. The remaining precipitate was washed with cold acetone three times. The protein was redissolved in 8 M urea and the protein concentration was determined with a BCA Kit according to the manufacturer's instructions.
Trypsin digestion. For digestion, the protein solution was reduced with 5 mM dithiothreitol for 30 min at 56˚C and alkylated with 11 mM iodoacetamide for 15 min at 25˚C in darkness. The protein sample was then diluted by adding 100 mM TEAB to a urea concentration less than 2 M. Finally, trypsin was added at a 1:50 trypsin-to-protein mass ratio for the first digestion overnight and at a 1:100 trypsin-to-protein mass ratio for a second 4 h digestion.
TMT/iTRAQ labeling. After trypsin digestion, the peptide was desalted with a Strata X C18 SPE column and vacuum-dried. The peptide was then reconstituted in 0.5 M TEAB and processed with the TMT Kit/iTAQ Kit according to the manufacturer's protocol for the TMT Kit/iTRAQ Kit.
HPLC fractionation. The tryptic peptides were fractionated by high pH reverse-phase HPLC using a Thermo Betasil C18 column (5 μm particles, 10 mm ID, 250 mm length). Briefly, the peptides were first separated with a gradient of 8% to 32% acetonitrile (pH 9.0) over 60 min into 60 fractions. Then, the peptides were combined and dried by vacuum centrifugation.
LC-MS/MS analysis. The tryptic peptides were dissolved in 0.1% formic acid and directly loaded onto a homemade reversed-phase analytical column (15-cm length, 75 μm i.d.). The gradient consisted of an increase from 6% to 23% solvent B (0.1% formic acid in 98% acetonitrile) for 26 min, 23% to 35% for 8 min, climbing to 80% over 3 min and the holding at 80% for the last 3 min, all at a constant flow rate of 400 nL/min using an EASY-nLC 1000 UPLC system.
The peptides were subjected to an NSI source followed by tandem mass spectrometry (MS/ MS) in a Q ExactiveTM Plus (Thermo) coupled online to the UPLC. The applied electrospray voltage applied was 2.0 kV. The m/z scan range was 350 to 1800 for a full scan, and intact peptides were detected in the Orbitrap at a resolution of 70,000. Peptides were then selected for MS/MS using the NCE setting of 28, and the fragments were detected in the Orbitrap at a resolution of 17,500. The data-dependent procedure that alternated between one MS scan followed by 20 MS/MS scans with 15.0 s dynamic exclusion. Automatic gain control (AGC) was set at 5E4. The fixed first mass was set to 100 m/z. Database search. The resulting MS/MS data were processed using the Maxquant search engine (v.1.5.2.8). Tandem mass spectra were searched against a database concatenated with a reverse decoy database. The protein sequence database is derived from the transcriptome sequencing data of previous studies (sequences: 25,412). Trypsin/P was specified as a cleavage enzyme, and up to 4 missing cleavages were allowed. The mass tolerance for precursor ions was set as 20 ppm in the first search and 5 ppm in the main search, and the mass tolerance for fragment ions was set to 0.02 Da. Carbamidomethyl on Cys was specified as a fixed modification, acetylation and oxidation on Met were specified as variable modifications. The FDR was adjusted to < 1%, and the minimum score for modified peptides was set to > 40.

Domain annotation.
Identified protein domain functional descriptions were annotated by InterProScan based on the protein sequence alignment method, and the InterPro domain database was used (http://www.ebi.ac.uk/interpro/).

Functional enrichment
Gene ontology enrichment analysis. Proteins were classified by GO annotation into the following three categories: biological process, cellular component and molecular function. For each category, a two-tailed Fisher's exact test was employed to test the enrichment of the identified modified protein against all proteins from the species database. The GO terms with a corrected p-value < 0.05 were considered significant.
Pathway enrichment analysis. The Kyoto Encyclopedia of Genes and Genomes (KEGG) database was used to identify enriched pathways using a two-tailed Fisher's exact test to test the enrichment of the identified modified protein against all proteins in the species database. A pathway with a corrected p-value < 0.05 was considered significant. These pathways were classified into hierarchical categories according to the KEGG website.
Protein domain enrichment analysis. For each category of proteins, the InterPro (a resource that provides functional analysis of protein sequences by classifying them into families and predicting the presence of domains and important sites) database was researched, and a two-tailed Fisher's exact test was employed to test the enrichment of the identified modified protein against all proteins in the species database. Protein domains with a corrected pvalue < 0.05 were considered significant.
Enrichment-based clustering. For further hierarchical clustering based on differentially modified protein functional classification, we first collated all the categories obtained after enrichment along with their P-value, and then filtered for those categories that were enriched in at least one of the clusters with a P-value <0.05. This filtered P-value matrix was transformed by the function x = −log10 (P-value). Finally, these x values were z-transformed for each functional category. The scores were then clustered by one-way hierarchical clustering in Genesis. Cluster membership was visualized by a heat map using the "heatmap.2" function from the "gplots" R-package.
Protein-protein interaction network. All differentially expressed modified protein database accessions or sequences were searched against the STRING database version 10.5 for protein-protein interactions. Only interactions between the proteins belonging to the searched data set were selected, thereby excluding external candidates. STRING defines a metric called the "confidence score" to define interaction confidence; we fetched all interactions that had a confidence score >0.7. The Interaction network from STRING was visualized in the R package "networkD3".

Results and discussion
Paper mulberry is a perennial tree species characterized by a higher growth rate and greater adaptability to adverse environments than other species. Although paper mulberry is a pioneer tree species in the karst region, we know very little about the resistance mechanisms of this plant at the genomic level. Protein acetylation, a type of PTMs, has been revealed to play critical roles in various physiological processes related to adaptive reactions [14][15][16]. Therefore, we used PTM technology and LC-MS/MS to systematically express the lysine-acetylated proteins in paper mulberry to further study its adaptive mechanism, and the workflow of experimental procedures used in the study was shown in Fig 1a. A total of 54617 secondary spectra were obtained by mass spectrometry during the identification (Fig 1b). When the mass spectrometry secondary spectra were searched against the protein theory data, the available efficiency was 14267, the spectrum utilization rate was 26.1%, and the peptides were resolved into 9005 peptides and 7018 acetylated peptides. A total of 7,130 acetylation sites were identified on 3179 proteins in this study and are included in the subsequent analyses (S1 Table).

Subsequent analysis of the acetylation sites by bioinformatics analysis confirms the modification sites and modified proteins
In this identification, the lengths of the peptides ranged from 7 to 30 amino acids, and most were between 7 and 25 amino acids (Fig 1c). The number of modification sites per protein ranged from 1 to 21 (Fig 1d), and more than 50% of proteins contained only one acetylation site. It is well-known that the histone core is a site for histone acetylation [17], for instance, rice GF14e encodes a 14-3-3 protein that negatively affects cell death and disease resistance in rice [18], and plant 14-3-3 proteins modulate important cellular processes by interacting with a diverse range of target proteins [19]. Additionally, different species may have different protein acetylation levels, sometimes varying significantly. In a comprehensive analysis of protein and nitrogen nutrition, a total of 1286 proteins with lysine in tea plant species were shown to be acetylated [20]; moreover, tea leaves in different periods will have different amounts of protein acetylated [21]. However, as the first lysine acetylation group map of woody plants, this analysis is expected to provide valuable resources for future PTM research.

Motif characteristics of the acetylated peptides
To better understand the features of the acetylated sites, the flanking amino acid residues from positions -10 to +10 around the acetylated lysine were analyzed (S2 Table). Motif analysis of the modified sites showed a predominance of 12 motifs, as shown in Fig 2a, including D � K ac R, K ac� Y, K ac� R, Y � K ac S, K ac� H, K ac� S, K ac� F, K ac� K, K ac� T, K ac� N, K ac� D, Y � K ac , K ac� W, T � K ac , F � K ac , and K ac� V. Among these motifs (Fig 2a and 2b), the following six positively charged residues: lysine (K), arginine (R), and histidine (H), serine (S), threonine (T), and tyrosine (Y), occurred most frequently at the +1 position, which were all in positively charged. Amino acid biases may reflect a bona fide preference or may be due to the preference of antibodies used for selective enrichment of acetylated peptides [22,23]. In previous studies, three motifs common in rice (K, H, and F) also existed in the Gram-negative marine bacterium V. parahemolyticus [24]. phenylalanine (F) was detected both upstream and downstream of the acetylated lysines. F is an essential amino acid and a precursor of thousands of secondary metabolites in animals that cannot synthesize F, indicating that these animals must obtain F directly or indirectly from plants [25]. Moreover, R, H, Y, and S are the most common amino acids in roseosporus motifs and Camellia sinensis [26, 27], Choudhary et al. also found that amino acids with a bulky side chain (mainly Y and F) were enriched in the -2 and +1 positions in human cells [28]. Therefore, the motif analysis suggests that acetylation preferentially occurs at alkaline and positively amino acids in nearby regions in paper mulberry, may be conserved and are important for lysine acetylation in plants, which may help elucidate the acetylation structure of paper mulberry.

Functional annotation of acetylated proteins
GO analysis is an important bioinformatics analysis method and tool for determining the expressing various properties of genes and gene products and can explain the biological effects of proteins from different perspectives in paper mulberry. We performed a statistical analysis of the distribution of proteins corresponding to the identified modification sites in the GO secondary annotation.
In the biological process category (Fig 3a), the most prevalent process GO terms were cellular metabolic process (12%), organic substance metabolic process (11%), and primary metabolic process (10%). It is also worth noting that response to stress accounted for 5%, mainly including salt stress protein and response to oxidative stress protein (S3 Table). Salt stress has an inhibitory effect on plants photosynthesis, and as the external salt concentration increases, the degree of inhibition is greater [29]. This allows us to understand the reasons why paper mulberry can grow well under salinization in karst areas. In addition, the ability of chloroplasts of drought-affected plants to use CO 2 in carbon assimilation is limited under the light, and therefore energy consumption is reduced [30], the proportion of electrons transferred to O 2 is relatively increased [31]; thus, O 2 and H 2 O 2 can be formed. With metal ion catalysis, it can form more active and aggressive -OH [32][33][34]. In the face of oxidative stress caused by drought, paper mulberry can provide rich antioxidants for stress proteins, intuitively providing evidence for the resistance of the tree in the molecular function classification. Among the cellular component category, intracellular (22%), intracellular organelle (19%), and membrane-bound organelle (18%) were the main components (Fig 3b). We also observed that most

PLOS ONE
acetylated proteins were related to heterocyclic compound binding (12%), organic cyclic compound binding (12%), hydrolase activity (11%), transferase activity (10%) and protein binding (10%) in the molecular function; others proteins accounted for a large proportion and the remainder accounted for less than 10% (Fig 3c). According to the functional annotation (S3 Table), enzymes related translation, transcription, ribosomal structure and biogenesis were identified. However, the majority of the enzymes are involved in metabolism-related categories, such as carbohydrate transport and metabolism, amino acid transport and metabolism, and nucleotide transport and metabolism. Early research found that metabolism-related enzymes can play an important role in the regulation of cellular metabolism. The observation of lysine-acetylated substrate protein in bacterial metabolic enzymes verifies the key functional role of such modifications in metabolism [35]. The various mechanisms play diverse role in the regulation of metabolic enzymes [36,37], and some proteins showing homology between different species have been characterized as being involved in lysine acetylation, indicating their evolutionary and functional conservation [38,39]. Subcellular localization analysis of acetylated proteins from paper mulberry shows that most proteins are expected to localize to the chloroplast (40.83%), cytoplasm (24.44) % and nucleus (17.71%), while others accounted for only 17.03% (Fig 3d).
A protein domain is a conserved part of a given protein sequence and structure that can evolve, function and exist independently of the rest of the protein chain. To better determine the proteins more prone to acetylation, we performed protein domain enrichment analysis on the acetylated proteome to confirm the previous conclusion (Fig 4a). We noticed that some

PLOS ONE
protein complexes, including parts of the chloroplast (≧68.56%) and plastid (≧66.85%), were preferentially acetylated. We learned from other studies that photosynthetic activities can be represented as the growth potentials of the plant [40][41][42][43], and that chlorophyll plays an important role in the response of leaf photosynthesis to environmental stresses [44]. Others have suggested that lysine acetylation may be an important posttranslational modification in the chloroplast [45]. Increasing photosynthetic energy-use efficiency and enhancing photosynthetic capacity may be the most successful mechanisms for alien species invasion and adaptability to adverse environments [46,47]. Meanwhile, structural constituents of ribosomes (20.15%), copper ion binding (19.49%), anion binding (12.85%), aminoacyl-tRNA ligase activity (12.67%), ligase activity, forming carbon-oxygen bonds (12.67%), oxidoreductase activity, and acting on NAD(P)H (12.53%) were also enriched (Fig 4a), which are important in various cellular functions in plants. A total of 20 pathways were enriched in the KEGG enrichment (Fig 4b), including limonene and pinene degradation, TCA cycle, proteasome, glyoxylate and dicarboxylate metabolism, ribosome and so on. The cellular localization and function of proteins are often dictated by their domains [40], among in the identified protein domains (Fig  4c), glutathione-, thioredoxin-, and ATPase-related proteins were significantly enriched.
This result indicates that lysine acetylation tends to target large macromolecular complexes, as has been reported in humans [40], associated with various processes, such as substance transport and metabolism, oxidation-reduction, protein synthesis, and chromatin remodeling [48]. Lysine acetylation also provides material transport and plays roles in metabolism and redox reactions, which can play a role in the regulation of cellular metabolism and stress responses in fraternization. Expansion, antioxidant, metabolic, and detoxification effects may be related to the adaptability of paper mulberry to extreme environments and medicinal functions.
In general, for paper mulberry, lysine acetylation may regulate various aspects of primary carbon and nitrogen metabolism as well as secondary metabolism. However, further confirmation is required. Key acetylated proteins should be purified and selected, and functional studies should be performed to examine what happens following the vitro site-directed mutagenesis of lysine to arginine or glutamine [49][50][51].

Abundant lysine acetylation in photosynthesis and phenylalanine metabolism of paper mulberry
Previously large-scale proteomics studies have shown that lysine acetylation is widespread in the mitochondria [52,53], but the analysis of acetylated proteins from paper mulberry suggesting that the acetylated proteins located in the chloroplast might play important roles in regulating photosynthesis.
Our study showed that 16 acetylated proteins were involved in photosynthesis (Fig 5a). This process is catalyzed by these multisubunit membrane-protein complexes, including photosystems I and II, the photosynthetic electron transport, and F-type ATPases. Additionally, phenylalanine metabolism plays an important role in plant growth, development and pathological/stress response processes. In the KEGG pathway, there were 6 acetylated proteins in the phenylalanine pathway (Fig 5b), the differentially expressed genes were annotated as phenylalanine ammonia-lyase (PLA), aspartate aminotransferase, histidinol-phosphate aminotransferase, aromatic-L-amino-acid/L-tryptophan decarboxylase, enoyl-coA hydratase and aspartate aminotransferase, mitochondrial, which are key synthase genes in the lignin monomer synthesis pathway. The downstream branch pathways are mainly divided into the flavonoid synthesis pathways and lignin synthesis pathways [54]. Entry into a specificity downstream branch pathway generates a specific metabolic product of benzene propane. For example, the downstream

PLOS ONE
branch pathway promoted the reaction of phenyl propane metabolites, including coumarin, flavonol, lignin, cork vinegar and other benzene compounds (phytoalexin, protection factor, flower and fruit pigment); these structural cell wall components and signaling molecules, such as play a different role in plant growth and development.
To our knowledge, photosynthesis plays a very important role in plant responses to drought stress, and the phenylalanine metabolic pathway is of great significance to mulberry structure, as it also produces a variety of secondary metabolites such as flavonoids, lignin, plant protections, chlorogenic acid, etc. These secondary metabolites have important roles in plant growth and development, disease resistance and adverse reaction resistance.

Interaction network of acetylated protein
To better understand the biological pathways and visualize virtually all cellular processes [55], we further built a PPI network for acetylated proteins (Fig 6, S4 Table). The detailed interaction of acetylation regulation processes and modified proteins can be deeply investigated in depth from this network (S4 Table). As shown in the figure (Fig 6), a large and complex acetylated PPI network was constructed based on the number of proteins. In total, 1071 acetylated proteins were connected with one another; among them, 91 acetylated proteins were defined as nodes. The top cluster was the ribosome network, which consisted of 84 ribosome-associated proteins. Thirty-one amino-tRNA biosynthesis-related proteins were contained in the cluster II, while the remaining proteins were grouped as glycolysis/gluconeogenesis, the citrate cycle (TCA cycle), glyoxylate and dicarboxylate metabolism, and carbon fixation in photosynthetic organisms. Node degree is a key parameter for evaluating proteins in the network; among them, 4 proteins displayed the highest degree (�40, S4 Table) and were located in chloroplasts, indicating that these proteins are dominant in chloroplasts. These results indicate that acetylated proteins are involved in a broad protein interaction network in paper mulberry.

Acetylation associated with stress adaptation and protein in paper mulberry
Considering the challenging environment of the karst landform where the paper mulberry is located, such as drought and lack of nutrients, we specifically analyzed the acetylated proteins that may directly respond to stress. Based on the annotation, PAL (Phenylalanine ammonialyase), HSP70 (heat shock protein 70), and ERF (Ethylene-responsive transcription factor) were identified. The PAL family in the plants Arabidopsis (Arabidopsis thaliama), poplar (Populus trichocarpa) and rice (Oryza sativa) is composed of 4, 5 and 9 members, respectively, and they play an important role in resisting external environmental stress [56][57][58]. Moreover, HSP70 and ERF are both well-known for their function in cell response to stresses, such as those imposed by thermal stimuli and osmotic pressure [59][60][61].
It has been discovered this year that HSP70 is mainly distributed in the cytoplasm, endoplasmic reticulum, mitochondria and chloroplast [62]. It is an important protein in plant abiotic and disease stress response. At the present, a large number of studies have shown that temperature is closely related to HSP70 expression. Drought stress can promote HSP70 expression in wheat, rice, maize and Eupatorium adenophorum [63][64][65][66], indicating that HSP70 may play a role as a molecular chaperone under adverse conditions to improve the ability of plants to cope with adverse environments by maintaining the stable conformation of proteins related to plant growth and development [67]. However, there is little evidence of HSP70 expression in woody plants. In this study, HSP70 acetylation sites were found, displayed 12 sites, and the deduced amino acids (aa) sequence contained 706 aa. Comparison on this protein's sequences in other nine plants showed an overall consistency is 94.4% (Fig 7). The protein amino acid sequence conservation at the N-terminus is lower than that at the C-terminus, but there is a highly conserved motif structure (VIDADEFDS) at the C-terminus. In this study, the acetylated protein BrHSP70 is a very important member of the HSP family, it exists widely in nature and is highly conserved. Although different species have significant evolutionary differences, the evolution of HSP70 genes in plants is conserved [68]. Both proteins have an N-terminal accounting binding region and a C-terminal substrate binding region. There is an approximately 45kDa accounting binding region at the N terminal, that can hydrolyze APT and an approximately 25kDa substrate binding region at the C terminal can be exposed to the outside of the polypeptide substrate. The unfolded hydrophobic domain binds specifically, and SBD and NBD are connected by a twisted chain structure to function as a molecular chaperone [69][70][71]. The acetylated of BrHSP70, a typical motif characteristic at the C-terminus, directs the proteins to the cytoplasm. It is speculated that BrHSP70 can regulate the physiological functions of cells under adversity stress and participate in the correct assembly of proteins as a PLOS ONE molecular chaperone to maintain specific protein conformations. We can infer that this lysine acetylation probably participates in the specific adaptation to the arid environment in paper mulberry, but the real role needs to be further verified.

Conclusion
This study provides lysine acetylation data for paper mulberry in the karst region. A total of 7,130 acetylation sites were identified on the 3179 proteins. The acetylation sites are located in different organelles and involved in various processes. Abundant lysine acetylation was discovered in the chloroplast, cytoplasm and nucleus. The acetylation of the key stress resistance protein HSP70 in the woody plant paper mulberry was clearly characterized for the first time. In addition, our method may be applicable to the comprehensive determination of lysine acetylation in other plants.
Supporting information S1