Protein phosphorylation is a complex regulatory event that is involved in the signaling networks that affect virtually every cellular process. The protein phosphorylation may be a novel source for discovering biomarkers and drug targets. However, a systematic analysis of the phosphoproteome in patients with SLE has not been performed. To clarify the pathogenesis of systemic lupus erythematosus (SLE), we compared phosphoprotein expression in PBMCs from SLE patients and normal subjects using proteomics analyses. Phosphopeptides were enriched using TiO2 from PBMCs isolated from 15 SLE patients and 15 healthy subjects and then analyzed by automated LC-MS/MS analysis. Phosphorylation sites were identified and quantitated by MASCOT and MaxQuant. A total of 1035 phosphorylation sites corresponding to 618 NCBI-annotated genes were identified in SLE patients compared with normal subjects. Differentially expressed proteins, peptides and phosphorylation sites were then subjected to bioinformatics analyses. Gene ontology(GO) and pathway analyses showed that nucleic acid metabolism, cellular component organization, transport and multicellular organismal development pathways made up the largest proportions of the differentially expressed genes. Pathway analyses showed that the mitogen-activated protein kinase (MAPK) signaling pathway and actin cytoskeleton regulators made up the largest proportions of the metabolic pathways. Network analysis showed that rous sarcoma oncogene (SRC), v-rel reticuloendotheliosis viral oncogene homolog A (RELA), histone deacetylase (HDA1C) and protein kinase C, delta (PRKCD) play important roles in the stability of the network. These data suggest that aberrant protein phosphorylation may contribute to SLE pathogenesis.
Citation: Zhang X, Ma H, Huang J, Dai Y (2012) Characterization of the Phosphoproteome in SLE Patients. PLoS ONE 7(12): e53129. doi:10.1371/journal.pone.0053129
Editor: Leighton R. James, University of Florida, United States of America
Received: August 4, 2012; Accepted: November 23, 2012; Published: December 28, 2012
Copyright: © 2012 Zhang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was financially supported by National Nature Science Foundation of China (grant number 30972741\C080701). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Protein phosphorylation is a widespread post-translational modification (PTM). Reversible protein phosphorylation, in which phosphate groups are enzymatically added by protein kinases and removed from proteins by phosphatases, often serves as a molecular switch in signaling pathways. Disruptions in phosphorylation-mediated cell signaling events are connected with numerous diseases , , , , . Furthermore, the abnormal expression of protein kinases is an important cause or component of many pathologies. Therefore, the characterization of the phosphorylation sites of proteins within various signaling pathways can enhance the understanding of specific disease pathologies . Phosphoproteomics is defined as the study of the components of the proteome that undergo phosphorylation. Systemic lupus erythematosus (SLE) is a classical autoimmune disease. The disease incidence is nine times greater in women than in men , and its estimated prevalence in China is 37.7/100,000 persons . However, the details of SLE etiology remain poorly understood. In this study, we thoroughly explored the phosphopeptide proteome of human Peripheral blood mononuclear cells (PBMCs) using a highly sensitive Liquid chromatography-mass spectrometry (LC-MS/MS) system, improved software for phosphopeptide identification and subsequent analysis with an elaborate bioinformatics strategy, including gene ontology (GO) analysis, pathway analysis and protein network analysis. The rich data from the proteomic analysis also provides insight into the pathogenesis of SLE.
Materials and Methods
Patient Assessments and Classifications
This study protocols and consent forms were approved by the Second Clinical Medical College (Shenzhen People’s Hospital) of Jinan University and adhere to the Helsinki Declaration guidelines on ethical principles for medical research involving human subjects. Written informed consent was obtained from all participants. A group of 15 SLE patients who had never been treated with disease-modifying antirheumatic drugs (DMARDs) or other immunosuppressive drugs was recruited for this study. Patients treated with nonsteroidal anti-inflammatory drugs or other symptomatic treatments were excluded. All patients satisfied the American College of Rheumatology classification criteria for SLE. In addition, we choose 15 healthy subjects as controls.
PBMCs Isolation and Protein Extraction
Peripheral blood mononuclear cells (PBMCs) were separated by a Ficoll-Paque (Sigma, St. Louis, MO, USA) density gradient centrifugation according to the manufacturer’s instructions. In brief, 2 ml blood (with EDTA as an anticoagulant) was layered on 3 ml Ficoll-Hypaque (Sigma) and centrifuged for 25 min at 1300 rpm at room temperature. Mononuclear cells at the interface were aspirated with a Pasteur pipette, washed twice in PBS with centrifugation for 10 min at 900 rpm at room temperature and resuspended in 500 µl SDT lysate (Invitrogen, Carlsbad, USA). The samples were then stored at −80°C until further use.
Phosphopeptides from digested peptides were enriched using the Phosphopeptide Enrichment TiO2 kit (Calbiochem, San Diego, CA) according to the manufacturer's instructions. Briefly, the tryptic digest was dried, dissolved in 200 µL TiO2 Phosphobind buffer containing 50 g/L 2,5-dihydroxybenzoic acid (DHB) and then mixed with 50 µL TiO2 Phosphobind Resin. After a 40 minute incubation, the supernatant was discarded, and the TiO2 resin was washed twice with the washing buffer. Then, elution buffer was added to elute the phosphopeptides in two batches. The eluted supernatant was pooled and dried by evaporation for LC-MS/MS analysis.
The dried phosphopeptides were subjected to LC-MS/MS analysis with a Finnigan Surveyor High Performance Liquid Chromatography(HPLC) system coupled with a LTQ-Orbitrap XL mass spectrometer (Thermo Electron, San Jose, CA). Briefly, the peptide mixtures were loaded onto a C18 column (100 µm i.d., 10 cm long, 5 µm, resin from Michrom Bioresources, Auburn, CA) using an autosampler. Peptides were eluted in a 5–35% gradient of buffer solution over 180 min and then detected in the LTQ-Orbitrap XL mass spectrometer as described previously , .
Raw MS Data Analysis
Raw Orbitrap full-scan MS and ion trap MSA spectra were processed using the MaxQuant algorithms , . Peptides and proteins were identified by Mascot through automated database matching of all tandem mass spectra against an in-house curated concatenated target database. Scoring was performed in MaxQuant as described previously. We required strict trypsin enzyme specificity and allowed up to two missed cleavage sites. Cysteine carbamidomethylation (Cys, +57.021464 Da) was searched as a fixed modification, whereas N-acetylation of proteins (N-terminal, +42.010565 Da), oxidized methionine (Met,+15.994915 Da), and serine, threonine, and tyrosine phosphorylations (Ser/Thr/Tyr, +79.966331 Da) were searched as variable modifications.
Peptide filtering and Phosphorylation Site Identification
The Mascot result files were imported into the MaxQuant software suite for further processing. In MaxQuant, we defined the estimated false discovery rate (FDR) of all peptide and protein identifications at 1% by automatically filtering based on peptide length, mass error precision estimates, and the Mascot scores of all forward and reversed peptide identifications. The final estimate of true phosphorylated amino acids remaining within all identified phosphopeptide sequences was calculated in MaxQuant based on the localization probabilities of all assumed threonine, serine and tyrosine phosphorylation sites using the PTM score algorithm, as described previously .For protein identification, we used IPI database. A protein group was removed if all identified peptides assigned to this protein group were also assigned to another protein group. Tosort out a single protein member from a protein group, we chose the protein from the Swiss-Prot database and with the highest sequence coverage. When using label-free approach to identify differently expressed protein and calculating the coefficient of variance, the number of spectra of each protein was logarithmtransformed.
Different Gene Screening and Statistical Analyses
For screening of phosphorylation sites between the two groups, we used the following method. 1, caculate the fold change between the two groups. 2. set threshold value is 1, that is the average fold change between SLE patients group and healthy controls group was more than or equal to 2 folds; and the p value of single sample t-test was less than or equal to 0.05. T-test was conducted using MATLAB 7.5. 3. labeling the gene name corresponding protein according to the NCBI annotation information.
The expression values calculated for the differential proteins and peptides were used for distance and average to determine linkage for gene ontology (GO) analysis. In pathway analysis, interactions between genes in the range of genomes were analyzed by downloading the pathway data in Kyoto Encyclopedia of Genes and Genomes(KEGG). Finally, the results of the above data were merged into a comprehensive gene inter-relationship network. The established gene network was able to directly reflect the inter-relationships between genes at a whole-cell level, as well as the stability of the gene regulatory network.
The Clinical Characteristics of the Study Population
A total of 30 subjects were in the study group, which included 15 SLE patients and 15 healthy controls. In Table 1, the clinical characteristics of the study population are summarized.
Pretreatment of the Raw Data and Screening of Different Genes
A phosphorylated peptide reagent kit was used to enrich the sample for phosphorylated proteins, thus combining protein separation enrichment technology and mass spectrometry technology. The detailed information on the identified phosphoproteins/phosphopeptides according to the mass spectrometric results (Table S1). A total of 1035 phosphorylation sites, corresponding to 618 NCBI-annotated genes, were identified as differentially modified in SLE patients compared with normal subjects.
GO Annotation and Analysis of the Differences in Phosphoproteins
The phosphoproteins characterized in the study were evaluated based on their molecular function, biological process and cellular component annotations. As shown in Figure 1, proteins from various cellular components (e.g., the nucleus, plasma membrane, cytosol, cytoskeleton, and Golgi apparatus) were included. The most enriched cellular components were nuclear proteins and proteins associated with the plasma membrane, cytosol or cytoskeleton. Functionally, the phosphoproteins characterized in the study are diverse. We grouped the identified phosphoproteins into several categories based on their molecular functions as annotated in the Swiss-Prot database. The distribution of the phosphoproteins among the various functional categories is shown in Figure 2. The largest group is comprised of proteins with roles in protein binding. The other three largest groups are proteins involved in catalytic activity, nucleic acid binding and nucleotide binding. The distribution of phosphoproteins by biological process is shown in Figure 3. The largest group contains proteins related to nucleobase, nucleoside, nucleotide and nucleic acid metabolism. Two other large groups are the proteins involved in cellular component organization and transport.
The most enriched cellular components were nuclear proteins and proteins associated with the plasma membrane, cytosol or cytoskeleton. The information was compiled from Gene Ontology annotations.
The largest group is constituted by protein binding followed by catalytic activity and nucleic acid binding. The information was compiled from Gene Ontology annotations.
The largest group contains proteins related to nucleobase, nucleoside, nucleotide and nucleic acid metabolism. Two other large groups are the proteins involved in cellular component organization and transport. The information was gathered based on Gene Ontology annotations.
Signaling Pathway Analyses
We next wanted to determine whether specific pathways are enriched in the set of proteins present in our phosphotyrosine database. Similar to the strategy used for the GO analysis, we mapped differentially modified genes to the KEGG pathway database using GenMAPP v2.1 and then performed a statistical test to identify enriched metabolic pathways, using P<0.05 as the standard. We selected 50 metabolic pathways (Table 2). The top KEGG pathway was the MAPK signaling pathway (Figure 4).
Red marks indicate the genes with differential phosphorylation profiles.
Gene Network Analysis
We integrated the following three different interaction relationships: 1) the gene regulation and protein modification relationships of genes in the KEGG database and other relationships; 2) interaction data from high-flux experiments, such as protein-protein interactions confirmed by yeast two-hybrid; 3) gene-gene interactions described in the literature. Specifically, we downloaded the pathway data from KEGG database and analyzed genome-wide genetic interactions in R (http://www.r-project.org/) and downloaded the KEGGSOAP package (http://www.bioconductor.org/packages/2.4/bioc/html/KEGGSOAP.html). Finally, we integrated the relationships in a gene network (Figure 5). Genes with large numbers of connections were referred to as “hub” genes. Hub genes often play important roles in network stability. We identified SRC, RELA, HDA1C and PRKCD as hub genes in our network (Figure 6).
The network can reflect the relationship between genes from the situation as a whole. Blue means expression, gray means binding and purple means ptmod (post-transcription modification).
Protein phosphorylation is the most common posttranslational modification (PTM) in the biosphere , . Approximately 30% of proteins can be phosphorylated  at threonine, tyrosine and serine residues . Protein phosphorylation becomes disordered when protein kinase or phosphatase activity is overexpressed or inhibited, resulting in abnormal cellular activities and producing cell damage or even cancer , . Phosphoproteomics requires powerful analytical technologies and bioinformatics tools. Several recent reviews have summarized the development of various phosphoproteomic methodologies. These methods typically combine different separation strategies with mass spectrometry , , . The successful application of proteomic technologies to biomedical and clinical research has enabled the discovery of disease-specific biomarkers for diagnosis and treatment monitoring, thus offering insight into the underlying pathologies of diseases and identification of new therapeutic targets.
In this study, we used a phosphorylated peptide reagents kit to enrich the samples for phosphorylated proteins and then combined this technique with mass spectrometry technology. A total of 1035 phosphorylation sites corresponding to 618 annotated genes were identified as differentially modified in SLE compared with normal subjects.
GO analyses showed that the most highly differentially expressed genes were related to nucleic acid metabolism, cellular component organization, transportation, protein modification, cell cycle, cell communication, multicellular organismal development, carbohydrate metabolic process, lipid metabolism and protein translation processes. Nucleic acid metabolism, cellular component organization, transport and multicellular organismal development were the dominant processes. Pathway analysis showed that 50 metabolic pathways are modified in SLE pathogenesis. Notably, MAPK signaling, actin cytoskeleton regulation, chemokine signaling pathway, Fc gamma R-mediated phagocytosis, Herpes simplex infection, spliceosome, vascular smooth muscle contraction and RNA transport process components made up a larger proportion of the genes in these 50 metabolic pathways. The MAPK signaling pathway was highlighted as the most important pathway.
SLE is a chronic autoimmune disorder that is characterized by lymphocyte abnormalities and autoantibody production . Hoffmant showed that immune tolerance defects in the peripheral blood T-lymphocytes of SLE patients related to the abnormal regulation of the MAPK signaling pathway, which directly results in abnormal TCR-mediated intracellular signaling and T lymphocyte function , . The MAPK signaling pathway has important functions in many types of mammalian cells. Mitogen-activated protein kinases (MAPKs) are serine and threonine protein kinases that can be activated by phosphorylation in response to extracellular stimuli, such as mitogens, growth factors, cytokines, and osmotic stress , . The activation of MAPK pathways has been shown to be a potential pro-inflammatory mechanism in rheumatoid arthritis , , . During inflammation, MAPK is activated in various immune cells, and its activation is closely related to stress responses and apoptosis . Our results demonstrated that the MAPK signal pathway is abnormally activated in PBMCs from SLE patients, which provided an experimental basis for researching SLE pathogenesis and exploring new therapies. We believe that interventions in or regulation of this signaling pathway may be useful therapies for treating SLE and related diseases.
SRC was the first protein found to have tyrosine protein kinase activity, and its activity is itself regulated by phosphorylation and dephosphorylation. MAPK signaling pathways control multiple physiological processes and are involved in a variety of diseases. Ras, the activating protein upstream of the MAPK pathway, is directly regulated by SRC activity. The activation of the MAPK pathway downstream of Src phosphorylation leads to transcriptional activation. Meanwhile, the inhibition of MAPK pathway activation can partially reverse the effects of SRC protein activity . In particular, as suggested by protein network analysis, genes with many connections within the network were identified as the hub genes. Hub genes often play an important role in the stability of the network. We found that SRC, RELA, HDA1C and PRKCD were the hub genes in our network. These results demonstrated that SRC plays a central role in the stability of the network, suggesting it is important in the pathogenesis of SLE, which provides an experimental basis for researching the pathogenesis of lupus and exploring new treatment methods for SLE.
This experiment thoroughly characterized the phosphorylated protein expression profile in PBMCs of SLE patients. These data will serve as a reference and supplement to help us better understand the pathogenesis of SLE. Furthermore, interventions that modulate the activities of the involved genes and pathways may be able to block or slow the onset and development of SLE.
The information of phosphoproteins/phosphopeptides. The detailed information on the identified phosphoproteins/phosphopeptides according to the mass spectrometric results.
Conceived and designed the experiments: YD. Performed the experiments: HM XZ. Analyzed the data: XZ HM. Contributed reagents/materials/analysis tools: JH. Wrote the paper: XZ HM.
- 1. Zahid S, Oellerich M, Asif AR, Ahmed N (2012) Phosphoproteome profiling of substantia nigra and cortex regions of Alzheimer's disease patients. J Neurochem 121: 954–963.
- 2. Di Domenico F, Sultana R, Barone E, Perluigi M, Cini C, et al. (2011) Quantitative proteomics analysis of phosphorylated proteins in the hippocampus of Alzheimer's disease subjects. J Proteomics 74: 1091–1103.
- 3. Chien KY, Liu HC, Goshe MB (2011) Development and application of a phosphoproteomic method using electrostatic repulsion-hydrophilic interaction chromatography (ERLIC), IMAC, and LC-MS/MS analysis to study Marek's Disease Virus infection. J Proteome Res 10: 4041–4053.
- 4. Wu HY, Tseng VS, Chen LC, Chang HY, Chuang IC, et al. (2010) Identification of tyrosine-phosphorylated proteins associated with lung cancer metastasis using label-free quantitative analyses. J Proteome Res 9: 4102–4112.
- 5. Popova TG, Turell MJ, Espina V, Kehn-Hall K, Kidd J, et al. (2010) Reverse-phase phosphoproteome analysis of signaling pathways induced by Rift valley fever virus in human small airway epithelial cells. PLoS One 5: e13805.
- 6. Giorgianni F, Zhao Y, Desiderio DM, Beranova-Giorgianni S (2007) Toward a global characterization of the phosphoproteome in prostate cancer cells: identification of phosphoproteins in the LNCaP cell line. Electrophoresis 28: 2027–2034.
- 7. Rahman A, Isenberg DA (2008) Systemic lupus erythematosus. N Engl J Med 358: 929–939.
- 8. Xiang YJ, Dai SM (2009) Prevalence of rheumatic diseases and disability in China. Rheumatol Int 29: 481–490.
- 9. Li X, Zhang Y, Zeng X, Yang L, Deng Y (2011) Chemical profiling of bioactive constituents in Sarcandra glabra and its preparations using ultra-high-pressure liquid chromatography coupled with LTQ Orbitrap mass spectrometry. Rapid Commun Mass Spectrom 25: 2439–2447.
- 10. Fernbach NV, Planyavsky M, Muller A, Breitwieser FP, Colinge J, et al. (2009) Acid elution and one-dimensional shotgun analysis on an Orbitrap mass spectrometer: an application to drug affinity chromatography. J Proteome Res 8: 4753–4765.
- 11. Cox J, Matic I, Hilger M, Nagaraj N, Selbach M, et al. (2009) A practical guide to the MaxQuant computational platform for SILAC-based quantitative proteomics. Nat Protoc 4: 698–705.
- 12. Cox J, Mann M (2008) MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26: 1367–1372.
- 13. Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, et al. (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell 127: 635–648.
- 14. Melo-Braga MN, Braga TV, Leon IR, Antonacci D, Nogueira FC, et al. (2012) Modulation of protein phosphorylation, glycosylation and acetylation in grape (Vitis vinifera) mesocarp and exocarp due to Lobesia botrana infection. Mol Cell Proteomics 11: 945–956.
- 15. Lin SY, Li TY, Liu Q, Zhang C, Li X, et al. (2012) Protein phosphorylation-acetylation cascade connects growth factor deprivation to autophagy. Autophagy 8: 1385–1386.
- 16. Raggiaschi R, Gotta S, Terstappen GC (2005) Phosphoproteome analysis. Biosci Rep 25: 33–44.
- 17. Wu HY, Tseng VS, Liao PC (2007) Mining phosphopeptide signals in liquid chromatography-mass spectrometry data for protein phosphorylation analysis. J Proteome Res 6: 1812–1821.
- 18. Yang F, Stenoien DL, Strittmatter EF, Wang J, Ding L, et al. (2006) Phosphoproteome profiling of human skin fibroblast cells in response to low- and high-dose irradiation. J Proteome Res 5: 1252–1260.
- 19. Ge F, Xiao CL, Bi LJ, Tao SC, Xiong S, et al. (2010) Quantitative phosphoproteomics of proteasome inhibition in multiple myeloma cells. PLoS One 5: e13095.
- 20. Hoffert JD, Knepper MA (2008) Taking aim at shotgun phosphoproteomics. Anal Biochem 375: 1–10.
- 21. Collins MO, Yu L, Choudhary JS (2007) Analysis of protein phosphorylation on a proteome-scale. Proteomics 7: 2751–2768.
- 22. Gafken PR, Lampe PD (2006) Methodologies for characterizing phosphoproteins by mass spectrometry. Cell Commun Adhes 13: 249–262.
- 23. La Cava A (2009) Lupus and T cells. Lupus 18: 196–201.
- 24. Kyttaris VC, Tsokos GC (2004) T lymphocytes in systemic lupus erythematosus: an update. Curr Opin Rheumatol 16: 548–552.
- 25. Hoffman RW (2004) T cells in the pathogenesis of systemic lupus erythematosus. Clin Immunol 113: 4–13.
- 26. De Luca A, Maiello MR, D'Alessio A, Pergameno M, Normanno N (2012) The RAS/RAF/MEK/ERK and the PI3K/AKT signalling pathways: role in cancer pathogenesis and implications for therapeutic approaches. Expert Opin Ther Targets 16 Suppl 2: S17–27.
- 27. Chang L, Karin M (2001) Mammalian MAP kinase signalling cascades. Nature 410: 37–40.
- 28. Lopez-Santalla M, Salvador-Bernaldez M, Gonzalez-Alvaro I, Castaneda S, Ortiz AM, et al. (2011) Tyr(3)(2)(3)-dependent p38 activation is associated with rheumatoid arthritis and correlates with disease activity. Arthritis Rheum 63: 1833–1842.
- 29. Kanbe K, Chen Q, Nakamura A, Hobo K (2011) Inhibition of MAP kinase in synovium by treatment with tocilizumab in rheumatoid arthritis. Clin Rheumatol 30: 1407–1413.
- 30. Thiel MJ, Schaefer CJ, Lesch ME, Mobley JL, Dudley DT, et al. (2007) Central role of the MEK/ERK MAP kinase pathway in a mouse model of rheumatoid arthritis: potential proinflammatory mechanisms. Arthritis Rheum 56: 3347–3357.
- 31. Niu X, Fan T, Li W, Xing W, Huang H (2012) The anti-inflammatory effects of sanguinarine and its modulation of inflammatory mediators from peritoneal macrophages. Eur J Pharmacol 689: 262–269.
- 32. Kline CL, Olson TL, Irby RB (2009) Src activity alters alpha3 integrin expression in colon tumor cells. Clin Exp Metastasis 26: 77–87.