Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Mass spectrometry-based serum peptidome profiling accurately and reliably predicts outcomes of pemetrexed plus platinum chemotherapy in patients with advanced lung adenocarcinoma

  • Lin Wang,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Chuanhao Tang,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Bin Xu,

    Affiliation National Center of Biomedical Analysis, Beijing, China

  • Lin Yang,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Lili Qu,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Liangliang Li,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Xiaoyan Li,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Weixia Wang,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Haifeng Qin,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Hongjun Gao,

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

  • Kun He,

    Affiliation National Center of Biomedical Analysis, Beijing, China

  • Xiaoqing Liu

    Affiliation Department of Lung Cancer, Affiliated Hospital of Academy of Military Medical Sciences, Beijing, China

Mass spectrometry-based serum peptidome profiling accurately and reliably predicts outcomes of pemetrexed plus platinum chemotherapy in patients with advanced lung adenocarcinoma

  • Lin Wang, 
  • Chuanhao Tang, 
  • Bin Xu, 
  • Lin Yang, 
  • Lili Qu, 
  • Liangliang Li, 
  • Xiaoyan Li, 
  • Weixia Wang, 
  • Haifeng Qin, 
  • Hongjun Gao



Although pemetrexed plus cis/carboplatin has become the most effective chemotherapy regimen for patients with advanced lung adenocarcinoma, predictive biomarkers are not yet available, and new tools to identify chemosensitive patients who would likely benefit from this treatment are desperately needed. In this study, we constructed and validated predictive peptide models using the serum peptidome profiles of two datasets.


One hundred eighty-three patients treated with first-line platinum-based pemetrexed treatment for advanced lung adenocarcinoma were retrospectively enrolled and randomized into the training (n = 92) or validation (n = 91) set, and pre-treatment serum samples were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) and ClinProTools software. Serum peptidome profiles from the training set were used to identify potential predictive peptide biomarkers and construct a predictive peptide model for accurate group discrimination; which was then used to classify validation samples into “good” and “poor” outcome groups. The clinical outcomes of objective response rate (ORR), disease control rate (DCR), progression-free survival (PFS), and overall survival (OS) were analyzed based on the classification result.


Eight potential peptide biomarkers were identified. A predictive peptide model based on four distinct m/z features (2,142.12, 3,316.19, 4,281.94, and 6,624.02 Da) was developed based on the clinical outcomes of training set patients after first-line pemetrexed plus platinum treatment. In the validation set, the good group had significantly higher ORR (49.1% vs. 8.3%, P <0.001) and DCR (96.4% vs. 47.2%, P <0.001), and longer PFS (7.3 months vs. 2.7 months, P <0.001) vs. the poor group. However, the model did not predict OS (13.6 months vs. 12.7 months, P = 0.0675).


Our predictive peptide model could predict pemetrexed plus platinum treatment outcomes in patients with advanced lung adenocarcinoma and might thus facilitate appropriate patient selection. Further studies are needed to confirm these findings.


Currently, adenocarcinoma is the most common histological subtype of lung cancer, the leading cause of cancer-related deaths worldwide [1]. Despite great progress in targeted and immune therapies, chemotherapy remains the standard treatment for lung cancer. The American Society of Clinical Oncology suggests that patients with advanced non-squamous non-small cell lung cancer (NSCLC) patients who are not suitable for targeted therapy or immunotherapy should receive a platinum-based combination of two cytotoxic drugs [2]. In this context, pemetrexed is one of the most effective agents when combined with cisplatin or carboplatin [3,4]. However, a previous report stated a pooled objective response rate (ORR) and median progression-free survival (PFS) of 37.8% and 5.7 months, respectively, for platinum-based pemetrexed chemotherapy, indicating that many patients do not respond to this regimen [5]. Furthermore, unlike targeted or immune therapies, there are no clinical biomarkers to indicate which patients would benefit from pemetrexed or platinum-based chemotherapy. Recently, thymidylate synthase (TS) and excision repair cross-complementation group 1 (ERCC1) showed promise as predictive biomarkers for pemetrexed and platinum-based agents, respectively; however, these biomarkers must be assessed in tumor tissues and have not been validated prospectively in patients with lung adenocarcinoma [6,7]. Furthermore, a single predictive biomarker strategy is unrealistic because pemetrexed and platinum are usually administered in combination. Accordingly, new treatment selection tools are desperately needed to enhance the efficacy of this important regimen.

Recently, proteomic/peptidomic analyses, which complement genetic analyses, have become integral to investigations of tumor biology [8]. In addition, protein/peptide signatures can be tested using serum samples and may more accurately characterize the disease characteristics and development. Serum marker classification models yield patterns of multiple serum biomarkers, which provide better sensitivity and discrimination relative to a single biomarker. Many studies have tested the ability of proteomics/peptidomics strategies to facilitate early tumor detection and identify patients who would benefit from specific targeted therapies [914]. However, this method has not previously been used to predict chemotherapeutic outcomes.

Of the various proteomic/peptidomic techniques, matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) is uniquely suited to the analysis of complex biological samples [15]. MALDI-TOF MS devices comprise three main components: an ion source to ionize molecules and transfer them into a gas phase, a mass analysis device that separates molecules according by mass, and a detector that monitors the separated ions. During MALDI-TOF MS analysis, the separation of molecules by mass facilitates the creation of a mass spectrum characterized by ion masses and intensities. Previously, MALDI-TOF MS and bioinformatics have been combined to develop diagnostic or predictive models that could eventually be used in clinical practice [12,1618].

In this study, we used weak cation exchange magnetic beads coupled with MALDI-TOF MS to obtain serum peptidome profiles of a training dataset, which were then used to construct a predictive peptide model. The discriminative ability of this model was then tested using a validation dataset. Specifically, we used the ORR, DCRPFS, and OS to assess the predictive value of our predictive classification model.

Materials and methods

Patients and samples

A study overview is provided in Fig 1. A total of 183 patients with advanced lung adenocarcinoma who were treated with first-line pemetrexed plus platinum at the Affiliated Hospital of Academy of Military Medical Sciences from December 2012 to November 2014 were enrolled in this retrospective study. The eligibility criteria included a confirmed new diagnosis of advanced lung adenocarcinoma (stage IIIB or stage IV), no history of prior chemotherapy or targeted therapy, good organ function, and an Eastern Cooperative Oncology Group performance status of 0–2.

Fig 1. Study overview.

This flowchart describes the construction and testing of serum-based predictive peptide models for patients with advanced lung adenocarcinoma who were treated with first-line pemetrexed plus platinum-based chemotherapeutic regimens.

All patients received at least two cycles of pemetrexed plus cisplatin (500 mg/m2 and 75 mg/m2, respectively) or carboplatin (area under the curve: 5). Tumors were assessed at baseline using computed tomography, and the same radiological assessment was repeated every two cycles to assess the disease status. Disease response and progression were assessed using the Response Evaluation Criteria in Solid Tumors, version 1.0. Patients whose lesions disappeared during treatment were classified as having achieved a complete response (CR). Patients with a ≥30% decrease in the target lesion size were classified as having achieved a partial response (PR). Patients with a change in lesion size ranging from an increase of <20% to a decrease of <30% and no new lesions were classified as having stable disease (SD). Patients with an increase of ≥20% in lesion size or with new lesions were classified as having progressive disease (PD). The ORR was defined as the sum of the CR and PR rates, and the disease control rate (DCR) was defined as the sum of the ORR and SD rate.

PFS was defined as the time interval from the start of treatment to disease progression. OS was defined as the time from the date of diagnosis to the date of death or the last follow-up date (November 20, 2016). The smoking status was determined from patients’ medical records, and those who had smoked >100 cigarettes in lifetime were considered smokers. Pre-treatment (baseline) serum samples were obtained from consenting patients and collected in vacuum blood collection tubes containing coagulant and separation gel, separated via centrifugation (10 min at 4,000 rpm, 4°C), and stored at –80°C until analysis.

This study’s retrospective design was approved by the ethics committee of Affiliated Hospital of Academy of Military Medical Sciences (approval #2012-11-171). All patients provided informed consent to receive treatment and for the testing of their serum samples.

Study population and outcomes

Responses to chemotherapy were classified as CR, PR, SD and PD. As the sex ratio was imbalanced in favor of male patients, all patients were stratified and randomized into the training and validation sets by sex and treatment response before the analyses to balance the influences of these factors. The training group was used to develop peptide models that could discriminate patients who would and would not benefit from treatment. The validation group was then used to test the predictive power of the model derived from the training set.

Patients in the training group were divided into four clinical groups: PD (disease progression at ≤1.5 months), short SD (disease stability for ≤3 months), long SD (disease stability for >3 months), and PR. To optimize classification parameters, representative spectra of each clinical group were selected from among the training group. Good clinical outcomes included a CR, PR, or long SD, whereas and poor clinical outcomes included PD or short SD.

Sample preparation and mass analysis (peptide profiling)

Serum samples were thawed on ice and fractionated using weak cation exchange magnetic beads (MB-WCX, YiXin Bochuan Bio-Technique Co. Ltd., Beijing, China) before MS analysis, which was conducted from November 25 to December 5, 2016. These magnetic beads, which exhibited good peptide-capturing performance, were used to fractionate serum samples according to the manufacturer’s instructions. We added 5 μL of the serum sample to a pre-mixture of 10 μL of binding solution and 7 μL of MB-WCX beads in a polymerase chain reaction tube. The solution was intensively mixed, incubated for 5 min, and placed on a magnetic separator to isolate the unbound solution. The bound peptides were eluted from the magnetic beads after two rounds of bead separation and washing. Finally, 1 μL of the peptide eluate was mixed with 1 mL of MALDI-TOF matrix (a saturated solution of 4-hydroxy-3,5-dimethoxycinnamic acid in 50% acetonitrile with 0.5% trifluoroacetic acid), which was then spotted onto the sample anchor spots of an AnchorChip target plate (Bruker Daltonics Inc., Bremen, Germany). The MALDI-TOF MS analyses were performed on an Ultraflex III MALDI-TOF MS device (Bruker Daltonics Inc.) with the following settings: linear positive ion mode, repetition rate of 200 Hz, ion source voltages of 25 kV and 23.50 kV, lens voltage of 6.5 kV, pulsed ion extraction time of 100 ns, and nitrogen pressure of 1,700–2,000 mbar. All signals with a signal-to-noise ratio of >5 in a mass range of 800–10,000 Da were recorded using FlexAnalysis software (version 3.4; Bruker Daltonics Inc.). The peptidomic patterns and models were processed using ClinPro Tools bioinformatics software (version 3.0; Bruker Daltonics Inc.).

Data processing and statistical analysis

The following workflow was used for the data processing and analysis with FlexAnalysis and ClinPro Tools software: each spectrum was normalized to its total ion current and recalibrated according to the prominent and common m/z values, after which the baseline was subtracted, the peaks were smoothed before detection, and the peak areas were calculated for each spectrum. All peak signals were processed for noise reduction using a top-hat baseline in the 800–10,000 Da range. For the peptide peaks, the expressions of the same mass-to-charge ratios (m/z) were compared between the good and poor response groups using parametric testing (t test); we also tested whether these peptide patterns could be classified.

During model construction, we used only the spectra and clinical outcome data of the training set. Three algorithms (genetic algorithm [GA], supervised neural networks [SNN], and quick classifier [QC]) were used to establish the prediction models. Next, each model was applied to the validation set to test its ability to identify patients with good and poor responses. The validation process was performed in a blinded manner, and samples that had been classified prior to obtaining clinical outcome data were available to the investigators.

ORR and DCR outcomes were compared using the chi-square test or Fisher's exact test, whereas and PFS and OS were analyzed using the Kaplan–Meier method and the log-rank test. Survival outcomes were reported as median durations with 95% confidence intervals (CIs). Multivariable Cox proportional hazard analyses were performed to evaluate the relevance of various clinical features. All statistical tests were two-tailed, and a P-value of <0.05 was considered statistically significant. All statistical tests were performed using SPSS software (version 19; SPSS Inc., Chicago, IL, USA).


Patient characteristics

The clinical characteristics of the training and validation groups are presented in Table 1 and S1 Dataset (DOI: 10.6084/m9.figshare.5043280). Of the 183 patients included in this study, 92 and 91 were assigned to the training and validation groups, respectively. Most patients had stage IV disease and an Eastern Cooperative Oncology Group performance status of 1. All patients had histologically diagnosed adenocarcinoma and received first-line pemetrexed plus cis/carboplatin treatment.

Comparison of peptidomic data

We compared peptidome MS data from the good and poor responders in the training set. A total of 136 peptide peaks were detected; of these, eight peaks differed significantly between the good and poor outcome groups. Patients who achieved good clinical outcomes exhibited highly expressed mass spectra of m/z 3316.19, 6624.02, 2142.12, 4281.94, 3773.02, 3029.28, 3955.87, and 3323.95 Da (Table 2 and Figs 2 and 3).

Table 2. The eight mass peaks significantly differentially expressed in the training set.

Fig 2. Average serum peptide fingerprints.

(2A) Peptide profile of the good outcome group (green, n = 56) generated using ClinPro Tools. (2B) Peptide profile of the poor outcome group (red, n = 36) generated using ClinPro Tools. x-axis, mass to charge ratio; y-axis, relative intensity.

Fig 3. Significantly expressed mass peaks.

Eight peaks were significantly differentially expressed between the good outcome (green line) and poor outcome (red line) groups. x-axis, mass to charge ratio; y-axis, relative intensity.

Construction of predictive peptide models in the training set

The ClinPro Tools 3.0 software package (Bruker Daltonics) was used to analyze all serum sample data derived from the training set. The processed data were then used for visualization and statistical analysis. Statistically significant differences in peptide quantities were determined using Welch’s t-test at a significance level of P < 0.05. Data from the training set were subjected to three different mathematical model algorithms: the Genetic Algorithm (GA), Supervised Neural Network (SNN), and Quick Classifier (QC). We assessed the performances of these three models by considering the cross-validation and recognition capability. Finally, the QC algorithm was used to set a prediction model, and a peptidome pattern classification was constructed. This model was based on four significantly different peaks at m/z 2,142.12, 3,316.19, 4,281.94, and 6624.02 Da, which provided a recognition capability of 94.74% and cross-validation value of 91.74% (Table 3). This classification model correctly identified 100% of patients with poor outcomes and 83.47% of patients with good outcomes in the training set.

Table 3. Included peptide peaks and performances of predictive classification models.

Blind testing of the predictive peptide model in the validation set

Once the peptide prediction model was established with all parameters frozen, it was tested in the validation set of 91 patients in a blinded manner. All the samples were successfully classified as chemo good (n = 55, 60.4%) or chemo poor (n = 36, 39.6%). In other words, the model identified 55 and 36 patients as having achieved good and poor responses, respectively. When we compared the ORR, DCR, PFS, and OS of these two groups, we found that the chemo good group exhibited a significantly higher ORR (49.1% vs. 8.3%, P <0.001) and DCR (96.4% vs. 47.2%, P <0.001) and significantly longer PFS (7.3 months [95% CI: 6.735–7.865 months] vs. 2.7 months [95% CI: 0.939–4.461 months], P <0.001). However, the peptide model was not predictive of OS (13.6 months [95% CI: 11.109–16.091 months] vs. 12.7 months [95% CI: 10.201–15.199 months], P = 0.0675) (Table 4 and Fig 4).

Fig 4. Kaplan–Meier survival analysis.

(4A) Progression-free survival according to baseline classification results in the validation set (7.3 months [95% confidence interval (CI): 6.735–7.865 months] vs. 2.7 months [95% CI: 0.939–4.461 months], P <0.001). (4B) Overall survival according to baseline classification results in the validation set (13.6 months [95% CI: 11.109–16.091 months] vs. 12.7 months [95% CI: 10.201–15.199 months], P = 0.0675).

Multivariable analysis of the validation set

Cox multivariable analyses of PFS according to classification results, age, sex, and smoking status were performed using validation group data. The analyses demonstrated that only the classification results were independently associated with a PFS benefit, as patients who achieved good responses had a significantly lower risk of progression (hazard ratio [HR]: 0.124, 95% CI: 0.067–0.227, log-rank P <0.001).

Identification of peptide peaks

The LTQ Orbitrap-MS/MS successfully identified two of the eight peptides that had been differentially expressed between the good and poor outcome subgroups in the training set. All eight peaks were down-regulated in the poor outcome subgroup, and an MS/MS analysis of two down-regulated peaks, 2,142.12 Da and 3,316.19 Da, revealed respective sequences of K.AVEYYFASDASAVIEHTNR.V and K.NGVDGVYSADPNKDASAVKFDTLTHLDIINK.G. These sequences corresponded to fragments of glucosamine-fructose-6-phosphate aminotransferase and uridylate kinase, respectively (Table 2 and Fig 5).

Fig 5. Identification of two differentially expressed serum peptides.

Purified peptides from the poor outcome (red line) and good outcome (green line) group were sequenced using LTQ-Orbitrap-MS/MS. 5A and 5B present the fragment ion spectra of the sequences K.AVEYYFASDASAVIEHTNR.V and K.NGVDGVYSADPNKDASAVKFDTLTHLDIINK.G, respectively.


In this study, we combined magnetic bead-assisted serum peptide capture with MALDI-TOF MS to compare the peptidomic profiles of patients who exhibited different clinical outcomes in response to first-line pemetrexed plus platinum-based chemotherapy for advanced lung adenocarcinoma. Our results indicated that our novel, serum peptide pattern-based prediction model was useful for discriminating patients who would benefit from this regimen. To the best of our knowledge, this is the first attempted use of this methodology to predict the clinical outcomes of pemetrexed plus platinum-based chemotherapy in this patient population.

Although targeted therapies and immune checkpoint inhibitors have significantly improved the outcomes of selected patients with non-small cell lung cancer, chemotherapy remains the mainstay of treatment for thousands of lung cancer patients worldwide [1921]. Unfortunately, chemotherapy is still administered clinically using a “one-size fits all” approach. Accordingly, predictive biomarkers are desperately needed to identify chemosensitive patients and select appropriate drug combinations, thus avoiding unnecessary toxicities and costs and eventually improving patients' outcomes. To date, two molecules, ERCC1 and TS, have been identified as potential biomarkers [6,7,22]. However, biomarker studies have been hindered by the frequent administration of combinations of chemotherapeutic agents. Meanwhile, the availability and heterogeneity of tumor samples continue to present challenges to clinicians who must select patients [23]. Therefore, novel, noninvasive patient selection tools used to develop predictive models of treatment outcomes must better integrate multiple markers.

The human low molecular weight (1–10 KDa) serum peptidome includes many cytokines, peptide hormones, endogenous peptide products, and protein fragments, some of which may be uniquely suited for diagnostic, prognostic, or predictive peptide biomarker discovery [18,2428]. In other words, the serum peptidome contains an enormous wealth of unexplored biomarker information. Multiple biomarkers or biomarker patterns are now widely recognized as useful clinical tools, and various proteomic approaches have been applied to biological fluid-based biomarker discovery. Moreover, MALDI-TOF MS has exhibited good performance in the low-molecular-mass range [16] and can detect low molecular weight peptides at high levels of sensitivity and resolution, and is therefore considered a useful and standard method for serum peptidome profiling in many diseases [29]. In the present clinical proteomics study, we advantageously combined MALDI-TOF profiling combined with bioinformatics (FlexAnalysis and ClinPro Tools in this study) to the discovery of peptide biomarker patterns in a human disease.

However, the co-existence of highly abundant proteins and other factors in human serum makes it difficult to directly analyze the blood peptidome. Accordingly, targeted peptide enrichment is required [30]. One option for complex sample management is the elimination of the most abundant proteins. However, this depleted protein fraction may contain important disease biomarkers [31]. In 2007, Fiedler et al. revealed that MB-MALDI-TOF MS (magnetic bead-based fractionation flowed by MALDI-TOF MS) is likely to be more sensitive than SELDI-TOF MS for some later generated peaks [32]. Magnetic beads have been developed to purify and fractionate the proteome in serum, as well as in other body fluids. Accordingly, the MB-WCX (weak cation exchange magnetic beads) peptide profiling kit was developed and used to enrich serum peptides, as well as low molecular weight peptides (1–10 KDa), prior to MALDI-TOF MS analysis. Many studies have confirmed that magnetic bead fractionation plus MALDI-TOF MS is a highly sensitive and reproducible approach to serum profiling in different cancers [9,10,33]. To ensure reproducibility in the present study, the same researcher fractioned all serum samples using MB-WCX, and MALDI-TOF MS analyses were performed on the same day. ClinPro Tools 3.0 software was then used to analyze serum peptidome profiles and establish a predictive peptide model with high levels of cross-validation and recognition that could accurately distinguish patients with good or poor outcomes after pemetrexed plus platinum-based chemotherapy.

To eliminate potential influences of previous treatments, we only included previously untreated patients who had received first-line pemetrexed plus platinum-based chemotherapy, as Lazzari et al. reported that serum peptidome profiles could change during the course of treatment [34]. We also collected pre-treatment serum samples, as we inferred that these might allow us to more accurately evaluate the original disease state, a critical component of model construction, and avoid changes induced by chemotherapy.

Previous studies demonstrated that algorithm-based models generated using similar methods could identify patients who were sensitive to epidermal growth factor receptor tyrosine kinase inhibitors (EGFR-TKIs) [12,2426]. Unlike those studies, however, we combined the response and PFS to optimize our classification algorithm (rather than either outcome alone). We classified patients in the training set as having good or poor responses in an attempt to provide broad coverage of all patients, and identified eight potential biomarkers that distinguished these patients through our exploration of inter-group peptidome differences. Although three algorithms were used to develop a predictive classification model, our final successful predictive model was based on four peptides and a QC algorithm that could discriminate between patients with good and poor outcomes.

In the validation set, our predictive model identified chemosensitive patients, or those classified in the chemo good, as having a significantly higher ORR and DCR and significantly longer PFS relative to the chemo poor group. Cox multivariable analyses also confirmed that only these classification results correlated independently with a PFS benefit. Most patients with a PR (27/33) or PD (19/21) were classified correctly, thus yielding a high level of accuracy. Moreover, patients with long SD were also distinguished from those with short SD. The study results suggest that patients with a poor classification might exhibit primary resistance to pemetrexed plus platinum-based chemotherapy. Although the predictive peptide model did not predict OS outcomes, we cannot exclude a prognostic role of this peptide model among patients receiving the designated chemotherapy regimen. To confirm whether the predictive power of our peptide model is specific to a pemetrexed plus platinum-based regimen, cohorts of patients treated with the targeted therapy or other chemotherapy regimens should compared in the future.

Interestingly, we observed differences in eight mass spectra when we compared patients with good and poor responses. The two peptide peaks in our model, m/z 2,142.12 and m/z 3,316.19, were fragments of glutamine-fructose-6-phosphate aminotransferase and uridylate kinase. In this context, glucosamine-6-phosphate synthase is the first and rate-limiting enzyme in the hexosamine biosynthetic pathway, and uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. Ours is the first study to identify these enzymes as potential tumor markers, and we speculate that higher levels of these enzymes might correlate with sensitivity to pemetrexed/platinum because all eight peaks were downregulated in patients who experienced poor treatment responses. However, further studies are needed to confirm whether these biomarkers are valid, sensitive, and specific.

In conclusion, the results of this study demonstrate that patients with advanced lung adenocarcinoma and different responses to pemetrexed plus platinum-based chemotherapy have distinct serum peptidome profiles. Specifically, we constructed a noninvasive, highly sensitive, and high-throughput predictive peptide model to predict the clinical outcomes of this patient population in response to the indicated chemotherapy regimen. The results from our validation set suggest that this predictive peptide model can accurately and reliably discriminate chemosensitive patients and thus could be useful as a tool for the clinical selection of patients who would benefit from this regimen. However, further studies are needed to confirm the clinical value of our model.


We are grateful to all of the patients whose blood donations made this study possible.

Author Contributions

  1. Conceptualization: X. Liu LW.
  2. Data curation: LW.
  3. Formal analysis: LW BX.
  4. Funding acquisition: Xiaoqing Liu.
  5. Investigation: LY LQ LL.
  6. Methodology: KH BY.
  7. Project administration: LW.
  8. Resources: X. Li WW HQ HG.
  9. Software: BX LW.
  10. Supervision: X. Liu.
  11. Validation: CT.
  12. Writing – original draft: LW.
  13. Writing – review & editing: LW.


  1. 1. Siegel RL, Miller KD, Jemal A (2016) Cancer statistics, 2016. Ca A Cancer Journal for Clinicians 66: 7. pmid:26742998
  2. 2. Ettinger D S, Wood D E, Aisner D L, Akerley W, Bauman J, Chirieac LR, et al. Non-Small Cell Lung Cancer, Version 5.2017, NCCN Clinical Practice Guidelines in Oncology[J]. Journal of the National Comprehensive Cancer Network Jnccn, 2017, 15(4):504. pmid:28404761
  3. 3. Tomasini P, Barlesi F, Mascaux C, Greillier L (2016) Pemetrexed for advanced stage nonsquamous non-small cell lung cancer: latest evidence about its extended use and outcomes. Therapeutic Advances in Medical Oncology 8: 198–208.4. pmid:27239238
  4. 4. Scagliotti GV, Parikh P, von Pawel J, Biesma B, Vansteenkiste J, Manegold C, et al. (2008) Phase III study comparing cisplatin plus gemcitabine with cisplatin plus pemetrexed in chemotherapy-naive patients with advanced-stage non-small-cell lung cancer. J Clin Oncol 26: 3543–3551. pmid:18506025
  5. 5. Xiao H-Q, Tian R-H, Zhang Z-H, Du K-Q, Ni Y-M (2016) Efficacy of pemetrexed plus platinum doublet chemotherapy as first-line treatment for advanced nonsquamous non-small-cell-lung cancer: a systematic review and meta-analysis. OncoTargets and therapy 9: 1471. pmid:27042115
  6. 6. Liu Y, Yin T-J, Zhou R, Zhou S, Fan L, Zhang RG, et al. (2013) Expression of thymidylate synthase predicts clinical outcomes of pemetrexed-containing chemotherapy for non-small-cell lung cancer: a systemic review and meta-analysis. Cancer chemotherapy and pharmacology 72: 1125–1132. pmid:24067998
  7. 7. Han Y, Liu J, Sun M, Zhang Z, Liu C, Sun Y, et al. (2016) A Significant Statistical Advancement on the Predictive Values of ERCC1 Polymorphisms for Clinical Outcomes of Platinum-Based Chemotherapy in Non-Small Cell Lung Cancer: An Updated Meta-Analysis. Disease markers 2016.
  8. 8. Coombes KR, Morris JS, Hu J, Edmonson SR, Baggerly KA (2005) Serum proteomics profiling—a young technology begins to mature. Nature biotechnology 23: 291–292. pmid:15765078
  9. 9. Schwamborn K, Krieg RC, Grosse J, Reulen N, Weiskirchen R, Knuechel R, et al. (2009) Serum proteomic profiling in patients with bladder cancer. European urology 56: 989–997. pmid:19282097
  10. 10. Freed GL, Cazares LH, Fichandler CE, Fuller TW, Sawyer CA, Stack BC Jr, et al. (2008) Differential Capture of Serum Proteins for Expression Profiling and Biomarker Discovery in Pre-and Posttreatment Head and Neck Cancer Samples. The Laryngoscope 118: 61–68. pmid:18043497
  11. 11. Orvisky E, Drake SK, Martin BM, Abdel-Hamid M, Ressom HW, Varghese RS, et al. (2006) Enrichment of low molecular weight fraction of serum for MS analysis of peptides associated with hepatocellular carcinoma. Proteomics 6: 2895–2902. pmid:16586431
  12. 12. Taguchi F, Solomon B, Gregorc V, Roder H, Gray R, Kasahara K, et al. (2007) Mass spectrometry to classify non–small-cell lung cancer patients for clinical outcome after treatment with epidermal growth factor receptor tyrosine kinase inhibitors: a multicohort cross-institutional study. Journal of the National Cancer Institute 99: 838–846. pmid:17551144
  13. 13. Amann JM, Lee J-W, Roder H, Brahmer J, Gonzalez A, Schiller JH, et al. (2010) Genetic and proteomic features associated with survival after treatment with erlotinib in first-line therapy of non-small cell lung cancer in Eastern Cooperative Oncology Group 3503. Journal of Thoracic Oncology 5: 169–178. pmid:20035238
  14. 14. Kuiper J, Lind J, Groen H, Roder J, Grigorieva J, Roder H, et al. (2012) VeriStrat® has prognostic value in advanced stage NSCLC patients treated with erlotinib and sorafenib. British journal of cancer 107: 1820–1825. pmid:23079575
  15. 15. Villanueva J, Philip J, Entenberg D, Chaparro CA, Tanwar MK, Holland EC, et al. (2004) Serum peptide profiling by magnetic particle-assisted, automated sample processing and MALDI-TOF mass spectrometry. Analytical chemistry 76: 1560–1570. pmid:15018552
  16. 16. Petricoin EF, Liotta LA (2003) Mass spectrometry-based diagnostics: the upcoming revolution in disease detection. Clinical Chemistry 49: 1228–1229.
  17. 17. Chung CH, Seeley EH, Roder H, Grigorieva J, Tsypin M, Roder J, et al. (2010) Detection of tumor epidermal growth factor receptor pathway dependence by serum mass spectrometry in cancer patients. Cancer Epidemiology Biomarkers & Prevention 19: 358–365.
  18. 18. Carbone DP, Ding K, Roder H, Grigorieva J, Roder J, Tsao MS, et al. (2012) Prognostic and predictive role of the VeriStrat plasma test in patients with advanced non-small-cell lung cancer treated with erlotinib or placebo in the NCIC Clinical Trials Group BR.21 trial. Journal of Thoracic Oncology Official Publication of the International Association for the Study of Lung Cancer 7: 1653.
  19. 19. Shea M, Costa DB, Rangachari D (2015) Management of advanced non-small cell lung cancers with known mutations or rearrangements: latest evidence and treatment approaches. Therapeutic Advances in Respiratory Disease 10: 113. pmid:26620497
  20. 20. Sundar R, Cho BC, Brahmer JR, Soo RA (2015) Nivolumab in NSCLC: latest evidence and clinical potential. Therapeutic Advances in Medical Oncology 7: 85–96. pmid:25755681
  21. 21. Brahmer J, Reckamp KL, Baas P, Crinò L, Eberhardt WE, Poddubskaya E, et al. (2015) Nivolumab versus Docetaxel in Advanced Squamous-Cell Non–Small-Cell Lung Cancer. New England Journal of Medicine 373: 1627. pmid:26412456
  22. 22. Olaussen KA, Postelvinay S (2016) Predictors of chemotherapy efficacy in non-small-cell lung cancer: a challenging landscape. Annals of Oncology: mdw321.
  23. 23. Lu J, Wang W, Xu M, Li Y, Chen C, Wang X (2016) A global view of regulatory networks in lung cancer: An approach to understand homogeneity and heterogeneity. Seminars in Cancer Biology.
  24. 24. Lai ZW, Petrera A, Schilling O (2015) The emerging role of the peptidome in biomarker discovery and degradome profiling. Biological chemistry 396: 185–192. pmid:25229414
  25. 25. Klupczynska A, Swiatly A, Hajduk J, Matysiak J, Dyszkiewicz W, Pawlak K, et al. (2016) Identification of Serum Peptidome Signatures of Non-Small Cell Lung Cancer. International Journal of Molecular Sciences 17.
  26. 26. Wu X, Liang W, Hou X, Lin Z, Zhao H, Huang Y, et al. (2013) Serum proteomic study on EGFR-TKIs target treatment for patients with NSCLC. Onco Targets Ther 6: 1481–1491. pmid:24204163
  27. 27. Carbone DP, Salmon JS, Billheimer D, Chen H, Sandler A, Roder H, et al. (2010) VeriStrat classifier for survival and time to progression in non-small cell lung cancer (NSCLC) patients treated with erlotinib and bevacizumab. Lung Cancer 69: 337–340. pmid:20036440
  28. 28. Wang N, Cao Y, Song W, He K, Li T, Wang J, et al. (2014) Serum peptide pattern that differentially diagnoses hepatitis B virus-related hepatocellular carcinoma from liver cirrhosis. Journal of Gastroenterology & Hepatology 29: 1544–1550.
  29. 29. Kisluk J, Ciborowski M, Niemira M, Kretowski A, Niklinski J (2014) Proteomics biomarkers for non-small cell lung cancer. Journal of Pharmaceutical & Biomedical Analysis 101: 40–49.
  30. 30. Nanjappa V, Thomas JK, Marimuthu A, Muthusamy B, Radhakrishnan A, Sharma R, et al. (2014) Plasma Proteome Database as a resource for proteomics research: 2014 update. Nucleic Acids Research 42: D959. pmid:24304897
  31. 31. Yadav AK, Bhardwaj G, Basak T, Kumar D, Ahmad S, Priyadarshini R, et al. (2011) A systematic analysis of eluted fraction of plasma post immunoaffinity depletion: implications in biomarker discovery. PLoS One 6: e24442. pmid:21931718
  32. 32. Fiedler GM, Baumann S, Leichtle A, Oltmann A, Kase J, Thiery J, et al. (2007) Standardized peptidome profiling of human urine by magnetic bead separation and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Clinical Chemistry 53: 421. pmid:17272489
  33. 33. Schaub NP, Jones KJ, Nyalwidhe JO, Cazares LH, Karbassi ID, Semmes OJ, et al. (2009) Serum Proteomic Biomarker Discovery Reflective of Stage and Obesity in Breast Cancer Patients. Journal of the American College of Surgeons 208: 970–978. pmid:19476873
  34. 34. Lazzari C, Spreafico A, Bachi A, Roder H, Floriani I, Garavaglia Det al. (2012) Changes in plasma mass-spectral profile in course of treatment of non-small cell lung cancer patients with epidermal growth factor receptor tyrosine kinase inhibitors. Journal of Thoracic Oncology Official Publication of the International Association for the Study of Lung Cancer 7: 40.