Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

New label-free methods for protein relative quantification applied to the investigation of an animal model of Huntington Disease

  • Flora Cozzolino ,

    Contributed equally to this work with: Flora Cozzolino, Alfredo Landolfi

    Roles Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing

    Affiliations Department of Chemical Sciences, University of Naples "Federico II", Naples, Italy, CEINGE Advanced Biotechnologies, Naples, Italy

  • Alfredo Landolfi ,

    Contributed equally to this work with: Flora Cozzolino, Alfredo Landolfi

    Roles Investigation, Methodology, Writing – original draft

    Affiliations Department of Chemical Sciences, University of Naples "Federico II", Naples, Italy, CEINGE Advanced Biotechnologies, Naples, Italy

  • Ilaria Iacobucci,

    Roles Formal analysis, Investigation, Validation, Visualization, Writing – review & editing

    Affiliations Department of Chemical Sciences, University of Naples "Federico II", Naples, Italy, CEINGE Advanced Biotechnologies, Naples, Italy

  • Vittoria Monaco,

    Roles Formal analysis, Investigation, Validation, Visualization, Writing – review & editing

    Affiliation CEINGE Advanced Biotechnologies, Naples, Italy

  • Marianna Caterino,

    Roles Formal analysis, Validation, Visualization, Writing – review & editing

    Affiliation Department of Molecular Medicine and Medical Biotechnologies, University of Naples "Federico II", Naples, Italy

  • Simona Celentano,

    Roles Visualization

    Affiliation CEINGE Advanced Biotechnologies, Naples, Italy

  • Chiara Zuccato,

    Roles Funding acquisition, Validation, Writing – review & editing

    Affiliations Department of Biosciences, University of Milan, Milan, Italy, Istituto Nazionale di Genetica Molecolare "Romeo ed Enrica Invernizzi", Milan, Italy

  • Elena Cattaneo,

    Roles Writing – review & editing

    Affiliations Department of Biosciences, University of Milan, Milan, Italy, Istituto Nazionale di Genetica Molecolare "Romeo ed Enrica Invernizzi", Milan, Italy

  • Maria Monti

    Roles Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing

    montimar@unina.it

    Affiliations Department of Chemical Sciences, University of Naples "Federico II", Naples, Italy, CEINGE Advanced Biotechnologies, Naples, Italy

Abstract

Spectral Counts approaches (SpCs) are largely employed for the comparison of protein expression profiles in label-free (LF) differential proteomics applications. Similarly, to other comparative methods, also SpCs based approaches require a normalization procedure before Fold Changes (FC) calculation. Here, we propose new Complexity Based Normalization (CBN) methods that introduced a variable adjustment factor (f), related to the complexity of the sample, both in terms of total number of identified proteins (CBN(P)) and as total number of spectral counts (CBN(S)). Both these new methods were compared with the Normalized Spectral Abundance Factor (NSAF) and the Spectral Counts log Ratio (Rsc), by using standard protein mixtures. Finally, to test the robustness and the effectiveness of the CBNs methods, they were employed for the comparative analysis of cortical protein extract from zQ175 mouse brains, model of Huntington Disease (HD), and control animals (raw data available via ProteomeXchange with identifier PXD017471). LF data were also validated by western blot and MRM based experiments. On standard mixtures, both CBN methods showed an excellent behavior in terms of reproducibility and coefficients of variation (CVs) in comparison to the other SpCs approaches. Overall, the CBN(P) method was demonstrated to be the most reliable and sensitive in detecting small differences in protein amounts when applied to biological samples.

Introduction

In recent years, Proteomics has gained centrality in Omics studies for basic and translational applications, especially for diagnostic purposes and for targeted and/or personalized medicine [13]. Differential proteomics approaches are attracting particular attention due to the possibility to compare protein expression profiles from multiple biological conditions, e. g. wild type vs mutant or vs pharmacologically treated, etc [4,5].

Different methodologies have been developed to carry out the qualitative-quantitative analysis of the protein content in samples using both labeled and label-free approaches. The labeling based methods employed fluorescent (e. g. Difference in Gel Electrophoresis (DIGE)) [6,7] or isotopic reagents (as for Isobaric Tag for Relative and Absolute Quantitation (iTRAQ), Stable Isotope Labeling with Amino acids in Cell culture (SILAC) and Tandem Mass Tagging (TMT) [812]) to differently label the two or more proteomes under investigation. These strategies provide higher levels of reproducibility because are based on the contemporaneous electrophoretic separation and/or tandem mass spectrometry (MS/MS) analysis of the samples. Despite these advantages, labeling procedures are time-consuming and very expensive for the high cost of the labeling reagents [13].

More recently, Label-Free (LF) approaches have been introduced for quantification of proteomic profiles by exploiting liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) analyses [1416]. These procedures [1719] represented an effective solution to overcome these drawbacks due to their reduced costs and simplified sample preparation. Nevertheless, longer analysis time and high-performance MS/MS instruments are needed.

Label-Free quantification has emerged as a consequence of the great technical advances in the development and design of high-resolution LC-MS/MS instruments [20], such as the Orbitrap mass analyzer [21]. This analytical approach relies on the measurement of two main parameters, Extracted Ion Chromatogram (XIC) and Spectral Counts (SpCs) [19,22]. Measurement of XIC is a MS1-based strategy in which the ion current associated with each peptide ion is individually considered. The relative quantification of proteins can then be obtained by evaluating the total current associated with all the peptides belonging to a specific protein in the two conditions [23]. SpCs quantification is an MS/MS-based procedure. The number of fragmentation events, (i.e. the spectral counts), measured for all the peptides belonging to the same protein in each sample are summed up resulting in the relative quantification of the protein in the different conditions [24,25].

Both XIC and SpCs methods need massive use of tandem mass spectrometry, producing a large amount of data that needs to be normalized, i.e. corrected for instrumental or uncontrollable variations before becoming suitable for quantitative purposes. In particular, specifically, designed bioinformatic tools are required to manage and process data, while appropriate normalization methods have to be used [26,27].

Normalization procedure has been assessed in multiple ways by several authors [28,29]. In general, XIC methods require steps of higher complexity such as isotopic patterns resolution, feature detection, retention time alignment, etc. Therefore, well defined and automated processing and normalization algorithms have been integrated into specifically designed software packages, as for MaxQuant [3032].

In SpCs methods, instead, fragmentation events are crucial for both protein identification and quantification. Commonly, software workflows for SpCs quantification result in discrete, unnormalized data with normalization left to the user [33,34]. Although XIC-based methods show a greater accuracy and linear range, the access to the last generation high-resolution instruments and/or computational processing for XIC data are often prohibitive, with the consequence that SpC methods are still a valuable alternative for a large number of applications, as demonstrated by recent literature [3538].

Here we propose a new Complexity Based Normalization (CBN) method introducing new adjustment factors in the SpC normalization formula related to the complexity of the analyzed samples in terms of total identified proteins (CBN(P)) or spectral counts (CBN(S)). Both CBN methods are compared with the multiple label-free quantification modes based on the SpC approaches, namely Ratio of Spectral Count (RSC) [33] and Normalized Spectra Abundance Factor (NSAF) [39].

The new proposed normalization approach was optimized on standard protein mixtures with different complexity, and then applied to the differential analysis of the proteome profiles extracted from the brain cortex of zQ175 knock-in mice [40], an animal model of Huntington Disease (HD), in comparison with the wild-type counterpart. The CBN(P) method was demonstrated to be the most reliable and sensitive in detecting small differences in protein amounts when applied to biological samples.

Materials and methods

Preparation of standard mixtures

Six standard tryptic peptide mixtures (Waters, Milford, Massachusetts, US) including Bovine Hemoglobin Chain A (HBA, Uniprot ID P01966, B. Taurus), Bovine Hemoglobin Chain B (HBB, P02070, B. Taurus), Bovine Serum Albumin (BSA, P02769, B. Taurus), yeast Enolase (ENO, P00924, S. Cerevisiae), yeast Alcohol Dehydrogenase (ADH, P00330, S. Cerevisiae) and rabbit Glycogen Phosphorylase (PYG, P00489, O. Cuniculus) were dissolved in 0.2% formic acid (FA) and spiked into an E. Coli total tryptic digest (Waters, Milford, Massachusetts, US) solution, so that final concentration of the digest reaches 67 ng/μL. In all mixtures obtained and indicated with A-E letters, HBA, HBB, BSA and PYG were always added in a fixed amount, while only ENO and ADH were added in variable amounts ranging from 1/25 to 5 times the fixed standards (i.e. 0.94–23.5ng/μl for ENO and 0.76–19 ng/μl for ADH). The obtained standard mixtures have been employed for the optimization of mass spectrometry analyses and for setting quantification methods.

The zQ175 mouse model

All protocols involving animals were carried out in accordance with institutional guidelines in compliance with Italian law (D. Lgs no. 2014/26, implementation of the 2010/63/UE) and authorization n.324/2015-PR issued May 6, 2015, by Ministry of Health. The Ethics Committee of the University of Milano approved studies in mice (Ethical Approval 74/13; Ethical Approval 74/14). JAX stock number was B6J.129S1-Htt tm1Mfc /190ChdiJ. For biochemical analyses, animals were euthanized by dislocation and all efforts were made to minimize suffering. Genotyping of the zQ175 (C57BL/6J) mouse colony was performed by PCR of genomic DNA obtained from tail samples (Nucleo Spin Tissue, Macherey-Nagel, catalog 740952.250) at weaning. CAG repeats of zQ175 mice were sized by using the following PCR primers: forward: CATTCATTGCCTTGCTGCTAAG; reverse: CTGAAACGACTTGAGCGACTC. Cycling conditions were 94°C for 10 minutes, 30 cycles × (96°C for 30 seconds, 57°C for 30 seconds, 72°C for 30 seconds), 72°C for 7 minutes.

Preparation of total protein lysates from mouse cortical tissues

For proteomics and biochemical analyses, animals were euthanized by dislocation and the brains were rapidly removed and the cerebral cortices were immediately excised from the brain, frozen in liquid nitrogen and smashed. Specifically, cortical tissues from three symptomatic homozygous zQ175 mutant and three wild type (WT) mice at 50 weeks of age were used. The samples were then lysed by combining chemical and mechanical methods. Cortical tissues were crushed and repeatedly pipetted in the lysis buffer (50 mM Tris-HCl pH 7.4, 150 mM NaCl, 0.1% v/v Sodium Dodecyl Sulphate (SDS), 1 mM Phenylmethylsulfonyl fluoride (PMSF), 1% v/v Nonidet P-40 (NP40), protease inhibitors EDTA-free), then passed ten times each through a syringe. Following these mechanical processes, they were put on a stirring wheel, (20 minutes at 4°C), and then centrifuged, (16,100 x g, 30 minutes, 4°C). After the lysis procedure, supernatants from all samples were collected and quantified according to Bradford protein assay and 50 μg from each were used for the Differential Proteomics experiment and Western Blot assays.

SDS-PAGE and in situ hydrolysis

50μg from each sample were suspended in Laemmli buffer containing 100 mM dithiothreitol (DTT), boiled for 10 minutes and loaded on a 20x20 cm 4% - 15% bis-acrylamide gradient gel. After SDS-PAGE separation, the gel was stained with Coomassie Blue Brilliant and each lane was excised in 29 slices by using the same cutting scheme. All gel slices were in situ hydrolyzed as previously reported [41]. Finally, the peptide mixtures were vacuum-dried and resuspended in 0.2% formic acid (FA) solution to be analyzed by LC-MS/MS.

LC-MS/MS analyses

Quantitative data were collected on an LTQ Orbitrap XL (ThermoScientific, Waltham, MA) coupled to a nanoLC system (nanoEasy II). All peptide mixtures were analyzed using the same chromatographic conditions, i.e. 3 microliters of each sample were fractionated onto a C18 capillary reverse-phase column (100 mm, 75 μm, 5 μm) working at 250 nl/min flow rate, using a linear gradient of eluent B (0.2% formic acid in 95% acetonitrile) in A (0.2% formic acid and 2%acetonitrile in MilliQ water) from 5% to 40% in 80 minutes was run. MS/MS analyses were performed using Data-Dependent Acquisition (DDA) mode: one MS scan (mass range from 400 to 1800 m/z) was followed by MS/MS scans of the five most abundant ions in each MS scan, applying a dynamic exclusion window of 40 seconds. All samples were run in duplicates.

Protein identification and quantification

Raw data obtained from nanoLC-MS/MS were analyzed with MaxQuant 1.5.2 integrated with the Andromeda search engine [42]. To that end, an appropriate Fasta file was generated by downloading from UniProt and subsequently merging amino acid sequences of both standard proteins and whole E. Coli proteome.

The selected parameters for protein identification were the following: minimum 2 peptides, at least 1 unique; variable modifications allowed were methionine oxidation and pyroglutamate formation on N-terminal glutamine; accuracy for the first search was set to 10 ppm, then lowered to 5 ppm in the main search; 0.01 FDR was used, with a reverse database for decoy; retention time alignment and second peptides search functions were allowed. Protein quantification has been performed only using unique unmodified peptides.

As output, MaxQuant resulted in discrete spectral counts, which were then normalized by using NSAF, RSC, or CBNs approaches.

As concerns data collected for the Differential Proteomics analysis on zQ175 mice, following MaxQuant analysis data, each couple of technical duplicate was mediated for every protein, obtaining a set of 6 data (3 for WT and 3 for zQ175). These values were employed for manual normalization according to the above mentioned SpCs methods.

Western blot and densitometric analyses

Mice brains total protein extracts were separated by SDS-PAGE and then transferred onto nitrocellulose membranes (Bio-rad, Hercules, California, US). Membranes were blocked with 5% non-fat milk and then incubated with the following antibodies: rabbit polyclonal anti-hnRNPH (Abcam, ab154894), rabbit polyclonal anti-SERPIN B6 (Abcam, ab233229; Proteintech, 14962-1-AP), rabbit polyclonal anti-IRGM (Abcam, ab118569), rabbit polyclonal anti-UQCRQ (Proteintech, 14975-1-AP), rabbit monoclonal anti-HOMER1 (Abcam, ab184955), rabbit monoclonal anti-HTT (Cell Signalling Technology, mAb#5656), rabbit polyclonal anti-OSBPL2 (Proteintech, 14751-1-AP), mouse monoclonal anti-β-Actin (Origene, TA811000) and mouse monoclonal anti-Vinculin (Sigma-Aldrich, V9131). Membranes were incubated with horseradish peroxidase-conjugated secondary antibody (1:5000 for mouse host antibodies and 1:10000 for rabbit host antibodies) for 45 minutes at room temperature and the signals were detected by enhanced chemiluminescence (ECL) detection system (Thermo Fisher Scientific, Inc., Waltham, MA).

For densitometric analyses, the software Quantity One 4.6.8 (Bio-Rad, Hercules, California, US) has been employed; all protein band intensities were normalized with the corresponding β–actin signal, except HTT, whose intensities were normalized on vinculin signal.

Multiple-Reaction Monitoring (MRM) analyses

As additional validation method multiple-reaction monitoring (MRM) approach was employed. 50 μg of cell lysates obtained as described above were digested by trypsin onto S-Trap filters, according to the manufacturer protocol (Protifi, Huntington, NY). The three biological replicates of zQ175 and wild-type mice were pooled respectively, and HTT, SERPIN B6, HOMER1, OSPBL2, SAMM50, APO-A4, CAMK2A, STIP1, RIC8A, ATP2A2 peptide transitions were monitored with a Xevo-TQS mass spectrometer coupled to a nanoAcquity UHPLC (Waters, Milford, MA, US) equipped with IonKey CHIP interface. Peptide mixtures were separated on peptide BEH C18 130Ǻ, 1.7μm, 150 μm x 50mm, iKey by using a linear gradient of eluent B (95% acetonitrile LC-MS grade (Sigma Aldrich, St. Louis, Missouri, US), 0.2% formic acid (Sigma Aldrich, St. Louis, Missouri, US)) from 7% to 95% over 115 minutes working at a flow rate of 3μl/min. Raw data were processed with Skyline v20.1.0.155 (MacCoss Lab Software, Dept of Genome Sciences, UW) and the total area of each peptide transition was used for the relative quantification of the specific proteins. For each protein, at least two prototypic peptides were selected and at least two transitions for each parent ion were monitored, as in silico predicted by using Skyline. Proteins were not monitored all together but in three different runs, both for wild type and zQ175: run A, included HTT, HOMER1, RIC8A; run B, ATP2A2, APO-A4, OSBPL2; run C, SAMM50, SERPIN B6, CAMK2A, STIP1 and in each run ACTIN was monitored as the internal standard. Each run was analyzed in duplicate and the total area of each peptide transitions was used for the relative quantification of the specific proteins, normalized for actin FC.

Statistical analyses

Multiexperiment Viewer v4.9.0 [43], (MeV) was employed to perform statistical analysis of MaxQuant output. Western blots results were evaluated by univariate statistical analysis using GraphPad Prism 8.0, and the results are presented as the mean ± standard deviation (SD) by three biological replicates. The statistical significance of the observed difference in western blot analysis was determined by parametric (Welch’s t) or non-parametric (Mann–Whitney test) tests when data failed the Shapiro–Wilk normality test. Mass spectrometry data (LF) statistical significance was determined by the unpaired Student’s t-test and differences were considered statistically significant at Benjamini-Hochberg corrected p-value (FDR)<0.05 [44].

Functional over-representation analysis

STRING database (https://string-db.org/) [45] (STRING v11: Protein-Protein Association Networks With Increased Coverage, Supporting Functional Discovery in Genome-Wide Experimental Datasets) was used to perform a pathways enrichment analysis on differentially expressed proteins in mouse brains identified in each method (NSAF, Rsc, CBN(P), CBN(S)).

Results

Development of a new method for Spectral Counts normalization

A new mathematic method, called Complexity Based Normalization (CBN), was developed for the normalization of Spectral Counts data in differential proteomics experiments to calculate the relative Fold Change for each protein between two experimental conditions. The new formula (1) employed for SpCs normalization is reported as follows in comparison with the Normalized Spectra Abundance Factor (NSAF) (2) and the Spectral Counts log Ratio (RSC) (3) methods.

(1)(2)(3)

In CBN methods (1), Sx,M are the Spectral Counts of protein x in mixture M, tM are the total spectral counts of mixture M, f is the complexity-based adjustment factor. In addition to common variables, in NSAF method (2) Li represents the amino acid length of "i" protein, while P is the total number of proteins identified in the analysis. Finally, in the RSC method (3) f is a fixed adjustment factor = 0.5.

The output of these formulas consists of a numeric value representing the quantity of a specific protein in each mixture that is used to calculate the relative Fold Change (FC). Both RSC and CBN require the use of an adjustment factor "f" to avoid the presence of missing values. In the RSC method, Beissbarth et al. [46] have fixed the f value constant to 0.5. For both CBN methods we propose the employment of a variable adjustment factor according to sample complexity. Specifically, the CBN formula has been developed with f = 1/P (CNB(P)), where P is the number of proteins occurring in the MaxQuant output data or with f = 1/t (CBN(S)), where t is the sum of all spectral counts associated to all proteins in all mixtures. As a consequence of these definitions, the adjustment factor becomes complexity-based, since the P and t values are both related to the total number of proteins in the sample.

Setting up the normalization methods on E. Coli proteome standard sample

The newly developed CBN formulas were evaluated in comparison with existing normalization methods in differential proteomics experiments using the label-free procedure applied to the E. Coli proteome. Commercial tryptic digests from six different proteins (HBA, HBB, and BSA from B. Taurus, ENO and ADH from S. Cerevisiae and PYG, from O. Cuniculus) were spiked into a fixed matrix consisting of a constant amount of E. Coli total proteins tryptic digest, to mimic a real sample environment. Five mixtures (A, B, C, D, E) were prepared as described above, in which only ADH and ENO tryptic peptides were added in different amounts. Each mixture was analyzed by nanoLC-MS/MS in duplicates and the raw data processed by using MaxQuant. Following MaxQuant analysis, a set of unnormalized SpCs data was obtained and used to test the different normalization methodologies, NSAF, Rsc, and the newly developed CBNs, either in terms of total identified proteins (CBN(P)) or spectral counts (CBN(S)).

A total of 409 proteins (P), including the six spiked standards were identified in all samples by MaxQuant analysis, together with 17837 total spectral counts (t) (S1 Table). The P and t experimental values were then introduced in the CBN formulas for the computation of the f factor.

As a first step, the reproducibility of the different approaches was tested making use of the two technical replicates prepared and analyzed for each mixture. As an example, Fig 1A–1D shows the corresponding scatter plots concerning the quantitative measurements of all proteins identified in the two replicates of mixture A by the four different normalization methods used, i.e. NSAF, RSC, CBN(P) with f = 1/P and CBN(S) with f = 1/t.

thumbnail
Fig 1. Scatter plots of the quantitative measurements of all the E. Coli proteins.

Scatter plots of the quantitative measurements of all the E. Coli proteins identified in the two replicates of mixture A by the four different normalization methods used, i.e. NSAF (panel A), RSC (panel B), CBN(P) with f = 1/P (panel C) and CBN(S) with f = 1/t (panel D). On the X-axis is reported the normalized Spectral Counts of replicate 1, and on the Y-axis the normalized Spectral Counts of replicate 2.

https://doi.org/10.1371/journal.pone.0238037.g001

When the data produced by the applied method were perfectly reproducible between the two technical replicates, the scatter plot should result in a perfectly linear behavior (R2 = 1) and, most importantly, should have a unitary slope.

As showed in Fig 1A–1D, a very poor correlation occurred when data were elaborated by NSAF and Rsc in comparison to CBN methods, with a high number of points largely scattered, therefore suggesting a lower level of reproducibility. Linear regression best-fit values are summarized in S2 Table. It should be underlined that the scatter plots associated with the two CBN methods showed quite similar shape although the value of the adjustment factor was greatly different. Furthermore, as shown in panel 1B, a large scattering of data that could not be accommodated in a linear behavior was observed in the RSC normalization method.

Fig 2A reports the slopes calculated for the best fitting lines in each pair of technical replicates analyzed with the four methods for all samples except mixture B, since it showed a very low reproducibility in all methods (S1 Fig), suggesting the occurrence of technical problems in the LC-MS/MS analysis. Findings reported in Fig 2A showed that all the SpCs methods displayed an acceptable behavior except for RSC, whose median value was much lower than the expected value (i.e. 1) suggesting that a fixed adjustment factor "f" strongly affects the median value. The reproducibility of each method was also evaluated by calculating the coefficient of variation (CV) in the quantification of E. Coli proteins throughout the ten LC-MS/MS runs in which the amount of E. Coli tryptic peptides was constant (Fig 2B). The RSC method showed a very good behavior with a low coefficient of variation whereas the NSAF displayed a greater dispersion of data and a very high CV value. This is very likely due to the absence of any adjustment factor in this method making NSAF very susceptible to quantitative biases caused by the absence of data. As showed, CBN(S) performed well, showing a CV dispersion intermediate between NSAF and Rsc. The best result occurred in CBN(P), which presented the lowest CV values contained in the narrowest dispersion window, suggesting it as the most precise among the investigated methods.

thumbnail
Fig 2. Comparative analysis of all spectral counts normalization methods applied to E. Coli proteome.

(A) Median values for the best fitting slopes calculated for each pair of technical replicates for all samples analyzed (except mixture B) with each normalization method. (B) Coefficient of variation (CV) for the evaluation of data dispersion for all the normalization methods used in the analysis of E. Coli proteome. (C) Representation of the logarithmic Fold Change distribution around the theoretical value indicated by the point line for all normalization methods. (D) Box plots of the CV for the fold change in the mix E / mix D pair, calculated on all the four possible pairs of the technical replicates. In panels B and D the box and whisker extremes represent 25–75%.

https://doi.org/10.1371/journal.pone.0238037.g002

These preliminary data strengthened our hypothesis that differences in Fold Change values are due to the different sizes of the adjustment factor, since the shape of the data is identical.

The normalization methods were then tested for their ability to correctly assess protein quantification when two different sets of data were compared. Therefore, the five mixtures and the corresponding replicates were compared by two according to the scheme: A1/B1, A2/B1, A1/B2, A2/B2, etc., and the Fold Change (FC) values of all proteins in each pair were calculated. As the amount of E. Coli proteome was constant in all samples, the expecting Folding Change value for each protein should have been equal to 1 (Log2 (FC) = 0, Fig 2C).

For each pair, the mean value of the Fold Change and its associated CV was calculated. As an example, in Fig 2D the data related to the four possible combinations of mixture E/mixture D pair were reported. Inspection of the results showed a very regular distribution of data points for the CBN(P) method, with more than 75% of values having an associated CV of 25% or less. NSAF and Rsc displayed a slightly more dispersed distribution of Fold Change values, while CBN(S) showed the highest dispersion of data.

Then, we moved to evaluate the statistical relevance of proposed methods, through the calculation of False Discovery Rate (FDR). An unpaired Student's t-test was performed to extrapolate the E. Coli statistically significant Differentially Expressed Proteins (DEPs): the latter represented the pool of false positives, since no variations were expected in E. Coli proteins. The FDRs, calculated as the percentage of false positives of total identified proteins, resulted in 5.1% for Rsc, 6.1% for both CBNs, and 7.6% for NSAF. These values are very close to the reference one (5%), indicating a high reliability of all methods, excepted NSAF, which showed the highest value of false positives.

Furthermore, for each method the values of Fold Change cutoffs were estimated, treating the standard mixtures A—C and D—E as a couple of samples to be compared (A and C vs D and E).

The FC cutoffs were calculated for each method by evaluating the dispersion of FCs of the unchangeable proteins of E. Coli proteome, considered not statistically significant according to unpaired Student's t-test (p<0.05). The FC thresholds were defined by the lower Q1-(IQR*1.5) and upper Q3+(IQR*1.5) extremes, where Q1, Q3 represented 25 and 75 percentiles respectively, and IQR is defined as the inter-quartile distance. The obtained FC cutoffs were summarized in Table 1:

These findings showed very similar results for CBN(S), Rsc, and NSAF, while CBN(P) performed quite differently. In particular, its narrowest cutoffs again confirmed that CBN(P) is the most reliable method in the detection of slight differences in protein expression levels (low FCs), which in biological samples might be relevant.

Comparison of normalization methods for the quantification of the six standard proteins spiked within the E. Coli proteome

The different normalization methods were then evaluated in the quantification of the Fold Changes of the six standard proteins spiked within the E. Coli proteome.

The Fold Change values were determined by all methods for the six standard proteins spiked within the E. Coli proteome in all mixtures (S3 Table).

The performances of the different normalization methods were evaluated for the quantification of BSA, HBA, HBB, and PYG proteins, whose concentration was constant in all mixtures, and for quantification of ENO and ADH, whose amount changed in the samples. Fig 3A reports plots showing the distribution of the logarithmic Fold Change for unchangeable proteins obtained with the four normalization methods. All methods displayed similar results indicating a general high accuracy in the FC measurements, as mainly demonstrated in BSA and HBB, where the mean values agree to the true value (FC = 0). In HBA the best fit occurred in CBN(P) method, confirming very good performances in terms of accuracy and precision. However, almost all measured values scattered within 0.3 logarithmic units from the theoretical values, corresponding to an absolute Fold Change of 1.2, which is reasonably considered as an unchanged amount of protein.

thumbnail
Fig 3. Fold change analysis comparing all spectral counts normalization methods.

(A) The mean Fold Change values for unchangeable proteins (BSA, HBA, HBB, and PYG) within the ten different pairs of mixtures calculated by the four normalization methods. The theoretical values are 0 and it is indicated by a point line. (B) Comparison of the experimental and theoretical Fold Change values for the relative quantification of ENO and ADH obtained by using the SpC-based methods and reported in the logarithmic scale. (C) Table reporting linear regression best-fit values for all ENO and ADH linear regressions.

https://doi.org/10.1371/journal.pone.0238037.g003

The linear correlations between experimental and theoretical values obtained for all different normalization methods were investigated for ENO and ADH, whose concentrations were changeable (Fig 3B). Very similar correlation values were obtained for both proteins in the calculation of relative abundance determined by NSAF, Rsc and CBNs, suggesting that all methods are equally able to correctly evaluate differences in the amount of proteins. For ENO, linear correlations were confirmed also decreasing 10 times the concentration of the lowest point reported in Fig 3B (data not shown), confirming a good behavior for all methods.

In conclusion, the quantitative measurement using standard mixtures demonstrated that both the newly proposed CBN methods resulted reliable in the estimation of all expected FCs, either when the protein amount changed or when it was constant, for which they showed a very good adherence to theoretical data even in a complex mixture.

Application of CBN methods to the differential proteomics analysis of samples from cerebral cortex from Huntington’s Disease mice

Once the effectiveness of the newly developed SpC-based normalization methods was defined, the two CBN approaches were challenged with the analysis of more complex samples, in which the expression level of a large number of proteins is expected to significantly change. A label-free differential proteomics experiment was performed on the brain cortex proteome from mice affected by Huntington’s Disease (HD) (homozygous mutant HTT knock in line zQ175) [40] and corresponding control samples (WT).

Equal amounts of total protein extracts from three wild type and three mutant mice cortices were separated by SDS-PAGE (S2 Fig) and each of the six lanes was cut in 29 slices that were in situ hydrolyzed with trypsin. The corresponding 174 peptide mixtures were then analyzed by nanoLC-MS/MS in duplicates.

The resulting set of 348 LC-MS/MS raw data (available via ProteomeXchange with identifier PXD017471) was processed by MaxQuant for protein identification and quantification (S4 Table). The set of unnormalized SpCs data was then elaborated by the newly developed CBN methods (CBN(P) and CBN(S)). The other methods (NSAF and Rsc) were used for comparison. The lists of proteins were then filtered according to their differential expression and statistical significance (FDR<5%) by using the Multi experiment Viewer, as described above. Statistically significant up- and down-regulated proteins, identified by each method according to FC cut off values previously calculated, were reported in the histogram in Fig 4A. It is surprising to note that the two CBN methods led to the identification of a quite different number of statistically significant proteins. In CBN(S) as well as Rsc, it was nearly 140, whereas the CBN(P) as well as the NSAF approach led to the identification of only about a hundred of significant proteins.

thumbnail
Fig 4. Visualization of identified proteins with all methods in the WT and HD mouse model.

(A) The histogram shows for each method the number of proteins that appeared to be both differentially expressed and statistically significant. (B) Venn diagram referred to all proteins. The central area represents the proteins common to all methods.

https://doi.org/10.1371/journal.pone.0238037.g004

In particular, the specific and shared proteins were highlighted in a Venn diagram (Fig 4B). Details of statistically significant identified proteins were summarized in S5 Table.

The reliability of the proposed normalization methods was further confirmed by quantification of some selected proteins by two independent methodologies, i.e. western blot analyses and Multiple Reaction Monitoring (MRM) mass spectrometry analysis.

In western blot assays (Fig 5A), the expression levels of IRGM1, OSBPL2 and SERPIN B6, which were up-regulated in HD mice and hnRNP H, HTT, SAMM50, UQCRQ and HOMER1, whose expression was instead decreased in HD mice were evaluated on total protein extracts of three WT and three HD mice (the same samples employed for the proteomic experiment), in duplicate.

thumbnail
Fig 5. Validation of a selected group of proteins differentially expressed in WT and HD mouse model.

(A) Western blot assays performed on total protein extracts from three mutant (zQ175(1), zQ175(2), zQ175(3)), and three wild-type (wt(1), wt(2), wt(3)) mice with antibodies against the selected proteins. β-actin was used for normalization. (B) MRM superimposed traces of transitions of one of CAMK2A proteotypic peptide reported for zQ175 (left panel) and WT (right panel) together with one ACTIN peptide transition. (C) Densitometric analysis of data from the western blot of panel A. The indicated values in the graph represent the percentage of arbitrary units compared to WT to which 100% was assigned. Results are represented as the as mean ± SD (standard deviation). The statistical significance was evaluated by parametric (Welch’s) or non-parametric (Mann-Whitney) tests when data failed the Shapiro–Wilk normality test. * p < 0.05, ** p < 0.01 *** p < 0.001, **** p < 0.0001. (D) Fold Change measured by MRM analysis of pooled HD and WT samples, respectively.

https://doi.org/10.1371/journal.pone.0238037.g005

Upon the densitometric analysis, each sample was normalized with the intensity of β-actin developed on the same membrane, except HTT normalized with the intensity of vinculin. The normalized intensities were then evaluated by univariate statistical analysis and the results summarized in Table 2 and graphically reported in Fig 5C.

thumbnail
Table 2. Summary of proteins FCs validated by western blot and/or MRM.

https://doi.org/10.1371/journal.pone.0238037.t002

Since in several cases the differences in expression levels were about 20–30%, the western blot technique might not be enough reliable in detections on a restricted number of replicates.

Therefore, we integrated validation experiments by quantifying some proteins by using a tandem mass spectrometry methodology: the Multiple Reaction Monitoring (MRM). HTT, SERPIN B6, HOMER1, OSBPL2, and SAMM50 were also monitored by MRM, together with other proteins, APO-A4, CAMK2A, STIP1, RIC8A and ATP2A2, whose antibodies were not available or not efficient in western blot analyses. ACTIN was monitored too and employed as an internal standard for normalization procedure.

In detail, three zQ175 and WT cortex brain samples (the same samples employed for the proteomic experiment) were pooled respectively, and few micrograms were employed for a shotgun proteomics experiment by using the MRM targeted approach. Peptide mixtures were separated by nanoLC-MS/MS in duplicate and two or more transitions of at least two proteotypic peptides were monitored for each of the above proteins and employed for relative protein quantification. Areas of monitored transitions (Fig 5B, S6 Table) were measured, mediated among transitions, and all peptides belonging to the same protein, employed to calculate FC, which were normalized with FC measured for ACTIN in each couple of runs. FC results are summarized in Table 2 and graphically reported in Fig 5D.

A strong agreement among all measured FCs emerged by comparing the results obtained by western blot and MRM, confirming the reliability in the validation of both methods. Moreover, for all proteins, the variation trends detected by LF approaches were confirmed by both methods (Table 2) also in the detection of slight FCs.

In detail, levels of HTT protein are significantly reduced in HD mice compared to controls, according to previously observed [47]. This protein was identified by Rsc and CBN approaches as statistically significant, although the three methods measured very different FC values: CBN(S) FC = 0.08, CBN(P) FC = 0.59, Rsc FC = 0.14. By comparing these values with those obtained by western blot (0.42) and MRM (0.59), the FC measured by CBN(P) was the only perfectly in agreement with both values. This finding was not a random or a sporadic event, but it was recurrent in all proteins, from those quantified also in other methods (HTT, SAMM50, HOMER1, IRGM1, RIC8A, STIP1, OSBPL2, UQCRQ) to those statistically significant only for CBN(P) (CAMK2A, APO-A4, ATP2A2) (Table 2).

hnRNP H and UQCRQ are proteins identified as statistically significant only by CBN(S), while SERPIN B6 varied significantly for all methods excepted CBN(P). hnRNP H and SERPIN B6 were endorsed by western blot assays, founding their densitometric mean values statistically significant. Surprisingly, FC values calculated by CBN(S) for these proteins strongly agreed with those obtained from the other methods. Altogether these data confirm that the quantification of results obtained by normalization of the differential proteomic data performed by the CBN methods is reliable, although CBN(P) was confirmed to be the best in terms of precision and reliability in comparison with CBN(S) and the other investigated methods.

The results of the differential proteomics experiments were then evaluated on a functional basis by gathering the differentially expressed proteins within biological functional networks. A bioinformatic analysis involving all the differentially expressed proteins identified by all methods were carried out using KEGG pathways and the STRING databases. The results are summarized in Table 3 where the main pathways showing FDR<0.05 are reported. Functional classes showing the lower FDR values include metabolic pathways, oxidative phosphorylation, and neurodegenerative diseases (Parkinson's Disease, Huntington's Disease, Alzheimer's Disease), confirming the effectiveness of the newly developed methods not only in terms of sensitivity in the detection of single up- or down-regulated proteins, but also looking at the biological meaning of aggregated results.

thumbnail
Table 3. List of functional pathways identified by STRING analysis.

https://doi.org/10.1371/journal.pone.0238037.t003

Discussion

Label-Free methods for differential proteomics investigations generate a large amount of data that needs to be normalized, i.e. corrected for instrumental or uncontrollable variations before becoming employable for quantitative purposes. The choice of suitable normalization formula is a not trivial aspect in these approaches, because it might strongly affect the experimental results.

For Spectral Counts based Label-Free approaches, normalization procedure has been assessed in different ways, resorting in some cases to the use of fixed and arbitrary corrective factors, such as in Rsc formula [33], to overcome biases associated with the presence of missing values in the analysis, as occurs in NSAF method [39] lacking of adjustment elements.

In light of these considerations, we decide to test new formulas for SpCs data normalization, in which the corrective factor was not an arbitrary choice of the operator, but strictly and directly defined by an intrinsic property of the system under investigation, such as its complexity.

A new approach defined Complexity Based Normalization (CBN) was introduced for the normalization of Spectral Counts data in differential proteomics experiments to calculate the relative protein Fold Changes. The new formula employed a complexity-based adjustment factor generating two different methods, CBN(P) where f = 1/P and CBN(S) in which f = 1/t with P and t referring to the total number of identified proteins (P) or the total spectral counts in the LC-MS/MS analysis (t), respectively.

The performances in terms of reliability, precision, accuracy, linearity, and sensitivity of the two CBN methods in the analysis of raw data from differential proteomics experiments carried out according to the label-free strategies were evaluated in comparison with existing SpC-based normalization approaches, NSAF and Rsc.

Methods evaluation was carried out by analyzing a set of samples consisting of six different standard proteins spiked in an E. Coli total tryptic digest matrix and the capability of the methods to quantify both the total matrix and the spiked proteins were monitored. Both CBN(P) and CBN(S) showed very good reproducibility in the analysis of the technical replicates and were able to correctly quantify the unchanged amount of E. Coli proteins. When the accuracy of all methods was compared, CBN(P) showed the best behavior, while the CBN(S) performance was comparable with the one showed by NSAF and RSC methods; a similar trend has been detected also for unchangeable spiked proteins.

In addition, other differences could be observed between all methods when the coefficient of variation in the quantification of E. Coli proteins was examined. Although both CBN methods displayed a low dispersion of data and low CV, CBN(P) displayed a better performance than CBN(S) suggesting that the precision is affected by the size of the adjustment factor.

The evidence that the choice of the adjustment factor value might affect performances of SpC-based normalization approaches is confirmed also for NSAF and RSC methods. In fact, NSAF showed low reproducibility greater dispersion of data and very high CV value especially in the quantification of the lowest abundant proteins. This is likely due to the absence of any adjustment factor in this method making NSAF very susceptible to quantitative biases caused by the absence of data. The Rsc method showed low reproducibility in the quantification of lowest abundance proteins in the quantitative analysis of E. Coli proteome but low coefficient of variation in the quantitation of spiked proteins suggesting again that a fixed adjustment factor "f" might affect the analysis of a large set of data.

Otherwise, all quantitative methods were characterized by a wide linearity concentration-range when changeable spiked proteins were quantified, proving a response directly proportional to the protein amount in a wide concentration range.

When CBNs methods were tested on biological complex samples (protein extracts from the brain of HD mice vs WT), both methods generated reproducible, accurate, sensible, and reliable quantitative data. Their FC trends were always confirmed by western blot and/or MRM validations, suggesting that a complexity-based adjustment factor properly works in the correction of output data, leading to suitable final quantitative measurements of protein expression variations.

However, the two CBN methods performed differently in the Fold Change calculation. Although CBN(P) recognized a lower number of changeable significant proteins, it has proven to be the most reliable method to appreciate minimal statistically significant changes in protein expression levels, since its FC cutoff was the narrowest among those calculated for other methods.

Moreover, when the FC values calculated by all methods for selected proteins were compared with those calculated by western blot and/or MRM, a surprisingly perfect accordance was found solely with FCs derived from CBN(P), suggesting that this method performs as the highest reliable and accurate among all. CBN(S), in most cases, performed more like Rsc and NSAF than CBN(P).

Nevertheless, CBN(S) method showed a higher sensitivity in the detection of statistically relevant FC of low abundant proteins that give rise to a low number of SpCs, such as UQCRQ and hnRNP H. These proteins were statistically significant solely for CBN(S), but their FCs were confirmed by western blot assays with a good accordance.

Differences in performances between CBN(P) and CBN(S) might be due to the dimension of the adjustment factor, which is not negligible, considering that the difference between total SpCs and total proteins is about two orders of magnitude (in mice analysis, total SpCs = 2.45x105 and P = 2860).

Moreover, over-representation analysis of CBNs data carried out by STRING reveals that the most of proteins flows in molecular pathways affected in HD brain tissue and other neurodegenerative model systems, thus confirming the robustness and the biological coherence of data provided by these new methods. Indeed, processes such as energy metabolism [5,4851], oxidative phosphorylation and mitochondria functionality [5254], calcium homeostasis [55,56], already known to be deregulated in cortical samples from HD mice, were identified, thus indicating the highest correlation level of functional data.

In conclusion, in this study, we demonstrated how all SpCs normalization methods are strongly affected by the presence/absence or by the value of adjustment factors “f”. The presence of a correction factor allows overcoming the effect of the absence of data in FC calculation and leads to methods with the lowest coefficient of variation. Moreover, the association of the "f" with sample complexity makes the operator free from the choice of the best value for the correction factor.

Our data have shown that CBN(S) and CBN(P) are both a viable alternative to other existing SpC-based quantification methods. Furthermore, both CBN(S) and CBN(P) are two sensitive methods, although each shows a different reactivity: CBN(P) is capable of appreciating small but statistically and biologically significant FC variations in proteins well represented in the proteome; on the contrary, the CBN(S) enhances the differences in levels of protein expression in lower abundant proteins.

Finally, if someone would ask which methods we prefer, our choice falls on CBN(P), because it performs as the most reliable and sensitive “sensor” for FCs. In fact, from our point of view, it is preferable to work on a lower number of data which, however, are the result of more stringent and reliable selection standards, rather than getting lost in a sea of data often difficult to interpret from a biological point of view.

Supporting information

S1 Fig. Median values for the best fitting slopes calculated for each pair of technical replicates including mixture B for all samples analyzed with each normalization method.

https://doi.org/10.1371/journal.pone.0238037.s001

(PDF)

S2 Fig. Images of the SDS-PAGE of mice samples.

Three biological replicates cortices of zQ175 and WT mice were loaded and gel slices were cut following the scheme reported.

https://doi.org/10.1371/journal.pone.0238037.s002

(PDF)

S3 Fig. Images of the entire membranes whose inserts are reported in Fig 5A.

Full-length western blots. Technical replicates of HTT, IRGM1, Homer1, OSBPL2, hnRNP H, UQCRQ and Samm50 in zQ175 and WT mice were developed with same antibodies. Technical replicates for SerpinB6 were developed with two different antibodies, as reported below the images.

https://doi.org/10.1371/journal.pone.0238037.s003

(PDF)

S1 Table. Details of the qualitative and quantitative mass spectrometric analysis of standard mixtures spiked in standard E. Coli proteome.

In sheet 1, raw data generated by MaxQuant following the bioinformatic analysis for protein identification and quantification are reported. The processed protein groups applying each SpCs method normalization formulas are disclosed in sheets 2, 3, 4, and 5.

https://doi.org/10.1371/journal.pone.0238037.s004

(XLS)

S2 Table. Linear regression best-fit value referred to MixA.

Included is Slope, Y-intercept, and X-intercept calculated by NSAF, RSC, CBN(P), and CBN(S) methods.

https://doi.org/10.1371/journal.pone.0238037.s005

(XLS)

S3 Table. Fold Change value (FC) and expected Fold Change (exp FC) value by NSAF, RSC, CBN(P) and CBN(S) methods.

Included is Fold Change value (FC) and expected Fold Change (exp FC) value of standard proteins (HBA, HBB, BSA, ADH, PYG, and ENO) measured and/or calculated in B/A, C/A, D/A, E/A, C/B, D/B, E/B, D/C, E/C and E/C mixture by using NSAF, RSC, CBN(P), CBN(S).

https://doi.org/10.1371/journal.pone.0238037.s006

(XLS)

S4 Table. Details of the qualitative and quantitative mass spectrometric analysis of zQ175 and Wild-Type mouse HD model.

In sheet 1, raw data generated by MaxQuant following the bioinformatic analysis for protein identification and quantification are reported. The processed protein groups applying each SpCs method normalization formulas are disclosed in sheets 2, 3, 4, and 5.

https://doi.org/10.1371/journal.pone.0238037.s007

(XLS)

S5 Table. Statistically significant identified proteins.

In the table are reported the FCs and p-values of statistically significant identified proteins (p value<0.05) and FDR (<5%). Protein names, gene names, Uniprot codes and peptides are also indicated as headers.

https://doi.org/10.1371/journal.pone.0238037.s008

(XLSX)

S6 Table. Summary of MRM results.

In the table are reported the FCs values of proteins validated by Multiple Reaction Monitoring (MRM). Run, protein names, peptide sequences, Collision Energy, m/z of precursor ions, m/z of fragment ions, total transition area, peptide FC, and averaged and normalized FC for each protein.

https://doi.org/10.1371/journal.pone.0238037.s009

(XLSX)

Acknowledgments

We wish to acknowledge the CHDI Foundation for zQ175 mice and prof. Piero Pucci for a support for a critical review of the contents.

References

  1. 1. Duarte TT, Spencer CT. Personalized proteomics: the future of precision medicine. Proteomes, 2016, Dec; 4(4): 29. pmid:27882306
  2. 2. Chandler RJ, Sloan JL, Dorko K, Cusmano-Ozog K, Ingenito L, et al. The proteome of methylmalonic acidemia (MMA): the elucidation of altered pathways in patient livers. Molecular Biosystems, 2016 Feb;12(2):566–74. pmid:26672496
  3. 3. Greco V, Piras C, Pieroni L, Ronci M, Putignani L, Roncada P, et al. Applications of MALDI-TOF mass spectrometry in clinical proteomics. Expert Rev Proteomics, 2018 Aug;15(8):683–696. pmid:30058389
  4. 4. Cao Y, Li Z, Mao L, Cao H, Kong J, Yu B, et al. The use of proteomic technologies to study molecular mechanisms of multidrug resistance in cancer. European Journal of Medicinal Chemistry,2019 Jan 15;162:423–434. pmid:30453249
  5. 5. Langfelder P, Cantle JP, Chatzopoulou D, Wang N, Gao FF, Al-Ramahi I, et al. Integrated genomics and proteomics define huntingtin CAG length-dependent networks in mice. Nature Neuroscience, 2016 Apr;19(4):623–33. pmid:26900923
  6. 6. Timms JF, Cramer R. Difference gel electrophoresis. Proteomics, 2008, 8, 4886:4897. pmid:19003860
  7. 7. Caterino M, Pastore A, Strozziero MG, Di Giovamberardino G, Imperlini E, Scolamiero E, et al. The proteome of cblC defect: in vivo elucidation of altered cellular pathways in humans. J Inherit Metab Dis., 2015 Sep;38(5):969–79. pmid:25585586
  8. 8. Evans C, Noirel J, Ow SY, Salim M, Pereira-Medrano AG, Couto N, et al. An insight into iTRAQ: where do we stand now? Analytical and Bioanalytical Chemistry, 2012, 404, 1011:1027. pmid:22451173
  9. 9. Staubach S, Wenzel A, Beck BB, Rinschen MM, Müller S, Hanisch FG. Autosomal Tubulo interstitial Kidney Disease-MUC1 Type: Differential Proteomics Suggests that Mutated MUC1 (insC) Affects Vesicular Transport in Renal Epithelial Cells. Proteomics,2018 Apr;18(7). pmid:29436780
  10. 10. Chen X, Wei S, Ji Y, Guo X, Yang F. Quantitative proteomics using SILAC: principles, applications, and developments, Proteomics, 2015, 15, 3175:3192. pmid:26097186
  11. 11. Altelaar AF, Frese CF, Preisinger C, Hennrich ML, Schram AW, Timmers HT, et al. Benchmarking stable isotope labeling based quantitative proteomics. J Proteomics, 2013 Aug 2;88:14–26. pmid:23085607
  12. 12. Zhang L,. Elias JE. Relative Protein Quantification Using Tandem Mass Tag Mass Spectrometry.Methods in Molecular Biology, 2017,1550:185–198.
  13. 13. Patel VJ, Thalassinos K, Slade SE, Connolly JB, Crombie A, Murrell J C, et al. A Comparison of Labeling and Label-Free Mass Spectrometry-Based Proteomics Approaches. Journal of Proteome Research, 2009, 8, 3752:3759. pmid:19435289
  14. 14. Latterich M, Abramovitz M, Leyland-Jones B. Proteomics: new technologies and clinical applications, European Journal of Cancer, 2008, 44, 2737:2741. pmid:18977654
  15. 15. Bantscheff M, Schirrle M, Sweetman G, Rick J, Kuster B. Quantitative mass spectrometry in proteomics: a critical review, Analytical and Bioanalytical Chemistry, 2007, 389, 1017:1031. pmid:17668192
  16. 16. Bantscheff M, Lemeer S, Savitski MM, Kuster B. Quantitative mass spectrometry in proteomics: critical review update from 2007 to the present, Analytical and Bioanalytical Chemistry, 2012, 404, 939:965. pmid:22772140
  17. 17. Neilson KA, Ali NA, Muralidharan S, Mirzaei M, Mariani M, Assadourian G, et al. Less label, more free: Approaches in label‐free quantitative mass spectrometry, Proteomics, 2011, 11, 535:553. pmid:21243637
  18. 18. Elliott MH, Smith DS, Parker CE, Borchers C. Current trends in quantitative proteomics, Journal of Mass Spectrometry, 2009, 44, 1637:1660. pmid:19957301
  19. 19. Blein-Nicolas M, Zivy M. Thousand and one ways to quantify and compare protein abundances in label-free bottom-up proteomics, Biochimica et Biophysica Acta (BBA), 2016 Aug;1864(8):883–95. pmid:26947242
  20. 20. Di Girolamo F, Lante I, Muraca M, Putignani L. The role of mass spectrometry in the “omics” era, Current Organic Chemistry, 2013, 17, 2891:2905. pmid:24376367
  21. 21. Scigelova M, Makarov A. Orbitrap mass analyzer–overview and applications in proteomics, Proteomics, 2006, 6, Suppl 2, 16:21. pmid:17031791
  22. 22. Wang M, You J, Bemis KG, Tegeler TJ, Brown DP. Label-free mass spectrometry-based protein quantification technologies in proteomic analysis, Briefings in Functional Genomics and Proteomics, 2008, 7, 329:339. pmid:18579615
  23. 23. Matzke MM, Brown JN, Gritsenko MA, Metz TO, Pounds JG, Rodland KD, et al. A comparative analysis of computational approaches to relative protein quantification using peptide peak intensities in label‐free LC‐MS proteomics experiments, Proteomics, 2013, 13, 493:503. pmid:23019139
  24. 24. Lundgren DH, Hwang S, Wu L, Han DK. Role of spectral counting in quantitative proteomics, Expert Reviews of Proteomics, 2010, 7, 39:53. pmid:20121475
  25. 25. Hoehenwarter W, Wienkoop S. Spectral counting robust on high mass accuracy mass spectrometers, Rapid Communications in Mass Spectometry, 2010, 24, 3609:3614. pmid:21108307
  26. 26. Cappadona S, Baker PR, Cutillas PR, Heck AJ, van Breukelen B. Current challenges in software solutions for mass spectrometry-based quantitative proteomics, Amino Acids, 2012, 43, 1087:1108. pmid:22821268
  27. 27. Nahnsen S, Bielow C, Reinert K, Kohlbacher O. Tools for label-free peptide quantification, Molecular and Cellular Proteomics, 2013, 12, 549:556. pmid:23250051
  28. 28. Välikangas T, Suomi T, Elo LL. A comprehensive evaluation of popular proteomics software workflows for label-free proteome quantification and imputation, Briefings in bioinformatics, 2018, 19(1):1–11.
  29. 29. Välikangas T, Suomi T, Elo LL. A systematic evaluation of normalization methods in quantitative label-free proteomics, Brief Bioinform, 2018;19(1):1–11. pmid:27694351
  30. 30. Cox J, Hein MY, Lubert CA, Paron I, Nagaraj N, Mann M. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ, Molecular and Cellular Proteomics, 2014, 13, 2513:2526. pmid:24942700
  31. 31. Cox J, Mann M. MaxQuant enables high peptide identification rates, individualized ppb-range mass accuracies and proteome-wide protein quantification, Nature Biotechnology, 2008, 26, 1367:1372. pmid:19029910
  32. 32. Gokce E, Shuford CM, Franck WL, Dean RA, Muddiman DC. Evaluation of normalization methods on GeLC-MS/MS label-free spectral counting data to correct for variation during proteomic workflows, Journal of American Society of Mass Spectrometry, 2011, 22, 2199:2208. pmid:21952779
  33. 33. Zhu W, Smith JW, Huang CM. Mass spectrometry-based label-free quantitative proteomics, Journal of Biomedicine and Biotechnology, 2010,2010:840518. pmid:19911078
  34. 34. Old WM, Meyer-Arendt K, Aveline-Wolf L, Pierce KG, Mendoza A, Sevinsky JR, et al. Comparison of label-free methods for quantifying human proteins by shotgun proteomics, Molecular & Cellular Proteomics, 2005, 4, 1487:1502. pmid:15979981
  35. 35. Vitko Dijana, Cho Patricia S, Kostel Stephen A, DiMartino Shannon E, Cabour Lily D, Migliozzi Matthew A, et al. Characterizing Patients With Recurrent Urinary Tract Infections in Vesicoureteral Reflux: A Pilot Study of the Urinary Proteome. Mol Cell Proteomics 2020 Mar;19(3):456–466. Epub 2020 Jan 2. pmid:31896675
  36. 36. Lee HY, Kim EG, Jung HR, Jung JW, Kim HB, Cho JW, et al. Refinements of LC-MS/MS Spectral Counting Statistics Improve Quantification of Low Abundance Proteins. Sci Rep. 2019 Sep 20;9(1):13653. pmid:31541118
  37. 37. Hsu CW, Chang KP, Huang Y, Liu HP, Hsueh PC, Gu PW, et al. Proteomic Profiling of Paired Interstitial Fluids Reveals Dysregulated Pathways and Salivary NID1 as a Biomarker of Oral Cavity Squamous Cell Carcinoma. Mol Cell Proteomics. 2019 Oct;18(10):1939–1949. pmid:31315917
  38. 38. Tucci P, Portela M, Chetto CR, González-Sapienza G, Marín M. Integrative proteomic and glycoproteomic profiling of Mycobacterium tuberculosis culture filtrate. PLoS One. 2020 Mar 3;15(3):e0221837. eCollection 2020. pmid:32126063
  39. 39. Paoletti AC, Parmely TJ, Tomomori-Sato C, Sato S, Zhu D, Conaway RC, et al. Quantitative proteomic analysis of distinct mammalian Mediator complexes using normalized spectral abundance factors, Proceedings of the National Academy of Sciences of the United States of America, 2006 December 12, 103 (50) 18928–18933. pmid:17138671
  40. 40. Menalled LB, Kudwa AE, Miller S, Fitzpatrick J, Watson-Johnson J, Keating N, et al. Comprehensive behavioral and molecular characterization of a new knock-in mouse model of Huntington's disease: zQ175, PLoS One. 2012, 7(12):e49838. pmid:23284626
  41. 41. Bianconcini A, Lupo A, Capone S, Quadro L, Monti M, Zurlo D, et al. Transcriptional activity of the murine retinol-binding protein gene is regulated by a multiprotein complex containing HMGA1, p54 nrb/NonO, protein-associated splicing factor (PSF) and steroidogenic factor 1 (SF1)/liver receptor homologue 1 (LRH-1), Int J Biochem Cell Biol, 2009; 41(11):2189–203. pmid:19389484
  42. 42. Cox J, Neuhauser N, Michalski A, Scheltema RA, Olsen JV, Mann M. Andromeda: a peptide search engine integrated into the MaxQuant environment, Journal of Proteome Research, 2011, 10, 1794:1805. pmid:21254760
  43. 43. Howe E, Holton K, Nair SD, Schlauch D, Sinha R, Quackenbush J. MeV: MultiExperiment Viewer, Biomedical Informatics for Cancer Research, 2010, 267:277.
  44. 44. Karlsson L, González-Alvarado MN, Larrosa-Flor M, Osman A, Börjesson M, Blomgren K, et al. Constitutive PGC-1α Overexpression in Skeletal Muscle Does Not Improve Morphological Outcome in Mouse Models of Brain Irradiation or Cortical Stroke. Neuroscience 2018, 384, 314–328. pmid:29859976
  45. 45. Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING V11: Protein-Protein Association Networks with Increased Coverage, Supporting Functional Discovery in Genome-Wide Experimental Datasets. Nucleic Acids Res. 2019, 47 (D1), D607–D613. pmid:30476243
  46. 46. Beissbarth T, Hyde L, Smyth GK, Job C, Boon W, Tan S, et al. Statistical modeling of sequencing errors in SAGE libraries, Bioinformatics, 2004, 20, Suppl 1, i31:i39. pmid:15262778
  47. 47. Franich NR, Basso M, André EA, Ochaba J, Kumar A, Thein S, et al. Striatal Mutant Huntingtin Protein Levels Decline with Age in Homozygous Huntington's Disease Knock-In Mouse Models. J Huntingtons Dis., 2018, 7(2):137–150. pmid:29843246
  48. 48. Liu X, Miller BR, Rebec GV, Clemmer DE. Protein Expression in the Striatum and Cortex Regions of the Brain for a Mouse Model of Huntington's Disease, Journal of Proteome Research, 2007 Vol.6, 3134–3142. pmid:17625815
  49. 49. Choudhury KR, Das S, Bhattacharyya NP. Differential proteomic and genomic profiling of mouse striatal cell model of Huntington's disease and control; probable implications to the disease biology, Journal Of Proteomics, 2016 Vol. 132, 155–166. pmid:26581643
  50. 50. Mochel F, Haller RG. Energy deficit in Huntington disease: why it matters, The Journal of Clinical Investigations, 2011 Vol. 121, Issue 2, 493–499. pmid:21285522
  51. 51. van der Burg JMM, Bacos K, Wood NI, Lindqvist A, Wierup N, Woodman B, et al. Increased metabolism in the R6/2 mouse model of Huntington's disease, Neurobiology of Disease, 2008, Vol.29, 41–51. pmid:17920283
  52. 52. Damiano M, Galvan L, Déglon N, Brouillet E. Mitochondria in Huntington's disease, Biochimica et Biophysica Acta, 2010 Vol. 1802, 52–61. pmid:19682570
  53. 53. Pandey M, Mohanakumar KP, Usha R. Mitochondrial functional alterations in relation to pathophysiology of Huntington’s disease, Journal of Bioenergetics and Biomembranes, 2010 Vol. 42, 217–226. pmid:20464463
  54. 54. Orr AL, Li S, Wang C, Li H, Wang J, Rong J, et al. N-Terminal Mutant Huntingtin Associates with Mitochondria and Impairs Mitochondrial Trafficking The Journal of Neuroscience, 2008, Vol.28, Issue 11, 2783–2792. pmid:18337408
  55. 55. Giacomello M, Hudec R, Lopreiato R. Huntington's disease, calcium, and mitochondria, Biofactors, 2011 Vol.37, Issue 3, 206–218. pmid:21674644
  56. 56. Giacomello M, Oliveros JC, Naranjo JR, Carafoli E. Neuronal Ca2+ dyshomeostasis in Huntington disease, Prion, 2013 Vol.7, Issue 1, 76–84. pmid:23324594