Bottom-up, integrated -omics analysis identifies broadly dosage-sensitive genes in breast cancer samples from TCGA

Bobak D. Kechavarzi; Huanmei Wu; Thompson N. Doman

doi:10.1371/journal.pone.0210910

Abstract

The massive genomic data from The Cancer Genome Atlas (TCGA), including proteomics data from Clinical Proteomic Tumor Analysis Consortium (CPTAC), provides a unique opportunity to study cancer systematically. While most observations are made from a single type of genomics data, we apply big data analytics and systems biology approaches by simultaneously analyzing DNA amplification, mRNA and protein abundance. Using multiple genomic profiles, we have discovered widespread dosage compensation for the extensive aneuploidy observed in TCGA breast cancer samples. We do identify 11 genes that show strong correlation across all features (DNA/mRNA/protein) analogous to that of the well-known oncogene HER2 (ERBB2). These genes are generally less well-characterized regarding their role in cancer and we advocate their further study. We also discover that shRNA knockdown of these genes has an impact on cancer cell growth, suggesting a vulnerability that could be used for cancer therapy. Our study shows the advantages of systematic big data methodologies and also provides future research directions.

Citation: Kechavarzi BD, Wu H, Doman TN (2019) Bottom-up, integrated -omics analysis identifies broadly dosage-sensitive genes in breast cancer samples from TCGA. PLoS ONE 14(1): e0210910. https://doi.org/10.1371/journal.pone.0210910

Editor: Aamir Ahmad, University of South Alabama Mitchell Cancer Institute, UNITED STATES

Received: October 15, 2018; Accepted: January 3, 2019; Published: January 17, 2019

Copyright: © 2019 Kechavarzi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data underlying the results presented in the study are available from The Cancer Genome Atlas (TCGA) at https://portal.gdc.cancer.gov.

Funding: The authors received no specific funding for this work. Eli Lilly provided support in the form of salaries for authors BDK and TND. The funder provided tuition reimbursement for BDK. The funder did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

Competing interests: We have the following interests. BDK and TND are employed by Eli Lilly. This research project is independent from Eli Lilly, but as a part of the Ph.D. dissertation project of Bobak, the first author. There are no patents, marketed products, or products in development due to the affiliation with Eli Lilly and this affiliation does not alter our adherence to all PLOS ONE policies on sharing data and materials.

Introduction

The scientific literature is replete with papers highlighting the complex interplay between chromosomal instability, aneuploidy, and cancer (e.g. [1] [2] [3] [4]). Aneuploidy, the state of having other than the canonical or “euploid” number of chromosomes—for humans, 46—is with only rare exceptions (Downs syndrome, Trisomy 18) lethal in human embryonic development [5]. By contrast, aneuploidy is observed with very high frequency in cancer, leading the eminent German biologist Theodor Boveri to speculate as early as 1902 [6] that aneuploidy might have a causative role in the disease.

Despite previous investigations, there are still important questions. Is aneuploidy a cause or a side-effect of cancer? If the former, what factors associated with aneuploidy contribute to cancer cell fitness? Are there deleterious impacts of aneuploidy in cancer and how are they mitigated during tumorigenesis? More generally, what is the broader impact of aneuploidy on gene expression and resulting phenotypes?

DNA studies have found that amplification of genomic arms such as 20q and 8q [7] [8] occur with high prevalence and have been correlative with cancer severity. Understanding how these amplifications impact changes in gene expression and protein production is of great interest. The conventional wisdom regarding gene transcription and translation has been that “dosage” generally correlates with product: DNA to RNA to protein. Indeed, a recent report finds no evidence for widespread dosage compensation in yeast [9].

It is customary to use mRNA transcript abundance to identify disease-associated genes, but the impact of mRNA abundance on protein production is poorly understood. Correlational methods yield weak associations, even when considering protein half-lives and other chemical properties [10–13]. Other efforts have been made to integrate mRNA dynamics (half-life and fold energy) and RNA Binding Protein (RBP) interactions with expression data in S. cerevisiae and S. pombe to aid in predicting protein production from gene expression. [14] Illustrates how sequence elements (sequence lengths, secondary structures, etc.) were used to identify protein abundance variations. Understanding how DNA, RNA, and protein interact is a non-trivial task, but considering any of these features in isolation may yield sub-optimal results. This understanding could provide crucial details about tumorigenesis, cancer evolution, and may hold clues to potential cancer treatments.

In 2015 approximately 40,000 women died of breast cancer in the US alone [15]. In an effort to better profile and understand cancer, large public efforts have been initiated to gather patient data and comprehensively investigate it. The Cancer Genome Atlas (TCGA) collects data for patients across 34 types of cancer profiled using a wide array of ‘-omics’ platforms [16]. The unprecedented availability of cancer data, like TCGA, affords insights into the genomic foundation of these lethal diseases.

In this paper, we apply big data methods in a systematic fashion to observe the impact of DNA dosage on mRNA transcript levels and subsequent protein concentrations. We identify the prevalence of dosage compensation in TCGA breast cancer samples (BRCA), highlight dosage-sensitive genes, and investigate the role of these genes in cancer cell line survival.

Material and methods

The data used in this study has been downloaded from multiple resources, including TCGA [17], Clinical Proteomic Tumor Analysis Consortium (CPTAC) [18], the Catalogue of Somatic Mutations in Cancer (COSMIC) [19], and Achilles short hairpin RNA or small hairpin RNA (shRNA) [20]. The data and processing approaches are briefly described below. Fig 1 illustrates the overall workflow.

Download:

Fig 1. Bottom-up, integrated analysis workflow.

Visual representation of the analytical workflow for identifying broadly dosage-sensitive genes. Green portions represent mRNA-based steps, red protein, blue CNV. Integrated and filtering steps are white. Briefly, data were acquired from their sources, joined with metadata, normalized, integrated, then filtered.

https://doi.org/10.1371/journal.pone.0210910.g001

The Cancer Genome Atlas (TCGA)

RNAseq V2 data of 114 normal control patients and 1102 patients with breast invasive carcinoma (BRCA) were downloaded from TCGA. For each of the 20532 genes of each patient, the median of 114 normal values was used as an estimated baseline, which is noted as the Normal median RSEM. The fold change of each gene, Δ_exp(gene), was calculated for every patient as: (Eq 1)

The corresponding patient metadata was downloaded and mapped based on sample IDs extracted from the TCGA barcode. Level 3 TCGA copy number variant (CNV) data was extracted for all available patients. The TCGA CNV pipeline transforms a CNV value into a segment mean, where Segment mean = log₂(CNV/2). A copy number can be derived from the segment means by calculating 2 * (2^{segment mean}). With this, the diploid regions will have a segment mean value of zero, amplified regions will have positive values, and deletions will have negative values.

Clinical proteomic tumor analysis consortium (CPTAC)

Mass spectrometry (MS) data for breast invasive carcinoma was downloaded from CPTAC; these abundances were reported as the log₂-ratio of the expression of the sample to a common, healthy pool [21]. Patient mRNA, protein, and CNV data was matched using TCGA barcode. Gene identifications for MS data were made using previously established methods [22] and were provided by CPTAC. mRNA, protein, and CNV data for a given gene were joined by gene symbol. The unshared relative protein abundance was matched to 106 patients with mRNA and protein abundance data [18].

Gene amplification and deletion

Segment regions were mapped to the UCSC genome coordinates for the hg38 build of the human genome. For regions that covered multiple genes, the segment means counted for each gene. For genes with multiple calls, the maximum value was kept.

For genes across all 106 samples with protein and mRNA data, we observe 1,052,345 segment means in total. The average segment mean was 0.12 ± 0.02. We define a gene as amplified if its segment mean is greater than 0.2 and deleted if it is less than -0.2 [23]. Doing so we find 837,531 normal segments in the patients, 213,361 segments are amplified, and 1,453 have deletion events. Of those, 9835 genes were uniquely identified as normal, 9831 as amplified, and 1247 as deleted. Interestingly, only 15 patients of the 106 had no deletions.

To normalize protein and mRNA expression to a similar scale, a z-score normalization for protein and mRNA fold changes was performed as follows: (Eq 2)

Where represents the expression of a given gene, the mean expression for the data set, and the standard deviation.

Cancer gene profiling

The Cancer Gene Census was downloaded from COSMIC [19]. Genes labelled as amplified were selected and mapped to the results of the integrated genomics analysis to annotate correlational signatures. The Pearson correlation coefficient was calculated between protein abundance fold-change, mRNA fold-change, and CNV amplification. Any gene with all correlational scores above 0.70 is called a “Broadly Dosage-Sensitive Gene,” or BDSG. The stringent cutoff was selected to emphasize genes as very unique. Generally, there was poor correlational concordance among the features (Fig A in S1 File and Table A in S1 File). This is not to say the genes below this threshold may not be informative, but they are not exemplars of this particular genomic conservation.

Achilles shRNA

Achilles shRNA knockdown data was downloaded and subset for the BDSG genes, a selection of housekeeping genes [24] and genes acting as oncosuppressors or oncogenes (S1 Table). shRNA hairpins for each gene were selected based on second-lowest log-ratio to avoid false positives. Hierarchical clustering was performed via Python clustermap function on the genes, as well as the cell types. Additional hierarchical clustering was performed excluding any cell types not related to breast models.

Results and discussion

Genomics and protein analysis

RSEM distributions for the 106 breast cancer patients for 20531 genes were plotted for both the cancer and healthy samples. Initial observations of RSEM values show similar quantiles suggesting that global expression distributions are similar between tumor-matched normal and tumor samples (Fig B in S1 File). This alleviates concern for the impact of any batch-effect or other temporal anomalies in sample processing.

Wilcoxon signed-rank test of gene expression between healthy and cancer samples detected that over 10,000 genes had significantly different expression (p < 0.00005). This emphasizes that the cancer transcriptome varies dramatically from normal tissue.

The log₂ fold-change of mRNA from cancer to healthy was calculated as (Eq 1), described with a mean mRNA fold change of -9.0*10⁻² ± 1.5 (max = 17, min = – 16). The indicates a class of genes that have minimal change and another subset that shows large changes in expression as indicated by the distribution (Fig C in S1 File).

Fold changes were plotted for protein expression. For proteins, the mean fold change was 3*10⁻² ± 0.67 (max = 8.5, min = – 6.5). Since the dynamic range of these effects are not equivalent in both datasets, they were z-score normalized (Fig D in S1 File). After normalization (Fig E in S1 File), the overall distributions of mRNA and protein fold changes were not statistically different (p = 0.96, Wilcoxon Test). Therefore, our normalization practice is sufficient to integrate the datasets and attempt to find relationships. These relationships are visualized in Fig 2. The strikingly poor correlation between the two features emphasizes the difficulty in accurately inferring protein levels from given mRNA expression values. A D’Agostino’s K-squared test of the mRNA and protein correlations led to the rejection of the null hypothesis that the data was Guassian (p < 0.0005). Further examination of QQ plots conveys a trend for a mariginally lighter left tail and a heavier right tail to the distribution (Fig F in S1 File).

Download:

Fig 2. mRNA fold change versus protein fold change.

This represents the matched protein (y-axis) and mRNA (x-axis) z-score normalized, log₂ fold changes of disease to healthy samples for 106 patients and 20531 genes. Poor correlation amongst the points suggests that mRNA changes do not necessarily predict protein production changes.

https://doi.org/10.1371/journal.pone.0210910.g002

mRNA and protein dysregulation relative to copy number variation

There is a strong positive trend for correlation of mRNA fold change with segment means. This is intuitive, given if there is an amplification or deletion of a genomic region transcriptional activity is impacted.

A surprising result is the volume of anti-correlated genes when comparing mRNA versus protein fold-changes (Fig G in S1 File). There are 185 genes out of the 9835 with weak, negative correlations between mRNA and protein fold-changes. A negative correlation suggests that a regulatory mechanism is either additionally suppressing or enhancing transcript abundance. We observe a number of genes which have negative correlation. These are of relatively low magnitude and lack statistical significance, but there may be underlying biological mechanisms at play that may be of interest in future studies. This notion of regulatory interactions is emphasized again by observing the correlation of protein changes to segment means (Fig H in S1 File). In this case there are generally weak correlations, supporting the concept that precursor genomic features are often not a reliable predictor of protein concentration [25].

Selecting genes and subsequent data based on the 20q chromosomal arm–a locus known to be frequently-amplified in breast cancer [26] exemplifies how DNA amplification, mRNA and protein abundance may be discordant. In the case of the gene SLPI for a sample (TCGA barcode: D8-A13Y), we see that there is strong DNA amplification, however mRNA and protein abundance is very low. SLPI encodes an antibody-producing transcript which antagonizes paclitaxel in ovarian cancer cells [27]. Conversely, we can see cases where there is concordance between amplification, mRNA abundance, and protein abundance. In the case of RIMS4, we see very high fold changes and amplification in certain members of the cohort. KM-survivability analysis [28] based on microarray data of over 1000 breast cancer patients indicates that high RIMS4 expression has a positive prognostic impact (P <4.0x10^-7).

Cancer gene profiling identifies broadly dosage-sensitive genes (BDSGs)

The mRNA, protein, and CNV data for genes labeled as amplified in breast cancer from COSMIC is in Fig 3A. Genes from across the genome that meet the Pearson correlation criteria (i.e., all correlations above 0.75) are displayed in Fig 3D and listed in Table 1. Among the genes in Table 1, ERBB2 (HER2, Fig 3C) is a member of this group and is a well-known oncogene. In both Fig 3C and Fig 3D, we observe a strong dosage sensitivity that is atypical across the genome. The remaining 11 genes in Fig 3D are not identified in COSMIC at all. GRB7 is a growth factor receptor that overlaps with HER2 pathways, and coexpresses with it in esophageal cancer [29]. RPS6KB1 is a kinase whose alterations have been associated with an increased risk of colorectal cancer [30]. These broadly dosage-sensitive genes (BDSGs) are observations on a seemingly rare conservation of the central dogma, yet they have minimal functional annotations in the scientific literature.

Download:

Fig 3. Protein vs mRNA fold changes with CNV amplification.

Figures represent the z-score normalized, log2 fold change of mRNA (x-axis) versus protein (y-axis) and are colored by CNV segment mean values for samples from patients in the TCGA Breast Cancer dataset selected for: (a) Genes in the COSMIC database labelled as amplified, (b) known oncogene MYC, (c) known oncogene and BDSG HER2/ERBB2, and (d) all BDSGs. The trend in Fig 3A exemplifies how oncogenecity does not always correlate with dosage-sensitivity.

https://doi.org/10.1371/journal.pone.0210910.g003

Download:

Table 1. Broadly dosage-sensitive genes (BDSGs).

https://doi.org/10.1371/journal.pone.0210910.t001

shRNA data defines the role of BDSGs in cancer cell line growth

Fig 4 shows results of testing the impact of BDSGs on cancer cell line growth using shRNA. TUBB is a common housekeeping gene and the signature illustrated in 4a is typical of this role. When knocked down by shRNA there is a very deleterious effect on cancer cell line growth and it stands out as a singleton in both heatmaps. In Fig 4A we observe that PPME1 and UBE2Z consistently behave as tumor suppressor genes (TSGs) across all cell types; their silencing promotes cell viability. In breast cancer cell lines these genes cluster closely with PTEN and RB1 which were included as typical breast cancer TSGs. In contrast, GRB7 and RPS6KB1 have a generally negative impact on cell line viability in Fig 4A. However, when considering just breast cancer-specific cell lines, we observe that these genes cluster closely and exclusively with ERBB2. The differences in clustering behavior suggest that, unlike PPME1 and UBE2Z, GRB7 and RPS6KB1 act as oncogenes very similar to ERBB2. In fact, GRB7 is co-located with ERBB2 and may be upregulated as an adaptation to HER2 [29]. According to the breast-specific cell type clustering of shRNA data, BDSGs do not display a subtype-specific role. They are generally tumor suppressors or oncogenes across all breast cell lines.

Download:

Fig 4. Heatmap and hierarchical clustering of shRNA knockdown.

Section (a) represents all cell lines available in the Achilles project and (b) breast-specific cell lines. Rows are the selected genes: blue are housekeeping genes, red are oncogenes, green oncosupressors, and light blue are BDSGs; columns are the cell lines. Red cell values represent cellular proliferation, blue cellular death, and white no change. The clustering of ERBB2 and two BDSGs (GRB7, RPS6KB1) in (b) suggests an oncogenic role in breast cancer. PPME1 and UBE2Z signatures in (a) and (b) suggest an overall oncosuppressive role.

https://doi.org/10.1371/journal.pone.0210910.g004

Conclusions

In this paper we have shown that in the TCGA breast cancer cohort there is widespread dosage compensation for the extensive aneuploidy that is observed. The dosage of DNA does not generally correlate well with mRNA, nor does the latter correlate well with protein levels. A total of 11 genes show strong correlation across all features (DNA/mRNA/protein); analogous to that of a well-known oncogene HER2 (ERBB2). We refer to these genes as “Broadly Dosage-Sensitive Genes” or BDSGs. We note they are much less characterized in the literature as to their role, if any, in cancer. We advocate further study of BDSGs to better understand their potential effects on cancer. This may lead to new therapies for cancer or biomarkers for improved cancer detection.

From shRNA data, we show that knockdown of these genes has an impact on cancer cell growth. We speculate that tumor cells adapt unusual ploidies to take advantage of amplifications and deletions that functionally implicate only subsets of genes. These tumor cells may compensate for the dosage of a large number of “passenger” genes. This may be a vulnerability that could be used for cancer therapy, for example by de-repressing mRNA and/or protein production from these passenger genes. This may leave the tumor cell with potentially catastrophic levels of unneeded molecules or disrupted biological pathways.

We also caution that there may be significant pitfalls in drawing conclusions from a single type of genomics data. For example, gene expression (mRNA) data is widely-used to infer biological pathway activation, but Fig 2 suggests this would be extremely misleading for exploring protein levels of Cancer Gene Census genes in TCGA Breast samples.

Supporting information

S1 Table. House keeping, oncosuppressor, and oncogenes selected for Achilles shRNA analysis.

https://doi.org/10.1371/journal.pone.0210910.s001

(XLSX)

S1 File. Additional figures profiling data characteristics.

https://doi.org/10.1371/journal.pone.0210910.s002

(PDF)

Acknowledgments

The authors would like to acknowledge Dr. Xiaowen Liu for advice regarding proteomic datasets, and Dr. Farhana Merzoug for advice on shRNA selection criteria.

References

1. Gordon DJ, Resio B, Pellman D. Causes and consequences of aneuploidy in cancer. Nat Rev Genet. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2012;13: 189. Available: http://dx.doi.org/10.1038/nrg3123 pmid:22269907
- View Article
- PubMed/NCBI
- Google Scholar
2. Giam M, Rancati G. Aneuploidy and chromosomal instability in cancer: a jackpot to chaos. Cell Div. 2015;10: 3. pmid:26015801
- View Article
- PubMed/NCBI
- Google Scholar
3. Santaguida S, Amon A. Short- and long-term effects of chromosome mis-segregation and aneuploidy. Nat Rev Mol Cell Biol. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2015;16: 473. Available: http://dx.doi.org/10.1038/nrm4025 pmid:26204159
- View Article
- PubMed/NCBI
- Google Scholar
4. Bloomfield M, Duesberg P. Inherent variability of cancer-specific aneuploidy generates metastases. Molecular Cytogenetics. London; 2016. pmid:28018487
- View Article
- PubMed/NCBI
- Google Scholar
5. O’Connor C. Chromosomal Abnormalities: Aneuploidies. Nat Educ. 2008;1: 172.
- View Article
- Google Scholar
6. Boveri T. Concerning the origin of malignant tumours by Theodor Boveri. Translated and annotated by Henry Harris. J Cell Sci. The Company of Biologists Ltd; 2008; 1–84. pmid:18089652
- View Article
- PubMed/NCBI
- Google Scholar
7. Parris TZ, Kovacs A, Hajizadeh S, Nemes S, Semaan M, Levin M, et al. Frequent MYC coamplification and DNA hypomethylation of multiple genes on 8q in 8p11-p12-amplified breast carcinomas. Oncogenesis. 2014;3: e95. pmid:24662924
- View Article
- PubMed/NCBI
- Google Scholar
8. Tabach Y, Kogan-Sakin I, Buganim Y, Solomon H, Goldfinger N, Hovland R, et al. Amplification of the 20q chromosomal arm occurs early in tumorigenic transformation and may initiate cancer. PLoS One. 2011;6: e14632. pmid:21297939
- View Article
- PubMed/NCBI
- Google Scholar
9. Torres EM, Springer M, Amon A. No current evidence for widespread dosage compensation in S. cerevisiae. Odom DT, editor. Elife. eLife Sciences Publications, Ltd; 2016;5: e10996. pmid:26949255
- View Article
- PubMed/NCBI
- Google Scholar
10. Dumaual CM, Steere BA, Walls CD, Wang M, Zhang Z-Y, Randall SK. Integrated Analysis of Global mRNA and Protein Expression Data in HEK293 Cells Overexpressing PRL-1. PLoS One. 2013;8: e72977. pmid:24019887
- View Article
- PubMed/NCBI
- Google Scholar
11. Greenbaum D, Colangelo C, Williams K, Gerstein M. Comparing protein abundance and mRNA expression levels on a genomic scale. Genome Biol. 2003;4: 117. Available: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC193646/ pmid:12952525
- View Article
- PubMed/NCBI
- Google Scholar
12. Maier T, Güell M, Serrano L. Correlation of mRNA and protein in complex biological samples. FEBS Lett. 2009;583: 3966–3973. http://dx.doi.org/10.1016/j.febslet.2009.10.036 pmid:19850042
- View Article
- PubMed/NCBI
- Google Scholar
13. Tuller T, Kupiec M, Ruppin E. Determinants of Protein Abundance and Translation Efficiency in S. cerevisiae. PLoS Comput Biol. 2007;3: e248. pmid:18159940
- View Article
- PubMed/NCBI
- Google Scholar
14. Vogel C, de Sousa Abreu R, Ko D, Le S, Shapiro BA, Burns SC, et al. Sequence signatures and mRNA concentration can explain two‐thirds of protein abundance variation in a human cell line. Mol Syst Biol. 2010;6. pmid:20739923
- View Article
- PubMed/NCBI
- Google Scholar
15. Cancer Facts & Figures 2015 [Internet]. 2015. Available: http://www.cancer.org/research/cancerfactsstatistics/cancerfactsfigures2015/index
16. Tibes R, Qiu Y, Lu Y, Hennessy B, Andreeff M, Mills GB, et al. Reverse phase protein array: validation of a novel proteomic technology and utility for analysis of primary leukemia specimens and hematopoietic stem cells. Mol Cancer Ther. 2006;5: 2512–2521. pmid:17041095
- View Article
- PubMed/NCBI
- Google Scholar
17. National Cancer Institute, National Human Genome Research Institute. The Cancer Genome Atlas [Internet]. 2010 [cited 1 Jan 2015]. Available: http://cancergenome.nih.gov/
18. Ellis MJ, Gillette M, Carr SA, Paulovich AG, Smith RD, Rodland KK, et al. Connecting Genomic Alterations to Cancer Biology with Proteomics: The NCI Clinical Proteomic Tumor Analysis Consortium. Cancer Discov. 2013;3: 1108–1112. pmid:24124232
- View Article
- PubMed/NCBI
- Google Scholar
19. Forbes SA, Beare D, Gunasekaran P, Leung K, Bindal N, Boutselakis H, et al. COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic Acids Res. 2015;43: D805—D811. pmid:25355519
- View Article
- PubMed/NCBI
- Google Scholar
20. Cowley GS, Weir BA, Vazquez F, Tamayo P, Scott JA, Rusin S, et al. Parallel genome-scale loss of function screens in 216 cancer cell lines for the identification of context-specific genetic dependencies. Sci Data. The Author(s); 2014;1: 140035. Available: http://dx.doi.org/10.1038/sdata.2014.35 pmid:25984343
- View Article
- PubMed/NCBI
- Google Scholar
21. Edwards NJ, Markey SP, Stein SE. Protein Reports–CPTAC Common Data Analysis Pipeline (CDAP) [Internet]. 2016. Available: https://cptac-data-portal.georgetown.edu/cptac/documents/CDAP_ProteinReports_description_20160503.pdf
22. Reiter L, Claassen M, Schrimpf SP, Jovanovic M, Schmidt A, Buhmann JM, et al. Protein Identification False Discovery Rates for Very Large Proteomics Data Sets Generated by Tandem Mass Spectrometry. Mol & Cell Proteomics. 2009;8: 2405 LP–2417. Available: http://www.mcponline.org/content/8/11/2405.abstract
- View Article
- Google Scholar
23. Laddha S V, Ganesan S, Chan CS, White E. Mutational Landscape of the Essential Autophagy Gene BECN1 in Human Cancers. Mol Cancer Res. 2014;12: 485–490. pmid:24478461
- View Article
- PubMed/NCBI
- Google Scholar
24. Technologies N. Reference Genes for Normalization of Expression Data [Internet]. 2009 [cited 1 Jan 2016]. Available: https://www.nanostring.com/application/files/7014/8943/0117/TN_Normalization_of_Expression_Data.pdf
25. Liu Y, Beyer A, Aebersold R. On the Dependency of Cellular Protein Levels on mRNA Abundance. Cell. 2016. pp. 535–550. pmid:27104977
- View Article
- PubMed/NCBI
- Google Scholar
26. Garcia-Murillas I, Sharpe R, Pearson A, Campbell J, Natrajan R, Ashworth A, et al. An siRNA screen identifies the GNAS locus as a driver in 20q amplified breast cancer. Oncogene. Nature Publishing Group; 2014;33: 2478–2486. pmid:23752180
- View Article
- PubMed/NCBI
- Google Scholar
27. Rasool N, LaRochelle W, Zhong H, Ara G, Cohen J, Kohn EC. Secretory Leukocyte Protease Inhibitor, SLPI, Antagonizes Paclitaxel in Ovarian Cancer Cells. Clin Cancer Res. 2010;16: 600–609. pmid:20068074
- View Article
- PubMed/NCBI
- Google Scholar
28. Györffy B, Lanczky A, Eklund AC, Denkert C, Budczies J, Li Q, et al. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat. 2010;123: 725–731. pmid:20020197
- View Article
- PubMed/NCBI
- Google Scholar
29. Tanaka S, Mori M, Akiyoshi T, Tanaka Y, Mafune K, Wands JR, et al. Coexpression of Grb7 with Epidermal Growth Factor Receptor or Her2/erbB2 in Human Advanced Esophageal Carcinoma. Cancer Res. 1997;57: 28. Available: http://cancerres.aacrjournals.org/content/57/1/28.abstract pmid:8988034
- View Article
- PubMed/NCBI
- Google Scholar
30. Slattery ML, Lundgreen A, Herrick JS, Wolff RK. Genetic variation in RPS6KA1, RPS6KA2, RPS6KB1, RPS6KB2, and PDK1 and risk of colon or rectal cancer. Mutat Res Mol Mech Mutagen. 2011;706: 13–20. http://dx.doi.org/10.1016/j.mrfmmm.2010.10.005
- View Article
- Google Scholar

[ref1] 1. Gordon DJ, Resio B, Pellman D. Causes and consequences of aneuploidy in cancer. Nat Rev Genet. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2012;13: 189. Available: http://dx.doi.org/10.1038/nrg3123 pmid:22269907
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Giam M, Rancati G. Aneuploidy and chromosomal instability in cancer: a jackpot to chaos. Cell Div. 2015;10: 3. pmid:26015801
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Santaguida S, Amon A. Short- and long-term effects of chromosome mis-segregation and aneuploidy. Nat Rev Mol Cell Biol. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2015;16: 473. Available: http://dx.doi.org/10.1038/nrm4025 pmid:26204159
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Bloomfield M, Duesberg P. Inherent variability of cancer-specific aneuploidy generates metastases. Molecular Cytogenetics. London; 2016. pmid:28018487
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. O’Connor C. Chromosomal Abnormalities: Aneuploidies. Nat Educ. 2008;1: 172.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref6] 6. Boveri T. Concerning the origin of malignant tumours by Theodor Boveri. Translated and annotated by Henry Harris. J Cell Sci. The Company of Biologists Ltd; 2008; 1–84. pmid:18089652
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Parris TZ, Kovacs A, Hajizadeh S, Nemes S, Semaan M, Levin M, et al. Frequent MYC coamplification and DNA hypomethylation of multiple genes on 8q in 8p11-p12-amplified breast carcinomas. Oncogenesis. 2014;3: e95. pmid:24662924
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Tabach Y, Kogan-Sakin I, Buganim Y, Solomon H, Goldfinger N, Hovland R, et al. Amplification of the 20q chromosomal arm occurs early in tumorigenic transformation and may initiate cancer. PLoS One. 2011;6: e14632. pmid:21297939
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Torres EM, Springer M, Amon A. No current evidence for widespread dosage compensation in S. cerevisiae. Odom DT, editor. Elife. eLife Sciences Publications, Ltd; 2016;5: e10996. pmid:26949255
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Dumaual CM, Steere BA, Walls CD, Wang M, Zhang Z-Y, Randall SK. Integrated Analysis of Global mRNA and Protein Expression Data in HEK293 Cells Overexpressing PRL-1. PLoS One. 2013;8: e72977. pmid:24019887
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Greenbaum D, Colangelo C, Williams K, Gerstein M. Comparing protein abundance and mRNA expression levels on a genomic scale. Genome Biol. 2003;4: 117. Available: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC193646/ pmid:12952525
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref12] 12. Maier T, Güell M, Serrano L. Correlation of mRNA and protein in complex biological samples. FEBS Lett. 2009;583: 3966–3973. http://dx.doi.org/10.1016/j.febslet.2009.10.036 pmid:19850042
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref13] 13. Tuller T, Kupiec M, Ruppin E. Determinants of Protein Abundance and Translation Efficiency in S. cerevisiae. PLoS Comput Biol. 2007;3: e248. pmid:18159940
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref14] 14. Vogel C, de Sousa Abreu R, Ko D, Le S, Shapiro BA, Burns SC, et al. Sequence signatures and mRNA concentration can explain two‐thirds of protein abundance variation in a human cell line. Mol Syst Biol. 2010;6. pmid:20739923
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref15] 15. Cancer Facts & Figures 2015 [Internet]. 2015. Available: http://www.cancer.org/research/cancerfactsstatistics/cancerfactsfigures2015/index

[ref16] 16. Tibes R, Qiu Y, Lu Y, Hennessy B, Andreeff M, Mills GB, et al. Reverse phase protein array: validation of a novel proteomic technology and utility for analysis of primary leukemia specimens and hematopoietic stem cells. Mol Cancer Ther. 2006;5: 2512–2521. pmid:17041095
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref17] 17. National Cancer Institute, National Human Genome Research Institute. The Cancer Genome Atlas [Internet]. 2010 [cited 1 Jan 2015]. Available: http://cancergenome.nih.gov/

[ref18] 18. Ellis MJ, Gillette M, Carr SA, Paulovich AG, Smith RD, Rodland KK, et al. Connecting Genomic Alterations to Cancer Biology with Proteomics: The NCI Clinical Proteomic Tumor Analysis Consortium. Cancer Discov. 2013;3: 1108–1112. pmid:24124232
View Article
PubMed/NCBI
Google Scholar

[63] View Article

[64] PubMed/NCBI

[65] Google Scholar

[ref19] 19. Forbes SA, Beare D, Gunasekaran P, Leung K, Bindal N, Boutselakis H, et al. COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic Acids Res. 2015;43: D805—D811. pmid:25355519
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref20] 20. Cowley GS, Weir BA, Vazquez F, Tamayo P, Scott JA, Rusin S, et al. Parallel genome-scale loss of function screens in 216 cancer cell lines for the identification of context-specific genetic dependencies. Sci Data. The Author(s); 2014;1: 140035. Available: http://dx.doi.org/10.1038/sdata.2014.35 pmid:25984343
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref21] 21. Edwards NJ, Markey SP, Stein SE. Protein Reports–CPTAC Common Data Analysis Pipeline (CDAP) [Internet]. 2016. Available: https://cptac-data-portal.georgetown.edu/cptac/documents/CDAP_ProteinReports_description_20160503.pdf

[ref22] 22. Reiter L, Claassen M, Schrimpf SP, Jovanovic M, Schmidt A, Buhmann JM, et al. Protein Identification False Discovery Rates for Very Large Proteomics Data Sets Generated by Tandem Mass Spectrometry. Mol & Cell Proteomics. 2009;8: 2405 LP–2417. Available: http://www.mcponline.org/content/8/11/2405.abstract
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref23] 23. Laddha S V, Ganesan S, Chan CS, White E. Mutational Landscape of the Essential Autophagy Gene BECN1 in Human Cancers. Mol Cancer Res. 2014;12: 485–490. pmid:24478461
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref24] 24. Technologies N. Reference Genes for Normalization of Expression Data [Internet]. 2009 [cited 1 Jan 2016]. Available: https://www.nanostring.com/application/files/7014/8943/0117/TN_Normalization_of_Expression_Data.pdf

[ref25] 25. Liu Y, Beyer A, Aebersold R. On the Dependency of Cellular Protein Levels on mRNA Abundance. Cell. 2016. pp. 535–550. pmid:27104977
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref26] 26. Garcia-Murillas I, Sharpe R, Pearson A, Campbell J, Natrajan R, Ashworth A, et al. An siRNA screen identifies the GNAS locus as a driver in 20q amplified breast cancer. Oncogene. Nature Publishing Group; 2014;33: 2478–2486. pmid:23752180
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref27] 27. Rasool N, LaRochelle W, Zhong H, Ara G, Cohen J, Kohn EC. Secretory Leukocyte Protease Inhibitor, SLPI, Antagonizes Paclitaxel in Ovarian Cancer Cells. Clin Cancer Res. 2010;16: 600–609. pmid:20068074
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref28] 28. Györffy B, Lanczky A, Eklund AC, Denkert C, Budczies J, Li Q, et al. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat. 2010;123: 725–731. pmid:20020197
View Article
PubMed/NCBI
Google Scholar

[96] View Article

[97] PubMed/NCBI

[98] Google Scholar

[ref29] 29. Tanaka S, Mori M, Akiyoshi T, Tanaka Y, Mafune K, Wands JR, et al. Coexpression of Grb7 with Epidermal Growth Factor Receptor or Her2/erbB2 in Human Advanced Esophageal Carcinoma. Cancer Res. 1997;57: 28. Available: http://cancerres.aacrjournals.org/content/57/1/28.abstract pmid:8988034
View Article
PubMed/NCBI
Google Scholar

[100] View Article

[101] PubMed/NCBI

[102] Google Scholar

[ref30] 30. Slattery ML, Lundgreen A, Herrick JS, Wolff RK. Genetic variation in RPS6KA1, RPS6KA2, RPS6KB1, RPS6KB2, and PDK1 and risk of colon or rectal cancer. Mutat Res Mol Mech Mutagen. 2011;706: 13–20. http://dx.doi.org/10.1016/j.mrfmmm.2010.10.005
View Article
Google Scholar

[104] View Article

[105] Google Scholar