Adjuvant hormonal therapy is administered to all early stage ER+ breast cancers, and has led to significantly improved survival. Unfortunately, a subset of ER+ breast cancers suffer early relapse despite hormonal therapy. To identify molecular markers associated with early relapse in ER+ breast cancer, an outlier analysis method was applied to a published gene expression dataset of 268 ER+ early-stage breast cancers treated with tamoxifen alone. Increased expression of sets of genes that clustered in chromosomal locations consistent with the presence of amplicons at 8q24.3, 8p11.2, 17q12 (HER2 locus) and 17q21.33-q25.1 were each found to be independent markers for early disease recurrence. Distant metastasis free survival (DMFS) after 10 years for cases with any amplicon (DMFS = 56.1%, 95% CI = 48.3–63.9%) was significantly lower (P = 0.0016) than cases without any of the amplicons (DMFS = 87%, 95% CI = 76.3% –97.7%). The association between presence of chromosomal amplifications in these regions and poor outcome in ER+ breast cancers was independent of histologic grade and was confirmed in independent clinical datasets. A separate validation using a FISH-based assay to detect the amplicons at 8q24.3, 8p11.2, and 17q21.33-q25.1 in a set of 36 early stage ER+/HER2- breast cancers treated with tamoxifen suggests that the presence of these amplicons are indeed predictive of early recurrence. We conclude that these amplicons may serve as prognostic markers of early relapse in ER+ breast cancer, and may identify novel therapeutic targets for poor prognosis ER+ breast cancers.
Citation: Bilal E, Vassallo K, Toppmeyer D, Barnard N, Rye IH, Almendro V, et al. (2012) Amplified Loci on Chromosomes 8 and 17 Predict Early Relapse in ER-Positive Breast Cancers. PLoS ONE 7(6): e38575. https://doi.org/10.1371/journal.pone.0038575
Editor: Jin Q. Cheng, H.Lee Moffitt Cancer Center & Research Institute, United States of America
Received: January 19, 2012; Accepted: May 7, 2012; Published: June 13, 2012
Copyright: © 2012 Bilal et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: EB was supported in part by a grant from the New Jersey Commission on Cancer Research. SG has received support from the National Institutes of Health, and the New Jersey Commission on Cancer Research. HR is funded by the Norwegian Cancer Association, The Norwegian Research Council, Raagholdt Foundation, Thorsteds Foundation and Radiumhospitalets legater. VA is funded by Cellex Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: SG is a consultant for Ibris Inc. SG, EB and GB are co-inventors on a patent entitled “GENETIC MARKERS AND DIAGNOSTIC METHODS FOR RESISTANCE OF BREAST CANCER TO HORMONAL THERAPIES” United States Patent Application Serial No. 13/220,434. This does not alter the authors' adherence to all the PLoS ONE policies on sharing data and materials. The other authors have no competing interests.
Hormone therapy is widely used for treatment of estrogen receptor positive (ER+) breast cancer and has been shown to result in significantly improved survival and lower rates of recurrence (reviewed in , ). However, a significant subset of ER+ breast cancer patients treated with adjuvant hormone therapy suffer early disease recurrence. These poor prognosis ER+ tumors tend to have higher grade and show higher proliferative indices and may not be “addicted” to ER –dependent signalling, making them resistant to hormone therapy and prone to early relapse (reviewed in –). A better understanding of the mechanisms underlying the early relapse of some ER+ breast cancers may lead to better prognostic assays, and to new targeted therapeutic strategies for these poor prognosis cancers.
Several assays have been developed to distinguish ER+ patients likely to do well with hormonal therapy from those likely to have early disease progression. The best validated of these is the Oncotype Dx® assay  from Genomic Health, Inc., based on RT-PCR measurement of mRNA levels of 21 genes. ER+ breast cancer patients whose tumors have low ODx Recurrence Scores (RS) do well with adjuvant hormonal therapy alone, while tumors with high ODx RS are more likely to benefit from the addition of chemotherapy to hormonal therapy. Other panels of genes, such as the Genomic Grade Index panel , and clinical markers such as histological grade, are also used to classify patients into good or poor prognosis classes. In addition, molecular signatures from clustering methods applied to gene-expression data are also able to separate ER+ breast cancers into good prognosis (Luminal A) and poor prognosis (Luminal B) classes –. However, several studies have shown that, when the prognostic assays are compared to the gene expression based sub-classification of breast cancers, these assays are essentially identifying Luminal A tumors (low grade, highly ER+ breast cancers, HER2-) as being good prognosis, and Luminal B, ER+ breast cancers (which are ER+, mostly intermediate-to-high grade, some with HER2 amplification) as poor prognosis –.
Although gene expression based assays such as Oncotype Dx have prognostic and predictive utility, they do not identify the biologic pathways driving resistance in the poor prognosis tumors. Moreover the optimal strategy for “Intermediate Risk” ODx RS, found in up to 30% of ER+ cancers, is not clear at present. In contrast, the presence of the HER2 amplicon, in ER+ breast cancers, has both clear prognostic value and identifies a clear and effective therapeutic target. ER+ breast cancers with HER2-amplification tend to have early recurrence if treated with hormonal therapy alone, likely because the activation of the HER2 pathway leads to independence from ER- mediated signalling (see reviews above, also , ). Moreover, therapy that specifically targets HER2 has been shown to dramatically improve outcome in HER2+ patients. Thus all breast cancers are now routinely tested for the presence of HER2 amplification.
As HER2 amplicon genes are part of the 21 gene panel used in determining the Oncotype Dx recurrence score (RS), breast cancers with HER2 amplification generally have high RS, high histological grade, and a high genomic grade and are easily and correctly identified as poor prognosis by the assay. However, the majority of poor prognosis ER+ cancers with high ODx RS do not have HER2 amplification . Indeed only patients with ER+ tumors and no evidence of HER2 amplification have Oncotype DX assays performed in most clinical settings. At present there is little insight into the mechanism driving estrogen independence and growth in poor prognosis ER+/HER2- breast cancers.
In order to gain insight into the biology of these poor prognosis ER+/HER2- breast cancers, we analyzed a public gene expression data set of early stage ER+ breast cancers treated with tamoxifen using a novel method. Sets of outlier genes whose expression correlated with clinical outcome were analyzed to identify either molecular pathways or enrichment of chromosomal regions. Four separate regions of the genome were identified whose amplification was highly predictive of poor prognosis in early stage ER+ breast cancers treated with tamoxifen. As expected, one of these was the HER2 amplicon on 17q12 , ; validating our methods as being able to identify relevant amplicons. The other three amplification regions were in 8q24.3, 8p11.2 and 17q21.33-q25.1. Although these loci have previously been identified as regions of amplification in subsets of breast cancer , their association with tamoxifen resistance in ER+/HER2- breast cancers is novel. The presence of these amplicons in ER+/HER2- breast cancer and their association with poor prognosis was validated in several independent data sets . Taken together, these findings demonstrate that these amplicons are strong predictors of early relapse in ER+ breast cancers.
Outlier Genes and Patterns Associated with Tamoxifen Treatment Response
A gene expression dataset (published by Loi et al. , ) containing 268 patients with early stage ER+ breast cancers treated with local therapy and adjuvant tamoxifen with 9+ years of available clinical follow-up data, was analyzed. Clinical characteristics of this set have been previously described (Table S1).
Genes whose expression values were outliers in at least 10 samples in this dataset were identified and analyzed for their correlation with distant metastasis free survival. Outlier genes for which there was a significant difference in distant metastasis free survival between samples having outlier expression when compared to samples with normal expression, were identified and retained (see Methods for details). Table S2 has the set of outlier genes, hazard ratios, log-rank P values and outlier scores.
Principle component analysis (PCA) demonstrated that the outlier genes separated into 3 clusters (Figure 1A). Survival analysis of these clusters showed that one cluster contained genes whose over-expression associated with poor prognosis, and the other two contained genes over-expressed in good prognosis samples. The set of outlier genes in each cluster was analysed using Gene Ontology (GO)  to identify pathways and potential chromosomal amplifications associated with outcome (Table S3). Pathways enriched in over-expressed outliers associated with good prognosis included development, cell adhesion, and immune response genes. Of note, no clusters of outliers associated with good prognosis suggestive of an underlying amplicon were detected. Outlier genes whose over-expression was associated with poor prognosis had a significant enrichment of genes in cell cycle pathways. Analysis of outliers for clustering by chromosomal location identified putative amplification of four chromosomal regions associated with poor prognosis: 17q12, 17q21.33-q25.1, 8p11.2 and 8q24.3. The presence of genomic amplification in any of these regions leads to outlier expression of their genes, and is a marker of poor prognosis in ER+ breast cancer.
Principal component analysis of high outlier genes (A) and low outlier genes (B) associated with differential distant metastasis free survival are shown. The figure represents the projection of each gene’s outlier profile on the first two principal components of the corresponding matrix. Gene clusters associated with good prognosis are circled in blue while gene clusters associated with bad prognosis are circled in red. Over-expressed genes associated with poor prognosis, which map to the chromosomal regions 8q24.3, 8p11.2, 17q21.33-q25.1 and 17q12, and are associated with specific GO pathways are labeled with different colors.
Cell cycle pathway outliers contained genes associated with proliferation and included many of the genes used to define the Genomic Grade Index . This confirms prior observations that proliferation-associated genes are strong markers of poor prognosis in ER+ breast cancer. The known amplicon on 17q12  associated with poor prognosis was also identified. This amplicon contains the HER2 gene and is known to be associated with relative resistance to hormonal therapy and poor prognosis in ER+ breast cancer. The other putative amplicons in 17q21.33-q25.1 –, 8p11.2 ,  and 8q24.3  have been previously reported as amplified in subsets of breast cancers but their association with tamoxifen resistance in ER+ breast cancer is a novel finding. The full list of outlier genes identified in these amplified chromosomal regions is listed in Table 1, with potential oncogenes highlighted in red. Some of these genes have been previously identified as playing a role in tumorigenesis or cancer progression including, WHSC1L1 , , CLTC –, HSF1 , and LSM1 . Of note the FGFR1 which has been implicated in hormonal resistance in ER+ breast cancer , is present at the edges of the 8p11.2 amplicon, but is not present in our minimal amplicon defined by our analysis (see Table S2). Similarly MYC, another oncogene reported to induce hormone resistance in breast cancer, is upstream of the 8p24.3 amplicon defined by our analysis, and is not associated with poor outcome in this dataset (see Table S2).
For under-expressed outliers, a similar analysis showed that relative under-expression of the cell cycle pathway was associated with good prognosis, while under-expression of the immune response and cell adhesion pathway was associated with poor prognosis (Figure 1B). This mirrors the results for over-expressed outlier genes and confirms the strong association of the cell cycle, immune response and cell adhesion pathways with prognosis in ER+ breast cancers.
Correlations between Cell Cycle Pathway and Putative Amplicons in 17q12, 17q21.33-q25, 8p11.2, and 8q24.3
To examine the inter-relationship between the cell cycle pathway and the four potential amplicons identified by our analysis, a correlation matrix of all genes associated with poor outcome was computed (Figure S1). Correlations between the presence of each amplicon and any amplicon or the cell cycle pathway are shown in Table S4. We find that the cell cycle pathway correlates partly with all the amplicons (Figure S1, Table S4), suggesting that activation of cell cycle pathway is associated with chromosomal amplifications in 17q12, 17q21.33-q25, 8p11.2, and 8q24.3. Of note, the cell cycle genes themselves are not located in these amplicon regions.
The association between cell cycle and putative amplicons was further examined. Samples with enrichment of any of the four amplicons or the cell cycle pathway were identified by requiring at least 50% of gene markers in each group to be over-expressed, i.e. marked as a high outlier in the respective sample. It was found that in most samples (90.5%), over-expression of cell cycle genes display at least one of the four chromosomal amplifications, suggesting a causal relationship between tumor proliferation and the presence of these amplicons.
However, chromosomal amplifications in 17q12, 17q21.33-q25, 8p11.2, and 8q24.3 have poor to medium correlations with each other (Table S4), suggesting that the presence and effects of each amplicon may be functionally independent.
Presence of Amplicons is Associated with Poor Outcome in ER+ Breast Cancers in Multiple Independent Datasets
The effects of the presence of outliers in the cell cycle pathway, and of each of the four amplicons on distant metastasis free survival (DMFS) was determined. Presence of cell cycle pathway genes was found associated with significantly lower DMFS (log-rank P = 0.0013), as well as higher hazard ratio (HR = 9.71, 95% CI = 3.3–28.6) in ER+ breast cancers, compared to tumors that lack this signature (Figure 2A). Presence of any of the four amplicons was also associated with lower DMFS compared to tumors without amplicons (Figure 2B). Hazard ratios for samples with amplicons on 17q12, 17q21.33-q25, 8p11.2 or 8q24.3 vs. no amplicons were 4.09, 3.14, 3.75, and 4.29 respectively, while log-rank P values for the DMFS differences were 6.3e−07, 3.0e−04, 5.7e−06, and 2.2e−06.
A) Kaplan-Meier curves of the samples in the primary dataset (GSE6532) enriched for over-expressed cell cycle genes versus the rest of samples that don’t show this feature. Patients with cell cycle activated genes show a significant decrease in distant metastasis free survival rate (HR = 9.71, 95% CI = 3.3–28.6; P<0.0001). B) Kaplan-Meier curves of the ER+ samples in the primary dataset (GSE6532) stratified by presence of putative amplicons in 17q12, 17q21.33-q25.1, 8p11.2 and 8q24.3. Patients that show any one of the chromosomal amplifications have significantly higher relapse rates when compared to samples without any amplifications: 17q12 (HR = 4.09, 95% CI = 3.84–21.99; P = 6.3e−07), 17q21.33– q25.1 (HR = 3.14, 95% CI = 2.17–13.62; P = 3.0e−04), 8p11.2 (HR = 3.75, 95% CI = 3.18–18.31; P = 5.7e−06), and 8q24.3 (HR = 4.29, 95% CI = 4.32–34.08; P = 2.2e−06). C) Analysis of combined gene expression data of 624 ER+ breast cancers from multiple published data sets. Outlier analysis was performed to identify cases with evidence of amplification at 17q12, 17q22, 8p11.2, and 8q24.3 and those without evidence of any amplification. Kaplan-Meier curves of relapse free survival for ER+ samples with each of the four amplicons, and samples containing no amplicon are plotted: 17q12 (HR = 2.30, 95% CI = 1.45–3.64; P = 4.0e−04), 17q22 (HR = 3.07, 95% CI = 1.99–4.73; P<1.0e−04), 8p11.2 (HR = 1.96, 95% CI = 1.23–3.13; P = 4.9e−3), 8q24.3 (HR = 2.38, 95% CI = 1.60–3.55; P<1.0e−04) D) Kaplan-Meier curves of overall survival for the ER+ samples in the test CGH dataset (GSE22133) with each of the 4 amplicons, as well as samples that don’t have any of the chromosomal amplifications. Analysis of the CGH data identified amplification peaks at each of the four regions that overlap with the previously identified loci. Patients that show any one of the chromosomal amplifications have significantly higher event rates than those without any of the amplifications: 17q12 (HR = 2.61, 95% CI = 1.51–5.51; P = 6.8e−04), 17q22 (HR = 3.02, 95% CI = 1.76–5.18; P = 7.3e−05), 8p11.2 (HR = 2.65, 95% CI = 1.48–4.74; P = 1.3e−03), and 8q24.3 (HR = 2.12, 95% CI = 1.24–3.65; P = 6.7e−03). Log-rank tests were used to calculate all the P values.
For validation, we first analyzed a data set of 624 early stage ER+ breast cancers for which relapse free survival data was available. This dataset included patients from over 20 published studies for whom gene expression data were combined as previously described . Samples from the training set (GEO accession number GSE6532) were specifically excluded from this combined data. The clinical characteristics of the rest of the samples are listed in Table S5. This gene expression data was analyzed to identify breast cancers that had outlier patterns consistent with presence of the amplicons in 17q12, 17q21.33-q25, 8p11.2, and 8q24.3, as described in the Methods section. Kaplan-Meyer curves of the survival fraction for ER+ cases showed that samples which scored as having any of the putative amplicons, as assayed by outlier analysis, had significantly poorer relapse free survival compared to samples having no amplicons (Figure 2C), validating our results.
In these datasets, the amplicons were imputed by analysis of gene expression outliers in regions of known amplification. To test whether the presence of genomic amplification is directly responsible for these findings, a separate CGH array dataset  (GEO accession number GSE22133) with 359 samples and 8.1 years of median follow-up survival information was analyzed. We retained only the 222 ER+ samples for our analysis (Table S6). Although we expect that the ER+ cancers would have received adjuvant tamoxifen therapy, patients were not uniformly treated and specifics of the exact treatments were unavailable for this dataset. Copy number estimates obtained from GEO were segmented using circular binary segmentation (CBS) , followed by identification of significant amplification peaks with the GISTIC  algorithm as described in . Amplification peaks were detected in 17q12, 17q22, 8p11.2 and 8q24.3 which overlapped the regions previously found by gene expression analysis. Correlation analysis between samples with these amplicons showed little to medium associations (Table S7) similar to the previously obtained values in Table S4.
Overall survival curves for samples with/without amplifications of 17q12, 17q22, 8p11.2 and 8q24.3 regions were determined using GISTIC and Kaplan-Meier estimator, and are shown in Figure 2D. This analysis showed that the presence of an amplicon in any of these four regions is associated with significantly worse outcome. Hazard ratios for samples with amplicons in 17q12, 17q22, 8p11.2 and 8q24.3 vs. no amplicons were 2.61, 3.02, 2.65, and 2.12 respectively, while log-rank P values for the survival difference were 6.8e−04, 7.3e−05, 1.3e−03, and 6.7e−03. Of note, 17q22 as identified by GISTIC, is a peak region included in the previously defined amplicon 17q21.33-q25 which contains a considerable number of outlier genes in the 17q22 locus (see Table 1).
Associations between Presence of Amplicons and Oncotype Dx Recurrence Score
A validated marker of poor outcome in ER+ breast cancers with hormonal treatment is the Oncotype Dx assay . This assay uses a weighted, linear combination of the expression of 21 genes to generate a single recurrence score RS. The genes used in this assay consist of HER2, GRB7, GSTM1, CD68, BAG1, invasion markers (MMP11, CTSL2), proliferation markers (Ki67, STK15, Survivin, CCNB1, MYBL2) as well as estrogen and reference markers. We used this gene panel and to generate a relative Oncotype Dx Recurrence Score using normalized expression levels and published weights . This calculation of a relative Oncotype Dx recurrence score from gene expression array data is based on prior studies which have such relative scores to correlate with both outcome and molecular subtype , . This relative RS score was able to separate poor prognosis samples from good prognosis samples in the tamoxifen treated sample set of 268 patients, validating this relative ODx score as being prognostic(Figure S2). We found (Figure S3) that the presence of any of these amplicons was associated with higher recurrence scores, while ER+ cancers lacking the amplicons had lower recurrence scores. A potentially significant finding was that the relative Oncotype Dx scores of tumors with amplicons 17q21.33-q25.1, 8p11.2 and 8q24.3 was lower than that of tumors with HER2 amplification (Figure S3), while their prognosis was similarly poor (Figure 2B). This observation suggests that Oncotype Dx may, in some cases, underestimate the risk of poor prognosis in tumors with these amplifications, and that some of tumors classified as “intermediate risk” by Oncotype Dx may in fact be high risk tumors.
To further test the hypothesis that regions 17q21.33-q25.1, 8p11.2 and 8q24.3 are likely to be amplified in ER+/HER2- breast cancer samples having high Oncotype Dx recurrence scores because of upregulation of cell-cycle genes, a set of 14 ER+/HER2- breast cancer samples with known Oncotype Dx scores was evaluated for the presence of 17q21.33-q25.1, 8p11.2 or 8q24.3 amplifications using FISH. Out of 14 samples, 8 had high recurrence scores (RS) (>30) and 6 had low scores (<18). As shown in Figure S4 and Table 2, cancers with high RS had amplification of at least one of these regions, while almost all cancers with low RS did not exhibit any amplification at these chromosomal locations.
Associations between Presence of Amplicons and Histologic Grade
Histologic grade is also a strong predictor of outcome in ER+ breast cancer, with low grade tumors having good outcome with hormone therapy and high grade tumors having poor outcome in this setting . In order to rule out the possibility that the presence of the amplicons is a surrogate for high histologic grade, a multivariate Cox analysis (Table 3) was performed on the training data set (GEO accession number GSE6532) to explore the relation between the presence of any of the four amplicons and other clinical markers (patient age, tumor size, node status, tumor grade and HER2 status) as well as the relative Oncotype Dx score calculated from gene expression data. We found that the presence of amplicons was a significant predictor of distant metastasis (HR = 2.53, P = 0.0067), more so than, tumor size (HR = 1.38, P = 0.0180), histologic grade (HR = 0.44, P = 0.0959) or ODx RS (HR = 1.08, P = 0.3838). If the amplicon covariate was removed from the Cox analysis, then significant predictors of distant metastasis become ODx RS (HR = 1.19, P = 0.0487) and tumor size (HR = 1.29, P = 0.0330).
We also analyzed the ability of the presence of any amplicon to discriminate outcome in intermediate grade tumors, which is a clinical grade category with unclear prognostic significance. Two datasets, (GSE6532 training set with gene expression data, and GSE22133 validation set with CGH data) where annotated pathologic grade information was available were analyzed. Kaplan-Meier curves comparing distant relapse rates for intermediate grade tumors with any of these four amplicons versus cases with none of the amplicons (Figure 3A) in the training set GSE6532, were significantly different (HR = 3.22, 95% CI = 1.6–6.5; P = 0.0012). This was true also for Kaplan-Meier curves comparing overall survival for intermediate grade cancers with any of the 4 amplicons versus cases with none of the amplicons (Figure 3B) in the test set GSE22133 (HR = 3.01, 95% CI = 1.2–7.6; P = 0.0200). Together, these results demonstrate that the amplicon associated risk categories have a discriminatory power beyond that of standard histologic grade.
Kaplan-Meier curves comparing distant relapse rates for intermediate grade cancers with any of the 4 amplicons versus cancers with none of the amplicons (A) in the training set GSE6532 (HR = 3.22, 95% CI = 1.6–6.5; P = 0.0012). Also shown Kaplan-Meier curves comparing overall survival for intermediate grade cancers with any of the 4 amplicons versus cancers with none of the amplicons (B) in the test set GSE22133 (HR = 3.01, 95% CI = 1.2–7.6; P = 0.0200).
FISH-based Assay has Potential Prognostic Value in ER+/HER2- Breast Cancers Treated with Hormone Therapy
A multiplexed FISH assay to detect 8p11.2, 17q22 and 8q24.3 amplicons in FFPE sections was developed using prelabeled FISH probes from validated BACs (Bacterial Artificial Chromosomes). The specificity of each probe was tested on metaphase chromosome spreads and hybridized to the corresponding chromosomal locations. The FISH assay was applied to 36 ER+/HER2− samples from the MicMa cohort that were treated with adjuvant hormonal therapy  (Figure 4). Tumor samples, present in tissue microarray format, were scored for amplification of each amplicon by averaging signals in 20 tumor cells/sample. The KM curves for systemic relapse free survival in this cohort for patients with and without any amplicon are shown in Figure 4. There is a trend for decreased relapse-free survival in patients scored as having any amplicon, vs. having no amplicon, but given the small sample size, this did not achieve statistical significance (P = 0.1041). The thresholds for amplification were optimized using the outcomes in this sample set, and thus require independent validation in future studies. Of note, very few relapses occurred earlier than 1500 days in the no-amplicon group, whereas the majority of relapse in the any-amplicon group occurred before 1500 days.
ER+/HER2- samples with systemic relapse events were probed for the amplification of 8p11.2, 17q22 and 8q24.3 by multiplexed FISH assay as described in Methods. A) Out of 36 samples, 15 had at least one region amplified. Kaplan-Meier curves for relapse free survival is shown for cancers having at least one amplicon vs cancer having no amplicons (HR = 2.31, 95% CI = 0.66–8.06; P = 0.1041 (Gehan-Willcoxon) or P = 0.1886 (Mantel-Cox)). B) A typical image of multicolor FISH in a breast cancer specimen. This cell has evidence of amplification of both 17q23.1 and 8p11 loci, but normal 8q24 loci.
Currently Oncotype Dx assays are routinely used to predict outcome and guide treatment for early stage ER+/HER2- breast cancer patients in the US. A high Oncotype Dx recurrence score can identify patients likely to have poor outcome with hormonal therapy alone, and who may benefit most from the addition of chemotherapy. However, such prognostic assays are expensive, often have intermediate risk scores with unclear predictive value, do not give biological insight into mechanisms driving poor prognosis, and do not identify potential therapeutic targets.
The results presented in this paper demonstrate that the presence of amplifications in chromosomal regions 17q21.33-q25.1, 8p11.2 and 8q24.3 are strong markers of poor prognosis in ER+/HER2- breast cancers. Our results suggest that these amplicons may function in a manner similar to HER2 amplification in identifying ER+ breast cancers with poor outcome and relative resistance to hormone therapy. In our primary dataset of 268 patients treated with tamoxifen, of the 44 patients who suffered distant metastasis within the first 4 years after diagnosis, only 30% were identified as having only 17q12 (HER2+) amplification, while an additional 42% had amplification of one of the three other amplicons. These results suggest that the presence of other amplicons, besides HER2, is associated with early relapse in ER+ breast cancer.
The presence of these amplicons is also associated with higher expression of proliferative genes/cell cycle genes that drive a high Oncotype Dx (ODx) recurrence score. Direct analysis of clinical specimens for amplification of these regions using FISH also demonstrated that the presence of amplification in each locus is associated with high ODx scores, while tumors that lack any of the amplicons have low recurrence ODx scores. This indicates that identification of chromosomal amplifications in defined regions, by a cost effective FISH assay, may be a clinically useful biomarker for predicting poor outcome in early stage ER+/HER2- breast cancers. Moreover, analysis of relative ODx scores in gene expression data sets suggests that ODx may underestimate risk associated with presence of amplicons, and that some tumors classified as intermediate risk by ODx assay may instead be poor prognosis tumors marked by a high risk amplicon. These finding will need to be validated in future studies analyzing a larger set of ER+ breast cancers with known ODx scores and clinical outcome.
In addition to their usefulness as biomarkers of risk in ER+ breast cancers, the chromosomal regions 17q21.33-q25.1, 8p11.2 and 8q24.3 contain oncogenes that may be valuable as therapeutic targets for novel drug therapies. Genes in the 8p11.2 region identified as putative oncogenes and therapeutic targets include U6 snRNA-associated Sm-like protein (LSM1), Wolf-Hirschhorn syndrome candidate 1-like 1 (WHSC1L1), and the RNA Polymerase III subunit BRF2 in region 8p11.2. Of note BRF2 has the highest associated HR, and was recently identified as a putative oncogene in squamous cell lung cancer , , , . Fibroblast growth factor receptor-1 (FGFR1), recently identified as a putative driver of endocrine resistance in breast cancer , is at the edges of the 8p11.2 amplicon that we found associated with early relapse in ER+ breast cancer treated with tamoxifen. Its outlier profile is associated with poor survival with hazard ratio of 1.8 and a log-rank P value of 0.046 (Table S2). However, in our analysis FGFR1 is not the strongest candidate in this region, and was not present in our minimal amplicon region defined by our methods.
As seen in Figure S1, the majority of outlier genes associated with poor prognosis on the q arm of chromosome 8 are clustered in the region 8q24.3 with the rest of them scattered all the way to 8q11.2. This suggests that in some cases the whole q arm of chromosome 8 is amplified or that there are a number of amplicons on 8q that correlate with 8q24.3. Slightly more upstream of 8q24.3 there is a well known oncogene MYC, a key estrogen effector, that has been reported to induce tamoxifen resistance when over-expressed . Although MYC could also contribute to the effect of this amplicon on early relapse, it was not identified in our analysis as strongly associated with differential survival (log-rank P = 0.042, Table S2) compared to more distal genes, suggesting it may contribute to only a minority of cases containing this amplicon. Other potential genes driving tumorigenesis in 8q24.3 include Heat Shock Transcription Factor 1 (HSF1), which has been shown to be a powerful potentiator of tumorigenesis , , . Also of interest is YWHAZ, a member of the 14-3-3 family of proteins. High expression of YWHAZ has been associated with poor clinical outcome in ER+ breast cancer . Down-regulation of YWHAX can restore tamoxifen sensitivity to tamoxifen resistant populations of MCF-7 cells, suggesting it may play a direct role in mediating hormone resistance .
Of the chromosomal regions identified in this study, 17q21.33-q25.1 is the least understood. Situated downstream of a much better known amplicon 17q12 (HER2+), it is known to be amplified and correlated with high grade tumors and poor prognosis . However, there is still no definite identification of driver oncogenes in this region. Possible candidates are CLTC, involved in gene fusions in B-cell lymphomas and non-small cell lung carcinomas, and RAD51C involved in DNA repair and homologous recombination. The gene with highest HR for relapse in this region is Prohibitin (PHB), a transcriptional regulator that has been shown to have both oncogenic and tumor suppressor capability in different contexts. Of note PHB has been reported to associate with and inhibit ER-mediated transcriptional function, suggesting it may play a role in modulating ER-function .
Another gene associated with early relapse is Cyclin D1 (CCND1, log-rank P = 5.7e-06, Table S2)  located on chromosomal band 11q13, which is another well known amplification site . However CCND1 is also a cell cycle marker and its expression is associated with proliferation. Thus the association of high CCND1 expression with poor outcome may in part reflect its role in proliferation and not just as a driver oncogene. This region was not identified as an independent potential amplicon in our analysis. Intriguingly there are reports of an association between 11q13 amplification and amplification of 8p12 , ,  in breast cancers, with some reports demonstrating a physical association between these domains .
Analysis for the presence of 17q21.33-q25.1, 8p11.2 and 8q24.3 amplicons by a multiplexed FISH assay in 36 ER+/HER2- samples from the MicMa cohort showed there is a trend towards decreased relapse free survival in patients who have amplicons compared to those who do not. Although small numbers limit statistical significance, these results are encouraging and suggest that a FISH-based assay could be developed as a prognostic tool. Future studies that evaluate large, well annotated clinical data sets are necessary to validate the FISH assay and determine whether these amplicons can be of used as predictive and prognostic markers in ER+ breast cancer.
In summary, the data presented here suggest that amplification of chromosomal regions 17q21.33-q25.1, 8p11.2 and 8q24.3 is strongly associated with early relapse in early stage ER+/HER2- breast cancers treated with hormonal therapy, and correlates with high Oncotype Dx recurrence scores. These chromosomal regions also contain genes whose over-expression may directly drive early relapse and/or hormone independence in ER+ breast cancers, and may be candidates for targeted therapy. Assays to identify the presence of amplicons may then both identify patients at high risk of relapse with hormonal therapy alone, and also potentially help determine what targeted therapy may be most appropriate to improve outcome in these poor prognosis cancers.
Materials and Methods
Clinical samples obtained at CINJ-UMDNJ were pre-existing archived samples that were de-identified and obtained without individual consent under a protocol approved by the Institutional Review Board of UMDNJ (Piscataway/NewBrunswick Campus). Samples from Radium Hospital for which clinical outcome data were available were obtained with written patient consent under a protocol approved by Regional Ethical Committee of South Eastern Norway (REK sør-øst).
Three breast cancer gene expression datasets from Loi et al. ,  were downloaded from (GEO:www.ncbi.nlm.nih.gov/geo, accession number GSE6532). The sets are abbreviated as KIT, OXFT and GUYT representing the institutions of origin: Uppsala University Hospital, John Radcliffe Hospital, and Guys Hospital. They comprised of data from 81, 109 and 87 ER+ breast cancer samples from patients treated with tamoxifen with 9 years median clinical follow-up on Affymetrix U133A/B (KIT & OXFT) and U133Plus2 (GUYT) platforms. After MAS5 normalization, probes were retained only for genes found on both platforms. Expression values were log2 transformed and multiple probes/gene compressed to the probe with highest median expression across samples.
Supervised Outlier Analysis of Gene Expression Datasets
Expression values were median centered and divided by the median absolute deviation (MAD) as described in Tomlins et al. . This step was performed separately for KIT, OXFT and GUYT datasets to avoid distribution biases. Outlier low/high cut-off values for each gene were defined as those which were outside the 10/90% quantile cutoffs across samples (results were insensitive to varying the quantile cut-off by +/−5%). High/low outlier genes for each sample array were identified using these cutoffs. The dataset is now reduced to three binary matrices of size Ngenes x Nsamples, one matrix for non-outliers and one each for high and low outliers. This process was implemented separately for each dataset (KIT, OXFT, GUYT) and the resulting matrices merged by concatenation over samples.
The high/low outlier matrices B1 and B2, with entries 1/0 if gene i in sample j was/was-not an outlier, were analysed further. Genes with <10 outliers across samples were discarded as not informative for statistical inference. For each remaining gene, the distribution of outliers across samples defines two classes: the sample set with “aberrant” (outlier) expression and the sample set with “normal” expression. Kaplan-Meier curves were used to identify the genes where these classes had a significant differential survival based on a log-rank test at p<0.05 (complete list in Table S2).
Identification of Predictive Gene Patterns for Tamoxifen Sensitivity
Do the outlier genes defined as above represent gene categories of clinical interest? For this to be true and statistically significant, sets of genes must exist with similar outlier classes - i.e., they must be over/under-expressed in roughly the same set of samples. This corresponds to identifying tightly correlated clusters of outlier genes and samples in the binary matrices B1 or B2. These were identified using the Phi coefficient (equivalent to a Pearson correlation between rows of matrices B1 or B2) as follows:
Let C1 and C2 be the covariance matrices between the rows of B1 and B2 respectively. Then, is the matrix of correlation coefficients between the outlier profiles of the genes in B1 or B2. Clusters of tightly correlated genes were identified by iteratively removing row i and column i with where if and otherwise, until a stable set was obtained. Here, stability means that the size of the reduced matrix R’ stops changing. PCA plots of the resulting reduced matrices B1 and B2 were used to identify distinct groups of highly correlated genes for further analysis (eg. pathway enrichment ).
The identified genes were mapped to chromosomal locations and amplified regions identified using a sliding window 25 genes wide with a pace of 5 genes (varying the window size and/or pace by 5–10 genes did not affect results. The Fisher Exact test  was used to assess significance. The Benjamini-Hochberg method  was used to implement FDR <5% by converting p-values to q-values. For each array, chromosomal regions with q <0.05 were marked as potential amplifications and ordered by frequency in the cluster sample set.
Relative Oncotype Dx Scores
The Relative Oncotype Dx score is calculated using normalized gene expression values of the set of genes from the original score together with their published weights . The genes used in calculating this score are: HER2, GRB7, GSTM1, CD68, BAG1, invasion markers MMP11, CTSL2, proliferation markers Ki67, STK15, Survivin, CCNB1, MYBL2 and hormonal markers ER, PGR, BCL2, and SCUBE2. A separate score is calculated for each group and then combined in a final score:
Fluorescence in situ Hybridization (FISH)
Prelabeled FISH probes for BAC clones RP11-1065N2, RP11-90P5 and RP11-1136N16 were purchased (Empire Genomics, Buffalo, NY) and tested on metaphase chromosome spreads. FISH experiments were performed on 14 4 µm paraffin embedded breast cancer tissue slides, collected from ER+/HER2- breast cancer patients treated in 2007–2009 at Robert Wood Johnson University Hospital, New Jersey, USA. Hybridization was performed on hybrite for 16–24 hours at 37°C, and slides washed, first with 4x SSC for 3 min at 37°C then with 0.1% NP-40 (Vysis, Downers Grove, IL, USA) for 30 sec at room temperature. Slides were scored for chromosomal amplification by counting signals in 20 tumor cells and averaging.
A multiplex FISH assay was also developed to score the amplicons on specimens from 36 ER+/HER2- patients treated with hormone therapy from the MicMa cohort that were available in a tissue microarray format . The probes were labelled by Nick translation with nucleotides labelled with Green-dUTP, Alexa Fluor 594-5-dUTP and HyPer5 dCTP, respectively. Scoring of FISH signals was done by acquiring z-stacks of the whole thickness of each sample and using a Nikon Ti microscope attached to a Yokogawa spinning-disk confocal unit. Non-tumor cells such as fibroblasts or lymphocytes were used as internal controls. The amplicon count for the sample was the average number of counts over 20 cells. We consider the sample to have an amplicon if its amplicon count is ≥3.5 for 8p11.2, ≥4.0 for 17q12 and ≥2.8 for 8q24.3. These thresholds were obtained by finding the optimal thresholds associated with survival difference between the cases that had at least one amplicon against the cases that had none in this sample set.
Clustergram of the correlation matrix between selected over-expressed genes identify pathways and amplicons for poor survival under tamoxifen treatment. The Phi coefficients between gene pairs of highly expressed outlier genes associated with tamoxifen resistance in Figure 1A produce a correlation matrix. The figure shows the resulting heatmap of this correlation matrix using hierarchical clustering using Pearson correlation distance and complete linkage. Genes in the same pathway or chromosomal region are clustered together as marked.
Survival curves for samples with high/low Oncotype Dx scores. Kaplan-Meier curves showing significantly lower survival (HR = 2.81, 95% CI = 1.7–4.5; P<0.0001) for tumor samples with high Oncotype Dx scores (ODx score >0) versus low Oncotype Dx scores (ODx score <0).
Oncotype Dx and presence of amplicons in ER+ breast cancer. Relative Oncotype Dx scores calculated across all 3 datasets (GSE6532) as outlined in Methods, are shown as mean values with standard errors for each group of samples listed on the vertical axes. Note that the Oncotype Dx scores for patients with the three novel amplicons are less than that for HER2 amplicon, in spite of their similar poor survival. This suggests that Oncotype Dx does not adequately assess the presence of these novel amplicons, and may underestimate risk in some cases.
Analysis of amplicon status using FISH in a cohort of tamoxifen treated ER+/HER2− breast cancers. This figure shows typical FISH images from analysis of FFPE slides for samples with/without amplicons using probes specific to each amplicon.
Clinical information file. Excel 2003 file containing clinical characteristics of the breast tumor samples used in the gene expression analysis from the study by Loi et al. , . Relative Oncotype Dx scores together with pathway/amplicon presence for each sample are also listed.
Survival associated with outlier genes. Excel 2003 file containing a table of outlier association results for all genes used in the analysis. Along with the outlier scores and hazard ratios, corresponding P values are also listed.
Gene patterns associated with tamoxifen response. Gene Ontology pathway/chromosomal location enrichment results in the primary gene expression dataset GSE6532. Significance was assessed using Fisher Exact Test.
Sample correlations between gene patterns associated with tamoxifen resistance. Sample correlations between cell cycle pathway and amplicons associated with tamoxifen resistance in the primary gene expression dataset GSE6532. Values represent Phi coefficients measuring the strength of association between the group of samples that over-express cell cycle genes and amplicons 17q12, 17q21.33-q25.1, 8p11.2 and 8q24.3. The last column lists the percentage counts of ER+ samples with the associated pathway/amplicons. Highlighted in bold are correlation values significant at P<0.01 except for self correlations.
Clinical information file. Excel 2003 file containing clinical characteristics of the breast tumor samples used in the analysis of the combined gene expression data set from the study by Györffy et al. . Amplicon presence as found by the unsupervised outlier analysis are also listed.
Clinical information file. Excel 2003 file containing clinical characteristics of the breast tumor samples used in the CGH data analysis. Amplicon presence as found by GISTIC is also listed for each sample.
Sample correlations between amplicons 17q12, 17q22, 8p11.2 and 8q24.3 in an independent CGH array data set. Phi coefficients measuring the strength of association between amplicons 17q12, 17q22, 8p11.2 and 8q24.3 in the test CGH dataset GSE22133. The last column lists the percentage counts of ER+ samples with the associated amplicons. Highlighted in bold are correlation values significant at P<0.01 except for self correlations.
Conceived and designed the experiments: EB SG GB AB-D. Performed the experiments: EB KV HR IR VA. Analyzed the data: EB HR IR VA SG GB DT NB AL. Contributed reagents/materials/analysis tools: EB DT NB AB-D. Wrote the paper: EB HR GB AB-D SG.
- 1. Clarke M, Collins R, Davies C, Godwin J, Gray R, et al. (1998) Tamoxifen for early breast cancer: an overview of the randomised trials. Lancet 351: 1451–1467.
- 2. Clarke M, Collins R, Darby S, Davies C, Evans V, et al. (2005) Effects of chemotherapy and hormonal therapy for early breast cancer on recurrence and 15-year survival: an overview of the randomised trials. Lancet 365: 1687–1717. doi. pp. 10.1016/S0140–6736(05)66544-0.
- 3. Hurvitz SA, Pietras RJ (2008) Rational management of endocrine resistance in breast cancer: a comprehensive review of estrogen receptor biology, treatment options, and future directions. Cancer 113: 2385–2397. doi:10.1002/cncr.23875.
- 4. Gnant M, Dubsky P, Fitzal F, Blaha P, Schoppmann S, et al. (2009) Maintaining bone density in patients undergoing treatment for breast cancer: is there an adjuvant benefit? Clin Breast Cancer 9 Suppl 1: S18–27. doi:10.3816/CBC.2009.s.002.
- 5. Rastelli F, Crispino S (2008) Factors predictive of response to hormone therapy in breast cancer. Tumori 94: 370–383.
- 6. Ma CX, Sanchez CG, Ellis MJ (2009) Predicting endocrine therapy responsiveness in breast cancer. Oncology (Williston Park, NY) 23: 133–142.
- 7. Paik S, Shak S, Tang G, Kim C, Baker J, et al. (2004) A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. The New England journal of medicine 351: 2817–2826. doi:10.1056/NEJMoa041588.
- 8. Sotiriou C, Wirapati P, Loi S, Harris A, Fox S, et al. (2006) Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. Journal of the National Cancer Institute 98: 262–272. doi:10.1093/jnci/djj052.
- 9. Alexe G, Dalgin GS, Ramaswamy R, Delisi C, Bhanot G (2007) Data perturbation independent diagnosis and validation of breast cancer subtypes using clustering and patterns. Cancer informatics 2: 243–274.
- 10. Alexe G, Dalgin GS, Scanfeld D, Tamayo P, Mesirov JP, et al. (2007) High expression of lymphocyte-associated genes in node-negative HER2+ breast cancers correlates with lower recurrence rates. Cancer research 67: 10669-10676. doi. pp. 10.1158/0008–5472.CAN-07–0539.
- 11. Dalgin GS, Alexe G, Scanfeld D, Tamayo P, Mesirov JP, et al. (2007) Portraits of breast cancer progression. BMC bioinformatics 8: 291. doi. pp. 10.1186/1471–2105-8-291.
- 12. Loi S, Haibe-Kains B, Desmedt C, Lallemand F, Tutt AM, et al. (2007) Definition of clinically distinct molecular subtypes in estrogen receptor-positive breast carcinomas through genomic grade. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 25: 1239–1246. doi:10.1200/JCO.2006.07.1522.
- 13. Cheang MCU, Chia SK, Voduc D, Gao D, Leung S, et al. (2009) Ki67 index, HER2 status, and prognosis of patients with luminal B breast cancer. Journal of the National Cancer Institute 101: 736–750. doi:10.1093/jnci/djp082.
- 14. Sorlie T, Perou CM, Tibshirani R, Aas T, Geisler S, et al. (2001) Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proceedings of the National Academy of Sciences of the United States of America 98: 10869–10874. doi:10.1073/pnas.191367098.
- 15. Osborne CK, Schiff R (2011) Mechanisms of endocrine resistance in breast cancer. Annu Rev Med 62: 233–247. doi. pp. 10.1146/annurev–med-070909-182917.
- 16. Osborne CK, Shou J, Massarweh S, Schiff R (2005) Crosstalk between estrogen receptor and growth factor receptor pathways as a cause for endocrine therapy resistance in breast cancer. Clin Cancer Res 11: 865s–70s.
- 17. Arpino G, Wiechmann L, Osborne CK, Schiff R (2008) Crosstalk between the estrogen receptor and the HER tyrosine kinase receptor family: molecular mechanism and clinical implications for endocrine therapy resistance. Endocrine reviews 29: 217–233. doi. pp. 10.1210/er.2006–0045.
- 18. Arpino G, Green SJ, Allred DC, Lew D, Martino S, et al. (2004) HER-2 amplification, HER-1 expression, and tamoxifen response in estrogen receptor-positive metastatic breast cancer: a southwest oncology group study. Clinical cancer research: an official journal of the American Association for Cancer Research 10: 5670–5676. doi. pp. 10.1158/1078–0432.CCR-04-0110.
- 19. Naylor T, Greshock J, Wang Y, Colligon T, Yu QC, et al. (2005) High resolution genomic analysis of sporadic breast cancer using array-based comparative genomic hybridization. Breast cancer research: BCR 7: R1186–98. doi:10.1186/bcr1356.
- 20. Naume B, Zhao X, Synnestvedt M, Borgen E, Russnes HG, et al. (2007) Presence of bone marrow micrometastasis is associated with different recurrence risk within molecular subtypes of breast cancer. Mol Oncol 1: 160–171. doi:10.1016/j.molonc.2007.03.004.
- 21. Loi S, Haibe-Kains B, Desmedt C, Wirapati P, Lallemand F, et al. (2008) Predicting prognosis using molecular profiling in estrogen receptor-positive breast cancer treated with tamoxifen. BMC genomics 9: 239. doi. pp. 10.1186/1471–2164-9-239.
- 22. The Gene Ontology project in 2008 (2008) Nucleic acids research 36: D440–4.
- 23. Borg A, Baldetorp B, Fernö M, Killander D, Olsson H, et al. (1994) ERBB2 amplification is associated with tamoxifen resistance in steroid-receptor positive breast cancer. Cancer letters 81: 137–144.
- 24. Kallioniemi A, Kallioniemi O-P, Piper J, Tanner M, Stokke T, et al. (1994) Detection and Mapping of Amplified DNA Sequences in Breast Cancer by Comparative Genomic Hybridization. Proceedings of the National Academy of Sciences of the United States of America 91: 2156–2160.
- 25. Parssinen J, Kuukasjarvi T, Karhu R, Kallioniemi A (2007) High-level amplification at 17q23 leads to coordinated overexpression of multiple adjacent genes in breast cancer. British Journal of Cancer 96: 1258–1264. doi:10.1038/sj.bjc.6603692.
- 26. Orsetti B, Courjal F, Cuny M, Rodriguez C, Theillet C (1999) 17q21-q25 aberrations in breast cancer: combined allelotyping and CGH analysis reveals 5 regions of allelic imbalance among which two correspond to DNA amplification. Oncogene 18: 6262–6270. doi:10.1038/sj.onc.1203006.
- 27. Gelsi-Boyer V, Orsetti B, Cervera N, Finetti P, Sircoulomb F, et al. (2005) Comprehensive profiling of 8p11-12 amplification in breast cancer. Molecular cancer research: MCR 3: 655–667. doi. pp. 10.1158/1541–7786.MCR-05-0128.
- 28. Cingoz S, Altungoz O, Canda T, Saydam S, Aksakoglu G, et al. (2003) DNA copy number changes detected by comparative genomic hybridization and their association with clinicopathologic parameters in breast tumors. Cancer genetics and cytogenetics 145: 108–114.
- 29. Bernard-Pierrot I, Gruel N, Stransky N, Vincent-Salomon A, Reyal F, et al. (2008) Characterization of the recurrent 8p11-12 amplicon identifies PPAPDC1B, a phosphatase protein, as a new therapeutic target in breast cancer. Cancer research 68: 7165-7175. doi. pp. 10.1158/0008–5472.CAN-08-1360.
- 30. Stec I, van Ommen GJ, den Dunnen JT (2001) WHSC1L1, on human chromosome 8p11.2, closely resembles WHSC1 and maps to a duplicated region shared with 4p16.3. Genomics 76: 5–8.
- 31. De Paepe P, Baens M, van Krieken H, Verhasselt B, Stul M, et al. (2003) ALK activation by the CLTC-ALK fusion is a recurrent event in large B-cell lymphoma. Blood 102: 2638–2641.
- 32. Argani P, Lui MY, Couturier J, Bouvier R, Fournet J-C, et al. (2003) A novel CLTC-TFE3 gene fusion in pediatric renal adenocarcinoma with t(X;17)(p11.2;q23). Oncogene 22: 5374–5378.
- 33. Patel AS, Murphy KM, Hawkins AL, Cohen JS, Long PP, et al. (2007) RANBP2 and CLTC are involved in ALK rearrangements in inflammatory myofibroblastic tumors. Cancer genetics and cytogenetics 176: 107–114.
- 34. Dai C, Whitesell L, Rogers AB, Lindquist S (2007) Heat shock factor 1 is a powerful multifaceted modifier of carcinogenesis. Cell 130: 1005–1018.
- 35. Streicher KL, Yang ZQ, Ethier SP (2007) Transforming function of the LSM1 oncogene in human breast cancers with the 8p11–12 amplicon. Oncogene: 2104–2114. doi:10.1038/sj.onc.1210002.
- 36. Turner N, Pearson A, Sharpe R, Lambros M, Geyer F, et al. (2010) FGFR1 amplification drives endocrine therapy resistance and is a therapeutic target in breast cancer. Cancer research 70: 2085-2094. doi. pp. 10.1158/0008–5472.CAN-09-3746.
- 37. Györffy B, Schäfer R (2008) Meta-analysis of gene expression profiles related to relapse-free survival in 1,079 breast cancer patients. Breast Cancer Res Treat 118: 433–441. doi. pp. 10.1007/s10549–008-0242-8.
- 38. Jonsson G, Staaf J, Vallon-Christersson J, Ringner M, Holm K, et al. (2010) Genomic subtypes of breast cancer identified by array comparative genomic hybridization display distinct molecular and clinical characteristics. Breast cancer research: BCR 12: R42. doi:10.1186/bcr2596.
- 39. Venkatraman ES, Olshen AB (2007) A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics (Oxford, England) 23: 657–663. doi:10.1093/bioinformatics/btl646.
- 40. Beroukhim R, Getz G, Nghiemphu L, Barretina J, Hsueh T, et al. (2007) Assessing the significance of chromosomal aberrations in cancer: methodology and application to glioma. Proceedings of the National Academy of Sciences of the United States of America 104: 20007–20012. doi:10.1073/pnas.0710052104.
- 41. Fan C, Oh DS, Wessels L, Weigelt B, Nuyten DSA, et al. (2006) Concordance among gene-expression-based predictors for breast cancer. N Engl J Med 355: 560–569. doi:10.1056/NEJMoa052933.
- 42. Singh L, Wilson AJ, Baum M, Whimster WF, Birch IH, et al. (1988) The relationship between histological grade, oestrogen receptor status, events and survival at 8 years in the NATO (‘Nolvadex’) trial. Br J Cancer 57: 612–614.
- 43. Lockwood WW, Chari R, Coe BP, Thu KL, Garnis C, et al. (2010) Integrative genomic analyses identify BRF2 as a novel lineage-specific oncogene in lung squamous cell carcinoma. PLoS Med 7: e1000315. doi:10.1371/journal.pmed.1000315.
- 44. Butt AJ, McNeil CM, Musgrove EA, Sutherland RL (2005) Downstream targets of growth factor and oestrogen signalling and endocrine resistance: the potential roles of c-Myc, cyclin D1 and cyclin E. Endocrine-related cancer 12: 47–59. doi:10.1677/erc.1.00993.
- 45. Frasor J, Chang EC, Komm B, Lin C-Y, Vega VB, et al. (2006) Gene Expression Preferentially Regulated by Tamoxifen in Breast Cancer Cells and Correlations with Clinical Outcome. Cancer Research 66: 7334–7340. doi. pp. 10.1158/0008–5472.CAN-05-4269.
- 46. Bergamaschi A, Christensen BL, Katzenellenbogen BS (2011) Reversal of endocrine resistance in breast cancer: interrelationships among 14-3-3ζ, FOXM1, and a gene signature associated with mitosis. Breast Cancer Res 13: R70. doi:10.1186/bcr2913.
- 47. He B, Feng Q, Mukherjee A, Lonard DM, DeMayo FJ, et al. (2008) A repressive role for prohibitin in estrogen signaling. Mol Endocrinol 22: 344–360. doi. pp. 10.1210/me.2007–0400.
- 48. Ishii Y, Waxman S, Germain D (2008) Tamoxifen Stimulates the Growth of Cyclin D1–Overexpressing Breast Cancer Cells by Promoting the Activation of Signal Transducer and Activator of Transcription 3. Cancer Research 68: 852–860. doi. pp. 10.1158/0008–5472.CAN-07-2879.
- 49. Karlsson E, Ahnstrom Waltersson M, Bostner J, Perez-Tenorio G, Olsson B, et al (2009) Comprehensive Genomic and Transcriptomic Analysis of the 11q13 Amplicon in Breast Cancer. Cancer Res 69: 5166. doi. pp. 10.1158/0008–5472.SABCS-09-5166.
- 50. Bautista S, Theillet C (1998) CCND1 and FGFR1 coamplification results in the colocalization of 11q13 and 8p12 sequences in breast tumor nuclei. Genes, chromosomes & cancer 22: 268–277.
- 51. Kwek SS, Roy R, Zhou H, Climent J, Martinez-Climent JA, et al. (2009) Co-amplified genes at 8p12 and 11q13 in breast tumors cooperate with two major pathways in oncogenesis. Oncogene 28: 1892–1903. doi:10.1038/onc.2009.34.
- 52. Tomlins SA, Rhodes DR, Perner S, Dhanasekaran SM, Mehra R, et al. (2005) Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. Science (New York, NY) 310: 644–648. doi:10.1126/science.1117679.
- 53. Fisher RA (1922) On the Interpretation of χ2 from Contingency Tables, and the Calculation of P. Journal of the Royal Statistical Society 85: 87–94. doi:10.2307/2340521.
- 54. Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological) 57: 289–300.
- 55. Wiedswang G, Borgen E, Kåresen R, Kvalheim G, Nesland JM, et al. (2003) Detection of isolated tumor cells in bone marrow is an independent prognostic factor in breast cancer. J Clin Oncol 21: 3469–3478. doi:10.1200/JCO.2003.02.009.