Recent genome-wide association studies (GWASs) have identified candidate genes contributing to cancer risk through low-penetrance mutations. Many of these genes were unexpected and, intriguingly, included well-known players in carcinogenesis at the somatic level. To assess the hypothesis of a germline-somatic link in carcinogenesis, we evaluated the distribution of somatic gene labels within the ordered results of a breast cancer risk GWAS. This analysis suggested frequent influence on risk of genetic variation in loci encoding for “driver kinases” (i.e., kinases encoded by genes that showed higher somatic mutation rates than expected by chance and, therefore, whose deregulation may contribute to cancer development and/or progression). Assessment of these predictions using a population-based case-control study in Poland replicated the association for rs3732568 in EPHB1 (odds ratio (OR) = 0.79; 95% confidence interval (CI): 0.63–0.98; Ptrend = 0.031). Analyses by early age at diagnosis and by estrogen receptor α (ERα) tumor status indicated potential associations for rs6852678 in CDKL2 (OR = 0.32, 95% CI: 0.10–1.00; Precessive = 0.044) and rs10878640 in DYRK2 (OR = 2.39, 95% CI: 1.32–4.30; Pdominant = 0.003), and for rs12765929, rs9836340, rs4707795 in BMPR1A, EPHA3 and EPHA7, respectively (ERα tumor status Pinteraction<0.05). The identification of three novel candidates as EPH receptor genes might indicate a link between perturbed compartmentalization of early neoplastic lesions and breast cancer risk and progression. Together, these data may lay the foundations for replication in additional populations and could potentially increase our knowledge of the underlying molecular mechanisms of breast carcinogenesis.
Citation: Bonifaci N, Górski B, Masojć B, Wokołorczyk D, Jakubowska A, Dębniak T, et al. (2010) Exploring the Link between Germline and Somatic Genetic Alterations in Breast Carcinogenesis. PLoS ONE 5(11): e14078. https://doi.org/10.1371/journal.pone.0014078
Editor: Kelvin Yuen Kwong Chan, The University of Hong Kong, China
Received: July 2, 2010; Accepted: November 2, 2010; Published: November 22, 2010
Copyright: © 2010 Bonifaci et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The work was supported by the Spanish Society of Medical Oncology 2009 grant to JB and MAP, the Ramón Areces Foundation XV and FIS (09/02483) grants, and the Biomedical Research Centre Network for Epidemiology and Public Health group 55 to MAP, and the “Roses contra el Càncer” Foundation and RTICCC RD06/0020/0028 grants to JB. MAP was supported by the “Ramón y Cajal” Young Investigator program of the Spanish Ministry of Science and Innovation and NB by an IDIBELL fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
With the advent of technical and methodological advances, several GWASs identifying common genetic variation associated with risk of developing cancer have been completed recently . Thus, initiatives such as the National Cancer Institute's Cancer Genetic Markers of Susceptibility (CGEMS) and efforts carried out by deCODE Genetics and the Breast Cancer Association Consortium have led to the identification of breast cancer risk alleles in single nucleotide polymorphisms (SNPs) replicated across populations –. Intriguingly, illustrating the unbiased nature of GWASs, most hits have corresponded to a priori unexpected candidate genes. In this context, the involvement of biological processes beyond the canonical DNA damage response in breast cancer is further suggested by the observed differential influence of low-penetrance risk alleles among BRCA1 and BRCA2 mutation carriers –.
A potential common characteristic of the unexpected low-penetrance susceptibility genes is the previously identified contribution to tumorigenesis, but at the somatic level. Common genetic variation in loci encoding for FGFR2 and MAP3K1 influences risk of breast cancer , , and these genes were previously found to be somatically mutated in diverse neoplasias including breast cancer , . In addition, and central to the understanding of cancer progression, common risk alleles showed differential influence according to ERα tumor status , and variation in the locus encoding for ERα, ESR1, also influences risk of breast cancer , . More recently, additional breast cancer susceptibility loci have been described that include CDKN2A/B as candidates . While these observations suggest a “germline-somatic” link in breast carcinogenesis, an analogous situation may exist for other neoplasias. Variation in loci encoding for CDH1 and SMAD7 influences risk of colorectal cancer ,  and, similarly, these genes were previously identified as inactivated or deregulated in tumors –. Moreover, deregulated germline expression of a paradigmatic proto-oncogene, MYC, may be a common mechanism of tumorigenesis in epithelial tissues –. However, despite some evidence of a germline-somatic link, as yet there is no explicit evaluation of this hypothesis and its potential usefulness in replication studies. Here we present an examination of this link through analysis of the CGEMS GWAS breast cancer dataset and subsequent assessment of the predictions in a case-control study of incident breast cancer in Poland.
Distribution of somatic gene sets in ordered breast cancer GWAS results
Previously, analysis of the CGEMS GWAS dataset using the lowest genotypic P value per gene locus suggested true associations in genes annotated with Gene Ontology (GO) biological process terms linked to somatic events , . However, since there is a positive correlation between the extension of a given locus and the number of SNPs it may contain (and, therefore, the possibility of significant association results being obtained by chance), an unadjusted GWAS rank is biased at its lowest P values for specific processes in which large gene products frequently participate , ,  (Fig. 1A). Nevertheless, cancer genes tend to expand across large genomic regions , and examination of eight genes likely involved in breast cancer through low-penetrance mutations–CASP8, COX11, ESR1, FGFR2, LSP1, MAP3K1, RAD51L1 and TOX3 –, , –showed a trend for larger genomic loci (mean () genomic extension = 211 kilo bases (kb) and standard deviation (σ) = 283 kb; compared to = 66 kb and σ = 128 kb for all annotated genes in the CGEMS GWAS rank).
A, Original GWAS results ranked according to the lowest genotypic association test P value per gene locus (unadjusted for genomic extension; taken SNPs in defined genomic window of ±10 kb relative to the first and last exons of a given gene). The Y-axis indicates the number of SNPs per gene locus while the X-axis indicates the lowest association P value per gene locus. Bias can be appreciated as the number of SNPs per gene locus increases at lower P values. B, GWAS results ranked according to the lowest association P value per gene locus but adjusted by genomic extension through case-control permutations. Compared to the previous graph, the bias largely disappears. C, Following the rank in B, the Y-axis indicates odds ratios (ORs) of allele effects and density distributions of gene sets (driver kinases correspond to a light lilac curve; the rest of the genome in the GWAS dataset is shown by a dark lilac curve), while the X-axis indicates the log-transformed association P values, previously adjusted by genomic extension. As indicated by the density curves, SNPs mapping to driver kinase loci are relatively more frequent at lower association adjusted P values. This observation is supported by GSEA results using the same CGEMS GWAS adjusted rank; nominal P<0.001 and FDR-adjusted P = 0.010 (Table S2). D, Similarly to the graph in C, distribution of CRGs in the CGEMS GWAS rank adjusted through permutations.
Having identified caveats to the ranking of GWAS results, we performed 10,000 permutations of case-control status and used the null distribution of t statistics from the age-adjusted partial correlation analysis to correct the original rank, which then showed an unbiased distribution (Fig. 1B). Prior to the evaluation of somatic sets, analysis of GO biological process terms in the GWAS permutation P values rank did not show any significant asymmetry using the Gene Set Enrichment Analysis (GSEA) tool  with multiple testing correction by the false discovery rate (FDR) approach . Nonetheless, most processes with nominally significant P values were those previously highlighted, which are associated with somatic events ,  (Table S1). This observation appears to agree with recently described results of pathway-based analysis of the same GWAS dataset .
Next, evaluation of somatic sets related to cancer prognosis and treatment response prediction, and to genetic and genomic alterations (see Materials and Methods), revealed significant asymmetrical distribution of “driver kinases” , ; that is, kinases whose deregulation through frequent somatic mutation contributes to tumor development and/or progression (“driver mutations”). In contrast, “passenger mutations” were defined as essentially neutral and linked to the inherent genetic instability in cancer cells , . Thus, the driver kinases set was found to be biased towards the top (nominal significant association results) of the GWAS permutation rank (GSEA nominal P<0.001; FDR-adjusted P = 0.010) (Fig. 1C and Table S2). Among the remaining of somatic sets evaluated, only cooperation response genes (CRGs) to oncogenic mutations  showed a trend for a distribution similar to that of driver kinases (GSEA nominal P = 0.080; FDR-adjusted P value = 0.25) (Fig. 1D), although the intersection between both sets only contained two genes (Table S2). Therefore, in somatic cancer genes, common genetic variation in driver kinase loci might frequently influence risk of breast cancer.
The set of driver kinases contained a benchmark gene, FGFR2 , , and a locus recently replicated in an independent study, BMPR1B . Nevertheless, a significant bias was still observed following exclusion of these two loci (GSEA nominal P = 0.001; FDR-adjusted P = 0.048), which suggests that variation at additional driver kinase loci influences risk of breast cancer. Importantly, using the set of non-driver kinases–either the subsequent equivalent set as originally statistically ordered or the total set (n = 344) –did not reveal significant bias (GSEA nominal P = 0.99 and 0.66, respectively), which reinforces the idea of frequent involvement of driver kinases. However, if only the individual statistical data for each locus were considered, most of the driver kinase loci would perhaps not have been selected for replication in other populations.
Independent association results for common variation in driver kinase loci
Given the possible bias in GWAS rank identified above, we examined the top 20 driver kinase variants in the original rank (Table S3, including details of the CGEMS and results below) in a case-control study of incident breast cancer in Szczecin (Poland), previously used in other replications . Applying genotyping quality controls and Hardy-Weinberg equilibrium analysis, 16 SNPs representing an identical number of driver kinase loci (i.e., a single SNP for each locus and representing the strongest potential statistical association) were examined for their association with risk of breast cancer using 880 controls and 1,173 cases (see Materials and Methods). In this analysis, the rs3732568 variant in the ephrin type-B receptor 1 (EPHB1) locus was found to be associated with risk of breast cancer: OR = 0.79, 95% CI: 0.63–0.98; Ptrend = 0.031 (Table 1). Further evaluation of this association through 10,000 case-control permutations in our study gave a similar significance value, Ptrend = 0.034. Importantly, this association was in the same direction and with similar magnitude to the result in the CGEMS GWAS: age-adjusted OR = 0.78, 95% CI: 0.64–0.94; Ptrend = 0.009.
While deregulated expression or function of EPHs and EPH receptors is thought to play a critical role in the initial stages of epithelial neoplasia , , recent analysis of early breast cancer expression changes suggests a link between disruption of cell adhesion and extracellular matrix pathways, and the risk of developing breast cancer . Analysis of this recent dataset also revealed an early expression change of EPHB1, between normal breast tissue and atypical ductal hyperplasia (Fig. 2). This alteration consisted of infra-expression in hyperplasia, akin to its potential role in the compartmentalization of early neoplastic lesions . Together, association studies, early expression changes in carcinogenesis and the regulation of cell adhesion suggest the involvement of EPHB1 in risk of breast cancer.
The graphs show expression profiles in histologically normal (HN) breast tissues versus patient-matched atypical ductal hyperplasia (ADH) and ductal carcinoma in situ (DCIS) . Results of two EPHB1 microarray probes (names shown at the top) and the corresponding significance P values are shown.
Next, given accepted models of inherited breast cancer susceptibility , we examined associations with risk at early age of diagnosis (≤40 years old). This analysis indicated two additional potential associations: rs6852678 in CDKL2, recessive model OR = 0.32, 95% CI: 0.10–1.00; P = 0.044; and rs10878640 in DYRK2, dominant model OR = 2.39, 95% CI: 1.32–4.30; P = 0.003 (Table 2). Results for rs6852678 appeared to be consistent with CGEMS GWAS analysis; age-adjusted recessive model OR = 0.71, 95% CI: 0.53–0.95; P = 0.019; however, the pattern for rs10878640 might be more complex (CGEMS GWAS ORs = 1.05 and 0.68 for heterozygotes and minor allele homozygotes, respectively).
Having potential differences by ERα tumor status, we next examined associations in ERα-positive and -negative breast cancer patients. Thus, rs3732568 in EPHB1 showed a similar influence on either type of breast cancer (Table 3)–which is consistent with an overall significant association–and rs12765929 in BMPR1A and rs9836340 in EPHA3 showed a potential major impact on the risk of ERα-negative breast cancer (P for difference in OR (interaction) by ERα status <0.05), while rs4707795 in EPHA7 showed a differential effect between ERα-negative versus ERα-positive breast cancer risk (Pinteraction = 0.007) (Table 3). None of these additional candidates linked to ERα tumor status, or those linked to an early age of diagnosis above, showed significant expression differences at early stages of breast carcinogenesis as EPHB1. On the other hand, the remaining SNPs examined in this study after applying quality controls and Hardy-Weinberg equilibrium analysis (i.e., 10 out of 16), did not show significant associations following CGEMS evidence (Table S3). Together, the gene-set based analysis of GWAS data and the subsequent replication attempt might indicate that common genetic variation in specific driver kinase loci, and particularly in EPH receptor genes, influence risk of breast cancer.
Evaluation of a germline-somatic link in breast carcinogenesis suggests a role for driver kinases and, perhaps to a lesser extent, genes with a synergistic response to oncogenic mutations. This study might be limited by the assignment of the lowest genotypic P value per gene locus within a defined genomic window (i.e., ±10 kb)–thus excluding a large proportion of variation that cannot be assigned to a specific known gene–and by its focus on the additive model of influence of risk alleles when adjusted through case-control permutations. Future analyses taking into account the potential perturbation of germline gene expression by, for example, common variation at distant regulatory regions may improve the identification of susceptibility genes using GWAS complete data. Another limitation in the interpretation of the results presented here may lie in the case-control study designs: the CGEMS addressed breast cancer risk in postmenopausal women, while the Polish study was relatively enriched in early-onset cases. Therefore, studies in additional populations, with diverse designs, are warranted to corroborate the results shown here.
The results of the replication study may be consistent with previously detected somatic genetic alterations and/or functional roles. Somatic mutations in CDKL2 were nonsense and were only detected in breast and ovarian cancer cell lines or tumors , . CDKL2 (also known as p56 or KKIAMRE) is the most distant member of the CDC2-related serine/threonine protein kinase family, involved in epidermal growth factor signaling , but with a mostly uncharacterized function. DYRK2 was found to be mutated in breast and central nervous system tumors, in nonsense and missense alterations, respectively , . The functional role of DYRK2 in the DNA damage response  may link to CGEMS GWAS results for RAD51L1 : loss of DYRK2 function alters the activation of apoptosis in response to DNA damage via ATM , which may therefore promote carcinogenesis.
Having revealed potential associations linked to known somatic alterations, the most striking results of this study may concern the identification of risk alleles at three EPH receptor loci. EPH-mediated signaling regulates important biological process altered in carcinogenesis, such as cell-to-cell communication, and cell migration and adhesion via the actin cytoskeleton , . Thus, through RHO and RAS/MAPK activities , this signaling pathway has been implicated in the maintenance of epithelial tissue architectures and is therefore thought to act as a tumor suppressor , . These observations may indicate that, similarly to colorectal tumorigenesis , EPH-mediated compartmentalization of early breast tissue neoplastic lesions is critical to prevent the subsequent emergence of carcinoma. Therefore, through a germline expression or functional perturbation, EPHB1 may contribute to the observed variability in the transition from an in situ lesion to an invasive carcinoma . While the associations revealed here warrant further replication in other populations, the existing data could potentially increase current knowledge of the genetic basis and molecular mechanisms of breast carcinogenesis.
Materials and Methods
The National Cancer Institute CGEMS initiative has conducted genome-wide association studies to identify common genetic variants and the corresponding functionally affected genes involved in breast cancer and prostate cancer susceptibility. An initial CGEMS whole genome scan was designed to study the main effect of SNPs on breast cancer risk in postmenopausal women . The study involved 1,145 invasive postmenopausal breast cancer cases and 1,142 matched controls from the Nurses' Health Study nested case-control study . Results of the CGEMS GWAS of breast cancer were obtained upon approval of a Data Access Request.
In our previous analyses , , ordered CGEMS GWAS results (i.e., ranks) corresponded to the lowest P value per gene for the genotypic test in a genomic region of +/−10 kb at each gene locus, defined by the Ensembl human genome release 57. Assigned SNPs were curated using Ensembl gene annotations. We  and others  noted that such ranks were biased along with the genomic extension–and therefore with the number of SNPs–per gene locus. To adjust for this bias, several statistical strategies are possible , including carrying out permutations of the case-control status to correct the significance of the original statistic. In our analysis, considering typed and informative SNPs in each gene locus, we first chose the maximum absolute value of the t statistic from the age-adjusted partial correlation in the additive model. Next, 10,000 permutations of the same informative SNPs were performed to create a null distribution for this maximum t statistic, which was used to assess its significance corrected by number of SNPs.
The distribution of gene sets in ranked GWAS results was examined using the non-parametric algorithm in the GSEA tool, with default values for all parameters  except for the set size when appropriated. In GSEA, a pre-defined gene set is mapped to a rank–in our case genes/loci ordered according to the adjusted association statistic–to assess potential bias using an enrichment score that reflects the degree to which this set is overrepresented at the extremes of the entire ranked list. In the interpretation of the results, caution should be taken when considering sets of different size. In our study, different hypotheses were examined independently (i.e., gene sets linked to prognosis, prediction or genetic/genomic somatic alterations), and P values were corrected for multiple testing within each group : 1) genes whose expression in primary breast tumors was associated with patient prognosis and/or metastasis –; 2) genes whose expression in primary breast tumors was associated with patient therapeutic treatment response –; 3) genes whose expression levels differed according to ERα breast tumor status or grade , or in response to 17β-estradiol ; and 4) genes with somatic genetic and/or genomic somatic alterations (Table S2). This last group was made up of five sets : i/ driver kinases (conditional probability of containing driver mutations >0.70, n = 119 as defined previously , of which 95 were uniquely mapped in the GWAS rank); ii/ CRGs to oncogenic mutations ; iii/ cancer gene census, somatically-mutated only , ; iv/ genes affected by somatic chromosomal rearrangements and/or fusions ; and v/ amplified and over-expressed cancer genes  (Table S2).
Gene expression analysis
Raw expression microarray data on breast cancer progression  were downloaded from the Gene Expression Omnibus reference GSE16873 and normalized with robust multiarray average (RMA)  and significance analysis was performed using the significance analysis of microarray (SAM) algorithm .
Study samples in Poland and association study
A case-control study of unselected invasive breast cancer collected between 1996 and 2003 in Szczecin (Poland) was analyzed. The series included 976 cases of breast cancer unselected for age and an additional group of 367 cases of breast cancer diagnosed at age 50 or below. Therefore, the series was enriched for early-onset cases: mean age of diagnosis was 52.4 years (range 19–88). Subjects were unselected for family history and 15% of cases reported a first- or second-degree relative with breast cancer. The participation rate exceeded 70% among women with breast cancer invited to enroll. Collected information included year of birth, age at diagnosis of breast and/or ovarian cancer, tumor bilaterality, family history (first- and second-degree relatives with breast and/or ovarian cancer) and tumor pathological features in >80% of cases (ERα and progesterone receptor status, and grade). Cases were also examined for BRCA1 founder mutations in Poland  and, if positive, excluded from the association study (n = 50). The control group included cancer-free adult women from the same population (920 women with mean age of diagnosis of 56.7, range 20–91) taken from the healthy adult patients of five family doctors practicing in the Szczecin region. These individuals were selected randomly from the patient lists of the participating doctors. The study was carried out with informed consent of the probands and approved by local ethics committees. Genotypes were obtained using Sequenom iPLEX chemistry at the International Hereditary Cancer Center. Quality controls were of >95% calling for each SNP and >90% of calls per sample. Thus, in the set of 16 SNPs, we observed an average concordance rate of 98.7% of genotype calls using 3.3% replicates. Genotypes of 880 controls and 1,173 cases were effectively analyzed using conditional and unconditional logistic regressions (age adjustment using similar strata size; 20–46, 46–56, 56–66, and 66–91 years old).
The authors are indebted to the CGEMS initiative and to Dr. L.A. Emery and colleagues for making their data available. We also wish to thank the study participants in Poland for their generous contribution.
Conceived and designed the experiments: CL CC MAP. Performed the experiments: NB BG BM DW CC. Analyzed the data: NB BG BM DW TD. Contributed reagents/materials/analysis tools: NB BG BM DW AJ TD AB JSM JB JD SN JL CL CC. Wrote the paper: MAP.
- 1. Easton DF, Eeles RA (2008) Genome-wide association studies in cancer. Hum Mol Genet 17: R109–115.DF EastonRA Eeles2008Genome-wide association studies in cancer.Hum Mol Genet17R109115
- 2. Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, et al. (2007) A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet 39: 870–874.DJ HunterP. KraftKB JacobsDG CoxM. Yeager2007A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer.Nat Genet39870874
- 3. Thomas G, Jacobs KB, Kraft P, Yeager M, Wacholder S, et al. (2009) A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1). Nat Genet 41: 579–584.G. ThomasKB JacobsP. KraftM. YeagerS. Wacholder2009A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1).Nat Genet41579584
- 4. Easton DF, Pooley KA, Dunning AM, Pharoah PD, Thompson D, et al. (2007) Genome-wide association study identifies novel breast cancer susceptibility loci. Nature 447: 1087–1093.DF EastonKA PooleyAM DunningPD PharoahD. Thompson2007Genome-wide association study identifies novel breast cancer susceptibility loci.Nature44710871093
- 5. Stacey SN, Manolescu A, Sulem P, Thorlacius S, Gudjonsson SA, et al. (2008) Common variants on chromosome 5p12 confer susceptibility to estrogen receptor-positive breast cancer. Nat Genet 40: 703–706.SN StaceyA. ManolescuP. SulemS. ThorlaciusSA Gudjonsson2008Common variants on chromosome 5p12 confer susceptibility to estrogen receptor-positive breast cancer.Nat Genet40703706
- 6. Ahmed S, Thomas G, Ghoussaini M, Healey CS, Humphreys MK, et al. (2009) Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2. Nat Genet 41: 585–590.S. AhmedG. ThomasM. GhoussainiCS HealeyMK Humphreys2009Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2.Nat Genet41585590
- 7. Antoniou AC, Sinilnikova OM, McGuffog L, Healey S, Nevanlinna H, et al. (2009) Common variants in LSP1, 2q35 and 8q24 and breast cancer risk for BRCA1 and BRCA2 mutation carriers. Hum Mol Genet 18: 4442–4456.AC AntoniouOM SinilnikovaL. McGuffogS. HealeyH. Nevanlinna2009Common variants in LSP1, 2q35 and 8q24 and breast cancer risk for BRCA1 and BRCA2 mutation carriers.Hum Mol Genet1844424456
- 8. Antoniou AC, Spurdle AB, Sinilnikova OM, Healey S, Pooley KA, et al. (2008) Common breast cancer-predisposition alleles are associated with breast cancer risk in BRCA1 and BRCA2 mutation carriers. Am J Hum Genet 82: 937–948.AC AntoniouAB SpurdleOM SinilnikovaS. HealeyKA Pooley2008Common breast cancer-predisposition alleles are associated with breast cancer risk in BRCA1 and BRCA2 mutation carriers.Am J Hum Genet82937948
- 9. Antoniou AC, Sinilnikova OM, Simard J, Leone M, Dumont M, et al. (2007) RAD51 135G→C modifies breast cancer risk among BRCA2 mutation carriers: results from a combined analysis of 19 studies. Am J Hum Genet 81: 1186–1200.AC AntoniouOM SinilnikovaJ. SimardM. LeoneM. Dumont2007RAD51 135G→C modifies breast cancer risk among BRCA2 mutation carriers: results from a combined analysis of 19 studies.Am J Hum Genet8111861200
- 10. Hansen RM, Goriely A, Wall SA, Roberts IS, Wilkie AO (2005) Fibroblast growth factor receptor 2, gain-of-function mutations, and tumourigenesis: investigating a potential link. J Pathol 207: 27–31.RM HansenA. GorielySA WallIS RobertsAO Wilkie2005Fibroblast growth factor receptor 2, gain-of-function mutations, and tumourigenesis: investigating a potential link.J Pathol2072731
- 11. Stephens P, Edkins S, Davies H, Greenman C, Cox C, et al. (2005) A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancer. Nat Genet 37: 590–592.P. StephensS. EdkinsH. DaviesC. GreenmanC. Cox2005A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancer.Nat Genet37590592
- 12. Garcia-Closas M, Chanock S (2008) Genetic susceptibility loci for breast cancer by estrogen receptor status. Clin Cancer Res 14: 8000–8009.M. Garcia-ClosasS. Chanock2008Genetic susceptibility loci for breast cancer by estrogen receptor status.Clin Cancer Res1480008009
- 13. Zheng W, Long J, Gao YT, Li C, Zheng Y, et al. (2009) Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1. Nat Genet 41: 324–328.W. ZhengJ. LongYT GaoC. LiY. Zheng2009Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1.Nat Genet41324328
- 14. Dunning AM, Healey CS, Baynes C, Maia AT, Scollen S, et al. (2009) Association of ESR1 gene tagging SNPs with breast cancer risk. Hum Mol Genet 18: 1131–1139.AM DunningCS HealeyC. BaynesAT MaiaS. Scollen2009Association of ESR1 gene tagging SNPs with breast cancer risk.Hum Mol Genet1811311139
- 15. Turnbull C, Ahmed S, Morrison J, Pernet D, Renwick A, et al. (2010) Genome-wide association study identifies five new breast cancer susceptibility loci. Nat Genet 42: 504–507.C. TurnbullS. AhmedJ. MorrisonD. PernetA. Renwick2010Genome-wide association study identifies five new breast cancer susceptibility loci.Nat Genet42504507
- 16. Houlston RS, Webb E, Broderick P, Pittman AM, Di Bernardo MC, et al. (2008) Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer. Nat Genet 40: 1426–1435.RS HoulstonE. WebbP. BroderickAM PittmanMC Di Bernardo2008Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer.Nat Genet4014261435
- 17. Broderick P, Carvajal-Carmona L, Pittman AM, Webb E, Howarth K, et al. (2007) A genome-wide association study shows that common alleles of SMAD7 influence colorectal cancer risk. Nat Genet 39: 1315–1317.P. BroderickL. Carvajal-CarmonaAM PittmanE. WebbK. Howarth2007A genome-wide association study shows that common alleles of SMAD7 influence colorectal cancer risk.Nat Genet3913151317
- 18. Levy L, Hill CS (2006) Alterations in components of the TGF-beta superfamily signaling pathways in human cancer. Cytokine Growth Factor Rev 17: 41–58.L. LevyCS Hill2006Alterations in components of the TGF-beta superfamily signaling pathways in human cancer.Cytokine Growth Factor Rev174158
- 19. Wheeler JM, Kim HC, Efstathiou JA, Ilyas M, Mortensen NJ, et al. (2001) Hypermethylation of the promoter region of the E-cadherin gene (CDH1) in sporadic and ulcerative colitis associated colorectal cancer. Gut 48: 367–371.JM WheelerHC KimJA EfstathiouM. IlyasNJ Mortensen2001Hypermethylation of the promoter region of the E-cadherin gene (CDH1) in sporadic and ulcerative colitis associated colorectal cancer.Gut48367371
- 20. Guilford P, Hopkins J, Harraway J, McLeod M, McLeod N, et al. (1998) E-cadherin germline mutations in familial gastric cancer. Nature 392: 402–405.P. GuilfordJ. HopkinsJ. HarrawayM. McLeodN. McLeod1998E-cadherin germline mutations in familial gastric cancer.Nature392402405
- 21. Richards FM, McKee SA, Rajpar MH, Cole TR, Evans DG, et al. (1999) Germline E-cadherin gene (CDH1) mutations predispose to familial gastric cancer and colorectal cancer. Hum Mol Genet 8: 607–610.FM RichardsSA McKeeMH RajparTR ColeDG Evans1999Germline E-cadherin gene (CDH1) mutations predispose to familial gastric cancer and colorectal cancer.Hum Mol Genet8607610
- 22. Pomerantz MM, Ahmadiyeh N, Jia L, Herman P, Verzi MP, et al. (2009) The 8q24 cancer risk variant rs6983267 shows long-range interaction with MYC in colorectal cancer. Nat Genet 41: 882–884.MM PomerantzN. AhmadiyehL. JiaP. HermanMP Verzi2009The 8q24 cancer risk variant rs6983267 shows long-range interaction with MYC in colorectal cancer.Nat Genet41882884
- 23. Tuupanen S, Turunen M, Lehtonen R, Hallikas O, Vanharanta S, et al. (2009) The common colorectal cancer predisposition SNP rs6983267 at chromosome 8q24 confers potential to enhanced Wnt signaling. Nat Genet 41: 885–890.S. TuupanenM. TurunenR. LehtonenO. HallikasS. Vanharanta2009The common colorectal cancer predisposition SNP rs6983267 at chromosome 8q24 confers potential to enhanced Wnt signaling.Nat Genet41885890
- 24. Solé X, Hernández P, de Heredia ML, Armengol L, Rodríguez-Santiago B, et al. (2008) Genetic and genomic analysis modeling of germline c-MYC overexpression and cancer susceptibility. BMC Genomics 9: 12.X. SoléP. HernándezML de HerediaL. ArmengolB. Rodríguez-Santiago2008Genetic and genomic analysis modeling of germline c-MYC overexpression and cancer susceptibility.BMC Genomics912
- 25. Sotelo J, Esposito D, Duhagon MA, Banfield K, Mehalko J, et al. (2010) Long-range enhancers on 8q24 regulate c-Myc. Proc Natl Acad Sci U S A 107: 3001–3005.J. SoteloD. EspositoMA DuhagonK. BanfieldJ. Mehalko2010Long-range enhancers on 8q24 regulate c-Myc.Proc Natl Acad Sci U S A10730013005
- 26. Bonifaci N, Berenguer A, Díez J, Reina O, Medina I, et al. (2008) Biological processes, properties and molecular wiring diagrams of candidate low-penetrance breast cancer susceptibility genes. BMC Med Genomics 1: 62.N. BonifaciA. BerenguerJ. DíezO. ReinaI. Medina2008Biological processes, properties and molecular wiring diagrams of candidate low-penetrance breast cancer susceptibility genes.BMC Med Genomics162
- 27. Medina I, Montaner D, Bonifaci N, Pujana MA, Carbonell J, et al. (2009) Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies. Nucleic Acids Res 37: W340–344.I. MedinaD. MontanerN. BonifaciMA PujanaJ. Carbonell2009Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies.Nucleic Acids Res37W340344
- 28. Kraft P, Raychaudhuri S (2009) Complex diseases, complex genes: keeping pathways on the right track. Epidemiology 20: 508–511.P. KraftS. Raychaudhuri2009Complex diseases, complex genes: keeping pathways on the right track.Epidemiology20508511
- 29. Stanley SM, Bailey TL, Mattick JS (2006) GONOME: measuring correlations between GO terms and genomic positions. BMC Bioinformatics 7: 94.SM StanleyTL BaileyJS Mattick2006GONOME: measuring correlations between GO terms and genomic positions.BMC Bioinformatics794
- 30. Furney SJ, Higgins DG, Ouzounis CA, López-Bigas N (2006) Structural and functional properties of genes involved in human cancer. BMC Genomics 7: 3.SJ FurneyDG HigginsCA OuzounisN. López-Bigas2006Structural and functional properties of genes involved in human cancer.BMC Genomics73
- 31. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, et al. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102: 15545–15550.A. SubramanianP. TamayoVK MoothaS. MukherjeeBL Ebert2005Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.Proc Natl Acad Sci U S A1021554515550
- 32. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Statist Soc Ser B 57: 289–300.Y. BenjaminiY. Hochberg1995Controlling the false discovery rate: a practical and powerful approach to multiple testing.J Roy Statist Soc Ser B57289300
- 33. Menashe I, Maeder D, Garcia-Closas M, Figueroa JD, Bhattacharjee S, et al. (2010) Pathway analysis of breast cancer genome-wide association study highlights three pathways and one canonical signaling cascade. Cancer Res 70: 4453–4459.I. MenasheD. MaederM. Garcia-ClosasJD FigueroaS. Bhattacharjee2010Pathway analysis of breast cancer genome-wide association study highlights three pathways and one canonical signaling cascade.Cancer Res7044534459
- 34. Stratton MR, Campbell PJ, Futreal PA (2009) The cancer genome. Nature 458: 719–724.MR StrattonPJ CampbellPA Futreal2009The cancer genome.Nature458719724
- 35. Greenman C, Stephens P, Smith R, Dalgliesh GL, Hunter C, et al. (2007) Patterns of somatic mutation in human cancer genomes. Nature 446: 153–158.C. GreenmanP. StephensR. SmithGL DalglieshC. Hunter2007Patterns of somatic mutation in human cancer genomes.Nature446153158
- 36. McMurray HR, Sampson ER, Compitello G, Kinsey C, Newman L, et al. (2008) Synergistic response to oncogenic mutations defines gene class critical to cancer phenotype. Nature 453: 1112–1116.HR McMurrayER SampsonG. CompitelloC. KinseyL. Newman2008Synergistic response to oncogenic mutations defines gene class critical to cancer phenotype.Nature45311121116
- 37. Saetrom P, Biesinger J, Li SM, Smith D, Thomas LF, et al. (2009) A risk variant in an miR-125b binding site in BMPR1B is associated with breast cancer pathogenesis. Cancer Res 69: 7459–7465.P. SaetromJ. BiesingerSM LiD. SmithLF Thomas2009A risk variant in an miR-125b binding site in BMPR1B is associated with breast cancer pathogenesis.Cancer Res6974597465
- 38. Wokolorczyk D, Gliniewicz B, Sikorski A, Zlowocka E, Masojc B, et al. (2008) A range of cancers is associated with the rs6983267 marker on chromosome 8. Cancer Res 68: 9982–9986.D. WokolorczykB. GliniewiczA. SikorskiE. ZlowockaB. Masojc2008A range of cancers is associated with the rs6983267 marker on chromosome 8.Cancer Res6899829986
- 39. Merlos-Suárez A, Batlle E (2008) Eph-ephrin signalling in adult tissues and cancer. Curr Opin Cell Biol 20: 194–200.A. Merlos-SuárezE. Batlle2008Eph-ephrin signalling in adult tissues and cancer.Curr Opin Cell Biol20194200
- 40. Vaught D, Brantley-Sieders DM, Chen J (2008) Eph receptors in breast cancer: roles in tumor promotion and tumor suppression. Breast Cancer Res 10: 217.D. VaughtDM Brantley-SiedersJ. Chen2008Eph receptors in breast cancer: roles in tumor promotion and tumor suppression.Breast Cancer Res10217
- 41. Emery LA, Tripathi A, King C, Kavanah M, Mendez J, et al. (2009) Early dysregulation of cell adhesion and extracellular matrix pathways in breast cancer progression. Am J Pathol 175: 1292–1302.LA EmeryA. TripathiC. KingM. KavanahJ. Mendez2009Early dysregulation of cell adhesion and extracellular matrix pathways in breast cancer progression.Am J Pathol17512921302
- 42. Cortina C, Palomo-Ponce S, Iglesias M, Fernández-Masip JL, Vivancos A, et al. (2007) EphB-ephrin-B interactions suppress colorectal cancer progression by compartmentalizing tumor cells. Nat Genet 39: 1376–1383.C. CortinaS. Palomo-PonceM. IglesiasJL Fernández-MasipA. Vivancos2007EphB-ephrin-B interactions suppress colorectal cancer progression by compartmentalizing tumor cells.Nat Genet3913761383
- 43. Claus EB, Risch NJ, Thompson WD (1990) Using age of onset to distinguish between subforms of breast cancer. Ann Hum Genet 54: 169–177.EB ClausNJ RischWD Thompson1990Using age of onset to distinguish between subforms of breast cancer.Ann Hum Genet54169177
- 44. Taglienti CA, Wysk M, Davis RJ (1996) Molecular cloning of the epidermal growth factor-stimulated protein kinase p56 KKIAMRE. Oncogene 13: 2563–2574.CA TaglientiM. WyskRJ Davis1996Molecular cloning of the epidermal growth factor-stimulated protein kinase p56 KKIAMRE.Oncogene1325632574
- 45. Taira N, Yamamoto H, Yamaguchi T, Miki Y, Yoshida K (2010) ATM augments nuclear stabilization of DYRK2 by inhibiting MDM2 in the apoptotic response to DNA damage. J Biol Chem 285: 4909–4919.N. TairaH. YamamotoT. YamaguchiY. MikiK. Yoshida2010ATM augments nuclear stabilization of DYRK2 by inhibiting MDM2 in the apoptotic response to DNA damage.J Biol Chem28549094919
- 46. Brantley-Sieders DM, Zhuang G, Hicks D, Fang WB, Hwang Y, et al. (2008) The receptor tyrosine kinase EphA2 promotes mammary adenocarcinoma tumorigenesis and metastatic progression in mice by amplifying ErbB2 signaling. J Clin Invest 118: 64–78.DM Brantley-SiedersG. ZhuangD. HicksWB FangY. Hwang2008The receptor tyrosine kinase EphA2 promotes mammary adenocarcinoma tumorigenesis and metastatic progression in mice by amplifying ErbB2 signaling.J Clin Invest1186478
- 47. Schnitt SJ (2009) The transition from ductal carcinoma in situ to invasive breast cancer: the other side of the coin. Breast Cancer Res 11: 101.SJ Schnitt2009The transition from ductal carcinoma in situ to invasive breast cancer: the other side of the coin.Breast Cancer Res11101
- 48. Colditz GA, Hankinson SE (2005) The Nurses' Health Study: lifestyle and health among women. Nat Rev Cancer 5: 388–396.GA ColditzSE Hankinson2005The Nurses' Health Study: lifestyle and health among women.Nat Rev Cancer5388396
- 49. van 't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, et al. (2002) Gene expression profiling predicts clinical outcome of breast cancer. Nature 415: 530–536.LJ van 't VeerH. DaiMJ van de VijverYD HeAA Hart2002Gene expression profiling predicts clinical outcome of breast cancer.Nature415530536
- 50. Chi JT, Wang Z, Nuyten DS, Rodriguez EH, Schaner ME, et al. (2006) Gene expression programs in response to hypoxia: cell type specificity and prognostic significance in human cancers. PLoS Med 3: e47.JT ChiZ. WangDS NuytenEH RodriguezME Schaner2006Gene expression programs in response to hypoxia: cell type specificity and prognostic significance in human cancers.PLoS Med3e47
- 51. Chang HY, Nuyten DS, Sneddon JB, Hastie T, Tibshirani R, et al. (2005) Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival. Proc Natl Acad Sci U S A 102: 3738–3743.HY ChangDS NuytenJB SneddonT. HastieR. Tibshirani2005Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival.Proc Natl Acad Sci U S A10237383743
- 52. Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, et al. (2000) Molecular portraits of human breast tumours. Nature 406: 747–752.CM PerouT. SorlieMB EisenM. van de RijnSS Jeffrey2000Molecular portraits of human breast tumours.Nature406747752
- 53. Liu R, Wang X, Chen GY, Dalerba P, Gurney A, et al. (2007) The prognostic role of a gene signature from tumorigenic breast-cancer cells. N Engl J Med 356: 217–226.R. LiuX. WangGY ChenP. DalerbaA. Gurney2007The prognostic role of a gene signature from tumorigenic breast-cancer cells.N Engl J Med356217226
- 54. Minn AJ, Gupta GP, Siegel PM, Bos PD, Shu W, et al. (2005) Genes that mediate breast cancer metastasis to lung. Nature 436: 518–524.AJ MinnGP GuptaPM SiegelPD BosW. Shu2005Genes that mediate breast cancer metastasis to lung.Nature436518524
- 55. Ramaswamy S, Ross KN, Lander ES, Golub TR (2003) A molecular signature of metastasis in primary solid tumors. Nat Genet 33: 49–54.S. RamaswamyKN RossES LanderTR Golub2003A molecular signature of metastasis in primary solid tumors.Nat Genet334954
- 56. Ayers M, Symmans WF, Stec J, Damokosh AI, Clark E, et al. (2004) Gene expression profiles predict complete pathologic response to neoadjuvant paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide chemotherapy in breast cancer. J Clin Oncol 22: 2284–2293.M. AyersWF SymmansJ. StecAI DamokoshE. Clark2004Gene expression profiles predict complete pathologic response to neoadjuvant paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide chemotherapy in breast cancer.J Clin Oncol2222842293
- 57. Chang JC, Wooten EC, Tsimelzon A, Hilsenbeck SG, Gutierrez MC, et al. (2005) Patterns of resistance and incomplete response to docetaxel by gene expression profiling in breast cancer patients. J Clin Oncol 23: 1169–1177.JC ChangEC WootenA. TsimelzonSG HilsenbeckMC Gutierrez2005Patterns of resistance and incomplete response to docetaxel by gene expression profiling in breast cancer patients.J Clin Oncol2311691177
- 58. Ma XJ, Wang Z, Ryan PD, Isakoff SJ, Barmettler A, et al. (2004) A two-gene expression ratio predicts clinical outcome in breast cancer patients treated with tamoxifen. Cancer Cell 5: 607–616.XJ MaZ. WangPD RyanSJ IsakoffA. Barmettler2004A two-gene expression ratio predicts clinical outcome in breast cancer patients treated with tamoxifen.Cancer Cell5607616
- 59. Wang XD, Reeves K, Luo FR, Xu LA, Lee F, et al. (2007) Identification of candidate predictive and surrogate molecular markers for dasatinib in prostate cancer: rationale for patient selection and efficacy monitoring. Genome Biol 8: R255.XD WangK. ReevesFR LuoLA XuF. Lee2007Identification of candidate predictive and surrogate molecular markers for dasatinib in prostate cancer: rationale for patient selection and efficacy monitoring.Genome Biol8R255
- 60. van de Vijver MJ, He YD, van't Veer LJ, Dai H, Hart AA, et al. (2002) A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347: 1999–2009.MJ van de VijverYD HeLJ van't VeerH. DaiAA Hart2002A gene-expression signature as a predictor of survival in breast cancer.N Engl J Med34719992009
- 61. Carroll JS, Meyer CA, Song J, Li W, Geistlinger TR, et al. (2006) Genome-wide analysis of estrogen receptor binding sites. Nat Genet 38: 1289–1297.JS CarrollCA MeyerJ. SongW. LiTR Geistlinger2006Genome-wide analysis of estrogen receptor binding sites.Nat Genet3812891297
- 62. Futreal PA, Coin L, Marshall M, Down T, Hubbard T, et al. (2004) A census of human cancer genes. Nat Rev Cancer 4: 177–183.PA FutrealL. CoinM. MarshallT. DownT. Hubbard2004A census of human cancer genes.Nat Rev Cancer4177183
- 63. Forbes SA, Tang G, Bindal N, Bamford S, Dawson E, et al. (2010) COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer. Nucleic Acids Res 38: D652–657.SA ForbesG. TangN. BindalS. BamfordE. Dawson2010COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer.Nucleic Acids Res38D652657
- 64. Stephens PJ, McBride DJ, Lin ML, Varela I, Pleasance ED, et al. (2009) Complex landscapes of somatic rearrangement in human breast cancer genomes. Nature 462: 1005–1010.PJ StephensDJ McBrideML LinI. VarelaED Pleasance2009Complex landscapes of somatic rearrangement in human breast cancer genomes.Nature46210051010
- 65. Santarius T, Shipley J, Brewer D, Stratton MR, Cooper CS (2010) A census of amplified and overexpressed human cancer genes. Nat Rev Cancer 10: 59–64.T. SantariusJ. ShipleyD. BrewerMR StrattonCS Cooper2010A census of amplified and overexpressed human cancer genes.Nat Rev Cancer105964
- 66. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, et al. (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4: 249–264.RA IrizarryB. HobbsF. CollinYD Beazer-BarclayKJ Antonellis2003Exploration, normalization, and summaries of high density oligonucleotide array probe level data.Biostatistics4249264
- 67. Tusher VG, Tibshirani R, Chu G (2001) Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci U S A 98: 5116–5121.VG TusherR. TibshiraniG. Chu2001Significance analysis of microarrays applied to the ionizing radiation response.Proc Natl Acad Sci U S A9851165121
- 68. Gorski B, Byrski T, Huzarski T, Jakubowska A, Menkiszak J, et al. (2000) Founder mutations in the BRCA1 gene in Polish families with breast-ovarian cancer. Am J Hum Genet 66: 1963–1968.B. GorskiT. ByrskiT. HuzarskiA. JakubowskaJ. Menkiszak2000Founder mutations in the BRCA1 gene in Polish families with breast-ovarian cancer.Am J Hum Genet6619631968