Skip to main content
Advertisement
  • Loading metrics

Mutations in tumor signaling, metastases, and synthetic lethality establish distinct patterns

  • Bengi Ruken Yavuz,

    Roles Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Cancer Innovation Laboratory, National Cancer Institute, Frederick, Maryland, United States of America

  • Ugur Sahin,

    Roles Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Molecular Biology and Genetics, Koç University, Istanbul, Turkey

  • Hyunbum Jang,

    Roles Conceptualization, Methodology, Writing – original draft, Writing – review & editing

    Affiliations Cancer Innovation Laboratory, National Cancer Institute, Frederick, Maryland, United States of America, Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland, United States of America

  • Ruth Nussinov ,

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Writing – original draft, Writing – review & editing

    ntuncbag@ku.edu.tr (NT); nussinor@mail.nih.gov (RN)

    Affiliations Cancer Innovation Laboratory, National Cancer Institute, Frederick, Maryland, United States of America, Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, Maryland, United States of America, Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel

  • Nurcan Tuncbag

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Visualization, Writing – original draft, Writing – review & editing

    ntuncbag@ku.edu.tr (NT); nussinor@mail.nih.gov (RN)

    Affiliations Department of Chemical and Biological Engineering, College of Engineering, Koç University, Istanbul, Turkey, School of Medicine, Koç University, Istanbul, Turkey, Koc University Research Center for Translational Medicine (KUTTAM), Istanbul, Turkey

Abstract

Effective identification of oncogenic mutations is essential for diagnosis, forecasting resistance, and metastasis in remission. It is required for an optimal drug regimen. We develop a framework to discover mutations that co-exist in different oncoproteins, and those that are excluded, likely encoding oncogene-induced senescence. First, mapping the proteins onto pathways assists combinatorial drug selections and helps to detect metastases. Second, it provides the molecular basis for synthetic lethality, to date investigated at the genome level. Our pan-cancer profiles of ~60,000 tumor sequences, detect 3424 co-existing tumor-specific mutations. Mapping them onto pathways indicates that they preferentially promote specific primary tumors. We uncover metastatic mutations and provide metastatic breast-cancer markers. This work not only clarifies the mechanistic basis of intratumor mutational diversity but usefully reveals markers for metastasis in patients’ genomes and introduces a novel computational framework for detecting metastasis based on tumor mutational profiles. Mapping the mutations onto pathways provides an invaluable metastasis-targeting resource, guiding drug combinations.

Author summary

Cancer is driven by combinations of mutations in specific proteins in certain cellular pathways; identifying these combinations can reveal new ways to treat the disease more effectively. We analyzed nearly 60,000 tumor genomes from a wide range of cancers to identify mutations that frequently occur together in different cancer driving proteins, as well as combinations that rarely appear in the same tumor. Co-occurring mutations may indicate signaling pathways that are prone to cancer growth and metastasis. In contrast, mutually exclusive mutations may point to processes like senescence or vulnerability to specific treatments. By mapping these mutation patterns onto known cellular pathways, we found thousands of tumor-specific mutation pairs, many linked to distinct cancer types. We also identified key mutations associated with metastasis, including markers specific to metastatic breast cancer. Our study introduces a new computational framework for detecting metastasis and guides drug selection based on the tumor mutational profile. This approach reframes tumor heterogeneity from cell signaling perspective and provides a resource for developing personalized cancer therapies.

Introduction

Here we discover and interpret patterns of mutations in tumor signaling and metastases. Decades ago, it was proposed that cancer evolution requires more than a single mutation [14]. The minimal number has been debated since, and their cancer-specific identification in the proteins, and pathways, were sought [57]. This is crucial since subclones harboring rare resistant mutations proliferate following decimation of sensitive cells [8]. These couple with gene rearrangements, fusions, and overexpression. All increase the number of active protein states [9]. Passenger mutations were observed to be unevenly distributed across cancer genomes, appearing to stem from regional chromatin accessibility [10,11]. Below, we briefly overview mutations in tumor signaling and metastases from our perspective.

Co-occurrence (or exclusivity) of two mutations in different genes (i.e., in trans) are a random event in cancer evolution. If two mutations in different genes occur less or more frequently than expected, they are mutually exclusive or co-occurring, respectively [12,13]. Mutual exclusivity was mostly observed in genes in the same or redundant pathways; co-occurrence was mostly observed in different, or parallel pathways [14]. In our definition, if the pathways recruit the same downstream protein families, they are redundant; if evolutionary-independent, they are parallel. Exclusivity has been attributed to the expectation that acquiring a second strong mutation in the same pathway is unsustainable for the cell since it risks its death [12,13]. It was proposed as associated with tumor subtype, synthetic lethality, and positive selection [15]. We clarified that co-occurrence is restricted [16,17], since a sustained, additive effect on signaling strength of same — or redundant— pathway driver mutations is likely to hyperactivate the proliferation signal, triggering an oncogene-induced senescence (OIS) cellular program [1820].

OIS is a mechanism the cancer cell uses to proliferate- yet dodge cell death. Cancer cells opt for combinations of potent mutations. As early as 1953 [1], and again in 1969 [2], 1971 [3], 1999 [4], 2002 [5], 2015 [21], and more recently, it has been established that multiple co-existing mutations are required for the emergence of cancer. We also know that the number increases during proliferation. Yet, cells survive, suggesting that OIS combinations were sidestepped, or more likely, since somatic mutations emerge sporadically, cells harboring excluded combinations did not endure [22].

OIS, a tumor-suppressive mechanism arresting cell cycle progression, can be the main reason for excluding the co-occurrence of driver mutations in the same, here MAPK, pathway [2325]. Irreversible senescence expresses strong oncogenic stimuli [26]. Overexpression of oncogenes such as KRAS, BRAF, and MYC, which also generate strong proliferation signal, induce OIS, as does increased PI3K/AKT signaling. Loss of PTEN, which negatively regulates the PI3K/AKT pathway, can trigger OIS through a p53-dependent pathway [27], or via coupling with strong PI3K mutational variants. Both constitutively generate signaling lipid PIP3. Sustained hyperactivation of the PI3K/AKT/mTORC1 pathway results in OIS [28,29]. Examples of mutually exclusive relations between gene pairs include BRCA2-TP53, BRCA1-PARP1, and PTEN-PIK3CA in breast cancer; and BRCA1-CCNE1, BRAF-KRAS, ERBB2-KRAS in ovary cancer [30]. Tumor suppressors can be advantageous and detrimental contingent upon the oncogenic context [31]. A third hit—within the same or a different gene in the pathway—may represent alternative routes to tumorigenesis [32].

The mechanisms underlying the mutual exclusivity of oncogenes may depend on the tumor type and interacting oncoproteins. KRASG12D and BRAFV600E are mutually exclusive in lung cancer. Their co-expression triggers oncogene-induced senescence (OIS) [33]. Impaired B-Raf activation may avoid OIS, contributing to mutual exclusivity among certain BRAF mutations [34]. Activating mutations in RAS- and RAF-family proteins, especially the strong BRAFV600E hotspot is one example [35,36]. Absence of co-occurring NRAS and BRAF variants, such as BRAFV600E, in the TCGA skin cutaneous melanoma cohort illustrates mutual exclusivity. Similarly, BRAF and RAS gene family mutations are mutually exclusive in metastatic colorectal cancer tumors [35].

Here we aim to discover (i) cancer-specific mutations that co-exist (doublets or double mutations) in different proteins in tumor-specific signaling, in primary tumors and in metastases, and (ii) those that are excluded. These are significant aims. For the first (i), knowledge of mutations that co-exist in different proteins and mapped onto pathways, can aid in knowledge-based combination drug strategy [37,38], and identify cancer metastases. For the second (ii) synthetic lethality, where a strong mutation in one gene product can be sustained but not in two, parallels excluded multiple strong drivers, clarifying its molecular basis. Related to (i), is the cardinal question of whether same tumor type cells which harbor common driver mutations are likely to adopt common drug resistance mechanisms.

Here we reveal rare but impactful mutation patterns across proteins that drive distinct phenotypes, informing oncology of drug combinations. We observe that mutations disrupting similar functions are mutually exclusive. As to metastatic tumors, some mutations in trans are significantly higher. We provide a list of actionable mutations for metastatic tumors in breast cancer cohort and a tree of double, triple, and quadruple co-existing patterns across the pan-cancer metastatic tumors. These also address the questions of why these signatures, and what can they tell us about the possible molecular mechanisms which are involved. As to synthetic lethality, which has been of broad interest due to its pharmacological potential [3942], we provide a list, a rationale, and raise the possibility that OIS can provide the molecular basis of synthetic lethality.

Results

Tissue distribution and functional relevance of co-existing mutation pairs in cancer

To understand how mutations influence cancer evolution and how pre-existing mutations affect tumor initiation and metastasis [43], we analyzed statistically significant mutation pairs in different proteins using ~60,000 tumor samples from TCGA and GENIE pan-cancer datasets [44]. After identifying significant pairs, we mapped them to pathways and tissues and used Frequency Pattern Growth Tree analysis (treating each tumor as a transaction and alterations as items) to identify potential metastatic markers from co-existing mutations. Our analytical approach is outlined in Fig 1.

thumbnail
Fig 1. Overview of the study.

We obtained tumor mutations from TCGA and AACR GENIE datasets. Then, we applied Fisher’s Exact Test followed by Benjamini-Hochberg multiple corrections. After obtaining 3424 significant mutation pairs, we mapped their components to pathways and tissues for functional interpretation. This helped identify co-altered pathways and protein pairs harboring the mutations specific to tissues. We also identified metastatic tumor markers by utilizing the Frequency Pattern (FP) Growth Tree algorithm.

https://doi.org/10.1371/journal.pcbi.1013351.g001

The distribution of metastatic tumors across tissues with at least 50 samples is shown in Fig 2A. Notably, skin, breast, intestine, lung, ovary, and prostate cancers include a relatively higher proportion of metastatic cases, although primary tumors still outnumber metastases in all tissues.

thumbnail
Fig 2. Statistics of mutation pairs in different proteins.

(A) The windrose plot shows the number of primary (blue) and metastatic (light blue) tumors among tissues with at least 50 samples. The number of all tumors for each cancer type is specified in parenthesis. Breast, bowel, and lung have the highest number of primary tumors. Although the total number of metastatic tumors are less than primary tumors overall, skin, breast, intestine, lung, ovary, and prostate are among the relatively more metastatic. (B) There are 3424 significant mutation pairs in different gene (Fisher’s Exact Test p < 0.05, Benjamini-Hochberg q < 0.3). Their compositions are D + P (n = 1554), D + D (n = 1268) and P + P (n = 602) where D is “Driver” and P is “Passenger” mutation. (C) Box plot showing the frequency of mutation pairs constituents. For each pair, the component with lower frequency is put into “Constituent 1” and the other into “Constituent 2”. The figure shows that most pairs are composed of a frequent and a rare mutation. (D) Stacked bar plot of the mutation pairs in different proteins for 3307 co-existing and 117 excluded. For co-existing mutations, the distribution is as follows: D + D = 1155, D + P = 1550, P + P = 602. Excluded pairs include 113 D + D and 4 D + P. D (Driver), P (Passenger). (E) We call a protein pair A|B if there are mutations on proteins A and B. We divided the pairs into two groups: if the pair belongs to a common pathway, we put it into “Shared Signaling Pathway”, otherwise “Not Share Signaling Pathway”. 1284 protein pairs share at least one signaling pathway where the corresponding mutations are: 754 D + D, 480 D + P, 36 P + P). 638 of them do not share a signaling pathway, and 205 are D + D, 324 D + P, 108 P + P. For the remaining pairs, at least one of the proteins harboring them does not have pathway information. Co-existing mutations are usually present on protein pairs residing in different pathways and this could imply some driver mutations need a helper passenger mutation from a different pathway.

https://doi.org/10.1371/journal.pcbi.1013351.g002

Analyzing missense mutations yielded 3,424 significant mutation pairs across 285 proteins (Fisher Exact Test p < 0.05, Benjamini-Hochberg q < 0.3) (S1 Table). Using the Catalog of Validated Oncogenic Mutations from the Cancer Genome Interpreter [45], mutations were labeled as drivers (D) or passengers (P). The pairs comprised 1554 D + P, 1268 D + D, and 602 P + P combinations (Fig 2B). We categorized mutation pairs by placing the lower frequency mutation as “Constituent 1” and the higher as “Constituent 2” (Fig 2C, and Fig A in S1 Text). Most pairs showed one frequent and one rare mutation. Of the total pairs, 3307 co-existed and 117 were excluded (Fig 2D, S1 Text). Co-existing pairs included 1155 D + D, 1550 D + P, and 602 P + P, while excluded pairs comprised 113 D + D and 4 D + P.

To assess whether mutually exclusive gene pairs in our analysis were enriched for synthetic lethal interactions, we compared our results with established synthetic lethality databases (SynLethDB and BioGRID [46,47]). We identified that 32% (12/37) of mutually exclusive pairs overlapped with known synthetic lethal interactions. Statistical analysis using Fisher’s exact test revealed significant enrichment (p-value = 9.33 x 10-6, OR = 6.32), indicating that mutual exclusivity strongly correlates with synthetic lethality. The overlapping pairs included prominent cancer-related genes such as AKT1|PIK3CA, BRAF|KRAS, and EGFR|TP53. In contrast, co-occurring gene pairs showed no significant enrichment for synthetic lethality (p-value = 0.895, OR = 0.54), suggesting that mutual exclusivity serves as a more reliable indicator of synthetic lethal relationships and may be valuable for identifying clinically relevant therapeutic targets.

To address whether mutually exclusive mutations potentially encoding oncogene-induced senescence (OIS), we conducted Gene Ontology enrichment analysis at MSigDB of the mutually exclusive mutation set. This analysis revealed significant overrepresentation of processes critical to cellular stress responses, including apoptotic processes (FDR q-value = 3.13 x 10-10), regulation of cell population proliferation (FDR q-value = 1.2e-9), and regulation of programmed cell death (FDR q-value = 7.04 x 10-9). Additional enrichment was observed in pathways involving phosphorus metabolic process regulation, protein modification processes, and phosphorylation regulation (FDR q-values between 7.38 x 10-10 and 3.31 x 10-9). These enriched terms reflect key hallmarks of oncogene-induced senescence, particularly the balance between proliferative signaling and growth arrest characterizing the senescent state, providing substantial evidence supporting our hypothesis that mutually exclusive mutations are enriched in pathways contributing to or responding to oncogene-induced stress [48,49].

Our analysis identified 3424 pairs with significant co-existing mutations (denoted as A|B when mutations occur in proteins A and B in the same tumors). Analysis of pathway relationships using the KEGG database (detailed methods in S1 Text) revealed 1280 protein pairs sharing at least one pathway (754 D + D, 487 D + P, 39 P + P), while 638 pairs (206 D + D, 324 D + P, 108 P + P) did not share pathways (Fig 2E). We identified 1,284 protein pairs containing 3,424 doublets and examined their tissue distribution in two categories: oncogene-oncogene (OG-OG) and oncogene-tumor suppressor (OG-TSG) pairs (detailed methods in S1 Text). Node colors show the fraction of samples with mutations in both proteins, normalized to samples with mutations in either protein per tissue. For clarity, we displayed OG-OG pairs with >10 mutation pairs (47 total) (Fig B in S1 Text) and OG-TSG pairs with >20 mutation pairs (54 total) (Panel B of Fig B in S1 Text).

The odds ratio analysis assessing whether the co-existing mutation constituents occur more (or less) frequently than expected under the null hypothesis of independence identified 3,307 co-existing mutation pairs. Notably, 45% of the significant co-occurring pairs involved a driver–passenger (D–P) combination, suggesting that passenger mutations may not arise entirely at random. Instead, they may be functionally tolerated or even selected in the presence of specific driver mutations, hinting at a possible cooperative or permissive relationship. In this context, the passenger mutation could contribute to pathway crosstalk, fine-tune signaling, or influence the tumor phenotype in a driver-dependent context. Our findings suggest a non-random pattern of co-occurrence, possibly indicating that certain passenger mutations are selectively maintained in the presence of specific drivers, potentially supporting tumor progression through complementary or compensatory mechanisms across pathways.

Most mutually exclusive pairs comprise driver mutations of varying frequencies, indicating strong oncogenic signals are complemented by less frequent mutations. These patterns highlight the prognostic potential of mutation combinations (detailed methods in S1 Text), and functional evaluation for therapeutics.

Co-existing KRAS and TP53 mutations alter transcriptional networks in pancreatic cancer

A driver alteration can influence protein signaling and transcriptional profiles. In transcriptional regulation networks, altered proteins may impact upstream activities via feedback loops [50]. We analyzed transcriptional differences between pancreatic adenocarcinomas (PAAD) with mutation pairs and those with single mutations.

KRASG12D is one of the most frequent mutations in pancreatic cancer, often paired with TP53 mutations that impair DNA binding. Among 1502 PAAD tumors, 1007 harbor KRASG12D mutations. Of these, 397 tumors (from TCGA and ACCR GENIE) exhibit significant mutation pairs involving KRASG12D and TP53, including mutations at positions 248 and 273 (DNA contact sites) and position 175 (affecting Zinc ion binding, destabilizing p53). Group 1 includes 24 PAAD tumors with KRASG12D-TP53 significant mutation pairs, while Group 2 (71 tumors) contains single mutations in either KRAS or TP53, or mutations not contributing to significant pairs. Group1 and Group 2 contain tumors only from TCGA with the available transcriptomic data in PAAD.

Transcriptomic analysis revealed 394 differentially expressed genes (Fig 3, 137 upregulated, 257 downregulated; adjusted p-value < 0.05, |log2FC| > 0.5) between Groups 1 and 2, with p-values corrected for multiple hypothesis testing using the Benjamini-Hochberg method. Group 1 tumors showed enrichment of proteins involved in immune response, cell proliferation, and cell-cell signaling. Using the TRRUST (v2) dataset, 55 upstream transcription factors (TFs) were identified (Fig 3), including SP1 (Sp1), SP3 (Sp3), NFKB1, JUN and TP53 (p53). A regulatory network constructed with CancerGeneNet highlighted upregulated proliferation phenotypes, aligning with the transcriptional impact of TP53 driver mutations compared to single mutations [51,52].

thumbnail
Fig 3. Transcriptome analysis of tumors with mutation pairs.

Comparison of gene expression profiles of double mutant and single mutant TP53 cases in KRASG12 mutant PAAD tumors revealed 394 differentially expressed genes (137 upregulated and 257 downregulated) between Group 1 and Group 2. Group 1: PAAD tumors having at least one significant pair composed of KRASG12 and TP53 mutations); Group 2: PAAD tumors having mutation either in KRAS or in TP53 or mutation in any of these proteins that does not contribute to a significant pair. The heatmap shows the z-scores of differentially expressed genes (DEGs) across patient tumors. 55 TFs are retrieved as the transcription factors regulating the differentially expressed genes including the main regulators SP1, SP3, NFKB1, JUN and TP53 obtained from TRRUST version 2 dataset, a manually curated database of human and mouse transcriptional regulatory networks. SP1 (encodes Sp1), SP3 (encodes Sp3), NFKB1, JUN and TP53 (encodes p53).

https://doi.org/10.1371/journal.pcbi.1013351.g003

Sp1 (Specificity Protein 1) is overexpressed in various tumor types and correlates with poor patient survival. High levels of Sp1, Sp3, and Sp4 contribute to cancer cell development, survival, and migration to tissues, including the pancreas [53,54]. In pancreatic ductal adenocarcinoma (PDAC), co-occurring TP53 and KRAS mutations drive tumor growth and metastasis through interaction with CREB-1 (cAMP responsive element binding protein 1) [55]. Mutant p53 and CREB-1 upregulate FOXA1, a pro-metastatic pioneer transcription factor that activates its network and promotes WNT/β-catenin signaling, a key driver of PDAC metastasis. Inhibiting CREB-1 reduces FOXA1 and β-catenin expression, limiting PDAC metastasis. Additionally, TP53 missense mutations in KRAS-transformed PDAC are linked to poor differentiation and exhibit gain-of-function properties [56].

Collectively, these show that co-existing KRAS and TP53 mutations alter transcriptional networks in pancreatic cancer, calling for reconsideration of PDAC drug regimen combinations accounting for both to restrain drug resistance.

Functionally similar alterations do not co-exist in tumors

We define alterations with similar phenotypic outcomes as “functionally equivalent,” which rarely or never co-occur and are often mutually exclusive. One example involves KRASG12D and CDKN2A deletions in pancreatic cancer. Mutually exclusive pairs may act in separate pathways but yield similar phenotypic effects. While co-existing mutations in KRAS are rare, they drive downstream changes promoting a proliferation phenotype. Fig 4A highlights mutations in KRASG12D-mutated PAAD tumors, where alterations in SMAD4, CDKN2A, U2AF1, and GNAS are mutually exclusive and rarely co-occur. CDKN2A and SMAD4 mutations, located in binding regions, rarely coexist in tumor proteins. CDKN2A mutations (R58*/Q, R80*, H83N/R/Y, and D84G) occur at the CDK6 interface (PDB: 1BI7) (Fig 4B), with all but R58*/Q spatially clustered. These mutations disrupt CDKN2A-CDK6 interactions, inhibiting cyclin D binding and retinoblastoma phosphorylation, thereby suppressing proliferation. SMAD4 mutations at D351 and R361 localize to the SMAD4/SMAD3 heterodimer interface [57] (PDB: 1U7F (trimer)) (Fig 4C). These mutations disrupt SMAD homo- and hetero-oligomerization. All mutually exclusive mutations shown in Fig 4 are on non-overlapping paths linked to SMAD4 (encodes Smad4) and their functional outcome may be similar. U2AF1 and GNAS (encodes Gαs) paths are linked to Ras and downstream proliferation phenotype (Fig 4D).

thumbnail
Fig 4. Co-existing mutations in PAAD tumors.

(A) Oncoprint of the mutations that coexist with KRASG12. KRASG12 partners with CDKN2A, SMAD4, GNAS, and U2AF1 which show a mutually exclusive pattern. (B) The structure of CDKN2A/CDK6 complex shows that the mutations R58, R80, H83, and D84 are in the interface region between CDKN2A and CDK6. (C) SMAD4 mutations D351 and R361 are in the interface between SMAD4/SMAD3 complex. (D) A subnetwork of highly frequent alterations co-existing with KRASG12 leading to proliferation and oncogenic signals.

https://doi.org/10.1371/journal.pcbi.1013351.g004

Co-existence of MYC amplification and CDKN2A deletion is relatively rare. MYC drives proliferation, while CDKN2A, a tumor suppressor, restricts cell cycle progression in response to oncogenic or cellular stress. Their cooperation is linked to TP53 mutations in the DNA-binding domain. This aligns with the absence of multiple strong alterations in the same or functionally similar pathways, as such combinations can trigger OIS [58,59].

Altogether, such data pointing that functionally similar alterations do not co-exist in tumors, bespeak of which drug combination may be ineffective.

Distinct oncogenic signaling networks drive breast and pancreatic cancer subtypes

The frequency of co-occurring mutations varies across tissue types, with affected proteins spanning multiple signaling pathways. We analyzed 3,424 significant mutation pairs, examining their tissue-specific distribution across 46 KEGG pathways (see S1 Text). Bubble plot (Fig 5A) illustrates the number and fraction of tumors with mutation-associated pathways across 10 tissues and 25 pathways. Bowel and uterus tissues show mutation pairs across all pathways, while breast, lung, and pancreas tissues exhibit accumulation in specific pathways, including PI3K/AKT, MAPK, FoxO, Neurotrophin, and mTOR. Mutation pairs in breast, lung, and pancreas tissues accumulating in specific pathways prompt the construction of cancer-specific subnetworks for breast and pancreatic tissues. Seed proteins with co-existing mutations were selected based on thresholds: breast (≥0.5%) and pancreas (≥1.5%).

thumbnail
Fig 5. Breast cancer specific subnetwork.

(A) Bubble plot representing the number and fraction of pathway-mutation pair- tumors in each tissue by node size and color, respectively. On the x-axis and y-axis there are 10 tissues and 25 pathways, respectively. Bowel and uterus have mutation pairs across all listed pathways while in breast, lung and pancreas tissues they accumulate in specific pathways including PI3K/AKT, MAPK, FoxO, Neurotrophin, mTOR. (B) Breast cancer specific subnetwork shows seed proteins, oncogenes and tumor suppressors obtained with the Page Rank algorithm from Omni Path with the seed proteins that have mutation pairs, and the tissue specific fraction is greater than 0.5. Blue nodes are the proteins from the PPI network, pink nodes are the seed proteins which are mutation pair components. Transcription factors (TFs) are V-shaped nodes. The border color is green if the gene is tumor suppressor gene and red if oncogene. The edges are solid lines if there is an edge between nodes in the PPI network that contribute to a mutation pair. Dashed lines depict the direct interaction in the PPI. Source and target shape are derived from TRRUST representing the activation or repression. TFs RB1 (encodes the protein Rb1), APC (encodes the protein APC), CTNNB1 (encodes Catenin Beta-1), MYC (encodes the protein Myc), TP53 (encodes the protein p53).

https://doi.org/10.1371/journal.pcbi.1013351.g005

From 8,512 breast and 1,853 pancreatic tumors, a directed PPI network with 20,029 nodes and 276,179 edges was derived from OmniPath [60] (see S1 Text). For breast cancer, 15 seed proteins were used, with initialization scores set to 1 for the PageRank algorithm (α = 0.85). The resulting subnetworks contained 59 nodes and 343 edges for breast (Fig 5B and Fig C in S1 Text) and 40 nodes and 267 edges for pancreas (Fig D in S1 Text). Transcription factors regulating subnetwork nodes were identified, along with their functional roles (activation or repression) to determine arrow types.

To evaluate the robustness of gene prioritization to the choice of the damping factor (α) in the personalized PageRank algorithm, we performed a sensitivity analysis across multiple α values (0.70, 0.75, 0.80, 0.90, and 0.95). Using the same tissue-specific seed genes and the OmniPath protein–protein interaction (PPI) network [60], we compared gene rankings generated with each α value to those obtained with α = 0.85 using Spearman rank correlation. The correlations were consistently high across all tested values, demonstrating strong agreement in rankings. For the pancreas-specific analysis, Spearman coefficients ranged from ρ = 0.9922 to ρ = 0.9997, while for the breast-specific analysis, they ranged from ρ = 0.9944 to ρ = 0.9998. These findings indicate that gene prioritizations are highly robust to variations in the damping factor, supporting the reliability of results obtained with α = 0.85.

For breast cancer, the resulting subnetwork includes the TFs RB1 (encodes the protein Rb1), APC (encodes the protein APC), CTNNB1 (encodes Catenin Beta-1), MYC (encodes the protein Myc), TP53 (encodes the protein p53). Oncogenes or tumor suppressor proteins (nodes) have red and green borders, respectively. Gene set enrichment analysis in MsigDB [61,62] and Webgestalt [63] yielded subnetwork proteins enriched in the following pathways prominently (panels A and B of Fig E in S1 Text): GPCR Signaling (41 proteins overlapped), G-alpha signaling events (28 genes), Chemokine signaling pathway (25 genes), GnRH signaling pathway (21 genes), Gap junction (20 genes), Endothelins (19 genes), LPA receptor mediated events (19 genes). GPCRs are crucial in tumorigenesis, angiogenesis, and metastasis by modulating heterotrimeric G-proteins pathways [64]. GnRH agonists inhibit cancer cell growth and invasion while suppressing ovarian steroid production [65,66]. Chemokines activate pathways like PI3K/AKT/NF-κB and MAPK/ERK, promoting proliferation [67,68]. Targeting these pathways offers potential for breast cancer treatments [69]. In the pancreas cancer subnetwork, key TFS such as APC (encodes APC), RB1 (encodes pRb), DNMT3A, TP53 (p53), MYC, CTNNB1 (Beta Catenin-1) and SMAD4 (Smad4) contribute to mutation pairs. Gene set enrichment analysis with WebGestalt (Panels A and B of Fig F in in S1 Text) revealed upregulated pathways like PI3K/AKT/NF-κB and MAPK/ERK signaling, and platelet activation.

To contextualize our results and distinguish biologically meaningful signals from random network effects, we examined pathways enriched in fewer than 50% of subnetworks reconstructed from randomized interactomes in breast and pancreatic cancers. These infrequently enriched pathways may represent more robust, non-random features of the original network topology. Notably, pathways such as IL-17 signaling, Hedgehog signaling, and Necroptosis—known to regulate inflammation, cell death, and immune evasion—appeared in only 39–44% of random subnetworks, suggesting genuine biological relevance in breast cancer subnetworks. In contrast, pathways like Autoimmune thyroid disease, Phagosome, and systemic lupus erythematosus were rarely enriched (≤10%), and several—including Olfactory transduction and Lysosome—were never enriched in the random subnetworks.

Pathways such as GPCR signaling, GnRH signaling, and chemokine-mediated cascades, while repeatedly enriched, have broad and pleiotropic roles in cellular signaling and may dominate enrichment outputs due to their general connectivity in the interactome. Despite this, the consistent enrichment of key oncogenic pathways—such as PI3K/AKT and MAPK signaling—in both the original and randomized settings support their centrality in cancer progression and aligns with the transcription factor disruptions identified in the pancreatic cancer subnetwork. While these pathways appear most prominent in terms of overlap size, their recurrence in analyses of randomized networks suggests that they may arise due to inherent network topology rather than specific biological signal.

In the pancreas cancer subnetwork, several signaling pathways—including IL-17, Adipocytokine, and Hedgehog signaling—were each enriched in 40% of subnetworks, while Pertussis signaling, involving GPCR activation, appeared in 44%. These pathways are known to influence inflammation, metabolism, and developmental signaling, all central to tumor biology. Oocyte meiosis (25%) may reflect dysregulation in cells. Their selective presence in the original network, but low recurrence across randomized versions, suggests these enrichments reflect disease-specific biological features.

Complex biological functions are mostly managed through protein-protein interactions (PPIs), and mutations occurring in proteins may alter the cell phenotypes [70]. Alterations perturbing PPIs can impact drug outcomes [7173]. Detection of druggable oncoPPIs [71] is promising since some could constitute more cancer-specific therapeutic targets [74,75], although the pre-existing mutation load can impact the efficacy [8]. Among the Fujian cohort mutations classified as metastatic markers in gastric carcinomas, one is a PTPRT mutation [76], and another, Chondrosarcoma TERT promoter mutation is a metastasis marker [77]. A real-world clinicogenomic dataset permitted the discovery of biomarkers that predict treatment outcomes that affect patients’ survival [78]. Treatment-specific genetic alterations in non-small cell lung cancer (NSCLC) include mutually exclusive and co-occurring mutations [79,80]. A whole-genome sequencing of a large Swedish cohort study [81] aimed to identify prognostically relevant co-occurring mutations in microsatellite stable colorectal cancers. The analysis revealed 23 significant co-mutations, with specific pairs such as APC-TCF7L2 and BRAFV600E-RNF43 showing distinct associations with favorable or poor survival outcomes, highlighting the added prognostic value of co-mutation profiling over single-gene analysis.

Overall, coupled with the literature, our analysis indicates that pathways carrying co-existing mutations can help in fathoming the evolution of certain cancer types and guide target selection.

Co-existing mutations are mostly in primary tumors and some rare mutations are signatures of breast cancer metastatic tumors

Genomic comparisons between primary and metastatic tumors show only moderate increases in mutation burden, with common mutations in TP53, MYC, and CDKN2A [8285]. While these studies reveal genomic similarities, key questions remain about the molecular mechanisms driving metastatic phenotypes and their activation [86,87]. Analysis of 40,979 tumors across 25 cancer types found no distinct mutational patterns associated with metastasis [85].

Analysis of the AACR GENIE cohort’s primary and metastatic breast cancer samples revealed that while over 70% of mutation pairs exist in primary tumors, some pairs are metastasis specific. Using frequency pattern growth tree analysis (Fig 6), we examined mutation patterns in the Metastatic Breast Cohort (n = 2385), comprising 52% IDC, 25% BRCA, 9% ILC, 4.5% BRCNOS, and other subtypes. The FP growth tree, analyzing 456 mutations across 2385 samples, yielded 31 nodes and 51 edges, with 40 significant mutation pairs serving as potential metastatic markers (Fig 6A, S2 Table). Strong associations emerged between mutations in ESR1, GATA3, and PIK3CA in metastatic tumors, particularly ESR1 mutations at positions 380, 536, 537, and 538 pairing with PIK3CA mutations at positions 542, 545, and 1047 (Fig 6B). A volcano plot analysis (Fig 7A) illustrated co-existence and mutual exclusivity patterns of metastatic markers, while further analysis confirmed ESR1|PIK3CA mutation pairs as significant metastatic indicators (Fig 7B, p-value < 10-13). GATA3 mutations at positions 293, 335, and 408 showed coupling with ESR1 or PIK3CA mutations, consistent with GATA3’s known functional relationship with ESR1 in transcriptional regulation [88]. The relationships between mutations were visualized in a bubble plot (Fig 7C), excluding 330 pairs with fewer than three co-existing mutations. Analysis of 393 metastatic breast tumors from the Hartwig Medical Foundation (HMF) dataset [89] reveals that 22 tumors harboring both PIK3CA and ESR1 mutations. Specifically, PIK3CA mutations at positions 542, 545, and H1047 paired with ESR1 mutations at positions 380, 536, 537, and 538, confirming the metastatic markers identified by the FP growth tree algorithm.

thumbnail
Fig 6. Rare mutations among the co-existing mutations are specific to metastatic tumors.

(A) Frequency pattern growth tree of missense mutations shows breast cancer metastatic markers where links represent association between mutations. Edges represent correlation strength. Ancestral mutations are shown as dark blue nodes, leading to consequential light blue mutations. Mutation frequencies are represented by node sizes. Mutation pairs accommodated by ESR1, GATA3, and PIK3CA proteins are enriched in metastatic tumors. 45 mutation pairs from 31 nodes and 51 edges in the FP growth tree are among those evaluated as metastatic markers. (B) ESR1|PIK3CA mutations are enriched in breast cancer metastatic tumors derived from FP Growth Tree in panel A. ESR1 mutations at positions 380, 536, 537 and 538 are paired with the major drivers of PIK3CA at positions 542, 545, and 1047.

https://doi.org/10.1371/journal.pcbi.1013351.g006

thumbnail
Fig 7. Metastatic marker statistics.

(A) We visualize each alteration pair with a volcano plot, where the x-axis represents and the y-axis represents . Pairs with indicate strong effect sizes, and a threshold of corresponds to p < 0.05, our chosen significance level as this threshold is widely used in biological research to denote statistical significance. (B) Propensity of mutation pairs from ESR1|PIK3CA and from other proteins where one side is from ESR1 or PIK3CA in metastatic tumors. (C) The bubble plot evaluating all possible relationships of the mutations at the FP growth tree, where node color depicts coexistence characteristics, and node radius is proportional to the Chi-Square value of the same cohort.

https://doi.org/10.1371/journal.pcbi.1013351.g007

In addition to these, we provide a tree of double, triple, and quadruple co-existing patterns across the pan-cancer metastatic tumors (Panels A and B of Fig G in S1 Text). To construct the tree in Panel B of Fig G, a minimum support of 7 × 10 ⁻ ⁵ and a minimum confidence threshold of 0.95 were used. In the tree, the node TP53R273 is followed by either one of the mutations on ATRX at the positions 1045, 1049, and 1426 where the third node of the triplet is IDH1R132. IDH1, TP53, and ATRX mutations are associated with low-grade astrocytomas collaborate to inhibit SOX2 (a nucleosome-binding pioneer TF acting to make genes accessible to the transcription machinery) and prevent the development of human neural stem cells [90,91]. Quadruples include mutations from BRAF, MSH3, BLM, RNF43, PLK2, NBN. In response to the context-dependent nature of cancer genomics, we derived a breast cancer–specific co-mutation tree illustrating doublet, and triplet mutation patterns, rather than performing a pan-cancer analysis. Due to the limited sample size of the metastatic breast cancer cohort, we applied less stringent thresholds (minimum support = 0.0007, minimum threshold = 0.01). The triplets in metastatic breast cancer include ESR1D538+PIK3CAH1047 + SF3B1K700 and PIK3CAE545 + GATA3M293+NUP93E14. To improve interpretability, direct mutation pairs connected only to the null root were excluded. The resulting tree is shown in Fig H in S1 Text, and the corresponding data is provided in S3 Table.

As breast cancer represents 25% of female cancers globally, with hormone receptor-positive being the most prevalent subtype, marked by estrogen receptor (ER) expression. While endocrine therapy targeting ER is standard, resistance often develops through ESR1 mutations, affecting up to 36% of metastatic breast cancer (MBC) patients [92]. Studies by Gerratana et al. revealed that ESR1 codon 537 variants associated with ER and RAF pathway SNVs, MYC CNVs, and bone metastases, while codon 538 variants correlate with cell cycle SNVs and liver metastases. PIK3CA codons 1047 and 542 showed associations with CNVs, informing therapeutic strategies [93].

Thus, although primary and metastatic tumors share driver mutations [85], most are in primary tumors. Some rare mutations appear enriched in metastatic tumors and may serve as potential markers, although further validation with larger datasets is required to establish their significance as metastatic signatures. A more comprehensive cross-cancer metastases analysis is vital for early detection.

Discussion

Tumorigenesis involves complex protein alterations [9497], where co-existing mutations in different proteins (in trans) undergo evolutionary selection [12,13]. While mutations in parallel pathways often co-exist, those in identical or redundant pathways typically show mutual exclusion, reflecting cellular sustainability limits [14]. This exclusion pattern links synthetic lethality and oncogene-induced senescence (OIS) [1520,98], where hyperactivation of proliferation signals becomes unsustainable.

Understanding cancer evolution requires systematic investigation of mutation epistasis [15,98,99102], supported by both driver-passenger mutation classification tools [30] and methods for identifying mutually exclusive gene sets [103]. There are several tools for identifying biologically meaningful, mutually exclusive gene sets [30], including statistical frameworks (MEScan [103], MEGSA [104], CoMET [105]) and driver module detection (MEMo [106], CoMDP [107]). These approaches, ranging from statistical frameworks to driver module detection [104108], culminate in tools like SELECT [108] that capture evolutionarily dependent alterations affecting drug response. Our analysis of in trans mutations identified 3424 significant co-existing mutations, predominantly comprising one rare and one frequent mutation, with 3307 co-existing pairs and the remainder being mutually exclusive. Co-existing mutations can enhance oncogenic signaling, while mutual exclusivity may result from unsustainable hyperactivity through same-pathway mutations or combinations with pioneer transcription factors like FOXA1 [109]. The FP-growth tree method [110] revealed increased frequency of PIK3CA and ESR1 mutation pairs in metastatic versus primary tumors.

Previous research on in cis mutations (multiple mutations in the same proteins) [44,51,111113] showed increased oncogenic activity and drug sensitivity, with double mutations enhancing PI3K signaling and tumor growth. Clinical trials suggest double-mutant breast cancers respond better to PI3K inhibitors than single-mutant cases [111], though exceptions exist, as seen in EGFRL858/T790’s increased resistance compared to EGFRL858. Building on this, studies of PI3Kα variants showed that co-existing mutations—often combining a hotspot with a weaker mutation—can shift conformational ensembles toward more active states and expose mutation-specific cryptic pockets, underscoring the need for variant-specific combination allosteric therapies [114].

Molecular studies have advanced our understanding of metastatic mechanisms and prognostic factors in cancers, particularly focusing on mutation patterns and their clinical implications. Wang et al. analyzed 114 Chinese NSCLC patients, finding higher metastasis in older patients with TP53 mutations, while EGFR, ASXL2, and STK11 mutations correlated with better survival [115]. Studies on thyroid cancer highlighted advances in biomarker detection enabling personalized treatment [116]. Analysis of 154 lung cancer brain metastases revealed greater tumor heterogeneity than primary tumors, with shared mutations (TTN, TP53), enhanced mitochondrial metabolism but suppressed immunity; combining mitochondrial-targeting drugs with immunotherapy improved mouse survival [117]. In 1817 KRAS-mutant lung adenocarcinomas, KEAP1 and SMARCA4 mutations correlated with metastasis and poor survival, while STK11’s impact varied by metastatic site and KEAP1 status [118]. Potential prognostic biomarkers in lung adenocarcinoma were identified through integrative analysis of mRNAs, miRNAs, and lncRNAs and cell cycle–related ceRNA network [119].

A limitation of our approach is the lack of redundancy reduction by tissue type, which may have allowed cancers with larger sample sizes—such as breast and pancreas—to dominate some signals. While this enabled pan-cancer discovery, it may introduce tissue bias. We later addressed this by identifying the tissue distribution of key doublets and selecting case studies from cancer types with both enrichment and sufficient sample size. Yet another limitation of our approach is the reliance on KEGG as pathway database; while KEGG is widely used and well-curated, different pathway resources such as Reactome [120] or WikiPathways [121] may yield alternative pathway overlaps due to variations in pathway definitions and gene assignments.

This study addresses two critical yet understudied challenges in cancer research: (i) proposing candidate markers potentially linked to metastatis; (ii) mutations whose combinations may be linked to oncogene-induced senescence (OIS), thus are rare, or absent in cancer genomes. As to point (ii), we conjecture that the absence of these combinations may be because the oncogenic signal they generate is too strong for the cell to tolerate. Mutations related to OIS likely emerge sporadically during cancer evolution, and further work is needed to validate this hypothesis. Cells do not have a mechanism for a priori negative selection. Rather, negative selection, or exclusion, is the outcome of cell death. Here we provide a list, a rationale, and suggest that OIS can provide the molecular basis to synthetic lethality, which has been investigated on the gene level. As to (i), to our knowledge, we provide a first candidate list of metastatic markers specific to metastatic samples, including actionable mutations for metastatic tumors, as well as a tree of double, triple, and quadruple co-occurrence patterns across the pan-cancer metastatic. This list (S2 Table) may help in early metastases detection. Advanced computational approaches allow detection of patterns. Co-existing mutation signatures for metastatic tumors, coupled with identification of the respective proteins and pathways, can create interactive maps. Linking them with drugs can create immensely useful tools for the attending physicians [122].

Finally, while the usefulness of metastatic markers to drug discovery is self-evident, exactly how to identify OIS mutations in protein pairs is less so, as indeed is also the case in synthetic lethality [123]. OIS restriction is observed in same- and redundant-pathways. Classically, drugs aim to block protein activity and downstream signaling. Introducing drugs that recreate their action is expected to lead to cell death. At the same time, controlling the signal is challenging, as strong signals can be associated with cell proliferation, but too strong a signal may result in oncogene-induced senescence.

Materials and methods

Data collection and processing

The data and preprocessing protocol is same as in [44] and the corresponding code and input data files are available at https://github.com/bengiruken.

Identification of significant co-existing mutations

Out of a total of 395,801 mutations with a variant allele frequency (VAF) greater than 0.125 [44], 20,157 mutations were found in at least three non-hypermutated tumor samples. We generated all possible binary combinations of these 20,157 mutations, excluding pairs from the same gene. For each unique mutation pair, we created a contingency table. This process yielded 366,573 potential co-existing mutations in different genes, which we evaluated across 59,048 non-hypermutated tumor samples using Fisher’s Exact Test for statistical significance where the contingency table is as follows (Table 1):

where d = 59,048 − (a + b + c) and 59,048 is the number of non-hypermutated samples. We did not apply any initial filter based on the number of tumors with co-existing mutations—cases where a = 0 were also tested. We then adjusted for multiple comparisons using the Benjamini-Hochberg method and proceeded with downstream analyses using 3,424 mutation pairs that had a q-value below 0.3 and were observed in at least three tumor samples. If a co-existing mutation constituent is among the 5601 driver mutations from the Catalog of Validated Oncogenic Mutations (Cancer Genome Interpreter [45]), it is labeled as known driver (D), otherwise passenger (P).

Transcriptome analysis

To identify differentially expressed genes in the group of patients with co-existing mutations compared to the single mutant patient group, we downloaded RNA-seq transcriptome data of the TCGA project from the cBioPortal database (https://www.cbioportal.org). We used median Transcripts Per Kilobase Million (TPM) values of RNA-seq data of PAAD cohort of TCGA. For the PAAD cohort, 177 patients with TPM values, we constructed two groups, where Group 1 is tumors having at least one significant co-existing mutation of type KRASG12D/V/C + TP53mutation and Group 2 is tumors having either single mutant KRASG12D/V/C or single mutant TP53. We calculated the log2FC value of each gene between the double mutant and single mutant groups by using the formula:

(1)

We identified differentially expressed genes between the double mutant tumors group and the single mutant tumors group (comparison of means of TPM values by Mann-Whitney U-Test). If || > 0.5 and adjusted p-value < 0.05 we considered the corresponding gene as differentially upregulated or downregulated in the double mutant group. Benjamini-Hochberg procedure to control the false discovery rate (FDR) for multiple correction was performed using the multipletests function in the statsmodels Python package. We continued our analysis with these differentially expressed genes, and calculated z-scores of each gene as follows:

(2)

where is µ is the mean of TPM values and σ is the standard deviation across all samples in the double mutant and the single mutant groups. After obtaining z-scores for each gene, we sorted the genes as downregulated and upregulated and represented these values as a heatmap (https://seaborn.pydata.org).

We used Webgestalt (http://www.webgestalt.org) for the gene set enrichment analysis where the functional database is selected from Reactome and significantly up- or down-regulated pathways are found. FDR threshold is selected 0.05 and the list of genes ranked by their logFC values are given as input.

Personalized PageRank algorithm

We used a network diffusion-based algorithm to find the most affected region of the interactome given a set of nodes. Given a directed or undirected graph G(v,e) where v ∈ V and e ∈ E and a set of seed nodes S ⊆ V, the personalized PageRank algorithm solves the seed set expansion problem, where it finds which additional nodes may exist in the community besides the nodes in S and ranks them according to their importance. We used the PageRank [124] function implemented in Python networkx library [125]. The damping parameter, alpha, is selected as 0.85 and the number of iterations are 100.

We applied the personalized PageRank algorithm to prioritize genes based on their proximity to tissue-specific seed genes within the OmniPath PPI network [60]. Following standard practice in network analyses and the original PageRank formulation [126], we set the damping factor α to 0.85, which balances local and global network exploration. To assess sensitivity to this parameter, we recomputed PageRank scores using α values of 0.70, 0.75, 0.80, 0.90, and 0.95, keeping the network and seed genes constant. We then compared gene rankings derived at each α to the rankings obtained with α = 0.85 using Spearman rank correlation, a non-parametric statistic appropriate for evaluating rank-order consistency.

Tissue specific subnetworks

To determine tissue specific seed proteins to perform Personalized PageRank algorithm for obtaining tissue specific subnetworks, we calculated the fraction of double-mutant tumors among all tumors in the corresponding tissue for each doublet. Then we evaluated the gene couples harboring co-existing mutation components as seed proteins if the fraction of the tumors with co-existing mutations is greater than 25%. We assigned initial weights as 1. We run the PageRank algorithm on the PPI network from OmniPath [60] with alpha = 0.85. We added the edges with score greater than the threshold 0.001 to the breast specific subnetworks.

We evaluated 46 signaling pathways from KEGG. We obtained pathway-double-mutant tumors by taking the union of all double mutants when at least one of the proteins belong to the corresponding pathway. We filtered the pathways that have less than 80 double mutant tumors and tissues and pathways that have less than 10% double mutant tumor rate. We constructed cancer type specific subnetworks for breast and pancreas tissues with seed proteins that have co-existing mutations and tissue specific double mutant fraction is greater than a certain threshold (thresholds are as follows: breast, 0.5; pancreas, 1.5). The number of seed proteins for breast cancer and pancreas cancer are 12 and 23, respectively.

Shared pathway membership between genes was determined based on overlap in KEGG gene sets, not by matching pathway names, to ensure functional relevance rather than semantic similarity.

To evaluate the robustness of pathway enrichment results against the native topology of the PPI network, we generated 100 randomized directed networks using a degree-preserving edge-swapping algorithm. For each randomized network, PageRank scores were computed using a damping factor of 0.85 and a fixed personalization vector. Genes exceeding a threshold score of 0.002 were used to reconstruct subnetworks.

We then performed pathway enrichment using the KEGG 2021 Human gene sets in gmt format with the GSEApy [127] package. A local Enrichr-based enrichment approach with retry mechanisms was used to ensure reliable results. For each pathway enriched in the original subnetwork, its frequency across the 100 randomized subnetworks was calculated and expressed as a percentage. This allowed identification of pathways that were specifically enriched in the original network and not due to network structural properties.

FP Growth Tree construction

We used the mlextend library’s FreqItems and AssociationRules functions for mining frequent item sets and association rules. The FP growth algorithm is selected for tree construction where each node represents one alteration and each edge in the tree represents the association of the nodes [110]. In the constructed tree, all nodes in the path from root to the distant node are associated with each other and strongly present together in the tumors. The implementation of this workflow or additional documentation on how the FP-Tree algorithm is adapted for this specific use case is available at https://github.com/ugur0sahin/FMPSeeker.

The tendency of the alterations to be specific to metastatic tumor is calculated by

(3)

where xi is the number of metastatic tumors having co-existing mutation i, Ni is the number of tumors having co-existing mutation i, X is the number of metastatic tumors and N is the number tumors in our dataset.

To assess co-occurrence among alterations in the metastatic breast cancer cohort, we first restricted our alteration matrix to this subset of patients. We then evaluated all unique pairs of alterations using itertools.combinations() in Python.

For each alteration pair (A, B), we extracted binary presence/absence vectors across samples from a pandas DataFrame and constructed a 2 × 2 contingency table based on the four possible joint states:

  • a: both alterations present,
  • b: only alteration A present,
  • c: only alteration B present,
  • d: neither present.

These counts form the basis for statistical testing. We used scipy.stats.fisher_exact to compute both the odds ratio (OR), calculated as

and the Fisher exact test p-value for each pair. Additionally, we applied scipy.stats.chi2_contingency to compute Pearson’s chi-squared statistic and corresponding p-value.

Note: We present further details of the downstream analyses of co-existing mutations in the supplemental information S1 Text.

Supporting information

S1 Text. Contains supplemental text, methods and figures.

https://doi.org/10.1371/journal.pcbi.1013351.s001

(DOCX)

S1 Table. Statistics of co-existing mutations on different proteins.

https://doi.org/10.1371/journal.pcbi.1013351.s002

(XLSX)

S2 Table. Metastatic breast cancer co-existing mutation markers.

https://doi.org/10.1371/journal.pcbi.1013351.s003

(XLSX)

S3 Table. Metastatic breast cancer triplet mutations in FP Growth tree.

https://doi.org/10.1371/journal.pcbi.1013351.s004

(XLSX)

References

  1. 1. Nordling CO. A new theory on cancer-inducing mechanism. Br J Cancer. 1953;7(1):68–72. pmid:13051507
  2. 2. Ashley DJ. The two “hit” and multiple “hit” theories of carcinogenesis. Br J Cancer. 1969;23(2):313–28. pmid:5788039
  3. 3. Knudson AG Jr. Mutation and cancer: statistical study of retinoblastoma. Proc Natl Acad Sci U S A. 1971;68(4):820–3. pmid:5279523
  4. 4. Boland CR, Ricciardiello L. How many mutations does it take to make a tumor? Proc Natl Acad Sci U S A. 1999;96(26):14675–7. pmid:10611270
  5. 5. Tomlinson I, Sasieni P, Bodmer W. How many mutations in a cancer? Am J Pathol. 2002;160(3):755–8. pmid:11891172
  6. 6. Kur J, Hasan N, Szybalski W. Physical and biological consequences of interactions between integration host factor (IHF) and coliphage lambda late p’R promoter and its mutants. Gene. 1989;81(1):1–15. pmid:2553535
  7. 7. Liu Y, Zhang W, Jang H, Nussinov R. SHP2 clinical phenotype, cancer, or RASopathies, can be predicted by mutant conformational propensities. Cell Mol Life Sci. 2023;81(1):5. pmid:38085330
  8. 8. Nussinov R, Tsai C-J, Jang H. Anticancer drug resistance: An update and perspective. Drug Resist Updat. 2021;59:100796. pmid:34953682
  9. 9. Nussinov R, Yavuz BR, Jang H. Molecular principles underlying aggressive cancers. Signal Transduct Target Ther. 2025;10(1):42. pmid:39956859
  10. 10. Polak P, Karlić R, Koren A, Thurman R, Sandstrom R, Lawrence M, et al. Cell-of-origin chromatin organization shapes the mutational landscape of cancer. Nature. 2015;518(7539):360–4. pmid:25693567
  11. 11. Salvadores M, Mas-Ponte D, Supek F. Passenger mutations accurately classify human tumors. PLoS Comput Biol. 2019;15(4):e1006953. pmid:30986244
  12. 12. Campbell PJ. Cliques and Schisms of Cancer Genes. Cancer Cell. 2017;32(2):129–30. pmid:28810139
  13. 13. Nussinov R, Tsai C-J, Jang H. Are Parallel Proliferation Pathways Redundant? Trends Biochem Sci. 2020;45(7):554–63. pmid:32345469
  14. 14. Nussinov R, Zhang M, Maloney R, Jang H. Drugging multiple same-allele driver mutations in cancer. Expert Opin Drug Discov. 2021;16(8):823–8. pmid:33769165
  15. 15. van de Haar J, Canisius S, Yu MK, Voest EE, Wessels LFA, Ideker T. Identifying Epistasis in Cancer Genomes: A Delicate Affair. Cell. 2019;177(6):1375–83. pmid:31150618
  16. 16. Nussinov R, Tsai C-J, Jang H. A New View of Activating Mutations in Cancer. Cancer Res. 2022;82(22):4114–23. pmid:36069825
  17. 17. Nussinov R, Tsai C-J, Jang H. Allostery, and how to define and measure signal transduction. Biophys Chem. 2022;283:106766. pmid:35121384
  18. 18. Cisowski J, Sayin VI, Liu M, Karlsson C, Bergo MO. Oncogene-induced senescence underlies the mutual exclusive nature of oncogenic KRAS and BRAF. Oncogene. 2016;35(10):1328–33. pmid:26028035
  19. 19. Cavalli G, Biavasco R, Borgiani B, Dagna L. Oncogene-induced senescence as a new mechanism of disease: the paradigm of erdheim-chester disease. Front Immunol. 2014;5:281. pmid:24982657
  20. 20. Zhang M, Jang H, Nussinov R. PI3K Driver Mutations: A Biophysical Membrane-Centric Perspective. Cancer Res. 2021;81(2):237–47. pmid:33046444
  21. 21. Tomasetti C, Marchionni L, Nowak MA, Parmigiani G, Vogelstein B. Only three driver gene mutations are required for the development of lung and colorectal cancers. Proc Natl Acad Sci U S A. 2015;112(1):118–23. pmid:25535351
  22. 22. Nussinov R, Jang H, Tsai C-J, Cheng F. Precision medicine review: rare driver mutations and their biophysical classification. Biophys Rev. 2019;11(1):5–19. pmid:30610579
  23. 23. Schmitt CA, Wang B, Demaria M. Senescence and cancer - role and therapeutic opportunities. Nat Rev Clin Oncol. 2022;19(10):619–36. pmid:36045302
  24. 24. Wang B, Kohli J, Demaria M. Senescent Cells in Cancer Therapy: Friends or Foes? Trends Cancer. 2020;6(10):838–57. pmid:32482536
  25. 25. Liu X-L, Ding J, Meng L-H. Oncogene-induced senescence: a double edged sword in cancer. Acta Pharmacol Sin. 2018;39(10):1553–8. pmid:29620049
  26. 26. Zhu H, Blake S, Kusuma FK, Pearson RB, Kang J, Chan KT. Oncogene-induced senescence: From biology to therapy. Mech Ageing Dev. 2020;187:111229. pmid:32171687
  27. 27. Jung SH, Hwang HJ, Kang D, Park HA, Lee HC, Jeong D, et al. mTOR kinase leads to PTEN-loss-induced cellular senescence by phosphorylating p53. Oncogene. 2019;38(10):1639–50. pmid:30337688
  28. 28. Chan KT, Blake S, Zhu H, Kang J, Trigos AS, Madhamshettiwar PB, et al. A functional genetic screen defines the AKT-induced senescence signaling network. Cell Death Differ. 2020;27(2):725–41. pmid:31285545
  29. 29. Tu Z, Aird KM, Zhang R. RAS, cellular senescence and transformation: the BRCA1 DNA repair pathway at the crossroads. Small GTPases. 2012;3(3):163–7. pmid:22751483
  30. 30. Deng Y, Luo S, Deng C, Luo T, Yin W, Zhang H, et al. Identifying mutual exclusivity across cancer genomes: computational approaches to discover genetic interaction and reveal tumor vulnerability. Brief Bioinform. 2019;20(1):254–66. pmid:28968730
  31. 31. Blair LM, Juan JM, Sebastian L, Tran VB, Nie W, Wall GD, et al. Oncogenic context shapes the fitness landscape of tumor suppression. Nat Commun. 2023;14(1):6422. pmid:37828026
  32. 32. Park S, Supek F, Lehner B. Higher order genetic interactions switch cancer genes from two-hit to one-hit drivers. Nat Commun. 2021;12(1):7051. pmid:34862370
  33. 33. Cisowski J, Bergo MO. What makes oncogenes mutually exclusive? Small GTPases. 2017;8(3):187–92. pmid:27416373
  34. 34. Köhler M, Röring M, Schorch B, Heilmann K, Stickel N, Fiala GJ, et al. Activation loop phosphorylation regulates B-Raf in vivo and transformation by B-Raf mutants. EMBO J. 2016;35(2):143–61. pmid:26657898
  35. 35. Ahronian LG, Sennott EM, Van Allen EM, Wagle N, Kwak EL, Faris JE, et al. Clinical Acquired Resistance to RAF Inhibitor Combinations in BRAF-Mutant Colorectal Cancer through MAPK Pathway Alterations. Cancer Discov. 2015;5(4):358–67. pmid:25673644
  36. 36. Rajagopalan H, Bardelli A, Lengauer C, Kinzler KW, Vogelstein B, Velculescu VE. Tumorigenesis: RAF/RAS oncogenes and mismatch-repair status. Nature. 2002;418(6901):934. pmid:12198537
  37. 37. Nussinov R, Yavuz BR, Jang H. Anticancer drugs: How to select small molecule combinations? Trends Pharmacol Sci. 2024;45(6):503–19. pmid:38782689
  38. 38. Nussinov R, Weichhart T, Dlamini Z, Gibbons DL, Van Seuningen I, Konen J, et al. Directions to overcome therapy resistance in cancer. Trends Pharmacol Sci. 2024;45(6):467–71. pmid:38760267
  39. 39. Huang A, Garraway LA, Ashworth A, Weber B. Synthetic lethality as an engine for cancer drug target discovery. Nat Rev Drug Discov. 2020;19(1):23–38. pmid:31712683
  40. 40. Shaheen M, Allen C, Nickoloff JA, Hromas R. Synthetic lethality: exploiting the addiction of cancer to DNA repair. Blood. 2011;117(23):6074–82. pmid:21441464
  41. 41. Liu Q-W, Yang Z-W, Tang Q-H, Wang W-E, Chu D-S, Ji J-F, et al. The power and the promise of synthetic lethality for clinical application in cancer treatment. Biomed Pharmacother. 2024;172:116288. pmid:38377739
  42. 42. Setton J, Zinda M, Riaz N, Durocher D, Zimmermann M, Koehler M, et al. Synthetic Lethality in Cancer Therapeutics: The Next Generation. Cancer Discov. 2021;11(7):1626–35. pmid:33795234
  43. 43. Tomlinson IP, Novelli MR, Bodmer WF. The mutation rate and cancer. Proc Natl Acad Sci U S A. 1996;93(25):14800–3. pmid:8962135
  44. 44. Yavuz BR, Tsai C-J, Nussinov R, Tuncbag N. Pan-cancer clinical impact of latent drivers from double mutations. Commun Biol. 2023;6(1):202. pmid:36808143
  45. 45. Tamborero D, Rubio-Perez C, Deu-Pons J, Schroeder MP, Vivancos A, Rovira A, et al. Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med. 2018;10(1):25. pmid:29592813
  46. 46. Wang J, Wu M, Huang X, Wang L, Zhang S, Liu H, et al. SynLethDB 2.0: a web-based knowledge graph database on synthetic lethality for novel anticancer drug discovery. Database (Oxford). 2022;2022:baac030. pmid:35562840
  47. 47. Breitkreutz B-J, Stark C, Reguly T, Boucher L, Breitkreutz A, Livstone M, et al. The BioGRID Interaction Database: 2008 update. Nucleic Acids Res. 2008;36(Database issue):D637-40. pmid:18000002
  48. 48. Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–74. pmid:21376230
  49. 49. Rodier F, Campisi J. Four faces of cellular senescence. J Cell Biol. 2011;192(4):547–56. pmid:21321098
  50. 50. Alon U. Network motifs: theory and experimental approaches. Nat Rev Genet. 2007;8(6):450–61. pmid:17510665
  51. 51. Gorelick AN, Sánchez-Rivera FJ, Cai Y, Bielski CM, Biederstedt E, Jonsson P, et al. Phase and context shape the function of composite oncogenic mutations. Nature. 2020;582(7810):100–3. pmid:32461694
  52. 52. Iannuccelli M, Micarelli E, Surdo PL, Palma A, Perfetto L, Rozzo I, et al. CancerGeneNet: linking driver genes to cancer hallmarks. Nucleic Acids Res. 2020;48(D1):D416–21. pmid:31598703
  53. 53. Hedrick E, Cheng Y, Jin U-H, Kim K, Safe S. Specificity protein (Sp) transcription factors Sp1, Sp3 and Sp4 are non-oncogene addiction genes in cancer cells. Oncotarget. 2016;7(16):22245–56. pmid:26967243
  54. 54. Safe S, Shrestha R, Mohankumar K, Howard M, Hedrick E, Abdelrahim M. Transcription factors specificity protein and nuclear receptor 4A1 in pancreatic cancer. World J Gastroenterol. 2021;27(38):6387–98. pmid:34720529
  55. 55. Kim MP, Li X, Deng J, Zhang Y, Dai B, Allton KL, et al. Oncogenic KRAS Recruits an Expansive Transcriptional Network through Mutant p53 to Drive Pancreatic Cancer Metastasis. Cancer Discov. 2021;11(8):2094–111. pmid:33839689
  56. 56. Zhou Y, Jin J, Ji Y, Zhang J, Fu N, Chen M, et al. TP53 missense mutation reveals gain-of-function properties in small-sized KRAS transformed pancreatic ductal adenocarcinoma. J Transl Med. 2023;21(1):872. pmid:38037073
  57. 57. Shi Y, Hata A, Lo RS, Massagué J, Pavletich NP. A structural basis for mutational inactivation of the tumour suppressor Smad4. Nature. 1997;388(6637):87–93. pmid:9214508
  58. 58. Schmitt CA, Wang B, Demaria M. Senescence and cancer - role and therapeutic opportunities. Nat Rev Clin Oncol. 2022;19(10):619–36. pmid:36045302
  59. 59. Kumari R, Jat P. Mechanisms of Cellular Senescence: Cell Cycle Arrest and Senescence Associated Secretory Phenotype. Front Cell Dev Biol. 2021;9:645593. pmid:33855023
  60. 60. Türei D, Korcsmáros T, Saez-Rodriguez J. OmniPath: guidelines and gateway for literature-curated signaling pathway resources. Nat Methods. 2016;13(12):966–7. pmid:27898060
  61. 61. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. pmid:16199517
  62. 62. Mootha VK, Lindgren CM, Eriksson K-F, Subramanian A, Sihag S, Lehar J, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267–73. pmid:12808457
  63. 63. Wang J, Vasaikar S, Shi Z, Greer M, Zhang B. WebGestalt 2017: a more comprehensive, powerful, flexible and interactive gene set enrichment analysis toolkit. Nucleic Acids Res. 2017;45(W1):W130–7. pmid:28472511
  64. 64. Lappano R, Maggiolini M. GPCRs and cancer. Acta Pharmacol Sin. 2012;33(3):351–62. pmid:22266725
  65. 65. Huerta-Reyes M, Maya-Núñez G, Pérez-Solis MA, López-Muñoz E, Guillén N, Olivo-Marin J-C, et al. Treatment of Breast Cancer With Gonadotropin-Releasing Hormone Analogs. Front Oncol. 2019;9:943. pmid:31632902
  66. 66. Garrido MP, Hernandez A, Vega M, Araya E, Romero C. Conventional and new proposals of GnRH therapy for ovarian, breast, and prostatic cancers. Front Endocrinol (Lausanne). 2023;14:1143261. pmid:37056674
  67. 67. Mishra P, Banerjee D, Ben-Baruch A. Chemokines at the crossroads of tumor-fibroblast interactions that promote malignancy. J Leukoc Biol. 2011;89(1):31–9. pmid:20628066
  68. 68. Mollica Poeta V, Massara M, Capucetti A, Bonecchi R. Chemokines and Chemokine Receptors: New Targets for Cancer Immunotherapy. Front Immunol. 2019;10:379. pmid:30894861
  69. 69. Liu H, Yang Z, Lu W, Chen Z, Chen L, Han S, et al. Chemokines and chemokine receptors: A new strategy for breast cancer therapy. Cancer Med. 2020;9(11):3786–99. pmid:32253815
  70. 70. Jubb HC, Pandurangan AP, Turner MA, Ochoa-Montaño B, Blundell TL, Ascher DB. Mutations at protein-protein interfaces: Small changes over big surfaces have large impacts on human health. Prog Biophys Mol Biol. 2017;128:3–13. pmid:27913149
  71. 71. Cheng F, Zhao J, Wang Y, Lu W, Liu Z, Zhou Y, et al. Comprehensive characterization of protein-protein interactions perturbed by disease mutations. Nat Genet. 2021;53(3):342–53. pmid:33558758
  72. 72. Cheng F, Liang H, Butte AJ, Eng C, Nussinov R. Personal Mutanomes Meet Modern Oncology Drug Discovery and Precision Health. Pharmacol Rev. 2019;71(1):1–19. pmid:30545954
  73. 73. Nussinov R, Jang H, Tsai C-J, Cheng F. Review: Precision medicine and driver mutations: Computational methods, functional assays and conformational principles for interpreting cancer drivers. PLoS Comput Biol. 2019;15(3):e1006658. pmid:30921324
  74. 74. Ivanov AA. Explore Protein-Protein Interactions for Cancer Target Discovery Using the OncoPPi Portal. Methods Mol Biol. 2020;2074:145–64. pmid:31583637
  75. 75. Ivanov AA, Revennaugh B, Rusnak L, Gonzalez-Pecchi V, Mo X, Johns MA, et al. The OncoPPi Portal: an integrative resource to explore and prioritize protein-protein interactions for cancer target discovery. Bioinformatics. 2018;34(7):1183–91. pmid:29186335
  76. 76. Chen C, Shi C, Huang X, Zheng J, Zhu Z, Li Q, et al. Molecular Profiles and Metastasis Markers in Chinese Patients with Gastric Carcinoma. Sci Rep. 2019;9(1):13995. pmid:31570735
  77. 77. Zhang Y, Chen Y, Yang C, Seger N, Hesla AC, Tsagkozis P, et al. TERT promoter mutation is an objective clinical marker for disease progression in chondrosarcoma. Mod Pathol. 2021;34(11):2020–7. pmid:34108637
  78. 78. Liu R, Rizzo S, Waliany S, Garmhausen MR, Pal N, Huang Z, et al. Systematic pan-cancer analysis of mutation-treatment interactions using large real-world clinicogenomics data. Nat Med. 2022;28(8):1656–61. pmid:35773542
  79. 79. Smith MR, Wang Y, D’Agostino R Jr, Liu Y, Ruiz J, Lycan T, et al. Prognostic Mutational Signatures of NSCLC Patients treated with chemotherapy, immunotherapy and chemoimmunotherapy. NPJ Precis Oncol. 2023;7(1):34. pmid:36973365
  80. 80. Zhang F, Wang J, Xu Y, Cai S, Li T, Wang G, et al. Co-occurring genomic alterations and immunotherapy efficacy in NSCLC. NPJ Precis Oncol. 2022;6(1):4. pmid:35042953
  81. 81. Nunes L, Stenersen JM, Kryeziu K, Sjöblom T, Glimelius B, Lothe RA, et al. Co-occurring mutations identify prognostic subgroups of microsatellite stable colorectal cancer. Mol Cancer. 2024;23(1):264. pmid:39587554
  82. 82. Zhao Z-M, Zhao B, Bai Y, Iamarino A, Gaffney SG, Schlessinger J, et al. Early and multiple origins of metastatic lineages within primary tumors. Proc Natl Acad Sci U S A. 2016;113(8):2140–5. pmid:26858460
  83. 83. Liu G, Zhan X, Dong C, Liu L. Genomics alterations of metastatic and primary tissues across 15 cancer types. Sci Rep. 2017;7(1):13262. pmid:29038591
  84. 84. Martínez-Jiménez F, Movasati A, Brunner SR, Nguyen L, Priestley P, Cuppen E, et al. Pan-cancer whole-genome comparison of primary and metastatic solid tumours. Nature. 2023;618(7964):333–41. pmid:37165194
  85. 85. Christensen DS, Ahrenfeldt J, Sokač M, Kisistók J, Thomsen MK, Maretty L, et al. Treatment Represents a Key Driver of Metastatic Cancer Evolution. Cancer Res. 2022;82(16):2918–27. pmid:35731928
  86. 86. Birkbak NJ, McGranahan N. Cancer Genome Evolutionary Trajectories in Metastasis. Cancer Cell. 2020;37(1):8–19. pmid:31935374
  87. 87. Patel SA, Rodrigues P, Wesolowski L, Vanharanta S. Genomic control of metastasis. Br J Cancer. 2021;124(1):3–12. pmid:33144692
  88. 88. Theodorou V, Stark R, Menon S, Carroll JS. GATA3 acts upstream of FOXA1 in mediating ESR1 binding by shaping enhancer accessibility. Genome Res. 2013;23(1):12–22. pmid:23172872
  89. 89. Priestley P, Baber J, Lolkema MP, Steeghs N, de Bruijn E, Shale C, et al. Pan-cancer whole-genome analyses of metastatic solid tumours. Nature. 2019;575(7781):210–6. pmid:31645765
  90. 90. Modrek AS, Golub D, Khan T, Bready D, Prado J, Bowman C, et al. Low-Grade Astrocytoma Mutations in IDH1, P53, and ATRX Cooperate to Block Differentiation of Human Neural Stem Cells via Repression of SOX2. Cell Rep. 2017;21(5):1267–80. pmid:29091765
  91. 91. Ohba S, Kuwahara K, Yamada S, Abe M, Hirose Y. Correlation between IDH, ATRX, and TERT promoter mutations in glioma. Brain Tumor Pathol. 2020;37(2):33–40. pmid:32227259
  92. 92. Herzog SK, Fuqua SAW. ESR1 mutations and therapeutic resistance in metastatic breast cancer: progress and remaining challenges. Br J Cancer. 2022;126(2):174–86. pmid:34621045
  93. 93. Gerratana L, Davis AA, Velimirovic M, Clifton K, Hensing WL, Shah AN, et al. Interplay between ESR1/PIK3CA codon variants, oncogenic pathway alterations and clinical phenotype in patients with metastatic breast cancer (MBC): comprehensive circulating tumor DNA (ctDNA) analysis. Breast Cancer Res. 2023;25(1):112. pmid:37784176
  94. 94. Boca SM, Kinzler KW, Velculescu VE, Vogelstein B, Parmigiani G. Patient-oriented gene set analysis for cancer mutation data. Genome Biol. 2010;11(11):R112. pmid:21092299
  95. 95. Reiter JG, Baretti M, Gerold JM, Makohon-Moore AP, Daud A, Iacobuzio-Donahue CA, et al. An analysis of genetic heterogeneity in untreated cancers. Nat Rev Cancer. 2019;19(11):639–50. pmid:31455892
  96. 96. Mendiratta G, Ke E, Aziz M, Liarakos D, Tong M, Stites EC. Cancer gene mutation frequencies for the U.S. population. Nat Commun. 2021;12(1):5961. pmid:34645806
  97. 97. Mendiratta G, Jones MK, Stites EC. How often is each gene mutated within the cancer patient population? Mol Cell Oncol. 2022;9(1):2065176. pmid:35529901
  98. 98. El Tekle G, Bernasocchi T, Unni AM, Bertoni F, Rossi D, Rubin MA, et al. Co-occurrence and mutual exclusivity: what cross-cancer mutation patterns can tell us. Trends Cancer. 2021;7(9):823–36. pmid:34031014
  99. 99. Rogers ZN, McFarland CD, Winters IP, Naranjo S, Chuang C-H, Petrov D, et al. A quantitative and multiplexed approach to uncover the fitness landscape of tumor suppression in vivo. Nat Methods. 2017;14(7):737–42. pmid:28530655
  100. 100. Harris TJR, McCormick F. The molecular pathology of cancer. Nat Rev Clin Oncol. 2010;7(5):251–65. pmid:20351699
  101. 101. Sanchez-Vega F, Mina M, Armenia J, Chatila WK, Luna A, La KC, et al. Oncogenic Signaling Pathways in The Cancer Genome Atlas. Cell. 2018;173(2):321-337.e10. pmid:29625050
  102. 102. Sinkala M. Mutational landscape of cancer-driver genes across human cancers. Sci Rep. 2023;13(1):12742. pmid:37550388
  103. 103. Liu S, Liu J, Xie Y, Zhai T, Hinderer EW, Stromberg AJ, et al. MEScan: a powerful statistical framework for genome-scale mutual exclusivity analysis of cancer mutations. Bioinformatics. 2021;37(9):1189–97. pmid:33165532
  104. 104. Hua X, Hyland PL, Huang J, Song L, Zhu B, Caporaso NE, et al. MEGSA: A Powerful and Flexible Framework for Analyzing Mutual Exclusivity of Tumor Mutations. Am J Hum Genet. 2016;98(3):442–55. pmid:26899600
  105. 105. Leiserson MDM, Wu H-T, Vandin F, Raphael BJ. CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer. Genome Biol. 2015;16(1):160. pmid:26253137
  106. 106. Ciriello G, Cerami E, Aksoy BA, Sander C, Schultz N. Using MEMo to discover mutual exclusivity modules in cancer. Curr Protoc Bioinformatics. 2013;Chapter 8:8.17.1-8.17.12. pmid:23504936
  107. 107. Zhang J, Wu L-Y, Zhang X-S, Zhang S. Discovery of co-occurring driver pathways in cancer. BMC Bioinformatics. 2014;15(1):271. pmid:25106096
  108. 108. Mina M, Raynaud F, Tavernari D, Battistello E, Sungalee S, Saghafinia S, et al. Conditional Selection of Genomic Alterations Dictates Cancer Evolution and Oncogenic Dependencies. Cancer Cell. 2017;32(2):155-168.e6. pmid:28756993
  109. 109. Iwafuchi-Doi M, Donahue G, Kakumanu A, Watts JA, Mahony S, Pugh BF, et al. The Pioneer Transcription Factor FoxA Maintains an Accessible Nucleosome Configuration at Enhancers for Tissue-Specific Gene Activation. Mol Cell. 2016;62(1):79–91. pmid:27058788
  110. 110. Han J, Pei J, Yin Y. Mining frequent patterns without candidate generation. ACM SIGMOD Rec. 2000;29(2):1–12.
  111. 111. Vasan N, Razavi P, Johnson JL, Shao H, Shah H, Antoine A, et al. Double PIK3CA mutations in cis increase oncogenicity and sensitivity to PI3Kα inhibitors. Science. 2019;366(6466):714–23. pmid:31699932
  112. 112. Saito Y, Koya J, Araki M, Kogure Y, Shingaki S, Tabata M, et al. Landscape and function of multiple mutations within individual oncogenes. Nature. 2020;582(7810):95–9. pmid:32494066
  113. 113. Saito M, Momma T, Kono K. Targeted therapy according to next generation sequencing-based panel sequencing. Fukushima J Med Sci. 2018;64(1):9–14. pmid:29628467
  114. 114. Jang H, Yavuz BR, Zhang M, Liu Y, Nussinov R. The mechanism of oncogenic PI3K lipid kinase variants at the membrane and their cryptic pockets. bioRxiv. 2025;:2025.06.26.661751. pmid:40666967
  115. 115. Wang K, Ye M, Mo Z, Huang X, Li Y, Wei S. Comprehensive analysis of genomic alterations and novel prognostic biomarkers, and establishment of prediction models of metastasis in metastatic non-small cell lung cancer. J Cancer. 2025;16(1):339–50. pmid:39744567
  116. 116. Cendra AS, Pekarek L, Pedrejón BI, Bernier L, Cervantes EDR, Cendra CS, et al. New Horizons of Biomarkers in Metastatic Thyroid Cancer. J Cancer. 2025;16(1):241–64. pmid:39744583
  117. 117. Duan H, Ren J, Wei S, Yang Z, Li C, Wang Z, et al. Integrated analyses of multi-omic data derived from paired primary lung cancer and brain metastasis reveal the metabolic vulnerability as a novel therapeutic target. Genome Med. 2024;16(1):138. pmid:39593114
  118. 118. Boiarsky D, Lydon CA, Chambers ES, Sholl LM, Nishino M, Skoulidis F, et al. Molecular markers of metastatic disease in KRAS-mutant lung adenocarcinoma. Ann Oncol. 2023;34(7):589–604. pmid:37121400
  119. 119. Yang Y, Zhang S, Guo L. Characterization of Cell Cycle-Related Competing Endogenous RNAs Using Robust Rank Aggregation as Prognostic Biomarker in Lung Adenocarcinoma. Front Oncol. 2022;12:807367. pmid:35186743
  120. 120. Wu G, Haw R. Functional Interaction Network Construction and Analysis for Disease Discovery. Methods Mol Biol. 2017;1558:235–53. pmid:28150241
  121. 121. Agrawal A, Balcı H, Hanspers K, Coort SL, Martens M, Slenter DN, et al. WikiPathways 2024: next generation pathway database. Nucleic Acids Res. 2024;52(D1):D679–89. pmid:37941138
  122. 122. Zhou Y, Zhao J, Fang J, Martin W, Li L, Nussinov R, et al. My personal mutanome: a computational genomic medicine platform for searching network perturbing alleles linking genotype to phenotype. Genome Biol. 2021;22(1):53. pmid:33514395
  123. 123. Hopkins BD, Goncalves MD, Cantley LC. Insulin-PI3K signalling: an evolutionarily insulated metabolic driver of cancer. Nat Rev Endocrinol. 2020;16(5):276–83. pmid:32127696
  124. 124. Dominich S. The modern algebra of information retrieval. Springer Science & Business Media. 2008. p. 330.
  125. 125. Exploring network structure, dynamics, and function using Networkx2008. 2008.
  126. 126. Brin S, Page L. The anatomy of a large-scale hypertextual web search engine (Reprint from Computer Networks and ISDN Systems, vol 30, pg 107-117, 1998). Comput Netw. 2012;56(18):3825–33. WOS:000 pmid:312922400003
  127. 127. Fang Z, Liu X, Peltz G. GSEApy: a comprehensive package for performing gene set enrichment analysis in Python. Bioinformatics. 2023;39(1):btac757. pmid:36426870