• Loading metrics

EGFR Signal-Network Reconstruction Demonstrates Metabolic Crosstalk in EMT

  • Kumari Sonal Choudhary,

    Affiliations Center for Systems Biology, University of Iceland, Reykjavik, Iceland, Biomedical Center, University of Iceland, Reykjavik, Iceland

  • Neha Rohatgi,

    Affiliations Center for Systems Biology, University of Iceland, Reykjavik, Iceland, Biomedical Center, University of Iceland, Reykjavik, Iceland

  • Skarphedinn Halldorsson,

    Affiliations Center for Systems Biology, University of Iceland, Reykjavik, Iceland, Biomedical Center, University of Iceland, Reykjavik, Iceland


  • Eirikur Briem,

    Affiliations Biomedical Center, University of Iceland, Reykjavik, Iceland, Stem Cell Research Unit, Department of Anatomy, School of Health Sciences, University of Iceland, Reykjavík, Iceland, Department of Laboratory Hematology, Landspitali-University Hospital, Reykjavik, Iceland

  • Thorarinn Gudjonsson,

    Affiliations Biomedical Center, University of Iceland, Reykjavik, Iceland, Stem Cell Research Unit, Department of Anatomy, School of Health Sciences, University of Iceland, Reykjavík, Iceland, Department of Laboratory Hematology, Landspitali-University Hospital, Reykjavik, Iceland

  • Steinn Gudmundsson,

    Affiliation Center for Systems Biology, University of Iceland, Reykjavik, Iceland

  • Ottar Rolfsson

    Affiliations Center for Systems Biology, University of Iceland, Reykjavik, Iceland, Biomedical Center, University of Iceland, Reykjavik, Iceland

EGFR Signal-Network Reconstruction Demonstrates Metabolic Crosstalk in EMT

  • Kumari Sonal Choudhary, 
  • Neha Rohatgi, 
  • Skarphedinn Halldorsson, 
  • Eirikur Briem, 
  • Thorarinn Gudjonsson, 
  • Steinn Gudmundsson, 
  • Ottar Rolfsson


Epithelial to mesenchymal transition (EMT) is an important event during development and cancer metastasis. There is limited understanding of the metabolic alterations that give rise to and take place during EMT. Dysregulation of signalling pathways that impact metabolism, including epidermal growth factor receptor (EGFR), are however a hallmark of EMT and metastasis. In this study, we report the investigation into EGFR signalling and metabolic crosstalk of EMT through constraint-based modelling and analysis of the breast epithelial EMT cell model D492 and its mesenchymal counterpart D492M. We built an EGFR signalling network for EMT based on stoichiometric coefficients and constrained the network with gene expression data to build epithelial (EGFR_E) and mesenchymal (EGFR_M) networks. Metabolic alterations arising from differential expression of EGFR genes was derived from a literature review of AKT regulated metabolic genes. Signaling flux differences between EGFR_E and EGFR_M models subsequently allowed metabolism in D492 and D492M cells to be assessed. Higher flux within AKT pathway in the D492 cells compared to D492M suggested higher glycolytic activity in D492 that we confirmed experimentally through measurements of glucose uptake and lactate secretion rates. The signaling genes from the AKT, RAS/MAPK and CaM pathways were predicted to revert D492M to D492 phenotype. Follow-up analysis of EGFR signaling metabolic crosstalk in three additional breast epithelial cell lines highlighted variability in in vitro cell models of EMT. This study shows that the metabolic phenotype may be predicted by in silico analyses of gene expression data of EGFR signaling genes, but this phenomenon is cell-specific and does not follow a simple trend.

Author Summary

The epidermal growth factor receptor (EGFR) signaling cascade is one of the key signaling pathways that are involved in the induction of Epithelial Mesenchymal Transition (EMT) and tumor metastasis. These signaling cascades often affect metabolic fate in tumor cells and control their progression. Here we demonstrate a method to build a mathematical model of the EGFR signaling cascade and use it to study signaling in EMT and how signaling affects metabolism. The model was used to obtain a list of potential signaling and metabolic targets of EMT. These targets may aid in the understanding of the molecular mechanisms that underlie EMT and metastasis. Our results further highlight the heterogeneity of cell models used to study EMT and support the idea of cell specific anti-cancer interventions.


Epithelial to mesenchymal transition (EMT) is a developmental process where polarized epithelial cells transition to an invasive mesenchymal-like phenotype through molecular reprogramming that leads to degradation of the extra-cellular matrix (ECM) and the loss of cell polarity. Following recruitment to specific sites at distant locations within the developing embryo, the mesenchymal cells may revert back to the epithelial phenotype by a process known as mesenchymal to epithelial transition (MET), thereby seeding new epithelial tissues [1]. Although EMT is fundamental for several developmental processes and wound healing, dysregulation of EMT may cause cancer cells to initiate metastasis and form secondary tumors at distant sites [13].

EMT is induced by a number of distinct molecular processes [1]. These include the binding of several growth factors, including the platelet derived growth factor (PDGF), insulin-like growth factor (IGF), neuregulin and epidermal growth factor (EGF) to their cognate cell-surface receptors, leading to receptor activation [4]. This activates downstream signaling pathways that regulate the control of specific transcription factors, cell-surface proteins and microRNAs [5]. EMT is also involved in reorganization and expression of cytoskeletal proteins and production of ECM-degrading enzymes [1]. This series of events leads to increased expression of mesenchymal markers like N-cadherin and vimentin and decreased expression of epithelial markers such as E-cadherin [6]. Binding of EGF to its cognate epidermal growth factor receptor (EGFR) family has been shown to stimulate EMT in breast cancer cells [7,8], leading to altered expression of E-cadherin and vimentin [8,9]. Activated EGFR signaling suppresses E-cadherin expression either by promoting its endocytosis [10] or by enhancing the expression of transcription factors (TFs) like Snail and Twist [11,12]. As a result, the cells may transition from epithelial to mesenchymal phenotype with spindle like morphology [8]. EGFR regulates mammary gland development and in certain aggressive breast cancer cells has been shown to regulate invasion and migration [8]. The most common signaling cascades activated downstream of EGFR are PI3K/Akt, Ras/Raf/Mek and DAG/IP3 and CaM signaling, that affect cell cycle progression, inhibition of apoptosis, angiogenesis, tumor cell motility, and metastases (Fig 1) [13,14].

Fig 1. An overview of downstream signaling pathways induced by EGFR signaling.

Three main pathways: AKT, RAS/MAPK and CaM and DAG/IP3 are induced downstream of active EGFR signaling.

EMT is likely to impact metabolism, but the effects are not as widely studied as cancer metabolism [15,16] Cancer cells exhibit a shift of ATP generation from oxidative phosphorylation to aerobic glycolysis known as the Warburg effect [17]. This leads to a higher rate of glycolysis in cancer cells. Cancer cells also tend to show enhanced glutamine metabolism which has been shown to contribute cancer cell migration [18]. Signaling pathways have often been associated with metabolic consequences, but can themselves be influenced by metabolism. Interestingly, up-regulated glycolysis has been linked with higher AKT signaling in cancer cells [19,20]. However, the mechanistic manner in which metabolism is affected during EMT is unknown.

Computational approaches such as Constraint-based modeling and analysis (COBRA) techniques are very useful in analysis of the complex biological networks like signaling networks [2123]. Prior efforts of modeling of signaling cascades include modeling of the EGFR pathway [21,22,2426], TLR signaling [2729], JAK-STAT signaling [23], MAPK pathway [30] and interleukin 1 signaling [31]. COBRA techniques mainly focus on the use of physio-chemical and biological constraints and are sparsely dependent on kinetic data that has limited availability. Protocols for the generation of biochemical networks and computational algorithms/methods for querying these networks are now well established [32,33]. They involve conversion of biological data (e.g. genomic, metabolic, and regulatory) to a mathematical reaction format. This allows better definition of regulatory changes associated with specific events such as EMT and exploration of metabolic alterations associated with the process. For example, how altered EGFR signaling is propagated to a metabolic phenotype can be investigated using COBRA methodology.

In this study, we built a computational signaling network of EGFR to query how the expression of signaling genes can affect metabolic alterations during EMT in human breast epithelial cells. An EGFR signalling network was reconstructed (EGFR_SN) and was constrained with the transcriptomics data of the breast epithelial cell line D492 and its mesenchymal counterpart D492M [34] to form EGFR_E and EGFR_M networks. D492 is an E6/E7 viral oncogene immortalized human breast epithelial basal cell line with stem cell like properties that differentiates into both myoepithelial and luminal cells and has the ability to undergo branching morphogenesis when grown in a 3D reconstituted basement membrane matrix [35]. The 3D co-culture of D492 cells with human endothelial cells led to establishment of mesenchymal cells with spindle-like morphology called D492M. The D492M cells have high expression of N-cadherin and vimentin and low expression of E-cadherin that is typical for cells that have undergone EMT [34].

The EMT specific signaling network of D492 and D492M enabled us to investigate differential gene expression of EGFR signalling genes. This was subsequently extended to other breast epithelial cell lines and their mesenchymal counterparts. Through extensive literature review, the EGFR_SN network was linked to metabolic genes that were likely to be affected. The differential flux values in EGFR_E and EGFR_M allowed the assessment of how the altered signaling affects metabolic gene expression and metabolism in D492 and D492M cells. Increased flux within the AKT and RAS/MAPK signaling pathways was predicted in D492 as compared to D492M. The in silico predicted increase in flux of the AKT pathway induced higher glycolytic activity in D492 cells. This suggested that there may be an EMT-related decrease in glycolysis in D492M as compared to D492 cells, which was confirmed in vitro by glucose uptake and lactate secretion measurements. Comparative analysis of EGFR signaling networks in three other breast epithelial cell lines showed that regulation of signaling pathways are cell specific and follow no simple trend.

Results and Discussion

EGFR signaling network reconstruction

In order to capture how altered EGFR signaling is propagated through metabolic pathways in the breast epithelium, we built a constraint based EGFR network, EGFR_SN. To generate EGFR_SN, the EGFR pathway map (Reactome ID: R-HSA-177929) was downloaded from the Reactome database [36]. Several modifications such as incorporation of gene-protein reaction (GPRs) rules/association, additions of modifiers, inhibitors and activators and removal of dead ends were made to the Reactome pathway to make it feasible for analysis with the COBRA methodology (methods section). Due to the incorporated GPRs, experimental data (e.g, gene expression, proteomic, fluxomic data) can now be mapped onto EGFR_SN, thereby relating genomic information to the reactions in the network.

The resulting reconstructed network EGFR_SN accounts for 182 reactions, 216 genes, 152 reacting species, 11 inhibitors and 2 activators. The 182 reactions were divided between 83 internal reactions and 99 exchange reactions. Exchange reactions were added in order to remove dead-ends in the network and acquire feasibility. The number of exchange reactions added here is large compared to what is typically present in metabolic networks, where exchange reactions are added mainly to allow for accumulation of metabolites and secretion of wastes. Internal reactions represent connections between internal signaling components, while the exchange reactions represent connections of the system boundary with the environment. Fig 2 shows a sub-network of EGFR_SN representing AKT signaling.

Fig 2. Sub-network of EGFR_SN network.

A part of the EGFR signaling network representing AKT signaling. Nodes indicate the reacting component and edges denote the reactions. Red nodes indicate inhibitors which were manually added to the model in the form of GPRs (information obtained from the Reactome database), white nodes indicate signaling components present in the original SBML file downloaded from the Reactome database and green nodes indicate exchange reactions which were added to the model to remove dead ends. Node ‘B’ and ‘P’ indicates binding, and phosphorylation, respectively. Edges: → transition, ⟞ inhibition, ⫯ catalysis.

D492 and D492M specific EGFR signaling networks

Microarray gene expression profiles of the human breast epithelial cell line D492 and the mesenchymal like D492M were used to constrain the EGFR_SN network to build an EMT specific signaling model. D492 can be used as a model for studying EMT for which biological data is available: mRNA, micro-RNA, cell phenotypic data, growth curves etc. [34,37]. D492 has previously been used in determining the role of microRNAs in EMT [37], studies related to branching morphogenesis [38] and more recently it was used to investigate the role of EGFR as a tumor suppressor [39].

To derive epithelial and mesenchymal specific signaling networks, gene expression data from D492 and D492M was mapped onto the EGFR_SN network. The pipeline is described in Fig 3. Differentially regulated (up-regulated and down-regulated) genes in the two cell lines led to two different signaling networks: EGFR_E and EGFR_M for D492 and D492M respectively, as described in the methods section.

Fig 3. The pipeline employed to generate context specific signaling networks EGFR_E and EGFR_M of an immortalized breast epithelial cell line (D492) and mesenchymal cell line (D492M).

SBML pathway map from Reactome was converted to mathematical format using the COBRA toolbox. This network was further modified to remove dead ends and include GPRs for modifiers, activators and inhibitors, resulting in the formation of the EGFR_SN network. The EGFR_SN network was constrained with the transcriptomic data of D492 and D492M to form EGFR_E and EGFR_M networks. Flux differences between EGFR_E and EGFR_M were used to further predict metabolic phenotype and target reactions critical for the reversal of mesenchymal to epithelial phenotype.

Network simulation of D492 and D492M

Flux analysis discriminates between D492 and D492M EGFR signaling phenotypes.

EGFR_E and EGFR_M networks were investigated through flux analysis. Flux distribution in EGFR_E and EGFR_M networks were estimated using random sampling [21,40] (Fig 4). Fig 4A shows the probabilty density for the flux in selected reactions, where red and blue lines denote EGFR_M and EGFR_E, respectively. Fig 4A, panel A1 shows that the most probable flux for this reaction in EGFR_E is higher than in EGFR_M, indicating this reaction is more active in EGFR_E. The panels A2-A6 belonging to AKT and RAS/MAPK pathways show that the most probable flux in EGFR_M is lower than the most probable flux in EGFR_E. The graphs in panels A7-A9, belong to the CaM pathway and are in contrast to panels A2-A4, indicating higher flux in EGFR_M.

Fig 4. Flux differences between epithelial network (EGFR_E) and mesenchymal network (EGFR_M).

A) Probability density estimates for the flux values in selected reactions as obtained by random sampling. The blue curve represents the flux distribution of EGFR_E and the red curve that of EGFR_M. Vertical axis denote probability and flux values are represented on the horizontal axis. AU: arbitrary units B) Relative mean flux for each reaction in EGFR_E and EGFR_M through the AKT, RAS and DAG/IP3 and CaM pathways. Higher flux within reactions in AKT and RAS/MAPK pathways are observed in the EGFR_E network compared to EGFR_M while CaM and DAG/IP3 have higher flux in EGFR_M. Negative values denote higher flux in EGFR_E and positive values denotes higher flux in EGFR_M. Numerical values of these fluxes are given in supplementary file (S1 File).

Fig 4B shows the relative mean flux values in each reaction within the AKT, RAS/MAPK and DAG/IP3 and CaM pathways. The EGFR_E network had higher flux through the AKT and RAS/MAPK pathways. In contrast, the transition from epithelial (EGFR_E) to mesenchymal (EGFR_M) phenotype promotes higher flux through the calcium CaM signaling pathway in the EGFR_M network. Moreover, flux was also increased in the di-acyl glycerol (DAG) and inositol trisphosphate (IP3) pathway in the EGFR_M network, which constitutes an important part of the CaM pathway.

Predicting reversal of mesenchymal to epithelial phenotype

We next studied how the flux in the EGFR_M network could be modified so that it became similar to the EGFR_E flux, in order to identify how the EMT process could be reversed. This was achieved by using an optimization algorithm similar to the MOMA algorithm which is frequently used to study perturbations in metabolic networks [41]. The algorithm searches for a flux distribution in EGFR_M, which most closely resembles the average flux values obtained previously for the EGFR_E model with random sampling, with minimum relaxation in the reaction bounds in EGFR_M while maintaining steady state conditions, see the methods section for more details.

The algorithm highlighted five reactions whose bounds needed to be relaxed such that the flux distribution of EGFR_M is close to that of EGFR_E. These included three internal reactions: 1) phosphorylation of the MAP2K dimer by RAF, 2) phosphorylation of MAPKs by MAP2Ks (RAF/MAPK pathway), 3) PIP2 conversion to PIP3 by PIK3 (AKT pathway), and two exchange reactions, belonging to the RAF/MAPK pathway. Through GPRs we identified that there are 22 genes associated with the reactions predicted above (S1 Table). However, among these 22 genes, MAPK1, NRAS, HRAS and EGFR genes were overexpressed in D492 as compared to D492M and the inhibitor PTEN was overexpressed in D492M.

Based on these predictions and by analysing the flux differences between EGFR_E and EGFR_M, we hypothesize that increased AKT and RAS/MAPK signaling in D492 epithelial cells promotes their transition to the D492M mesenchymal like phenotype. However, after attainment of a mesenchymal state, AKT and RAS/MAPK signaling is reduced and alternative pathways such as CaM signaling gets activated which may be involved in the maintenance of the mesenchymal state. We also hypothesize that up-regulation of MAPK1, NRAS, HRAS and EGFR and down-regulation of PTEN inhibitor in EGFR_M may lead to its transformation into EGFR_E.

Interestingly, most of the in silico predicted targets from our study have previously been implicated to play a role in EMT. The activation of MAPK1 protein has been shown to induce EMT in MCF10 breast epithelial cells by phosphorylation and consequent stabilization of Twist1 [42]. In another study, the silencing of MAPK1 led to increased expression of E-cadherin and a decrease in vimentin and Snail expression in human cervical cancer cells [43], suggesting its role in the transition from epithelial to mesenchymal phenotype. HRAS and Slug together have been shown to induce the expression of vimentin and enhance cell migration in pre-malignant MCF10A breast epithelial cells [43]. Further, activation of EGFR has been shown to induce the expression of Twist by activating STAT3, suggesting a prominent role of EGFR in EMT [11]. In a recent study where mesenchymal cells were generated from D492 by overexpressing HER2 (ErBb2) [39], subsequent overexpression of EGFR promoted mesenchymal to epithelial transition. In light of this finding, and based on our in silico predictions, we made EGFR overexpressing D492M cell line (D492MEGFR) (method section). D492MEGFR cells showed higher phosphorylation of AKT and ERK1/2 (Fig 5), although reversal of mesenchymal to epithelial phenotype was not observed (S1 Fig). Thus, we hypothesize that for complete reversal of mesenchymal to epithelial phenotype, the MAPK1, NRAS and HRAS genes may need to be overexpressed in addition to EGFR, along with PTEN inhibition in D492M. Likewise, maintaining expression of the MAPK1, NRAS, HRAS and EGFR in epithelial cells while suppressing the expression of PTEN is expected to inhibit EMT. At present, a number of EGFR inhibitors have been approved by the FDA for cancer treatment. These include Cetuximab and Panitumumab that are monoclonal antibodies against EGFR, and Erlotinib, Gefitinib and Lapatinib which are specific tyrosine kinase inhibitors against EGFR. All these compounds are prescribed against advanced, late-stage or metastatic cancers [44,45]. The findings we present here suggest that inhibition of EGFR may contribute to or increase the risk of EMT in cancers of epithelial origin. Indeed, the overexpression of HER2 in D492 cells has recently been shown to suppress EGFR expression and induce EMT [39]. Our predictions also indicate that PTEN inhibition can help maintain the epithelial phenotype, thereby preventing EMT and metastasis. However, recent findings suggest that loss of PTEN function may promote tumor progression in a mouse model [46] and EMT in human colon cancer cells [47], highlighting the fact that further research on the role of PTEN in EMT is needed.

Fig 5. EGFR overexpression in D492M.

(A) Real-Time Quantitative Reverse Transcription PCR of EGFR in D492, D492MEGFR and D492MEmpty, normalized to GAPDH. EGFR transcription level is significantly higher in D492MEGFR compared to D492MEmpty but does not reach the D492 EGFR transcription level. (B) Protein expression of EGFR, Phospho-p44/42 MAPK (ERK1/2) and Phospho-AKT determined by Western blotting. Overexpression of EGFR in D492M leads to increased protein expression of EGFR in D492MEGFR compared to D492MEmpty, but EGFR protein expression does not reach the D492 level. Overexpression of EGFR in D492M leads to increased MAPK (Erk1/2) and Akt phosphorylation which is higher than in both D492MEmpty and D492.

Taken together, these above results demonstrate the EGFR_SN network can be used to study in detail the differences in EGFR signaling between epithelial and mesenchymal cells during EMT and identify gene targets which could possibly be used to hinder or even revert EMT.

Crosstalk of the signaling and metabolism in D492 and D492M

We next studied how changes in AKT signaling could influence metabolic gene expression in EMT. The AKT pathway was chosen since it has previously been shown to affect various metabolic pathways, e.g. glycolysis and fatty acid metabolism in cancer cells [19,48]. To reflect how alterations in the AKT signaling pathway may affect metabolism, a literature based survey was conducted to identify the connections between this pathway and its target metabolic genes. This suggested that active AKT signaling induces the expression of metabolic genes involved in glycolysis, fatty acid metabolism and purine and pyrimidine metabolism, while it suppresses the expression of metabolic genes involved in gluconeogenesis (Table 1).

Table 1. Predicted expression of metabolic genes regulated by AKT in D492 and D492M cells.

The ratio of average flux in each signaling reaction of the AKT pathways in the EGFR_M and EGFR_E network was used to identify differences in relative metabolic gene expression in the D492 and D492M cells. The higher flux observed in the AKT pathway in the EGFR_E network suggested increased expression of genes belonging to glycolysis, fatty acid and purine/pyrimidine metabolism in D492 in comparison to D492M (Table 1). Higher expression of glycolytic genes suggested higher glycolytic activity in the D492 cells.

To test these in silico predictions, we compared the predicted expression of the metabolic genes with their corresponding relative expression values in the microarray data set [34] and compared relative expression of metabolic genes in D492 and D492M. Up- and down-regulated metabolic genes were identified based on differential expression and significance measurements, as analyzed by SAM [49], included in the microarray dataset. A cut-off of 0.05 on the significance measure was used. The relative gene expression of 13 out of 15 metabolic genes (86.6%) affected by AKT signaling were in agreement with the predicted in silico expression (Table 1). Since the experimental expression values of the metabolic genes were not used to generate the EGFR_E and EGFR_M networks, this data corresponds to an independent validation set for the in silico predictions.

However, the gene expression of 2 out of 15 (13.3%) of the metabolic genes (GLUT1 and SREBF1) from the microarray data and in silico predictions were not in agreement. This suggests an alternate level of regulation that metabolic genes may encounter during EMT, in addition to the direct regulation by molecular signaling pathways. For accuracy measure we did not take into account metabolic genes which did not have any detectable expression values (denoted NA). Ultimately, we observed that the expression of most of the metabolic genes directly affected by AKT signaling during EMT was correctly predicted in our D492 model system.

Glycolytic flux is increased in D492 as compared to D492M and consistent with model predictions

Gene expression measurements of metabolic genes are not necessarily a quantitative predictor of their metabolic activity. We therefore tested our in silico prediction that glycolytic activity is higher in D492 than in D492M by measuring the proliferation rates and glucose consumption and lactate secretion rate to estimate glycolytic activity in vitro. Cells with a higher rate of proliferation may have higher nutrient and energy requirements and consequently greater metabolic demand [59]. Cell proliferation assays showed that the D492 cells had a higher growth rate than D492M (Fig 6A). Spent medium of D492 and D492M cells was analyzed to measure glucose and lactate levels. Higher glucose levels and lower lactate levels in cultured supernatants of the D492M cells indicated a lower rate of glucose consumption and lactate secretion rates in D492M cells (Fig 6B). This indicates a lower glycolytic rate in D492M cells compared to D492 and suggests a shift in the glycolytic capacity of the cells in response to EMT and is in agreement with the in silico predictions that indicated higher gene expression of glycolytic enzymes in D492 cells.

Fig 6. Proliferation rate and glycolytic activity are higher in D492 than in D492M cells.

A) Cell proliferation assay demonstrated a higher growth rate of D492 cells compared to D492M cells. B) Spent medium analysis of glucose (dashed lines) and lactate (solid lines) shows higher glucose uptake and lactate secretion in D492 cells (blue) than in D492M cells (red). C) Calculated glucose uptake and lactate secretion rates indicate higher glycolytic flux rates per cell per hour in D492 cells than D492M cells. Data represents results from 3 independent experiments. Error bars represent standard deviation in a single experiment done in triplicate. mM: milli molar, fmol: femto molar.

We conclude that this EMT related decrease in aerobic glycolysis appears to be driven by an overall decrease in the expression of glycolytic enzymes due to down-regulated AKT signaling in D492M cells. To the best of our knowledge, this is the first report that demonstrates that in silico network predictions can be used to study the influence of a molecular signaling pathways, such as AKT, on the metabolic outcome during EMT in breast epithelial cells. Our findings are in agreement with a previous study on human non-small cell lung carcinoma (NSCLC) cells [60] which demonstrated a decrease in aerobic glycolysis during EMT but are in contrast to results obtained for MCF7 breast epithelial cells that have undergone EMT [61].

EMT metabolic network constrained with AKT signaling regulated metabolic genes

Results from the previous section showed that the metabolic phenotype could be accurately predicted by in silico analyses of the changes in the expression of AKT signaling genes. Further, we investigated how these changes in AKT signaling, that impact the expression of metabolic genes (Table 1), are propagated through other metabolic pathways during EMT. Based on RECON2 [62], we have also built an EMT metabolic network (MODEL1602080000). This metabolic network was built by constraining RECON2 with microarray gene expression data [34] of metabolic genes during EMT in D492 and D492M. RECON2 is a global human metabolic reconstruction that has been used previously to investigate regulation of metabolism in diseases like obesity and diabetes [40,62]. In this study, we constrained our EMT metabolic network with the metabolic genes that were predicted to be regulated by changes in AKT signaling (Table 1). This led to the formation of epithelial metabolic (Met_E) and mesenchymal metabolic (Met_M) networks, specific for AKT signaling regulated metabolism during EMT (described in methods section). Metabolic differences between the Met_E and Met_M models were identified based on differences in their relative flux span (methods section). Reactions carrying higher flux in Met_E compared to Met_M included reactions that are involved in N-glycan metabolism, Glycolysis, Fatty acid synthesis, Fatty acid oxidation, nucleotide interconversion and pentose phosphate pathway. Reactions carrying higher flux in Met_M involved Glutathione, glycerophospholipid, and inositol phosphate metabolism. Alteration of metabolic pathways, including N-glycan, glutathione metabolism, glycolysis, fatty acid and purine metabolism that we observed from our constructed Met_E and Met_M metabolic networks, have been shown to play an important role in the regulation of EMT [19,48,57,60,63,64]. However, the details of the mechanism are still unknown. A list of the metabolic reactions similarly predicted to be affected by the AKT pathway in the Met_E and Met_M networks are provided in the supplementary file (S2 File). In conclusion, this method was able to predict metabolic pathways that may be affected downstream upon activation of AKT signaling in breast epithelial cells during EMT. This method extends the approach of associating metabolic phenotype with regulation of signaling pathways. Further, it also suggests the possibility of determining the metabolic regulation in cases that are limited by the availability of metabolomic data or the gene expression data of metabolic genes. Although these predictions are context specific in relation to AKT signaling, the integration of other signaling pathways affected during EMT may give a more coherent picture of altered metabolism.

Comparative analysis of the EGFR signaling network between D492 and three other human breast epithelial cell lines

We next studied whether a general trend of higher flux in the reactions of the AKT and RAS pathways (downstream of EGFR) was observed in other human breast epithelial cell lines, when compared to their mesenchymal counterparts. We mapped the microarray transcriptomic datasets for the three human breast epithelial cell lines (HMLE, MCF-7 and MCF-10A) onto the EGFR_SN network similar to the method used for the D492 cells (methods section). Comparisons between cell lines were done in terms of the ratio between flux in the mesenchymal network vs flux in the corresponding epithelial network (Fig 7). Numerical values of the fluxes within AKT, RAS and CaM pathways in HMLE, MCF-7 and MCF-10A are given in supplementary file (S3 File).

Fig 7. Comparison of flux distribution in the reactions within AKT, RAS and CaM pathway in different breast epithelial cell lines.

The heat map has been generated using log2 relative mean flux of the reactions within the AKT, RAS and DAG/IP3 and CaM pathways in epithelial and mesenchymal cells of D492, MCF, MCF10A and HMLE cells. Exchange reactions are not included in the heat map. The negative and positive values as depicted on the color scale denote higher flux in reactions of epithelial and mesenchymal cells, respectively. D492M, MCF7_mirna, MCF7_snail, MCF10_snail, MCF10A_tgfb, HMLE_slug, HMLE_twist and HMLE_snail are EMT derived mesenchymal cells.

Higher flux through the reactions in the AKT pathway was observed in both the D492 and HMLE epithelial cells, suggesting that the HMLE cells may have similar metabolic phenotype as D492 (Fig 7). In contrast, the mesenchymal counterparts of MCF7 and MCF10A cells had higher flux in the AKT pathway (Fig 7), suggesting that cells that have undergone EMT may have increased glycolytic activity. This is in agreement with Kondaveeti et al. who have reported an increase in glycolytic activity post-EMT in MCF-7 cells as a result of increased expression of glucose transporters and lactate dehydrogenase [61]. The flux in the RAS/MAPK pathway was higher in the D492 and the MCF7 epithelial cells than in their mesenchymal counterparts. Analysis of DAG/IP3 and CaM pathway showed that D492M, MCF7 mesenchymal cells and those of HMLE which have undergone EMT due to induction of Twist, had higher flux as compared to their epithelial counterpart (Fig 7). Thus, no general pattern was observed in the flux distributions between the epithelial and mesenchymal networks for the different breast epithelial cell lines. Induction of EMT by different factors (viral induction of SNAIL, SLUG, TWIST, miR374a or TGFβ1 treatment) also seemed to differentially regulate signaling pathways as was evident in the MCF10A and HMLE cells. For example, the induction of EMT in HMLE cells by overexpression of Slug and Twist resulted in different flux patterns in the RAS/MAPK and DAG/IP3 and CaM pathways (Fig 7). Similar effect was seen by Deshiere et al., where they have shown that TGFβ1 treatment and CK2b silencing activate divergent signaling pathways, that ultimately lead to EMT in MCF-10A cells [65]. Finally, we compared the predicted metabolic phenotypes to the metabolic gene expression data for the MCF7, MCF10A and HMLE cell lines similar to the method used for the D492 cells, however the gene expression data of metabolic genes were not statistically significant and hence was not included in our study (S1S4 Tables).

In summary, we observed that different cell lines may affect different signaling regulation during EMT. Moreover, variation in the methods of EMT induction may dictate differential regulation of the signaling and metabolic cross talk.


Herein we have demonstrated a method to build a stoichiometric model of the EGFR signaling (EGFR_SN) network employing COBRA methods that aids in understanding the differential activation of downstream EGFR signaling pathways during EMT. Epithelial and mesenchymal specific EGFR signaling networks were obtained by integrating microarray transcriptomics data of signaling genes from the D492 breast epithelial and mesenchymal cells with EGFR_SN. The epithelial and mesenchymal networks were used to predict the expression of metabolic genes. The predicted expression values were in agreement with transcriptomics data of metabolic genes as well as biochemical data that demonstrated higher glycolytic activity in D492 epithelial cells. Furthermore, signaling genes leading to reversion to the epithelial phenotype (MET) via up- regulation in the mesenchymal cells were predicted. Additional in vitro testing would be required to confirm these in silico predictions. Thus, in this study we showed that the metabolic phenotype can be predicted in silico using gene expression profiles of EGFR signaling components.

EGFR_SN is not limited in scope to the investigation of EMT. The signalling network could be used to highlight signaling-metabolic crosstalk in different cell types for which metabolic reconstructions exist [66] or for different conditions [66] where EGFR signalling is known to be influential [67]. Furthermore, the network described herein could be expanded to allow for more comprehensive coverage of signalling pathways of relevance to EMT. Signaling networks for platelet derived growth factor (PDGF), insulin like growth factor (IGF), and vascular endothelial growth factor (VEGF) for example, could be constructed and co-integrated. The co-integration of these signaling networks regulated by different growth factors would give more comprehensive knowledge of cross talk between signaling and metabolic pathways during EMT.

The pipeline developed for D492 was used on other cell models representative of breast epithelial cell lines. We assumed that different breast epithelial cells models would have similar signaling patterns and hence could have a general interpretation of regulation of signaling pathways during EMT. In contrast to our hypothesis, the regulation of signaling pathways showed no general pattern and appeared to be a cell-specific phenomenon. Flux values in the AKT pathway from our network, suggest that there may be an EMT related decrease in aerobic glycolysis in both the HMLE and D492 cell lines, while the opposite was observed in the MCF-7 and MCF-10A breast epithelial cells. There are several possible explanations for this disparity. First, the highly complex nature of the regulation of EMT, which may be differentially regulated by the cellular micro environment or EMT inducing factors as seen in our in silico predictions. Second, our study of EMT signal transduction is primarily based on transcriptional signatures which may or may not necessarily correlate with the translational output (protein-levels) [68,69]. This last issue might be addressed by co-integrating transcriptomics and proteomics/phospho-proteomics data in order to obtain more accurate models. Such a strategy was recently reported where it was used to reconstruct a metabolic network for predicting of metabolic signatures in diabetes patients [70]. The disparity between cell models suggests considerable heterogeneity of the cell models used for EMT research in general. Finally, these efforts highlight a lack of comprehensive datasets available that accurately describe EMT and ultimately hinder mechanistic understanding of the genotype phenotype relationship underlying EMT. The direct link between regulation of signaling pathways and the consequent metabolic phenotype may be of clinical interest, as metabolically based therapeutics to combat cancer EMT could be masked by inaccurate metabolic understanding.


Reconstruction of signaling network

The EGFR pathway network was downloaded from the Reactome database which is curated and peer reviewed [71]. The EGFR pathway was then converted from SBML to COBRA format for further analyses. This conversion is based on the stoichiometric coefficients of the reacting species provided in the Reactome pathway and also requires setting constraints on each reaction in the form of lower and upper bounds, which determine the minimum and maximum allowable reaction rates (fluxes), respectively [33]. Flux in a signaling network is defined as the rates of phosphorylation, de-phosphorylation, dimerization, or binding of proteins.

The COBRA model was represented by an m by r stoichiometric matrix S, where m denotes the number of network components (metabolites, proteins, and complexes) and r the number of network reactions. Reactions within the network were mass-balanced. The system was assumed to be at steady state, which means that the fluxes v = (v1,…,vm) satisfy the equations Sv  =  0. The upper and lower bounds of all the internal reactions were set to 1000 and zero, respectively. A lower bound of zero was used since all the reactions are irreversible.

The initial COBRA model contained many dead ends and was infeasible due to network gaps [72]. Dead-ends represents those reacting species which are either only produced or only consumed in the network, leading to blocked reactions, i.e. reactions unable to carry flux.To remove the dead ends and obtain a feasible model, exchange reactions were introduced allowing uptake and secretion of components across the system boundary. By adding exchange reaction for all reacting species, a feasible model was obtained. In this model, the network topology becomes irrelevant since all demands on the network can be met by the exchange reactions. To avoid this situation, an optimization algorithm, ‘relax_rxns’ (S1 Dataset) was developed that enabled a feasible steady state network, while minimizing uptake/secretion (exchange) of the dead-end molecules. The methodology is similar to the one used by Vardi et al. [21]. First, the lower bounds of all the internal reactions were set to 1 to force removal of blocked reactions (reactions with zero flux) and consequently removal of the dead-end species. Exchange reactions were then added for every reacting species in the network, initially with all uptake and secretion rates set to zero. The optimization algorithm returns a minimal set of exchange reactions that need to be present in order to remove all dead ends. These reactions were included in the final model, but the other exchange reactions were removed. The optimization problem was formulated as follows: (1) (2) (3) (4) (5) (6) (7)

The decision variables are v, the flux values in individual reactions, pj and nj which represent the amount of relaxation of upper and lower bounds for reaction j, respectively and binary variables yj which indicate whether reaction j is relaxed or not. The objective is to minimize the number of reactions that are relaxed (1). The steady state mass balance constraints are represented by (2), Rr is the set of reactions that are to be relaxed (3) with the corresponding upper and lower bounds set to zero. The set Rn represents all the remaining reactions (4) with the corresponding upper bounds set to 1000 and lower bounds set to 1. The value of the constant M in (5) was set to 1000. The optimization model was implemented in Matlab (Mathworks, Natick, MA, USA) using the CVX modeling language [73,74] and solved using the Gurobi solver [75]. The version of CVX used in this study supports binary variables as those in constraint (7).

Modelling modifiers, activators and inhibitors

The signaling network from section 4.1 was extended to include modifiers, activators and inhibitors. Modifiers are phosphorylated protein entities that further phosphorylate downstream targets. In the original Reactome pathway, the modifiers were not included as reacting species in reactions, such that they were not connected to downstream targets. For this reason modifiers were included as the reacting species in their target reactions. We included the modifiers in the same way as described by Dasika et al. [22]. Briefly, modifier mod1 acts as a modifier for the transition of A to B. In order to avoid unambiguous stoichiometry, we added mod1p, as a product of mod1 during the reaction as shown below.

Activators and inhibitors are responsible for positive and negative regulation of the reaction, respectively. Activators and inhibitors were included in the model via gene-protein reaction rules as described in the next section.

Addition of gene information

The original Reactome signaling network described the transmission of signal, in terms of activation or inhibition of the subsequent downstream entities. This did not include any enzymatic reactions or the gene-protein rules required to map gene expression data [33]. The signaling network was therefore modified to include GPRs by associating each network reaction with genes encoding modifiers, activators and inhibitors. Reactions not having any information of modifiers, activators or inhibitors were not designated with GPRs. For the GPR generation, we employed UniProt IDs of the protein entities within the Reactome pathway to identify genes, which were then associated with reactions using Boolean logic. Multiprotein complexes were represented with an ‘AND’ operator, while isoforms were represented by an ‘OR’ operator to create a corresponding Boolean rule. Wherever an inhibitor was involved, the corresponding genes were prefixed by a ‘NOT’ operator in the GPR. Exchange reactions were also assigned GPRs. If a reaction is catalysed by a modifier and inhibited by an inhibitor, the presence of modifier will activate the reaction, while an inhibitor will inhibit the reaction. GPRs for all the reactions are provided in the S1 File in the sheet named “GPRs”. The blank rows denote that they were not assigned any GPRs. Fig 8 below, illustrates examples of GPR generation.

Fig 8. The rules for adding GPRs.

Reaction1 is catalyzed by a modifier which is a multiprotein complex of gene1 and gene2. The combination of both the genes are required for the reaction to take place. Reaction2 is inhibited in presence of inhibitor which is an isoform of gene3 and gene4. Presence of either gene3 or gene4 will inhibit Reaction2. A ‘NOT’ operator is assigned to the GPR of Reaction2 to indicate inhibition. A GPR is not assigned to Reaction 3 in this example.

The resulting network is referred to as EGFR_SN. A spreadsheet containing all the network reactions, reacting species, modifiers, inhibitor, activators, and GPRs in different sheets is provided in the supplementary file (S1 File). All the models used in this study are provided in the supplementary dataset (S1 Dataset).

Microarray data integration

Gene expression data for the D492 and D492M cell lines was obtained from Sigurdsson et al [34]. The microarray expression data for the HMLE, MCF-7 and MCF-10A cell lines was obtained from NCBI GEO [76], GEO IDs: GSE52593 [77], GSE43495 [78], GSE58252 [79], GSE39358 [80], and GSE28569 [65]. Illumina/Affimetrix IDs within the microarray data were mapped with the Uniprot IDs of the genes in the EGFR_SN network using the Python API of bioDBnet, biological DataBase network. Mapped gene expression of the EGFR signaling genes for each cell line is provided in the supplementary material S4 File with sheets named after each cell line. Negative values indicate higher gene expression in epithelial cells and positive values indicate higher gene expression in mesenchymal cells. Integration of expression data with the EGFR_SN network and subsequent analysis was performed in Matlab. The cut-off value on the relative fold change between epithelial and mesenchymal cell lines, to determine up- and down-regulated genes, depended on the relative expression of housekeeping genes and the statistical significance (p-value ≤ 0.05) of the fold change. Accordingly, a cut-off of ≥ 2 was considered for D492 cells, 0.5 for MCF7 and MCF10A cells and 0.3 for HMLE cells. Since GPRs link genes with reactions, up- and down-regulated genes identified consequently up-regulated and down-regulated reactions. The change in the expression value of each gene in the EGFR_SN signaling network was used to define the upper and lower flux bounds of its associated reactions, such that up-regulated reactions were allowed to have higher flux values and down-regulated reactions were allowed to have lower flux values. Since, the upper bounds on individual fluxes are essentially infinite, up-regulation in an epithelial model was simulated by downregulating the corresponding reaction in its mesenchymal counterpart, and vice versa. Flux bounds of the up-regulated reactions in D492 were constrained by an arbitrary factor one-hundredth of the initial bounds in D492M and vice versa. Similarly, flux bounds of the activated and inhibited reactions were constrained. Activation in the epithelial model was simulated by downregulating the corresponding reaction in its mesenchymal counterpart and vice versa, while inhibition was simulated by downregulating the corresponding reaction in the same model. This led to the formation of EGFR_E and EGFR_M networks for D492 and D492M respectively.

Random sampling was used to obtain flux distributions in the networks [81] using the COBRA toolbox [32], technical details of which can be found in Supplementary methods (S1 Methods).

Flux differences in epithelial and mesenchymal networks and crosstalk to metabolism

Flux differences in individual reactions in the mesenchymal and epithelial networks were quantified in terms of fold changes, vM(i) / vE(i) where vM(i) and vE(i) represent the average flux in reaction i for the mesenchymal and epithelial networks, respectively. Reactions which had vM(i) / vE(i) greater than 1 carried higher flux in the mesenchymal network. A ratio below 1 indicates higher flux in the epithelial network. A literature based survey provided evidence of whether a metabolic gene is positively or negatively regulated by AKT signaling (S5 Table). The prediction of the metabolic gene expression derives from whether EGFR_E or EGFR_M have higher flux in AKT signaling.

Minimization of the distance between the Mesenchymal and Epithelial flux distributions

The optimization algorithm of section 4.1 was modified by replacing the objective function (1) by (1a) where v and yj represents the decision variables as before, vE are fixed values representing the mean flux distribution of EGFR_E, obtained from random sampling and ||. || represents the Euclidean norm. The upper and lower bounds for each flux correspond to the values from the EGFR_M network. The algorithm returns a set of reactions in EGFR_M whose bounds can be relaxed in order to obtain a flux distribution that resembles that of EGFR_E.

The effects of active AKT signaling on the EMT metabolic network

We constrained RECON 2 [62] using the microarray data from Sigurdsson et al. [34] to generate an EMT metabolic network (submitted to Biomodels: MODEL1602080000). This reconstruction consists of all the metabolic reactions encoded in both the D492 and D492M cell lines. This EMT metabolic network has information on GPRs connecting each reaction with the genes of the enzymes catalyzing the reaction. Since GPRs associate genes with corresponding reactions, the metabolic genes predicted to be up-regulated in epithelial cells due to AKT signaling (Table 1) led to the identification of up-regulated reactions in D492 and similarly were determined in D492M. This information was then used to define the upper and lower flux bounds of the affected reactions in the EMT metabolic network to form Met_E (metabolic epithelial) and Met_M (metabolic mesenchymal) networks (Fig 9). Up- and down-regulation of the Met_E and Met_M models was simulated as described in section 4.4. Specifically, up-regulation in an epithelial metabolic model was simulated by downregulating the corresponding reaction in its mesenchymal counterpart, and vice versa. Flux bounds of the up-regulated metabolic reactions due to AKT signaling in D492 were constrained by an arbitrary factor, one-hundredth of the initial bounds in D492M and vice versa.

Fig 9. The pipeline used to determine the effects of active AKT signaling on the EMT metabolic network.

The metabolic genes predicted to be altered in D492 and D492M dependent on the AKT signaling network, were used to constrain the EMT metabolic network generated from Recon2, in order to identify downstream metabolic pathways that are affected by AKT activation.

The flux values through each reaction in both the models were determined through random sampling method and the relative flux span, sM(i)/ sE(i) was used to quantify the flux differences between the networks. Here, sM(i) and sE(i) represent the average flux in reaction i for the mesenchymal and epithelial cells, respectively.

Cell proliferation assay and metabolite measurements

For proliferation assays and spent medium analysis, 1.5 × 104 D492 or D492M cells were cultured in 48-well plates (Costar) in 200μL H14 medium as previously described [35]. Spent medium was collected and cells were fixed in ice-cold methanol at 0 h, 24 h, 48 h and 72 h. To estimate proliferation, cells were stained with crystal violet, washed thoroughly with water, dissolved in 30% acetic acid and read in a spectrophotometer at 570 nm. The observed growth rates were 0.0276 h−1 for D492 and 0.0161 h−1 for D492M. The glucose and lactate concentrations in the spent medium were measured in an ABL90 blood gas analyser (Radiometer, Brønshøj, Denmark). Glucose uptake and lactate secretion per cell were calculated for each cell line as described in [82], based on the ABL90 measurements and growth rates.

Viral transduction

Vectors used for viral production were acquired from Addgene, pBABE-EGFR and empty backbone (#11011, #1764, respectively) and were used as provided. Phoenix HEK293 cells were used for retroviral (EGFR) virus production, using Arrestin transfection (Life Technologies). D492M cells were transduced overnight with viral supernatant containing 8 μg/ml Polybrene (Sigma-Aldrich). EGFR and empty backbone cells were selected using 2 μg/ml puromycin (Life Technologies).

Real-Time Quantitative Reverse Transcription PCR

Total RNA was isolated using TRI-Reagent solution (Ambion) and reverse transcribed using SuperScript IV (Invitrogen). The resulting cDNA was used for Real-Time Quantitative Reverse Transcription PCR, in Maxima Probe/ROX qPCR Master Mix (Thermo Scientific) with primer pairs and probes for EGFR (Hs00540086_m1, Life Technologies), ZEB1 (Hs00232783_m1, Life Technologies) and GAPDH (Hs99999905_m1, Life Technologies). Experiments were done in triplicates on 7500 Real Time PCR System (Applied Biosystems). EGFR mRNA levels were normalized to GAPDH and relative mRNA differences were calculated using the 2ΔCt method.

Western blotting

Proteins were isolated using RIPA lysis buffer supplemented with protease and phosphatase inhibitors (Life Technologies). For Western blot analysis 5 μg of protein lysates were loaded per lane on NuPage 10% Bis-Tris gels (Life Technologies) in 2-(N-morpholino)ethanesulfonic acid (MES) running buffer (Life Technologies). Samples were denatured using 10% mercaptoethanol at 95°C for 10 minutes before loading. Samples were transferred to Immobilon FL PVDF membranes (Millipore) and blocked in Li-cor blocking buffer for 1 hour. Primary antibodies were incubated overnight at 4°C and secondary IRDye antibodies were incubated at room temperature for 1 hour (Licor). The following primary antibodies were used for Western blotting: Actin antibody (Abcam, ab3280), EGF Receptor (Cell Signaling, CS#4267), Phospho-Akt (Ser473) (Cell Signaling, CS#4060), Phospho-p44/42 MAPK (Erk1/2) (Thr202/Tyr204) (Cell Signaling, CS#4370), N-Cadherin (BD Biosciences, 610921), E-Cadherin (BD Biosciences, 610182) and Cytokeratin 14 (Abcam, ab15461). Near-infrared fluorescence visualization was measured using Odyssey CLx scanner (Li-Cor, Cambridge, UK).

Supporting Information

S1 Dataset. Codes and all the models in COBRA format used in this study.


S1 File. A spreadsheet describing EGFR_SN attributes, along with flux distribution in EGFR_E and EGFR_M for D492 cells.


S2 File. Flux distribution in Met_E and Met_M.


S3 File. Flux distribution within EGFR signaling cascade in MCF7, MCF10A and HMLE cell lines.


S4 File. Expression data mapped to EGFR signaling genes for each cell line.


S2 Methods. Modelpipeline: Detailed procedure for generation and simulation of EGFR signaling network.


S1 Fig. Epithelial and mesenchymal marker expression.

(A) Western blotting for epithelial markers E-Cadherin and CK14 and mesenchymal marker N-Cadherin. Overexpression of EGFR in D492M does not revert the mesenchymal phenotype towards an epithelial phenotype. D492MEGFR retains N-Cadherin expression and does not gain E-Cadherin or CK14 expression. (B) Real-Time Quantitative Reverse Transcription PCR of the EMT transcription factor ZEB1 in D492, D492MEGFR and D492MEmpty normalized to GAPDH. ZEB1 transcription was not detected in D492 and the transcription level of ZEB1 was unchanged in D492MEGFR compared to D492MEmpty. D492MEGFR retains mesenchymal ZEB1 expression.


S1 Table. Gene associated with the reactions important for the reversal of EGFR_M to EGFR_E.


S2 Table. Predicted expression of metabolic genes regulated by AKT in HMLE cells.

ER: Microarray Expression data in TWIST, SLUG and SNAIL induced HMLE cells respectively. PE: Proposed Expression. Predictions in agreement with microarray data are highlighted in green and that otherwise are highlighted in orange.


S3 Table. Predicted expression of metabolic genes regulated by AKT in MCF10A cells.

ER: Microarray Expression data in TWIST, SLUG and SNAIL induced HMLE cells respectively. PE: Proposed Expression. Predictions in agreement with microarray data are highlighted in green and that otherwise are highlighted in orange.


S4 Table. Predicted expression of metabolic genes regulated by AKT in MCF7 cells.

ER: Microarray Expression data in TWIST, SLUG and SNAIL induced HMLE cells respectively. PE: Proposed Expression. Predictions in agreement with microarray data are highlighted in green and that otherwise are highlighted in orange.


S5 Table. Regulation of metabolic gene expression by AKT signaling.

“Reference” column lists the studies from which the influence of AKT signaling on the expression of the corresponding metabolic genes was derived. +1 and -1 denotes positive and negative regulation, respectively.



The authors would like to acknowledge Miha Skalic and Guðrún Valdimarsdóttir for valuable discussions. We would also like to thank the members of Stem cell research unit, Biomedical Centre for valuable discussion on this manuscript.

Author Contributions

Conceived and designed the experiments: KSC OR. Performed the experiments: KSC SH EB. Analyzed the data: KSC NR SH EB TG SG OR. Contributed reagents/materials/analysis tools: TG. Wrote the paper: KSC SH SG OR.


  1. 1. Kalluri R, Weinberg RA. The basics of epithelial-mesenchymal transition. J Clin Invest. 2009;119: 1420–8. pmid:19487818
  2. 2. Polyak K, Weinberg R a. Transitions between epithelial and mesenchymal states: acquisition of malignant and stem cell traits. Nat Rev Cancer. 2009;9: 265–73. pmid:19262571
  3. 3. Mani SA, Guo W, Liao M-J, Eaton EN, Ayyanan A, Zhou AY, et al. The epithelial-mesenchymal transition generates cells with properties of stem cells. Cell. Elsevier; 2008;133: 704–15. pmid:18485877
  4. 4. Huber MA, Kraut N, Beug H. Molecular requirements for epithelial-mesenchymal transition during tumor progression. Curr Opin Cell Biol. 2005;17: 548–58. pmid:16098727
  5. 5. Garg M. Epithelial-mesenchymal transition—activating transcription factors—multifunctional regulators in cancer. World J Stem Cells. 2013;5: 188–95. pmid:24179606
  6. 6. De Craene B, Berx G. Regulatory networks defining EMT during cancer initiation and progression. Nat Rev Cancer. Nature Publishing Group; 2013;13: 97–110. pmid:23344542
  7. 7. Thiery JP, Acloque H, Huang RYJ, Nieto MA. Epithelial-mesenchymal transitions in development and disease. Cell. 2009;139: 871–90. pmid:19945376
  8. 8. Hardy KM, Booth BW, Hendrix MJC, Salomon DS, Strizzi L. ErbB/EGF signaling and EMT in mammary development and breast cancer. J Mammary Gland Biol Neoplasia. 2010;15: 191–9. pmid:20369376
  9. 9. Misra A, Pandey C, Sze SK, Thanabalu T. Hypoxia activated EGFR signaling induces epithelial to mesenchymal transition (EMT). PLoS One. 2012;7: e49766. pmid:23185433
  10. 10. Lu Z, Ghosh S, Wang Z, Hunter T. Downregulation of caveolin-1 function by EGF leads to the loss of E-cadherin, increased transcriptional activity of β-catenin, and enhanced tumor cell invasion. Cancer Cell. 2003;4: 499–515. pmid:14706341
  11. 11. Lo H-W, Hsu S-C, Xia W, Cao X, Shih J-Y, Wei Y, et al. Epidermal growth factor receptor cooperates with signal transducer and activator of transcription 3 to induce epithelial-mesenchymal transition in cancer cells via up-regulation of TWIST gene expression. Cancer Res. 2007;67: 9066–76. pmid:17909010
  12. 12. Lee K, Gjorevski N, Boghaert E, Radisky DC, Nelson CM. Snail1, Snail2, and E47 promote mammary epithelial branching morphogenesis. EMBO J. EMBO Press; 2011;30: 2662–74.
  13. 13. Chang F, Lee JT, Navolanic PM, Steelman LS, Shelton JG, Blalock WL, et al. Involvement of PI3K/Akt pathway in cell cycle progression, apoptosis, and neoplastic transformation: a target for cancer chemotherapy. Leukemia. Nature Publishing Group; 2003;17: 590–603. pmid:12646949
  14. 14. McCubrey JA, Steelman LS, Chappell WH, Abrams SL, Wong EWT, Chang F, et al. Roles of the Raf/MEK/ERK pathway in cell growth, malignant transformation and drug resistance. Biochim Biophys Acta. 2007;1773: 1263–84. pmid:17126425
  15. 15. Cairns RA, Harris IS, Mak TW. Regulation of cancer cell metabolism. Nat Rev Cancer. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2011;11: 85–95. pmid:21258394
  16. 16. Li L, Li W. Epithelial-mesenchymal transition in human cancer: comprehensive reprogramming of metabolism, epigenetics, and differentiation. Pharmacol Ther. 2015;150: 33–46. pmid:25595324
  17. 17. WARBURG O. On the origin of cancer cells. Science. 1956;123: 309–14. Available: pmid:13298683
  18. 18. Amoêdo ND, Valencia JP, Rodrigues MF, Galina A, Rumjanek FD. How does the metabolism of tumour cells differ from that of normal cells. Biosci Rep. 2013;33.
  19. 19. Elstrom RL, Bauer DE, Buzzai M, Karnauskas R, Harris MH, Plas DR, et al. Akt stimulates aerobic glycolysis in cancer cells. Cancer Res. 2004;64: 3892–9. pmid:15172999
  20. 20. Pavlova NN, Thompson CB. The Emerging Hallmarks of Cancer Metabolism. Cell Metab. Elsevier; 2016;23: 27–47. pmid:26771115
  21. 21. Vardi L, Ruppin E, Sharan R. A linearized constraint-based approach for modeling signaling networks. J Comput Biol. 2012;19: 232–40. pmid:22300322
  22. 22. Dasika MS, Burgard A, Maranas CD. A computational framework for the topological analysis and targeted disruption of signal transduction networks. Biophys J. 2006;91: 382–98. pmid:16617070
  23. 23. Papin JA, Palsson BO. The JAK-STAT signaling network in the human B-cell: an extreme signaling pathway analysis. Biophys J. 2004;87: 37–46. pmid:15240442
  24. 24. Wang DYQ, Cardelli L, Phillips A, Piterman N, Fisher J. Computational modeling of the EGFR network elucidates control mechanisms regulating signal dynamics. BMC Syst Biol. BioMed Central Ltd; 2009;3: 118. pmid:20028552
  25. 25. Araujo RP, Petricoin EF, Liotta LA. A mathematical model of combination therapy using the EGFR signaling network. Biosystems. 2005;80: 57–69. pmid:15740835
  26. 26. Ruths D, Muller M, Tseng J-T, Nakhleh L, Ram PT. The signaling petri net-based simulator: a non-parametric strategy for characterizing the dynamics of cell-specific signaling networks. PLoS Comput Biol. 2008;4: e1000005. pmid:18463702
  27. 27. Li F, Thiele I, Jamshidi N, Palsson BØ. Identification of potential pathway mediation targets in Toll-like receptor signaling. PLoS Comput Biol. 2009;5: e1000292. pmid:19229310
  28. 28. Aurich MK, Thiele I. Contextualization Procedure and Modeling of Monocyte Specific TLR Signaling. PLoS One. 2012;7.
  29. 29. Richard G, Belta C, Julius AA, Amar S. Controlling the outcome of the Toll-like receptor signaling pathways. PLoS One. 2012;7: e31341. pmid:22363624
  30. 30. Orton RJ, Sturm OE, Vyshemirsky V, Calder M, Gilbert DR, Kolch W. Computational modelling of the receptor-tyrosine-kinase-activated MAPK pathway. Biochem J. 2005;392: 249–61. pmid:16293107
  31. 31. Sharan R, Karp RM. Reconstructing Boolean models of signaling. J Comput Biol. 2013;20: 249–57. pmid:23286509
  32. 32. Schellenberger J, Que R, Fleming RMT, Thiele I, Orth JD, Feist AM, et al. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat Protoc. 2011;6: 1290–307. pmid:21886097
  33. 33. Thiele I, Palsson BØ. A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat Protoc. Nature Publishing Group; 2010;5: 93–121. pmid:20057383
  34. 34. Sigurdsson V, Hilmarsdottir B, Sigmundsdottir H, Fridriksdottir AJR, Ringnér M, Villadsen R, et al. Endothelial induced EMT in breast epithelial cells with stem cell properties. PLoS One. Public Library of Science; 2011;6: e23833. pmid:21915264
  35. 35. Gudjonsson T, Villadsen R, Nielsen HL, Rønnov-Jessen L, Bissell MJ, Petersen OW. Isolation, immortalization, and characterization of a human breast epithelial cell line with stem cell properties. Genes Dev. 2002;16: 693–706. pmid:11914275
  36. 36. Croft D, Mundo AF, Haw R, Milacic M, Weiser J, Wu G, et al. The Reactome pathway knowledgebase. Nucleic Acids Res. 2014;42: D472–7. pmid:24243840
  37. 37. Hilmarsdóttir B, Briem E, Sigurdsson V, Franzdóttir SR, Ringnér M, Arason AJ, et al. MicroRNA-200c-141 and ΔNp63 are required for breast epithelial differentiation and branching morphogenesis. Dev Biol. 2015;403: 150–61. pmid:25967125
  38. 38. Sigurdsson V, Ingthorsson S, Hilmarsdottir B, Gustafsdottir SM, Franzdottir SR, Arason AJ, et al. Expression and functional role of sprouty-2 in breast morphogenesis. PLoS One. 2013;8: e60798. pmid:23573284
  39. 39. Ingthorsson S, Andersen K, Hilmarsdottir B, Maelandsmo GM, Magnusson MK, Gudjonsson T. HER2 induced EMT and tumorigenicity in breast epithelial progenitor cells is inhibited by coexpression of EGFR. Oncogene. Macmillan Publishers Limited; 2015.
  40. 40. Väremo L, Nookaew I, Nielsen J. Novel insights into obesity and diabetes through genome-scale metabolic modeling. Front Physiol. 2013;4: 92. pmid:23630502
  41. 41. Segrè D, Vitkup D, Church GM. Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci U S A. 2002;99: 15112–7. pmid:12415116
  42. 42. Hong J, Zhou J, Fu J, He T, Qin J, Wang L, et al. Phosphorylation of serine 68 of Twist1 by MAPKs stabilizes Twist1 protein and promotes breast cancer cell invasiveness. Cancer Res. 2011;71: 3980–90. pmid:21502402
  43. 43. Li X- W, Tuergan M, Abulizi G. Expression of MAPK1 in cervical cancer and effect of MAPK1 gene silencing on epithelial-mesenchymal transition, invasion and metastasis. Asian Pac J Trop Med. 2015;8: 937–943. pmid:26614994
  44. 44. Diaz R, Nguewa PA, Parrondo R, Perez-Stable C, Manrique I, Redrado M, et al. Antitumor and antiangiogenic effect of the dual EGFR and HER-2 tyrosine kinase inhibitor lapatinib in a lung cancer model. BMC Cancer. BioMed Central; 2010;10: 188. pmid:20459769
  45. 45. Kim H-P, Han S-W, Kim S-H, Im S-A, Oh D-Y, Bang Y-J, et al. Combined lapatinib and cetuximab enhance cytotoxicity against gefitinib-resistant lung cancer cells. Mol Cancer Ther. 2008;7: 607–15. pmid:18347147
  46. 46. Ebbesen SH, Scaltriti M, Bialucha CU, Morse N, Kastenhuber ER, Wen HY, et al. Pten loss promotes MAPK pathway dependency in HER2/neu breast carcinomas. Proc Natl Acad Sci U S A. 2016.
  47. 47. Bowen KA, Doan HQ, Zhou BP, Wang Q, Zhou Y, Rychahou PG, et al. PTEN loss induces epithelial—mesenchymal transition in human colon cancer cells. Anticancer Res. 2009;29: 4439–49. Available: pmid:20032390
  48. 48. Porstmann T, Griffiths B, Chung Y-L, Delpuech O, Griffiths JR, Downward J, et al. PKB/Akt induces transcription of enzymes involved in cholesterol and fatty acid biosynthesis via activation of SREBP. Oncogene. 2005; 6465–6481. pmid:16007182
  49. 49. Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci U S A. 2001;98: 5116–21. pmid:11309499
  50. 50. Shen W. Akt and Mammalian Target of Rapamycin Regulate Separate Systems of Proteolysis in Renal Tubular Cells. J Am Soc Nephrol. 2006;17: 2414–2423. pmid:16885413
  51. 51. Barthel A, Okino ST, Liao J, Nakatani K, Li J, Whitlock JP, et al. Regulation of GLUT1 Gene Transcription by the Serine/Threonine Kinase Akt1. J Biol Chem. 1999;274: 20281–20286. pmid:10400647
  52. 52. Lochhead PA, Coghlan M, Rice SQ, Sutherland C. Inhibition of GSK-3 selectively reduces glucose-6-phosphatase and phosphatase and phosphoenolypyruvate carboxykinase gene expression. Diabetes. 2001;50: 937–46. Available: pmid:11334436
  53. 53. Miyamoto S, Murphy AN, Brown JH. Akt mediates mitochondrial protection in cardiomyocytes through phosphorylation of mitochondrial hexokinase-II. Cell Death Differ. 2008;15: 521–9. pmid:18064042
  54. 54. Vander Heiden MG, Plas DR, Rathmell JC, Fox CJ, Harris MH, Thompson CB. Growth factors can influence cell growth and survival through effects on glucose metabolism. Mol Cell Biol. 2001;21: 5899–912. Available: pmid:11486029
  55. 55. Novellasdemunt L, Tato I, Navarro-Sabate A, Ruiz-Meana M, Méndez-Lucas A, Perales JC, et al. Akt-dependent activation of the heart 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase (PFKFB2) isoenzyme by amino acids. J Biol Chem. 2013;288: 10640–51. pmid:23457334
  56. 56. Yoon S, Lee MY, Park SW, Moon JS, Koh YK, Ahn YH, et al. Up-regulation of acetyl-CoA carboxylase alpha and fatty acid synthase by human epidermal growth factor receptor 2 at the translational level in breast cancer cells. J Biol Chem. 2007;282: 26122–26131. pmid:17631500
  57. 57. Wang W, Fridman a., Blackledge W, Connelly S, Wilson I a., Pilz RB, et al. The Phosphatidylinositol 3-Kinase/Akt Cassette Regulates Purine Nucleotide Synthesis. J Biol Chem. 2009;284: 3521–3528. pmid:19068483
  58. 58. Saha Arindam, Connelly Stephen, Jiang Jingjing, Shunhui Zhuang DTA, Tony Phan, Pilz Renate B. and GRB . Akt Phosphorylation and Regulation of Transketolase Is a Nodal Point for Amino Acid Control of Purine Synthesis. Mol Cell. 2014;55: 264–276. pmid:24981175
  59. 59. DeBerardinis RJ, Lum JJ, Hatzivassiliou G, Thompson CB. The biology of cancer: metabolic reprogramming fuels cell growth and proliferation. Cell Metab. 2008;7: 11–20. pmid:18177721
  60. 60. Thomson S, Petti F, Sujka-Kwok I, Mercado P, Bean J, Monaghan M, et al. A systems view of epithelial-mesenchymal transition signaling states. Clin Exp Metastasis. 2011;28: 137–55. pmid:21194007
  61. 61. Kondaveeti Y, Guttilla Reed IK, White BA. Epithelial-mesenchymal transition induces similar metabolic alterations in two independent breast cancer cell lines. Cancer Lett. 2015;364: 44–58. pmid:25917568
  62. 62. Thiele I, Swainston N, Fleming RMT, Hoppe A, Sahoo S, Aurich MK, et al. A community-driven global reconstruction of human metabolism. Nat Biotechnol. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2013;31: 419–25. pmid:23455439
  63. 63. Guo J, Li X, Tan Z, Lu W, Yang G, Guan F. Alteration of N-glycans and expression of their related glycogenes in the epithelial-mesenchymal transition of HCV29 bladder epithelial cells. Molecules. Multidisciplinary Digital Publishing Institute; 2014;19: 20073–90. pmid:25470275
  64. 64. Liu R-M, Gaston Pravia KA. Oxidative stress and glutathione in TGF-beta-mediated fibrogenesis. Free Radic Biol Med. 2010;48: 1–15. pmid:19800967
  65. 65. Deshiere A, Duchemin-Pelletier E, Spreux E, Ciais D, Combes F, Vandenbrouck Y, et al. Unbalanced expression of CK2 kinase subunits is sufficient to drive epithelial-to-mesenchymal transition by Snail1 induction. Oncogene. Macmillan Publishers Limited; 2013;32: 1373–83. pmid:22562247
  66. 66. Uhlen M, Fagerberg L, Hallstrom BM, Lindskog C, Oksvold P, Mardinoglu A, et al. Tissue-based map of the human proteome. Science (80-). American Association for the Advancement of Science; 2015;347: 1260419–1260419. pmid:25613900
  67. 67. Gazdar AF, Minna JD. Deregulated EGFR signaling during lung cancer progression: mutations, amplicons, and autocrine loops. Cancer Prev Res (Phila). 2008;1: 156–60.
  68. 68. Vogel C, Marcotte EM. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat Rev Genet. 2012;13: 227–32. pmid:22411467
  69. 69. Maier T, Güell M, Serrano L. Correlation of mRNA and protein in complex biological samples. FEBS Lett. 2009;583: 3966–73. pmid:19850042
  70. 70. Väremo L, Scheele C, Broholm C, Mardinoglu A, Kampf C, Asplund A, et al. Proteome- and transcriptome-driven reconstruction of the human myocyte metabolic network and its use for identification of markers for diabetes. Cell Rep. 2015;11: 921–33. pmid:25937284
  71. 71. Vastrik I, D’Eustachio P, Schmidt E, Joshi-Tope G, Gopinath G, Croft D, et al. Reactome: a knowledge base of biologic pathways and processes. Genome Biol. BioMed Central Ltd; 2007;8: R39. pmid:17367534
  72. 72. Rolfsson O, Palsson BØ, Thiele I. The human metabolic reconstruction Recon 1 directs hypotheses of novel human metabolic functions. BMC Syst Biol. BioMed Central Ltd; 2011;5: 155. pmid:21962087
  73. 73. Grant M, Boyd S. CVX: Matlab software for disciplined convex programming, version 2.0 beta.
  74. 74. Grant M, Boyd S. Graph implementations for nonsmooth convex programs, Recent Advances in Learning and Control (a tribute to M. Vidyasagar). Lect Notes Control Inf Sci. 2008; 95–110.
  75. 75. Gurobi Optimization I. Gurobi Optimizer Reference Manual. 2015;
  76. 76. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res. 2013;41: D991–5. pmid:23193258
  77. 77. Javaid S, Zhang J, Anderssen E, Black JC, Wittner BS, Tajima K, et al. Dynamic Chromatin Modification Sustains Epithelial-Mesenchymal Transition following Inducible Expression of Snail-1. Cell Rep. 2013;5: 1679–1689. pmid:24360956
  78. 78. Tam WL, Lu H, Buikhuisen J, Soh BS, Lim E, Reinhardt F, et al. Protein kinase C α is a central signaling node and therapeutic target for breast cancer stem cells. Cancer Cell. 2013;24: 347–64. pmid:24029232
  79. 79. McGrail DJ, Mezencev R, Kieu QMN, McDonald JF, Dawson MR. SNAIL-induced epithelial-to-mesenchymal transition produces concerted biophysical changes from altered cytoskeletal gene expression. FASEB J. 2015;29: 1280–9. pmid:25491311
  80. 80. Cai J, Guan H, Fang L, Yang Y, Zhu X, Yuan J, et al. MicroRNA-374a activates Wnt/β-catenin signaling to promote breast cancer metastasis. J Clin Invest. 2013;123: 566–79. pmid:23321667
  81. 81. Schellenberger J, Palsson BØ. Use of randomized sampling for analysis of metabolic networks. J Biol Chem. 2009;284: 5457–61. pmid:18940807
  82. 82. Jain M, Nilsson R, Sharma S, Madhusudhan N, Kitami T, Souza AL, et al. Metabolite profiling identifies a key role for glycine in rapid cancer cell proliferation. Science. 2012;336: 1040–4. pmid:22628656