Figures
Abstract
We investigate how protein-protein interactions (PPIs) can regulate carbohydrate metabolism in Escherichia coli. We specifically investigated the stoichiometry of 378 PPIs involving carbohydrate metabolic enzymes. In 48 interactions, the interactors were much more abundant than the enzyme and are thus likely to affect enzyme activity and carbohydrate metabolism. Many of these PPIs are conserved across thousands of bacteria including pathogens and microbial species. E. coli adapts to different cellular environments by adjusting the quantities of the interacting proteins (25 PPIs) in a way that the protein-enzyme interaction (PEI) is a likely mechanism to regulate its metabolism in specific environments. We predict 3 PPIs (RpsB-AdhE, DcyD-NanE and MinE-Yccx) previously not known to regulate metabolism.
Citation: Chowdhury S, Fong SS, Uetz P (2025) The protein interactome of Escherichia coli carbohydrate metabolism. PLoS ONE 20(2): e0315240. https://doi.org/10.1371/journal.pone.0315240
Editor: Hari S. Misra, Gandhi Insititute of Technology and Management, INDIA
Received: April 29, 2024; Accepted: November 21, 2024; Published: February 4, 2025
Copyright: © 2025 Chowdhury et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data for this study are within the paper and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
Carbohydrate metabolism (CHM) is responsible for converting sugars and other compounds into other metabolites and ATP and occurs in all organisms from bacteria to humans [1]. Carbohydrate metabolism includes about a dozen sub-pathways in bacteria and thus can involve more than a hundred metabolic enzymes [1]. In E. coli, the focus of this study, CHM involves 13 sub-pathways (Table 1 and Fig 6).
Central carbon metabolism is the shortest route for producing energy and biomass in Escherichia coli [3]. In addition, CHM provides the substrates for many other metabolic pathways, such as lipid or amino acid metabolism [2, 4]. Given that carbohydrate and energy metabolism in general is critical for cellular survival, it is usually tightly regulated [3]. Different regulatory mechanisms have been identified, including gene expression [5], post-translational modification (PTMs) [6], enzyme localization [7], or allosteric control [8]. In addition, protein-protein interactions (PPIs) are a common mechanism to regulate protein activity. Although numerous enzymes have been shown to be regulated by PPIs, it is difficult to study the impact of PPIs on metabolism on a large scale. For instance, we have demonstrated how several metabolic enzymes in Escherichia coli are regulated by PPIs, including enzymes in energy metabolism, carbohydrate metabolism or amino acid metabolism [8–11]. Notably, a single protein such as Hpr can both activate and repress enzymes in the same pathway such as glycolysis [10]. It is well-known that many enzymes are regulated by interactions in large regulatory networks [12].
Building upon a previous pilot study on glycolysis [13], here we investigate the connection between interactomics (PPI networks) and metabolomics (metabolic networks) on a more comprehensive scale, focusing on carbohydrate metabolism (CHM). We wondered to what extent carbohydrate metabolism can be regulated via protein-enzyme interactions of its enzymes [14] and how many such interactions are there for different carbohydrate pathways [15]. For example, if a protein binds an enzyme metabolizing inositol phosphate, that protein-enzyme interaction (PEI) will be considered as part of the inositol phosphate pathway. Central carbohydrate pathways such as glycolysis, TCA cycle, pyruvate metabolism and HMP shunt have a high number of PEIs which is not surprising as these are primary fueling pathways with high number of enzymes [16–18]. Importantly, a good number of uncharacterized proteins also bind carbohydrate enzymes, possibly shedding light on novel metabolic regulatory mechanisms [19, 20].
Proteins interacting with carbohydrate metabolic enzymes may perform important functions as these E. coli PEIs can be predicted in more than 4000 bacterial genomes (40% of the known genomes). Such potentially conserved interactions are good candidates for future experimental analysis and thus may reveal novel regulators of metabolism [21, 22] (Table 3). Therefore, a further understanding of these carbohydrate metabolism PEIs may contribute to applications such as biofuel production [23–25], antibiotic development [11, 26] or comprehending metabolism functions of the human microbiome [27].
In this study, we identified 25 PEIs which likely perform metabolic regulatory roles based on the abundance of enzymes and their interacting proteins [28]. We verify this by using published protein abundance data of 2600 E. coli proteins across 22 environmental conditions, including multiple carbon sources [29].
2 Materials and methods
2.1 Metabolic enzymes and their pathway information
We obtained the list of metabolic genes of carbohydrate metabolism divided into different carbohydrate pathways from KEGG [2, 30] (Table 1, for full list see S1 Table in S1 File). These genes were mapped onto proteins of E. coli K-12 (proteome ID: UP000000625) using UniProt’s ID mapping feature [31]. This resulted in a list of all E. coli metabolic enzymes categorized into 15 carbohydrate metabolic pathways.
2.2 Protein-enzyme interaction data
We used the IntAct database to obtain E. coli protein-protein interactions (PPI) [32]. The PPIs in IntAct are annotated by interaction type, detection method and experimental evidence giving them a PPI score. If the PPI score exceeds a certain threshold based on these parameters, it is included in the database with the literature evidence information [33, 34].
IntAct divides the PPIs into spoke and non-spoke expanded interactions. Spoke-expanded interactions are PPIs obtained from protein complexes from which binary PPIs are inferred without knowing whether these interactions physically take place [32, 35]. Hence, we focused on non-spoke expanded interactions that is, primarily binary protein-protein interactions whether they are from (binary) protein complexes or other sources. From this set, we chose the PPIs with at least one of the two proteins being a metabolic enzyme. A list of E. coli metabolic enzymes was already obtained earlier from KEGG and UniProt (Section 2.1). This list of protein-enzyme interactions (PEIs) had a total of 299 binding partners (“interactors”) and 378 interactions (S1 Table in S1 File).
2.3 Inferring what kind of proteins bind metabolic enzymes
Interactors were classified into enzymes, metabolic proteins, and other (un-) characterized proteins. UniProt provided the EC number for proteins which are enzymes [36]. We used this feature to classify the interactors into enzymes and non-enzymes first. Then ToppGene (a gene enrichment software [37]) was used to classify the enzyme and non-enzyme interactors into metabolic and non-metabolic (Fig 1). The list of interactors was then processed in ToppGene to produce a list of Gene Ontology (GO) Biological Process terms along with the interactors falling under that GO term [38]. Only the “Metabolic Process” GO terms were chosen. If the interactor listed in these GO terms was an enzyme, it was called 1) metabolic enzyme (Acetyl-CoA carboxylase, Enolase); the remaining enzymes were called 2) non-metabolic enzymes (e.g., Multiphosphoryl transfer protein, ATP-binding protein) (Figs 1 and 5). In the same way, we classified the non-enzyme interactors into 3) metabolic (e.g., Cytochrome YdhU, Cellulose biosynthesis protein) and 4) non-metabolic proteins (e.g., Biopolymer transport protein, Inner membrane protein). Interactors whose protein names mention “Uncharacterized” (e.g., Fimbrial-like protein) or lacked GO annotations (e.g., Electron transport protein) were collectively called “unknown proteins”.
2.4 Visualizing carbohydrate metabolism using KEGG mapper
In order to put carbohydrate metabolism in a broader context, we highlighted CHM in E. coli’s metabolome (KEGG: map01100) in Fig 9. The figure was constructed using KEGG mapper and each of E. coli’s 15 different carbohydrate metabolic pathways were color-coded separately [2, 39].
2.5 Conservation of E. coli PEIs
UniProt and EggNog provided clusters of ortholog (COG) group IDs. This COG group lists all the bacterial ortholog (OG) species where that protein is found as an ortholog [40]. We obtained that list of OG species from EggNOG for all E. coli enzymes and their interactors. Then, for each PEI, we counted the number of OG species in which both the enzyme and its interactor co-occurs and we called this number “conservation value” (Figs 2 and 8). S1 Table in S1 File lists the conservation values for all the E. coli PEIs. Presence of the orthologs of both the enzyme and interactor from E. coli is likely to indicate a conserved interaction in that ortholog species [41]. We have also listed some bacterial species known to cause diseases in Table 3 where many E. coli PEIs (or rather predicted PEIs) are conserved.
YhiJ-SgbE is an example of a poorly conserved interaction with YhiJ present in 8 genomes and SgbE present in 7128 genomes, hence both proteins are found in only 7 genomes (the conservation value, CV, thus being 7). Similarly, the CV of NusG-GapA is 10,277, given that orthologs of both proteins are found in 10,277 genomes. Note that these values represent predicted PPIs.
2.6 Protein abundance of E. coli protein-enzyme interactions
Schmidt et al. experimentally determined the abundance of ~2600 E. coli proteins in 22 different growth conditions [29], measured as protein copies per cell (Figs 3, 9, 11, 14 and S1 Table in S1 File). We used these values to determine the in vivo ratios of PEIs.
In specific growth media (A), E. coli transcription factor ArcA will increase the expression of co-regulated genes FrdB and FrdA (B) and thus the enzyme complex (fumarate reductase) which metabolizes fumarate (A). The interactor FrdB becomes the regulatory factor which allosterically regulates the activity of the metabolic enzyme FrdA. In x-axis, G_ means glucose media, so, G_osm_stress is glucose media with high NaCl concentration. _E and _I for each media in x axis are the same media but _E shows abundance of enzyme and _I shows abundance of interactor in that media.
2.7 Obtaining protein structures, identifying contact residues at protein-protein interfaces, and calculating their fractions
Single protein structures as well as protein-protein interaction complexes in Section 3.7 and 3.8 were obtained with the help of ColabFold [42]. This is an AlphaFold-based protein structure software. The Predicted Template Modeling (PTM) and (Interface PTM) iPTM scores [43] mentioned for each modeled protein and protein-enzyme structure respectively, range from 0 to 1, indicating high quality and reliability of the predicted 3D structure if a score is > = 0.95.
ChimeraX software was then used to visualize the predicted protein structures, and identify the binding site residues at PPI interfaces [44]. This software considers two atoms in contact if they are within a 4 Å distance [45]. Therefore, with the help of ChimeraX, we determined which residues are in contact (interface residues) between two proteins [48]. Once we obtained those, we calculated the fraction (%) of overlapping binding sites between secondary interactors, primary interactors and metabolic enzymes (Section 3.7) using the structures from ColabFold [46, 88]. We first counted how many residues are involved in the primary interaction, that is, interface residues between the primary interactor and the enzyme. After that, we determined how many of these residues are also involved in the secondary interaction, that is, interface residues between secondary interactor and primary interactor (or enzyme). We report these fractions (%) of overlapping binding residues between primary and secondary interactions in Table 5 and Fig 12.
2.8 Calculating buried surface area and binding energy
The buried surface area (BSA) is the surface area of proteins which is found at the PPI interface [46]. Before calculating BSA, first, we obtained structures of individual proteins making PPIs from ColabFold [42]. Then, by loading the structures (PDB files) of the binding proteins and then of the PPI complex in PyMol, we obtained the surface area of those proteins (or PPIs) structures by using the “get_area” PyMol command [47]. Once we have the surface area (SA) of 1) two interacting proteins individually and 2) PPI complex, BSA is the sum of surface areas of two proteins excluding surface area of the protein-protein complex. For example, the BSA of secondary interaction RplD-Def = SA of RplD + SA of Def–SA of RplD-Def complex. We calculated the buried surface area (BSA) between secondary interactor and primary interactor/enzyme (secondary interaction) and between primary interactor and metabolic enzyme (primary interaction). Then, we compared those values to find which interaction (primary or secondary) has the bigger BSA (Fig 12) [47].
We finally determined the binding energy (ΔG) between two proteins [48] using the PRODIGY server [49], were inputting the two PDB files for two proteins binding each other outputs the ΔG value of their interaction. We used the server to determine the ΔG value of secondary (secondary interactor—primary interactor/enzyme) and primary interactions (primary interactor—metabolic enzyme) (Fig 12). However, given the uncertainty of these calculations based on the predicted structures and interfaces, we decided not to use these results in our final analysis.
3 Results & discussion
3.1 Selecting 16 PEIs as probable regulators of metabolism
We illustrate in Fig 4 how we started from 378 PEIs and reached the number of 16 PEIs which were later reported as potential metabolism regulators. The figure shows how we filtered out PEIs during each analysis.
3.2 The carbohydrate metabolism protein interactome
There are 242 enzymes which metabolize carbohydrates (Table 1), including 146 enzymes (60%) which bind 299 other proteins (or interactors). Multiple enzymes can have a common interactor or multiple interactors binding one enzyme (Table 2). A summary of these is provided in Table 2.
Carbohydrate metabolism enzymes interact primarily with metabolic enzymes. This is not surprising as proteins of similar functions often bind to each other [41, 50]. It is well-known that carbohydrate metabolism is controlled through protein-protein interactions between the enzymes of this pathway. For instance, interactions between GltA of the TCA cycle and PykF of glycolysis [51], Gnd of the Pentose-phosphate shunt and PckA of the TCA cycle are well known [52]. Carbohydrate enzymes also bind 12 enzymes not involved in the 11 E. coli metabolic pathways [2], such as Chaperonin GroEL [53], Arginine import ATP-binding proteins (AraG) [54], Flagellum-specific ATP synthase (FliI) [55] and Multiphosphoryl transfer protein 1 (FryA) [56].
45 interactors of carbohydrate enzymes are metabolic proteins but not enzymes (indicated by not having an EC number). These proteins mainly include transcription factors, chaperones and ribosomal proteins. There are 59 non-metabolic proteins interacting with metabolic enzymes and thus may play a role in metabolism (Fig 5) [57].
Shown are the 299 interactors of carbohydrate metabolism. They also interact with 104 non-enzymes comprising 45 metabolic and 59 non-metabolic proteins. E. coli carbohydrate enzymes also bind 54 proteins of unknown function.
Surprisingly, E. coli carbohydrate enzymes also interact with 54 proteins that are either uncharacterized or that lack annotation (S1 Table in S1 File). Among them, 3 unknown proteins show potentially relevant expression levels: YadG, YeeD and DcrB. These three proteins are at least 3 times more abundant than their enzyme partners GadB, MurA and GloC respectively. Investigating these interactions involving unknown proteins experimentally may shed light on their metabolic or non-metabolic roles [20, 58, 59].
3.3 Count of protein-protein interactions per carbohydrate pathway
The protein-enzyme interactions were classified into different metabolic pathways based on the pathway of the enzyme. For example, PEI GlpK-Eno will be part of glycolysis, given that Enolase is a glycolytic enzyme. We found that pathways with more metabolic enzymes bind more proteins and thus, have more PEIs (Fig 6). For instance, glycolysis and TCA cycle have 26 and 19 enzymes and 83 and 70 PEIs, respectively. Metabolites of pathways such as glycolysis and TCA often function as branchpoints to other metabolic pathways, hence they affect these other pathways too, such as amino acid or lipid metabolism. Therefore, these pathways have large numbers of enzymes and it is very likely that PPIs regulate these pathways [60].
Pathways with more enzymes bind more proteins compared to pathways with low count of enzymes.
It is not surprising that pathways with more enzymes bind more interactors. Therefore, we used another measure to reflect on the count of PEIs per metabolic pathway which is how many interactors bind a metabolic enzyme on average. The density value is equal to the number of interactors divided by the number of enzymes and we calculated it for each of the 11 carbohydrate pathways. The average of those values was 2.68. Glycolysis, TCA cycle and pyruvate metabolism pathways had density values > 3. Inositol phosphate metabolism, galactose metabolism and the pentose & glucoronate interconversions pathway had density values of 3.5, 2.9 and 4.3 respectively (Fig 7). This means that these pathways with a low number of enzymes (2, 14 and 6) bind a relatively high number of interactors (7, 38 and 26 respectively). Overall, out of 11 E. coli metabolic pathways [30] including lipid, amino acid or nucleotide metabolism pathways, we find that carbohydrate metabolism has the largest number of enzymes, reactions, interactors and thus, PEIs. Fig 7 illustrates how much space carbohydrate metabolism pathway takes in the entire E. coli metabolome with density values shown in () after the pathway names.
With 11 different sub-systems, we found that E. coli’s carbohydrate metabolic pathway has the largest number of enzymes (146), proteins (299) and protein-enzyme interactions (378) compared to other pathways such as lipid (33,108,119) or amino acid metabolism (111,200,261) [3, 61]. The number in parentheses are average interactor counts per enzyme (density values) for that pathway. The E. coli metabolome map can be found at https://www.genome.jp/pathway/map01100.
3.4 Conservation of carbohydrate PEIs
A protein with orthologs in a large number of species is more conserved and therefore, may play important biological functions [62]. Thus, we tested the conservation of E. coli PEIs by counting in how many bacterial genomes these PEIs co-occur as orthologs. Overall, we considered 10000 bacterial genomes listed in the EggNOG database [40]. For each 378 PEIs, we calculated the number of bacterial genomes where both the interactor and its binding metabolic enzyme co-occur. The average number of bacterial genomes where E. coli PEIs are found is 4307 (green line in Fig 8). Therefore, in 43% of bacterial genomes, proteins are predicted to bind carbohydrate enzymes.
E. coli pathways including central carbohydrate pathways have more conserved PEIs than others. The average conservation value per pathway is shown by a X and for all pathways, it is shown by a vertical line. Each dot represents the number of bacterial genomes where a PEI co-occurs as orthologs, so, in total, there are 378 dots for 378 PEIs.
If we focus on individual pathways, the average conservation value per pathway (X in Fig 8) is higher than 4307 for the central carbohydrate pathways of glycolysis, TCA cycle and HMP shunt. If metabolic regulation happens at protein-protein interaction (PPI) levels, we assume it will happen more for these pathways [3, 63]. Pathways which E. coli uses only in specific metabolic environments like starch and sucrose, ascorbate and aldarate, inositol phosphate metabolism and amino sugar and nucleotide sugar metabolism were less conserved as the average conservation value (X) was less than 4307 for these pathways (Fig 8). Overall, all pathways span a large range of conservation, i.e. some may be highly conserved in one species but not in others. This is not surprising, given that closely related species will have higher conservation levels than distantly related, ecologically divergent species.
Of the 378 carbohydrate metabolism PEIs in E. coli (Table 2), on average, there are 150 PEIs found in a bacterial genome. We list the top 5 bacterial genomes in each of these three groups in Table 3. For example, 354 E. coli PEIs are found as orthologs in Shigella sonnei 53G which is the causative agent of bacterial dysentery [64]. Shigella and other γ-proteobacteria are close relatives of E. coli, therefore, the number of co-occurring PEIs is high in them. With increasing phylogenetic distance, the number of orthologous pairs decreases, but equally important, the similarity of proteins also decreases, so many of the predicted interactions based on sequence similarity may be lost in evolution. Nevertheless, we expect that PEIs can regulate metabolism not only in E. coli but in many other species where they co-occur. Hence, experimental investigation of E. coli PEIs may help us to 1) develop novel therapeutic strategies [65], 2) understand novel metabolic functions of human microbiome [66] and 3) design new applications in biotechnology [67].
3.5 Abundances of metabolic enzymes and their binding partners suggest novel metabolic regulation
We wondered how many of the 378 PEIs may be metabolic regulators in E. coli. Importantly, regulation often depends on the stoichiometry of an enzyme and its regulator [68]. Hence, we obtained abundance values in the form of protein copies per cell for the E. coli proteins from Schmidt et al [29] who determined abundance of approx. 2600 E. coli proteins in 22 different environments. These conditions included multiple carbon sources, complex media, chemostat cultures as well as cultures under stress (Fig 9 x axis). Schmidt et al [29] provides abundance data for 179 PEIs out of 378 that we identified earlier. For an interactor to regulate the functioning of the enzyme when it binds it, it can be assumed that it should be in higher amounts to do it when it binds the metabolic enzyme [69]. Following this assumption, we calculated the ratio of Interactor/Enzyme copies for each of the 179 PEIs. Then, we computed how many PEIs had ratios of at least 100, 10, 5 and 2 in at least one environmental condition (Fig 10). We focused on an arbitrary ratio of Interactor/Enzyme copies > = 5 in at least one growth media (Fig 10). Much higher thresholds (e.g., 100-fold) result in only 7 PEIs for the next sets of investigations and a very low threshold (2-fold) may lead to false positives (73) [70]. Therefore, we somewhat arbitrarily chose 5-fold as threshold which gave us 48 PEIs (out of 179) for further investigations.
The ratio (abundance of interactor/abundance of enzyme) for 25 E. coli PEIs (y axis) is shown for 22 different E. coli environments (x axis). Ratios were divided into 5 different ranges shown as a colored brick. Blue bricks denote a ratio of 0–0.5 while a green brick shows values from 10–100. A blue or black brick indicates that the enzyme is more or equally abundant than the interactor in that media. A green, yellow or red brick shows that the interactor is more abundant than the enzyme in that media. G_ means glucose media, so, for example G_42C means glucose media at 42°C.
27% of E. coli PEIs (48 PEIs out of 179 PEIs) showed fold change of > = 5 in at least one cell growth condition (blue bar) [29]. These cases were investigated in more detail to delineate their abundance properties in 22 different metabolic states (Fig 8) [29].
In 48 PEIs the number of interactor molecules was at least 5 times than the number of enzyme molecules in at least one growth media. To identify which of the 48 PEIs have conditional effects (possible regulatory) we identified those PEIs for which the ratio of interactor copies/enzyme copies is > = 5 in one growth media and < = 1 in a different growth media, indicating a condition-dependent change in the PEI. Condition-dependent changes to PEIs are a significant indicator associated with regulation of metabolism [28, 71]. Out of 48 PEIs, 25 PEIs (Fig 9) showed condition-dependent changes in abundance ratios. A large range of ratios, from > 100 copies of interactor:enzyme (shown in red in Fig 9) to < 0.5 (indicating more enzyme than interactor, shown in blue in Fig 9) was observed.
3.6 Some enzyme abundances change independent from interactors
Absolute values of interactor and enzyme abundance were also analyzed in the 22 different growth conditions. We found 4 PEIs (among 25) where only the enzyme changed its abundance. These 4 PEIs were 1) Frr-GalE, 2) Frr-GalK, 3) YadG-GadB and 4) Eno-GlpK.
Frr is a ribosomal protein which binds the enzymes GalE and GalK. GalK is galactokinase (GalK) and UDP-glucose 4-epimerase (GalE), and these enzymes get activated in media with galactose [72]. We observed that across all 22 growth conditions, Frr is consistently higher in abundance than GalK and GalE except galactose media where both these enzymes become more abundant than Frr (S1 and S2 Figs).
GadB is usually expressed at low levels while its binding partner; YadG is highly abundant across all growth media. Glucose media at pH6 (“G_pH6) is an exception with GadB still being lower in abundance than YadG but the difference in abundance between these two proteins becomes very small due to upregulation of GadB (S3 Fig). GadB expression is known to be upregulated when E. coli faces acidic stress [73].
The Eno-GlpK interaction is an enzyme-enzyme interaction between glycolytic enzyme Enolase and Glycol kinase GlpK. In 21 of the 22 growth conditions, GlpK is lower in abundance than its interactor, Eno, with growth on glycerol being the one exception [74] (Fig 11). In complex media Glycerol + AA, GlpK shows an increase in abundance. In these 4 cases the metabolic enzyme increased abundance levels when the metabolic function was required independently from interactor abundances.
GlpK; glycerol kinase is a glycerol processing enzyme which increases its abundance in glycerol and glycerol + AA media. Its binding partner Eno may not be responsible for this increase in abundance of GlpK as it does not show itself any condition specific abundances. On the x axis, G means glucose media and G_pH6 means glucose media at pH6. For each of the 22 conditions, there are duplicates of that such as Fumarate_E and Fumarate_I. _E means the abundance of Enzyme in that condition whereas _I mean abundance of Interactor in that condition.
3.7 Enzymes and interactors compete with other proteins to bind to each other
Enzymes are not just regulated by single interactors but yet other proteins may compete or interfere with these interactions. We called these proteins “secondary interactors” and focused on 21 such PEIs [75]. These 21 interactions, involved primary interactors with at least 5-fold excess over the enzyme in at least one growth media but switched to a different ratios in other growth media (Section 3.5). Note that 48 PEIs had primary interactors with at least 5-fold excess over their enzyme in at least one growth media but only 25 PEIs among them reversed their stoichiometry in some condition(s), so we focused on those remaining 23 PEIs (48–25). There are 2 cases to consider (Fig 12):
Case 1. In those 23 PEIs, there can be a secondary interactor with abundance higher than both the metabolic enzyme and the primary interactor, so, they may occupy most of the binding sites on the enzyme and hence, the primary interactor may not be able to bind the metabolic enzyme (Fig 12A).
Case 2. The primary interactor is more abundant than the metabolic enzyme, but the abundance of the secondary interactor is much higher than the primary interactor, so it will occupy most of the binding sites on the primary interactor and the metabolic enzyme may not get chance to bind the primary interactor (Fig 12B).
Primary interactions (A) YiaD-GlgX (Case 1) and (B) FdoG-FdoH (Case 2) may get affected by their secondary interactors. The structures were obtained from ColabFold [42] and the PTM scores [43] were above 0.6 for all of them reflecting high confidence in predicting the structures of these proteins accurately. The residues highlighted in red on primary interactor are binding sites where all secondary interactors and metabolic enzyme bind that protein.
In order to investigate the cases described above, we obtained the abundance values of secondary interactors of enzymes (Case 1) and primary interactors (Case 2) in Glucose and LB media and investigated the impact of abundance, binding sites, surface area, and binding energy:
Abundance: In order to determine which PEIs get affected by secondary interactions, we used the abundance of secondary interactors of an enzyme. For instance, Case 1 is represented by MsrA (687 copies per cell) which binds GlgX (111 copies) (Fig 12A and Table 4). LeuB (4588), IlvH (1533) and MqsA (34) bind IlvI (441) as its secondary interactors (Table 4). For Case 2, we calculated the abundance ratios of (secondary interactor / primary interactor) / (primary interactor / metabolic enzyme). If this ratio is > = 5, we considered that secondary interactors might hinder the primary interactor–enzyme binding (Section 3.5). Examples of this cases are PEIs FdoG-FdoH (Fig 12B) or Def-FucO (see Table 4 for other examples).
Int1, Int2 = primary interactor, secondary interactor of enzyme. G:L = abundance in glucose and LB media (a single value indicates that only abundances for media with glucose are available, e.g. in the case of GlgX).
Binding sites: Secondary interactors can block primary interactors if they compete for the same binding (Fig 12). Therefore, for the 5 PEIs in Table 4, we attempted to determine the binding sites among enzymes and (primary and secondary) interactor [76] as described in (Section 2.7) [42, 77]. The main results of this analysis are summarized in Table 5.
Overall, the PEIs YiaD-GlgX (Fig 12B), RimM-IlvI and DcrB-GloC tested positive for Criteria 1 (Abundance) and Criteria 2 (Binding sites). However, they did not give the results we hypothesized in Criteria 3 (BSA) and Criteria 4 (ΔG), hence we excluded this data from further analysis. We must keep in mind that we included the effects of all secondary interactors (cumulative abundance) in Criteria 1 but in Criteria 2, 3 and 4, we focused on only the most highly abundant secondary interactor binding either the enzyme (Case 1) or primary interactor (Case 2) (Table 5). Therefore, a single secondary interaction may not show a high BSA but if cumulative effects of all secondary interactors is considered, these 3 PEIs may test positive in Criteria 2 or 3 but it is very difficult to determine and would make things bit complex. We predict PEIs Def-FucO and FdoG-FdoH (Fig 12B) have more chances of getting obstructed by secondary interactors RplQ/D and FdhE based on these criteria. It is very hard to predict the actuality of secondary interactions computationally, however, in this study for the first time, we report it as a novel metabolic regulatory mechanism affecting the interaction between a primary interactor and metabolic enzyme.
Note that it is possible that secondary interactors may be both competitors [75] or activators [78]. For example, secondary interactor FdhE may also act as a promoter of Formate dehydrogenase complex formation [79].
3.8 Predicting potential regulators of Escherichia coli carbohydrate metabolism
3.8.1 Multi-level regulation of metabolism.
The fumarate reductase complex is a good example for multi-level regulation of protein complexes in metabolism. FrdB interacts with metabolic enzyme FrdA to form the fumarate reductase enzyme complex [84]. FrdA is the catalytic subunit and FrdB is the regulatory subunit [86]. The original studies did not investigate the endogenous stoichiometry in any details, but Schmidt et al. [29] found both of them at high abundance in fumarate media (Fig 11A). Transcription factor ArcA (Aerobic respiration control protein) causes high production of these two proteins when a major nutrient is fumarate (Fig 11B) [85].
3.8.2 Enzyme-enzyme interactions.
Enzymes catalyzing one metabolic reaction often bind and regulate other enzymes (Fig 13). Out of 16 PEI candidates (Table 6), 4 of them are enzyme-enzyme interactions. 1) GltA (TCA cycle) binds PykF (glycolysis) (Fig 13) [51, 87]. 2) Gnd pentose-phosphate shunt) binds PckA (TCA cycle) [52, 89]. 3) Gnd also binds enzyme PepD (amino acid metabolism). Both these proteins are highly expressed in Salmonella Typhi, the causative agent of typhoid in humans [90]. 4) Glycolytic enzyme PykA interacts with GatD (galactose pathway) and both are inhibited by DicF which is an sRNA helping E. coli adapt to changing environments [88]. Overall, we found that metabolic enzymes prefer other metabolic enzymes as their binding partners (Fig 5) [50, 98–100].
3.8.3 Protein-protein interactions in sulfur metabolism.
Among the 16 PEI regulators, one protein interacts with three different carbohydrate enzymes. The putative sulfur carrier protein TsuB binds MurA (peptidoglycan biosynthesis), GabT (Butanoate metabolism) and YahK (glycolysis). TsuB has been reported to physically interact with many proteins to facilitate thiosulfate uptake in E. coli [91, 92] which is an important process during carbohydrate metabolism regulation [93, 94]. Therefore, further experimental investigations on these 3 PEIs involving TsuB may give us more insights on what role sulfur assimilation plays during carbohydrate metabolism in E. coli.
3.8.4 Protein-enzyme interaction specific literature evidence.
The functions of 3 PEIs are supported by published data: TalB-LeuD, YbgI-YqhD and PtsH-ManX. PtsH (= Hpr) binds another phosphotransferase system protein, ManX and both these proteins are inhibited by the transcription repressor Mlc. Mlc is known to repress the expression of Pts proteins when E. coli does not need glycolysis in low glucose conditions [5, 96]. ManX and PtsH both are Pts proteins [101]. PtsH protein interacts with and allosterically regulates multiple E. coli metabolic enzymes [10].
Next, interactor YbgI binds the metabolic enzyme YqhD and both YbgI and YqhD are known to increase their amounts in LB condition when E. coli cells experience high vanillin in their cell environments [97].
Fig 14 illustrates both Interactor TalB and its binding partner, enzyme LeuD decrease their abundance when E. coli goes through stationary phases. This is happening due to the function of a third protein, ClpX [95]. This protein degrades both TalB and LeuD when E. coli switches to stationary phase implying these proteins facilitate cell growth.
Both these interacting proteins become low in abundance due to growth inhibitor ClpX degrading them [95] when E. coli goes through stationary phase highlighted in pink circle. G_ means glucose media, e.g., G_42C is glucose media at 42°C. E indicates abundance of enzyme and I = abundance of interactor in that media.
4 Conclusion
This project is the first attempt to look at regulation of carbohydrate metabolism via the binding partners of metabolic enzymes in Escherichia coli. Although highly desirable, the experimental investigations of the predicted PEIs was beyond the scope of this study. However, we selected 16 PEIs as good candidates for experimental analysis with a good chance of these predictions being metabolism regulators. Our study predicts 3 PEIs as metabolism regulators previously not studied: RpsB-AdhE, DcyD-NanE & MinE-YccX (Table 6) and many others involving uncharacterized proteins (Fig 5). Therefore, this opens doors for experimental biologists to investigate these PEIs further and to develop novel applications of these PEIs in E. coli and other bacteria. In this study, we showed how metabolism regulation can be dependent on quantities of binding proteins helping E. coli survive in multiple cell environments. Apart from that, based on features including abundance, their predicted binding affinity with other proteins etc., we reported potentially new protein-interaction regulatory mechanisms such as secondary interactions that can either obstruct a primary protein-enzyme interaction or increase its strength by forming metabolic enzyme complexes. Lastly, we were able to uncover new protein-enzyme interactions that are likely to function during sulfur metabolism, gene regulation-metabolism crosstalk and stationary phase in E. coli.
Supporting information
S1 Fig. Absolute abundance of PEI Frr-GalK in 22 different growth conditions.
https://doi.org/10.1371/journal.pone.0315240.s001
(TIF)
S2 Fig. Absolute abundance of PEI Frr-GalK in 22 different growth conditions.
https://doi.org/10.1371/journal.pone.0315240.s002
(TIF)
S3 Fig. Absolute abundance of PEI YadG-GadB in 22 different growth conditions.
https://doi.org/10.1371/journal.pone.0315240.s003
(TIF)
S4 Fig. Enzyme IlvB depends (enzyme of butanoate and C5-branched dibasic acid metabolism) on its interactor IlvN for its function.
https://doi.org/10.1371/journal.pone.0315240.s004
(TIF)
S5 Fig. Enzyme FdoG, (dehydrogenizing formate) and its interactor, FdhE simultaneously increase abundance dramatically in LB but are less abundant in other growth conditions.
https://doi.org/10.1371/journal.pone.0315240.s005
(TIF)
References
- 1. Dashty M. A quick look at biochemistry: Carbohydrate metabolism. Clinical Biochemistry. 2013;46: 1339–1352. pmid:23680095
- 2. Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 2017;45: D353–D361. pmid:27899662
- 3. Noor E, Eden E, Milo R, Alon U. Central carbon metabolism as a minimal biochemical walk between precursors for biomass and energy. Mol Cell. 2010;39: 809–820. pmid:20832731
- 4. Caspi R. MetaCyc: a multiorganism database of metabolic pathways and enzymes. Nucleic Acids Research. 2006;34: D511–D516. pmid:16381923
- 5. Commichau FM, Stülke J. Trigger enzymes: bifunctional proteins active in metabolism and in controlling gene expression: Trigger enzymes in transcription regulation. Molecular Microbiology. 2007;67: 692–702. pmid:18086213
- 6. Sternisha SM, Miller BG. Molecular and cellular regulation of human glucokinase. Arch Biochem Biophys. 2019;663: 199–213. pmid:30641049
- 7. Jurkowska RZ, Jeltsch A. Enzymology of Mammalian DNA Methyltransferases. Adv Exp Med Biol. 2016;945: 87–122. pmid:27826836
- 8. Rodionova IA, Goodacre N, Do J, Hosseinnia A, Babu M, Uetz P, et al. The uridylyltransferase GlnD and tRNA modification GTPase MnmE allosterically control Escherichia coli folylpoly-γ-glutamate synthase FolC. J Biol Chem. 2018;293: 15725–15732. pmid:30089654
- 9. Rodionova IA, Goodacre N, Babu M, Emili A, Uetz P, Saier MH. The Nitrogen Regulatory PII Protein (GlnB) and N-Acetylglucosamine 6-Phosphate Epimerase (NanE) Allosterically Activate Glucosamine 6-Phosphate Deaminase (NagB) in Escherichia coli. J Bacteriol. 2018;200. pmid:29229699
- 10. Rodionova IA, Zhang Z, Mehla J, Goodacre N, Babu M, Emili A, et al. The phosphocarrier protein HPr of the bacterial phosphotransferase system globally regulates energy metabolism by directly interacting with multiple enzymes in Escherichia coli. J Biol Chem. 2017;292: 14250–14257. pmid:28634232
- 11. Zamanian Azodi M, Peyvandi H, Rostami-Nejad M, Safaei A, Rostami K, Vafaee R, et al. Protein-protein interaction network of celiac disease. Gastroenterol Hepatol Bed Bench. 2016;9: 268–277. pmid:27895852
- 12. Lee D-Y, Fan LT, Park S, Lee SY, Shafie S, Bertók B, et al. Complementary identification of multiple flux distributions and multiple metabolic pathways. Metabolic Engineering. 2005;7: 182–200. pmid:15885617
- 13. Chowdhury S, Hepper S, Lodi MK, Saier MH, Uetz P. The Protein Interactome of Glycolysis in Escherichia coli. Proteomes. 2021;9: 16. pmid:33917325
- 14. Rausell A, Juan D, Pazos F, Valencia A. Protein interactions and ligand binding: From protein subfamilies to functional specificity. Proc Natl Acad Sci USA. 2010;107: 1995–2000. pmid:20133844
- 15. Chandel NS. Carbohydrate Metabolism. Cold Spring Harb Perspect Biol. 2021;13: a040568. pmid:33397651
- 16. Nie H, Ju H, Fan J, Shi X, Cheng Y, Cang X, et al. O-GlcNAcylation of PGK1 coordinates glycolysis and TCA cycle to promote tumor growth. Nat Commun. 2020;11: 36. pmid:31911580
- 17. Stincone A, Prigione A, Cramer T, Wamelink MMC, Campbell K, Cheung E, et al. The return of metabolism: biochemistry and physiology of the pentose phosphate pathway. Biol Rev. 2015;90: 927–963. pmid:25243985
- 18. Zuo J, Tang J, Lu M, Zhou Z, Li Y, Tian H, et al. Glycolysis Rate-Limiting Enzymes: Novel Potential Regulators of Rheumatoid Arthritis Pathogenesis. Front Immunol. 2021;12: 779787. pmid:34899740
- 19. Poverennaya E, Kiseleva O, Romanova A, Pyatnitskiy M. Predicting Functions of Uncharacterized Human Proteins: From Canonical to Proteoforms. Genes (Basel). 2020;11: 677. pmid:32575886
- 20. Hu P, Janga SC, Babu M, Díaz-Mejía JJ, Butland G, Yang W, et al. Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins. PLoS Biol. 2009;7: e96. pmid:19402753
- 21. Heckmann D, Lloyd CJ, Mih N, Ha Y, Zielinski DC, Haiman ZB, et al. Machine learning applied to enzyme turnover numbers reveals protein structural correlates and improves metabolic models. Nat Commun. 2018;9: 5252. pmid:30531987
- 22. Paliy O, Gunasekera TS. Growth of E. coli BL21 in minimal media with different gluconeogenic carbon sources and salt contents. Appl Microbiol Biotechnol. 2007;73: 1169–1172. pmid:16944129
- 23.
Kaneko JJ. Carbohydrate Metabolism and Its Diseases. Clinical Biochemistry of Domestic Animals. Elsevier; 1997. pp. 45–81. https://doi.org/10.1016/B978-012396305-5/50004-X
- 24. Büsing F, Hägele FA, Nas A, Hasler M, Müller MJ, Bosy-Westphal A. Impact of energy turnover on the regulation of glucose homeostasis in healthy subjects. Nutr Diabetes. 2019;9: 22. pmid:31395858
- 25. Chen C-Y, Zhao X-Q, Yen H-W, Ho S-H, Cheng C-L, Lee D-J, et al. Microalgae-based carbohydrates for biofuel production. Biochemical Engineering Journal. 2013;78: 1–10.
- 26. Wiktorowicz JE, Soman KV. Discovery of Candidate Biomarkers. Adv Exp Med Biol. 2016;919: 443–462. pmid:27975230
- 27. Rosen CE, Palm NW. Functional Classification of the Gut Microbiota: The Key to Cracking the Microbiota Composition Code: Functional classifications of the gut microbiota reveal previously hidden contributions of indigenous gut bacteria to human health and disease. BioEssays. 2017;39: 1700032. pmid:28976007
- 28. Hua Q, Yang C, Oshima T, Mori H, Shimizu K. Analysis of Gene Expression in Escherichia coli in Response to Changes of Growth-Limiting Nutrient in Chemostat Cultures. Appl Environ Microbiol. 2004;70: 2354–2366.
- 29. Schmidt A, Kochanowski K, Vedelaar S, Ahrné E, Volkmer B, Callipo L, et al. The quantitative and condition-dependent Escherichia coli proteome. Nat Biotechnol. 2016;34: 104–110. pmid:26641532
- 30. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28: 27–30. pmid:10592173
- 31. The UniProt Consortium, Bateman A, Martin M-J, Orchard S, Magrane M, Agivetova R, et al. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Research. 2021;49: D480–D489. pmid:33237286
- 32. Kerrien S, Aranda B, Breuza L, Bridge A, Broackes-Carter F, Chen C, et al. The IntAct molecular interaction database in 2012. Nucleic Acids Res. 2012;40: D841–846. pmid:22121220
- 33. Orchard S, Ammari M, Aranda B, Breuza L, Briganti L, Broackes-Carter F, et al. The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases. Nucl Acids Res. 2014;42: D358–D363. pmid:24234451
- 34. Porras P, Barrera E, Bridge A, Del-Toro N, Cesareni G, Duesbury M, et al. Towards a unified open access dataset of molecular interactions. Nat Commun. 2020;11: 6144. pmid:33262342
- 35. Spirin V, Mirny LA. Protein complexes and functional modules in molecular networks. Proc Natl Acad Sci USA. 2003;100: 12123–12128. pmid:14517352
- 36. Duvaud S, Gabella C, Lisacek F, Stockinger H, Ioannidis V, Durinx C. Expasy, the Swiss Bioinformatics Resource Portal, as designed by its users. Nucleic Acids Research. 2021;49: W216–W227. pmid:33849055
- 37. Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Research. 2009;37: W305–W311. pmid:19465376
- 38. Gene Ontology Consortium. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Research. 2004;32: 258D – 261. pmid:14681407
- 39. Nishida K, Ono K, Kanaya S, Takahashi K. KEGGscape: a Cytoscape app for pathway data integration. F1000Res. 2014;3: 144. pmid:25177485
- 40. Hernández-Plaza A, Szklarczyk D, Botas J, Cantalapiedra CP, Giner-Lamia J, Mende DR, et al. eggNOG 6.0: enabling comparative genomics across 12 535 organisms. Nucleic Acids Research. 2023;51: D389–D394. pmid:36399505
- 41. Pagel P, Mewes H-W, Frishman D. Conservation of protein–protein interactions–lessons from ascomycota. Trends in Genetics. 2004;20: 72–76. pmid:14746987
- 42. Mirdita M, Schütze K, Moriwaki Y, Heo L, Ovchinnikov S, Steinegger M. ColabFold: making protein folding accessible to all. Nat Methods. 2022;19: 679–682. pmid:35637307
- 43. Zhu W, Shenoy A, Kundrotas P, Elofsson A. Evaluation of AlphaFold-Multimer prediction on multi-chain protein complexes. Cowen L, editor. Bioinformatics. 2023;39: btad424. pmid:37405868
- 44. Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, et al. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Science. 2021;30: 70–82. pmid:32881101
- 45.
Adhikari B, Cheng J. Protein Residue Contacts and Prediction Methods. In: Carugo O, Eisenhaber F, editors. Data Mining Techniques for the Life Sciences. New York, NY: Springer New York; 2016. pp. 463–476. https://doi.org/10.1007/978-1-4939-3572-7_24 pmid:27115648
- 46. Chakravarty D, Guharoy M, Robert CH, Chakrabarti P, Janin J. Reassessing buried surface areas in protein–protein complexes. Protein Science. 2013;22: 1453–1457. pmid:23934783
- 47. Yuan S, Chan HCS, Hu Z. Using PyMOL as a platform for computational drug design. WIREs Comput Mol Sci. 2017;7: e1298.
- 48. Du X, Li Y, Xia Y-L, Ai S-M, Liang J, Sang P, et al. Insights into Protein-Ligand Interactions: Mechanisms, Models, and Methods. Int J Mol Sci. 2016;17: 144. pmid:26821017
- 49. Honorato RV, Koukos PI, Jiménez-García B, Tsaregorodtsev A, Verlato M, Giachetti A, et al. Structural Biology in the Clouds: The WeNMR-EOSC Ecosystem. Front Mol Biosci. 2021;8: 729513. pmid:34395534
- 50. Walhout AJM, Vidal M. Protein interaction maps for model organisms. Nat Rev Mol Cell Biol. 2001;2: 55–63. pmid:11413466
- 51. Siddiquee KAZ, Arauzo-Bravo MJ, Shimizu K. Effect of a pyruvate kinase (pykF -gene) knockout mutation on the control of gene expression and metabolic fluxes in Escherichia coli. FEMS Microbiology Letters. 2004;235: 25–33.
- 52. Alteri CJ, Smith SN, Mobley HLT. Fitness of Escherichia coli during urinary tract infection requires gluconeogenesis and the TCA cycle. PLoS Pathog. 2009;5: e1000448. pmid:19478872
- 53. Taguchi H, Koike-Takeshita A. In vivo client proteins of the chaperonin GroEL-GroES provide insight into the role of chaperones in protein evolution. Front Mol Biosci. 2023;10: 1091677. pmid:36845542
- 54. Schneider E, Eckey V, Weidlich D, Wiesemann N, Vahedi-Faridi A, Thaben P, et al. Receptor–transporter interactions of canonical ATP-binding cassette import systems in prokaryotes. European Journal of Cell Biology. 2012;91: 311–317. pmid:21561685
- 55. Minamino T, Kinoshita M, Namba K. Insight Into Distinct Functional Roles of the Flagellar ATPase Complex for Flagellar Assembly in Salmonella. Front Microbiol. 2022;13: 864178. pmid:35602071
- 56. Reizer J, Saier MH. Modular multidomain phosphoryl transfer proteins of bacteria. Current Opinion in Structural Biology. 1997;7: 407–415. pmid:9204284
- 57. Pan C, Li B, Simon MC. Moonlighting functions of metabolic enzymes and metabolites in cancer. Molecular Cell. 2021;81: 3760–3774. pmid:34547237
- 58. Babu M, Bundalovic-Torma C, Calmettes C, Phanse S, Zhang Q, Jiang Y, et al. Global landscape of cell envelope protein complexes in Escherichia coli. Nat Biotechnol. 2018;36: 103–112. pmid:29176613
- 59. Konc J, Hodošček M, Ogrizek M, Trykowska Konc J, Janežič D. Structure-Based Function Prediction of Uncharacterized Protein Using Binding Sites Comparison. MacKerell AD, editor. PLoS Comput Biol. 2013;9: e1003341. pmid:24244144
- 60. Ammar EM, Wang X, Rao CV. Regulation of metabolism in Escherichia coli during growth on mixtures of the non-glucose sugars: arabinose, lactose, and xylose. Sci Rep. 2018;8: 609. pmid:29330542
- 61. Phue J-N, Noronha SB, Hattacharyya R, Wolfe AJ, Shiloach J. Glucose metabolism at high density growth ofE. coli B andE. coli K: Differences in metabolic pathways are responsible for efficient glucose utilization inE. coli B as determined by microarrays and Northern blot analyses. Biotechnol Bioeng. 2005;90: 805–820. pmid:15806547
- 62. Sharan R, Suthram S, Kelley RM, Kuhn T, McCuine S, Uetz P, et al. Conserved patterns of protein interaction in multiple species. Proc Natl Acad Sci U S A. 2005;102: 1974–1979. pmid:15687504
- 63. Sabnis NA, Yang H, Romeo T. Pleiotropic regulation of central carbohydrate metabolism in Escherichia coli via the gene csrA. J Biol Chem. 1995;270: 29096–29104. pmid:7493933
- 64. Thompson CN, Duy PT, Baker S. The Rising Dominance of Shigella sonnei: An Intercontinental Shift in the Etiology of Bacillary Dysentery. Clements ACA, editor. PLoS Negl Trop Dis. 2015;9: e0003708. pmid:26068698
- 65. Passalacqua KD, Charbonneau M-E, O’Riordan MXD. Bacterial Metabolism Shapes the Host–Pathogen Interface. Kudva IT, Cornick NA, editors. Microbiol Spectr. 2016;4: 4.3.13. pmid:27337445
- 66. Chowdhury S, Fong SS. Computational Modeling of the Human Microbiome. Microorganisms. 2020;8: 197. pmid:32023941
- 67. Liang L, Liu R, Freed EF, Eckert CA. Synthetic Biology and Metabolic Engineering Employing Escherichia coli for C2–C6 Bioalcohol Production. Front Bioeng Biotechnol. 2020;8: 710. pmid:32719784
- 68. Allison WT, DuVal MG, Nguyen-Phuoc K, Leighton PLA. Reduced Abundance and Subverted Functions of Proteins in Prion-Like Diseases: Gained Functions Fascinate but Lost Functions Affect Aetiology. Int J Mol Sci. 2017;18: 2223. pmid:29064456
- 69. Vogel C, Marcotte EM. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat Rev Genet. 2012;13: 227–232. pmid:22411467
- 70. Swets JA. The science of choosing the right decision threshold in high-stakes diagnostics. American Psychologist. 1992;47: 522–532. pmid:1595983
- 71. Arike L, Valgepea K, Peil L, Nahku R, Adamberg K, Vilu R. Comparison and applications of label-free absolute proteome quantification methods on Escherichia coli. J Proteomics. 2012;75: 5437–5448. pmid:22771841
- 72. Wilson DB, Hogness DS. THE ENZYMES OF THE GALACTOSE OPERON IN ESCHERICHIA COLI. I. PURIFICATION AND CHARACTERIZATION OF URIDINE DIPHOSPHOGALACTOSE 4-EPIMERASE. J Biol Chem. 1964;239: 2469–2481. pmid:14235524
- 73. Small PLC, Waterman SR. Acid stress, anaerobiosis and gadCB: lessons from Lactococcus lactis and Escherichia coli. Trends in Microbiology. 1998;6: 214–216. pmid:9675796
- 74. Yeh JI, Kettering R, Saxl R, Bourand A, Darbon E, Joly N, et al. Structural characterizations of glycerol kinase: unraveling phosphorylation-induced long-range activation. Biochemistry. 2009;48: 346–356. pmid:19102629
- 75. Pawson T, Nash P. Protein–protein interactions define specificity in signal transduction. Genes Dev. 2000;14: 1027–1047. pmid:10809663
- 76. Davis FP, Sali A. The overlap of small molecule and protein binding sites within families of protein structures. PLoS Comput Biol. 2010;6: e1000668. pmid:20140189
- 77. Bryant P, Pozzati G, Elofsson A. Improved prediction of protein-protein interactions using AlphaFold2. Nat Commun. 2022;13: 1265. pmid:35273146
- 78. Huthmacher C, Gille C, Holzhütter H-G. A computational analysis of protein interactions in metabolic networks reveals novel enzyme pairs potentially involved in metabolic channeling. Journal of Theoretical Biology. 2008;252: 456–464. pmid:17988690
- 79. Benoit S, Abaibou H, Mandrand-Berthelot MA. Topological analysis of the aerobic membrane-bound formate dehydrogenase of Escherichia coli. J Bacteriol. 1998;180: 6625–6634. pmid:9852007
- 80. Mitra A, Sarma SP. Escherichia coli ilvN Interacts with the FAD Binding Domain of ilvB and Activates the AHAS I Enzyme. Biochemistry. 2008;47: 1518–1531. pmid:18193896
- 81. Vyazmensky M, Zherdev Y, Slutzker A, Belenky I, Kryukov O, Barak Z, et al. Interactions between large and small subunits of different acetohydroxyacid synthase isozymes of Escherichia coli. Biochemistry. 2009;48: 8731–8737. pmid:19653643
- 82. Tao H, Bausch C, Richmond C, Blattner FR, Conway T. Functional Genomics: Expression Analysis of Escherichia coli Growing on Minimal and Rich Media. J Bacteriol. 1999;181: 6425–6440.
- 83. Lüke I, Butland G, Moore K, Buchanan G, Lyall V, Fairhurst SA, et al. Biosynthesis of the respiratory formate dehydrogenases from Escherichia coli: characterization of the FdhE protein. Arch Microbiol. 2008;190: 685–696. pmid:18716757
- 84. Cole ST, Grundström T, Jaurin B, Robinson JJ, Weiner JH. Location and Nucleotide Sequence of frdB, the Gene Coding for the Iron‐Sulphur Protein Subunit of the Fumarate Reductase of Escherichia coli. European Journal of Biochemistry. 1982;126: 211–216. pmid:6751816
- 85. Cecchini G, Schröder I, Gunsalus RP, Maklashina E. Succinate dehydrogenase and fumarate reductase from Escherichia coli. Biochimica et Biophysica Acta (BBA)—Bioenergetics. 2002;1553: 140–157. pmid:11803023
- 86. Starbird CA, Maklashina E, Sharma P, Qualls-Histed S, Cecchini G, Iverson TM. Structural and biochemical analyses reveal insights into covalent flavinylation of the Escherichia coli Complex II homolog quinol:fumarate reductase. Journal of Biological Chemistry. 2017;292: 12921–12933. pmid:28615448
- 87. Li Q, Zhao P, Yin H, Liu Z, Zhao H, Tian P. CRISPR interference-guided modulation of glucose pathways to boost aconitic acid production in Escherichia coli. Microb Cell Fact. 2020;19: 174. pmid:32883305
- 88. Balasubramanian D, Ragunathan PT, Fei J, Vanderpool CK. A Prophage-Encoded Small RNA Controls Metabolism and Cell Division in Escherichia coli. mSystems. 2016;1: e00021–15. pmid:27822514
- 89. Conway T, Cohen PS. Commensal and Pathogenic Escherichia coli Metabolism in the Gut. Microbiol Spectr. 2015;3. pmid:26185077
- 90. Safi AUR, Mansour Salih M, Rahman H, Khattak B, El Askary A, Hussain Khalifa E, et al. Immunoaffinity-based mass spectrometric characterization of immunoreactive proteins of Salmonella Typhi. Saudi Journal of Biological Sciences. 2023;30: 103502. pmid:36458097
- 91. Ikei M, Miyazaki R, Monden K, Naito Y, Takeuchi A, Takahashi YS, et al. YeeD is an essential partner for YeeE-mediated thiosulfate uptake in bacteria and regulates thiosulfate ion decomposition. Søgaard-Andersen L, editor. PLoS Biol. 2024;22: e3002601. pmid:38656967
- 92. Morigasaki S, Umeyama A, Kawano Y, Aizawa Y, Ohtsu I. Defect of RNA pyrophosphohydrolase RppH enhances fermentative production of L-cysteine in Escherichia coli. J Gen Appl Microbiol. 2020;66: 307–314. pmid:32779574
- 93. Romano AH, Van Vranken NJ, Preisand P, Brustolon M. Regulation of the Thiobacillus intermedius glucose uptake system by thiosulfate. J Bacteriol. 1975;121: 577–582. pmid:1112773
- 94. Sekowska A, Kung HF, Danchin A. Sulfur metabolism in Escherichia coli and related bacteria: facts and fiction. J Mol Microbiol Biotechnol. 2000;2: 145–177. pmid:10939241
- 95. Weichart D, Querfurth N, Dreger M, Hengge-Aronis R. Global Role for ClpP-Containing Proteases in Stationary-Phase Adaptation of Escherichia coli. J Bacteriol. 2003;185: 115–125.
- 96. Kimata K, Inada T, Tagami H, Aiba H. A global repressor (Mlc) is involved in glucose induction of the ptsG gene encoding major glucose transporter in Escherichia coli. Molecular Microbiology. 1998;29: 1509–1519. pmid:9781886
- 97. Pattrick CA, Webb JP, Green J, Chaudhuri RR, Collins MO, Kelly DJ. Proteomic Profiling, Transcription Factor Modeling, and Genomics of Evolved Tolerant Strains Elucidate Mechanisms of Vanillin Toxicity in Escherichia coli. Tullman-Ercek D, editor. mSystems. 2019;4: e00163–19. pmid:31186336
- 98.
Bhattacharyya S, Ranganathan S, Chowdhury S, Adkar BV, Khrapko M, Shakhnovich EI. A comprehensive map of transient protein-protein interactions and their structural features in the E. coli 1-carbon metabolism pathway. bioRxiv; 2024. p. 2024.02.17.580827. https://doi.org/10.1101/2024.02.17.580827
- 99. Durek P, Walther D. The integrated analysis of metabolic and protein interaction networks reveals novel molecular organizing principles. BMC Syst Biol. 2008;2: 100. pmid:19032748
- 100. Zhang Y, Fernie AR. Stable and Temporary Enzyme Complexes and Metabolons Involved in Energy and Redox Metabolism. Antioxidants & Redox Signaling. 2021;35: 788–807. pmid:32368925
- 101. Deutscher J, Francke C, Postma PW. How phosphotransferase system-related protein phosphorylation regulates carbohydrate metabolism in bacteria. Microbiol Mol Biol Rev. 2006;70: 939–1031. pmid:17158705