Figures
Abstract
The selection of optimal enzyme concentration in multienzyme cascade reactions for the highest product yield in practice is very expensive and time-consuming process. The modelling of biological pathways is a difficult process because of the complexity of the system. The mathematical modelling of the system using an analytical approach depends on the many parameters of enzymes which rely on tedious and expensive experiments. The artificial neural network (ANN) method has been successively applied in different fields of science to perform complex functions. In this study, ANN models were trained to predict the flux for the upper part of glycolysis as inferred by NADH consumption, using four enzyme concentrations i.e., phosphoglucoisomerase, phosphofructokinase, fructose-bisphosphate-aldolase, triose-phosphate-isomerase. Out of three ANN algorithms, the neuralnet package with two activation functions, “logistic” and “tanh” were implemented. The prediction of the flux was very efficient: RMSE and R2 were 0.847, 0.93 and 0.804, 0.94 respectively for logistic and tanh functions using a cross validation procedure. This study showed that a systemic approach such as ANN could be used for accurate prediction of the flux through the metabolic pathway. This could help to save a lot of time and costs, particularly from an industrial perspective. The R-code is available at: https://github.com/DSIMB/ANN-Glycolysis-Flux-Prediction.
Citation: Ajjolli Nagaraja A, Fontaine N, Delsaut M, Charton P, Damour C, Offmann B, et al. (2019) Flux prediction using artificial neural network (ANN) for the upper part of glycolysis. PLoS ONE 14(5): e0216178. https://doi.org/10.1371/journal.pone.0216178
Editor: Marie-Joelle Virolle, Universite Paris-Sud, FRANCE
Received: January 28, 2019; Accepted: April 15, 2019; Published: May 8, 2019
Copyright: © 2019 Ajjolli Nagaraja et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: AAN is supported by a PhD grant from Region Reunion and European Union (FEDER) under European operational program INTERRG V-2014-2020 file number 20161449, tiers 234273.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: ANN, artificial neural network; PGI, phosphoglucoisomerase; PFK, 6-phosphofructokinase; FBA, fructose bisphosphate aldolase; TPI, triose phosphate isomerase; RMSE, root mean square error
Introduction
The emergence of genomics, transcriptomics and proteomics, along with improvements in information technology, helped us to integrate the information, build the mathematical in silico model of a biological system and observe its behaviour [1–5]. The integration of different “-omics” data helped us to understand the genetic difference between the phenotypes, to identify the molecular signature [6,7] and use metabolic engineering [8,9] etc. There have been many attempts to model biological systems, like Saccharomyces cerevisiae [4,10–12], Escherichia coli [13–15], other organisms [3] and many plant metabolic networks for observing and predicting the behaviour of a system using different methods [2,16].
Many different kinds of mathematical models exist to study biological systems [17,18]. Several approaches have been developed to determine or estimate the flux through the metabolic pathway [19–21]. Based on the data and constraints used, the mathematical modelling can be classified into two broad categories [2,16] i.e., kinetic modelling or mechanistic modelling [22–24], and constraint-based or stoichiometric modelling [12,25,26]. The kinetic model defines the reaction mechanism in the system using kinetic parameters to evaluate rate laws. These rate laws are defined from the experiment, assuming that the experimental conditions are similar to in-vivo conditions [27]. To build a kinetic model, the system has been made as simple as possible, while retaining system behaviour. The modelling of enzymes like phosphofructokinase could be problematic and might need more parameters than other enzymes [28]. Determining the kinetic parameter is expensive and time consuming; some parameters could be more difficult to measure. Although many enzymatic assays are described in the literature, sometimes it is necessary to modify the assay for new enzymes or to find a new one. In some cases, for example, following enzyme reaction through spectrophotometers or spectrofluorimeters, this is difficult due to no absorption or emission signals [29] linked to the reactants. Most of the available kinetic data are obtained from in-vitro studies using purified enzymes which might not represent the exact properties of in-vivo enzymes [23]. For example: The Vmax value measured in vitro may not represent the value of an in vivo system because of the destruction of enzyme complexes, cellular organisation and the absence of an unknown inhibitor or activator [30,31]. A constraint-based model uses physiochemical constraints like mass balance, thermodynamic constraints, etc., in the modelling, to observe and study the behaviour of the system [25]. There are different methods, like flux balance analysis [32] and metabolic flux analysis [33]. Flux balance analysis is an approach to studying biochemical networks on a genomic scale, which includes all the known metabolite reactions, and the genes that encode for a particular enzyme. The data from genome annotation or existing knowledge is used to construct the network [5,34] and the physicochemical constraints are used to predict the flux distribution, considering that the total product formed must be equal to the total substrate consumed in steady state conditions [32]. This method is used to predict the growth rate [5,32,34,35] or the production of a particular metabolite [36]. Metabolic flux analysis, an experimental based method, allows the quantification of metabolite in the central metabolism using the Carbon-labelled substrate [33,37,38]. The labelled substrate is allowed to distribute over the metabolic network and is measured using NMR [39] or mass spectrometry [32].
Many of the biomolecules like organic acids [40,41], antibiotics [42–44], bioethanol etc. [45,46] have been used in the pharmaceutical and food industries and as energy sources. Biomolecule production is attracting the attention of biologists and industries due to the decrease in non-renewable resources and global warming [47,48]. Synthetic biology and systems biology help to obtain the highest yield of biomolecules from the source [49–51].
Glycolysis is the centre of the metabolic system in all living organisms. It is an anaerobic pathway present in almost all living cells and helps in ATP generation. Glycolysis has been established as the central core for the fermentation. It contributes to the production of different metabolites, like citric acid, succinic acid, amino acids, etc., through pyruvate, the end product of glycolysis [52].
The neural network is an architecture, modelled on the brain, that is organized with neurons and synapses present in a structure of nodes (formal neuron) connected together [53,54]. Each numerical input corresponds to the input layer, and the value to predict (variable to explain) corresponds to the last level, the output layer. Between those two layers, intermediary nodes are present, built specifically and in sufficient number to model the problem; they form the hidden layer. The neural network has been successfully applied in different fields of science such as physics [55,56], environmental science [57–59] and data mining [60,61] for the prediction of different features in the system. The ANN is also core for deep learning [62]. The artificial neural network could be used to predict the product outcome (i.e. flux through the pathway) when combined with Flux balance analysis or other modelling approaches.
The ANN has been used earlier in predicting the fluxes from 13C labelling of metabolites in mammalian gluconeogenesis by M.R. Antoniewicz et al. [21]. Three linear regression modelling methods, multiple linear regression (MLR), principal component regression (PCR) and partial least square regression (PLS) were run on simulated data and compared to the ANN. The study showed that ANN, that requires the larger sample (>200) performed better than the other methods for flux prediction using new mass isotopomer data [21].
Due to the challenges in estimating the flux using different methods like constraint-based and kinetic-based modelling approaches [63–66], we developed a simple method using artificial neural networks. This method is based purely on the existing experimental data and hence does not require kinetic parameters as in kinetic modelling and no prior information is required regarding the stoichiometry of the metabolic pathway. In this study, an artificial neural network was built to estimate the flux using enzyme concentrations for the upper part of glycolysis as input. Finding the optimum enzyme concentration—which gives the highest product—through experiments is very tedious and expensive. The neural network approach could be helpful for choosing the optimum enzyme concentrations which enhance the final product concentration without experimental setup, within a short period of time. Experiments were carried out in order to: i) assess the structure of the ANN using three different approaches, ii) evaluate different activation functions and iii) compare the prediction of flux of NAD+ to the fluxes predicted by Fievet & al. (2006).
Materials and methods
Principle of artificial neural networks
The base element of ANN is the perceptron, defined in 1958 by Rosenblatt [67]. A combination function computes a value based on the input layer and some weights. This is a weighted sum ∑nipi (observed node) of the ni values in the input layer. To define the output value, a function called activation function is applied to this value. We note ni for the node i, the weight pi corresponds to the connection between node i, the observed node and the activation function f, associated with the observed node (Fig 1).
The structure of an ANN is defined by the number of layers and nodes, by the way they are linked (activation function) and the method to estimate the weights.
Input for building the ANN model
The flux measurement data from in-vitro reconstructed upper part of glycolysis [20] was used to build the artificial neural network (Fig 2). The input for the ANN model consists of concentrations of enzymes phosphoglucoisomerase (PGI), phosphofructokinase (PFK), fructose bisphosphate aldolase (FBA) and triose phosphate isomerase (TPI) in mg/L, and the output is flux J (μM/s) measured as the NADH consumption by glycerol-3-phosphate dehydrogenase (G3PDH). The flux measured through the upper part of glycolysis is indirect and we assume that most of the NADH in the system is consumed during the measurement. The data has been normalised using min-max method before building the neural network.
HK-hexokinase; PGI-phopshoglucoisomerase; PFK-phosphofructokinase; FBA-fructose bisphosphate aldolase; TPI- triose phosphate isomerase; G3PDH- glycerol-3-phosphate dehydrogenase, CK- Creatine kinase.
Experimental details
The upper part of glycolysis was reconstructed in vitro (Fig 2), with constant concentration of hexokinase andglycerol-3-phosphate dehydrogenase, while the other four enzymes (PGI, PFK, FBA and TPI) concentrations varied. The total enzyme concentration of the four enzymes (PGI, PFK, FBA and TPI) was constant at 101.9 mg/L. The NADH consumption using the glycerol-3-phosphate dehydrogenase is monitored every 2s with Uvikon 850 spectrometer at 390 nm from 60 to 120s. The linear slope of NADH was calculated as the flux through the pathway. The assays were performed in triplicate by Fiévet et al., 2006, at 25°C, by adding 1mM ATP at pH 7.5. Data is given in supplementary information (Table A in S1 File).
Structure of ANN
The artificial neural network built with a single layer of hidden units [68] using statistical tool R (version 3.4.3) (R Core Team (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/) using three different packages: nnet (version 7.3–12) [69], neuralnet (version 1.33) [70] and RSNNS (version 0.4–10) [71].
The network consists of three layers: a) Input (I), b) Hidden layer (H) and c) Output. These layers are connected by edges or neurons. The weighted sum of neuron inputs is submitted to a function which conditions neuron activation. There is no rule for deciding the number of neuron units in a single hidden layer; to choose the best algorithm out of three (i.e. nnet, neuralnet and RSNNS), we first chose the number of hidden units according to Equation 1 and compared the RMSE (Equation 2) and coefficient of determination (R2) (Equation 3) values between the three methods. The algorithm with the lowest RMSE value and highest R2value during the leave-one-out cross validation was chosen as an algorithm of interest and the effect of the number of hidden units on RMSE and R2 was analysed between 1 to 25 hidden units.
Equation 1: Where Nh is number of hidden units; Ns: number of sampling in training data; Ni: number of input neurons; No: number of output neurons; α: arbitrary scaling factor 2–10.
Equation 2: Where RMSE is root mean square error, Yi is ANN predicted values; yi is experimental values; n is number of predictions.
Equation 3: Where R2 is coefficient of determination; Yi ANN is predicted values; yi is experimental values; n is number of predictions, Ӯ is average of experimental values.
Results and discussion
The three-neural network algorithms: nnet [69], neuralnet [70] and RSNNS [71] are built with a hidden number of units ranging from 9 to12, as shown in Equation 1. The RMSE and coefficient of determination were compared between algorithms during leave-one-out cross validation. Out of three algorithms, the neuralnet performed better than the other two (Table B in S1 File), allowing the option of choosing two different activation functions i.e., logistic (sigmoidal) and tanh (Equation 4 and Equation 5 respectively).
Equation 4: logistic activation function.
Equation 5: tanh activation function.
Using the neuralnet model, with “logistic” and “tanh” activation functions, the effect of the number of hidden units on RMSE and R2 was studied (Fig 3) with a leave-one-out-cross-validation procedure (Table C in S1 File). The logistic function with 13 hidden units gives a RMSE of 0.847, R2 of 0.93 and tanh function RMSE of 0.804, and R2 of 0.94 with 6 hidden units. The R-script to build the ANN with leave-one-out cross validation is given in R-Scripts A in S1 File.
Effect of the number of hidden units on RMSE (A) and coefficient of determination (B) in activation function logistic (filled circle, solid line) and tanh (open circle and dotted lines).
Experimental flux was compared with the ANN predicted flux by leave-one-outcross-validation procedure, using chosen hidden units with logistic and tanh function (Fig 4). The effects of enzyme concentrations on the predicted flux and experimental flux were compared, and found to follow a similar trend (Fig 5).
Filled and open circles represent logistic and tanh activation functions respectively.
Filled circle and open circle are enzyme concentration vs predicted flux with logistic and tanh activation functions respectively, open triangles represent the experiment.
During the cross validation of the neural network model, a negative flux value was predicted for one combination of enzymes (Table 1: Index-3). This is because, during a leave-one-out procedure (LOOcv), one combination of the four enzymes is not included in the model training and has to be predicted. The negative value shows the poor ability of the ANN model to predict the outliers, i.e. a combination that is not close (in terms of PGI, PFK, FBA and TPI concentrations) to those included in the training data set.
The original study by J. B. Fievet et al. developed a model to predict flux. As the authors mentioned in their article, their flux predictor overestimates the observed flux by a constant factor. The predicted flux in their method, has an R2 value of 0.86, whereas an ANN approach with logistic function showed an R2 value to be 0.93 and in case of tanh activation function, an R2 of 0.94, obtained with leave-one-out cross validation, which implies that the ANN approach is more efficient in predicting the flux than the method developed in the Fievet study. The effect of enzyme concentrations on the predicted flux by both methods follows a similar trend.
The difference between actual flux and ANN predicted flux was an average of 0.57 μM/s for logistic and for tanh, with a standard deviation of 0.63 and 0.57 respectively (Table 1), whereas the Fievet & al. study (2006) showed an average of 3.3 and a standard deviation of 2.2 with actual predicted values. Fievet et al. stated that their method overestimates the flux values by a constant factor of 1.38. Hence by dividing the predicted flux values by 1.38, corrected values were obtained. The new average corrected value is 1.04 and standard deviation is 0.78 with the experimental values. This indicates that the ANN method performs better than the method described in the original study. This ANN-based method provides additional degrees of freedom over the method proposed in Fievet & al. (2006). Indeed, the number of degrees of freedom increases with the number of hidden units. This makes it possible to obtain an important advantage regarding the error inherent to the learning phase.
Conclusion
Kinetic modelling of metabolic pathways is challenging because of the difficulties in estimating the kinetic parameters [29,65], and is sometimes expensive because of the high-cost substrates and technologies involved [72,73], whereas the constraint-based model does not use any kinetic parameters but is efficient enough to predict the flux of metabolites. Choosing the optimum enzyme concentrations for the highest flux could be a challenge when conducting experiments. Using artificial intelligence with available experimental data can help us find a quicker and more cost-effective solution for biological problems.
In this study, a neural network model was built successfully with two different activation functions: i.e., logistic (sigmoidal) and tanh, with RMSE and R2 values of 0.847, 0.93 and 0.804, 0.94 respectively. The difference between actual flux and ANN predicted flux was an average of 0.57 for both activation functions. The J. B. Fievet et al. method after the correction has a RMSE of 1.30, with a 1.05 difference between predicted and observed flux, which clearly indicates that the ANN method works better than the other method.
It has not escaped our attention that the artificial neural network model depends on the diversity of the training data, and hence training the model with a maximum of variability in the concentration of enzymes plays a crucial role. The model built in this study might not be enough to extrapolate a model for all other enzyme concentrations. In future, the study will be extended to predict the wide range of flux and the enzyme concentration for a particular flux. Experiments in a wet laboratory will be carried out for an upcoming application paper. Even though further work has to be performed, the artificial neural network approach is a promising method in metabolic pathway modelling and could find its place in metabolic engineering and industrial scale biomolecule synthesis.
Supporting information
S1 File.
Table A: Input used to build artificial neural network from Fievet et al., 2016. Table B: Comparision of RMSE and R-squared values during the leave-one-out crossvalidation between neuralnet, nnet and RSNNS algorithm. Table C: RMSE and R-squared for values between the activation functions logistics and tanh during leave-one-out cross validation for the neuralnet method. R-Scripts A: R-script used to obtained the Table 1. The different activation function and number of hidden units are selected using "act.fct = " and "hidden = ".
https://doi.org/10.1371/journal.pone.0216178.s001
(DOCX)
Acknowledgments
AAN is supported by a PhD grant from the Region Reunion and European Union (FEDER) under European operational program INTERRG V-2014-2020 file number 20161449, tiers 234273.
References
- 1. Fukushima A, Kusano M, Redestig H, Arita M, Saito K. Integrated omics approaches in plant systems biology. Curr Opin Chem Biol. 2009;13(5–6):532–8. pmid:19837627
- 2. Stelling J. Mathematical models in microbial systems biology. Curr Opin Microbiol. 2004;7(5):513–8. pmid:15451507
- 3. Pereira B, Miguel J, Vilaça P, Soares S, Rocha I, Carneiro S. Reconstruction of a genome-scale metabolic model for Actinobacillus succinogenes 130Z. BMC Syst Biol. 2018;12.
- 4. Nookaew I, Olivares-Hernández R, Bhumiratana S, Nielsen J. Genome-Scale Metabolic Models of Saccharomyces cerevisiae. In: Methods in molecular biology (Clifton, NJ) [Internet]. 2011. p. 445–63. Available from: http://www.ncbi.nlm.nih.gov/pubmed/21863502
- 5. Schilling CH, Covert MW, Famili I, Church GM, Edwards JS, Palsson BO. Genome-Scale Metabolic Model of Helicobacter pylori 26695. J Biotechnol. 2002;184(16):4582–93.
- 6. Acharjee A, Kloosterman B, Visser RGF, Maliepaard C. Integration of multi-omics data for prediction of phenotypic traits using random forest. BMC Bioinformatics [Internet]. 2016;17(5):363–73. Available from: http://dx.doi.org/10.1186/s12859-016-1043-4
- 7. Craig E. Wheelock VG, David Balgoma BN, Brandsma J, PaulJ. Skipp, Stuart Snowden Dominic Burg, D’Amico A, IldikoHorvath , et al. Application of ‘omics technologies to biomarker discovery in inflammatory lung diseases. Eur Respir J. 2013;42(3):802–25. pmid:23397306
- 8. Vemuri GN, Aristidou AA. Metabolic Engineering in the -omics Era: Elucidating and Modulating Regulatory Networks. Microbiol Mol Biol Rev. 2005;69(2):197–216. pmid:15944454
- 9. Chen GQ. Omics Meets Metabolic Pathway Engineering. Cell Syst [Internet]. 2016;2(6):362–3. Available from: pmid:27237740
- 10. Duarte NC, Herrgård MJ, Palsson BØ. Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res [Internet]. 2004 Jul;14(7):1298–309. Available from: pmid:15197165
- 11. Förster J, Famili I, Fu P, Palsson BØ, Nielsen J. Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. Genome Res [Internet]. 2003 Feb;13(2):244–53. Available from: http://www.ncbi.nlm.nih.gov/pubmed/12566402 pmid:12566402
- 12. Price ND, Reed JL, Palsson B. Genome-scale models of microbial cells: Evaluating the consequences of constraints. Nat Rev Microbiol. 2004;2(11):886–97. pmid:15494745
- 13. Feist AM, Henry CS, Reed JL, Krummenacker M, Joyce AR, Karp PD, et al. A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Mol Syst Biol [Internet]. 2007 Jan 1 [cited 2018 Jan 30];3(1):121. Available from: http://www.ncbi.nlm.nih.gov/pubmed/17593909
- 14. Reed JL, Palsson BØ. Thirteen Years of Building Constraint-Based In Silico Models of Escherichia coli. J Biotechnol. 2003;185(9):2692–9.
- 15. Weaver DS, Keseler IM, Mackie A, Paulsen IT, Karp PD. A genome-scale metabolic flux model of Escherichia coli K– 12 derived from the EcoCyc database. BMCSystems Biol [Internet]. 2014;8(79):1–24. Available from: http://www.biomedcentral.com/1752-0509/8/79
- 16. Rios-Estepa R, Lange BM. Experimental and mathematical approaches to modeling plant metabolic networks. Phytochemistry. 2007;68(16–18):2351–74. pmid:17561179
- 17. Arturo J, Mora M. System biology: Mathematical modeling of biological systems. Int J Adv Res Comput Eng Technol. 2016;5(8):2243–6.
- 18. VOLKENSTEIN MV. Mathematical Modeling of Biological Processes. Physics and Biology. Springer; 2012. 154 p.
- 19. Nikoloski Z, Perez-Storey R, Sweetlove LJ. Inference and prediction of metabolic network fluxes. Plant Physiol. 2015;169:1443–55. pmid:26392262
- 20. Fiévet JB, Dillmann C, Curien G, de Vienne D. Simplified modelling of metabolic pathways for flux prediction and optimization: lessons from an in vitro reconstruction of the upper part of glycolysis. Biochem J [Internet]. 2006;396(2):317–26. Available from: http://www.ncbi.nlm.nih.gov/pubmed/16460310 pmid:16460310
- 21. Antoniewicz MR, Stephanopoulos G, Kelleher JK. Evaluation of regression models in metabolic physiology: Predicting fluxes from isotopic data without knowledge of the pathway. Metabolomics. 2006;2(1):41–52. pmid:17066125
- 22. Almquist J, Cvijovic M, Hatzimanikatis V, Nielsen J. Kinetic models in industrial biotechnology–Improving cell factory performance. Metab Eng [Internet]. 2014;24:38–60. Available from: pmid:24747045
- 23. Steuer R, Gross T, Selbig J, Blasius B. Structural kinetic modeling of metabolic networks. Proc Natl Acad Sci U S A [Internet]. 2006 Aug 8 [cited 2018 Feb 6];103(32):11868–73. Available from: http://www.ncbi.nlm.nih.gov/pubmed/16880395 pmid:16880395
- 24. Srinivasan S, Cluett WR, Mahadevan R. Constructing kinetic models of metabolism at genome-scales: A review. Biotechnol J. 2015;10(9):1345–59. pmid:26332243
- 25. Covert MW, Famili I, Palsson BO. Identifying Constraints that Govern Cell Behavior: A Key to Converting Conceptual to Computational Models in Biology? Biotechnol Bioeng. 2003;84(7):763–72. pmid:14708117
- 26. Vijayakumar S, Conway M, Lió P, Angione C. Seeing the wood for the trees: a forest of methods for optimization and omic-network integration in metabolic modelling. Brief Bioinform. 2018;19(6):1218–35. pmid:28575143
- 27.
Cornish-Bowden A and Wharton C W. Enzyme kinetics (In Focus). oxford: IRL press ltd; 1988.
- 28. Teusink B, Passarge J, Reijenga CA, Esgalhado E, van der Weijden CC, Schepper M, et al. Can yeast glycolysis be understood in terms of in vitro kinetics of the constituent enzymes? Testing biochemistry. Eur J Biochem [Internet]. 2000 Sep;267(17):5313–29. Available from: http://www.ncbi.nlm.nih.gov/pubmed/10951190 pmid:10951190
- 29. Bisswanger H. Enzyme assays. Perspect Sci [Internet]. 2014;1:41–55. Available from: http://dx.doi.org/10.1016/j.pisc.2014.02.005
- 30. Wright BE, Butler MH, Albe KR. Systems analysis of the tricarboxylic acid cycle in dictyostelium discoideum. I. The basis for model construction. J Biol Chem. 1992;267(5):3101–5. pmid:1737766
- 31. Albe KR, Wright BE. Systems Analysis of the Tricarboxylic Acid Cycle in Dictyostelium discoideum II. control Analysis. J Biol Chem. 1992;267(No. 5, Issue of February 15):3106–14. pmid:1737767
- 32. Orth JD, Thiele I, Palsson BO. What is flux balance analysis? Nat Biotechnol [Internet]. 2010;28(3):245–8. Available from: pmid:20212490
- 33. Wiechert W. 13C Metabolic Flux Analysis. Metab Eng [Internet]. 2001 Jul;3(3):195–206. Available from: https://www.sciencedirect.com/science/article/pii/S1096717601901879 pmid:11461141
- 34. Edwards JS, Palsson BO. The Escherichia coli MG1655 in silico metabolic genotype: Its definition, characteristics, and capabilities. Proc Natl Acad Sci. 2002;97(10):5528–33.
- 35. Jamshidi N, Palsson B. Investigating the metabolic capabilities of Mycobacterium tuberculosis H37Rv using the in silico strain iNJ661 and proposing alternative drug targets. BMC Syst Biol. 2007;1(26):1–20.
- 36. Claudia S, Quintero JC. Flux Balance Analysis in the Production of Clavulanic Acid by Streptomyces clavuligerus. Biotechnol Prog. 2015;31(5):1226–36. pmid:26171767
- 37. Christensen B, Nielsen J. Metabolic Network Analysis of Penicillium chrysogenum Using C-Labeled Glucose. Biotechnol Bioeng. 2000;68(6):652–9. pmid:10799990
- 38. Chen X, Alonso AP, Allen , Doug K.Reed JL, Shachar-Hill Y. Synergy between 13C-metabolic flux analysis and flux balance analysis for understanding metabolic adaption to anaerobiosis in E. coli. Metab Eng [Internet]. 2010;13(1):38–48. Available from: pmid:21129495
- 39. de Graaf AA, Mahle M, Möllney M, Wiechert W, Stahmann P, Sahm H. Determination of full 13C isotopomer distributions for metabolic flux analysis using heteronuclear spin echo difference NMR spectroscopy. J Biotechnol. 2000;77(1):25–35. pmid:10674212
- 40. Mead CACMHGC, Chopra I. Organic Acids: Chemistry. Antibacterial Activity and Practical Applications. Adv Microb Physiol. 1991;32:87–108. pmid:1882730
- 41. Alsaheb RAA, Aladdin A, Othman NZ, Malek RA, Leng OM, Aziz R, et al. Lactic acid applications in pharmaceutical and cosmeceutical industries. J Chem Pharm Res. 2015;7(10):729–35.
- 42. Weissman KJ, Leadlay PF. Combinatorial biosynthesis of reduced polyketides. Nat Rev Microbiol. 2005;3(12):925–36. pmid:16322741
- 43. Awan AR, Blount BA, Bell DJ, Shaw WM, Ho JCH, McKiernan RM, et al. Biosynthesis of the antibiotic nonribosomal peptide penicillin in baker’s yeast. Nat Commun [Internet]. 2017;8(May):1–8. Available from: http://dx.doi.org/10.1038/ncomms15202
- 44. Haris S, Fang C, Bastidas-Oyanedel JR, Prather KJ, Schmidt JE, Thomsen MH. Natural antibacterial agents from arid-region pretreated lignocellulosic biomasses and extracts for the control of lactic acid bacteria in yeast fermentation. AMB Express [Internet]. 2018;8(1):1–7. Available from:
- 45. Khattak WA, Ul-Islam M, Ullah MW, Yu B, Khan S, Park JK. Yeast cell-free enzyme system for bio-ethanol production at elevated temperatures. Process Biochem [Internet]. 2014;49(3):357–64. Available from: http://dx.doi.org/10.1016/j.procbio.2013.12.019
- 46. Zhang YHP, Sun J, Zhong JJ. Biofuel production by in vitro synthetic enzymatic pathway biotransformation. Curr Opin Biotechnol [Internet]. 2010;21(5):663–9. Available from: pmid:20566280
- 47. Zhang YHP. Renewable carbohydrates are a potential high-density hydrogen carrier. Int J Hydrogen Energy [Internet]. 2010;35(19):10334–42. Available from: http://dx.doi.org/10.1016/j.ijhydene.2010.07.132
- 48. Yim H, Haselbeck R, Niu W, Pujol-Baxley C, Burgard A, Boldt J, et al. Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol. Nat Chem Biol [Internet]. 2011;7(7):445–52. Available from: pmid:21602812
- 49. Anderson LA, Ahsanul Islam M, Prather KLJ. Synthetic biology strategies for improving microbial synthesis of “green” biopolymers. J Biol Chem [Internet]. 2018;293,:5053–61. Available from: pmid:29339554
- 50. Lee JW, Na D, Park JM, Lee J, Choi S, Lee SY. systems metabolic engineering of microorganisms for natural and non-natural chemicals. Nat Chem Biol [Internet]. 2012;8(6):536–46. Available from: pmid:22596205
- 51. Zeng J, Teo J, Banerjee A, Chapman TW, Kim J, Sarpeshkar R. A Synthetic Microbial Operational Amplifier. ACS Synth Biol [Internet]. 2018;7(9):2007–13. Available from: pmid:30152993
- 52. Liu Jingjing, Li Jianghua Shin Hyun-dong, Liu Long, Guocheng Du JC. Protein and metabolic engineering for the production of organic acids. Bioresour Technol [Internet]. 2017 Sep 1 [cited 2017 Oct 6];239:412–21. Available from: http://www.sciencedirect.com/science/article/pii/S0960852417305424?via%3Dihub pmid:28538198
- 53. Hassoun MH. Fundamentals of Artificial Neural Networks. Proc IEEE. 1996 Jun;84(6):906.
- 54. Jain AK, Mao J, Mohiuddin KM. Artificial neural networks: A tutorial. Computer (Long Beach Calif). 1996;29(3):31–44.
- 55. Kolanoski H. Application of artificial neural networks in particle physics. Nucl Instruments Methods Phys Res Sect A. 1995;367:14–20.
- 56. Giuseppe Carleo MT. Solving the quantum many-body problem with artificial neural networks. Science (80-). 2017;355(6325):602–6.
- 57. LIU ZeLin PENG ChangHui, XIANG WenHua, TIAN DaLun, DENG XiangWen ZM. Application of artificial neural networks in global climate change and ecological research: An overview. Chinese Sci Bull. 2010;55(34):3853–63.
- 58. Pawul M, Śliwka M. Application of artificial neural networks for prediction of air pollution levels in environmental monitoring. J Ecol Eng. 2016;17(4):190–6.
- 59. Ahmed Gamal El-Din, Daniel W Smith and MGE-D. The application of artificial neural network in wastewater treatment. J Environ Eng Sci. 2004;3:81–95.
- 60. Kamruzzaman SM, Jehad Sarkar AM. A new data mining scheme using artificial neural networks. Sensors. 2011;11(5):4622–47. pmid:22163866
- 61. Eide Å, Johansson R, Lindblad T, Lindsey CS. Data mining and neural networks for knowledge discovery. Nucl Instruments Methods Phys Res Sect A Accel Spectrometers, Detect Assoc Equip. 1997;389(1–2):251–4.
- 62. Schmidhuber J. Deep Learning in neural networks: An overview. Neural Networks [Internet]. 2015;61:85–117. Available from: pmid:25462637
- 63. Raman K, Chandra N. Flux balance analysis of biological systems: Applications and challenges. Brief Bioinform. 2009;10(4):435–49. pmid:19287049
- 64. Allen DK, Libourel IGL, Shachar-Hill Y. Metabolic flux analysis in plants: Coping with complexity. Plant, Cell Environ. 2009;32(9):1241–57.
- 65. Vasilakou E, Machado D, Theorell A, Rocha I, Nöh K, Oldiges M, et al. Current state and challenges for dynamic metabolic modeling. Curr Opin Microbiol. 2016;33(1):97–104.
- 66. Rohwer JM. Kinetic modelling of plant metabolic pathways. J Exp Bot. 2012;63(6):2275–92. pmid:22419742
- 67. Rosenblatt F. The perceptron: A probabilistic model for information storage and organization in the brain. Psychol Rev. 1958;65(6):386–408. pmid:13602029
- 68. Hornik K, Maxwell S, White H. Multilayer Feedforward Networks are Universal Approximators. Neural Networks [Internet]. 1989 [cited 2018 Jan 29];2:359–66. Available from: https://ac.els-cdn.com/0893608089900208/1-s2.0-0893608089900208-main.pdf?_tid=2e455254-04f2-11e8-acb3-00000aacb35d&acdnat=1517230064_28e5e7c6145756cbc7aa95e108aa024c
- 69.
Venables WN , Ripley B. D. Modern Applied Statistics with S-Plus [Internet]. 4th ed. Springer. New York: Springer; 2002. Available from: http://www.stats.ox.ac.uk/pub/MASS4
- 70. Günther F, Fritsch S. neuralnet: Training of Neural Networks. R J. 2010;2(1):30–8.
- 71. Bergmeir C, Benitez JM. Neural Networks in R Using the Stuttgart Neural Network Simulator: RSNNS. J Stat Softw. 2012;46(7):1–26.
- 72. Gupta R, Rathi P, Gupta N, Bradoo S. Lipase assays for conventional and molecular screening: an overview. Biotechnol Appl Biochem. 2003;37(1):63.
- 73. Schmeier S, Hakenberg J, Klipp E, Leser U, Kowald A. Finding Kinetic Parameters Using Text Mining. Omi A J Integr Biol. 2004;8(2):131–52.