Figures
Abstract
Metabolic networks map complex biochemical reactions within organisms, which is crucial for understanding cellular processes and metabolite flow. This study focuses on inferring the directionality of interactions in metabolomics networks. Given the challenge of using steady-state data, we benchmark various methods, including statistical scores and neural network approaches, on synthetic yet realistic biological models. Our findings highlight the relative success of a few methods in some cases where the interaction mechanism is known, whereas other methods show limited effectiveness.
Citation: Leibovich N, Cuperlovic-Culf M (2026) Pairwise causal discovery in biochemical networks: A survey on directionality inference within complex networks from stationary observations. PLoS One 21(6): e0349617. https://doi.org/10.1371/journal.pone.0349617
Editor: Stephan N. Witt, Louisiana State University Health Sciences Center, UNITED STATES OF AMERICA
Received: November 14, 2025; Accepted: May 2, 2026; Published: June 16, 2026
Copyright: © 2026 Leibovich, Cuperlovic-Culf. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files.
Funding: This study was financially supported by the National Research Council of Canada (NRC) in the form of an Artificial Intelligence for Design (AI4D) Challenge program award (https://nrc.canada.ca/en/research-development/research-collaboration/programs/artificial-intelligence-design-challenge-program) received by NL and MC-C. No additional external funding was received for this study. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Metabolic networks function as intricate maps of biochemical reactions within an organism, illustrating the complex web of interactions that govern cellular processes. Understanding the directionality of these reactions is crucial for tracking the flow of metabolites through pathways and illuminating the functional dynamics of cellular metabolism. This information is relevant across various fields, including systems biology, biotechnology, and drug discovery [1–5]. In this context, we focus on the interpretation of the metabolomics network.
The metabolome consists of hundreds of correlated compounds, offering substantial information within the metabolomics network that extends beyond the mere levels of individual metabolites [6–8]. Focusing solely on individual metabolites can obscure potential intervention targets due to collinearity and confounding factors; however, network models can effectively incorporate these interrelationships within the metabolome [6–10]. The significance of biological networks in human diseases has been widely acknowledged [11,12], particularly in their role in the identification of causal associations [13].
Interaction networks are typically inferred using time series measurements or pseudo-time trajectories, employing statistical tools such as Bayesian inference and maximum likelihood alongside machine learning algorithms [14–24]. Notably, a wide range of recent studies have explored methods for inferring networks from temporal data, as highlighted in the partial list as follows [25–35]. However, it is important to note that these methods require the records of synchronized ordered data, which is often unmeasurable in various observational contexts.
Here, we aim to infer the direction of interactions from stationary data associated with the analyzed dataset, see Fig 1. In general, the interaction network can be curated from prior biological knowledge, or structured from experimental data [36–39]. The experimental networks are directly derived from data, for example, untargeted metabolomics generated from mass spectrometry [36–39]. One can derive the functional connectivity, which provides valuable insights into the statistical dependencies arising from the collective dynamics of interactions between pairs of units [40–46]. These statistical dependencies among components can be characterized by well-known measures such as correlation, mutual information, or their analyses using silencing methods [44–48]. However, these measures are symmetric and consequently do not capture the direction of interactions, resulting in the inference of non-directed networks. In contrast, one may consider non-symmetric metrics, such as partial correlations or dependency analyses. Specifically, partial correlation and various kernel-based metrics have been proposed to analyze associations between variables while accounting for confounders within the functional connectivity network [49–54]. However, these approaches require recording data from all other interacting variables in the network, including any suspected confounders. Such comprehensive data is often unavailable in many systems.
Our focus is on methods designed to uncover the directional influence between these two variables. However, determining the orientation of the network from static data poses a great challenge, necessitating keen analytical and computational techniques.
In particular, we recognize that the complete metabolic network under consideration has not been fully characterized. As a result, neither the overall network topology nor other confounding variables have been adequately described or measured. Consequently, we focus on methods designed to infer the pairwise causal direction between two correlated variables. Furthermore, we emphasize that earlier benchmark analyzes of pairwise inference methods [55–57] evaluate performance with respect to various causal mechanisms between pairs of variables, without taking into account the context of interaction networks in biological processes. Such networks can introduce intricate influences due to signal propagation, even when the two variables do not directly interact with one another.
Determining the network orientation from steady-state data is therefore challenging. First, we examine various methods for deciphering interaction directions using steady-state data and highlight their domains of failure and success. We survey both statistical scores, which allow inference from small data sets, and methods based on machine learning and neural network approaches, which require large data sets for training the models. We tested these methods on multiple interaction mechanisms, with and without additional noise.
Main results
As mentioned, our aim is to infer the direction of interaction within a partially observed interaction network between biochemical components to gain insights into some biological processes. To do so, we benchmark existing methods to infer directionality from pairwise data. Our contribution lies in the following. Previous comparisons between methods collate various interaction mechanisms, combining linear and non-linear effects, and additive and multiplicative noise. Nevertheless, these previously examined mechanisms do not necessarily capture the complexity of biochemical reaction networks; which possess multiple confounders that may remain unknown, including biochemical cycles and interaction loops, complex signal propagation throughout the network, and which the pairwise interactions mechanism is unknown.
Benchmarking methods for analyzing synthetic data
We use several interaction mechanisms within complex networks. Specifically, we use the Michaelis-Menten model, which describes gene interaction networks [58–60], as well as networks of coupled Rössler oscillators, and coupled Goodwin oscillators, which represent a prototypical biological oscillator that characterizes various biological processes such as circadian clocks [61–65], and brain function [66]. These synthetic models are commonly used in literature for describing biological processes. More details on these models can be found in S1 Appendix.
Generally, the investigated interactions can be derived from either knowledge-based or experimental networks [36–39]. In our synthetic data, all examined edges correspond to mathematically defined interactions, which means they are knowledge-based reactions. We analyze both random Erdős–Rényi networks, characterized by a binomial degree distribution, and Barabási–Albert scale-free networks, which exhibit a power-law degree distribution. We start with considering the Erdős–Rényi networks, where the results for Barabási–Albert networks are given in Subsect. Network structure dependence: Scale-Free vs Erdős–Rényi Models below. Examination of these two network models enables a comprehensive evaluation of network-dependent behaviors relevant to our assessment.
First, we consider random interaction networks following the Erdös–Rényi model. In Fig 2 we show the observations of pairs for the interaction mechanisms considered. Given that both variables are affected by additional influences from other network variables, along with inherent noise for each variable, which also propagates throughout the network, determining the direction of the interaction is highly challenging [67].
For consistent visualization, we present the direction of interaction yi → yj for all panels, where for each we sample N=102 data points.
We investigate a range of methods, including those grounded in statistical scores, stacking techniques, and artificial neural networks (NNs). The statistical scores we analyzed include the Additive Noise Model (ANM), Conditional Distribution Similarity (CDS), Information Geometric Causal Inference (IGCI), and Regression-Error-based Causal Inference (RECI). Additionally, we examined various stacking methods, such as voting, eXtreme Gradient Boosting (XGBoost), and a recently introduced Causal Ensemble Learning method based on Support Measure Machines (CEMM). Furthermore, we explore several proposed deep neural network architectures, such as the Neural Causation Coefficient (NCC) and Randomize Causation Coefficient (RCC). A comprehensive overview and description of these inference methods can be found in Table 1 and in S1 Appendix. The implementations were based on [68] wherever applicable.
The techniques that have been tested provide a rich composition of methods, each incorporating different approaches to inference. This variety offers a solid foundation for benchmarking assessment. However, we have not examined all possible methods that have been proposed in the literature. Certain methods were excluded for several reasons, including limitations in computational capacity, the unavailability of published code from the relevant articles, and a lack of applicability to our data. Specifically, some methods require information that is inaccessible or involve data types that do not align with the data of our interest. As mentioned, the methods are reviewed in Table 1 and S1 Appendix.
Performance on random synthetic networks
We show in Fig 3 that for the biochemical reaction networks we examined, most of the methods we assessed struggle with direction determination, as no method presents excellent accuracy, say beyond 0.9. This somewhat inferior accuracy is expected, especially for the statistical scores, since they rely on strong assumptions about the statistical characteristics of the data, which are not necessarily complied with within our systems. In general, an accuracy of around 0.5 indicates a random choice between two equally probable cases, while an accuracy lower than that implies systematic errors in the statistical scores from their unfulfilled assumptions.
Here the interaction networks are from Erdős–Rényi type.
However, we found that some methods perform better than others. In particular, the NCC method yields results that surpass the baseline of random choice located at 0.5, with accuracies ranging between 0.7 and 0.9. We find that the NCC results exceed other methods for various interaction mechanisms and noise scales that were considered. We further investigate insights from these results in the following subsections.
Interestingly, for the coupled Rössler oscillators, we found that the CEMM, XGBoost, and CDS methods produced relatively successful results, albeit falling short compared to the NCC method (see Fig 3). Among stacking methods, XGBoost and CEMM demonstrated accuracy that places them second to NCC, achieving an approximate accuracy of 0.7. Additionally, for the Rössler coupled oscillators, the statistical score of CDS exhibits similar accuracy for larger values of N. In contrast, for the other interaction mechanisms that were examined, the CDS, CEMM, and XGBoost stacking methods, along with the other inference approaches tested—excluding NCC—failed to accurately infer the direction of the interaction.
In addition, our findings indicate some dependence on the noise level. We tested the performance for three noise levels. For data synthesized with the Goodwin mechanism, a higher noise level seems preferable, and success was found for the NCC method. Other dynamic mechanisms show comparable results for all noise levels tested. Additionally, the finite number of sampled data points, N, induced additional measurement noise from the finite sampling itself. For small N, most methods show an accuracy close to 0.5, which signifies the chance accuracy of a dummy classifier for the two possible directions. As N increases, some methods improve their accuracy, while for others the accuracy remains at the random chance or even decreases below it, due to systematic errors that emerged from the statistical assumption. Given that observations are typically of the size of samples, we do not consider a larger set of data points. Furthermore, we examine the results when varying the level of missing data using imputation. We have tested multiple imputation methods: replacement of the missing values with the mean, iteratively predicting missing values using the MICE (Multiple Imputation by Chained Equations) method [69], and the imputation using the K-Nearest Neighbors (KNN) approach [70]. The dependence of varying the rate of missing values on the accuracy of the methods for inference of the interactions’ direction shows similar results, where the accuracy of NCC model exceeds the random classifier results, see S1 Appendix.
Further inspection of the NCC model
To expand the analyses even further, we have tested the NCC method as a candidate for a directionality inference method. We first examine the performance of the model trained on a given dataset drawn from one mechanism, to infer interaction direction from another mechanism, see Fig 4. As expected, each NCC model performs better on the data mechanism it was trained on. Obviously, for a trained model to be effective, the training data must be similar to the data aimed to infer. Nevertheless, in the biological processes under question, one does not necessarily have such information.
As expected, using training and testing datasets from different mechanisms is ineffective.
We also trained an additional model using simulated data generated with randomly selected interaction mechanisms, referred to as “mixed.” Unlike before, we did not specify particular mechanisms. This new model yielded slightly improved results, exceeding the random accuracy for two different mechanisms we examined.
The analysis considers Kendall’s and Spearman’s
correlation coefficients, as well as the mutual information between variables, to evaluate their observed dependence. Higher values of these metrics indicate that the dynamics of one variable are influenced by the dynamics of the other. Our findings show that a strong dependence is associated with a higher probability of correctly determining the direction of interaction, as shown in Fig 5. In our synthetic data, all examined edges correspond to defined interactions in which one variable categorically influences the other. In other words, we consider the edge to be given from prior knowledge on the process [36–38]. Nevertheless, weak interactions may be obtained empirically due to sampling errors, noise propagation from other network regions, or confounding factors affecting both variables. Fig 5 shows that edges with weakly measured interactions are less likely to be inferred correctly. Most of these interactions, however, would have likely been omitted in an experimental network since their pairwise dependence would be lower than the required threshold. As was previously noted, integration of knowledge-based networks with experimental networks would help to improve the identification and interpretation in biologically relevant contexts [36–39].
Results for the Kendall coefficient (left), Spearman
coefficient (middle), and mutual information (right) show that stronger dependence between the two variables corresponds to increased success rates in determining the direction of interaction between them.
Network structure dependence: Scale-Free vs Erdös–Rényi models
Up to this point, we have presented results for edges within random Erdös–Rényi-type networks. However, scale-free networks are frequently reported in complex biological systems [71–74]. A key property of these networks is their degree distribution, which follows a power-law pattern, where k is the degree and the exponent
typically ranges between 2 and 3. Fig 6 compares random networks generated using the Erdös–Rényi model (Fig 6A) and the Barabási–Albert scale-free model (Fig 6B). Although both networks have the same mean degree, their topologies differ substantially. In particular, Barabási–Albert network exhibits ‘hubs’ where some nodes have high degrees, while the vast majority of nodes have a degree of one and, as such, are connected to only a single node. Fig 6C shows the performance of directional inference methods for interactions governed by Michaelis–Menten dynamics, where nodes represent variables in scale-free networks. The results show a consistent pattern: among the tested methods, the NCC approach demonstrates superior performance over the others.
(A) Example of a random network generated using the Erdös–Rényi model and its degree distribution, which follows a binomial pattern. (B) Example of a scale-free network generated using the Barabási–Albert model, exhibiting a power-law degree distribution. Both networks share the same mean degree but display markedly different topologies. (C) Performance of directional inference methods applied to reactions (edges) drawn from scale-free interaction networks. Simulations were performed using Michaelis–Menten dynamics with a noise scale of 0.5 and a dataset size of N = 103.
Model performance for realistic biological systems
We test the applicability of the above models for inference of the direction of the interaction within realistic biological processes. To do so, we simulate these biological processes and test the performance of the inferred interaction direction. Specifically, we first use a previously proposed biologically realistic system – a model that is based on the E. coli gene regulatory network (GNR) provided in the DREAM challenge [75–77]. In addition, we simulate the Bile Acid Synthesis Pathway (BASP) model specified in [78] to describe a part of cholesterol metabolism. Finally, we simulate sphingolipid metabolism using computationally determined enzyme kinetic parameters and a detailed metabolic network based on KEGG: map00600. The details of all models are provided in the S1 Appendix.
Testing the models on synthetic, yet realistic, biological process data enables us to evaluate the anticipated performance of the inference method in a controlled environment before applying it to actual data of interest. We utilize our pre-trained NCC models and apply them to the biological processes mentioned above, see Fig 7. Our simulation results indicate that the inference approaches explored demonstrate limited success in elucidating the interactions within the examined biological network.
The models used for direction inference are NCC with training data as specified in each panel. Models are detailed in (S1 Appendix).
Discussion and summary
Inferring the direction of interaction between two components within a complex biochemical network from time-independent observations is an important yet challenging task. One key significance of inferring directed biochemical networks in human health research lies in the ability to control and recover metabolic performance in faulty or suboptimally operating cells, such as those with mutations [5]. Additionally, it is essential to understand how signal transduction networks influence cancer cell behavior, including proliferation, survival, invasiveness, and drug resistance [3]. Therefore, the study of biological interaction networks, and in particular the inference of the direction of biochemical reactions, is essential as it may eventually lead to new therapeutic approaches and personalized treatments.
We examined various methods to determine the directionality of the interaction while considering the characteristics of the biological observations of metabolites; the small data set, many variables that are unknown or unobserved, the multiplicity of confounders, the propagation of noise through the network, the unknown dynamics mechanism between the variables, and more. Since many variables are undetermined, we focused on pairwise directionality inference. The directionality inference approaches discussed in the paper focus on determining a single direction between a pair of variables. We note that the analysis excludes bidirectional edges, which describe reversible reactions. Considering this type of interaction requires expansion of the methods beyond their current capacity of binary classification of direction. Further examination of whether the raw scores for each direction, or other computational approaches, may be utilized to distinguish reversible reactions is left for future work.
We compared various approaches and found that they are limited in determining the direction of influence effectively. This limitation may stem from unfulfilled assumptions, such as the presence of directed acyclic graphs or the introduction of additional noise. Furthermore, complexity can increase due to different interaction mechanisms, multiple undetermined confounders, and various sources of noise. Therefore, we suggest that methods for determining the orientation of interaction should be tailored specifically to the system being studied whenever possible.
The inference of pairwise directionality from multiple snapshots, especially in the absence of temporal information, should be approached with caution. In particular, one must be wary of interpreting the interaction directionality as indicative of ’causality.’ This term refers to scenarios where direct changes or interventions in one variable influence the state of another variable. Consequently, ’causality’ is typically discussed in the context of temporal data, where direct interventions and their subsequent effects can be observed over time. However, while some argue that ’causality’ should only apply to temporal data, others extend its definition to include an interpretation based on static observations – this dissension remains open for further research.
References
- 1. Cornelius SP, Kath WL, Motter AE. Realistic control of network dynamics. Nat Commun. 2013;4:1942. pmid:23803966
- 2. Timme M, Casadiego J. Revealing networks from dynamics: an introduction. J Phys A: Math Theor. 2014;47(34):343001.
- 3. Kolch W, Halasz M, Granovskaya M, Kholodenko BN. The dynamic control of signal transduction networks in cancer cells. Nat Rev Cancer. 2015;15(9):515–27. pmid:26289315
- 4. Karrer TM, Kim JZ, Stiso J, Kahn AE, Pasqualetti F, Habel U, et al. A practical guide to methodological considerations in the controllability of structural brain networks. J Neural Eng. 2020;17(2):026031. pmid:31968320
- 5. Motter AE, Gulbahce N, Almaas E, Barabási A-L. Predicting synthetic rescues in metabolic networks. Mol Syst Biol. 2008;4:168. pmid:18277384
- 6. Weckwerth W. Metabolomics in systems biology. Annu Rev Plant Biol. 2003;54:669–89. pmid:14503007
- 7. Gao J, Tarcea VG, Karnovsky A, Mirel BR, Weymouth TE, Beecher CW, et al. Metscape: a Cytoscape plug-in for visualizing and interpreting metabolomic data in the context of human metabolic networks. Bioinformatics. 2010;26(7):971–3. pmid:20139469
- 8. Karnovsky A, Weymouth T, Hull T, Tarcea VG, Scardoni G, Laudanna C, et al. Metscape 2 bioinformatics tool for the analysis and visualization of metabolomics and gene expression data. Bioinformatics. 2012;28(3):373–80. pmid:22135418
- 9. Weckwerth W, Morgenthal K. Metabolomics: from pattern recognition to biological interpretation. Drug Discov Today. 2005;10(22):1551–8. pmid:16257378
- 10. Perez De Souza L, Alseekh S, Brotman Y, Fernie AR. Network-based strategies in metabolomics data analysis and interpretation: from molecular networking to biological interpretation. Expert Rev Proteomics. 2020;17(4):243–55. pmid:32380880
- 11. Barabási A-L, Oltvai ZN. Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004;5(2):101–13. pmid:14735121
- 12. Vidal M, Cusick ME, Barabási A-L. Interactome networks and human disease. Cell. 2011;144(6):986–98. pmid:21414488
- 13. Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, Guhathakurta D, et al. An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005;37(7):710–7. pmid:15965475
- 14. Su EY, Spangler A, Bian Q, Kasamoto JY, Cahan P. Reconstruction of dynamic regulatory networks reveals signaling-induced topology changes associated with germ layer specification. Stem Cell Reports. 2022;17(2):427–42. pmid:35090587
- 15. Guillén-Gosálbez G, Miró A, Alves R, Sorribas A, Jiménez L. Identification of regulatory structure and kinetic parameters of biochemical networks via mixed-integer dynamic optimization. BMC Syst Biol. 2013;7:113. pmid:24176044
- 16. Michailidis G, d’Alché-Buc F. Autoregressive models for gene regulatory network inference: sparsity, stability and causality issues. Math Biosci. 2013;246(2):326–34. pmid:24176667
- 17. Schmidt H, Cho K-H, Jacobsen EW. Identification of small scale biochemical networks based on general type system perturbations. FEBS J. 2005;272(9):2141–51. pmid:15853799
- 18. Kirst C, Timme M, Battaglia D. Dynamic information routing in complex networks. Nat Commun. 2016;7:11061. pmid:27067257
- 19. Kim J, Bates DG, Postlethwaite I, Heslop-Harrison P, Cho K-H. Least-squares methods for identifying biochemical regulatory networks from noisy measurements. BMC Bioinformatics. 2007;8:8. pmid:17212835
- 20.
Pang J, Wang A. Iterative Structural Inference of Directed Graphs. In: Advances in Neural Information Processing Systems 35, 2022. 8717–30. https://doi.org/10.52202/068431-0634
- 21. Gong H, Klinger J, Damazyn K, Li X, Huang S. A novel procedure for statistical inference and verification of gene regulatory subnetwork. BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S7. pmid:25952938
- 22. Larvie JE, Sefidmazgi MG, Homaifar A, Harrison SH, Karimoddini A, Guiseppi-Elie A. Stable Gene Regulatory Network Modeling From Steady-State Data. Bioengineering (Basel). 2016;3(2):12. pmid:28952574
- 23. Warne DJ, Baker RE, Simpson MJ. Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface. 2019;16(151):20180943. pmid:30958205
- 24. Yu D, Righero M, Kocarev L. Estimating topology of networks. Phys Rev Lett. 2006;97(18):188701. pmid:17155589
- 25. Wang L, Huang S, Wang S, Liao J, Li T, Liu L. A survey of causal discovery based on functional causal model. Engineering Applications of Artificial Intelligence. 2024;133:108258.
- 26. Gong C, Zhang C, Yao D, Bi J, Li W, Xu Y. Causal Discovery from Temporal Data: An Overview and New Perspectives. ACM Comput Surv. 2024;57(4):1–38.
- 27. Bhaskar D, Magruder DS, Morales M, De Brouwer E, Venkat A, Wenkel F, et al. Inferring Dynamic Regulatory Interaction Graphs from Time Series Data with Perturbations. Proc Mach Learn Res. 2024;231:22. pmid:40678737
- 28. Theocharous A, Gregoriou GG, Sapountzis P, Kontoyiannis I. Temporally Causal Discovery Tests for Discrete Time Series and Neural Spike Trains. IEEE Trans Signal Process. 2024;72:1333–47.
- 29. Wan J, Kataoka J, Sivakumar J, Peña E, Che Y, Sayama H, et al. Sparse Bayesian Learning for Sequential Inference of Network Connectivity From Small Data. IEEE Trans Netw Sci Eng. 2024;11(6):5892–902.
- 30. Casadiego J, Maoutsa D, Timme M. Inferring Network Connectivity from Event Timing Patterns. Phys Rev Lett. 2018;121(5):054101. pmid:30118266
- 31.
Rouillard L, Ambrogioni L, Wassermann D. Robust and highly scalable estimation of directional couplings from time-shifted signals. In: 2024. https://arxiv.org/abs/240602545
- 32. Yang J, Huang J, Chen Q. Construction of time series causal network based on partial rank correlation. Knowledge-Based Systems. 2024;295:111865.
- 33. Irribarra N, Michell K, Bermeo C, Kristjanpoller W. A multi-head attention neural network with non-linear correlation approach for time series causal discovery. Applied Soft Computing. 2024;165:112062.
- 34. Casadiego J, Nitzan M, Hallerberg S, Timme M. Model-free inference of direct network interactions from nonlinear collective dynamics. Nat Commun. 2017;8(1):2192. pmid:29259167
- 35.
Mizera A, Pang J, Tong TP, Wang A. Benchmarking Structural Inference Methods for Interacting Dynamical Systems with Synthetic Data. In: Advances in Neural Information Processing Systems 37, 2024. 135129–85. https://doi.org/10.52202/079017-4295
- 36. Ma’ayan A. Introduction to network analysis in systems biology. Sci Signal. 2011;4(190):tr5. pmid:21917719
- 37. Zhao C, Dong J, Deng L, Tan Y, Jiang W, Cai Z. Molecular network strategy in multi-omics and mass spectrometry imaging. Curr Opin Chem Biol. 2022;70:102199. pmid:36027696
- 38. Amara A, Frainay C, Jourdan F, Naake T, Neumann S, Novoa-Del-Toro EM, et al. Networks and Graphs Discovery in Metabolomics Data Analysis and Interpretation. Front Mol Biosci. 2022;9:841373. pmid:35350714
- 39. Maizels RJ, Briscoe J. Gene regulatory networks: from correlative models to causal explanations. Nat Rev Genet. 2026;27(6):485–98. pmid:41803457
- 40. Pearl J. Causal inference in statistics: An overview. Statistics Surveys. 2009;3:96–146.
- 41. Liu F, Chan L. Causal Inference on Discrete Data via Estimating Distance Correlations. Neural Comput. 2016;28(5):801–14. pmid:26890344
- 42.
Marx A, Vreeken J. Telling cause from effect using MDL-based local and global regression. In: 2017 IEEE International Conference on Data Mining (ICDM), 2017. 307–16.
- 43. Marx A, Vreeken J. Telling cause from effect by local and global regression. Knowl Inf Syst. 2018;60(3):1277–305.
- 44. Rosato A, Tenori L, Cascante M, De Atauri Carulla PR, Martins Dos Santos VAP, Saccenti E. From correlation to causation: analysis of metabolomics data using systems biology approaches. Metabolomics. 2018;14(4):37. pmid:29503602
- 45. Peel L, Peixoto TP, De Domenico M. Statistical inference links data and theory in network science. Nat Commun. 2022;13(1):6794. pmid:36357376
- 46. Cliff OM, Bryant AG, Lizier JT, Tsuchiya N, Fulcher BD. Unifying pairwise interactions in complex dynamics. Nat Comput Sci. 2023;3(10):883–93. pmid:38177751
- 47. Barzel B, Barabási A-L. Network link prediction by global silencing of indirect correlations. Nat Biotechnol. 2013;31(8):720–5. pmid:23851447
- 48. Glymour C, Zhang K, Spirtes P. Review of Causal Discovery Methods Based on Graphical Models. Front Genet. 2019;10:524. pmid:31214249
- 49. Palus M, Vejmelka M. Directionality of coupling from bivariate time series: how to avoid false causalities and missed connections. Phys Rev E Stat Nonlin Soft Matter Phys. 2007;75(5 Pt 2):056211. pmid:17677152
- 50. Eichler M, Dahlhaus R, Sandkühler J. Partial correlation analysis for the identification of synaptic connections. Biol Cybern. 2003;89(4):289–302. pmid:14605893
- 51. Aghdam R, Ganjali M, Zhang X, Eslahchi C. CN: a consensus algorithm for inferring gene regulatory networks using the SORDER algorithm and conditional mutual information test. Mol Biosyst. 2015;11(3):942–9. pmid:25607659
- 52. Zhang X, Zhao J, Hao J-K, Zhao X-M, Chen L. Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks. Nucleic Acids Res. 2015;43(5):e31. pmid:25539927
- 53. Jacob Y, Winetraub Y, Raz G, Ben-Simon E, Okon-Singer H, Rosenberg-Katz K, et al. Dependency Network Analysis (DEPNA) Reveals Context Related Influence of Brain Network Nodes. Sci Rep. 2016;6:27444. pmid:27271458
- 54. Monti F, Stewart D, Surendra A, Alecu I, Nguyen-Tran T, Bennett SAL, et al. Signed Distance Correlation (SiDCo): an online implementation of distance correlation and partial distance correlation for data-driven network analysis. Bioinformatics. 2023;39(5):btad210. pmid:37137236
- 55. Käding C, Runge J. Distinguishing cause and effect in bivariate structural causal models: A systematic investigation. Journal of Machine Learning Research. 2023;24(278):1–144.
- 56. Varando G, Catsis S, Diaz E, Camps-Valls G. Pairwise causal discovery with support measure machines. Applied Soft Computing. 2024;150:111030.
- 57.
Niu W, Gao Z, Song L, Li L. Comprehensive review and empirical evaluation of causal discovery algorithms for numerical data. Journal of Machine Learning Research. 2024;(23):1–78.
- 58. Ronen M, Rosenberg R, Shraiman BI, Alon U. Assigning numbers to the arrows: parameterizing a gene regulation network by using accurate expression kinetics. Proc Natl Acad Sci U S A. 2002;99(16):10555–60. pmid:12145321
- 59. Karlebach G, Shamir R. Modelling and analysis of gene regulatory networks. Nat Rev Mol Cell Biol. 2008;9(10):770–80. pmid:18797474
- 60. Novák B, Tyson JJ. Design principles of biochemical oscillators. Nat Rev Mol Cell Biol. 2008;9(12):981–91. pmid:18971947
- 61. Goodwin BC. Oscillatory behavior in enzymatic control processes. Adv Enzyme Regul. 1965;3:425–38. pmid:5861813
- 62. Ruoff P, Vinsjevik M, Monnerjahn C, Rensing L. The Goodwin oscillator: on the importance of degradation reactions in the circadian clock. J Biol Rhythms. 1999;14(6):469–79. pmid:10643743
- 63. Zeiser S, Müller J, Liebscher V. Modeling the Hes1 oscillator. J Comput Biol. 2007;14(7):984–1000. pmid:17803375
- 64. François P, Despierre N, Siggia ED. Adaptive temperature compensation in circadian oscillations. PLoS Comput Biol. 2012;8(7):e1002585. pmid:22807663
- 65. Nitzan M, Casadiego J, Timme M. Revealing physical interaction networks from statistics of collective dynamics. Sci Adv. 2017;3(2):e1600396. pmid:28246630
- 66. Kuate PDK, Ito H, Fossi JT, Zhao M, Fozin TF, Fotsin HB, et al. From connectome to silicon: a biologically-inspired complex network of CMOS chaotic oscillators for analog brain emulation. Nonlinear Dyn. 2025;113(15):20227–52.
- 67. Scholkopf B, Locatello F, Bauer S, Ke NR, Kalchbrenner N, Goyal A, et al. Toward Causal Representation Learning. Proc IEEE. 2021;109(5):612–34.
- 68. Kalainathan D, Goudet O, Dutta R. Causal discovery toolbox: uncovering causal relationships in python. Journal of Machine Learning Research. 2020;21(37):1–5.
- 69. Buuren S van, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations inR. J Stat Soft. 2011;45(3).
- 70. Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, Tibshirani R, et al. Missing value estimation methods for DNA microarrays. Bioinformatics. 2001;17(6):520–5. pmid:11395428
- 71. Barabasi A, Albert R. Emergence of scaling in random networks. Science. 1999;286(5439):509–12. pmid:10521342
- 72. Albert R. Scale-free networks in cell biology. J Cell Sci. 2005;118(Pt 21):4947–57. pmid:16254242
- 73. Khanin R, Wit E. How scale-free are biological networks. J Comput Biol. 2006;13(3):810–8. pmid:16706727
- 74. Barabási A-L. Scale-free networks: a decade and beyond. Science. 2009;325(5939):412–3. pmid:19628854
- 75. Schaffter T, Marbach D, Floreano D. GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods. Bioinformatics. 2011;27(16):2263–70. pmid:21697125
- 76. Liu F, Zhang S-W, Guo W-F, Wei Z-G, Chen L. Inference of Gene Regulatory Network Based on Local Bayesian Networks. PLoS Comput Biol. 2016;12(8):e1005024. pmid:27479082
- 77. Foo M, Kim J, Bates DG. Modelling and Control of Gene Regulatory Networks for Perturbation Mitigation. IEEE/ACM Trans Comput Biol Bioinform. 2019;16(2):583–95. pmid:29994499
- 78. Zhang F, Macshane B, Searcy R, Huang Z. Mathematical Models for Cholesterol Metabolism and Transport. Processes. 2022;10(1):155.
- 79. Hoyer P, Janzing D, Mooij JM, Peters J, Schölkopf B. Nonlinear causal discovery with additive noise models. Advances in Neural Information Processing Systems. 2008;21.
- 80.
Peters J, Mooij JM, Janzing D, Schölkopf B. Causal discovery with continuous additive noise models. 2014.
- 81.
Fonollosa JAR. Conditional Distribution Variability Measures for Causality Detection. The Springer Series on Challenges in Machine Learning. Springer International Publishing. 2019. p. 339–47. https://doi.org/10.1007/978-3-030-21810-2_12
- 82.
Daniusis P, Janzing D, Mooij J, Zscheischler J, Steudel B, Zhang K, Bernhard Schoelkopf Inferring deterministic causal relations. Conference: UAI 2010, Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence, Catalina Island, CA, USA; 2010. https://arxiv.org/abs/1203.3475
- 83. Blöbaum P, Janzing D, Washio T, Shimizu S, Schölkopf B. Analysis of cause-effect inference by comparing regression errors. PeerJ Comput Sci. 2019;5:e169. pmid:33816822
- 84. Leibovich N. Determining interaction directionality in complex biochemical networks from stationary measurements. Scientific Reports. 2025;15:3004.
- 85. Goudet O, Kalainathan D, Caillou P, Guyon I, Lopez-Paz D, Sebag M. Learning functional causal models with generative neural networks. Explainable and interpretable models in computer vision and machine learning. 2018. p. 39–80.
- 86.
Lopez-Paz D, Nishihara R, Chintala S, Scholkopf B, Bottou L. Discovering Causal Signals in Images. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. 58–66. https://doi.org/10.1109/cvpr.2017.14
- 87.
Lopez-Paz D, Muandet K, Schölkopf B, Tolstikhin I. In: International Conference on Machine Learning, 2015. 1452–61. https://doi.org/10.48550/arXiv.1502.02398
- 88.
Yuan Y, Ding X, Bar-Joseph Z. Causal inference using deep neural networks. In: 2020. https://arxiv.org/abs/2011.12508
- 89. Ton J-F, Sejdinovic D, Fukumizu K. Meta Learning for Causal Direction. AAAI. 2021;35(11):9897–905.