Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Pairwise causal discovery in biochemical networks: A survey on directionality inference within complex networks from stationary observations

  • Nava Leibovich ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Nava.Leibovich@nrc-cnrc.gc.ca

    Affiliation National Research Council of Canada, NRC-Fields Mathematical Sciences Collaboration Centre, Toronto, Ontario, Canada

  • Miroslava Cuperlovic-Culf

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Writing – original draft, Writing – review & editing

    Affiliations Digital Technologies Research Centre, National Research Council Canada, Ottawa, Ontario, Canada, Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine University of Ottawa, Ottawa, Canada, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Canada

Abstract

Metabolic networks map complex biochemical reactions within organisms, which is crucial for understanding cellular processes and metabolite flow. This study focuses on inferring the directionality of interactions in metabolomics networks. Given the challenge of using steady-state data, we benchmark various methods, including statistical scores and neural network approaches, on synthetic yet realistic biological models. Our findings highlight the relative success of a few methods in some cases where the interaction mechanism is known, whereas other methods show limited effectiveness.

Introduction

Metabolic networks function as intricate maps of biochemical reactions within an organism, illustrating the complex web of interactions that govern cellular processes. Understanding the directionality of these reactions is crucial for tracking the flow of metabolites through pathways and illuminating the functional dynamics of cellular metabolism. This information is relevant across various fields, including systems biology, biotechnology, and drug discovery [15]. In this context, we focus on the interpretation of the metabolomics network.

The metabolome consists of hundreds of correlated compounds, offering substantial information within the metabolomics network that extends beyond the mere levels of individual metabolites [68]. Focusing solely on individual metabolites can obscure potential intervention targets due to collinearity and confounding factors; however, network models can effectively incorporate these interrelationships within the metabolome [610]. The significance of biological networks in human diseases has been widely acknowledged [11,12], particularly in their role in the identification of causal associations [13].

Interaction networks are typically inferred using time series measurements or pseudo-time trajectories, employing statistical tools such as Bayesian inference and maximum likelihood alongside machine learning algorithms [1424]. Notably, a wide range of recent studies have explored methods for inferring networks from temporal data, as highlighted in the partial list as follows [2535]. However, it is important to note that these methods require the records of synchronized ordered data, which is often unmeasurable in various observational contexts.

Here, we aim to infer the direction of interactions from stationary data associated with the analyzed dataset, see Fig 1. In general, the interaction network can be curated from prior biological knowledge, or structured from experimental data [3639]. The experimental networks are directly derived from data, for example, untargeted metabolomics generated from mass spectrometry [3639]. One can derive the functional connectivity, which provides valuable insights into the statistical dependencies arising from the collective dynamics of interactions between pairs of units [4046]. These statistical dependencies among components can be characterized by well-known measures such as correlation, mutual information, or their analyses using silencing methods [4448]. However, these measures are symmetric and consequently do not capture the direction of interactions, resulting in the inference of non-directed networks. In contrast, one may consider non-symmetric metrics, such as partial correlations or dependency analyses. Specifically, partial correlation and various kernel-based metrics have been proposed to analyze associations between variables while accounting for confounders within the functional connectivity network [4954]. However, these approaches require recording data from all other interacting variables in the network, including any suspected confounders. Such comprehensive data is often unavailable in many systems.

thumbnail
Fig 1. The main goal is to infer the direction of interaction between two variables within a metabolic network.

Our focus is on methods designed to uncover the directional influence between these two variables. However, determining the orientation of the network from static data poses a great challenge, necessitating keen analytical and computational techniques.

https://doi.org/10.1371/journal.pone.0349617.g001

In particular, we recognize that the complete metabolic network under consideration has not been fully characterized. As a result, neither the overall network topology nor other confounding variables have been adequately described or measured. Consequently, we focus on methods designed to infer the pairwise causal direction between two correlated variables. Furthermore, we emphasize that earlier benchmark analyzes of pairwise inference methods [5557] evaluate performance with respect to various causal mechanisms between pairs of variables, without taking into account the context of interaction networks in biological processes. Such networks can introduce intricate influences due to signal propagation, even when the two variables do not directly interact with one another.

Determining the network orientation from steady-state data is therefore challenging. First, we examine various methods for deciphering interaction directions using steady-state data and highlight their domains of failure and success. We survey both statistical scores, which allow inference from small data sets, and methods based on machine learning and neural network approaches, which require large data sets for training the models. We tested these methods on multiple interaction mechanisms, with and without additional noise.

Main results

As mentioned, our aim is to infer the direction of interaction within a partially observed interaction network between biochemical components to gain insights into some biological processes. To do so, we benchmark existing methods to infer directionality from pairwise data. Our contribution lies in the following. Previous comparisons between methods collate various interaction mechanisms, combining linear and non-linear effects, and additive and multiplicative noise. Nevertheless, these previously examined mechanisms do not necessarily capture the complexity of biochemical reaction networks; which possess multiple confounders that may remain unknown, including biochemical cycles and interaction loops, complex signal propagation throughout the network, and which the pairwise interactions mechanism is unknown.

Benchmarking methods for analyzing synthetic data

We use several interaction mechanisms within complex networks. Specifically, we use the Michaelis-Menten model, which describes gene interaction networks [5860], as well as networks of coupled Rössler oscillators, and coupled Goodwin oscillators, which represent a prototypical biological oscillator that characterizes various biological processes such as circadian clocks [6165], and brain function [66]. These synthetic models are commonly used in literature for describing biological processes. More details on these models can be found in S1 Appendix.

Generally, the investigated interactions can be derived from either knowledge-based or experimental networks [3639]. In our synthetic data, all examined edges correspond to mathematically defined interactions, which means they are knowledge-based reactions. We analyze both random Erdős–Rényi networks, characterized by a binomial degree distribution, and Barabási–Albert scale-free networks, which exhibit a power-law degree distribution. We start with considering the Erdős–Rényi networks, where the results for Barabási–Albert networks are given in Subsect. Network structure dependence: Scale-Free vs Erdős–Rényi Models below. Examination of these two network models enables a comprehensive evaluation of network-dependent behaviors relevant to our assessment.

First, we consider random interaction networks following the Erdös–Rényi model. In Fig 2 we show the observations of pairs for the interaction mechanisms considered. Given that both variables are affected by additional influences from other network variables, along with inherent noise for each variable, which also propagates throughout the network, determining the direction of the interaction is highly challenging [67].

thumbnail
Fig 2. Realizations of interacting variables’ pairs yi, yj for different interaction mechanisms; Goodwin, Rössler, and Michaelis-Menten models (left to right correspondingly).

For consistent visualization, we present the direction of interaction yi → yj for all panels, where for each we sample N=102 data points.

https://doi.org/10.1371/journal.pone.0349617.g002

We investigate a range of methods, including those grounded in statistical scores, stacking techniques, and artificial neural networks (NNs). The statistical scores we analyzed include the Additive Noise Model (ANM), Conditional Distribution Similarity (CDS), Information Geometric Causal Inference (IGCI), and Regression-Error-based Causal Inference (RECI). Additionally, we examined various stacking methods, such as voting, eXtreme Gradient Boosting (XGBoost), and a recently introduced Causal Ensemble Learning method based on Support Measure Machines (CEMM). Furthermore, we explore several proposed deep neural network architectures, such as the Neural Causation Coefficient (NCC) and Randomize Causation Coefficient (RCC). A comprehensive overview and description of these inference methods can be found in Table 1 and in S1 Appendix. The implementations were based on [68] wherever applicable.

thumbnail
Table 1. Pairwise interaction orientation methods.

https://doi.org/10.1371/journal.pone.0349617.t001

The techniques that have been tested provide a rich composition of methods, each incorporating different approaches to inference. This variety offers a solid foundation for benchmarking assessment. However, we have not examined all possible methods that have been proposed in the literature. Certain methods were excluded for several reasons, including limitations in computational capacity, the unavailability of published code from the relevant articles, and a lack of applicability to our data. Specifically, some methods require information that is inaccessible or involve data types that do not align with the data of our interest. As mentioned, the methods are reviewed in Table 1 and S1 Appendix.

Performance on random synthetic networks

We show in Fig 3 that for the biochemical reaction networks we examined, most of the methods we assessed struggle with direction determination, as no method presents excellent accuracy, say beyond 0.9. This somewhat inferior accuracy is expected, especially for the statistical scores, since they rely on strong assumptions about the statistical characteristics of the data, which are not necessarily complied with within our systems. In general, an accuracy of around 0.5 indicates a random choice between two equally probable cases, while an accuracy lower than that implies systematic errors in the statistical scores from their unfulfilled assumptions.

thumbnail
Fig 3. Results for the tested directionality inference methods for various interaction mechanisms.

Here the interaction networks are from Erdős–Rényi type.

https://doi.org/10.1371/journal.pone.0349617.g003

However, we found that some methods perform better than others. In particular, the NCC method yields results that surpass the baseline of random choice located at 0.5, with accuracies ranging between 0.7 and 0.9. We find that the NCC results exceed other methods for various interaction mechanisms and noise scales that were considered. We further investigate insights from these results in the following subsections.

Interestingly, for the coupled Rössler oscillators, we found that the CEMM, XGBoost, and CDS methods produced relatively successful results, albeit falling short compared to the NCC method (see Fig 3). Among stacking methods, XGBoost and CEMM demonstrated accuracy that places them second to NCC, achieving an approximate accuracy of 0.7. Additionally, for the Rössler coupled oscillators, the statistical score of CDS exhibits similar accuracy for larger values of N. In contrast, for the other interaction mechanisms that were examined, the CDS, CEMM, and XGBoost stacking methods, along with the other inference approaches tested—excluding NCC—failed to accurately infer the direction of the interaction.

In addition, our findings indicate some dependence on the noise level. We tested the performance for three noise levels. For data synthesized with the Goodwin mechanism, a higher noise level seems preferable, and success was found for the NCC method. Other dynamic mechanisms show comparable results for all noise levels tested. Additionally, the finite number of sampled data points, N, induced additional measurement noise from the finite sampling itself. For small N, most methods show an accuracy close to 0.5, which signifies the chance accuracy of a dummy classifier for the two possible directions. As N increases, some methods improve their accuracy, while for others the accuracy remains at the random chance or even decreases below it, due to systematic errors that emerged from the statistical assumption. Given that observations are typically of the size of samples, we do not consider a larger set of data points. Furthermore, we examine the results when varying the level of missing data using imputation. We have tested multiple imputation methods: replacement of the missing values with the mean, iteratively predicting missing values using the MICE (Multiple Imputation by Chained Equations) method [69], and the imputation using the K-Nearest Neighbors (KNN) approach [70]. The dependence of varying the rate of missing values on the accuracy of the methods for inference of the interactions’ direction shows similar results, where the accuracy of NCC model exceeds the random classifier results, see S1 Appendix.

Further inspection of the NCC model

To expand the analyses even further, we have tested the NCC method as a candidate for a directionality inference method. We first examine the performance of the model trained on a given dataset drawn from one mechanism, to infer interaction direction from another mechanism, see Fig 4. As expected, each NCC model performs better on the data mechanism it was trained on. Obviously, for a trained model to be effective, the training data must be similar to the data aimed to infer. Nevertheless, in the biological processes under question, one does not necessarily have such information.

thumbnail
Fig 4. Testing data vs trained NCC models.

As expected, using training and testing datasets from different mechanisms is ineffective.

https://doi.org/10.1371/journal.pone.0349617.g004

We also trained an additional model using simulated data generated with randomly selected interaction mechanisms, referred to as “mixed.” Unlike before, we did not specify particular mechanisms. This new model yielded slightly improved results, exceeding the random accuracy for two different mechanisms we examined.

The analysis considers Kendall’s and Spearman’s correlation coefficients, as well as the mutual information between variables, to evaluate their observed dependence. Higher values of these metrics indicate that the dynamics of one variable are influenced by the dynamics of the other. Our findings show that a strong dependence is associated with a higher probability of correctly determining the direction of interaction, as shown in Fig 5. In our synthetic data, all examined edges correspond to defined interactions in which one variable categorically influences the other. In other words, we consider the edge to be given from prior knowledge on the process [3638]. Nevertheless, weak interactions may be obtained empirically due to sampling errors, noise propagation from other network regions, or confounding factors affecting both variables. Fig 5 shows that edges with weakly measured interactions are less likely to be inferred correctly. Most of these interactions, however, would have likely been omitted in an experimental network since their pairwise dependence would be lower than the required threshold. As was previously noted, integration of knowledge-based networks with experimental networks would help to improve the identification and interpretation in biologically relevant contexts [3639].

thumbnail
Fig 5. Dependence quantities influence the probability for meaningful training of the NCC model and thus to a successful inference.

Results for the Kendall coefficient (left), Spearman coefficient (middle), and mutual information (right) show that stronger dependence between the two variables corresponds to increased success rates in determining the direction of interaction between them.

https://doi.org/10.1371/journal.pone.0349617.g005

Network structure dependence: Scale-Free vs Erdös–Rényi models

Up to this point, we have presented results for edges within random Erdös–Rényi-type networks. However, scale-free networks are frequently reported in complex biological systems [7174]. A key property of these networks is their degree distribution, which follows a power-law pattern, where k is the degree and the exponent typically ranges between 2 and 3. Fig 6 compares random networks generated using the Erdös–Rényi model (Fig 6A) and the Barabási–Albert scale-free model (Fig 6B). Although both networks have the same mean degree, their topologies differ substantially. In particular, Barabási–Albert network exhibits ‘hubs’ where some nodes have high degrees, while the vast majority of nodes have a degree of one and, as such, are connected to only a single node. Fig 6C shows the performance of directional inference methods for interactions governed by Michaelis–Menten dynamics, where nodes represent variables in scale-free networks. The results show a consistent pattern: among the tested methods, the NCC approach demonstrates superior performance over the others.

thumbnail
Fig 6. Comparison between Erdös–Rényi and Barabási–Albert network structures and corresponding inference performance.

(A) Example of a random network generated using the Erdös–Rényi model and its degree distribution, which follows a binomial pattern. (B) Example of a scale-free network generated using the Barabási–Albert model, exhibiting a power-law degree distribution. Both networks share the same mean degree but display markedly different topologies. (C) Performance of directional inference methods applied to reactions (edges) drawn from scale-free interaction networks. Simulations were performed using Michaelis–Menten dynamics with a noise scale of 0.5 and a dataset size of N = 103.

https://doi.org/10.1371/journal.pone.0349617.g006

Model performance for realistic biological systems

We test the applicability of the above models for inference of the direction of the interaction within realistic biological processes. To do so, we simulate these biological processes and test the performance of the inferred interaction direction. Specifically, we first use a previously proposed biologically realistic system – a model that is based on the E. coli gene regulatory network (GNR) provided in the DREAM challenge [7577]. In addition, we simulate the Bile Acid Synthesis Pathway (BASP) model specified in [78] to describe a part of cholesterol metabolism. Finally, we simulate sphingolipid metabolism using computationally determined enzyme kinetic parameters and a detailed metabolic network based on KEGG: map00600. The details of all models are provided in the S1 Appendix.

Testing the models on synthetic, yet realistic, biological process data enables us to evaluate the anticipated performance of the inference method in a controlled environment before applying it to actual data of interest. We utilize our pre-trained NCC models and apply them to the biological processes mentioned above, see Fig 7. Our simulation results indicate that the inference approaches explored demonstrate limited success in elucidating the interactions within the examined biological network.

thumbnail
Fig 7. Performance for Realistic Biological Systems.

The models used for direction inference are NCC with training data as specified in each panel. Models are detailed in (S1 Appendix).

https://doi.org/10.1371/journal.pone.0349617.g007

Discussion and summary

Inferring the direction of interaction between two components within a complex biochemical network from time-independent observations is an important yet challenging task. One key significance of inferring directed biochemical networks in human health research lies in the ability to control and recover metabolic performance in faulty or suboptimally operating cells, such as those with mutations [5]. Additionally, it is essential to understand how signal transduction networks influence cancer cell behavior, including proliferation, survival, invasiveness, and drug resistance [3]. Therefore, the study of biological interaction networks, and in particular the inference of the direction of biochemical reactions, is essential as it may eventually lead to new therapeutic approaches and personalized treatments.

We examined various methods to determine the directionality of the interaction while considering the characteristics of the biological observations of metabolites; the small data set, many variables that are unknown or unobserved, the multiplicity of confounders, the propagation of noise through the network, the unknown dynamics mechanism between the variables, and more. Since many variables are undetermined, we focused on pairwise directionality inference. The directionality inference approaches discussed in the paper focus on determining a single direction between a pair of variables. We note that the analysis excludes bidirectional edges, which describe reversible reactions. Considering this type of interaction requires expansion of the methods beyond their current capacity of binary classification of direction. Further examination of whether the raw scores for each direction, or other computational approaches, may be utilized to distinguish reversible reactions is left for future work.

We compared various approaches and found that they are limited in determining the direction of influence effectively. This limitation may stem from unfulfilled assumptions, such as the presence of directed acyclic graphs or the introduction of additional noise. Furthermore, complexity can increase due to different interaction mechanisms, multiple undetermined confounders, and various sources of noise. Therefore, we suggest that methods for determining the orientation of interaction should be tailored specifically to the system being studied whenever possible.

The inference of pairwise directionality from multiple snapshots, especially in the absence of temporal information, should be approached with caution. In particular, one must be wary of interpreting the interaction directionality as indicative of ’causality.’ This term refers to scenarios where direct changes or interventions in one variable influence the state of another variable. Consequently, ’causality’ is typically discussed in the context of temporal data, where direct interventions and their subsequent effects can be observed over time. However, while some argue that ’causality’ should only apply to temporal data, others extend its definition to include an interpretation based on static observations – this dissension remains open for further research.

References

  1. 1. Cornelius SP, Kath WL, Motter AE. Realistic control of network dynamics. Nat Commun. 2013;4:1942. pmid:23803966
  2. 2. Timme M, Casadiego J. Revealing networks from dynamics: an introduction. J Phys A: Math Theor. 2014;47(34):343001.
  3. 3. Kolch W, Halasz M, Granovskaya M, Kholodenko BN. The dynamic control of signal transduction networks in cancer cells. Nat Rev Cancer. 2015;15(9):515–27. pmid:26289315
  4. 4. Karrer TM, Kim JZ, Stiso J, Kahn AE, Pasqualetti F, Habel U, et al. A practical guide to methodological considerations in the controllability of structural brain networks. J Neural Eng. 2020;17(2):026031. pmid:31968320
  5. 5. Motter AE, Gulbahce N, Almaas E, Barabási A-L. Predicting synthetic rescues in metabolic networks. Mol Syst Biol. 2008;4:168. pmid:18277384
  6. 6. Weckwerth W. Metabolomics in systems biology. Annu Rev Plant Biol. 2003;54:669–89. pmid:14503007
  7. 7. Gao J, Tarcea VG, Karnovsky A, Mirel BR, Weymouth TE, Beecher CW, et al. Metscape: a Cytoscape plug-in for visualizing and interpreting metabolomic data in the context of human metabolic networks. Bioinformatics. 2010;26(7):971–3. pmid:20139469
  8. 8. Karnovsky A, Weymouth T, Hull T, Tarcea VG, Scardoni G, Laudanna C, et al. Metscape 2 bioinformatics tool for the analysis and visualization of metabolomics and gene expression data. Bioinformatics. 2012;28(3):373–80. pmid:22135418
  9. 9. Weckwerth W, Morgenthal K. Metabolomics: from pattern recognition to biological interpretation. Drug Discov Today. 2005;10(22):1551–8. pmid:16257378
  10. 10. Perez De Souza L, Alseekh S, Brotman Y, Fernie AR. Network-based strategies in metabolomics data analysis and interpretation: from molecular networking to biological interpretation. Expert Rev Proteomics. 2020;17(4):243–55. pmid:32380880
  11. 11. Barabási A-L, Oltvai ZN. Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004;5(2):101–13. pmid:14735121
  12. 12. Vidal M, Cusick ME, Barabási A-L. Interactome networks and human disease. Cell. 2011;144(6):986–98. pmid:21414488
  13. 13. Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, Guhathakurta D, et al. An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005;37(7):710–7. pmid:15965475
  14. 14. Su EY, Spangler A, Bian Q, Kasamoto JY, Cahan P. Reconstruction of dynamic regulatory networks reveals signaling-induced topology changes associated with germ layer specification. Stem Cell Reports. 2022;17(2):427–42. pmid:35090587
  15. 15. Guillén-Gosálbez G, Miró A, Alves R, Sorribas A, Jiménez L. Identification of regulatory structure and kinetic parameters of biochemical networks via mixed-integer dynamic optimization. BMC Syst Biol. 2013;7:113. pmid:24176044
  16. 16. Michailidis G, d’Alché-Buc F. Autoregressive models for gene regulatory network inference: sparsity, stability and causality issues. Math Biosci. 2013;246(2):326–34. pmid:24176667
  17. 17. Schmidt H, Cho K-H, Jacobsen EW. Identification of small scale biochemical networks based on general type system perturbations. FEBS J. 2005;272(9):2141–51. pmid:15853799
  18. 18. Kirst C, Timme M, Battaglia D. Dynamic information routing in complex networks. Nat Commun. 2016;7:11061. pmid:27067257
  19. 19. Kim J, Bates DG, Postlethwaite I, Heslop-Harrison P, Cho K-H. Least-squares methods for identifying biochemical regulatory networks from noisy measurements. BMC Bioinformatics. 2007;8:8. pmid:17212835
  20. 20. Pang J, Wang A. Iterative Structural Inference of Directed Graphs. In: Advances in Neural Information Processing Systems 35, 2022. 8717–30. https://doi.org/10.52202/068431-0634
  21. 21. Gong H, Klinger J, Damazyn K, Li X, Huang S. A novel procedure for statistical inference and verification of gene regulatory subnetwork. BMC Bioinformatics. 2015;16 Suppl 7(Suppl 7):S7. pmid:25952938
  22. 22. Larvie JE, Sefidmazgi MG, Homaifar A, Harrison SH, Karimoddini A, Guiseppi-Elie A. Stable Gene Regulatory Network Modeling From Steady-State Data. Bioengineering (Basel). 2016;3(2):12. pmid:28952574
  23. 23. Warne DJ, Baker RE, Simpson MJ. Simulation and inference algorithms for stochastic biochemical reaction networks: from basic concepts to state-of-the-art. J R Soc Interface. 2019;16(151):20180943. pmid:30958205
  24. 24. Yu D, Righero M, Kocarev L. Estimating topology of networks. Phys Rev Lett. 2006;97(18):188701. pmid:17155589
  25. 25. Wang L, Huang S, Wang S, Liao J, Li T, Liu L. A survey of causal discovery based on functional causal model. Engineering Applications of Artificial Intelligence. 2024;133:108258.
  26. 26. Gong C, Zhang C, Yao D, Bi J, Li W, Xu Y. Causal Discovery from Temporal Data: An Overview and New Perspectives. ACM Comput Surv. 2024;57(4):1–38.
  27. 27. Bhaskar D, Magruder DS, Morales M, De Brouwer E, Venkat A, Wenkel F, et al. Inferring Dynamic Regulatory Interaction Graphs from Time Series Data with Perturbations. Proc Mach Learn Res. 2024;231:22. pmid:40678737
  28. 28. Theocharous A, Gregoriou GG, Sapountzis P, Kontoyiannis I. Temporally Causal Discovery Tests for Discrete Time Series and Neural Spike Trains. IEEE Trans Signal Process. 2024;72:1333–47.
  29. 29. Wan J, Kataoka J, Sivakumar J, Peña E, Che Y, Sayama H, et al. Sparse Bayesian Learning for Sequential Inference of Network Connectivity From Small Data. IEEE Trans Netw Sci Eng. 2024;11(6):5892–902.
  30. 30. Casadiego J, Maoutsa D, Timme M. Inferring Network Connectivity from Event Timing Patterns. Phys Rev Lett. 2018;121(5):054101. pmid:30118266
  31. 31. Rouillard L, Ambrogioni L, Wassermann D. Robust and highly scalable estimation of directional couplings from time-shifted signals. In: 2024. https://arxiv.org/abs/240602545
  32. 32. Yang J, Huang J, Chen Q. Construction of time series causal network based on partial rank correlation. Knowledge-Based Systems. 2024;295:111865.
  33. 33. Irribarra N, Michell K, Bermeo C, Kristjanpoller W. A multi-head attention neural network with non-linear correlation approach for time series causal discovery. Applied Soft Computing. 2024;165:112062.
  34. 34. Casadiego J, Nitzan M, Hallerberg S, Timme M. Model-free inference of direct network interactions from nonlinear collective dynamics. Nat Commun. 2017;8(1):2192. pmid:29259167
  35. 35. Mizera A, Pang J, Tong TP, Wang A. Benchmarking Structural Inference Methods for Interacting Dynamical Systems with Synthetic Data. In: Advances in Neural Information Processing Systems 37, 2024. 135129–85. https://doi.org/10.52202/079017-4295
  36. 36. Ma’ayan A. Introduction to network analysis in systems biology. Sci Signal. 2011;4(190):tr5. pmid:21917719
  37. 37. Zhao C, Dong J, Deng L, Tan Y, Jiang W, Cai Z. Molecular network strategy in multi-omics and mass spectrometry imaging. Curr Opin Chem Biol. 2022;70:102199. pmid:36027696
  38. 38. Amara A, Frainay C, Jourdan F, Naake T, Neumann S, Novoa-Del-Toro EM, et al. Networks and Graphs Discovery in Metabolomics Data Analysis and Interpretation. Front Mol Biosci. 2022;9:841373. pmid:35350714
  39. 39. Maizels RJ, Briscoe J. Gene regulatory networks: from correlative models to causal explanations. Nat Rev Genet. 2026;27(6):485–98. pmid:41803457
  40. 40. Pearl J. Causal inference in statistics: An overview. Statistics Surveys. 2009;3:96–146.
  41. 41. Liu F, Chan L. Causal Inference on Discrete Data via Estimating Distance Correlations. Neural Comput. 2016;28(5):801–14. pmid:26890344
  42. 42. Marx A, Vreeken J. Telling cause from effect using MDL-based local and global regression. In: 2017 IEEE International Conference on Data Mining (ICDM), 2017. 307–16.
  43. 43. Marx A, Vreeken J. Telling cause from effect by local and global regression. Knowl Inf Syst. 2018;60(3):1277–305.
  44. 44. Rosato A, Tenori L, Cascante M, De Atauri Carulla PR, Martins Dos Santos VAP, Saccenti E. From correlation to causation: analysis of metabolomics data using systems biology approaches. Metabolomics. 2018;14(4):37. pmid:29503602
  45. 45. Peel L, Peixoto TP, De Domenico M. Statistical inference links data and theory in network science. Nat Commun. 2022;13(1):6794. pmid:36357376
  46. 46. Cliff OM, Bryant AG, Lizier JT, Tsuchiya N, Fulcher BD. Unifying pairwise interactions in complex dynamics. Nat Comput Sci. 2023;3(10):883–93. pmid:38177751
  47. 47. Barzel B, Barabási A-L. Network link prediction by global silencing of indirect correlations. Nat Biotechnol. 2013;31(8):720–5. pmid:23851447
  48. 48. Glymour C, Zhang K, Spirtes P. Review of Causal Discovery Methods Based on Graphical Models. Front Genet. 2019;10:524. pmid:31214249
  49. 49. Palus M, Vejmelka M. Directionality of coupling from bivariate time series: how to avoid false causalities and missed connections. Phys Rev E Stat Nonlin Soft Matter Phys. 2007;75(5 Pt 2):056211. pmid:17677152
  50. 50. Eichler M, Dahlhaus R, Sandkühler J. Partial correlation analysis for the identification of synaptic connections. Biol Cybern. 2003;89(4):289–302. pmid:14605893
  51. 51. Aghdam R, Ganjali M, Zhang X, Eslahchi C. CN: a consensus algorithm for inferring gene regulatory networks using the SORDER algorithm and conditional mutual information test. Mol Biosyst. 2015;11(3):942–9. pmid:25607659
  52. 52. Zhang X, Zhao J, Hao J-K, Zhao X-M, Chen L. Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks. Nucleic Acids Res. 2015;43(5):e31. pmid:25539927
  53. 53. Jacob Y, Winetraub Y, Raz G, Ben-Simon E, Okon-Singer H, Rosenberg-Katz K, et al. Dependency Network Analysis (DEPNA) Reveals Context Related Influence of Brain Network Nodes. Sci Rep. 2016;6:27444. pmid:27271458
  54. 54. Monti F, Stewart D, Surendra A, Alecu I, Nguyen-Tran T, Bennett SAL, et al. Signed Distance Correlation (SiDCo): an online implementation of distance correlation and partial distance correlation for data-driven network analysis. Bioinformatics. 2023;39(5):btad210. pmid:37137236
  55. 55. Käding C, Runge J. Distinguishing cause and effect in bivariate structural causal models: A systematic investigation. Journal of Machine Learning Research. 2023;24(278):1–144.
  56. 56. Varando G, Catsis S, Diaz E, Camps-Valls G. Pairwise causal discovery with support measure machines. Applied Soft Computing. 2024;150:111030.
  57. 57. Niu W, Gao Z, Song L, Li L. Comprehensive review and empirical evaluation of causal discovery algorithms for numerical data. Journal of Machine Learning Research. 2024;(23):1–78.
  58. 58. Ronen M, Rosenberg R, Shraiman BI, Alon U. Assigning numbers to the arrows: parameterizing a gene regulation network by using accurate expression kinetics. Proc Natl Acad Sci U S A. 2002;99(16):10555–60. pmid:12145321
  59. 59. Karlebach G, Shamir R. Modelling and analysis of gene regulatory networks. Nat Rev Mol Cell Biol. 2008;9(10):770–80. pmid:18797474
  60. 60. Novák B, Tyson JJ. Design principles of biochemical oscillators. Nat Rev Mol Cell Biol. 2008;9(12):981–91. pmid:18971947
  61. 61. Goodwin BC. Oscillatory behavior in enzymatic control processes. Adv Enzyme Regul. 1965;3:425–38. pmid:5861813
  62. 62. Ruoff P, Vinsjevik M, Monnerjahn C, Rensing L. The Goodwin oscillator: on the importance of degradation reactions in the circadian clock. J Biol Rhythms. 1999;14(6):469–79. pmid:10643743
  63. 63. Zeiser S, Müller J, Liebscher V. Modeling the Hes1 oscillator. J Comput Biol. 2007;14(7):984–1000. pmid:17803375
  64. 64. François P, Despierre N, Siggia ED. Adaptive temperature compensation in circadian oscillations. PLoS Comput Biol. 2012;8(7):e1002585. pmid:22807663
  65. 65. Nitzan M, Casadiego J, Timme M. Revealing physical interaction networks from statistics of collective dynamics. Sci Adv. 2017;3(2):e1600396. pmid:28246630
  66. 66. Kuate PDK, Ito H, Fossi JT, Zhao M, Fozin TF, Fotsin HB, et al. From connectome to silicon: a biologically-inspired complex network of CMOS chaotic oscillators for analog brain emulation. Nonlinear Dyn. 2025;113(15):20227–52.
  67. 67. Scholkopf B, Locatello F, Bauer S, Ke NR, Kalchbrenner N, Goyal A, et al. Toward Causal Representation Learning. Proc IEEE. 2021;109(5):612–34.
  68. 68. Kalainathan D, Goudet O, Dutta R. Causal discovery toolbox: uncovering causal relationships in python. Journal of Machine Learning Research. 2020;21(37):1–5.
  69. 69. Buuren S van, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations inR. J Stat Soft. 2011;45(3).
  70. 70. Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, Tibshirani R, et al. Missing value estimation methods for DNA microarrays. Bioinformatics. 2001;17(6):520–5. pmid:11395428
  71. 71. Barabasi A, Albert R. Emergence of scaling in random networks. Science. 1999;286(5439):509–12. pmid:10521342
  72. 72. Albert R. Scale-free networks in cell biology. J Cell Sci. 2005;118(Pt 21):4947–57. pmid:16254242
  73. 73. Khanin R, Wit E. How scale-free are biological networks. J Comput Biol. 2006;13(3):810–8. pmid:16706727
  74. 74. Barabási A-L. Scale-free networks: a decade and beyond. Science. 2009;325(5939):412–3. pmid:19628854
  75. 75. Schaffter T, Marbach D, Floreano D. GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods. Bioinformatics. 2011;27(16):2263–70. pmid:21697125
  76. 76. Liu F, Zhang S-W, Guo W-F, Wei Z-G, Chen L. Inference of Gene Regulatory Network Based on Local Bayesian Networks. PLoS Comput Biol. 2016;12(8):e1005024. pmid:27479082
  77. 77. Foo M, Kim J, Bates DG. Modelling and Control of Gene Regulatory Networks for Perturbation Mitigation. IEEE/ACM Trans Comput Biol Bioinform. 2019;16(2):583–95. pmid:29994499
  78. 78. Zhang F, Macshane B, Searcy R, Huang Z. Mathematical Models for Cholesterol Metabolism and Transport. Processes. 2022;10(1):155.
  79. 79. Hoyer P, Janzing D, Mooij JM, Peters J, Schölkopf B. Nonlinear causal discovery with additive noise models. Advances in Neural Information Processing Systems. 2008;21.
  80. 80. Peters J, Mooij JM, Janzing D, Schölkopf B. Causal discovery with continuous additive noise models. 2014.
  81. 81. Fonollosa JAR. Conditional Distribution Variability Measures for Causality Detection. The Springer Series on Challenges in Machine Learning. Springer International Publishing. 2019. p. 339–47. https://doi.org/10.1007/978-3-030-21810-2_12
  82. 82. Daniusis P, Janzing D, Mooij J, Zscheischler J, Steudel B, Zhang K, Bernhard Schoelkopf Inferring deterministic causal relations. Conference: UAI 2010, Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence, Catalina Island, CA, USA; 2010. https://arxiv.org/abs/1203.3475
  83. 83. Blöbaum P, Janzing D, Washio T, Shimizu S, Schölkopf B. Analysis of cause-effect inference by comparing regression errors. PeerJ Comput Sci. 2019;5:e169. pmid:33816822
  84. 84. Leibovich N. Determining interaction directionality in complex biochemical networks from stationary measurements. Scientific Reports. 2025;15:3004.
  85. 85. Goudet O, Kalainathan D, Caillou P, Guyon I, Lopez-Paz D, Sebag M. Learning functional causal models with generative neural networks. Explainable and interpretable models in computer vision and machine learning. 2018. p. 39–80.
  86. 86. Lopez-Paz D, Nishihara R, Chintala S, Scholkopf B, Bottou L. Discovering Causal Signals in Images. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. 58–66. https://doi.org/10.1109/cvpr.2017.14
  87. 87. Lopez-Paz D, Muandet K, Schölkopf B, Tolstikhin I. In: International Conference on Machine Learning, 2015. 1452–61. https://doi.org/10.48550/arXiv.1502.02398
  88. 88. Yuan Y, Ding X, Bar-Joseph Z. Causal inference using deep neural networks. In: 2020. https://arxiv.org/abs/2011.12508
  89. 89. Ton J-F, Sejdinovic D, Fukumizu K. Meta Learning for Causal Direction. AAAI. 2021;35(11):9897–905.