Skip to main content
Advertisement
  • Loading metrics

MR2G: A novel framework for causal network inference using GWAS summary data

  • Zhaotong Lin,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Software, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Statistics, Florida State University, Tallahassee, Florida, United States of America

  • Wei Pan,

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Writing – original draft, Writing – review & editing

    Affiliation Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, Minnesota, United States of America

  • Haoran Xue

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Visualization, Writing – original draft, Writing – review & editing

    xue.hr@cityu.edu.hk

    Affiliation Department of Biostatistics, City University of Hong Kong, Hong Kong, China

Abstract

Inferring a causal network among multiple traits is essential for unraveling complex biological relationships and informing interventions. Mendelian randomization (MR) has emerged as a powerful tool for causal inference, utilizing genetic variants as instrumental variables (IVs) to estimate causal effects. However, when the directions of causal relationships among traits are unknown, reconstructing the underlying causal network becomes challenging. In particular, the presence of cycles or feedback loops, which are common in biological systems, poses additional challenges for causal network inference, and remains largely under-studied with standard MR approaches and existing IV-based network inference methods. To address these issues, we introduce MR2G, a new statistical framework that enables robust inference of causal networks, including those with cycles, directly from GWAS summary statistics. MR2G is built on a formally defined recursive causal graph model that rigorously links direct causal effects to (univariable) MR estimands. It recovers a biologically interpretable causal network from pairwise MR effect estimates, while incorporating a network-informed IV screening strategy to reduce pleiotropic bias and improve robustness. Through realistic simulations, MR2G demonstrates superior accuracy and robustness in recovering complex causal structures, including those involving feedback loops. We apply MR2G to GWAS summary statistics for six complex diseases and nine cardiometabolic risk factors. MR2G not only recovers well-established causal pathways but also uncovers multiple feedback relationships, highlighting its utility in disentangling complex and biologically plausible causal networks from large-scale genetic data.

Author summary

Unraveling how complex traits causally influence each others is essential for understanding disease mechanisms and guiding effective interventions. However, inferring such a causal network from observational data remains challenging due to hidden confounding, and most existing methods assume that the underlying network has no feedback loops, which is often unrealistic in biological systems. In this work, we developed MR2G, a statistical framework that infers a causal network of multiple traits directly from GWAS summary statistics, while allowing for cycles and feedback loops. Our approach is based on a recursive causal model that formally links direct causal effects to Mendelian randomization estimands, and incorporates a novel instrumental variable screening strategy to reduce bias from horizontal pleiotropy. We applied MR2G to 15 complex traits including cardiometabolic risk factors and diseases, and identified biologically supported causal pathways, such as the direct effects of blood pressure and LDL cholesterol on coronary artery disease, and a bidirectional relationship between atrial fibrillation and coronary artery disease.

Introduction

Understanding how complex human traits causally influence one another is critical for uncovering disease mechanisms and identifying effective intervention targets. For example, traits like blood pressure, body mass index (BMI), and lipid levels are known to be interrelated and collectively influence the risk of cardiometabolic diseases. Modeling these interdependencies as a causal network allows researchers to move beyond isolated pairwise effects and instead capture the broader architecture of trait–trait relationships. However, inferring such networks from observational data is challenging due to confounding, reverse causation, and the presence of feedback loops.

Mendelian randomization (MR) provides a powerful tool for causal inference in the presence of unmeasured confounders by leveraging genetic variants, mostly single-nucleotide polymorphisms (SNPs), as instrumental variables (IVs) [1]. Traditional MR methods, both univariable (i.e., one exposure) [26] and multivariable (i.e., multiple exposures) [711], assume the directions of effects are known from exposures to outcomes. When causal direction between a pair of traits is unclear, bidirectional MR methods have been proposed to infer causal effects in both directions [1215], but they do not account for complex causal relationships among a set of interrelated traits.

The MR framework has considerable potential to be extended to causal network inference among traits [16,17], and several methods have been proposed for this purpose. Early efforts focused on mediation or two-step MR frameworks to test whether one trait mediates the effect of another on an outcome [18,19], but these approaches are limited to networks with only three (or a few more) traits, and require pre-specified directionality. More recent methods combine genetic instruments with structural learning algorithms to infer larger networks. For example, [20] developed BN-GWAS to construct a network that links multiple genes to an outcome. [21] introduced a linear structural equation model (SEM)-based method to infer a causal protein network without cycles. [22] proposed MrDAG, a Bayesian approach to estimate causal relationships among multiple exposures and outcomes under the assumption of no feedback loops. However, these approaches typically assume a directed acyclic graph (DAG) structure and/or often require the causal directions between exposures and outcomes to be specified a priori, limiting their applicability in settings where feedback or cyclic relationships are biologically plausible. For example, in type 2 diabetes (T2D), insulin resistance leads to elevated blood glucose, which in turn can exacerbate insulin resistance, forming a positive feedback cycle. Graph-MRcML [23] relaxed the DAG assumption by first estimating a total-(causal-)effect network using bidirectional MR, then followed by network deconvolution to reconstruct the direct-(causal-)effect network [24]. However, Graph-MRcML lacks a formal causal model, and the equivalence between the total-(causal-)effect network and the MR-estimated network may break down for cyclic networks (as to be discussed later).

To address the limitations of existing approaches, we introduce MR2G, a new framework for inferring the causal network among multiple complex traits using MR with GWAS summary statistics. MR2G is built upon a recursive model, which captures the propagation of causal effects through the (potentially cyclic) direct-(causal-)effect network of interest denoted by G. Under appropriate assumptions, this dynamic system converges to an equilibrium state, allowing us to define the MR-(causal-)effect network denoted by which can be estimated by MR, and derive an analytical expression linking G and . With this insight, MR2G first estimates using bidirectional MR, and then directly reconstructs the direct-effect network G from the estimated , without relying on structural learning or the heuristic network deconvolution algorithm. To further improve robustness, we introduce a novel IV screening strategy ScreenAug when performing MR. Unlike standard Steiger-type filtering [25] commonly used in bidirectional MR analysis, ScreenAug incorporates the inferred network structure to further exclude SNPs that may exert correlated pleiotropic effects on the outcome through alternative pathways after conditioning on the exposure.

We apply MR2G to investigate causal relationships among nine cardiometabolic risk factors and six diseases, including four major cardiometabolic diseases. MR2G successfully identifies multiple plausible and biologically meaningful causal relationships, including the independent effects of systolic blood pressure (SBP) and low-density lipoprotein (LDL) cholesterol on coronary artery disease (CAD), while traits such as BMI and triglycerides (TG) appear to exert more indirect effects. It also reveals BMI and height as direct contributors to atrial fibrillation (AF), a bidirectional causal link between AF and CAD, and AF as a major upstream driver of stroke. These findings highlight the ability of MR2G to robustly recover interpretable causal networks from large-scale genetic data, even in the presence of complex interdependencies and feedback loops.

Results

Method overview

MR2G is a two-stage framework for inferring direct-(causal-)effect networks from GWAS summary statistics. It is based on a recursive model that allows for feedback loops and cycles, capturing how traits influence each other over time. Specifically, we consider a dynamic system where trait values evolve over discrete time points: (1) At the initial time point, the trait values are determined by SNPs (with their direct effects on traits denoted by matrix ) and random errors; (2) At each subsequent time point, the trait values are updated by combining carry-over effects from the previous time point together with effects propagated through the direct-(causal-)effect network denoted by matrix G (Fig 1A). Under mild conditions on the spectral radius of G, this system converges to an equilibrium state, at which the total effects of SNPs on traits (denoted by matrix B) are jointly determined by G and . For each ordered pair of traits, with SNPs being their valid IVs (see Definition 1), we can define an estimable MR-(causal-)effect using total SNP effects from B, and eventually derive an analytical relationship between G and the MR-(causal-)effect network as shown in Eq (7). More details about the model are in the section Causal network model with cyclic structures.

thumbnail
Fig 1. Illustrations of the proposed cyclic causal network model and the MR2G method.

Panel A shows that in a causal network of three traits, values of traits evolve over time and can be decomposed as the sum of values from the previous time point (i.e., carry-over effects) and changes due to causal effects (i.e., propagated effects). In panel B, for a causal network of four traits, MR2G first estimates an MR-effect network and then constructs the direct-effect network from the estimated MR-effect network.

https://doi.org/10.1371/journal.pgen.1012144.g001

MR2G operates in two stages (Fig 1B). In Stage 1, we obtain the estimated MR-effect network via a robust bidirectional (univariable) MR method across all trait pairs. Specifically, we adopt our previously developed method, (univariable) MRcML [6], which accommodates invalid IVs exhibiting both correlated and uncorrelated pleiotropy under a plurality condition [6,12,23]. In Stage 2, based on , MR2G estimates the direct-effect network as using the connection between G and in Eq (7). To enable robust statistical inference on the entries of , we employ a data perturbation (equivalent to parametric bootstrapping) procedure based on a matrix normal distribution that captures both the linkage disequilibrium (LD) structure among the genetic instruments and the correlation structure of GWAS summary statistics. Further implementation details are provided in the section MR2G: estimating causal networks from GWAS summary statistics.

To further enhance robustness in MR estimation and network inference, we develop a novel IV selection strategy, ScreenAug, which leverages the estimated causal network to refine instrument selection (see the section IV screening procedure). It is applied after an initial candidate IV set being selected with ScreenPair and ScreenMax. Briefly, ScreenPair implements Steiger’s filtering by selecting a SNP as an instrument for the trait with which it has a stronger marginal association than that with the other trait [25]. However, this pairwise screening rule can lead to overlapping IVs across trait pairs, increasing the risk of using SNPs with horizontal pleiotropy. As demonstrated in our simulations, this issue becomes more pronounced with increasing GWAS sample sizes, as weaker effects become more likely to pass the selection threshold. To mitigate this issue, ScreenMax globally prunes the candidate IV set by retaining only the strongest SNP–trait association across all traits and assigns the SNP as an IV exclusively for that trait. For example, in Fig 2A, SNPs in the set Z3 are associated with both traits Y1 and Y2, yet are not valid IVs for either of them due to the shared confounder Y3. While ScreenPair may incorrectly use Z3 to infer either or based on marginal association strengths, ScreenMax will only use them as IVs for Y3, as associations of Z3 with Y3 are stronger than with Y1 and Y2. However, ScreenMax can also exclude valid instruments. To balance stringency and sensitivity, ScreenAug augments the ScreenMax-filtered set by reintroducing a subset of excluded SNPs if they are deemed valid based on the estimated causal network. Specifically, a SNP is re-included as an IV for a trait pair only if the inferred network indicates no direct path through which the SNP could influence the outcome after conditioning on the exposure.

thumbnail
Fig 2. True direct-effect networks used in simulation studies.

In panels A and B, Y1, Y2, Y3 are three traits, and represent three sets of SNPs exerting direct effects on the corresponding traits. Panel C is estimated based on real data application.

https://doi.org/10.1371/journal.pgen.1012144.g002

In summary, MR2G combines a theoretically sound cyclic causal network model, robust (univariable) MR estimation via MRcML with a network-aware IV screening procedure, and a closed-form graph recovery algorithm to enable robust and interpretable causal network inference from GWAS summary data while allowing for feedback loops and cycles, invalid instruments, and complex trait interdependencies.

ScreenAug outperforms other IV selection strategies by leveraging the inferred network structure

To evaluate the impact of IV screening on causal network estimation, we first conducted simulation studies under simple settings with three traits (Fig 2A and Fig 2B). We compared three IV selection strategies, ScreenPair, ScreenMax, and the proposed ScreenAug (see the sections Method overview and IV screening procedure), each applied within the MR2G framework.

This simulation involved three traits and a set of 30 SNPs, among which 8 SNPs directly affected Y1 (), 7 SNPs directly affected Y2 (), and 15 SNPs directly affected Y3 (). In the first scenario of Fig 2A, when the sample size was N = 50000, MR2G coupled with all three IV screening methods worked similarly well, with well-controlled type-I error (Fig 3A left panel) and almost no bias (Figure A in S1 Appendix). However, it might seem surprising that when the sample size increased to N = 200 000 (Fig 3A right panel), the performance of ScreenPair dropped dramatically with highly inflated type-I error and substantial bias (Figure A in S1 Appendix). This is because when the sample size increased, SNPs in Z3 reached the significance threshold for marginal associations with Y1 (and Y2). When performing a bi-directional MR analysis on Y1 and Y2, ScreenPair incorrectly selected the SNPs in Z3 as IVs for Y1, due to their stronger correlations with Y1 than that with Y2 (as ). However, these SNPs in Z3 were all invalid IVs for Y1 in this scenario as they exhibited pleiotropic effects on Y2. Furthermore, since Z3 constituted the largest group (), the plurality assumption required by MRcML was violated, leading to poor estimation of the MR effect . It is also noted that, although ScreenPair could correctly estimate the MR effect of , the spurious effect of effectively accounted for the true edge through the indirect pathway , causing the estimated direct effect of to shrink toward zero and loss of power (Figure A in S1 Appendix). In contrast, ScreenMax demonstrated robust performance by selecting SNPs as IVs only for the trait with which they had the strongest marginal association. As a result, only SNPs in Z1 were used as IVs for Y1 in the analysis, successfully recovering the true direct-effect network. ScreenAug performed similarly to ScreenMax, as the initial direct-effect network estimated using ScreenMax ensured that SNPs in Z3 were not added into the IV set for Y1 when inferring . This is because, after removing Y1, these SNPs still had a path to Y2 via Y3.

thumbnail
Fig 3. Proportions of rejections by MR2G across three IV-selection approaches in (A) the acyclic setting (Fig 2A) and (B) the cyclic setting (Fig 2B).

Each cell reports the rejection proportion for the effect from the column trait to the row trait. Within each panel, the left column corresponds to N = 50,000 and the right column to N = 200,000. Power is indicated in blue and type I error in red.

https://doi.org/10.1371/journal.pgen.1012144.g003

In the second scenario Fig 2B, all three screening methods performed well with well-controlled type-I error (Fig 3B) and almost unbiased estimates (Figure B in S1 Appendix). Unlike in Fig 2A, ScreenPair did not have the same issue when the sample size was large. This is because, in this scenario, SNPs in Z3 became valid IVs for Y1 when inferring . Notably, ScreenAug achieved higher power than ScreenMax for inferring when N = 50 000 (0.75 v.s. 0.50). This is because ScreenMax’s conservative global pruning assigned SNPs in Z3 exclusively to Y3, even though they were also valid IVs for Y1 in this setting. ScreenAug recovered these valid IVs by verifying that, after removing Y1 from the reference graph, no alternative path existed from these SNPs to Y2, thereby augmenting the IV set and improving power. This illustrates the advantage of ScreenAug that it retains the robustness of ScreenMax while recovering valid IVs that ScreenMax unnecessarily excludes.

Furthermore, we also applied the Graph-MRcML approach [23] for comparison, which used ScreenPair for IV selection and the network deconvolution algorithm for estimating direct-(causal-)effect network G. The results are provided in Figure C in S1 Appendix. Overall, Graph-MRcML had performance similar to the performance of ScreenPair in Fig 3. It is worth noting that, in scenario Fig 2B, although all selected IVs were valid, it yielded a slightly biased estimate of the direct-effect network. This is because Graph-MRcML [23] approximated the off-diagonal elements of the total-effect network as defined in Eq (9) by pairwise MR. As shown in the section Relationship with network deconvolution, this approximation became inaccurate in the presence of cycles ( in this case), leading to a biased estimate of the direct-effect network.

We conducted several additional sensitivity analyses for the three-trait simulation. First, we varied the AR(1) residual correlation to assess robustness to confounding strength (Section 3.3.2 in S1 Appendix). Second, we reduced the direct-effect sizes and controlled per-trait SNP heritability at different levels to evaluate the performance (Section 3.3.3 in S1 Appendix). Third, we introduced an explicit latent confounder U with direct effects on all three traits to examine robustness beyond correlated residuals (Section 3.3.4 in S1 Appendix). Across all settings, the main conclusions remained consistent that ScreenMax and ScreenAug maintained well-controlled type-I error, while ScreenPair exhibited inflated false positives in the acyclic setting when indirect associations became detectable. Additionally, we compared MR2G with conventional univariable MR methods (IVW, MR-Egger, and MR-PRESSO) to illustrate the limitations of standard approaches in the network setting (Section 3.3.5 in S1 Appendix).

MR2G accurately reconstructs complex causal networks in simulations designed to mimic real-world GWAS data

To evaluate the performance of MR2G under realistic genetic architectures, we designed a simulation using six traits and an underlying direct-effect network estimated from real GWAS data (Fig 2C, Fig 4A). We directly simulated the GWAS summary statistics of 1056 SNPs across 392 LD blocks that closely mimicked the real data analysis and preserved realistic LD patterns (see the section Simulation based on real GWAS summary data for details).

thumbnail
Fig 4. Simulation results based on real data application in Fig 2C.

Panel A shows the true direct-effect network G used in the simulation. Panel B shows the mean of estimated by MR2G across 200 replicates (values are rounded to three decimal places). Panels C and D show the rejection rate of MR2G and Graph-MRcML, respectively. Power is indicated by a blue colour scale; type I error is indicated by a red colour scale.

https://doi.org/10.1371/journal.pgen.1012144.g004

MR2G successfully recovered the direct-effect network, achieving well-controlled type-I error (Fig 4C) and nearly unbiased estimates (Fig 4B). In contrast, Graph-MRcML, which is also a two-stage approach that allows for cycles, exhibited substantially inflated type-I error for several pathways involving Stroke (Fig 4D). The unsatisfactory performance of Graph-MRcML is primarily attributable to its IV screening step, where ScreenPair tends to include invalid instruments with pleiotropic effects, as discussed next.

We examined the other two IV screening strategies used by MR2G. ScreenPair exhibited highly inflated type-I error rates in some pairs, particularly for the pathways involving Stroke (Figure D in S1 Appendix). This inflation was primarily due to the poor estimation of the MR effects for the pathways CAD → Stroke and Stroke → AF (Figure E in S1 Appendix). ScreenMax produced results similar to ScreenAug but with a slightly higher root mean squared error (Table A in S1 Appendix). We further investigated the poor performance of ScreenPair. When inferring the causal relationship CAD → Stroke, ScreenPair selected many SNPs that had a direct effect on SBP, which in turn had an indirect effect on Stroke that did not mediate through CAD. A similar issue arose when inferring the relationship Stroke → AF. These findings are consistent with the simpler setting in the previous section and further confirm the robustness of ScreenAug in the presence of realistic polygenic architectures and complex causal network structure with indirect pathways.

In summary, the proposed method MR2G reliably recovered complex causal networks with accurate effect estimates and well-controlled false positives. These results highlight the effectiveness of combining a theoretically sound network recovery framework with robust MR estimation and the ScreenAug strategy, which improves IV selection in the presence of horizontal pleiotropy and trait interdependencies. To further assess robustness, we repeated this simulation under weaker causal effects by scaling the direct-effect matrix G in Fig 4A by (Section 3.4.1 in S1 Appendix). The results were consistent with the above conclusions, with ScreenMax and ScreenAug maintaining well-controlled type-I error across all scaling factors.

MR2G reveals biologically plausible network among 15 traits with large-scale GWAS summary data

We applied MR2G to infer a causal network among 15 complex traits, including 9 common risk factors — body mass index (BMI), height, systolic blood pressure (SBP), low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, triglycerides (TG), fasting glucose (FG), smoking, and alcohol drinking — and six major diseases: coronary artery disease (CAD), stroke, atrial fibrillation (AF), type 2 diabetes (T2D), Alzheimer’s disease (AD), and asthma. Fig 5A and Fig 5B show the MR-effect and direct-effect networks inferred by MR2G (with 1000 data perturbations/bootstrap samples) using IVs selected by ScreenAug. Edges represent significant effects passing Bonferroni corrected p-value threshold, with green and red indicating positive and negative effects, respectively. The estimated effect sizes with their standard errors are shown in the Figures G and H in S1 Appendix.

thumbnail
Fig 5. Networks among 15 complex traits inferred by MR2G with GWAS summary data.

Panel A is the estimated MR-effect network , and panel B is the estimated direct-effect network . Edge color indicates the sign of an estimated effect (green for positive and red for negative), and edge width is proportional to its magnitude.

https://doi.org/10.1371/journal.pgen.1012144.g005

First, we discuss several key findings on the refined causal landscape of the three cardiovascular diseases. For CAD, the estimated MR-effect network indicated multiple risk factors, including BMI, height, FG, SBP, and lipid traits (LDL and TG). However, after accounting for potential indirect effects mediated by other traits, only SBP and LDL remained as independent direct causal factors. This is consistent with findings from large-scale studies such as the Cholesterol Treatment Trialists’ Collaboration [26], which demonstrated LDL’s direct causal role in CAD, and several large studies [27,28], which highlighted SBP as a robust, independent predictor of CAD risk. Other factors including BMI, height, FG, and TG often exert their effects indirectly through mediators such as hypertension, dyslipidemia, or insulin resistance [2931]. The direct-effect network also highlighted the role of SBP as a key determinant of stroke. Furthermore, while the MR-effect network suggested a marginally significant Alcohol → Stroke relationship with an effect size of 0.26 (standard error = 0.09, p-value = 0.0018), this association completely disappeared in the direct-effect network with an effect size of 0.03 and p-value greater than 0.7, indicating that alcohol’s influence on stroke risk is largely mediated through increases in SBP. The direct-effect graph also provided important insights into AF and its relationship with CAD and stroke. BMI and height were identified as strong independent risk factors for AF, consistent with the well-established roles of obesity in atrial remodeling [32] and the emerging evidence that taller individuals have a greater predisposition to AF due to larger atrial dimensions and prolonged conduction pathways [33,34]. Furthermore, a bidirectional causal relationship between CAD and AF was observed, suggesting a complex interplay between these conditions [35]. This supports previous studies indicating that atherosclerosis may contribute to AF via systemic inflammation and endothelial dysfunction [36], while AF may exacerbate CAD risk through altered hemodynamics and increased thromboembolic burden [37,38]. Additionally, AF was confirmed as a direct causal risk factor for stroke, reinforcing its role as a major contributor to cardioembolic stroke [39,40]. Interestingly, the direct CAD → stroke relationship was only marginally significant in the direct-effect graph, suggesting that effect of CAD on stroke is partially mediated through AF or other vascular mechanisms rather than a direct causal pathway.

Beyond findings related to the three cardiovascular diseases, the FG ↔ T2D ↔ BMI pathway provided insights into the metabolic interdependence of diabetes and obesity. The bidirectional relationship between fasting glucose and T2D is expected, as elevated FG levels contribute to diabetes onset, while T2D exacerbates glycemic dysregulation through insulin resistance and beta-cell dysfunction. Interestingly, while obesity is a well-established risk factor for T2D (BMI → T2D), a negative causal effect of T2D on BMI was discovered, suggesting that the onset of diabetes may lead to weight loss. This is consistent with clinical observations where unintended weight loss is one of the symptoms of diabetes [41], often resulting from dehydration, metabolic inefficiency, increased glucose excretion, and muscle breakdown [42,43]. Further studies are warranted to explore the underlying mechanisms and potential clinical implications of this bidirectional relationship.

The direct-effect network also revealed the causal relationship between height and AD, suggesting that taller individuals may have a reduced risk of AD. This finding aligns with recent observational studies [44] and MR studies [45,46] reporting height as a protective factor of AD. Moreover, the effect sizes of height → AD remained similar between the MR-effect and direct-effect graphs, suggesting that the effect was not strongly influenced by mediation through other pathways considered in this analysis. Lastly, as a negative control, asthma was included to assess the specificity of the inferred causal effects. Consistent with what is expected, no significant causal relationship was identified between asthma and key metabolic or cardiovascular traits.

We also evaluated the impact of different IV screening methods, ScreenAug, ScreenMax, and ScreenPair, on the inferred direct-effect network structure (Figure A in S1 Appendix). The estimated networks were broadly consistent across all IV screening approaches, with only minor differences due to p-values falling slightly below the Bonferroni-adjusted threshold. For example, ScreenPair failed to detect the bidirectional relationship CAD ↔ AF after Bonferroni correction, despite these associations being nearly significant. ScreenAug and ScreenMax obtained similar direct-effect networks, but ScreenAug provided more precise causal effect estimates with smaller standard errors in over 75% of the estimated causal effects.

Discussion

Understanding how complex traits influence one another is essential for uncovering disease mechanisms and designing effective interventions. In this study, we introduce MR2G, a novel statistical framework that enables robust inference of direct-(causal-)effect networks among multiple traits from GWAS summary statistics, even in the presence of feedback loops/cycles and horizontal pleiotropy of SNPs. MR2G consists of three key components: (i) a recursive dynamic model that accommodates cyclic effects among traits, (ii) a closed-form graph reconstruction algorithm based on pairwise MR estimates, and (iii) a new IV selection strategy, ScreenAug, which uses the estimated network structure to mitigate bias from correlated pleiotropy.

Many existing approaches for causal network inference from GWAS data rely on structural equation models or Bayesian frameworks that assume a DAG structure. While these methods have enabled progress in disentangling trait relationships, they are fundamentally limited in settings where feedback loops and bidirectional influences are biologically plausible. In contrast, MR2G is built upon a recursive causal model that naturally accommodates cyclic networks and provides a theoretically sound link between direct effects and marginal MR effects. Compared to our previous Graph-MRcML framework [23], which also allows for feedback loops, MR2G replaces the heuristic network deconvolution step with a closed-form graph recovery algorithm, improving both the theoretical validity and computational efficiency. Additionally, MR2G extends Steiger’s filtering used in Graph-MRcML to ScreenAug for IV selection, which leverages the estimated causal structure to exclude instruments with potential pleiotropic effects, leading to more robust causal estimates.

It is also important to distinguish MR2G from conventional MR methods. Univariable MR methods such as IVW [47], MR-Egger [2], and MR-PRESSO [48], along with MRcML [6], estimate the pairwise MR effect , which captures both direct and indirect causal effects, but cannot disentangle the two. MR2G addresses this by recovering the direct-effect network G from the MR-effect network via Eq (7). We have further shown that the resulting direct effects are the same as those from multivariable MR [7], which estimates the effect of each exposure on an outcome conditional on other exposures. However, unlike MVMR, MR2G does not require a pre-specified exposure–outcome direction and recovers the entire direct-effect network simultaneously. There may also be other technical challenges with application of MVMR, such as the exacerbated weak IV bias with multiple correlated exposures as often encountered [10]. MR2G can be regarded as another way to achieve the goal of MVMR (and beyond) through applying simpler univariable MR.

Simulation studies demonstrate that MR2G achieved well-calibrated type-I error control and unbiased estimation, even in challenging scenarios with dense horizontal pleiotropy and complex indirect pathways. The proposed ScreenAug procedure outperformed alternative IV screening strategies, offering greater robustness and improved precision in causal effect estimates. Although the benefits of ScreenAug were more modest in real data analyses with current GWAS sample sizes, simulation studies suggest that its advantages will become increasingly pronounced as sample sizes grow and weaker GWAS effects become more detectable. A sensitivity analysis with five traits further demonstrated that ScreenAug is robust to moderate misspecification of the reference graph used for IV augmentation, provided that edges involved in confounder-like motifs are captured (Section 3.5 in S1 Appendix). When applied to 15 complex traits, MR2G recovered biologically supported relationships, such as the direct effects of SBP and LDL cholesterol on CAD, the feedback loop between AF and CAD, and feedback loops among T2D, FG, and BMI.

The proposed MR2G framework still has certain limitations, and there are several promising extensions could be pursued in future work. First, although the recursive model guarantees convergence under a commonly adopted spectral radius condition, this constraint is not enforced during estimation, making it possible that the spectral radius of the estimated causal network is greater than 1. Second, although ScreenAug empirically enhanced IV selection, its performance depends on the accuracy of the inferred direct-effect graph and thus inherits assumptions required by MRcML, such as the plurality condition. In practice, other robust UVMR methods, such as MR-APSS [49], can also be used in the MR2G framework, while their performance need to be further evaluated. In addition, ScreenAug cannot fully eliminate invalid IVs with (correlated) horizontal pleiotropy mediated by traits outside the analyzed network. Third, when the number of available IVs is limited, the significance threshold for IV selection could be relaxed (e.g., from to ) to include more candidate IVs. While this strategy increases the number of IVs, it may also introduce more weak IVs; some existing methods, such as MR-RAPS [50] (and likely MRcML [10] as we are currently studying), are robust to weak IVs, and thus can be used. Fourth, MR2G could be extended to incorporate correlated SNPs as IVs following existing methods such as cis-MRcML [51], which can increase the number of usable IVs (especially for molecular traits) and improve statistical efficiency. Fifth, while MR2G is built on a recursive causal model, Algorithm 1 is algebraically equivalent to the formula of [52]. In principle, MR2G can be extended to leverage interventional data (e.g., Perturb-seq) by using interventions instead of SNPs as IVs to infer gene regulatory networks, as in recent works [52,53]. Finally, future extensions to high-dimensional traits through penalization or regularization techniques will further enhance the utility of the MR2G framework.

In summary, MR2G provides a powerful and robust tool for elucidating causal relationships among complex traits using GWAS summary data. As biobank-scale datasets continue to grow, we expect that MR2G will facilitate more accurate construction of causal networks, enabling deeper understanding of the biological mechanisms underlying human traits and diseases.

Materials and methods

Causal network model with cyclic structures

Recursive model.

Suppose we have T traits of interest denoted as , and S SNPs as IVs denoted as . To accommodate potential feedback loops/cyclic causal relationships among traits, we model the system dynamically, allowing trait values to evolve over time. Denote as a series of discrete time points, the value of trait Yi at time t is denoted as , and . For the convenience of notation, we denote Y(0) as a vector of 0’s with length T. We adopt the following recursive model

(1)(2)

Eq (1) is the initialization of the model at t = 1. Here is a T × S matrix, with its element in the ith row and kth column being the direct effect from Zk to Yi. The random errors are correlated, and their correlations account for unmeasured confounders. Eq (2) represents a second-order difference equation, which describes the recursive process. It consists of two components: a carry-over term Y(t−1) and a propagated change , which captures how recent changes in trait values propagate through the direct-(causal-)effect network G. Therefore, the matrix is the causal network of our main interest, where each entry gij encodes the direct causal effect from Yj to Yi. More specifically, if the value of Yj changes by from time t − 2 to t − 1, then this change will cause Yi to change at time t (comparing to time t − 1) by an amount of . This second-order formulation reflects how changes in one trait can have (delayed) effects on other traits, a feature especially important for modeling feedback and cyclic relationships. The example in Fig 6 is an illustration of the recursive model.

thumbnail
Fig 6. An illustrative example for the recursive model with three traits.

The figure on the left shows the causal model with cycle and bidirectional causal effects between Y1 and Y3, effect sizes are marked on the directed edges. As shown by the equations on the right, values of traits at time t are recursively determined by values at times t − 1 and t − 2.

https://doi.org/10.1371/journal.pgen.1012144.g006

Denote , with I being the identity matrix. From Eqs (1) and (2), for any t ≥ 1 we have

(3)

We make the following Assumption 1 about the causal network G.

Assumption 1 We have (1) the spectral radius of G (i.e., the largest absolute value of all real and complex eigenvalues of G) is less than 1, and (2) all diagonal elements of G (i.e., gii’s) are 0. In the Assumption 1, part (1) about the spectral radius (also commonly adopted in previous studies such as [54]) guarantees the dynamic model will converge to an equilibrium state, and part (2) requires that any trait cannot have direct effect on itself (i.e., there is no self-loop). With part (1) of Assumption 1, the inverse always exists, and we can calculate the sum in (3) as

(4)

Marginal model.

In most real-world applications, it is (almost) impossible to measure Y(t) at different times t’s, and thus the observed values of traits are treated as fixed. With part (1) of Assumption 1, we have Gt converges to a matrix of 0’s as . Therefore, we have with the marginal model

(5)

In the marginal model (5), is a T × S matrix with its element in the ith row and kth column being the total (or marginal) effect from Zk to Yi. And is the vector of marginal random errors. For the convenience of notation, we denote , and wij is the element in the ith row and jth column of W.

Notably, Eq (5) coincides with the commonly-used linear structural equation model (SEM):

(6)

which describes the relationship among endogenous variables (i.e., traits Y) and exogenous variables (i.e. IVs Z). While this SEM representation is static, our recursive modeling provides a dynamic interpretation of how changes in one trait propagate through the causal network over time. In particular, it reveals how the system evolves toward a steady state (or equilibrium) described in Eq (6).

MR-(causal-)effect network.

Suppose we have an ordered sequence of M variables with an integer M ≥ 2, such that Ql has non-zero direct effect on Ql+1 for . Since an IV can have direct effects on traits and cannot be directly affected by a trait or another IV, we have Q1 being an IV or a trait (i.e., ) and Ql being a trait (i.e., ) for l ≥ 2. Therefore, is a directed path with M variables and of length (M − 1), and we denote this directed path as . Note that, different Ql’s can be the same trait due to existence of cycles in the causal network. With the concept of directed path, we can define valid IV as follows.

Definition 1. For an IV Zk and two traits Yi and Yj, if for any directed path QM from Zk to Yi (i.e., Q1 = Zk and QM = Yi), we always have Yj is on the directed path (i.e., there is some 1 < l < M such that Ql = Yj), then Zk is a valid IV for Yj to Yi. Furthermore, if Zk is a valid IV for Yj to Yi for all i ≠ j (which is equivalent to and for all ), then Zk is a valid IV for Yj.

Definition 1 can be viewed as an extension of the original definition of valid IV in the univariable MR framework [1], in which a valid IV cannot affect the outcome through a path that is not mediated through the exposure. Note that, IV validity from Definition 1 is not conditional on the modeled network. If we add/exclude some variables to/from the network containing Yj and Yi, the pleiotropic effects from Zk to Yi through directed paths Q’s including these variables can be separated from/merged into the effects from Zk to Yi through other existing directed paths, and the total pleiotropic effect does not change. As shown by Theorem 1, we can define the MR effect of one trait on another with valid IVs.

Theorem 1. Suppose Zk is a valid IV for Yj to Yi, we have the ratio of the two total (marginal) effects as . Therefore, is independent of Zk and completely determined by W and thus by G. We call as the MR effect from Yj to Yi.

The proof of Theorem 1 is in the Section 1.1 in S1 Appendix. To get some intuition, consider a simple case that Zk is a valid IV for Yj, i.e., and for all . Recall that, from Eq (5), the total effects of SNPs on traits B are jointly determined by the direct-(causal-)effect network G and direct effects of SNPs on traits as with , we have and , therefore the ratio is independent of Zk. Denote as the T × T matrix representing the MR-(causal-)effect network with its element in the ith row and jth column as , from Theorem 1 we have the following key connection between and W as

(7)

where and is the T × T diagonal matrix with diagonal elements as w.

MR2G: estimating causal networks from GWAS summary statistics

Estimating the MR-effect network via MRcML.

Model (5) describes marginal associations between SNPs and traits, therefore we can obtain estimates of elements in B from GWAS summary data. Denote estimate of as for 1 ≤ i ≤ T and 1 ≤ k ≤ S, and standard error of as . For any two traits Yj and Yi, Theorem 1 allows us to combine multiple valid IVs to estimate the MR effect . However, a critical issue in MR is that SNPs being used as IVs may be invalid due to the widespread horizontal pleiotropy, leading to biased results. A number of MR methods have been developed to deal with invalid IVs [25], and one robust and powerful method is the constrained maximum likelihood-based MR method (MRcML) [6]. Under some mild conditions, including the plurality assumption for model identifiability and the ratio of sample sizes of the two traits being bounded, MRcML can consistently select invalid IVs to achieve unbiased inference of the causal effect. More details about MRcML can be found in [6]. For an ordered pair of traits (Yj, Yi) with 1 ≤ i ≠ j ≤ T, we apply MRcML to get an estimate of MR effect with a set of independent SNPs as candidate IVs. The strategy of IV selection and screening is described in the following Section. With all ’s, we can get the estimated MR-effect network denoted as .

IV screening procedure.

In the conventional MR analysis, the directions of causal effects are assumed from given exposures to outcomes. Therefore, SNPs being significantly associated with an exposure (typically with p-values less than 5 × 10−8) will be selected as candidate IVs for the exposure. However, when the directions of causal effects between two traits are unknown, as discussed in [12], selecting IVs based on p-values can lead to biased results. For example, suppose trait Y1 has a non-zero causal effect on trait Y2 but not the other way around, and Z is a valid IV for Y1 with effect . Therefore, Z has non-zero effect on Y2 that is mediated through Y1. When the sample size of GWAS for Y2 is large enough, Z is significantly associated with Y2 and will be used as an IV for Y2 as well, falsely estimating the causal effect from Y2 to Y1 as .

A simple IV screening rule based on Steiger’s method was proposed in [12], which can effectively alleviate this problem when estimating bi-directional causal relationships between two traits. The same screening rule was also adopted in the Graph-MRcML method [23] when estimating total causal effects between traits. Specifically, for bi-directional MR between Yi and Yj, if a SNP is significantly associated with both traits, it is only used as an IV for the trait with which it has a stronger correlation. This approach has demonstrated good performance in examining bi-directional causal relationships by alleviating violation of MR assumptions required by many robust MR methods (e.g., the InSIDE assumption, plurality assumption), particularly in scenarios involving bi-directional causality [12]. We refer to this screening approach as ScreenPair.

While ScreenPair is effective in bi-directional MR analysis, it does not fully account for the complex causal network structure. For instance, consider the causal graph among 3 traits shown in Fig 2A, SNPs in the set Z3 are marginally associated with all three traits. Depending on the effect sizes of and (and consequently, the correlations of SNPs in Z3 with Y1 and Y2), the ScreenPair approach may select these SNPs as IVs for Y1 when performing MR of , provided their correlations with Y1 are stronger, and vice versa. However, all SNPs in Z3 are invalid IVs for investigating the bi-directional relationship between Y1 and Y2 due to (correlated) horizontal pleiotropy. The following Assumption 2 is a generalization of the assumption in [12] to the causal network model.

Assumption 2 For an IV Zk and a trait Yj, if for all , then .

Assumption 2 states that, if Zk has the strongest correlation with Yj, then it has a non-zero direct effect on Yj. This assumption is reasonable in practice given that causal effects between complex traits are typically small. Note that, it does not exclude direct effects of Zk on other traits. With Assumption 2, we can construct a set of candidate IVs for Yj as . If a SNP is significantly associated with multiple traits, it will only be used as an IV for the trait with the strongest correlation. We refer to this approach as ScreenMax. Under ScreenMax, SNPs in will only be used as IVs for Y1, Y2 and Y3 respectively, thereby avoiding the issue caused by ScreenPair in Fig 2A.

However, ScreenMax may be overly restrictive as it may exclude some valid IVs. For example, in Fig 2B, SNPs in Z3 are indeed valid IVs for Y1 when performing MR of . Accordingly, we can augment the IV set selected by ScreenMax by adding back some potentially valid IVs selected by ScreenPair. The procedure is summarized as follows:

  1. For each pair of traits Yj and Yi, we first identify a set of independent SNPs, with each SNP being significantly associated with either/both of the two traits. Then we use ScreenPair to construct IV sets for performing MR , and for performing MR .
  2. Next we use ScreenMax to refine these sets. Specifically, for , we retain only the IVs that exhibit the strongest correlation with Yj among all other traits with which they are significantly associated, denoting this set as . Similarly, for , we retain only the IVs that have the strongest correlation with Yi, denoting this set as . We then construct an estimated direct-effect graph using IVs selected by ScreenMax.
  3. Based on the estimated (with Bonferroni adjustment for multiple testing), we augment IV sets as follows: for each SNP , consider each trait Yl that is significantly associated with Zk and has a stronger correlation with Zk than Yj. If removing Yj from , results in Yl having no path to Yi, then Zk is added back to the IV set . A similar procedure is also applied to .

We refer to the above procedure as ScreenAug. This procedure balances the strengths of both methods: it reduces bias and false discoveries by excluding potentially invalid IVs and enhances power by reintroducing potentially valid IVs excluded due to overly stringent criteria.

Algorithm to estimate the direct-(causal-)effect network from a MR-(causal-)effect network.

Based on the key connection (7), we have , which suggests that is a scaled version of with column scaling matrix . Therefore, we have

(8)

To ensure zero diagonal elements in G as required by Assumption 1 (i.e., no self-loop), which is equivalent to having the diagonal elements of being 1, we propose the following algorithm to estimate wjj and G given the estimated .

Algorithm 1. Estimation of G given

Input:

return

The following Theorem 2 provides theoretical guarantees for inference on the causal network G, and the conditions it requires and the proof can be found in Section 1.2 in S1 Appendix.

Theorem 2. Under suitable regularity conditions, is a consistent estimator of G and follows an asymptotic normal distribution.

However, since the asymptotic covariance of is analytically complex and does not account for uncertainty arising from model selection in MRcML under finite samples, we adopt a data perturbation strategy (i.e., a parametric bootstrap [55]) for inference, similar to that in our previous work [23], as detailed below.

Specifically, we jointly perturb the entire matrix of GWAS effect estimates across all traits to account for both inter-GWAS correlations and LD structure among SNPs. Letting denote the matrix of GWAS Z-scores, with denoting the element-wise matrix division, where S is the matrix of standard errors. We generate perturbed datasets from a matrix normal distribution , with R and P representing the SNP LD matrix and trait correlation matrix (due to sample overlap), respectively. The perturbed GWAS effect estimates are then reconstructed as , with ⊙ denoting the element-wise matrix multiplication. In our application, LD matrices are computed within 1703 approximately independent LD blocks [56] using 1000 Genomes Phase 3 [57] European population via TwoSampleMR [58]. For each perturbed dataset, we apply bidirectional MRcML to get estimated and then apply Algorithm 1 to get estimated . Repeating this procedure B times, we compute the element-wise mean and standard deviation of to obtain point estimates and standard errors, and calculate p-values using a standard normal distribution.

Relationship with other methods

Relationship with network deconvolution.

[24] proposed a network deconvolution formula

(9)(10)

where the second equality holds under the assumption that the spectral radius of G is less than 1. The definition of a total-(causal-)effect graph Gtot in Eq (9) can be motivated by repeatedly substituting the right-hand side of Eq (6) for Y(∞) in the right-hand side [59]:

where the matrix Gk represents the indirect effects of length k for traits Y on each other. For example, if we denote be the element in Gk, then represents the indirect effect of length 2 from Yj to Yi through one node Yl. Similarly represents the indirect effect of length 3 from Yj to Yi through two nodes Yl and Yv, and so on. Then the total effect can be defined by the sum , and k can get arbitrarily large due to possible cycles among Y.

The above infinite summation Eq (9) converges to if and only if the spectral radius of G is less than 1, otherwise may not be well defined. In particular, under the spectral radius assumption, we have , which gives us an insightful relationship between the total-effect network and the MR-effect network

(11)

where and tjj is the jth diagonal element of . In other words, MR estimates a partial total effect without accounting for cyclic effects if cycles are present. Cyclic effects manifest as self-loops in the total-effect graph, i.e., tjj. One immediate result is that, if there are no cycles in the direct-effect graph, there will be no self-loops in the total-effect graph (i.e., tjj = 0), making .

Relationship with multivariable MR.

MVMR [7] estimates the direct effect of each exposure on an outcome, conditional on the other exposures. A natural question is whether the direct effects in G coincide with the corresponding MVMR estimands. In S1 Appendix Section 3.2 Proposition 1, we prove that, when all other traits are included as MVMR exposures for outcome Yi, the MVMR estimand equals , the i-th row of G.

Simulation setups

Simulation with three traits.

In this section, we considered two simple scenarios in Fig 2A and Fig 2B to illustrate the idea of IV screening. We generated individual-level data based on the marginal model (5),

(12)

where Y was a 3 × N matrix representing three traits, Z was a 30 × N matrix of 30 SNPs, each independently generated from a binomial distribution with a minor allele frequency of 0.3. was a 3 × 30 matrix, in which the first 8 SNPs had non-zero effects only on Y1, the next 7 SNPs had non-zero effects only on Y2, and the remaining 15 SNPs had non-zero effects only on Y3, and the non-zero effect sizes () were drawn from a uniform distribution over . In other words, . The random errors were drawn from , where had a first-order autoregressive structure with correlation 0.4 to reflect the unmeasured confounder effect. For Fig 2A, , and for Fig 2B, . Under this setup, the average proportion of phenotypic variance explained by the genetic component across replicates was approximately 7.1%, 6.0%, and 12.8% for the three traits in the acyclic setting (Fig 2A), and 6.3%, 6.0%, and 8.9% in the cyclic setting (Fig 2B).

We considered sample sizes of N = 50 000 and N = 200 000. After generating individual-level data, we regressed rows of Y on Z to obtain the GWAS estimates for the three traits. We then applied MR2G to infer the direct-effect graph using IVs selected by the three different IV selection approaches, ScreenPair, ScreenMax and ScreenAug, with the p-value threshold of . Additionally, we also compared the results of Graph-MRcML, which uses IVs selected via ScreenPair and the network deconvolution algorithm proposed in [23].

For each scenario, we conducted 200 simulation replicates. We used the p-value threshold of 0.05 to evaluate type-I error and a Bonferroni-corrected threshold of 0.05/6 to evaluate power.

Simulation based on real GWAS summary data.

To evaluate the performance of the proposed method, we designed simulation studies closely following the structure of our real data application. We focused on a subset of six traits, including three risk factors (BMI, LDL and SBP) and three cardiovascular diseases (CAD, AF and stroke) in the simulation.

We first extracted the 1056 SNPs across 392 LD blocks that were used in the real data analysis for these 6 traits, ensuring that the simulated data retains a realistic LD pattern. We directly generated GWAS summary statistics for the 1056 SNPs as follows.

  1. We first selected SNPs that were genome-wide significant in the real GWAS datasets to have non-zero direct effect on the corresponding trait Yj, and the effect sizes were drawn from a normal distribution ;
  2. The GWAS effect size matrix B was then generated using the relationship . G was based on the real data application on the 6 traits (Fig 2C).
  3. Finally, the observed GWAS estimates were generated as , where E follows a matrix normal distribution . Here, S represents the standard errors of the GWAS estimates, directly extracted from the real data. R is the (block-diagonal) LD-matrix of SNPs, derived from the 1000 Genomes Phase 3 [57] European population. P is the correlation matrix among the 6 GWAS data, which is an identity matrix in this case.

With the simulated GWAS summary statistics, we applied the same preprocessing step as in the real data analysis to obtain a set of approximately LD-independent SNPs for each pair of traits. We then applied MR2G (with B = 100 data perturbations) using IVs selected by the three different IV selection approaches, ScreenPair, ScreenMax, and ScreenAug, to estimate direct-effect network among traits. We conducted 200 simulation replicates. We used the p-value threshold of 0.05 to evaluate type-I error and a Bonferroni-corrected threshold of 0.05/30 to evaluate power.

Real data application

We applied MR2G to infer the direct-effect network among 9 cardiometabolic risk factors and 6 diseases. The 9 risk factors were triglycerides (TG), low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, height, body-mass index (BMI), systolic blood pressure (SBP), fasting glucose (FG), smoke (cigarette per day), and alcohol (alcoholic drinks per week). The 6 diseases were coronary artery disease (CAD), stroke, type 2 diabetes (T2D), asthma (more as a negative control), atrial fibrillation (AF), and Alzheimer’s disease (AD). The sample sizes for the 15 GWAS datasets ranged from 10 083–1 030 836, with a median of 256 879. All GWAS summary statistics are publicly available.

We pre-processed the GWAS summary statistics following the procedure described in [12] and [23]. Specifically, for each pair of traits, we used the TwoSampleMR package [60] to obtain a set of approximately LD-independent SNPs that were significantly associated with at least one of the traits (using a p-value cutoff of ). We then aggregated these SNPs across all trait pairs and used ieugwasr::ld_matrix function from the ieugwasr package [61], with the 1000 Genomes Phase 3 [57] European population as the reference panel, to extract the LD matrix R among all selected SNPs. The correlation matrix P among the 15 GWAS data was estimated by bivariate LD score regression [62], and for the pairs with small correlations , we set them to be 0 as before [23].

Supporting information

S1 Appendix. Supplementary file with proofs, details of methods, additional real data analysis results and additional simulation results.

https://doi.org/10.1371/journal.pgen.1012144.s001

(PDF)

Acknowledgments

The computing resources were provided by the Minnesota Supercomputing Institute at the University of Minnesota.

References

  1. 1. Sanderson E, Glymour MM, Holmes MV, Kang H, Morrison J, Munafò MR, et al. Mendelian randomization. Nat Rev Methods Primers. 2022;2:6. pmid:37325194
  2. 2. Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44(2):512–25. pmid:26050253
  3. 3. Qi G, Chatterjee N. Mendelian randomization analysis using mixture models for robust and efficient estimation of causal effects. Nat Commun. 2019;10(1):1941. pmid:31028273
  4. 4. Morrison J, Knoblauch N, Marcus JH, Stephens M, He X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics. Nat Genet. 2020;52(7):740–7. pmid:32451458
  5. 5. Burgess S, Foley CN, Allara E, Staley JR, Howson JMM. A robust and efficient method for Mendelian randomization with hundreds of genetic variants. Nat Commun. 2020;11(1):376. pmid:31953392
  6. 6. Xue H, Shen X, Pan W. Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects. Am J Hum Genet. 2021;108(7):1251–69. pmid:34214446
  7. 7. Burgess S, Thompson SG. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. 2015;181(4):251–60. pmid:25632051
  8. 8. Sanderson E, Davey Smith G, Windmeijer F, Bowden J. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int J Epidemiol. 2019;48(3):713–27. pmid:30535378
  9. 9. Lu J, Li H. Hypothesis testing in high-dimensional instrumental variables regression with an application to genomics data. Stat Sinica. 2023.
  10. 10. Lin Z, Xue H, Pan W. Robust multivariable Mendelian randomization based on constrained maximum likelihood. Am J Hum Genet. 2023;110(4):592–605. pmid:36948188
  11. 11. Lorincz-Comi N, Yang Y, Li G, Zhu X. MRBEE: a bias-corrected multivariable Mendelian randomization method. Hum Genet Genom Adv. 2024;5(3):100290. pmid:38582968
  12. 12. Xue H, Pan W. Robust inference of bi-directional causal relationships in presence of correlated pleiotropy with GWAS summary data. PLoS Genet. 2022;18(5):e1010205. pmid:35576237
  13. 13. Xie F, Yao Z, Xie L, Zeng Y, Geng Z. Identification and Estimation of the Bi-Directional MR with Some Invalid Instruments. arXiv preprint arXiv:240707933. 2024.
  14. 14. Li W, Duan R, Li S. Discovery and inference of possibly bi-directional causal relationships with invalid instrumental variables. arXiv preprint arXiv:240711646. 2024.
  15. 15. Li S, Ye T. A focusing framework for testing bi-directional causal effects in Mendelian randomization. J R Stat Soc Series B Stat Methodol. 2024;87(2):529–48. pmid:40225200
  16. 16. Richmond RC, Hemani G, Tilling K, Davey Smith G, Relton CL. Challenges and novel approaches for investigating molecular mediation. Hum Mol Genet. 2016;25(R2):R149–56. pmid:27439390
  17. 17. Yazdani A, Yazdani A, Mendez-Giraldez R, Samiei A, Kosorok MR, Schaid DJ. From classical mendelian randomization to causal networks for systematic integration of multi-omics. Front Genet. 2022;13:990486. pmid:36186433
  18. 18. Relton CL, Davey Smith G. Two-step epigenetic Mendelian randomization: a strategy for establishing the causal role of epigenetic processes in pathways to disease. Int J Epidemiol. 2012;41(1):161–76. pmid:22422451
  19. 19. Burgess S, Daniel RM, Butterworth AS, Thompson SG, EPIC-InterAct Consortium. Network Mendelian randomization: using genetic variants as instrumental variables to investigate mediation in causal pathways. Int J Epidemiol. 2015;44(2):484–95. pmid:25150977
  20. 20. Yin L, Feng Y, Shi Y, Lau A, Qiu J, Sham P-C, et al. Estimation of causal effects of genes on complex traits using a Bayesian-network-based framework applied to GWAS data. Nat Mach Intell. 2024;6(10):1231–44.
  21. 21. Zilinskas R, Li C, Shen X, Pan W, Yang T. Inferring a directed acyclic graph of phenotypes from GWAS summary statistics. Biometrics. 2024;80(1):ujad039. pmid:38470257
  22. 22. Zuber V, Cronjé T, Cai N, Gill D, Bottolo L. Bayesian causal graphical model for joint Mendelian randomization analysis of multiple exposures and outcomes. Am J Hum Genet. 2025;112(5):1173–98. pmid:40179887
  23. 23. Lin Z, Xue H, Pan W. Combining Mendelian randomization and network deconvolution for inference of causal networks with GWAS summary data. PLoS Genet. 2023;19(5):e1010762. pmid:37200398
  24. 24. Feizi S, Marbach D, Médard M, Kellis M. Network deconvolution as a general method to distinguish direct dependencies in networks. Nat Biotechnol. 2013;31(8):726–33. pmid:23851448
  25. 25. Hemani G, Tilling K, Davey Smith G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet. 2017;13(11):e1007081. pmid:29149188
  26. 26. Cholesterol Treatment Trialists’ (CTT) Collaboration, Baigent C, Blackwell L, Emberson J, Holland LE, Reith C, et al. Efficacy and safety of more intensive lowering of LDL cholesterol: a meta-analysis of data from 170,000 participants in 26 randomised trials. Lancet. 2010;376(9753):1670–81. pmid:21067804
  27. 27. Lewington S, Clarke R, Qizilbash N, Peto R, Collins R, Prospective Studies Collaboration. Age-specific relevance of usual blood pressure to vascular mortality: a meta-analysis of individual data for one million adults in 61 prospective studies. Lancet. 2002;360(9349):1903–13. pmid:12493255
  28. 28. Yamada MH, Fujihara K, Kodama S, Sato T, Osawa T, Yaguchi Y, et al. Associations of systolic blood pressure and diastolic blood pressure with the incidence of coronary artery disease or cerebrovascular disease according to glucose status. Diabetes Care. 2021;44(9):2124–31. pmid:34035075
  29. 29. Emerging Risk Factors Collaboration, Wormser D, Kaptoge S, Di Angelantonio E, Wood AM, Pennells L, et al. Separate and combined associations of body-mass index and abdominal adiposity with cardiovascular disease: collaborative analysis of 58 prospective studies. Lancet. 2011;377(9771):1085–95. pmid:21397319
  30. 30. Sarwar N, Danesh J, Eiriksdottir G, Sigurdsson G, Wareham N, Bingham S, et al. Triglycerides and the risk of coronary heart disease: 10,158 incident cases among 262,525 participants in 29 Western prospective studies. Circulation. 2007;115(4):450–8. pmid:17190864
  31. 31. Powell-Wiley TM, Poirier P, Burke LE, Després J-P, Gordon-Larsen P, Lavie CJ, et al. Obesity and cardiovascular disease: a scientific statement from the American Heart Association. Circulation. 2021;143(21):e984–1010. pmid:33882682
  32. 32. Sha R, Baines O, Hayes A, Tompkins K, Kalla M, Holmes AP, et al. Impact of obesity on atrial fibrillation pathogenesis and treatment options. J Am Heart Assoc. 2024;13(1):e032277. pmid:38156451
  33. 33. Sohail H, Hassan SM, Yaqoob U, Hassan Z. The height as an independent risk factor of atrial fibrillation: a review. Indian Heart J. 2021;73(1):22–5. pmid:33714405
  34. 34. Hartley A, Shalhoub J, Ng FS, Krahn AD, Laksman Z, Andrade JG, et al. Size matters in atrial fibrillation: the underestimated importance of reduction of contiguous electrical mass underlying the effectiveness of catheter ablation. Europace. 2021;23(11):1698–707. pmid:33948648
  35. 35. Liang F, Wang Y. Coronary heart disease and atrial fibrillation: a vicious cycle. Am J Physiol Heart Circ Physiol. 2021;320(1):H1–12. pmid:33185113
  36. 36. Corban MT, Toya T, Ahmad A, Lerman LO, Lee HC, Lerman A. Atrial fibrillation and endothelial dysfunction: a potential link? In: Mayo Clinic Proceedings. vol. 96. Elsevier; 2021. pp. 1609-21.
  37. 37. Zaromytidou M, Siasos G, Coskun AU, Lucier M, Antoniadis AP, Papafaklis MI, et al. Intravascular hemodynamics and coronary artery disease: new insights and clinical implications. Hellenic J Cardiol. 2016;57(6):389–400. pmid:27894949
  38. 38. Wendelboe AM, Raskob GE. Global burden of thrombosis: epidemiologic aspects. Circ Res. 2016;118(9):1340–7. pmid:27126645
  39. 39. Elsheikh S, Hill A, Irving G, Lip GYH, Abdul-Rahim AH. Atrial fibrillation and stroke: state-of-the-art and future directions. Curr Probl Cardiol. 2024;49(1 Pt C):102181. pmid:37913929
  40. 40. Murtagh B, Smalling RW. Cardioembolic stroke. Curr Atheroscler Rep. 2006;8(4):310–6. pmid:16822397
  41. 41. Mayo Clinic. Type 2 Diabetes. [cited 2025 Jun 10]. https://www.mayoclinic.org/diseases-conditions/type-2-diabetes/symptoms-causes/syc-20351193
  42. 42. Perry BD, Caldow MK, Brennan-Speranza TC, Sbaraglia M, Jerums G, Garnham A, et al. Muscle atrophy in patients with Type 2 Diabetes Mellitus: roles of inflammatory pathways, physical activity and exercise. Exerc Immunol Rev. 2016;22:94–109. pmid:26859514
  43. 43. Park SW, Goodpaster BH, Lee JS, Kuller LH, Boudreau R, de Rekeneire N, et al. Excessive loss of skeletal muscle mass in older adults with type 2 diabetes. Diabetes Care. 2009;32(11):1993–7. pmid:19549734
  44. 44. Jørgensen TSH, Okholm GT, Christensen K, Sørensen TI, Osler M. Body height in young adult men and risk of dementia later in adult life. Elife. 2020;9:e51168. pmid:32041683
  45. 45. Taylor JC, Brenowitz WD, Yaffe K, Andrews SJ. Height and intracranial volume as protective factors against Alzheimer’s disease: a Mendelian randomization analysis. Alzheimer’s Dementia. 2023;19(S23).
  46. 46. Huang J, Zhang L, Bodimeade C, Nassan M, Gill D, Cronjé HT, et al. Unravelling the relationship between height, lean mass, Alzheimer’s disease and cognition through Mendelian randomization. Genes (Basel). 2025;16(2):113. pmid:40004442
  47. 47. Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65. pmid:24114802
  48. 48. Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet. 2018;50(5):693–8. pmid:29686387
  49. 49. Hu X, Zhao J, Lin Z, Wang Y, Peng H, Zhao H, et al. Mendelian randomization for causal inference accounting for pleiotropy and sample structure using genome-wide summary statistics. Proc Natl Acad Sci U S A. 2022;119(28):e2106858119. pmid:35787050
  50. 50. Zhao Q, Wang J, Hemani G, Bowden J, Small DS. Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score. Ann Statist. 2020;48(3).
  51. 51. Lin Z, Pan W. A robust cis-Mendelian randomization method with application to drug target discovery. Nat Commun. 2024;15(1):6072. pmid:39025905
  52. 52. Brown BC, Tokolyi A, Morris JA, Lappalainen T, Knowles DA. Large-scale causal discovery using interventional data sheds light on gene network structure in k562 cells. Nat Commun. 2025;16(1):9628. pmid:41173850
  53. 53. Sun Z, Kang H, Keles S. Causal gene regulatory network inference from Perturb-seq via adaptive instrumental variable modeling. bioRxiv. 2026;2026:2026–02.
  54. 54. Hyttinen A, Eberhardt F, Hoyer PO. Learning linear cyclic causal models with latent variables. J Mach Learn Res. 2012;13(1):3387–439.
  55. 55. Lin Z, Deng Y, Pan W. Combining the strengths of inverse-variance weighting and Egger regression in Mendelian randomization using a mixture of regressions model. PLoS Genet. 2021;17(11):e1009922. pmid:34793444
  56. 56. Berisa T, Pickrell JK. Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics. 2016;32(2):283–5. pmid:26395773
  57. 57. 1000 Genomes Project Consortium, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. pmid:26432245
  58. 58. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7:e34408. pmid:29846171
  59. 59. Bentler PM, Freeman EH. Tests for stability in linear structural equation systems. Psychometrika. 1983;48(1):143–5.
  60. 60. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7:e34408. pmid:29846171
  61. 61. Hemani G, Elsworth B, Palmer T, Rasteiro R. Ieugwasr: Interface to the “OpenGWAS” Database API. R package version. 2024.
  62. 62. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh P-R, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–41. pmid:26414676