MVMRmode: Introducing an R package for plurality valid estimators for multivariable Mendelian randomisation

Benjamin Woolf; Dipender Gill; Andrew J. Grant; Stephen Burgess

doi:10.1371/journal.pone.0291183

Abstract

Background

Mendelian randomisation (MR) is the use of genetic variants as instrumental variables. Mode-based estimators (MBE) are one of the most popular types of estimators used in univariable-MR studies and is often used as a sensitivity analysis for pleiotropy. However, because there are no plurality valid regression estimators, modal estimators for multivariable-MR have been under-explored.

Methods

We use the residual framework for multivariable-MR to introduce two multivariable modal estimators: multivariable-MBE, which uses IVW to create residuals fed into a traditional plurality valid estimator, and an estimator which instead has the residuals fed into the contamination mixture method (CM), multivariable-CM. We then use Monte-Carlo simulations to explore the performance of these estimators when compared to existing ones and re-analyse the data used by Grant and Burgess (2021) looking at the causal effect of intelligence, education, and household income on Alzheimer’s disease as an applied example.

Results

In our simulation, we found that multivariable-MBE was generally too variable to be much use. Multivariable-CM produced more precise estimates on the other hand. Multivariable-CM performed better than MR-Egger in almost all settings, and Weighted Median under balanced pleiotropy. However, it underperformed Weighted Median when there was a moderate amount of directional pleiotropy. Our re-analysis supported the conclusion of Grant and Burgess (2021), that intelligence had a protective effect on Alzheimer’s disease, while education, and household income do not have a causal effect.

Conclusions

Here we introduced two, non-regression-based, plurality valid estimators for multivariable MR. Of these, “multivariable-CM” which uses IVW to create residuals fed into a contamination-mixture model, performed the best. This estimator uses a plurality of variants valid assumption, and appears to provide precise and unbiased estimates in the presence of balanced pleiotropy and small amounts of directional pleiotropy.

Citation: Woolf B, Gill D, Grant AJ, Burgess S (2024) MVMRmode: Introducing an R package for plurality valid estimators for multivariable Mendelian randomisation. PLoS ONE 19(5): e0291183. https://doi.org/10.1371/journal.pone.0291183

Editor: Suyan Tian, The First Hospital of Jilin University, CHINA

Received: March 30, 2023; Accepted: August 22, 2023; Published: May 7, 2024

Copyright: © 2024 Woolf et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data produced in the present study are available from DOI 10.17605/OSF.IO/8DZKU.

Funding: Benjamin Woolf is funded by an Economic and Social Research Council (ESRC) South West Doctoral Training Partnership (SWDTP) 1+3 PhD Studentship Award (ES/P000630/1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Background

Mendelian randomisation (MR) is an increasingly popular method for causal inference in epidemiology which uses the random assignment of genetic variants at birth to justify the assumptions of an Instrumental variables analysis [1, 2]. In a traditional MR study, genetic variants (typically single-nucleotide polymorphisms, SNPs) which robustly associate (typically at genome-wide significance) with an exposure of interest are selected as instruments [3]. Because of the easy accessibility of Genome-Wide Association Study (GWAS) summary statistics for many epidemiological traits, MR is often implemented using summary data, in a so-called ‘two-sample MR’ analysis [4]. In such a setting, the effect of the exposure on the outcome is estimated using a Wald ratio as the variant-outcome association divided by the genotype-exposure association. When there are multiple variants, their effects are generally combined using an inverse variance weighted (IVW) meta-analysis.

On top of requiring a robust genotype-exposure association, instrumental variables analysis requires that there are no variant-outcome confounders, and that the variant can only cause the outcome via the exposure. The first of these assumptions is justified by Mendel’s laws of independent and random segregation. However, the second assumption is less plausible due to pleiotropy (the association of most variants with multiple traits). Pleiotropy can occur for two reasons: Firstly, if the exposure causes many other traits, then the genetic variants which associate with it should also associate with these other traits. This type of pleiotropy (often called vertical pleiotropy) is required for MR to work. However, the second type of pleiotropy (horizontal pleiotropy) occurs when the genetic variants independently cause two phenotypes. A second advantage of two-sample MR is that it allows for the implementation of ‘pleiotropy robust’ estimators [5]. These methods generally allow for some variants to be pleiotropic by modifying the assumptions of the instrumental variables framework. One of the first methods proposed for doing this is MR-Egger. IVW can be conceptualised as a weighted intercept-free regression of the variant-outcome associations on the variant-exposure associations. MR-Egger fits the same model as IVW but with an intercept. This model is robust to pleiotropy if the instrument strength is independent of the strength of the direct, pleiotropic, effect (called the InSIDE assumption) [6].

A recent systematic review of two-sample MR studies found that the most frequently implemented pleiotropy robust estimators were MR-Egger, weighted median, and weighted mode [7]. Weighted Median will provide valid estimates if at least half the variants are valid instruments, and so is called a ‘majority valid’ estimator. Weighted mode makes the ZEro Modal Pleiotropy Assumption (ZEMPA), i.e. that there is zero pleiotropy in the modal estimand of the causal effect [8]. ZEMPA is plausible because we should expect the causal effects for variants which are valid instruments to be similar, but each invalid variant to have its own unique pleiotropic bias [9]. If the unique paths are independent of each other, then so too should the biases they exert on invalid variants. Thus, valid variants should have clustered effect estimates, while invalid variants should create heterogeneity. Hence, in settings where there are some valid instruments, we should expect the most common effect estimated to be the valid causal parameter. Here in, we call this type of estimator, which will produce valid estimates when a plurality of SNPs are valid, ‘plurality valid’ estimators.

Estimating modes directly from observed data can be difficult because no two estimates are ever exactly equal. Therefore, the most common observation at a given level of precision may be very different from the true mode. Traditional MBEs avoid this dilemma by smoothing the observed distribution using a parametric kernel-density-smoothed function. This converts the observed estimates into a probability density distribution, and then select the mode of this distribution. An alternative plurality valid estimator comes from the contamination mixture method [10].

The contamination mixture method uses a maximum likelihood approach, assuming the variant specific Wald ratios are normally distributed [10]. It produces a consistent estimator of the causal effect under the plurality valid (ZEMPA) assumption. The advantages of the contamination mixture method are that it does not require the parametric assumptions of the kernel-density function, is more computationally efficient, and generally produces more precise estimates with potentially asymmetric confidence intervals [10].

Multivariable MR (MVMR) is an extension of MR to allow for the simultaneous modelling of the effect of multiple exposures on an outcome [11]. The effects of each exposure in an MVMR model are the direct effects of the exposure on the outcome conditional on the other exposures. This has resulted in MVMR being applied as a method for mediation analyses [12], but it is also used to adjust for known biases in an MR model [13–15]. MVMR modifies the three instrumental Variables assumptions so that the variant is a valid instrument if: 1) the variant is robustly associated with at least one exposure, 2) there are no variant-outcome confounders, 3) the variant can only cause the outcome via one or more of the exposures.

MVMR was originally introduced using a residual-based framework, in which the effect of a second exposure on the outcome was removed from the variant-outcome association, and the effect of the second exposure on the exposure was removed from the variant-exposure association [14]. These modified associations were then used as the input to a traditional MR estimator. However, given the analogy between IVW and weighted regression, two-sample MVMR is typically implemented as a type of multiple regression, in which the variant-outcome associations for the variants which associate with either exposure of interest are regressed on the variant-exposure associations in an intercept-free linear regression, inversely weighted by the variance in the variant-outcome association. MR-Egger can also be implemented by allowing for a non-zero regression intercept, and weighted median can be implemented using weighted quantile regression [16].

However, we are not aware of an existing estimators for doing mode-based regression, and hence MVMR which make a plurality valid-type assumption like ZEMPA have been underexplored. The multivariable constrained maximum likelihood (MVMR-cML) method provides consistent estimates under a plurality-valid assumption by maximizing a constrained likelihood function subject a maximum number of invalid instruments [17]. The MVMR-Horse method provides estimates under the same model as MVMR-cML in a Bayesian framework, using horseshoe priors for identification [18]. Finally, the Genome-wide mR Analysis under Pervasive PLEiotropy (GRAPPLE) method is a multivariable method that can provide robust estimates in the presence of invalid instruments using profile likelihood [19]. Here, we, introduce and validate a further framework for implementing plurality valid estimators in two-sample MVMR.

Methods

Theoretical background

Notation and assumptions.

We assume a set of genetic variants that are independently distributed are being proposed as instruments in an MR analysis. We shall denote with subscript i the ith element of any vector, which relates to the ith genetic variant. Let β_y,i be the genetic variant-outcome association for the ith genetic variant and β_x,i be the genetic variant-exposure association for the ith variant. We represent the causal effect of the exposure on the outcome using the scalar θ. We also assume that the exposure-outcome relationship is linear and unaffected by effect modification. We let α_i represent pleiotropic effects of the ith variant on the outcome. Thus, when α_i = 0, the ith variant is a valid instrument.

Suppose the ith variant-exposure and variant-outcome associations are related according to the model proposed by Bowden et al. [20]: 1)

Now suppose we have estimates for two exposures, denoted by x₁ and x₂. and are the ith variant’s associations with the first and second exposure, respectively. Likewise, θ₁ and θ₂ are the causal effects of the first and second exposure, respectively, on the outcome. We can now extend (1) as follows: 2) Where represents pleiotropic effects of the ith variant on the outcome which do not pass via x₁ or x₂.

Statistical framework.

In practice, we do not observe β_y, , or . However, we may obtain estimates, for example from GWAS. We denote the vectors of association estimates by , and . Thus, in traditional multivariable-IVW we can estimate θ₁ and θ₂ using the following linear model: 3)

Given the data structure in Eqs (2), (3) will provide a consistent estimator when for all i (i.e., all variants are valid instruments), or when and is independent of and for all i (i.e., pleiotropy is balanced and the InSIDE assumption is met). A plurality valid estimator, on the other hand, should be consistent provided that a plurality of the are zero, i.e. under the ZEMPA assumption.

Let be the residuals from regressing on (without an intercept), and let be the residuals from regressing on (without an intercept). We can now estimate θ₁ using the linear model: 4)

Let be the residuals from regressing a vector of the pleiotropic effects on (without an intercept). Because we have now reformulated the equation for the variant-outcome association so that it is in terms of a univariable regression model, and can be used as the inputs to a traditional univariable mode-based estimator. When more than one exposure is of interest, then this process can be iterated for each exposure. It follows that a plurality valid estimator for θ₁ using the residuals in this way will produce a valid estimate provided that a plurality of the values are zero. This seems likely to be the case if a plurality of the values are zero and the non-zero elements are distributed around zero (i.e., balanced pleiotropy).

In settings with only two exposures, the residuals could be obtained through univariable MR of the outcome on the second exposure, and of the first exposure on the second exposure. Where there are more than two exposures, an existing multivariable MR method could be used instead to create residuals. This general framework could be implemented using a variety of estimators. Here we explore two types of plurality valid estimators. Firstly, we explore an estimator which uses a regression model to create the residuals fed into a traditional mode-based estimator (MBE) [8], which we dub ‘multivariable-MBE’. This regression model could be created using any of the existing MVMR-estimators. Here we model the residuals using IVW (i.e. intercept-free linear regression).

Although ultimately arbitrary, we focused on IVW, rather than another type of MR estimator, because it provides the most intuitive way to understand validity conditions: using IVW to create residuals means that pleiotropic effects in the residual creation step are passed forwards to the MR analysis. Hence, the estimator should produce valid estimates if a plurality of SNP effects are valid instruments. On the other hand, if weighted median was used in the first step then this would require that at least 50% of these variants would be valid. It is not obvious how the identification assumptions for the two steps would interact when defining which settings the estimator would be valid in. In addition, MBE are known to be much less precise than other estimators, and IVW is currently the most efficient multivariable estimator. Using other estimators to create residuals could exacerbate this issue.

Since the contamination mixture method has several advantages, discussed above, we also implemented this framework using both the contamination mixture method. This ‘multivariable-CM’ estimator uses IVW to create residuals fed into a contamination mixture model.

Our estimators are therefore algorithmic rather than model-based in the sense that we are not starting by precisely defining a statistical model, and then deriving conclusion from the assumptions of the model. But, instead, using an algorithm (taking the mode of the distribution) to convert genetic data in MR estimates. The likely trade-off for the conceptual simplicity of this approach will not optimise statistical efficiency.

Deriving a standard error multivariable-MBE and multivariable-CM.

Assuming we have strong instruments (i.e. the first MR assumption is valid) we can use the first order approximation for the standard error of the Wald ratio that is typically used in two-sample MR studies. In a traditional univariable model this is defined as: 5) Where SE_y,i is the standard error of the ith variant-outcome association estimate.

In effect, this standard error is assuming that the variant-exposure association is measured with sufficient precision that we can assume that it contributes no error to the estimate of the causal effect. Under this assumption, the process of creating residuals will not increase the random error in the standard error of the Wald ratio. Hence, we model the standard error of the ith Wald ratio estimate as: 6)

Simulation study

We report our simulation study using the ADEMP (aims, data-generating mechanisms, estimands, methods, and performance measures) approach [21].

Aims.

We ran a simple simulation study to assess the performance of our plurality valid estimators when compared to other MVMR estimators.

Data-generating mechanisms.

We broadly simulate a setting in which there are two putative causal exposures for a single outcome. In the primary simulation we explore a setting in which the second exposure is pleiotropic (Fig 1), and where either both or neither of the exposures have a causal association with the outcome. We then explore how well the methods do under varying amounts of balanced and directional pleiotropy.

Download:

Fig 1. Directed acyclic graphs of the simulation data generative models.

E and E2 are the first and second exposures respectively, GRS is the genetic liability to the exposures, and O is the outcome, and C is a confounder.

https://doi.org/10.1371/journal.pone.0291183.g001

More formally, we simulated 200 single nucleotide polymorphisms (SNPs, which are common genetic variants) as independent and identically distributed binomial variables with the following parameters:

We additionally simulated the SNP effects on the exposures as independent and identically distributed normal variables

The beta values and allele frequencies here were chosen to be loosely based on the effect sizes for the genome wide significant SNPs in the Wootton et al. UK Biobank GWAS smoking [22].

For settings in which we simulated pleiotropy (Fig 1.2A and 1.2B), the pleiotropic SNP effects were simulated as:

Each simulation was repeated with BETA being set to either 0 or -0.03 to represent balanced and directional pleiotropy respectively. SE was always set to 0.1.

We then simulated a confounder as a normally distributed variable with the following parameters: C ~ N(0, 1²)

We then defined the first exposure as: where ε is an error term such that ε₁ ~ N(0, 1²).

The second exposure was defined as: where ε is an error term such that ε₃ ~ N(0, 1²).

When both exposures had null effects on the outcome (Fig 1.1B and 1.2B), the outcome was defined as: where ε is an error term such that ε₄ ~ N(0, 1²). p could take the value of 0, 20, 40, or 80 to represent pleiotropic effects for 0, 10%, 20% or 40% of SNPs.

When both exposures had non-null effects on the outcome (Fig 1.1A and 1.2A), the outcomes were defined as: Where p takes the same definition as it had for O_N;P.

The phenotypic beta values chosen were chosen arbitrarily. However, biases are often more visible with larger effect estimates. By choosing realistically large betas we hoped to clearly illustrate the possible strengths and limitations of the different methods. While the specific results of our simulation may not be applicable to any specific applied setting, more general trends should be.

GWAS summary statistics for each exposure variable were estimated from linear regression models. Each genetic association with each exposure, and the outcome, were estimated from a unique sample of 200,000 participants with no sample overlap with the other GWASs.

Estimands.

The causal effects of each exposure on the outcome.

Methods. We compare five methods for estimating the causal effect of the exposure on the outcome: multivariable IVW (intercept free multiple regression of the variant-outcome associations on the variant-exposure associations weighted by the inverse variance in the variant-outcome association), multivariable MR-Egger (multiple regression of the variant-outcome associations on the variant-exposure associations weighted by the inverse variance in the variant-outcome association), multivariable Weighted Median (quantile regression of the variant-outcome associations on the variant-exposure associations weighted by the inverse variance in the variant-outcome association), multivariable-MBE (using IVW to create the residuals and an MBE to estimate the causal effect), and multivariable-CM (using IVW to create the residuals and the contamination mixture method estimate the causal effect). IVW, MR-Egger, and weighted median were chosen because they appear to be some of the most widely used estimators which use different assumptions.

Performance measure.

The primary performance measures were mean bias, 95% CI width, and the percentage of times that the confidence intervals include zero. When there is no causal effect, the latter will represent the type-2 error rate. When there is a causal effect, it measures one minus the type-1 error rate. In additional analyses we also explore the standard deviation of the effect estimate (overall 1000 simulations), and coverage for the causal effect of the exposure on the outcome over the 1000 iterations. Bias was defined as the estimate minus the true causal effect. Thus, in the null settings, bias was the effect estimate. In the non-null settings, bias was the effect estimate of E₁ minus 0.3 and the estimate of E_Pl minus 0.4. Coverage was defied as the percentage of times that the 95% confidence interval included the causal effect (or zero). 95% CI width was operationalised as difference between the upper 95% CI limit and the lower 95% CI limit.

Applied example

We re-analysed the applied example (on the effect of intelligence, education, and household income on Alzheimer’s disease) from Grant and Burgess’ (2021) paper on pleiotropy robust estimators for MMVR [23]. This had previously been studied by Davies et al. and Anderson et al. [24, 25]. Anderson et al., in particular, had shown that a multivariable model was important for accounting for the collinearity between intelligence and education. Grant and Burgess then added household income to explore how the models worked with an additional risk factor.

Here we re-analysed the data used by Grant and Burgess (2021). They used 213 genetic variants from Davies et al. as instruments. These instruments had been clumped to ensure independence from each other and all had F statistics greater than 10, although the mean conditional F statistics ranged between 1.5 and 2.5. They used the Hill et al. GWAS of intelligence (n = 199,242 male and female European ancestry individuals) [26], Okbay et al. GWAS of years of education (n = 293,723 male and female European ancestry individuals) [27], and the Neale Lab UK Biobank GWAS of household income (n = 337,199 male and female European ancestry individuals) as sources of exposure data [28]. Since household income is an ordinal categorical variable, the genetic variant associations represent the increase in log odds of being in a higher income category per extra effect allele. Grant and Burgess (2021) additionally used Lambert et al. as a source of Alzheimer’s data (n = 74,046 male and female European ancestry individuals) [29]. More information on the data sources can be found in the original publications.

We implemented our two novel estimators, as well as IVW, MR-Egger, and MR-Median. Since the genetic associations with education and intelligence were in the same direction, the MR-Egger estimates can be interpreted as being oriented in the direction of either of these exposures.

Results

Simulation

Table 1 presents the results for the primary performance measures (bias and 95% CI width) of the simulations from the settings in which both exposures cause the outcome, while in Table 2 neither exposure exerts a causal effect on the outcome. The mean conditional F statistic for Exposure 1 was around 197, and 186 for Exposure 2.

Download:

Table 1. Primary results for setting where both exposures cause the outcome, and exposure 2 is pleiotropic.

https://doi.org/10.1371/journal.pone.0291183.t001

Download:

Table 2. Primary results for setting where neither exposure causes the outcome, and exposure 2 is pleiotropic.

https://doi.org/10.1371/journal.pone.0291183.t002

Bias.

In both Tables 1 and 2, all estimators performed well in the no-bias setting. The small amount of bias observed (0.1% - 0.5%) is explicable by weak instrument bias and the variability in the estimates (S1 and S2 Tables). When there was balanced pleiotropy, the multivariable-MBE seemed to underperform the non-plurality valid estimators while the multivariable -CM estimator appeared to do slightly better. Multivariable-CM was comparatively unbiased by even large amounts of balanced pleiotropy. However, moderate amounts of directional pleiotropy were sufficient to bias estimates more than the Median estimator. For example, in the setting where both exposures are causal and there was 40% directional pleiotropy, the first and second exposure estimates were biased by -0.055 and -0.008 respectively for the Median estimator, but 0.073 and 0.054 for multivariable-CM. Multivariable-MBE was more biased than multivariable-CM in all settings. For example, using the same simulation as above, multivariable-MBE was biased by 0.253 and -0.113 in the estimates for exposure 1 and 2 respectively.

95% CI width.

The multivariable-MBE had the widest 95% CIs of all the estimators. For example, in the no bias simulation, the 95% CI widths were five to ten time larger than for the other estimators. The non-plurality valid estimators generally had similarly wide 95% CI. Multivariable-CM generally had tighter 95% CI than the other estimators.

Coverage and power.

Since it had wide 95% CI, multivariable-MBE unsurprisingly had a low type-1 error rate (the 95% CI included the null in all settings > 98% when there was no association), but a high type-2 (the 95% CI included the null up to 35% of the time in settings where there was a true association). Multivariable-CM conversely had a very low type-2 error rate (the 95% CI never included the null when there was a true association). Multivariable-CM had a type-1 error rate at the nominal level (5%) for the 0% and 10% balance pleiotropy scenarios. In contrast, the Median estimator had type-1 error rates well below the nominal level in these scenarios. The type-1 error rates for Multivariable-CM were above the nominal level from 20% balanced pleiotropy, and for all levels of directional pleiotropy.

Additional outcomes.

Standard deviation of the effect estimates across the 1000 simulations: The SD of effect estimates between the multivariable-CM estimator and the non-plurality valid estimators were similar in the no-bias setting and when there was balanced pleiotropy (S1 and S2 Tables). However, multivariable-MBE had much wider SD, possibly because MBE produces less precise estimates than the contamination mixture method. In addition, all the plurality valid estimators had larger standard deviations when there was directional pleiotropy.

Coverage. Although all the estimators achieved 95% coverage when neither exposure was causal and there was no bias (S2 Table), surprisingly, except for Weighted Median and Multivariable-MBE, most estimators did not achieve at least 95% coverage when both exposures were causal (S1 Table). This might be because Weighted Median and Multivariable-MBE had the widest CI width (Tables 1 and 2) and all estimators were being effected by weak-instrument bias.

Applied example

As with Grant and Burgess (2021), the pleiotropy robust estimators provided consistent estimates of the effects of education, intelligence, and household income on Alzheimer’s disease (Table 3). All estimators concluded a null effect of education on Alzheimer’s, conditional on the other exposures. However, they all implied a negative effect on intelligence, although the 95% CI for MR-Egger and multivariable-MBE included the null hypotheses. All estimators estimated a log odds ratio of household income around 0.3, but again with 95% CI which included zero. As the original study concluded “[t]he consistency of the findings give strength to the assertion that intelligence has a causally protective effect on Alzheimer’s disease, conditional on years of education and household income. However, there is no evidence of a direct effect of years of education or household income on Alzheimer’s disease.”

Download:

Table 3. Results of the applied example exploring the effect to education and intelligence on Alzheimer’s disease.

https://doi.org/10.1371/journal.pone.0291183.t003

Discussion

Here we introduce two plurality valid estimators for multivariable Mendelian randomisation. Unlike most existing estimators, these use residual framework rather than multivariable regression models to produce the final effect estimates. We then used simulations with varying amounts of directional and balanced pleiotropy, as well as a re-analysis of the effect of intelligence, years of education, and household income on Alzheimer’s disease to compare the relative performance of our estimators with each other and existing estimators for MVMR.

As with previous analyses, our estimators implied that intelligence has a protective effect on Alzheimer’s disease, while years of education and household income do not. This has two important implications, firstly that as the years of mandatory education increase, there should not be a corresponding increase in Alzheimer’s. Secondly, our results imply that public health interventions to boost intelligence, beyond additional years of education, may be useful in reducing the burden of Alzheimer’s, although further research would be needed to confirm this hypothesis.

Of the two plurality valid estimators considered here, multivariable-CM, which uses IVW to create the residuals fed into a contamination mixture model, overall performed the best. It generally performed at least as well, if not better, than MR-Egger and IVW in terms of bias and precision in all settings. Indeed, when there was balanced pleiotropy, it was both more precise and less biased than IVW. However, in settings with moderate-to-high amounts of directional pleiotropy it was a lot more biased than Weighted median. Indeed, the high precision of the CM estimates is probably detrimental in this setting as it resulted in lower coverage than the other estimators. The divergence in performance between balanced and directional settings is probably, as discussed in the methods section, due to the multivariable-CM method assuming balanced pleiotropy. Hence, we would expect the estimator to perform better under situations where the distribution of Wald ratios with directional pleiotropy is similar to the assumed model with balanced pleiotropy, such as when the absolute amount of directional bias is small. The MR-Egger intercept and funnel plots have both been suggested as methods for exploring the presence of directional pleiotropy, and therefore may be useful additional analyses when employing the multivariable-CM estimator [30]. Thus, while we think it can help triangulate results between a univariate and multivariable setting by allowing the use of a plurality valid estimator in both analyses, or between multiple multivariable estimators, we cannot recommend using it alone unless there is a priori evidence that there should be no directional pleiotropy.

Multivariable-MBE was sufficiently imprecise that it is likely to be uninformative in practice, and we would therefore suggest that, when needed, researchers use another robust multivariable method instead. The poorer performance of the MV-MBE estimator is probably due to the greater uncertainty in the estimates produced by the mode-based estimator [5]: in Tables 1 and 2, the bias remains meaningfully smaller than half of the 95% CI width, despite often being more than ten times greater than the bias for the other estimators.

Our simulations are not without limitations. Firstly, although pleiotropy can vary continuously between studies, we explore only discrete amounts of this biases. This could potentially mask non-linearities in the performance of pleiotropy robust estimators for MVMR in the presence of these biases. In addition, all our simulations assume linearity and homogeneity (i.e. no effect modification or interaction) of the effects of the risk factors on the outcomes. A further limitation of this work is that we have only considered the scenario with two exposures in our simulation study. However, the framework we introduce in this paper does naturally extend to consider more than two exposures by using multivariable IVW in the first stage. Finally, although multivariable-CM and multivariable-MBE could be implemented using estimates other than IVW to create residuals, here we have implemented it explicitly using IVW because the interpretation of the validity assumption using the other estimators is unclear.

In summary, here we introduce a framework for implementing plurality valid estimators for multivariable Mendelian randomisation in the absence of modal regression. Of these, the multivariable-CM estimator, which uses IVW to create residuals then fed into a contamination mixture method, appeared to perform the best. Although it performed very well with large amounts of balanced pleiotropy, it underperformed estimators like Weighted median when there was directional pleiotropy. We hope these estimators (available from https://github.com/bar-woolf/MVMRmode/wiki) will further enable the future triangulation of univariable MR studies which have used plurality valid estimators with multivariable MR designs.

Supporting information

S1 Table. Results for additional outcomes when both exposures cause the outcome, and exposure 2 is pleiotropic.

https://doi.org/10.1371/journal.pone.0291183.s001

(DOCX)

S2 Table. Results for additional outcomes when neither exposure cause the outcome, and exposure 2 is pleiotropic.

https://doi.org/10.1371/journal.pone.0291183.s002

(DOCX)

Acknowledgments

This work was carried out using the computational facilities of the Advanced Computing Research Centre, University of Bristol - http://www.bris.ac.uk/acrc/.

References

1. Davey Smith G, Hemani G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Human Molecular Genetics. 2014;23: R89–R98. pmid:25064373
- View Article
- PubMed/NCBI
- Google Scholar
2. Davey Smith G, Holmes MV, Davies NM, Ebrahim S. Mendel’s laws, Mendelian randomization and causal inference in observational data: substantive and nomenclatural issues. Eur J Epidemiol. 2020;35: 99–111. pmid:32207040
- View Article
- PubMed/NCBI
- Google Scholar
3. Burgess S, Small DS, Thompson SG. A review of instrumental variable estimators for Mendelian randomization. Stat Methods Med Res. 2017;26: 2333–2355. pmid:26282889
- View Article
- PubMed/NCBI
- Google Scholar
4. Hartwig FP, Davies NM, Hemani G, Smith GD. Two-sample Mendelian randomization: avoiding the downsides of a powerful, widely applicable but potentially fallible technique. International Journal of Epidemiology. 2016;45: 1717–1726. pmid:28338968
- View Article
- PubMed/NCBI
- Google Scholar
5. Slob EAW, Burgess S. A comparison of robust Mendelian randomization methods using summary data. Genetic Epidemiology. 2020;44: 313–329. pmid:32249995
- View Article
- PubMed/NCBI
- Google Scholar
6. Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. International Journal of Epidemiology. 2015;44: 512–525. pmid:26050253
- View Article
- PubMed/NCBI
- Google Scholar
7. Woolf B, Di Cara N, Moreno-Stokoe C, Skrivankova V, Drax K, Higgins JPT, et al. Investigating the transparency of reporting in two-sample summary data Mendelian randomization studies using the MR-Base platform. International Journal of Epidemiology. 2022; dyac074. pmid:35383846
- View Article
- PubMed/NCBI
- Google Scholar
8. Hartwig FP, Davey Smith G, Bowden J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int J Epidemiol. 2017;46: 1985–1998. pmid:29040600
- View Article
- PubMed/NCBI
- Google Scholar
9. Bowden J, Hemani G, Davey Smith G. Invited Commentary: Detecting Individual and Global Horizontal Pleiotropy in Mendelian Randomization—A Job for the Humble Heterogeneity Statistic? Am J Epidemiol. 2018;187: 2681–2685. pmid:30188969
- View Article
- PubMed/NCBI
- Google Scholar
10. Burgess S, Foley CN, Allara E, Staley JR, Howson JMM. A robust and efficient method for Mendelian randomization with hundreds of genetic variants. Nat Commun. 2020;11: 376. pmid:31953392
- View Article
- PubMed/NCBI
- Google Scholar
11. Sanderson E, Spiller W, Bowden J. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable Mendelian randomization. Statistics in Medicine. 2021;40: 5434–5452. pmid:34338327
- View Article
- PubMed/NCBI
- Google Scholar
12. Carter AR, Sanderson E, Hammerton G, Richmond RC, Davey Smith G, Heron J, et al. Mendelian randomisation for mediation analysis: current methods and challenges for implementation. Eur J Epidemiol. 2021;36: 465–478. pmid:33961203
- View Article
- PubMed/NCBI
- Google Scholar
13. Schooling CM, Lopez PM, Yang Z, Zhao JV, Au Yeung SL, Huang JV. Use of Multivariable Mendelian Randomization to Address Biases Due to Competing Risk Before Recruitment. Frontiers in Genetics. 2021;11. Available: https://www.frontiersin.org/article/10.3389/fgene.2020.610852 pmid:33519914
- View Article
- PubMed/NCBI
- Google Scholar
14. Burgess S, Thompson SG. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. 2015;181: 251–260. pmid:25632051
- View Article
- PubMed/NCBI
- Google Scholar
15. Woolf B. mesrument error and MR. 2021 [cited 23 Apr 2022].
- View Article
- Google Scholar
16. Rees JMB, Wood AM, Burgess S. Extending the MR-Egger method for multivariable Mendelian randomization to correct for both measured and unmeasured pleiotropy. Stat Med. 2017;36: 4705–4718. pmid:28960498
- View Article
- PubMed/NCBI
- Google Scholar
17. Lin Z, Xue H, Pan W. Robust multivariable Mendelian randomization based on constrained maximum likelihood. The American Journal of Human Genetics. 2023;110: 592–605. pmid:36948188
- View Article
- PubMed/NCBI
- Google Scholar
18. Grant AJ, Burgess S. A Bayesian approach to Mendelian randomization using summary statistics in the univariable and multivariable settings with correlated pleiotropy. bioRxiv; 2023. p. 2023.05.30.542988.
- View Article
- Google Scholar
19. Wang J, Zhao Q, Bowden J, Hemani G, Smith GD, Small DS, et al. Causal inference for heritable phenotypic risk factors using heterogeneous genetic instruments. PLOS Genetics. 2021;17: e1009575. pmid:34157017
- View Article
- PubMed/NCBI
- Google Scholar
20. Bowden J, Del Greco M F, Minelli C, Davey Smith G, Sheehan N, Thompson J. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat Med. 2017;36: 1783–1802. pmid:28114746
- View Article
- PubMed/NCBI
- Google Scholar
21. Morris TP, White IR, Crowther MJ. Using simulation studies to evaluate statistical methods. Statistics in Medicine. 2019;38: 2074–2102. pmid:30652356
- View Article
- PubMed/NCBI
- Google Scholar
22. Wootton RE, Richmond RC, Stuijfzand BG, Lawn RB, Sallis HM, Taylor GMJ, et al. Evidence for causal effects of lifetime smoking on risk for depression and schizophrenia: a Mendelian randomisation study. Psychol Med. 2020;50: 2435–2443. pmid:31689377
- View Article
- PubMed/NCBI
- Google Scholar
23. Grant AJ, Burgess S. Pleiotropy robust methods for multivariable Mendelian randomization. Stat Med. 2021;40: 5813–5830. pmid:34342032
- View Article
- PubMed/NCBI
- Google Scholar
24. Anderson EL, Howe LD, Wade KH, Ben-Shlomo Y, Hill WD, Deary IJ, et al. Education, intelligence and Alzheimer’s disease: evidence from a multivariable two-sample Mendelian randomization study. International Journal of Epidemiology. 2020;49: 1163–1172. pmid:32003800
- View Article
- PubMed/NCBI
- Google Scholar
25. Davies NM, Hill WD, Anderson EL, Sanderson E, Deary IJ, Davey Smith G. Multivariable two-sample Mendelian randomization estimates of the effects of intelligence and education on health. Teare MD, Franco E, Burgess S, editors. eLife. 2019;8: e43990. pmid:31526476
- View Article
- PubMed/NCBI
- Google Scholar
26. Hill WD, Marioni RE, Maghzian O, Ritchie SJ, Hagenaars SP, McIntosh AM, et al. A combined analysis of genetically correlated traits identifies 187 loci and a role for neurogenesis and myelination in intelligence. Mol Psychiatry. 2019;24: 169–181. pmid:29326435
- View Article
- PubMed/NCBI
- Google Scholar
27. Okbay A, Beauchamp JP, Fontana MA, Lee JJ, Pers TH, Rietveld CA, et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature. 2016;533: 539–542. pmid:27225129
- View Article
- PubMed/NCBI
- Google Scholar
28. Rapid GWAS of thousands of phenotypes for 337,000 samples in the UK Biobank. In: Neale lab [Internet]. [cited 18 Jul 2022]. Available: http://www.nealelab.is/blog/2017/7/19/rapid-gwas-of-thousands-of-phenotypes-for-337000-samples-in-the-uk-biobank
29. Lambert J-C, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet. 2013;45: 1452–1458. pmid:24162737
- View Article
- PubMed/NCBI
- Google Scholar
30. Haycock PC, Burgess S, Wade KH, Bowden J, Relton C, Davey Smith G. Best (but oft-forgotten) practices: the design, analysis, and interpretation of Mendelian randomization studies. Am J Clin Nutr. 2016;103: 965–978. pmid:26961927
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Davey Smith G, Hemani G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Human Molecular Genetics. 2014;23: R89–R98. pmid:25064373
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Davey Smith G, Holmes MV, Davies NM, Ebrahim S. Mendel’s laws, Mendelian randomization and causal inference in observational data: substantive and nomenclatural issues. Eur J Epidemiol. 2020;35: 99–111. pmid:32207040
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Burgess S, Small DS, Thompson SG. A review of instrumental variable estimators for Mendelian randomization. Stat Methods Med Res. 2017;26: 2333–2355. pmid:26282889
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Hartwig FP, Davies NM, Hemani G, Smith GD. Two-sample Mendelian randomization: avoiding the downsides of a powerful, widely applicable but potentially fallible technique. International Journal of Epidemiology. 2016;45: 1717–1726. pmid:28338968
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Slob EAW, Burgess S. A comparison of robust Mendelian randomization methods using summary data. Genetic Epidemiology. 2020;44: 313–329. pmid:32249995
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. International Journal of Epidemiology. 2015;44: 512–525. pmid:26050253
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Woolf B, Di Cara N, Moreno-Stokoe C, Skrivankova V, Drax K, Higgins JPT, et al. Investigating the transparency of reporting in two-sample summary data Mendelian randomization studies using the MR-Base platform. International Journal of Epidemiology. 2022; dyac074. pmid:35383846
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Hartwig FP, Davey Smith G, Bowden J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int J Epidemiol. 2017;46: 1985–1998. pmid:29040600
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Bowden J, Hemani G, Davey Smith G. Invited Commentary: Detecting Individual and Global Horizontal Pleiotropy in Mendelian Randomization—A Job for the Humble Heterogeneity Statistic? Am J Epidemiol. 2018;187: 2681–2685. pmid:30188969
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Burgess S, Foley CN, Allara E, Staley JR, Howson JMM. A robust and efficient method for Mendelian randomization with hundreds of genetic variants. Nat Commun. 2020;11: 376. pmid:31953392
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Sanderson E, Spiller W, Bowden J. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable Mendelian randomization. Statistics in Medicine. 2021;40: 5434–5452. pmid:34338327
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Carter AR, Sanderson E, Hammerton G, Richmond RC, Davey Smith G, Heron J, et al. Mendelian randomisation for mediation analysis: current methods and challenges for implementation. Eur J Epidemiol. 2021;36: 465–478. pmid:33961203
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Schooling CM, Lopez PM, Yang Z, Zhao JV, Au Yeung SL, Huang JV. Use of Multivariable Mendelian Randomization to Address Biases Due to Competing Risk Before Recruitment. Frontiers in Genetics. 2021;11. Available: https://www.frontiersin.org/article/10.3389/fgene.2020.610852 pmid:33519914
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Burgess S, Thompson SG. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. 2015;181: 251–260. pmid:25632051
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Woolf B. mesrument error and MR. 2021 [cited 23 Apr 2022].
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref16] 16. Rees JMB, Wood AM, Burgess S. Extending the MR-Egger method for multivariable Mendelian randomization to correct for both measured and unmeasured pleiotropy. Stat Med. 2017;36: 4705–4718. pmid:28960498
View Article
PubMed/NCBI
Google Scholar

[61] View Article

[62] PubMed/NCBI

[63] Google Scholar

[ref17] 17. Lin Z, Xue H, Pan W. Robust multivariable Mendelian randomization based on constrained maximum likelihood. The American Journal of Human Genetics. 2023;110: 592–605. pmid:36948188
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref18] 18. Grant AJ, Burgess S. A Bayesian approach to Mendelian randomization using summary statistics in the univariable and multivariable settings with correlated pleiotropy. bioRxiv; 2023. p. 2023.05.30.542988.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref19] 19. Wang J, Zhao Q, Bowden J, Hemani G, Smith GD, Small DS, et al. Causal inference for heritable phenotypic risk factors using heterogeneous genetic instruments. PLOS Genetics. 2021;17: e1009575. pmid:34157017
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref20] 20. Bowden J, Del Greco M F, Minelli C, Davey Smith G, Sheehan N, Thompson J. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat Med. 2017;36: 1783–1802. pmid:28114746
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref21] 21. Morris TP, White IR, Crowther MJ. Using simulation studies to evaluate statistical methods. Statistics in Medicine. 2019;38: 2074–2102. pmid:30652356
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref22] 22. Wootton RE, Richmond RC, Stuijfzand BG, Lawn RB, Sallis HM, Taylor GMJ, et al. Evidence for causal effects of lifetime smoking on risk for depression and schizophrenia: a Mendelian randomisation study. Psychol Med. 2020;50: 2435–2443. pmid:31689377
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref23] 23. Grant AJ, Burgess S. Pleiotropy robust methods for multivariable Mendelian randomization. Stat Med. 2021;40: 5813–5830. pmid:34342032
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref24] 24. Anderson EL, Howe LD, Wade KH, Ben-Shlomo Y, Hill WD, Deary IJ, et al. Education, intelligence and Alzheimer’s disease: evidence from a multivariable two-sample Mendelian randomization study. International Journal of Epidemiology. 2020;49: 1163–1172. pmid:32003800
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref25] 25. Davies NM, Hill WD, Anderson EL, Sanderson E, Deary IJ, Davey Smith G. Multivariable two-sample Mendelian randomization estimates of the effects of intelligence and education on health. Teare MD, Franco E, Burgess S, editors. eLife. 2019;8: e43990. pmid:31526476
View Article
PubMed/NCBI
Google Scholar

[96] View Article

[97] PubMed/NCBI

[98] Google Scholar

[ref26] 26. Hill WD, Marioni RE, Maghzian O, Ritchie SJ, Hagenaars SP, McIntosh AM, et al. A combined analysis of genetically correlated traits identifies 187 loci and a role for neurogenesis and myelination in intelligence. Mol Psychiatry. 2019;24: 169–181. pmid:29326435
View Article
PubMed/NCBI
Google Scholar

[100] View Article

[101] PubMed/NCBI

[102] Google Scholar

[ref27] 27. Okbay A, Beauchamp JP, Fontana MA, Lee JJ, Pers TH, Rietveld CA, et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature. 2016;533: 539–542. pmid:27225129
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref28] 28. Rapid GWAS of thousands of phenotypes for 337,000 samples in the UK Biobank. In: Neale lab [Internet]. [cited 18 Jul 2022]. Available: http://www.nealelab.is/blog/2017/7/19/rapid-gwas-of-thousands-of-phenotypes-for-337000-samples-in-the-uk-biobank

[ref29] 29. Lambert J-C, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet. 2013;45: 1452–1458. pmid:24162737
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref30] 30. Haycock PC, Burgess S, Wade KH, Bowden J, Relton C, Davey Smith G. Best (but oft-forgotten) practices: the design, analysis, and interpretation of Mendelian randomization studies. Am J Clin Nutr. 2016;103: 965–978. pmid:26961927
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

Figures

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Theoretical background

Notation and assumptions.

Statistical framework.

Deriving a standard error multivariable-MBE and multivariable-CM.

Simulation study

Aims.

Data-generating mechanisms.

Estimands.

Performance measure.

Applied example

Results

Simulation

Bias.

95% CI width.

Coverage and power.

Additional outcomes.

Applied example

Discussion

Supporting information

S1 Table. Results for additional outcomes when both exposures cause the outcome, and exposure 2 is pleiotropic.

S2 Table. Results for additional outcomes when neither exposure cause the outcome, and exposure 2 is pleiotropic.

Acknowledgments

References