Profiling compliers and noncompliers for instrumental variable analysis with covariates: A weighting approach

Byeong Yeob Choi

doi:10.1371/journal.pone.0283223

Abstract

Instrumental variable (IV) estimation is an essential tool to estimate the causal effect of a treatment in randomized experiments when noncompliance exists. In such studies, standard statistical approaches can be biased because compliers and noncompliers can differ in unmeasured ways that affect both the compliance behavior and outcome. Based on a few assumptions such as monotonicity, the IV estimand represents the causal effect of compliers. Profiling compliers and noncompliers has important implications because the IV estimand applies only to compliers. A method for estimating the covariate means for compliers and noncompliers has recently been proposed in political sciences literature. However, this approach requires an assumption that the instrument is randomly assigned, which confines the application of this approach to randomized experiments. In this study, we present two weighting methods for profiling compliers and noncompliers when the instrument and compliance behavior are confounded by several covariates. The proposed approach can be used for both experimental and nonexperimental studies, and hence is more broadly applicable. For the development, an instrumental propensity score is adopted to account for confounded instruments. We demonstrate the utility of the proposed methods based on simulation and real data experiments.

Citation: Choi BY (2023) Profiling compliers and noncompliers for instrumental variable analysis with covariates: A weighting approach. PLoS ONE 18(6): e0283223. https://doi.org/10.1371/journal.pone.0283223

Editor: Guangyu Tong, Yale University, UNITED STATES

Received: July 7, 2022; Accepted: March 3, 2023; Published: June 15, 2023

Copyright: © 2023 Byeong Yeob Choi. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting information files.

Funding: BYC is supported in part by the National Cancer Institute for the Mays Cancer Center (P30CA054174) at the UT Health San Antonio. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. There was no additional external funding received for this study.

Competing interests: The author has declared that no competing interests exist.

Introduction

Instrumental variable (IV) analysis is used to estimate the causal effect of a treatment in the presence of unmeasured confounders. Angrist et al. [1] demonstrated that when the treatment assignment is randomized, the IV estimand represents the average causal effect of compliers, who comply with the assigned treatment. This estimand is called the local average treatment effect (LATE). Marbach and Hangartner [2] presented identification results to profile compliers and noncompliers, the required assumptions, and the estimators. Their work is important because the LATE addresses only the causal effect on the subpopulation of compliers, but not on the whole population. We will refer to their approach as the MH approach.

The MH approach is useful for studying the profiles of compliers and noncompliers with unconfounded instruments. However, instruments are often confounded, which means that they are random only if several confounders are controlled. For example, the IV analysis of Abadie [3] studied the effects of 401(k) participation on savings, and family income was controlled in the IV model to make 401(k) eligibility become a valid IV. Card [4] used two-stage least squares (2SLS) to estimate the effect of education on earnings, where IV was an indicator of whether an individual grew up near a 4-year college. His 2SLS model included some covariates such as age, race, and family backgrounds to make college proximity a valid IV.

Before the MH approach, few researchers devised methods for profiling compliers with confounded instruments. For example, Baiocchi et al. [5] identified the complier mean as a ratio of two quantities, similar to our approach. However, their approach involves integrating a conditional compliance probability with respect to the conditional joint probability of all measured covariates, given the covariate to be profiled. Thus, a practical method to perform this integration calculation is needed to apply the result. Hangartner et al. [6] used matching to control for IV confounding and applied the MH approach based on that the IV is approximately as-if randomized in the matched data set. However, matching does not guarantee a complete removal of covariate imbalance particularly when there are many matching factors.

After the landmark papers of Imbens and Angrist [7], and Angrist, Imbens, and Rubin [1], several researchers developed IV methods with covariates. For example, Abadie [3], Frolich [8], and Tan [9] presented IV methods based on regression, matching, and weighting for adjustments of the confounders of the instrument. These authors commonly adopted an instrumental propensity score (IPS), which is the probability of being encouraged toward the active treatment conditional on measured covariates, to identify the characteristics of compliers. In this study, we adopt the IPS to develop a weighting approach that is more generally applicable than the MH approach. Our approach is based on inverse probability weighting (IPW) and augmented IPW (AIPW).

The remainder of this article is organized as follows. We briefly review the MH approach, and then present our weighting approach that accounts for confounded IVs. Based on simulations, we evaluate the proposed IPW and AIPW methods in data with confounded IVs by some covariates. The practical utility of the proposed methods is illustrated by a study of education on future earnings. We end the article with concluding remarks.

Profiling approach for unconfounded instruments

We introduce some notations and assumptions to define the profiles of compliers and noncompliers. Let Z ∈ {0, 1} be the binary instrument, with 1 indicating encouragement toward treatment and 0 indicating encouragement toward control. Let D(z) ∈ {0, 1} be the binary potential treatment value that would be observed if Z = z. The observed treatment is defined as D = ZD(1) + (1 − Z)D(0). We define a column vector of covariates X. To refer to the variables for subject i among N subjects, we add a subscript “i” to these notations. In other words, {Z_i, D_i(0), D_i(1), X_i} denote the instrument, potential treatment values, and vector of covariates for subject i, where X_i = (X_i1, …, X_ip)^T and X_ij is the jth covariate for subject i.

Angrist et al. [1] classified the population into four groups based on the values of D(1) and D(0): compliers if D(1) > D(0), always-takers if D(1) = D(0) = 1, never-takers if D(1) = D(0) = 0, and defiers if D(1) < D(0). Because defiers are excluded under Assumption 1 below, noncompliers in IV analysis include only the always-takers and never-takers.

Marbach and Hangartner [2] presented two assumptions to identify the profiles of compliers and noncompliers for IV analysis:

Assumption 1 (Monotonicity) D(1) > D(0).

Assumption 2 (Independence of the instrument) {D(0), D(1), X} ⊥ Z.

Assumption 1 excludes defiers. Assumption 2 means that Z is randomly assigned without controlling any covariates, which occurs in randomized experiments. Based on Assumptions 1 and 2, Marbach and Hangartner [2] identified the covariate means for always-takers, never-takers, and compliers. However, these identification results are only applicable when Assumption 2 is satisfied. The results of Marbach and Hangartner [2] imply that the means of X_ij for always-takers and never-takers can be identified by The covariate mean for compliers can be identified by

The approach of Marbach and Hangartner [2] can be directly extended to the cases with confounded instruments when the covariates to be profiled are distinct from those that must be controlled to satisfy Assumption 2. Suppose that there exists a vector of covariates, W, such that Assumption 2 holds if this vector is adjusted for. In other words, {D_i(0), D_i(1), X_ij} ⊥ Z_i|W_i. Then, the means of X_ij for always-takers and never-takers are identified by Clearly, these equations are not much helpful if W_i contains X_ij. Based on these equations, the mean of X_ij for compliers can be identified by adding W_i to the equation as conditioning variables: E[X_ij|D_i(1) > D_i(0), W_i]. However, this extension has very limited applicability, which provides motivation for our weighting approach.

Weighting approach for confounded instruments

In this section, we propose two weighting methods to identify the profiles of compliers and noncompliers when the IV and potential treatment values are confounded by several observed covariates.

Inverse probability weighting (IPW)

To develop an IPW method, we first replace Assumption 2 with the following, which is adopted from Abadie [3], Frolich [8], and Tan [9]:

Assumption 3 (Conditional independence of the instrument) {D(0), D(1)} ⊥ Z|X.

Assumption 3 means that the instrument is independent of the potential treatment variables conditional on X. Therefore, Z and {D(0), D(1)} can be confounded by X. We address this confounding by adopting an IPS, e(X) = P[Z = 1|X]. The IPSs are unknown unless the study is a randomized experiment. In addition to Assumption 3, we impose an assumption that the IPS for every subject is strictly between 0 and 1.

Assumption 4 (Positivity) 0 < e(X) < 1 for every subject.

We introduce additional notations as follows. Let f_j(x) = P[X_ij = x] represent the probability mass function or probability density function of X_ij. To refer to the densities of X_j for compliers, always-takers, and never-takers, we add superscripts “c”, “a”, and “n” to f_j(x), respectively. In other words, these densities are denoted by The complier mean of X_ij can be expressed as By Bayes’ theorem, Therefore, can be written as (1)

Under monotonicity, the denominator of Eq (1) can be written as P[D_i(1) > D_i(0)] = E[D_i(1) − D_i(0)]. This proportion of compliers can be identified by E[D_i(1) − D_i(0)] = E[D_i|Z_i = 1] − E[D_i|Z_i = 0] if Assumption 2 holds. However, when the instrument is confounded by X, this identification result does not hold because E[D_i(z)] ≠ E[D_i|Z_i = z] for z = 0, 1. To identify E[D_i(1) − D_i(0)], we express the expectation of each potential treatment as the IPW mean based on the result of Lunceford and Davidian [10]:

Note that under monotonicity, Hence, the numerator of Eq (1) can be written as which represents the average causal effect of Z_i on X_ijD_i. As a result, can be expressed as

To identify E[X_ijD_i(1)], it is worthwhile to note that e_i = E[Z_i|D_i(1), X_i] because Z_i and D_i(1) are independent conditional on X_i under Assumption 3. Then, it follows that (2)

Based on that D_i(1)Z_i = D_iZ_i and the law of total expectation, Eq (2) becomes (3)

In a similar way, E[X_ijD_i(0)] can be identified by (4) Therefore, can be identified as described below in Theorem 1, which summarizes the nonparametric identification results for profiling compliers and noncompliers via IPW. The means of X_j for always-takers and never-takers are defined as The proofs for the IPW representations of and are provided in Appendix A.

Theorem 1 (IPW representations) Under Assumptions 1, 3 and 4, the means of X_ij for compliers, always-takers, and never-takers are identified by the following IPW representations, respectively:

For estimation, we assume that the IPS is parameterized with a regression parameter vector β: e_i = e(X_i, β). In practice, we use a logistic regression model to estimate e(X_i, β): The parameter β is estimated by maximum likelihood (ML). Let denotes the ML estimator. Then, Theorem 1 suggests immediately the following IPW estimators: where is the estimated IPS for subject i, Note that the normalization terms in , and , converge in probability to 1. The normalization terms are canceled out for and .

In Theorem 2, we present the asymptotic properties of the proposed IPW estimators. Their asymptotic variances are presented in Appendix B.

Theorem 2 (Asymptotic distributions of the IPW estimators) Under Assumptions 1, 3 and 4, the random quantities , , and converge in distribution to mean-zero normal random variables.

Augmented IPW

Robins et al. [11] proposed an augmented IPW (AIPW) estimator for the average treatment effect (ATE). This estimator is consistent for the ATE when either the treatment or the outcome model is correctly specified. In addition, the AIPW estimator is semiparametrically efficient if both models are correctly specified. We adopt this AIPW estimator for profiling compliers and noncompliers to obtain potential accuracy and precision gains, when compared to the proposed IPW estimators.

To develop the AIPW estimators, we augment treatment models into the IPW estimators. Let m_1i and m_0i denote the mean models of D for subject i if that subject had been assigned to Z = 1 and Z = 0 groups: Under Assumptions 3, m_1i = E[D_i(1)|X_i] and m_0i = E[D_i(0)|X_i]. We augment these treatment models into the IPW representations in Theorem 1, which gives the following theorem for the doubly robustness of the AIPW representations.

Theorem 3 (Doubly robustness of the AIPW representations) Under Assumptions 1, 3 and 4, the following AIPW representations correctly identify the means of X_ij for compliers, always-takers, and never-takers if either the IPS or treatment model is correctly specified: The Proof for Theorem 3 is presented in Appendix C.

For estimation, we assume that m_1i and m_0i are parameterized with regression parameter vectors, α₁ and α₀, as follows: In practice, we use logistic regression models to estimate the treatment expectations: for z = 0, 1, . We denote by and the ML estimators for α₁ and α₀, respectively. These ML estimators are obtained by fitting the logistic regression models of D on X to the groups of Z = 1 and Z = 0, respectively. We denote by and the estimated treatment means. Based on Theorem 3, we suggest the following AIPW estimators:

In Theorem 4 below, we present the asymptotic properties of the AIPW estimators. Their asymptotic variances are presented in Appendix D.

Theorem 4 (Asymptotic distributions of the AIPW estimators) Under Assumptions 1, 3 and 4, the random quantities , , and converge in distribution to mean-zero normal random variables.

Simulation

The proposed IPW and AIPW estimators were evaluated based on Monte Caro simulations with unconfounded and confounded IVs. Because the MH approach is valid for unconfounded IVs, we also included the MH estimators in the simulations with an unconfounded instrument. Two covariates, X₁ and X₂, were generated from a multivariate normal distribution with a zero mean vector and a compound symmetric covariance matrix with unit variance and a correlation coefficient of 0.5. Once these normal covariates were generated, X₂ was dichotomized to be a binary variable: I(X₂ > 0).

The instrument Z was generated from the following logistic regression function: To simulate different degrees of confounding for the instrument Z, we used the following values for (β₁, β₂): (β₁, β₂) = (0, 0) for no confounding, (β₁, β₂) = (log 1.2, log 1.5) for mild confounding, and (β₁, β₂) = (log 1.5, log 2) for moderate confounding. The intercept term β_o was selected to have the marginal mean of Z be approximately equal to 0.30 for all three confounding scenarios for the instrument.

A compliance class U was defined such that U = 0, 1 and 2 indicated a never-taker, an always-taker, and a complier, respectively. The probability of being a complier was a logistic regression function: The probabilities of a never-taker and an always-taker were the same as (1 − P[U = 2|X])/2. Then, D was a deterministic function of Z and U: D = ZI(U = 2) + UI(U ≠ 2). These probability models for compliers and noncompliers were used across all confounding scenarios for the instrument. Our simulation models yielded approximately P[U = 2] = 0.32 and P[U = 0] = P[U = 1] = 0.34.

By construction of our simulations, the true treatment model was not exactly a logistic function; hence there was a model misspecification for the treatment model because logistic models were used to estimate m_1i and m_0i. However, our simulation results demonstrated that the AIPW estimators were robust to this mild model misspecification. The reason the true treatment model was not a logistic model is as follows. The stratum (Z = 1, D = 1) contains compliers and always takers, and therefore m_1i = P[U_i = 2|X_i] + P[U_i = 1|X_i]. The stratum (Z = 0, D = 1) contains only always takers, and hence m_0i = P[U_i = 1|X_i]. Because P[U = 0|X] = P[U = 1|X] = (1 − P[U = 2|X])/2 under our simulation models, we have m_1i = (1 + P[U_i = 2|X_i])/2 and m_0i = (1 − P[U_i = 2|X_i])/2, which are not logistic functions.

We presented the performance of the profiling estimators for continuous X₁ and dichotomous X₂. The true means of X₁ and X₂ for compliers and noncompliers were calculated based on a very large sample size with several iterations. These true means are determined by the probability models for the compliance classes; hence they were the same for all confounding scenarios for the instrument. The true means of X₁ for compliers, always-takers and never-takers were 0.133, -0.064 and -0.064, respectively. The true means of X₂ for those compliance classes were 0.457, 0.520 and 0.520, respectively. The sample sizes were from 500 to 6000. The number of simulations was 500 for a given sample size.

To compare different methods, we calculated the bias, empirical standard error (ESE), and 95% confidence interval coverage rate (CR). These performance measures were calculated based on the 500 simulated data sets under each simulation scenario at a given sample size. Bias was obtained by taking the absolute value of the average of the 500 values of (estimate—true value). ESE was the empirical Monte Carlo standard error of the 500 estimates. The standard error estimates of the MH estimators were obtained by using the R package ivdesc [12]. Those of the IPW and AIPW estimators were obtained based on Theorems 2 and 4. CR was the empirical coverage of 95% confidence intervals using the estimated standard errors across the 500 simulated data sets.

In general, the IPW and AIPW estimators performed similarly to the MH estimators when the instrument was randomly assigned (Figs 1–3), except that the IPW and AIPW estimators of X₁ and X₂ for never-takers were slightly less variable than the corresponding MH estimators (Fig 2). All the methods yielded valid estimators to profile compliers and noncompliers (Figs 1–3): the bias reduced to zero, and the coverage rates converged to the target coverage rate of 95% as the sample size increased.

Download:

Fig 1. Bias of the MH, IPW, and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is not confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g001

Download:

Fig 2. Empirical standard errors (ESE) of the MH, IPW, and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is not confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g002

Download:

Fig 3. Coverage rates of the MH, IPW, and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is not confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g003

For the scenarios with mildly (Figs 4–6) and moderately (Figs 7–9) confounded IVs, we compared only the IPW and AIPW estimators because the MH estimators were not valid in such settings. The bias of the IPW and AIPW estimators reduced to zero (Figs 4 and 7), and the coverage rates of those converged to the target coverage rate of 95% as the sample size increased (Figs 6 and 9), which implied that the proposed point and standard error estimates addressed the confounded instruments appropriately. The AIPW method yielded coverage rates that were slightly closer to the target rate than the IPW method in estimating the means of X₁ and X₂ for never-takers when the instrument was mildly confounded (Fig 6). In addition, the AIPW estimator was slightly less variable than the IPW estimator in estimating the mean of X₁ for compliers when the instrument was moderately confounded (Fig 8). Except those cases, in general, the IPW and AIPW estimators performed similarly.

Download:

Fig 4. Bias of the IPW and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is mildly confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g004

Download:

Fig 5. Empirical standard errors (ESE) of the IPW and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is mildly confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g005

Download:

Fig 6. Coverage rates of the IPW and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is mildly confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g006

Download:

Fig 7. Bias of the IPW and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is moderately confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g007

Download:

Fig 8. Empirical standard errors (ESE) of the IPW and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is moderately confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g008

Download:

Fig 9. Coverage rates of the IPW and AIPW estimators for continuous X₁ and dichotomous X₂ when the instrument is moderately confounded with the compliance behavior.

https://doi.org/10.1371/journal.pone.0283223.g009

Application

We applied the proposed methodology to the data for the causal effects of education on earnings. The data set included 3010 observations between 14 and 24 years in 1996 from the National Longitudinal Survey of Young Men (NLSYM), originally analyzed by Card [4], and afterward by several statistical articles [9, 13, 14]. In his analysis, the outcome was log-transformed wages in the survey year (1976), the treatment was years of education, and the IV was a binary indicator for growing up near a 4-year college in 1966. In our analysis, instead of years of education, we used education beyond high school (EBH) as the treatment, as in Tan [9]. In other words, the treatment variable D was an indicator for whether the years of education were more than 12 years. Our outcome Y was log wages in cents per hour in 1976. The instrument was proximity to a 4-year college in 1966. The compliers in our analysis were those who would have attained EBH if they had lived close to a 4-year college. There were 68.2% who grew up near a 4-year college and 50.5 had EBH. The covariates X to adjust for confounding included age, age squared, race, an indicator for living with both mother and father at age 14, an indicator for living with single mom at age 14, an indicator for living with stepparents at age 14, and residence in the South in 1966.

We first estimated the LATE of EBH on log wages without adjusting for any covariates. Based on Angrist et al. [1], this LATE was the intent-to-treat effect of the IV on log wages divided by that of the IV on EBH. Using the R package ivmodel [15], the LATE was calculated as 1.28 with a 95% confidence interval of (0.84, 1.72). Next, we estimated the LATE of EBH with covariates X using the approach of Abadie [3], which uses the kappa weights based on the IPS of college proximity. We fitted a logistic regression model of college proximity on the covariates X to estimate the IPS. The estimated IPS model revealed that residence in the South was strongly associated with the IV with an odds ratio of 0.38. Using the R package LARF [16], the calculated LATE was 0.87 with a 95% confidence interval of (0.49, 1.26). Therefore, both LATE estimates indicated that EBH increased future wages.

To interpret these LATE estimates of EBH, it would be reasonable to characterize the compliers and noncompliers because they can be different in important ways, and the LATE addresses the causal effect only for the compliers. To this end, we used the MH approach and our weighing approach to estimate the covariate means of the compliers and noncompliers. Different profiling results were observed between the MH and our approaches for some covariates. The IPW and AIPW results were very similar, and thus we displayed only the MH (Fig 10) and AIPW (Fig 11) estimates and the corresponding 95% confidence intervals. Because of the strong correlation between residence in the South and the IV, the MH and AIPW methods gave very different profiling results for that covariate: the MH estimates implied that the compliers were significantly less likely to live in the South in 1996 than the noncompliers, but the compliers appeared to live in the South with a similar proportion to those of the noncompliers based on the AIPW estimates. Even though the MH and weighting approaches gave different results for some covariates, both approaches demonstrated that the compliers and noncompliers were different. The compliers were older than the noncompliers, less likely to be black than the never-takers, and more likely to live with single mother at age 14 than the always-takers.

Download:

Fig 10. MH estimates of the means and their 95% confidence intervals for the characteristics of the complier and noncomplier subpopulations in the NLSYM data.

C, A, and N on the Y-axis indicate the subpopulations of compliers, always-takers, and never-takers, respectively. Momdad14 is an indicator for living with both mother and father at age 14. Smom14 is an indicator for living with single mom at age 14. Step14 is an indicator for living with stepparents at age 14. South is an indicator for residence in the South in 1966.

https://doi.org/10.1371/journal.pone.0283223.g010

Download:

Fig 11. AIPW estimates of the means and their 95% confidence intervals for the characteristics of the complier and noncomplier subpopulations in the NLSYM data.

C, A, and N on the Y-axis indicate the subpopulations of compliers, always-takers, and never-takers, respectively. Momdad14 is an indicator for living with both mother and father at age 14. Smom14 is an indicator for living with single mom at age 14. Step14 is an indicator for living with stepparents at age 14. South is an indicator for residence in the South in 1966.

https://doi.org/10.1371/journal.pone.0283223.g011

Conclusion

In this study, we proposed weighting estimators to profile compliers and noncompliers when the instrument is confounded with the compliance behavior. The proposed estimators are based on IPW and AIPW, which are frequently used for estimation of causal effects. The proposed estimators are also valid for instruments that are randomly assigned, and hence are more generally applicable than the MH estimators. The simulation study demonstrated that the proposed methods correctly account for confounded instruments. It also showed that the use of the proposed estimators can bring some precision gains in estimating the covariate means of noncompliers in randomized experiments when compared to the MH estimators. We applied our profiling approach to the data set from Card [4], where the compliers were those who would have attained EBH if they had lived near a 4-year college. Because residence in the South was strongly associated with EBH, the MH and our weighting approaches gave significantly different results for that covariate, which implies that accounting for significant correlations between the covariates and the IV is important to profile the compliers and noncompliers accurately.

Among the two proposed methods, the AIPW method yielded slightly better coverage rates and smaller standard errors than the IPW method for some simulation scenarios. In the analysis of the NLSYM data, the two methods gave almost identical estimates and standard errors for the baseline mean characteristics of compliers and noncompliers. Based on these observations, the AIPW method would be preferable to the IPW method, unless the treatment models for the AIPW are not severely misspecified.

We used ML to estimate the IPSs for the proposed IPW and AIPW estimators. This allowed us to derive their asymptotic distributions based on standard asymptotic theory. Based on the score equation, ML finds the parameter values such that the average of prediction errors is minimized. However, estimation of the IPS does not necessarily rely on ML. Because the PS is a balancing score [17], we can adopt estimation methods based on balancing covariates. One of those is the covariate balancing propensity score (CBPS) by Imai and Ratkovic [18], which estimates a parametric PS model by solving covariate balancing moment conditions. Adopting the CBPS into the IPS estimation involves estimating the regression parameters of the parametric IPS model by solving the covariate balancing moment conditions. Simulation studies demonstrated that CBPS can yield less biased and more stable PS weighting estimators than ML when either model misspecification or limited overlap exists for the PS [19–21]. Based on these simulation results, adopting CBPS would improve the robustness of the proposed weighting estimators.

A recent study demonstrated that adopting machine learning techniques such as lasso [22] and super learner [23] for outcome regression enhances the robustness of the AIPW estimator for the ATE to model misspecification by reducing bias and standard error [24]. To further improve the performance of the AIPW profiling estimators, we can adopt these machine learning techniques to estimate the treatment models. However, the development of the associated inferential procedures is complicated and merits further investigation.

Appendix A: Proof of Theorem 1

The always-taker mean of X_ij is . Based on Bayes’ theorem, Therefore, can be written as (5) Under monotonicity, D_i(0) = 1 implies D_i(1) = 1. This leads to (6) and P[D_i(1) = D_i(0) = 1] = E[D_i(0)]. As a result, the denominator of Eq (5) can be identified by E[(1 − Z_i)D_i/(1 − e_i)]. Based on Eq (6), the numerator of Eq (5) can be written as which is identified by Eq (4).

The never-taker mean of X_ij is . Based on Bayes’ theorem, Therefore, can be written as (7) Under monotonicity, D_i(1) = 0 implies D_i(0) = 0. This leads to (8) and P[D_i(1) = D_i(0) = 0] = E[1 − D_i(1)]. Because E[Z_i/e_i] = 1, we can identify the denominator of (7) by

Based on Eq (8), the numerator of Eq (7) can be written as (9) It is worthwhile to note that Then, based on Eq (3), Eq (9) can be identified by

Appendix B: Proof of Theorem 2

We can express the IPW estimator for as , where Then, the estimator of can be viewed as a solution to the following estimating equations: where S_β(X_i, β) is the likelihood score equation for the IPS model. The last element of , obtained as a solution to , is . Therefore, is an M-estimator, which has an asymptotic normal distribution [25]. The variance of is estimated by N⁻¹A⁻¹BA^−T, with and . The estimated variance of is the last diagonal element of N⁻¹A⁻¹BA^−T. The asymptotic normal distributions of and can be demonstrated in a similar way.

Appendix C: Proof of Theorem 3

We demonstrate that the AIPW representation of is doubly robust. The doubly robustness of the AIPW representations of and can be shown in a similar way, and hence the proofs of those are omitted. The AIPW representation for the denominator of in Theorem 3 is known to be doubly robust for E[D_i(1) − D_i(0)] [10, 26, 27], which means that it is equal to E[D_i(1) − D_i(0)] if either the IPS or treatment model is correctly specified. Therefore, it is sufficient to show that the AIPW representation for the numerator of in Theorem 3 is doubly robust for E[X_ijD(1)] − E[X_ijD(0)].

We can express the first term in the numerator of in Theorem 3 as (10) If the IPS is correctly specified, then [(Z_i − e_i)X_ijm_1i/e_i] = 0, and hence the above equation becomes E[X_ijD(1)]. In addition, we can express Eq (10) as (11) If m_1i is correctly specified, then E[Z_i(D_i − m_1i)X_ij/e_i] = 0 and E[X_ijm_1i] = E[E[X_ijD_i(1)|X_i]] = E[X_ijD_i(1)]; hence Eq (11) becomes E[X_ijD_i(1)].

In a similar way, we can demonstrate that the second term in the numerator of in Theorem 3, (12) is doubly robust for E[X_ijD_i(0)]. Eq (12) can be expressed as (13) If the IPS is correctly specified, then E[(Z_i − e_i)X_ijm_0i/(1 − e_i)] = 0, and hence Eq (13) becomes E[X_ijD_i(0)]. In addition, we can express Eq (12) as (14) If m_0i is correctly specified, then E[(1 − Z_i)X_ij(D_i − m_0i)/{1 − e_i}] = 0 and E[X_ijm_0i] = E[E[X_ijD_i(0)|X_i]] = E[X_ijD_i(0)]; hence, Eq (14) becomes E[X_ijD_i(0)]. This completes the proof.

Appendix D: Proof of Theorem 4

We can express the AIPW estimator for as , where Then, the estimator of can be viewed as a solution to the following estimating equations: where S₁(D_i, X_i, α₁) and S₀(D_i, X_i, α₀) are the likelihood score equations for m_1i(α₁) and m_0i(α₀), respectively. The last element of , obtained as a solution to , is . Therefore, is an M-estimator, which has an asymptotic normal distribution [25]. The variance of is estimated by N⁻¹C⁻¹DC^−T, with and . The estimated variance of is the last diagonal element of N⁻¹C⁻¹DC^−T. The asymptotic normal distributions of and can be demonstrated in a similar way.

Supporting information

S1 Data.

https://doi.org/10.1371/journal.pone.0283223.s001

(CSV)

References

1. Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. Journal of the American Statistical Association. 1996;91:444–55.
- View Article
- Google Scholar
2. Marbach M, Hangartner D. Profiling Compliers and Noncompliers for Instrumental-Variable Analysis. Political Analysis. 2020;28(3):435–444.
- View Article
- Google Scholar
3. Abadie A. Semiparametric instrumental variable estimation of treatment response models. Journal of Econometrics. 2003;113:231–63.
- View Article
- Google Scholar
4. Card D. Using geographic variation in college proximity to estimate the return to schooling. In: Fagerberg J, Mowery DC, Nelson RR, editors. Aspects of Labour Market Behaviour: Essays in Honour of John Vanderkamp. Toronto: University of Toronto Press; 1995. p. 201–222.
5. Baiocchi M, Cheng J, Small DS. Instrumental variable methods for causal inference. Statistics in Medicine. 2014;33(13):2297–2340. pmid:24599889
- View Article
- PubMed/NCBI
- Google Scholar
6. Hangartner D, Marbach M, Henckel L, Maathuis MH, Kelz RR, Keele L. Profiling Compliers in Instrumental Variables Designs; 2021. Available from: https://arxiv.org/abs/2103.06328.
7. Imbens GW, Angrist JD. Identification and estimation of local average treatment effects. Econometrica. 1994;62(2):467–475.
- View Article
- Google Scholar
8. Frölich M; Ctr Microdata Methods & Practice. Nonparametric IV estimation of local average treatment effects with covariates. Journal of Econometrics. 2007;139(1):35–75.
- View Article
- Google Scholar
9. Tan Z. Regression and weighting methods for causal inference using instrumental variables. Journal of the American Statistical Association. 2006;101(476):1607–1618.
- View Article
- Google Scholar
10. Lunceford J, Davidian M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Statistics in Medicine. 2004;23(19):2937–2960. pmid:15351954
- View Article
- PubMed/NCBI
- Google Scholar
11. Robins J, Rotnitzky A, Zhao L. Estimation of regression-coefficients when some regressors are not always observed. Journal of the American Statistical Association. 1994;89(427):846–866.
- View Article
- Google Scholar
12. Marbach M. ivdesc: Profiling Compliers and Non-Compliers for Instrumental Variable Analysis; 2021. Available from: https://CRAN.R-project.org/package=ivdesc.
13. Okui R, Small DS, Tan Z, Robins JM. Doubly robust instrumental variable regression. Statistica Sinica. 2012;22(1):173–205.
- View Article
- Google Scholar
14. Wang L, Tchetgen Tchetgen E. Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables. Journal of the Royal Statistical Society: Series B. 2018;80(3):531–550. pmid:30034269
- View Article
- PubMed/NCBI
- Google Scholar
15. Kang H, Jiang Y, Zhao Q, Small D. ivmodel: Statistical Inference and Sensitivity Analysis for Instrumental Variables Model; 2021. Available from: https://CRAN.R-project.org/package=ivmodel.
16. An W, Wang X. LARF: Instrumental variable estimation of causal effects through local average response functions. Journal of Statistical Software. 2016;71(1):1–13.
- View Article
- Google Scholar
17. Rosenbaum P, Rubin D. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55.
- View Article
- Google Scholar
18. Imai K, Ratkovic M. Covariate balancing propensity score. Journal of the Royal Statistical Society: Series B. 2014;76(1):243–263.
- View Article
- Google Scholar
19. Choi BY, Wang CP, Michalek J, Gelfond J. Power comparison for propensity score methods. Computational Statistics. 2019;34:743–761.
- View Article
- Google Scholar
20. Li F, Thomas LE, Li F. Addressing extreme propensity score via the overlap weights. American Journal of Epidemiology. 2019;188(1):250–257. pmid:30189042
- View Article
- PubMed/NCBI
- Google Scholar
21. Wyss R, Ellis AR, Brookhart MA, Girman CJ, Funk MJ, LoCasale R, et al. The role of prediction modeling in propensity score estimation: an evaluation of logistic regression, bCART, and the covariate-balancing propensity score. American Journal of Epidemiology. 2014;180(6):645–655. pmid:25143475
- View Article
- PubMed/NCBI
- Google Scholar
22. Tibshirani R. Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society Series B (Methodological). 1996;58(1):267–288.
- View Article
- Google Scholar
23. van der Laan MJ, Polley EC, Hubbard AE. Super Learner. Statistical Applications in Genetics and Molecular Biology. 2007;6(1). pmid:17910531
- View Article
- PubMed/NCBI
- Google Scholar
24. Choi BY, Wang CP, Gelfond J. Machine learning outcome regression improves doubly robust estimation of average causal effects. Pharmacoepidemiology and Drug Safety. 2020; p. 1–14. pmid:32716126
- View Article
- PubMed/NCBI
- Google Scholar
25. Stefanski L, Boos D. The calculus of M-estimation. American Statistian. 2002;56(1):29–38.
- View Article
- Google Scholar
26. Bang H, Robins JM. Doubly robust estimation in missing data and causal inference models. Biometrics. 2005;61(4):962–973. pmid:16401269
- View Article
- PubMed/NCBI
- Google Scholar
27. Tao Y, Fu H. Doubly robust estimation of the weighted average treatment effect for a target population. Statistics in Medicine. 2019;38:315–325. pmid:30302780
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. Journal of the American Statistical Association. 1996;91:444–55.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Marbach M, Hangartner D. Profiling Compliers and Noncompliers for Instrumental-Variable Analysis. Political Analysis. 2020;28(3):435–444.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Abadie A. Semiparametric instrumental variable estimation of treatment response models. Journal of Econometrics. 2003;113:231–63.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Card D. Using geographic variation in college proximity to estimate the return to schooling. In: Fagerberg J, Mowery DC, Nelson RR, editors. Aspects of Labour Market Behaviour: Essays in Honour of John Vanderkamp. Toronto: University of Toronto Press; 1995. p. 201–222.

[ref5] 5. Baiocchi M, Cheng J, Small DS. Instrumental variable methods for causal inference. Statistics in Medicine. 2014;33(13):2297–2340. pmid:24599889
View Article
PubMed/NCBI
Google Scholar

[12] View Article

[13] PubMed/NCBI

[14] Google Scholar

[ref6] 6. Hangartner D, Marbach M, Henckel L, Maathuis MH, Kelz RR, Keele L. Profiling Compliers in Instrumental Variables Designs; 2021. Available from: https://arxiv.org/abs/2103.06328.

[ref7] 7. Imbens GW, Angrist JD. Identification and estimation of local average treatment effects. Econometrica. 1994;62(2):467–475.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref8] 8. Frölich M; Ctr Microdata Methods & Practice. Nonparametric IV estimation of local average treatment effects with covariates. Journal of Econometrics. 2007;139(1):35–75.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref9] 9. Tan Z. Regression and weighting methods for causal inference using instrumental variables. Journal of the American Statistical Association. 2006;101(476):1607–1618.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref10] 10. Lunceford J, Davidian M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Statistics in Medicine. 2004;23(19):2937–2960. pmid:15351954
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref11] 11. Robins J, Rotnitzky A, Zhao L. Estimation of regression-coefficients when some regressors are not always observed. Journal of the American Statistical Association. 1994;89(427):846–866.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref12] 12. Marbach M. ivdesc: Profiling Compliers and Non-Compliers for Instrumental Variable Analysis; 2021. Available from: https://CRAN.R-project.org/package=ivdesc.

[ref13] 13. Okui R, Small DS, Tan Z, Robins JM. Doubly robust instrumental variable regression. Statistica Sinica. 2012;22(1):173–205.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref14] 14. Wang L, Tchetgen Tchetgen E. Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables. Journal of the Royal Statistical Society: Series B. 2018;80(3):531–550. pmid:30034269
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref15] 15. Kang H, Jiang Y, Zhao Q, Small D. ivmodel: Statistical Inference and Sensitivity Analysis for Instrumental Variables Model; 2021. Available from: https://CRAN.R-project.org/package=ivmodel.

[ref16] 16. An W, Wang X. LARF: Instrumental variable estimation of causal effects through local average response functions. Journal of Statistical Software. 2016;71(1):1–13.
View Article
Google Scholar

[42] View Article

[43] Google Scholar

[ref17] 17. Rosenbaum P, Rubin D. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref18] 18. Imai K, Ratkovic M. Covariate balancing propensity score. Journal of the Royal Statistical Society: Series B. 2014;76(1):243–263.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref19] 19. Choi BY, Wang CP, Michalek J, Gelfond J. Power comparison for propensity score methods. Computational Statistics. 2019;34:743–761.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref20] 20. Li F, Thomas LE, Li F. Addressing extreme propensity score via the overlap weights. American Journal of Epidemiology. 2019;188(1):250–257. pmid:30189042
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref21] 21. Wyss R, Ellis AR, Brookhart MA, Girman CJ, Funk MJ, LoCasale R, et al. The role of prediction modeling in propensity score estimation: an evaluation of logistic regression, bCART, and the covariate-balancing propensity score. American Journal of Epidemiology. 2014;180(6):645–655. pmid:25143475
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref22] 22. Tibshirani R. Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society Series B (Methodological). 1996;58(1):267–288.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref23] 23. van der Laan MJ, Polley EC, Hubbard AE. Super Learner. Statistical Applications in Genetics and Molecular Biology. 2007;6(1). pmid:17910531
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref24] 24. Choi BY, Wang CP, Gelfond J. Machine learning outcome regression improves doubly robust estimation of average causal effects. Pharmacoepidemiology and Drug Safety. 2020; p. 1–14. pmid:32716126
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref25] 25. Stefanski L, Boos D. The calculus of M-estimation. American Statistian. 2002;56(1):29–38.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref26] 26. Bang H, Robins JM. Doubly robust estimation in missing data and causal inference models. Biometrics. 2005;61(4):962–973. pmid:16401269
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref27] 27. Tao Y, Fu H. Doubly robust estimation of the weighted average treatment effect for a target population. Statistics in Medicine. 2019;38:315–325. pmid:30302780
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

Figures

Abstract

Introduction

Profiling approach for unconfounded instruments

Weighting approach for confounded instruments

Inverse probability weighting (IPW)

Augmented IPW

Simulation

Application

Conclusion

Appendix A: Proof of Theorem 1

Appendix B: Proof of Theorem 2

Appendix C: Proof of Theorem 3

Appendix D: Proof of Theorem 4

Supporting information

S1 Data.

References