Implications of acute change in estimated Glomerular Filtration Rate (eGFR) for the effect of sodium-glucose cotransporter-2 inhibitors (SGLT-2i) on long-term endpoints

Ransmond O. Berchie; Lesley A. Inker; Hiddo J. L. Heerspink; Ben Haaland; Tom Greene

doi:10.1371/journal.pone.0347741

Abstract

In randomized trials, the primary analysis often estimates the average treatment effect on a clinical endpoint. Some treatments also lead to early changes in a biomarker that is prognostic for the clinical endpoint, prompting investigators to explore how these acute biomarker changes might inform the treatment’s effect on long-term clinical outcomes. A naive analysis that directly examines treatment-by-biomarker-change interactions may lead to biased estimates because it fails to account for the fact that biomarker changes are influenced by the treatment and post-randomization factors. A key statistical challenge is that we do not know whether the observed biomarker change in an individual patient truly reflects a treatment-induced effect or whether the change would have occurred under placebo as well. This uncertainty makes it difficult to disentangle the causal effect of the treatment from natural biomarker variability. We apply principal stratification with a normal copula governed by the correlation between the potential acute biomarker changes under treatment and placebo. A flexible model for the conditional distribution of the clinical endpoint given the biomarker change enables estimation of the conditional average treatment effect on the clinical endpoint, given the acute biomarker change under treatment, as a function of . We illustrate the method by determining how knowledge of acute change in estimated glomerular filtration rate modifies the expected effect of sodium-glucose cotransporter-2 inhibitors (SGLT-2i) on clinical endpoints in patients with chronic kidney disease.

Citation: Berchie RO, Inker LA, Heerspink HJL, Haaland B, Greene T (2026) Implications of acute change in estimated Glomerular Filtration Rate (eGFR) for the effect of sodium-glucose cotransporter-2 inhibitors (SGLT-2i) on long-term endpoints. PLoS One 21(4): e0347741. https://doi.org/10.1371/journal.pone.0347741

Editor: Nestor Oliva Damaso, Hospital Costa del Sol, SPAIN

Received: March 4, 2025; Accepted: April 3, 2026; Published: April 29, 2026

Copyright: © 2026 Berchie et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data used in the analysis were obtained by the CKD-EPI CT group through third parties (CREDENCE - Janssen Research and Development; DAPA-CKD – AstraZeneca; EMPA-REG OUTCOME - Boehringer Ingelheim (BI) and Eli Lilly). As per data use agreements, CKD-EPI CT does not have the right to share or legally distribute third-party data. Interested researchers may apply for access through the following portals: the Vivli platform (https://search.vivli.org/) for CREDENCE (NCT02065791) and EMPA-REG OUTCOME (NCT01131676) datasets, and the AstraZeneca Clinical Study Data Request portal (https://www.astrazenecaclinicaltrials.com/) for DAPA-CKD (NCT03036150) dataset. The authors confirm they had no special access privileges. Findings can be replicated by obtaining these datasets and following the methods section of this paper.

Funding: Support for CKD-EPI CT was obtained from multi-sponsor funding to the National Kidney Foundation (NKF). The NKF received consortium support from the following companies: Alexion, AstraZeneca, Bayer, Boehringer Ingelheim, CSL Behring, Novartis, Novo Nordisk A/S, ProKidney, Roche/Genentech, and Travere. This work also received support from the Utah Study Design and Biostatistics Center, with funding in part from the National Center for Advancing Translational Sciences of the National Institutes of Health under Award Number UL1TR002538. The support and resources from the Center for High Performance Computing at the University of Utah are also gratefully acknowledged. The funder provided support in the form of salaries for authors [TG, LI, HJLH], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

Competing interests: LA Inker reports funding from National Institutes of Health (NIH), National Kidney Foundation (NKF), Omeros, Dialysis Clinics, Inc., and Reata Pharmaceuticals for research and contracts to Tufts Medical Center; consulting agreements to Tufts Medical Center with Tricida and HealthLogistics Interactive; and consulting agreements to Diamerix. HJL Heerspink reports funding from the NKF to his institute, is a consultant for AbbVie, AstraZeneca, Bayer, Boehringer Ingelheim, Chinook, CSL Behring, Dimerix, Eli-Lilly, Gilead, GoldFinch, Janssen, Merck, Novo Nordisk A/S, Travere Pharmaceuticals. T Greene reports grant support from the NKF, the Collaborative Study Group, the National Institutes of Health, the Patient-Centered Outcomes Research Institute (PCORI), Janssen Pharmaceuticals, Durect Corporation, Pfizer; statistical consulting AstraZeneca, CSL, and Boehringer-Ingleheim. B Haaland reports employment for Pentara Corporation, and consults for the NKF, Proxima Clinical Research, Value Analytics Health, and Guidepoint Global. RO Berchie reports employment for Genentech. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

1 Introduction

Randomized clinical trials (RCTs) often evaluate causal treatment effects by employing an intent-to-treat approach, which asks, “What is the average effect of treatment assignment across the study population?” In certain cases, treatments may cause potentially adverse changes in a prognostic biomarker for a clinically relevant endpoint shortly after treatment initiation. In these cases, clinicians may question whether they should change the treatment.

The motivation of this paper stems from recent randomized clinical trials which have demonstrated that treatment with sodium-glucose cotransporter-2 inhibitors (SGLT-2i) slows disease progression and delays the onset of kidney failure in patients with chronic kidney disease (CKD) [1–5]. These studies also reported that some participants experience an acute reduction in estimated glomerular filtration rate (eGFR), a measure of renal function, within approximately 1 month after initiating treatment with SGLT-2i [5–7]. Although this drug class has been shown to slow the average progression of kidney disease at the population level, there is concern that this benefit may be reduced or even reversed in patients with unusually large acute eGFR declines.

From a biological perspective, initial eGFR declines after initiating SGLT-2is and certain other classes of medications may result from mechanisms that slow CKD progression in the long-run. This decline is thought to be driven primarily by reductions in intraglomerular pressure, which occur due to hemodynamic changes such as afferent arteriole constriction and efferent arteriole dilation. These adaptations help preserve long-term kidney function by reducing hyperfiltration and mitigating glomerular damage, ultimately slowing CKD progression [8]. In addition, recent epidemiologic analyses suggest that in patients with type 2 diabetes, initial reductions in eGFR following the initiation of SGLT-2i are not associated with an increased risk of adverse events [9,10]. However, such analyses may be subject to bias. For instance, if patients with large eGFR declines are more likely to have their treatment discontinued, the observed relationship may underestimate any potential risks associated with acute eGFR declines. Hence, the implications of such epidemiologic associations for the causal question of whether larger than average acute eGFR declines are indicative of enhanced or reduced long term benefit are unclear. Our paper seeks insights into the implications of acute eGFR changes () under SGLT-2i treatment for the expected causal effect of SGLT-2i treatment on long-term clinical outcomes.

Empirical evaluation of the implications of following treatment initiation is challenging. Progress in understanding how treatment effects are modified by post-treatment factors has been made through the framework of principal stratification. In this framework, causal effects are evaluated after conditioning on subsets of patients defined by counterfactual (potential) values of an intermediate variable that would have occurred under designated levels of the treatment [11]. Because principal strata are determined by counterfactuals, they can be viewed as defined at baseline, prior to the treatment, allowing average causal effects to be defined on those subgroups. In this case, our goal is to estimate the average causal effect of treatment with SGLT-2 inhibition within the principal strata defined by designated early changes in eGFR that would occur with the treatment. These principal strata are empirically determined only for patients assigned to the SLGT-2i intervention; they are not observed for patients assigned to the placebo. Thus, the challenge of using potential outcomes in principal stratification arises from their partial observability in a parallel-arm RCT. To estimate treatment effects within these principal strata, we must rely on additional identifying assumptions based on subject matter considerations that cannot be verified from the data. Often, a monotonicity assumption is applied in similar problems, usually in conjunction with a sensitivity analysis to evaluate how the average causal effect of interest depends on non-identified parameters. In our context, a common form of the monotonicity assumption would posit that for all patients, the decline in eGFR that would be observed with SGLT-2i treatment is at least as large as the decline in eGFR that would be observed under placebo. For complex diseases such as CKD, the assumption that the treatment modifies the eGFR change in the same direction for all patients is a stretch. Treatment with SGLT-2i is likely to decrease eGFR by some mechanisms but increase eGFR by others; while a greater mean eGFR decline is observed in the short term after initiating SGLT-2i, it is difficult to rule out the possibility that the treatment may lead to short-term eGFR increases, at least by small amounts, in a subset of patients. A recent cross-over study [12] illustrates the limitations of the conventional monotonicity assumption by showing its inapplicability when potential outcomes under both interventions can be observed in a controlled setting.

Most approaches proposed to evaluate the causal effect in the principal stratification framework have focused on binary or discrete post-treatment variables [13–20]. However, in real-world applications, biomarkers with acute responses to treatments, including eGFR in response to SGLT-2i, are often continuous. When dealing with continuous post-treatment variables, the number of potential principal strata becomes infinite, complicating both inference and interpretation. Several approaches have been proposed in the causal inference literature to address these challenges. One approach is to dichotomize the post-treatment variable [21,22], though this method discards information provided by the continuous scale of the biomarker and introduces issues with inference due to the arbitrary selection of thresholds. Another strategy assumes strong parametric models for both the outcome and the post-treatment variable and jointly modeling them with informative priors for the parameters [23,24]. A different line of development relaxes these parametric assumptions by using a Dirichlet process mixture model within a Bayesian framework [25]. Two recent developments extend nonparametric modeling in the Bayesian framework; one does so by incorporating Gaussian processes in the case of continuous treatment [26] and the other allows for information sharing across treatment groups in determining principal strata membership [27]. While these methods offer potential solutions for handling continuous post-treatment variables, they often require strong assumptions or produce estimands that are difficult to interpret.

An alternative line of research relaxes the restrictive monotonicity assumption by employing copula [28] functions to model the bivariate joint distribution of potential values for continuous post-treatment variables across the treatment and placebo interventions. This copula-based approach has been proposed for modeling principal strata across various contexts (e.g., [23,24,29–31]).

These copula-based approaches show promise but may produce unstable estimates when identifiability is not assured [24]. In this paper, we replace the assumption of strict monotonicity by positing a bivariate normal copula for the joint distribution of the counterfactual acute eGFR changes that would be observed with and without the treatment, while treating the correlation between these changes as a sensitivity parameter. We use the Johnson family of distributions [32] to flexibly model the distribution of , and adjust for baseline covariates to address possible confounding. We apply a competing risks framework to characterize to the extent possible from the data, how the expected effect of SGLT-2i on the time to kidney failure (with death as competing event) is modified by knowledge of the acute eGFR change following SGLT-2i initiation, while also articulating the uncertainty resulting from the dependence of this conditional expectation on causal assumptions that extend beyond the observable data. The paper is organized as follows: Sect 2 provides a heuristic overview of key concepts. Sect 3 introduces our method, detailing the estimand of interest, the identifying causal assumptions, the analytical approach, and the estimation algorithm. Sect 4 provides additional details of our approach for the specific application to clinical trials of SGLT-2i in patients with chronic kidney disease. We provide the results in Sect 5, and Sect 6 is devoted to discussing these findings and conclusions.

2 Heuristic overview

Before beginning our formal development, we first give a heuristic overview of the framework. Panels 1A and 1B of Fig 1 illustrate two possible models for the association between the percent change in eGFR on placebo () and the percent change in eGFR on SGLT-2i (). In both panels, we set the mean percent changes under placebo and SGLT-2i to –0.6% and –6.4%, respectively, and the standard deviation in both arms to 12.7%, to correspond roughly to the data presented later in the manuscript. However, the correlation between and cannot be determined empirically because the two quantities cannot be observed simultaneously. As a result, we cannot know whether the potential outcomes are tightly correlated, as in Panel 1A, or weakly correlated, as in Panel 1B.

Download:

Fig 1. Heuristic Overview of the framework.

The figure illustrates how the acute effect relates to the potential outcomes and for lower () and higher () values of for a simplified scenario in which the potential outcomes follow a bivariate normal distribution. It also describes the implications of this relationship for the longer term treatment effect on a clinical endpoint. Panel 1 compares the joint distribution between and between the two values of . Panel 2 shows how influences the conditional average treatment effect on the clinical endpoint given an observed acute eGFR decline of 35%, and Panel 3 demonstrates the relationship between and the variability of the acute effect.

https://doi.org/10.1371/journal.pone.0347741.g001

Next, consider a scenario in which a 35% eGFR reduction is observed one month after initiating SGLT-2i therapy, and in which SGLT-2i reduces the 2-year risk of kidney failure by one-third at every value of the observed . Larger declines are assumed to be associated with higher risk of kidney failure, as in Panels 2A and 2B. Under a strong correlation (), the expected value of corresponding to a 35% decline on SGLT-2i is only slightly attenuated to –27.8%, and the distribution of possible values is relatively narrow (Panel 2A). In contrast, under a weaker correlation (), the expected shifts more substantially to –14.9%, and the distribution becomes much wider (Panel 2B). Below we show that, under a no-confounding assumption, and for any specified , the average treatment effect conditional on an observed 35% eGFR decline after starting SGLT-2i can be estimated by comparing (i) the 2-year probability of kidney failure with SGLT-2i to (ii) the average 2-year probability of kidney failure with placebo across the conditional distribution of given the observed . This latter quantity corresponds approximately to the vertical coordinate of the blue curve in Panels 2A or 2B at –27.8% (Panel 2A) or –14.9% (Panel 2B). In the illustrated scenario, an observed 35% decline is associated with a beneficial expected treatment effect when the potential outcomes are strongly correlated (Panel 2A), but with a harmful expected effect when they are weakly correlated (Panel 2B). Thus, the data alone cannot determine whether such a decline indicates benefit or harm; the answer depends on , which is not identifiable from the data.

However, subject-matter knowledge may help constrain plausible values of . Lower correlations imply larger variability in the acute SGLT-2i effect given by the difference , and therefore imply that substantial fractions of patients have large positive acute effects (Panel 3). If physiological considerations preclude substantial positive acute effects in a large share of patients, this would imply a lower bound to , potentially ruling out the harmful-effect scenario.

We next describe our formal framework, which extends this heuristic overview by allowing non-normal and incorporating covariate adjustment to account for possible confounding.

3 Methods

3.1 Notations and estimand of interest

Consider a randomized trial where participants are randomized to SGLT-2i treatment or placebo. For the purposes of this manuscript, our primary objective is to understand the implications of acute eGFR change for the effect of the treatment on the occurrence of kidney failure and all-cause mortality, where kidney failure and all-cause mortality are competing events. For the subsequent development, unless stated, we refer to kidney failure (with all-cause mortality prior to kidney failure considered as a competing risk) as the primary clinical endpoint. This reflects the central clinical concern that acute effects on eGFR may hasten progression of kidney disease. Acute eGFR changes may also affect deaths from cardiovascular disease and other causes, but the mechanisms are less clear. Hence, all-cause mortality and the composite of kidney failure and all-cause mortality will be the secondary outcomes.

To formalize this framework statistically, let T and C respectively denote the failure and censoring times for the composite of kidney failure or death, and let indicate whether the observed event is kidney failure (E = 1) or all-cause mortality (E = 2). We define acute eGFR change, as the percentage change in eGFR from baseline to month 1, using the 2009 CKD-EPI equation [33]. For each subject i, the observed data consist of , the time to either an event or censoring; , a binary indicator of whether the event was observed; A_i, the randomized treatment assignment (1 for SGLT-2i and 0 for placebo); and X_i, a vector of baseline covariates. In a competing risks framework, we additionally observe E_i for those who have either events, such that the full data structure is (, A_i). When the interest lies in modeling the cumulative incidence function for kidney failure conditional on covariates, the inference centers on .

To address causal effects using potential outcomes, let T_i(1) and T_i(0) denote the time to the clinical endpoint for the i^th subject under SGLT-2i and placebo, respectively. Similarly, let W_i(1) and W_i(0) denote the for the i^th subject under SGLT-2i and placebo. Note that we observe W_i(1) and T_i(1) only for patients randomized to the treatment, and we observe W_i(0) and T_i(0) only for patients randomized to the placebo. It is important to distinguish between W_i(1), which represents the acute change in eGFR after initiating SGLT-2i, and W_i(1) − W_i(0), which represents the acute effect of SGLT-2 inhibition. We are able to observe W_i(1) in patients assigned to SGLT-2i and W_i(0) in placebo patients, but we cannot observe the acute effect W_i(1) − W_i(0) in the same patient. Also let E_i(1) and E_i(0) be potential event type indicators for the i^th subject under SGLT-2i and placebo respectively.

We express the treatment effect estimand for the composite outcome of kidney failure and all-cause mortality as a difference in cumulative risks at time k:

(1)

For the competing outcomes (kidney failure and all-cause mortality), the estimand is:

(2)

Because a patient must survive without kidney failure for 1 month for the acute eGFR change to be observed, more precise expressions for the estimands in (1) and (2) must also condition on survival without kidney failure to 1 month. Thus, a more formal expression for the second estimand is

As only 0.02% of patients experienced events within the first month, for simplicity we subsequently omit the formal conditioning on patients surviving without kidney failure from our notation.

3.2 Analytic approach and causal assumptions

Focusing on the estimand in expression (2) above, the first term, , can be estimated directly from the data of patients randomized to SGLT-2i, but the comparator cannot be identified solely from the observed data since we never observe W_i(1) and T_i(0) in the same patient. Thus, to make progress, we make causal assumptions from subject matter considerations that go beyond our actual data. To do this, we employ a bivariate normal copula to characterize the joint distribution of the counterfactual values, W_i(0) and W_i(1).

3.2.1 Transformation with Normal copula.

Let F₍₀₎ and F₍₁₎ denote the cumulative distribution functions for Wi(0) and Wi(1), respectively, and let and denote their inverses. We use the Johnson family of distributions to model F₍₀₎ and F₍₁₎; this family includes the normal distribution as a special case but also allows for non-zero skewness and kurtosis. Let denote the cumulative distribution function for the standard normal distribution with mean 0 and standard deviation 1. Define and . Then Y_i(0) and Y_i(1) each have standard normal distributions. Further, since has a known form, we can directly estimate both Y_i(0) and Y_i(1) from the data.

Identification of in expression 2 requires additional assumptions.

3.2.2 Identifiability.

To obtain causal identification of , we make the following assumptions.

Assumption 1: Y_i(0) and Y_i(1) follow a bivariate normal distribution.

Assumption 2: The correlation between Y_i(0) and Y_i(1) is equal to a value which will serve as a sensitivity parameter.

Assumption 3:

Assumption 3 essentially states that knowing the acute effect for a patient provides no additional information on the patient’s time to kidney failure or death in the absence of SGLT-2i after accounting for the patient’s in the absence of SGLT-2i. In Sect 3.3, we show that the plausibility of this assumption can be strengthened by incorporating adjustment for potential confounders that could jointly influence the acute effect and CKD progression under the placebo. From Assumptions 1 and 2, it follows that the conditional distribution of Y_i(0) given Y_i(1) = y is . Hence

(3)

In what follows, we concentrate on the causal identification of . Specifically, using the assumptions above, we can modify the second term in 2 as;

where the first equation follows by the law of total probability and the last equation follows by invoking Assumption 3. Here is the conditional density of W(0) given W(1).

To implement the integration in the final expression above, note that

(4)

is the average of the conditional probabilities | W_i(0) = w₀] over the conditional distribution of W_i(0) given that W_i(1) = w₁. Thus, we can approximate the integral in 4 as:

where the w_{0,i, j}; j = 1,2, … J are simulated randomly from the conditional distribution of W_i(0) given . We can make the approximation as accurate as desired by increasing the number of draws from the conditional distribution.

3.3 Covariate adjustment

As described above, Assumption 3 stipulates that a patient’s acute effect provides no additional information on the patient’s time to kidney failure or death without SGLT-2i after accounting for the patient’s in the absence of SGLT-2i. This is a type of no-confounding assumption between the acute effect of the treatment and the long-term rate of CKD progression. Assumption 3 may be more plausible with adjustment for baseline covariates that subject matter considerations suggest may be related to both the acute effect and rate of CKD progression. We therefore modify Assumptions to incorporate covariate adjustment.

Let X_i denote a collection of baseline covariates. Let and , where and , and r_i(1) and r_i(0) are the residuals in the two regressions. Let and , where F_r(0) and F_r(1) are the cdf’s of r(0) and r(1), respectively. We can then replace Assumptions 1–3 with similar assumptions expressed in terms of the and .

Assumption 1B: and follow a bivariate normal distribution.

Assumption 2B: The correlation between and is equal to a value that we treat as a sensitivity parameter.

Note that the difference between assumptions 1B and 2B vs. 1 and 2 is that the ’s are the transformed residuals from regressing each of W_i(0) and W_i(1) on the covariates in the treatment groups.

Assumption 3B:

By the law of total probability, the estimands 1 and 2 can be expressed:

(5)

(6)

The estimand in 6 can be identified by another application of the law of total probability and invoking assumption 3B as follows:

where is the conditional probability density function that defines the conditional distribution of W_i(0) given W_i(1) and X_i = x_i. The is equivalent to the distribution of the sum of and a random variable whose distribution follows the conditional distribution of r_i(0) given r_i(1) and x_i.

The quantity represents the conditional probability density function that defines the conditional distribution of X_i given W_i(1). To approximate this conditional distribution, we will employ the Sampling-Importance-Resampling (SIR) algorithm [34]. This approach provides a more robust and flexible framework for approximating than attempting to fit a multivariate regression of X on w₁ which would have to account for disparate types of variables (e.g., continuous and categorical) as well as the relationships among these variables. Using Baye’s theorem, we express the conditional density as , where is the conditional pdf of w₁ given X and f(x) is the unconditional pdf of the covariates X. The denominator serves at the normalizing constant and is obtained by integrating the joint density over all possible values of X. To implement the SIR algorithm, we first generate proposal samples x from a multivariate normal distribution stratified by any dichotomous covariates. Each sample will then be assigned an importance weight based on the likelihood of observing W₁ = w₁, thereby adjusting for differences between the proposal and target distributions. The weights will be computed as (approximated via regression of w₁ on x) and the unconditional pdf f(x) (estimated from the observed data), with normalization ensuring they sum to one. Finally, we will perform resampling from the proposal distribution using these importance weights, effectively generating a new set of samples that more accurately reflects .

We are now able to define Algorithm 1, which provides the analytical steps needed to estimate the quantities in 6.

Algorithm 1: Estimate with covariate adjustment

Step 1: Fit regression models and , recording estimates of the coefficients and residuals r_i(1), r_i(0). Define a grid of w₁ values.

Step 2: Fit a Fine-Gray regression [35] (for competing outcomes) or Cox regression (non-competing outcomes) relating T_i(1) jointly to W_i(1) and X_i.

2a: Do the following for each w₁ value in the grid of w₁ values:

2ai: Using the SIR algorithm, draw from

2aii: Apply the Breslow method [36] to the model in Step 2 to provide estimates

Step 3: Fit a Fine-Gray (or Cox) regression relating T_i(0) jointly to W_i(0) and X_i.

3a: Do the following for each w₁ value in the grid of w₁ values:

3ai: Compute .

3aii: Compute , where is estimated using the residuals from the regression, in the SGLT- 2 i group.

3aiii: Simulate a random sample of J values of for each i = 1,2, … n from the Normal distribution; .

3aiv: Compute , where is estimated using the residuals from the regression, in the placebo group.

3av: Compute .

3avi: Apply the Breslow method to the model in Step 3 to provide estimates .

3b: Estimate as the difference:

Step 4: Bootstrap the whole process to estimate pointwise 95% CIs for

3.4 Bound on

To evaluate plausible bounds on , we used a simulation-based procedure to reconstruct the joint distribution of the acute eGFR changes under placebo and SGLT-2i while preserving the observed marginal distributions within each treatment arm.

We first performed separate regressions of the acute percent changes in eGFR on baseline eGFR, log-proteinuria, age, systolic blood pressure, sex, and diabetes status in the SGLT-2i and placebo groups. This yielded arm-specific conditional mean functions , where W(a) denotes the acute change under treatment . Residuals were extracted from each model to represent unexplained variation in acute change after adjustment for measured covariates. We then fit Johnson SU distributions separately to the residuals in the placebo and SGLT-2i arms to map the residual distributions to standard normal distributions.

To construct a joint distribution for (W(0), W(1)) under a candidate value of , we simulated pairs of standardized residuals from a bivariate normal distribution with mean zero, unit variances, and correlation . These latent normal draws were then transformed to the original residual scale using the inverse Johnson SU cumulative distribution functions for each arm, preserving the empirically estimated marginal residual distributions while imposing the specified cross-arm dependence structure.

We constructed potential acute changes by adding the simulated residuals to the predicted mean acute change evaluated at representative covariate values (continuous covariates fixed at their sample means, with sex and diabetes indicators fixed at prespecified values). This yielded simulated draws of (W(0), W(1)) consistent with both the observed regression structure and the hypothesized residual correlation .

For each value of , we generated 15,000 simulated joint draws and computed the distribution of the individual acute effect . We then evaluated the proportion of simulated patients whose acute effect exceeded a prespecified threshold (scaled to percentage change relative to mean baseline eGFR). By examining how this probability varied across values of , we sought to determine the smallest correlation that is biologically plausible.

4 Data analysis

We included participants from three landmark SGLT-2i trials pivotal in demonstrating the benefits of SGLT-2i in slowing kidney disease progression.

The DAPA-CKD trial enrolled 4,304 CKD patients, with and without diabetes, randomized to dapagliflozin or placebo [3]. Dapagliflozin significantly reduced the risk of the primary outcome – a composite of 50% eGFR decline, kidney failure, death due to kidney failure, or cardiovascular mortality. Benefits were seen in both diabetic and nondiabetic groups, with similar adverse event rates in both treatment arms.

The CREDENCE trial involved 4,401 patients with type 2 diabetes and CKD, comparing canagliflozin to placebo [37]. Canagliflozin significantly lowered the risk of a primary composite outcome of 57% eGFR decline, kidney failure, death due to kidney failure, or cardiovascular mortality.

The EMPA-REG OUTCOME trial assessed 7,020 type 2 diabetes patients with cardiovascular disease, evaluating empagliflozin against placebo for three major adverse cardiovascular events (cardiovascular death, nonfatal myocardial infarction, or nonfatal stroke) and kidney disease progression [5]. Empagliflozin significantly reduced the risk of an exploratory endpoint of kidney failure and 57% eGFR decline.

In our analysis, we focused on three clinical endpoints: (1) kidney failure (with all-cause mortality as a competing event), (2) all-cause mortality (with kidney failure as a competing event), and (3) a composite of kidney failure and all-cause mortality. We defined as the percentage change in eGFR from baseline to the measurement taken at the first visit. We pooled individual-level data from the three trials for analysis. Trial participants were then classified as receiving SGLT-2i vs. placebo based on the individual trial protocol definitions.

We estimate the estimand of interest (treatment effect, at k = 2 years) by implementing the methods outlined in Algorithm 1 in each of the 3 studies and overall. We confirmed the suitability of the proportional hazards assumption using the modified weighted Schoenfeld residuals [38]. For kidney failure and all-cause mortality outcomes, we used the Fine-Gray model to relate each clinical outcome to the (with subdistribution hazards as a function of modeled as restricted cubic splines with 4 knots). We used a similar Cox proportional hazard model to relate the composite outcome to . We included baseline eGFR and log baseline proteinuria, which are markers of severity and aggressiveness of chronic kidney disease, as covariates. Based on subject matter considerations, we also adjusted for baseline SBP, age, sex and diabetes status (in the DAPA-CKD analysis). We estimated the treatment effect in the three trials separately, and for each clinical outcome, we pooled the effect estimates across the 3 studies using a weighted (by relative sample size) average of the trials’ effect estimates. We report the overall estimated treatment effects (and their 95% confidence intervals from 1000 bootstrap estimates) for all 3 clinical outcomes when is 0% and when there is a decline of 20% for values, 0.4,0.6,0.8, and 1.

We visualized the 2-year effect estimates for the kidney failure outcome analysis for values ranging from 0 to 1 and varied across a broad range of values for (−20% to 20%). We provide the same analyses for all-cause mortality and the composite outcome in the supplementary material.

5 Results

Across the three included landmark trials, EMPA-REG OUTCOME, DAPA-CKD, and CREDENCE, a total of 15,725 participants were initially enrolled. Of these, 470 participants were excluded: 349 due to missing key baseline variables (including creatinine, albumin-to-creatinine ratio [ACR], age, or sex) or privacy concerns, and 121 who lacked a post-randomization eGFR measurement within the first month. This resulted in a combined cohort of 15,255 individuals. To address extreme outliers in the distribution of acute eGFR changes, the cohort was further trimmed by the 0.5^th and 99.5^th quantiles, leaving a final analysis cohort of 15,101 participants (CREDENCE: 4,273; DAPA-CKD: 3,959; and EMPA-REG OUTCOME: 6,869).

The characteristics of participants by study and overall included in the analysis are shown in Table 1. Overall, the mean baseline eGFR was 61.9 ± 22.3ml/min/1.73 m², with participants from EMPA-REG having the highest baseline eGFR on average (76.4 ± 19.8ml/min/1.73 m²). Across studies, EMPA-REG participants had the lowest median baseline proteinuria (17.7mg/g vs. DAPA CKD: 900.0mg/g, CREDENCE: 928.0 mg/g). Overall, the mean eGFR change from baseline was −3.9 ± 12.7% and ranged from 42.7% to 54.5%. CREDENCE recorded the highest percentage (6.4%) of kidney failure events, and EMPA-REG recorded the lowest percentage (0.4%). Overall, there were 954(6.0%) deaths and 537(4.0%) kidney failure events.

Download:

Table 1. Table of Characteristics by Study and Overall.

https://doi.org/10.1371/journal.pone.0347741.t001

Combining data from all trials, the placebo group shows minimal acute change in eGFR (mean: −0.57%), while the SGLT-2i group has a mean decline of −6.35% (Fig 2). This consistent pattern across the trials indicates that SGLT-2i treatment is associated with a substantial acute decline in eGFR, while placebo generally shows smaller or even positive eGFR changes.

Download:

Fig 2. Kernel density of

by trial and overall.

The figure illustrates that SGLT-2i treatment induces a significant acute decline in eGFR compared to placebo across all trials, which aligns with the known hemodynamic effects of these medications.

https://doi.org/10.1371/journal.pone.0347741.g002

Fig 3 displays the cubic spline regressions relating the sub-distribution hazard for kidney failure to by randomized group with adjustment for the covariates, using the median in the placebo group as the reference. The hazard ratios were consistently less than 1 across the full range in the SGLT-2i group.

Download:

Fig 3. Empirical relationship between

and Kidney Failure.

This shows the adjusted relationship between and the sub-distribution hazard of kidney failure pooling across the three studies. The effect of was modeled using a restricted cubic spline with 4 knots placed at the default locations of the rcs function in the rms package in R. We allowed for an interaction term between the restricted cubic splines of at 1 month and treatment variable. We set the median in the placebo group as the referent point. The models were adjusted for baseline eGFR, proteinuria, SBP, age, sex, and diabetes status. The shaded ribbons represent the 95% CI. The left y-axis provides the adjusted sub-distribution hazard ratio for every value of compared to the reference point defined as the median in the placebo group. The histogram of is shown with the density on the right y-axis.

https://doi.org/10.1371/journal.pone.0347741.g003

Fig 4 shows the estimated reduction in the risk of kidney failure by 2 years between patients with vs. without SGLT-2i given the observed after starting SGLT-2i using the approach defined in Algorithm 1. In contrast to Fig 3, which depicts the directly observed relationship between the sub-distribution hazard for kidney failure and , the estimated treatment effects in Fig 4 depend on the sensitivity parameter , which cannot be determined from the data. However, across all values of , we find no evidence that negative after SGLT-2i initiation indicates either an adverse effect or a reduction in the benefit of SGLT-2i on kidney failure. Moreover, for larger values of , we observe a trend suggesting that more negative acute changes signify a greater benefit of SGLT-2i on kidney failure. A similar pattern was observed for the composite of kidney failure and death, and an overall benefit of SGLT-2i on the competing risk of mortality free of kidney failure was seen irrespective of the the assumed value for (see Table 2, S2 Fig, and S4 Fig). To put the treatment effects displayed in Fig 4 into context, the absolute risk reduction of 0.010 at 2 -years corresponds to a relative risk reduction of approximately 30% compared to the 2 -risk risk of kidney failure in the control group of 0.034.

Download:

Table 2. Estimated Risk Reduction for Clinical Outcomes Given the Observed eGFR Decine After Starting SGLT-2i.

https://doi.org/10.1371/journal.pone.0347741.t002

Download:

Fig 4. Implications of

on the estimated effect of SGLT-2i on the risk of kidney failure by 2 years.

The figure displays the implications of the observed acute eGFR change () after starting SGLT-2i on the estimated effect of SGLT-2i on the risk of kidney failure within two years for alternative . The analysis adjusts for baseline eGFR, log-transformed proteinuria, age, systolic blood pressure (SBP), sex and diabetes status. The relationship between and kidney failure (with all-cause mortality as a competing event) was modeled using a restricted cubic spline with four knots. The y-axis shows the 2-year reduction in risk of kidney failure for patients treated with SGLT-2i compared to those without, using the steps outlined in Algorithm 1. The bottom axis represents the observed after starting SGLT-2i, expressed as % change from baseline. The top axis indicates the corresponding estimated mean with the placebo under the assumed value of . The steel blue line shows the estimated reduction in the risk of kidney failure under the assumed given the observed following SGLT-2i initiation, with 95% pointwise confidence intervals.

https://doi.org/10.1371/journal.pone.0347741.g004

We also formally addressed whether the conditional average treatment effects on the clinical endpoints at 2 years varied across different observed acute eGFR changes following SGLT-2i initiation. This was achieved by estimating the difference in the conditional average treatment effect at each compared to the conditional average treatment effect when (See S1 Fig, S3 Fig, and S5 Fig). For kidney failure alone (S1 Fig) and for the composite of kidney failure and death (S5 Fig), larger acute declines in eGFR after starting SGLT-2i were associated with increased treatment benefit, with the trend becoming more pronounced as the sensitivity parameter increases (e.g., ). However, the 95% confidence intervals for these differences largely overlap with zero across the spectrum of , indicating that the trends for increased benefit for larger eGFR reductions did not reach statistical significance.

Table 3 shows the implications of the correlation parameter, for the variability in the acute effect of SGLT-2i and the associated predicted proportion of patients with an acute effect greater than . In this analysis, we converted the acute effect expressed as a percent to an absolute difference in eGFR levels to facilitate judgments as to the plausible size of the acute effect. As in the heuristic example displayed in Fig 1, smaller values for imply greater variation in the acute effect with a consequent increase in the proportion of patients with positive acute effects greater than .

Download:

Table 3. Estimated bounds on

.

https://doi.org/10.1371/journal.pone.0347741.t003

For values of , more than 16% of patients would exhibit an acute increase in eGFR greater than within one month of treatment initiation. SGLT-2i’s could cause short-term increases in eGFR in some patients if their beneficial effect in slowing CKD progression produced a larger short term effect on eGFR change than their hemodynamic reduction in eGFR. However, eGFR reduction associated with CKD progression in the investigated populations is typically between 2 and 6 ml/min/1.73 m² year, or less than 0.5 ml/min/1.73 m² within 1 month. Thus, we are not aware of a plausible biological mechanism that would drive such a widespread improvement in kidney function within 1 month after starting SGLT-2i in a large proportion of patients. If widespread positive acute effects greater than are considered unlikely, this analysis would suggest that is likely to be relatively high, probably ≥ 0.80.

6 Discussion

Nephrologists have often speculated that acute declines in eGFR following initiation of SGLT-2i and other medications that produce similar acute eGFR declines are benign and reversible [4,5,39]. Nonetheless, concerns about treatment safety and efficacy persist among clinicians and patients. This paper evaluates the implications of the observed acute eGFR change for the treatment’s longer-term effects. Recognizing that the true acute effect is unobserved, we use the framework of principal stratification to make explicit the inherent limits in what the data reveal. We characterize the key uncertainty in our understanding through the sensitivity parameter, , which defines the correlation between suitably transformed potential acute eGFR changes with and without the treatment.

Notably, irrespective of (ranging from 0 to 1), we found no evidence that larger than average eGFR decline indicates a reduction in SGLT-2i’s long-term benefits. In fact, for higher values, our findings suggest a potential benefit associated with more substantial eGFR declines, suggesting that such a decline might signify a protective mechanism contributing to long-term benefit.

It is a natural human response to attribute changes in a biomarker after starting a new treatment to the treatment itself. This tendency is reflected in the common use of the expression “treatment response” to describe changes in biomarkers and other outcomes observed soon after treatment initiation. Similarly, clinicians and patients may assume that acute eGFR declines reflect effects of the treatment. However, as is evident from the substantial variation in acute eGFR changes in the placebo groups of the studies in our analysis (see Fig 2), where no new treatment was initiated, eGFR changes following treatment initiation must reflect multiple factors. These include measurement error and underlying variability in disease progression in addition to any true variation in the acute treatment effect. Distinguishing acute changes from actual causal acute effects is challenging because acute effects represent a comparison between outcomes with and without treatment, and such comparisons are unobservable. This limitation also prevents empirical determination of the sensitivity parameter . Assumptions regarding the value of belong to the class of so called “cross-world” assumptions which are recognized as difficult to justify in causal inference.

As illustrated in Fig 1, the value of (or ) quantifies the amount of heterogeneity in the acute effect. Values close to 1 indicate that SGLT-2i’s produce uniform acute eGFR reduction across patients, implying that essentially all the variation in acute eGFR changes following SGLT-2i initiation are unrelated to the treatment. In contrast, values close to 0 imply substantial heterogeneity in the acute effect, implying that variation in acute effects could meaningfully contribute to observed variation in early eGFR changes after starting the treatment. As illustrated in Fig 1, the uncertainty in the value of has implications for the treatment effect on longer term clinical endpoints. In settings where larger acute eGFR declines are associated with increased risk of clinical events, large negative acute eGFR changes on the treatment could signify an attenuation or reversal in an overall beneficial effect if is close to 0, but not if is close to 1.

We observed only a negligible increase in the variability of the acute eGFR changes in the SLGT-2i groups compared to placebo in the studies of our analysis. This could suggest that any heterogeneity in the acute effect of SGLT-2i must have been small, consistent with a large (see Fig 2). However, similar variability in treatment and control groups could still occur under a heterogeneous acute effect if the acute effect were inversely correlated with acute eGFR changes with placebo. A stronger argument supporting a large is provided by noting that smaller and the consequent higher heterogeneity in the acute effect would imply that a substantial proportion of the study population must experience substantial positive acute effects (see the Panel 3 of Fig 1 and Table 3). We are unaware of any mechanism that could lead to positive acute effects greater than 1 ml/min/1.73 m² in more than a very small fraction of patients. This suggests a large value for , probably greater than 0.8.

Ultimately, across the entire spectrum of possible values for , our data confirm that the observed initial eGFR change does not signal a reduction in the expected benefits for kidney failure or all-cause mortality. Further, for the higher and more plausible values of , we observed a trend suggesting that more substantial acute declines may signify a protective hemodynamic mechanism leading to enhanced long-term preservation of kidney function, although the 95% confidence intervals for the treatment by acute eGFR change interaction effects indicated that this trend did not reach formal statistical significance. Overall, our results suggest that the magnitude of the acute eGFR change should not be viewed as a signal to de-escalate therapy, but rather as an expected physiologic response to treatment that is compatible with significant long-term clinical benefit.

The demonstration that acute eGFR declines do not diminish the long-term protective benefits of SGLT-2i has immediate implications for clinical decision making and patient communication. Clinicians should proactively counsel patients that an initial decline in estimated kidney function (via eGFR) commonly up to 20% within the first month of treatment, represents an expected and generally benign effect that reflects some combination of the expected variation that would have occurred without the treatment or consistent benign acute effects resulting from reduced intra-glomerular pressure. Because our analysis indicates that the substantial long-term relative risk reduction for kidney failure persists even in patients experiencing such acute declines, treatment should not be prematurely discontinued based solely on these early biomarker fluctuations. By utilizing a principal stratification framework, our methodology helps differentiate drug-induced hemodynamic changes from underlying disease progression, providing reassurance that even larger-than-average acute declines do not signal a reduction in therapeutic efficacy. Ultimately, these findings provide an evidence-based framework for clinicians to initiate SGLT-2i with confidence across a broad range of patient profiles, shifting the focus of clinical monitoring from transient early eGFR changes to the sustained prevention of kidney failure and mortality.

A potential limitation of our analysis is its reliance on the assumption that no unmeasured factor jointly influences both the acute effect and CKD progression under placebo. While we adjust for known confounders, which are strong predictors of CKD progression, it remains possible that an unobserved variable could introduce bias. Although we cannot entirely rule out this possibility, the confounders included in our analysis are well-established determinants of CKD progression. A further conceptual limitation is that the principal stratification approach used in this paper expresses the plausible range of treatment effects for clinical endpoints as a function of , which can be observed only 1 month after the treatment has already been initiated. Because is not known at the time of treatment initiation, it is not possible, in general, to use to guide decisions regarding treatment initiation without additional assumptions. However, in this case, we saw no evidence for an adverse effect of SGLT-2i treatment across the full range of the distribution, providing reassurance of no adverse effect of the treatment irrespective of any plausible eGFR declines that may subsequently be observed. A further limitation of the current analysis is that it does not explicitly model the joint relationship between acute eGFR changes and early changes in albuminuria, such as the albumin-to-creatinine ratio (ACR). Reductions in albuminuria are a hallmark of SGLT-2i therapy and are strongly associated with long-term kidney protection. Biologically, the acute eGFR decline and the reduction in proteinuria are often coupled, as both are consequences of the treatment’s ability to reduce intraglomerular pressure through tubuloglomerular feedback. By focusing solely on the eGFR decline, our study demonstrates that this specific biomarker change does not indicate a loss of benefit, even without accounting for concurrent ACR improvements. However, incorporating acute changes in ACR into our principal stratification framework would likely provide deeper mechanistic insights. For instance, it would be valuable to determine if the long-term benefit of SGLT-2i is even more pronounced in patients who experience both a significant acute eGFR decline and a substantial reduction in albuminuria.

Future research could extend our approach by considering the eGFR slope following the 1-month acute effect as the outcome, providing a more specific assessment of long-term kidney function decline beyond the initial acute effects. Building on the need for deeper mechanistic insights, our framework could be adapted to a multi-biomarker approach that incorporates the albumin-to-creatinine ratio, a key indicator of kidney disease progression and treatment response. Exploring the joint interplay between acute changes in ACR and eGFR would allow for a more granular assessment of long-term outcomes, potentially refining patient stratification and further distinguishing benign hemodynamic shifts from sustained renal protection. Beyond kidney disease, the methodology used in this report could be extended to other disease settings where early biomarker changes are predictive of long-term outcomes.

In conclusion, our analysis shows no evidence that a larger eGFR decline diminishes SGLT-2i’s benefits. For higher values, a more significant acute decline might even be associated with greater treatment efficacy.

Supporting information

S1 Fig. Estimated interaction effect on kidney failure at 2 years.

The figure displays the difference in the estimated 2-year risk reduction in kidney failure given the observed after starting SGLT-2i and the 2-year risk reduction in kidney failure if = 0 across different . The values are expressed as % changes.

https://doi.org/10.1371/journal.pone.0347741.s001

(DOCX)

S2 Fig. Implications of on the estimated effect of SGLT-2i on the risk of death prior to kidney failure by 2 years.

The figure displays the implications of the observed acute eGFR change () after starting SGLT-2i on the estimated effect of SGLT-2i on the risk of death prior to kidney failure within two years for alternative . The analysis adjusts for baseline eGFR, log-transformed proteinuria, age, systolic blood pressure (SBP), sex and diabetes status. The relationship between and death (with kidney failure as a competing event) was modeled using a restricted cubic spline with four knots. The y-axis shows the 2-year reduction in risk of death prior to kidney failure for patients treated with SGLT-2i compared to those without, using the steps outlined in Algorithm 1. The bottom axis represents the observed after starting SGLT-2i, expressed as % change from baseline. The top axis indicates the corresponding estimated mean with the placebo under the assumed value of . The steel blue line shows the estimated reduction in the risk of death prior to kidney failure under the assumed given the observed following SGLT-2i initiation, with 95% pointwise confidence intervals.

https://doi.org/10.1371/journal.pone.0347741.s002

(DOCX)

S3 Fig. Estimated interaction effect on death prior to kidney failure at 2 years.

The figure displays the difference in the estimated 2-year risk reduction in death prior to kidney failure given the observed after starting SGLT-2i and the 2-year risk reduction in death prior to kidney failure if = 0 across different . The values are expressed as % changes.

https://doi.org/10.1371/journal.pone.0347741.s003

(DOCX)

S4 Fig. Implications of on the estimated effect of SGLT-2i on the composite of kidney failure or death by 2 years.

The figure displays the implications of the observed acute eGFR change () after starting SGLT-2i on the estimated effect of SGLT-2i on the risk of kidney failure or death within two years for alternative . The analysis adjusts for baseline eGFR, log-transformed proteinuria, age, systolic blood pressure (SBP), sex and diabetes status. The relationship between and the composite of kidney failure or death was modeled using a restricted cubic spline with four knots. The y-axis shows the 2-year reduction in risk of kidney failure or death for patients treated with SGLT-2i compared to those without, using the steps outlined in Algorithm 1. The bottom axis represents the observed after starting SGLT-2i, expressed as % change from baseline. The top axis indicates the corresponding estimated mean with the placebo under the assumed value of . The steel blue line shows the estimated reduction in the risk of kidney failure or death under the assumed given the observed following SGLT-2i initiation, with 95% pointwise confidence intervals.

https://doi.org/10.1371/journal.pone.0347741.s004

(DOCX)

S5 Fig. Estimated interaction effect on the composite of kidney failure or death at 2 years.

The figure displays the difference in the estimated 2-year risk reduction in the composite of kidney failure or death given the observed after starting SGLT-2i and the 2-year risk reduction in of kidney failure or death if = 0 across different . The values are expressed as % changes.

https://doi.org/10.1371/journal.pone.0347741.s005

(DOCX)

References

1. Mosenzon O, Wiviott SD, Cahn A, Rozenberg A, Yanuv I, Goodrich EL, et al. Effects of dapagliflozin on development and progression of kidney disease in patients with type 2 diabetes: an analysis from the DECLARE-TIMI 58 randomised trial. Lancet Diabetes Endocrinol. 2019;7(8):606–17. pmid:31196815
- View Article
- PubMed/NCBI
- Google Scholar
2. Neuen BL, Young T, Heerspink HJL, Neal B, Perkovic V, Billot L, et al. SGLT2 inhibitors for the prevention of kidney failure in patients with type 2 diabetes: a systematic review and meta-analysis. Lancet Diabetes Endocrinol. 2019;7(11):845–54. pmid:31495651
- View Article
- PubMed/NCBI
- Google Scholar
3. Heerspink HJ, Stefánsson BV, Correa-Rotter R, Chertow GM, Greene T, Hou F, et al. Dapagliflozin in patients with chronic kidney disease. N Engl J Med. 2020;383(15):1436–46.
- View Article
- Google Scholar
4. Perkovic V, Jardine M, Neal B, Bompoint S, Heerspink H, Charytan D, et al. Canagliflozin and renal outcomes in type 2 diabetes and nephropathy. N Engl J Med. 2019;380:2295.
- View Article
- Google Scholar
5. Wanner C, Inzucchi SE, Lachin JM, Fitchett D, von Eynatten M, Mattheus M, et al. Empagliflozin and progression of kidney disease in type 2 diabetes. N Engl J Med. 2016;375(4):323–34. pmid:27299675
- View Article
- PubMed/NCBI
- Google Scholar
6. Heerspink HJL, Desai M, Jardine M, Balis D, Meininger G, Perkovic V. Canagliflozin slows progression of renal function decline independently of glycemic effects. J Am Soc Nephrol. 2017;28(1):368–75. pmid:27539604
- View Article
- PubMed/NCBI
- Google Scholar
7. Heerspink HJL, Johnsson E, Gause-Nilsson I, Cain VA, Sjöström CD. Dapagliflozin reduces albuminuria in patients with diabetes and hypertension receiving renin-angiotensin blockers. Diabetes Obes Metab. 2016;18(6):590–7. pmid:26936519
- View Article
- PubMed/NCBI
- Google Scholar
8. Meraz-Muñoz AY, Weinstein J, Wald R. eGFR Decline after SGLT2 Inhibitor Initiation: the Tortoise and the Hare Reimagined. Kidney360. 2021;2(6):1042–7. pmid:35373074
- View Article
- PubMed/NCBI
- Google Scholar
9. Oshima M, Jardine MJ, Agarwal R, Bakris G, Cannon CP, Charytan DM, et al. Insights from CREDENCE trial indicate an acute drop in estimated glomerular filtration rate during treatment with canagliflozin with implications for clinical practice. Kidney Int. 2021;99(4):999–1009. pmid:33316282
- View Article
- PubMed/NCBI
- Google Scholar
10. Kraus BJ, Weir MR, Bakris GL, Mattheus M, Cherney DZI, Sattar N, et al. Characterization and implications of the initial estimated glomerular filtration rate “dip” upon sodium-glucose cotransporter-2 inhibition with empagliflozin in the EMPA-REG OUTCOME trial. Kidney Int. 2021;99(3):750–62. pmid:33181154
- View Article
- PubMed/NCBI
- Google Scholar
11. Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2002;58(1):21–9. pmid:11890317
- View Article
- PubMed/NCBI
- Google Scholar
12. Qu Y, Lipkovich I, Ruberg SJ. Assessing the commonly used assumptions in estimating the principal causal effect in clinical trials. Statistics in Biopharmaceutical Research. 2023;15(4):812–9.
- View Article
- Google Scholar
13. Cheng J, Small DS. Bounds on causal effects in three-arm trials with non-compliance. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2006;68(5):815–36.
- View Article
- Google Scholar
14. Imai K. Sharp bounds on the causal effects in randomized experiments with “truncation-by-death”. Statistics & Probability Letters. 2008;78(2):144–9.
- View Article
- Google Scholar
15. Ding P, Geng WY, Zhou XH. Identifiability and estimation of causal effects by principal stratification with outcomes truncated by death. Journal of the American Statistical Association. 2011;106(496):1578–91.
- View Article
- Google Scholar
16. Mattei A, Mealli F. Augmented designs to assess principal strata direct effects. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2011;73(5):729–52.
- View Article
- Google Scholar
17. Mealli F, Pacini B. Using secondary outcomes to sharpen inference in randomized experiments with noncompliance. Journal of the American Statistical Association. 2013;108(503):1120–31.
- View Article
- Google Scholar
18. Mealli F, Pacini B, Stanghellini E. Identification of principal causal effects using additional outcomes in concentration graphs. Journal of Educational and Behavioral Statistics. 2016;41(5):463–80.
- View Article
- Google Scholar
19. Jiang Z, Ding P, Geng Z. Principal causal effect identification and surrogate end point evaluation by multiple trials. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2015;78(4):829–48.
- View Article
- Google Scholar
20. Mattei A, Ding P, Ballerini V, Mealli F. Assessing causal effects in the presence of treatment switching through principal stratification. Bayesian Anal. 2025;20(3).
- View Article
- Google Scholar
21. Sjölander A, Humphreys K, Vansteelandt S, Bellocco R, Palmgren J. Sensitivity analysis for principal stratum direct effects, with an application to a study of physical activity and coronary heart disease. Biometrics. 2009;65(2):514–20. pmid:18759834
- View Article
- PubMed/NCBI
- Google Scholar
22. Jiang Z, Yang S, Ding P. Multiply robust estimation of causal effects under principal ignorability. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2022;84(4):1423–45.
- View Article
- Google Scholar
23. Conlon ASC, Taylor JMG, Elliott MR. Surrogacy assessment using principal stratification when surrogate and outcome measures are multivariate normal. Biostatistics. 2014;15(2):266–83. pmid:24285772
- View Article
- PubMed/NCBI
- Google Scholar
24. Lu S, Jiang Z, Ding P. Principal stratification with continuous post-treatment variables: nonparametric identification and semiparametric estimation. In: 2024. https://arxiv.org/abs/2309.12425
25. Schwartz L, Mealli F. A Bayesian semiparametric approach to intermediate variables in causal inference. Journal of the American Statistical Association. 2011;106(496):1331–44.
- View Article
- Google Scholar
26. Antonelli J, Mealli F, Beck B, Mattei A. Principal stratification with continuous treatments and continuous post-treatment variables. 2023. https://arxiv.org/abs/2309.14486
27. Zorzetto D, Canale A, Mealli F, Dominici F, Bargagli-Stoffi FJ. Bayesian nonparametrics for principal stratification with continuous post-treatment variables. 2024. https://arxiv.org/abs/2405.17669
28. Nelsen RB. An Introduction to Copulas. 2nd ed. New York, NY, USA: Springer; 2006.
29. Bartolucci F, Grilli L. Modeling partial compliance through copulas in a principal stratification framework. Journal of the American Statistical Association. 2011;106(494):469–79.
- View Article
- Google Scholar
30. Daniels MJ, Roy JA, Kim C, Hogan JW, Perri MG. Bayesian inference for the causal effect of mediation. Biometrics. 2012;68(4):1028–36. pmid:23005030
- View Article
- PubMed/NCBI
- Google Scholar
31. Yang F, Ding P. Using survival information in truncation by death problems without the monotonicity assumption. Biometrics. 2018;74(4):1232–9.
- View Article
- Google Scholar
32. Johnson NL. Systems of frequency curves generated by methods of translation. Biometrika. 1949;36(Pt. 1-2):149–76. pmid:18132090
- View Article
- PubMed/NCBI
- Google Scholar
33. Inker LA, Schmid CH, Tighiouart H, Eckfeldt JH, Feldman HI, Greene T, et al. Estimating glomerular filtration rate from serum creatinine and cystatin C. New England Journal of Medicine. 2012;367(1):20–9.
- View Article
- Google Scholar
34. Doucet A, De Freitas N, Gordon NJ. Sequential Monte Carlo methods in practice. Springer; 2001.
35. Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association. 1999;94(446):496–509.
- View Article
- Google Scholar
36. Breslow NE. Contribution to the discussion of paper by D. R. Cox. Journal of the Royal Statistical Society: Series B. 1972;34:216–7.
- View Article
- Google Scholar
37. Perkovic V, Jardine MJ, Neal B, Bompoint S, Heerspink HJL, Charytan DM, et al. Canagliflozin and renal outcomes in type 2 diabetes and nephropathy. N Engl J Med. 2019;380(24):2295–306. pmid:30990260
- View Article
- PubMed/NCBI
- Google Scholar
38. Zhou B, Fine J, Laird G. Goodness-of-fit test for proportional subdistribution hazards model. Stat Med. 2013;32(22):3804–11. pmid:23625840
- View Article
- PubMed/NCBI
- Google Scholar
39. Neal B, Perkovic V, Mahaffey KW, de Zeeuw D, Fulcher G, Erondu N, et al. Canagliflozin and cardiovascular and renal events in type 2 diabetes. N Engl J Med. 2017;377(7):644–57. pmid:28605608
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Mosenzon O, Wiviott SD, Cahn A, Rozenberg A, Yanuv I, Goodrich EL, et al. Effects of dapagliflozin on development and progression of kidney disease in patients with type 2 diabetes: an analysis from the DECLARE-TIMI 58 randomised trial. Lancet Diabetes Endocrinol. 2019;7(8):606–17. pmid:31196815
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Neuen BL, Young T, Heerspink HJL, Neal B, Perkovic V, Billot L, et al. SGLT2 inhibitors for the prevention of kidney failure in patients with type 2 diabetes: a systematic review and meta-analysis. Lancet Diabetes Endocrinol. 2019;7(11):845–54. pmid:31495651
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Heerspink HJ, Stefánsson BV, Correa-Rotter R, Chertow GM, Greene T, Hou F, et al. Dapagliflozin in patients with chronic kidney disease. N Engl J Med. 2020;383(15):1436–46.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref4] 4. Perkovic V, Jardine M, Neal B, Bompoint S, Heerspink H, Charytan D, et al. Canagliflozin and renal outcomes in type 2 diabetes and nephropathy. N Engl J Med. 2019;380:2295.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref5] 5. Wanner C, Inzucchi SE, Lachin JM, Fitchett D, von Eynatten M, Mattheus M, et al. Empagliflozin and progression of kidney disease in type 2 diabetes. N Engl J Med. 2016;375(4):323–34. pmid:27299675
View Article
PubMed/NCBI
Google Scholar

[16] View Article

[17] PubMed/NCBI

[18] Google Scholar

[ref6] 6. Heerspink HJL, Desai M, Jardine M, Balis D, Meininger G, Perkovic V. Canagliflozin slows progression of renal function decline independently of glycemic effects. J Am Soc Nephrol. 2017;28(1):368–75. pmid:27539604
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref7] 7. Heerspink HJL, Johnsson E, Gause-Nilsson I, Cain VA, Sjöström CD. Dapagliflozin reduces albuminuria in patients with diabetes and hypertension receiving renin-angiotensin blockers. Diabetes Obes Metab. 2016;18(6):590–7. pmid:26936519
View Article
PubMed/NCBI
Google Scholar

[24] View Article

[25] PubMed/NCBI

[26] Google Scholar

[ref8] 8. Meraz-Muñoz AY, Weinstein J, Wald R. eGFR Decline after SGLT2 Inhibitor Initiation: the Tortoise and the Hare Reimagined. Kidney360. 2021;2(6):1042–7. pmid:35373074
View Article
PubMed/NCBI
Google Scholar

[28] View Article

[29] PubMed/NCBI

[30] Google Scholar

[ref9] 9. Oshima M, Jardine MJ, Agarwal R, Bakris G, Cannon CP, Charytan DM, et al. Insights from CREDENCE trial indicate an acute drop in estimated glomerular filtration rate during treatment with canagliflozin with implications for clinical practice. Kidney Int. 2021;99(4):999–1009. pmid:33316282
View Article
PubMed/NCBI
Google Scholar

[32] View Article

[33] PubMed/NCBI

[34] Google Scholar

[ref10] 10. Kraus BJ, Weir MR, Bakris GL, Mattheus M, Cherney DZI, Sattar N, et al. Characterization and implications of the initial estimated glomerular filtration rate “dip” upon sodium-glucose cotransporter-2 inhibition with empagliflozin in the EMPA-REG OUTCOME trial. Kidney Int. 2021;99(3):750–62. pmid:33181154
View Article
PubMed/NCBI
Google Scholar

[36] View Article

[37] PubMed/NCBI

[38] Google Scholar

[ref11] 11. Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2002;58(1):21–9. pmid:11890317
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref12] 12. Qu Y, Lipkovich I, Ruberg SJ. Assessing the commonly used assumptions in estimating the principal causal effect in clinical trials. Statistics in Biopharmaceutical Research. 2023;15(4):812–9.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref13] 13. Cheng J, Small DS. Bounds on causal effects in three-arm trials with non-compliance. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2006;68(5):815–36.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref14] 14. Imai K. Sharp bounds on the causal effects in randomized experiments with “truncation-by-death”. Statistics & Probability Letters. 2008;78(2):144–9.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref15] 15. Ding P, Geng WY, Zhou XH. Identifiability and estimation of causal effects by principal stratification with outcomes truncated by death. Journal of the American Statistical Association. 2011;106(496):1578–91.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref16] 16. Mattei A, Mealli F. Augmented designs to assess principal strata direct effects. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2011;73(5):729–52.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref17] 17. Mealli F, Pacini B. Using secondary outcomes to sharpen inference in randomized experiments with noncompliance. Journal of the American Statistical Association. 2013;108(503):1120–31.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref18] 18. Mealli F, Pacini B, Stanghellini E. Identification of principal causal effects using additional outcomes in concentration graphs. Journal of Educational and Behavioral Statistics. 2016;41(5):463–80.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref19] 19. Jiang Z, Ding P, Geng Z. Principal causal effect identification and surrogate end point evaluation by multiple trials. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2015;78(4):829–48.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref20] 20. Mattei A, Ding P, Ballerini V, Mealli F. Assessing causal effects in the presence of treatment switching through principal stratification. Bayesian Anal. 2025;20(3).
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref21] 21. Sjölander A, Humphreys K, Vansteelandt S, Bellocco R, Palmgren J. Sensitivity analysis for principal stratum direct effects, with an application to a study of physical activity and coronary heart disease. Biometrics. 2009;65(2):514–20. pmid:18759834
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref22] 22. Jiang Z, Yang S, Ding P. Multiply robust estimation of causal effects under principal ignorability. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2022;84(4):1423–45.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref23] 23. Conlon ASC, Taylor JMG, Elliott MR. Surrogacy assessment using principal stratification when surrogate and outcome measures are multivariate normal. Biostatistics. 2014;15(2):266–83. pmid:24285772
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref24] 24. Lu S, Jiang Z, Ding P. Principal stratification with continuous post-treatment variables: nonparametric identification and semiparametric estimation. In: 2024. https://arxiv.org/abs/2309.12425

[ref25] 25. Schwartz L, Mealli F. A Bayesian semiparametric approach to intermediate variables in causal inference. Journal of the American Statistical Association. 2011;106(496):1331–44.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref26] 26. Antonelli J, Mealli F, Beck B, Mattei A. Principal stratification with continuous treatments and continuous post-treatment variables. 2023. https://arxiv.org/abs/2309.14486

[ref27] 27. Zorzetto D, Canale A, Mealli F, Dominici F, Bargagli-Stoffi FJ. Bayesian nonparametrics for principal stratification with continuous post-treatment variables. 2024. https://arxiv.org/abs/2405.17669

[ref28] 28. Nelsen RB. An Introduction to Copulas. 2nd ed. New York, NY, USA: Springer; 2006.

[ref29] 29. Bartolucci F, Grilli L. Modeling partial compliance through copulas in a principal stratification framework. Journal of the American Statistical Association. 2011;106(494):469–79.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref30] 30. Daniels MJ, Roy JA, Kim C, Hogan JW, Perri MG. Bayesian inference for the causal effect of mediation. Biometrics. 2012;68(4):1028–36. pmid:23005030
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref31] 31. Yang F, Ding P. Using survival information in truncation by death problems without the monotonicity assumption. Biometrics. 2018;74(4):1232–9.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref32] 32. Johnson NL. Systems of frequency curves generated by methods of translation. Biometrika. 1949;36(Pt. 1-2):149–76. pmid:18132090
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref33] 33. Inker LA, Schmid CH, Tighiouart H, Eckfeldt JH, Feldman HI, Greene T, et al. Estimating glomerular filtration rate from serum creatinine and cystatin C. New England Journal of Medicine. 2012;367(1):20–9.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref34] 34. Doucet A, De Freitas N, Gordon NJ. Sequential Monte Carlo methods in practice. Springer; 2001.

[ref35] 35. Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association. 1999;94(446):496–509.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref36] 36. Breslow NE. Contribution to the discussion of paper by D. R. Cox. Journal of the Royal Statistical Society: Series B. 1972;34:216–7.
View Article
Google Scholar

[110] View Article

[111] Google Scholar

[ref37] 37. Perkovic V, Jardine MJ, Neal B, Bompoint S, Heerspink HJL, Charytan DM, et al. Canagliflozin and renal outcomes in type 2 diabetes and nephropathy. N Engl J Med. 2019;380(24):2295–306. pmid:30990260
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref38] 38. Zhou B, Fine J, Laird G. Goodness-of-fit test for proportional subdistribution hazards model. Stat Med. 2013;32(22):3804–11. pmid:23625840
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref39] 39. Neal B, Perkovic V, Mahaffey KW, de Zeeuw D, Fulcher G, Erondu N, et al. Canagliflozin and cardiovascular and renal events in type 2 diabetes. N Engl J Med. 2017;377(7):644–57. pmid:28605608
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

Figures

Abstract

1 Introduction

2 Heuristic overview

3 Methods

3.1 Notations and estimand of interest

3.2 Analytic approach and causal assumptions

3.2.1 Transformation with Normal copula.

3.2.2 Identifiability.

3.3 Covariate adjustment

3.4 Bound on

4 Data analysis

5 Results

6 Discussion

Supporting information

S1 Fig. Estimated interaction effect on kidney failure at 2 years.

S2 Fig. Implications of on the estimated effect of SGLT-2i on the risk of death prior to kidney failure by 2 years.

S3 Fig. Estimated interaction effect on death prior to kidney failure at 2 years.

S4 Fig. Implications of on the estimated effect of SGLT-2i on the composite of kidney failure or death by 2 years.

S5 Fig. Estimated interaction effect on the composite of kidney failure or death at 2 years.

References