Skip to main content
Advertisement
  • Loading metrics

Trastuzumab in early curative breast cancer: A target trial emulation benchmarked against two randomized clinical trials

Abstract

Background

Benchmarking an observational analysis against a randomized trial can increase confidence in the use of observational data to complement inferences made in trials. Until now, few examples of benchmarking have been within oncology. However, benchmarking trials of a cancer treatment poses a unique set of challenges, such as defining composite outcomes like disease-free survival.

Methods and findings

We designed a target trial with a protocol as similar as possible to the B-31 and N9831 randomized trials, which estimated the effect of adjuvant trastuzumab plus chemotherapy compared with chemotherapy alone in individuals with early human epidermal growth factor receptor 2-positive breast cancer. We then carried out an observational analysis by emulating the target trial using routinely collected data from Swedish registries to understand if we can estimate a similar effect of trastuzumab as the trial. The primary endpoint was the composite of disease-free survival consisting of the earliest of (1) local or regional recurrences, (2) distant recurrences, (3) contralateral breast cancer, (4) other second primary cancer, or (5) death from any cause. Individuals who had data compatible with both treatment strategies at baseline were cloned and one copy was assigned to each arm. We applied inverse probability weights to adjust for baseline and time-varying confounding (e.g., age and hematological events like neutropenia). Our observational analysis included 1,578 women, with a median age of 59 years, and who were diagnosed between 2008 and 2015. We estimated a similar effect after five years of follow-up (RR: 0.54, 95% CI [0.44, 0.67]) for the composite endpoint of disease-free survival as the two jointly analyzed B-31 and N9831 trials (HR: 0.48, 95% CI [0.39, 0.59]). While the comparability of results increases confidence in our estimates, there remains a risk of residual and unmeasured confounding, as is the case with all observational analyses.

Conclusions

We successfully benchmarked an observational analysis against the B-31 and N9831 trials. By aligning protocols and using appropriate methodological approaches, we show that observational data can be used to estimate similar results as randomized trials of cancer treatments, like trastuzumab. This opens the door to using observational data to complement results from randomized trials of cancer treatments which can provide quick, cheap, and robust evidence to support decision-making where trials leave evidence gaps.

Author summary

Why was this study done?

  • Observational analyses can complement results from randomized controlled trials by providing evidence on research questions that they could not answer.
  • For example, observational data can fill gaps in knowledge for vulnerable people, such as older individuals, but this is often hampered by a lack of trust in these data.
  • In addition, oncological studies come with their own set of challenges, such as defining complex outcomes like disease-free survival, aligning a unique time point for treatment initiation in sequences of treatment (e.g., surgery and first-line treatment), and assigning individuals to a treatment when the classification is not distinguishable from the observational data.

What did the researchers do and find?

  • We conducted an observational analysis that closely mimicked the design of two jointly analyzed trials (B-31 and N9831) on the effect of trastuzumab in early curative breast cancer and compared results, i.e., benchmarking.
  • Our observational analysis included 1,578 women diagnosed between 2008 and 2015.
  • We estimated a similar effect after 5 years of follow-up for the composite endpoint of disease-free survival as the two trials. That is, both analyses agreed that trastuzumab substantially increases the 5-year disease-free survival.

What do these findings mean?

  • Of course, we cannot rule out the possibility that unmeasured factors, such as frailty, may have influenced our findings, and that the observed similarity is a chance finding.
  • However, the comparability of results increases confidence in our estimates, and we also used other indirect methods (e.g., negative control outcome) that could have indicated this possibility, but instead confirmed our findings
  • Therefore, the findings show that observational data can be used to estimate similar results as randomized trials of cancer treatments, like trastuzumab.
  • This opens the door to using observational data to complement results from randomized trials of cancer treatments which can provide quick, cheap, and robust evidence to support decision-making where trials leave evidence gaps.

Introduction

Benchmarking an observational analysis against a randomized trial increases confidence in the use of observational data to complement inferences made in trials [1]. If one can first replicate the results of a randomized trial using observational data, clinicians and other decision makers may be more confident in the use of these data to address questions not answered by the trials, e.g., the effect of treatment in populations excluded or underrepresented in the trials [24]. And, when benchmarking fails and observational data cannot reliably estimate similar results to a trial, we enhance the understanding of the limitations of observational data to answer causal questions.

Until now, few examples of benchmarking have been within oncology [5,6]. However, benchmarking an observational analysis against a trial of a cancer treatment poses a unique set of challenges. First, composite outcomes such as disease-free survival are commonly used, which can be hard to define in observational data. Second, correct alignment of time zero in sequences of treatment (e.g., surgery and first-line treatment) is crucial, as treatment delays may lead to a worse prognosis [7]. Third, intended treatment strategies are not always clearly distinguishable at baseline in observational data, thus the risk of selection bias due to immortal time is high without appropriate study design considerations (i.e., cloning) [7].

Here, we design a target trial [2,4,8,9] with a protocol as similar as possible to the B-31 and N9831 trials [1012], which estimated the effect of adjuvant trastuzumab plus chemotherapy compared with chemotherapy alone on disease-free and overall survival in individuals with early human epidermal growth factor receptor 2 (HER2)-positive breast cancer. We then emulate the target trial to understand if we can undertake an observational analysis that successfully replicates estimates of the effect of trastuzumab on both overall and disease-free survival using the Swedish National Quality Register for Breast Cancer and the routinely collected data from the Swedish population-based registers.

Methods

Index randomized trials: B-31 and N9831

The protocol used for benchmarking is outlined in Table 1. It essentially merges the B‑31 and N9831 trials (S1 Table), as they were similar and analyzed in a joint analysis approved by the Food and Drug Administration and the National Cancer Institute before any data cut-off [1014]. Both trials were open-label, multicenter, randomized, controlled clinical trials in individuals with early HER2-positive breast cancer who underwent surgery for complete resection of the primary tumor (Table 1). Participants in both studies were randomly assigned to receive either an initial anthracycline-based combination chemotherapy for 12 weeks followed by a taxane-based chemotherapy plus trastuzumab, or the same chemotherapy regimen without trastuzumab. The primary endpoint was the composite of disease-free survival consisting of the earliest of (1) local or regional recurrences, (2) distant recurrences, (3) contralateral breast cancer, (4) other second primary cancer, or (5) death from any cause. Secondary endpoints were the individual components of the composite outcome.

thumbnail
Table 1. Summary of protocol used for benchmarking, the target trial and its emulation to compare trastuzumab plus chemotherapy with chemotherapy, NKBC and seven further Swedish registers, 2008–2015.

https://doi.org/10.1371/journal.pmed.1004661.t001

After approximately five years in each study (B-31: February 2000 to February 2005; N9831: May 2000 to March 2005), an intention-to-treat analysis was performed [12]. A total of 3,676 individuals were randomized, of whom 3,351 had at least one follow-up examination. 1,672 individuals were assigned to the trastuzumab plus chemotherapy arm (B-31: 864; N9831: 808) and 1,679 to the chemotherapy only arm (B-31: 872; N9831: 807). The 5-year disease-free survival proportion was 92.0% (1539/1672) in the trastuzumab group and 84.5% (1418/1679) in the no trastuzumab group (hazard ratio [HR]: 0.48, 95% confidence interval [CI] [0.39, 0.59]). Overall survival proportion at 5 years were 96.3% (1610/1672) and 94.5% (1587/1679), respectively (HR: 0.67, 95% CI [0.48, 0.93]).

The observational analysis

Target trial protocol

To compare the index trial with an observational analysis that attempts to answer the same clinical question, we first specified the protocol of a target trial, with deviations from the index trial protocol only when the observational data did not correspond to the information collected in the trial (Table 1). We outline the main differences herein.

The target trial would not exclude individuals with a life expectancy of less than 10 years, and relevant eligibility criteria would be identified using prior clinical events and drug use rather than biomarker levels (e.g., left ventricular ejection fraction [LVEF]). The treatment strategies would be: (1) initiation of trastuzumab within 12 weeks (plus a 2-week grace period to consider delays in treatment) after receiving chemotherapy, and (2) no initiation of trastuzumab over follow-up while receiving the same chemotherapy regime as in the intervention arm. The type or substances of chemotherapy would not be restricted. Based on previous studies which found interchangeable effects [1517], we would also not restrict the order of chemotherapy administration (e.g., anthracyclines or taxanes first). The outcomes would be the same as in the B-31 and N9831 trials, and everyone would be followed from the time of assignment (i.e., day 1 of first chemotherapy received) until the earliest of the outcome of interest, loss to follow-up, or five years. Since only 1% of individuals declined chemotherapy after randomization in both the B-31 and N9831trial [12], the time zero in our target trial would be the same as that in the trials. The causal contrasts would be the intention-to-treat and per-protocol effect.

The 5-year survival in each arm for each endpoint would be estimated parametrically using a pooled logistic regression model with an indicator for assigned strategy, time (as linear and quadratic terms), and a product term between the assignment indicator and time. Prognostic factors unbalanced between groups would be used to estimate inverse probability weights if necessary (truncated at their 99th percentile to prevent an influence of outliers). The intention-to-treat analysis would estimate the effect of being assigned to trastuzumab plus chemotherapy versus chemotherapy on the outcome of interest. The per-protocol effect would estimate the effect of receiving and adhering to the assigned strategy of either trastuzumab plus chemotherapy or chemotherapy on the outcomes of interest. To estimate the per-protocol effect, individuals would be censored when they deviate from their assigned treatment strategy, and inverse probability weighting would be used to adjust for baseline and time-varying prognostic factors associated with non-adherence. For both analyses, risks would then be estimated using one minus survival and contrasted to calculate risk differences (RDs) and risk ratios (RRs). Ideally, we would compare the RDs and RRs at five years from the observational analyses with those from the trials. However, the main effect estimate in the trials was the HR, which can greatly vary between studies due to differences in the length of follow-up and the distribution of censoring [18]. Therefore, instead of comparing the effect measures quantitatively, the focus would be on qualitatively evaluating the direction of effects and overall conclusions to decide whether benchmarking was successful.

The observational emulation of the target trial

We used data from the Swedish National Quality Registry for Breast Cancer to emulate the target trial. The Breast Cancer registry was launched in 2008 to facilitate nationwide collection of data for all primary invasive and in situ breast cancer cases across Sweden [19]. It comprises data on pre-defined diagnostics, therapeutic procedures, and death during follow-up. Individual healthcare providers have the responsibility of reporting to the Registry. Further monitoring of these data is performed by the six Regional Cancer Centers in Sweden [20]. To obtain information on disease recurrence, comorbid conditions, drug use, social, and emigration status, we linked the Breast Cancer Quality Registry to seven nationwide registers for which reporting is regulated by Swedish laws: the National Cancer Register, the National Patient Register (i.e., inpatient and outpatient), the National Cause of Death Register, the National Prescribed Drug Register; the Longitudinal Integration Database for Health Insurance and Labor Market Studies (LISA), and the Total Population Register [2129]. The analyses are thus based on a total of linked eight registers.

Each component of the target trial protocol was emulated as closely as possible using the above data (see Table 1 for details). Active contraindications were operationalized as any code suggestive of the contraindication within 90 days before baseline. Outcomes were defined as follows (see also S3 Table): local recurrence of breast cancer was identified using surgical procedure codes (rather than a code for diagnosis of local or regional recurrence) from the Inpatient Register; distant recurrences using codes for secondary cancers from the Inpatient and Outpatient Register; contralateral breast cancer from the Breast Cancer Quality Registry; other second primary cancers using codes for malignancies except breast cancer and secondary cancers in the National Cancer Register; and death from the Cause of Death Register. Chemotherapy treatment was identified through the Breast Cancer Quality Registry, in which information was available on whether any type of chemotherapy was administered and if it contained anthracyclines. The start date of chemotherapy was captured only once, when the first substance type was initiated. We, therefore, made the assumption that trastuzumab was only administered concomitant to taxanes, which was based on expert opinion regarding normal practice in Sweden. There were missing data for start date of chemotherapy for 20% of individuals. Imputation was used to impute the time from primary surgery to this date using a Bayesian linear regression model with age, number of additional surgeries, and death as predictors. Data on start date of trastuzumab was complete among individuals who initiated trastuzumab.

As each treatment strategy was not distinguishable at baseline in the observational data, the 3-step clone, censor, and weighting procedure was used [30,31]. The first step was to clone each individual if their data were compatible with both treatment strategies at baseline. Then, when individuals deviated from their assigned strategy, they were censored. That is, in the trastuzumab plus chemotherapy arm individuals were censored at week 14 (i.e., week 12 plus 2 weeks grace period) if they had not initiated trastuzumab; in the chemotherapy only arm individuals were censored at the week they initiated trastuzumab. Time-varying inverse probability weights were then used to account for potential baseline and time-varying confounding and selection bias due to informative censoring. Further details on our modeling approach are in S1 Data. The baseline variables were: age, marital status, family income, menopausal status, year of diagnosis, size of tumor at diagnosis, lymph node involvement, estrogen receptor status, progesterone receptor status, size of the largest tumor at surgery, histological grade, number of tumors in breast, and number of visits to healthcare professional within 5 years prior to baseline. The time-varying variables were: drug intake against cardiac disease, mild to moderate cardiac disease, severe cardiac disease, liver function disorder, renal function disorder, dyspnea, hematological events (e.g., neutropenia), severe infections, neurologic disorders, and gastrointestinal disorders. Time-varying variables were modeled using the week of occurrence and the subsequent week. S4 Table shows full variable definitions and operationalizations. S1 Appendix shows the coefficients from the pooled logistic regression model for the denominator of the inverse probability of treatment weights. Nonparametric bootstrapping with 500 samples was used to calculate 95% confidence intervals.

Sensitivity analyses

We performed three sensitivity analyses. First, we restricted analyses to individuals who initiated anthracyclines at baseline and who received trastuzumab between weeks 8 and 16. Second, we only included individuals with a known date for start of chemotherapy (later referred to as complete case analysis). Third, we estimated the effect on survival specifically from breast cancer to understand if non-breast cancer deaths led to a distortion of results. All analyses were conducted in R version 4.3.1.

This study was approved by the Regional Ethical Review Board in Stockholm (2011/634-31, 2019-05224). As the data were register-based and pseudonymized, the Ethical Review Board waived the requirement for participant consent. The reporting follows the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline [32] (S1 Checklist).

Results

Fig 1 depicts a flowchart of patient selection for the target trial emulation. The baseline characteristics of all 1,578 eligible individuals are shown in Table 2. All but 308 individuals were assigned to both strategies at baseline (i.e., they were only given chemotherapy, and it was not clear if they would go on to be given trastuzumab). Compared with those in the B-31 and N9831 trials, individuals in the target trial emulation were older, but slightly fewer had a tumor larger than 5 cm in diameter. For the overall survival analysis, there were 159,972 and 61,733 person-weeks of follow-up in the trastuzumab plus chemotherapy and chemotherapy arm, respectively.

thumbnail
Table 2. Baseline characteristics of individuals included in the emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC, and seven further Swedish registers, 2008–2015.

https://doi.org/10.1371/journal.pmed.1004661.t002

thumbnail
Fig 1. Flowchart of patient selection into the emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC, and seven further Swedish registers, 2008-2015.

https://doi.org/10.1371/journal.pmed.1004661.g001

Table 3 shows the estimated survival, RDs, and RRs adjusted for baseline and time-varying variables, and Figs 2, S1, and S2 show the inverse probability weighted survival curves for all outcomes. For the primary analysis 74.7% (95% CI [70.7%, 78.5%]) were disease-free and alive at year 5 in the trastuzumab plus chemotherapy group and 53.5% (95% CI [46.6%, 60.3%]) in the chemotherapy group, resulting in an RD of −21.2% (95% CI [−29.2%, −13.8%]) and an RR of 0.54 (95% CI [0.44, 0.67]). S5 Table shows the distribution of final inverse probability weights.

thumbnail
Table 3. Survival, risk differences, and risk ratios at 5 years from baseline estimated in the observational emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC, and seven further Swedish registers, 2008–2015 (main analysis based on 1578 individuals).

https://doi.org/10.1371/journal.pmed.1004661.t003

thumbnail
Fig 2. Inverse-probability weighted survival curve of disease-free survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC, and seven further Swedish registers, 2008–2015 (main analysis based on 1,578 individuals).

https://doi.org/10.1371/journal.pmed.1004661.g002

Results for the individual components of disease-free survival were as follows. Overall survival at 5 years was 90.7% (95% CI [88.1%, 93.2%) in the trastuzumab plus chemotherapy group and 67.5% (95% CI [60.1%, 74.0%]) in the chemotherapy group, resulting in an RD of −23.2% (95% CI [−30.8%, −16.0%]) and an RR of 0.29 (95% CI [0.20, 0.40]). The 5-year distant recurrence free survival was 80.7% (95% CI [76.9%, 84.1%]) in the trastuzumab plus chemotherapy group and 73.8% (95% CI [68.2%, 79.0%]) in the chemotherapy group, resulting in an RD of −6.9% (95% CI [−13.6%, −0.9%]) and an RR of 0.74 (95% CI [0.56, 0.96]). There were few events for the remaining components of disease-free survival (local recurrence, contralateral breast cancer, other second primary cancer). All results from sensitivity analyses were similar to the main analysis (see S6, S7, and S8 Tables, and S3, S4, S5, S6, S7, and S8 Figs).

Discussion

We used routinely collected data from Swedish population-based registers to emulate a target trial that asked a similar clinical question to the B-31 and N9831 randomized trials. Both the trials and observational analysis estimated similar effects of trastuzumab on disease-free survival and overall survival in women with HER2-positive breast cancer, i.e., a clear increase in survival. Data from both the trials and observational analyses, therefore, support the recommendation of trastuzumab.

Symmetry of conclusions between our observational analysis and the index trials, i.e., successful benchmarking, increases confidence in our observational analysis. However, this does not mean there are no differences between the studies. In fact, several differences affecting estimates in different directions may average out creating a false impression of successful benchmarking. Therefore, we will consider important differences and their potential impact.

A central challenge was to define the individual cancer-specific components of the disease-free survival outcome in the available routinely collected data using definitions similar to the trials. Consider the following two examples: (1) we defined distant recurrence using International Classification of Diseases, Tenth Revision, Tenth Revision (ICD-10) codes that capture secondary cancers, which, in fact, could be a local, regional or distant recurrence; and (2) we defined local recurrences using surgical procedure codes, meaning a local recurrence that was not removed is missed. As there are several unavoidable misclassification problems with these definitions of local and distant recurrences, the survival estimates for trastuzumab of these specific outcomes are difficult to interpret. However, for the composite outcome of disease-free survival, the effect of this misclassification is less problematic. This is because misclassification occurs between the individual components (i.e., capturing local recurrence instead of distant recurrence) or a missing diagnosis for one component could be captured in another component (i.e., missing local recurrences can be captured within the distant recurrence definition). However, the absolute risks for disease-free survival may be lower if any diagnoses of the cancer-specific components are not recorded in the registers.

A differential distribution of treatment effect modifiers between the trial and observational analysis could also be a reason for different estimated effects. In our observational analysis, individuals were, on average, older than those in the trials, and the proportion of individuals with lymph node involvement was not comparable between studies. However, it is unlikely that this contributes to a meaningful difference between effect estimates for disease-free survival as it has been suggested that the effect of trastuzumab on recurrence risk does not differ by age or nodal status [33]. Notwithstanding, an older age is often accompanied by further comorbidities, which we might not have captured in our analyses, and could lead to a poorer response to treatment and/or more treatment-unrelated deaths than in the randomized trials, explaining the more pronounced absolute risk estimates for overall survival.

As with all observational analyses, there is a risk of residual and unmeasured confounding. Our analysis required accounting for both baseline reasons individuals would or would not receive trastuzumab, and post-baseline factors associated with adhering to the assigned treatment strategy (i.e., whether or not an individual initiated trastuzumab over follow-up). We relied on diagnoses of clinical events and drug use to account for this potential confounding. Ideally, we would also use information on biomarkers (e.g., laboratory or radiology parameters), as changes in these recordings could be a reason someone would not receive trastuzumab, but such test results are only routinely captured in the register data for selected conditions. Also, we would need information on treatment discontinuation and dose modifications, which is not captured in the registers. However, if we consider the null result for other second primary cancers as a negative control to assess the impact of potential unmeasured factors (i.e., confounding), there is enhanced certainty that cancer-specific outcomes are not affected.

One notable result from our observational analysis is the large estimated effect of trastuzumab on overall survival, which is considerably larger than the effect estimated in the trials. This difference could be a consequence of unmeasured confounding in the observational analysis, i.e., there were factors associated with overall survival (but not particularly associated with disease-free survival) not captured in the data, such as frailty, that were also reasons someone was withheld trastuzumab. However, there were very few non-breast cancer-related deaths, and the estimated effect on breast cancer-specific survival is similar to the effect on overall survival. So, it is likely that the effect estimates for overall survival reflect a difference in deaths due to the breast cancer.

Another factor contributing to the dissimilarity in the effect estimate for overall survival may be the estimation of the analog of the per-protocol effect in the observational analysis and the estimation of the intention-to-treat effect in the trials. If individuals assigned to the chemotherapy arm in the trials switched to additionally receive trastuzumab, this could lead to an attenuation of the effect estimate on overall survival in an intention-to-treat analysis. It is then plausible that the effect estimate derived from a per-protocol analysis would be larger than that obtained from an intention-to-treat analysis. As the trials did not conduct a per-protocol analysis, the comparison of effect estimates from the observational analysis and the trials should focus more on the direction and the definitiveness of the superiority of trastuzumab plus chemotherapy irrespective of the data source.

A limitation of the observational analyses was that there was 20% missing data for the start date of chemotherapy. To overcome this, we imputed the start date for those with missing data. Although such imputation relies on the untestable missing at random assumption, we also carried out a sensitivity analysis in which we excluded all individuals with missing data, which returned similar results. The consistency between these two approaches to deal with missing data enhances the certainty in our results.

Of note, the person-weeks of follow-up was higher in the trastuzumab plus chemotherapy arm compared with the chemotherapy arm. This is a consequence of more individuals receiving trastuzumab plus chemotherapy than chemotherapy alone (i.e., at week 15, there were 1,050 individuals in the trastuzumab plus chemotherapy arm and 518 in the chemotherapy only arm) and the survival being considerably higher in the trastuzumab group. But this is not expected to bias effect estimates, particularly because we adjust for time-varying factors related to censoring.

If investigators using routinely collected data can successfully benchmark analyses that estimate the effect of cancer treatments on both overall and disease-free survival against a randomized trial, it would increase confidence in the ability of these data to ask questions that could not be answered in the initial trial. For example, routinely collected data can provide evidence more relevant to everyday clinical practice situations. Individuals who are regularly underrepresented or excluded from randomized trials often represent a majority of affected individuals [34,35]; routinely collected data, therefore, provide a unique opportunity to explore heterogeneity in treatment effectiveness across subgroups.

We successfully benchmarked an observational analysis of trastuzumab on disease-free and overall survival against the B-31 and N9831 trials. By aligning protocols and using appropriate methodological approaches, we show that observational data can be used to estimate similar results as randomized trials of cancer treatments like trastuzumab. This opens the door to using observational data to complement results from randomized trials of cancer treatments which can provide quick, cheap, and robust evidence to support decision-making where trials leave evidence gaps.

Supporting information

S1 Table. Summary of selected protocol components of the RCTs B-31 and N9831 based on ClinicalTrials.gov last updated on 2021-04-29 (B-31) and 2020-08-14 (N9831).

https://doi.org/10.1371/journal.pmed.1004661.s002

(DOCX)

S2 Table. Operationalizations of eligibility criteria identified using codes in the emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC and seven further Swedish registers, 2008–2015.

https://doi.org/10.1371/journal.pmed.1004661.s003

(DOCX)

S3 Table. Overview of research aims, endpoints and operationalizations in the emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC and seven further Swedish registers, 2008–2015.

https://doi.org/10.1371/journal.pmed.1004661.s004

(DOCX)

S4 Table. Definitions and operationalizations of covariates used for all analyses in the emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC and seven further Swedish registers, 2008–2015.

https://doi.org/10.1371/journal.pmed.1004661.s005

(DOCX)

S5 Table. Final inverse probability of treatment weights truncated at the 99th percentile in the emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy (main analysis), NKBC and seven further Swedish registers, 2008–2015.

https://doi.org/10.1371/journal.pmed.1004661.s006

(DOCX)

S6 Table. Survival, risk differences, and risk ratios at 5 years from baseline estimated in the observational emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC and seven further Swedish registers, 2008–2015 (complete case analysis based on 1,327 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s007

(DOCX)

S7 Table. Survival, risk differences, and risk ratios at 5 years from baseline estimated in the observational emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC and seven further Swedish registers, 2008–2015 (sensitivity analysis restricted to individuals who initiated anthracyclines at baseline based on 940 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s008

(DOCX)

S8 Table. Survival from breast cancer, risk differences, and risk ratios at 5 years from baseline estimated in the observational emulation of a target trial comparing trastuzumab plus chemotherapy with chemotherapy, NKBC and seven further Swedish registers, 2008–2015 (main analysis based on 1,578 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s009

(DOCX)

S1 Fig. Inverse-probability weighted survival curve of overall survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (main analysis based on 1,578 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s010

(EPS)

S2 Fig. Inverse-probability weighted survival curves of the single recurrence components of the composite endpoint disease-free survival to compare adjuvant chemotherapy trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (main analysis based on 1,578 individuals).

(A) Distant recurrence. (B) Local recurrence. (C) Contralateral breast cancer. (D) Other second primary cancer.

https://doi.org/10.1371/journal.pmed.1004661.s011

(EPS)

S3 Fig. Inverse-probability weighted survival curve of disease-free survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (complete case analysis based on 1,327 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s012

(EPS)

S4 Fig. Inverse-probability weighted survival curve of overall survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (complete case analysis based on 1,327 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s013

(EPS)

S5 Fig. Inverse-probability weighted survival curves of the single recurrence components of the composite endpoint disease-free survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (complete case analysis based on 1,327 individuals).

(A) Distant recurrence. (B) Local recurrence. (C) Contralateral breast cancer. (D) Other second primary cancer. a. non-informative due to low event numbers in the chemotherapy arm.

https://doi.org/10.1371/journal.pmed.1004661.s014

(EPS)

S6 Fig. Inverse-probability weighted survival curve of disease-free survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (sensitivity analysis restricted to individuals who initiated anthracyclines at baseline based on 940 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s015

(EPS)

S7 Fig. Inverse-probability weighted survival curve of overall survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (sensitivity analysis restricted to individuals who initiated anthracyclines at baseline based on 940 individuals).

https://doi.org/10.1371/journal.pmed.1004661.s016

(EPS)

S8 Fig. Inverse-probability weighted survival curves of the single recurrence components of the composite endpoint disease-free survival to compare adjuvant chemotherapy + trastuzumab with chemotherapy from an observational emulation of a target trial, NKBC and seven further Swedish registers, 2008–2015 (sensitivity analysis restricted to individuals who initiated anthracyclines at baseline based on 940 individuals).

(A) Distant recurrence. (B) Local recurrence. (C) Contralateral breast cancer. (D) Other second primary cancer. a. non-informative due to low event numbers in the chemotherapy arm.

https://doi.org/10.1371/journal.pmed.1004661.s017

(EPS)

S1 Appendix. Coefficients from pooled logistic regression model for the denominator of the inverse probability of treatment weights (main analysis).

https://doi.org/10.1371/journal.pmed.1004661.s018

(DOCX)

S1 Checklist. STROBE Statement checklist of items that should be included in reports of observational studies.

https://doi.org/10.1371/journal.pmed.1004661.s019

(PDF)

Acknowledgments

We thank Mats Talbäck for his aid in data management.

The information reported in the manuscript was previously presented at the 2024 Conference of the Society for Epidemiologic Research.

References

  1. 1. Dahabreh IJ, Robins JM, Hernán MA. Benchmarking observational methods by comparing randomized trials and their emulations. Epidemiology. 2020;31(5):614–9. pmid:32740470
  2. 2. Matthews AA, Szummer K, Dahabreh IJ, Lindahl B, Erlinge D, Feychting M, et al. Comparing effect estimates in randomized trials and observational studies from the same population: an application to percutaneous coronary intervention. J Am Heart Assoc. 2021;10(11):e020357. pmid:33998290
  3. 3. Lodi S, Phillips A, Lundgren J, Logan R, Sharma S, Cole SR, et al. Effect estimates in randomized trials and observational studies: comparing apples with apples. Am J Epidemiol. 2019;188(8):1569–77. pmid:31063192
  4. 4. Matthews AA, Dahabreh IJ, Fröbert O, Lindahl B, James S, Feychting M, et al. Benchmarking observational analyses before using them to address questions trials do not answer: an application to coronary thrombus aspiration. Am J Epidemiol. 2022;191(9):1652–65. pmid:35641151
  5. 5. Boyne DJ, Cheung WY, Hilsden RJ, Sajobi TT, Batra A, Friedenreich CM, et al. Association of a shortened duration of adjuvant chemotherapy with overall survival among individuals with stage III colon cancer. JAMA Netw Open. 2021;4(3):e213587. pmid:33783516
  6. 6. Bacic J, Liu T, Thompson RH, Boorjian SA, Leibovich BC, Golijanin D, et al. Emulating target clinical trials of radical nephrectomy with or without lymph node dissection for renal cell carcinoma. Urology. 2020;140:98–106. pmid:32142726
  7. 7. Garcia-Albeniz X, Stattin P, Westerberg M, Weinrib R, Sandstrom P, Vassilev Z. Design considerations for observational studies of drugs for advanced cancer. Pharmacoepidemiol Drug Saf. 2023;32(7):812–5. pmid:36920321
  8. 8. Hernán MA. A good deal of humility: cochran on observational studies. Obs Stud. 2015;2015(7):194–5. pmid:26688836
  9. 9. Hernán MA, Robins JM. Using big data to emulate a target trial when a randomized trial is not available. Am J Epidemiol. 2016;183(8):758–64. pmid:26994063
  10. 10. National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation. Doxorubicin and cyclophosphamide plus paclitaxel with or without trastuzumab in treating women with node-positive breast cancer that overexpresses HER2. 2021 (updated 29 April) [Internet] [cited 4 Nov 2023]. Available from: https://clinicaltrials.gov/study/NCT00004067
  11. 11. North Central Cancer Treatment Group (NCCTG). Doxorubicin Hydrochloride, Cyclophosphamide, and Pacltaxel With or Without Trastuzumab in Treating Women With HER2-Positive Node-Positive or High-Risk Node-Negative Breast Cancer. 2020 (updated 14 Aug) [Internet] [cited 4 Nov 2023. ]. Available from: https://clinicaltrials.gov/study/NCT00005970
  12. 12. Romond EH, Perez EA, Bryant J, Suman VJ, Geyer CE Jr, Davidson NE, et al. Trastuzumab plus adjuvant chemotherapy for operable HER2-positive breast cancer. N Engl J Med. 2005;353(16):1673–84. pmid:16236738
  13. 13. Perez EA, Romond EH, Suman VJ, Jeong J-H, Davidson NE, Geyer CE Jr, et al. Four-year follow-up of trastuzumab plus adjuvant chemotherapy for operable human epidermal growth factor receptor 2-positive breast cancer: joint analysis of data from NCCTG N9831 and NSABP B-31. J Clin Oncol. 2011;29(25):3366–73. pmid:21768458
  14. 14. Perez EA, Romond EH, Suman VJ, Jeong J-H, Sledge G, Geyer CE Jr, et al. Trastuzumab plus adjuvant chemotherapy for human epidermal growth factor receptor 2-positive breast cancer: planned joint analysis of overall survival from NSABP B-31 and NCCTG N9831. J Clin Oncol. 2014;32(33):3744–52. pmid:25332249
  15. 15. Bines J, Earl H, Buzaid AC, Saad ED. Anthracyclines and taxanes in the neo/adjuvant treatment of breast cancer: does the sequence matter?. Ann Oncol. 2014;25(6):1079–85. pmid:24625452
  16. 16. Early Breast Cancer Trialists’ Collaborative Group (EBCTCG). Anthracycline-containing and taxane-containing chemotherapy for early-stage operable breast cancer: a patient-level meta-analysis of 100 000 women from 86 randomised trials. Lancet. 2023;401(10384):1277–92. pmid:37061269
  17. 17. Zaheed M, Wilcken N, Willson ML, O’Connell DL, Goodwin A. Sequencing of anthracyclines and taxanes in neoadjuvant and adjuvant therapy for early breast cancer. Cochrane Database Syst Rev. 2019;2(2):CD012873. pmid:30776132
  18. 18. Hernán MA. The hazards of hazard ratios. Epidemiology. 2010;21(1):13–5. pmid:20010207
  19. 19. Stockholm Gotland Regional Cancer Center. National Quality Registry for Breast Cancer (NKBC) [Nationellt Kvalitetsregister för bröstcancer]. 2025 (updated 5 May) [Internet] [cited 31 Oct 2022]. Available from: https://cancercentrum.se/diagnosbehandling/cancerdiagnoser/brost/kvalitetsregister.7359.html
  20. 20. Löfgren L, Eloranta S, Krawiec K, Asterkvist A, Lönnqvist C, Sandelin K, et al. Validation of data quality in the Swedish National Register for Breast Cancer. BMC Public Health. 2019;19(1):495. pmid:31046737
  21. 21. Brooke HL, Talbäck M, Hörnblad J, Johansson LA, Ludvigsson JF, Druid H, et al. The Swedish cause of death register. Eur J Epidemiol. 2017;32(9):765–73. pmid:28983736
  22. 22. Ludvigsson JF, Almqvist C, Bonamy A-KE, Ljung R, Michaëlsson K, Neovius M, et al. Registers of the Swedish total population and their use in medical research. Eur J Epidemiol. 2016;31(2):125–36. pmid:26769609
  23. 23. Ludvigsson JF, Andersson E, Ekbom A, Feychting M, Kim J-L, Reuterwall C, et al. External review and validation of the Swedish national inpatient register. BMC Public Health. 2011;11:450. pmid:21658213
  24. 24. Ludvigsson JF, Svedberg P, Olén O, Bruze G, Neovius M. The longitudinal integrated database for health insurance and labour market studies (LISA) and its use in medical research. Eur J Epidemiol. 2019;34(4):423–37. pmid:30929112
  25. 25. Socialstyrelsen. National Cancer Register. 2022 (updated 14 Jan) [Internet] [cited 8 Nov 2023]. Available from: https://www.socialstyrelsen.se/en/statistics-and-data/registers/national-cancer-register/
  26. 26. Socialstyrelsen. National Patient Register. 2023 (updated 4 Dec) [Internet] [cited 26 May 2024. ]. Available from: https://www.socialstyrelsen.se/en/statistics-and-data/registers/national-patient-register/
  27. 27. Socialstyrelsen. National Prescribed Drug Register. 2024 (updated 8 Feb) [Internet] [cited 7 Nov 2024]. Available from: https://www.socialstyrelsen.se/en/statistics-and-data/registers/national-prescribed-drug-register/
  28. 28. Socialstyrelsen. National Cause of Death Register. 2024 (updated 9 Feb) [Internet] [cited 7 Nov 2024]. Available from: https://www.socialstyrelsen.se/en/statistics-and-data/registers/national-cause-of-death--register/
  29. 29. Wettermark B, Hammar N, Fored CM, Leimanis A, Otterblad Olausson P, Bergman U, et al. The new Swedish Prescribed Drug Register—opportunities for pharmacoepidemiological research and experience from the first six months. Pharmacoepidemiol Drug Saf. 2007;16(7):726–35. pmid:16897791
  30. 30. Cain LE, Robins JM, Lanoy E, Logan R, Costagliola D, Hernán MA. When to start treatment? A systematic approach to the comparison of dynamic regimes using observational data. Int J Biostat. 2010;6(2):Article 18. pmid:21972433
  31. 31. Hernán MA. How to estimate the effect of treatment duration on survival outcomes using observational data. BMJ. 2018;360:k182. pmid:29419381
  32. 32. von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP, et al. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. PLoS Med. 2007;4(10):e296. pmid:17941714
  33. 33. Early Breast Cancer Trialists’ Collaborative group (EBCTCG). Trastuzumab for early-stage, HER2-positive breast cancer: a meta-analysis of 13 864 women in seven randomised trials. Lancet Oncol. 2021;22(8):1139–50. pmid:34339645
  34. 34. He J, Morales DR, Guthrie B. Exclusion rates in randomized controlled trials of treatments for physical conditions: a systematic review. Trials. 2020;21(1):228. pmid:32102686
  35. 35. U.S. Food & Drug Administration (FDA). Public workshop: evaluating inclusion and exclusion criteria in clinical trials—workshop report. 2018 (updated 16 April) [Internet] [cited 26 May 2024]. Available from: https://www.fda.gov/media/134754/download