Figures
Abstract
Background
Reverse causation is a challenge in many drug-cancer associations, where the cancer symptoms are potentially mistaken for drug indication symptoms. However, tools to assess the magnitude of this type of bias are currently lacking. We used a simulation-based approach to investigate the impact of reverse causation on the association between the use of topical tacrolimus and cutaneous T-cell lymphoma (CTCL) in a multinational, population-based study using topical corticosteroids (TCS) as comparator.
Methods
We used a multistate model to simulate patients’ use over time of a first- (TCS) and second-line treatment (topical tacrolimus), onset of atopic dermatitis (indication for drugs) and CTCL (the studied outcome). We simulated different scenarios to mimic real-life use of the two treatments. In all scenarios, it was assumed that there was no causal effect of the first- or second-line treatment on the occurrence of CTCL. Simulated data were analysed using Cox proportional hazards models.
Results
The simulated hazard ratios (HRs) of CTCL for patients treated with tacrolimus vs. TCS were consistently above 1 in all 9 settings in the main scenario. In our main analysis, we observed a median HR of 3.09 with 95% of the observed values between 2.11 and 4.69.
Conclusions
We found substantial reverse causation bias in the simulated CTCL risk estimates for patients treated with tacrolimus vs. TCS. Reverse causation bias may result in a false positive association between the second-line treatment and the studied outcome, and this simulation-based framework can be adapted to quantify the potential reverse causation bias.
Citation: Øland CB, Ranch LS, Skaaby T, Delvin T, Jakobsen HB, Pipper CB (2024) Reverse causation bias: A simulation study comparing first- and second-line treatments with an overlap of symptoms between treatment indication and studied outcome. PLoS ONE 19(7): e0304145. https://doi.org/10.1371/journal.pone.0304145
Editor: Hideo Kato, Mie University Hospital: Mie Daigaku Igakubu Fuzoku Byoin, JAPAN
Received: February 14, 2024; Accepted: May 7, 2024; Published: July 12, 2024
Copyright: © 2024 Øland et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This study was financially supported by LEO Pharma (https://www.leo-pharma.com/) in the form of salary for all authors and covering the cost associated with running the simulations. The specific roles of all authors are articulated in the ‘author contributions’ section. No additional external funding was received for this study. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: All authors are or have been employees of LEO Pharma A/S and may be shareholders of LEO Pharma A/S. LEO Pharma A/S owns and sells topical tacrolimus/Protopic®.
Introduction
Reverse causation bias, also known as protopathic bias, arises in a drug-disease association if the drug under study is prescribed for symptoms of the outcome under study before this is correctly diagnosed. Reverse causation bias is a challenge in many drug-cancer associations, where the cancer symptoms are mistaken for drug indication symptoms, e.g., in the association of alpha-adrenoreceptor antagonists and prostate cancer;[1] respiratory drugs and lung cancer;[2] and proton pump inhibitors and colon cancer [1]. Studies of these may show a positive association between the drug and the specific cancer, even in the absence of a true association.
Another example is the association between topical tacrolimus, an ointment used to treat atopic dermatitis (AD), and cutaneous T-cell lymphoma (CTCL) [3]. AD is the most common type of eczema and is approximately 2,000 times more common than CTCL [4–6]. Since AD and CTCL may present similar symptoms, CTCL may initially be misdiagnosed as atopic dermatitis (AD) [3]. The first-line treatment for AD is topical corticosteroids (TCS) and the first-line treatment in early-stage CTCL is high-potency TCS [7]. If TCS cannot control the AD symptoms, second-line treatments (for example, topical tacrolimus) may be prescribed. In some cases, the delay from the first symptoms to the correct CTCL diagnosis may be years (for Mycosis fungoides, the most common type of CTCL, the median time from symptom onset to diagnosis was 3 to 4 years) [8, 9], during which the cutaneous symptoms are frequently treated as other skin conditions (for example, AD). Misclassification of CTCL as AD could give a false positive association between topical tacrolimus (second-line treatment) and CTCL [3], because patients with undiagnosed CTCL, misclassified as AD, are more likely to fail first-line treatment and receive second-line treatment for AD. This pattern could emerge in any comparison of first- and second-line treatments where there is an overlap in symptoms between indication and the outcome under study. Receiving the second-line treatment will appear associated with the outcome under study.
Reverse causation bias can sometimes be minimized by including a time-lag in the analysis where the cancer events that occur in a specified period after initiating medication (usually 1 or 2 years) are ignored or reclassified [2]. However, for diseases with overlapping symptoms where the time to correct diagnosis is longer than the commonly used lag time, this approach is not likely to solve the problem. On one hand, using a lag time shorter than the time to correct diagnosis, the approach is unlikely to sufficiently remove the reverse causation bias; on the other hand, extending the lag time even further increases the risk of removing any true drug-cancer association.
To our knowledge, the impact of reverse causation bias in studies comparing a second-line treatment to a first-line treatment with overlap in symptoms between the indication for the drug and the studied outcome has not been investigated. To investigate the potential impact of reverse causation bias in the association between the use of topical tacrolimus (second-line treatment) and CTCL in a large, multinational, population-based study [10], we used a simulation-based approach.
Methods
The JOELLE study
The Protopic® Joint European Longitudinal Lymphoma and Skin Cancer Evaluation (JOELLE) study was a population-based cohort study conducted in Denmark, Sweden, the Netherlands, and the United Kingdom to evaluate the risk of skin cancer and lymphoma associated with use of tacrolimus ointment [10]. This simulation study uses data from the JOELLE tacrolimus ointment cohorts (adults and children) and its matched TCS cohorts. The JOELLE study used electronic health care data from 2002–2017: 32,605 children and 126,908 adult ‘new’ users of tacrolimus ointment matched to 117,592 children and 452,996 adult users of topical corticosteroids. Of the patients using tacrolimus ointment, 19.4% of adults and 31.9% children were followed for more than 10 years, and the median follow-up period was 5.0 years for adults and 5.7 years for children.
The multistate model.
In a survival analysis study, the patient can be simulated with two ‘states’ (alive and dead) and one possible transition (from being alive to dead). In this simulation study, the ‘alive’ state is partitioned into two or more intermediate (transient) states, each of which correspond to a stage of illness. Models linking progression through multiple states, also known as multistate models, can be used to simulate the transition of patients through the various states [11]. We used a multistate approach to simulate the transition of patients through the ‘healthy’, ‘TCS-treated’, ‘tacrolimus-treated’, and ‘CTCL’ states as illustrated in Fig 1.
TCS: Topical corticosteroids; CTCL: cutaneous T-cell lymphoma.
All patients were born as ‘healthy’. The patient later diagnosed with AD was assumed to be treated with either TCS or tacrolimus and moved to either of these states. If treated with TCS initially, the patient could switch treatment and progress to the tacrolimus state but not the opposite: patients stayed in the tacrolimus state until censoring or diagnosed with CTCL. This is in line with the JOELLE study. A diagnosis of CTCL was considered a terminal state from which no transition to other states occurred.
Specification of time to event variables
Age at onset of AD was defined as the time from birth to the time of transition to one of the treatment states. We simulated age at onset of AD according to a distribution with a cumulative risk of AD of 6–7% at 18 years of age and 10–11% at 100 years of age (see Table 1 and S1 Fig for more details).
Features of this distribution were chosen to mimic the anticipated marginal age at onset of AD distribution in the general population from which patients were sampled for the JOELLE study.
Age at onset of CTCL was simulated assuming a constant incidence rate of 6 events per 1,000,000 person-years [12]. Following four million patients for 100 years or until death, whichever came first, we would expect approximately 1,574 CTCL cases. The proportion of CTCL patients initially diagnosed correctly with CTCL was determined by a probability p1 (Table 1).
Age at diagnosis of AD was defined as either the age at onset of AD or age at onset of CTCL. If age at onset of AD was prior to age at onset of CTCL, the patient was diagnosed with AD at age at onset of AD. Conversely, if age at onset of CTCL was prior to age at onset of AD, the patient was either misdiagnosed with AD at age at onset of CTCL with probability 1-p1, or correctly diagnosed with AD at age at onset of AD with probability p1.
The age at diagnosis of CTCL was defined as either the age at onset of CTCL or, the time at which the correct CTCL diagnosis is given if the patient was misdiagnosed with AD at onset of CTCL. In the latter case, the time between AD diagnosis and CTCL diagnosis was simulated as the maximum of two independent draws from the same Weibull distribution: shape parameter was fixed at 0.9, and scale parameter (λ, the time taken to correctly diagnose CTCL) was varied according to different scenarios. In our main setting, λ was set to 3.5 for a median duration of 4.4 years (inter-quartile range: 2.3–5.7 years) until CTCL was correctly diagnosed (S4 Fig).
The follow-up period of the patient mimicked the design of the JOELLE study. A 12-year study period was used: The patient was included in the cohort from a randomly chosen age (0–100 years) or age at AD diagnosis, whichever came last, and then followed for 12 years after the randomly chosen age or until diagnosis of CTCL or death, whichever came first. The distribution of age at death was chosen to reflect that of the general population (Netherlands, UK, Denmark, and Sweden) from which patients were sampled for the JOELLE study (see S2 Fig).
The proportion of patients treated with TCS as first-line treatment was defined as probability p2. Note: 1-p2 is the proportion of patients treated with tacrolimus as first-line treatment.
The proportion of correctly diagnosed AD patients switching to treatment with tacrolimus was defined as probability p3; the proportion of incorrectly diagnosed AD patients switching to treatment with tacrolimus was defined as probability p4. The time from initiation of TCS treatment to switching treatment was simulated in two steps: First step, we randomly selected 40% of the patients (the re-evaluation was assumed to be delayed for the remaining 60%) and simulated their duration using (U4 follows a uniform distribution in the range zero to one) to ensure that the patient had a duration on TCS before switching to tacrolimus below one year. Second step, for the remaining 60% of the patients, the duration was simulated according to an exponential distribution with rate 3 and shifted by 1 year (to ensure simulated durations above 1 year for these patients) (S3 Fig). p3 and p4 determined the magnitude and direction of the reverse causation bias. p3 was low to reflect a good overall response to TCS and was varied to reflect the possibility of different methods of use in the four countries in the JOELLE study. p4 was set lower than p3 to reflect a lower response rate to TCS in patients with CTCL who were misdiagnosed with AD initially.
Simulation specification
Scenarios
In the simulations, the variables p1,p2,p3,p4, and λ (governing the transition through states of the multi-state model) were varied to induce 3 scenarios: main, maximum use, and discovery.
In the main scenario, we assigned values to reflect the JOELLE study [10] (Table 2). It shows the predominant use of TCS as first line treatment (p2 = 0.95) and few true AD patients switching to tacrolimus (p3 = 0.05) from TCS. For misdiagnosed CTCL patients, the risk of switching to tacrolimus is higher (p4 = 0.10; 0.25; 0.40), and the time to correct diagnosis is long (λ = 3.5).
In the maximum use scenario, we investigated a scenario where more patients used tacrolimus (to reflect potential country differences) by varying p2, p3, and p4 (Table 3).
In the discovery scenario, we explored a scenario where misdiagnosed CTCL patients were diagnosed correctly with CTCL faster by varying λ (Table 4).
Settings
We evaluated 9 different settings in the main scenario, 3 in the maximum use scenario, and 2 in the discovery scenario. For each setting, 1,000 replicate AD cohorts were simulated. Each replicate AD cohort was sampled as the subset of patients that were diagnosed with AD prior to or during follow-up from a simulated base population of 4 million patients. The resulting cohort size is approximately equal to the adult cohort in the JOELLE study.
Statistical analyses
Data was simulated and analysed using the open-source statistical software, R (v. 3.6.1) and the add-on packages “mstate” (v. 0.2.12) and “survival” (v. 3.1–8) [13, 14]. Code used for the simulations and their analysis can be found in S1 File, along with the output of each simulation. Statistical analyses were carried out using Cox regression and Poisson regression.
Tacrolimus was categorised according to the ever-use principle in the JOELLE study: [10] once the patient had received tacrolimus, they would add risk-time to the tacrolimus cohort until end of follow-up. If the patient was prescribed TCS before tacrolimus, the patient would add risk time to the TCS cohort until first tacrolimus use. To quantify the magnitude and direction of the potential bias due to reverse causation, cohorts were simulated assuming no causal effect of TCS or tacrolimus on onset of CTCL, analysing only patients with diagnosed AD.
For each simulated cohort, we used Cox regression to analyse time from AD treatment initiation to CTCL (accounting for left truncation and right censoring due to follow-up period or death) and including a time dependent treatment variable in the regression. The resulting hazard ratio (the relative change in hazard rate after start of tacrolimus treatment compared to TCS treatment) was estimated by maximizing the Cox partial likelihood [15]. Treatment-specific incidence rates were estimated by Poisson regression with a log link (including a treatment variable in the regression) and using log person years at risk within each treatment group as offset. Within each simulation setting, the sample distribution of point estimates from the 1,000 simulations is shown by a histogram and numerically by median and a 95% reference interval (RI) defined as the range from the 2.5% to the 97.5% percentile.
Results
The 3 simulation scenarios (main scenario, maximum use scenario and discovery scenario) assumed no causal effect of tacrolimus vs. TCS treatment on CTCL, that means any deviations from 1 of the simulated hazard ratios would indicate bias due to reverse causation.
Main scenario
The simulated estimated hazard ratios of CTCL for patients treated with tacrolimus vs. TCS were above 1 in all 9 settings (Table 5).
For the primary setting (setting A1: characterised by a low chance of correctly diagnosing CTCL initially [p1 = 0.25]), simulations showed a median estimated incidence rate of 0.145 (95% RI: 0.102, 0.205) CTCL cases per 1,000 person years during tacrolimus treatment and 0.047 (95% RI: 0.038, 0.056) CTCL cases per 1,000 person years during TCS treatment (Table 6).
The corresponding median hazard ratio of CTCL was 3.09 (95% RI: 2.11, 4.69) (Table 5 and Fig 2) for tacrolimus vs. TCS. In 99.7% of the simulations in this setting there was evidence in favour of a hazard ratio above 1.
Table 7 contains information explaining settings A1-9.
When increasing the likelihood of correctly diagnosing CTCL (settings A2 and A3 with p1 = 0.5 and p1 = 0.75, respectively), the simulated hazard ratios were closer to 1 but still revealed a substantial bias (Table 5 and Fig 2). When decreasing the use of TCS as the first-line treatment (setting A5, p2 = 0.90), the bias was also reduced. Conversely, when increasing the use of TCS as the first-line treatment (setting A4, p2 = 0.99), the simulated hazard ratio and thus the bias was substantially higher.
When decreasing the difference in likelihood of switching to tacrolimus between true AD patients and misdiagnosed CTCL patients either by increasing the likelihood of switching for true AD patients (setting A6 and A7, p3 = 0.10, 0.20; p4 = 0.25) or by decreasing the likelihood of switching for misdiagnosed CTCL patients (setting A8, p3 = 0.05, p4 = 0.10) the resulting bias is still present but lower. When increasing the difference (setting A9), the simulated bias was markedly larger (Table 5 and Fig 2).
Maximum use scenario
In the maximum use scenario, we considered three settings with more use of tacrolimus compared to the main scenario by applying two actions: (1) increasing the frequency of first-line use of tacrolimus by decreasing p2; and (2) increasing the frequency of second-line use of tacrolimus in true AD patients by increasing p3 and simultaneously increasing the frequency of second-line use of tacrolimus in misdiagnosed CTCL patients by increasing p4.
Applying both actions (1) and (2) by doubling the use of tacrolimus both as first-line and second-line treatment (setting B1) compared to the primary setting of the main scenario, the median hazard ratio was 1.86 (95% RI: 1.32, 2.59) (Table 5 and Fig 3).
Table 8 contains information explaining settings B1-9.
If incorrectly interpreting the hazard ratio as an increased risk of CTCL due to tacrolimus treatment compared to TCS treatment, we would declare a false positive association in 93.4% of the 1,000 simulations (Table 6). Although less pronounced than the primary setting of the main scenario, the bias induced by reverse causation in this setting is still substantial.
If, in addition to the doubling of tacrolimus treatment, we increase the chance of correctly diagnosing CTCL initially (settings B2: p1 = 0.5; B3: p1 = 0.75), the bias is further reduced (for example, the median hazard ratio is 1.66 [95% RI: 0.93–2.93] in setting B3 [p1 = 0.75]) (Table 5 and Fig 3).
Discovery scenario
In the discovery scenario, the expected time to a correct CTCL diagnosis is decreased (settings C1: λ = 1.2; C2: λ = 0.5) and thus CTCL patients are less likely to initiate second-line treatment with tacrolimus. Simulation results from this scenario show hazard ratios closer to 1, that is a decreased bias (Table 5 and Fig 4).
Table 9 contains information explaining settings A1, C1, and C2.
Discussion
We investigated the association between the use of topical tacrolimus (second-line treatment) and CTCL using TCS (first-line treatment) as the comparator in a setting that assumed no association between treatment and CTCL. In the simulations, the calculated CTCL risk estimates for patients treated with tacrolimus vs. TCS, the reverse causation bias was larger when: 1) increasing the use of the first line treatment; 2) increasing the difference in likelihood of switching to second-line treatment for true patients and misdiagnosed patients; 3) decreasing the chance of correctly diagnosing patients with the event of interest initially, thus increasing the number of misdiagnosed patients; 4) increasing the expected duration until correct diagnosis of the event of interest; and 5) decreasing the general use of second-line treatment, and vice versa.
The difference in the likelihood of switching to the second-line treatment governs the difference in the perceived risk profiles for the two patient groups. A first-line versus second-line treatment strategy in which the chance of switching to the second-line treatment is higher with more refractory symptoms in misdiagnosed CTCL patients, reverse causation is induced. Indeed, when switching to second-line treatment becomes more likely for misdiagnosed patients than true patients, reverse causation is induced.
Specifically, we observed an increase in bias with increased difference in likelihood of switching to tacrolimus between true AD patients and misdiagnosed CTCL patients. The bias dictated a higher risk of diagnosis of CTCL in tacrolimus users only because such users are more likely to have been misdiagnosed with AD instead of CTCL initially.
The average incidence rates of CTCL during TCS treatment vary from 0.019 to 0.063 cases per 1,000 person-years in our simulations, and are comparable to the estimated crude incidence rate of CTCL for adults during TCS treatment (0.04 [95% CI: 0.032, 0.05] per 1,000 person-years) in the JOELLE study [10]. In the adult UK cohort of the JOELLE study, the crude and adjusted incidence rate ratios and 95% CI were 3.11 (95% CI: 1.38, 6.91) and 3.12 (95% CI: 1.49, 6.53), respectively [10] which is comparable to the ratio between the median simulated incidence rates of 0.145/0.047 = 3.09 for CTCL. In comparison, the JOELLE study found smaller effect sizes for the Danish, Swedish, and Dutch cohorts with adjusted incidence rate ratios of 1.28 (95% CI: 0.63–2.63), 1.75 (95% CI: 0.94, 3.25), and 1.33 (95% CI: 0.50–3.52), respectively. This may be due to greater use of tacrolimus as first- and second-line treatment as simulated in the max use scenario or a better ability to diagnose CTCL.
The strengths of the current simulation study include the multiple setting approach where key parameters in the multistate model used for simulation were varied to induce specific settings of interest. The model-based simulation approach made it possible to evaluate sources and magnitude of reverse causation bias in simulated cohort data closely aligned to that of the JOELLE [10].
The limitations of the current simulation study include the lack of confounders in the simulation model that may bias the results. However, the crude and adjusted estimates in the JOELLE study were comparable, indicating that the magnitude of confounding is small. Although our simulation results show that reverse causation may explain the observed associations between tacrolimus and CTCL in the JOELLE study, this does not exclude a potential causal association between tacrolimus and CTCL, however small.
We have used data from the JOELLE study to provide a framework to quantify the potential reverse causation bias when comparing first- and second-line treatments with an overlap in symptoms between the indication for the drug and the studied outcome. In the current simulation study, we found substantial reverse causation bias in the simulated CTCL risk estimates for patients treated with topical tacrolimus vs. TCS. Reverse causation bias may thus result in a false positive association between the second-line treatment and the studied outcome. The simulation-based framework can be adapted to other studies comparing first- and second-line treatments with an overlap in symptoms between the indication for the drug and the studied outcome. It allows an a priori exploration of the impact of various potential scenarios and can quantify the potential reverse causation bias.
Supporting information
S1 Fig. Example simulation of the age at onset of AD aka. time from birth to AD variable.
https://doi.org/10.1371/journal.pone.0304145.s001
(PDF)
S2 Fig. Example simulation of the age at death aka. time from birth to censoring variable.
https://doi.org/10.1371/journal.pone.0304145.s002
(PDF)
S3 Fig. Example simulation of the time from initiation of TCS treatment to switching treatment variable.
https://doi.org/10.1371/journal.pone.0304145.s003
(PDF)
S4 Fig. Example simulation of the time between AD diagnosis and CTCL diagnosis variable.
https://doi.org/10.1371/journal.pone.0304145.s004
(PDF)
S1 File. R code used for the simulations, output dataset of each simulation, and R code used in analysis.
https://doi.org/10.1371/journal.pone.0304145.s005
(ZIP)
Acknowledgments
We thank Ross S. Baird for his assistance in preparing the manuscript; Alexandre Abramavicus for helping with dermatological insights and simulation assumptions; RTI Health Solutions for information and feedback about the JOELLE study and reverse causation; and Metadata for preparing the simulations.
References
- 1. Pottegård A, Hallas J. New use of prescription drugs prior to a cancer diagnosis: Use of Prescription Drugs Prior to Cancer Diagnosis. Pharmacoepidemiology and Drug Safety. 2017;26(2):223–7.
- 2. Suissa S, Dell’Aniello S. Time-related biases in pharmacoepidemiology. Pharmacoepidemiol Drug Saf. 2020;29(9):1101–10. pmid:32783283
- 3. Legendre L, Barnetche T, Mazereeuw-Hautier J, Meyer N, Murrell D, Paul C. Risk of lymphoma in patients with atopic dermatitis and the role of topical treatment: A systematic review and meta-analysis. Journal of the American Academy of Dermatology. 2015;72(6):992–1002. pmid:25840730
- 4. Phan A, Veldman R, Lechowicz MJ. T-cell Lymphoma Epidemiology: the Known and Unknown. Curr Hematol Malig Rep. 2016;11(6):492–503. pmid:27995419
- 5. Bispo JAB, Pinheiro PS, Kobetz EK. Epidemiology and Etiology of Leukemia and Lymphoma. Cold Spring Harb Perspect Med. 2020;10(6). pmid:31727680
- 6. Bylund S, von Kobyletzki LB, Svalstedt M, Svensson A. Prevalence and Incidence of Atopic Dermatitis: A Systematic Review. Acta Derm Venereol. 2020;100(12):adv00160. pmid:32412646
- 7. Hristov AC, Tejasvi T, Wilcox RA. Mycosis fungoides and Sezary syndrome: 2019 update on diagnosis, risk-stratification, and management. American journal of hematology. 2019.
- 8. Wilcox RA. Cutaneous T-cell lymphoma: 2016 update on diagnosis, risk-stratification, and management. American journal of hematology. 2016;91(1):151–65. pmid:26607183
- 9. Kim EJ, Hess S, Richardson SK, Newton S, Showe LC, Benoit BM, et al. Immunopathogenesis and therapy of cutaneous T cell lymphoma. J Clin Invest. 2005;115(4):798–812. pmid:15841167
- 10. Arana A, Pottegard A, Kuiper JG, Booth H, Reutfors J, Calingaert B, et al. Long-Term Risk of Skin Cancer and Lymphoma in Users of Topical Tacrolimus and Pimecrolimus: Final Results from the Extension of the Cohort Study Protopic Joint European Longitudinal Lymphoma and Skin Cancer Evaluation (JOELLE). Clin Epidemiol. 2021;13:1141–53. pmid:35002327
- 11.
Lawless. RJCaJF. Multistate Models for the Analysis of Life History Data. Chapter 1. Chapman and Hall/CRC. 2018.
- 12. Criscione VD, Weinstock MA. Incidence of Cutaneous T-Cell Lymphoma in the United States, 1973–2002. Archives of Dermatology. 2007;143(7):854–9. pmid:17638728
- 13. Therneau T. A Package for Survival Analysis in R. R package version 3.2–7 - https://cran.r-project.org/web/packages/survival/index.html. 2020.
- 14. de Wreede LC, Fiocco M, Putter H. The mstate package for estimation and prediction in non- and semi-parametric multi-state and competing risks models. Comput Methods Programs Biomed. 2010;99(3):261–74. pmid:20227129
- 15. Cox DR. Regression Models and Life-Tables. Journal of the Royal Statistical Society Series B (Methodological). 1972;34(2):187–220.