The evaluation of HIV treatment programs is generally based on an estimation of survival among patients receiving antiretroviral treatment (ART). In large HIV programs, loss to follow-up (LFU) rates remain high despite active patient tracing, which is likely to bias survival estimates and survival regression analyses.
We compared uncorrected survival estimates derived from routine program data with estimates obtained by applying six correction methods that use updated outcome data by a field survey targeting LFU patients in a rural HIV program in Malawi. These methods were based on double-sampling and differed according to the weights given to survival estimates in LFU and non-LFU subpopulations. We then proposed a correction of the survival regression analysis.
Among 6,727 HIV-infected adults receiving ART, 9% were LFU after one year. The uncorrected survival estimates from routine data were 91% in women and 84% in men. According to increasing sophistication of the correction methods, the corrected survival estimates ranged from 89% to 85% in women and 82% to 77% in men. The estimates derived from uncorrected regression analyses were highly biased for initial tuberculosis mortality ratios (RR; 95% CI: 1.07; 0.76–1.50 vs. 2.06 to 2.28 with different correction weights), Kaposi sarcoma diagnosis (2.11; 1.61–2.76 vs. 2.64 to 3.9), and year of ART initiation (1.40; 1.17–1.66 vs. 1.29 to 1.34).
Citation: Henriques J, Pujades-Rodriguez M, McGuire M, Szumilin E, Iwaz J, Etard J-F, et al. (2012) Comparison of Methods to Correct Survival Estimates and Survival Regression Analysis on a Large HIV African Cohort. PLoS ONE 7(2): e31706. doi:10.1371/journal.pone.0031706
Editor: Matthew P. Fox, Boston University, United States of America
Received: May 10, 2011; Accepted: January 17, 2012; Published: February 23, 2012
Copyright: © 2012 Henriques et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work has been supported only by the institutional funds of Hospices Civils de Lyon and Médecins Sans Frontières. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
In the last five years, antiretroviral treatment (ART) programs have scaled-up in Sub-Saharan Africa to provide ART to millions of people. One of the main indicators used to evaluate the effectiveness of these programs is survival after ART initiation; however, this indicator is often biased because of underreporting and unrecorded deaths .
Several approaches have been proposed to minimise the attrition of study cohorts and ascertain the vital status of lost to follow-up (LFU) patients –. However, high rates of loss to follow-up represent a persistent challenge in program evaluation  and, there is yet no validated and commonly accepted method to analyse program data taking into account loss to follow-up data in the absence of vital registration systems.
Generally, program evaluations use estimates of the probability of remaining in care considering deaths and losses to follow-up as program failures . These estimates may be difficult to interpret. Indeed, the death rate is frequently higher among LFU than among non-LFU patients for several reasons –: i) death can be the cause of the loss to follow-up; ii) patients prone to loss to follow-up might be frailer than the others and have higher risks of death; and, iii) after a few weeks or months without treatment, LFU patients become frailer than the others.
The traditional (uncorrected) approach considers that loss to follow-up is equivalent to administrative censoring at loss to follow-up date. Whenever human and financial resources are available, a double-sampling approach is used to ascertain the outcome of all or a subset of LFU patients and to correct the death rates either by updating the routine information or by applying a weighted average of the death rates observed among LFU and non-LFU patients . In the absence of resources, external data from meta-analyses may be used to estimate the death rate among LFU patients . More recent and sophisticated correction methods based on double-sampling have been shown optimal and provided unbiased estimations under some assumptions , . Whenever no correction or a non-optimal correction is made, the magnitude of the bias may be significant.
Within the context of cohort attrition, some methods for regression analysis  are able to provide unbiased results when the probability of loss to follow-up does not change over the follow-up duration. An alternative method used herein provides also unbiased results even when the probability of loss to follow-up changes over time after follow-up onset.
In the present study, we use six different methods to correct data from a large HIV program in rural Malawi and compare the corresponding estimates of one-year survival to quantify bias reduction. We also propose a method to correct survival regression analyses.
The HIV program in Chiradzulu and data sources
Since 2001, Médecins Sans Frontières (MSF) and the Malawian Ministry of Health and Population have provided free ART to HIV-infected persons in Chiradzulu district, Malawi. Individual basic socio-demographic, clinico-immunological, and treatment data were collected at each clinical visit.
Later, a survey was conducted, as part of an internal audit of programme activities, to trace LFU patients and determine their outcomes and reasons for care discontinuation. All patients with an available address in Chiradzulu district and followed-up in one of MSF-supported health facilities were traced. Eleven district-wide catchment areas were defined according to the access to the facilities where care was provided. One survey worker from the community was hired within each catchment area and paid during the whole survey period. The survey workers had to cover the catchment areas by teams of two (on bicycles with car support when necessary) and each worker, in turn, was the team leader in his catchment area. All the workers were supervised by a long-term MSF worker. LFU patients were traced at least three times before the end of the ten-day search allotted for each catchment area. Various findings from this survey have been reported elsewhere .
In the present study, we analysed routine monitoring data on 6,727 ART-naive patients initiated on ART in the Chiradzulu program between July 2004 and July 2007 and aged 15 years or more at ART initiation. To facilitate comparisons of our results with previous literature, we limited the analysis to the first year after ART initiation. Thus, patient follow-up started at ART initiation and was censored at the earliest of the following dates: house moving, last clinical visit, or end of a one-year follow-up. A patient was considered LFU if he (she) missed a scheduled appointment by more than one month. The same censoring, one year after ART initiation, was used for traced patients.
During the study period, 610 (9.1%) patients died, 583 (8.7%) were LFU, and 413 (6.1%) were transferred out of the program. Among LFU patients, 305 (52%) could be traced and the vital status was ascertained for 202 (66%) (Figure 1).
The characteristics of LFU and non-LFU patients were first described and compared using proportions and chi-square tests for categorical variables and medians, interquartile ranges (IQRs), and Kruskal-Wallis tests for continuous variables.
Patient survival estimates obtained using six different data correction methods were compared to the estimates obtained from uncorrected monitoring-program data. The six correction methods were: 1) the updated-dataset approach, 2) the stratified Kaplan-Meier approach, 3) the nomogram approach, 4) the time-dependent stratified Kaplan-Meier approach, 5) the time- and frailty-dependent stratified Kaplan-Meier approach, and, 6) a regression method corrected for missing information on death (Table 1). Survivals were estimated separately for men and women. Table 2 provides a list of assumptions necessary to obtain unbiased estimates.
For the updated-dataset approach, the data on outcome were updated using the data collected during the survey ignoring that a subset of LFU patients cannot be traced or found. Moreover, a sensitivity analysis was carried to investigate the way in which the results would be modified if all LFU patients were assumed to die immediately after the last visit.
For the stratified Kaplan-Meier approach, mortality was first calculated separately for non-LFU patients and LFU patients who were traced and had their vital status ascertained. The two mortalities were then combined applying weights that correspond to the proportions LFU and non-LFU patients recorded at one year after ART initiation.
With the nomogram approach, graphical estimates of the death rates were obtained taking into account mortality in LFU and non-LFU patients up to time t  and the proportion of LFU patients at end of follow-up.
The time-dependent stratified Kaplan-Meier approach was used to account for changes in the probability of loss of follow-up over time . The estimated death rate at each time point was calculated using the updated weighted averages of the death rates among non-LFU patients and among LFU patients who were traced and had their vital status ascertained. The weights were, respectively, the proportions of non-LFU and LFU patients at each time point t. This method ignores a crucial heterogeneity between the patients at time t; actually, LFU patients could be frailer than regularly followed-up ones.
The time- and frailty-dependent stratified Kaplan-Meier approach was proposed by An et al.  to account for that frailty. It was applied using weighted averages within strata of subjects having the same propensity to be lost-to-follow-up.
Regression methods were finally used to compensate for the lack of information on death. In a first step, a logistic regression was applied to a dataset that included only deceased patients. In this logistic regression, the dependent variable was coded 1 when death was ascertained from the routine program data and 0 when death was ascertained after tracing LFU patients. This logistic regression provided an estimate of the probability for a death to be recorded in routine program data for each subgroup of patients. This probability may be considered as the sensitivity of the routine program to detect death; i.e., this sensitivity corresponds to the likelihood of a death to be recorded in routine program data. In the model, we included sex, age (as a continuous variable), tuberculosis and Kaposi sarcoma diagnosis, CD4 cell count at ART initiation (≤ or >150 cells/mm3), the year of therapy start (2004–2005 or 2006–2007), and the delay between ART initiation and death. Note that the national ART program was implemented in 2004 in centralized hospitals throughout Malawi (approximately 55 sites). By the end of 2007, 109 sites were providing care. The Chiradzulu program scaled up the decentralization of HIV services in 2006. This was the reason for the choice of 2006 as a threshold.
Three weighting options were applied to the logistic regression to take into account that only a subgroup of LFU patients were traced: 1) weight set to one (wt1), ignoring that only a subset of LFU patients were traced and had their vital status ascertained; 2) weight set to the inverse of the proportion of the above-mentioned subset among LFU patients (wt2); and 3) weight set to the inverse of the time-specific proportion of the above-mentioned subset among LFU patients (wt3), allowing for changes in this proportion over time. Poisson regression models were then fitted after splitting individual patient follow-ups into successive one-month time periods. Observed deaths in routine program data were assumed to follow a Poisson distribution whose mean was equal to the product of three factors: number of patients at risk×rate of death×sensitivity. The results are presented as adjusted mortality ratios (RR) with 95% confidence intervals (CI).
All the analyses were conducted using Stata 10 (StataCorp LP, College Station, TX, USA), except the time- and frailty-dependent stratified Kaplan-Meier approach for which a specific program, developed in R software, was provided by the author of the original publication .
The protocols of the Chiradzulu project were approved within the framework of formal agreements between MSF and the Malawian Ministry of Health. The present observational study was conducted under the supervision of the Malawi National Health Science Research Committee with an agreement on collection and use of routine programmatic data for monitoring and evaluation. The study type did not require a formal submission for ethical approval.
Patient characteristics at ART initiation
From July 2004 to July 2007, 6,727 patients aged 15 years or older were initiated on ART in the Chiradzulu program. At initiation, the median patient age was 35.2 years (IQR: 30.0–43.0), 65% of the patients were women, 5% had tuberculosis, and 4.7% had Kaposi sarcoma (Table 3). Among the 3,271 patients with available CD4 cell count at ART initiation, 24.9% had less than 150 cells/mm3.
The proportion of men was higher among LFU than among non-LFU patients (66% vs. 54.4%; p-value = 0.0001). Compared to patients in care, LFU patients were younger (median age 34.3 vs. 35.2, p-value = 0.0138) and more frequently diagnosed with tuberculosis (8.1% vs. 4.9%; p-value = 0.0001) or Kaposi sarcoma (12.2% vs. 4%; p-value = 0.0001) at ART start.
Sixty percent of the 202 LFU patients who were traced and had their vital status ascertained during the survey were reported dead whereas just 10% of non-LFU patients died during the first year of ART (p-value = 0.0001).
Within the context of these comparisons, it should be noted that the differences between LFU and non-LFU patients should probably not be evaluated with p-values because these differences are due to selection not to sampling error.
Comparison between survival estimates with and without correction
As expected, with the uncorrected method based on the use of routine program data, the highest one-year survival estimates were 84% in men and 91% in women (Figure 2).
The upper and lower packs of line identifiers correspond respectively to the visibly grouped graphs or methods. SKM = stratified Kaplan-Meier method, wt = weight.
With the updated-dataset method and the regression method with wt1 (i.e., ignoring that only a subset of LFU patients were traced and had their vital status ascertained), the corrected one-year estimates were 2% lower in men and women. The estimates obtained with the time-dependent stratified Kaplan-Meier approach as with the survival regression were 7% and 6% lower than the uncorrected estimates in men and women, respectively. With the nomogram method, the results were intermediate; i.e., the corrected estimates were about 3% and 2.5% lower than the uncorrected ones in men and women, respectively.
With the updated-dataset method, supposing that, for the sensitivity analysis, all LFU patients died immediately after their last visit, the one-year survival estimates were 72% in men and 82% in women. These estimates were lower than the estimates obtained using alternative methods.
Comparisons between regression methods with and without correction
Table 4 shows the results of the regressions of the death rate on the studied factors, with and without correction for lack of information on death.
Uncorrected mortality ratios decreased with time of follow-up (uncorrected RR = 0.80, 95% CI; 0.77–0.88 per month of follow-up). These ratios were lower in women vs. men (0.62; 0.53–0.73) and lower in patients with high vs. low CD4 cell counts (0.64; 0.48–0.86 for >150 vs. ≤150 cells/mm3). These ratios were also higher in patients with vs. without an initial diagnosis of Kaposi sarcoma (2.11; 1.61–2.76) and in those who started ART more recently (1.40; 1.17–1.66 for start in 2006–07 vs. 2004–05).
Compared to the uncorrected estimates, the estimates obtained with correction showed slower decreases in mortality over time (RR ranged from 0.85, 95% CI; 0.82–0.87 with wt1 to 0.87; 0.84–0.90 with wt2 regression) and lower mortality ratios in patients who started ART more recently (RR ranged from 1.29; 1.08–1.54 with wt3 to 1.34; 1.12–1.60 with wt1 regression, for 2006–07 vs. 2004–05). Higher ratios were observed in patients with vs. without initial diagnosis of Kaposi sarcoma (RR ranged from 2.64; 2.01–3.48 with wt3 to 3.09; 2.35–4.06 with wt2 regression) and in patients diagnosed with vs. without tuberculosis at ART start (RR ranged from 2.06; 1.46–2.90 with wt3 to 2.28; 1.62–3.22 with wt1 regression). In contrast, mortality ratios in patients with high vs. low CD4 cell counts were similar before and after correction (RR ranged from 0.66; 0.49–0.87 with wt3 to 0.69; 0.51–0.92 with wt1 regression, for >150 vs. ≤150 cells/mm3).
A number of previous studies have raised concerns regarding poor patient retention in HIV care and warned that ART program effectiveness could thus be overstated , . Our findings illustrate the ways in which different statistical methods based on double-sampling can be used to correct survival estimates and how the reduction of bias varies according to the method used. In this large HIV treatment cohort in rural Malawi where survival estimates were 91% in women and 84% in men, the corrected estimates ranged from 89% and 82%, respectively, with the simplest methods, to 85% and 77%, respectively with the most sophisticated ones. These findings highlight the importance of accounting for the unknown mortality in LFU patients and the benefit of using sophisticated methods .
Broadly speaking, there are two ways of obtaining survival estimates. The first is “retrospective”; it uses outreach data –not collected for weighting– through double sampling but adjusts patient retention to be as representative as possible. The second is “prospective”; it identifies the sampling frame ahead of time, which ensures the representativeness of the double-sampled data by design. Each way has its proper use, but two advantages of the prospective approach are that it requires fewer assumptions and subject only to sampling error.
As in previous studies, we observed a higher risk of death among LFU patients than among patients retained in care . The use of uncorrected survival estimates would have therefore led to an overestimation of the program performance. As described in a South African study, linking patient data from HIV programs to vital registers would be the ideal way to obtain unbiased estimates of patient survival , . However, in many Sub-Saharan Africa settings, vital registration is not available, which requires the use of alternative correction methods. In the Chiradzulu program, the use of data from a survey conducted to ascertain the vital status of LFU patients and update the routine information did not succeed in correcting for loss of follow-up because of the small proportion of LFU patients who were traced and had their vital status ascertained.
The survival estimates given by the updated-dataset and the regression methods corrected for lack of information on death using the first weight option were only slightly lower that those obtained without correction. This is because these two methods ignore that a subset of LFU patients were not traced or their vital status not ascertained. Higher estimates were obtained using the nomogram and the stratified Kaplan-Meier approaches by applying to all LFU patients the estimated death rate observed among LFU patients who were traced and had their vital status ascertained. Two advantages of the nomogram are that it is easy to implement and does not require prior double-sampling because it uses external data. Nomogram-derived estimates can be considered as a preliminary correction; however, caution is necessary in the absence of double-sampling data.
The methods that take into account the proportion of LFU patients at each time point during follow-up gave lower survival estimates than the above-cited ones and may be considered as gold-standards .
The regression method with the weighting option 2 (wt2) proposed in the present study is close to another method proposed recently within the same context . The regression method with the weighting option 3 (wt3) is a variant that accounts for changes in the probability of loss to follow-up over time . In our cohort, the regression method proposed to correct survival estimates taking into account the uneven likelihood of outcome ascertainment among patients gave similar results to the uncorrected regression methods but only for some factors. The uncorrected survival regression analysis was sufficient to estimate the effects of factors such as the initial CD4 cell count; this highlights the importance of this predictor of mortality independently of patient gender or other factors. However, the mortality ratios related to tuberculosis doubled and the ratios related to Kaposi sarcoma diagnosis at ART start increased by 25–40% when corrected regression methods were used. These higher ratios may be explained by the expected increased risk of loss to follow-up and death among the patients presenting severe tuberculosis and Kaposi sarcoma disease. This finding highlights the need to correct survival regression models for the missing information on death within the context of program evaluation in large HIV cohorts with high loss to follow-up rates. However, this approach might be too complicated to be implemented in routine programs. Simpler statistical methods, such as the simulation-extrapolation method (SIMEX) , , could be good alternatives but they require double-sampling.
Survival regression assumed that the sensitivity to correctly identify deaths in the program was known and equal to the observed estimation; however, this assumption may be wrong. Structural equation modelling would avoid the need for separate estimations of sensitivity and the death rate and provide accurate confidence intervals.
The present study has several limitations. In our cohort, the proportion of patients who were traced and had their vital status ascertained represented only one third of all LFU patients. A study by Anglaret et al. suggested that unfound LFU patients are at higher risk of death than found ones . This means that the approaches that assume that traced LFU patients are a representative sample of all LFU patients would overestimate survival. A selection process could have affected the distribution of the outcomes in the traced sample. Patients with no recorded residence could be more vulnerable to mortality or have poor access to health services. Patients living outside Chiradzulu district may also differ from the others; however, there was no evidence to suggest that a geographical selection would bias the results. Finally and interestingly, one out of six patients traced was not found. A number of recorded addresses were probably not correct but no address checking was carried out neither at inclusion nor when patients moved out of the district. It should also be noted that Chiradzulu district was one of the first places in Malawi to provide ART. As the access to treatment increased, a number of patients may have moved out of the district to receive care closer to their homes or home villages.
Some relevant patient characteristics, such as the WHO stage or the adherence to treatment, could not be used in the regression models because of missing or unavailable information. For example, the CD4 cell counts were not available in half of the patients. The lack of information on these counts is known to be frequent among patients with advanced HIV disease at ART start. This is why a category for missing CD4 count was included in the regression analysis even though this practice is not generally advisable. Finally, a comparison between long-term survival estimates (one year after ART start) obtained with various correction methods should be performed because the proportion of deaths among LFU patients and the factors associated with outcome ascertainment in a program are likely to change with the increase of time on ART.
In conclusion, evaluations of HIV programs with high loss to follow-up rates should be based on corrected survival estimates. Though all the correction methods proposed in this article may be carried out using standard statistical software, their routine use in field surveys is not straightforward. The future development of a simple application should help program managers use these methods in routine program evaluation.
Conceived and designed the experiments: MP MM ES. Analyzed the data: JH RE. Wrote the paper: JH MP MM JI RE. Managed the data: MP MM ES. Interpreted the results: MP MM RE. Coordinated the project: JE.
- 1. Lanoy E, Lewden C, Lièvre L, Tattevin P, Boileau J, et al. (2009) How does loss to follow up influence cohort findings on HIV infection? A joint analysis of the French hospital database on HIV, Mortalité 2000 survey and death certificates. HIV Medicine 10: 236–245.
- 2. Lester RT, Mills EJ, Kariri A, Ritvo P, Chung M, et al. (2009) The HAART cell phone adherence trial (WelTel Kenya1): a randomized controlled trial protocol. Trials 10: 87.
- 3. Anglaret X, Toure S, Gourvellec G, Tchehy A, Zio L, et al. (2004) Impact of Vital Status Investigation Procedures on Estimates of Survival in Cohorts of HIV-Infected Patients From Sub-Saharan Africa. JAIDS 35: 320–323.
- 4. Bisson GP, Gaolathe T, Gross R, Rollins C, Bellamy S, et al. (2008) Overestimates of Survival after HAART: Implications for Global Scale-up Efforts. PLoS One 3: e1725.
- 5. McGuire M, Munyenyembe T, Szumilin E, Heinzelmann A, Le Paih M, et al. (2010) Vital Status of pre-ART an ART patients defaulting from care in rural Malawi. Trop Med Int Health 15: 55–62.
- 6. Brinkhof MWG, Pujades-Rodriguez M, Egger M (2009) Mortality of Patients Lost to Follow-Up in Antiretroviral Treatment Programmes in Resource-Limited Settings: Systematic Review and Meta-Analysis. PLoS One 4: e5790.
- 7. Lowrance DW, Makombe S, Harries AD, Shiraishi RW, Hochgesang M, et al. (2008) A Public Health Approach to Rapid Scale-Up of Antiretroviral Treatment in Malawi During 2004–2006. JAIDS 49: 287–293.
- 8. Touloumi G, Pocok SJ, Babiker AG, Darbyshire JH (2002) Impact of missing data due to selective dropouts in cohort studies and clinical trials. Epidemiology 13: 347–355.
- 9. De Gruttola V, Tu XM (1994) Modelling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics 50: 1003–1014.
- 10. Wu L (2007) HIV viral dynamic models with dropouts and missing covariates. Stat Med 26: 3342–3357.
- 11. Geng EH, Emenyonu N, Bwana MB, Glidden DV, Martin JN (2008) Sampling-based approach to determining outcomes of patients lost to follow-up in antiretroviral therapy scale-up programs in Africa. JAMA 300: 506–507.
- 12. Egger M, Spycher BD, Sidle J, Weigel R, Geng EH, et al. (2011) Correcting Mortality for Loss to Follow-Up: A Nomogram Applied to Antiretroviral Treatment Programmes in Sub-Saharan Africa. PLoS Med 8: e1000390.
- 13. Frangakis CE, Rubin DB (2001) Addressing an Idiosyncrasy in Estimating Survival - Curves Using Double Sampling in the Presence of Self-selected Right Censoring. Biometrics 57: 333–342.
- 14. An MW, Frangakis CE, Musick BS, Yiannoutsos CT (2009) The Need for Double-Sampling Designs in Survival Studies: An Application to Monitor PEPFAR. Biometrics 65: 301–306.
- 15. Geng EH, Glidden DV, Emenyonu N, Musinguzi N, Bwana MB, et al. (2010) Tracking a sample of patients lost to follow-up has a major impact on understanding determinants of survival in HIV-infected patients on antiretroviral therapy in Africa. Trop Med Int Health 15(Suppl 1): 63–69.
- 16. Rosen S, Fox MP, Gill CJ (2007) Patient retention in antiretroviral therapy programs in Sub-Saharan Africa: a systematic review. PLoS Med 4: e298.
- 17. Bisson GP, Gaolathe T, Gross R, Rollins C, Bellamy S, et al. (2008) Overestimates of survival after HAART: implications for global scale-up efforts. PLoS One 3: e1725.
- 18. Yiannoutsos CT, An MW, Frangakis CE, Musick BS, Braitstein P, et al. (2008) Sampling-Based Approaches to Improve Estimation of Mortality among Patient Dropouts: Experience from a Large PEPFAR-Funded Program in Western Kenya. PLoS One 3: e3843.
- 19. Boulle A, Van Cutsem G, Hilderbrand K, Cragg C, Abrahams M, et al. (2010) Seven-year experience of a primary care antiretroviral treatment programme in Khayelitsha, South Africa. AIDS 24: 563–572.
- 20. Fox MP, Brennan A, Maskew M, PacPhail P, Sanne I (2010) Using vital registration data to update mortality among patients lost to follow-up from ART programmes: evidence from the Themba Lethu Clinic, South Africa. Trop Med Int Health 15: 405–413.
- 21. Cook J, Stefanski L (1994) Simulation-extrapolation estimation in parametric measurement error models. J Am Stat Assoc 89: 1314–1328.
- 22. Carroll R, Küchenhoff H, Lombard F, Stefanski L (1996) Asymptotics for the SIMEX estimator in structural measurement error models. J Am Stat Assoc 91: 242–250.