HIV incidence estimates are used to monitor HIV-1 infection in the United States. Use of laboratory biomarkers that distinguish recent from longstanding infection to quantify HIV incidence rely on having accurate knowledge of the average time that individuals spend in a transient state of recent infection between seroconversion and reaching a specified biomarker cutoff value. This paper describes five estimation procedures from two general statistical approaches, a survival time approach and an approach that fits binomial models of the probability of being classified as recently infected, as a function of time since seroconversion. We compare these procedures for estimating the mean duration of recent infection (MDRI) for two biomarkers used by the U.S. National HIV Surveillance System for determination of HIV incidence, the Aware BED EIA HIV-1 incidence test (BED) and the avidity-based, modified Bio-Rad HIV-1/HIV-2 plus O ELISA (BRAI) assay. Collectively, 953 specimens from 220 HIV-1 subtype B seroconverters, taken from 5 cohorts, were tested with a biomarker assay. Estimates of MDRI using the non-parametric survival approach were 198.4 days (SD 13.0) for BED and 239.6 days (SD 13.9) for BRAI using cutoff values of 0.8 normalized optical density and 30%, respectively. The probability of remaining in the recent state as a function of time since seroconversion, based upon this revised statistical approach, can be applied in the calculation of annual incidence in the United States.
Citation: Hanson DL, Song R, Masciotra S, Hernandez A, Dobbs TL, Parekh BS, et al. (2016) Mean Recency Period for Estimation of HIV-1 Incidence with the BED-Capture EIA and Bio-Rad Avidity in Persons Diagnosed in the United States with Subtype B. PLoS ONE 11(4): e0152327. https://doi.org/10.1371/journal.pone.0152327
Editor: Hiroshi Nishiura, The University of Tokyo, JAPAN
Received: December 4, 2015; Accepted: March 11, 2016; Published: April 11, 2016
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The authors have no support or funding to report.
Competing interests: The authors have declared that no competing interests exist.
Measurement of the number of new HIV-1 infections per year and the annual rate at which incident infections occur is important for tracking HIV and monitoring transmission, for evaluation of preventive interventions, and for resource allocation. Application of laboratory methods to distinguish recent from non-recent HIV infection, based upon characteristics of the antibody response early after seroconversion, has led to a less costly approach in the estimation of HIV incidence . Since the publication of a sensitive/less-sensitive testing algorithm in 1998 , a number of bioassays have been developed to detect recent infections [3–6]. Many of these assays have recently received a formal evaluation of performance as an incidence assay by the Consortium for the Evaluation and Performance of HIV Incidence Assays .
For future biomarker-based HIV incidence estimates using the stratified extrapolation approach , the U.S. National HIV Surveillance System (NHSS) will transition from using the Aware BED EIA HIV-1 Incidence test (BED) to the Bio-Rad avidity index (BRAI), a modification of the HIV-1/HIV-2 plus O ELISA test, to classify infections as recent vs. non-recent. The accuracy of new incidence estimates will depend on having an accurate estimate of the mean duration of recent infection (MDRI) and an accurate estimate of the distribution of recency duration from a representative sample of U.S. subtype B incident HIV infections. It is important to estimate MDRI for both BED and BRAI using the same statistical procedures to facilitate bridging between NHSS historic and future trends in HIV incidence.
A critical parameter in the estimation of HIV incidence, MDRI for a given bioassay such as BED or BRAI, is defined as the average length of time, over a fixed time T, that persons with newly acquired infection are classified by the bioassay as having recently acquired infection. MDRI is expressed as , where PR(t) is the probability of obtaining a recent result and t is the time since detectable infection. T was chosen to capture most of the dynamic range of a bioassay; i.e., PR ≈ 0 for t>T . Estimation of ΩT consists of analyzing longitudinal data from HIV seroconverters, where standard methods, such as survival analysis or binomial regression, must be carefully adapted to consider fluctuations of HIV-infected persons in and out of the bioassay-defined state of recent infection and imprecisely known seroconversion times. This report details two general approaches used to estimate ΩT and provides estimates of the ΩT incidence parameter for both BED and BRAI for persons infected with subtype B, the predominant subtype in the United States. We also describe the distribution of recency duration for application to NHSS HIV incidence data.
Materials and Methods
Biomarker data were generated from longitudinal specimens of therapy-naïve individuals from the United States infected with HIV-1 subtype B who had known dates of last negative and first positive HIV antibody test dates less than 365 days apart collected as part of various cohort studies, including: plasma samples from the HIVNET HIV network for prevention trials , the AIDSVAX B/B candidate vaccine phase III trial (Vax004) , the multicenter AIDS cohort study (MACS) , and the Seroconversion Incident Panel Project (SIPP) in collaboration with SeraCare Life Sciences, Inc. in Milford, MA. In addition, biomarker data were generated from commercially available seroconversion panels obtained from SeraCare Life Sciences (previously known as Boston Biomedical Inc. (BBI)) (Table 1). Specimens were unlinked from personal identifiers and determined not to be human subjects research by the Centers for Disease Control and Prevention.
The median and interquartile range of distributions are given for the HIV-negative seroconversion interval, i.e., time between last HIV-negative and first HIV-positive tests; the HIV-positive follow-up, reflecting the total duration of observation after testing positive; the number of HIV-positive samples; and the HIV-positive sampling intervals or times between consecutive samples.
The principles of the BED-capture enzyme immunoassay, measuring the proportion of immunoglobulin G that is specific to HIV, and the Bio-Rad GS HIV-1/HIV-2 Plus O EIA, measuring the avidity of maturing antibodies to bind less strongly to the antigen early after infection, have been described in detail elsewhere [4, 13–14]. The thresholds used to classify recent vs. long-standing infection were 0.8 normalized optical density (OD) for BED and 30% for BRAI [15–16].
Two general approaches were used to estimate ΩT: (i) survival time methods, and (ii) binomial models of the probability of testing recent as a function of time since seroconversion. The fitted models were used to calculate ΩT at the specified recency thresholds and within the dynamic range of each bioassay, at T = 2 years post seroconversion. Differences in survival time distributions by cohort were assessed using a log-rank test with a p-value adjustment for multiple comparisons.
In the first approach, the Kaplan-Meier estimator of the survival function describing the probability of being in the recent state as a function of time since seroconversion was estimated by a step function using a maximum likelihood approach, after approximating entry (seroconversion) and exit (transition from recent to non-recent) times for each subject. To use the Kaplan-Meier estimator, the time spent in the recent state must be known or right- censored (have a lower bound). Before application of the Kaplan-Meier estimator, seroconversion times were approximated by the midpoints of seroconversion intervals, i.e., the time between last HIV-negative and first HIV-positive tests. This approximation assumes the seroconversion times were uniformly distributed within the seroconversion intervals. The times of transitions from recent to non-recent infection were estimated using linear interpolation or regression (see below) between measurement readings. If a person did not have a non-recent result, then the time in the recent state was right-censored; i.e., time in the state was greater than the time from estimated seroconversion to the subject’s last visit (lower bound for survival time). Given the resulting set of distinct times spent in the recent state after seroconversion, the estimator for the probability of being in the recent state at time t post-seroconversion is given by , where di is the number of transitions from recent to non-recent occurring at time ti post-seroconversion, and ni is the number of subjects in the sample at time ti after seroconversion (the number of subjects who have not yet transitioned and not been censored by time ti). for t<t1. ΩT was estimated by . The survival probability function was computed from the Kaplan-Meier step function, with probabilities given at each day between 0 and T days. Gaps between days were filled with the probability for the last day before the gap, and for those days with an estimated probability that included a fraction of a day, a weighted sum of probabilities was computed for the day, with weights based upon the duration per fraction. A correction to the Kaplan-Meier estimator was applied by classification of the maximum observation to an event if it was censored . In addition to non-parametric survival, survival times spent in the recent state, as defined above, were modelled parametrically by the Weibull distribution. The probability distribution function for the Weibull distribution is given by , where β is the Weibull scale parameter or spread of the distribution and α shape parameter.
Neglected in our previous estimates of MDRI , subjects observed often enough during the phase when the bioassay was near the recent/non-recent threshold may exhibit multiple fluctuations back and forth between the two states. For these subjects, predicted exit times were computed from linear regression of the bioassay measurement values on approximated days since seroconversion using all data between the observation prior to the time when the threshold was first reached and either the first observation above after the last value below the threshold or the last observation for those individuals not observed above the threshold at the end of their follow-up. If the prediction slope was negative, time was right-censored at the observation prior to when the threshold was first reached.
In the second adapted approach , the probability of testing recent was modeled by fitting a binary value for the observed non-recent/recent classifications, instead of the original OD or avidity value for BED and BRAI, respectively. A linear binomial regression model, assuming a logit or inverse sigmoidal logistic parametric form for the probability of testing recent as a function of time since seroconversion, i.e. g(p) = log(p/(1 − p)), was fit to the data using a maximum likelihood approach. Model goodness of fit was assessed using the AIC criterion . The model form was g(PR(t)) = Σβt, where g(.) was the logit link function and Σβt the linear predictor. The seroconversion times were approximated by the midpoints of the seroconversion intervals prior to model fitting. ΩT was estimated by using the trapezoidal rule for 731 days (0 to T). The predictor was log-transformed time since seroconversion. Bootstrapping was performed by sampling from subjects; 1000 replicates were computed. To account for potential subject level clustering effect on estimates of MDRI, a random intercept binomial model, with the logit parametric form was fit to the data using a pseudo-likelihood technique.  The linear predictor is of the form Σβt + γ, where γ is the subject-specific random effect. The minimum and maximum cluster size per subjects was 1 and 15, respectively.
In addition to this parametric approach, a generalized additive model (GAM) was fit to the binomial data with a logit link. The additive model generalizes the linear binomial model by modeling the expectation of testing recent as g(PR(t)) = ΣΨ(t), where ΣΨ(t) was a smooth function of time since seroconversion, estimated non-parametrically with flexible spline terms for the predictor . The degrees of freedom determine the amount of smoothing. A cross-validation method was used to optimize the effective number of parameters defined by the degrees of freedom . The ‘log-transformed time since seroconversion’ term was fit by using a univariate smoothing spline with four degrees of freedom; of these, one was taken up by the linear portion of the fit and three by the nonlinear spline portion. As with the parametric binomial model, bootstrapping was performed to derive measures of accuracy.
SAS software, version 9.3 (SAS Institute Inc., Cary, NC, USA) was used to implement all steps of the estimation methods.
Collectively, 953 specimens from 220 HIV-1 subtype B seroconverters taken from 5 cohorts were tested using either BED, BRAI, or both (Table 1, S1 and S2 Tables). The median time between last HIV-negative and first HIV-positive specimens, based upon Western blot diagnostic testing criteria, was 180 (interquartile range (IQR) 114, 192) days. The median duration of follow-up following HIV-positive diagnosis was 208 (IQR 65, 510) days; there were a median of 4 (IQR 3, 5) specimens per subject sampled approximately every 51 (IQR 28, 95) days. The percentage of subjects who did not enter the non-recent state during the course of their follow-up, as defined by the bioassay threshold, was 29% (61/209) for BED and 38% (62/162) for BRAI. There were not significant differences in the BED or BRAI probability distributions for remaining in the recent state by cohort (p>0.05).
Most subjects show increases in HIV antibody levels over time, though there was heterogeneity among individual responses (Fig 1) for both BED and BRAI. The estimates for ΩT range from 198.4 to 215.7 days for BED and 239.3 to 253.6 for BRAI avidity, differences of approximately 2–3 weeks between estimation methods (Table 2). The relative standard error (RSE), i.e. standard error divided by the mean, ranged from 5.8 to 8.0% for BED and 5.4 to 7.0% for BRAI, with the lower RSEs associated with the survival procedures or the binomial random intercept models, suggestive of slightly better fit, but with similar dispersion observed for the two bioassays.
Model predicted probabilities from five estimation methods are given; blue lines represent the survival methods, green dashed lines the binomial logit, and red dotted line the GAM method. Subject-specific increasing trends in normalized OD or avidity are graphed in the background on the secondary vertical axis. The horizontal dashed line represents the bioassay threshold for classification of recency/non-recency (BED threshold 0.8; BRAI threshold 30%).
In this report we have used five estimation procedures based on two statistical approaches to estimate the MDRI, a key parameter in estimation of population level HIV incidence, and the distribution of time from seroconversion to non-recent classification which can be used by NHSS to estimate the number of new HIV-1 infections for monitoring of trends in incidence at the national and state levels. Resulting estimates of ΩT ranged from 198.4 to 215.7 days for BED and 239.3 to 253.6 days for Bio-Rad avidity. These results constitute revised estimates for the BED mean duration of recency used in the HIV-1 incidence algorithm  and are consistent with other recently published estimates of BED MDRI [19, 24]. Differences in ΩT due to revisions will be inversely proportional to differences in estimated incidence when ΩT is applied to NHSS incidence data because this parameter is in the denominator of the incidence algorithm [20, 25–26], and will result in lower estimates of HIV-1 incidence than previously published [27, 28]. Improved estimation methods that account for the wide variability in individual response to antibody maturation, and specifically the potential fluctuations around the bioassay threshold, have been implemented. In addition, we have developed a random intercept model for the binomial approach to appropriately account for varying subject-level cluster sizes. The resulting MDRIs based upon our data were 1–2 weeks shorter when estimated from the random effects models relative to the logit models which assumed independence of observations.
Estimation of ΩT in general, and our estimation work in particular, have important limitations. Though we restricted our analyses to study cohorts and commercially available seroconverter panels of data from the United States for estimation of ΩT, a key difference from previously published results, these data may not be representative of all newly diagnosed HIV-1 subtype B infections in the United States. In addition, new diagnostic testing recommendations may lengthen MDRI by shortening the duration of time between transmission and detected infection. Future estimates will require an adjustment to MDRI or use of new seroconversion panel data from subjects diagnosed consistent with new HIV diagnostic testing algorithms. An important assumption is that of uniformly distributed seroconversion times in the seroconversion interval, the time from last HIV-negative to first HIV-positive tests. The assumption of uniformly distributed seroconversion times may be reasonable for individuals enrolled in studies that schedule visits. However, it may otherwise be possible to observe lack of independence between testing and seroconversion time for persons motivated by suspicion of infection or with concomitant needs for clinical care, e.g., diagnoses of STDs or pregnancy [29–32].
Results from simulation studies demonstrated that the estimation methods described in this report were robust to most data issues, e.g., duration of time between last HIV-negative and first HIV-positive tests, frequency of HIV-positive sampling, and loss-to-follow-up . However, when data issues were compounded, e.g., when both HIV-negative and HIV-positive sampling frequency was lengthy, absolute bias increased. Although a number of methods can be used to estimate ΩT, there were differences in accuracy and precision depending upon the data quality. For example, models we implemented in estimation of MDRI that relied on a parametric assumption or ignored within-subject correlation in the presence of varying cluster sizes (measurements per subject) did not perform as well as the non-parametric survival approach in the presence of increasing measurement noise with T = 2 years. Although the estimates reported herein will require validation from other studies of seroconverters, the MDRI estimates based upon the revised survival approach, 198.4 days for BED and 239.6 days for BRAI, and more specifically, the probability of remaining in the recent state as a function of time since seroconversion, will be applied in calculation of annual incidence in the United States as described previously [25, 27].
S1 Table. Spreadsheet with BED OD values at times since estimated time of seroconversion, assayed for 858 specimens from 209 HIV-1 subtype B seroconverters, taken from 4 data cohorts.
Disclaimer: The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.
Conceived and designed the experiments: DLH BSP SMO TAG. Performed the experiments: SM TLD. Analyzed the data: DLH RS. Contributed reagents/materials/analysis tools: DLH BSP SMO SM TLD. Wrote the paper: DLH RS AH SMO TAG.
- 1. Brookmeyer R. Measuring the HIV/AIDS epidemic: approaches and challenges. Epidemiol Rev 2010; 32: 26–37. pmid:20203104
- 2. Janssen RS, Satten GA, Stramer SL, Rawal BD, O'Brien TR, Weiblen BJ, et al. New testing strategy to detect early HIV-1 infection for use in incidence estimates and for clinical and prevention purposes. JAMA 1998; 280: 42–48. [Erratum appears in JAMA 1999; 281(20): 1893.] pmid:9660362
- 3. Duong YT, Qiu M, De AK, Jackson K, Dobbs T, Kim AA, et al. Detection of recent HIV-1 infection using a new limiting-antigen avidity assay: potential for HIV-1 incidence estimates and avidity maturation studies. PLoS One 2012; 7(3): e33328. pmid:22479384
- 4. Parekh BS, Kennedy MS, Dobbs T, Pau CP, byers R, Green T, et al. Quantitative detection of increasing HIV type 1 antibodies after seroconversion: a simple assay for detecting recent HIV infection and estimating incidence. AIDS Res Hum Retroviruses 2002; 18: 295–307. pmid:11860677
- 5. Keating SM, Hanson D, Lebedeva M, Laeyendecker O, Ali-Napo NL, Owen SM, et al. Lower-sensitivity and avidity modifications of the Vitros Anti-HIV 1+2 Assay for detection of recent HIV infections and incidence estimation. J Clin Microbiol 2012; 50: 3968–3976. pmid:23035182
- 6. Mastro TD, Kim AA, Hallett T, Rehle T, Welte A, Laeyendecker O, et al. Estimating HIV Incidence in populations using tests for recent infection: issues, challenges and the way forward. J HIV AIDS Surveill Epidemiol 2010; 2(1): 1–14. pmid:21743821
- 7. Consortium for the Evaluation and Performance of HIV Incidence Assays (CEPHIA). Available: http://www.incidence-estimation.org/page/cephia-overview.
- 8. Karon JM, Song R, Brookmeyer R, Kaplan EH, Hall HI. Estimating HIV incidence in the United States from HIV/AIDS surveillance data and biomarker HIV test results. Stat Med 2008; 27(23): 4617–4633. pmid:18833636
- 9. Kassenjee R, McWalter TA, Barnighausen T, Welte A. A new bioassay-based incidence estimator. Epidemiology 2012; 23(5): 721–728. pmid:22627902
- 10. Koblin BA, Heagerty P, Sheon A, Buchbinder S, Celum C, Douglas JM, et al. Readiness of high-risk populations in the HIV Network for Prevention Trials to participate in HIV vaccine efficacy trials in the United States. AIDS 1998; 12: 785–93. pmid:9619811
- 11. The rgp 120 HIV Vaccine Study Group. Placebo-controlled phase 3 trial of a recombinant glycoprotein 120 vaccine to prevent HIV-1 infection. J Infect Dis 2005; 191: 654–665. pmid:15688278
- 12. Polk B, Fox R, Brookmeyer R, Kanchanaraksa S, Kaslow R, Visscher B, et al. Predictors of the acquired immunodeficiency syndrome developing in a cohort of seropositive homosexual men. N Engl J Med 1987; 316: 61–66. pmid:3024007
- 13. Dobbs T, Kennedy S, Pau CP, McDougal JS, Parekh BS. Performance characteristics of the immunoglobulin G-capture BED-enzyme immunoassay, an assay to detect recent human immunodeficiency virus type 1 seroconversion. J Clin Microbiol 2004; 42: 2623–2628. pmid:15184443
- 14. CEPHIA Project Team. Evaluation of Bio-Rad GS HIV-1/HIV-2 PLUS O EIA Avidity Assay. Available: http://www.incidence-estimation.com/content/filedl/biorad-assay-cdc-owen-protocol.
- 15. McDougal JS, Parekh BS, Peterson ML, Branson BM, Dobbs T, Ackers M, et al. Comparison of HIV Type 1 incidence observed during longitudinal follow-up with incidence estimated by cross-sectional analysis using the BED Capture Enzyme Immunoassay. AIDS Res Hum Retroviruses 2006; 22(10): 945–952. pmid:17067263
- 16. Hauser A, Santos-Hoevener C, Meixenberger K, Zimmermann R, Somogyi S, Fiedler S, et al. Improved testing of recent HIV-1 Infections with the BioRad avidity assay compared to the limiting antigen avidity assay and BED capture enzyme immunoassay: evaluation using reference sample panels from the German seroconverter cohort. PLoS One 2014; 9(6): e98038. pmid:24892795
- 17. Efron B. The two sample problem with censored data. In: Proceedings of the Fifth Berkeley Symposium on Mathematical and Statistical probability, Vol. 4. Prentice-Hall, New York, 1967. Pp. 831–853.
- 18. Parekh BS, Hanson DL, Hargrove J, Branson B, Green T, Dobbs T, et al. Determination of mean recency period for estimation of incidence with the BED-capture EIA in persons infected with diverse HIV-1 subtypes. AIDS Res Hum Retroviruses 2011; 27: 265–273. pmid:20954834
- 19. Kassanjee R, Pilcher CD, Keating SM, Facente SN, Mckinney E, Price MA, et al. Independent assessment of candidate HIV incidence assays on specimens in the CEPHIA repository. AIDS 2014; 28(16): 2439–49. pmid:25144218
- 20. Akaike H. A new look at the statistical model identification. IEEE Trans Automat Contr 1974; 19 (6): 716–723.
- 21. Breslow NE, Clayton DG. Approximate Inference in Generalized Linear Mixed Models. JASA 1993; 88: 9–25.
- 22. Hastie TJ, Tibshirani RJ. Generalized Additive Models. New York: Chapman and Hall 1990.
- 23. Hurvich CM, Simonoff JS, Tsai CL. Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion. J R Stat Soc Series B Stat Methodol 1998; 60: 271–293.
- 24. Hargrove J, Eastwood H, Mahiane G, van Schalkwyk C. How should we best estimate the mean recency duration for the BED method? PLoS One 2012; 7(11): e49661. pmid:23166743
- 25. Hall HI, Song R, Rhodes P, Prejean J, An Q, Lee LM, et al. Estimation of HIV incidence in the United States. JAMA 2008; 300(5): 520–529. pmid:18677024
- 26. Brookmeyer R, Konikoff J, Laeyendecker O, Eshleman SH. Estimation of HIV incidence using multiple biomarkers. Am J Epidemiol 2013; 177(3): 264–272. pmid:23302151
- 27. Prejean J, Song R, Hernandez A, Ziebell R, Green T, Walker F, et al. Estimated HIV incidence in the United States, 2006–2009. PLoS One 2011; 6(8): e17502. pmid:21826193
- 28. Centers for Disease Control and Prevention. Estimated HIV incidence in the United States, 2007–2010. In: HIV/AIDS Surveillance Supplemental Report 2012; 17(No. 4).
- 29. Centers for Disease Control and Prevention. HIV testing survey, 2002. Atlanta: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention; 2004: [1–28]. HIV/AIDS Special Surveillance Report 5.
- 30. Burchell AN, Calzavara L, Ramuscak N, Myers T, Major C, Rachlis A, et al. Symptomatic primary HIV infection or risk experiences? Circumstances surrounding HIV testing and diagnosis among recent seroconverters. Int J STD AIDS 2003; 14: 601–608. pmid:14511496
- 31. Centers for Disease Control and Prevention. HIV Testing among men who have sex with men—21 cities, United States, 2008. MMWR Morb Mortal Wkly Rep 2011; 60(21): 694–699. pmid:21637183
- 32. Lansky A, Drake A, DiNenno E, Lee CW. HIV behavioral surveillance among the U.S. general population. Public Health Rep 2007; 122(Suppl 1): 24–31. pmid:17354524
- 33. HIV Modelling Consortium. Available: http://www.hivmodelling.org/.