Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Cumulative viral load as a predictor of CD4+ T-cell response to antiretroviral therapy using Bayesian statistical models

  • Joseph B. Sempa ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation South African Department of Science and Technology (DST)/ National Research Foundation (NRF) Centre of Excellence in Epidemiological Modelling and Analysis (SACEMA), Stellenbosch University, Stellenbosch, South Africa

  • Theresa M. Rossouw,

    Roles Investigation, Writing – review & editing

    Affiliation Institute for Cellular and Molecular Medicine, Department of Immunology, University of Pretoria, Pretoria, South Africa

  • Emmanuel Lesaffre,

    Roles Methodology, Supervision, Writing – review & editing

    Affiliation Interuniversity Institute for Biostatistics and Statistical Bioinformatics, KU Leuven, Leuven, Belgium

  • Martin Nieuwoudt

    Roles Data curation, Funding acquisition, Supervision, Visualization, Writing – original draft, Writing – review & editing

    Current address: Institute for Biomedical Engineering (IBE), Mechanical and Mechatronic Engineering, Stellenbosch University, Western Cape, South Africa

    Affiliation South African Department of Science and Technology (DST)/ National Research Foundation (NRF) Centre of Excellence in Epidemiological Modelling and Analysis (SACEMA), Stellenbosch University, Stellenbosch, South Africa


16 Jan 2020: Sempa JB, Rossouw TM, Lesaffre E, Nieuwoudt M (2020) Correction: Cumulative viral load as a predictor of CD4+ T-cell response to antiretroviral therapy using Bayesian statistical models. PLOS ONE 15(1): e0228218. View correction



There are Challenges in statistically modelling immune responses to longitudinal HIV viral load exposure as a function of covariates. We define Bayesian Markov Chain Monte Carlo mixed effects models to incorporate priors and examine the effect of different distributional assumptions. We prospectively fit these models to an as-yet-unpublished data from the Tshwane District Hospital HIV treatment clinic in South Africa, to determine if cumulative log viral load, an indicator of long-term viral exposure, is a valid predictor of immune response.


Models are defined, to express ‘slope’, i.e. mean annual increase in CD4 counts, and ‘asymptote’, i.e. the odds of having a CD4 count ≥500 cells/μL during antiretroviral treatment, as a function of covariates and random-effects. We compare the effect of using informative versus non-informative prior distributions on model parameters. Models with cubic splines or Skew-normal distributions are also compared using the conditional Deviance Information Criterion.


The data of 750 patients are analyzed. Overall, models adjusting for cumulative log viral load provide a significantly better fit than those that do not. An increase in cumulative log viral load is associated with a decrease in CD4 count slope (19.6 cells/μL (95% credible interval: 28.26, 10.93)) and a reduction in the odds of achieving a CD4 counts ≥500 cells/μL (0.42 (95% CI: 0.236, 0.730)) during 5 years of therapy. Using informative priors improves the cumulative log viral load estimate, and a skew-normal distribution for the random-intercept and measurement error results is a better fit compared to using classical Gaussian distributions.


We demonstrate in an unpublished South African cohort that cumulative log viral load is a strong and significant predictor of both CD4 count slope and asymptote. We argue that Bayesian methods should be used more frequently for such data, given their flexibility to incorporate prior information and non-Gaussian distributions.


The rollout of effective combination antiretroviral therapy (ART) has markedly improved the survival of sub-Saharan African (SSA) Human Immunodeficiency Virus (HIV)-infected populations [1]. However, resource-limited settings are characterised by the late initiation of ART [2] and limited ongoing immunologic or virologic monitoring [3,4]. The difficulty of recovering CD4 count increases with due to decreasing thymic CD4+ T-cell production [5], and also depends on direct and indirect effects of chronic viral replication [6]. Further, in part due to its recent roll-out in SSA and limited monitoring, the long-term immunological consequences of ART remain incompletely understood [79].

There are challenges in the statistical modelling of the relationship of HIV viral load (VL) to CD4 counts. As a result, mechanistic models, which make particular assumptions regarding underlying biological processes, have commonly been used for this purpose [10,11]. A previous review of statistical models of immune response to ART in SSA revealed an immense diversity in the methodologies employed [12]. However, particularly two main types of statistical models of immune response following ART initiation were identified. Namely, those quantifying, ‘Slope’ of change in CD4 counts within a specified time after initiation, and ‘Asymptote’, which quantify achieving or not a predefined minimal ‘normal’ threshold of CD4 count within a specified time. There was also a sub-type of the above, namely ‘Time-to’ achieving a particular threshold [12]. In this study, we investigate slope and asymptote models.

Cumulative VL (cVL) is a readily definable indicator of long-term in vivo exposure to detectable virus [13] and positively associated with an increase in mortality [13]. Few studies have investigated the association of cVL with immune response to ART. Marconi et al [14] used the summated area under a patient’s ‘raw’ VL curve which was then subsequently log-transformed (cVL1), as a predictor of immune response in the data of a North American cohort. However, as previously discussed, the methods they employed have statistical weaknesses and the relevance of their findings to resource-constrained SSA settings may be limited [13]. In this study, we use cumulative log VL (cVL2), i.e. the summation of the area under the initially log-transformed VL curve above the limit of detection.

Previous statistical models have usually assumed normality for CD4 count distributions despite studies demonstrating that this is not justified [12,1517]. This may have been due to frequentist modelling software requiring this assumption. Bayesian methods, in combination with Markov Chain Monte Carlo (MCMC) computational techniques, allow the use of alternatives to Gaussian distributions, the incorporation of prior information and considerable flexibility with respect to estimation. For a general overview of the Bayesian methodology refer to Lesaffre and Lawson [18] and Gelman et al. [19].

To encourage model reproducibility and standardization, which has been historically lacking, we employ slope and asymptote statistical models based on a consensus of those previously systematically reviewed [12]. Bayesian MCMC mixed-effects methods are used to incorporate priors and examine the effect of different distributional assumptions. We then prospectively apply these models to the data of an as-yet-unpublished SSA HIV treatment cohort and determine if cVL2 is a valid predictor of immune response.


Ethics statement

This study was approved by the Research Ethics Committee of the Faculty of Health Sciences at the University of Pretoria (13/2010) and the Tshwane Metsweding Region Research Committee (TMREC 2011/05) to retrospectively capture, and analyse routinely collected data at the Tshwane District Hospital in Gauteng, South Africa. The Research Ethics Committee waived the need for informed consent since the data was de-identified during capture.

Study setting and patient population

The comprehensive demographic and long-term treatment data of the first, consecutive, 963 patients older than 18 years of age who presented for ART at the Tshwane District Hospital in Gauteng, South Africa during 2004 and 2005 were selected for analysis. All patients started ART after 2004 as part of the South African national HIV treatment plan and were treated according to the National Department of Health HIV guidelines (2004) operative at the time, i.e. eligibility for ART was CD4 <200 cells/μL or WHO stage 4 disease regardless of CD4 count. Treatment was initiated using a standardized triple-drug regimen consisting of two nucleoside reverse transcriptase inhibitors, mostly d4T and 3TC, and one non-nucleoside reverse transcriptase inhibitor, either NVP or EFV. CD4 and HIV-1 VL monitoring was performed at treatment initiation ‘baseline’ and then 6-monthly, according to the national protocol. Demographic, anthropometric, clinical, ART and 5-year longitudinal treatment response data were collected. We excluded all second-line ART visits for all patients who were switched to second-line therapy.

Data collection and inclusion criteria

Data was collated in Microsoft Excel spreadsheets. Of an initial 963 patients, 213 patients were excluded as they only had a baseline visit, preventing the calculation of cVL2. Of the remaining 750 patients, 59 patients had missing data for either sex, baseline CD4 count, baseline log VL or longitudinal CD4 counts. The Bayesian models included imputation for missing data in both the CD4 outcomes and the covariates, during each model iteration [18]. For the ‘slope’ model the data of 750 patients was analysed. In the ‘asymptote’ model, a further 5 patients were excluded as they had CD4 counts ≥500 cells/μL at baseline, leaving 745 patients, see Figure A in S2 Appendix.

Statistical methods

Model definitions.

In this study, two types of longitudinal statistical models were investigated. These were defined based on a consensus from a prior systematic review [12]. The utility of including cVL2 as a covariate was examined in both cases:

  1. The ‘Slope’ model, in which the outcome was the mean annual increase in absolute CD4 count measured at 6-monthly intervals following ART initiation. In this case, the post-ART immune trajectory was the metric of interest; and,
  2. The ‘Asymptote’ model, in which the outcome was the odds of achieving a CD4 count ≥500 cells/μL at any particular time during the 5 years of ART. If at any visit following ART initiation there was a CD4 count ≥500 cells/μL it was coded as ‘1’, or ‘0’, if below, i.e. a binary outcome. In this case, achieving or not a specified ‘normal’ CD4 count threshold was the metric of interest. Note, a 500 cells/μL threshold was chosen as it is the described lower end of the healthy reference range of CD4 counts for the SSA population [20]. Patients were also excluded when their baseline CD4 count was ≥500 cells/μL, as prior studies have shown that when such people initiate ART they have an almost equal life expectancy to the general population [21,22].

The lower detection limit for the VL assay used in this cohort was 50 copies/mL. However, a value of 400 copies/mL was used in the calculation of cVL2 [13] as doing so enabled the exclusion of smaller ‘viral blips’ (50–400 copies/mL) that are unrelated to subsequent viral rebound and failure [23,24]. Transitory values of below 1000 copies/mL have been shown to have no quantitative difference in estimates of opportunistic infections and mortality compared to a threshold of 400 copies/mL during treatment [13].

Thus, cVL2, measured as copy-year/mL, for the i-th patient at their j-th visit is given by, (1) where ti,k and Vi,k represent the time (in years) and log VL measurement of the k-th visit, respectively. cVL2 was set to 0 at baseline. All visits where VL was missing (49% of 8250 visits see Figure B in S2 Appendix) were excluded during estimation of cVL2. The cVL2 for patients with missing VL intervals was estimated with a longer inter-visit duration i.e. (ti,kti,k−1) as seen in Eq 1.


These included: sex (‘0’ for males and ‘1’ for females), baseline age (years), baseline CD4 count, baseline log VL (log10 copies/mL), time on treatment (years) as per prior model consensus [12], and cVL2 (log10 copy-year/mL) as described above (Eq 1). cVL2 coefficient was multiplied with a binary variable (0 –if the patient was lost to follow-up and 1 –if the patient was still in care) (see model implementation in S1 Appendix).

Model structure

Also as per prior consensus [12], we used linear mixed-effects models, with random intercept and slope on time on treatment, as these models have most often been used to study CD4 count response while on ART [2527]. Absolute CD4 counts are known to vary within and between patients [8,28] and CD4 measurements further apart in time were found to be less correlated than those closely adjacent. Nash et al. 2008 [22] have demonstrated, in patients initiating ART with <200 cells/μL, that the largest gains in CD4 count were seen in the first 6 months of treatment after which it slowed down considerably. The majority (91.7%) of our cohort had baseline CD4 counts <200 cells/μL.

We define 9 models, each with two scenarios, namely, unadjusted (i.e. without cVL2) and adjusted (i.e. with cVL2). These are labelled as follows:

  • Model 1: slope model second-degree polynomial model where errors and random effects are normally distributed;
  • Model 2: slope model where errors and random effects are normally distributed and has cubic splines with 3 inner knots;
  • Model 3: slope model where errors and random effects are normally distributed and has cubic splines with 5 inner knots;
  • Model 4: slope model where random-effects are SN distributed;
  • Model 5: slope model where measurement error and random-effects are SN distributed.
  • Model 6: Asymptote model second-degree polynomial model where random effects are normally distributed;
  • Model 7: Asymptote model where random effects are skew-normally distributed;
  • Model 8: Asymptote model where random-effects are normally distributed with cubic splines and 3 inner knots; and
  • Model 9: Asymptote model where random-effects are normally distributed with cubic splines and 5 inner knots.

Slope models (Eq 2) were initially defined as second-degree polynomials, i.e. a quadratic term of time on treatment with a random intercept and slope. This was referred to as model 1 and defined as follows: (2) for i = (1,…,n) patients. The CD4 count response is given by an mi-dimensional vector, , β = (β0,β1,…,βp)T is a (p+1)−dimensional vector of unknown regression coefficients, is a (mi x(p+1)) matrix of p fixed effects, is a (mi x q) design matrix for the ith, q dimensional vector of random effects. The measurement error is a mi−dimensional normally distributed vector with mean zero and covariance, , and assumed to be independent of the random effects. bi is assumed to have a bivariate Gaussian distribution with mean zero and (q x q) covariance matrix Σb.

The model for the response of the i-th subject at the j-th time point is then given by,

We also examined replacing 2nd degree polynomials with cubic B-splines with 3 inner knots. This procedure slices the data around important time points as previously described by Corbeau et al 2011 [29], at (1.250, 2.5, 3.751), or 5 inner knots, at (0.833, 1.666, 2.5, 3.334, 4.167), that latter of which is simply a more extreme case of the 3 inner knots. For more details, refer to the ‘Spline models’ model 2 and model 3 in S1 Appendix.

Distributional considerations

Models such as the above assume that measurement errors and random effects are normally distributed, meaning that responses must also be normally distributed given the covariates. Most prior studies have assumed that CD4 count response is either Gaussian normally distributed or have applied log or square-root variable transformations to achieve normalization [30,31]. In isolated cases, the Poisson distribution has also been applied [32]. However, the latter assumes equal mean and variance, which is invariably unjustified. In this case, these parameters differed dramatically, 366 cells/μL versus 56481.3 cells/μL, respectively. Further, at any particular time during treatment, the distribution of CD4 count values are usually skewed [33]. Thus, inferences based on using such methods might result in bias.

For this reason, in addition to the Gaussian normal, we also investigated Skew-normal (SN) distributions. The SN distribution is an extension of the former, consisting of a Gaussian component and a skewness component. Initially, only the random-effects were assumed to have a SN distribution (Eq 3), i.e. bi~SNq(0,Σbb), where each ith individual has a 2-dimensional random-effects covariance matrix, Σb. The deviation from normality is given by the skewness part, here chosen the same for all random effects, namely Δb = diag(δ,…,δ). Note that for δ = 0 we obtain the Gaussian distribution. For more details, refer to [33,34].


Thereafter, SNs were assumed for both random effects and measurement error (Eq 4). Namely, and random-effects, bi~SNq(0,Σbb) where and Δb are the respective skewness terms. An unknown (mix mi) skewness diagonal matrix, with skewness parameter vector was used. Thus: (4)

For more details, refer to the ‘SN models’ model 4 and model 5 in S1 Appendix

The asymptote models (Eq 5) were also initially defined as second-degree polynomials, and the same variables used in the slope model were adjusted for. This was referred to as model 6 and defined as follows: (5)

For πij = prob(yij = 1) if CD4 count ≥500 cells/μL and ‘0’ otherwise for patient ‘i’ at visit ‘j’. is an mi-dimensional vector, with a Bernoulli distribution. The model for the ith subject at the j-th time point is then written as,

A SN distribution for the random-effects (Eq 6) was also investigated, see ‘SN model’ model 7 S1 Appendix. (6) where elements of bi are defined in the same way as those in Eq 3. Further, we also investigated the use of cubic B-splines with 3 and 5 inner knots, as described for Eq 2, refer to ‘Spline models’ model 8 and model 9 in S1 Appendix.

Missing data imputation

In the Bayesian framework, missing data imputation uses the data augmentation algorithm [35]. Given dataset D, with missing data MD, and parameters K, the algorithm determines, with each iteration, plausible values by sampling MD from a conditional distribution P(MD | D, K), and then imputes these values. It then samples K from a conditional distribution of the posterior P(K | D, MD) [18]. For missing data on baseline CD4 count and baseline VL we estimated P(MD | D, K) using multiple linear regression. For imputation of sex we used multiple logistic regression.

The linear regression imputation models were defined as: (7)

In Eq 7 the response, CD4 count is an n-dimensional vector yi = (y1,y2,…,yn)T, assumed to have a linear relationship with xi = (1,x1,x2)T an (n x 3) matrix of 2 fixed effects of sex and baseline age. ϕT = (ϕ0,ϕ1,ϕ2) is a 3-dimensional vector of regression coefficients and ε an n×1 vector of random errors, for additional details refer to S1 Appendix.


In Eq 8, for πi = prob(yi = 1) if sex is female and ‘0’ otherwise for patient ‘i’. πi = (π1,π2,…,πn)T is an mi-dimensional vector, with a Bernoulli distribution. ϕT = (ϕ0,ϕ1,ϕ2,ϕ3) a 4-dimensional vector of regression coefficients and xi = (1,x1,x2,x3)T an (n x 4) matrix of 3 fixed effects of baseline age, baseline CD4 count, and baseline log VL. Note, baseline CD4 count and baseline log VL, which had MD, was used to impute missing sex. For this reason we used Eq 7 to impute the MD for the first two variables, and then imputed a value for the missing sex.


Non-informative priors: For slope of CD4 count models in Eqs 2, 3 and 4, the unknown parameters are: , where for those not defined in the preceding sections, ρ is the correlation for the random-effects, τε is a vector for the scale parameter for the imputed models, σ0 and σ1 are the respective standard deviations of the random intercept and slope, ϑb is a vector of parameters that make up the skewness parameter for the random effects in the SN models, γb is a vector of shape parameters for the random-effects in the covariance matrix of the SN models, and α is a vector of coefficients on the random-effects part in both types of SN models (i.e. Eqs 3 and 4). For more details regarding these priors, please refer to the section on priors for each of the models in S1 Appendix.

For asymptote models in Eqs 5 and 6, the unknown parameters are: {β,ρ,ϕ,θ,τε,σ0,σ1,αb,ϑb, γb}, where ρ is the correlation for the random-effects, τε is a vector for the scale parameter for the imputed models, σ0 and σ1 are the respective standard deviations of the random intercept and slope, ϑb is a vector of parameters that make up the skewness parameter for the random effects in the SN models, γb is a vector of shape parameters for the random-effects in the covariance matrix in the SN models and α is a vector of coefficients on the random-effects part in of SN random-effects model. For details about these priors please refer to the section on priors for each of the models in S1 Appendix.

Vague Gaussian, uniform, inverse-gamma, and inverse-Wishart prior distributions were used for the regression coefficients, the error variance and the covariance matrix of the random effects, respectively.

Informative priors: For the regression coefficients of sex, baseline log VL, and polynomial terms, the mean and variances were estimated from prior studies [25,36]. Meaning, these were only possible for polynomial models in which the random effects had Gaussian distributions. SN distributions were not applied in the reviewed prior studies [12]. Simple linear mixed-effect models were generated for each of the covariates—for example, CD4 count as the outcome adjusted for sex as the only covariate, resulting in a posterior estimate for sex (i.e. 36.17). The change in the posterior estimates from the full model, model 1 (i.e. 24.1) to that in the simple linear mixed-effect model (i.e. 36.17) was obtained using this method. This change was then added (i.e. -12.04) to the historical study estimate (i.e. 35.2) [36] to get to 23.16, which was then used as an informative prior mean for sex, see Table 1. The precision was estimated from the variance, s2, using the formula for the upper 95% confidence intervals (uCI): (9) where is the mean estimate and ‘n’ the particular historical study size. Then, solving for s2 in Eq 9: (10)

Table 1. Estimating informative priors used for slope of CD4 count model.

For example, in [36] the simple regression coefficient for female-sex and the uCI were 35.2 and 66.5 respectively, with a sample size n = 459. Substituting this into Eq 10 yields s2 = 117054.8, and a precision of 8.543007E-06 (Table 1). In the asymptote models, historical studies used different CD4 count thresholds [12], meaning that informative priors were not possible.

Data analysis

R version 3.2.2 (R Foundation for Statistical Computing, Vienna, Austria, and OpenBUGS ( were used. Except for time on treatment, all continuous covariates were standardized to a mean of zero and standard deviation equal to one. Three MCMC chains were employed in both models. For the second-degree polynomial slope models, a burn-in of 20K followed by a further 45K iterations was used. For other variations of the slope and asymptote models, a burn-in of 80K followed by 100K iterations was used. The SN random-effects slope model had 250K burn-in followed by 250K iterations while that for asymptote model employed 500K burn-in followed by 500K iterations. We ensured that the models ran until the Monte Carlo error was <5% of the posterior standard deviation. For confirmation of convergence, the Brooks-Gelman-Rubin (BGR) diagnostic was used.

Models were selected based on the conditional Deviance Information Criterion (cDIC) [37]. Unless models differ by at least 5, no clear model choice can be expected. Moreover, Quintero and Lesaffre (2018) showed that the marginal DIC (mDIC) should be chosen, based on the marginal specification of the linear mixed-effects model [38]. Unfortunately, there is no software yet available to compute the mDIC in the general case considered here. Hence, we considered only a clear model choice if the cDIC’s differ by approximately 10 [38]. Parameter means and 95% equal-tail credible intervals (95%CI) are reported below.


Descriptive results

213 patients were excluded from analysis as they only had a baseline visit. This group had higher median baseline log VLs, i.e. 5.3 log10 copies/mL (IQR: 4.76, 5.54) vs 5.1 log10 copies/mL (IQR: 4.64, 5.48), (p = 0.003), and were older, i.e. 39 years (IQR: 34, 47) vs 36 years (IQR: 31, 42), (p<0.001).

In the data of the 750 patients analysed, 69.6% (n = 522) were female, with a median baseline age of 36 years (IQR: 31, 42). The median baseline CD4 count was 89 cells/μL (IQR: 43, 143) and median baseline log VL, 5.1 log10 copies/mL (IQR: 4.64, 5.48). The overall median increase in cVL2 after 5 years on ART was 0.3 log10 copy-year/mL (IQR: 0.30, 0.43). Patients initiated on ART with baseline CD4 count >200 cells/μL (n = 43) experienced a 278 cells/μL median increase, starting at 261 cells/μL (IQR: 219, 298) and ending at 529 cells/μL (IQR: 420, 633) during 5-years of follow-up. Those with CD4 count 101–200 cells/μL (n = 291) at baseline experienced a 241 cells/μL median increase during 5-years of follow-up on ART from 143 cells/μL (IQR: 120, 173) to 386 cells/μL (IQR: 253, 525). Patients with ≤100 cells/μL (n = 416) at baseline experienced a 270 cells/μL median increase in CD4 count during 5-years of follow-up on ART from 47 cells/μL (IQR: 20, 72) to 317 cells/μL (IQR: 188, 496).

Patients with baseline log VL >5 log10 copies/mL (n = 411) experienced a CD4 count median increase of 268 cells/μL during follow-up, from 78 cells/μL (IQR: 41, 131) to 367 cells/μL (IQR: 232, 515), while those with ≤5 log10 copies/mL (n = 339) had a 237 cells/μL median increase in CD4 count from 103 cells/μL (IQR: 47, 161) to 365 cells/μL (IQR: 226, 512). The overall median increase in cVL2 from ART start by baseline CD4 count category was 0.6 log10 copy-year/mL (IQR: 0.44, 0.82), 0.7 log10 copy-year/mL (IQR: 0.50, 0.90), and 0.7 log10 copy-year/mL (IQR: 0.55, 1.28) for patients with baseline CD4 count >200, 101–200, and ≤100 cells/μL, respectively. Only 33.8% of the patients in the data used in the asymptote model ever reached a CD4 count ≥500 cells/μL.

Slope models

After adjusting for sex, baseline age, baseline CD4 count, baseline log VL and time on treatment an increase in cVL2 is associated with a -19.6 cells/μL (95%CI: -28.25, -10.93) mean annual decrease in absolute CD4 counts over 5-years of follow up, Table 2. All other estimates of the regression coefficients were associated with CD4 count response as their 95%CIs do not include zero. The adjusted, i.e. with cVL2, model has a substantially lower cDIC compared to the crude, i.e. without cVL2 (47480 vs 47510, respectively), Table H in S2 Appendix. The median predicted CD4 count response, generated from the cVL2 adjusted model 1, among patients with baseline CD4 count ≤100 cells/μL is lower compared to those with >200 cells/μL, especially within the first 2 years of ART. The trend is similar but still lower compared to those in the 101–200 cells/μL category. This is more pronounced towards the 5th year of ART, Fig 1A. Females demonstrate a marginally higher median predicted increase in CD4 counts than males, Fig 1B. There is no difference in median predicted CD4 count increase between stratified groups of baseline age (≤50 vs >50 years) or baseline log10 VL (≤5 vs >5 copies/mL) during follow-up, Fig 1C and 1D.

Fig 1. Predicted posterior median CD4 counts trajectory by covariate strata for the slope of CD4 count model, with 95% prediction intervals.

A. baseline CD4 count (cells/μL), B. sex, C. baseline age (years), D. baseline log10 VL (copies/mL). Median predicted CD4 counts from model 1, is plotted. The red, black or blue dots represent predicted median CD4 count at different time points, and the bars are the whiskers.

Table 2. Posterior means and 95% credible intervals of the regression coefficients for the slope of CD4 count models.

Asymptote models

After adjusting for sex, baseline age, baseline CD4 count, time on treatment and baseline log VL, an increase in cVL2 reduces the odds for having a CD4 count ≥500 cells/μL at any time during 5 years of follow-up, i.e. 0.42 (95%CI: 0.236, 0.730), Table 3. For none of the variables, other than baseline log VL, the 95%CI for the posterior mean odds ratio includes one. The cVL2 adjusted compared to the unadjusted model 6 has a substantially lower cDIC (1695 versus 1716, respectively), Table H in S2 Appendix. Adjusting for cVL2 clearly provides a better fit to the data. Except for patients whose baseline CD4 count was >200 cells/μL and 101–200 cells/ μL, the median predicted probability of having a CD4 count ≥500 cells/μL is always less than 0.5 throughout the 5 years of ART, Fig 2A. Females demonstrate a higher median predicted probability of having a CD4 count ≥500 cells/μL compared to males, Fig 2B, especially during the first 2.5 years of treatment. Patients with a baseline age ≤50 years old have similar median predicted probabilities of a CD4 count ≥500 cells/μL compared to those >50 years, Fig 2C. It seems more likely for ≤50-year-old patients to reach at least 0.5 probability of achieving CD4 count ≥500 cells/μL (at 3 years of ART) compared to those >50 years who never reach this during 5 years of follow-up. Patients with higher baseline log10 VL, ≤5 copies/mL, have a similar median predicted probability compared to those with baseline log VL >5 copies/mL of having CD4 counts ≥500 cells/μL for 5 years, Fig 2D.

Fig 2. Predicted posterior median probability of having ≥500 cells/μL CD4 counts by covariate strata, with 95% prediction intervals.

A) baseline CD4 count (cells/μL), B) sex, C) Baseline age (years), D) baseline log10 viral load (copies/mL). Median predicted probability of having CD4 count ≥500 cells/μL from model 6. The red, black or blue dots represent predicted probability of having CD4 count ≥500 cells/μL at different time points, and the bars are the whiskers.

Table 3. Posterior mean odds ratios and 95% credible intervals of the regression coefficients for the binary longitudinal models with response: CD4 counts ≥500 cells/μL.

Refinements of initial models

In the slope model, changing model structure from a second-degree polynomial to cubic splines, model 2 and model 3, provides significantly higher cDIC’s (Table H in S2 Appendix), indicating that the polynomial model, i.e. model 1 is a better fit. The random effects are not convincingly normally distributed, Figure C in S2 Appendix, prompting further investigation into SN distributions. The SN distribution of the random-effects yields a significantly lower cDIC for both the unadjusted and adjusted, model 4, compared to Gaussian models. This is more pronounced in the unadjusted versus adjusted in model 4 (i.e. 47270 vs 47480, respectively). However, for model 5 both unadjusted and adjusted models (i.e. with SNs for both random-effects and measurement errors) have the smallest cDIC’s. Except for the spline models, cVL2 remains a significant covariate in determining the slope of CD4 count, Table H in S2 Appendix and Table 4.

Table 4. The effect of changing from a Gaussian to a skew-normal on the estimated regression coefficients, with 95% credible intervals, in the slope model.

Using informative priors

This improved estimation of covariate effects in model 1*. For example, the mean annual effect of cVL2 on immune response increases from -19.6 cell/μL (95%CI: -28.25, -10.93) in model 1, i.e. without informative priors, to -19.7 cell/μL (95%CI: -28.35, -11.10) in model 1*, i.e. with informative priors. For additional details refer to Table E in S2 Appendix.

In the asymptote model, the distribution of the random-effects of the polynomial version is visibly skewed, Figure D in S2 Appendix, implying that the use of a Gaussian normal distribution for the random effects is inappropriate. Using a SN yields a significantly lower cDIC compared to a Gaussian distribution, model 7 = 829 vs model 6 = 1695 in the adjusted model and in the unadjusted, model 7 = 1108 vs model 6 = 1716, Table H in S2 Appendix. The adjusted SN model still has the lowest cDIC among all asymptote models. Overall, the SN model has slightly smaller CIs compared to the normal random effects model, Table 5. cVL2 remains a significant covariate in the asymptote model, Table 5.

Table 5. The effect of changing from a normal to a skew-normal distribution on the random-effects on the regression coefficients, with 95% credible intervals, in the asymptote model.


Proper adherence to ART, in the absence of HIV drug resistance, is sufficient to suppress HIV VL in plasma. When this is not achieved ongoing viral replication causes progressive depletion of CD4+ T-cells through direct and indirect mechanisms [8,29,39]. However, the extent of damage by a detectable VL in the short-term may not be readily apparent as CD4 counts are inherently variable within and between people [1517].

Using prospectively defined models on an unpublished SSA HIV treatment cohort, we demonstrated that long-term consequences of a history of detectable VL, i.e. the cVL2 biomarker, on immune response while on ART. cVL2 is strongly associated with, and provides better model fits for, both slope and asymptote CD4 count models. These combined CD4 count-VL dynamics have not previously been demonstrated in causal statistical models and are in apparent agreement with the findings of biologically ‘mechanistic’ mathematical models [11]. After adjusting for sex, baseline age, baseline CD4 count, baseline log10 VL and time on treatment, each log10 copy-year/mL increase in cVL2 was associated with a decrease in slope of CD4 counts and lower odds of having a CD4 count of ≥500 cells/μL while on ART. The median probability of having a CD4 count ≥500 cells/μL was still < 0.5 among patients with a baseline CD4 count ≤100 cells/μL after 5 years of follow-up. Having a CD4 count ≥500 cells/μL while on ART was previously associated with a similar life expectancy to that in the normal population [21]. Our findings provide evidence for increased benefits of early ART initiation, as the probability of having a CD4 counts ≥500 cells/μL is then increased.

In addition to baseline log VL, increasing baseline age, associated with a reduction in the size and functionality of the thymus [5,29], and lower baseline CD4 counts [22,40,41] are associated with ineffective CD4 count recovery. Lawn et al. [42] have shown that late ART initiators, with baseline CD4 count <200 cells/ μL, equivalent to 94% of our cohort, often take longer than 5 years of treatment before they reach CD4 count ≥500 cells/μL. In our cohort and elsewhere [22], patients with baseline CD4 count ≤ 200 cells/μL never reached a 0.5 median predicted probability of having a CD4 count ≥500 cells/μL. Further, such patients are at risk of developing suboptimal immune response, i.e. failure to reach CD4 count >350 cells/μL after 5 years of ART [43]. Patients with >200 cells/μL at baseline experienced a decline in the median predicted probability of having CD4 count ≥500 cells/μL after 3 years, Fig 2A. It is possible that patients initiated on ART at >200 cells/μL quickly reach a CD4 count ceiling which, according to Williams et al 2006 and Kulkarni et al 2011 is related to pre-infection and immediate post-infection CD4 count levels [44,45]. Our findings of better CD4 count recovery in females compared to males is consistent with prior findings [8,22,25,29,41], even though we had fewer males than females a trend which is common in clinical settings in sub-Saharan Africa [46].

Results from randomized controlled trials such as the ‘Strategies for Management of Antiretroviral Therapy’ (SMART) [47], ‘Strategic Timing of Antiretroviral Therapy (START)’ [48] and ‘Trial of Early Antiretrovirals and Isoniazid Preventive Therapy in Africa’ (TEMPRANO) [49], indicated benefits of early initiation of ART to reduce AIDS and non-AIDS infection in HIV patients. Currently, the World Health Organization recommendations are to initiate ART in all newly diagnosed HIV patients, regardless of their CD4 count a diagnosis. However, many people living with HIV are still initiated late on ART, reflected by low baseline CD4 counts [50], possibly due to programmatic bottlenecks, e.g. drug stock-outs, limited clinic space, late diagnosis or patient decisions to defer ART. Our findings are relevant resource-limited settings in highly HIV burdened sub-Saharan Africa.

cVL2 attempts to capture cumulative HIV antigenic exposure due to detectable viral load (>400 copies/mL) during ART. However, cVL2 estimation improves with narrower sampling intervals [13,51]. Increasing VL sampling frequency in resource-limited settings is unrealistic as it is expensive. Mechanistic models have shown that the HIV virus has a short half-life of approximately 2 days [52], which implies that during ART, an HIV VL test shows the current VL rather than a time-averaged measure across the interval or up to a few weeks before the sampling visit. Considering this, interpreting VL tests may not be as informative as time-averaged cVL2, which may be a relevant biomarker for cumulative antigenic pressure.

Our models do not adjust for duration of HIV infection prior to ART initiation as this information was not available. This may have resulted in biased model estimates for cVL2. However, we did control for baseline CD4 count which, although perhaps not ideal due to inter-patient variability in the rate of CD4 decline prior to ART [53], is frequently used as a surrogate marker for duration of infection [54]. Further, approaches for adjusting for pre-ART VLs, as summarized in a prior study [13], may be of limited value as they are dependent on an individual’s innate viral controlling ability. In the future, adjusting similar models with data for HIV ‘recency’ assay results [55,56] if available, may serve as an additional indication for duration of infection.

A Bayesian approach was helpful in that it enabled us to apply informative prior estimates from historical models, although this was limited to only those that assume Gaussian normality [12,25,36]. Assuming normal distributions for random effects that are intrinsically not normally distributed is questionable and likely to produce biased results [57,58]. We found that models with both SN distributions for the random-effects and measurement errors produced substantially smaller cDIC’s, i.e. the best model fits. The predicted distributions for the random-effects in both slope and asymptote models were observably skew, as can be seen in Figure C in S2 Appendix and even more so in Figure D in S2 Appendix. Thus, using SN distributions is likely to have improved the description of the dataset as a whole. Having said that, the β-parameters and 95%CIs obtained after using non-informative and informative priors were similar, likely owing to the relatively large size of the dataset [59].

The considered models all assumed that when patients have an incomplete follow-up that either: (1) the reason for incompleteness is not related to the outcome (Missing completely at random) or (2) could have been predicted from past history (missing at random). Of course, it can also be that the incompleteness is related in a more complicated manner to the outcome (missing not at random). However, there is no clear solution on how to deal with this case. Some possible approaches are joint modelling, sensitivity analyses assuming different mechanisms for the incompleteness, etc. Addressing missing not at random missingness mechanisms thus involve a lot of extra considerations. The possible impact of this missingness mechanism on our data will be the topic of future research.

This study has limitations. As for most Bayesian models based on sizeable datasets, running them was computationally intensive. Estimation of cVL2 may also have been affected by sampling frequency, as previously alluded to [13]. VL was measured bi-annually in the current cohort; thus, the effect of detectable VL might have excluded unmeasured viral rebounds occurring between sampling intervals. This cohort included patients who were initiated on ART at very low median CD4 counts, i.e. 87 cells/μL (IQR: 43, 144), and may consequently have been predisposed to poor CD4 count responses, Fig 1, as compared to cohorts with early ART initiation in resource-rich settings. Our results may be prone to a ‘survival’ bias as 213 patients, who were excluded from this study because they only had a baseline visit, were older and had a higher baseline VL.

Taken together, the results of this study show promising use of the cVL2 metric to describe CD4 count response in ART patients. Models were prospectively defined and tested on the data of an as yet undescribed SSA HIV treatment cohort. Bayesian methods enabled the incorporation of prior information from historical studies. All assumptions were explicitly defined facilitating reproducibility and standardization. In the future, quantifying the effect of viral rebounds occurring between sampling intervals and duration of HIV infection may help improve our understanding of the effect of cVL2.

Supporting information

S1 Appendix. WinBugs/OpenBugs code for all the models analyzed.



  1. 1. Palella FJ, Delaney KM, Moorman AC, Loveless MO, Fuhrer J, Satten GA, et al. Declining morbidity and mortality among patients with advanced human immunodeficiency virus infection. HIV Outpatient Study Investigators. N Engl J Med. 1998;338: 853–60. pmid:9516219
  2. 2. Siedner MJ, Ng CK, Bassett I V., Katz IT, Bangsberg DR, Tsai AC. Trends in CD4 count at presentation to care and treatment initiation in Sub-Saharan Africa, 2002–2013: A meta-analysis. Clin Infect Dis. 2015;60: 1120–1127. pmid:25516189
  3. 3. Reynolds SJ, Sendagire H, Newell K, Castelnuovo B, Nankya I, Kamya M, et al. Virologic versus immunologic monitoring and the rate of accumulated genotypic resistance to first-line antiretroviral drugs in Uganda. BMC Infect Dis. 2012;12: 381. pmid:23270482
  4. 4. Castelnuovo B, Sempa J, Agnes KN, Kamya MR, Manabe YC. Evaluation of WHO Criteria for Viral Failure in Patients on Antiretroviral Treatment in Resource-Limited Settings. AIDS Res Treat. 2011;2011: 736938. pmid:21541225
  5. 5. Haynes BF, Markert ML, Sempowski GD, Patel DD, Hale LP. The role of the thymus in immune reconstitution in aging, bone marrow transplantation, and HIV-1 infection. Annu Rev Immunol. 2000;18: 529–60. pmid:10837068
  6. 6. Okoye AA, Picker LJ. CD4(+) T-cell depletion in HIV infection: mechanisms of immunological failure. Immunol Rev. 2013;254: 54–64. pmid:23772614
  7. 7. Abbas AK, Lichtman AH, Pillai S. Basic Immunology: Functions and Disorders of the Immune System. 4th ed. Basic Immunology: Functions and Disorders of the Immune System. 4th ed. Elsevier USA; 2012. pp. 303–305.
  8. 8. Pinzone MR, Di Rosa M, Cacopardo B, Nunnari G. HIV RNA suppression and immune restoration: can we do better? Clin Dev Immunol. 2012;2012: 515962. pmid:22489250
  9. 9. Semeere AS, Lwanga I, Sempa J, Parikh S, Nakasujja N, Cumming R, et al. Mortality and immunological recovery among older adults on antiretroviral therapy at a large urban HIV clinic in Kampala, Uganda. J Acquir Immune Defic Syndr. 2014;67: 382–9. pmid:25171733
  10. 10. Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology. 1999;10: 37–48. Available: pmid:9888278
  11. 11. Prague M, Commenges D, Gran JM, Ledergerber B, Young J, Furrer H, et al. Dynamic models for estimating the effect of HAART on CD4 in observational studies: Application to the Aquitaine Cohort and the Swiss HIV Cohort Study. Biometrics. 2016;61: 899–911. pmid:27461460
  12. 12. Sempa JB, Ujeneza EL, Nieuwoudt M. Systematic review of statistically-derived models of immunological response in HIV-infected adults on antiretroviral therapy in Sub-Saharan Africa. PLoS One. 2017;12: e0171658. pmid:28199360
  13. 13. Sempa JB, Dushoff J, Daniels MJ, Castelnuovo B, Kiragga AN, Nieuwoudt M, et al. Reevaluating Cumulative HIV-1 Viral Load as a Prognostic Predictor: Predicting Opportunistic Infection Incidence and Mortality in a Ugandan Cohort. Am J Epidemiol. 2016. pmid:27188943
  14. 14. Marconi VC, Grandits G, Okulicz JF, Wortmann G, Ganesan A, Crum-Cianflone N, et al. Cumulative viral load and virologic decay patterns after antiretroviral therapy in HIV-infected subjects influence CD4 recovery and AIDS. PLoS One. 2011;6: e17956. pmid:21625477
  15. 15. Gordon CL, Cheng AC, Cameron PU, Bailey M, Crowe SM, Mills J. Quantitative Assessment of Intra-Patient Variation in CD4+ T Cell Counts in Stable, Virologically-Suppressed, HIV-Infected Subjects. PLoS One. 2015;10: e0125248. pmid:26110761
  16. 16. van Rood Y, Goulmy E, Blokland E, Pool J, van Rood J, van Houwelingen H. Month-related variability in immunological test results; implications for immunological follow-up studies. Clin Exp Immunol. 1991;86: 349–54. pmid:1834381
  17. 17. Raboud JM, Haley L, Montaner JS, Murphy C, Januszewska M, Schechter MT. Quantification of the variation due to laboratory and physiologic sources in CD4 lymphocyte counts of clinically stable HIV-infected individuals. J Acquir Immune Defic Syndr Hum Retrovirol. 1995;10 Suppl 2: S67–73. Available:
  18. 18. Lesaffre E, Lawson AB. Bayesian Biostatistics. Chichester, UK: John Wiley & Sons, Ltd; 2012.
  19. 19. Gelman A, Andrew . Bayesian data analysis. 3rd Edition. Chapman & Hall/CRC texts in statistical science. CRC Press; 2014.
  20. 20. Lawrie D, Coetzee LM, Becker P, Mahlangu J, Stevens W, Glencross DK. Local reference ranges for full blood count and CD4 lymphocyte count testing. S Afr Med J. 2009;99: 243–8. Available: pmid:19588777
  21. 21. Lewden C, Chene G, Morlat P, Raffi F, Dupon M, Dellamonica P, et al. HIV-infected adults with a CD4 cell count greater than 500 cells/mm3 on long-term combination antiretroviral therapy reach same mortality rates as the general population. J Acquir Immune Defic Syndr. 2007;46: 72–7. pmid:17621240
  22. 22. Nash D, Katyal M, Brinkhof MWG, Keiser O, May M, Hughes R, et al. Long-term immunologic response to antiretroviral therapy in low-income countries: a collaborative analysis of prospective studies. AIDS. 2008;22: 2291–302. pmid:18981768
  23. 23. Grennan JT, Loutfy MR, Su D, Harrigan PR, Cooper C, Klein M, et al. Magnitude of virologic blips is associated with a higher risk for virologic rebound in HIV-infected individuals: a recurrent events analysis. J Infect Dis. 2012;205: 1230–8. pmid:22438396
  24. 24. Easterbrook PJ, Ives N, Waters A, Mullen J, O’Shea S, Peters B, et al. The natural history and clinical significance of intermittent viraemia in patients with initial viral suppression to < 400 copies/ml. AIDS. 2002;16: 1521–7. Available: pmid:12131190
  25. 25. Maman D, Pujades-Rodriguez M, Subtil F, Pinoges L, McGuire M, Ecochard R, et al. Gender differences in immune reconstitution: a multicentric cohort analysis in sub-Saharan Africa. PLoS One. 2012;7: e31078. pmid:22363550
  26. 26. Reda AA, Biadgilign S, Deribew A, Gebre B, Deribe K. Predictors of change in CD4 lymphocyte count and weight among HIV infected patients on anti-retroviral treatment in Ethiopia: a retrospective longitudinal study. PLoS One. 2013;8: e58595. pmid:23573191
  27. 27. Maskew M, MacPhail AP, Whitby D, Egger M, Fox MP. Kaposi sarcoma-associated herpes virus and response to antiretroviral therapy: a prospective study of HIV-infected adults. J Acquir Immune Defic Syndr. 2013;63: 442–8. pmid:23614996
  28. 28. Sax PE. Editorial commentary: can we break the habit of routine CD4 monitoring in HIV care? Clin Infect Dis. 2013;56: 1344–6. pmid:23315314
  29. 29. Corbeau P, Reynes J. Immune reconstitution under antiretroviral therapy: the new challenge in HIV-1 infection. Blood. 2011;117: 5582–90. pmid:21403129
  30. 30. Awoke T, Worku A, Kebede Y, Kasim A, Birlie B, Braekers R, et al. Modeling outcomes of first-line antiretroviral therapy and rate of CD4 counts change among a cohort of HIV/AIDS patients in Ethiopia: A retrospective cohort study. PLoS One. 2016;11: 1–18. pmid:27997931
  31. 31. De Beaudrap P, Etard J-F, Diouf A, Ndiaye I, Guèye NF, Guèye PM, et al. Modeling CD4+ cell count increase over a six-year period in HIV-1-infected patients on highly active antiretroviral therapy in Senegal. Am J Trop Med Hyg. 2009;80: 1047–53. Available: pmid:19478274
  32. 32. Sarfo FS, Sarfo MA, Kasim A, Phillips R, Booth M, Chadwick D. Long-term effectiveness of first-line non-nucleoside reverse transcriptase inhibitor (NNRTI)-based antiretroviral therapy in Ghana. J Antimicrob Chemother. 2014;69: 254–61. pmid:24003181
  33. 33. Chen R. Bayesian Inference on Mixed-effects Models with Skewed Distributions for HIV longitudinal Data. University of South Florida. 2012. Available:
  34. 34. Sahu SK, Dey DK, Branco MD. A New Class of Multivariate Skew Distributions with Applications to Bayesian Regression. Source Can J Stat / La Rev Can Stat. 2003;31: 129–150. Available:
  35. 35. Tanner MA, Hung Wong W. The Calculation of Posterior Distributions by Data Augmentation. Source J Am Stat Assoc. 1987. Available:
  36. 36. Boullé C, Kouanfack C, Laborde-Balen G, Carrieri MP, Dontsop M, Boyer S, et al. Task shifting HIV care in rural district hospitals in Cameroon: evidence of comparable antiretroviral treatment-related outcomes between nurses and physicians in the Stratall ANRS/ESTHER trial. J Acquir Immune Defic Syndr. 2013;62: 569–76. pmid:23337365
  37. 37. Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian measures of model complexity and fit. J R Stat Soc Ser B (Statistical Methodol. 2002;64: 583–639.
  38. 38. Quintero A, Lesaffre E. Comparing hierarchical models via the marginalized deviance information criterion. 2018; 2440–2454. pmid:29579777
  39. 39. Abbas AK, Lichtman AH, Pillai S. Basic Immunology: Functions and Disorders of the Immune System. 4th ed. Basic Immunology: Functions and Disorders of the Immune System. Elsevier USA; 2012.
  40. 40. Gras L, Kesselring AM, Griffin JT, van Sighem AI, Fraser C, Ghani AC, et al. CD4 cell counts of 800 cells/mm3 or greater after 7 years of highly active antiretroviral therapy are feasible in most patients starting with 350 cells/mm3 or greater. J Acquir Immune Defic Syndr. 2007;45: 183–92. pmid:17414934
  41. 41. Sempa JB, Kiragga AN, Castelnuovo B, Kamya MR, Manabe YC. Among patients with sustained viral suppression in a resource-limited setting, CD4 gains are continuous although gender-based differences occur. PLoS One. 2013;8: e73190. pmid:24013838
  42. 42. Lawn SD, Myer L, Bekker L-G, Wood R. CD4 cell count recovery among HIV-infected patients with very advanced immunodeficiency commencing antiretroviral treatment in sub-Saharan Africa. BMC Infect Dis. 2006;6: 59. pmid:16551345
  43. 43. Nakanjako D, Kiragga AN, Musick B, Yiannoutsos C, Wools-Kaloustian K, Diero L, et al. Frequency and impact of suboptimal immune recovery on first-line antiretroviral therapy (ART) within the IeDEA-East Africa cohort. AIDS. 2016. pmid:26959510
  44. 44. Kulkarni H, Okulicz JF, Grandits G, Crum-Cianflone NF, Landrum ML, Hale B, et al. Early postseroconversion CD4 cell counts independently predict CD4 cell count recovery in HIV-1-postive subjects receiving antiretroviral therapy. J Acquir Immune Defic Syndr. 2011;57: 387–95. pmid:21546844
  45. 45. Williams BG, Korenromp EL, Gouws E, Schmid GP, Auvert B, Dye C. HIV infection, antiretroviral therapy, and CD4+ cell count distributions in African populations. J Infect Dis. 2006;194: 1450–8. pmid:17054076
  46. 46. Sia D, Onadja Y, Hajizadeh M, Heymann SJ, Brewer TF, Nandi A. What explains gender inequalities in HIV/AIDS prevalence in sub-Saharan Africa? Evidence from the demographic and health surveys. BMC Public Health. 2016;16: 1136. pmid:27809824
  47. 47. Strategies for Management of Antiretroviral Therapy (SMART) Study Group, El-Sadr WM, Lundgren JD, Neaton JD, Gordin F, Abrams D, et al. CD4+ count-guided interruption of antiretroviral treatment. N Engl J Med. 2006;355: 2283–96. pmid:17135583
  48. 48. INSIGHT START Study Group, Lundgren JD, Babiker AG, Gordin F, Emery S, Grund B, et al. Initiation of Antiretroviral Therapy in Early Asymptomatic HIV Infection. N Engl J Med. 2015;373: 795–807. pmid:26192873
  49. 49. TEMPRANO ANRS 12136 Study Group, Danel C, Moh R, Gabillard D, Badje A, Le Carrou J, et al. A Trial of Early Antiretrovirals and Isoniazid Preventive Therapy in Africa. N Engl J Med. 2015;373: 808–22. pmid:26193126
  50. 50. IeDEA and COHERE Cohort Collaborations. Global Trends in CD4 Cell Count at the Start of Antiretroviral Therapy: Collaborative Study of Treatment Programs. Clin Infect Dis. 2018;66: 893–903. pmid:29373672
  51. 51. Fung ICH, Gambhir M, van Sighem A, de Wolf F, Garnett GP. The clinical interpretation of viral blips in HIV patients receiving antiviral treatment: are we ready to infer poor adherence? J Acquir Immune Defic Syndr. 2012;60: 5–11. pmid:22267019
  52. 52. Wei X, Ghosh SK, Taylor ME, Johnson VA, Emini EA, Deutsch P, et al. Viral dynamics in human immunodeficiency virus type 1 infection. Nature. 1995;373: 117–122. pmid:7529365
  53. 53. Patrikar S, Basannar DR, Bhatti VK, Kotwal A, Gupta RM, Grewal RS. Rate of decline in CD4 count in HIV patients not on antiretroviral therapy. Med J Armed Forces India. 2014;70: 134–138. pmid:24843201
  54. 54. Siedner MJ. START or SMART? Timing of Antiretroviral Therapy Initiation and Cardiovascular Risk for People With Human Immunodeficiency Virus Infection. Open forum Infect Dis. 2016;3: ofw032. pmid:26989755
  55. 55. Kassanjee R, Pilcher CD, Keating SM, Facente SN, McKinney E, Price MA, et al. Independent assessment of candidate HIV incidence assays on specimens in the CEPHIA repository. Aids. 2014;28: 2439–2449. pmid:25144218
  56. 56. Grebe E, Welte A, Hall J, Keating SM, Facente SN, Marson K, et al. Infection Staging and Incidence Surveillance Applications of High Dynamic Range Diagnostic Immuno-Assay Platforms. JAIDS J Acquir Immune Defic Syndr. 2017;76: 547–555. pmid:28914669
  57. 57. Verbeke G, Lesaffre E. A Linear Mixed-Effects Model With Heterogeneity in the Random-Effects Population. Source J Am Stat Assoc. 1996;91: 217–221. Available:
  58. 58. Ghosh P, Branco MD, Chakraborty H. Bivariate random effect model using skew-normal distribution with application to HIV-RNA. Stat Med. 2007;26: 1255–1267. pmid:16998836
  59. 59. Komárek A, Lesaffre E. Generalized linear mixed model with a penalized Gaussian mixture as a random effects distribution. Comput Stat Data Anal. 2008;52: 3441–3458.