Figures
Abstract
The development of a safe and efficacious vaccine that provides immunity against all four dengue virus serotypes is a priority, and a significant challenge for vaccine development has been defining and measuring serotype-specific outcomes and correlates of protection. The plaque reduction neutralisation test (PRNT) is the gold standard assay for measuring serotype-specific antibodies, but this test cannot differentiate homotypic and heterotypic antibodies and characterising the infection history is challenging. To address this, we present an analysis of pre- and post-infection antibody titres measured using the PRNT, collected from a prospective cohort of Thai children. We applied four machine learning classifiers and multinomial logistic regression to the titre data to predict the infecting serotype. The models were validated against the true infecting serotype, identified using RT-PCR. Model performance was calculated using 100 bootstrap samples of the train and out-of-sample test sets. Our analysis showed that, on average, the greatest change in titre was against the infecting serotype. However, in 53.4% (109/204) of the subjects, the highest titre change did not correspond to the infecting serotype, including in 34.3% (11/35) of dengue-naïve individuals (although 8/11 of these seronegative individuals were seropositive to Japanese encephalitis virus prior to their infection). The highest post-infection titres of seropositive cases were more likely to match the serotype of the highest pre-infection titre than the infecting serotype, consistent with antigenic seniority or cross-reactive boosting of pre-infection titres. Despite these challenges, the best performing machine learning algorithm achieved 76.3% (95% CI 57.9–89.5%) accuracy on the out-of-sample test set in predicting the infecting serotype from PRNT data. Incorporating additional spatiotemporal data improved accuracy to 80.6% (95% CI 63.2–94.7%), while using only post-infection titres as predictor variables yielded an accuracy of 71.7% (95% CI 57.9–84.2%). These results show that machine learning classifiers can be used to overcome challenges in interpreting PRNT titres, making them useful tools in investigating dengue immune dynamics, infection history and identifying serotype-specific correlates of protection, which in turn can support the evaluation of clinical trial endpoints and vaccine development.
Author summary
Dengue is a viral infection transmitted by mosquitoes that has rapidly spread worldwide over the past 50 years. It is caused by four distinct virus serotypes, and developing a vaccine that protects against all of them is a key goal. However, accurately measuring serotype-specific immune responses and identifying the serotype responsible for past infections remains a significant challenge. The plaque reduction neutralisation test (PRNT) is the gold standard method for measuring serotype-specific antibody responses. However, its cross-reactivity, combined with the complex short-term cross-protective nature of dengue antibodies, currently makes the identification of the infecting serotype challenging. To tackle this, we analysed antibody data from Thai children, collected both before and after dengue infection. By applying machine learning models to this data, we predicted the infecting serotype with an average accuracy of 71% to 80%. This approach improves our understanding of how the immune system responds to different dengue serotypes and has the potential to inform the evaluation of vaccine efficacy in future trials.
Citation: Cracknell Daniels B, Buddhari D, Hunsawong T, Iamsirithaworn S, Farmer AR, Cummings DAT, et al. (2024) Predicting the infecting dengue serotype from antibody titre data using machine learning. PLoS Comput Biol 20(12): e1012188. https://doi.org/10.1371/journal.pcbi.1012188
Editor: Leah C. Katzelnick, National Institute of Allergy and Infectious Diseases, UNITED STATES OF AMERICA
Received: May 22, 2024; Accepted: November 28, 2024; Published: December 23, 2024
Copyright: © 2024 Cracknell Daniels et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data and code are provided to fit the models using all titre data (scenario A) and using all titre data plus year of infection and age-group (modified scenario B). School and age in years are not published to ensure anonymity. Data and code are available at: https://github.com/bnc19/dengue-serotype-classification-public.
Funding: ID acknowledges funding by the Wellcome Trust (213494/Z/18/Z, https://wellcome.org/grant-funding). BCD acknowledges PhD funding from the UK Medical Research Council (https://www.ukri.org/). ID and BCD acknowledge funding from the MRC Centre for Global Infectious Disease Analysis (reference MR/X020258/1), funded by the UK Medical Research Council (MRC). This UK funded award is carried out in the frame of the Global Health EDCTP3 Joint Undertaking. KA was supported by a grant from the US NIH (1R01AI175941, https://www.nih.gov/grants-funding). DATC was supported by a grant from the US NIH (R01AI175495, https://www.nih.gov/grants-funding). The original cohort study was funded by NIH grant P01 A1034533 and the Military Infectious Disease Research Program (https://www.nih.gov/grants-funding). The funders played no role in study design, data collection and analysis, decision to published or preparation of the manuscript.
Competing interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: AF declares travel support from Janssen Pharmaceuticals to attend an industry-sponsored symposium related to dengue. DATC declares Merck and Pfizer contracts, US NIH grants and a US NSF grant to his institution for unrelated work. ID declares consultancy to the WHO and Gavi the vaccine alliance.
Introduction
Dengue is an arboviral infection that has expanded globally in the last 50 years, with an estimated 105 million cases annually (95% confidence interval (CI) 95–114) [1]. Despite this, there are currently no specific antiviral treatments or vaccines in widespread use. Dengue is caused by four antigenically distinct virus serotypes (DENV-1-4), which interact immunologically. Infection results in protective and durable homotypic immunity [2–4], although homotypic reinfections may occur [5–7]. Conversely, heterotypic immunity following a primary infection is temporary, and secondary infections are associated with the potential for disease enhancement due to antibody dependent enhancement [8–12] which increases viral replication [13], and antigenic seniority where the secondary immune responses are skewed towards the primary infecting serotype [14]–which is different from the concept of original antigenic sin that has recently evolved from its original meaning [15] to refer to a model in which the immune response is not able to mount a significant de novo response upon infection with a virus that is related to the primary infection. Despite the higher likelihood of disease, a secondary infection also induces a broadly neutralising heterotypic response, to the extent that tertiary and quaternary infections are rarely severe [16,17].
To avoid enhancement, both the dengue vaccines licenced to date, Dengvaxia and Qdenga, aim to induce balanced immunity against all four serotypes. However, achieving and measuring this has proven challenging. While homotypic antibodies correlate with protection [5], heterotypic antibodies may be broadly protective (following the secondary infection) or enhance disease (following the primary infection), due to differences in their avidity, antigenic targets, and concentration [11,12]. The plaque reduction neutralisation test (PRNT), the gold standard for neutralising antibody measurement, cannot differentiate between homotypic and heterotypic antibodies. Consequently, immunogenicity endpoints in vaccine trials rely on total serotype-specific neutralising antibody titres and rates of tetravalent seroconversion, measured using the PRNT or similar neutralisation assays. However, these endpoints have limitations; neutralising titres may be associated with enhancement [9,10] and the commonly used seroconversion threshold (PRNT titre >10) does not correlate with disease protection [10]. Critically, the exact titre for protection will depend on the assay used (including assay conditions), the infecting serotype, and an individual’s prior exposure to dengue and other flaviviruses like Japanese encephalitis virus (JEV) [18–20]
This challenge was highlighted by the phase III trial of Qdenga. In the third year, tetravalent seropositivity was 81% in baseline seronegatives, suggesting broad protective immunity against all serotypes in most recipients [21]. However, vaccine efficacies in year 3 against DENV-1 to -4 were 17%, 85%, 10% and -99%, respectively. A study of the vaccine’s functional response revealed that most of the serotype-specific antibodies targeted DENV-2 [22], suggesting that seronegative protection against the other serotypes might partly arise from a transient heterotypic response against the DENV-2 component of the vaccine. Additionally, the first licensed vaccine, Dengvaxia, is not in widespread use due to the risk of enhanced disease in subjects who were dengue-naïve (seronegative) at the time of vaccination [23]. These results emphasise the need to better understand serotype-specific immunity. While novel assays have been developed to measure serotype-specific antibodies [22,24], they are infeasible for large-scale phase III trials. As such, serology remains central to inferring subclinical infections and estimating vaccine efficacy against infection [25], defining vaccine immunogenicity [26], identifying correlates of protection [27], and studying dengue dynamics in population-level studies [28].
We therefore aimed to test the utility of machine learning classifiers in identifying the infecting serotype from PRNT data collected from a prospective cohort of schoolchildren over five years (1998–2002) in Kamphaeng Phet, a largely rural province in northern Thailand where all four serotypes circulate endemically [29,30]. During the study period, three serotypes (DENV-1, DENV-2, and DENV-3) were observed in 1998, 1999, and 2001, all four serotypes were observed in 2002, and only DENV-2 was observed in 2000. JEV also circulates in this region and JEV vaccination coverage is high [31]. Our analysis of PRNT antibody titres shows that cross-reactive titres, even in DENV seronegative individuals, can complicate the interpretation of serotype-specific titres. Building on work by van Panhuis et al. [32], we trained machine learning classifiers to predict the infecting serotype from pre- and post-infection PRNT titres with high accuracy. We compared these approaches with earlier methods based on multinomial logistic regression (MLR) [32].
Results
Study characteristics
Antibody response data were collected from a prospective cohort study of children across 12 schools in Kamphaeng Phet in northern Thailand, a hyperendemic transmission setting with all four serotypes circulating [30,33]. 204 symptomatic RT-PCR confirmed dengue cases were recorded between 1998 and 2002 from all 12 schools (S1 Fig). Of these, 35 cases were in individuals classified as DENV seronegative prior to infection (pre-infection titres < 10 for all four DENV serotypes), although only four of these cases were classified as primary cases according to their ratio of IgM/IgG antibodies (S1A Fig). The remaining 169 cases were in DENV seropositive individuals (pre-infection titres ≥ 10 to at least one DENV serotype) (S1B Fig). 48.5% of cases had pre-infection titres ≥10 to JEV (including 23/35 of the DENV seronegative individuals). The age of infected children ranged from 7–15 years, with a mean age of 10 years (S1C–S1D Fig).
All four DENV serotypes were observed during the study, with DENV-2 being the most prevalent according to case definition (49%), followed by DENV-3 (32%), DENV-1 (17%) and DENV-4 (2%) (S1 Fig). DENV-3 was the dominant serotype in year one of the study, with DENV-2 dominating after that. Except for DENV-4, which was only observed in 2002 in school 3, the circulation of each serotype was not concentrated to a specific year or school. Multiple serotypes were observed each year, except for 2000 when only DENV-2 was observed. Equally, multiple serotypes were observed in each of the 12 schools, apart from schools 1 and 2 where only DENV-2 was observed (S1 Fig).
Antibody dynamics
Fig 1 shows the average pre- and post-infection log10 reciprocal neutralising PRNT titres, stratified by the infecting serotype. On average, the highest change in titre was against the infecting serotype. Consistent with antibody cross-reactivity, there were rises in neutralising titres against all serotypes, with similar post-infection PRNT titre values against DENV-1, DENV-2 and DENV-3, irrespective of the infecting serotype. Of the 35 cases in DENV seronegative individuals, only one, four or nine had monotypic responses, depending on whether a monotypic response was defined as having a PRNT titre ≥10 to only one serotype [17], a PRNT titre ≥10 to more than one serotype but a PRNT titre ≥80 to only one serotype [29], or a PRNT titre ≥10 to more than one serotype but a PRNT titre five-fold higher to only one serotype [34]. The average post-infection titre in the DENV seronegative group was lower than the DENV seropositive group (mean and standard deviation 1.89, 1.14 versus 2.44, 1.09).
PRNT: plaque reduction neutralisation test. Dengue virus (DENV-1: red, DENV-2: blue, DENV-3: green, DENV-4: dark blue. Japanese encephalitis virus (JEV): orange. RT-PCR: reverse transcriptase polymerase chain reaction. Seronegative individuals were defined as pre-infection titres < 10 for all four DENV serotypes. Seropositive individuals were defined as those with a pre-infection titre ≥ 10 for at least one DENV serotype.
Among the seropositive cases, 29 (17.2%) had the highest neutralising titre tied between two or more serotypes, due to right-censoring of antibody titres at either 2560 or 10240 (Fig 2A). For the remaining seropositive cases, 76.3% had the highest post-infection titre against a non-infecting serotype, and in 75.5% of these, the serotype with the highest post-infection titre was also the one with the highest pre-infection titre (Fig 2A). Thus, in over half of the seropositive cases without tied highest titres, the highest post-infection titre corresponded to the highest pre-infection titre, rather than to the infecting serotype.
(a) The frequency of subjects whose highest post-infection titre was against the infecting serotype, a non-infecting serotype or whose highest titre was tied between one or more serotypes due to maximum PRNT dilution. (b) The percentage of subjects whose highest pre- and post-infection titres were against each serotype and JEV, stratified by the infecting serotype. Tied highest titres are split. Seropositive individuals were defined as those with a pre-infection titre ≥ 10 for at least one DENV serotype. PRNT: plaque reduction neutralisation test. RT-PCR: reverse transcriptase polymerase chain reaction. DENV: dengue virus. JEV: Japanese encephalitis virus.
The mean of the log10 highest pre-infection reciprocal titre was significantly higher (Student’s T test, p value = 0.01) in cases with a non-infecting serotype as the highest post-infection titre (mean and standard deviation 2.6 and 0.7) compared to those whose highest post-infection titre matched the infecting serotype (mean and standard deviation 2.2, 0.6). Fig 2B shows the frequency of the highest neutralised serotype, pre- and post-infection (cases where the highest titre is against the infecting serotype are along the diagonal). Except for a DENV-3 infection, the highest pre-infection titre was never against the infecting serotype (left hand panel). Notably, a high percentage of DENV-2 infections had highest pre- and post-infection titres against DENV-1 and DENV-3. Additionally, a higher frequency of DENV-3 cases had a highest titre against DENV-1 following the DENV-3 infection than before (Fig 2B).
Fig 3 shows the highest PRNT titre change for each case, stratified by the infecting serotype. In 57.4% of seropositive individuals the highest titre changes were against a non-infecting serotype (87/169) or JEV (10/169) (Fig 3A). In seropositive individuals infected with DENV-3 the highest change in titre was equally likely to be against DENV-2 or DENV-3. Surprisingly, in 11/35 DENV seronegative cases, the highest titre change did not correspond with the infecting serotype (Fig 3B), although 8/11 of these DENV seronegative cases were seropositive to JEV prior to infection. Of the 11 cases, 4 DENV-1 infections had the greatest titre change against DENV-3, 5 DENV-2 infections had the greatest titre change against DENV-3, and 2 DENV-3 infections had the greatest titre change against DENV-1. A single DENV-3 infection also had a tied highest post-infection titre against DENV-2 and DENV-3. All other infections in seronegative individuals (i.e., 5/9, 8/13 and 10/13 DENV-1, DENV-2, and DENV-3 infections) had the greatest titre change against the infecting serotype. S1 and S2 Tables present the pre-infection JEV titre and post-infection titres of DENV seronegative cases, showing that many individuals had high post-infection titres to more than one serotype. Together, these findings suggest that although on average, the greatest change in titre is against the infecting serotype (Fig 1), this is frequently not the case, even in individuals who were seronegative to all four DENV serotypes prior to infection.
Seronegative individuals were defined as pre-infection titres < 10 for all four DENV serotypes. Seropositive individuals were defined as those with a pre-infection titre ≥ 10 for at least one DENV serotype. The plot area is proportional to the number of seropositive / seronegative cases. Points indicate a single case. PRNT: plaque reduction neutralisation test. DENV: dengue virus. JEV: Japanese encephalitis virus. RT-PCR: reverse transcriptase polymerase chain reaction.
Relationship between timing of titre measurement and change in PRNT titre
The time between measurement of pre- and post-infection titres and the date of infection varied within the study, ranging from 9–281 days (i.e., 0–9 months), with a single outlier of 584 days between measurement of the pre-infection titre and a DENV-3 infection (S2 Fig). The median time from pre-infection titre measurement to infection was 5 months, while the median time from infection to post-infection titre was 6 months. S3A Fig shows how the average change in log10 PRNT titre tends to increase as the time between measurement of the pre-infection titre and infection increases, suggesting the decay of antibodies induced against a past infection is associated with a greater increase in antibody titre against the subsequent infection. Conversely, the average change in titre decreased as time between infection and measurement of post-infection titre increased, consistent with decaying titres (S3B Fig).
Predicting the infecting serotype
Fig 4 shows an overview of the algorithm used to develop and validate the performance of four machine learning classifiers (random forest RF, gradient boosting machines GBM, artificial neural network ANN, and support vector machines SVM) compared to MLR in predicting the infecting serotype from neutralising antibody titres (including the pre- and post-infection titres, Fig 1, the change in titre, Fig 3, and the number of days between infection and titre measurement, S3 Fig). These machine learning classifiers were chosen given their ability to model complex-non-linear relationships making them well suited for cross-reactive antibody dynamics data. All machine learning classifiers and MLR were trained and validated on the same set of predictor variables. We considered four scenarios: scenario A used all titre measurements as predictor variables and predicted all cases; scenario B used all titre measurements plus year of infection and the age, and school of the individuals and predicted all cases; scenario C used post-infection DENV titres as predictor variables and predicted all cases; and scenario D used all titre measurements as predictor variables and predicted only DENV seropositive cases (pre-infection titre ≥10 to at least one serotype) (S3 Table).
For each scenario and each classifier (1), the data were randomly split into 90% train and 10% test sets (2). The train data were pre-processed and the model hyperparameters were tuned using leave-one-out-cross-validation (3–4). Performance on the train data was calculated (5) and the set of hyperparameters with the highest kappa statistic were chosen for the final model (6). This model was then applied to the test data (7) and the test performance metrics were calculated (8). Steps 2–8 were repeated 100 times using bootstrap samples of the test and train sets, and the mean and 95% CI were calculated for test and train performance metrics (9). Scenario A: all titre predictor variables (pre- and post-infection PRNT titres and change in titre against all four dengue virus serotypes and Japanese encephalitis virus, and the number of days between measurement of the pre- and post-infection titres and the date of infection). Scenario B: all predictor variables (titre predictor variables plus age, year, and school). Scenario C: post-infection PRNT titre of the dengue serotypes were predictor variables. Scenario D: all titre predictor variables but only predicted seropositive cases. Seropositive individuals were defined as those with a pre-infection titre ≥ 10 for at least one DENV serotype. CI: confidence interval. PRNT: plaque reduction neutralisation test.
Fig 5 shows the mean and 95% CI of the performance of each classification model and MLR across all four scenarios. In scenario A, which utilised all titre predictor variables (S3 Table), the best-performing classifier was GBM, achieving an accuracy of 76.3% (95% CI 57.9–89.5%) on the test set. The kappa statistic was 0.59 (95% CI 0.25–0.83). Training performance was similar (accuracy 74.6%, 95% CI 72.7–77.8% and kappa statistic 0.58, 95% CI 0.54–0.64), suggesting that the models are not overfitting to the training data (Fig 5). The most important predictor variables were the change in DENV-1 titre pre- and post-infection, the post-infection DENV-1 and DENV-3 titres and the change in DENV-2 and DENV-3 titres pre- and post-infection (S4A Fig).
(a) Performance of each model in predicting the infecting serotype across four scenarios using a 90/10 train/test split of the data. Scenario A and scenario D predictor variables were pre- and post-infection PRNT titres against DENV 1–4 and JEV, change in PRNT titre and number of days between infection and measurement of pre- and post-infection titres. Scenario B used the same predictor variables as A and C, plus year, age, and school. Scenario C predictor variables were post-infection PRNT against DENV 1–4. Scenario’s A, B, and C predicted all cases, scenario D predicted cases in seropositive individuals only. Seropositive individuals were defined as those with a pre-infection titre ≥ 10 for at least one DENV serotype. (b) Comparison of different train/test proportions (90/10, 80/20, 70/30, 60/40, 50/50) for predicting the infecting dengue virus serotype (scenario A), using the GBM classification model. For each performance metric, the mean (point) and 95% confidence interval (error bar) were calculated using 100 bootstrap samples of the test and train sets. Test performance was calculated by predicting on the hold-out-sample. Train performance was calculated using leave-one-out cross validation. ANN: artificial neural network. MLR: multinomial logistic regression. SVM: support vector machine. GBM: gradient boosting machine. RF: random forest. PRNT: plaque reduction neutralisation test. DENV: dengue virus.
The accuracy and kappa statistic of MLR on the training set were 65.8% (95% CI 63.0–68.4%) and 0.46 (95% CI 0.41–0.50), significantly lower than all the machine learning classifiers training set performances (Fig 5). The performance of the machine learning classifiers on the test set was also better than MLR, however this difference was not significant, due to the wide 95% CI of the estimates of model performance (Fig 5A). To investigate the wide 95% CI estimates of model performance on the test set, we fit the GBM model to the scenario A predictors using different train/test proportions (Fig 5B). Increasing the size of the test set from 10% to 50% reduced uncertainty in the performance estimates, however, to maximise the data available to train the models given the small size of the dataset, a 90%/10% train and test split was used in the main analysis.
In scenario B, we investigated whether the predictive performance could be further improved by including spatiotemporal variables in the model (i.e., the subjects age and school, and the year of infection). In this scenario, the best performing classifier was again GBM, with a test accuracy and kappa statistic of 80.6% (95% CI 63.2–94.7%) and 0.67 (95% CI 0.36–0.91), respectively (Fig 5A). The train kappa statistic was 0.70 (95% CI 0.66–0.75), a significant increase compared to the train kappa statistics in scenario A. The most important predictor variables in scenario B were the year, change in DENV-1, DENV-2, and DENV-3 titre pre- and post-infection and the post-infection DENV-1 titre (S4B Fig).
In scenario C, only the post-infection DENV PRNT titres were used as predictor variables (Fig 5A). As expected, the overall test performance was lower than when using all predictors. The ANN classifier performed the best, with a test accuracy and kappa statistic of 71.7% (95% CI 57.9–84.2) and 0.48 (95% CI 0.19–0.72), respectively. Again, the machine learning classifiers outperformed MLR on the training and test sets, despite the large uncertainty in the performance estimates on the test set. Finally, the performance in scenario D was slightly lower than in scenario A and subject to greater uncertainty, reflecting the smaller dataset (N = 169) and prediction of only seropositive cases. Across all four scenarios, the GBM classifier performed the best.
Table 1 shows the class performance of MLR and the machine learning models on the test set for predicting the infecting serotype in scenario A. Notably, the model sensitivity was higher for DENV-2 and DENV-3 than DENV-1, which is likely linked with the serotype-specific prevalence of infection observed in the test set, equal to 52.6%, 31.6%, and 15.8%, respectively. The SVM had the highest sensitivity against DENV-1 (65%), followed by MLR (61%). Conversely, the sensitivity of MLR against DENV-3 was notably lower than the machine learning models (49% vs. 54–66%). The highest positive predictive values (PPV) were achieved by GBM and ANN for DENV-1, SVM for DENV-2, and RF and GBM for DENV-3. The negative predictive values (NPV) were generally similar across classifiers, for each serotype (Table 1). The inclusion of the spatio-temporal variables (scenario B) increased the predictive performance against all serotypes. For instance, the sensitivity of GBM increased from 53%, 90%, and 66% to 58%, 93%, and 71% for DENV-1, DENV-2, and DENV-3, respectively (S4 Table). Therefore, it will be important to tailor the classification method and variables used to the specific setting.
For each performance metric, the mean and 95% confidence interval were calculated using 100 bootstrap samples of the hold-out-sample (10%). There were no DENV4 cases in the test set. ANN: artificial neural network. MLR: multinomial logistic regression. SVM: support vector machine. GBM: gradient boosting machine. RF: random forest. PRNT: plaque reduction neutralisation test. DENV: dengue virus. JEV: Japanese encephalitis virus. PPV: positive predictive value. NPV: negative predictive value.
To further understand the class imbalance, Fig 6 presents the predicted and true infecting serotype in cases misclassified by the GBM model in scenario A. The most frequent misclassification combination was a DENV-3 infection predicted to be DENV-2, which may be because the greatest change in PRNT titre following a DENV-3 infection was equally likely to be DENV-2 as DENV-3 (Fig 2). DENV-3 cases misclassified as DENV-2 had lower pre-infection DENV-2 titres compared to the correctly identified cases, which resulted in a greater average DENV-2 titre increase following the DENV3 infection. 11.1% of misclassifications occurred in cases where the highest post-infection titre was tied between the predicted and true infecting serotype. Of the remaining misclassifications, the titre of the infecting serotype was higher than the predicted serotype in 65.4% of the misclassified cases (Fig 6).
N = the number of times a case was misclassified out of 100 bootstrap samples of the 10% test set. Predictor variables were pre- and post-infection PRNT titres against DENV-1-4 and JEV, change in PRNT titre and number of days between infection and measurement of pre- and post-infection titres (scenario A). PRNT: plaque reduction neutralisation test. DENV: dengue virus. JEV: Japanese encephalitis virus.
Sensitivity analysis
In a sensitivity analysis, we set any titre value above 2560 to 2560, to investigate a discrepancy in the maximum PRNT dilution. Although the distribution of titres was similar to the main analysis (Figs 1 and S5A), 47 seropositive cases had tied highest titre values, compared to 29 in the main analysis (Figs 2A and S5B). Notably, this led to a reduction in classification performance, with a test kappa statistic of 0.47 (95% CI 0.12–0.78) in scenario A (S5C Fig).
Discussion
Despite the development of multiple dengue vaccines, the lack of standardised interpretation of neutralising antibodies still hinders the identification of correlates of protection. Whilst the primary endpoint of dengue vaccine trials has so far focussed on virologically confirmed disease or hospitalisation, the availability of methods that can identify subclinical infections can pave the way to the assessment of serotype-specific protection against infection. Here, we trained machine learning classification models to predict the infecting DENV serotype from pre- and post-infection neutralising antibody titres, achieving a mean accuracy of 76.3% using the GBM classifier. When additionally including information on the individual’s school, age and year of infection, the accuracy increased to 80.6% on average.
Consistent with antigenic seniority [14] and/or cross-reactive boosting of pre-infection titres, we found that 57.6% of the highest post-infection titres of seropositive cases correspond with the serotype of the highest pre-infection titre, rather than to the infecting serotype. Despite this, a classification model trained only on post-infection DENV titres was able to predict the infecting serotype with 71.7% accuracy, on average. This is particularly relevant to epidemiological studies where these models can be used to identify the serotype of secondary infections using post-infection serological samples only.
Very few of the cases in DENV seronegative individuals exhibited a monotypic response [17,29,34], implying that a higher specificity (e.g., PRNT-70 or -90) may be needed to disentangle cross-reactive antibodies, even in primary cases. However, the highest post-infection antibody titres of 11/35 DENV seronegative individuals were against a non-infecting serotype. Most (8/11) of these cases were seropositive to JEV prior to infection, implying that pre-existing JEV antibodies, previously associated with an increased risk of symptomatic dengue [31], can also complicate the interpretation of serotype-specific PRNT titres [35,36]. Other explanations include the antigenic overlap between serotypes, for instance the highest titre following multiple DENV-1 infections was DENV-3 and vice-versa, which are phylogenetically the most similar [7]. However, previous work suggests that the highest neutralising response following infection in DENV seronegative individuals is against the infecting serotype [15]. An alternative explanation is that the seronegative individuals had prior dengue exposure, but their antibody levels decayed below the limit of detection (seroreversion), causing their misclassification as seronegative. In support of this, only 4/35 cases classified as DENV seronegative prior to infection were also defined as primary infections based on their IgM/IgG ratio.
The neutralisation assay conditions and assay variability may also impact the measurement of serotype-specific antibody titres [18]. Ensuring that the reference DENV strain used in the PRNT matches the circulating strains, as well as the use of mature virions and cells grown in appropriate substrates, will improve the measurement of serotype-specific neutralising antibodies [19,37] and hence the performance of the classification models presented in this study.
The classification models developed in this study can be extended to investigate the association between sequences of serotype-specific infections and disease protection or enhancement [38]. Furthermore, these models could be used to identify subclinical infections within vaccine trials using serology data, allowing the estimation of serotype-specific vaccine efficacy against infection which has the potential to alter serotype transmission dynamics [39]. Additionally, they could be used to investigate the prevalence of homotypic reinfection, which may be more frequent than previously realised, especially if asymptomatic [6,40,41].
In general, the most important predictor variables were the post-infection titre and change in titre following infection against DENV-1, DENV-2, and DENV-3. We further found that inclusion of the infection year, participant age, and school as predictor variables increased the predictive performance of the classification models, with year being the most important predictor variable when included. As these predictor variables are specific to the current cohort, their inclusion in the model and the use of a single dataset reduces the generalisability, highlighting the need for more individual-level serological data from epidemiological studies and vaccine trials. Spatiotemporal variables may also be less informative in urban settings with high levels of human movement, both nationally and internationally, compared to the largely rural setting of this current study. However, the high performance of the classifiers in this study’s setting, where all four serotypes are endemic, suggests that these classifiers will also perform well in other high transmission settings. Additionally, they may perform even better in settings where fewer serotypes are circulating.
This study builds on previous work by Van Panhuis et al. [32] who used MLR to infer the infecting serotype from neutralising antibody titre data. We found that machine learning classifiers perform better on average than MLR, reflecting the ability of classifiers like GBM and ANN to model the complex, non-linear biological interactions that influence antibody dynamics. However, this comes with the cost of reduced model simplicity and interpretability compared to the MLR model. Additionally, there was wide variability of the test set performance, attributable to the small sample size of the dataset. Machine learning is expected to outperform MLR on larger datasets, owing to its flexibility in describing complex relationships, but this analysis also shows that to draw definitive conclusions, machine learning algorithms require more data, which is a challenge in the context of collecting serotype-specific serological data. MLR and the classifiers showed variable performance against each serotype, reflecting the class imbalance of the data. Future work predicting the infecting serotype from larger or multiple data sets with more equal representation of each serotype is likely to improve overall performance, as well as the performance against each serotype.
A further limitation is the inconsistent maximum dilutions used for the PRNT. We addressed this in a sensitivity analysis by setting any titre values >2560 to 2560, which resulted in a loss of predictive performance. To capture the full range of antibody dynamics, especially in seropositive infections, serial dilutions should therefore be performed to at least 1:10240 or criteria pre-specified, dictating the sequence with which different dilutions should be completed. Finally, the date between measurement of pre- and post-infection titres and the date of infection ranged from 0–9 months. According to our current understanding of the duration of cross-protection against infection, which varies between 2–3 months according to heterotypic challenge studies conducted by Sabin et al. [3], or estimated at ~2 years by Reich et al. (against infection or clinical disease) [42] it is possible that subclinical infections or boosting of titres may have occurred for some children within that time, further complicating the interpretation of PRNT titres. We included time from titre measurement to infection as predictor variables, increasing performance, but could not address this further.
This analysis compared the performance of four machine learning classifiers and MLR in predicting the infecting serotype from neutralising antibody titres. Machine learning classifiers predicted the infecting serotype with high accuracy, overcoming challenges in interpreting cross-reactive PRNT titres, in both seropositive and seronegative cases. If applied to longitudinal antibody titre data, the models developed here can be used to predict sequential infections, subclinical infections, and homotypic reinfections, making them important tools for our understanding of dengue immunity, disease dynamics and to help identify serotype-specific immune correlates of vaccine induced protection.
Methods
Data collection
Antibody response data were obtained from a prospective cohort study conducted on children in grades 1–5 in Kamphaeng Phet, Thailand, as previously described [43]. Briefly, the study enrolled children from 12 schools in 1998, with new first graders enrolled each January until 2002. Active case surveillance for acute dengue illness ran from June to November, during which three blood samples (June, August, and November) were collected for dengue serology. Village healthcare workers visited absent students to check for fever within seven days or a temperature >38°C, and a referral to the public health clinic was made, if necessary, where an acute illness blood sample was obtained. Children who reported directly to the public health clinic or were admitted to the hospital were also evaluated for acute dengue illness.
Acute dengue infection was confirmed through dengue serology (immunoassays against DENV-1-4 and Japanese encephalitis virus) or detection of the virus [43]. Confirmation of the infecting serotype was performed through RT-PCR within six days of acute DENV illness. PRNT titres were measured from pre- and post-infection blood samples of children with confirmed dengue infection. All PRNT titres were performed in the same lab using the same protocol [43]. A monolayer of LLC-MK2 cells were infected with a constant amount of DENV, in the presence of four-fold serial dilutions of heat-inactivated plasma from patients (1:10–1:2560 or 1:10–1:10240). The viral strains used in the PRNT were DENV-1 (16007), DENV-2 (16681), DENV-3 (16562), and DENV-4 (1036), and JEV (Vaccine strain; SA 14-14-2). PRNT50 were calculated using probit regression and reported as the reciprocal titre.
Informed assent and consent were obtained from all participants or guardians. Only de-identified secondary data was used in this present study.
Model development and validation
We compared the performance of four non-parametric classifiers (RF, GBM, ANN, and SVM) to MLR in predicting the infecting DENV serotype from neutralising antibody titres.
RF and GBM are ensemble decision trees, hierarchical structures which break down the classification process into a series of sequential conditions. RF builds many trees from bootstrap training data samples, so each tree is grown on random subsets of the data, with a random subset of variables considered at each decision node. Each tree provides a class vote to classify observations. The class with the majority votes is selected for the final prediction. By combining the predictions of many de-correlated trees, RF reduces variance and the risk of overfitting, improving predictive performance. Whereas RF builds many trees simultaneously, GBM builds decision trees sequentially. Starting with a weak learner (i.e., a tree with only a few splits), GBM aims to increase the performance with each new tree, by fitting to observations misclassified by the previous trees. The final model is a linear combination of many trees, that provide a class vote, like RF. GBM can reduce both the bias and variance of single decision trees; however, they may require more data than RF.
Support vector classifiers create linear boundaries, or hyperplanes, to divide the data into different classes. The optimal hyperplane is found by optimising the margin between the plane and the nearest training observations. SVM extends this, by using kernels to handle non-linear decision boundaries, which enlarge the feature space into many dimensions. This makes SVM effective in high dimensional space, although they can underperform with noisy data. A one-versus-one approach is used to classify more than two classes.
ANNs are inspired by the neural networks in the brain and compromise of node layers, which are analogous to neurons. The output layer (the classification) is modelled from the input layer (the predictor variables) via an intermediary layer of hidden nodes. In this analysis, a single hidden layer neural network was implemented. The hidden layer derives non-linear transformations from the input layer. Weight decay is used to regularise ANN models, which are otherwise prone to over-fit due to the high number of estimated parameters (intercepts and weights). Fitting multiple independent models and aggregating their predictions also reduces variance.
The outcome variable was the infecting serotype (DENV-1-4), identified through RT-PCR. The titre predictor variables were the pre- and post-infection PRNT titres, change in titre and time between infection and measurement of titres (S3 Table). Additionally, the year of infection and the age and school of the participant were considered as predictor variables. Titres recorded as “<10”, “>2560” or “>10240” were set to 0, 2560 and 10240, respectively. All PRNT titres were log10 transformed after adding 1, to obtain a normal distribution. For SVM and ANN, the log-transformed titres were centred and scaled.
Fig 4 presents an overview of model development and validation. For each classifier, four modelling scenarios were investigated. Scenario A used all titre measurements as predictor variables and predicted all cases, scenario B used all titre measurements plus year of infection and the age, and school of the individuals and predicted all cases, scenario C used post-infection DENV titres as predictor variables and predicted all cases, and scenario D used all titre measurements as predictor variables and predicted only seropositive cases (pre-infection titre ≥10 to at least one serotype) (S3 Table). We did not consider feature selection due to the small number of predictor variables.
Model performance was evaluated on out-of-sample datasets by splitting the dataset, so the model was trained on 90% of the data (train set) and validated on 10% of the data (test set), while preserving the overall serotype distribution within both sets. The training set was used to tune the hyperparameters of each classifier using leave-one-out-cross-validation (S5 Table). The hyperparameter combination that gave the best performance, measured using the kappa statistic, was chosen to predict the infecting serotypes of the test set. In total, 100 estimates of the train and test performance were obtained using 100 bootstrap samples of the test and train sets, from which we calculated the mean and 95% CI (2.5–97.5 percentiles). The analysis was conducted through the caret package [44], using R studio version 1.3.1093 [45]. The packages for each classifier are nnet [46], kernlab [47], gbm [48], ranger [49].
Sensitivity analyses
A sensitivity analysis was conducted to ascertain the impact of different train and test splits on model performance and uncertainty. To this end, the GBM algorithm was applied to scenario A using the following test and train splits: 10%/90% (baseline), 20%/80%, 30%/70%, 40%/60%, 50%/50%.
Finally, the maximum dilution for the PRNT of some antibody samples was 1:2560, whilst for others it was 1:10240. No samples collected following a DENV-2 infection were diluted above 1:2560. To investigate whether the different dilutions impacted the classification performance, we conducted a sensitivity analysis where we set any sample greater than 2560 to 2560.
Supporting information
S1 Fig. Distribution of cases by serotype observed in Kamphaeng Phet, Thailand across study year, school and participants age, in seronegative individuals (a, c and e), and seropositive individuals (b, d and f).
Seropositive individuals are defined as those with a pre-infection titre ≥ 10 for at least one DENV serotype. DENV: dengue virus. RT-PCR: reverse transcriptase polymerase chain reaction.
https://doi.org/10.1371/journal.pcbi.1012188.s001
(TIFF)
S2 Fig. Distribution of time (months) between infection and the measurement of a) pre-infection PRNT and b) post-infection PRNT, and c) between pre-infection titre measurement and post-infection titre measurement.
Boxes represent the 25th to the 75th percentile of each group’s distribution of values, the horizonal line represents the median, and the upper and lower whiskers denote the most extreme values within 1.5 interquartile range of the 25th and 75th percentile of each group. Data beyond the whiskers are dented outliers and are plotted individually as points. PRNT: plaque reduction neutralisation test.
https://doi.org/10.1371/journal.pcbi.1012188.s002
(TIF)
S3 Fig. Change in log PRNT as a function of the number of days from the measurement of a) pre-infection PRNT to infection and b) infection to post-infection PRNT.
Points are the individual changes in titre (N = 204) and line and shaded region are the mean and 95% confidence interval of a linear regression, fit using the least squares method. A single outlier of 584 days between measurement of the pre-infection titre and a DENV-3 infection was removed to aid visualisation. PRNT: plaque reduction neutralisation test. DENV: dengue virus. RT-PCR: reverse transcriptase polymerase chain reaction.
https://doi.org/10.1371/journal.pcbi.1012188.s003
(TIF)
S4 Fig. Relative importance (0–100) of each predictor variable in predicting the infecting serotype using the GBM and RF classification models (colours) in (a) modelling scenario A and (b) modelling scenario B.
Variable importance of each model in predicting the infecting serotype was calculated using leave-one-out cross validation on the full dataset. Scenario A: all titre predictor variables (pre- and post-infection PRNT titres and change in titre against all four-dengue virus (DENV) serotypes and Japanese encephalitis virus (JEV), and the number of days between measurement of the pre- and post-infection titres and the date of infection). Scenario B: titre predictor variables plus age, year of infection, and school. GBM: gradient boosting machine. RF: random forest. PRNT: plaque reduction neutralisation test.
https://doi.org/10.1371/journal.pcbi.1012188.s004
(TIF)
S5 Fig. Impact of a maximum PRNT dilution of 1:2560 on antibody dynamics and model predictive performance.
For any samples where the maximum dilution was 1:10240, rather than 1:2560, titre values with a reciprocal titre >2560 were set to 2560. (a) Mean (point) ± standard deviation (error bar) of the pre- and post-infection log PRNT titre, by infecting DENV serotype as quantified by RT-PCR. (b) The frequency of subjects whose highest post-infection titre was against the infecting serotype, a non-infecting serotype or whose highest titre was tied between one or more serotypes due to maximum PRNT dilution. (c) Predictive performance of the gradient boosting machine for predicting the infecting DENV serotype. Scenario A (left panel) predictor variables are pre- and post-infection PRNT titres against DENV 1–4 and JEV, change in PRNT titre and number of days between infection and measurement of pre- and post-infection titres. Scenario C (right panel) predictor variables post-infection PRNT against DENV 1–4. The mean (point) and 95% confidence interval (error bar) of each performance metric are calculated from 100 bootstrap samples. Test performance is calculated by predicting on the hold-out-sample. Train performance is calculated using leave-one-out cross validation. PRNT: plaque reduction neutralisation test. DENV: dengue virus. JEV: Japanese encephalitis virus. RT-PCR: reverse transcriptase polymerase chain reaction.
https://doi.org/10.1371/journal.pcbi.1012188.s005
(TIF)
S1 Table. Individual pre- (JEV) and post-infection neutralising antibody titres in DENV seronegative children whose highest post-infection titre did not match the infecting serotype.
Note, DENV seronegative children are by definition those with a pre-infection titre <10 for all four serotypes. DENV: dengue virus. JEV: Japanese encephalitis virus.
https://doi.org/10.1371/journal.pcbi.1012188.s006
(DOCX)
S2 Table. Individual pre- (JEV) and post-infection neutralising antibody titres in DENV seronegative children whose highest post-infection titre matched the infecting serotype.
Note, DENV seronegative children are by definition those with a pre-infection titre <10 for all four serotypes. DENV: dengue virus. JEV: Japanese encephalitis virus.
https://doi.org/10.1371/journal.pcbi.1012188.s007
(DOCX)
S3 Table. Modelling scenarios investigated.
Seropositive individuals are classified as those with pre-infection titre ≥ 10 for at least one DENV serotype. DENV: dengue virus. JEV: Japanese encephalitis virus. PRNT: plaque reduction neutralisation test.
https://doi.org/10.1371/journal.pcbi.1012188.s008
(DOCX)
S4 Table. Mean (95% confidence interval) test class performance of regression and machine learning models for predicting the infecting DENV serotype using all titre predictor variables, plus age, school, and year of infection (Scenario B).
For each performance metric, the mean and 95% confidence interval were calculated using 100 bootstrap samples of the hold-out-sample (10%). There were no DENV4 cases in the test set. ANN: artificial neural network. MLR: multinomial logistic regression. SVM: support vector machine. GBM: gradient boosting machine. RF: random forest. DENV: dengue virus.
https://doi.org/10.1371/journal.pcbi.1012188.s009
(DOCX)
S5 Table. Description of the hyperparameters used to tune each machine learning classifier.
https://doi.org/10.1371/journal.pcbi.1012188.s010
(DOCX)
Acknowledgments
Material has been reviewed by the Walter Reed Army Institute of Research. There is no objection to its presentation and/or publication. The opinions or assertions contained herein are the private views of the author, and are not to be construed as official, or as reflecting true views of the Department of the Army or the Department of Defense. The investigators have adhered to the policies for protection of human subjects as prescribed in AR 70–25.
References
- 1. Cattarino L, Rodriguez-Barraquer I, Imai N, Cummings DAT, Ferguson NM. Mapping global variation in dengue transmission intensity. Science Translational Medicine. 2020;12(528):1–11. pmid:31996463
- 2. Guzman MG, Alvarez M, Rodriguez-Roche R, Bernardo L, Montes T, Vazquez S, et al. Neutralizing antibodies after infection with dengue 1 virus. Emerging Infectious Diseases. 2007;13(2):282–6. pmid:17479892
- 3. Sabin AB. Research on dengue during World War II. The American Journal of Tropical Medicine and Hygiene. 1952 Jan 1;1(1):30–50. pmid:14903434
- 4. Imrie A, Meeks J, Gurary A, Suhkbaatar M, Truong TT, Cropp CB, et al. Antibody to dengue 1 detected more than 60 years after infection. Viral Immunol. 2007 Dec;20(4):672–5. pmid:18158740
- 5. Buddhari D, Aldstadt J, Endy TP, Srikiatkhachorn A, Thaisomboonsuk B, Klungthong C, et al. Dengue virus neutralizing antibody levels associated with protection from infection in Thai cluster studies. PLOS Neglected Tropical Diseases. 2014 Oct 16;8(10):e3230. pmid:25329173
- 6. Waggoner JJ, Balmaseda A, Gresh L, Sahoo MK, Montoya M, Wang C, et al. Homotypic dengue virus reinfections in Nicaraguan children. The Journal of Infectious Diseases. 2016 Oct 1;214(7):986–93. pmid:26984144
- 7. Katzelnick LC, Fonville JM, Gromowski GD, Arriaga JB, Green A, James SL, et al. Dengue viruses cluster antigenically but not as discrete serotypes. Science. 2015 Sep 18;349(6254):1338–43. pmid:26383952
- 8. Halstead S O’Rourke E. Dengue viruses and mononuclear phagocytes. I. Infection enhancement by non-neutralizing antibody. J Exp Med. 1977 Jul 1;146(1):201–17.
- 9. Katzelnick LC, Gresh L, Halloran ME, Mercado JC, Kuan G, Gordon A, et al. Antibody-dependent enhancement of severe dengue disease in humans. Science. 2017 Nov 17;358(6365):929–32. pmid:29097492
- 10. Salje H, Cummings DAT, Rodriguez-Barraquer I, Katzelnick LC, Lessler J, Klungthong C, et al. Reconstruction of antibody dynamics and infection histories to evaluate dengue risk. Nature. 2018 May;557(7707):719–23. pmid:29795354
- 11. Dejnirattisai W, Jumnainsong A, Onsirisakul N, Fitton P, Vasanawathana S, Limpitikul W, et al. Cross-reacting antibodies enhance dengue virus infection in humans. Science. 2010;328:745–8. pmid:20448183
- 12. De Alwis R, Williams KL, Schmid MA, Lai CY, Patel B, Smith SA, et al. Dengue viruses are enhanced by distinct populations of serotype cross-reactive antibodies in human immune sera. PLOS Pathogens. 2014;10(10):1004386. pmid:25275316
- 13. Zellweger RM, Prestwood TR, Shresta S. Enhanced infection of liver sinusoidal endothelial cells in a mouse model of antibody-induced severe dengue disease. Cell Host & Microbe. 2010 Feb 18;7(2):128–39.
- 14. St. John AL, Rathore APS. Adaptive immune responses to primary and secondary dengue virus infections. Nat Rev Immunol. 2019 Apr;19(4):218–30. pmid:30679808
- 15. Midgley CM, Bajwa-Joseph M, Vasanawathana S, Limpitikul W, Wills B, Flanagan A, et al. An in-depth analysis of original antigenic sin in dengue virus infection. Journal of Virology. 2011 Jan;85(1):410–21. pmid:20980526
- 16. Gibbons RV, Kalanarooj S, Jarman RG, Nisalak A, Vaughn DW, Endy TP, et al. Analysis of repeat hospital admissions for dengue to estimate the frequency of third or fourth dengue infections resulting in admissions and dengue hemorrhagic fever, and serotype Sequences. American Journal of Tropical Medicine and Hygiene. 2007;77(5):910–3. pmid:17984352
- 17. Olkowski S, Forshey BM, Morrison AC, Rocha C, Vilcarromero S, Halsey ES, et al. Reduced risk of disease during postsecondary dengue virus infections. The Journal of Infectious Diseases. 2013 Sep 15;208(6):1026–33. pmid:23776195
- 18. Salje H, Rodríguez-Barraquer I, Rainwater-Lovett K, Nisalak A, Thaisomboonsuk B, Thomas SJ, et al. Variability in dengue titer estimates from plaque reduction neutralization tests poses a challenge to epidemiological studies and vaccine development. PLOS Neglected Tropical Diseases. 2014;8(6):8–10. pmid:24967885
- 19. Bos S, Graber AL, Cardona-Ospina JA, Duarte EM, Zambrana JV, Ruíz Salinas JA, et al. Protection against symptomatic dengue infection by neutralizing antibodies varies by infection history and infecting serotype. Nat Commun. 2024 Jan 9;15(1):382. pmid:38195666
- 20. Thomas SJ, Nisalak A, Anderson KB, Libraty DH, Kalayanarooj S, Vaughn DW, et al. Dengue plaque reduction neutralization test (PRNT) in primary and secondary dengue virus infections: How alterations in assay conditions impact performance. Am J Trop Med Hyg. 2009 Nov;81(5):825–33.
- 21. Rivera L, Biswal S, Sáez-Llorens X, Reynales H, López-Medina E, Borja-Tabora C, et al. Three-year efficacy and safety of Takeda’s dengue vaccine candidate (TAK-003). Clinical Infectious Diseases. 2022 Jul 1;75(1):107–17. pmid:34606595
- 22. White LJ, Young EF, Stoops MJ, Henein SR, Adams EC, Baric RS, et al. Defining levels of dengue virus serotype-specific neutralizing antibodies induced by a live attenuated tetravalent dengue vaccine (TAK-003). PLOS Neglected Tropical Diseases. 2021 Mar 12;15(3):e0009258. pmid:33711074
- 23. Sridhar S, Luedtke A, Langevin E, Zhu M, Bonaparte M, Machabert T, et al. Effect of dengue serostatus on dengue vaccine safety and efficacy. N Engl J Med. 2018 Jul 26;379(4):327–40. pmid:29897841
- 24. Henein S, Adams C, Bonaparte M, Moser JM, Munteanu A, Baric R, et al. Dengue vaccine breakthrough infections reveal properties of neutralizing antibodies linked to protection. J Clin Invest. 2021;131(13):e147066. pmid:34003796
- 25. Salje H, Alera MT, Chua MN, Hunsawong T, Ellison D, Srikiatkhachorn A, et al. Evaluation of extended efficacy of Dengvaxia vaccine against symptomatic and subclinical dengue infection. Nat Med. 2021 Aug;27(8):1395–400.
- 26. Walsh MCR, Alam MS, Pierce KK, Carmolli M, Alam M, Dickson DM, et al. Safety and durable immunogenicity of the TV005 tetravalent dengue vaccine, across serotypes and age groups, in dengue-endemic Bangladesh: a randomised, controlled trial. The Lancet Infectious Diseases. 2024 Feb 1;24(2):150–60. pmid:37776876
- 27. Moodie Z, Juraska M, Huang Y, Zhuang Y, Fong Y, Carpp LN, et al. Neutralizing antibody correlates analysis of tetravalent dengue vaccine efficacy trials in Asia and Latin America. Journal of Infectious Diseases. 2018 Feb 14;217(5):742–53. pmid:29194547
- 28. Hamins-Puértolas M, Buddhari D, Salje H, Cummings DAT, Fernandez S, Farmer A, et al. Household immunity and individual risk of infection with dengue virus in a prospective, longitudinal cohort study. Nat Microbiol. 2024 Jan;9(1):274–83. pmid:38110699
- 29. Endy TP, Nisalak A, Chunsuttitwat S, Vaughn DW, Green S, Ennis FA, et al. Relationship of preexisting dengue virus (DV) neutralizing antibody levels to viremia and severity of disease in a prospective cohort study of DV infection in Thailand. The Journal of Infectious Diseases. 2004 Mar 15;189(6):990–1000. pmid:14999601
- 30. Bhoomiboonchoo P, Gibbons RV, Huang A, Yoon IK, Buddhari D, Nisalak A, et al. The spatial dynamics of dengue virus in Kamphaeng Phet, Thailand. PLOS Neglected Tropical Diseases. 2014 Sep 11;8(9):e3138. pmid:25211127
- 31. Anderson KB, Gibbons RV, Thomas SJ, Rothman AL, Nisalak A, Berkelman RL, et al. Preexisting Japanese Encephalitis Virus Neutralizing Antibodies and Increased Symptomatic Dengue Illness in a School-Based Cohort in Thailand. PLOS Neglected Tropical Diseases. 2011 Oct 4;5(10):e1311. pmid:21991398
- 32. Van Panhuis WG, Gibbons RV, Endy TP, Rothman AL, Srikiatkhachorn A, Nisalak A, et al. Inferring the serotype of dengue virus infections based on pre- and post-infection neutralizing antibody titers. J Infect Dis. 2010 Oct 1;202(7):1002–10.
- 33. Endy TP, Nisalak A, Chunsuttiwat S, Libraty DH, Green S, Rothman AL, et al. Spatial and temporal circulation of dengue virus serotypes: A prospective study of primary school children in Kamphaeng Phet, Thailand. American Journal of Epidemiology. 2002 Jul 1;156(1):52–9. pmid:12076888
- 34. Sasmono RT, Taurel AF, Prayitno A, Sitompul H, Yohan B, Hayati RF, et al. Dengue virus serotype distribution based on serological evidence in pediatric urban population in Indonesia. PLoS Negl Trop Dis. 2018 Jun;12(6):e0006616. pmid:29953438
- 35. Uhuami AO, Lawal N, Bello MB, Imam MU. Flavivirus cross-reactivity: Insights into e-protein conservancy, pre-existing immunity, and co-infection. The Microbe. 2024 Sep 1;4:100105.
- 36. Zambrana JV, Hasund CM, Aogo RA, Bos S, Arguello S, Gonzalez K, et al. Primary exposure to Zika virus is linked with increased risk of symptomatic dengue virus infection with serotypes 2, 3, and 4, but not 1. Sci Transl Med. 2024 May 29;16(749):eadn2199. pmid:38809964
- 37. Katzelnick L, Odio C, Daag J, Crisostomo MV, Voirin C, Escoto AC, et al. Dengue virus IgG and serotype-specific neutralizing antibody titers measured with standard and mature viruses are associated with protection. Res Sq. 2024 Apr 12;rs.3.rs–4145863. pmid:38659845
- 38. Aguas R, Dorigatti I, Coudeville L, Luxemburger C, Ferguson NM. Cross-serotype interactions and disease outcome prediction of dengue infections in Vietnam. Scientific Reports [Internet]. 2019;9(1). Available from: pmid:31253823
- 39. WHO. Meeting of the Immunization and Vaccine Related Implementation Research Advisory Committee (IVIR-AC), September 2023 [Internet]. Geneva; 2024. Available from: https://iris.who.int/bitstream/handle/10665/375749/WER9903-eng-fre.pdf
- 40. Forshey BM, Reiner RC, Olkowski S, Morrison AC, Espinoza A, Long KC, et al. Incomplete protection against dengue virus type 2 re-infection in Peru. PLOS Neglected Tropical Diseases. 2016;10(2):1–17.
- 41. Sáez-Llorens X, Biswal S, Borja-Tabora C, Fernando L, Liu M, Wallace D, et al. Effect of the tetravalent dengue vaccine TAK-003 on sequential episodes of symptomatic dengue. The American Journal of Tropical Medicine and Hygiene. 2023 Apr 5;108(4):722–6. pmid:36878213
- 42. Reich NG, Shrestha S, King AA, Rohani P, Lessler J, Kalayanarooj S, et al. Interactions between serotypes of dengue highlight epidemiological impact of cross-immunity. Journal of the Royal Society Interface. 2013;10(86). pmid:23825116
- 43. Endy TP, Chunsuttiwat S, Nisalak A, Libraty DH, Green S, Rothman AL, et al. Epidemiology of inapparent and symptomatic acute dengue virus infection: A prospective study of primary school children in Kamphaeng Phet, Thailand. American Journal of Epidemiology. 2002 Jul 1;156(1):40–51. pmid:12076887
- 44. Kuhn M. Building predictive models in R using the caret package. Journal of Statistical Software. 2008 Nov 10;28:1–26.
- 45. RStudio Team.,. RStudio: Integrated Development for R [Internet]. 2020. Available from: http://www.rstudio.com/
- 46.
Ripley B, Venables W. Modern applied statistics with S [Internet]. New York: Springer; 2002 [cited 2024 Mar 4]. Available from: https://www.stats.ox.ac.uk/pub/MASS4/
- 47. Karatzoglou A, Smola A, Hornik K, Zeileis A. kernlab—An S4 package for kernel methods in R. Journal of Statistical Software. 2004 Nov 2;11(9):1–20.
- 48. Rigdgeway G. Generalized Boosted Models: A guide to the gbm package. 2006.
- 49. Wright M, Ziegler A. ranger: A fast implementation of random forests for high dimensional data in C++ and R. J Stat Softw. 2015;77(1).