Skip to main content
Advertisement
  • Loading metrics

Severity Index for Suspected Arbovirus (SISA): Machine learning for accurate prediction of hospitalization in subjects suspected of arboviral infection

  • Rachel Sippy ,

    Roles Conceptualization, Formal analysis, Methodology, Software, Supervision, Visualization, Writing – original draft, Writing – review & editing

    sippyr@upstate.edu(RS); astewart@dir.iai.int(AMSI)

    Affiliations Institute for Global Health and Translational Science, SUNY Upstate Medical University, Syracuse, New York, United States of America, Quantitative Disease Ecology and Conservation Lab, Department of Geography, University of Florida, Gainesville, Florida, United States of America, Emerging Pathogens Institute, University of Florida, Gainesville, Florida, United States of America

  • Daniel F. Farrell,

    Roles Conceptualization, Data curation, Funding acquisition, Project administration, Writing – original draft, Writing – review & editing

    Current address: Penn Medicine Lancaster General Health Family and Community Residency Program, Lancaster General Health, Lancaster, Pennsylvania, United States of America

    Affiliation College of Medicine, MD Program, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Daniel A. Lichtenstein,

    Roles Data curation, Writing – original draft, Writing – review & editing

    Affiliation College of Medicine, MD Program, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Ryan Nightingale,

    Roles Data curation, Writing – original draft, Writing – review & editing

    Affiliation College of Medicine, MD Program, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Megan A. Harris,

    Roles Data curation, Writing – original draft, Writing – review & editing

    Affiliation College of Medicine, MD Program, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Joseph Toth,

    Roles Data curation, Writing – original draft, Writing – review & editing

    Affiliation College of Medicine, MD Program, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Paris Hantztidiamantis,

    Roles Data curation, Writing – original draft, Writing – review & editing

    Affiliation College of Medicine, MD Program, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Nicholas Usher,

    Roles Formal analysis, Writing – original draft, Writing – review & editing

    Affiliation Office of Undergraduate Biology, Cornell University, Ithaca, New York, United States of America

  • Cinthya Cueva Aponte,

    Roles Data curation

    Affiliation Institute for Global Health and Translational Science, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Julio Barzallo Aguilar,

    Roles Conceptualization, Data curation, Project administration, Resources, Writing – review & editing

    Affiliation Teófilo Dávila Hospital, Ministry of Health, Machala, El Oro Province, Ecuador

  • Anthony Puthumana,

    Roles Data curation, Investigation, Writing – review & editing

    Affiliation College of Medicine, MD Program, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Christina D. Lupone,

    Roles Conceptualization, Project administration, Writing – review & editing

    Affiliation Institute for Global Health and Translational Science, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Timothy Endy,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Supervision, Writing – review & editing

    Affiliations Institute for Global Health and Translational Science, SUNY Upstate Medical University, Syracuse, New York, United States of America, Department of Medicine, SUNY Upstate Medical University, Syracuse, New York, United States of America

  • Sadie J. Ryan,

    Roles Funding acquisition, Writing – review & editing

    Affiliations Quantitative Disease Ecology and Conservation Lab, Department of Geography, University of Florida, Gainesville, Florida, United States of America, Emerging Pathogens Institute, University of Florida, Gainesville, Florida, United States of America

  • Anna M. Stewart Ibarra

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Supervision, Writing – review & editing

    sippyr@upstate.edu(RS); astewart@dir.iai.int(AMSI)

    Current address: Inter-American Institute for Global Change Research, Montevideo, Department of Montevideo, Uruguay

    Affiliations Institute for Global Health and Translational Science, SUNY Upstate Medical University, Syracuse, New York, United States of America, Department of Medicine, SUNY Upstate Medical University, Syracuse, New York, United States of America

Abstract

Background

Dengue, chikungunya, and Zika are arboviruses of major global health concern. Decisions regarding the clinical management of suspected arboviral infection are challenging in resource-limited settings, particularly when deciding on patient hospitalization. The objective of this study was to determine if hospitalization of individuals with suspected arboviral infections could be predicted using subject intake data.

Methodology/Principal findings

Two prediction models were developed using data from a surveillance study in Machala, a city in southern coastal Ecuador with a high burden of arboviral infections. Data were obtained from subjects who presented at sentinel medical centers with suspected arboviral infection (November 2013 to September 2017). The first prediction model—called the Severity Index for Suspected Arbovirus (SISA)—used only demographic and symptom data. The second prediction model—called the Severity Index for Suspected Arbovirus with Laboratory (SISAL)—incorporated laboratory data. These models were selected by comparing the prediction ability of seven machine learning algorithms; the area under the receiver operating characteristic curve from the prediction of a test dataset was used to select the final algorithm for each model. After eliminating those with missing data, the SISA dataset had 534 subjects, and the SISAL dataset had 98 subjects. For SISA, the best prediction algorithm was the generalized boosting model, with an AUC of 0.91. For SISAL, the best prediction algorithm was the elastic net with an AUC of 0.94. A sensitivity analysis revealed that SISA and SISAL are not directly comparable to one another.

Conclusions/Significance

Both SISA and SISAL were able to predict arbovirus hospitalization with a high degree of accuracy in our dataset. These algorithms will need to be tested and validated on new data from future patients. Machine learning is a powerful prediction tool and provides an excellent option for new management tools and clinical assessment of arboviral infection.

Author summary

Patient triage is a critical decision for clinicians. Patients with suspected arbovirus infection are difficult to diagnose as symptoms can be vague and molecular testing can be expensive or unavailable. Determining whether these patients should be hospitalized or not can be challenging, especially in resource-limited settings. Our study included data from 543 subjects with a diagnosis of suspected dengue, chikungunya, or Zika infection. Using a machine learning approach, we tested the ability of seven algorithms to predict hospitalization status based on the signs, symptoms, and laboratory data that would be available to a clinician at patient intake. Using only signs and symptoms, we were able to predict hospitalization with high accuracy (94%). Including laboratory data also resulted in highly accurate prediction of hospitalization (92%). This tool should be tested in future studies with new subject data. Upon further development, we envision a simple mobile application to aid in the decision-making process for clinicians in areas with limited resources.

Introduction

Undifferentiated febrile illness is a common clinical scenario in tropical medicine, with a long list of potential pathogens sharing similar symptoms. Arthropod-borne viruses (arboviruses), including dengue virus (DENV), chikungunya virus (CHIKV), and Zika virus (ZIKV), share common mosquito vectors (Aedes aegypti and Ae. albopictus) and often present with fever, rash, myalgias, and arthralgias. Dengue virus is endemic in the tropical Americas, and the emergence of CHIKV in 2013 in Saint Martin and ZIKV in 2015 in Brazil has brought these arboviruses to the forefront of international attention [13]. There were over 400,000 cases of dengue fever in Andean Latin America in 2013 [4], with transmission risk expected to increase sharply over the next 50 years [5]. Ecuador in particular has a high burden of arboviral illness, with 86,306 total cases of dengue from 2014–2018 [68]. There is also a high prevalence of asymptomatic DENV infections and infections with other arboviruses in coastal Ecuador [1,9]. In 2014, CHIKV was introduced to Ecuador, with 35,555 cases from 2014–2018, followed by the introduction of ZIKV in 2016, with 5,304 cases from 2016–2018 [68].

Clinical decision-making in the context of arboviral infection is particularly challenging in resource-limited settings such as Ecuador. For instance, there may be too few healthcare professionals relative to the high disease burden, which may impact the ability to provide optimal subject care. Ecuador has 22 physicians for every 10,000 people, though this ranges by province from 13 to 32 physicians per 10,000 people [10]. This is just above the 1 physician per 1,000 people benchmark of the World Health Organization (WHO) [11], and physicians are likely concentrated in urban areas. Moreover, molecular diagnostics are often unavailable outside of large urban centers. Of Ecuador’s 4,168 healthcare establishments, 1,045 (25.1%) have a clinical laboratory [10], leaving many healthcare providers in Ecuador without these crucial diagnostic tools (i.e. PCR or ELISA). These infrastructural limitations create a challenging clinical environment, especially as healthcare providers need to determine whether a patient with suspected arboviral illness should be hospitalized or not. Efficient and effective triaging is essential for good clinical care in resource-limited settings [12].

Patients with DENV, CHIKV, or ZIKV infections often present with similar symptoms. Fever, lethargy, and arthralgia are common [1,13] and acute febrile illness is a typical manifestation for many patients. In Latin America, DENV, CHIKV, and ZIKV are the three most common infections among acute febrile illness patients [14]. Moreover, co-infection is common if multiple viruses are circulating [13]. Current practice in Ecuador is to hospitalize subjects suspected of dengue infection when they exhibit any of the WHO 2009 dengue warning signs, any signs of shock, or severe thrombocytopenia [15]. While treatment for dengue is supportive, proper inpatient management of severe dengue can reduce mortality dramatically [16,17]. Dengue has a wide spectrum of clinical presentations, with the majority of patients recovering following a self-limited clinical course, and a small percentage progressing to severe disease characterized by plasma leakage. In the latter cases, prompt intravenous rehydration can reduce the case fatality rate to less than 1% [16]. Similarly, management of CHIKV and ZIKV infections is largely supportive, but both infections may result in potentially serious complications, such as adverse neonatal effects [1821]. Deciding whether to hospitalize a subject with a suspected DENV, CHIKV, or ZIKV infection is thus an important clinical decision, which often must be made before a clear diagnosis has been determined. This decision has other non-clinical and indirect consequences, including the utilization of hospital resources that could otherwise be used for other patients, as well as increasing the financial cost of the case when compared to less costly outpatient care. Globally, an estimated 18% of dengue cases are admitted to the hospital, with 48% managed as outpatients and 34% not seeking medical attention [22]. The average cost to manage a case of dengue is tripled if the patient is hospitalized [16,22].

Machine learning is a tool that combines statistics with computer science to make efficient use of massive data sets [23]. It differs from traditional statistical modeling (e.g. regression models) in that there are fewer assumptions about the underlying distribution of the data and the relationships between variables. While model interpretability is often a goal of traditional statistical models, this is not important in machine learning. The only goal is to create highly accurate predictions of an outcome of interest, often using as many variables as possible [24,25]. In modeling relationships with a machine learning approach, the computer incorporates connections not obvious to human beings to successfully predict an outcome of interest. Machine learning is applicable in many fields and has been previously used in medical applications, to estimate clinical risk, guide triage or diagnose disease [23,2628]. Clinical applications of machine learning for arboviral illnesses, specifically, have included analysis of patient genomes for dengue prognosis [29], scanning of patient sera for DENV [30] or Zika diagnosis [31], thermal image scanning for detection of hemodynamic shock [32], analysis of body temperature patterns for diagnosis of undifferentiated fever etiology [33], and analysis of patient data for dengue fever diagnosis [27]. No studies have yet attempted to use machine learning for prediction of hospitalization among arboviral illness or undifferentiated fever patients, although it has been used to predict critical care and hospitalization outcomes based on emergency department triage data in children and adults [34,35].

The objective of this study was to determine if the hospitalization of individuals with suspected arboviral infections could be predicted using subject intake data. This information (i.e. initial clinical details and no diagnostic testing data) replicates the information available to clinicians making the decision whether to hospitalize a patient or not. In this study, we take a retrospective view of arboviral infection management in a tropical city in southern coastal Ecuador using data from an ongoing prospective surveillance study. Using actual clinical practice as a guide, we assessed the ability of seven machine learning algorithms to determine hospitalization using basic symptom and demographic data that was collected via standard intake of subjects with suspected DENV, CHIKV, or ZIKV infections. The machine learning approach and algorithms developed here could potentially support physicians faced with complex clinical management decisions in areas where multiple arboviruses co-circulate, such as Ecuador.

Methods

Ethics statement

This study protocol was reviewed and approved by Institutional Review Boards at the State University of New York (SUNY) Upstate Medical University, Cornell University, and the Luis Vernaza Hospital in Guayaquil, Ecuador, the Human Research Protection Office of the U.S. Department of Defense, and the Ecuadorean Ministry of Health (MoH). Clinical and demographic data from study subjects was obtained following written informed consent, and/or assent (as applicable) as per the study protocol (described previously) [1]. For those subjects unable to participate in the consent and/or assent process, an adult representative documented consent. Parents signed a written informed consent for children aged 6 months to 17 years, and children aged 7 to 17 additionally signed a written assent.

Study design and data source

We conducted a retrospective analysis of data from a prospective arbovirus surveillance study, which included subjects (age ≥6 months) recruited from Ecuadorean MoH clinical sites from November 2013 to September 2017 in the city of Machala, Ecuador. Subjects were identified as a part of an ongoing, multi-year arbovirus surveillance project, a description of which has been published previously [1]. Briefly, subjects were invited to enroll in the study if they presented at the reference hospital or one of four outpatient clinics and were diagnosed with arboviral infection by MoH physicians. In 2014 and 2015, we recruited subjects who were clinically diagnosed with dengue fever by MoH physicians based on their individual clinical suspicion for DENV infection. We assume that diagnostic standards for each respective infection were similar across study sites, as all physicians receive the same training from MoH. Following the local emergence of CHIKV (2015) and ZIKV (2016), the inclusion criterion in 2016 and 2017 was expanded to include subjects clinically diagnosed with DENV, CHIKV or ZIKV infection. At the time of enrollment, subject demographic information, clinical history, and symptoms present during current illness were collected using a questionnaire administered by trained study personnel. Subjects were asked about symptoms in the past 7 days, including the following: headache, anorexia or nausea, muscle or joint pain, rash, bleeding (defined as bleeding from respiratory, digestive, or genitourinary mucosa), rhinorrhea, vomiting, lethargy or drowsiness, cough, abdominal pain, diarrhea, and retro-orbital pain. Conjunctivitis was later added to the enrollment survey after the emergence of ZIKV but was not included in this analysis. Laboratory data (hematocrit, white blood cell count, neutrophils, lymphocytes, and platelet count) were collected at the time of enrollment if the subject had copies of recent laboratory evaluation (for outpatients), or the first labs on admission to the hospital (for hospitalized subjects) were used. Additional laboratory data were available for hospitalized subjects, but analysis was limited to the aforementioned three parameters as these were consistently available among a subset of the non-hospitalized subjects. Laboratory arboviral diagnostic data were not included, as these data are often not available at the time that a physician decides whether or not to hospitalize a patient, and we utilized only the data that would realistically be available. Data from enrollment surveys was used for the analysis of the non-hospitalized outpatients in the current study. Laboratory data on the hospitalized subjects was verified by review of medical records and managed using REDCap software [36] hosted at SUNY Upstate Medical University.

Exclusion criteria

Hospitalized subjects whose physical medical records could not be located and subjects with incomplete enrollment survey data (i.e. missing hospitalization status, symptom survey questions) were excluded. The subset of the non-hospitalized subjects who had available laboratory data were included in a second analysis with the same hospitalized cohort, all of whom had available laboratory data.

Statistical analysis

The outcome variable was hospitalization status. Variables of interest included demographic data, presenting symptoms, past medical history, and laboratory data (hematocrit, white blood cell count, neutrophils, lymphocytes, and platelet count). A prediction algorithm was developed using demographic and symptom data only (28 total predictors), called the Severity Index for Suspected Arbovirus (SISA, in Spanish the Severidad de Infecciones Sospechosas por Arbovirus). A second prediction algorithm was developed using demographic, symptom, and laboratory data (33 total predictors), called the Severity Index for Suspected Arbovirus with Laboratory (SISAL, in Spanish the Severidad de Infecciones Sospechosas por Arbovirus con datos del Laboratorio). Characteristics for hospitalized and non-hospitalized subjects among these subject groups were compared using a two-sample t-test (continuous) or Fisher’s exact test (categorical).

In machine learning, 10-fold cross-validation with holdout data results in an unbiased estimate of model validity and accuracy [24,37]. Thus, our datasets were divided by random sampling into training and testing (holdout) data sets. For SISA, the training set was 85% of the full dataset and the testing set was the remaining 15%. For SISAL, the training set was 70% of the full dataset and the testing set was the remaining 30% (the SISAL dataset was allowed a greater percentage to allow for sufficient sample size in the testing set). With the training dataset, we used repeated 10-fold cross validation to estimate the ability of six algorithms with diverse statistical approaches—bagged trees (bags) [38], k nearest neighbors regression (knn) [39], random forest [40], elastic net regression [41], generalized boosting models (gbm) [42], and neural networks [43]—to predict hospitalization. Because we have no prior assumptions about the nature of the relationship between the available predictors and the outcome, we use a variety of statistical approaches to improve the likelihood that we will find an algorithm that works well with these data. Following a published criticism of machine learning prediction compared to logistic regression [44], we added logistic regression to our list of algorithms to test (seven total algorithms, model descriptions in S1 Table). For models with tuning parameters (knn, random forest, elastic net, and gbm), tuning was performed using another layer of repeated 10-fold cross-validation [45]. The final model for each algorithm was created based on all training data. Each algorithm was then used to predict hospitalization outcomes within the holdout testing dataset, resulting in the final performance measure for the model. Model predictions were probabilities that the given observation set was hospitalized or not, with 0.5 used as the probability cut-off for hospitalization classification. Measures of discrimination [46], including accuracy, Cohen’s kappa, and area under the curve (AUC) for the receiver operating characteristic (ROC) were calculated to determine the performance of each algorithm. Each algorithm’s classification predictions were compared to the true outcomes of the data used, such that the correct/incorrect ratings of the algorithm were compared to what happened. Accuracy is the percentage of correct classifications out of all classifications made; Cohen’s kappa (potential values: 0–1) also calculates this percentage but compares each algorithm’s performance to classifications made by random guessing [47]. The AUC (potential values: 0–1) considers both the true and false positive predictions, with higher AUC indicating a high true positive prediction and a low false positive prediction (i.e. it is sensitive and specific) [46]. For each fold of the cross-validation, performance measures were calculated and averaged across all folds and repetitions (100 preliminary models for algorithms with no tuning parameters), resulting in a mean cross-validation performance measure that estimates how the algorithm will perform on a new dataset. The best algorithm for SISA/SISAL was chosen based on the highest AUC as calculated from the holdout test set. A flow chart of the entire approach is available in S1 Fig. Model residual plots were examined. The relative contribution of each variable to the model (i.e. variable influence or influence on prediction) was calculated using model- or non-model specific methods as appropriate (see caret [39] documentation for details). Calibration plots provide a method to graphically evaluate the predictive ability of a prediction model [48]. Subjects in the holdout test set were separated into deciles (SISA) or quintiles (SISAL) and the mean predicted hospitalization probability and proportion of actual hospitalizations were calculated for each decile/quintile. These values were plotted to create calibration plots; the distance of the points from the diagonal (perfect prediction) shows whether the prediction model is over- or under-predicting among certain risk groups [48]. Data analysis and visualization were performed using SAS version 9.2 (SAS Institute, Cary, NC) and R version 3.2.2 (R Foundation for Statistical Computing, Vienna, Austria) in RStudio (RStudio, Inc., Boston, MA) including packages haven [49], caret [39], MASS [43], ipred [38], randomForest [40], elasticnet [41], gbm [42], nnet [43], mgcv [50,51], kernlab [52], glmnet [53], and pROC [54]. Code for the machine learning analyses is available at https://github.com/rsippy/SISASISAL.

We compared the prediction abilities of SISA versus SISAL to assess whether laboratory data could improve our ability to predict subject hospitalization status. Because there may be some selection bias for subjects with available laboratory data (e.g. more severe symptoms, more similar subject data, or different socioeconomic status compared to typical patients with clinical arboviral diagnosis), the subject groups in SISA and SISAL may not be exchangeable [55]. We performed a sensitivity analysis to determine if the selected algorithm and prediction ability of SISA is the same when using all SISA subjects or SISAL subjects (without laboratory data) for the training and testing steps.

Results

General characteristics

Between November 20, 2013 to September 13, 2017, 592 subjects were recruited into the arboviral surveillance study. After exclusions (Fig 1), 534 subjects were included in the dataset for SISA, of which 59 were hospitalized and 475 were not hospitalized. The SISA training dataset included 455 subjects and the holdout test dataset included 79 subjects. The SISAL dataset included 98 subjects, of which 59 were hospitalized and 39 were outpatients. The SISAL training dataset included 70 subjects and the holdout test dataset included 28 subjects. Demographics and symptoms for the two datasets are in Table 1. Presenting temperature was higher, and presence of mucosal bleeding, vomiting, and abdominal pain were significantly more common in hospitalized subjects in the SISA dataset.

thumbnail
Fig 1. Flow diagram of subject selection.

Subjects clinically diagnosed with arboviral (dengue, chikungunya, Zika) infections were recruited from sentinel clinics in Machala, Ecuador. A subset of subjects were selected to test the Severity Index for Suspected Arbovirus (SISA) and Severity Index for Suspected Arbovirus with Laboratory (SISAL) machine learning algorithms.

https://doi.org/10.1371/journal.pntd.0007969.g001

thumbnail
Table 1. Overview of demographics, symptoms, and laboratory values for subjects enrolled in the study (n = 543).

Numerical data are shown as means and were analyzed with Welch 2-Sample T-test. Categorical data shown as percentages and analyzed with Fisher’s Exact Test.

https://doi.org/10.1371/journal.pntd.0007969.t001

Prediction of hospitalization status

Accuracy, Cohen’s kappa, and AUC for the training set (from repeated 10-fold cross-validation) and the holdout test set (final performance) are shown in Fig 2. For SISA, using only symptoms and demographics, generalized boosting model, elastic net, neural networks, and logistic regression performed well with the test set (accuracy: 89.8–96.2%, Cohen’s kappa: 0.00–0.77, AUC: 0.50–0.91). The generalized boosting model algorithm was found to have the best final AUC (0.91) among the test dataset and was the second-best algorithm in the training set. The sensitivity for this algorithm was 95.8%, and the specificity was 87.5% when predicting hospitalization of subjects in the test dataset. The variables with the greatest influence on the final SISA model were drowsiness, bleeding, vomiting, and temperature. The calibration plot for this prediction is in Fig 3; the SISA model shows under-prediction of hospitalization risk among low-risk groups and over-prediction among high-risk groups.

thumbnail
Fig 2. Results for SISA dataset.

Accuracy (blue), Cohen’s kappa (red), and AUC (green) were calculated for the repeated 10-fold cross validation (left) and the holdout test dataset (right) for prediction of hospitalization status in clinically diagnosed dengue, chikungunya or Zika virus infections. bag = bagged trees, knn = k nearest neighbors, rf = random forest, gbm = generalized boosting models, enet = elastic net, nnet = neural networks, log = logistic regression.

https://doi.org/10.1371/journal.pntd.0007969.g002

thumbnail
Fig 3. Calibration plot for SISA prediction.

For the final SISA algorithm (generalized boosting model), the predicted probability of hospitalization is compared to the proportion of actual hospitalization among deciles of subjects in the holdout test set. Perfect predictive ability is represented by the dashed diagonal line.

https://doi.org/10.1371/journal.pntd.0007969.g003

Results for SISAL, where laboratory parameters were included as well as symptoms and demographics, are shown in Fig 4. All models except neural networks and k nearest neighbors performed well with the test set (accuracy: 64.3–92.6%, Cohen’s kappa: 0.25–0.85, AUC: 0.62–0.94). The elastic net algorithm had the best final AUC (0.94) among the test dataset and was the third-best algorithm in the training set (model details available in S2 Table). The sensitivity for SISAL was 100% and the specificity was 88.2% when predicting hospitalization of subjects in the test dataset. The variables with the greatest influence on the final SISAL model were drowsiness, orbital pain, and platelet count. The calibration plot for this prediction is in Fig 5; the SISAL model shows under-prediction of hospitalization risk among low-risk groups and over-prediction among high-risk groups.

thumbnail
Fig 4. Results for SISAL dataset.

Accuracy (blue), Cohen’s kappa (red), and AUC (green) were calculated for the repeated 10-fold cross validation (left) and the holdout test dataset (right) for prediction of hospitalization status in clinically diagnosed dengue, chikungunya or Zika virus infections. bag = bagged trees, knn = k nearest neighbors, rf = random forest, gbm = generalized boosting models, enet = elastic net, nnet = neural networks, log = logistic regression.

https://doi.org/10.1371/journal.pntd.0007969.g004

thumbnail
Fig 5. Calibration plot for SISAL prediction.

For the final SISAL algorithm (elastic net regression), the predicted probability of hospitalization is compared to the proportion of actual hospitalization among quintiles of subjects in the holdout test set. Perfect predictive ability is represented by the dashed diagonal line.

https://doi.org/10.1371/journal.pntd.0007969.g005

The results for SISA when trained with the SISAL subjects (without laboratory data) are shown in S2 Fig. All models except k nearest neighbors and logistic regression performed well (test set accuracy: 53.5–92.9%, Cohen’s kappa: 0.04–0.86, AUC: 0.51–0.94). The bagged trees, random forest, generalized boosting models, and elastic net algorithm had identical final AUC values (0.94). The sensitivity was 100% and the specificity was 88.2% when predicting hospitalization of subjects in the test dataset. If the SISA and SISAL subjects were exchangeable, we would expect the SISAL subject group (without laboratory data) to produce the same results as the SISA analysis. Because these results differ from those obtained in the SISA analysis, we conclude that the SISAL subjects are not exchangeable with the SISA subjects.

Discussion

Suspected arboviral infections impose large health and financial burdens on populations in which the diseases are endemic. In 2013, the estimated global cost of dengue illness was US $8–9 billion [22]. In many arbovirus endemic regions, DENV, CHIKV, and ZIKV infections are diagnosed based on clinical presentation and basic laboratory results, which can be difficult due to nonspecific symptoms and limited availability of definitive diagnostic tools [56]. In this study, we demonstrate that our machine learning algorithms were able to predict hospitalization status among our cohort of subjects with suspected arboviral illness with up to 96% accuracy using only symptom and demographic data. We thus describe the early development of a new tool, SISA/SISAL, which in the future may be utilized by clinicians in resource-limited settings when triaging subjects with suspected arboviral illness.

The final SISA model used the generalized boosting model. These models are also called stochastic boosting gradients or gradient boosting machines and were developed by Jerome Friedman [57,58]. This ensemble-type model is based on a sequentially built series of simple classification trees and its final predictions are based on the collective ensemble of trees, with some trees weighted more heavily than others [59]. Generalized boosting models are particularly adept in solving hard-to-predict observations; the “boosting” component is the model forcing itself to improve these predictions (i.e. reducing prediction error) by building additional trees until it is correct. The final SISA model makes predictions from a weighted set of 150 weak (single node) trees and included information from 23 of the original 28 predictors (all except history of hypertension, history of asthma, history of diabetes, and history of dengue), with symptoms of drowsiness, bleeding, vomiting, and temperature providing most of the predictive information (i.e. highest variable influence). Calibration plots showed that the SISA model under-predicted hospitalization risk among low-risk groups, and over-predicted hospitalization among high-risk groups. Because the holdout test set for SISA was relatively small (n = 79), it is unclear if these prediction trends would hold in a larger validation set of subjects.

The final SISAL model was an elastic net regression. This is a type of regression that applies additional terms (alpha and lambda) to the regression coefficients, resulting in some terms being set to zero (i.e. eliminating some coefficients) and “shrinking” some others (i.e. minimizing the magnitude of the coefficients), particularly coefficients from highly correlated predictors [60]. The final SISAL model was an elastic net regression with an alpha of 0.5 and a lambda of 0.25 and included information from three of the original 28 predictors (drowsiness, retro-orbital pain, and platelet count). The SISAL model showed the same under- and over-prediction trends as SISA, and like SISA, the holdout test set was small (n = 28). The prediction trends of SISAL should be assessed with a larger validation set to determine if there are prediction weaknesses for the model among specific patient groups.

In our cohort, we found that hospitalized cases had statistically significant–though clinically insignificant–elevations in temperature at presentation in both SISA and SISAL. This demonstrates an algorithm’s ability to make use of small differences in data. In the SISA dataset, mucosal bleeding, vomiting, and abdominal pain were more common in hospitalized subjects than in outpatients. In the SISAL dataset, while hospitalized subjects experienced more mucosal bleeding and vomiting than outpatients, the presence of abdominal pain did not differ between groups. This could suggest that the outpatient subjects who were sent for laboratory testing represented cases of serious concern, as abdominal pain may qualify those with suspected or confirmed dengue for hospital admission [15,16]. For SISAL, hospitalized subjects had lower hematocrit and platelet counts when compared to non-hospitalized subjects; lower platelet counts are to be expected in hospitalized dengue cases.

Our sensitivity analysis revealed that SISA produced different results when its training/testing dataset was restricted to those subjects with laboratory data available. This result is unsurprising, as we expect that selection bias is contributing to the subjects available for the SISA and SISAL datasets. All hospitalized subjects had laboratory data available, and we would additionally expect that subjects with laboratory data had some signs or symptoms that would prompt the attending physician to request laboratory diagnostics, setting them apart from subjects without laboratory data. These signs and symptoms are also likely linked to whether subjects were eventually hospitalized or not, meaning these groups of subjects are not directly comparable to one another. When we used the SISA approach (symptoms and demographics only) for a dataset comprised of the SISAL group of subjects (without laboratory data), we found that the AUC was identical to the AUC from the SISAL approach. This would suggest that we are unable to improve our prediction of hospitalization status by using subject laboratory data; though a study implementing the use of laboratory tests among a general population could potentially find that laboratory tests provide an added benefit for prediction of hospitalization status. In our dataset, the AUC was higher for the SISAL group of subjects, but these improvements are likely due to fundamental differences between the SISA/SISAL groups of subjects. These patient groups should continue to be analyzed with separate algorithms.

This is the first use of machine learning to predict hospitalization status of subjects with clinically diagnosed arboviral infections. Our models exhibit high accuracy, sensitivity, and specificity in a region with a high burden of co-circulating of DENV, CHIKV, and ZIKV. These algorithms, particularly SISA, use information that could easily be obtained in resource-limited settings, suggesting the potential to develop a useful tool for clinicians. Our model’s accuracy is consistent with tools previously reported in the literature. Past predictive modeling of disease with a machine learning approach had been efficacious in the diagnosis of pneumonia (95% sensitivity, 96.5% specificity), dengue (70% sensitivity, 80% specificity), hepatitis (96% accuracy), and tuberculosis (95% accuracy) using clinical and laboratory parameters [27,6163].

There has been criticism regarding the use of machine learning in prediction models. A recent systematic review found that machine learning predictions had no advantage over logistic regression predictions, on average [44]. Christodoulou et al. do an excellent job of outlining some common missteps in the use of machine learning for prediction and the somewhat alarming lack of transparency in many published machine learning prediction models. We agree with many of the assertions made by the authors and strive to improve reporting and validation in our own work, in accordance. However, in this specific study, we did not find that logistic regression performed better than other algorithms. Our overall approach differs from that of most machine learning papers in that we did not assume that one particular algorithm would have superior prediction abilities for our data. We rigorously compared multiple algorithms with the goal of finding an algorithm that functions well with our predictors and outcome of interest, to be further validated with a new dataset in future research. We have no illusions about the potential lack of generalizability of our data and caution against any strong conclusions about the future utility SISA/SISAL in predicting hospitalization status for future patients. In the current study, we present preliminary yet promising results in the development of a future tool that will need additional, vigorous validation using additional future sets of subject data before use in the real world.

Numerous studies have looked at clinical and laboratory findings specific to certain arbovirus diagnoses, yet few have proposed tools that can aid in management of unconfirmed febrile illness [6467]. A study in Puerto Rico of acute febrile illness emergency room cases found the tourniquet test and leukopenia to be predictive of dengue diagnosis, yet dengue was confirmed in only 11% of their total 284 cases [68]. In Thailand, fever, positive tourniquet test and leukopenia differentiated confirmed dengue from other febrile illness initially suspected as dengue [69]. Also in Thailand, among a sample of 172 children with acute fever without obvious cause, those with dengue had several laboratory parameters that differentiated them from the other febrile illness [56]. While these studies were able to distinguish dengue from other acute febrile illness, they highlight the large proportion of cases that do not get a confirmed diagnosis, and most studies have not moved beyond initial reports to demonstrate predictive abilities. With SISA/SISAL, the approach is more empirical. Clinical diagnosis of DENV, CHIKV, or ZIKV infection was a starting point for the machine learning used here. Given that timely laboratory diagnostics may not be available, grouping these suspected subjects reflects the reality that physicians face in the clinic in arbovirus-endemic regions. That such a model can accurately predict hospitalization outcome suggests that SISA/SISAL could be expanded to undifferentiated febrile illness. The ability of machine learning models to predict hospital admission outcomes using only emergency department triage data lends support to expanding our approach to undifferentiated fever [34,35]. Of the suspected arboviral cases analyzed here, approximately 54% were confirmed as acute or recent DENV infection, 17% had acute CHIKV infection, and 29% were negative for DENV, CHIKV or ZIKV (based on analysis of subjects in 2014 and 2015) [1]. Results of the 2016 and 2017 subject samples are pending, but preliminary PCR testing suggests predominance of CHIKV in 2016 and ZIKV in 2017.

Clinicians rely on tools to help make decisions about patient management, and simple tools can benefit physicians in limited-resource settings [70,71]. Smart phones are commonly used in Ecuador and mobile health tools are a great option for physicians, with several popular apps that include various triage rules and scores, such as MDCalc [72]. After further development and validation of our algorithmic approach, and evaluation of its potential benefit in the clinic, we conceive of its inclusion in a user-friendly mobile application to aid in the decision to hospitalize patients with undifferentiated fever.

Limitations

The variables with the greatest influence on the final SISA model were drowsiness, temperature, and nausea; for the SISAL model they were drowsiness, orbital pain, and platelet count. An important caveat inherent to the nature of machine learning is that the exact weight of each variable in the final prediction model is difficult to assess and interpret, thus we cannot propose a causal relationship or correlation between these variables and our outcome of hospitalization.

The SISA/SISAL models are presented here in the first iteration of their use. They have not yet been validated beyond the current datasets, but the use of holdout data and 10-fold cross-validation provides us with an unbiased estimate of model validity as well as prediction accuracy. An external validation of these algorithms with a new dataset is ongoing, as well as the testing of fewer prediction variables with the eventual goal of an easy-to-use online or mobile app for use in the clinic.

In this study, we used the outcome of subject hospitalization for both prediction models. The sensitivity and specificity of SISA/SISAL relies on the assumption that the subjects in this dataset were correctly hospitalized. It is possible that some subjects were treated as outpatients when they should have been hospitalized, or that some subjects were hospitalized unnecessarily. For subjects that were incorrectly treated as outpatients, it is likely that the subject would return to a clinic to receive care, as their symptoms would likely drive them to do so. Because our collection of medical records was retrospective, we were able to capture subject hospitalization at any point, even if they were initially treated as outpatients. Hospital Teofilo Davila is the reference MoH hospital in the province, and it is unlikely that these subjects would have sought care at a hospital elsewhere. It is possible for some subjects to have been hospitalized unnecessarily; we have no way of identifying these subjects or truly knowing if it was safe for these subjects to have been treated as outpatients. As a result, our algorithms could thus recommend hospitalization unnecessarily. Although hospitalization could place undue financial burden on some patients and the health system, failure to hospitalize a serious case could results in grave consequences and we would prefer to take a cautious approach in hospitalization decision-making. Moreover, these algorithms are merely intended as a tool to inform clinical judgement, not to replace important clinical triage decisions [73].

The time period during which our data were collected (2014–2017) included the emergence of two important new arboviruses—CHIKV and ZIKV. The MoH provided training to its personnel (including those working at the hospital and clinics in this study) to identify and diagnose these patients. For patients, the potential severity of these infections and their novelty may have increased the number of patients willing to be hospitalized or to seek healthcare in the first place. With ZIKV infection, physicians may have been more likely to hospitalize pregnant women. This may limit the generalizability of SISA/SISAL in future subject datasets, though as viral diseases continue to emerge globally, it is important to test the ability of decision-making tools to function under these dynamic scenarios. For new diseases with clear warning signs for potentially severe disease, we would expect SISA/SISAL to work well.

Conclusions

Clinicians in resource-limited settings commonly encounter subjects with a suspected diagnosis of DENV, CHIKV, or ZIKV infection and often have limited tools at their disposal. A subject may be unable or unwilling to provide a laboratory specimen, and diagnostic testing may not always be available. The SISA/SISAL models are promising clinical tools, given the high sensitivity and specificity for both models. Machine learning, if used thoughtfully, can be a powerful method for building such prediction models, making the best use of real-world available clinical data.

Supporting information

S1 Table. Classification algorithms used for prediction.

Predictors and outcomes are the actual data that are put into the model; for this manuscript the predictors are the variables from each subject and the outcome is whether the subject was hospitalized or not. The final prediction is determined by each algorithm, i.e. the algorithm predicts whether the subject was hospitalized based on the predictor variable values. These final predictions are compared to the true outcome to determine how well each algorithm performed.

https://doi.org/10.1371/journal.pntd.0007969.s001

(DOCX)

S2 Table. Final SISAL model.

The final SISAL model was an elastic net regression model with an alpha value of 0.5, a lambda value of 0.25, and three coefficients (all other coefficients were reduced to zero).

https://doi.org/10.1371/journal.pntd.0007969.s002

(DOCX)

S1 Fig. Flow chart of approach.

This chart shows the algorithm development, training and testing processes and the flow of data, using an example algorithm with no tuning parameters with the SISA dataset. Repeated 10-fold cross-validation is used for algorithm development to produce an estimate of the final model performance (Mean CV-AUC). The final performance for the algorithm is calculated from the holdout test data. This process was repeated for each algorithm.

https://doi.org/10.1371/journal.pntd.0007969.s003

(DOCX)

S2 Fig. SISA analysis of SISAL dataset.

Accuracy (blue), Cohen’s kappa (red), and AUC (green) were calculated for the repeated 10-fold cross validation (left) and the holdout test dataset (right) for prediction of hospitalization status in clinically diagnosed DENV, CHIKV or ZIKV infections. bag = bagged trees, knn = k nearest neighbors, rf = random forest, gbm = generalized boosting models, enet = elastic net, nnet = neural networks, log = logistic regression, DENV = dengue virus, CHIKV = chikungunya virus, ZIKV = Zika virus

https://doi.org/10.1371/journal.pntd.0007969.s004

(DOCX)

Acknowledgments

Many thanks to the Ministry of Health of Ecuador and SUNY Upstate’s Institute for Global Health and Translational Science, as well as the Upstate team in Machala.

References

  1. 1. Stewart-Ibarra AM, Ryan SJ, Kenneson A, King CA, Abbott M, Barbachano-Guerrero A, et al. The Burden of Dengue Fever and Chikungunya in Southern Coastal Ecuador: Epidemiology, Clinical Presentation, and Phylogenetics from the First Two Years of a Prospective Study. Am J Trop Med Hyg. 2018;98: 1444–1459. pmid:29512482
  2. 2. Staples JE, Fischer M. Chikungunya Virus in the Americas—What a Vectorborne Pathogen Can Do. N Engl J Med. 2014;371: 887–889. pmid:25184860
  3. 3. Zanluca C, de Melo VCA, Mosimann ALP, dos Santos GIV, dos Santos CND, Luz K. First report of autochthonous transmission of Zika virus in Brazil. Mem Inst Oswaldo Cruz. 2015;110: 569–572. pmid:26061233
  4. 4. Stanaway JD, Shepard DS, Undurraga EA, Halasa YA, Coffeng LE, Brady OJ, et al. The global burden of dengue: an analysis from the Global Burden of Disease Study 2013. Lancet Infect Dis. 2016;16: 712–723. pmid:26874619
  5. 5. Ryan SJ, Carlson CJ, Mordecai EA, Johnson LR. Global expansion and redistribution of Aedes-borne virus transmission risk with climate change. PLoS Negl Trop Dis. 2019;13: e0007213. pmid:30921321
  6. 6. Ministerio de Salud Publica, Dirección Nacional de Vigilancia Epidemiológica. Enfermedades Trasmitidas por Vectores 1994–2017. Dirección Nacional de Vigilancia Epidemiológica, Ministerio de Salud Publica.; Available: https://public.tableau.com/profile/vvicentee80#!/vizhome/EnfermeddaesTropicales_vectoriales-2014/ANUARIO
  7. 7. Ministerio de Salud Publica, Dirección Nacional de Vigilancia Epidemiológica. Enfermedades Trasmitidas por Vectores SE 52/2018. Available: https://www.salud.gob.ec/wp-content/uploads/2019/01/GACETA-VECTORES-SE-52.pdf
  8. 8. Ministerio de Salud Publica, Dirección Nacional de Vigilancia Epidemiológica. Enfermedades Trasmitidas por Vectores SE 52/2017. Available: https://www.salud.gob.ec/wp-content/uploads/2017/07/Gaceta-Vectorial-SE52.pdf
  9. 9. Farrell DF, Lupone CD, Kenneson A, Cueva C, Heydari N, Barzallo Aguilera JH, et al. Case Report: An Acute Chikungunya Infection and a Recent Secondary Dengue Infection in a Peripartum Case in Ecuador. Am J Trop Med Hyg. 2018;98: 838–840. pmid:29363451
  10. 10. Censos IN de E y. Actividades y Recursos de Salud. In: Instituto Nacional de Estadística y Censos [Internet]. [cited 28 Mar 2019]. Available: http://www.ecuadorencifras.gob.ec/actividades-y-recursos-de-salud/
  11. 11. WHO | Density of physicians (total number per 1000 population, latest available year). [cited 11 Apr 2019]. Available: https://www.who.int/gho/health_workforce/physicians_density/en/
  12. 12. Agyeman-Duah JNA, Theurer A, Munthali C, Alide N, Neuhann F. Understanding the barriers to setting up a healthcare quality improvement process in resource-limited settings: a situational analysis at the Medical Department of Kamuzu Central Hospital in Lilongwe, Malawi. BMC Health Serv Res. 2014;14: 1. pmid:24382312
  13. 13. Azeredo EL, Dos Santos FB, Barbosa LS, Souza TMA, Badolato-Corrêa J, Sánchez-Arcila JC, et al. Clinical and laboratory profile of Zika and dengue infected patients: lessons learned from the co-circulation of dengue, Zika, and chikungunya in Brazil. PLoS Curr. 2018;10. pmid:29588874
  14. 14. Moreira J, Bressan CS, Brasil P, Siqueira AM. Epidemiology of acute febrile illness in Latin America. Clin Microbiol Infect. 2018;24: 827–35. pmid:29777926
  15. 15. Dengue, Guías de atención para enfermos en la región de las Américas–Ministerio de Salud Pública. [cited 3 Apr 2019]. Available: https://www.salud.gob.ec/dengue-guias-de-atencion-para-enfermos-en-la-region-de-las-americas/
  16. 16. WHO | Dengue guidelines for diagnosis, treatment, prevention and control: new edition. In: WHO [Internet]. [cited 8 Jul 2017]. Available: http://www.who.int/rpc/guidelines/9789241547871/en/
  17. 17. Dengue and severe dengue. [cited 28 Mar 2019]. Available: https://www.who.int/news-room/fact-sheets/detail/dengue-and-severe-dengue
  18. 18. Gérardin P, Barau G, Michault A, Bintner M, Randrianaivo H, Choker G, et al. Multidisciplinary Prospective Study of Mother-to-Child Chikungunya Virus Infections on the Island of La Réunion. PLOS Med. 2008;5: e60. pmid:18351797
  19. 19. Hoz JM de la, Bayona B, Viloria S, Accini JL, Juan-Vergara HS, Viasus D. Fatal cases of Chikungunya virus infection in Colombia: Diagnostic and treatment challenges. J Clin Virol. 2015;69: 27–29. pmid:26209372
  20. 20. Moore CA, Staples JE, Dobyns WB, Pessoa A, Ventura CV, Fonseca EB da, et al. Characterizing the Pattern of Anomalies in Congenital Zika Syndrome for Pediatric Clinicians. JAMA Pediatr. 2017;171: 288–295. pmid:27812690
  21. 21. Barbeito-Andrés J, Schuler-Faccini L, Garcez PP. Why is congenital Zika syndrome asymmetrically distributed among human populations? PLOS Biol. 2018;16: e2006592. pmid:30142150
  22. 22. Shepard DS, Undurraga EA, Halasa YA, Stanaway JD. The global economic burden of dengue: a systematic analysis. Lancet Infect Dis. 2016;16: 935–941. pmid:27091092
  23. 23. Deo RC. Machine Learning in Medicine. Circulation. 2015;132: 1920–1930. pmid:26572668
  24. 24. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition. 2nd ed. New York: Springer-Verlag; 2009. Available: //www.springer.com/us/book/9780387848570
  25. 25. Breiman L. Statistical Modeling: The Two Cultures. Stat Sci. 2001;16: 199–215.
  26. 26. Margolin AA, Bilal E, Huang E, Norman TC, Ottestad L, Mecham BH, et al. Systematic analysis of challenge-driven improvements in molecular prognostic models for breast cancer. Sci Transl Med. 2013;5: 181re1. pmid:23596205
  27. 27. Sa-ngamuang C, Haddawy P, Luvira V, Piyaphanee W, Iamsirithaworn S, Lawpoolsri S. Accuracy of dengue clinical diagnosis with and without NS1 antigen rapid test: Comparison between human and Bayesian network model decision. PLoS Negl Trop Dis. 2018;12. pmid:29912875
  28. 28. Haddawy P, Kasantikul R, Hasan AHMI, Rattanabumrung C, Rungrun P, Suksopee N, et al. Spatiotemporal Bayesian Networks for Malaria Prediction: Case Study of Northern Thailand. Stud Health Technol Inform. 2016;228: 773–777. pmid:27577491
  29. 29. Davi CCM, Pastor A, Oliveira T, Lima Neto FB, Braga-Neto U, Bigham A, et al. Severe Dengue Prognosis Using Human Genome Data and Machine Learning. IEEE Trans Biomed Eng. 2019. pmid:30716030
  30. 30. Khan S, Ullah R, Khan A, Wahab N, Bilal M, Ahmed M. Analysis of dengue infection based on Raman spectroscopy and support vector machine (SVM). Biomed Opt Express. 2016;7: 2249–2256. pmid:27375941
  31. 31. Melo CFOR, Navarro LC, de Oliveira DN, Guerreiro TM, Lima E de O, Delafiori J, et al. A Machine Learning Application Based in Random Forest for Integrating Mass Spectrometry-Based Metabolomic Data: A Simple Screening Method for Patients With Zika Virus. Front Bioeng Biotechnol. 2018;6. pmid:29696139
  32. 32. Nagori A, Dhingra LS, Bhatnagar A, Lodha R, Sethi T. Predicting Hemodynamic Shock from Thermal Images using Machine Learning. Sci Rep. 2019;9: 91. pmid:30643187
  33. 33. Dakappa PH, Prasad K, Rao SB, Bolumbu G, Bhat GK, Mahabala C. A Predictive Model to Classify Undifferentiated Fever Cases Based on Twenty-Four-Hour Continuous Tympanic Temperature Recording. In: Journal of Healthcare Engineering [Internet]. 2017 [cited 9 Apr 2019]. pmid:29359037
  34. 34. Goto T, Camargo CA, Faridi MK, Freishtat RJ, Hasegawa K. Machine Learning-Based Prediction of Clinical Outcomes for Children During Emergency Department Triage. JAMA Netw Open. 2019;2: e186937. pmid:30646206
  35. 35. Raita Y, Goto T, Faridi MK, Brown DFM, Camargo CA, Hasegawa K. Emergency department triage prediction of clinical outcomes using machine learning models. Crit Care. 2019;23. pmid:30795786
  36. 36. Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)–A metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inf. 2009;42: 377–381.
  37. 37. Boulesteix A-L, Schmid M. Machine learning versus statistical modeling. Biom J Biom Z. 2014;56: 588–593. pmid:24615669
  38. 38. Peters A. ipred: Improved Predictors. 2017.
  39. 39. Kuhn M. caret: Classification and Regression Training. 2017.
  40. 40. Liaw A, Wiener M. Classification and Regression by RandomForest. Forest. 2001;23.
  41. 41. Zou H, Hastie T. elasticnet: Elastic-Net for Sparse Estimation and Sparse PCA. 2012.
  42. 42. Ridgeway G. gbm: Generalized Boosted Regression Models. 2017.
  43. 43. Venables W, Ripley B. Modern Applied Statistics with S. 4th ed. New York: Springer-Verlag; 2002.
  44. 44. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019;110: 12–22. pmid:30763612
  45. 45. Kuhn M. Building Predictive Models in R Using the caret Package. J Stat Softw. 2008;28: 1–26.
  46. 46. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures. Epidemiology. 2010;21: 128–138. pmid:20010215
  47. 47. Banerjee M, Capozzoli M, McSweeney L, Sinha D. Beyond kappa: A review of interrater agreement measures. Can J Stat. 1999;27: 3–23.
  48. 48. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J. 2014;35: 1925–31. pmid:24898551
  49. 49. Wickham H, Miller E. haven: Import and Export “SPSS”, “Stata” and “SAS” Files. R package version 1.1.0 ed2017.
  50. 50. Wood SN. Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J R Stat Soc Ser B Stat Methodol. 2011;73: 3–36.
  51. 51. Wood S N. Thin Plate Regression Splines. J R Stat Soc Ser B. 2003;65: 95–114.
  52. 52. Karatzoglou A, Smola A, Hornik K, Zeileis A. kernlab—An S4 Package for Kernel Methods in R. J Stat Softw. 2004;11: 1–20.
  53. 53. Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw. 2010;33: 1–22. pmid:20808728
  54. 54. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J, et al. pROC: and open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12: 77. pmid:21414208
  55. 55. Greenland S, Robins JM. Identifiability, exchangeability, and epidemiological confounding. Int J Epidemiol. 1986;15: 413–419. pmid:3771081
  56. 56. Kalayanarooj S, Vaughn DW, Nimmannitya S, Green S, Suntayakorn S, Kunentrasai N, et al. Early clinical and laboratory indicators of acute dengue illness. J Infect Dis. 1997;176: 313–321. pmid:9237695
  57. 57. Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29: 1189–1232.
  58. 58. Friedman JH. Stochastic gradient boosting. Comput Stat Data Anal. 2002;38: 367–378.
  59. 59. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Second Edition. New York: Springer-Verlag; 2009.
  60. 60. Hastie T, Tibshirani R, Wainwright M. Statistical Learning with Sparsity: The Lasso and Generalizations. Chapman & Hall/CRC; 2015.
  61. 61. Aronsky D, Haug PJ. Diagnosing community-acquired pneumonia with a Bayesian network. Proc AMIA Symp. 1998; 632–636. pmid:9929296
  62. 62. Sartakhti JS, Zangooei MH, Mozafari K. Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA). Comput Methods Programs Biomed. 2012;108: 570–579. pmid:21968203
  63. 63. Er O, Temurtas F, Tanrikulu AÇ. Tuberculosis Disease Diagnosis Using Artificial Neural Networks. J Med Syst. 2008;34: 299–302. pmid:20503614
  64. 64. Lee VJ, Chow A, Zheng X, Carrasco LR, Cook AR, Lye DC, et al. Simple Clinical and Laboratory Predictors of Chikungunya versus Dengue Infections in Adults. PLoS Negl Trop Dis. 2012;6. pmid:23029573
  65. 65. Lee I-K, Liu J-W, Yang KD. Clinical and laboratory characteristics and risk factors for fatality in elderly patients with dengue hemorrhagic fever. Am J Trop Med Hyg. 2008;79: 149–153. pmid:18689614
  66. 66. Thein T-L, Leo Y-S, Fisher DA, Low JG, Oh HML, Gan VC, et al. Risk Factors for Fatality among Confirmed Adult Dengue Inpatients in Singapore: A Matched Case-Control Study. PLOS ONE. 2013;8: e81060. pmid:24278377
  67. 67. Lee I-K, Liu J-W, Yang KD. Clinical characteristics, risk factors, and outcomes in adults experiencing dengue hemorrhagic fever complicated with acute renal failure. Am J Trop Med Hyg. 2009;80: 651–655. pmid:19346394
  68. 68. Gregory CJ, Lorenzi OD, Colón L, Sepúlveda García A, Santiago LM, Cruz Rivera R, et al. Utility of the Tourniquet Test and the White Blood Cell Count to Differentiate Dengue among Acute Febrile Illnesses in the Emergency Room. PLoS Negl Trop Dis. 2011;5. pmid:22163057
  69. 69. Sawasdivorn S, Vibulvattanakit S, Sasavatpakdee M, Iamsirithavorn S. Efficacy of Clinical Diagnosis of Dengue Fever in Paediatric Age Groups as Determined by WHO Case Definition 1997 in Thailand. 2001 [cited 11 Oct 2018]. Available: http://apps.who.int/iris/handle/10665/163627
  70. 70. Boillat-Blanco N, Mbarack Z, Samaka J, Mlaganile T, Mamin A, Genton B, et al. Prognostic value of quickSOFA as a predictor of 28-day mortality among febrile adult patients presenting to emergency departments in Dar es Salaam, Tanzania. PLoS ONE. 2018;13. pmid:29902174
  71. 71. Seymour CW, Liu VX, Iwashyna TJ, Brunkhorst FM, Rea TD, Scherag A, et al. Assessment of Clinical Criteria for Sepsis. JAMA. 2016;315: 762–774. pmid:26903335
  72. 72. MDCalc—Medical calculators, equations, algorithms, and scores. [cited 11 Apr 2019]. Available: https://www.mdcalc.com/
  73. 73. Cabitza F, Rasoini R, Gensini GF. Unintended Consequences of Machine Learning in Medicine. JAMA. 2017;318: 517–518. pmid:28727867