The emergence of artemisinin-resistant Plasmodium falciparum malaria is a major threat to malaria elimination. New tools for supporting the surveillance of artemisinin resistance are critical for current and future malaria control and elimination strategies. We have developed an open-access, user-friendly, web-based tool to analyse parasite clearance half-life data of P. falciparum infected patients after treatment with artemisinin derivatives, so that resistance to artemisinin can be identified. The tool can be accessed at bit.ly/id_artemisinin_resistance.
Citation: Tun STT, Lubell Y, Dondorp AM, Fieldman T, Tun KM, Celhay O, et al. (2017) Identifying artemisinin resistance from parasite clearance half-life data with a simple Shiny web application. PLoS ONE 12(5): e0177840. https://doi.org/10.1371/journal.pone.0177840
Editor: Michelle Louise Gatton, Quensland University of Technology, AUSTRALIA
Received: November 11, 2016; Accepted: May 3, 2017; Published: May 22, 2017
Copyright: © 2017 Tun et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The web application can be accessed from this link: bit.ly/id_artemisinin_resistance (full web address: https://malaria.shinyapps.io/id_artemisinin_resistance/). The source code for the web application can be found at bit.ly/White-2015-shiny-code (full web address: https://github.com/SaiTheinThanTun/mixmodelpaperfinal).
Funding: This work was funded by Bill and Melinda Gates Foundation Investment number OPP1110500 awarded to LJW. It was also a part of the Wellcome-Trust Major Overseas Programme in SE Asia (grant number 106698/Z/14/Z). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The prospect of malaria elimination is under threat due to the emergence of parasites resistant to artemisinin derivatives, the most efficacious drugs against Plasmodium falciparum malaria . The therapeutic efficacy of artemisinin-based combination therapy (ACT) has to be monitored every two years in order to respond to resistance . With improving methods for genotyping, such as the K13 molecular marker, the working definition of artemisinin resistance has been gradually adapted over the past few years . In the latest WHO recommendations , the cut-off value of ≥ 10% of patients with a half-life of parasite clearance slope ≥ 5 hours after treatment with ACT or artesunate monotherapy, is used as one of the definitions of “suspected endemic artemisinin resistance”. White et al  developed a model to analyse data on parasite half-life as distributions of artemisinin-sensitive and artemisinin-resistant populations, which could help in the surveillance of artemisinin resistance. The model has been validated using parasite half-life data from studies on the Thai-Myanmar border and in Western Cambodia.
The source code of the model, developed in the R programming language by White et al., is available on Github . Interested parties have been using the model to analyse their own parasite half-life data. Until now, this has been limited to modellers proficient in the R programming language. We have developed a user-friendly interface which will make this model available to a wider audience.
Methods and implementation
For a given set of half-life measurements, mixture models can be used to detect and describe subgroups of individuals or subpopulations, if there are any, without having any information on the membership of each individual observation . White et al.  used this mixture modelling approach to estimate the geometric mean, standard deviation, and the relative contributions of each subpopulation, given the observed population’s half-life data. Mixture models with a range of one to five components were fitted to the data, and the most suitable model was chosen. By applying this method to parasite clearance half-life data from the Thai-Myanmar border and Western Cambodia, two subpopulations, which represent artemisinin sensitive and artemisinin resistant populations, were detected. This method was also validated by comparison with the parasite half-life distributions of clonal parasite infections. For details of this method, please refer to the original paper .
The source code of the model, written in the R programming language , was provided in the original paper . We extracted the specific code for the mixture models with log-normal distribution. Shiny , an R package used to build interactive web applications directly from R scripts, was utilized to modify the code to produce an interactive user interface for the mixture model from White et al. The other R packages used were mixtools , pROC , MASS , shinythemes , and shinydashboard . The workflow of the web application is described in Fig 1.
There are three main sections on the user interface of the web application. The Introduction describes why this tool should be used. It also provides two hypothetical examples which users can toggle by clicking on the respective buttons. In the “Simulation” section, users may change the parameters to generate the multimodal composite distributions themselves. This will help users to understand more about the model and the multimodal distributions. Part of the user interface can be seen in Fig 2.
In the "Use your data" section, a simulated dataset can be downloaded as an example of the format of the data which is required for running the model. This dataset can also serve as a template onto which users can overwrite (copy and paste) their own half-life data. The input dataset must be in comma-separated values (CSV) file format, with a single column of half-life clearance data and without column and row names. Once the users have their half-life data in the correct format, it can be uploaded with the "Upload File" button. The model will process the data in the background, which will take from a few seconds to a few minutes depending on the size of the data input. For example, the simulated dataset we have provided has 388 rows of data and it takes approximately 20 seconds to process. Users will be notified once the process is completed or if the input dataset is not in a correct format. Fig 3 shows the “Use your data” section with the result of the model from processing the simulated dataset. There are two download buttons at the bottom of the page: one to download the graph and another one to download the results as a table in CSV file format. If the model estimates two subpopulations, where the one with shorter half-lives represents the sensitive population and the other with longer half-lives represents the resistant population, then a Receiver Operating Characteristic (ROC) curve will be plotted. From this, it can be discerned how well a particular cut-off value of half-life performs in detecting resistance, given the assumption that each unimodal distribution represents infections with a subpopulation of parasites.
We host the web application on the Shiny server at this web address (bit.ly/id_artemisinin_resistance) and the corresponding source code at (bit.ly/White-2015-shiny-code). The web application was tested and found to be working on the following internet browsers: Mozilla Firefox (v48.0.2), Google Chrome (v53.0), Internet Explorer (v11.0), Safari (v9.0.1), Opera (v39.0), Maxthon (v4.9.3), and the mobile version of Google Chrome (v52.0). The only requirement to use this web application is access to the internet.
Results and discussion
We developed an open-access interactive web application based on the model by White et al , which can be found at this address: bit.ly/id_artemisinin_resistance. It can analyse parasite clearance half-life data as distributions of drug-sensitive and drug-resistant populations. This is a potentially useful tool for the surveillance of artemisinin resistance, and could be used alongside other methods such as the identification of molecular markers. It can also be used to measure the probability of a parasite strain being artemisinin resistant for an infection with a particular parasite clearance half-life.
We also demonstrated how a complicated model could be adapted into a user-friendly web application. Since an internet connection is the only requirement, there is significant potential for such modelling tools to be used by and presented to a wider audience.
All of the assumptions from the original paper  are applied here. The clearance half-lives of infections with a particular sensitivity are assumed to follow unimodal distributions of log-normal type. The model’s accuracy will be reduced with an increasing number of true components in the distribution and decreasing sample size. For instance, with a sample size of 1,000, the model will correctly predict 96%, 91%, 70%, 46% and 21% for the input mixture distributions of 1, 2, 3, 4 and 5 components respectively. From a sample size of 50, the model will be able to differentiate between subpopulations of geometric mean half-lives with a difference of 3 or more hours. From a sample size of 1000, the model will be able to differentiate subpopulations whose geometric mean half-lives differ by only 0.5 hours.
The web application will not work if the input data is not in an appropriate format. In that case, it will display an error message. At the time of preparation of this manuscript, the web application has been hosted on a Starter version of Shiny server, which allows 100 hours of active usage per month with premium support. Since the Shiny server is easily scalable, further upgrades can be considered in case of heavier traffic to the web application. We have committed to maintain this web application over the next 5 years, after which time its access may be limited to 25 active hours per month, which is standard for a free Shiny account.
The authors would like to thank Yu Nandar Aung, Naowarat Saralamba and Phetsavanh Chanthavilay for testing out the web-application.
- Conceptualization: STTT YL AMD LJW.
- Data curation: STTT TF OC.
- Formal analysis: STTT LJW.
- Methodology: STTT LJW.
- Software: STTT TF OC.
- Supervision: LJW.
- Validation: STTT KMT XHC SS.
- Visualization: STTT OC SS.
- Writing – original draft: STTT.
- Writing – review & editing: YL TF KMT XHC LJW.
- 1. Dondorp AM, Fairhurst RM, Slutsker L, Macarthur JR, Breman JG, Guerin PJ, et al. The threat of artemisinin-resistant malaria. N Engl J Med. 2011;365(12):1073–5. pmid:21992120.
- 2. World Health Organization. Update on artemisinin and ACT resistance—October 2016. 2016.
- 3. White LJ, Flegg JA, Phyo AP, Wiladpai-ngern JH, Bethell D, Plowe C, et al. Defining the in vivo phenotype of artemisinin-resistant falciparum malaria: a modelling approach. PLoS Med. 2015;12(4):e1001823. pmid:25919029.
- 4. R Core Team. R: A Language and Environment for Statistical Computing 2016 [cited 2016 Sep 14]. https://www.R-project.org/.
- 5. White LJ. PDpheno-mixture [Source code]. 2015 [cited 2016 Aug 30]. https://github.com/lisawhite100/PDpheno-mixture.
- 6. Benaglia T, Chauveau D, Hunter DR., Young DS.. mixtools: An R Package for Analyzing Finite Mixture Models. Journal of Statistical Software. 2009;32(6):1–29.
- 7. Chang W, Cheng J, Allaire J, Xie Y, McPherson J. shiny: Web Application Framework for R 2016 [cited 2016 Sep 14]. version 0.13.2:[R package]. http://CRAN.R-project.org/package=shiny.
- 8. Xavier R, Natacha T, Alexandre H, Natalia T, Frédérique L, Jean-Charles S, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77. pmid:21414208
- 9. Venables WN, Ripley BD. Modern Applied Statistics with S. Fourth ed. New York: Springer; 2002.
- 10. Chang W. shinythemes: Themes for Shiny 2015 [cited 2016 Sep 14]. version 1.0.1:[R package]. https://CRAN.R-project.org/package=shinythemes.
- 11. Chang W. shinydashboard: Create Dashboards with 'Shiny' 2015 [cited 2016 Sep 14]. version 0.5.1:[R package]. https://CRAN.R-project.org/package=shinydashboard.