Figures
Abstract
Mitigating the threat of insecticide resistance in African malaria vector populations requires comprehensive information about where resistance occurs, to what degree, and how this has changed over time. Estimating these trends is complicated by the sparse, heterogeneous distribution of observations of resistance phenotypes in field populations. We use 6,423 observations of the prevalence of resistance to the most important vector control insecticides to inform a Bayesian geostatistical ensemble modelling approach, generating fine-scale predictive maps of resistance phenotypes in mosquitoes from the Anopheles gambiae complex across Africa. Our models are informed by a suite of 111 predictor variables describing potential drivers of selection for resistance. Our maps show alarming increases in the prevalence of resistance to pyrethroids and DDT across sub-Saharan Africa from 2005 to 2017, with mean mortality following insecticide exposure declining from almost 100% to less than 30% in some areas, as well as substantial spatial variation in resistance trends.
Citation: Hancock PA, Hendriks CJM, Tangena J-A, Gibson H, Hemingway J, Coleman M, et al. (2020) Mapping trends in insecticide resistance phenotypes in African malaria vectors. PLoS Biol 18(6): e3000633. https://doi.org/10.1371/journal.pbio.3000633
Academic Editor: Andrew Fraser Read, The Pennsylvania State University, UNITED STATES
Received: December 16, 2019; Accepted: May 11, 2020; Published: June 25, 2020
Copyright: © 2020 Hancock et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The predictive maps of the mean prevalence of resistance are available to download from Figshare (10.6084/m9.figshare.9912623) and will be available to visualise on the Malaria Atlas Project website (https://map.ox.ac.uk/explorer/#). The susceptibility test data is available to download (https://doi.org/10.1101/582510 [8]). Sets of susceptibility test data and predictor variable data in the form used by the statistical modelling analyses are available from GitHub. Numerical data corresponding to the manuscript figures are available to download from Figshare (10.6084/m9.figshare.9912623). R code for implementing the extreme gradient boosting, random forest, and boosted generalized additive models and the R-INLA geostatistical models is available on GitHub at 10.5281/zenodo.3751786 (Hancock, 2020).
Funding: This work was funded by Wellcome Trust Grant 108440/Z/15/Z (CLM). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: BGAM, boosted generalized additive model; CI, credible interval; GMRF, Gaussian Markov random field; IHS, inverse hyperbolic sine; IRS, indoor residual spraying; ITN, insecticide-treated net; LLIN, long-lasting insecticide-treated net; MAE, mean absolute error; PET, potential evapotranspiration; PIT, probability integral transform; RF, random forest model; RMSE, root mean square error; Vgsc, Voltage-gated sodium channel; XGB, extreme gradient boosting model
Introduction
Insecticide resistance in African malaria vector populations has serious consequences for malaria prevention. Long-lasting insecticide-treated nets (LLINs) have achieved substantial reductions in malaria prevalence thus far in Africa [1], but the number of insecticides currently available for use in LLINs is very limited. Until recently, pyrethroids were the only class approved for use in LLINs, and recently launched new generation nets still use pyrethroids in combination with either an insect growth regulator, a pyrrole, or a synergist that inhibits the primary metabolic mechanism of pyrethroid resistance within mosquitoes [2,3]. A wider range of options is available for indoor residual spraying (IRS), but pyrethroids are less expensive than many alternatives and are still used for IRS in malaria-endemic sub-Saharan African countries [4,5].
Although there is evidence that pyrethroid resistance in African malaria vector populations is increasing [6,7], the wide array of field studies that are available do not provide a spatially comprehensive time series of resistance trends [8]. Quantifying these trends will improve our understanding of the historical spread of resistance and assist in designing insecticide resistance management strategies [9]. Comprehensive spatiotemporal analyses of resistance are also necessary to facilitate its inclusion in epidemiological models of malaria that inform decision-making at national and global levels [9]. Efforts to estimate trends in insecticide resistance are impeded by limitations associated with the available observations of resistance phenotypes in field mosquito populations. Observations from standardized susceptibility tests, which indicate the prevalence of phenotypic resistance in field populations, cover a wide geographic area and span several decades [8,10]. However, the spatial coverage of these data is sparse and heterogeneous, and resistance has rarely been monitored consistently over time, meaning that very few time series are available [9]. Moreover, these susceptibility tests have a large measurement error, and replication is required to robustly estimate resistance phenotypes.
Several studies have reported spatial variation in resistance within a country, but these studies haven’t analysed the spatial trends behind this variation [11,12]. In addition, time series of resistance data have shown changes over time at the location of sentinel sites [13–20]; however, only one study has investigated temporal trends across Africa [7]. This study generated a single trend over time for the whole continent, for each species and insecticide, and did not investigate differences in trends between locations. No previous study has analysed spatial variation across multiple countries or investigated spatiotemporal trends within these regions.
Our capacity to understand and predict insecticide resistance can benefit from considering the variables that may influence selection for resistance. Sources of insecticides in the environment include the application of insecticide-based vector control interventions for public health—such as LLINs and IRS—and the application of agricultural insecticides, which include the same insecticide classes as those used in vector control [21]. LLIN coverage increased markedly across Africa from 2005 in response the global Roll Back Malaria initiative [22, 23], while IRS coverage has been restricted to much smaller areas [4]. Early insecticide-treated nets (ITNs) used permethrin or deltamethrin, whereas α-cypermethrin is now the most commonly used pyrethroid in LLINs. Over the last 20 years, deltamethrin, λ-cyhalothrin, and DDT have been used for IRS, with α-cypermethrin first used in mass campaigns in 2003. Since 2015, deltamethrin has been the only pyrethroid reported to be used in mass IRS campaigns along with DDT and other non-pyrethroid insecticides [4]. Several studies have demonstrated a local increase in insecticide resistance in field mosquito populations following the implementation of LLINs, IRS, or both [19,20,24–26] although in other locations evidence of higher resistance after the introduction of these interventions was not found [24,27]. Associations between agricultural pesticide use and insecticide resistance have also been found [21,28], and there is evidence that pesticide contamination of water bodies is a source of selection pressure for resistance acting on mosquito larvae [29]. Relationships between resistance and drivers of selection will, however, vary geographically depending on population structure [30,31]. Genetic mechanisms of resistance also differ across mosquito species [25,30], and even closely related mosquito species have different ecological niches [32,33], as well as different blood feeding behaviour and preferences, meaning that they are likely to experience differences in insecticide exposure [34].
Pyrethroid resistance has not spread outwards from a single origin, and there are several known resistance mechanisms. The mechanisms that have been identified can be classed into target site mutations, up-regulation of detoxifying enzymes, and cuticular thickening [35–38]. Each class is associated with multiple genetic variants that confer resistance, and there is evidence that at least some of these have arisen independently multiple times across Africa [39]. Once a new variant has arisen, increases in its frequency are driven by selection pressures that differ from place to place, for example, through the spatially varying coverage of pyrethroid use in public health and agriculture [4,22,40]. The spread of these genetic variants to new locations is driven by gene flow that also varies with location [30,41–45]. Here, we consider a species complex, and the “spread” of resistance is further influenced by introgression of resistance genes from one species to another [46–51]. The geospatial patterns in the pyrethroid resistance phenotype that we are studying here are, therefore, the result of a complex combination of multiple origins of multiple genetic variants that have then spread through gene flow and introgression.
There is evidence that some mechanisms of pyrethroid resistance differ in the level of resistance they confer to different compounds; for example, different depletion rates were found across 6 pyrethroid compounds when they were metabolised by 8 Anopheles cytochrome P450 enzymes under controlled conditions [52]. These differences have not been investigated for all mechanisms of resistance, and there is no evidence that the differences seen for individual resistance mechanisms tested in the laboratory translate into diverging patterns of resistance in the field.
To develop predictive models of insecticide resistance in field populations that can represent variable, nonlinear interactions with environmental, biological, and genetic variables, we utilise an ensemble modelling approach. The approach exploits the multifaceted strengths of different modelling methodologies, using machine-learning methods to extract predictive power from a set of covariates and then allowing a Bayesian geostatistical Gaussian process to model the autocorrelated residual variation [53]. Bayesian geostatistical models provide a robust model of residual autocorrelation that can be applied to spatiotemporal data with a heterogeneous sampling distribution [54]. Their application to observations from insecticide susceptibility tests conducted over a range of locations across Africa has previously demonstrated broad-scale associations between resistance to different types of pyrethroids, as well as the organochlorine DDT [55]. The models developed in this study exploit these associations in resistance across different insecticides to improve resistance predictions for individual insecticide types.
Using a database containing the results of standard insecticide susceptibility tests performed on mosquito samples collected throughout Africa [8], we extracted the results of 6,423 tests conducted on samples from the A. gambiae species complex, which are among the most important African malaria vectors. We used this data set in our model ensemble to quantify variation in the prevalence of resistance to pyrethroids and DDT over the period 2005–2017 by developing a series of predictive maps. Our models are informed by a suite of potential explanatory variables describing the coverage of insecticide-based vector control interventions, agriculture and other types of land cover, climate, processes determining the environmental fate of pesticides, and the distribution of the sibling species that make up the A. gambiae complex. Our results show dramatic changes in insecticide resistance phenotypes in malaria vector populations across Africa over a 13-year period and identify variables that were important in shaping these predictions.
Results
Spatiotemporal trends in the prevalence of insecticide resistance
Pyrethroid resistance.
We investigated spatiotemporal trends in the prevalence of phenotypic resistance in the A. gambiae complex to 4 pyrethroids: deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin. Due to the lack of observations from central Africa, we partitioned the data into 2 separate spatial regions covering western and eastern parts of the continent and analysed each data subset independently by fitting separate models (Fig 1 and “Methods”). In West Africa, predicted mean prevalence of resistance to all pyrethroids increased dramatically over the period 2005–2017 (Figs 2 and 3 and S1 Fig, S2 Fig and S3 Fig). Predicted mean proportional mortality to deltamethrin was below 0.9 (the WHO threshold for confirmed resistance) across 15% (95% credible interval [CI] 13%–17%) of the west region in 2005, and across 98% (CI 96.6%–98.7%) of the region in 2017 (Fig 3). These changes in resistance were spatially heterogeneous (Fig 2). Increases in resistance to deltamethrin over the period—in terms of the reductions in the predicted mean proportional mortality—were greatest in northern Liberia (Fig 2D, line A); central Cote d’Ivoire (Fig 2D, line B); the area surrounding the border between Burkina Faso, Cote d’Ivoire, and Ghana (Fig 2D, line C); southern Ghana (Fig 2D, line D); and northern Gabon (Fig 2D, line E). In these regions, resistance to deltamethrin in 2017 was particularly high (with a mean proportional mortality below 0.3 (CI < 0.4).
(A) The number of susceptibility test observations for each year and insecticide type. Numerical values are provided in S1 Data (10.6084/m9.figshare.9912623). (B) The sampling locations of the susceptibility test observations in panel A.
(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.
The proportion of the area with a predicted mean mortality to deltamethrin of less than 0.9, for the west region (red line) and the east region (blue line). Red and blue shaded areas indicate the 95% CI of the predicted proportion of pixels for the west and east regions, respectively. Numerical values are provided in S2 Data (10.6084/m9.figshare.9912623). CI, credible interval.
In East Africa, the prevalence of pyrethroid resistance also increased over the period 2005–2017, albeit at a lesser rate than that in the west region (Figs 2 and 3). Predicted mean proportional mortality to deltamethrin was below 0.9 across 9% (CI 3%–17%) of the east region in 2005 and across 45% (CI 38%–51%) of the region in 2017 (Fig 3). The greatest increases in pyrethroid resistance over the period occurred in the northern part of the region, in the area from central Ethiopia (Fig 2D, line F) westward across most of South Sudan (Fig 2D, line G), and extending into southern Sudan (Fig 2D, line H) and northern Uganda (Fig 2D, line I). Across most of this area, mean mortality to deltamethrin in 2017 was below 0.5 (CI < 0.75). Resistance to deltamethrin increased to a lesser extent in central and southern Uganda, western Kenya, eastern Ethiopia, and coastal Tanzania, with predicted mean mortalities of between 0.6 and 0.8 in these areas in 2017. In areas farther south, differences in predicted resistance over the time period were relatively slight, with mean mortalities changing by less than 0.15 from 2005 to 2017 within Malawi, Mozambique, Zimbabwe, and those parts of Zambia, Botswana, and South Africa that were included in the model. Similar spatiotemporal trends across the west and east regions occurred in predicted mean resistance to permethrin, λ-cyhalothrin, and α-cypermethrin (S1 Fig, S2 Fig and S3 Fig).
DDT resistance.
Predicted mean resistance to DDT at the start of the period (in 2005) was more widespread in comparison to pyrethroid resistance and also increased throughout the region from 2005 to 2017 (Figs 4 and 5). In the west region, predicted mean proportional mortality to DDT was below 0.9 across 53% (CI 47%–59%) of the west region in 2005, and across 97% (CI 92.7%–99%) of the region in 2017 (Fig 5). Increases in resistance to DDT over the period were greatest in the area surrounding the border between Liberia and Guinea (Fig 4D, line A), southern Mali (Fig 4D, line B), and central Burkina Faso (Fig 4D, line C). The east region showed a weaker increase in predicted mean resistance to DDT over the period 2005–2017 in comparison to that occurring in the west region. Predicted mean proportional mortality was below 0.9 across 32% (CI 21%–44%) of the east region in 2005, and across 45% (CI 39%–51%) in 2017. Increases in DDT resistance over the period were greatest in central South Sudan (Fig 4D, line D).
(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.
The proportion of the area with a predicted mean mortality to DDT of less than 0.9, for the west region (red line) and the east region (blue line). Red and blue shaded areas indicate the 95% CI of the predicted proportion of pixels for the west and east regions, respectively. Numerical values are provided in S2 Data (10.6084/m9.figshare.9912623). CI, credible interval.
Assessing prediction accuracy
We performed 10-fold out-of-sample validation on the model ensemble (re-running the model 10 times, withholding a different 10% portion of the data each time; see “Methods” and paper by Gelman and colleagues [56]) to assess the accuracy of predicted mean prevalence of resistance. Across all bioassay observations for pyrethroid insecticides, we obtained a root mean square error (RMSE) [57] of 0.179 (mean absolute error [MAE] = 0.127; S2 Table) across all the (out-of-sample) predictions of mean proportional mortality for the west and east regions combined (S5 Fig). Across all DDT bioassay observations, the corresponding out-of-sample RMSE was 0.167 (MAE = 0.111; S3 Table). We note that susceptibility tests have a high measurement error, which is reflected by the high values of the data noise parameter estimated by the fitted model [58] (S1 Table) and that the model aims to distinguish spatiotemporal variation in mean resistance that is independent of data noise.
The individual model constituents of our ensemble included 3 machine-learning models: an extreme gradient boosting model (XGB), a random forest model (RF), and a boosted generalized additive model (BGAM). We confirmed that our model ensemble showed improved predictive performance relative to each of these constituent models, as expected (S2 Table and S3 Table). Of the 3 machine-learning models, XGB had the lowest out-of-sample prediction error followed by RF and then BGAM. The fitted mean model weights given by the ensemble model were higher for models with lower out-of-sample prediction error, as expected (S4 Table).
In order to quantify prediction uncertainty across space and time, we produced maps of the 95% CIs of the posterior distributions of predicted mean mortality (Fig 6 and S7 Fig). We again performed 10-fold out-of-sample validation to assess the accuracy of the predicted CIs and found the coverage of the CIs to be accurate when the measurement error associated with the data—estimated by the fitted model [58]—was accounted for (S6 Fig). Prediction error is heterogeneous across space and time, with the 95% CIs of predicted mean mortality being higher in the east compared with the west region and with particularly high CIs in the northwestern part of the east region. This reflects the more sparse distribution of bioassay sampling locations in the east region, particularly in South Sudan and much of southern Sudan (Fig 1).
See 10.6084/m9.figshare.9912623. CI, credible interval.
Regional and local resistance trends.
National temporal trends in the predicted prevalence of resistance are qualitatively similar to those occurring in the broader east and west regions, particularly for countries in the west region, which all show a monotonic increase over time in the proportional area with a mean mortality to deltamethrin of <0.9 (Fig 7). Predicted temporal trends are more variable across the countries in the east region, with trends in Kenya, Malawi, Rwanda, and Tanzania suggesting either a plateau or a decline in resistance levels towards the end of the time period (Fig 7). Moreover, predicted temporal trends across point locations show greater variability than regional or national trends, and nonmonotonicity in point trends is common, with declines in resistance occurring following earlier increases (S8 Fig and S9 Fig). Attenuations and declines in resistance may reflect fitness costs of resistance, or they may also arise due to shifts in the composition of the sibling species that make up the A. gambiae complex, or immigration of mosquitoes from areas where resistance is lower (see “Discussion”).
The proportion of the area with a predicted mean mortality to deltamethrin of less than 0.9 within each country that is fully contained within the west region (red lines) and the east region (blue lines). Red and blue shaded areas indicate the 95% CI of the predicted proportion of pixels for the west and east regions, respectively. Numerical values are provided in S3 Data and S4 Data (10.6084/m9.figshare.9912623). CI, credible interval.
Influential predictor variables
Our models used over 100 potential explanatory variables (see the “Methods” section), and our results show which of these variables were most influential to the predictions of mean prevalence of resistance. We obtained measures of variable importance for each of the 3 constituent machine-learning models (XGB, RF, and BGAM). Variable importance measures describe the influence of a variable on model predictions relative to the other predictor variables, but they can be hard to interpret when predictor variables are correlated (see S1 Text), and they do not identify causal relationships (see “Methods” and “Discussion”). For each model, the importance of each variable is expressed as a fraction of the total importance across all predictor variables. In ranking variable importance, we weighted the importance of each variable given by each model by the model’s weight obtained from the Gaussian process meta-model for pyrethroids (S4 Table). This increasingly weights those variables that were more important to models that performed better and thus made a higher relative contribution to the predictions made by the ensemble (Fig 8). Thus, the variable importance values given by XGB and RF are up-weighted relative to those given by BGAM. The original variable importance values produced by each model are given in S5 Table and S6 Table, and a description of each predictor variable is given in S9 Table.
(A) West Africa; (B) East Africa. Stacked bars show the relative variable importance given by XGB (blue), RF (green), and BGAM (grey), weighted by the fitted weight for each model given by the Gaussian process meta-model (see text). Variables are ranked by the total height of the stacked bars across the 3 models, and the top 20 variables are shown. The original variable importance values produced by each model are given in S5 Table and S6 Table, and definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2), and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two, and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step (see “Methods”). BGAM, boosted generalized additive model; IRS, indoor residual spraying; ITN, insecticide-treated net; PET, potential evapotranspiration; RF, random forest model; XGB, extreme gradient boosting model.
For the west region, variables describing the coverage of ITNs had the highest importance value for each of the 3 models. For XGB and RF, the 3-year lag of ITN coverage had the highest importance value. For BGAM, non-lagged ITN coverage had the highest importance value, and the 3-year lag of ITN coverage had the second highest importance value (Fig 8 and S5 Table). Outside the top two, variables describing climate processes and those describing the area of harvested crops are highly ranked (within the top 20 most important variables) for all 3 models (Fig 8 and S5 Table). For the east region, variables describing ITN coverage and rainfall were ranked in the top 10 most important variables for all 3 models (Fig 8 and S6 Table). More broadly, variables describing climate processes were highly ranked by all 3 models. Our ability to quantitatively compare differences in importance across our set of predictor variables is, however, inhibited by differences in the definition of variable importance used in the different machine-learning approaches that we have employed (see “Methods”).
Discussion
Here, we have quantified spatial and temporal trends in insecticide resistance in the A. gambiae species complex in East and West Africa, showing marked increases in the prevalence of resistance to pyrethroids and DDT in recent years, as well as geographic expansion. These results highlight the urgency of identifying and implementing effective resistance management strategies. Our predictive maps of mean prevalence of resistance are available to visualise alongside the latest susceptibility test data on the insecticide resistance mapper website (http://www.irmapper.com) and can guide decisions about resistance management at regional and local levels. In making recommendations, our results will need to be considered in combination with (i) data from resistance monitoring of field samples, including other malaria vector species such as A. funestus; (ii) data on the presence of underlying mechanisms of resistance; and (iii) analyses of the expected impacts of resistance management strategies on malaria prevalence [9,59]. Decision-making frameworks also need to explicitly incorporate predictive uncertainty, which is facilitated by our out-of-sample validation results and our mapped Bayesian CIs. Our predictions are not a substitute for ongoing resistance monitoring requirements but highlight areas with particularly high levels of predictive uncertainty, such as parts of South Sudan, southern Sudan, and the Democratic Republic of Congo (Fig 6D). In these areas, field sampling to measure resistance is the only means of informing resistance management decisions.
Our results show substantial variation in resistance trends between East and West Africa, as well as within these two regions. Interestingly, ITN coverage was identified as a relatively influential predictor in our models, which is consistent with other studies that have found significant, but spatially variable, increases in pyrethroid resistance associated with the introduction of ITNs [24]. However, in several areas of the central and southern parts of East Africa, such as west Tanzania, ITN coverage has been relatively high (>50%) from 2012 to 2017 [22], but predicted pyrethroid resistance in 2017 was relatively low (Fig 2D). This may be influenced by the locations where resistance mechanisms first emerged, patterns of subsequent gene flow (including restricted flow across the Rift Valley) [41,60,61], and differences among the sibling species within the A. gambiae complex [25,30]. For example, the distribution of A. arabiensis extends further than other species in the complex [33], and this species is known to be more plastic in its feeding behaviour, biting outdoors and feeding on cattle [33]. Therefore, it is possible that selection for resistance in this species lags behind other members of the complex [62–64]. Our predictions of the prevalence of resistance are based on susceptibility tests that often do not identify the sibling species composition of the A. gambiae complex sample that was tested. Our analysis only includes test results that are representative of the original sample collected [8, 55], and our predictions cannot directly represent variation in the prevalence of resistance due to variation in the composition of sibling species [33,65]. Routine identification of the composition of sibling species in tested samples—and the provision of species-specific mortality values—would improve the capacity of susceptibility test data to inform prediction of resistance.
The coverage of pyrethroid IRS was not among the most influential predictors in our models, but only a small fraction of the areas that we modelled (<5% of the west region and <15% of the east region) received pyrethroid IRS between 2005 and 2017 [4]. Thus, our results do not imply that IRS is not important in driving the selection of resistance. IRS can, however, be a useful tool to prevent the spread of resistance and mitigate its effects because the number of options available for IRS mean chemical classes can be rotated through time, applied in a mosaic in space, or combined for use in the same place and time [9].
It is also important to note that, while our models included over 100 potential predictor variables that may influence selection for resistance, it is unlikely that we have captured the full set of causal variables underlying selection. In particular, data on the quantities of insecticides used in agriculture, and where they were applied, were not available [66]. Such information would better inform predictive relationships between resistance and agricultural insecticide use. We note that the relationships between insecticide resistance and the predictor variables represented in our models do not prove causality. Each variable interacts with other variables (S10 Fig and S11 Fig) and possibly with variables not included in our analysis. For example, variables describing climate processes were ranked as influential predictors (Fig 8), but these may delineate broad areas where resistance trends are similar as a result of an unmeasured process, such as mosquito population structure or species composition. More extensive data on the presence of resistance mechanisms—including a wider coverage of voltage-gated sodium channel (Vgsc) allele frequencies, as well as metabolic resistance markers [67]—in field populations will aid in predicting and interpreting resistance trends. The similarity in predicted spatiotemporal patterns in resistance across the 4 pyrethroids and DDT (e.g., Figs 2 and 4) suggests common underlying resistance mechanisms [55].
In some areas, predicted temporal trends indicate plateaus or declines in the prevalence of resistance following an increase. Interestingly, the areas that showed the strongest interannual declines in resistance over the time period often experienced strong increases in earlier years (S9 Fig). These oscillatory dynamics may be the result of fitness costs associated with the introduction of novel resistance mutations, but the extent of such costs in field A. gambiae populations is poorly understood [68]. Several resistance mechanisms exist in field mosquito populations, and they are complex polygenic processes that vary geographically and are continuing to evolve [68,69]. Declines in the prevalence of resistance may also result from a shift in the composition of sibling species; for example, increases in the proportion of A. arabiensis have been observed following the instigation of LLIN interventions that predominantly target A. gambiae, which bite humans indoors [70]. A future focus on African regions where resistance levels are still relatively low may be deliver new insights into how spread is initiated and how it can be mitigated.
While our analysis focuses on pyrethroids, insecticides from other classes such as carbamates and organophosphates are being increasingly used in IRS interventions [4]. The number of available susceptibility test results for insecticides from these classes is relatively low [8], and spatiotemporal analyses of resistance would benefit greatly from increasing the frequency and spatial coverage of sampling and testing. Susceptibility test data are also more limited for A. funestus, a major malaria vector in Africa that is widespread and among the dominant vector species [71]. While the available susceptibility test data for A. funestus are insufficient to support a geospatial analysis of the kind performed here, we note that the data indicate higher levels of pyrethroid resistance than we have seen for A. gambiae s.l. in southern parts of East Africa (Fig 9).
(A) The proportional mortality to pyrethroids in susceptibility tests performed on A. funestus in the years 1998–2017 (n = 692). Raw data are available in Moyes et al. 2019 [8]. (B) the locations of the samples in panel A and the recorded proportional mortality.
In summary, our results provide an Africa-wide perspective on recent trends in pyrethroid and DDT resistance in A. gambiae complex malaria vectors, demonstrating increasingly high prevalence of resistance to the main insecticides used in malaria control. The rapid spread of resistance across large parts of sub-Saharan Africa signals an urgent need to quantify the efficacy of different resistance management strategies and to understand the impact of resistance on malaria transmission and control. Relationships between insecticide resistance and malaria prevalence are currently poorly understood, but there is evidence that resistance can reduce the efficacy of standard pyrethroid-treated LLINs [3], which have played a key role in achieving reductions in malaria prevalence in Africa over 2000–2015 [1]. Our maps show marked broad-scale spatial heterogeneity in resistance, motivating the implementation and assessment of a wide range of strategies that target different insecticide resistance and malaria transmission settings, such as next-generation LLINs [3,9] as well as rotating insecticide use across different insecticide classes [9].
Methods
Data
Insecticide resistance bioassay data.
Insecticide resistance bioassay data were obtained from a published database [8], which is an updated version of the data used by Hancock and colleagues [55] that includes samples tested up until the end of 2017. The data record the number of mosquitoes in the sample and the proportional sample mortality resulting from the bioassay, as well as variables describing the mosquitoes tested, the sample collection site, and the bioassay conditions and protocol. We used this information to select a subset of records for inclusion in our study (S1 Text). In summary, we include bioassay results for which standard WHO susceptibility tests or CDC bottle bioassays using any one of the 4 pyrethroid types (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) or the organochlorine DDT was performed on mosquito samples belonging to the A. gambiae species complex. We include results from bioassays conducted over the period 2005–2017. Due to spatial heterogeneity in the sampling distribution, we confine our analysis to samples collected from within 2 separate geographic (west and east) regions of sub-Saharan Africa (see Fig 1 and S1 Text). We excluded Madagascar from our analysis, as our models of resistance on the mainland may not generalize well to island populations. The final number of proportional mortality observations across all insecticide types was 6,423 across 1,466 locations, with 3,515 and 2,908 observations in the west region and east region, respectively (S7 Table and S8 Table).
Vgsc allele frequency data.
The Vgsc is the target site for both pyrethroids and DDT, and mutations in this channel confer resistance. Our analysis used data on the frequency of Vgsc mutations in mosquito samples belonging to the A. gambiae species complex collected from within the west and east regions over the period 2005–2017 [8,55]. These data record the combined frequency of the single point mutations L1014F and L1014S with respect to the wild-type allele L1014L and comprise 316 observations (215 observations for the west region and 101 observations for the east regions; S7 Table). As described subsequently, we incorporated these data into machine-learning models in order to inform prediction of phenotypic resistance to DDT and pyrethroids by exploiting the positive association between the frequency of Vgsc mutations and the prevalence of these resistance phenotypes [55].
Potential predictor variables.
Our set of predictors includes 111 variables describing environmental characteristics that could potentially be related to the development and spread of insecticide resistance in populations of A. gambiae complex mosquito species (described in S9 Table and S1 Text). These variables describe the coverage of insecticide-based vector control interventions, agricultural land use [72,73], and the environmental fate of agricultural insecticides [66], other types of land use [72,74–76], climate [72,77,78], and relative species abundance. Our vector control intervention data include a variable estimating the yearly coverage of ITNs [22,79] and a variable estimating the coverage of IRS with either pyrethroids or DDT year [4]. Relative species abundance is represented by a variable estimating the abundance of A. arabiensis relative to the abundance of A. gambiae and A. coluzzii [65]. For all variables, we obtained spatially explicit data on a grid with a 2.5 arc-minute resolution (which is approximately 5 km at the equator) covering sub-Saharan Africa. For variables for which temporal data were available on an annual resolution, we included time-lagged representations with lags of 0, 1, 2, and 3 years.
Gaussian process stacked generalization ensemble modelling approach
Stacked generalization is a method of combining an ensemble of models to produce a meta-model, with the aim of achieving better predictive performance than the individual model constituents [80,81]. Here, we adopt a stacking design whereby a set of individual models that make up the first layer, referred to as the “level 0 models,” feed into a single meta-model on the second layer, referred to as the “level 1 model.” We use the Gaussian process stacked generalization approach developed by Bhatt and colleagues [53], which uses Gaussian process regression as the level 1 model that combines weighted out-of-sample predictions from a set of multiple level 0 models derived from machine-learning methods. The approach exploits the known strengths of these different methodologies, using machine-learning methods to extract as much predictive power from the covariates as possible, and then allowing the Gaussian process to model the spatiotemporal error covariance structure, aiming to further improve prediction. Bhatt and colleagues [53] showed that, under the (restrictive) assumption that the true function is a part of the model’s function space, the use of the Gaussian process model of residual variation improves prediction accuracy compared with a standard constrained weighted mean across the ensemble predictions.
Machine-learning models.
Our set of level 0 models consists of 3 different types of machine-learning model that predict insecticide resistance, using our bioassay mortality observations as the label and our suite of intervention, agriculture, and environmental covariates as features. The machine-learning approaches employed include an XGB model (implemented using the R package xgboost), an RF model (implemented using the R package randomForest), and a BGAM model (implemented using the R package mboost). We chose these methods because of their demonstrated high predictive performance, particularly in previous applications of Gaussian process stacked generalization to spatial processes [53]. The label for the level 0 models was the proportional mortality observations from bioassays conducted using the 4 pyrethroid types (deltamethrin, permethrin, λ-cyhalothrin and α-cypermethrin), the proportional mortality observations for bioassays conducted using DDT, and the observations of the combined frequency of the Vgsc mutations L1014F and L1014S. We included in the label our data on the observed combined frequency of Vgsc mutations in mosquito samples, because these observations are significantly associated with the prevalence of resistance to DDT and pyrethroids [55] and can therefore inform prediction of these mortality values. Before performing parameter tuning on the level 0 models, we applied 2 data transformations to the label, the empirical logit transformation followed by the inverse hyperbolic sine (IHS) transformation [82].
The features used in the models included the 111 environmental predictor variables together with the 1-, 2-, and 3-year lags for those variables that vary temporally (on a yearly time step). A factor variable grouping the label according to the type of observation was also included as a feature, assigning a different group to bioassay observations depending on type of insecticide used and whether a WHO or CDC susceptibility test was used. This factor variable also assigned the Vgsc allele frequency observations to a separate group. Finally, the year in which the bioassay and allele frequency samples were collected was also included as a feature.
For each level 0 model, parameter tuning was performed using K-fold out-of-sample validation based on subdividing the data into K training and validation subsets (see S1 Text). In applying the XGB method, we used the DART boosting methodology to avoid overfitting [83].
Model stacking and Gaussian process regression.
Let gA(si,t) denote the (empirical logit and IHS-transformed) proportional mortality record for a bioassay using insecticide type A conducted on a sample collected at geographic coordinates si and sampling time t. To implement Gaussian process stacked generalization, we model the transformed observations, denoted gA(si,t), using a Gaussian process regression formulation:
(1)
in which wA is a constant vector,
is a design matrix, fA(s,t) is a Gaussian process modelled by a spatiotemporal Gaussian Markov random field (GMRF) [84], and eA is Gaussian white noise
. We define a Bayesian hierarchical formulation for the model (Eq 1) using a vector of prior probability distributions for the hyperparameters θA = [wA,ψA,σA] in which ψA are the parameters of fA(s,t) (see S1 Text). To fit the model, the elements of the design matrix
are set to the out-of-sample predictions of the level 0 models derived from K-fold cross-validation, i.e.,
, in which
is the prediction of the iith withheld (transformed) observation gA(si,t) given by the pth level 0 model. Validation folds were randomly selected from the full data set. Posterior distributions of θA and fA(s,t) are then estimated by fitting the model (Eq 1) using the R-INLA package (www.r-inla.org) [85]. The posterior mean of the vector wA contains the fitted weights for each model, representing the relative contribution of each model to the predictions made by the model ensemble. Our implementation of Gaussian process regression (Eq 1) constrains each weight to be positive (wp≥0,∀p) [86]. Once the parameter estimation has been performed, the final set of predictions,
, given by the stacked model are obtained by replacing the elements of
with the in-sample predictions of the l0 models obtained by fitting each of these models to all the data (all the labels and the corresponding sets of features) [53] (S1 Text).
Posterior validation.
We performed posterior validation of the stacked model using 10-fold out-of-sample cross-validation, whereby the data were divided into 10 subsets (or “test” sets, using random sampling without replacement), and 10 successive model fits were performed, each withholding a different test set. Each test set was withheld from both the level 0 and level 1 models. We used these out-of-sample predictions to assess the accuracy of the predicted means of the observations as well as their predicted CIs (S1 Text). We also assessed the suitability of our assumed data-generating process using probability integral transform (PIT) histograms on out-of-sample data (S1 Text).
Predictor variable importance.
We calculated measures of the importance of each predictor variable for each of the machine-learning models used in our model ensemble. For XGB, we used the gain measure calculated for each variable using the xgboost package [87], which is the fractional total reduction in the training error gained across all of that variable’s splits. For RF, we used the permutation importance measure calculated using the randomForest package [88], which is the fractional change in the out-of-bag error when the variable is randomly permuted. In the case of BGAM, we used the mboost package [89] to calculate variable importance as the total reduction in the training error across all boosting iterations in which that variable was chosen as the base learner. For each model, we express the importance of a single variable as a fraction of the total importance across all predictor variables in that model.
Code availability
R code for implementing the XGB, RF, and BGAM models, as well as the R-INLA models for Gaussian process stacked generalisation, is available on GitHub at 10.5281/zenodo.3751786 [90].
Supporting information
S1 Fig. Predicted mean proportional mortality to permethrin across the west and east regions.
(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.
https://doi.org/10.1371/journal.pbio.3000633.s001
(TIF)
S2 Fig. Predicted mean proportional mortality to λ-cyhalothrin across the west and east regions.
(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.
https://doi.org/10.1371/journal.pbio.3000633.s002
(TIF)
S3 Fig. Predicted mean proportional mortality to α-cypermethrin across the west and east regions.
(A) 2005, (B) 2010, (C) 2015, and (D) 2017. See 10.6084/m9.figshare.9912623.
https://doi.org/10.1371/journal.pbio.3000633.s003
(TIF)
S4 Fig. Histograms of the approximate cross-validated PIT values comparing observations and cumulative predictive densities across all susceptibility test observations for pyrethroids.
Numerical values are provided in S5 Data (10.6084/m9.figshare.9912623).
https://doi.org/10.1371/journal.pbio.3000633.s004
(TIF)
S5 Fig. Predictions of mean proportional mortality from 10-fold out-of-sample validation performed on the Gaussian process regression meta-model.
The vertical axis shows the corresponding value observed from the bioassay. Values for all data points for all pyrethroid types (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) for the west region (red markers) and the east region (blue markers) are shown. The RMSE across all data values is 0.179 (RMSE = 0.191 for the data within the west region and RMSE = 0.166 for the data within the east region). Numerical values are provided in S6 Data (10.6084/m9.figshare.9912623). RMSE, root mean square error.
https://doi.org/10.1371/journal.pbio.3000633.s005
(TIF)
S6 Fig. The proportion of withheld data points that fell within the predicted CIs, based on 10-fold out-of-sample validation, when accounting for the estimated measurement error (see S1 Text).
Numerical values are provided in S7 Data (10.6084/m9.figshare.9912623). CI, credible interval.
https://doi.org/10.1371/journal.pbio.3000633.s006
(TIF)
S7 Fig. The prediction error (95% CI) associated with predicted mean mortality to DDT.
See 10.6084/m9.figshare.9912623. CI, credible interval.
https://doi.org/10.1371/journal.pbio.3000633.s007
(TIF)
S8 Fig.
The predicted mean proportional mortality to deltamethrin over time for the point locations in the east (A) and west (B) regions that experienced the greatest overall increase in resistance from 2005 to 2017 (Fig 2; locations A, B, C, D, E, F, G, H, and I). Dashed lines show the 95% CIs of the predicted mean mortality. Numerical values are provided in S8 Data (10.6084/m9.figshare.9912623). CI, credible interval.
https://doi.org/10.1371/journal.pbio.3000633.s008
(TIF)
S9 Fig.
The maximum interannual change in the predicted mean mortality to deltamethrin over the time period 2005–2017 at each location within the west and east regions: (A) the maximum interannual decrease, (B) the maximum interannual increase. Interannual increases and decreases in predicted mortality are calculated as the difference in predictions between 2 consecutive years, for all years 2005 to 2017. See 10.6084/m9.figshare.9912623.
https://doi.org/10.1371/journal.pbio.3000633.s009
(TIF)
S10 Fig. The Pearson correlation coefficient between each of the 20 variables with the highest weighted variable importance value for the models fitted to the West Africa data set.
https://doi.org/10.1371/journal.pbio.3000633.s010
(TIF)
S11 Fig. The Pearson correlation coefficient between each of the 20 variables with the highest weighted variable importance value for the models fitted to the East Africa data set.
https://doi.org/10.1371/journal.pbio.3000633.s011
(TIF)
S1 Table. Fitted parameters of the Bayesian Gaussian process regression models.
Numbers in brackets are the 95% CIs. CI, credible interval.
https://doi.org/10.1371/journal.pbio.3000633.s012
(DOCX)
S2 Table. The RMSE given by 10-fold out-of-sample validation performed on the Gaussian process regression meta-model fitted to the bioassay records for the four pyrethroid insecticides (deltamethrin, permethrin, λ-cyhalothrin, and α-cypermethrin) and each of the 3 machine-learning model constituents.
The unit of the transformed RMSE values corresponds to the (empirical logit and IHS-transformed) observations to which the models were fitted. IHS, inverse hyperbolic sine; RMSE, root mean square error.
https://doi.org/10.1371/journal.pbio.3000633.s013
(DOCX)
S3 Table. The RMSE given by 10-fold out-of-sample validation performed on the Gaussian process regression meta-model fitted to the bioassay records for DDT and each of the 3 machine-learning model constituents.
The unit of the transformed RMSE values corresponds to the (empirical logit and IHS-transformed) observations to which the models were fitted. IHS, inverse hyperbolic sine; RMSE, root mean square error.
https://doi.org/10.1371/journal.pbio.3000633.s014
(DOCX)
S4 Table. The fitted weights for each constituent model included in the Gaussian process regression meta-model.
https://doi.org/10.1371/journal.pbio.3000633.s015
(DOCX)
S5 Table. Variable importance values for predictor variables given by each machine-learning model included in the model ensemble for the west region.
The 30 variables that were most highly ranked by XGB are shown. Definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2) and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two, and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step. XGB, extreme gradient boosting model.
https://doi.org/10.1371/journal.pbio.3000633.s016
(DOCX)
S6 Table. Variable importance values for predictor variables given by each machine-learning model included in the model ensemble for the east region.
The 30 variables that were most highly ranked by XGB are shown. Definitions of each predictor variable are given in S9 Table. Variable name suffixes (-1), (-2) and (-3) denote time lags of 1, 2, and 3 years, respectively. One, two and three asterisks denote the first, second, and third principal component, respectively, for variables available on a monthly time step. XGB, extreme gradient boosting model.
https://doi.org/10.1371/journal.pbio.3000633.s017
(DOCX)
S7 Table. Number of bioassay records for each insecticide type and number of Vgsc allele frequency observations.
https://doi.org/10.1371/journal.pbio.3000633.s018
(DOCX)
S8 Table. Number of bioassay records for each year for each insecticide class.
https://doi.org/10.1371/journal.pbio.3000633.s019
(DOCX)
S9 Table. Descriptions of each potential explanatory variable used in the ensemble model.
If the data layer was obtained from an online repository, the URL and date accessed are given. If the data layer has a citation, then this is given.
https://doi.org/10.1371/journal.pbio.3000633.s020
(DOCX)
S1 Text. Supplementary information about the modelling methodology.
https://doi.org/10.1371/journal.pbio.3000633.s022
(PDF)
Acknowledgments
The authors are extremely grateful to the many people who contributed unpublished data sets and to the authors who provided additional information linked to their published works.
References
- 1. Bhatt S, Weiss DJ, Cameron E, Bisanzio D, Mappin B, Dalrymple U, et al. The effect of malaria control on Plasmodium falciparum in Africa between 2000 and 2015. Nature. 2015;526(7572):207–2011. pmid:26375008
- 2. Corbel V, Chabi J, Dabire RK, Etang J, Nwane P, Pigeon O, et al. Field efficacy of a new mosaic long-lasting mosquito net (PermaNet (R) 3.0) against pyrethroid-resistant malaria vectors: a multi centre study in Western and Central Africa. Malar J. 2010;9. pmid:20423479
- 3. Protopopoff N, Mosha JF, Lukole E, Charlwood JD, Wright A, Mwalimu CD, et al. Effectiveness of a long-lasting piperonyl butoxide-treated insecticidal net and indoor residual spray interventions, separately and together, against malaria transmitted by pyrethroid-resistant mosquitoes: a cluster, randomised controlled, two-by-two factorial design trial. Lancet. 2018;391(10130):1577–88. pmid:29655496
- 4. Tangena J-A, Hendricks CJM, Devine M, Tammaro M, Trett AE, de Pina A, et al. Indoor residual spraying for malaria control in Sub-Saharan Africa 1997 to 2017: an adjusted retrospective analysis. Malar J. 2020;19:150. pmid:32276585
- 5. Sherrard-Smith E, Griffin JT, Winskill P, Corbel V, Pennetier C, Djenontin A, et al. Systematic review of indoor residual spray efficacy and effectiveness against Plasmodium falciparum in Africa. Nat Commun. 2018;9. pmid:30478327
- 6. Edi CAV, Koudou BG, Bellai L, Adja AM, Chouaibou M, Bonfoh B, et al. Long-term trends in Anopheles gambiae insecticide resistance in Cote d'Ivoire. Parasit Vectors. 2014;7. pmid:25429888
- 7. Ranson H, Lissenden N. Insecticide resistance in African Anopheles mosquitoes: A worsening situation that needs urgent action to maintain malaria control. Trends Parasitol. 2016;32(3):187–96. pmid:26826784
- 8. Moyes CL, Wiebe A, Gleave K, Trett A, Hancock PA, Padonou GG, et al. Analysis-ready datasets for insecticide resistance phenotype and genotype frequency in African malaria vectors. Sci Data. 2019;6(1):121. pmid:31308378
- 9.
World Health Organization. Global Plan for Insecticide Resistance Management in Malaria Vectors. Geneva: World Health Organization, 2012.
- 10. Coleman M, Hemingway J, Gleave KA, Wiebe A, Gething PW, Moyes CL. Developing global maps of insecticide resistance risk to improve vector control. Malar J. 2017;16. pmid:28222727
- 11. Camara S, Koffi AA, Alou LPA, Koffi K, Kabran JPK, Kone A, et al. Mapping insecticide resistance in Anopheles gambiae (s.l.) from Cote d'Ivoire. Parasit Vectors. 2018;11. pmid:29310704
- 12. Wat'senga F, Manzambi EZ, Lunkula A, Mulumbu R, Mampangulu T, Lobo N, et al. Nationwide insecticide resistance status and biting behaviour of malaria vector species in the Democratic Republic of Congo. Malar J. 2018;17. pmid:29580247
- 13. Badolo A, Traore A, Jones CM, Sanou A, Flood L, Guelbeogo WM, et al. Three years of insecticide resistance monitoring in Anopheles gambiae in Burkina Faso: resistance on the rise? Malar J. 2012;11. pmid:22799568
- 14. Abdalla H, Wilding CS, Nardini L, Pignatelli P, Koekemoer LL, Ranson H, et al. Insecticide resistance in Anopheles arabiensis in Sudan: temporal trends and underlying mechanisms. Parasit Vectors. 2014;7. pmid:24886129
- 15. Kabula B, Tungu P, Malima R, Rowland M, Minja J, Wililo R, et al. Distribution and spread of pyrethroid and DDT resistance among the Anopheles gambiae complex in Tanzania. Med Vet Entomol. 2014;28(3):244–52. pmid:24192019
- 16. Hakizimana E, Karema C, Munyakanage D, Iranzi G, Githure J, Tongren JE, et al. Susceptibility of Anopheles gambiae to insecticides used for malaria vector control in Rwanda. Malar J. 2016;15. pmid:27905919
- 17. Foster GM, Coleman M, Thomsen E, Ranson H, Yangalbe-Kalnone E, Moundai T, et al. Spatial and temporal trends in insecticide resistance among malaria vectors in Chad highlight the importance of continual monitoring. PLoS ONE. 2016;11(5). pmid:27228026
- 18. Messenger LA, Shililu J, Irish SR, Anshebo GY, Tesfaye AG, Ye-Ebiyo Y, et al. Insecticide resistance in Anopheles arabiensis from Ethiopia (2012–2016): a nationwide study for insecticide resistance monitoring. Malar J. 2017;16. pmid:29151024
- 19. Mandeng SE, Awono-Ambene HP, Bigoga JD, Ekoko WE, Binyang J, Piameu M, et al. Spatial and temporal development of deltamethrin resistance in malaria vectors of the Anopheles gambiae complex from North Cameroon. PLoS ONE. 2019;14(2). pmid:30779799
- 20. Ismail BA, Kafy HT, Sulieman JE, Subramaniam K, Thomas B, Mnzava A, et al. Temporal and spatial trends in insecticide resistance in Anopheles arabiensis in Sudan: outcomes from an evaluation of implications of insecticide resistance for malaria vector control. Parasit Vectors. 2018;11. pmid:29499751
- 21. Chouaibou M, Etang J, Brevault T, Nwane P, Hinzoumbe CK, Mimpfoundi R, et al. Dynamics of insecticide resistance in the malaria vector Anopheles gambiae s.l. from an area of extensive cotton cultivation in Northern Cameroon. Trop Med Int Health. 2008;13(4):476–86. pmid:18248566
- 22. Bhatt S, Weiss DJ, Mappin B, Dalrymple U, Cameron E, Bisanzio D, et al. Coverage and system efficiencies of insecticide-treated nets in Africa from 2000 to 2017. eLife. 2015;4. pmid:26714109
- 23. Noor AM, Mutheu JJ, Tatem AJ, Hay SI, Snow RW. Insecticide-treated net coverage in Africa: mapping progress in 2000–07. Lancet. 2009;373(9657):58–67. pmid:19019422
- 24. Cook J, Tomlinson S, Kleinschmidt I, Donnelly MJ, Akogbeto M, Adechoubou A, et al. Implications of insecticide resistance for malaria vector control with long-lasting insecticidal nets: trends in pyrethroid resistance during a WHO-coordinated multi-country prospective study. Parasit Vectors. 2018;11. pmid:30348209
- 25. Mathias DK, Ochomo E, Atieli F, Ombok M, Bayoh MN, Olang G, et al. Spatial and temporal variation in the kdr allele L1014S in Anopheles gambiae s.s. and phenotypic variability in susceptibility to insecticides in Western Kenya. Malar J. 2011;10. pmid:21235783
- 26. Vontas J, Grigoraki L, Morgan J, Tsakireli D, Fuseini G, Segura L, et al. Rapid selection of a pyrethroid metabolic enzyme CYP9K1 by operational malaria control activities. Proc Natl Acad Sci U S A. 2018;115(18):4619–24. pmid:29674455
- 27. Rakotoson JD, Fornadel CM, Belemvire A, Norris LC, George K, Caranci A, et al. Insecticide resistance status of three malaria vectors, Anopheles gambiae (s.l.), An. funestus and An. mascarensis, from the south, central and east coasts of Madagascar. Parasit Vectors. 2017;10. pmid:28835269
- 28. Reid MC, McKenzie FE. The contribution of agricultural insecticide use to increasing insecticide resistance in African malaria vectors. Malar J. 2016;15. pmid:26895980
- 29. Hien AS, Soma DD, Hema O, Bayili B, Namountougou M, Gnankine O, et al. Evidence that agricultural use of pesticides selects pyrethroid resistance within Anopheles gambiae s.l. populations from cotton growing areas in Burkina Faso, West Africa. PLoS ONE. 2017;12(3). pmid:28253316
- 30. Miles A, Harding NJ, Botta G, Clarkson CS, Antao T, Kozak K, et al. Genetic diversity of the African malaria vector Anopheles gambiae. Nature. 2017;552(7683):96–+. pmid:29186111
- 31. Weedall GD, Mugenzi LMJ, Menze BD, Tchouakui M, Ibrahim SS, Amvongo-Adjia N, et al. A cytochrome P450 allele confers pyrethroid resistance on a major African malaria vector, reducing insecticide-treated bednet efficacy. Sci Transl Med. 2019;11. pmid:30894503
- 32. Simard F, Ayala D, Kamdem GC, Pombi M, Etouna J, Ose K, et al. Ecological niche partitioning between Anopheles gambiae molecular forms in Cameroon: the ecological side of speciation. BMC Ecol. 2009;9. pmid:19460146
- 33. Wiebe A, Longbottom J, Gleave K, Shearer FM, Sinka ME, Massey NC, et al. Geographical distributions of African malaria vector sibling species and evidence for insecticide resistance. Malar J. 2017;16:85. pmid:28219387
- 34. Massey NC, Garrod G, Wiebe A, Henry AJ, Huang Z, Moyes CL, et al. A global bionomic database for the dominant vectors of human malaria. Sci Data. 2016;3:160014. pmid:26927852
- 35. Balabanidou V, Kefi M, Aivaliotis M, Koidou V, Girotti JR, Mijailovsky SJ, et al. Mosquitoes cloak their legs to resist insecticides. Proc R Soc Lond B Biol Sci. 2019;286(1907). pmid:31311476
- 36. David JP, Ismail HM, Chandor-Proust A, Paine MJI. Role of cytochrome P450s in insecticide resistance: impact on the control of mosquito-borne diseases and use of insecticides on Earth. Philos Trans R Soc Lond B Biol Sci. 2013;368(1612). pmid:23297352
- 37. Liu NN. Insecticide Resistance in Mosquitoes: Impact, Mechanisms, and Research Directions. In: Berenbaum MR, editor. Annual Review of Entomology, Vol 60. Annual Review of Entomology. 602015. p. 537–59. pmid:25564745
- 38. Lynd A, Oruni A, van't Hof AE, Morgan JC, Naego LB, Pipini D, et al. Insecticide resistance in Anopheles gambiae from the northern Democratic Republic of Congo, with extreme knockdown resistance (kdr) mutation frequencies revealed by a new diagnostic assay. Malar J. 2018;17. pmid:30400885
- 39. Lucas ER, Rockett KA, Lynd A, Essandoh J, Grisales N, Kemei B, et al. A high throughput multi-locus insecticide resistance marker panel for tracking resistance emergence and spread in Anopheles gambiae. Sci Rep. 2019;9. pmid:31527637
- 40.
Christiaensen L, Demery L, Dillon B, Barrett CB, Deininger KW, Kaminski J. Agriculture in Africa: telling myths from facts. Washington D.C.: World Bank, 2018.
- 41. Donnelly MJ, Simard F, Lehmann T. Evolutionary studies of malaria vectors. Trends Parasitol. 2002;18(2):75–80. pmid:11832298
- 42. Lehmann T, Licht M, Elissa N, Maega BTA, Chimumbwa JM, Watsenga FT, et al. Population structure of Anopheles gambiae in Africa. J Hered. 2003;94(2):133–47. pmid:12721225
- 43. Neafsey DE, Lawniczak MKN, Park DJ, Redmond SN, Coulibaly MB, Traore SF, et al. SNP genotyping defines complex gene-flow boundaries among African malaria vector mosquitoes. Science. 2010;330(6003):514–7. pmid:20966254
- 44. Pinto J, Egyir-Yawson A, Vicente JL, Gomes B, Santolamazza F, Moreno M, et al. Geographic population structure of the African malaria vector Anopheles gambiae suggests a role for the forest-savannah biome transition as a barrier to gene flow. Evol Appl. 2013;6(6):910–24. pmid:24062800
- 45. Barnes KG, Irving H, Chiumia M, Mzilahowa T, Coleman M, Hemingway J, et al. Restriction to gene flow is associated with changes in the molecular basis of pyrethroid resistance in the malaria vector Anopheles funestus. Proc Natl Acad Sci U S A. 2017;114(2):286–91. pmid:28003461
- 46. Weetman D, Wilding CS, Steen K, Pinto J, Donnelly MJ. Gene flow-dependent genomic divergence between Anopheles gambiae M and S forms. Mol Biol Evol. 2012;29(1):279–91. pmid:21836185
- 47. Lee Y, Marsden CD, Norris LC, Collier TC, Main BJ, Fofana A, et al. Spatiotemporal dynamics of gene flow and hybrid fitness between the M and S forms of the malaria mosquito, Anopheles gambiae. Proc Natl Acad Sci U S A. 2013;110(49):19854–9. pmid:24248386
- 48. Clarkson CS, Weetman D, Essandoh J, Yawson AE, Maslen G, Manske M, et al. Adaptive introgression between Anopheles sibling species eliminates a major genomic island but not reproductive isolation. Nat Commun. 2014;5. pmid:24963649
- 49. Ochomo E, Subramaniam K, Kemei B, Rippon E, Bayoh NM, Kamau L, et al. Presence of the knockdown resistance mutation, Vgsc-1014F in Anopheles gambiae and An. arabiensis in western Kenya. Parasit Vectors. 2015;8. pmid:26626424
- 50. Caputo B, Pichler V, Mancini E, Pombi M, Vicente JL, Dinis J, et al. The last bastion? X chromosome genotyping of Anopheles gambiae species pair males from a hybrid zone reveals complex recombination within the major candidate 'genomic island of speciation'. Mol Ecol. 2016;25(22):5719–31. pmid:27661465
- 51. Vicente JL, Clarkson CS, Caputo B, Gomes B, Pombi M, Sousa CA, et al. Massive introgression drives species radiation at the range limit of Anopheles gambiae. Sci Rep. 2017;7. pmid:28417969
- 52. Yunta C, Hemmings K, Stevenson B, Koekemoer LL, Matambo T, Pignatelli P, et al. Cross-resistance profiles of malaria mosquito P450s associated with pyrethroid resistance against WHO insecticides. Pestic Biochem Physiol. 2019;161:61–7. pmid:31685198
- 53. Bhatt S, Cameron E, Flaxman SR, Weiss DJ, Smith DL, Gething PW. Improved prediction accuracy for disease risk mapping using Gaussian process stacked generalization. Journal of the Royal Society Interface. 2017;14(134). pmid:28931634
- 54.
Blangiardo M, Cameletti M. Spatial and Spatio-Temporal Bayesian Models with R-INLA2015. 1–308 p.
- 55. Hancock PA, Wiebe A, Gleave KA, Bhatt S, Cameron E, Trett A, et al. Associated patterns of insecticide resistance in field populations of malaria vectors across Africa. Proc Natl Acad Sci U S A. 2018;115(23):5938–43. pmid:29784773
- 56. Gelman A, Hwang J, Vehtari A. Understanding predictive information criteria for Bayesian models. Stat Comput. 2014;24(6):997–1016.
- 57. Gneiting T, Raftery AE. Strictly proper scoring rules, prediction, and estimation. J Am Stat Assoc. 2007;102(477):359–78.
- 58. Lindgren F, Rue H, Lindstrom J. An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach. J R Stat Soc B. 2011;73:423–98.
- 59. Churcher TS, Lissenden N, Griffin JT, Worrall E, Ranson H. The impact of pyrethroid resistance of the efficacy and effectiveness of bednets for malaria control in Africa. eLife. 2016;5. pmid:27547988
- 60. Kamau L, Mukabana WR, Hawley WA, Lehmann T, Irungu LW, Orago AAS, et al. Analysis of genetic variability in Anopheles arabiensis and Anopheles gambiae using microsatellite loci. Insect Mol Biol. 1999;8(2):287–97. pmid:10380112
- 61. Lehmann T, Blackston CR, Besansky NJ, Escalante AA, Collins FH, Hawley WA. The Rift Valley complex as a barrier to gene flow for Anopheles gambiae in Kenya: The mtDNA perspective. J Hered. 2000;91(2):165–8. pmid:10768135
- 62. Gericke A, Govere JM, Durrheim DN. Insecticide susceptibility in the South African malaria mosquito Anopheles arabiensis (Diptera: Culicidae). S Afr J Sci. 2002;98(3–4):205–8.
- 63. Killeen GF, Kiware SS, Okumu FO, Sinka ME, Moyes CL, Massey NC, et al. Going beyond personal protection against mosquito bites to eliminate malaria transmission: population suppression of malaria vectors that exploit both human and animal blood. BMJ Glob Health. 2017;2(2). pmid:28589015
- 64. Protopopoff N, Matowo J, Malima R, Kavishe R, Kaaya R, Wright A, et al. High level of resistance in the mosquito Anopheles gambiae to pyrethroid insecticides and reduced susceptibility to bendiocarb in north-western Tanzania. Malar J. 2013;12. pmid:23638757
- 65. Sinka ME, Golding N, Massey NC, Wiebe A, Huang Z, Hay SI, et al. Modelling the relative abundance of the primary African vectors of malaria before and after the implementation of indoor, insecticide-based vector control. Malar J. 2016;15. pmid:26945997
- 66. Hendriks CJM, Gibson H, Trett A, Python A, Weiss DJ, Vrieling A, et al. Mapping geospatial processes affecting the environmental fate of agricultural pesticides in Africa. Int J Environ Res Public Health. 2019;16(3523): https://doi.org/10.3390/ijerph16193523.
- 67. Weetman D, Wilding CS, Neafsey DE, Muller P, Ochomo E, Isaacs AT, et al. Candidate-gene based GWAS identifies reproducible DNA markers for metabolic pyrethroid resistance from standing genetic variation in East African Anopheles gambiae. Sci Rep. 2018;8. pmid:29440767
- 68. Lucas ER, Miles A, Harding NJ, Clarkson CS, Lawniczak MKN, Kwiatkowski DP, et al. Whole-genome sequencing reveals high complexity of copy number variation at insecticide resistance loci in malaria mosquitoes. Genome Res. 2019;29(8):1250–61. pmid:31345938
- 69. The Anopheles gambiae 1000 Genomes Consortium. Genetic diversity of the African malaria vector Anopheles gambiae. Nature. 2017;96(552).
- 70. Bayoh MN, Mathias DK, Odiere MR, Mutuku FM, Kamau L, Gimnig JE, et al. Anopheles gambiae: historical population decline associated with regional distribution of insecticide-treated bed nets in western Nyanza Province, Kenya. Malar J. 2010;9. pmid:20187956
- 71. Sinka ME, Bangs MJ, Manguin S, Rubio-Palis Y, Chareonviriyaphap T, Coetzee M, et al. A global map of dominant malaria vectors. Parasit Vectors. 2012;5. pmid:22475528
- 72. Friedl M, Sulla-Menashe D. MCD12Q1 MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC; 2015.
- 73.
You L, Wood-Sichra U, Fritz S, Guo Z, See L, Koo J. Spatial production allocation model (SPAM) 2005 v2.0: mapspam.info. Available from: mapspam.info.
- 74. Tatem AJ. WorldPop, open data for spatial demography. Sci Data. 2017;4.
- 75. Sulla-Menashe D, Gray JM, Abercrombie SP, Friedl MA. Hierarchical mapping of annual global land cover 2001 to present: The MODIS Collection 6 Land Cover product. Remote Sensing of Environment. 2019;222:183–94.
- 76. Esch T, Heldens W, Hirner A, Keil M, Marconcini M, Roth A, et al. Breaking new ground in mapping human settlements from space—The Global Urban Footprint. ISPRS J Photogramm Remote Sens. 2017;134:30–42.
- 77. Funk C, Peterson P, Landsfeld DP, Verdin J, Shukla S, Husak G, et al. The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Sci Data. 2. 2015. pmid:26646728
- 78. Trabucco A, Zomer RJ. Global Aridity Index (Global-Aridity) and Global Potential Evapo-Transpiration (Global-PET) Geospatial Database. CGIAR-CSI GeoPortal; 2009.
- 79. Weiss DJ, Lucas TCD, Nguyen M, Nandi A, Bisanzio D, Battle KE, et al. The global landscape of Plasmodium falciparum prevalence, incidence and mortality 2000–2017. Lancet. 2019;accepted.
- 80.
Ting KM, Witten IH. Stacked generalization: when does it work? In: Pollack ME, editor. Ijcai-97—Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, Vols 1 and 2. International Joint Conference on Artificial Intelligence1997. p. 866–71.
- 81. Wolpert DH. Stacked generalization. Neural Networks. 1992;5(2):241–59.
- 82. Burbidge JB, Magee L, Robb AL. Alternative transformations to handle extreme values of the dependent variable. J Am Stat Assoc. 1988;83(401):123–7.
- 83.
Vinayak RK, Gilad-Bachrach R. DART: Dropouts meet Multiple Additive Regression Trees. In: Guy L, Vishwanathan SVN, editors. Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics; Proceedings of Machine Learning Research: PMLR; 2015. p. 489–97.
- 84. Cameletti M, Lindgren F, Simpson D, Rue H. Spatio-temporal modeling of particulate matter concentration through the SPDE approach. Asta-Advances in Statistical Analysis. 2013;97(2):109–31.
- 85.
Rue H, Held L. Gaussian Markov Random Fields: Theory and Applications. Boca Raton, FL: Chapman & Hall.
- 86. Breiman L. Stacked regressions. Machine Learning. 1996;24(1):49–64.
- 87.
Chen T, Guestrin C. XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGDD International Conference on Knowledge Discovery and Data Mining. KDD ‘16. New York, NY, USA: ACM; 2016. p. 785–94.
- 88. Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2(3):18–22.
- 89. Hothorn T, Buehlmann P, Kneib T, Schmid M, Hofner B. mboost: Model-Based Boosting. R package version 2.9–1 ed. https://CRAN.R-project.org/package = mboost: R package version 2.9–1; 2018.
- 90. Hancock PA. pahanc/Mapping-insecticide-resistance: Insecticide resistance model ensemble. 2020.