Skip to main content
Advertisement
  • Loading metrics

Hotspots and super-spreaders: Modelling fine-scale malaria parasite transmission using mosquito flight behaviour

  • Luigi Sedda ,

    Roles Conceptualization, Formal analysis, Methodology, Writing – original draft, Writing – review & editing

    l.sedda@lancaster.ac.uk (LS); chris.jones@lstmed.ac.uk (CMJ)

    Affiliation Lancaster Ecology and Epidemiology Group, Lancaster Medical School, Lancaster University, United Kingdom

  • Robert S. McCann,

    Roles Data curation, Methodology, Writing – review & editing

    Affiliations Laboratory of Entomology, Wageningen University & Research, Wageningen, The Netherlands, School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi, Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, Maryland, United States of America

  • Alinune N. Kabaghe,

    Roles Data curation, Writing – review & editing

    Affiliation School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi

  • Steven Gowelo,

    Roles Data curation, Writing – review & editing

    Affiliations Laboratory of Entomology, Wageningen University & Research, Wageningen, The Netherlands, School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi, MAC Communicable Diseases Action Centre, Kamuzu University of Health Sciences, Blantyre, Malawi

  • Monicah M. Mburu,

    Roles Data curation, Writing – review & editing

    Affiliations Laboratory of Entomology, Wageningen University & Research, Wageningen, The Netherlands, School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi

  • Tinashe A. Tizifa,

    Roles Data curation, Writing – review & editing

    Affiliations School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi, Center for Tropical Medicine and Travel Medicine, University of Amsterdam, The Netherlands

  • Michael G. Chipeta,

    Roles Data curation, Writing – review & editing

    Affiliations School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi, Malawi-Liverpool-Wellcome Trust Clinical Research Programme, Blantyre, Malawi

  • Henk van den Berg,

    Roles Data curation, Writing – review & editing

    Affiliation Laboratory of Entomology, Wageningen University & Research, Wageningen, The Netherlands

  • Willem Takken,

    Roles Data curation, Writing – review & editing

    Affiliation Laboratory of Entomology, Wageningen University & Research, Wageningen, The Netherlands

  • Michèle van Vugt,

    Roles Data curation, Writing – review & editing

    Affiliation Center for Tropical Medicine and Travel Medicine, University of Amsterdam, The Netherlands

  • Kamija S. Phiri,

    Roles Data curation, Writing – review & editing

    Affiliation School of Global and Public Health, Kamuzu University of Health Sciences, Blantyre, Malawi

  • Russell Cain,

    Roles Software

    Affiliation Lancaster Ecology and Epidemiology Group, Lancaster Medical School, Lancaster University, United Kingdom

  • Julie-Anne A. Tangena,

    Roles Data curation, Formal analysis, Investigation, Writing – review & editing

    Affiliation Vector Biology Department, Liverpool School of Tropical Medicine, Liverpool, United Kingdom

  • Christopher M. Jones

    Roles Conceptualization, Funding acquisition, Project administration, Writing – review & editing

    l.sedda@lancaster.ac.uk (LS); chris.jones@lstmed.ac.uk (CMJ)

    Affiliations Malawi-Liverpool-Wellcome Trust Clinical Research Programme, Blantyre, Malawi, Vector Biology Department, Liverpool School of Tropical Medicine, Liverpool, United Kingdom

Abstract

Malaria hotspots have been the focus of public health managers for several years due to the potential elimination gains that can be obtained from targeting them. The identification of hotspots must be accompanied by the description of the overall network of stable and unstable hotspots of malaria, especially in medium and low transmission settings where malaria elimination is targeted. Targeting hotspots with malaria control interventions has, so far, not produced expected benefits. In this work we have employed a mechanistic-stochastic algorithm to identify clusters of super-spreader houses and their related stable hotspots by accounting for mosquito flight capabilities and the spatial configuration of malaria infections at the house level. Our results show that the number of super-spreading houses and hotspots is dependent on the spatial configuration of the villages. In addition, super-spreaders are also associated to house characteristics such as livestock and family composition. We found that most of the transmission is associated with winds between 6pm and 10pm although later hours are also important. Mixed mosquito flight (downwind and upwind both with random components) were the most likely movements causing the spread of malaria in two out of the three study areas. Finally, our algorithm (named MALSWOTS) provided an estimate of the speed of malaria infection progression from house to house which was around 200–400 meters per day, a figure coherent with mark-release-recapture studies of Anopheles dispersion. Cross validation using an out-of-sample procedure showed accurate identification of hotspots. Our findings provide a significant contribution towards the identification and development of optimal tools for efficient and effective spatio-temporal targeted malaria interventions over potential hotspot areas.

Author summary

The dispersal of infectious Anopheles mosquitoes is critical to determining the geographical range over which malaria parasites are transmitted between human hosts and mosquito vectors. Malaria rates in the human population vary over space and time and are often characterised by hotspots, where disproportionately few hosts or individuals contribute to malaria transmission. Here, we present an approach to determine the location of malaria hotspots and super spreader houses based on modelling infectious mosquito movements from house to house using infection and wind data collected as part of a wider malaria study in southern Malawi. From our model, we show that it is possible to determine key components of malaria transmission including the identification of stable and unstable malaria hotspots (including super spreader houses), how quickly malaria spreads between households, quantify the importance of village configuration on malaria spread and identify the most important wind types in the local ecological setting. We conclude that it is possible to determine networks of mosquito-borne infection from combining infection and wind data. The identification of malaria hotspots presents an opportunity to target malaria control efforts in areas where malaria is disproportionately high.

Introduction

Targeted malaria control is an important and potentially cost-effective approach [1,2]. Developing effective strategies for implementing malaria interventions on a targeted basis is particularly important in an era where countries are striving to achieve malaria elimination targets using limited funding and when progress in reducing malaria cases has stalled over the last 5 years [3]. One critical component of a targeted control strategy is the accurate delineation of entomological and epidemiological hotspots (areas where vector/s or disease rates exceed a local average value [4]), that in hypoendemic and mesoendemic malaria regions are often characterised by a combination of stable hotspots (persisting across time), unstable hotspots (seasonally occurring but not consistent geographically) and low transmission clusters (persistent across time but not reaching hotspot threshold levels) within different spatial scales. Therefore, defining and delineating hotspots according to available vector and malaria data is difficult and this is compounded by a lack of information on mosquito and people movement and the interaction between vectors and hosts [5], and consequently requires fast and flexible malaria surveillance systems with digital tools for optimising sampling locations, collection and analysis of key malaria data [3,5].

For these reasons, understanding and quantifying mosquito movement is of key importance for identifying hotspot networks, forecasting fine-scale malaria transmission and for geographically targeted interventions, especially in countries where there is focal and heterogeneous transmission [57]. Super-spreaders are characterised by temporal and spatial scales intrinsically associated to the stability and size of hotspots areas. Hence, super-spreaders need to be considered in any rational design of vector control programmes. There is not a conventional definition of super-spreaders, but it is generally accepted that super-spreaders are those epidemiological units (e.g. individuals, vectors, households) that account for a high proportion of the transmission, sometimes based on the 80–20 “Pareto” rule [6]. In this study a super-spreader is a house infecting two or more other houses. This means that super-spreading, as defined in our study is dependent of location and house characteristics (and the characteristics of people living in the house). It is dependent on location because the spreading is influenced by the environment where mosquito moves. When an area contains more than one super-spreader for a certain length of time then the area is defined as hotspot.

In this work we aim to identify stable and unstable hotspots and malaria super-spreader houses by explicitly modelling infected mosquito movements from house to house. In densely populated areas the distance adult anophelines travel in search of a blood-meal are expected to be relatively short, but in sparsely populated settings, mosquitoes are likely to disperse up to several kilometres, depending upon host availability and landscape structure [8]. Household location, occupancy, demography and characteristics (e.g. building materials and entry points) influence indoor vector densities, and consequently mosquito directional movement and dispersal, the distribution of human-biting risk and malaria transmission within communities [9,10]. These factors can impact the effectiveness of interventions over a range of geographical scales.

Mosquitoes, as with many other insect vectors, use wind-assisted movement to disperse or migrate with major implications for pathogen spread and gene flow [11,12]. Over relatively short distances (hundreds of metres) mosquitoes use the wind to seek and guide themselves towards odour plumes emitted from hosts although there is little consensus under which prevalent flight strategy (e.g. upwind, downwind, random) and optimal wind speed they make such flights [1315].

Modelling disease transmission using vector flight behaviour and wind data are a powerful means to quantify and characterise vector dispersal, determine environmental risk factors for transmission, and predict the development of an epidemic, leading to the possibility of short, medium and large spatial scale estimations of disease spread [16,17]. There are only a limited number of models explicitly accounting for flight behaviour and flight parameters when predicting Anopheles dispersion [18,19], and none joining Anopheles dynamics or mosquito wind-aided dispersion and human malaria transmission (for a review see [20]).

In this work, we have extended the biologically-informed Spatial-temporal Wind-Outbreak Trajectory Simulation model (SWOTS) to the malaria disease system (MALSWOTS). SWOTS was originally developed for describing the wind spread of midge-borne diseases (Culicoides sp.) in livestock [21,22] based on midges’ flight capabilities and wind conditions. SWOTS estimated intrinsic and extrinsic incubation periods that corresponded with those found by laboratory analyses [22], confirming the validity of SWOTS results. Compared to the original SWOTS model, in MALSWOTS the mosquito movements between locations: (i) are weighted on the survival probabilities of mosquitoes based on the environmental characteristics of departing and arriving households and (ii) inform the inferential process by accounting for uninfected households. This provides estimations of the probability that a connection between two locations takes place by integrating likelihood-based approaches with mechanistic simulations allowing for predictions and uncertainty in those predictions. SWOTS and MALSWOTS are different from other wind-based models such as HYDREMATS [23]. The latter combines life-stage population dynamics with environmental conditions in a fully mechanistic process, while MALSWOTS allows for the probabilistic treatment of source (infected mosquito departing) and sink (infected mosquito reaching an uninfected house), and inclusion of downwind and upwind mosquito movement capabilities with or without a random component.

We apply MALSWOTS to identify super-spreader houses and clusters of these super-spreaders as hotspots in the villages surrounding the Majete Wildlife Reserve, a malaria endemic area in the Chikwawa District of southern Malawi, part of the Majete Malaria Project (MMP) [24,25]. Malaria endemicity in the study area is high but heterogeneous with identifiable hotspots of transmission [26]. Transmission of the malaria parasite Plasmodium falciparum occurs year-round, with annual peaks typically following the rainy season [27]. From 2015 to 2018 the prevalence of P. falciparum infection in children aged 6–59 mo. during repeated cross-sectional surveys varied within and between years, from a low of 5% in December 2016 to a high of 60% in June 2015 [28]. The Chikwawa District Health Office implements malaria interventions in the district for the National Malaria Control Programme (NMCP). This includes provision of ITNs to pregnant women and children under 5 years old, mass distribution campaigns of ITNs, intermittent preventative therapy for pregnant women, and malaria case diagnosis and treatment with artemisinin-based combination therapy. The only mass distribution of ITNs during the study period was in April 2016. PermaNet 2.0 (Vestergaard Frandsen, Lausanne, Switzerland), Olyset Net (Sumitomo Chemical Company, Tokyo, Japan), and Royal Sentry (Disease Control Technologies, USA) were distributed in the district.

We provide an overview of the MALSWOTS algorithm, its application to the MMP data, present various outputs and indexes returned by MALSWOTS such as identification of super-spreaders, stable and unstable hotspots, velocity of parasite transmission, spatial and temporal scales of parasite transmission. In addition, we validate the model with an out-of-sample procedure and describe how MALSWOTS outputs can support and contribute to spatially targeted malaria intervention strategies.

Methods

Ethics statement

Written informed consent was sought from individual participants (and their parents or guardians when appropriate) during household surveys. The College of Medicine Research and Ethics Committee in Malawi approved this study (Proposal Number P.05/15/1731).

Study area and period

Data were collected as part of the MMP in Chikwawa district, southern Malawi. The MMP assessed multidisciplinary and community-based approaches to malaria control from 2014 to 2019 in communities living within 10 km of Majete Wildlife Reserve [25,26]. MMP studies primarily covered three separate regions, referred to as focal areas A, B and C, with a total of 25,000 inhabitants living in 6,600 houses across 65 villages [26] (Fig 1). The MMP included a cluster-randomized controlled trial from May 2016 to May 2018. Of the 65 villages, 12 were excluded from the treatment arm allocation to reduce the risk of contamination between different treatment arms. The remaining villages were assigned to one of four groups: a control arm; house improvement (HI); larval source management (LSM); and HI+LSM. NMCP interventions and community engagement were used in all arms.

thumbnail
Fig 1. Majete Malaria Project focal areas.

Map was made using ArcGIS Pro 2.7.0 (https://www.esri.com/en-us/arcgis/products/arcgis-pro/resources). Source settlements data: High Resolution Settlement Layer—Facebook Connectivity Lab and Center for International Earth Science Information Network—CIESIN—Columbia University.

https://doi.org/10.1371/journal.ppat.1010622.g001

Chikwawa sits within the lower Shire Valley which, compared with the rest of the country, has its own unique climate characterised by hot/dry periods (September to December) and comparatively low, sporadic rainfall. The higher temperatures and presence of the Shire river promotes Anopheles mosquito proliferation. The dominant vectors in the region are Anopheles funestus and An. arabiensis with a small proportion of An. gambiae s.s. [24]. Mosquito population size is dependent on seasonal rainfall [29].

Most of the households practice subsistence farming with maize, millet and beans as staple crops [30]. Many residents also keep domestic animals, with cattle marketing forming an important economic activity in the district.

To measure malaria parasite prevalence, transmission and vector densities, a series of repeated cross-sectional surveys were conducted from 2015 to 2018. The period covered by the current analysis is 1st July 2016 to 12th May 2018 (680 days), coinciding with the collection of wind data as described below (Table 1).

thumbnail
Table 1. Environmental and epidemiological characteristics of focal areas for the period under study (1st July 2016 to 12th May 2018).

https://doi.org/10.1371/journal.ppat.1010622.t001

Climate and wind data

Climate data were obtained from a HOBO weather station (Onset Computer Corporation, Massachusetts, USA) at each focal area with geographic coordinates (latitude and longitude in WGS84 coordinate system): -15.849767 and 34.505821 (Focal area A), -15.90287 and 34.74525 (Focal area B); and -15.97293 and 34.465394 (Focal area C). The weather stations recorded total hourly rainfall in mm; hourly average temperature in degrees Celsius (°C); hourly average relative humidity as a percentage; hourly average soil water content in m3/m3; hourly average wind speed in m/s; maximum 3-second gust speed at hourly intervals in m/s; and hourly average wind direction in degrees (Table 1).

Epidemiological data

Epidemiological data were collected using repeated cross-sectional surveys adapted from the malaria indicator survey (MIS) [26]. Every two months a sample of 270 households (90 households per focal area) was selected as described previously [24] from a demographic database covering the study area, and surveys were conducted in these households over a 6- to 8-week period in each round with short breaks between rounds (5 to 15 days). Surveys were conducted in parallel in each focal area, and 11 rounds were performed between 1st July 2016 to 12th May 2018. During each visit to a household, all children aged 6–59 months and women aged 15–49 years present during the survey were tested for the presence of P. falciparum antigens using a Rapid Diagnostic Test (RDT) (SD BIOLINE Malaria Ag P.f. HRP-II, Standard Diagnostics, Yongin-si, Republic of Korea). In this study, a household was defined as “a social group made up of people eating from the same pot”, whereby the members of a household could occupy one or multiple houses (i.e. buildings with distinct structural features and geographic locations). For households with multiple houses, eligible children and women from all houses were invited to participate in the study. The geographic unit of infection is therefore the house and not the household, which enables us to capture fine scale vector-biological, geographical and ecological conditions related to mosquito movement. For houses presenting more than one infection during the surveyed period, we assumed that once the first infection was recorded, any subsequent infections occurring one or more months later were independent from the initial one (i.e. a new infectious mosquito had reached the house). This is also coherent with the fact that participants presenting a positive RDT were treated according to Malawi national treatment guidelines.

During the study period, 502 P. falciparum infections were confirmed out of 3,271 tests. The number of malaria infections, infected houses (with at least one malaria infection), uninfected houses and houses with reoccurring infections in each focal area are presented in Table 1. Note that we are defining infected or uninfected houses based only on children from 6 to 59 months and women from 15 to 49 years old. In this study, a large proportion of RDT positive participants were asymptomatic at the time of the RDT (and did not report symptoms in the last 48 hours). Since asymptomatically infected individuals do not actively seek antimalarial treatment, their infections may last longer than symptomatic episodes and this may contribute to the further spread of malaria (Drakeley, Gonçalves et al. 2018).

Entomological data

For each round of data collection (i.e. every two months), 72% of the households selected for epidemiological data collection (i.e. 65 of 90 households in each focal area) were randomly selected for mosquito sampling. Maximum operationally feasible sample sizes for mosquito sampling (n = 65 households per focal area) and epidemiological data collection (n = 90 households per focal area) were determined during baseline rounds in 2015. A total of 4302 adult mosquito collections were carried out (1520 in Focal area A, 1407 in Focal area B and 1375 in Focal area C) averaging 65 trapping nights per month per focal area. Adult mosquitoes were sampled using the Suna trap (Biogents AG, Regensburg, Germany), which uses a chemical bait and carbon dioxide to attract mosquitoes at night [31]. Previous work in this study area has shown that the Suna trap can be considered a good proxy for host seeking mosquitoes [32]. At each house, the trap was set one night indoors and one night outdoors consecutively, with the order of indoor/outdoor determined randomly. Mosquitoes were collected from the traps each morning. All collected mosquitoes were preserved using a desiccant and identified to species using standard morphological and molecular techniques [3335]. qPCR was used to assess the presence of P. falciparum sporozoites in the heads and thoraces of female Anopheles mosquitoes [36]. Full description of the surveillance methods are described elsewhere [26].

Overall, the number of Anopheles malaria vectors collected was low (S1 Table). Only one Anopheles, which was not positive for P. falciparum, was found in Focal area A. On average, in Focal area B and C one female Anopheles was collected every 10 and 14 trapping nights, respectively, with one P. falciparum positive Anopheles collected per 200 trapping nights in Focal area C and per 350 trapping nights in Focal area B. The survival probabilities were based on the number of female Anopheles per trap per day.

Mosquito flight parameters

To control for mosquito flight characteristics and to quantify the influence of wind and mosquito flight on malaria transmission in the MMP focal areas, mosquito flight parameters were obtained from the literature. The parameters considered in this work are provided in the form of ranges (r), thresholds (t) and fixed values (f). They are: number of days prior to a new human infection (DPI), that has been set between day 21 and day 1 before the date of the house survey (r); number of days of flight (DoF), 1 to 18 days (r); numbers of hours of continuous flight, one hour (f); maximum distance of flight in one day, 1 km (t); distance around the house, 15 m (t); upwind and downwind flight speed, 0.1 m/s (f); distance at which the mosquito can detect the host, 50 m (t); wind limits for random, upwind and downwind flight, 11 m/s (t); and tolerance around wind direction, 45 degrees (t). DPI accounts for the full intrinsic incubation period, uncompleted extrinsic incubation period (if not completed during the days of flight) and uncertainty in RDT test (S1 Fig). Full details on the chosen values are provided in the S1 File.

MALSWOTS considers three mosquito flight movements and their combinations: downwind when the mosquito flight direction is coincident to that of the carrying wind (passive flight); upwind when the mosquito flight against the wind flow and usually in response to host odours (active flight); and random, i.e. irrespective of the current wind speed and direction, or the presence of wind turbulence (active flight).

MALSWOTS

We define a wind trajectory as the imaginary polyline a particle will follow when carried by the winds. When a trajectory describes the geographic path of an infected mosquito from an infected house to an uninfected house, and subsequently causing infection, the trajectory is called ‘connection’. We have simplified the jargon, defining an ‘infected house’ as a house with at least one child (6 to 59 months old) or a woman (15 to 49 years old) detected positive with a RDT test and, vice versa, an ‘uninfected house’ with a negative RDT test.

The Spatio-temporal Wind-Outbreak Trajectories Simulation (SWOTS) described in [21] has been adapted to malaria mosquitoes and developed further to include constraints such as mosquito survival probabilities and uninfected locations. As in SWOTS, MALSWOTS assumes that each infected house contains mosquitoes that are (or will develop to) infected while moving to other houses. The number of mosquitoes moving from house to house is independent from the mosquito catches (otherwise it is not possible to model areas without or scarce entomological information, such as Focal area A). MALSWOTS assumes that at least one mosquito departs from each infected house each day. Therefore, the entomological data is used only for calculating the survival probabilities by time (not space since not enough data has been collected) for each area under study (depending on local environmental characteristics). Mosquito survival probabilities are obtained from a Poisson log-linear model based on exposure time and climate. The survival probability is then measured as expected survival rates per exposure time and climate (full model specification described in S1 File).

In MALSWOTS we abandoned the ‘best fit’ approach (for which exact parameters are obtained) and added an averaging component that allows us to consider house infections as results of different connections at different spatial and temporal scales. For this study we averaged the top 5% of the models ranked by largest correlation. As in SWOTS, MALSWOTS includes five types of mosquitoes’ movement (downwind, upwind, random, downwind and random, and upwind and random). These are then used in conjunction with probabilities of survival to obtain the probability of (i) a house being infected from all surrounding infectious houses (since one house is likely to be infected by mosquitoes arriving from multiple houses with different movement types), (ii) a house being infected by a specific house, and (iii) a house being infected by a type of mosquito movement (S2 Fig).

In brief, MALSWOTS algorithm has four steps. Firstly, identification of parameter values (such as for DPI, DoF and time of the day considered) that maximise the correlations between mosquito trajectories and house infections trajectories at each house for each plausible temporal scale. Once these parameters are found, the top 5% of the models, ranked by the largest correlation, are averaged (second step). In the third step, MALSWOTS simulates the outbreak by re-calculating the mosquito trajectories according to the averaged parameters and a stochastic wind field. Each infected house in each interval is associated with many different possible infected mosquito trajectories going from it. Some of these trajectories will end up at uninfected houses and eventually infect them. In the last step, MALSWOTS examines the associations between suspected movement of infected mosquitoes from infected houses to susceptible houses and identifies the most likely (highest probability) source and route of infection to each susceptible house, providing the probabilities described above.

The algorithm is written in R-language [37]. The full detailed algorithmic steps are described in S1 File. All calculations are performed in a grid for each focal area, with all grids having a resolution of 1/3 of a kilometre per cell. Specifically, in a grid of 31 by 25 nodes (775 grid nodes, 86km2) for Focal area A; 31 by 61 (1891 grid nodes, 210km2) for Focal area B; and 34 by 34 (1156 grid nodes, 128km2) for Focal area C. Temperature, precipitation, wind speeds and directions are assumed identical at all the grid nodes in each area, since only one weather station is available per focal area. Triangulation between the three weather stations to estimate wind speed and direction at all the nodes in a grid is not feasible due to the large distance between focal areas (from 18km between Focal area A and C, 28km between Focal area B and C, and 29km between Focal area A and B).

Limitations described in the epidemiological data section are accounted for by considering temporal periods (number of days prior to a new human infection + number of days of flight) instead of exact dates. In general, issues related to temporal uncertainty are partly alleviated by the fact that MALSWOTS explores different time and spatial scales independently from each other, which can potentially balance the time-dependent and spatial-dependent RDT uncertainties within the individual-based level in a repeated randomised cross-sectional study [38].

Houses spatial randomness

Environmentally constrained infectious disease spread is also dependent on the spatial configuration of the area and its units (e.g. houses) [9]. To estimate potential clustering in house locations (independently from their infection status), complete spatial randomness was tested by employing a K-function test which is independent from the shape of the study region and takes into account the density of the events. The latter allows for comparison among groups regardless the event prevalence [39]. In this study the K-function is transformed into an L-function to simplify interpretability. In fact, for the L-function a value of 0 means the process is completely random, while increasing positive values describe increasing spatial clustering [40]. The L-function significance is evaluated by computing its envelopes based on 999 point pattern simulations.

Hotspot and super-spreader definitions

Our definition of a super-spreader is ‘a house that is the source of infection for two or more other houses within the study period (07/2016 to 05/2018). Hotspots are usually defined statistically [28,41] although no strict definition exists. Here we simply considered a hotspot an arbitrary area of 1 km2 (as the geographic size for hotspots described in [42]) containing at least two super-spreaders. We defined ‘stable hotspot’ if it occurred at least once a year in the same 1 km2 over the three calendar years, otherwise it is defined as an ‘unstable hotspot’. Low transmission areas are those that contain unclustered spreaders and super-spreaders.

Investigating risk factors of super-spreader houses

The number of ‘infected houses’ originating from each house was modelled using a Poisson generalised linear mixed model. This model was not corrected for spatial autocorrelation because the outcome number of infected houses originating from each house was affected by a strong nugget effect (weak functional structure in the semivariance, see S3 Fig) [43]. For each house the following predictors and/or strata were considered: the total number of people in the household; the presence of any ITNs in the household; the mean wealth score for the household based on assets owned; the number of ITNs per person; treatment arm in the cluster-randomized controlled trial; the number of women, children under 15 years old, and children under 5 years old in the household; the number of rooms used for sleeping; the number of cattle, goats, sheep, chickens, and pigs owned by the household; the number of ITNs in the household; the material of the roof, wall, and floor; whether the eaves were open or not; whether the household owned any cattle, goats, sheep, chickens, or pigs; the number of people who slept under an ITN the previous night; whether the individual tested by RDT slept under an ITN the previous night; and the proportion of people sleeping under an ITN the previous night; and the trial treatment arm, as described in [24]. For villages with the same interventions as the control but which were excluded from the trial analysis because of proximity to other villages, treatment was defined as “excluded”. Finally, the unconstrained population counts (because of the low accuracy of building satellite information for the Majete rural area) for the year 2016 at a resolution of 3arc (approximately 100m at the equator) from WorldPop [44], has been included as covariate.

The best fitting model was obtained from a model selection analysis by testing all the possible combinations of predictors with different stratification. The AICc criteria is recommended for small sample sizes and when the number of parameters among the compared models are different. The model with the lowest AICc value is usually considered to be the best data fitting model [45]. Due to computational issues and the small sample size (for example the number of infected houses in Focal area A, n = 86), we excluded from the model selection the interaction terms between predictors.

Variable selection was performed in R-cran software (4.0.4) with packages MuMIn (model selection and averaging) and stepcAIC.

MALSWOTS validation

The algorithm was evaluated by cross-validation on 10% of houses for each focal area. In particular, we ran MALSWOTS for 90% of houses and calculated the probabilities for the remaining 10% to be infected (equivalent to the probability (i) described in the MALSWOTS section above) during the estimated DPI+DoF period prior to the RDT positive test, the period prior to DPI+DoF and the period after DPI+DoF (S1 Fig). Probability calculations are described in the section above. We define a ‘correct prediction’ if the probability of a house which gets infected during the DPI+DoF period is larger than the probability before or after the DPI+DoF period. The before period is defined as between 2*(DPI+DoF) and (DPI+DoF) days prior to a RDT positive test and the after period is defined as between (DPI+DoF) and 2*(DPI+DoF) days after a RDT positive test. For the uninfected houses, since these are always uninfected during the study period, we have not calculated probabilities for the period before, around or after a RDT positive test. Instead, we have calculated the probability to be infected during the whole period spanning from 2*(DPI+DoF) days before and after a RDT positive test related to the closest infected house. From this probability we have produced the number of false positive when considering probabilities of connections above 0.025 and 0.25.

Results

In the surveyed area (Fig 1), 26% of surveyed houses were infected at some point during the two-year study period, and only 3.6% had recurring cases of malaria. For the latter only 1 house in each area experienced recurring cases within 30 to 60 days after the previous infection, while the rest of the recurring cases were after 121 days from previous infection (Table 1).

The MALSWOTS model found that in all focal areas, winds associated with the 6pm to 10pm time window are highly associated with the geographic spread of malaria parasites between houses surrounding the Majete Wildlife Reserve. In fact, winds within this timeframe accounted for between 39% to 57% of the top 5% of the strongest correlations between mosquito movements and malaria spread from house to house. It is worth noting that we explored 9,000 scenarios, each one based on a different combination of temporal and flight movement assumptions. For the rest of the time periods considered in this analysis, the second most important timeframe was 11pm to 3am in Focal areas A and C and 4am to 8am in Focal area B. Accounting for the time period as a whole (6pm-8am) did not generate the majority of highest correlations (Table 2). The modal number of days for a new infection to arise (that combines DPI and DoF) was in the range of 17 to 29 days (during which the mosquito took an infected blood meal, the mosquito flew to a different house, the extrinsic incubation period was completed, the mosquito took a blood meal from a new person, and the intrinsic incubation period was completed).

thumbnail
Table 2. MALSWOTS estimated optimal parameters and transmission summaries obtained from the top 5% of the models ranked by largest correlation.

^ confidence intervals. All parameters are calculated over the whole study period.

https://doi.org/10.1371/journal.ppat.1010622.t002

Wind-aided mosquito movements explained 91%, 85% and 71% of the possible house-to-house malaria transmissions in Focal area A, B and C respectively, with a probability above 0 (for example if this probability was 0.01, out of 100 infected mosquito wind-based trajectories departing from an infected house, at least 1 would reach an uninfected house and would subsequently become infected). Considering a more conservative probability of 0.25 (i.e. out of 100 infected mosquito wind-based trajectories departing from an infected house, at least 25 reach the uninfected house which subsequently becomes infected), these proportions reduced to 56%, 76% and 37%. A value of 25% of connections roughly equates to thousands of successful connections, and therefore the lower probabilities are still epidemiologically important.

In all focal areas, a mixture of wind-aided flight (downwind and downwind + random movements, upwind and upwind + random movements) were implicated in the majority of house-to-house transmissions: 96% in Focal area A, 58% in Focal area B, and 86% in Focal area C. Purely random movement (independent from wind) accounted for the majority of the connections in Focal area B only (around 42% of connections, Table 2). Importantly, active flight (upwind and upwind + random) is responsible for more than a third of connections in Focal area A and B and the majority of connections in Focal area C, supporting the epidemiological importance of upwind flight for mosquito dispersal and subsequent parasite transmission [46,47].

More than a third of infected houses infected at least one other house and more than a third infected two or more houses (Fig 2); the latter we define as ‘super-spreaders’. The maximum observed number of infected houses resulting from a single house was 13 in Focal area C. The proportion of super-spreaders is between 5 and 13% of all houses (Table 2). The locations of the super-spreaders are shown in Fig 3. Often, these super-spreaders tend to be localised, forming hotspots (Fig 3). 92% of super-spreaders in Focal area A were within a 1 km2 of an additional super-spreader (a condition we used for hotspot definition) while in the other two focal areas, all super-spreader houses are within 1 km of each other (Fig 3). Additionally, more than half of all super-spreader houses (57% to 62%) were stable; occurring at least once a year in the same 1 km2 (in each cluster) over the three calendar years. The low transmission areas are relatively small in Focal area C (Fig 3). On average each super-spreader infected 3 other houses and in comparison to non-super-spreaders they tend to have connections which are longer in time but not in distance (Fig 4). The Poisson Generalised Linear Model analysis on the super-spreaders showed that houses are likely to infect additional houses if they contain a larger number of positive rapid diagnostic test (RDT) results, a larger number of children under five, do not belong to the trial (‘treatment–excluded’) and if goats were present (Table 3). Overall, the model explained 67% of the total variance. The primary material of the wall structure is associated with a decreased risk of infecting other houses (bricks and mud compared to other materials including iron walls), but this predictor could only be used as a random effect in the model because it was not significant as fixed effect. Population counts were not significantly associated to super-spreaders (OR 1.06, 95% confidence interval: 0.98 and 1.15), and the model explained 55% of the total variance when this predictor was included (therefore this predictor was removed from the model described in Table 3).

thumbnail
Fig 2. Spreaders for each focal area obtained by MALSWOTS for the whole study period.

Each plot shows the number of new houses (from 1 to 15 houses in bins of 1) predicted to have been infected by x original houses. For example, 14 houses are likely to have infected 2 other houses each in Focal area A.

https://doi.org/10.1371/journal.ppat.1010622.g002

thumbnail
Fig 3. Location of super-spreader houses and hotspots (stable and unstable) within each focal area.

Each colour denotes the number of ‘infected houses’ arising from the spreader. A black cross denotes the location of an uninfected house. Allocation of the hotspots has been performed manually capturing the largest number of super-spreaders within each hotspot square.

https://doi.org/10.1371/journal.ppat.1010622.g003

thumbnail
Fig 4. MALSWOTS time and distance analyses between house connections within each focal area.

The ‘average distance connections’ refers to successful connections only. The ‘finding houses’ measure (central histogram) is the ratio between the length of the connection between two houses in which a mosquito has flown and the straight line (Euclidean) distance between the same houses (which can be shorter than the connection distance due to the zig-zag movement of the mosquito). The ‘time connections’ shows the temporal distance between infecting and infected houses in terms of RDT positive test. This time can be shorter than the days of flight (Table 2) due to the possibility that the mosquito left the infecting house prior malaria detection in the house.

https://doi.org/10.1371/journal.ppat.1010622.g004

thumbnail
Table 3. Odds ratios (OR) associated with selected variables from generalised linear mixed model for risk of being a super-spreader house.

*If yes, the village has the same interventions as the control village but it is excluded from the trial analysis because of proximity to other villages.

https://doi.org/10.1371/journal.ppat.1010622.t003

Successful connections are composed of relatively short daily movements that span from around 200m in Focal area B to between 400-500m in Focal area A and C. The median successful distance flown by an infected mosquito to reach an uninfected house within the optimal timespan of days prior to infection + days of flight (DPI+DoF, Table 1 for their values for each Focal area) was around 340m in Focal area B to 5 km in Focal area A (Table 2). These distances are consistently longer than the median Euclidian distance of infection connections (the straight-line distance between infecting and infected house) (Table 2). The common feature among the focal areas is that the majority of the connections are all under 5km of length (Table 2). The geometry of Focal area A differs from the other two focal areas since houses are spread much more homogeneously (see geometry parameters in Table 1 and locations in Fig 1) leading to longer connections (Fig 4). We interpret these longer distances as a result of the longer paths flown by mosquitoes to find houses given the low clustering (Table 1) under larger wind speeds recorded in the focal area (see Focal area A wind rose diagram on Fig 5, right top panel). There is an agreement between the directions of the connections (S4 Fig) and the wind directions (Fig 5 left panels) in Focal area B and C, less for Focal area A where the upwind and downwind mixed movements have a strong random component (Table 2). In Focal area A, 90% of connections are up to 20 times longer than the corresponding straight distances between the infecting and infected houses. This value is greater than that in Focal area B (90% are up to 3 times) and Focal area C (90% are up to 6 times) (Table 2).

thumbnail
Fig 5. Successful mosquito connections (left panels) are shown as arrows departing from an infecting house (red circle) and arriving to an uninfected (and subsequently infected) house (blue circle).

Unexplained (black circle) are houses that are not analysed because they are outside the spatial and temporal scales considered by the best models. The wind rose panels on the right show the frequency of wind directions and angular sector. The line colours identify the average speed of the winds. Calm refers to winds below 0.5 m/s.

https://doi.org/10.1371/journal.ppat.1010622.g005

The relative risk of becoming infected during the study period was not large, even in locations occupied by super-spreaders (Fig 6, S1 Video) since this risk is adjusted for the number of days ‘spreading infected mosquitoes’. For example, a low number of super-spreaders that consistently infect other houses over time will generate a larger relative risk than a large number of super-spreaders that infect other houses for just limited periods.

thumbnail
Fig 6. Relative risk of getting infected per day (left) and risk exceedance (right) maps for each focal area.

https://doi.org/10.1371/journal.ppat.1010622.g006

Finally, the cross-validation analysis showed that MALSWOTS (Table 4) correctly predicted that the majority of those houses used for validation (10% of the original data) were infected during the DPI+DoF time before and after an RDT positive test. False positives for uninfected houses were around a third in focal areas A and B but half for Focal area C. However, these values reduced to 0 when considering a more conservative probability of success of infection (Table 5). More importantly, all the super-spreaders (10% of validation houses) were correctly predicted, enhancing the effectiveness of the MALSWOTS algorithm as an epidemiological forecasting tool.

Discussion

Insect dispersal is recognised as a key determinant in the spatial distribution of vector borne disease but empirical estimates of its impact on transmission dynamics and vector control are severely lacking [5,17]. Here, we developed MALSWOTS to incorporate our current understanding of anopheline dispersal ecology to provide information on malaria transmission networks relevant to disease control.

Based on data from the MMP study, MALSWOTS identified the optimal parameter values that describe the correlation between wind data and the spread of malaria infection (from all previously infected to uninfected houses over plausible timescales) under a set of environmental conditions influencing the survival of the mosquitoes. Cross-validation demonstrated that MALSWOTS can forecast all super-spreaders and the majority of the validation houses (Table 4) once optimal parameters were identified for each area and when wind estimates for the next 16–17 days were available (Table 2). In addition, false positive uninfected houses were absent when using probability of connections above 0.25 (equivalent to 1 successful connection for every 4 potential connections).

We also explored the effect of removing survival probabilities from MALSWOTS. In terms of parameters, the major difference is the shorter DPI for Focal area A (S2 Table) than in the model with survival probabilities. Validation results show that the proportion of corrected infected houses and the probability of infection are worse in Focal area A and C but unchanged in Focal area B (S3 Table) when mosquito survival probabilities are not included. This shows that the inclusion of mosquito survival probabilities increases the accuracy of the predictions. However, this seems not the case for Focal area B, probably due to the shortest time of ‘Days to new infection’ among the three areas (Table 2).

One of the most important outputs from MALSWOTS is the identification of super-spreader houses from pre-existing infection data and their hotspot formation. These super-spreaders are the cause of 59% and 62% of all infections for focal areas A and C respectively, and 80% of infections for Focal area B; the latter satisfying the ‘Pareto rule’ [6]. The difference in super-spreader house locations across the focal areas drives a marked and seasonal spatial heterogeneity in the relative risk of malaria transmission [48] (S1, S2 and S3 Videos). Super-spreading can happen if some hosts are more infectious than others, if some hosts are more exposed to vectors (due to differences in behaviour, e.g. staying outside late, house structure, access to interventions), or because a host is preferentially bitten by more vectors [6,49]. Several biological, demographic and economic factors will determine whether members of a household are considered super-spreaders. From the MMP data available, we identified the number of positive cases in a single household, the number of children living in the house, location of the village relative to neighbouring villages (treatment excluded) and the presence of livestock (potentially associated to Anopheles abundance [50]) as significant risk factors for super-spreader houses, each of which, may have influenced the vectorial capacity of the departing and arriving infected mosquitoes [5153]. By contrast, population counts were not significantly associated to super-spreaders, although as shown in different studies, this is linked to malaria risk [54]. The increased risk for super-spreaders in the villages defined as ‘treatment excluded’ is not clear but may be due to the fact that they are centrally located within each focal area and therefore more subject to cumulative process effects (more mosquitoes reaching the centre instead of the periphery).

The importance of hotspots as intervention targets is based on three questions [42]: ‘are hotspots stable over time?, ‘do hotspots seed transmission to the rest of the focus of malaria transmission? and ‘at what geographical resolution can hotspots be detected?. During our period of study (2016–2018), we found that super-spreaders tend to cluster together within 1 km2 (Fig 3). This aggregation may be a conservative estimate since we found connections of up to 3 km in a day. In fact, dispersion capabilities over 1 km are the reasons why large clusters of infected houses will look like a geographic composite of hotspots (as in focal area A and C) instead of a single one.

Approximately 60% of super-spreaders were part of stable hotspots and this may support the effective implementation of more targeted control measures. In terms of elimination strategies, however, interventions must consider the full network of connections between stable, unstable hotspots and low transmission regions. In practice, however, targeting hotspots is currently difficult in most malaria transmission settings. In the few empirical studies conducted so far, there is no conclusive evidence on the real epidemiological gains obtained by targeting hotspots with, for example, either combined vector control and mass drug administration [41] or reactive case detection [5,41,52,55]. We found stable hotspots to be part of a complex network that includes unstable hotspots, and clustered and un-clustered infections (not belonging to hotspots); a system common in other hypo- and mesoendemic transmission settings [5,6]. From simulation studies, targeting malaria hotspots in systems containing connected hotspot and not-hotspot areas may not be sufficient for sustained long-term transmission reduction, and reinforcement of interventions in adjacent non-hotspots areas is potentially required [7]. Here, we describe the presence of a multitude of connected hotspots and non-hotspots in each focal area which may require larger and un-targeted interventions instead of more localised control.

The outputs from MALSWOTS provide parameters that are essential for any malaria transmission model accounting for vector movements and behaviour. Anopheles activity peaks at dusk and decreases around dawn/early morning in Malawi [56,57]. Historically An. gambiae, An. arabiensis and An. funestus were typically considered late-night biters, but evidence over the last 20 years has shown that this varies due to regional ecological differences and the influence of ITNs [5860]. The MALSWOTS results from the MMP data show that 6pm to 10pm is the time of the day where winds are mostly correlated with temporal connection of infections, though wind dynamics at other times of the day (between 6pm and 8am) were also important. It is important to point out that 6pm to 10pm is similarly the time when communities commonly engage in outdoor evening activities and this may facilitate transmission in the proximity of the house [61,62].

The average speed of the spread of malaria parasite ranged from around 200 to 500 m/day among focal areas. This value combines prior information (e.g. flight speed and maximum distance) with wind conditions and epidemiological data. It is therefore encouraging that this spread is similar to the Anopheles dispersion distances reported in mark-release-recapture studies [8,63,64], and well below the limits set in the MALSWOTS algorithm for maximum distance flown in a single day if a suitable house is not found (1 km). The length of successful connections averaged a few kilometres which, again, fits with the overall dispersal capacity measured during mark-release-recapture experiments [6567]. Downwind as well as upwind movements combined with a random component were responsible for most of the connections as shown in other studies [47]. Finally, the average time necessary to establish a successful connection between two houses (17 to 29 days) is sufficient for the completion of the extrinsic (in the mosquito) and intrinsic incubation period (in the human host) [68]. A great proportion of this time is composed by the mosquito movements.

The spatial configuration of villages surrounding Majete had a notable impact on connections between houses. Focal area C produced fewer connections despite similar output parameters to focal areas A and B. Higher clustering was observed in Focal area C promoting a larger significant component of upwind + random movement and this was effective for clustered houses over short distances (see the average distance between infected houses and the L-test for clustering in Table 1). While clustering promotes quick infections, Focal area C spatial configuration is characterised by a series of villages along the main road, requiring East-South-East winds or West-North-West winds for successful connections, and this itself may have reduced the number of connected houses. This highlights the importance of village configuration for the spread of infections [69].

The main limitation of our study is the population sampled for diagnosis with RDT, which can potentially bias the results in the case of the existence of age-related geographic patterns. In fact, the unit of infection was the house, and this was deemed ‘infected’ only when a child aged 6–59 months or a woman aged 15–49 years was positive for a malaria RDT. Anyone outside of these demographic categories was excluded from RDT testing. Secondly, missing infections due to untested residents of the house, low sensitivity of the RDT (limit of detection of parasite load is 50–200 parasites/μl [70]) and uncertainty around the exact moment of P. falciparum infection [71,72] could cause an increased spatial sparsity of infected houses (in a demographically homogenous scenario) and therefore increase the number of days and distance between connections, although cross-validation has shown accurate forecast of infected houses. Missing RDT tests for house members not included in the target population could have caused the underestimation of malaria persistence over time within a house. MALSWOTS explored persistence up to a month after previous infection at each infected house and identified best models with events happening at temporal scales lower than a month (Table 1). It is therefore unlikely that the persistence of malaria for 1 or 2 months in the same house may have played a central role in the malaria spread, while longer recurrences (after 4 months from previous infections) are likely to be independent from the previous infection.

We also recognise the limitation in the use of only one weather station for focal area and the limited number of variables used for modelling the survival of the mosquitoes. These may have affected the capacity of the model to explain connections with very large probabilities. In addition, climate (e.g. daily temperature fluctuation) does not only influence mosquito survival but also parasite growth and development and mosquito behaviour [73,74], hence further development of MALSWOTS may include additional probabilistic components (in addition to the existing mosquito survival probability and mosquito movement probability) related to these factors. Finally, we have not investigated potential other causes of the strong spatial epidemiological and entomological heterogeneity in the three focal areas which may depend on demographics, housing characteristics, immunity, malaria interventions, mosquito behaviour (related to ecological conditions [23]) and human movements [7577]. In particular by not considering immunological factors it is not possible to predict the stability of hotspots over time, since immunity may cause a stable hotspot to become unstable or to terminate. MALSWOTS analyses thousands of potential connections within different time and spatial scales independently from each other. This can balance time-dependent or spatially-dependent uncertainties originating from the factors not considered in this analysis and control for biases and confounding [78]. The study sites are treated as islands but migration of infected mosquitoes or people from one area to another, including areas not considered in the study, cannot be excluded. In the current model, the fully random movement component (independent from wind) may have captured some of the human movements themselves (Table 2).

MALSWOTS provided a quantitative assessment of the influence of wind on malaria transmission in a rural African setting. The model estimated the spatial-temporal pattern of malaria transmission according to the average distance and direction between infected houses as well as the percentage of Anopheles spp. movements due to wind-type. MALSWOTS quantified heterogeneity at a small scale including the identification of stable hotspots (at the house level) which were supported by cross-validation. At the same time, we identified a more complex network within the study area, composed of unstable hotspots and non-hotspots areas in addition to dispersal ranges varying from small to large.

MALSWOTS is a parsimonious and efficient model to forecast local transmission by mosquito movement when forecasted environmental (wind and climate) data is available. In this scenario we could simulate transmission patters assuming infections starting from a specific house, or if known which houses are infectious MALSWOTS can identify the most likely infected houses by mosquito movements. In the same way, the relative risk (as shown in Fig 6) for the entire area can be forecasted. However, the parameterisation for one area cannot be used for other areas given the intrinsic differences in urban and environmental compositions and structures in each area as shown in this study for the three focal areas analysed. Therefore, each area will require a separate parameters’ inference (as described in stages 1 and 2 in the MALSWOTS algorithm flow in S1 File) before running the simulations (stage 3 and 4). To improve accuracy and predictions, MALSWOTS will be developed further in order to account for field experiment data (e.g. mark-release-recapture experiments), immunological factors, interventions, insecticide resistance and parasite genomics that will allow the identification of routes and trajectories of malaria transmission across various ecological settings.

Finally, the outputs from MALSWOTS will complement ongoing efforts to improve geospatial modelling of malaria transmission, for which information on the role of vector dispersal is largely lacking. Maintaining the effectiveness and coverage of core interventions will be key to sustaining the gains made in malaria control, but supplementary targeted vector control may become increasingly important in countries such as Malawi as malaria prevalence continues to decline and become more heterogeneous. In this scenario, accurate delineation of parasite transmission networks will become a priority for any intervention. Once parameterised, MALSWOTS can identify prevalent routes of infected mosquitoes which may impact the way we describe and delineate clusters, hotspots and not-hotspots areas. The model could also help identify any buffer zone around an outbreak area that should be targeted to ensure most of the infective mosquitoes are removed from the environment.

Supporting information

S1 File. A full explanation of the mosquito flight parameters and MALSWOTS algorithm flow.

https://doi.org/10.1371/journal.ppat.1010622.s001

(PDF)

S1 Table. Entomological collection summaries for each focal area.

Data representation on the form m(q,s) where ‘m’ is mean number of mosquitoes collected per night in one trap x 1000, ‘q’ is the standard deviation x 1000, and ‘s’ is the sum x 1000. s.l. refers to specimens that did not amplify in PCR, therefore species identification is based only on morphology. (-) and (+) indicate results of the qPCR for P. falciparum sporozoites.

https://doi.org/10.1371/journal.ppat.1010622.s002

(DOCX)

S2 Table. MALSWOTS (without mosquito survival probabilities) estimated optimal parameters and transmission summaries obtained from the top 5% of the models ranked by largest correlation.

^ confidence intervals. All parameters are calculated over the whole study period.

https://doi.org/10.1371/journal.ppat.1010622.s003

(DOCX)

S3 Table. Cross validation results for infected houses using MALSWOTS without mosquito survival probabilities.

https://doi.org/10.1371/journal.ppat.1010622.s004

(DOCX)

S1 Fig. Timeline optimisation in MALSWOTS. During the number of days prior to infection (DPI) + days of flight (DoF) timeframe, we assume the mosquito fed on an infected individual, became infectious before arriving at the next house and then transmitted the parasite to the next house.

https://doi.org/10.1371/journal.ppat.1010622.s005

(TIF)

S2 Fig. Graphical representation of the three types of probabilities estimated by MALSWOTS.

Assuming mosquito survival probabilities and grid probability (of presence of infected houses) are the same for houses A, B and C, and given the fact that the number of simulations are the same for each house, and assuming the potential connections to be 10, the probability that house Z is infected from all surrounding infectious houses is 6/10 (1 connection from A, 3 connections from B and 2 connections from C); the probability that house Z is infected by house B is 3/10; and finally the probability that house Z is infected by downwind movement is 3/10.

https://doi.org/10.1371/journal.ppat.1010622.s006

(TIF)

S3 Fig. Experimental variogram for the number of ‘infected houses’ originating from each house in the three areas.

This outcome has been modelled using a Poisson generalised linear mixed model. The Poisson generalised linear mixed model was not corrected for spatial autocorrelation since the experimental variograms show a large nugget effect, i.e. the semivariance does not increase with distance but tends to be stable or decrease (which means that locations further apart looks more similar that locations close to each other).

https://doi.org/10.1371/journal.ppat.1010622.s007

(TIF)

S4 Fig. House to house connections by mosquito flight movements for each focal area. Each graph shows the directions of the movements, their overall proportions and the proportion by covered distance.

https://doi.org/10.1371/journal.ppat.1010622.s008

(TIF)

S1 Video. Spatio-temporal relative risk videos for focal area A.

Video also available at https://doi.org/10.17635/lancaster/researchdata/524.

https://doi.org/10.1371/journal.ppat.1010622.s009

(AVI)

S2 Video. Spatio-temporal relative risk videos for focal area B.

Video also available at https://doi.org/10.17635/lancaster/researchdata/524.

https://doi.org/10.1371/journal.ppat.1010622.s010

(AVI)

S3 Video. Spatio-temporal relative risk videos for focal area C.

Video also available at https://doi.org/10.17635/lancaster/researchdata/524.

https://doi.org/10.1371/journal.ppat.1010622.s011

(AVI)

Acknowledgments

We thank Dr Monica Staniek and Dr Rod Dillon for the useful discussions at the onset of this work.

References

  1. 1. Bath D, Cook J, Govere J, Mathebula P, Morris N, Hlongwana K, et al. Effectiveness and cost-effectiveness of reactive, targeted indoor residual spraying for malaria control in low-transmission settings: a cluster-randomised, non-inferiority trial in South Africa. The Lancet. 2021;397(10276):816–27. pmid:33640068
  2. 2. Carter R, Mendis KN, Roberts D. Spatial targeting of interventions against malaria. Bulletin of the World Health Organization. 2000;78:1401–11. pmid:11196487
  3. 3. WHO. World Malaria Report World Health Organization, Programme GM; 2020.
  4. 4. Keegan LT, Dushoff J. Estimating finite-population reproductive numbers in heterogeneous populations. J Theor Biol. 2016;397:1–12. WOS:000375168300001. pmid:26891919
  5. 5. Stresman G, Bousema T, Cook J. Malaria Hotspots: Is There Epidemiological Evidence for Fine-Scale Spatial Targeting of Interventions? Trends Parasitol. 2019;35(10):822–34. Epub 2019/09/03. pmid:31474558.
  6. 6. Cooper L, Kang SY, Bisanzio D, Maxwell K, Rodriguez-Barraquer I, Greenhouse B, et al. Pareto rules for malaria super-spreaders and super-spreading. Nat Commun. 2019;10(1):3939. Epub 2019/09/04. pmid:31477710; PubMed Central PMCID: PMC6718398.
  7. 7. Tun STT, Parker DM, Aguas R, White LJ. The assembly effect: the connectedness between populations is a double-edged sword for public health interventions. Malaria Journal. 2021;20(1). WOS:000641737100002. pmid:33865392
  8. 8. Thomas CJ, Cross DE, Bogh C. Landscape Movements of Anopheles gambiae Malaria Vector Mosquitoes in Rural Gambia. Plos One. 2013;8(7). WOS:000324146200025. pmid:23874719
  9. 9. Debebe Y, Hill SR, Tekie H, Dugassa S, Hopkins RJ, Ignell R. Malaria hotspots explained from the perspective of ecological theory underlying insect foraging. Scientific Reports. 2020;10(1). WOS:000600132400053. pmid:33293574
  10. 10. Guelbeogo WM, Goncalves BP, Grignard L, Bradley J, Serme SS, Hellewell J, et al. Variation in natural exposure to anopheles mosquitoes and its effects on malaria transmission. Elife. 2018;7. Epub 2018/01/24. pmid:29357976; PubMed Central PMCID: PMC5780040.
  11. 11. Chapman JW, Reynolds DR, Wilson K. Long-range seasonal migration in insects: mechanisms, evolutionary drivers and ecological consequences. Ecology Letters. 2015;18(3):287–302. pmid:25611117
  12. 12. Huestis DL, Dao A, Diallo M, Sanogo ZL, Samake D, Yaro AS, et al. Windborne long-distance migration of malaria mosquitoes in the Sahel. Nature. 2019;574(7778):404–+. WOS:000490988300065. pmid:31578527
  13. 13. Cummins B, Cortez R, Foppa IM, Walbeck J, Hyman JM. A Spatial Model of Mosquito Host-Seeking Behavior. Plos Computational Biology. 2012;8(5). WOS:000305964600011. pmid:22615546
  14. 14. Takken W, Dekker T, Wijnholds YG. Odor-mediated flight behavior of Anopheles gambiae Giles Sensu Stricto and A-stephensi Liston in response to CO2, acetone, and 1-octen-3-ol (Diptera: Culicidae). J Insect Behav. 1997;10(3):395–407. WOS:A1997XJ85400007.
  15. 15. Cribellier A. Biomechanics of flying mosquitoes during capture and escape: University of Wageningen; 2021.
  16. 16. Eze M, Labadin J, Lim T. Mosquito Flight Model and Applications in Malaria Control. Third International Conference on Computer Engineering and Technology (Iccet 2011). 2011:59–64. WOS:000320340300010.
  17. 17. Florio J, Veru LM, Dao A, Yaro AS, Diallo M, Sanogo ZL, et al. Diversity, dynamics, direction, and magnitude of high-altitude migrating insects in the Sahel. Scientific Reports. 2020;10(1). WOS:000596296000010. pmid:33239619
  18. 18. Endo N, Eltahir EAB. Modelling and observing the role of wind in Anopheles population dynamics around a reservoir. Malaria Journal. 2018;17. WOS:000423314000001. pmid:29370803
  19. 19. Matthys B, Koudou BG, N’Goran EK, Vounatsou P, Gosoniu L, Kone M, et al. Spatial dispersion and characterisation of mosquito breeding habitats in urban vege-production areas of Abidjan, Cote d’Ivoire. Ann Trop Med Parasit. 2010;104(8):649–66. WOS:000285292100004. pmid:21144184
  20. 20. Odhiambo JN, Kalinda C, Macharia PM, Snow RW, Sartorius B. Spatial and spatio-temporal methods for mapping malaria risk: a systematic review. BMJ Glob Health. 2020;5(10). Epub 2020/10/08. pmid:33023880; PubMed Central PMCID: PMC7537142.
  21. 21. Sedda L, Brown HE, Purse BV, Burgin L, Gloster J, Rogers DJ. A new algorithm quantifies the roles of wind and midge flight activity in the bluetongue epizootic in northwest Europe. Proceedings of the Royal Society B-Biological Sciences. 2012;279(1737):2354–62. WOS:000303888500009. pmid:22319128
  22. 22. Sedda L, Rogers DJ. The influence of the wind in the Schmallenberg virus outbreak in Europe. Scientific Reports. 2013;3. WOS:000327535200006. pmid:24285292
  23. 23. Endo N, Eltahir EAB. Prevention of malaria transmission around reservoirs: an observational and modelling study on the effect of wind direction and village location. The Lancet Planetary Health. 2018;2(9):e406–e13. pmid:30177009
  24. 24. McCann RS, Kabaghe AN, Moraga P, Gowelo S, Mburu MM, Tizifa T, et al. The effect of community-driven larval source management and house improvement on malaria transmission when added to the standard malaria control strategies in Malawi: a cluster-randomized controlled trial. Malaria Journal. 2021;20(1). WOS:000657758600003. pmid:34022912
  25. 25. van den Berg H, van Vugt M, Kabaghe AN, Nkalapa M, Kaotcha R, Truwah Z, et al. Community-based malaria control in southern Malawi: a description of experimental interventions of community workshops, house improvement and larval source management. Malaria Journal. 2018;17. WOS:000438821100003.
  26. 26. McCann RS, van den Berg H, Diggle PJ, van Vugt M, Terlouw DJ, Phiri KS, et al. Assessment of the effect of larval source management and house improvement on malaria transmission when added to standard malaria control strategies in southern Malawi: study protocol for a cluster-randomised controlled trial. BMC Infect Dis. 2017;17(1):639. Epub 2017/09/25. pmid:28938876; PubMed Central PMCID: PMC5610449.
  27. 27. Kabaghe AN, Chipeta MG, McCann RS, Phiri KS, van Vugt M, Takken W, et al. Adaptive geostatistical sampling enables efficient identification of malaria hotspots in repeated cross-sectional surveys in rural Malawi. Plos One. 2017;12(2). WOS:000394423900065. pmid:28196105
  28. 28. Amoah B, McCann RS, Kabaghe AN, Mburu M, Chipeta MG, Moraga P, et al. Identifying Plasmodium falciparum transmission patterns through parasite prevalence and entomological inoculation rate. Elife. 2021;10:2021.01.14.426709. Epub 2021/10/22. pmid:34672946; PubMed Central PMCID: PMC8530514.
  29. 29. Mzilahowa T, Hastings IM, Molyneux ME, McCall PJ. Entomological indices of malaria transmission in Chikhwawa district, Southern Malawi. Malaria Journal. 2012;11. WOS:000313240200001. pmid:23171123
  30. 30. Gowelo S, Chirombo J, Koenraadt CJM, Mzilahowa T, van den Berg H, Takken W, et al. Characterisation of anopheline larval habitats in southern Malawi. Acta Trop. 2020;210. WOS:000563497800007. pmid:32485166
  31. 31. Hiscox A, Otieno B, Kibet A, Mweresa CK, Omusula P, Geier M, et al. Development and optimization of the Suna trap as a tool for mosquito monitoring and control. Malaria Journal. 2014;13. WOS:000339576100001. pmid:24998771
  32. 32. Mburu MM, Zembere K, Hiscox A, Banda J, Phiri KS, van den Berg H, et al. Assessment of the Suna trap for sampling mosquitoes indoors and outdoors. Malaria Journal. 2019;18. WOS:000459403900003. pmid:30795766
  33. 33. Gillies M, Coetzee M. A supplement to the Anophelinae of Africa South of the Sahara. Publ S Afr Inst Med Res. 1987;55:1–143.
  34. 34. Koekemoer LL, Kamau L, Hunt RH, Coetzee M. Cocktail polymerase chain reaction assay to identify members of the Anopheles funestus (Diptera: Culicidae) group. Am J Trop Med Hyg. 2002;66(6):804–11. WOS:000177660700031. pmid:12224596
  35. 35. Scott JA, Brogdon WG, Collins FH. Identification of Single Specimens of the Anopheles-Gambiae Complex by the Polymerase Chain-Reaction. Am J Trop Med Hyg. 1993;49(4):520–9. WOS:A1993MD96400018. pmid:8214283
  36. 36. Perandin F, Manca N, Calderaro A, Piccolo G, Galati L, Ricci L, et al. Development of a real-time PCR assay for detection of Plasmodium falciparum, Plasmodium vivax, and Plasmodium ovale for routine clinical diagnosis. J Clin Microbiol. 2004;42(3):1214–9. WOS:000220376900043. pmid:15004078
  37. 37. R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2021.
  38. 38. Morin J, Olsson C, Atikcan EÖ. Research Methods in the Social Sciences: an A-Z of Key Concepts: Oxford University Press; 2021.
  39. 39. Pfeiffer DU, Robinson TP, Stevenson M, Stevens KB, Rogers DJ, Clements AC. Spatial analysis in epidemiology: OUP Oxford; 2008.
  40. 40. Baddeley A, Diggle PJ, Hardegen A, Lawrence T, Milne RK, Nair G. On tests of spatial pattern based on simulation envelopes. Ecol Monogr. 2014;84(3):477–89. WOS:000340081900006.
  41. 41. Bousema T, Stresman G, Baidjoe AY, Bradley J, Knight P, Stone W, et al. The Impact of Hotspot-Targeted Interventions on Malaria Transmission in Rachuonyo South District in the Western Kenyan Highlands: A Cluster-Randomized Controlled Trial. Plos Med. 2016;13(4). WOS:000375096900005. pmid:27071072
  42. 42. Bousema T, Griffin JT, Sauerwein RW, Smith DL, Churcher TS, Takken W, et al. Hitting Hotspots: Spatial Targeting of Malaria for Control and Elimination. Plos Med. 2012;9(1). WOS:000300419600013. pmid:22303287
  43. 43. Oliver MA, Webster R. A tutorial guide to geostatistics: Computing and modelling variograms and kriging. Catena. 2014;113:56–69. WOS:000328237500006.
  44. 44. Tatem AJ. Comment: WorldPop, open data for spatial demography. Sci Data. 2017;4. WOS:000393026200001.
  45. 45. Burnham KP, Anderson DR. Model Selection and Multimodel Inference2004.
  46. 46. Frei J, Krober T, Troccaz M, Starkenmann C, Guerin PM. Behavioral response of the malaria mosquito, Anopheles gambiae, to human sweat inoculated with axilla bacteria and to volatiles composing human axillary odor. Chem Senses. 2017;42(2):121–31. WOS:000397065700142. pmid:27789516
  47. 47. Midega JT, Smith DL, Olotu A, Mwangangi JM, Nzovu JG, Wambua J, et al. Wind direction and proximity to larval sites determines malaria risk in Kilifi District in Kenya. Nature Communications. 2012;3. WOS:000302060100015. pmid:22334077
  48. 48. Woolhouse MEJ, Dye C, Etard JF, Smith T, Charlwood JD, Garnett GP, et al. Heterogeneities in the transmission of infectious agents: Implications for the design of control programs. P Natl Acad Sci USA. 1997;94(1):338–42. WOS:A1997WC34700062. pmid:8990210
  49. 49. Martin LB, Addison B, Bean AGD, Buchanan KL, Crino OL, Eastwood JR, et al. Extreme Competence: Keystone Hosts of Infections. Trends Ecol Evol. 2019;34(4):303–14. WOS:000461826200007. pmid:30704782
  50. 50. Mayagaya VS, Nkwengulila G, Lyimo IN, Kihonda J, Mtambala H, Ngonyani H, et al. The impact of livestock on the abundance, resting behaviour and sporozoite rate of malaria vectors in southern Tanzania. Malar J. 2015;14:17. Epub 2015/01/22. pmid:25604150; PubMed Central PMCID: PMC4311485.
  51. 51. Smith DL, Musiime AK, Maxwell K, Lindsay SW, Kiware S. A New Test of a Theory about Old Mosquitoes. Trends Parasitol. 2021;37(3):185–94. Epub 2020/12/01. pmid:33250441; PubMed Central PMCID: PMC7878312.
  52. 52. Kang SY, Battle KE, Gibson HS, Cooper LV, Maxwell K, Kamya M, et al. Heterogeneous exposure and hotspots for malaria vectors at three study sites in Uganda. Gates Open Res. 2018;2:32. Epub 2019/09/25. pmid:30706054; PubMed Central PMCID: PMC6350504.
  53. 53. Kigozi SP, Pindolia DK, Smith DL, Arinaitwe E, Katureebe A, Kilama M, et al. Associations between urbanicity and malaria at local scales in Uganda. Malar J. 2015;14:374. Epub 2015/09/30. pmid:26415959; PubMed Central PMCID: PMC4587721.
  54. 54. Kabaria CW, Gilbert M, Noor AM, Snow RW, Linard C. The impact of urbanization and population density on childhood Plasmodium falciparum parasite prevalence rates in Africa. Malaria Journal. 2017;16. WOS:000392762000003. pmid:28125996
  55. 55. Sturrock HJ, Novotny JM, Kunene S, Dlamini S, Zulu Z, Cohen JM, et al. Reactive case detection for malaria elimination: real-life experience from an ongoing program in Swaziland. PLoS One. 2013;8(5):e63830. Epub 2013/05/24. pmid:23700437; PubMed Central PMCID: PMC3658965.
  56. 56. Duffield GE, Acri DJ, George GF, Sheppard AD, Beebe NW, Ritchie SA, et al. Diel flight activity of wild-caught Anopheles farauti (s.s.) and An. hinesorum malaria mosquitoes from northern Queensland, Australia. Parasite Vector. 2019;12. WOS:000456457700003. pmid:30670073
  57. 57. Mburu MM, Mzilahowa T, Amoah B, Chifundo D, Phiri KS, van den Berg H, et al. Biting patterns of malaria vectors of the lower Shire valley, southern Malawi. Acta Trop. 2019;197:105059. Epub 2019/06/14. pmid:31194960.
  58. 58. Bayoh MN, Walker ED, Kosgei J, Ombok M, Olang GB, Githeko AK, et al. Persistently high estimates of late night, indoor exposure to malaria vectors despite high coverage of insecticide treated nets. Parasite Vector. 2014;7. WOS:000341609300001. pmid:25141761
  59. 59. Huho B, Briet O, Seyoum A, Sikaala C, Bayoh N, Gimnig J, et al. Consistently high estimates for the proportion of human exposure to malaria vector populations occurring indoors in rural Africa. Int J Epidemiol. 2013;42(1):235–47. WOS:000316699300029. pmid:23396849
  60. 60. Seyoum A, Sikaala CH, Chanda J, Chinula D, Ntamatungiro AJ, Hawela M, et al. Human exposure to anopheline mosquitoes occurs primarily indoors, even for users of insecticide-treated nets in Luangwa Valley, South-east Zambia. Parasite Vector. 2012;5. WOS:000308375700001. pmid:22647493
  61. 61. Moshi IR, Manderson L, Ngowo HS, Mlacha YP, Okumu FO, Mnyone LL. Outdoor malaria transmission risks and social life: a qualitative study in South-Eastern Tanzania. Malar J. 2018;17(1):397. Epub 2018/10/31. pmid:30373574; PubMed Central PMCID: PMC6206631.
  62. 62. Monroe A, Mihayo K, Okumu F, Finda M, Moore S, Koenker H, et al. Human behaviour and residual malaria transmission in Zanzibar: findings from in-depth interviews and direct observation of community events. Malar J. 2019;18(1):220. Epub 2019/07/03. pmid:31262306; PubMed Central PMCID: PMC6604484.
  63. 63. Charlwood JD. Studies on the bionomics of male Anopheles gambiae Giles and male Anopheles funestus Giles from southern Mozambique. J Vector Ecol. 2011;36(2):382–94. WOS:000298017700017. pmid:22129410
  64. 64. Minakawa N, Seda P, Yan GY. Influence of host and larval habitat distribution on the abundance of African malaria vectors in western Kenya. Am J Trop Med Hyg. 2002;67(1):32–8. WOS:000178069600006. pmid:12363061
  65. 65. Gillies MT. Studies on the dispersion and survival of Anopheles gambiae Giles in East Africa, by means of marking and release experiments. B Entomol Res. 1961;52(1):99–127.
  66. 66. Verdonschot PFM, Besse-Lototskaya AA. Flight distance of mosquitoes (Culicidae): A metadata analysis to support the management of barrier zones around rewetted and newly constructed wetlands. Limnologica. 2014;45:69–79. WOS:000330502500009.
  67. 67. Midega JT, Mbogo CM, Mwambi H, Wilson MD, Ojwang G, Mwangangi JM, et al. Estimating dispersal and survival of Anopheles gambiae and Anopheles funestus along the Kenyan coast by using mark-release-recapture methods. Journal of Medical Entomology. 2007;44(6):923–9. WOS:000251034200003. pmid:18047189
  68. 68. Stopard IJ, Churcher TS, Lambert B. Estimating the extrinsic incubation period of malaria using a mechanistic model of sporogony. PLoS Comput Biol. 2021;17(2):e1008658. Epub 2021/02/17. pmid:33591963; PubMed Central PMCID: PMC7909686.
  69. 69. Jatta E, Jawara M, Bradley J, Jeffries D, Kandeh B, Knudsen JB, et al. How house design affects malaria mosquito density, temperature, and relative humidity: an experimental study in rural Gambia. The Lancet Planetary Health. 2018;2(11):e498–e508. pmid:30396441
  70. 70. Chiodini PL. Malaria diagnostics: now and the future. Parasitology. 2014;141(14):1873–9. Epub 2014/09/13. pmid:25216364.
  71. 71. Briggs J, Teyssier N, Nankabirwa JI, Rek J, Jagannathan P, Arinaitwe E, et al. Sex-based differences in clearance of chronic Plasmodium falciparum infection. Elife. 2020;9. Epub 2020/10/28. pmid:33107430; PubMed Central PMCID: PMC7591246.
  72. 72. Buchwald AG, Sorkin JD, Sixpence A, Chimenya M, Damson M, Wilson ML, et al. Association Between Age and Plasmodium falciparum Infection Dynamics. Am J Epidemiol. 2019;188(1):169–76. Epub 2018/09/27. pmid:30252032; PubMed Central PMCID: PMC6321803.
  73. 73. Suh E, Grossman MK, Waite JL, Dennington NL, Sherrard-Smith E, Churcher TS, et al. The influence of feeding behaviour and temperature on the capacity of mosquitoes to transmit malaria. Nat Ecol Evol. 2020;4(7):940–51. Epub 2020/05/06. pmid:32367033; PubMed Central PMCID: PMC7334094.
  74. 74. Paaijmans KP, Blanford S, Bell AS, Blanford JI, Read AF, Thomas MB. Influence of climate on malaria transmission depends on daily temperature variation. Proc Natl Acad Sci U S A. 2010;107(34):15135–9. Epub 2010/08/11. pmid:20696913; PubMed Central PMCID: PMC2930540.
  75. 75. Traore DF, Sagna AB, Adja AM, Zoh DD, Adou KA, Lingue KN, et al. Exploring the heterogeneity of human exposure to malaria vectors in an urban setting, Bouake, Cote d’Ivoire, using an immuno-epidemiological biomarker. Malar J. 2019;18(1):68. Epub 2019/03/13. pmid:30857543; PubMed Central PMCID: PMC6413440.
  76. 76. Nelson CS, Sumner KM, Freedman E, Saelens JW, Obala AA, Mangeni JN, et al. High-resolution micro-epidemiology of parasite spatial and temporal dynamics in a high malaria transmission setting in Kenya. Nat Commun. 2019;10(1):5615. Epub 2019/12/11. pmid:31819062; PubMed Central PMCID: PMC6901486.
  77. 77. Searle KM, Lubinda J, Hamapumbu H, Shields TM, Curriero FC, Smith DL, et al. Characterizing and quantifying human movement patterns using GPS data loggers in an area approaching malaria elimination in rural southern Zambia. R Soc Open Sci. 2017;4(5):170046. Epub 2017/06/03. pmid:28573009; PubMed Central PMCID: PMC5451810.
  78. 78. Thommes EW, Mahmud SM, Young-Xu Y, Snider JT, van Aalst R, Lee JKH, et al. Assessing the prior event rate ratio method via probabilistic bias analysis on a Bayesian network. Stat Med. 2020;39(5):639–59. WOS:000499819900001. pmid:31788843