Skip to main content
Advertisement
  • Loading metrics

Livestock, pathogens, vectors, and their environment: A causal inference-based approach to estimating the pathway-specific effect of livestock on human African trypanosomiasis risk

  • Julianne Meisner ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    meisnerj@uw.edu

    Affiliations Center for One Health Research, University of Washington, Seattle, Washington, United States of America, Department of Global Health, University of Washington, Seattle, Washington, United States of America

  • Agapitus Kato ,

    Contributed equally to this work with: Agapitus Kato, Marshall Lemerani, Erick Mwamba Miaka, Acaga Ismail Taban

    Roles Conceptualization, Formal analysis, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Uganda Virus Research Institute, Entebbe, Uganda

  • Marshall Lemerani ,

    Contributed equally to this work with: Agapitus Kato, Marshall Lemerani, Erick Mwamba Miaka, Acaga Ismail Taban

    Roles Conceptualization, Data curation, Methodology, Project administration, Supervision, Writing – review & editing

    Affiliation Trypanosomiasis Control Program, Ministry of Health, Lilongwe, Malawi

  • Erick Mwamba Miaka ,

    Contributed equally to this work with: Agapitus Kato, Marshall Lemerani, Erick Mwamba Miaka, Acaga Ismail Taban

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Project administration, Supervision, Writing – review & editing

    Affiliation Programme National de Lutte contre la Trypanosomiase Humaine Africaine, Kinshasa, Democratic Republic of Congo

  • Acaga Ismail Taban ,

    Contributed equally to this work with: Agapitus Kato, Marshall Lemerani, Erick Mwamba Miaka, Acaga Ismail Taban

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – review & editing

    Affiliation HISP South Sudan, Juba, South Sudan

  • Jonathan Wakefield,

    Roles Conceptualization, Formal analysis, Methodology, Software, Supervision, Validation, Visualization, Writing – review & editing

    Affiliations Department of Biostatistics, University of Washington, Seattle, Washington, United States of America, Department of Statistics, University of Washington, Seattle, Washington, United States of America

  • Ali Rowhani-Rahbar,

    Roles Conceptualization, Investigation, Methodology, Supervision, Validation, Writing – review & editing

    Affiliation Department of Epidemiology, University of Washington, Seattle, Washington, United States of America

  • David M. Pigott,

    Roles Formal analysis, Methodology, Supervision, Validation, Visualization, Writing – review & editing

    Affiliation Department of Health Metrics Sciences, University of Washington, Seattle, Washington, United States of America

  • Jonathan Mayer,

    Roles Conceptualization, Methodology, Supervision, Visualization, Writing – review & editing

    Affiliation Department of Epidemiology, University of Washington, Seattle, Washington, United States of America

  • Peter M. Rabinowitz

    Roles Conceptualization, Methodology, Project administration, Resources, Supervision, Visualization, Writing – review & editing

    Affiliations Center for One Health Research, University of Washington, Seattle, Washington, United States of America, Department of Global Health, University of Washington, Seattle, Washington, United States of America, Department of Environmental and Occupational Health Sciences, University of Washington, Seattle, Washington, United States of America

Abstract

Livestock are important reservoirs for many zoonotic diseases, however the effects of livestock on human and environmental health extend well beyond direct disease transmission. In this retrospective ecological cohort study we use pre-existing data and the parametric g-formula, which imputes potential outcomes to quantify mediation, to estimate three hypothesized mechanisms by which livestock can influence human African trypanosomiasis (HAT) risk: the reservoir effect, where infected cattle and pigs are a source of infection to humans; the zooprophylactic effect, where preference for livestock hosts exhibited by the tsetse fly vector of HAT means that their presence protects humans from infection; and the environmental change effect, where livestock keeping activities modify the environment in such a way that habitat suitability for tsetse flies, and in turn human infection risk, is reduced. We conducted this study in four high burden countries: at the point level in Uganda, Malawi, and Democratic Republic of Congo (DRC), and at the county level in South Sudan. Our results indicate cattle and pigs play a reservoir role for the rhodesiense form (rHAT) in Uganda (rate ratio (RR) 1.68, 95% CI 0.84, 2.82 for cattle; RR 2.16, 95% CI 1.18, 3.05 for pigs), however zooprophylaxis outweighs this effect for rHAT in Malawi (RR 0.85, 95% CI 0.68, 1.00 for cattle, RR 0.38, 95% CI 0.21, 0.69 for pigs). For the gambiense form (gHAT) we found evidence that pigs may be a competent reservoir (RR 1.15, 95% CI 0.92, 1.72 in Uganda; RR 1.25, 95% CI 1.11, 1.42 in DRC). Statistical significance was reached for rHAT in Malawi (pigs and cattle) and Uganda (pigs only) and for gHAT in DRC (pigs and cattle). We did not find compelling evidence of an environmental change effect (all effect sizes close to 1).

Introduction

Domestic and wild animals—in particular bovids such as cattle, bushbuck, and African buffalo, and suids such as domestic pigs, bushpigs, and warthogs—are known to be important reservoirs for the acute form of human African trypanosomiasis, caused by Trypanosoma brucei rhodesiense(rHAT) [14], and there is some evidence that domestic pigs may also be reservoirs for the chronic form, caused by T. b. gambiense (gHAT) [5]. Hereafter, we use the term “livestock” to refer to cattle and domestic pigs.

However, livestock may also influence the distribution of HAT beyond their role as reservoir hosts. HAT is transmitted by tsetse flies, and several species which are important HAT vectors have been documented to prefer animal hosts, and to only bite humans during times of stress [6]. Also appreciated with the malaria vector Anopheles arabiensis [79], this preference may shield humans from tsetse bites and thus HAT exposure when favored animal hosts are nearby, termed zooprophylaxis. The fly biting behaviors that favor a zooprophylactic role versus a reservoir role are distinct, with strongly zoophilic tsetse being unlikely to be competent HAT vectors, and animal reservoirs unlikely to be important where the predominant HAT vector is strongly anthrophilic. Furthermore, zoonotic transmission requires host switching—whereby a given fly feeds on an animal host and, subsequently, a human host—while zooprophylaxis does not [10]. Tsetse preference for animal hosts is thus harmful (up to a threshold) for HAT control in the presence of an animal reservoir, and helpful in its absence. Consistent with this, gHAT models have found that in the absence of animal reservoirs tsetse biting preference for animals had a negative effect on HAT risk, while in the presence of animal reservoirs biting preference for humans became more influential [10]. More recently, mathematical modeling using data from DRC detected statistical support for animal reservoirs of gHAT, leading the authors to conclude that while animal reservoirs would be unlikely to maintain transmission on their own, they could stall progress towards elimination targets in some settings [11].

Livestock grazing can bring about ecosystem degradation, in the form of desertification (the conversion of fertile land to desert) in arid climates, increased woody plant cover in semi-arid rangelands, and deforestation in humid climates [12]. As early as 1988, it was noted that overgrazing could potentially change local and global weather systems [13]. Insect vectors of public health importance, including tsetse fly vectors of HAT, are sensitive to environmental factors [14]. In Burkina Faso and Ghana, cattle keeping, crop farming, and other climactic and anthropogenic factors have altered riverine landscapes, fragmenting gallery forests and influencing tsetse distribution [15]. Similarly, in Ethiopia, where animal African trypanosomiasis (AAT) is endemic but HAT is not, regulatory changes leading to expansion in agricultural land was documented to alter the distribution of tsetse flies [16]. Furthermore, the clearing of bush by subsistence livestock agriculture in pre-colonial Africa is credited with small-scale control of tsetse, termed “agricultural prophylaxis,” and the resurgence of bush following catastrophic livestock losses to rinderpest (“cattle plague”) is credited with the sleeping sickness outbreaks of the early 20th century [13].

Livestock may thus come to bear on HAT distribution through three hypothesized pathways: disease reservoir, zooprophylaxis, and environmental change. While One Health approaches to HAT control target the reservoir pathway, the environmental change pathway was successfully employed for HAT and AAT control in pre-colonial Africa through agricultural prophylaxis, and in colonial Africa through environmentally-traumatic approaches such as brush clearing and widespread use of persistent organochlorine insecticides. Furthermore, vector control strategies that target livestock, such as insecticide treatment of cattle (ITC), exploit the close proximity of cattle to humans and their need to travel daily to the riverine habitats favored by gHAT vectors for water, and thus zooprophylaxis [17, 18].

Estimation of pathway-specific effects can narrow knowledge gaps surrounding the epidemiology of HAT, facilitate the targeted deployment of control measures, and provide key parameters for mathematical modeling. Previous authors have estimated elements of these effects in select foci, including the predictive ability of remote-sensed environmental data for tsetse distribution in Nuguruman, Kenya [19], and the effects of trypanocide mass-treatment of cattle on rHAT risk in Uganda [20], of habitat fragmentation on rHAT risk in Eastern Zambia [21], and of animal husbandry and climate on AAT risk in Burkina Faso [22]. Mediation analysis represents a powerful addition to this literature by quantifying the contribution of various mechanistic pathways to the total effect of livestock on HAT risk, allowing the estimation of the epidemiological significance of each pathway relative to the others. Such an approach is critical to addressing the significant uncertainty that remains surrounding the role of pigs as reservoirs of gHAT, and the ability of cattle to bring about environmental changes that in turn affect the distribution of HAT.

In this study, we build on our previous efforts to estimate the total effect of livestock density on HAT risk using the parametric g-formula [23], by applying the mediational g-formula to estimate the magnitude to which this effect is mediated by environmental change, in three high-burden countries: Malawi (rHAT), Uganda (rHAT and gHAT), and Democratic Republic of Congo (DRC, gHAT). We accomplished this using remote-sensed data, HAT surveillance data collated by the WHO Atlas of HAT, and our new time-series of high resolution livestock density maps [24]. We define livestock density as the ratio of livestock (separately for cattle and pigs) to humans, and fit all models separately for each country, for gHAT and rHAT, and for two mediators: normalized difference vegetation index (NDVI), a measure of vegetation cover, and land surface temperature (LST), both of which may be feasibly altered by livestock keeping or by the presence of livestock themselves. Additionally, in South Sudan (gHAT) we use a spatial extension of the regression approach to mediation analyses described by VanderWeele [25], and a single county-level map of livestock density produced for 2008 [24].

Materials and methods

Our study is a retrospective ecological cohort study with cluster-year as the unit of analysis in Malawi, Uganda, and DRC, and county (administrative level 2) as the unit of analysis in South Sudan. This distinction was necessary as cluster-level geolocation was not available for livestock data in South Sudan [24]. Cluster is defined as a 0.017° pixel.

Ethical approvals

As data collected were already de-identified, the University of Washington Institutional Review Board determined this work did not constitute engagement with human subjects (STUDY00004648). Similarly, no ethical review was required in DRC. Approvals were granted from the National Health Sciences Research Committee in Malawi (approval No. 2166), the South Sudan Research Ethics Committee (MOH/ERB21/2018), and the Uganda National Council of Science and Technology (ADM 194/212/01).

Study population

In Uganda, DRC, and Malawi, our study population included all clusters within five hours’ travel time of a fixed health facility capable of HAT diagnosis [26], defined separately for gHAT and rHAT. In South Sudan all counties which reported one or more HAT cases or conducted active surveillance during the study period comprised the study area.

The study period was defined separately for each country, on the basis of availability of HAT and livestock data: 2006–2018 for Uganda, 2003–2014 for Malawi, and 2010–2013 for DRC. In South Sudan the study period was restricted to 2008 alone as only the 2008 census provided geolocated (county-level) livestock ownership data for this country [24].

DAG formulation

In all study countries we adopted the counterfactual approach to mediation analysis, identifying confounders by a priori subject knowledge, which we encoded in a directed acyclic graph (DAG; Fig 1). Index events are natural disasters or armed conflicts in both rHAT and gHAT models, and proximity to protected areas—a proxy for density of wildlife reservoirs of HAT—in rHAT models only. Time-varying variables are indexed by t ∈ {0, 1, …T}, and variables in the minimally-sufficient adjustment set are boxed: {elevation, index events, LST, NDVI, wealth, vector control}.

thumbnail
Fig 1. Final time-varying directed acyclic graph (DAG), restricted to two time points.

Index events are natural disasters or armed conflicts. Protected areas pertains to rHAT models only. Exposure and outcome of interest are bolded, and confounders in the minimally-sufficient set are denoted by solid boxes.

https://doi.org/10.1371/journal.pgph.0002543.g001

Our DAG encodes our beliefs about the underlying temporal structure of these causal relationships. We believe LST, NDVI, and wealth exhibit exposure-confounder feedback, where LST, NDVI, and wealth at time t are a cause of livestock density at time t, which in turn is a cause of LST, NDVI and wealth at time t + 1. In the presence of exposure-confounder feedback confounding cannot be controlled using adjustment in regression models. Here we apply the mediational g-formula, one of Robin’s generalized methods (“g-methods”), which allows estimation of causal effects in the face of time-varying confounding with exposure-confounder feedback [2729]. As we did not have longitudinal data in South Sudan this issue did not arise; in this country, we applied the regression approach to mediation [27].

Note that while ethnicity, educational attainment, and religion are generally not time-varying at an individual-level, in our application these represent group-level variables (i.e., proportion receiving higher than a given level of education, proportion belonging to the majority ethnic group, and proportion belonging to Muslim, Christian, and other religions) and thus do vary in time.

The candidate confounders presented in this DAG do not represent a complete census of all factors that influence HAT distribution, as only variables which are either a cause of the exposure, or share a common cause with it, meet the necessary criteria to be a confounder [30]. For instance while medical interventions are critical determinants of HAT risk, they are not causes of livestock density, and their common causes are expected to be sufficiently approximated through wealth.

Measures

Exposure.

Exposure is livestock density, parameterized as the ratio of livestock (cattle and pigs, separately) to humans using maps we detail in a separate publication [24]. In Malawi, Uganda, and DRC these were a time-series of continuous (raster) maps. Due to limited data availability, in South Sudan we produced a single areal (county-level) map for 2008.

Outcome data.

Outcome data included all new cases of HAT diagnosed in a given year and given cluster (Malawi, Uganda, DRC) or county (South Sudan) in the WHO Atlas of HAT [31]. Clusters which did not appear in the Atlas were either assigned 0 cases if they were at least 1km from any clusters with reported cases, and removed from all analyses otherwise. Denominator data came from the University of Southampton’s WorldPop project [32].

Confounders.

Confounders are those in the minimum sufficient adjustment set, defined above (DAG formulation). We estimated wealth using an exploratory factor analysis approach modeled after the DHS Wealth Index [33] but excluding livestock-related measures. We then used spatial modeling to generate continuous annual maps in Uganda, Malawi, and DRC, and a single areal map for 2008 for South Sudan (detailed in S1 Appendix).

Natural disasters and armed conflicts were identified using the EM-DAT and UCDP/PIRO Armed Conflict Databases, respectively [34, 35]. NDVI data came from the Land Long Term Data Record 5 (LTDR5) [36]; LST data came from MODIS/Terra MOD21 for 2000–2002 [37], and from MODIS/Aqua MYD21 for 2003–2014 [38]. Finally, elevation data came from GMTED2010 [39], with the median product used.

We did not attempt to adjust for vector control as it is not practical to parameterize farmer-led efforts on the scale of this study, and top-down vector control efforts tend to be deployed across an entire epidemic focus. This means every location within an epidemic focus would have the same same value for every location (e.g., across the entire modeled Uganda gHAT focus [40]; Fig 2), and no adjustment can be performed.

thumbnail
Fig 2. National maps with administrative areas represented in the study data highlighted in red.

(a) Malawi: traditional authority (administrative level 3); (b-d) Uganda and South Sudan: county (administrative level 2); (e) DRC: territory (administrative level 2). All base maps were obtained from GADM (Malawi: https://geodata.ucdavis.edu/gadm/gadm4.1/shp/gadm41MWIshp.zip; Uganda: https://geodata.ucdavis.edu/gadm/gadm4.1/shp/gadm41UGAshp.zip; South Sudan: https://geodata.ucdavis.edu/gadm/gadm4.1/shp/gadm41SSDshp.zip; DRC: https://geodata.ucdavis.edu/gadm/gadm4.1/shp/gadm41CODshp.zip).

https://doi.org/10.1371/journal.pgph.0002543.g002

Causal estimands

The potential outcomes framework for causal inference defines causal effects in terms of counterfactual outcomes. Under this framework’s cannonical notation, Y refers to outcome (here, HAT), M refers to mediator (here, NDVI and LST), A refers to exposure (here, livestock density), and superscripts refer to counterfactual outcomes. Thus is the counterfactual outcome that would be observed with exposure fixed at value a and the mediator taking the value that would be observed with exposure fixed to a (note here that counterfactuals are defined both for Y and for M, indicated by M also carrying a superscript).

Mediation is defined through direct and indirect effects. Together, the indirect and direct effects comprise the total effect, which is the result one would get if no mediation analysis was conducted. The direct effect is the part of the total effect that is not mediated, while the indirect effect is the part of the total effect that “goes through” the mediator.

In the potential outcomes framework, estimands are the natural direct effect and the natural indirect effect. The word “natural” is used because the mediator is not fixed at set value m. Rather, the investigator fixes the exposure level at a set value a, and then the mediator takes the value it would naturally take under that exposure level. Thus, this mediator value is also a counterfactual, and the superscript notation is used: Ma.

To calculate each of these two estimands, two counterfactuals must be compared. For the natural direct effect, the comparison is between two counterfactuals in which exposure changes from a* to a, but the mediator does not change (it is Ma* for both counterfactuals). For the natural indirect effect it is the opposite: the comparision is between two counterfactuals in which the exposure is fixed at a, but the mediator changes from Ma to Ma*. In both cases, the estimand is represented as a population average effect: for the natural direct effect (mediator fixed, exposure changed) and for the natural indirect effect (mediator changed, exposure fixed).

In order for the population-average effect estimated from observed data to represent the true causal mediation effect, several criteria must be met, called the “identifiability criteria.” These are (conditional) exchangeability, positivity, and stable unit treatment value assumption (SUTVA). SUTVA is comprised of consistency and no interference, detailed in S2 Appendix. Briefly, conditional exchangeability can be understood as no unmeasured confounding, and no interference can be understood as statistical independence between observations. Positivity and SUTVA have no corollaries outside of the potential outcomes framework, so we refer the reader to S2 Appendix for more detail.

The no interference assumption is likely violated in our study, the outcome is an infectious disease. We instead assume partial interference, in which each unit’s potential outcome is independent of the rest conditional on a specified interference set χi = {i1, i2, …} [41, 42]. Effectively, statistical independence can be assumed to hold outside of a buffer zone around each cluster. Reflecting the range of tsetse flies and grazing ranges of livestock, we define the interference set as all other clusters within a 5km radius [43, 44].

Mediational g-formula

The mediational g-formula uses model-based standardization to derive these estimands; its implementation, along with the identifiability criteria, are detailed in S2 Appendix.

Briefly, the g-formula uses standardization to estimate counterfactual outcomes, which the parametric g-formula extends to settings with numerous or high-dimensional confounders using parametric models [29]. To implement the parametric g-formula, regression models (glm models in R, in this case) are fit separately for each time-varying confounder, mediator, and outcome, and for each year. Starting with the first year of the study, the investigator then uses the fitted models to predict the value of the time-varying confounder or outcome that would have been observed under the exposure value that they specify (i.e., if every cluster received exposure level a). These predictions then become inputs for prediction at the next time point, and so on through all of the years of the study. The DAG is used to encode each regression model: which predictors to include, appropriate lags, etc. This is conducted separately for NDVI and LST, and separately for each counterfactual which needs to be modeled to generate the desired causal estimands (natural direct and natural indirect effects, in this case). For instance, as the natural direct effect requires estimation of , we fixed exposure at a when predicting out from the outcome and confounder models, and at a* when predicting out from the mediator models. The result is a distribution of outcome across all clusters and years. Uncertainty bounds are produced through non-parametric bootstrapping (100 simulations).

We defined a* as mean livestock density, and a as mean × 1.5, and specify our causal estimand on the ratio scale (S2 Appendix), generating a rate ratio analog corresponding to a 50% increase in exposure. Our targets of inference are: where s indexes simulation (100 in total), t indexes year, i indexes cluster, and is the one-year cumulative incidence of HAT for a given livestock species, under A = a.

Validation of the mediational g-formula is conducted through implementation of the “natural course” model, whereby exposure (livestock density) is modeled as for the time-varying confounders. If the modeled distribution is close to the observed distribution, model performance is deemed adequate.

Regression approach

Due to the absence of longitudinal exposure data in South Sudan, we applied a regression approach to our mediation analysis in this country. Under this approach, two models are fit—one for outcome, and one for mediator—and causal estimands are derived from linear combinations of parameters from these models.

Consistent with our DAG formulation, we assume that wealth measured in the 2008 South Sudan census is upstream of livestock density ascertained in the same census. Thus, wealth at time t1 constitutes a mediator-outcome confounder which is downstream of exposure (as livestock density is a cause of wealth), precluding a simple regression approach to mediation analysis [27]. However, wealth was only measured at time 0 (2008), thus we proceeded with the regression approach and accept that our results will not have a causal interpretation.

The outcome (Poisson) model was as follows, where c indexes county: where

  • Yc is the number of cases in county c in 2012
  • mc is the mediator value in county c in 2010, m ∈ {NDVI, LST}
  • μc is estimated livestock density in county c in 2008
  • α is a vector of coefficients
  • bc is a vector of exposure-outcome, exposure-mediator, and mediator-outcome confounders, b ∈ {2008 wealth; 2007 NDVI, LST, conflicts, disasters; 2009 conflicts, disasters}
  • Sc are county-level structured random effects which follow the ICAR model with marginal variance
  • ne(c) denotes neighbors (shared boundary) of county c
  • nc is the number of neighbors of county c
  • ϵc are unstructured (iid) county-level spatial random effects
  • Pc is the offset, given as population in county c

and the mediator (linear) model was: where

  • Mc is the mediator distribution in county c, M ∈ {2010 NDVI, 2010 LST}
  • γ is a vector of coefficients
  • μc, bc, Sc, and ϵc are as defined above

and ICAR is the intrinsic conditional autoregressive model, which smooths each county’s random effect to that of its neighbors.

These models do not contain an exposure × mediator interaction term (imposing the assumption of linearity in the direct and indirect effects of exposure on outcome across levels of the mediator), thus the direct effect is then calculated as: and the indirect effect is calculated as: where μμ* are the two values of livestock density we are comparing [27]. We have parameterized exposure such that this contrast corresponds to a 50% increase in livestock density, allowing direct comparison with mediational g-formula results in the other countries. We used penalized complexity (PC) priors for the smoothing model, with Pr(σϵ) > 1 = Pr(σs) > 1 = 0.01. This yields a posterior 99% credible interval for each random effect’s residual rate ratio of (0.36, 2.71) [45, 46].

We first fit naive models which did not account for measurement error in livestock density or wealth arising from the fact that these variables were estimated. We subsequently re-fit these specifying livestock density and wealth as random effects following a classical measurement error (MEC) model. We detail implementation of this model in S3 Appendix.

No off-the-shelf software is available to calculate standard errors for these estimands in the presence of spatial structure, thus we derived uncertainty estimates by taking 1,000 draws from the posterior distribution of each parameter in the above models, and taking the 2.5% and 97.5% percentiles of this distribution as the posterior 95% credible interval.

Data and code availability

HAT outcome data can be requested from the WHO (https://www.who.int/trypanosomiasis_african/country/foci_AFRO/en/) and livestock density maps can be downloaded from GitHub https://github.com/JulianneMeisnerUW/LivestockMaps. Mediator and confounder data are available for download from the websites linked in our References. All analyses were performed in R, and all code are available in the same GitHub repository.

Results

Descriptive statistics

After removing clusters which did not meet the inclusion criteria, we were left with 3,982 clusters/year in Malawi, 5,746 clusters/year for gHAT models in Uganda, 13,570 clusters/year for rHAT models in Uganda, and 247,205 clusters/year in DRC. In South Sudan, the study area was comprised of 17 counties. General study areas are presented in Fig 2.

Malawi did not have any armed conflicts during the study period, and South Sudan did not have any natural disasters. Descriptive statistics are presented in Tables 1 and 2, with additional descriptive statistics figures presented in S4 Appendix.

thumbnail
Table 1. Descriptive statistics, Malawi, Uganda, and DRC.

https://doi.org/10.1371/journal.pgph.0002543.t001

Mediational g-formula

Validation results indicated very strong agreement between the natural course model and reported HAT cases with the exception of 2010 and 2012 for the pig-gHAT models in Uganda. We removed these years from all subsequent analyses, meaning that they contributed to the models but not to the effect estimates presented herein as we do not trust their validity. Across the remaining years and models, the mean of the squared difference between observed livestock density and that predicted by the “natural course” model across cluster-years was <1 × 10−4.

Results are presented in Tables 35. In Uganda, Malawi, and DRC, we found no evidence of an indirect effect across both mediators, livestock species, and HAT types (all effect sizes close to 1).

In Uganda, a 50% increase in cattle density was associated with a 62% higher (95% CI 15% lower, 185% higher) risk of rHAT beyond the effect mediated by NDVI; a 68% higher (95% CI 16% lower, 182% higher) risk of rHAT beyond the effect mediated by LST; a 11% lower (95% CI 4% lower, 65% higher) risk of gHAT beyond the effect mediated by NDVI; and a 20% lower (95% CI 47% lower, 33% higher) risk of gHAT beyond the effect mediated by LST.

For pigs in Uganda, a 50% increase in density was associated with a 114% higher (95% CI 13% higher, 198% higher) risk of rHAT beyond the effect mediated by NDVI; a 116% higher (95% CI 18% higher, 205% higher) risk of rHAT beyond the effect mediated by NDVI; a 15% higher (95% CI 8% lower, 72% higher) risk of gHAT beyond that mediated by NDVI; and a 40% higher (95% CI 15% lower, 58% higher) risk of gHAT beyond that mediated by LST.

Thus results in Uganda were positive for pigs in both gHAT and rHAT foci, positive for cattle in rHAT foci, and negative for cattle in gHAT foci, reaching significance only for the pig-rHAT effect (Table 3).

In Malawi, we did not run LST models due to large amounts of missingness, and we removed protected area, elevation, and disaster from all models as very few HAT cases were observed during the study period leading to concerns regarding overfitting. For cattle, a 50% increase in density was associated with a 15% lower rHAT risk (95% CI 32% lower, null) beyond that mediated by NDVI. For pigs, a 50% increase in density was associated with a 62% lower (95% CI 79% lower, 31% lower) risk of rHAT, beyond that mediated by NDVI. Only pig results reached statistical significance (Table 4).

In DRC, a 50% increase in cattle density was associated with a 10% higher (95% CI 0% higher, 22% higher) risk of gHAT beyond the effect mediated by NDVI, and an 18% higher (95% CI 5% higher, 30% higher) risk of gHAT beyond the effect mediated by LST.

For pigs in DRC, a 50% increase in density was associated with a 21% higher (95% CI 7% higher, 33% higher) risk of gHAT beyond the effect mediated by NDVI, and a 25% higher (95% CI 11% higher, 42% higher) risk of gHAT beyond the effect mediated by LST.

DRC results are consistent with the gHAT results for pigs in Uganda, but discordant with cattle results; all DRC results reached statistical significance (Table 5).

Regression

As the main models are expected to be biased, we will only interpret the MEC models here. A 50% increase in cattle density was associated with a 50% lower risk of gHAT (95% CI 91% lower, 54% higher) for the effect mediated by LST, however the effect mediated by NDVI was very close to 1, indicating no evidence of an indirect effect. Direct effects for cattle were also negative, with a 50% increase in density associated with a 78% lower (95% CI 87% lower, 55% lower) risk of gHAT beyond the effect mediated by NDVI, and a 50% increase in density associated with a 72% lower (95% CI 83% lower, 55% lower) risk of gHAT beyond the effect mediated by LST. Both direct effects reached significance, however neithter indirect effect was significant (Table 6).

For pigs, a 50% increase in density was associated with a 38% lower (95% CI 91% lower, 35% higher) risk of gHAT for the effect mediated by NDVI, and 43% lower (95% CI 87% lower, 115% higher) risk of gHAT for the effect mediated by LST. The direct effect not mediated by LST was similarly negative, with a 50% increase in density associated with a 14% lower (95% CI 56% lower, 68% higher) risk of gHAT, however the direct effect not mediated by NDVI was positive, with a 50% increase in density associated with a 55% higher risk of gHAT (95% CI 48% lower, 268% higher). All uncertainty bounds were wide for pigs, with none of these results reaching statistical significance (Table 6).

Discussion

In Malawi, Uganda, and DRC, we did not find evidence that the effect of livestock on HAT risk is mediated by NDVI or LST. In South Sudan we did detect strong evidence of a mediation effect by LST for both cattle and pigs, and by NDVI for pigs. These results indicate the environmental pathway mediated by NDVI and LST is weak—if present—in rHAT and gHAT foci in Malawi, Uganda, and DRC. While point estimates were strong in South Sudan, uncertainty was generally large in this country, and these findings may furthermore be due to ecological bias (discussed in detail below).

For reservoirs, our direct (non-mediated effects) indicate that cattle and pigs may play a reservoir role for rHAT in Uganda and gHAT in DRC, and that pigs but not cattle may play a reservoir role for gHAT in Uganda. For zooprophylaxis, our results indicate pigs may play a zooprophylactic role for rHAT in Malawi, and cattle may play a zooprophylactic role for gHAT in Uganda. These results are largely consistent with our published total effect estimates [23], a finding that supports the validity of both sets of results as in the absence of an indirect effect, the direct effect is expected to resemble the total effect. In South Sudan we detected evidence of a zooprophylactic effect for cattle, but mixed evidence for pigs: the effect of pigs not mediated by NDVI was positive (reservoir), but the effect not mediated by LST was negative (zooprophylactic).

It is important to consider that these results are presented on the ratio scale, and thus should not be interpreted as a measure of impact. A risk factor for a disease with a very small (yet positive) effect size can still have substantial population-level impact if exposure to that risk factor is common, and conversely, a risk factor with a large effect size can have a very small population-level impact if exposure to that risk factor is very rare. Thus these results alone cannot tell us whether pigs or cattle play a large role in HAT transmission or maintenance, but rather whether there is evidence that such a role exists.

Where livestock play both a zooprophylactic and reservoir role, direct effect estimates reflect the balance between these opposing effects. For rHAT, our results provide evidence of a livestock reservoir effect in Uganda but not Malawi, where the zooprophylactic effect predominates. Our findings in Malawi are not necessarily discordant with what is known mechanistically—i.e., cattle and pigs are rHAT reservoirs—if this reservoir poses little risk to humans, allowing the zooprophylactic effect to dominate. This could arise if wildlife are the dominant source of human infection—due to host abundance, tsetse fly preference, contact patterns between hosts, adequate control of the domestic reservoir through insecticides or trypanocides, or other means. The spatial distribution of HAT in Malawi, where most active transmission occurs near protected areas, indicates this scenario may be consistent with HAT epidemiology in this setting [47]. A further mediation analysis examining an indicator of zooprophylaxis could be used to disentangle the zooprophylaxis and reservoir pathways, for instance using an indicator of tsetse preference and host-tsetse contact. For gHAT, our results point to a reservoir role for pigs in both Uganda and DRC and provide some evidence that pigs are a reservoir in South Sudan. These results also provide equivocal evidence that cattle may serve as a gHAT reservoir: while there is evidence of a slight reservoir effect for gHAT in DRC, in Uganda and South Sudan a zooprophylactic effect was detected.

As with any imperfect study—interventional or observational, randomized or not—our results may also be the result of bias. Unmeasured confounding due to omission of vector control from our models, and the need to drop several confounders from our Malawi model due to overfitting concerns, may lead to residual confounding. As a result we encourage caution in interpreting the Malawi results. Vector control efforts could not be included in our model as we had no measure of farmer-led efforts, and top-down efforts did not have any variance within each epidemic focus modeled (i.e., would have the same value at every location and thus cannot be modeled and do not constitute confounders). Further, proximity to protected areas is an imperfect proxy for density of wildlife hosts, as wildlife can and do exist outside of protected areas.

Selection bias may arise if clusters or counties excluded from the study are actually at risk of reporting a HAT case, and differ systematically in their distribution of the exposure (livestock density), mediators (LST and NDVI) or confounders (index events, elevation, wealth, vector control). In South Sudan, this could arise if underreporting in the outcome data (discussed below) results in exclusion of an at-risk county from our analyses completely.

One of the major sources of bias in our study is underreporting in the outcome data. If such underreporting is non-differential with respect to HAT risk (i.e., clusters or counties at high risk of HAT are no more likely to underreport than those at low risk, even if high-risk individuals are less likely to enter the reporting system than low-risk individuals), effect estimates will be biased to the null in expectation. While weak surveillance infrastructure may drive an association between cluster- or county-level HAT risk and HAT reporting, we expect this mechanism to largely arise via civil unrest (conflict), which we have attempted to control for.

With the exception of South Sudan, we have not addressed uncertainty in livestock density and wealth in our estimates, and in South Sudan our implementation of the measurement error model assumes errors in wealth and livestock density are not correlated in space, which is likely violated. Measurement error may also arise in our confounders, driving residual confounding, and in our mediators (NDVI and LST), biasing direct and indirect effects.

In a country as large as DRC, there is likely subnational variation in the effect of livestock density on HAT risk. This could be captured by fitting models separately for subnational regions, for instance province. However given that all models need to be run four times (for all combinations of cattle, pigs, NDVI, and LST), this would markedly increase hypotheses tested. As a result, each country-level effect estimate we present here should be interpreted as a weighted average of locally-varying effects.

We have included conflict as a confounder since conflict can influence transmission of HAT through displacement and migration, and detection of HAT due to either weakened public health infrastructure, or enhanced surveillance by NGOs in migrant settlements. If these impacts are contained to the years in which conflicts were defined in our dataset, we do not expect our results to be biased. However the impacts of conflicts nearly always linger after the conflict, likely resulting in residual confounding during post-conflict years in the affected locations.

Finally, ecological bias arises from aggregation of variables, the presence of an unmeasured individual-level confounder whose association with exposure differs across groups, or the presence of an unmeasured individual-level effect modifier whose effect or distribution differs across groups [4850]. As ecological bias is sensitive to grouping definition, we are particularly concerned about its effects in South Sudan, where spatial units were large and irregular.

Conclusion

Concerns about overfitting and residual confounding in the Malawi analyses, ecological bias in the South Sudan analyses, and underreporting in the outcome data for all countries aside, our study takes a novel approach to estimating the pathway-specific impact of cattle and pigs on gHAT and rHAT risk. By conducting our study across four high-burden countries, our findings lend strong supporting evidence in favor of a reservoir effect of pigs for gHAT. Our results also indicate that interventions on domestic animal reservoirs may be more likely to yield positive results in Uganda than Malawi, suggesting efforts should instead be targeted to vector control and strengthening passive surveillance systems. Control measures targeting the environmental change pathway are unlikely to be of much utility in these countries, their logistical and ethical challenges notwithstanding. Broad agreement with our previously-published total effect estimates [23] supports the validity of both results.

In conjunction with high-resolution livestock maps [24] these results can be interpreted on a policy-relevant scale, allowing policymakers and other stakeholders to identify priority areas for implementing domestic animal AAT control in a coordinated One Health framework.

Supporting information

S1 Appendix. Wealth mapping.

Methods and results.

https://doi.org/10.1371/journal.pgph.0002543.s001

(PDF)

S2 Appendix. Motivation and implementation of the mediational g-formula.

https://doi.org/10.1371/journal.pgph.0002543.s002

(PDF)

S3 Appendix. Implementation of the measurement error model in South Sudan.

https://doi.org/10.1371/journal.pgph.0002543.s003

(PDF)

Acknowledgments

The authors wish to express sincere appreciation to the World Health Organization’s Department of Control of Neglected Tropical Diseases for use of their Atlas of HAT data, and to the HAT research community, in particular Prof. Eric Fevré and Dr. Richard Selby.

References

  1. 1. Fevre EM, Coleman P, Odiit M, Magona JW, Welburn SC, Woolhouse MEJ. The origins of a new Trypanosoma brucei rhodesiense sleeping sickness outbreak in eastern Uganda. Lancet. 2001;358(625-628). pmid:11530149
  2. 2. Hamill LC, Kaare MT, Welburn SC, Picozzi K. Domestic pigs as potential reservoirs of human and animal trypanosomiasis in northern Tanzania. Parasites and Vectors. 2013;6:322. pmid:24499540
  3. 3. Selby R, Bardosh K, Picozzi K, Waiswa C, Welburn SC. Cattle movements and trypanosomes: restocking efforts and the spread of Trypanosoma brucei rhodesiense sleeping sickness in post-conflict Uganda. Parasites and Vectors. 2013;6:281. pmid:24289452
  4. 4. Picozzi K, Fevre E, Odiit M, Carrington M, Eisler MC, Maudlin I, et al. Sleeping sickness in Uganda: a thin line between two fatal diseases. Briths Medical Journal. 2005;331:1238–1241. pmid:16308383
  5. 5. Buscher P, Bart JM, Boelaert M, Bucheton B, Cecchi G, Chitnis N, et al. Do Cryptic Reservoirs Threaten Gambiense-Sleeping Sickness Elimination? Trends Parasitol. 2018;34(3):197–207. pmid:29396200
  6. 6. Rogers DJ. Satellites, space, time and the African trypanosomiases. Adv Parasitol. 2000;47:129–71. pmid:10997206
  7. 7. Lunde TM, Lintjorn B. Cattle and climate in Africa: How climate variability has influenced national cattle holdings from 1961–2008. PeerJ. 2013;. pmid:23638393
  8. 8. Lunde TM, Balkew M, Korecha D, Gebre-Michael T, Massebo F, Sorteberg A, et al. A dynamic model of some malaria-transmitting anopheline mosquitoes of the Afrotropical region. I. Model description and sensitivity analysis. Malaria Journal. 2013;12(28). pmid:23342980
  9. 9. Lunde TM, Balkew M, Korecha D, Gebre-Michael T, Massebo F, Sorteberg A, et al. A dynamic model of some malaria-transmitting anopheline mosquitoes of the Afrotropical region. II. Validation of species distribution and seasonal variations. Malaria Journal. 2013;12(78). pmid:23442727
  10. 10. Stone CM, Chitnis N. Implications of Heterogeneous Biting Exposure and Animal Hosts on Trypanosomiasis brucei gambiense Transmission and Control. PLoS Comp Bio. 2015. pmid:26426854
  11. 11. Crump RE, Huang CI, Spencer SEF, Brown PE, Shampa C, Mwamba Miaka E, et al. Modelling to infer the role of animals in gambiense human African trypanosomiasis transmission and elimination in the DRC. PLoS Negl Trop Dis. 2022;16:e0010599. pmid:35816487
  12. 12. Asner GP, Elmore AJ, Olander LP, Martin RE, Harris AT. Grazing systems, ecosystem responses, and global change. Annu Rev Environ Resour. 2004;29:261–99.
  13. 13. Rogers DJ, Randolph SE. Tsetse flies in Africa: bane or boon? Conservation Biology. 1988;2(1):57–65.
  14. 14. Bouyer J, Dicko AH, Cecchi G, Ravel S, Guerrini L, Solano P, et al. Mapping landscape friction to locate isolated tsetse populations that are candidates for elimination. Proceedings of the National Academy of Sciences. 2015;112(47):14575–14580. pmid:26553973
  15. 15. Dicko AH, Percoma L, Sow A, Adam Y, Mahama C, Sidibe I, et al. A spatio-temporal model of African animal trypanosomiasis risk. PLoS Neg Trop Dis. 2015;9(7):e0003921.
  16. 16. Duguma R, Tasew S, Olani A, Damena D, Alemu D, Mulatu T, et al. Spatial distribution of Glossina sp. and Trypanosoma sp. in south-western Ethiopia. Parasites and Vectors. 2015;8(430). pmid:26286484
  17. 17. Shaw APM, Torr SJ, Waiwa C, Cecchi G, Wint GRW, Mattioli RC, et al. Estimating the costs of tsetse control options: an example for Uganda. Preventative Veterinary Medicine. 2013;110:290–303. pmid:23453892
  18. 18. Bauer B, Amsel-de Lafosse S, Calusen PH, Kabore I, Petrich-Bauer J. Successful application of deltamethrin pour on to cattle in a campaign against tsetse flies (Glossina spp.) in the pastoral zone of Samorogouan, Burkina Faso. Trop Med Parasitol. 1995;36(3):183–9.
  19. 19. Lin S, DeVisser MH, Messina JP. An agent-based model to simulate tsetse fly distribution and control techniques: a case study in Nguruman, Kenya. Ecol Modell. 2015;314:80–89. pmid:26309347
  20. 20. Hamill L, Picozzi K, Fyfe J, von Wissmann B, Wastling S, Wardrop N, et al. Evaluating the impact of targeting livestock for the prevention of human and animal trypanosomiasis, at village level, in districts newly affected with T. b. rhodesiense in Uganda. Infectious Diseases of Poverty. 2017;6(1):16. pmid:28162093
  21. 21. Mweempwa C, Marcotty T, De Pus C, Penzhorn BL, Dicko AH, Bouyer J, et al. Impact of habitat fragmentation on tsetse populations and trypanosomosis risk in Eastern Zambia. Parasit Vectors. 2015;8:406. pmid:26238201
  22. 22. Pagabeleguem S, Sangaré M, Bengaly Z, Akoudjin M, Belem AM, Bouyer J. Climate, cattle rearing systems and African Animal Trypanosomosis risk in Burkina Faso. PLoS One. 2012;7(11):e49762. pmid:23166765
  23. 23. Meisner J, Kato A, Lemerani M, Mwamba Miaka E, Ismail Taban A, Wakefield J, et al. The effect of livestock density on Trypanosoma brucei gambiense and T. b. rhodesiense: A causal inference-based approach. PLoS Negl Trop Dis. 2022;18(8):e0010155.
  24. 24. Meisner J, Kato A, Lemerani M, Mwamba Miaka E, Ismail AT, Wakefield J, et al. A time-series approach to mapping livestock density using household survey data. Scientific Reports. 2022;12:13310. pmid:35922452
  25. 25. VanderWeele TJ, Tchetgen Tchetgen EJ. Mediation analysis with time varying exposures and mediators. J R Stat Soc Series B Stat Methodol. 2017;79(3):917–938. pmid:28824285
  26. 26. Franco JR, Cecchi G, Priotto G, Paone M, Diarra A, Grout L, et al. Monitoring the elimination of human African trypanosomiasis: Update to 2014. PLoS Negl Trop Dis. 2017;11(5):e0005585. pmid:28531222
  27. 27. VanderWeele TJ. 2. In: Mediation: Introduction and Regression-Based Approaches. New York, NY: Oxford University Press; 2015.
  28. 28. VanderWeele TJ. Direct and indirect effects for neighborhood-based clustered and longitudinal data. Social Methods Res. 2010;38(4):515–544. pmid:25473138
  29. 29. Robins J, Hernan M. In: Estimation of the causal effects of time-varying exposures. Boca Raton, FL: Chapman and Hall; 2009. p. 553–599.
  30. 30. Hernan MA, Hernandez-Diaz S, Werler MM, Mitchell AA. Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology. Am J Epidemiol. 2002;155(2):176–84. pmid:11790682
  31. 31. World Health Organization. Mapping the distribution of human African trypanosomiasis: Uganda, 2000-2014;. Available from: https://www.who.int/trypanosomiasis_african/country/foci_AFRO/en/.
  32. 32. Stevens FR, Gaughan AE, Linard C, Tatem AJ. Disaggregating census data for population mapping using random forests with remote-sensed and ancillary data. PLOS ONE. 2015;10(2):e0107042. pmid:25689585
  33. 33. Wealth Index Construction;. https://www.dhsprogram.com/topics/wealth-index/Wealth-Index-Construction.cfm.
  34. 34. EM-DAT: The International Disaster Database;. http://www.emdat.be.
  35. 35. UCDP/PRIO Armed Conflict Dataset;. https://www.prio.org/Data/Armed-Conflict/UCDP-PRIO/.
  36. 36. LAADS DAAC. Products N_19AVH13C1, N_18AVH13C1, and N_16AVH13C1;. https://ladsweb.modaps.eosdis.nasa.gov.
  37. 37. MOD21A2 v006. MODIS/Terra Land Surface Temperature/3-Band Emissivity 8-Day L3 Global 1 km SIN Grid;. https://lpdaac.usgs.gov/products/mod21a2v006/.
  38. 38. MYD21A2 v006. MODIS/Aqua Land Surface Temperature/3-Band Emissivity 8-Day L3 Global 1 km SIN Grid;. https://lpdaac.usgs.gov/products/myd21a2v006/.
  39. 39. U.S. Geological Survey. Digital Eelevation—Global Multi-resolution Terrain Elevation Data 2010 (GMTED2010);. Available from: https://www.usgs.gov/land-resources/eros/coastal-changes-and-impacts/gmted2010.
  40. 40. Bessell PR, Esterhuizen J, Lehane MJ, Longbottom J, Mugenyi A, Selby R, et al. Estimating the impact of Tiny Targets in reducing the incidence of Gambian sleeping sickness in the North-west Uganda focus. Parasites Vectors. 2021;14(410). pmid:34407867
  41. 41. Liu L, Hudgens MG, Becker-Dreps S. On inverse probability-weighted estimators in the presence of interference. Biometrika. 2016;103(4):829–842. pmid:29422692
  42. 42. Saul BC, Hudgens MG, Mallin MA. Upstream Causes of Downstream Effects. ArXiv e-prints. 2017;.
  43. 43. Turner MD, Schlecht E. Livestock mobility in sub-Saharan Africa: A critical review. Pastoralism. 2019;9:13.
  44. 44. Torr SJ, Vale GA. Is the even distribution of insecticide-treated cattle essential for tsetse control? Modelling the impact of baits in heterogeneous environments. PLoS Negl Trop Dis. 2011;5(10):e1360. pmid:22028944
  45. 45. Simpson D, Rue H, Riebler A, Martins TG, Sorbye SH. Penalising Model Component Complexity: A Principled, Practical Approach to Constructing Priors. Stat Sci. 2017;32(1):1–28.
  46. 46. Fuglstad GA, Simpson D, Lindgren F, Rue H. Constructing Priors that Penalize the Complexity of Gaussian Random Fields. J Am Stat Assoc. 2019;114(525):445–452.
  47. 47. Franco JR, Cecchi G, Priotto G, Paone M, Diarra A, Grout L, et al. Monitoring the elimination of human African trypanosomiasis at continental and country level: Update to 2018. PLoS Neglected Tropical Diseases. 2020;14(5):1–18. pmid:32437391
  48. 48. Greenland S, Morgenstern H. Ecological Bias, Confounding, and Effect Modification. International Journal of Epidemiology. 1989;18(1):269–274. pmid:2656561
  49. 49. Greenland S, Robins JM. Invited commentary: ecological studies– biases, misconceptions, and counterexamples. Am J Epidemiol. 1994;139:45–59. pmid:8178788
  50. 50. Greenland S, Robins J. Accepting the limits of ecologic studies: Drs. Greenland and Robins reply to Drs. Piantadosi and Cohen. Am J Epidemiol. 1994;139(8):769–771.