Skip to main content
Advertisement
  • Loading metrics

Spatial variation in housing construction material in low- and middle-income countries: A Bayesian spatial prediction model of a key infectious diseases risk factor and social determinant of health

  • Josh M. Colston ,

    Roles Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing

    josh.colston@virginia.edu

    Affiliations Department of Medicine, Division of Infectious Disease and International Health, School of Medicine, University of Virginia, Charlottesville, Virginia, United States of America, Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, Virginia, United States of America

  • Bin Fang,

    Roles Data curation, Formal analysis, Visualization, Writing – original draft

    Affiliation Department of Civil and Environmental Engineering, University of Virginia, Charlottesville, Virginia, United States of America

  • Malena K. Nong,

    Roles Visualization

    Affiliation College of Arts and Sciences, University of Virginia, Charlottesville, Virginia, United States of America

  • Pavel Chernyavskiy,

    Roles Conceptualization, Formal analysis, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, Virginia, United States of America

  • Navya Annapareddy,

    Roles Data curation, Formal analysis

    Affiliation School of Data Science, University of Virginia, Charlottesville, Virginia, United States of America

  • Venkataraman Lakshmi,

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    Affiliation Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, Virginia, United States of America

  • Margaret N. Kosek

    Roles Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Medicine, Division of Infectious Disease and International Health, School of Medicine, University of Virginia, Charlottesville, Virginia, United States of America, Department of Public Health Sciences, University of Virginia School of Medicine, Charlottesville, Virginia, United States of America

Abstract

Housing infrastructure and quality is a major determinant of infectious disease risk and other health outcomes in regions where vector borne, waterborne and neglected tropical diseases are endemic. It is important to quantify the geographical distribution of improvements to dwelling components to identify and target resources towards populations at risk. This study aimed to model the sub-national spatial variation in housing materials using covariates with quasi-global coverage and use the resulting estimates to map predicted coverage across the world’s low- and middle-income countries. Data on materials used in dwelling construction were sourced from nationally representative household surveys conducted since 2005. Materials used for construction of flooring, walls, and roofs were reclassified as improved or unimproved. Households lacking location information were georeferenced using a novel methodology. Environmental and demographic spatial covariates were extracted at those locations for use as model predictors. Integrated nested Laplace approximation models were fitted to obtain, and map predicted probabilities for each dwelling component. The dataset compiled included information from households in 283,000 clusters from 350 surveys. Low coverage of improved housing was predicted across the Sahel and southern Sahara regions of Africa, much of inland Amazonia, and areas of the Tibetan plateau. Coverage of improved roofs and walls was high in the Central Asia, East Asia and Pacific and Latin America and the Caribbean regions. Improvements in all three components, but most notably floors, was low in Sub-Saharan Africa. The strongest determinants of dwelling component quality related to urbanization and economic development, suggesting that programs should focus on supply-side interventions, providing resources for housing improvements directly to the populations that need them. These findings are made available to researchers as files that can be imported into a GIS for integration into relevant analyses to derive improved estimates of preventable health burdens attributed to housing.

Introduction

The United Nations’ Sustainable Development Goals (SDGs) include ambitious commitments to fight communicable diseases (target 3.3) and provide adequate, safe and affordable housing (target 11.1) throughout its member states [1]. Although they fall under separate goals, housing quality has long been recognized as a social determinant of health and epidemiological evidence is now elucidating the mechanisms by which this relationship operates [2]. Many endemic infectious diseases of global public health concern, including several named in SDG3, are transmitted within and between households with the majority of infections occurring while the susceptible individual is at home [3], and consequently features of the built peridomestic environment and infrastructure play a role in promoting or impeding the spread of pathogens and their insect vectors [4]. This is particularly true in tropical and rural regions of Africa, Asia and Latin America where numerous vector borne and neglected tropical diseases circulate and where dwellings are often constructed using locally available, naturally occurring materials and traditional techniques such as wattle and daub, dried or burnt bricks, adobe, woven reed or bamboo and thatch [4]. These construction methods often require great skill and community mobilization to implement and are adapted over generations to suit local climate, ecology and topography, however numerous disease-causing insects and microbes are also well adapted to take advantage of the ecological niches that such buildings provide [5,6].

Infants and young children are particularly vulnerable to the health effects of housing construction material due to the high proportion of time spent in the family dwelling and behaviors common to early life such as crawling or playing on the floor [79]. Floors that are finished with wood, tiles or cement may protect against transmission of some diarrhea-causing enteric pathogens compared to those made of packed earth or sand either because they are easier to clean, or because they are less hospitable to pathogen survival outside the host [9]. Finished floors have been associated with decreases of 0.89 in Log10 E. coli contamination in Peru [10], 78%, 43% and 27% in soil-transmitted helminthiasis prevalence respectively in Mexican, Bangladeshi, and Kenyan children [11,12], and 9% for diarrheal disease risk, 11% for both enteric bacteria and enteric protozoa risk [8], and 17% for Shigella spp. infection probability in meta-analyses of children under 5 years across multiple LMIC surveillance sites [13]. Traditional roof material has also been shown to be associated with childhood diarrhea [14], even after adjusting for floor material [15]. Pooled analyses of household survey data from multiple countries have found associations of living in improved housing on numerous child health outcomes, including cognitive and social-emotional development [7], and nutritional status [16], in addition to malaria infection [17,18]. Additionally, there is evidence of increased acute respiratory illness (ARI) in children in Pakistan, with unimproved flooring increasing ARI risk by 18%, and unimproved walling materials also increasing the risk of ARI in children under the age of five [19]. These findings are supported by similar findings with different studies in India, Nigeria, and Lao PDR [2022].

As childhood mortality continues to decline globally, becoming concentrated in subnational hotspots it will be increasingly necessary to target interventions ever more specifically both geographically and to particular causes [23]. Several household-level determinants of health have been mapped at continental or global scale using survey data and spatial interpolation methods including water source and sanitation facility type [24], crowded living space [25], educational attainment [26], and relative wealth [27]. Tusting and colleagues have applied a similar approach to mapping houses built with finished materials across Sub-Saharan Africa for the years 2000 and 2015, defining such households as those having at least two out of three of the materials for the walls, roof and floor being finished, though they did not separate out these three components in their main analysis [28]. Building on these efforts, the aim of this study, a project of the Planetary Child Health & Enterics Observatory (Plan-EO, www.planeo.earth) [29], is to provide estimates of the coverage of improvements in each of these three dwelling components that can be used to more effectively target infectious disease control measures and other health interventions. To achieve this, we model the sub-national spatial variation in housing materials using covariates with quasi-global coverage and use the resulting estimates to map the predicted coverage across low- and middle-income countries (LMICs). The guiding hypothesis was that coverage of improved housing materials varies spatially as a function of environmental, and socio-demographic factors in a way that can be modelled using publicly available global datasets and state-of-the-art geostatistical methods.

Materials and methods

Objective and scope

The objective of this analysis was to estimate the percent coverage of each category of materials used in dwelling component construction at all locations throughout the world’s LMICs (as defined by the Organisation for Economic Co-operation and Development [30], excluding those in Europe).

Outcome variables

The categories of housing materials used in this analysis were those proposed by Florey and Taylor [18], who classify materials used for construction of flooring, walls, and roofs into natural, rudimentary, and finished types, and then further into improved and unimproved. Data relating to these variables were compiled from nationally representative, population-based household surveys with two-stage cluster-randomized sample designs such as the Demographic and Health Surveys (DHS) [31], the Multiple Indicator Cluster Surveys (MICS) [32] and others from sources provided in S1 Data. These programs collect information on coverage of health and development indicators and make the resulting microdata publicly available through their websites. All Standard DHSs, Malaria and AIDS Indicator Surveys (MIS and AIS) and MICS dating back to 2005 that collected information on housing material from any LMICs were included. For countries where no such surveys were available, either similar surveys from the 2000–2004 period or country-specific surveys were sourced where available. The unit of analysis was the household, and these were classified into three, mutually exclusive categories (natural, rudimentary, and finished) based on the housing material recorded by the survey interviewer for each of the three dwelling components (floors, walls, and roof) as shown in Table 1.

thumbnail
Table 1. Classification of construction materials for the three components of the dwelling used as three-category outcome variables (adapted from Florey and Taylor 2016 [18]).

https://doi.org/10.1371/journal.pgph.0003338.t001

Georeferencing households.

For this spatial analysis it was necessary to assign coordinates to each household representing its approximate location. Cluster-randomized surveys have a hierarchical design such that households are nested within clusters, the census enumeration areas that serve as the primary sampling unit, which are in turn nested within survey strata (sub-national region and urban/rural status). Typically, 25–30 households are sampled per cluster [33]. The DHS Program provides coordinates of the cluster centroids for most of the surveys they carry out [34] (though these are randomly “displaced”–systematically shifted up to a certain distance to preserve confidentiality [35]). However, these are not available for all clusters and surveys and equivalent coordinates have been made available only for a handful of MICS and no country-specific surveys. For this analysis, households were georeferenced to their displaced cluster centroid coordinates where available, otherwise their clusters were randomly assigned to populated settlement locations taken from the Humanitarian OpenStreetMap database [36] that fell within the same survey stratum (sub-national region and urban/rural status) with probability proportional to the population density of the settlement (extracted from the WorldPop [37] database at settlement coordinates). OpenStreetMap settlements were reclassified such that cities and towns were categorized as urban, and villages, hamlets, and isolated dwellings as rural. This novel cluster location assignment process was automated in ArcGIS Pro ModelBuilder [38] and Stata 18 [39].

Covariates

A suite of time-static environmental and demographic spatial covariates available in raster format were compiled based on their hypothesized associations with the outcome variables. Definitions and sources of each covariate are shown in Table 2. Variable values were extracted at the georeferenced cluster locations in Python. In addition, time was calculated in continuous months since January 1st, 2005, based on the date of survey interview and log transformed. This choice was informed by the assumption that changes in material used for housing tend to be infrequent and unidirectional–i.e. households seldom change the material used in their construction and when they do, it is generally from unfinished to finished and not the other way. Countries were grouped into the six regions used for administrative purposes by the World Bank [40], and this categorical variable was also treated as a covariate so that, for countries with no available survey data, estimates would be based partly on regional averages.

thumbnail
Table 2. Definitions and sources of variables included as covariate predictors in the model.

https://doi.org/10.1371/journal.pgph.0003338.t002

Analysis

To reduce the database size and computational demands, and to neutralize the issue of within-cluster correlation, one household with non-missing outcome value was randomly sampled per cluster and retained for analysis (this selection was done separately for each of the three outcomes). This also ensured that each sampled location had just one value for the outcome variable. Due to the computational demands of performing geospatial analysis at the global scale, we recoded all outcomes to be binary, by collapsing two of the response categories together (“rudimentary” was grouped with “natural”) to give “improved” / “unimproved” response categories as shown in Table 1, and in a modification of the schema proposed by Florey and Taylor (those authors grouped rudimentary and finished walls and roofs into the improved category, but not floors, however we opted for a consistent categorization across components to facilitate comparison between outcome variables [18]).

Exploratory spatial data analysis.

We first assessed the presence of spatial autocorrelation by generating semi-variograms of the Pearson residuals from a non-spatial logistic regression that included all explanatory variables listed in Table 2 (S1 Text). We fit spherical spatial correlation models to each semi-variogram and estimated the nugget, range, and sill for each outcome. The semi-variograms and respective models were estimated using the gstat R package [41]. Together with the nugget:sill ratio and the estimated range, we determined that an explicitly spatial modeling approach was required to account for the non-trivial spatial correlation in the Pearson residuals.

Model fitting.

Given the massive spatial scale of the database, with hundreds of thousands of points spanning most of the globe, incorporating spatial correlation into the models presented computational challenges. We used the inlabru R package to implement an integrated nested Laplace approximation (INLA) modeling approach in which all locations are projected onto a coarsened grid or “mesh” containing several thousand vertices that carry the spatial information and can be reprojected onto the observed data [42,43]. INLA models approximate Bayesian models by constructing the posterior distribution and then applying Laplace approximations, thus bypassing the need for time-consuming Markov chain Monte Carlo sampling and making global-scale computation feasible. All coordinates were transformed via the Mollweide projection and scaled into kilometers prior to analysis. The mesh used for modelling had 18,352 vertices, placed within continental boundaries and spatial correlation was specified using a stationary Matérn covariance function. Further details on the implementation of the INLA model are provided in S1 Text.

Model predictions.

Predicted probabilities for each outcome were made for all locations in the domain of interest (the LMICs) at 0.05 decimal degree2 resolution and exported in Georeferenced Tag Image File format (GeoTIFF). The spatial covariates from Table 2 along with the time variable were used to generate predicted logistic distribution probability of the finished class of each building material from the INLA model. A value for time corresponding to the first of January 2023 was used for making predictions. Missing pixel values were filled by performing imputation using k-Nearest Neighbors method by Python Scikit-learn package [44].

Model evaluation.

The predictive performance of the spatial models was assessed by calculating common metrics of recall (sensitivity), precision (positive predictive value), accuracy (the proportion correctly classified), F1-score (mean of precision and recall), and area under the receiver operating characteristic curve (ROC-AUC). For each performance metric, two multiclass averaging metrics (macro and weighted average) were calculated, including macro averaging and weighted macro averaging, given by: (1) (2) Where Pri is the precision calculated from the multiple class predictions and Obsi is the number of observations of one class. n is the total number of observations of all classes. To assess the relative contribution of each spatial covariate to the models, feature importance values were approximated by taking the absolute value of the ratio between the mean fixed effect estimates and their standard deviations, a Bayesian analog of the standardized coefficient, which were then plotted for each model.

Ethics statement

All human subject information used in this analysis was anonymized, publicly available secondary data, and therefore ethical approval was not required or sought. For data provided by the DHS Program, data access requests (including for the displaced cluster coordinates) were submitted and authorized through the Program’s website. A completed checklist of Guidelines for Accurate and Transparent Health Estimates Reporting (GATHER [45]) is included in S1 Text.

Results

350 nationally representative household surveys (together containing data from more than 6 million households in 283,000 clusters) met the inclusion criteria, reported information on construction material types for one or more of the dwelling components and were included in the model training dataset. Fig 1 shows the number of surveys contributed by each LMIC, while S1 Data gives the national level distribution of each of the three housing construction variables in each survey (before within-cluster sub-sampling, and without sample weights applied). All eligible surveys included information on floor material; however, wall and roof material information were only available from 328 and 324 surveys respectively. No relevant data from household surveys could be found for several LMICs with large geographies and populations, most notably China, Iran, Venezuela, Libya, and Malaysia, as well as the smaller countries of Eritrea, North Korea, Lebanon, Equatorial Guinea, and numerous island nations such as Sri Lanka.

thumbnail
Fig 1. Number of nationally representative household surveys included in input dataset by country for included LMICs (small countries represented by circles).

Base map compiled from shapefiles obtained from U.S. Department of State—Humanitarian Information Unit [63] and Natural Earth free vector map data @ naturalearthdata.com that are made available in the public domain with no restrictions.

https://doi.org/10.1371/journal.pgph.0003338.g001

Fig 2 shows the geographical distribution of the coverage of improved materials predicted by the INLA models for each of the three binary dwelling component variables across the domain of included LMICs. These predictions are also provided as raster TIFF files available on the Dryad data repository (https://doi.org/10.5061/dryad.cjsxksnf8). There are some similarities across the variables, with low coverage predicted for all three across a wide belt of the Sahel and southern Sahara regions of Africa, much of inland Amazonia, and areas of the Tibetan plateau, as well as individual countries including the Democratic Republic of the Congo, Mozambique, Madagascar, Pakistan, and Papua New Guinea. High coverage of all three improved components coincided across much of the Middle East, Mediterranean North Africa, the coast of the Bight of Benin, the Caribbean, sub-Amazonian Brazil, southern Argentina, and South Africa. However, divergence in coverage of the three variables is evident over many locations. Across Kazakhstan, Mongolia, Azerbaijan, Cambodia and Laos, low coverage of improved floors, but high coverage of walls and roofs were predicted, while in Afghanistan, the reverse was the case. Yemen has mostly high improved floor coverage predicted, but low improved roof and mixed improved wall coverage, while on the island of Borneo, that pattern is reversed. Importantly, sub-national patterns are clearly visible, for example, with respect to improved floors, walls, and roofs in India, China, Mexico, and Brazil.

thumbnail
Fig 2.

Coverage of improved material for three dwelling components—a. floors, b. walls, c. roofs–in LMICs predicted by integrated nested Laplace approximation (INLA) models fitted to household survey data with inset maps showing details at a smaller zoom extent. Base maps compiled from shapefiles obtained from U.S. Department of State—Humanitarian Information Unit [63] and Natural Earth free vector map data @ naturalearthdata.com that are made available in the public domain with no restrictions.

https://doi.org/10.1371/journal.pgph.0003338.g002

Fig 3 shows ridge plots visualizing the distribution of predicted values for the coverage of improved status for each of the three dwelling components and stratified by the six world regions. The distribution of improved roofs was highly concentrated at values very close to 100% in the Central Asia region, findings which are borne out by the input data, in which most surveys recorded a coverage of finished roofs greater than 97% (S1 Data). This was true to a far lesser extent for other regions—with the exception of Sub-Saharan Africa, which had predicted values much more evenly dispersed along the range of values–and for improved walls, though the South Asia region had a much more dispersed, bimodal distribution for the latter variable. For improved floors, predicted values were highly concentrated at the low extreme of Sub-Saharan Africa.

thumbnail
Fig 3. Distribution of values predicted for coverage of improved dwelling components by INLA models, stratified by component and world region.

https://doi.org/10.1371/journal.pgph.0003338.g003

Fig 4 visualizes the feature importance values for each spatial covariate in each of the three models, including for the separate categories of the four factor variables (with time excluded to facilitate comparison between the geographic variables). There were strong similarities in the importance rankings of the variables between the three components, with nighttime light, human footprint and accessibility exhibiting high importance and categories of land cover and use–notably the “built up” and “cropland” classifications–and urbanicity—peri-urban and urban areas–also contributing considerably. Many of the environmental and hydroclimatic variables–notably climate zone, but also EVI, potential evapotranspiration, temperature range, and aridity–showed only modest or negligible feature importance.

thumbnail
Fig 4. Feature importance for each of the variables and their categories included in the final model for each of the dwelling components (excluding time.

HDI–Human Development Index; EVI–Enhanced Vegetation Index; LST–Land Surface Temperature; ET–Evapotranspiration; GDP–Gross Domestic Product; MENA–Middle East and North Africa; LAC–Latin America and the Caribbean; SA–South Asia; EAP–East Asia and Pacific; SSA–Sub-Saharan Africa. Comparison categories for factor variables are Land cover/use–tree cover; Climate zone–tropical; Region–Europe and Central Asia; Urbanicity—remote).

https://doi.org/10.1371/journal.pgph.0003338.g004

Table 3 gives statistics that evaluate the models’ performance in classifying household construction material types for the three dwelling components. Across the whole database, floors were the dwelling component for which coverage of improved construction material was lowest at 57.9%, the equivalent coverage for walls and roofs being 67.1% and 80.3% respectively. While precision, recall and F1-score statistics were generally high for the unimproved category in all models, they varied considerably for the improved category, particularly for the roofs model, for which recall, and F1-score were just 0.4 and 0.5 respectively. However, the roofs model was the one with the highest weighted average for those three statistics (a precision of 0.84, recall of 0.85 and F1-score of 0.83, compared with 0.78, 0.79, and 0.78 respectively for the walls and 0.77 for all three statistics for the floors model). All three models demonstrated similarly strong discriminatory power and performance in distinguishing between households with improved and unimproved construction materials in the respective dwelling components, with ROC-AUC statistics of 0.85–0.87.

thumbnail
Table 3. Evaluation statistics for models of construction materials for three dwelling components.

https://doi.org/10.1371/journal.pgph.0003338.t003

Discussion

Housing infrastructure and quality are major determinants of infectious disease risk and other health outcomes, particularly in regions of the world where vector borne, waterborne and neglected tropical diseases are endemic. Although, the nature of this relationship is complex and multifaceted and varies depending on the specific pathogen and vector species, it highlights the importance of targeting interventions to mitigate these adverse health outcomes, particularly in LMICs where the overwhelming majority of childhood mortality occurs. As attention turns to improving housing quality in low-resource settings as a strategy for controlling infectious diseases, it is important to quantify the geographical distribution of improvements to the major dwelling components to identify and target resources towards populations at risk. This study is the first attempt to meet this objective.

The findings indicate that the use of improved material in housing construction varies markedly by geography and at different scales. Each world region exhibits a distinct distribution of predicted coverage, and within countries themselves, as well as between neighboring countries, the prevalence of improved dwelling components often varies widely (as is visible for Nigeria and central Africa on the inset maps in Fig 2). There are several possible explanations for what drives this variation. It is superficially plausible that differences in the prevailing environmental and climatological conditions from one location to another might determine preferences among the population to and local adaptations of construction methods for the use of particular materials that provide optimal ventilation, insulation or imperviousness to moisture. For example, some populations in Sub-Saharan Africa express a preference for traditional thatched roofs over metal replacements, because they allow for open eaves that provide ventilation during the daytime heat [4,64]. However, none of the climatological variables included in this analysis (temperature range, aridity, evapotranspiration), proved to be important predictors of these outcomes, suggesting that this explanation may be misplaced. Alternatively, in places where certain raw materials are naturally abundant it may make more practical sense to forage for these rather than transporting improved materials from elsewhere. Areas with dense vegetation cover may be more conducive to building with wattle, wicker or thatch. The relative importance of land cover and use, which includes both tree cover and short vegetation as categories, to all three models might lend credence to this explanation, though EVI, a measure of vegetation density, contributed little to any of the models. However, the overwhelming importance of urbanicity and related variables (built up land use, nighttime lights, accessibility) as determinants of material used in all three dwelling components might suggest that a lack of economic integration and market access is the major constraint on further gains in housing improvement. Under this hypothesis the observed variation would be driven by differences in supply of rather than demand for improvements in housing material, with supply in turn constrained by economic connections, resources and infrastructure.

In their study mapping changes in housing in sub-Saharan Africa, Tusting and colleagues found broadly similar patterns to this study for that region using only DHS surveys, with coverage of finished construction materials lowest in central Africa, and highest along the West African coast and in the countries bordering South Africa. However, by extending our analytical domain to include other regions, we find important areas of divergence and differences in distributions between the three dwelling components. This could be due to geographical differences in cultural preferences or policy priorities that determine the order in which resources are invested in the three components as human development improves.

This study is subject to several limitations. Our characterization of housing was constrained by the availability of data from household surveys, which generally only ask about just three components, and don’t include questions about other relevant features of the built household environment, such as screens covering openings [65] elevation of sleeping areas or improvements to windows and ventilation [66]. Although the variables were originally in three-class ordinal categorical format, we had to combine categories and model them as dichotomous, because there is currently no way to address adjacent categories and parallel odds using the INLA modeling approach. Additionally, our spatial models assume a stationary (i.e., global) covariance structure that does not vary across the globe. This is likely an oversimplification of the latent spatial effects; however, estimating a non-stationary spatial model at the global scale falls outside the scope of the current article and presents a worthwhile future direction. Likewise, improving the precision of the mesh used by INLA may improve predictions, but with ROC-AUC values already relatively high, this is likely to yield only marginal gains.

Despite these limitations, this study fills an important knowledge gap for targeting geographic areas where housing improvements can be made a priority.. Mosquitoes that transmit malaria (Anopheles spp.), dengue (Aedes spp.), filariasis and Japanese encephalitis (Culex spp.) often enter the home through eaves and other openings [67] and rest on walls and ceilings after ingesting a blood meal (the basis behind indoor residual spraying [IRS] of these surfaces as a malaria control intervention). Indeed, in Africa, 80% of malaria transmission occurs indoors [3] and houses constructed of natural material provide more points of entry [67,68] and preferred resting places [69] for malaria-transmitting mosquitoes, putting housing improvements on the research agendas as potential disease control strategies [66,68]. In rural Gambia, reductions in intradomiciliary mosquito vector abundance and survival were found through installing plywood ceilings [70], closing eaves in thatched roofs [71,72], and replacing thatch with ventilated metal roofing [73]. In rural Uganda, living in a house constructed of traditional materials was associated with increased clinical malaria incidence [74] and parasitemia in children [75] and pregnant women [76], and decreased effectiveness of IRS in reducing Anopheles biting rates [75]. Similar protective effects against malaria outcomes have been documented in Burkina Faso [77], Ethiopia [78], Laos [79], Malawi [80], South Africa [81], and Tanzania [82]. Aside from mosquito-borne illnesses, living in households with walls made of mud or thatch carries an increased risk of leishmaniasis infection and indoor abundance of sandfly vectors [83], while in the Americas, Chagas Disease vectors (triatomine bugs) are drawn to houses with thatched palm roofs and mud walls [84]. In a Guatemalan community, for example, the odds of triatomine presence were 3.85 times higher in houses with walls that lacked plastering [85], while in rural Paraguay, an intervention to provide houses with smooth, flat and crack-free walls, reduced triatomine infestation by 96.4%, a comparable effect to that of fumigation [86].

Conclusions

In conclusion, this study applies a spatially explicit modeling approach to a very large dataset, representative of but standardized across diverse geographies, and collected through rigorous and standardized methodologies. The findings allow us to assess the predictive performance of the models as well as the relative contribution of particular covariate variables, and the resulting predictions are made available in a format that’s readily useable by researchers, program planners and other stakeholders (available from https://datadryad.org/stash/dataset/doi:10.5061/dryad.cjsxksnf8). The prevalence of improved roofs and walls is high in the Central Asia, East Asia and Pacific and Latin America and the Caribbean regions, while coverage of improvements in all three components, but most notably floors, is low in Sub-Saharan Africa. The strongest determinants of dwelling component quality, tend to be those relating to urbanization and economic integration, suggesting that housing improvement programs should focus on supply-side interventions that provide the resources for these improvements directly to the populations that need them rather than generating broad based demand. The analytical approach can be repurposed with minimal adaptation for other markers of disease risk measured by household surveys such as water and sanitation access, livestock ownership, and childhood nutrition indicators.

Supporting information

S1 Text. Supplementary methods and guideline compliance.

https://doi.org/10.1371/journal.pgph.0003338.s001

(PDF)

S1 Data. National level distribution of each of the three housing construction variables in each survey.

https://doi.org/10.1371/journal.pgph.0003338.s002

(XLSX)

S2 Data. Spatial variation in housing construction material in low- and middle-income countries–raster files of prevalence estimates and standard errors.

Available at https://doi.org/10.5061/dryad.cjsxksnf8.

https://doi.org/10.1371/journal.pgph.0003338.s003

(ZIP)

References

  1. 1. United Nations. 17 Goals—Learn About the SDGs. [cited 26 Oct 2018]. http://17goals.org/.
  2. 2. Krieger J, Higgins DL. Housing and health: time again for public health action. Am J Public Health. 2002;92: 758–768. pmid:11988443
  3. 3. Haines A, Bruce N, Cairncross S, Davies M, Greenland K, Hiscox A, et al. Promoting health and advancing development through improved housing in low-income settings. J Urban Health. 2013;90: 810–831. pmid:23271143
  4. 4. Knudsen J, von Seidlein L. Healthy homes in tropical zones: improving rural housing in Asia and Africa. London: Axel Menges; 2014.
  5. 5. Costantini C, Sagnon N, della Torre A, Coluzzi M. Mosquito behavioural aspects of vector-human interactions in the Anopheles gambiae complex. Parassitologia. 1999;41: 209–217. pmid:10697859
  6. 6. Trpis M, Hausermann W. Genetics of house-entering behaviour in East African populations of Aedes aegypti (L.) (Diptera: Culicidae) and its relevance to speciation. Bulletin of Entomological Research. 1978;68: 521–532.
  7. 7. Gao Y, Zhang L, Kc A, Wang Y, Zou S, Chen C, et al. Housing environment and early childhood development in sub-Saharan Africa: A cross-sectional analysis. PLoS Med. 2021;18: e1003578. pmid:33872322
  8. 8. Colston JM, Faruque ASG, Hossain MJ, Saha D, Kanungo S, Mandomando I, et al. Associations between Household-Level Exposures and All-Cause Diarrhea and Pathogen-Specific Enteric Infections in Children Enrolled in Five Sentinel Surveillance Studies. International Journal of Environmental Research and Public Health. 2020;17: 8078. pmid:33147841
  9. 9. Schiaffino F, Rengifo Trigoso D, Colston JM, Paredes Olortegui M, Shapiama Lopez WV, Garcia Bardales PF, et al. Associations among Household Animal Ownership, Infrastructure, and Hygiene Characteristics with Source Attribution of Household Fecal Contamination in Peri-Urban Communities of Iquitos, Peru. Am J Trop Med Hyg. 2021;104: 372–381. pmid:33146117
  10. 10. Exum NG, Olórtegui MP, Yori PP, Davis MF, Heaney CD, Kosek M, et al. Floors and Toilets: Association of Floors and Sanitation Practices with Fecal Contamination in Peruvian Amazon Peri-Urban Households. Environmental Science & Technology. 2016;50: 7373–7381. pmid:27338564
  11. 11. Cattaneo M, Galiani S, Gertler P, Martinez S, Titiunik R. Housing, Health and Happiness. CEDLAS, Working Papers. 2008 [cited 25 Apr 2019]. https://ideas.repec.org/p/dls/wpaper/0074.html.
  12. 12. Benjamin-Chung J, Crider YS, Mertens A, Ercumen A, Pickering AJ, Lin A, et al. Household finished flooring and soil-transmitted helminth and Giardia infections among children in rural Bangladesh and Kenya: a prospective cohort study. Lancet Glob Health. 2021;9: e301–e308. pmid:33607029
  13. 13. Badr HS, Colston JM, Nguyen N-LH, Chen YT, Ali SA, Rayamajhi A, et al. Spatiotemporal variation in risk of Shigella infection in childhood: a global risk mapping and prediction model using individual participant data. medRxiv; 2022. p. 2022.08.04.22277641.
  14. 14. Getachew A, Tadie A, G.Hiwot M, Guadu T, Haile D, G.Cherkos T, et al. Environmental factors of diarrhea prevalence among under five children in rural area of North Gondar zone, Ethiopia. Italian Journal of Pediatrics. 2018;44: 95. pmid:30115077
  15. 15. Paul P. Socio-demographic and environmental factors associated with diarrhoeal disease among children under five in India. BMC Public Health. 2020;20: 1886. pmid:33287769
  16. 16. Tusting LS, Gething PW, Gibson HS, Greenwood B, Knudsen J, Lindsay SW, et al. Housing and child health in sub-Saharan Africa: A cross-sectional analysis. PLoS Med. 2020;17: e1003055. pmid:32203504
  17. 17. Tusting LS, Bottomley C, Gibson H, Kleinschmidt I, Tatem AJ, Lindsay SW, et al. Housing Improvements and Malaria Risk in Sub-Saharan Africa: A Multi-Country Analysis of Survey Data. von Seidlein L, editor. PLOS Medicine. 2017;14: e1002234. pmid:28222094
  18. 18. Florey L, Taylor C. Using household survey data to explore the effects of improved housing conditions on malaria infection in children in Sub-Saharan Africa. Rockville, Maryland, USA: ICF International; 2016 Aug. https://dhsprogram.com/publications/publication-AS61-Analytical-Studies.cfm.
  19. 19. Aftab A, Noor A, Aslam M. Housing quality and its impact on Acute Respiratory Infection (ARI) symptoms among children in Punjab, Pakistan. PLOS Glob Public Health. 2022;2: e0000949. pmid:36962576
  20. 20. Mengersen K, Morawska L, Wang H, Murphy N, Tayphasavanh F, Darasavong K, et al. The effect of housing characteristics and occupant activities on the respiratory health of women and children in Lao PDR. Sci Total Environ. 2011;409: 1378–1384. pmid:21300397
  21. 21. Islam F, Sarma R, Debroy A, Kar S, Pal R. Profiling Acute Respiratory Tract Infections in Children from Assam, India. J Glob Infect Dis. 2013;5: 8–14. pmid:23599611
  22. 22. Akinyemi JO, Morakinyo OM. Household environment and symptoms of childhood acute respiratory tract infections in Nigeria, 2003–2013: a decade of progress and stagnation. BMC Infect Dis. 2018;18: 296. pmid:29970043
  23. 23. Perin J, Mulick A, Yeung D, Villavicencio F, Lopez G, Strong KL, et al. Global, regional, and national causes of under-5 mortality in 2000–19: an updated systematic analysis with implications for the Sustainable Development Goals. The Lancet Child & Adolescent Health. 2022;6: 106–115. pmid:34800370
  24. 24. Deshpande A, Miller-Petrie MK, Lindstedt PA, Baumann MM, Johnson KB, Blacker BF, et al. Mapping geographical inequalities in access to drinking water and sanitation facilities in low-income and middle-income countries, 2000–17. The Lancet Global Health. 2020;8: e1162–e1185. pmid:32827479
  25. 25. Chipeta MG, Kumaran EPA, Browne AJ, Hamadani BHK, Haines-Woodhouse G, Sartorius B, et al. Mapping local variation in household overcrowding across Africa from 2000 to 2018: a modelling study. The Lancet Planetary Health. 2022;6: e670–e681. pmid:35932787
  26. 26. Graetz N, Woyczynski L, Wilson KF, Hall JB, Abate KH, Abd-Allah F, et al. Mapping disparities in education across low- and middle-income countries. Nature. 2019. pmid:31875853
  27. 27. Data For Good at Meta. Relative Wealth Index. 2022 [cited 2 Sep 2022]. https://dataforgood.facebook.com/dfg/tools/relative-wealth-index#methodology.
  28. 28. Tusting LS, Bisanzio D, Alabaster G, Cameron E, Cibulskis R, Davies M, et al. Mapping changes in housing in sub-Saharan Africa from 2000 to 2015. Nature. 2019 [cited 17 Oct 2019]. pmid:30918405
  29. 29. Colston JM, Chernyavskiy P, Gardner LM, Fang B, Houpt E, Swarup S, et al. The Planetary Child Health and Enterics Observatory (Plan-EO): a Protocol for an Interdisciplinary Research Initiative and Web-Based Dashboard for Climate-Informed Mapping of Enteric Infectious Diseases and their Risk Factors and Interventions in Low- and Middle-Income Countries. 18 Apr 2023 [cited 24 Apr 2023]..
  30. 30. Organisation for Economic Co-operation and Development. DAC List of ODA Recipients. In: OECD [Internet]. 2020 [cited 10 Dec 2021]. https://www.oecd.org/dac/financing-sustainable-development/development-finance-standards/daclist.htm.
  31. 31. ICF International. Demographic and Health Surveys (various, 2000–2021). Rockville, Maryland, USA: ICF International; 2021.
  32. 32. UNICEF. Multiple Indicator Cluster Surveys (various, 2000–2021). New York, NY: UNICEF; 2021.
  33. 33. Croft TN, Allen CK, Zachary BW. Guide to DHS Statistics. Rockville, Maryland, USA: ICF International; Report No.: 8.
  34. 34. Perez-Haydrich C, Warren JL, Burgert CR, Emch ME. Guidelines on the use of DHS GPS data. 2013 [cited 22 Jul 2022]. https://dhsprogram.com/publications/publication-SAR8-Spatial-Analysis-Reports.cfm.
  35. 35. Burgert C, Colston J, Roy T, Zachary B. Geographic displacement procedure and georeferenced data release policy for the Demographic and Health Surveys. Calverton, MD, USA: MEASURE DHS; 2013 Sep. http://dhsprogram.com/publications/publication-SAR7-Spatial-Analysis-Reports.cfm.
  36. 36. OpenStreetMap contributors. Humanitarian OpenStreetMap Populated Places Datasets (OpenStreetMap Export). 2022 [cited 22 Jul 2022]. https://data.humdata.org/organization/hot.
  37. 37. Tatem AJ. WorldPop, open data for spatial demography. Scientific Data. 2017;4: 170004. pmid:28140397
  38. 38. ESRI. ArcGIS Desktop: Release 10.8. Redlands, CA: Environmental Systems Research Institute; 2019. https://desktop.arcgis.com/en/desktop/.
  39. 39. StataCorp. Stata Statistical Software: Release 18. College Station, TX: StataCorp LLC; 2023.
  40. 40. The World Bank. The world by region. In: SDG Atlas 2017 [Internet]. 2017 [cited 10 Dec 2021]. https://datatopics.worldbank.org/sdgatlas/archive/2017/the-world-by-region.html.
  41. 41. Pebesma EJ. Multivariable geostatistics in S: the gstat package. Computers & Geosciences. 2004;30: 683–691.
  42. 42. Bachl FE, Lindgren F, Borchers DL, Illian JB. inlabru: an R package for Bayesian spatial modelling from ecological survey data. Methods in Ecology and Evolution. 2019;10: 760–766.
  43. 43. Lindgren F, Bachl FE, Borchers DL, Simpson D, Scott-Howard L, Andy S, et al. inlabru: Bayesian Latent Gaussian Modelling using INLA and Extensions. 2023. https://cran.r-project.org/web/packages/inlabru/index.html.
  44. 44. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. 2011;12: 2825–2830.
  45. 45. Stevens GA, Alkema L, Black RE, Boerma JT, Collins GS, Ezzati M, et al. Guidelines for Accurate and Transparent Health Estimates Reporting: the GATHER statement. The Lancet. 2016;388: e19–e23. pmid:27371184
  46. 46. Weiss DJ, Nelson A, Gibson HS, Temperley W, Peedell S, Lieber A, et al. A global map of travel time to cities to assess inequalities in accessibility in 2015. Nature. 2018;553: 333–336. pmid:29320477
  47. 47. Trabucco A, Zomer R J. Global Aridity Index and Potential Evapo-Transpiration (ET0). CGIAR Consortium for Spatial Information (CGIAR-SCI); 2018. Report No.: 2. https://cgiarcsi.comunity.
  48. 48. Beck HE, Zimmermann NE, McVicar TR, Vergopolan N, Berg A, Wood EF. Present and future Köppen-Geiger climate classification maps at 1-km resolution. Sci Data. 2018;5: 180214. pmid:30375988
  49. 49. Ramankutty N, Evan AT, Monfreda C, Foley JA. Farming the planet: 1. Geographic distribution of global agricultural lands in the year 2000. Global Biogeochemical Cycles. 2008;22: n/a–n/a.
  50. 50. Natural Earth. Rivers and Lakes Centerlines 4.1.0. 2021. https://www.naturalearthdata.com/downloads/10m-physical-vectors/10m-rivers-lake-centerlines/.
  51. 51. Hastings DA, Dunbar PK. Global Land One-kilometer Base Elevation (GLOBE) Digital Elevation Model, Documentation, Volume 1.0. Boulder, Colorado: National Oceanic and Atmospheric Administration, National Geophysical Data Center; 1999.
  52. 52. Kummu M, Taka M, Guillaume JHA. Gridded global datasets for Gross Domestic Product and Human Development Index over 1990–2015. Sci Data. 2018;5: 180004. pmid:29406518
  53. 53. U.S. Geological Survey. Landsat Enhanced Vegetation Index. In: Landsat Missions [Internet]. 2021 [cited 14 Dec 2021]. https://www.usgs.gov/landsat-missions/landsat-enhanced-vegetation-index.
  54. 54. The Food and Agriculture Organization (FAO), International Institute of Applied Systems Analysis. Global Agro-ecological Zones (GAEZ v3.0). Rome, Italy and Laxenburg, Austria: FAO & IIASA; 2012. http://www.fao.org/nr/gaez.
  55. 55. Wildlife Conservation Society, Center for International Earth Science Information Network—CIESIN. Last of the Wild Project, Version 2, 2005 (LWP-2): Global Human Footprint Dataset (Geographic). Palisades, NY: NASA Socioeconomic Data and Applications Center (SEDAC); 2005. https://sedac.ciesin.columbia.edu/data/set/wildareas-v2-human-footprint-geographic.
  56. 56. Siebert S, Döll P, Hoogeveen J, Faures J-M, Frenken K, Feick S. Development and validation of the global map of irrigation areas. Hydrology and Earth System Sciences. 2005;9: 535–547.
  57. 57. Potapov P, Hansen MC, Pickens A, Hernandez-Serna A, Tyukavina A, Turubanova S, et al. The Global 2000–2020 Land Cover and Land Use Change Dataset Derived From the Landsat Archive: First Results. Front Remote Sens. 2022;3.
  58. 58. Hulley, Glynn, Hook, Simon. MOD21A1D MODIS/Terra Land Surface Temperature/3-Band Emissivity Daily L3 Global 1km SIN Grid Day V006. NASA EOSDIS Land Processes DAAC; 2017.
  59. 59. Hulley, Glynn, Hook, Simon. MOD21A1N MODIS/Terra Land Surface Temperature/3-Band Emissivity Daily L3 Global 1km SIN Grid Night V006. NASA EOSDIS Land Processes DAAC; 2017.
  60. 60. Román MO, Wang Z, Sun Q, Kalb V, Miller SD, Molthan A, et al. NASA’s Black Marble nighttime lights product suite. Remote Sensing of Environment. 2018;210: 113–143.
  61. 61. Running, Steve, Mu, Qiaozhen, Zhao, Maosheng. MOD16A2 MODIS/Terra Net Evapotranspiration 8-Day L4 Global 500m SIN Grid V006. NASA EOSDIS Land Processes DAAC; 2017.
  62. 62. Pesaresi M, Ehrlich D, Stefano F, Florcyk A, Freire SMC, Halkia S, et al. Operating procedure for the production of the Global Human Settlement Layer from Landsat data of the epochs 1975, 1990, 2000, and 2014 | EU Science Hub. Publications Office of the European Union; 2016. https://ec.europa.eu/jrc/en/publication/operating-procedure-production-global-human-settlement-layer-landsat-data-epochs-1975-1990-2000-and.
  63. 63. U.S. Department of State—Humanitarian Information Unit. Global LSIB Polygons Detailed—Humanitarian Data Exchange. Humanitarian Data Exchange (HDX); https://data.humdata.org/dataset/global-lsib-polygons-detailed.
  64. 64. Knudsen JB, Pinder M, Jatta E, Jawara M, Yousuf MA, Søndergaard AT, et al. Measuring ventilation in different typologies of rural Gambian houses: a pilot experimental study. Malar J. 2020;19: 273. pmid:32736629
  65. 65. Furnival-Adams J, Olanga EA, Napier M, Garner P. House modifications for preventing malaria. Cochrane Database Syst Rev. 2020;10: CD013398. pmid:33058136
  66. 66. Mshamu S, Mmbando A, Meta J, Bradley J, Bøjstrup TC, Day NPJ, et al. Assessing the impact of a novel house design on the incidence of malaria in children in rural Africa: study protocol for a household-cluster randomized controlled superiority trial. Trials. 2022;23: 519. pmid:35725486
  67. 67. Lwetoijera DW, Kiware SS, Mageni ZD, Dongus S, Harris C, Devine GJ, et al. A need for better housing to further reduce indoor malaria transmission in areas with high bed net coverage. Parasites Vectors. 2013;6: 57. pmid:23497471
  68. 68. Kua KP, Lee SWH. Randomized trials of housing interventions to prevent malaria and Aedes-transmitted diseases: A systematic review and meta-analysis. PLoS One. 2021;16: e0244284. pmid:33417600
  69. 69. Msugupakulya BJ, Kaindoa EW, Ngowo HS, Kihonda JM, Kahamba NF, Msaky DS, et al. Preferred resting surfaces of dominant malaria vectors inside different house types in rural south-eastern Tanzania. Malar J. 2020;19: 22. pmid:31941508
  70. 70. Lindsay SW, Jawara M, Paine K, Pinder M, Walraven GEL, Emerson PM. Changes in house design reduce exposure to malaria mosquitoes. Tropical Medicine & International Health. 2003;8: 512–517. pmid:12791056
  71. 71. Jatta E, Jawara M, Bradley J, Jeffries D, Kandeh B, Knudsen JB, et al. How house design affects malaria mosquito density, temperature, and relative humidity: an experimental study in rural Gambia. The Lancet Planetary Health. 2018;2: e498–e508. pmid:30396441
  72. 72. Kirby MJ, West P, Green C, Jasseh M, Lindsay SW. Risk factors for house-entry by culicine mosquitoes in a rural town and satellite villages in The Gambia. Parasites & Vectors. 2008;1: 41. pmid:18939969
  73. 73. Lindsay SW, Jawara M, Mwesigwa J, Achan J, Bayoh N, Bradley J, et al. Reduced mosquito survival in metal-roof houses may contribute to a decline in malaria transmission in sub-Saharan Africa. Sci Rep. 2019;9: 7770. pmid:31123317
  74. 74. Snyman K, Mwangwa F, Bigira V, Kapisi J, Clark TD, Osterbauer B, et al. Poor Housing Construction Associated with Increased Malaria Incidence in a Cohort of Young Ugandan Children. The American Journal of Tropical Medicine and Hygiene. 2015;92: 1207–1213. pmid:25870429
  75. 75. Rek JC, Alegana V, Arinaitwe E, Cameron E, Kamya MR, Katureebe A, et al. Rapid improvements to rural Ugandan housing and their association with malaria from intense to reduced transmission: a cohort study. The Lancet Planetary Health. 2018;2: e83–e94. pmid:29615240
  76. 76. Okiring J, Olwoch P, Kakuru A, Okou J, Ochokoru H, Ochieng TA, et al. Household and maternal risk factors for malaria in pregnancy in a highly endemic area of Uganda: a prospective cohort study. Malaria Journal. 2019;18: 144. pmid:31014336
  77. 77. Yé Y, Hoshen M, Louis V, Séraphin S, Traoré I, Sauerborn R. Housing conditions and Plasmodium falciparum infection: protective effect of iron-sheet roofed houses. Malar J. 2006;5: 8. pmid:16451729
  78. 78. Ghebreyesus TA, Haile M, Witten KH, Getachew A, Yohannes M, Lindsay SW, et al. Household risk factors for malaria among children in the Ethiopian highlands. Transactions of the Royal Society of Tropical Medicine and Hygiene. 2000;94: 17–21. pmid:10748890
  79. 79. Hiscox A, Khammanithong P, Kaul S, Sananikhom P, Luthi R, Hill N, et al. Risk factors for mosquito house entry in the Lao PDR. PLoS One. 2013;8: e62769. pmid:23700411
  80. 80. Wolff CG, Schroeder DG, Young MW. Effect of improved housing on illness in children under 5 years old in northern Malawi: cross sectional study. BMJ. 2001;322: 1209–1212. pmid:11358772
  81. 81. Coleman M, Coleman M, Mabaso MLH, Mabuza AM, Kok G, Coetzee M, et al. Household and microeconomic factors associated with malaria in Mpumalanga, South Africa. Transactions of The Royal Society of Tropical Medicine and Hygiene. 2010;104: 143–147. pmid:19732924
  82. 82. Liu JX, Bousema T, Zelman B, Gesase S, Hashim R, Maxwell C, et al. Is Housing Quality Associated with Malaria Incidence among Young Children and Mosquito Vector Numbers? Evidence from Korogwe, Tanzania. PLOS ONE. 2014;9: e87358. pmid:24505285
  83. 83. Calderon-Anyosa R, Galvez-Petzoldt C, Garcia PJ, Carcamo CP. Housing Characteristics and Leishmaniasis: A Systematic Review. Am J Trop Med Hyg. 2018;99: 1547–1554. pmid:30382013
  84. 84. Peña-García VH, Gómez-Palacio AM, Triana-Chávez O, Mejía-Jaramillo AM. Eco-Epidemiology of Chagas Disease in an Endemic Area of Colombia: Risk Factor Estimation, Trypanosoma cruzi Characterization and Identification of Blood-Meal Sources in Bugs. Am J Trop Med Hyg. 2014;91: 1116–1124. pmid:25331808
  85. 85. Bustamante DM, Monroy C, Pineda S, Rodas A, Castro X, Ayala V, et al. Risk factors for intradomiciliary infestation by the Chagas disease vector Triatoma dimidiatain Jutiapa, Guatemala. Cad Saúde Pública. 2009;25: S83–S92. pmid:19287870
  86. 86. Rojas de Arias A, Ferro EA, Ferreira ME, Simancas LC. Chagas disease vector control through different intervention modalities in endemic localities of Paraguay. Bull World Health Organ. 1999;77: 331–339. pmid:10327712