Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Community-level externalities in child health: Evidence from Sub-Saharan Africa

Abstract

We use data from twenty countries from the harmonized versions of the Demographic and Health Surveys (DHS) for countries in Sub-Saharan Africa to investigate the extent to which contextual factors such as the level of women’s schooling in the community and the fraction of households in the community with improved water and improved sanitation facilities have an impact on child health in the short-term and in the long-term. Short-term child health is measured by the incidence of reported fever and diarrhea in the two weeks preceding the survey, and long-term child health is measured by child height for age z-scores. We employ the hybrid modeling approach that combines the strengths of random-and fixed-effects models, and considers the role of unobserved heterogeneity at the community level, so far ignored in the literature, to account for all the potential synergies in externalities at the community level. We test the sensitivity of the estimates of externalities to alternative specifications for cluster-level unobserved heterogeneity. The empirical analysis confirms the role of externalities at the community level and shows that the size of these externalities varies between rural and urban areas and by outcome considered. Our results reaffirm the paramount role of externalities in maternal education on child height-for-age and on the incidence of fever and diarrhea in both urban and rural areas. No significant externalities are associated with the fraction of households in the community having improved water, though the fraction of households in the cluster with improved toilets are shown to be significantly associated with a lower probability of fever and diarrhea and improved child height-for-age, with some of these effects varying between urban and rural areas.

Introduction

The positive contribution of maternal education and access to improved water and sanitation facilities on the nutritional status of children is well established in the literature. Numerous studies based on observational data have shown that maternal education, water quality, sanitation, and handwashing (WASH) in a household are strongly associated with linear growth of children living in the same household [16]. Understandably, maternal education and WASH constitute essential components of the underlying causes and determinants of child undernutrition in the UNICEF conceptual framework [7].

However, the question of whether the investments of any household in the community have broader social and economic impacts that extend beyond the health and nutritional impacts on household’s own children has received rather uneven attention in literature. Much of the focus has been on community level externalities associated with maternal education [3, 814]. Mothers with less or no education, for example, appear to be able to benefit from information and advice provided, and even the cultural attitudes exhibited by more educated women in the community. Thus, the impacts from a mother’s education diffuse across neighbors.

Such externalities and contextual effects may also be at work with individual household investments in water and sanitation. Household investments in sanitation can have an impact across the wider neighborhood. Such investments may reduce the proliferation of and exposure to pathogens in the neighborhood’s air and soil and thus indirectly contribute to the nutritional and health status of children in neighboring households who may not have proper sanitation facilities. Individual investments improving access to water, in terms of water quality, water quantity, and/or water collection times by any given household may also result in improved overall community child nutrition. If water that is cleaner or more available increases the frequency of food preparation then pathogens may have less time to multiply sufficiently to cause disease. However, the extent to which individual investments in improved water impact the nutritional status of neighborhood children without improved water facilities needs empirical validation.

The extant empirical evidence on the externalities across households from investments in improved basic services at the neighborhood level is rather scarce. Studies generally focus on the potential spillover effects of one policy variable of interest, such as maternal education or sanitation, at a time and ignoring any potential interlinking and synergies that may arise at the community level. For example, Andres et al. [15] and Larsen et al. [16] focus on estimating potential externalities in community level sanitation without considering the externalities that might be present with higher maternal education or higher incidence of uncontaminated water at the community level. This practice of focusing on only one variable at a time may exaggerate the externalities from improved sanitation, because it attributes to sanitation, incorrectly, externalities that might be due to synergies with other community-level characteristics. For example, in communities with better sanitation, a higher fraction of mothers may have completed primary schooling, or a higher fraction of households may have access to improved water. Educated mothers are more likely to utilize sanitation infrastructure effectively and teach adequate hygienic practices and thus it is not only the improved sanitation infrastructure but also the better educated population that in fact improves the outcomes of interest. To the extent that maternal education, improved sanitation and improved water services at the community level are correlated, the omission of any of these factors is likely to result in biased estimates of the externalities under consideration due to omitted variable bias. To our knowledge the study by Alderman et al. [8] is the only exception that investigates community level externalities in maternal education, sanitation and water services at the same time.

Our study focuses on Sub-Saharan Africa (SSA). It employs data from twenty countries from the harmonized versions of the Demographic and Health Surveys (DHS) to investigate the extent to which contextual community factors impact child health and nutrition. Specifically, we estimate the impacts of women’s schooling, access to improved water and access to improved sanitation facilities at the community level on child health in the short-term and in the long-term. Short-term child health is measured by the incidence of reported fever and diarrhea in the two weeks preceding the survey, and long-term child health is measured by child height for age z-scores We use DHS clusters to define community level measures.

Inferences regarding externalities at the community level are by necessity based on estimates of the coefficients of community-level variables such as the fraction of females in the cluster with a certain level of education or the proportion of households with improved sanitation and/or improved water facilities. Random-effects (or two-level hierarchical) models, that decompose additively the regression error term into a community-wide component and a household-specific or idiosyncratic component, allow researchers to estimate the effects of community-level variables of interest on child nutrition. Inferences, however, can be unbiased only under the questionable assumption that the community level random-effects (community-level unobserved factors) are uncorrelated with the observed household and observed community-level covariates. For example, the quality of schools in the community is not observable, but may be correlated with the educational attainment variables of mothers and the community. In contrast, the within-community or within-cluster (W) or fixed -effects method that is typically the preferred method for eliminating unobserved heterogeneity at the community level, does not allow estimation of the effect of community-level variables on child nutrition. This is because the within community method models out variance across communities and makes any correlations between any unobserved heterogeneity at the community-level and covariates irrelevant.

This paper contributes to the topic of community level externalities and investments in human capital and basic services in two ways. First, the analysis pays close attention to the potential role of unobserved heterogeneity at the community level by employing an estimation strategy that addresses this problem directly. The available estimates of externalities in the literature are based on the OLS method that essentially assumes on an ad-hoc basis that there are no unobserved community factors in the regression residual that may be correlated with the included regressors. The strategy adopted here separates the effects of variables into two associations, one at the child level or within-community effects and one at the community level or between-community effects and thus combines the strengths of random- and fixed-effects models [1719]. Notable examples of this estimation strategy are the hybrid model [20] and the correlated random-effects (CRE) model [2123]. Such models yield fixed-effects (or within-cluster) estimates of the child or household level (level one) covariates that are unbiased by community level unobserved heterogeneity, while at the same time yielding unbiased estimates of the effects of community-level (level two) covariates and thus more reliable estimates of the community-level externalities on child nutrition. The hybrid/CRE model also allows consideration of all the potential synergies in externalities that might influence child nutrition and the incidence of illness through the interactions of community-level maternal education, improved sanitation and water services. Second, the method employed offers the opportunity to investigate the sensitivity of the estimates of externality effects to alternative specifications of the community/cluster-level unobserved heterogeneity. Different specifications of the relationship between cluster-level unobservables and the cluster mean values of some or all level one variables can shed light on the extent to which the estimates of externality effects are robust.

Empirical evidence on the significant role of externalities among households informs public policy decisions on the amounts of public investments in education and basic water and sanitation infrastructure, the extent to which the costs of such investments need to be subsidized so that more households can afford the service, and whether public resources should be targeted to the community as opposed to the household. Empirical evidence can also shed light on the question of whether these public investments have the potential of acting as substitutes for or complements to the private investments that individual households may be able (or unable) to undertake towards improving child health and nutrition. When there are significant positive externalities in maternal education, there is more scope for public investment as the benefits not only accrue to the mother’s own children but also other children in her community.

Materials and methods

The analysis is based on data from 20 countries in SSA for children aged zero to 59 months of age. The data were extracted from the harmonized versions of DHS surveys. This study analyzed secondary data that are publicly available and which fully de-identified. We did not have access to information that could identify individual participants during or after data collection. The research did not involve direct interaction with human participants or animals and the use of these data does not constitute human subjects research. The complete data for all the countries analyzed in our paper, are publicly accessible through the IPUMS-DHS website https://www.idhsdata.org/idhs/ with prior registration with DHS [24]. The data were first accessed on April 2, 2021. Table 1 below lists the countries covered and the years of the surveys used in the analysis.

thumbnail
Table 1. List of harmonized DHS surveys (IPUMS) analyzed.

https://doi.org/10.1371/journal.pone.0341218.t001

The child health-related variables we analyze are the reported incidence of fever and diarrhea and child heigh-for-age. The incidence of child diarrhea and fever as reported by mothers, is collected by the DHS surveys for the 2 weeks prior to the date of the survey. Post-natal linear growth as measured by height-for-age (HAZ) compares the child’s height (or length if measured lying down) to the height (length) distribution of healthy children of the same age and sex, such that HAZ measures the child’s height in terms of standard deviations from the median height. Thus, a child whose height corresponds to the median height of healthy children of the same age and sex will have a HAZ of zero. A child taller than the “median” child will have a positive value and those shorter than the “median” child will have a negative HAZ value.

The three policy-related variables of interest are the years of education of the child’s mother, the household’s access to an improved water source and the household’s access to an improved and non-shared toilet facility. Both of the infrastructure variables are constructed as binary variables at the household level and defined based on WHO/UNICEF Joint Monitoring Program guidelines. We also control for other factors which consistently available for our set of surveys and are commonly included to explain child nutrition or health [6,814,16]. Additional variables used as controls include: the child’s age in months (7 categories), the child’s gender, whether the child is mother’s first child, whether mother’s height is more than 160 cm, whether the house has a high quality roof, total number of household members, total number of children under 5 in the household, travel time to urban center with at least 20,000 people, household asset index (5 quintiles), binary variables for year of survey, and binary variables for month of survey. Table 2 presents the mean and standard deviation of all the variables in the urban and rural samples used to analyze the incidence of fever in our analysis. The sample used for the analysis of child diarrhea is practically identical, while the sample used to analyze child HAZ is a smaller subset of this sample, due to missing or outlier observations on HAZ.

thumbnail
Table 2. Means (std. deviations) of variables in the urban and rural sample used to analyze the reported incidence of fever among children 0 to 60 months of age.

https://doi.org/10.1371/journal.pone.0341218.t002

To date, the literature on externalities associated with maternal education and basic service coverage, has relied on the following stylized econometric model

(1)

where is the outcome variable of interest such as the Height-for-Age z-score (HAZ) of child i in community j, denotes policy variables of interest, such as the level of a mother’s education, or whether the household has improved water or improved sanitation, and is other child and household control variables such as child age, and household size. The error term in the regression summarizes the influence of all other unobserved variables at the individual and at the community level on the outcome variable.

An implicit and untested assumption of the existing studies focusing on community level externalities is that the error term consists of two components

(2)

where is a community level error term that can be accounted for by including the community level mean value of the main policy variable of interest, i.e., , such as the community average years of education of mothers, or the fraction of household in the community with improved water or toilets. Based on this specification, equation (3) below is estimated using OLS, i.e.,

(3)

Community/cluster-level externalities are investigated by focusing on the sign and significance level of the estimate of the parameter , the coefficient of the community-level average of maternal years of schooling or the fraction of households in the community with improved water and/or improved toilet facilities. It is important to bear in mind that the standard practice is to exclude from the regression model the community level means of the other policy variables and child characteristics and household control variables in . The key assumption for any credible evidence on the significant role of externalities based on this approach is that , which implies that there are no omitted community-level confounders, meaning that is a perfect proxy for all the community-level unobservables denoted by in equation (2) above. If any omitted confounders are correlated with the covariates, i.e., , then the estimates of the model obtained by ordinary least squares (OLS) are likely to be biased.

In contrast, the hybrid method [20] employed in this paper is motivated by the hierarchical structure underlying the fixed-effect (FE) and random effects (RE) models. It is based on the principle that the total effect of any variable at the individual level (level 1) is comprised of two components, the between-cluster and within-cluster components that can have different effects on an outcome variable. The specification of the hybrid model allows for separate within- and between-cluster effects for level 1 and level 2 variables:

(4)

where is a community level error term, is the “within-cluster” effect and the “between-cluster” effect of . The within-cluster effect assesses how on average a within-cluster change in is associated with a within-cluster change in outcome and since it is estimated using only within-cluster variation it is identical to the estimate obtained based on the fixed-effect method applied at the cluster level [25, 21]. The between-cluster effect , assesses how a unit change in the average of between clusters, i.e., , is associated with a change in the average value of between clusters, i.e., .

The hybrid model in equation (4) is estimated as a standard random intercept model. A key assumption of the random intercept model is that residuals are independent of the covariates, i.e.,

. Specifically, the assumption requires that there be no omitted community-level confounders, or no unobserved heterogeneity at the cluster level. This assumption may be considered as more defensible given that the cluster mean values of all the covariates , and not just the cluster mean of the policy variable of interest, are already included in the regression. Nevertheless, if the cluster-level error term is correlated with the covariates, i.e., , then the effect estimates of a RE model will be biased. In contrast, the FE estimator does not require any assumption on the distribution of , since it relies exclusively on variation within clusters.

The correlated random-effects model (CRE) [22], sometimes called the Mundlak model [21], is mathematically equivalent to the hybrid model in (4). The main difference is that it includes the level one variable directly into the regression equation instead of the demeaned form used in the hybrid model (i.e., :

(5)

In contrast to the standard random intercept model, the CRE model relaxes the assumption of zero correlation between the level two error, , and the level 1 variables. Specifically, it assumes that depends on the cluster mean values of all the level-1 covariates, i.e.,

(6)

Substituting in equation (5) by yields

(7)

where . In equation (7), is an estimate of the “within-cluster” effect, given that including the mean of a level one variable in a random effects model is an alternative to cluster mean centering [26], and is the “contextual” effect that explicitly models the difference between the “between” and “within” effects, i.e.,. and

(8)

Equation (8) can be estimated using the random effects method and has the advantage that it is not subject to cluster-level unobserved heterogeneity and thus able to yield unbiased estimates of the variables varying within clusters as well as of those variables that do not vary within clusters but vary only between clusters, such as and .

The hybrid/CRE approach outlined above offers two main advantages. First, it provides a fresh and clear interpretation of the estimates of the coefficients of the individual and community level variables in the literature. The studies on externalities published to date rely on the standard OLS approach discussed above without any explicit recognition of the differences in the variation of variables or their effects within-clusters and between-clusters or the potential role of unobserved heterogeneity at the community level in biasing OLS estimates. Second, the framework provides the opportunity to assess the sensitivity of the estimates of externalities on the outcomes of interest to different assumptions regarding which or how many community level variables to include in the analysis of externalities. The majority of the studies on externalities focus on one policy variable of interest generated by averaging out individual level (level 1) data at the cluster level (level 2). Such variables are usually the years of education of a mother or access to improved sanitation. The assumption of the hybrid/CRE model that depends on the cluster mean values of all the level-one covariates, instead of just the one policy variable of interest, allows for externalities arising from other covariates at the cluster level and provides the opportunity to examine how this potential synergy could affect the estimate of the externality associated with the policy variable of interest. In the results section of our paper, we also investigate whether the estimate of the externality associated with a policy variable of interest, such as maternal education, is higher or lower when the community average of the fraction of households with improved toilets and with improved water and/or other community level covariates are also included in the model, in comparison to the externality estimate associated with the policy variable of interest when community level covariates are excluded from the regression.

An additional advantage is the ability to test the equality of within and between cluster effects, that can also be used as an alternative to the Hausman specification test [27]. If between and within effects are the same, i.e., then (8) collapses to the simple random-intercept model (3). A purely empirically-driven approach to the CRE model could be to assume that depends on the community average of all the level one variables in the model, test the random effects hypothesis for each level one variable, and then re-estimate the CRE model including the community level means of the variables for which the test rejects the RE hypothesis. The hybrid modeling approach can also be extended to include random slopes allowing the effects of child-level variables to vary between clusters.

One important issue associated with the construction of community-level (level 2) variables by averaging out individual-level (level one) variables in the community/cluster is the number of household observations in a cluster. A small number of households/individuals sampled in any given cluster gives rise to errors in the variable, a problem that leads to attenuation bias, and thus, towards an inappropriate failure to reject the null hypothesis (that is, towards finding that community effects are not important, when, in fact, they are). This problem is exacerbated by the practice of using non-self means derived by summing the variable of interest over the sample cluster and then subtracting the observation for the household and dividing this difference by the number of households in the cluster. Schunk explores the consequences of small cluster size in linear multilevel models of level 2 variables (such as cluster means) that are functions of level one variables [28]. Using simulations, he reports that small cluster sizes can cause severe downward bias in estimated regression coefficients of aggregated level two variables and that the bias does not decrease if the number of clusters (i.e., level 2 units) increases. In our analysis we calculate community level means based on the sample of households with children under 5 years of age because the hybrid modeling approach is applicable only to the sample being analyzed. Studies following the standard ad-hoc approach include in the community mean observations from households in the sampled cluster that do not have children below five years of age. (e.g., see [8]). Ultimately, the extent to which the use of non-self-means or the use of all households sampled in the cluster has any substantial impact on the estimates of the size of the estimated externalities is an empirical question.

The preceding discussion has argued that estimates of externalities based on OLS and ignoring synergies may be biased due to cluster-level unobserved heterogeneity that may be correlated with the included regressors. The next section examines the size of this bias at the empirical level and addresses the question of whether such bias leads to misleading conclusions about the role externalities at the community level. We have also explored the option of analyzing possible externalities associated with having simultaneous access to improved water and sanitation at the household level. Unfortunately, the sample sizes were not sufficient to support such an analysis.

Results

Tables 3– report the estimates of externalities associated with OLS and hybrid models on the incidence of fever and diarrhea based on a linear probability model (LPM) in urban and rural areas respectively. We have also estimated logit models and confirmed that the main results were identical regarding the presence of externalities. In addition to the three policy variable of interests, the control variables used in the analysis included: the child’s age in months (7 categories), the child’s gender, whether the child is mother’s first child, whether mother’s height is more than 160 cm, whether the house has a high quality roof, total number of household members, total number of children under 5 in the household, travel time to urban center with at least 20,000 people, household asset index (5 quintiles) at the county level, binary variables for the year of survey, and binary variables for the month of survey.

thumbnail
Table 3. Effects of household-level and community/cluster level variables on child fever in urban areas, children 0 to 60 months.

https://doi.org/10.1371/journal.pone.0341218.t003

thumbnail
Table 4. Effects of household-level and community-level variables on child fever in rural areas, children 0 to 60 months.

https://doi.org/10.1371/journal.pone.0341218.t004

thumbnail
Table 5. Effects of household-level and community-level variables on child diarrhea in urban areas, children 0 to 60 months.

https://doi.org/10.1371/journal.pone.0341218.t005

thumbnail
Table 6. Effects of household-level and community-level variables on child diarrhea in rural areas, children 0 to 60 months.

https://doi.org/10.1371/journal.pone.0341218.t006

In each of these tables, columns (1) to (3) present the estimates of the external effects obtained using the typical OLS approach used in the literature, whereby potential externalities are investigated by including the cluster level average of just one of the policy variables of interest, such as the average years of education of mothers in the cluster or the fraction of households in the cluster with improved water or improved toilets. Such a specification ignores the potential of any synergies at the cluster level among policy variables of other household observed characteristics. To the extent that the policy variable of interest such as higher maternal education at the community-level interacts with other community level variable such as the fraction of households with access to improved water or improved sanitation that are omitted from the regression model, then the estimate of the external effect of maternal education is likely to be biased.

In column (4) of Tables 3 through 8, the specification Hybrid-3 includes only the three policy variables of interest by restricting potential synergies with any other household variables at the cluster level. In column (5) the specification Hybrid-All is less restrictive since it takes into consideration that there may be synergies not only among the three policy variables but also other individual observed characteristics at the cluster level (level 1). The likelihood ratio tests comparing the estimates of the restricted model Hybrid-3 (using the cluster average of only the three policy variables of interest) with the unrestricted model Hybrid-All (using the cluster average of all the control variables at the individual level) indicated that the Hybrid-All model fitted the data better in all cases.

In sum, three main findings emerge from the preferred estimates in columns (5) and (6) of Tables 3 through 6 regarding the role of externalities on the incidence of fever and diarrhea. First, the Hybrid-All estimates in column (5) of Tables 3-6 reveal that there are significant externalities associated with maternal education at the community level with respect to the incidence of fever and diarrhea. Ceteris paribus, clusters with a higher average in years of education of mothers are significantly associated with a lower probability of incidence of child fever and diarrhea in both urban and rural areas. The faction of households with improved toilets is also significantly associated with a lower probability of incidence of child fever in both urban and rural areas and a lower probability of incidence of diarrhea in urban areas.

Second, a comparison of the estimates with the standard OLS model and the Hybrid-All model (col 5) reveals that externality estimates based on the standard OLS approach may be quite misleading. Depending on the outcome and the context, when synergies are considered for the effect size of community level externalities is lower. For example, the between-cluster coefficient of maternal education on child fever in urban areas is significantly lower than that obtained using just the cluster mean of maternal education (compare col 2 with col 5 in Tables 3 and 4). Thus, the traditional approach to estimating the presence of external effects is likely to result in an overestimate of the effect of the external effects.

Third, there are instances where within-cluster effect of maternal education on the reported incidence of child fever is opposite in sign than the between-cluster effect. For example, the within-cluster effect of maternal education on child fever in urban and rural areas (col 5 in Tables 3 and 4) is positive, whereas the between-cluster effect is negative. This suggests that ceteris paribus, mothers with more education may be more likely to report their children having fever, but at the cluster level, a higher average education is associated with a lower incidence of child fever.

In Tables 7 and 8 we present the respective estimates using child HAZ as an outcome. In contrast to the incidence of fever and diarrhea, that provide short-term measures of child health, child height for age (HAZ) is a measure of long-term child health and development. As such child HAZ may be better able to reflect the effects of synergies between maternal education at the community level and the fraction of households with improved water and/ or improved toilets.

thumbnail
Table 7. Effects of household-level and community-level variables on child HAZ in urban areas, children 0 to 60 months.

https://doi.org/10.1371/journal.pone.0341218.t007

thumbnail
Table 8. Effects of household-level and community-level variables on child haz in rural areas, children 0 to 60 months.

https://doi.org/10.1371/journal.pone.0341218.t008

Maternal education at the household level as well as at the cluster level has a strong positive effect on child HAZ. In both urban and rural areas, accounting for synergies among the three policy variables and other household characteristics policy reveal externality effects associated with maternal education that are greater than those obtained from the standard OLS approach (compare estimates in col. 1–3 in Table 7 and 8 with those col. 5). The Hybrid-All estimates reveal that there are also substantial externalities at the community level associated with the fraction of households having access to improved toilets (Table 7, col. 5) in urban areas and with improved water in rural areas (Table 8, col. 5). Thus, OLS-based studies of externalities using the community average of the policy variable of interest alone, are likely to result in misleading estimates of the effects of other level one variables on the outcome of interest. This problem does not arise with the CRE approach whereby the within-cluster estimates of level one variables are independent of how many community level variables are included in the regression.

However, it is important to point out one associated with the estimates of improved water and improved toilets in rural areas. Estimates in rural areas seem to be affected by collinearity between the within and between variation. Such indications of collinearity arise from the reversal of the sign of the coefficient of improved water when the fraction of households in the cluster with improved water is included in the regression (the coefficient of 0.051 in Table 8 col 1, changes to −0.030 in Table 8 col 2). A similar pattern emerges for the improved toilets variable (the coefficient of −0.009 in Table 8 col 1, changes to 0.010 in Table 8 col 3). Bearing these important caveats in mind, in rural areas, access to improved toilets is associated with a positive effect on child height within clusters, but there appear to be no externalities arising from a higher fraction of households with improved toilets between clusters. The opposite pattern emerges for access to improved water. Ceteris paribus, access to improved water at the household level within communities is associated with lower child HAZ, but a higher fraction of households with access to improved water in the cluster is associated with a higher average child HAZ.

Table 9 provides a qualitative summary of the estimates of the between-cluster estimates (or externality effects of maternal education, improved water and improved sanitation on the three outcome variables analyzed in this paper.

thumbnail
Table 9. Summary of between-cluster (community level) effects by policy variable, and child health outcome for urban and rural areas in SSA.

https://doi.org/10.1371/journal.pone.0341218.t009

The analysis so far has considered only a linear specification for externalities. In Table 10 we relax this assumption and use a quadratic specification for the cluster means of all the level 1 variables based on the hybrid model summarized in equation (4). The likelihood ratio tests comparing the estimates of the restricted model Hybrid-3 (using the quadratic specification for the cluster average of only the three policy variables of interest) with the unrestricted model Hybrid-All (using the quadratic specification of the cluster average of all the control variables at the individual level) indicated that the quadratic Hybrid-All model fitted the data better in all cases.

thumbnail
Table 10. Nonlinearities in effects of community-level variables.

https://doi.org/10.1371/journal.pone.0341218.t010

The estimates reveal that the quadratic term for maternal education at the community level is statistically significant in most instances such as for child height-for-age in urban areas, and for fever and diarrhea in rural areas. Also, the coefficients of improved toilets for fever in rural areas (Table 10, col 2) also suggest that there are increasing returns associated with a higher fraction of households with improved toilets at the community level. The current estimates suggest that the threshold value beyond which decreasing returns set in (or the marginal returns of improved sanitation equal zero), is when 59% of the households in the community have improved toilets. Given that this is almost three times the current mean fraction of households with improved toilets in rural areas (about (20%; see Table 2) this implies that positive externalities associated with improved toilets in rural areas will continue to be at work for a while in the foreseeable future.

Concluding remarks and policy considerations

In this paper we used data from twenty countries from the harmonized versions of the DHS surveys for countries in Sub-Saharan Africa to investigate the extent to which contextual community factors impact the nutrition and health of children less than 60 months of age. We used the level of women’s schooling in the community and the fraction of households in the community with improved water and the fraction with improved sanitation facilities as the community factors. Long-term child health was measured by child height for age z-scores, and short-term child health by the incidence of reported fever and diarrhea in the two weeks preceding the survey.

We employed the hybrid modeling approach that combines the strengths of random-and fixed-effects models. First, it allows for the accounting of unobserved heterogeneity at the community level which has been largely ignored in the literature up to now. Second, it also allows for the inclusion of all the potential synergies in externalities at the community level. Third, it permits testing the sensitivity of the estimates of externalities to alternative specifications of the cluster-level unobserved heterogeneity.

The empirical analysis confirms the role of externalities at the community level and shows that the size of these externalities varies between rural and urban areas and by outcome considered. The preferred hybrid approach reaffirmed the paramount role of externalities in maternal education on child height-for-age (in both urban and rural areas), in the incidence of fever (in urban areas) and the incidence of diarrhea (in both urban and rural areas). Significant externalities also appear to be associated with the fraction of households in the community having improved toilets reducing fever (in both urban and rural areas) and diarrhea in urban areas as well as in increasing child HAZ.

An important advantage of the hybrid model is that it also yields estimates of the within-cluster effects of the same policy variables. These effects are independent of the potential influence of unobserved heterogeneity at the community level. Again, both mother’s education and improved toilets have significant effects within clusters.

Our findings inform public policy decisions on the amounts of public investments in education and basic water and sanitation infrastructure, whether costs of such investments need to be subsidized as part of broader development strategy, and whether public resources should be targeted to the community as opposed to the household level. Solely from the lens of child health outcomes (both in terms of nutrition and the incidence of illness) investments in improved water do not appear to improve HAZ or reduce the incidence of fever or diarrhea in our sample of children less than 60 months of age. However, given data limitations, the measure of improved water may not indicate safe water, especially if the improved water is non-piped [29]. But, increasing the years of education of mothers and increasing household-specific access to improved toilets through public policy will have not only private returns to the family with more educated mothers and improved infrastructure but also benefits to other children in the community. The clearest case where public funding provides benefits outside of private returns to the beneficiary of the policy is in education. Additional investments in having educated mothers not only accrues health benefits to the mother’s own children but also to the other children in the community. While changing the educational attainment of current mothers is more challenging, investing in the education of girls, future mothers, is an investment in the health of all the children in their community.

References

  1. 1. Burger S, Esrey S. Water and sanitation: Health and nutrition benefits. In: Pinstrup-Andersen P, Pelletier D, Alderman H, editors. Child growth and nutrition in developing countries: Priorities for action. Ithaca, NY: Cornell University Press. 1994.
  2. 2. Caldwell J. How is greater maternal education translated into lower child mortality?. Health Transition Review. 1994;4:224–9.
  3. 3. Cleland JG, Van Ginneken JK. Maternal education and child survival in developing countries: the search for pathways of influence. Soc Sci Med. 1988;27(12):1357–68. pmid:3070762
  4. 4. Desai S, Alva S. Maternal education and child health: is there a strong causal relationship?. Demography. 1998;35(1):71–81. pmid:9512911
  5. 5. Fink G, Günther I, Hill K. The effect of water and sanitation on child health: evidence from the demographic and health surveys 1986-2007. Int J Epidemiol. 2011;40(5):1196–204. pmid:21724576
  6. 6. Thomas D, Strauss J, Henriques M-H. How Does Mother’s Education Affect Child Height?. J Human Res. 1991;26(2):183.
  7. 7. UNICEF UNICEF. Strategy for Improved Nutrition of Women and Children in Developing Countries. New York, NY: UNICEF. 1990.
  8. 8. Alderman H, Hentschel J, Sabates R. With the help of one’s neighbors: externalities in the production of nutrition in Peru. Soc Sci Med. 2003;56(10):2019–31. pmid:12697194
  9. 9. Andrzejewski CS, Reed HE, White MJ. Does where you live influence what you know? Community effects on health knowledge in Ghana. Health Place. 2009;15(1):228–38. pmid:18603464
  10. 10. Kravdal Ø. Child mortality in India: the community-level effect of education. Popul Stud (Camb). 2004;58(2):177–92. pmid:15204252
  11. 11. Kravdal O. The importance of community education for individual mortality: a fixed-effects analysis of longitudinal multilevel data on 1.7 million Norwegian women and men. J Epidemiol Community Health. 2010;64(12):1029–35. pmid:19854750
  12. 12. Moestue H, Huttly S. Adult education and child nutrition: the role of family and community. J Epidemiol Community Health. 2008;62(2):153–9. pmid:18192604
  13. 13. Osorio AM, Bolancé C, Madise N, Rathmann K. Social determinants of child health in Colombia: can community education moderate the effect of family characteristics?. XREAP WP. 2013.
  14. 14. Parashar S. Moving beyond the mother-child dyad: women’s education, child immunization, and the importance of context in rural India. Soc Sci Med. 2005;61(5):989–1000. pmid:15955401
  15. 15. Andrés L, Briceño B, Chase C, Echenique JA. Sanitation and externalities: evidence from early childhood health in rural India. J Water Sanitation Hygiene for Development. 2017;7(2):272–89.
  16. 16. Larsen DA, Grisham T, Slawsky E, Narine L. An individual-level meta-analysis assessing the impact of community-level sanitation access on child stunting, anemia, and diarrhea: Evidence from DHS and MICS surveys. PLoS Negl Trop Dis. 2017;11(6):e0005591. pmid:28594828
  17. 17. Bell A, Jones K. Explaining fixed effects: random effects modeling of time-series cross-sectional and panel data. PSRM. 2014;3(1):133–53.
  18. 18. Schunck R. Within and between estimates in random-effects models: advantages and drawbacks of correlated random effects and hybrid models. The Stata J: Promoting communications on statistics and Stata. 2013;13(1):65–76.
  19. 19. Schunck R, Perales F. Within- and between-cluster effects in generalized linear mixed models: a discussion of approaches and the xthybrid command. The Stata J: Promoting communications on statistics and Stata. 2017;17(1):89–115.
  20. 20. Allison PD. Fixed Effects Regression Models. Thousand Oaks, CA: Sage. 2009.
  21. 21. Mundlak Y. On the Pooling of Time Series and Cross Section Data. Econometrica. 1978;46(1):69.
  22. 22. Wooldridge JM. Econometric Analysis of Cross Section and Panel Data. 2nd ed. Cambridge, MA: MIT Press. 2010.
  23. 23. Cameron AC, Trivedi PK. Microeconometrics: methods and applications. Cambridge: Cambridge University Press. 2005.
  24. 24. Boyle EH, King M, Sobek M. IPUMS-Demographic and Health Surveys: Version 8. Minnesota Population Center and ICF International. 2020.
  25. 25. Hsiao H. Analysis of panel data. 2nd ed. Cambridge: Cambridge University Press. 2003.
  26. 26. Halaby CN. Panel Models in Sociological Research: Theory into Practice. Annu Rev Sociol. 2004;30(1):507–44.
  27. 27. Baltagi BH. Econometric Analysis of Panel Data. 5th ed. New York: Wiley; 2013.
  28. 28. Schunck R. Cluster size and aggregated level 2 variables in multilevel models: A cautionary note. Methods Data Anal. 2016;10:97–108.
  29. 29. Kumpel E, Peletz R, Bonham M, Khush R. Assessing drinking water quality and water safety management in sub-saharan africa using regulated monitoring data. Environ Sci Technol. 2016;50(20):10869–76. pmid:27559754