Figures
Abstract
To meet the SDG requirement of spatial disaggregation of indicators, several methods have been developed to generate estimates of under-five mortality at the sub-national level. The reliability of sub-national mortality estimates in children aged 5-14 with the available survey data has not been evaluated so far. We generate Admin-1 sub-national estimates of the risk of dying in children aged less than five (5q0) and those aged 5 to 14 years old (10q5). We use 96 Demographic and Health Surveys (DHS) in 20 Sub-Saharan countries having at least 3 surveys designed to be representative at a sub-national level. The estimates account for the complex sample design of DHS and HIV-related biases in young children. A Bayesian space-time model previously developed for under-five mortality is used to smooth estimates across space and time in both age groups to reduce problems associated with data sparsity. The posterior distributions of the probability 10q5 are used to compute coefficients of variation and assess precision. Sufficiently precise estimates are retained to study the sub-national relationship between age-specific mortality rates (5q0 and 10q5), accounting for uncertainty in sub-national levels. Out of 1,132 space-time estimates, 62.3% are considered sufficiently precise with high heterogeneity across countries. Across all periods, sub-national estimates of mortality in children aged 0-4 are highly correlated with those in older children and young adolescents but this correlation is largely driven by the mortality decline. Within specific periods of time, it is often impossible to assess the relationship between mortality rates in the two age groups at the sub-national level, except in Nigeria, Ethiopia, Cameroon, Senegal and Zambia. As increased attention is devoted to survival after age 5, more research is needed to ensure that sub-national areas with specific interventions required for older children can be correctly identified.
Citation: Schlüter B-S, Masquelier B (2021) Space-time smoothing of mortality estimates in children aged 5-14 in Sub-Saharan Africa. PLoS ONE 16(1): e0245596. https://doi.org/10.1371/journal.pone.0245596
Editor: José Antonio Ortega, University of Salamanca, SPAIN
Received: November 5, 2020; Accepted: January 4, 2021; Published: January 19, 2021
Copyright: © 2021 Schlüter, Masquelier. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data are publicly available Demographic and Health Surveys that can be downloaded from https://www.dhsprogram.com/data/available-datasets.cfm.
Funding: This work was supported by a FSR research grant from Université Catholique de Louvain (UCLouvain): IACS FSR19 MASQUE, Mr. Benjamin-Samuel Schlüter.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Under-five mortality is one of the most important health indicators and is regularly targeted by global health policies. It has been the focus of intense research in low- and middle-income countries, with studies developing robust national and sub-national estimates from multiple sources [1–3], identifying the main determinants of child survival [4–6] and pinning down the most effective public health interventions to prevent child deaths [7, 8]. In comparison, mortality in older children and young adolescents (5 to 14 years old) has attracted less attention, presumably because risks of dying are much lower at these ages. According to the UN Inter Agency Group for Child Mortality Estimation (IGME), 5.2 million children under 5 years of age died in 2019, compared to 0.9 million children aged 5-14 [9]. Children and young adolescents aged 5-14 face the lowest risk of dying observed across the life course, estimated at 7 deaths per 1000 children in 2019 globally. Yet, from the perspective of child health and development, this age group is of particular importance as it marks the start of formal education and the onset of adolescence [10]. Deaths in children aged 5-14 also deserve more attention as they are predominantly caused by preventable conditions (including lower respiratory tract infections, diarrhoeal diseases, drowning, meningitis and road injuries) [11]. Since the 2000s, the decline in under-five mortality has greatly accelerated, while that in mortality in the age range 5-14 has not, raising concerns that older children might not be benefiting from recent efforts to improve child survival as much as neonates and children aged 1-59 months [12].
Sub-Saharan Africa has by far the highest rate of mortality in children aged 5-14 years, at 17 per 1000 children, but this regional average conceals wide variations across countries [9]. In Niger for example, the probability of a child dying between her fifth and fifteen birthday (10q5) was estimated at 30 per thousand in 2019, six times the risk in South Africa (at 5 per thousand). There could also be substantial heterogeneity at the sub-national level but to date no study has attempted to disaggregate mortality indicators for this age group. Within-country differences in mortality in younger children are striking, suggesting sub-national variation in 5-14 year old mortality too; they accounted for 74-78% of overall variation in under-5 mortality across space and over time in Sub-Saharan Africa in the 1980s, 1990s and 2000s [13]. Hence progress towards child survival goals needs to be tracked at the sub-national level, a need recognized in the Sustainable Development Goals (SDGs) resolution [14]. Sub-national estimates help inform effective resource allocation to reduce child health inequities within countries by targeting areas where interventions are most needed [15]. Health programme planning and implementation typically occur at the level of low administrative units, such as regions, provinces and districts [2].
In recent years, the need for sub-national estimates of under-five mortality has been met thanks to the increased sample sizes of Demographic and Health surveys [16] and the development of new statistical methods [2, 3, 17]. Subnationally, the number of deaths and the corresponding population at risk in any unit of space × time are often too small to produce reliable estimates of the underlying risk without “borrowing strength” from neighbouring units [18]. Different strategies have been employed by various research groups for under-five mortality. Most notably, Golding et al. [2] used a Bayesian geostatistical analytical framework to produce estimates of under-5 and neonatal mortality at a resolution of 5 × 5 km grid cells across 46 countries in Africa. The study team combined full birth histories (FBH) collected in large-scale household surveys with summary birth histories (SBH) collected in censuses, where women report only on the children ever born and surviving. They showed that many sub-national areas would not meet the SDG 3.2 target by 2030 as this would require reducing under-five mortality by at least 8.8% per year between 2015 and 2030, an unrealistic target. However, their approach had at least three limitations. First, their modelling strategy did not appropriately account for the complex survey design that is used in sample surveys, potentially introducing biases in the uncertainty around space-time estimates. Second, Golding et al. [2] did not correct for the potential underestimation of mortality in countries with large HIV epidemics due to mother-to-child transmission of the virus. As birth histories are collected from mothers, they do not represent the experience of children whose mothers have died. In the absence of treatment, children born to HIV-positive mothers face higher risks of death and are also less likely to appear in the sample. Third, while their continuous-spatial model permits inference at various level of aggregation (districts, provinces, regions), it was less robust than other discrete spatial alternatives [3].
Another small area estimation (SAE) method was developed by Mercer and colleagues [17]. SAE methods leverage space-time similarity to construct a Bayesian smoothing model where spatial and temporal dependence are virtues that the estimation process can exploit [19, 20]. Mercer et al. [17] used hierarchical models performing spatio-temporal smoothing that fully accounted for the effects of the survey design on the uncertainty in place-time-specific estimates. In data-sparse situations, when one expects similarity (in space, time or both) this approach allows to disentangle signal from sampling variability. This can also be seen as a penalization, where large deviations from “neighbors” are discouraged. This method is of particular interest to study geographical heterogeneity in data-sparse contexts. It was first applied in a study of under-five mortality in regions of Tanzania [17] and has since been applied by Li et al. (2019) to generate estimates for Admin-1 sub-national areas in 35 countries in Africa [3].
In this article, we replicate the Li et al. (2019) study but focus on mortality in older children and adolescents. Spatial variations in children aged 5-14 might not be strongly associated with those in children younger than 5 years, due to different cause-of-death patterns or risk factors. There might be a need to target areas with specific interventions for older children to reach those most at need in this age range. However, data sparsity is a more serious problem for older children, as fewer data sources provide mortality measures for this age group. For example, indirect demographic techniques cannot be used to estimate the probability 10q5 from summary birth histories collected in censuses. In addition, mortality rates are much lower in older children and confidence intervals are larger. For example, in the 2018 DHS conducted in Zambia, the under-five mortality rate for the period 0-4 years before the survey was 58.1 per thousand at the national level, with a standard error of 3.6 per thousand (coefficient of variation = 6.3%). The corresponding risk of dying in the age group 5-14 was 10.0 per thousand, with a standard error of 1.4 (CV = 14.0%). For this recent period, women aged 15-49 in the survey reported more than 7 times more deaths among children aged 0-4 years than among children aged 5-14 years, for a relatively comparable exposure time (1.5 times more person-years in children aged 5-14). In this context, our study addresses two questions. First, we assess whether the method developed by Mercer et al. [17] can yield sufficiently precise mortality estimates of 10q5 at the first sub-national level when applied to Demographic and Health surveys (DHS). Second, for countries and periods for which we obtain estimates with enough precision, we evaluate if there is a strong relationship between under-five mortality and mortality in children aged 5-14 at the first sub-national level.
Materials and methods
Data sources
This study is based solely on DHS. DHS surveys use a stratified two-stage cluster sampling design. Samples are typically stratified by regions and urban/rural areas within each region. In each stratum, enumeration areas (EAs) are selected with a probability proportional to their population size, using a sampling frame usually obtained from the most recent census. Households are selected in each of the clusters. Within each household, all women aged 15-49 are selected for in-depth individual interviews. In recent surveys, geospatial information is collected to identify the central point of each cluster. To ensure that respondent confidentiality is maintained, coordinates are displaced by up to 2 km in urban clusters, 5 km in 99% of rural clusters, and 10 km in a random sample of 1% of rural clusters [21].
For this study, we retained only surveys conducted in Sub-Saharan countries that had at least 3 DHS surveys representative at a sub-national level, with a consistent set of sub-national boundaries across DHS surveys. This requirement was necessary to pool information from different surveys across time. When sub-national boundaries changed across DHS surveys, we used the GPS location of clusters to re-locate them into coherent sub-national units over time. This led us to select 96 surveys (see S1 Table in S1 Appendix) conducted in 20 Sub-Saharan African countries between 1990 and 2018. Our study covers the following countries: Benin, Burkina Faso, Cameroon, Ethiopia, Ghana, Guinea, Kenya, Lesotho, Madagascar, Malawi, Mali, Namibia, Niger, Nigeria, Rwanda, Senegal, Tanzania, Uganda, Zambia and Zimbabwe. The population-weighted average of the risk of dying in the age group 5-14 was 20 per 1000 in 2019 in these 20 countries, which is slightly higher than the regional estimate for Sub-Saharan Africa (16 per 1000) [9]. In 2019, 65% of all deaths that occurred in Sub-Saharan Africa in the age group 5-14 where in these 20 countries. The selected surveys contained data on about 11 million child-years of exposure and 376,015 deaths of children under the age of 5 years, against 9.2 million child-years of exposure and 20,605 deaths in the age group 5-14.
Estimation of sub-national mortality rates
We used the complete birth histories of women included in the DHS to obtain child-month data where the event of interest was the death of the child. We followed the methodology developed in Mercer et al. (2017) [17] and Li et al. (2019) [3] to estimate sub-national-period specific mortality rates, both for children aged 5 to 14 and for those aged less than five. The methodology consists of two steps. First, we obtained place-time probabilities of dying for both age groups. Then, we used these as inputs in a space-time Bayesian model to smooth estimates across space and time.
In the first step, we used discrete time survival analysis (DTSA) [22] to estimate age-specific monthly probabilities of dying using weighted logistic regression [23]. For under-five mortality, we used 6 age bands: [0, 1) to identify neonatal deaths, [1, 12), [12, 24), [24, 36), [36, 48) and [48, 60]. For children aged 5-14, we used 10 one-year age groups: [60, 72), [72, 84), … until [168, 180) months. The DHS sample weights associated with each mother account for the sampling probability and a non-response adjustment. Hence, the survey design and the related uncertainty is propagated through to the estimates and their associated standard error. This resulted in survey-place-time-age-specific estimated monthly probabilities of dying with variance estimates that fully account for the design of the survey. In our context, place consisted of Admin 1 level, which corresponds to administrative boundaries at the first sub-national level. DHS surveys are powered such that most indicators can be considered representative at this aggregation level. The time dimension of our mortality estimates was defined as a series of 4-year periods, for which multiple surveys could provide information. Because we need to combine surveys whose fieldwork dates have varied over time, we decided to let the most recent survey defines the time breaks (see S1 Appendix). For each survey, we went backwards in time by periods of 4 years. Information contained in full birth histories and referring to a period located more than 24 years before data collection was discarded when estimating under-five mortality. Similarly, this period was restricted to a maximum of 12 years for mortality in children aged 5-14 to reduce biases associated with the excess mortality of first-born children [12]. In other words, a survey contributed to a maximum of 6 periods for 5q0 and 3 periods for 10q5. Each country has its own series of 4-year periods. Note that if a survey provided only 1 year of exposure in a given 4- year period, it did not inform estimates in this period. We constructed the estimate of interest , in Admin 1 i, time period t and survey s using standard life table techniques:
(1)
The β’s are obtained from the weighted logistic regressions as follows:
(2)
After estimation of parameters in Eq (2), it can be rewritten as
(3)
for x ∈ [xa, xa + na], a = 1, …, 10, which consists of the 10 age intervals. Variances of the
were obtained via the delta method approach, using the estimated covariance matrices available from the weighted logistic regression models [17]. Under-five mortality rates were estimated in a similar way.
Each of our selected countries had a least 3 DHS, thus for a given period, and a given Admin 1, and
were potentially derived from multiple surveys. We combined estimates from different surveys into a single estimate for each age group using weighted average, where weights were given by the inverse of their variances. Resulting estimates can be seen as standard meta-analysis estimates. Similarly to Li et al. (2019) [3], we adjusted the estimates of
to account for downward biases introduced by mother-to-child transmission of HIV in Kenya, Lesotho, Tanzania, Zambia, Zimbabwe, Malawi, Namibia and Rwanda [24]. We used the correction factors made publicly available by Li and colleagues [3]. We did not correct for HIV-related biases in
as these biases has not been evaluated in older children and there is currently no method to adjust for these.
In a second step, we modeled the entire collection of place-time specific logit() and logit(
) for each country, using a Bayesian space-time smoothing model with appropiate design-based estimator of the variance,
, obtained in the previous step. The smoothing allows sharing information between near neighbors in both space and time, leading to estimates that are characterized by a smaller variance and higher signal to noise ratio [3]. The model can be expressed as follows:
(4)
where the components are:
- A constant overall level μ with a flat prior.
- An unstructured spatial term with prior
.
- A smooth spatial term [ϕ1, …, ϕn] where its prior is expressed as an intrinsic conditional autoregression (ICAR) model [25]. This model is a generalization of the random walk of order 1 (RW1) to the space dimension.
- An unstructured temporal term where the prior is expressed as
.
- A smooth temporal term [γ1, …, γT] where its prior is a random walk of order 2. A random walk of order 2 (RW2) model generally leads to more smoothing than a random walk of order 1 (RW1) model. Indeed, in the later, each value depends on its nearest neighbors while in the former, it also depends on neighbors’ nearest neighbors.
- δit, an interaction term (type IV) where we assumed that temporal trends differ between areas and that they are more likely to be similar for adjacent areas [18]. Its associated prior is expressed as the product of the random walk and the intrinsic conditional autoregression.
Hyperpriors on the dipersion parameters have Gamma(a,b) distribution. For a detailed explanation of how a and b are specified, we refer the reader to [17, 18]. As emphasized by Li et al. (2019) [3], for both space and time dimensions, there are two terms capturing different sources of variation. The first term refers to unobserved risk factors that vary smoothly (ϕi and γt), the other reflects random shocks that are specific to that place or time only (θi and αt). The Bayesian estimation process takes place separately for each country. The amount of smoothing is determined by the variances of ϕi and γt that are directly estimated from the data. The space-time interaction term (δit) allows the trend in time to vary across spatial areas. It was modeled assuming that the temporal (RW2) and spatial (ICAR) structured effects interact [18]. In summary, each (or
) depends on its nearest neighbors in space and its nearest two neighbors in both the past and future in the time dimension.
In the S1 Appendix, we compare the estimates of 10q5 and their associated precision before and after space-time smoothing (that is, after the first and second step). This comparison gives a sense of the reduction in uncertainty induced by the smoothing model. We also compare our final estimates at the national level with those provided by the UN Inter-agency Group for Child Mortality Estimation (UN IGME), using other data series and methods [9].
Precision of sub-national estimates of 
Our first objective is to assess if Admin 1 estimates are sufficiently precise for children aged 5-14 when derived from multiple DHS. In the frequentist framework, the coefficient of variation (i.e. the standard deviation divided by the mean) is regularly employed to decide on the usability of estimates [16]. Hansen et al. (1953) and Kish (1995) consider that a coefficient of variation larger than 0.2 is a sign of unstable estimate [26, 27]. Working in a Bayesian framework, we used the posterior distributions of sub-national mortality estimates and computed a coefficient of variation by dividing their standard deviations by their means. We multiplied these ratios by 100 to obtain percentages. Dong and Wakefield (2004) showed that there is a direct relationship between the posterior CV obtained in this way and the ratio of higher end to lower end of the posterior credible interval [28]. If the coefficient of variation was higher or equal than 20%, we considered that the associated sub-national mortality estimate for children aged 5-14 was not sufficiently precise.
Correlation between
and 
Our second objective is to examine the relationship between and
at the sub-national level. To do so, we retained only the periods for which at least three quarters of the sub-national mortality estimates of
were considered as sufficiently precise according to the CV. This threshold was set according to the countries with fewer Admin 1 entities. In the case of Malawi (consisting of three Admin 1 entities), applying a 75% rule, a period could not be retained if one sub-national estimate within this period was not robust. For Rwanda (characterized by five Admin 1 entities), applying the same rule, if more than one sub-national estimate in a given period was not robust, this period was not retained. Having our set of selected periods for each country, we first computed Pearson correlation coefficients between
and
by country. We further stratified by period. Working in a Bayesian inference setup,
and
were associated to their respective posterior distributions. This allowed us to obtain 95% credible intervals around Pearson correlation coefficients, accounting for the uncertainty in our sub-national estimates.
Analyses were run with the R software version 3.6.1. We used the SUMMER package [29] that uses the svyglm function from the survey package [30] and the R-INLA package [31] to fit the models. All the R codes used to perform the analysis is available on Github: https://github.com/benjisamschlu/Subnational-5-14-MR.
Results
Sub-national mortality estimates for children aged 0-4 and 5-14 years old
The estimates presented in this section are obtained after space-time smoothing of meta-analysis estimates. Figs 1 and 2 present Admin 1 mortality rates and their associated 95% credible intervals for Nigeria and Kenya. The corresponding graphs for other countries are the S1 Appendix. For most countries and Admin 1 units, mortality rates declined in young children aged 0-4 over the periods analysed. There were fewer cases with a significant decline over the period of observation in older children and young adolescents (40.2% of admin-1 units over the 20 countries against 81.7% in under-five mortality, when comparing the first and last periods). However, the low percentage in 10q5 is also a consequence of the wide confidence intervals for the first and last periods. The sub-national patterns observed in under-five mortality cannot be readily generalised to older children. There is substantial heterogeneity in the 10q5-to-5q0 relationship.
The uncertainty around sub-national mortality rates for children aged 5-14 is much higher than for children aged less than 5. First, deaths counts are subject to more stochastic variation as the mortality rates for older children are smaller. Second, in the older age group, mortality estimates associated to a period are generally informed by less DHS surveys. This comes from the fact that 3 and 6 periods in the past are retained from each DHS in the case of older and younger children, respectively. Uncertainty is particularly large for the first and last periods. This is because these estimates are built on fewer DHS surveys and oldest surveys are characterized by smaller sample sizes. When the Admin 1 refers to a capital city, estimates tend to be associated with large credible intervals due to the smaller sample size and smaller count of deaths in this area. This can be observed for Nairobi in Fig 2.
Precision of sub-national mortality estimates of children aged 5-14
Fig 3 displays the coefficient of variation for the probability estimated for each Admin 1-period combination. These CVs are compared to the threshold value of 20%. All countries exhibit a U-shape, as sub-national mortality estimates for the first and last periods are less precise than the ones in the middle. Out of the 1,132 space-time estimates of
, 62.3% fall below the 20% threshold. By contrast, this is the case of 99.3% space-time estimates of under-five mortality (see S1 Appendix). There is important heterogeneity in the amount of sub-national estimates that fulfill our criterion. In Zimbabwe, Kenya, Ghana, Lesotho and Namibia, sub-national estimates of the probability
are not sufficiently precise as most values of the CV fall above the 20% threshold. These countries had generally small sample sizes and a large number of Admin 1 entities which increases the amount of noise in the data.
Fig 4 highlights the uncertainty in the sub-national estimates of for two countries. In Ethiopia (top panel), a majority of coefficient of variations are above 20% for the period 2008-2011. This is due to wide posterior distributions for the sub-national mortality estimates, making it impossible to detect variations in mortality in older children within the country. On the contrary, sub-national estimates for Nigeria in the period 2007-2010 are all considered robust, which is due to much narrower posterior distributions, allowing detection of significant variations in
. This is less so for the period 2015-2018 as it consists of the last period, and hence, estimates are built on fewer data points.
In order to study the relationship between and
, we retained periods where at least 75% of the
had a coefficient of variation below 20% for a given period. We also checked how many periods were considered sufficiently precise when using a threshold of 15% (see S61 Fig in S1 Appendix).
The 10q5-to-5q0 relationship at the sub-national level
Fig 5 displays the estimates against the
values, each with their associated 95% credible intervals (note that the scale is defined specifically for each country and that different point shapes for a country corresponds to different periods). This Figure is based only on the periods for which we have sufficient precision in estimates of
according to our criterion. Except Lesotho and Zimbabwe, each other retained country has at least 4 periods considered as sufficiently precise. The figure allows to compare uncertainty in sub-national
estimates across countries. It shows the long term improvement in mortality for both age groups over time.
Pooling all sub-national estimates across retained periods, there is a clear positive and statistically significant Pearson correlation for all countries except Burkina Faso and Niger (see Fig 6). The correlation coefficient ranges from 0.14 in Niger (95% CI: -0.02—0.31) to 0.92 in Malawi (95% CI: 0.86—0.96). However, One should not conclude from these high coefficients that the spatial variation in mortality in older children is well reflected by that in under-five mortality. Indeed, the relationship captured here is driven by the long-term improvement in mortality in both age groups. Stratifying by period, this relationship is much less obvious. Out of 82 retained periods, 46 have a Pearson correlation where its 95% credible interval crosses zero. In other words, in the majority of periods, the Admin 1 characterized by the highest under-five mortality rate are not the ones facing the highest mortality rate in older children. We can differentiate three groups of country. The first group consists of Cameroon, Ethiopia, Nigeria, Senegal and Zambia where correlations are most of the time positive and significantly different from zero. The second group contains Madagascar, Mali, Niger, Rwanda, Tanzania and Uganda when occasionally, a period is characterized by a positive and significant correlation. Finally, in the last group encompassing Benin, Burkina Faso, Ghana and Guinea, we did not find any significant correlation. We did not compute the Pearson correlation estimates stratifying by period for Malawi as the country consists of only three Admin 1 entities. Zimbabwe and Lesotho are not plotted as they only have one period considered sufficiently precise. Correlations associated with these periods are not significantly different from zero.
Comparison of space and time heterogeneity among age groups
The total variance in (or
) can be split into the variance of the five components of the model (see Eq (4)). Pooling countries together, the variance associated to the RW2 time trend (
) makes up for an average of 58.86% and 46.44% of the total variance for
and
, respectively (see Table 1). Countries with specifically high values for
such as Benin, Rwanda, Ethiopia and Malawi, are the ones where all Admin 1 experienced a similar temporal trend, explaining the majority of the variation in sub-national estimates. Lesotho, Namibia and Niger are countries where
for
is below 10% which, in the case of Lesotho and Niger, is in contrast with what is found for
.
The average share of total variance accounted for by the ICAR structured space term () is 26.9% and 12.54% for
and
, respectively. In all countries but Madagascar, Malawi, Uganda and Zambia, our estimates of
for children under the age of five are higher or equal than for older children, sometimes by a large margin (e.g. Ghana, Guinea, Kenya, Niger and Zimbabwe). This is a sign that sub-national patterns of mortality declines that are consistent over time account for less variance in sub-national mortality estimates for children aged 5 to 14 in comparison to children aged < 5. It seems that most of the “lost” variation from this component in the older age group is now accounted for by the time×space interaction (RW2xICAR). The average percentage of variability captured by
equals 8.48% in children under the age of 5 and 33.40% in older children and young adolescents. This suggests that particular Admin 1 within a given country had mortality estimates that deviated from levels expected based on the secular trend (applying to all Admin 1) and the sub-national pattern (applying to all periods). However, we have to be cautious as most countries with the highest values in
are also the ones that had few precise sub-national estimates (Burkina Faso, Ghana, Lesotho, Namibia and Zimbabwe). Hence, this might also comes from stronger noise in sub-national estimates for theses countries. Finally, the variability explained by the unstructured terms (
) is within 1-5%.
Discussion
The monitoring of progress in child survival is increasingly moving from the national to the sub-national level, or even to finer spatial resolutions. At the same time, increasing attention is being paid to survival beyond the fifth birthday to ensure that older children can benefit from health programs and policies as much as young children [32]. In this context, it is important to assess whether spatial disparities in risks of dying in the early years of life mirror those that characterize mortality in older children. Otherwise, the targeting of public health interventions based solely on under-five mortality might leave behind some older children and young adolescents in need.
In this study, we showed that sub-national mortality estimates can be derived from DHS surveys for children aged 5-14. We used a method which has been validated for children aged 0-4 years [17], and obtained sub-national estimates of the probability 10q5. Working in a Bayesian inference setup allowed us to reflect and account for uncertainty in our sub-national estimates through their associated posterior distributions. The uncertainty correctly accounted for the complex survey design of DHS.
We obtained precise estimates for about two-thirds of period-specific sub-national units, defined as a CV below 20%. The sample size seemed to have the greater influence on the capacity of obtaining robust sub-national estimates than the number of surveys. Having these two criteria fulfilled, we observed that the age-specific risks of dying were highly and positively correlated. However, stratifying by period, in most cases, the Pearson correlations coefficients were not statistically significantly different from zero. Thus, for a given period, the Admin 1 entities with high 5q0 were not necessarily those with high 10q5. Hence, at the country level, the long term improvement in mortality drives the positive relationship between 5q0 and 10q5 but it is currently not possible to conclude that at the sub-national level, this relationship holds, expect in Cameroon, Ethiopia, Nigeria, Senegal and Zambia.
When decomposing the total variance associated with logit(5q0) and logit(10q5), we showed that the two age groups have a similar share of variation coming from the time trend (approximately half of the total variability). In contrast, the proportion of variation coming from the structured space term is approximately two times higher in 5q0 compared to 10q5. This reflects that sub-national variation within countries over all periods is higher for the younger age group. Burkina Faso, Ghana, Lesotho, Namibia, Niger and Zimbabwe are countries where more than 50% of their total variance is explained by the space×time interaction term. These countries also have a high share of sub-national estimates that are considered as imprecise. This suggests that in these cases, the method used does not allow to disentangle the signal from the noise. Hence, sub-national estimates for these countries have to be interpreted with caution.
This research is subject to some limitations. First, while we adjusted for bias related to the vertical transmission of HIV in under-five mortality rates, we did not account for such bias in older children and young adolescents. This bias should be smaller in older children than in children under age 5, as most children vertically infected will die before age 5 in the absence of treatment. Second, we worked with a somewhat arbitrary threshold for precision. To evaluate how the number of retained periods changed according to a more conservative threshold, we performed a sensitive analysis. Obviously, a more stringent criterion results in retaining fewer periods (91 periods with 20%, reduced to 50 periods with 15%). However, since the 95% CI around Pearson correlation coefficients accounted for uncertainty in sub-national estimates, we present uncertainty around correlation measures and the only change is the number of periods used, not the correlation estimate for a given period. Third, this study relied on a limited number of countries as 20 countries fulfilled our selection criteria (i.e having at least 3 DHS representative at the sub-national level). Finally, we do not provide estimates at a finer geographical scale than the Admin 1 level. It is clear from our results that 10q5 estimates at a finer level would be highly uncertain. There is a trade-off between geographical scale and precision of estimates [28].
In order to improve the measurement of mortality in older children at the sub-national level, other types of data sources are required, such as Health and Demographic Surveillance Systems (HDSS) (to study the relationship between 5q0 and 10q5 in detail), or reports on recent household deaths in censuses and summary birth history data (to reduce the uncertainty around space-time estimates). However, the methodology used here would need to be adapted as it was developed to be used on full birth histories. Another possibility that seems promising is to revise the approach and measure the 10q5-to-5q0 ratio at the sub-national level, and model this ratio, instead of modelling the age-specific estimates as if they were independent and analyzing their relationship in a second step. The ratio of 5q0 to 10q5 could be modelled as a function of to inform estimates at the sub-national level in a similar way as Alexander and Alkema [33] model the ratio of neonatal mortality over post-neonatal mortality.
To conclude, our study assessed the feasibility of obtaining sub-national mortality estimates for children aged 5-14 with DHS surveys and a recent space-time smoothing method. We showed that sub-national estimates for this age group are surrounded by considerable uncertainty, even when combining surveys within countries. In Cameroon, Ethiopia, Nigeria, Senegal and Zambia, our study suggests that the spatial variations in mortality found in the under-5 age group mirror those in the 5-14 age group. A more extensive use of census data should help in establishing whether this is also the case in other countries.
Acknowledgments
We thank Ashira Menashe-Oren and two anonymous reviewers for their valuable comments during the preparation of this manuscript.
References
- 1. Alkema L, New JR, Pedersen J, You D. Child mortality estimation 2013: an overview of updates in estimation methods by the United Nations Inter-agency Group for Child Mortality Estimation. PloS one. 2014 Jul 11;9(7):e101112. pmid:25013954
- 2. Golding N, Burstein R, Longbottom J, Browne AJ, Fullman N, Osgood-Zimmerman A. Mapping under-5 and neonatal mortality in Africa, 2000–15: a baseline analysis for the Sustainable Development Goals. The Lancet. 2017 Nov 11;390(10108):2171–82. pmid:28958464
- 3. Li Z, Hsiao Y, Godwin J, Martin BD, Wakefield J, Clark SJ, with support from the United Nations Inter-agency Group for Child Mortality Estimation and its technical advisory group. Changes in the spatial distribution of the under-five mortality rate: Small-area analysis of 122 DHS surveys in 262 subregions of 35 countries in Africa. PloS one. 2019 Jan 22;14(1):e0210645. pmid:30668609
- 4. Chao F, You D, Pedersen J, Hug L, Alkema L. National and regional under-5 mortality rate by economic status for low-income and middle-income countries: a systematic assessment. The Lancet Global Health. 2018 May 1;6(5):e535–47. pmid:29653627
- 5. Günther I, Harttgen K. Deadly cities? Spatial inequalities in mortality in sub-Saharan Africa. Population and Development Review. 2012 Sep;38(3):469–86.
- 6. Victora CG, Barros AJ, Blumenberg C, Costa JC, Vidaletti LP, Wehrmeister FC. Association between ethnicity and under-5 mortality: analysis of data from demographic surveys from 36 low-income and middle-income countries. The Lancet Global Health. 2020 Mar 1;8(3):e352–61. pmid:32087172
- 7. Akachi Y, Steenland M, Fink G. Associations between key intervention coverage and child mortality: an analysis of 241 sub-national regions of sub-Saharan Africa. International journal of epidemiology. 2018 Jun 1;47(3):740–51. pmid:29309582
- 8. Moucheraud C, Owen H, Singh NS, Ng CK, Requejo J, Lawn JE, et al. Countdown to 2015 country case studies: what have we learned about processes and progress towards MDGs 4 and 5?. BMC public health. 2016 Sep;16(2):33–49. pmid:27633919
- 9.
United Nations Inter-agency Group for Child Mortality Estimation (UN IGME). Levels & Trends in Child Mortality: Report 2020, Estimates developed by the United Nations Inter-agency Group for Child Mortality Estimation. United Nations Children’s Fund, New York, 2020.
- 10. Sawyer SM, Afifi RA, Bearinger LH, Blakemore SJ, Dick B, Ezeh AC. Adolescence: a foundation for future health. The lancet. 2012 Apr 28;379(9826):1630–40. pmid:22538178
- 11.
World Health Organization. Global Health Estimates 2016: Deaths by Cause, Age, Sex by Country and by Region 2000-2016. 2018. World Health Organization. 2019.
- 12. Masquelier B, Hug L, Sharrow D, You D, Hogan D, Hill K. Global, regional, and national mortality trends in older children and young adolescents (5–14 years) from 1990 to 2016: an analysis of empirical data. The Lancet Global Health. 2018 Oct 1;6(10):e1087–99. pmid:30223984
- 13. Burke M, Heft-Neal S, Bendavid E. Sources of variation in under-5 mortality across sub-Saharan Africa: a spatial analysis. The Lancet Global Health. 2016 Dec 1;4(12):e936–45. pmid:27793587
- 14.
United Nations General Assembly. Transforming our world: The 2030 agenda for sustainable development; 2015. Resolution adopted by the General Assembly on 25 September 2015: 70/1.
- 15. Yourkavitch J, Burgert-Brucker C, Assaf S, Delgado S. Using geographical analysis to identify child health inequality in sub-Saharan Africa. PLoS One. 2018 Aug 29;13(8):e0201870. pmid:30157198
- 16. Pedersen J, Liu J. Child mortality estimation: appropriate time periods for child mortality estimates from full birth histories. PLoS Med. 2012 Aug 28;9(8):e1001289. pmid:22952435
- 17. Mercer LD, Wakefield J, Pantazis A, Lutambi AM, Masanja H, Clark S. Space-time smoothing of complex survey data: small area estimation for child mortality. The annals of applied statistics. 2015 Dec;9(4):1889. pmid:27468328
- 18. Knorr-Held L. Bayesian modelling of inseparable space-time variation in disease risk. Statistics in medicine. 2000 Sep 15;19(17-18):2555–67. pmid:10960871
- 19. Dwyer-Lindgren L, Kakungu F, Hangoma P, Ng M, Wang H, Flaxman AD. Estimation of district-level under-5 mortality in Zambia using birth history data, 1980–2010. Spatial and spatio-temporal epidemiology. 2014 Oct 1;11:89–107. pmid:25457599
- 20. Pfeffermann D. New important developments in small area estimation. Statistical Science. 2013;28(1):40–68.
- 21.
Burgert CR, Colston J, Roy T, Zachary B. Geographic displacement procedure and georeferenced data release policy for the Demographic and Health Surveys. ICF International; 2013.
- 22. Clark SJ, Kahn K, Houle B, Arteche A, Collinson MA, Tollman SM. Young children’s probability of dying before and after their mother’s death: a rural South African population-based surveillance study. PLoS Med. 2013 Mar 26;10(3):e1001409. pmid:23555200
- 23. Binder DA. On the variances of asymptotically normal estimators from complex surveys. International Statistical Review/Revue Internationale de Statistique. 1983 Dec 1:279–92.
- 24. Walker N, Hill K, Zhao F. Child mortality estimation: methods used to adjust for bias due to AIDS in estimating trends in under-five mortality. PLoS Med. 2012 Aug 28;9(8):e1001298. pmid:22952437
- 25. Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Annals of the institute of statistical mathematics. 1991 Mar 1;43(1):1–20.
- 26.
Hansen MH, Hurwitz WN, Madow WG. Sample survey methods and theory. Vol. I. Methods and applications, 1953
- 27. Kish L. Methods for design effects. Journal of official Statistics. 1995 Mar 1;11(1):55.
- 28.
Dong TQ, Wakefield J. Modeling and presentation of vaccination coverage estimates using data from household surveys. arXiv preprint arXiv:2004.03127. 2020 Apr 7.
- 29.
Martin BD, Li ZR, Hsiao Y, Godwin J, Wakefield J, Clark SJ. SUMMER: Spatio-Temporal Under-Five Mortality Methods for Estimation. R package version 0.1. 0. 2017.
- 30.
Lumley T. Package “survey”: Analysis of complex survey samples. R Package Version. 2018;3:33-2.
- 31. Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the royal statistical society: Series b (statistical methodology). 2009 Apr;71(2):319–92.
- 32. Bundy DA, de Silva N, Horton S, Patton GC, Schultz L, Jamison DT. Investment in child and adolescent health and development: key messages from Disease Control Priorities. The Lancet. 2018 Feb 17;391(10121):687–99. pmid:29153316
- 33. Alexander M and Alkema L. Global Estimation of Neonatal Mortality using a Bayesian Hierarchical Splines Regression Model Demographic Research. 2018;38:335–372.