Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Predicting nutritional status for women of childbearing age from their economic, health, and demographic features: A supervised machine learning approach

  • Md. Mohsan Khudri,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Writing – original draft, Writing – review & editing

    Affiliation Department of Economics, Fogelman College of Business and Economics, The University of Memphis, Memphis, Tennessee, United States of America

  • Kang Keun Rhee,

    Roles Data curation, Formal analysis, Methodology, Writing – original draft

    Affiliation Department of Economics, Fogelman College of Business and Economics, The University of Memphis, Memphis, Tennessee, United States of America

  • Mohammad Shabbir Hasan ,

    Contributed equally to this work with: Mohammad Shabbir Hasan, Karar Zunaid Ahsan

    Roles Formal analysis, Visualization, Writing – review & editing

    Affiliation Department of Computer Science, Virginia Tech, Blacksburg, VA, United States of America

  • Karar Zunaid Ahsan

    Contributed equally to this work with: Mohammad Shabbir Hasan, Karar Zunaid Ahsan

    Roles Methodology, Supervision, Writing – original draft, Writing – review & editing

    zunaid@email.unc.edu

    Affiliation Public Health Leadership Program, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America

Abstract

Background

Malnutrition imposes enormous costs resulting from lost investments in human capital and increased healthcare expenditures. There is a dearth of research focusing on the prediction of women’s body mass index (BMI) and malnutrition outcomes (underweight, overweight, and obesity) in developing countries. This paper attempts to fill out this knowledge gap by predicting the BMI and the risks of malnutrition outcomes for Bangladeshi women of childbearing age from their economic, health, and demographic features.

Methods

Data from the 2017–18 Bangladesh Demographic and Health Survey and a series of supervised machine learning (SML) techniques are used. Additionally, this study circumvents the imbalanced distribution problem in obesity classification by utilizing an oversampling approach.

Results

Study findings demonstrate that the support vector machine and k-nearest neighbor are the two best-performing methods in BMI prediction based on the coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE). The combined predictor algorithms consistently yield top specificity, Cohen’s kappa, F1-score, and AUC in classifying the malnutrition status, and their performance is robust to alternative standards. The feature importance ranking based on several nonparametric and combined predictors indicates that socioeconomic status, women’s age, and breastfeeding status are the most important features in predicting women’s nutritional outcomes. Furthermore, the conditional inference trees corroborate that those three features, along with the partner’s educational attainment and employment status, significantly predict malnutrition risks.

Conclusion

To the best of our knowledge, this is the first study that predicts BMI and one of the pioneer studies to classify all three malnutrition outcomes for women of childbearing age in Bangladesh, let alone in any lower-middle income country, using SML techniques. Moreover, in the context of Bangladesh, this paper is the first to identify and rank features that are critical in predicting nutritional outcomes using several feature selection algorithms. The estimators from this study predict the outcomes of interest most accurately and efficiently compared to other existing studies in the relevant literature. Therefore, study findings can aid policymakers in designing policy and programmatic approaches to address the double burden of malnutrition among Bangladeshi women, thereby reducing the country’s economic burden.

Introduction

Malnutrition is a critical global concern that requires a comprehensive and continuous intervention strategy. Almost one-third of the population worldwide experience at least one type of malnutrition: underweight, overweight, or obesity [1, 2]. Nearly 1.9 billion adults globally are either overweight or obese, and about 462 million adults are underweight [2]. By controlling malnutrition, the society can reduce healthcare costs and increase productivity, which would result in saving money and economic prosperity in the long run. Obesity and overweight, resulting from abnormal or excessive fat accumulation, are behavioral and non-communicable risk factors affecting people of all ages, races, ethnicity, and gender. They contribute to critical health conditions, including cardiovascular disease, type 2 diabetes, and cancer which often result in death. In contrast, underweight is associated with lower economic productivity and contributes to higher morbidity and mortality rates [3]. Underweight, overweight, and obesity are major global health problems in developing countries [4]. Many lower-middle-income countries have experienced a high rise in the prevalence of overweight and obesity because of economic growth, industrialization, rapid urbanization, improved transportation system, and a nutrition transition to high-calorie food intake over the last couple of decades.

A study found that improvement in maternal body mass index (BMI) in the past 15 years is accompanied by a diminution in malnutrition status of under-five children [5]. Therefore, this paper focuses on the women of childbearing age as they can play a crucial role in the country’s economic development and substantially improve a child’s nutritional status. Though the socioeconomic, demographic, and health-protective factors of adult malnutrition are widely known, there is a paucity of information on the prediction of BMI and risks of being underweight, overweight, and obese among women of childbearing age based on those factors. Hence, the likelihood of these risky health behaviors needs to be evaluated and predicted at regular intervals. The study has therefore set three primary objectives. First, this study aims to evaluate the performance of multiple supervised machine learning (SML) algorithms along with the traditional regression algorithms and find the one that best predicts the BMI as well as the nutritional status (underweight, overweight, and obesity) among Bangladeshi women of childbearing age ranging from 18–49 years old. Second, the study ranks key socioeconomic, health, and demographic features related to each outcome based on different SML models. Thirdly, this study employs conditional inference tree (CT) to predict the outcome based on the interactions among important features and used classification and regression trees (CART), i.e., regression tree (RT) for BMI and decision tree (DT), as a robustness check. The findings will help enhance our understanding of malnutrition, make better predictions of nutritional status, and help policymakers design interventions that will reduce malnutrition at the national and global levels.

There are several economic aspects to justify the significance of this research. Firstly, a child whose mother is obese and has other health complications is likely to suffer from health concerns that may lead to a loss of human capital. Secondly, prenatal and postpartum health care costs for obese mothers and their infants are significantly higher than those with a normal weight [6]. Thirdly, malnutrition significantly contributes to increased absenteeism and unsatisfactory scholastic performance among school-aged children, which is detrimental to the economic growth of a nation [7]. Furthermore, the losses of individuals from insufficient nutrient intake are estimated to be around 10% or more of lifetime earnings in low-income countries [8]. Lastly, job absenteeism complicated by obesity even in developed nations yields an output loss of 4.3 billion dollars each year, and this brings an additional burden to the employers [9, 10]. Lack of skilled human capital is harmful for any developing nation because the cost in terms of productivity loss due to malnutrition can amount to 3 to 16 percent of the GDP [11]. Therefore, a comprehensive investigation to study and predict the risks of being underweight, overweight, and obese in women of childbearing age can potentially inform policy and practice in a developing country like Bangladesh.

Country background

Bangladesh has been experiencing remarkable progress in economic development for the last three decades. The annual GDP per capita growth reached 5.9% in 2021 compared to 1.11% in 1991, and the population living below the national poverty line has been decreasing [12]. Furthermore, the country has placed fewer restrictions on international trade, reformed the financial sector, and boosted infrastructure investments in geographically delimitated economic zones to increase country exports.

The country has also made improvements in socioeconomic and health aspects, such as child and maternal mortality and primary education participation. Child mortality decreased from 146.1 deaths to 29.1 per 1,000 live births between 1990 and 2020, whereas maternal mortality declined from 569 to 176 deaths per 100,000 live births between 1990 and 2015 [13]. In the past decade, enrollment in elementary schools increased from 100% to 120% [12]. Despite all these positive changes, Bangladesh is still struggling to control rising non-communicable diseases due to its fragile health system along with lifestyle and environmental challenges. This alone places a considerable strain on all available resources provided by the government and non-governmental organizations.

In terms of nutrition, Bangladesh is showing a reduction in underweight but an upward trend in overweight and obesity. The cases of underweight in ever-married women have been decreasing over time from 52% in 1996–97 to 30% in 2007 and 12% in 2017–18 [14]. In contrast, the percentage of overweight or obesity in the same cohort has increased to 12% in 2007 and 32% in 2017–18 from 3% in 1996–97 [14]. Past policies on maternal nutrition were skewed towards undernutrition, and this caused a problem to address the rising cases of overweight and obesity. Rural-urban differences in unhealthy BMI categories are also a big concern. The recent 2017–2018 Bangladesh Demographic and Health Survey (BDHS) shows that 13% of rural women are underweight compared to 9% of urban women [14]. On the other hand, 43% of urban women are either overweight or obese whereas 28% of rural women are so. The recent literature argued that women are more likely to experience overnutrition compared to their male counterparts [15, 16], and this is a serious problem as it is strongly liked to breast and ovarian cancers [17]. Therefore, women of childbearing age (18–49 years) are prone to chronic energy deficiency and malnutrition, and these can bring unwanted birth outcomes. Bangladesh has implemented nutritional programs such as Bangladesh Integrated Nutrition Project and National Nutrition Project to address these concerns. Still, the receptions were mixed and did not show a substantial impact on nutrition outcomes. All these indeed highlight the absence of a strategically built nutrition policy. Our study can shed light on policy managers by providing the necessary information to design a behavioral intervention program.

Literature review

Economic and health consequences of malnutrition.

According to the BMI database of the World Health Organization (WHO), on average, adult women are more overweight and obese than their male counterparts. Few other studies also report that women are more likely to be underweight, overweight, and obese than men [1820]. In addition, married women are more exposed to overweight and obesity than single women [2123]. Overweight and obesity are linked to a higher risk of gestational diabetes, cesarean birthing, and maternal and early neonatal death during pregnancy [24, 25]. In contrast, the health risks associated with being underweight include weakened immune function, vitamin deficiencies, anemia, increased risk for complications from surgery, fragility, increased perioperative complications, mortality in trauma surgery, etc.

The burdens of being obese or underweight pose critical cost containment conflicts to the health system as well as the efficiency for individuals to develop healthier lifestyles [26]. For example, 7% of the total healthcare costs in the European Union come from obesity [27]. BMI is considered one of the attributes in the literature on physical appearance [28]. In some jobs, e.g., sales, physical appearance is likely to be directly related to productivity. Besides, a common perception is obese people are likely to be lazier and less intellectually skilled than their non-obese counterparts [29, 30]. The existing literature provides evidence of potential discrimination made by employers against obese workers [3133]. It also appears that wages of white females go down due to weight increases [34]. He also observes a negative correlation between weight and wages for other gender-ethnic groups. All these provide evidence of the economic impacts of overweight and obesity. Several studies focus on obesity in developed countries [3537], however, only a few studies pay attention to this epidemic from developing countries’ perspectives [38, 39]. There is a lack of research focusing on the prediction of the nutritional outcomes among women of childbearing age in developing countries. This paper attempts to fill out this knowledge gap by predicting BMI and the risks of being underweight, overweight, and obesity from socioeconomic, demographic, and health features using robust SML models. Our study will aid policymakers in designing the appropriate health policies and strategies, thereby significantly reducing healthcare and economic costs.

Machine learning in applied economics and public health.

There is an increasing interest among applied economists and public health researchers in using the machine learning (ML) approach to predict human behavior and health outcomes, forecast energy and financial variables, and solve complex real-world problems [4044]. Although slowly, ML methods are gradually becoming the topic of interest in a rapidly growing methodological literature in economics [45, 46]. The reason is its great potential for extending an econometrician’s toolbox, which solves a different problem. ML methods are a subset of artificial intelligence that combine traditional econometric techniques and improve their performance over time, learning the underlying pattern of the data. These techniques are particularly robust in estimating human behavior and sentiment due to the increased heterogeneity of the underlying data [47]. Although ML does not focus on some formal properties that economists are concerned such as large sample properties of estimators, consistency, normality, efficiency, and identifying causal relationships, its demonstrated ability to outperform conventional methods on specific data sets with respect to out-of-sample (OOS) predictive power is valuable in practice.

A number of economists discuss the benefits of SML techniques for regression, point out the existing problems in economics where ML methods are apposite and emphasize the recent developments in adapting ML for addressing causal questions and policy implications in economics and technical econometrics research [4851]. Recently, one study adopted eXtreme gradient boosting (XGB), logistic regression (LTR), and random forest (RF) to predict bank failures in the United States [52]. In another study, a variety of SML methods were applied to predict the demand for salty snacks based on sales data, and findings suggested that SML can perform superbly in demand prediction [53]. Besides, there is a debate that ML can be useful in the first-stage estimation of instrumental variables because the ordinary least squares tend to overfit a model [54]. LASSO linear regression (LLR) was used by Wüthrich and Zhu to check whether it could mitigate the omitted variable bias [51]. One study suggested that ML could investigate the treatment effect with respect to observable variables [46]. An economic policy has both causal and prediction aspects. However, there are also occasions in which causal inference is unnecessary. Teacher employment was employed as an example by Mullainathan and Spiess to back this claim [49]. The study indicated that, even if the main goal is to estimate the impact of hiring an additional teacher, the type of teacher a school should pursue is based on prediction, which requires ML.

Another study area is to compare the performance between ML algorithms and regression methods. A study attempting to predict women’s height based on the socioeconomic status (SES) of 66 low-and-middle-income countries reported that its best ML algorithm performed slightly better than regression methods and exhibited a lack of nonlinearity between height and SES [55]. Another study employed the regularization feature of LASSO to construct an index called the Abridged Women’s Empowerment in Nutrition Index (A-WENI) [56]. They reduced the original 33 categories to 20 and showed that estimates from A-WENI are like those from the original Women’s Empowerment in Nutrition Index (WENI) and suggested that the shorter index can help measure nutritional empowerment.

Recent studies in public health prefer ML to traditional methods because of its leverage of data characterization and prediction simultaneously [57]. SML techniques have been extensively used in obesity research because of their capacity to predict with unprecedented precision [5861]. Using three SML algorithms: support vector machine (SVM), RF, and XGB, a study predicted obesity based on food sales data and documented that baked goods and flour followed by cheese and carbonated drinks are strongly associated with obesity [62]. Using the same three algorithms mentioned above, one study endeavored to predict the mortality from COVID-19 [63]. The study showed that age, oxygen levels, and the type of patient encounter are strongly related to mortality from COVID-19. With a small sample, one study based in Bangladesh employed nine SML algorithms to predict the risks of obesity [64]. Their experimental results show that LTR achieves the highest accuracy out of the nine classifiers. Another study from Bangladesh reported that an RF-based classifier outperformed the other four algorithms [65].

Differences between econometrics and supervised machine learning.

There are key diofferences between traditional econometrics and SML. First, traditional econometrics focuses on finding a variable that is significant in predicting an outcome. On the other hand, SML algorithms take two steps: finding a model to use and regularizing the model of choice based on cross-validation [49, 55]. One of the strengths of SML is that it predicts the outcome of interest using a variety of methods. There is no specific rule for following a particular conditional distribution to predict an outcome [46]. Unlike linear regression (LR), some SML techniques do not make parametric assumptions about the variables and allow more flexibility in modeling relationships. Therefore, this will enable researchers the freedom to choose the algorithm of their choice. Therefore, establishing a causal relationship is not the primary goal of supervised learning. Instead, our interest is in finding an algorithm that yields the maximum prediction accuracy and the smallest prediction error.

Second, while traditional econometric techniques can indicate variables associated with a certain health condition, it does not automatically yield a possible interaction between two or more variables related to the disease. In contrast, it is possible to find an interaction between predictors and how they predict the likelihood of exposure to a critical health condition with either decision or RT. Therefore, identifying those variables and their interaction is crucial in building an appropriate intervention strategy.

The third difference lies in the data mechanism. In ML, the data are split into an in-sample portion to train a model and an OOS portion to test its performance to find an algorithm with the smallest prediction error. The performance of the algorithm is evaluated based on the latter because SML algorithms seek to identify functions that can perform well with the OOS dataset. This approach is different from microeconometrics as it does not split the data set into training and test sample for analysis. The focus is more geared toward reducing the overall bias of the model [66].

The fourth difference between these two approaches is data-driven modeling. SML is more concerned about overfitting a model. More features can be inserted into an estimation model to increase the predictive power captured by the R2. However, this is not recommended as it can result in an overfitted model with serious concerns. An overfit model tends to have a low bias, and the in-sample error value could get close to zero. However, the OOS error and variance will be large as the predictions based on OOS will react sensitively to a small change of regressors. In contrast, a model with less complexity is not overfitted and tends to have a large in-sample bias. Nevertheless, the OOS prediction error will not be big due to the small variability of the model. SML algorithms can be tuned by imposing restrictions, known as regularization, to avoid overfitting in the first place. Regularization imposes a penalty on the model when additional variables are introduced to make the prediction efficient. It can take the form of variable selection, tree size, learning rate, number of neighborhoods, and so forth [49]. The amount of regularization a model requires is decided by cross-validation [55]. As mentioned earlier, SML methods split the data into in-sample and OOS portions. However, a single split is not recommended as it cannot generalize how the model performs with the comprehensive data. Therefore, K-fold cross-validation is implemented. The first step is splitting the data set into K equal-sized portions. Then, K-1 random folds of the data will be used as the training (in-sample) set, and the remaining fold becomes the test (OOS) set. These two steps are performed K times, whereas econometricians would use all observations from the data without making such a split.

Finally, unlike econometrics, most tree-based SML methods such as regression and decision trees along with RF provide a unique feature called variable importance. It ranks the variables that are important in predicting the outcome and provides insight into variables that are strongly correlated with the outcome, whereas the goal of traditional econometrics is to establish a causal relationship between the outcome and the regressors based on significance measures.

Methods

Data source and sampling design

This study uses data from the most recent wave of 2017–18 BDHS from the Demographic and Health Survey (DHS) website. 2017–18 BDHS is the eighth wave of the survey, conducted between October 2017 and March 2018. One of the challenges of the DHS data is that it neither contains information on income nor on consumption expenditures of households. We overcome this limitation by employing the data on household assets and housing characteristics [67]. Using these data, an asset index is constructed that proxies for wealth and implicitly long-run economic status. The survey also collected information on the household population’s demographic and socioeconomic characteristics, including fertility, mortality, educational attainment, employment status, asset ownership, housing characteristics, maternal and child nutritional status, and access to essential services. The survey contains several data files for different units of analysis. The unit of analysis means who or what is being studied. Depending on the data file, the unit of analysis of BDHS can be households, women, men, or children. This paper uses the data of ever-married childbearing women (de jure population) aged from 18 to 49 years old for the empirical analysis. Though the original BDHS collect data on women starting from age 15, we consider from 18 years old as girls transition into adulthood then [68].

The survey adopted a two-stage stratified sampling technique to select households and used the sampling frame from the list of enumeration areas (EAs) of the 2011 Population and Housing Census of the People’s Republic of Bangladesh, given by the Bangladesh Bureau of Statistics (BBS). Bangladesh comprises eight administrative divisions, such as Barisal, Chittagong, Dhaka, Khulna, Mymensingh, Rajshahi, Rangpur, and Sylhet. The sampling process involved stratifying the sample into regions and breaking down parts further into urban and rural areas. The EAs, also known as clusters and primary sampling units, for the survey are selected in the first stage employing the probability proportional to size sampling technique. All clusters are approximately of equal size in terms of area. The BDHS 2017–18 includes 672 EAs. A household listing prepared in all selected clusters is utilized as a sampling frame to select households in the second stage. Using an equal probability systematic sampling technique, 30 households, on average, are drawn from each selected cluster [14].

Variable description

The primary outcome variable of this study is the BMI. The survey computes the BMI using reported data on height and weight of individuals by dividing the weight in kilograms by the square of the height in meters. It is a standardized method for determining whether the body weight and the amount of body fat anyone has been in a healthy range. Following the definition of the WHO [69], we create three categorical outcomes classifying the respondents as being underweight (BMI<18.5 kg/m2), overweight (25≤BMI≤29.99 kg/m2) and obese (BMI≥30 kg/m2). Overweight and obesity variables are also generated following the Asian BMI cutoffs for robustness analysis. Asians tend to have a higher incidence of cardiovascular diseases and diabetes even at lower BMI levels, and the recommended overweight and obesity cutoffs for Asians by WHO expert consultation are 23−27.49 kg/m2 and ≥27.5 kg/m2 [70]. The socioeconomic features we consider in this paper include the household SES, educational attainments of women and their partners (years), partners’ occupation, and the current employment status of women. The household’s SES is measured by the wealth index and classified into three groups: low, middle, and high. The wealth index is a living standard measure constructed using principal component analysis. Selected household assets and characteristics: cars, bicycle, TV, usable land, floor type, source of drinking water, type of fuel, electricity, sanitation facilities, and construction materials are used to create a wealth index [71]. The value of the wealth index relies on the assets mentioned above owned by households and housing characteristics. However, this study does not suggest asset index as a measure of current welfare or poverty in this paper. Instead, we use it as a determinant of malnutrition among women, which depends on the long run as well as current SES. Households are unlikely to respond to temporary shocks by women’s malnutrition. The asset index is considered a proxy for wealth, hence for SES in the long run. The educational attainments take four equivalent categories: received no education, finished primary education, completed secondary education, and higher education degree such as college or graduate degree. The occupation of women’s partners is categorized into three groups: unemployed, blue-collar jobs, and white-collar jobs. Those jobs are defined as blue-collar that require physical labor, e.g., agriculture, skilled and unskilled manual work; and white-collar jobs consider all desk jobs, e.g., professional, technical, managerial, and business. The unemployed category includes unemployed, student, and retired women. Lastly, employment status shows whether a woman currently has a job or not.

This paper also controls for marital status, age of women, age during their first childbirth, age when they first lived together with their partners, partner’s age, contraceptive method, region, residence, religion, breastfeeding practice, abstaining status, amenorrheic status, access to piped water, open defecation, medical assistance, the total number of children ever born, access to media, family planning service, and the number of household members. A woman is defined as currently single if she is divorced, separated, or widowed during the interview. Women’s age is categorized into three groups: 18–25, 26–39, and 40+. The partner’s age is divided into three categories: 18–30, 31–49, and 50+. The contraceptive method variable asks the respondents if they use any contraceptive methods or not. An indicator variable is created to point out whether a woman used any of these methods or not. The eight administrative divisions of Bangladesh are classified into three regions based on their geographical locations following [72]: west (Rangpur, Rajshahi, and Khulna), center (Mymensingh, Dhaka, and Barishal), and east (Sylhet and Chittagong). The residence variable looks at whether a woman is from an urban or a rural area. Breastfeeding practice is a dummy variable created by looking into the entries in the maternity history for children born in the last three to five years and if any child is still being breastfed. If no child was born in the previous three to five years, the respondent is assumed not to be breastfeeding. Abstaining status is an indicator variable created from the maternity history by checking if the respondent had been abstaining from sexual relations since the last birth or not. Amenorrheic status indicates if the respondent is currently postpartum amenorrheic. The variable is constructed from the maternity history by checking if the period returned after the last birth. Religion specifies a woman’s religious affiliation, and our sample shows most are Muslim. We have three variables for getting medical assistance, and each has a specific question: getting permission to go, receiving financial assistance needed for treatment, and whether the distance to a health facility is manageable or not. The media variable is created by merging three variables: watching TV, listening to Radio, and reading the newspaper. If a woman has access to any of them, we consider that she has access to media. Lastly, family planning is a variable that shows whether a household heard any information about family planning through media.

Data pre-processing and SML algorithms

Variables of interest are extracted from the BDHS 2017–18 wave. In the pre-processing stage, 5,365 women are removed out of 20,127 from the analysis. These women met at least one of the following criteria: (1) women aged 15–17 (n = 651) (2) pregnant during the survey (n = 1,013); (3) missing information on BMI and other covariates (n = 3624) and (4) irrelevant observations and outliers (n = 77). Before prediction, the relationships between categorical features and nutritional outcomes are evaluated using the Pearson χ2 statistic corrected for the complex survey design with the second-order correction of Rao and Scott [73, 74].

For the continuous outcome, the following methods are applied: LR, LLR, RT, RF, CT, XGB, k-nearest neighbors (KNN), nonlinear SVM, and neural network (NN). One of our categorial outcomes, obesity, shows an imbalanced distribution of its underlying classes. A major problem of an imbalanced category is that it could yield either inaccurate or biased estimates of the measures such as accuracy and precision. This issue is circumvented by applying oversampling technique suggested by Lunardon and colleagues [75]. For the categorical outcomes, the following SML algorithms are implemented: LTR, LASSO logistic regression (LLTR), DT, RF, CT, adaptive boosting (ADB), XGB, KNN, nonlinear SVM, NN, and Naïve Bayes (NB).

The approaches under SML fall into four groups of predictors: parametric, nonparametric, mixed, and combined. LR, LLR, LTR, and LLTR estimate coefficients using parametric functions to predict an outcome. Nonparametric approaches include KNN and tree-based methods such as RT and DT that predict outcomes from a particular threshold of a variable. NB and SVM can be considered either parametric or nonparametric. The former can be regarded as parametric as it requires covariates to be conditionally independent and normally distributed. But it can be nonparametric, taking into account that a probability distribution does not need to be specified [76, 77]. SVM is generally treated as a parametric approach as the hyperplane has a functional form to separate the classes. However, it can be nonparametric depending on the kernel choice [78]. NN is a mixed predictor algorithm as each regressor is connected to a hidden layer to predict an outcome. Lastly, RF and ADB are combined predictors since they are weighted combinations of tree-based predictors. A brief description of all the algorithms used in this study is provided in Supplement A in S1 File.

Throughout the analysis, 80% of the data are used as the IS data and the remaining 20% as the OOS. Nevertheless, splitting the data into two parts may result in random sampling errors [49]. To optimize OOS prediction, both 5-fold and 10-fold cross-validations are used that allow us to find the optimal parameters of each model based on in-sample data and test the performance subsequently on the OOS data. The methodology of this paper is delineated in Fig 1.

Evaluation metrics

Since this study consists of both regression and classification tools, a variety of evaluation measures are employed. The root mean squared error (RMSE), the mean absolute error (MAE), and the R2 values are used to evaluate the model performance for regression models. Both RMSE and MAE can be considered standard errors of the prediction. It is preferable to have a lower value for those measures as they indicate that regression predictions are close to the actual outcome values. R2 is the variability of the outcome explained by changes in regressors and therefore captures the predictive power of a model. All these measures are applied to the OOS data and determine the algorithm that performs the best.

For the classification setting, traditional measures such as specificity, Cohen’s kappa, area under the repository curve (AUC), and F1-score are used in the ML literature. Same as the evaluation metrics, all these classification measures are analyzed based on the OOS data. Specificity indicates the percentage of true negatives over the sum of true negatives and false positives. A higher specificity value implies a lower number of false positives in the classification. Cohen’s kappa is the reliability measure of a classification model, and it is desirable to have it between 0.21 and 1 [79]. A Cohen’s kappa value of 0.21 indicates that a classification model is fairly classifying the instances. A value of 1 suggests a perfect classification. The area under the repository curve (AUC) shows the trade-off between the false positive and true positive rates at different threshold values. It is desirable to have an AUC value greater than 0.5 because an AUC of 0.5 implies that a classification model is no better than a fair random guessing of two classes. Lastly, F1-score is a weighted average of precision and recall. It is useful when there is an unbalanced distribution of the cases. The details on all evaluation measures of our study are provided in the Supplement A in S1 File.

Ethics approval

Demographic and Health Survey Data collection procedures for the 2017–18 BDHS were approved by the Institutional Review Boards of the ICF International, Rockville, MD, USA, and Bangladesh Medical Research Council, Dhaka, Bangladesh. Informed consent was obtained from all respondents in the survey before asking questions, and separately before obtaining biomarker and anthropometric measurements. Respondents who did not provide consent were excluded from the analysis for the current study.

Results

Descriptive statistics of study participants

Table 1 shows the descriptive statistics (mean or percentage) and bivariate analysis results of women’s BMI, underweight, overweight, and obesity status by their socioeconomic, health, and demographic characteristics from BDHS 2017–18. Estimates are reported incorporating sampling weights from the survey and adjusting for strata and clusters in the sample. The average BMI of women with higher education, higher socioeconomic levels, white-collar jobs, highly educated partners, piped water, or living in urban areas is higher than that of women with other characteristics. On average, the sampled women have a BMI of 23.5 kg/m2, ranging between 22.0 and 25.4 kg/m2. Bivariate analyses show a significant difference (p-value < 0.001) in average BMI across all features except contraception.

thumbnail
Table 1. Bivariate analysis between ever-married women’s nutritional outcomes, socioeconomic, health, and demographic characteristics from BDHS 2017–18.

https://doi.org/10.1371/journal.pone.0277738.t001

Overall, about 27% of the sampled women appear to be overweight. This class is significantly (p<0.05) associated with all features except for current contraception use, permission to go to medical, exposure to family planning messages, and religion. Striking disparities are observed in the prevalence of underweight across different subgroups, such as women with higher education (34%) and no education (22%), those from high SES (36%) and low SES (18%), and breastfeeding (16%) versus non-breastfeeding women (30%). Most of the features considered in this study have a statistical association with obesity. Nevertheless, obesity does not appear to be associated with contraception, family planning, or religion, as is the case in overweight. In addition, obesity is not significantly associated with the number of family members, abstinence, and amenorrhea. Obesity is more prevalent among women with higher education (11%) and those from wealthy families (12%). Women from low socioeconomic backgrounds and those without any education have a lower prevalence (4% and 3%, respectively). A higher percentage of urban women are overweight (34%) and obese (11%) compared to their rural counterparts (24% and 5%, respectively). Conversely, rural women tend to be more underweight than their urban counterparts (12% versus 8%).

Machine learning algorithm specifications

Table 2 summarizes the ML algorithms used in this study. This study determines the optimal parameter for the algorithms using 5-fold and 10-fold cross-validations to avoid overfitting.

The following sections emphasize performance evaluation measures obtained from the OOS 10-fold cross-validation since the 5-fold results are almost identical. All 5-fold cross-validation results are reported in Supplement B in S1 File (see SB-1 to SB-6 Tables in S1 File).

Evaluation measures

BMI prediction.

Table 3 presents the BMI prediction results from all the regression methods investigated in this study. This study uses R2, RMSE, and MAE, which are widely used metrics to evaluate prediction error rates and model performance of regression methods. Table 3 suggests that SVM and KNN are the best-performing models for BMI prediction as they rank first and second in the aforementioned measures. Specifically, the R2 values of SVM and KNN exceed that of LR by 8.6% and 5.5%, respectively. These findings justify the use of SML in this study to predict women’s BMI.

thumbnail
Table 3. Performance of different regression algorithms in BMI prediction using 10-fold cross-validation.

https://doi.org/10.1371/journal.pone.0277738.t003

Classification of nutritional status.

Fig 2 compares the predictive performance of classification algorithms for nutritional statuses based on the following four metrics: specificity, Cohen’s kappa, AUC, and F1-score. The combined predictor methods, namely ADB, XGB, and RF, show the best performance across all three nutritional status categories by yielding the highest values for all performance metrics considered in this study. Among these three, ADB produces the highest values in the following performance categories: F1-score, Cohen’s kappa, and specificity. RF achieved the highest AUC across all three classes while achieving the second-highest score across the remaining three metrics. XGB ranks third by producing the second-highest AUC as well as the third-best scores for the remaining three metrics. In underweight classification, the F1-scores of combined predictors outperform LTR by 42.3%, 33.3%, and 17.4%, respectively. Besides those, CT outperforms other methods in most instances except for producing slightly lower specificity values in overweight and obesity classifications. DT and SVM outperform CT in specificity by 0.8% and 6.4%, respectively, in overweight and underweight classifications.

thumbnail
Fig 2. Performance evaluation of different classification algorithms in predicting nutritional status using 10-fold cross-validation.

Here, LTR = Logistic regression; LLTR = LASSO logistic regression; DT = Decision tree; RF = Random forest; CT = Conditional inference tree; ADB = Adaptive boosting; XGB = eXtreme Gradient Boosting; KNN = K- nearest neighbor; SVM = Support vector machine; NN = Neural network; NB = Naïve Bayes.

https://doi.org/10.1371/journal.pone.0277738.g002

Sensitivity analysis.

A sensitivity analysis is conducted using the Asian BMI cutoffs for malnutrition outcomes defined in the variable description. This analysis excludes the underweight class since the standard is the same as WHO. Fig 3 illustrates that the performance rankings of the classifiers are the same for Asian and WHO standards, although the overweight and obesity cutoffs are different. Therefore, the ensemble methods examined in this study demonstrate robust predictive performance against different malnutrition benchmarks.

thumbnail
Fig 3. Performance evaluation of different classification algorithms in predicting nutritional status based on Asian cutoff using 10-fold cross-validation.

Here, LTR = Logistic regression; LLTR = LASSO logistic regression; DT = Decision tree; RF = Random forest; CT = Conditional inference tree; ADB = Adaptive boosting; XGB = eXtreme Gradient Boosting; KNN = K- nearest neighbor; SVM = Support vector machine; NN = Neural network; NB = Naïve Bayes.

https://doi.org/10.1371/journal.pone.0277738.g003

Variable importance

Table 4 shows the ranking of variables in terms of their importance in predicting women’s nutritional outcomes based on the specified SML algorithms. The features from each method are assigned a rank based on the variable importance summary from the respective algorithm. Any variable with a score of 1 is the most important feature in predicting an outcome. In the last column of the table, the average scores are reported for each variable. The lower the average score, the greater the importance. Considering that not all methods rank the same features, the left-hand side of the table lists more than five features for each nutritional outcome. For example, the variable importance from RT suggests that SES, partner’s educational attainment, age, residence, and breastfeeding practice are the top five features in predicting BMI. However, SES, breastfeeding practice, media access, residence, and entries in the immunization roster appear to be the top five features from the RF. Each algorithm has a unique method for determining the order of importance, which is why they differ in rank. Thus, while the breastfeeding practice of women is the second most important variable in predicting BMI from RF, it is ranked tenth in LR. Nevertheless, breastfeeding practice is an essential feature for predicting BMI as it ranks third in the average value.

thumbnail
Table 4. Feature importance ranking from the SML algorithms.

https://doi.org/10.1371/journal.pone.0277738.t004

Based on average scores, SES, women’s age, breastfeeding practice, partner’s educational attainment, and media access are the top five important features in predicting BMI among women. Similarly, the prediction for the underweight class is influenced by SES, age, financial assistance to medical facilities, contraceptive methods, and breastfeeding. Additionally, SES, breastfeeding practice, partner’s age, partner’s educational attainment, and women’s age are the five factors determining whether a woman will be overweight. Lastly, SES, media access, residence, partner’s employment, and women’s age are the top five variables associated with obesity. In summary, the SES, the age of the woman, and the breastfeeding practice of the woman influence the woman’s nutritional status.

Conditional inference trees

In this section, we discuss CTs generated in BMI prediction and nutritional status classification. For this, CT is chosen over CART because it provides statistical significance for both prediction and categorization. However, the visualization of the tree from the CART is presented as supplementary figures (see Supplement C in S1 File). Each CT begins with a binary split of SES. This indicates that SES is the most important feature, which is supported by Table 4.

The CT in Fig 4 shows how socioeconomic, demographic, and health variables interact to predict BMI. Nodes 14 and 15 show that the partner’s educational attainment determines the BMI of women at least 26 years old from high SES. The latter node states that the average BMI is estimated to be around 26 kg/m2 if their partners finished secondary or higher education and around 24 kg/m2 otherwise. Breastfeeding practices also influence women’s BMI, and breastfeeding women’s BMI is generally lower than those of non-breastfeeding women. Additionally, their BMI are also affected by age, as nodes 7 and 8 indicate that the BMI of women from low or middle SES who are between 18 and 25 years old is lower than those over 26 years old. Furthermore, non-breastfeeding women from low SES are expected to have a lower BMI than women from middle SES.

thumbnail
Fig 4. Conditional inference tree for BMI prediction using 10-fold cross-validation.

Each terminal node of this algorithm provides the average predicted value, the number of observations used, and the sum of squared error. NE, PE, SE, and HE represent no education, primary education, secondary education, and higher degree (such as college), respectively.

https://doi.org/10.1371/journal.pone.0277738.g004

The CT based on underweight classification is shown in Fig 5. Nodes 4 and 5 indicate that women from low SES are classified as underweight regardless of whether they use contraception. The breastfeeding practice of women from middle and high SES determines their underweight status, as seen in nodes 7, 8, 11, and 12. The latter group consists of people between the ages of 18 and 25. Breastfeeding women are categorized as underweight, whereas non-breastfeeding women are the opposite. Finally, women over 26 years from high SES are not classified as underweight, regardless of the number of children.

thumbnail
Fig 5. Conditional inference decision tree for underweight using 10-fold cross-validation.

Each terminal node of this algorithm provides the underweight status (Yes or No), the number of observations used, and the classification error.

https://doi.org/10.1371/journal.pone.0277738.g005

Fig 6 shows the CT for overweight classification. According to nodes 4, 5, 7, and 8, women from low SES are not classified as overweight regardless of whether they breastfeed or their educational level. Women from middle or high SES are classified as overweight if they live with partners at least 31 years old. Additionally, the educational attainment of their partner does not influence the classification outcome.

thumbnail
Fig 6. Conditional inference decision tree for overweight using 10-fold cross-validation.

The categories of the education variable are the same as those of partner’s education. Each terminal node of this algorithm provides the overweight status (Yes or No), the number of observations used, and the classification error.

https://doi.org/10.1371/journal.pone.0277738.g006

Fig 7 illustrates the CT for obesity classification. Nodes 4, 5, 7, and 8 show that women from middle and low SES are not categorized as obese regardless of their media access or breastfeeding practices. According to node 12, women whose partners hold either white-collar jobs or are unemployed are classified as obese, while women whose partners have blue-collar jobs are classified oppositely. The remaining two terminal nodes state that women over the age of 26 from high SES are likely to be obese.

thumbnail
Fig 7. Conditional inference decision tree for obesity using 10-fold cross-validation.

On the partner’s employment variable, BC, WC, and UNEMP respectively represent blue-collar job, white-collar-job, and unemployed. Each terminal node of this algorithm provides the obesity status (Yes or No), the number of observations used, and the classification error.

https://doi.org/10.1371/journal.pone.0277738.g007

Discussion

Our initial descriptive statistics suggest that the BMI values of sampled women range between 22.0 and 25.4 kg/m2, and the average is 23.5 kg/m2. Our bivariate analyses also indicate that BMI is significantly and positively associated with age and educational attainment (of both women and their partners), which is in line with recent studies from Bangladesh [8082]. We find that breastfeeding women have a lower BMI than their non-breastfeeding counterparts, which (i.e., the protective effect of breastfeeding against high BMI) is also in line with studies in Bangladesh and elsewhere [8386].

Currently, about 27% of the sampled women appear to be overweight and about 7% obese, compared with only 11% being underweight. This nutrition transition among women of childbearing age in Bangladesh from undernutrition to overnutrition was identified in the mid-2010s [87, 88], and now it is established that the proportion of women being overweight/obese in Bangladesh surpassed the proportion of women underweight in around 2010 [89]. Both from nationally representative BDHS data and findings from other recent studies support this transition [9092]. From our bivariate analyses, the estimated prevalence of overweight is nearly double among women from high SES, women who did not breastfeed their children, and older (i.e., age 26 years or over) compared to women from low SES, women who breastfeed their children, and age 25 years or less, respectively. In addition, the prevalence of obesity is nearly four times higher among women from high SES compared to women from low SES. Women with higher education and older (i.e., aged 26 years or above) have a three times higher prevalence of obesity compared to those without education and aged 25 years or less, respectively. The percentage of obesity among women who do not breastfeed their children is twice that of those who do. Recent studies in Bangladesh also confirm that age, SES, and educational attainment are statistically significant predictors of overweight and obesity among women of childbearing age [80, 82, 9395].

In our BMI prediction, the top two algorithms are SVM and KNN in terms of R2, RMSE, and MAE. Specifically, the R2 values of SVM and KNN exceed that of LR by 8.6% and 5.5%, respectively. In underweight, overweight, and obesity classifications, the combined predictor algorithms consistently yield top specificity, Cohen’s kappa, F1-score, and AUC, and their performance is robust across different outcome standards. We compare our results with those from previous studies that applied SML to classify different health outcomes (see SD-1 Table in S1 File). Our classification models for underweight and overweight outperform those of other studies in terms of accuracy values (94% and 84%). Additionally, the AUCs from all three classifications (96.2%, 89.3%, and 98.3%) of this paper based on RF exceed those of other studies reported in SD-1 Table in S1 File. All these results verify the strength of our models. Figueroa and Flores achieved a slightly better accuracy (97.36%) in obesity classification based on electronic medical records using SVM [61]. Although their study used a much larger sample size and a different cohort, the accuracy of the classification model is very comparable, demonstrating the generalization of the training model and the representative power of the samples.

Feature importance ranking based on selected algorithms suggests that SES, women’s age, and breastfeeding status are strongly associated with nutritional outcomes. Specifically, SES is the most important as it ranks first across all methods. The CT for BMI prediction using 10-fold cross-validation demonstrates that the average predicted BMI value among women of childbearing age in Bangladesh increases significantly (p<0.001) with SES (i.e., low vs. middle and low+middle vs. high), age (i.e., 18–25 vs. 25+ years), and breastfeeding practices (i.e., women who breastfeed their babies vs. who do not). Also, among the women from high SES, their partner’s educational attainment (i.e., no or primary education vs. secondary education or higher) significantly increases the predicted BMI value (p<0.001). CTs in our study have shown that SES strongly influences nutritional outcomes among women of childbearing age in Bangladesh. Additionally, both the age of women and breastfeeding practice appeared consistently on the trees, indicating that these variables are also important in predicting nutrition outcomes. Furthermore, the partner’s educational attainment influences BMI prediction and overweight classification. Women’s obesity risk is associated with their partner’s employment and media access. These findings correspond with the results from previous studies in Bangladesh and elsewhere [15, 82, 96].

Strength and limitations

This study makes several notable contributions to the existing literature on BMI and malnutrition outcomes in women of childbearing age. First, unlike most previous studies that utilized only standard LR or ordinal categorical response models to estimate the associations of socioeconomic determinants with malnutrition outcomes in women, this study implements a variety of SML methods to predict the BMI value as well as nutritional status, namely underweight, overweight, and obesity. To our knowledge, as of now, only two studies attempted to predict the risks of malnutrition outcomes in adults in Bangladesh [64, 65]. Nevertheless, neither predicted the BMI outcome nor defined how features interact and lead to prediction, aside from evaluating the performance of models. Additionally, the study by Ferdowsy and colleagues [64] had limited generalizability due to its small sample size (n = 1,100), and the study by Islam and colleagues [65] used data from an older round (i.e., 2014) of BDHS [64].,. We overcome these issues by utilizing data from the most recent wave of BDHS (i.e., 2017–18), a nationally representative survey with about 15,000 observations, and by exploring how hidden interactions among features influence the prediction of key outcomes using tree-based approaches. Hence our findings are generalizable at the national level and provide updated information for policy and programmatic considerations.

Second, the feature selection used in our study eliminates unimportant regressors, shortens the computational time and cost, and strengthens the accuracy of the predictive model. A number of studies identified the important risk factors for underweight and overweight/obesity status in women from traditional logistic or multinomial logistic regression [65, 97, 98]. They selected those features that appeared significant in the corresponding regression models. Unlike them, this study pins important features and ranks them using SML algorithms. SML feature selection algorithms are deemed robust for choosing features in the model [99, 100].

Third, due to a small number of observations under the obesity category, all previous studies using BDHS data modeled overweight and obesity as a single indicator variable [15, 65, 82, 101103]. This study is the first that attempts to model them separately to unmask the risk associated with obesity and distinguish it from that of being overweight. We overcome the issue of an unbalanced distribution of the obesity category (i.e., the percentage of obese women in this study sample is only about 7%, which can potentially lead to biased and unreliable results while using SML) by generating a synthetic balanced sample proposed by Menardi and colleagues [75, 104].

Lastly, this study favors CT over CART since the former has some relative advantages. CART is the typical approach to generating a tree that maps the features to the predicted value. CT builds the tree using a conditional distribution, which captures the relationship between features and outcomes. Unlike CART, this approach creates a tree by conducting the variable independence test. By examining the conditional distribution, this test identifies whether a particular feature significantly predicts both continuous and categorical outcomes.

In terms of limitations, the scope of our analyses relies on variables included in the BDHS cross-sectional dataset. The performance of our predictions and feature importance rankings can potentially be enhanced by adding relevant covariates (e.g., dietary practices by women of childbearing age), which are currently not in the BDHS dataset. Also, the wealth index used in this analysis as a socioeconomic feature can potentially cause misclassification bias in the measurement of SES. Several studies have discussed the limitations of the wealth index in the DHS and its potential for misclassification bias [105107]. The wealth index is constructed based on household assets and amenities [108]. While it is a well-accepted and useful proxy for wealth and economic status, it may not capture other important dimensions of socioeconomic status, such as income, education, and occupation. Finally, this paper focuses on Bangladeshi childbearing women aged from 18 to 49 years. For this reason, we excluded 5,365 sampled respondents from our empirical analyses who were either under the age of 18, or with missing data, or considered irrelevant observations/outliers based on Cook’s distance to prevent the loss of prediction strength captured by the R2 [109]. All observations with Cook’s distance greater than 4/n (n is the number of rows) after running a linear regression are taken out from the analysis as they are considered outliers [110]. The regression result is improved after eliminating them as the R2 enhances to 22% from 17.5%, which was the R2 value with outliers. Exclusion of the sampled respondents, particularly for missing values, can potentially cause selection bias [111].

Conclusion

To the best of our knowledge, this is the first study that predicts BMI and one of the pioneer studies to classify all three malnutrition outcomes for women of childbearing age in Bangladesh, let alone in any lower-middle income country, using SML techniques. Moreover, our paper is the first to identify and rank a set of socioeconomic, health, and demographic features that are critical in predicting nutritional measures using several SML feature selection algorithms. The estimators from this study predict the outcomes of interest most accurately and efficiently compared to other existing studies in the relevant literature. Therefore, study findings can aid policymakers in designing policy and programmatic approaches to address the double burden of malnutrition among Bangladeshi women, thereby reducing the country’s economic burden. Moreover, our findings would be valuable for identifying women at risk for any form of malnutrition to design and implement tailored interventions in the future.

The policy documents on women’s health and nutrition in Bangladesh largely focus on the undernutrition aspect of malnutrition [89]. The National Nutrition Policy 2015 identified ‘reducing maternal overweight’ to be critical for achieving optimal nutrition, and the National Strategy for Maternal Health 2019–2030 acknowledged the increasing trends in overweight and obesity, but neither provided any specific programmatic directions for addressing overweight and obesity among women of childbearing age in the country [112]. The Second National Plan of Action for Nutrition (NPAN-2) 2016–2025 has outlined some activities for prevention and control of obesity and non-communicable diseases, without specifying the target groups for each of these activities [89, 113]. Based on the findings of this study and the ‘best practices’ for reducing overweight and obesity in other countries from South and Southeast Asia, we conclude with the following policy and programmatic implications:

  1. Scaling up breastfeeding: the benefits of breastfeeding for a healthier weight and postpartum weight loss have been well-documented [86, 114, 115]. However, breastfeeding practice is still not optimum in Bangladesh, and women face a range of barriers at the individual, societal, and system levels that include adverse effects of breastfeeding on maternal health, nutrition, and physical appearance [116]. Proven interventions for scaling up breastfeeding in Bangladesh and other Asian countries, which combined intensified interpersonal counseling, mass media campaigns, and community mobilization interventions [117], need to be implemented not only for ensuring nutrition for children but also as an effective approach for healthy BMI for women of childbearing age.
  2. Designing culturally adapted community-based physical activity interventions for women of childbearing age: existing evidence indicates that culturally adapted physical activity interventions tended to be more accepted by the target community, particularly for women, and be effective in reducing weight among South Asian adults [118, 119]. Using participatory approaches to identify relevant cultural attitudes and norms to inform the design of community-based physical activity interventions would improve physical activity uptake and/or adherence for older, better-off women. The government’s National Nutrition Services can consider building partnership with the private/non-governmental sector for urban areas and recruiting community health workers in rural areas to promote and sustain physical activities among women of childbearing age since studies found that trained local facilitators help overcome the barriers to physical intervention uptake in among Bangladeshi and other South Asian descendants [118, 120, 121].
  3. Implementing behavior change communications for diet and lifestyle modifications for urban, high SES adults: studies indicate that routine consultations and lifestyle advice from physicians are ineffective in modifications in diet and lifestyle that result in viable weight loss [122]. However, a study on individuals of South Asian descent living in the United Kingdom found that family-based lifestyle intervention consisting of about 15 visits from a dietitian over three years resulted in significantly greater weight loss than did annual contact and simple lifestyle advice from a dietitian [123]. In our analyses, both RT and RF highlighted high SES and living in urban areas to be among the top five features in predicting BMI. For this reason, family-based lifestyle intervention involving dieticians can be explored for urban, affluent women.
  4. Implementing recent legislations on nutrition: the Government of Bangladesh has published several regulations to promote healthy diets (viz., Food Safety [Labelling] Regulations 2017, Breastmilk Substitute [BMS] Rules 2017, Limiting Trans Fatty Acids in Foodstuffs Regulations 2021) in the recent years. These regulations need to be implemented to control the marketing of junk/packaged foods and sugary beverages and their consumption.
  5. Setting up an effective nutrition surveillance system: both the National Nutrition Policy 2015 and NPAN-2 acknowledged the need for setting up a nutrition surveillance system to track nutritional status and identify locations for targeted nutrition programs [113, 124]. The government and other relevant stakeholders in the country need to work together and set up a community-based surveillance system with adequate coverage without delay. We agree with Khan and colleagues that tracking malnutrition is particularly crucial for adolescent girls, who remain vulnerable to both forms of malnutrition [89].

Acknowledgments

We acknowledge the institutions and individuals who made the 2017–18 Bangladesh Demographic and Health Survey possible. The 2017–18 BDHS was implemented under the authority of the National Institute of Population Research and Training (NIPORT) of the Government of the People’s Republic of Bangladesh with financial assistance from USAID/Bangladesh. The 2017–18 BDHS data were collected and processed by Mitra & Associates. Lastly, the authors would like to thank the participants of the Bangladesh Demographic and Health Surveys who participated in this study.

References

  1. 1. di Cesare M, Bentham J, Stevens GA, Zhou B, Danaei G, Lu Y, et al. Trends in adult body-mass index in 200 countries from 1975 to 2014: A pooled analysis of 1698 population-based measurement studies with 19.2 million participants. The Lancet. 2016;387: 1377–1396. pmid:27115820
  2. 2. World Health Organization (WHO). The double burden of malnutrition: Policy Brief. Geneva; 2016.
  3. 3. Black RE, Victora CG, Walker SP, Bhutta ZA, Christian P, de Onis M, et al. Maternal and child undernutrition and overweight in low-income and middle-income countries. The Lancet. Elsevier B.V.; 2013. pp. 427–451. pmid:23746772
  4. 4. Waxman A. WHO global strategy on diet, physical activity and health. Food Nutr Bull. 2004;25: 292–302. pmid:15460274
  5. 5. Hasan MT, Magalhães RJS, Williams GM, Mamun AA. Long-term changes in childhood malnutrition are associated with long-term changes in maternal BMI: Evidence from Bangladesh, 1996–2011. American Journal of Clinical Nutrition. 2016;104: 1121–1127. pmid:27581468
  6. 6. Moran PS, Wuytack F, Turner M, Normand C, Brown S, Begley C, et al. Economic burden of maternal morbidity–A systematic review of cost-of-illness studies. PLoS One. 2020;15. pmid:31945775
  7. 7. Shree A, Narayana Murthy MR. Impact of malnutrition on scholastic performance among school children in Mysuru. Clin Epidemiol Glob Health. 2021;11: 1–4.
  8. 8. Shekar M, Heaver Richard, Lee Y-Kyoung. Repositioning Nutrition as Central to Development: A Strategy for Large-Scale Action. Washington, DC: World Bank; 2005.
  9. 9. Cawley J, Meyerhoefer C. The medical care costs of obesity: An instrumental variables approach. J Health Econ. 2012;31: 219–230. pmid:22094013
  10. 10. Fitzgerald S, Kirby A, Murphy A, Geaney F. Obesity, diet quality and absenteeism in a working population. Public Health Nutr. 2016;19: 3287–3295. pmid:27230727
  11. 11. Hoddinott J. The economics of reducing malnutrition in Sub-Saharan Africa. Ithaca, NY; 2016.
  12. 12. World Bank. Databank. In: World Development Indicators [Internet]. 2021 [cited 19 Oct 2022]. Available: https://data.worldbank.org/
  13. 13. World Health Organization (WHO). Trends in maternal mortality: 1990 to 2015: Estimates by WHO, UNICEF, UNFPA, World Bank Group and the United Nations Population Division. Geneva; 2015.
  14. 14. National Institute of Population Research and Training (NIPORT), ICF. Bangladesh Demographic and Health Survey 2017–18: Key Indicators. Dhaka, Bangladesh, and Rockville, Maryland, USA; 2019. Available: http://www.niport.gov.bd;
  15. 15. Sarma H, Saquib N, Hasan MM, Saquib J, Rahman AS, Khan JR, et al. Determinants of overweight or obesity among ever-married adult women in Bangladesh. BMC Obes. 2016;3. pmid:26962459
  16. 16. Kamal SMM, Hassan CH, Alam GM. Dual burden of underweight and overweight among women in Bangladesh: Patterns, prevalence, and sociodemographic correlates. J Health Popul Nutr. 2015;33: 92–105. Available: https://www.researchgate.net/publication/277082739 pmid:25995726
  17. 17. Hu FB. Overweight and Obesity in Women: Health Risks and Consequences. J Womens Health. 2003;12: 163–172. pmid:12737715
  18. 18. Lash MM, Armstrong A. Impact of obesity on women’s health. Fertil Steril. 2009;91: 1712–1716. pmid:18410940
  19. 19. Deshmukh VL, Jadhav M, Yelikar K. Impact of HIGH BMI on Pregnancy: Maternal and Foetal Outcome. Journal of Obstetrics and Gynecology of India. 2016;66: 192–197. pmid:27651602
  20. 20. Wells JC, Sawaya AL, Wibaek R, Mwangome M, Poullas MS, Yajnik CS, et al. The double burden of malnutrition: aetiological pathways and consequences for health. The Lancet. Lancet Publishing Group; 2020. pp. 75–88. pmid:31852605
  21. 21. Kahn HS, Wliamson DF, Stevens JA. Race and Weight Change in US Women: The Roles of Socioeconomic and Marital Status. Am J Public Health. 1991;81: 319–323. pmid:2036117
  22. 22. Janghorbani M, Amini M, Willett WC, Gouya MM, Delavari A, Alikhani S, et al. First Nationwide Survey of Prevalence of Overweight, Underweight, and Abdominal Obesity in Iranian Adults. 2007.
  23. 23. Malcolm M, Kaya I. Selection works both ways: BMI and marital formation among young women. Rev Econ Househ. 2016;14: 293–311.
  24. 24. Freire WB, Waters WF, Román D, Jiménez E, Burgos E, Belmont P. Overweight, obesity, and food consumption in Galapagos, Ecuador: A window on the world. Global Health. 2018;14. pmid:30208909
  25. 25. Whitaker RC. Predicting preschooler obesity at birth: the role of maternal obesity in early pregnancy. Pediatrics. 2004;114. pmid:15231970
  26. 26. Costa-Font J, Fabbri D, Gil J. Decomposing Bodymass Index gaps between Mediterranean countries: A Counterfactual Quantile Regression Analysis*. 2008. Available: http://www.york.ac.uk/res/herc/research/hedg/wp.htm
  27. 27. Davies S. Promoting healthy diets and physical activity: a European dimension for the prevention of overweight, obesity and chronic diseases. Luxembourg; 2006.
  28. 28. Lindeboom M, Lundborg P, van der Klaauw B. Assessing the impact of obesity on labor market outcomes. Econ Hum Biol. 2010;8: 309–319. pmid:20864420
  29. 29. Sobal J. Sociological analysis of the stigmatization of obesity. 2nd Edition. In: Germov J, Williams L, editors. A Sociology of Food and Nutrition: The Social Appetite. 2nd Edition. Melbourne: Oxford University Press; 2004.
  30. 30. Puhl R, Brownell KD. Bias, Discrimination, and Obesity. Obes Res. 2001;9: 788–805. pmid:11743063
  31. 31. Hamermesh DS, Biddle JE. Beauty and the Labor Market. Am Econ Rev. 1994;84: 1174–1194.
  32. 32. Baum CL, Ford WF. The wage effects of obesity: A longitudinal study. Health Econ. 2004;13: 885–899. pmid:15362180
  33. 33. Rooth D-O. Obesity, Attractiveness, and Differential Treatment in Hiring: A Field Experiment. J Hum Resour. 2009;44: 710–735. Available: https://www.jstor.org/stable/20648914
  34. 34. Cawley J. The Impact of Obesity on Wages. J Hum Resour. 2004;39: 451–474.
  35. 35. Baum CL. The effects of cigarette costs on BMI and obesity. Health Econ. 2009;18: 3–19. pmid:18241088
  36. 36. French MT, Norton EC, Fang H, Maclean JC. Alcohol consumption and body weight. Health Econ. 2010;19: 814–832. pmid:19548203
  37. 37. Cawley J. An economy of scales: A selective review of obesity’s economic causes, consequences, and solutions. J Health Econ. 2015;43: 244–268. pmid:26279519
  38. 38. Abdulai A. Socio-economic characteristics and obesity in underdeveloped economies: Does income really matter? Appl Econ. 2010;42: 157–169.
  39. 39. Sahn DE. Weights on the rise: Where and for whom? J Econ Inequal. 2009;7: 351–370.
  40. 40. Crane-Droesch A. Technology diffusion, outcome variability, and social learning: Evidence from a field experiment in Kenya. Am J Agric Econ. 2018;100: 955–974.
  41. 41. dos Santos BS, Steiner MTA, Fenerich AT, Lima RHP. Data mining and machine learning techniques applied to public health problems: A bibliometric analysis from 2009 to 2018. Comput Ind Eng. 2019;138.
  42. 42. Ghoddusi H, Creamer GG, Rafizadeh N. Machine learning in energy economics and finance: A review. Energy Econ. 2019;81: 709–727.
  43. 43. März A, Klein N, Kneib T, Mußhoff O. Analysing farmland rental rates using Bayesian geoadditive quantile regression. European Review of Agricultural Economics. 2016; 663–98. https://doi.org/10.1093/erae/jbv028
  44. 44. Yu L, Zhao Y, Tang L. A compressed sensing based AI learning paradigm for crude oil price forecasting. Energy Econ. 2014;46: 236–245.
  45. 45. Athey S. The Impact of Machine Learning on Economics. In: Agrawal A, Gans J, Goldfarb A, editors. The Economics of Artificial Intelligence: An Agenda. Chicago, IL: University of Chicago Press; 2019. pp. 507–547.
  46. 46. Athey S, Imbens GW. Machine Learning Methods that Economists Should Know About. Annu Rev Econom. 2019;11: 685–725.
  47. 47. Kleinberg J, Lakkaraju H, Leskovec J, Ludwig J, Mullainathan S. Human decisions and machine predictions. Q J Econ. 2018;133: 237–293. 10.1093/qje/qjx032/4095198/Human-Decisions-and-Machine-Predictions pmid:29755141
  48. 48. Athey S. Beyond prediction: Using big data for policy problems. Science. 3 Feb 2017355: 483–485. pmid:28154050
  49. 49. Mullainathan S, Spiess J. Machine learning: An applied econometric approach. Journal of Economic Perspectives. American Economic Association; 2017. pp. 87–106.
  50. 50. Varian HR. Big data: New tricks for econometrics. Journal of Economic Perspectives. 2014;28: 3–28.
  51. 51. Wuthrich K, Zhu Y. Omitted variable bias of Lasso-based inference methods: A finite sample analysis. Rev Econ Stat. 2020; 1–47. Available: http://arxiv.org/abs/1903.08704
  52. 52. Carmona P, Climent F, Momparler A. Predicting failure in the U.S. banking sector: An extreme gradient boosting approach. International Review of Economics and Finance. 2019;61: 304–323.
  53. 53. Bajari P, Nekipelov D, Ryan SP, Yang M. Machine learning methods for demand estimation. American Economic Review. 2015;105: 481–485.
  54. 54. Angrist JD, Frandsen B. Machine Labor. J Labor Econ. 2022;40: S97–S140.
  55. 55. Daoud A, Kim R, Subramanian S v. Predicting women’s height from their socioeconomic status: A machine learning approach. Soc Sci Med. 2019;238. pmid:31470245
  56. 56. Saha S, Narayanan S. A simplified measure of nutritional empowerment using machine learning to abbreviate the Women’s Empowerment in Nutrition Index (WENI). World Dev. 2022;154. pmid:35664956
  57. 57. DeGregory KW, Kuiper P, DeSilvio T, Pleuss JD, Miller R, Roginski JW, et al. A review of machine learning in obesity. Obesity Reviews. Blackwell Publishing Ltd; 2018. pp. 668–685. pmid:29426065
  58. 58. Lee BJ, Kim KH, Ku B, Jang JS, Kim JY. Prediction of body mass index status from voice signals based on machine learning for automated medical applications. Artif Intell Med. 2013;58: 51–61. pmid:23453267
  59. 59. Pascali MA, Giorgi D, Bastiani L, Buzzigoli E, Henriquez P, Matuszewski BJ, et al. Face morphology: Can it tell us something about body weight and fat? Comput Biol Med. 2016;76: 238–249. pmid:27504744
  60. 60. Nguyen QC, Li D, Meng HW, Kath S, Nsoesie E, Li F, et al. Building a national neighborhood dataset from geotagged twitter data for indicators of happiness, diet, and physical activity. JMIR Public Health Surveill. 2016;2. pmid:27751984
  61. 61. Figueroa RL, Flores CA. Extracting Information from Electronic Medical Records to Identify the Obesity Status of a Patient Based on Comorbidities and Bodyweight Measures. J Med Syst. 2016;40. pmid:27402260
  62. 62. Dunstan J, Aguirre M, Bastías M, Nau C, Glass TA, Tobar F. Predicting nationwide obesity from food sales using machine learning. Health Informatics J. 2020;26: 652–663. pmid:31106648
  63. 63. Yadaw AS, chak Li Y, Bose S, Iyengar R, Bunyavanich S, Pandey G. Clinical features of COVID-19 mortality: development and validation of a clinical prediction model. Lancet Digit Health. 2020;2: e516–e525. pmid:32984797
  64. 64. Ferdowsy F, Rahi KSA, Jabiullah MI, Habib MT. A machine learning approach for obesity risk prediction. Current Research in Behavioral Sciences. 2021;2: 100053.
  65. 65. Islam MM, Rahman MJ, Islam MM, Roy DC, Ahmed NAMF, Hussain S, et al. Application of machine learning based algorithm for prediction of malnutrition among women in Bangladesh. International Journal of Cognitive Computing in Engineering. 2022;3: 46–57.
  66. 66. MacKinnon JG, Smith AA. Approximate bias correction in econometrics. J Econom. 1998;85: 205–230.
  67. 67. Filmer D, Pritchett LH. Estimating wealth effects without expenditure data—or tears: an application to educational enrollments in states of India. Demography. 2001;38: 115–132. pmid:11227840
  68. 68. Vespa J. The Changing Economics and Demographics of Young Adulthood: 1975–2016. Washinton, DC; 2017. Available: www.census.gov/programs-surveys/cps.html
  69. 69. World Health Organization (WHO). Obesity: Preventing and managing the global epidemic. Geneva; 2000.
  70. 70. Nishida C, Barba C, Cavalli-Sforza T, Cutter J, Deurenberg P, Darnton-Hill I, et al. Appropriate body-mass index for Asian populations and its implications for policy and intervention strategies. The Lancet. 2004;363: 157–163. pmid:14726171
  71. 71. Rutstein SO, Johnson KB. The DHS Wealth Index. Calverton, MD; 2004.
  72. 72. Hossain MI, Alam NE, Akter S, Suriea U, Aktar S, Shifat SK, et al. Knowledge, awareness and preventive practices of dengue outbreak in Bangladesh: A countrywide study. PLoS One. 2021;16. pmid:34111157
  73. 73. Rao JN, Scott AJ. On chi-squared tests for multiway contingency tables with cell proportions estimated from survey data. The Annals of Statistics. 1984;12: 46–60.
  74. 74. Rao JNK, Scott AJ. The Analysis of Categorical Data From Complex Sample Surveys: Chi-Squared Tests for Goodness of Fit and Independence in Two-Way Tables. J Am Stat Assoc. 1981;76: 221–230.
  75. 75. Lunardon N, Menardi G, Torelli N. ROSE: A Package for Binary Imbalanced Learning. R J. 2014;6: 79–89.
  76. 76. John GH, Langley P. Estimating Continuous Distributions in Bayesian Classifiers. In: Besnard P, Hanks S, editors. Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence. San Mateo: Morgan Kaufmann Publishers; 1995. pp. 338–345. Available: http://robotics.stanford.edu/-gjohn/
  77. 77. Bouckaert RR. Naive bayes classifiers that perform well with continuous variables. Australasian Joint Conference on Artificial Intelligence. Heidelberg: Springer; 2004. pp. 1089–1094.
  78. 78. al Daoud E, Turabieh H. New empirical nonparametric kernels for support vector machine classification. Applied Soft Computing Journal. 2013;13: 1759–1765.
  79. 79. Landis JR, Koch GG. The Measurement of Observer Agreement for Categorical Data. Biometrics. 1977;33: 159–174. Available: https://about.jstor.org/terms pmid:843571
  80. 80. Tanwi TS, Chakrabarty S, Hasanuzzaman S, Saltmarsh S, Winn S. Socioeconomic correlates of overweight and obesity among ever-married urban women in Bangladesh. BMC Public Health. 2019;19: 1–7. pmid:31253123
  81. 81. Islam MR, Hossain MS, Khan MMA, Rahman MS. Trends in body mass index among ever-married Bangladeshi women, 2004–14: Evidence from nationally representative population-based surveys. J Nutr Sci. 2021;10: 1–8. pmid:34094509
  82. 82. Hossain S, Khudri MM, Banik R. Regional education and wealth-related inequalities in malnutrition among women in Bangladesh. Public Health Nutr. 2022;25: 1639–1657. pmid:34482847
  83. 83. Ayesha U, Mamun ASMA, Sayem MA, Hossain MG. Factors associated with duration of breastfeeding in Bangladesh: evidence from Bangladesh demographic and health survey 2014. BMC Public Health. 2021;21: 1–9. pmid:34565370
  84. 84. Islam MA, Mamun ASMA, Hossain MM, Premananda B, Saw A, Lestrel PE, et al. Prevalence and factors associated with early initiation of breastfeeding among Bangladeshi mothers: A nationwide cross-sectional study. PLoS One. 2019;14: e0215733. pmid:31022237
  85. 85. Lucas R, Judge M, Sajdlowska J, Cong X, Mcgrath JM, Brandon D. Effect of Maternal Body Mass Index on Infant Breastfeeding Behaviors and Exclusive Direct Breastfeeding. Journal of Obstetric, Gynecologic, and Neonatal Nursing. 2015;44: 772–783. pmid:26402851
  86. 86. Mantzorou M, Papandreou D, Vasios GK, Pavlidou E, Antasouras G, Psara E, et al. Exclusive breastfeeding for at least four months is associated with a lower prevalence of overweight and obesity in mothers and their children after 2–5 years from delivery. Nutrients. 2022;14: 1–11. pmid:36079855
  87. 87. Khan SH, Talukder SH. Nutrition transition in Bangladesh: Is the country ready for this double burden. Obesity Reviews. 2013;14: 126–133. pmid:24102686
  88. 88. Khan SH, Ahsan KZ, Arifeen S el, Streatfield PK. Overweight surpassing underweight: Would Bangladesh be able to achieve the global voluntary target of halting obesity. Obesity Reviews. 2016;17: 195–196.
  89. 89. Khan S, Haider MM, Jamil K, Ahsan KZ, Angeles G. The double burden of malnutrition among Bangladeshi women: Rethinking the country’s maternal and child health programs and policies. Chapel Hill, NC; 2022.
  90. 90. al Muktadir MH, Islam MA, Amin MN, Ghosh S, Siddiqui SA, Debnath D, et al. Nutrition transition–Pattern IV: Leads Bangladeshi youth to the increasing prevalence of overweight and obesity. Diabetes and Metabolic Syndrome: Clinical Research and Reviews. 2019;13: 1943–1947. pmid:31235119
  91. 91. Rahman MA, Rahman MM, Rahman MM, Jesmin SS. The double burden of under- And overnutrition among Bangladeshi women: Socioeconomic and community-level inequalities. PLoS One. 2019;14: 1–24. pmid:31344076
  92. 92. Khanam M, Osuagwu UL, Sanin KI, Haque MA, Rita RS, Agho KE, et al. Underweight, overweight and obesity among reproductive Bangladeshi women: A nationwide survey. Nutrients. 2021;13: 1–16. pmid:34959960
  93. 93. Hasan E, Khanam M, Shimul SN. Socio-economic inequalities in overweight and obesity among women of reproductive age in Bangladesh: a decomposition approach. BMC Womens Health. 2020;20. pmid:33243211
  94. 94. Popkin BM, Corvalan C, Grummer-Strawn LM. Dynamics of the double burden of malnutrition and the changing nutrition reality. The Lancet. Lancet Publishing Group; 2020. pp. 65–74. pmid:31852602
  95. 95. Ferdausi F, Al-Zubayer MA, Keramat SA, Ahammed B. Prevalence and associated factors of underweight and overweight/obesity among reproductive-aged women: A pooled analysis of data from South Asian countries (Bangladesh, Maldives, Nepal and Pakistan). Diabetes and Metabolic Syndrome: Clinical Research and Reviews. 2022;16. pmid:35219260
  96. 96. Kamruzzaman M, Mamun ASMA, Bakar SMA, Saw A, Kamarul T, Islam MN, et al. Multilevel linear regression analysis of factors influencing body mass index among Bangladeshi married non-pregnant women. J Biosoc Sci. 2017;49: 498–508. pmid:27866490
  97. 97. Islam F, Kathak RR, Sumon AH, Molla NH. Prevalence and associated risk factors of general and abdominal obesity in rural and urban women in Bangladesh. PLoS One. 2020;15. pmid:32470026
  98. 98. Rahman A, Sathi NJ. Sociodemographic Risk Factors of Being Underweight Among Ever-Married Bangladeshi Women of Reproductive Age: A Multilevel Analysis. Asia Pac J Public Health. 2021;33: 220–226. pmid:33291961
  99. 99. Wang H, Li G, Jiang G. Robust regression shrinkage and consistent variable selection through the LAD-lasso. Journal of Business and Economic Statistics. 2007;25: 347–355.
  100. 100. Saeys Y, Abeel T, van de Peer Y. Robust Feature Selection Using Ensemble Feature Selection Techniques. Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Heidelberg: Springer; 2008. pp. 313–325.
  101. 101. Tanwi TS, Chakrabarty S, Hasanuzzaman S. Double burden of malnutrition among ever-married women in Bangladesh: A pooled analysis. BMC Womens Health. 2019;19. pmid:30704454
  102. 102. Shafique S, Akhter N, Stallkamp G, de Pee S, Panagides D, Bloem MW. Trends of under- and overweight among rural and urban poor women indicate the double burden of malnutrition in Bangladesh. Int J Epidemiol. 2007;36: 449–457. pmid:17242024
  103. 103. Hashan MR, Rabbi MF, Haider SS, Gupta R das. Prevalence and associated factors of underweight, overweight and obesity among women of reproductive age group in the Maldives: Evidence from a nationally representative study. PLoS One. 2020;15. pmid:33119696
  104. 104. Menardi G, Torelli N. Training and assessing classification rules with imbalanced data. Data Min Knowl Discov. 2014;28: 92–122.
  105. 105. Poirier MJP, Grépin KA, Grignon M. Approaches and Alternatives to the Wealth Index to Measure Socioeconomic Status Using Survey Data: A Critical Interpretive Synthesis. Soc Indic Res. 2020;148: 1–46.
  106. 106. Howe LD, Hargreaves JR, Huttly SRA. Issues in the construction of wealth indices for the measurement of socio-economic position in low-income countries. Emerg Themes Epidemiol. 2008;5. pmid:18234082
  107. 107. Howe LD, Hargreaves JR, Ploubidis GB, De Stavola BL, Huttly SRA. Subjective measures of socio-economic position and the wealth index: A comparative analysis. Health Policy Plan. 2011;26: 223–232. pmid:20817696
  108. 108. Gwatkin DR, Rutstein S, Johnson K, Suliman E, Wagstaff A, Amouzou A. Socio-economic differences in health, nutrition, and population within developing countries: An overview. Washington, DC; 2007. pmid:18293634
  109. 109. Cook RD. Detection of Influential Observation in Linear Regression. Technometrics. 1977;19: 15–18.
  110. 110. Belsley DA, Kuh E, Welsch RE. Regression diagnostics: Identifying influential data and sources of collinearity. Hoboken, NJ: John Wiley & Sons; 1980.
  111. 111. Finaret AB, Hutchinson M. Missingness of height data from the Demographic and Health Surveys in Africa between 1991 and 2016 was not Random but is unlikely to have major implications for biases in estimating stunting prevalence or the determinants of child height. Journal of Nutrition. 2018;148: 781–789. pmid:30053281
  112. 112. Government of Bangladesh. Bangladesh National Strategy for Maternal Health 2019–2030. Dhaka; 2019.
  113. 113. Government of Bangladesh. Second National Plan of Action for National (2016–2025). Dhaka; 2017.
  114. 114. López-Olmedo N, Hernández-Cordero S, Neufeld LM, García-Guerra A, Mejía-Rodríguez F, Méndez Gómez-Humarán I. The associations of maternal weight change with breastfeeding, diet and physical activity during the postpartum period. Matern Child Health J. 2016;20: 270–280. pmid:26525558
  115. 115. da Silva M da CM, Oliveira Assis AM, Pinheiro SMC, de Oliveira LPM, da Cruz TRP. Breastfeeding and maternal weight changes during 24 months post-partum: A cohort study. Matern Child Nutr. 2015;11: 780–791. pmid:23941254
  116. 116. Uddin MF, Jabeen I, Islam MA, Rahman M, Chisti MJ, Ahmed T, et al. Barriers to breastfeeding are shaped by sociocultural context: an exploratory qualitative study in Bangladesh. J Health Popul Nutr. 2022;41: 1–13. pmid:35964118
  117. 117. Menon P, Nguyen PH, Saha KK, Khaled A, Kennedy A, Tran LM, et al. Impacts on Breastfeeding Practices of At-Scale Strategies That Combine Intensive Interpersonal Counseling, Mass Media, and Community Mobilization: Results of Cluster-Randomized Program Evaluations in Bangladesh and Viet Nam. PLoS Med. 2016;13: e1002159. pmid:27780198
  118. 118. Horne M, Tierney S, Henderson S, Wearden A, Skelton DA. A systematic review of interventions to increase physical activity among South Asian adults. Public Health. Elsevier B.V.; 2018. pp. 71–81. pmid:29990615
  119. 119. Brown T, Smith S, Bhopal R, Kasim A, Summerbell C. Diet and physical activity interventions to prevent or treat obesity in south asian children and adults: A systematic review and meta-analysis. International Journal of Environmental Research and Public Health. MDPI; 2015. pp. 566–594. pmid:25584423
  120. 120. Islam NS, Wyatt LC, Patel SD, Shapiro E, Tandon SD, Mukherji BR, et al. Evaluation of a Community Health Worker Pilot Intervention to Improve Diabetes Management in Bangladeshi Immigrants With Type 2 Diabetes in New York City. Diabetes Educ. 2013;39: 478–493. pmid:23749774
  121. 121. Islam NS, Zanowiak JM, Wyatt LC, Kavathe R, Singh H, Kwon SC, et al. Diabetes prevention in the new york city sikh asian Indian community: A pilot study. Int J Environ Res Public Health. 2014;11: 5462–5486. pmid:24852392
  122. 122. Chan RSM, Woo J. Prevention of overweight and obesity: How effective is the current public health approach. International Journal of Environmental Research and Public Health. MDPI; 2010. pp. 765–783. pmid:20617002
  123. 123. Bhopal RS, Douglas A, Wallia S, Forbes JF, Lean MEJ, Gill JMR, et al. Effect of a lifestyle intervention on weight change in south Asian individuals in the UK at high risk of type 2 diabetes: A family-cluster randomised controlled trial. Lancet Diabetes Endocrinol. 2014;2: 218–227. pmid:24622752
  124. 124. Government of Bangladesh. Bangladesh National Nutrition Policy 2015: Nutrition is the foundation for development. Dhaka; 2015.