Figures
Abstract
Background
Food insecurity and cigarette smoking are significant public health issues that disproportionately affect vulnerable populations. Previous research often overlooks daily smoking patterns, focusing instead on smoking status or cessation. This study seeks to fill that gap, hypothesizing that food insecurity is positively associated with higher daily cigarette consumption.
Methods
This study investigates the non-linear association between food security status and daily cigarette consumption using data from the 2022 National Health Interview Survey (NHIS). It introduces a novel statistical approach—the COM-Poisson Generalized Additive Model (CMP-GAM)—to capture the non-linear dynamics between these variables,
Results
The CMP-GAM model was applied to a sample of 819 adults, controlling for key variables like age, body mass index (BMI), and mental health conditions (anxiety and depression severity). The results indicated that while food security status did not significantly affect daily cigarette consumption, the age of smoking initiation was a strong predictor, with earlier smoking onset leading to higher daily cigarette use. Subgroup analyses among participants with cardiovascular disease and diabetes also did not reveal a significant relationship between food insecurity and smoking frequency.
Significance
The findings suggest that targeted interventions aimed at preventing early smoking initiation may be more effective in reducing smoking prevalence and improving public health outcomes than addressing food security alone. This study highlights the need for further research into the complex interplay between socioeconomic factors and health behaviors like smoking.
Citation: Umam S, Razzak RB, Munni MY, Rahman A (2025) Exploring the non-linear association of daily cigarette consumption behavior and food security- An application of CMP GAM regression. PLoS One 20(7): e0328109. https://doi.org/10.1371/journal.pone.0328109
Editor: Ziqian Zeng, Chengdu Medical College, CHINA
Received: September 24, 2024; Accepted: June 25, 2025; Published: July 17, 2025
Copyright: © 2025 Umam et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data underlying the results presented in the study are available from figshare: https://figshare.com/s/cdd56229a1c2cba5d50a.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors confirm that no known conflicts of interest are associated with this publication. Specifically, this research utilizes secondary data that has been fully anonymized and poses no direct or indirect benefit to any of the researchers involved. There have been no financial or personal relationships with other people or organizations that could inappropriately influence our work. This declaration is made to ensure full transparency and to uphold the integrity of the scientific process.
1 Introduction
In 2022, around 2.4 billion people (29.6 percent of the global population) were moderately to severely food insecure, among which 900 million (11.3 percent) faced severe food insecurity [1]. Food insecurity, characterized by limited or uncertain access to adequate food, impacts beyond nutrition and physical health, affecting various aspects of mental and social well-being. Individuals experiencing food insecurity are at higher risk of health problems such as diabetes, hypertension, mental illness, and increased mortality [2]. Moreover, Studies suggest that food insecurity affects mental health, with evidence suggesting associations with depression, anxiety, and other psychological distress [3,4]. Smoking, on the other hand, remains one of the leading causes of preventable diseases and deaths globally. Smoking has been associated with a range of health conditions, such as cardiovascular diseases, neoplasms, chronic respiratory diseases, respiratory infections, tuberculosis, musculoskeletal disorders, as well as diabetes and kidney diseases [5]. Over an extended period, smoking has a detrimental impact on an individual’s health, diminishes productive workforce participation, and exerts influence on personal financial stability [6]. Understanding the factors that influence smoking behavior, particularly the number of cigarettes smoked per day, is essential for developing effective public health interventions. Several studies have investigated the connections between food security and smoking in an effort to identify underlying factors and preventive actions.
Previous research has shown that socioeconomic factors, including food security, are closely linked to smoking behavior. An earlier study among young adults observed that smoking prevalence is significantly higher for young adults who reported being food insecure [7]. The effect of food insecurity on smoking behavior is particularly noticeable among low-income and marginalized populations. Low-income adults experiencing food insecurity have been found to have a disproportionately high prevalence of cigarette smoking [8]. The impact of food insecurity on smoking behavior is multifaceted. Studies have shown that food insecurity is a barrier to smoking cessation [9]. Previous studies have discovered that facing food insecurity is associated with a higher likelihood of starting smoking for those who do not smoke [10]. Furthermore, Baek et al. noted that shifts from food security to insecurity were associated with almost a twofold rise in the likelihood of smoking behavior [11]. Individuals experiencing food insecurity may use smoking as a coping mechanism to manage stress and anxiety associated with their precarious food situation [12]. Canales et al. (2015) delineated that worrying about running out of food is an important indicator of food security and has direct causal implications on one’s perceptions of stress or anxiety [13]. Whereas cigarette smoking is a common way to relieve tension and stress [14], suggesting an association between the intensity of smoking and food insecurity. Individuals experiencing food insecurity may turn to smoking as a way to manage stress or suppress appetite, which can further complicate efforts to quit smoking [15]. Additionally, financial constraints can lead to prioritizing immediate relief through smoking over long-term health investments [16]. Psychological distress exacerbated by food insecurity may promote smoking for stress relief, creating a cycle that hinders cessation [7]. Moreover, a dose-response relationship has been observed between tobacco product use and levels of food insecurity, indicating that higher levels of smoking-induced financial deprivation can worsen food insecurity [6]. Nicotine intake reduces appetite and suppresses hunger [17], which increases the likelihood of smokers using smoking as a coping mechanism to reduce hunger related to food scarcity. This dynamic exacerbates health disparities and significantly burdens public health systems. Taking these into account, there is a growing recognition of the need for more comprehensive assessments to understand the dynamics between tobacco use and food insecurity [14].
Numerous statistical models have been utilized in research studies to explore the association between food insecurity and smoking behaviors. In most of the research studies logistic regression analysis was used to examine how food insecurity is associated with smoking status [7,12,18]. Longitudinal logistic and Tobit regressions analyzed the link between food security and smoking behaviors in low-income women, focusing on the longitudinal impact of food insecurity on smoking habits [19]. A prior study used multinomial logistic regression to assess and characterize the relationship between food security and tobacco product use [20]. Another study utilized multinomial logistic regression to examine the association between transitions of food insecurity with smoking cessation and shifts in the consumption pattern over two years among adults [21]. However, most studies only considered a linear association between food security and cigarette smoking behavior. Nevertheless, the non-linear nature of the association requires further investigation. To our knowledge, there is no study that has been conducted to capture the non-linear association between those variables using an appropriate count data model.
Considering a non-linear association between food security and cigarette smoking behavior is important for several reasons. The relationship between food security and smoking behavior may not be straightforward. Non-linear models can capture more complex interactions and thresholds that linear models might miss. For example, the impact of food insecurity on smoking behavior might be more pronounced at certain levels of food insecurity [7]. Food insecurity and smoking behavior can influence each other in multiple ways. For instance, food insecurity might lead to increased stress and smoking as a coping mechanism, while smoking can exacerbate financial strain, worsening food insecurity [14]. Understanding non-linear relationships can help in designing more effective policies and interventions. For example, interventions might need to be targeted differently at various levels of food insecurity to be effective. Incorporating non-linear associations enriches the literature by providing a more nuanced understanding of the dynamics between food security and smoking. It challenges the oversimplified view of linear relationships and encourages further research into the underlying mechanisms. Overall, by considering non-linear associations, researchers can uncover more detailed insights and develop more targeted strategies to address the intertwined issues of food insecurity and smoking behavior.
Poisson and Negative-Binomial regression are the most popular models for cross-sectional count data [22]. Besides these, the Conway-Maxwell-Poisson (CMP) or COM-Poison regression model introduced by Conway and Maxwell [23] is increasingly gaining popularity for complex association in count data as it can account for both under-dispersion and over-dispersion [22]. The CMP regression is an extension of the traditional Poisson and logistic regression models [24]. While modeling count data, the Poisson regression is limited with its variance assumption of equal mean and variance, and Negative-binomial regression can only model over-dispersed count data. While the CMP regression model is not limited to these assumptions and provide a flexible approach for modeling count data with varying dispersion levels [24], which can be helpful in allowing for a more accurate representation of the relationship between food security and the quantity of daily smoking. However, CMP regression is limited when dealing with complex nonlinear relationships [22]. Very recently, Chatla and Shmueli [22] introduced a model which overcome the limitation of traditional CMP-regression model and they referred this model as COM-Poisson Generalized additive model (CMP-GAM). In that study, they considered an iterative reweighted least square (IRLS) estimation framework and extended the CMP model with additive components using a penalized spline approach makes this model better at capturing complex non-linear relationships [22]. Considering the advance features of capturing non-linear association by this model, we consider the CMP-GAM to understand the association between food security and the number of cigarettes a person smokes in a day.
The association between food insecurity and smoking extends beyond individual behaviors and has broader implications for public health interventions [25]. Moreover, the impact of cigarette smoking behavior on food security needs to be investigated further due to its potential to inform targeted interventions. However, to date, the majority of research studies focus on analyzing smoking behaviors and cessation in relation to food security, but they often overlook the daily consumption patterns as a factor in this association. Therefore, this study aims to understand the non-linear association between food security and daily smoking behavior using CMP-GAM. Our primary hypothesis is that food security status is associated with the number of cigarettes smoked per day. Specifically, we hypothesize that individuals experiencing food insecurity will smoke more cigarettes daily compared to those with secure food access.
2 Methods
Data and study design
This study used the cross-sectional study design using data from the 2022 National Health Interview Survey (NHIS) conducted by the CDC’s National Center for Health Statistics (NCHS). Information from “Sample adult Interviews” was used for the study, where the interview participants were US adults aged 18 years and above. The NHIS is an annual household survey representing the civilian non-institutionalized US population across the 50 states and the District of Columbia. It collects data from individuals residing in households and noninstitutional group quarters, including homeless shelters, rooming houses, and group homes. Individuals temporarily living in student dormitories or temporary housing are sampled within their permanent households [26]. The survey excludes those without a fixed household address, active-duty military personnel and civilians on military bases, residents of long-term care institutions (such as nursing homes and chronic care hospitals), individuals in correctional facilities, and U.S. nationals living abroad [26]. However, civilians residing with military personnel in non-military housing remain eligible for inclusion. The NHIS employs a geographically clustered sampling design to select dwelling units, ensuring national representativeness [26]. Based on the 2010 decennial census, the sampling process partitions the United States into 1,689 geographic areas, comprising counties, county equivalents, or contiguous county groups, ensuring they remain within state boundaries [26]. The sample is structured in a way that monthly data is nationally representative [26]. Usually, interviews for the NHIS are conducted in-person modules and followed up via telephone. For over 50 years, the U.S. Census Bureau has sent interviewers to American homes to inquire about various health topics. Data collection is continuous, spanning from January to December each year, allowing for timely and comprehensive health surveillance [26]. The NHIS collects and analyzes data on various health topics to monitor the health of the US population. Survey results are instrumental for providing information and tracking health status, healthcare access, and advancement toward achieving national health objectives.
Participants
For the NHIS 2022, more than 30,000 confidential interviews were conducted during the year with the help of field staff trained by the U.S. Census Bureau. Around 27,651 adult participants participated in the “Sample Adult Interviews” in NHIS 2022. The response rate for the “Sample adults interviews” was 47.7% [27].
The inclusion and exclusion criteria for this study were determined based on the variables of interest and their alignment with the research objectives. From the 27,651 responses, (n = 1,267) data were excluded as missing food security information. Additionally, non-smokers were excluded from the study (n = 25,565). Out of 27,651 respondents, only 819 met the inclusion criteria based on the specified variables. The final selection was guided by the study’s research objectives to ensure relevance and accuracy in the analysis.
Variables of interest
2.1.1 Dependent variable.
The daily cigarette smoking rate is the primary outcome of this analysis. A self-reported survey item was used to identify the number of cigarettes consumed in a day. Participants were asked, ‘On average, about how many cigarettes do you NOW smoke a day?’ Following the responses to this question, the number of cigarettes consumed in a day is determined.
2.1.2 Independent variable.
Household food security was measured using the U.S. Department of Agriculture’s (USDA) 10-item-based food security module with a recall period of 30 days [28]. Based on the responses for the 10 items, household food security status was classified into four categories: High food security (scores 0), Marginal food security (scores 1–2), Low food security (scores 3–5), and very low food security (scores 6–10) [29].
2.1.3 Covariates.
The study utilized various sociodemographic factors and disease status as covariates to account for their potential to confound the relationship between food security and the daily consumption of cigarettes. Demographic covariates included are sex (male/female), age, marital status (married but spouse is present, married but spouse is not present, married but spouse presence unknown, widowed, divorced, separated, never married, and living with partner). Health and disease-related factors included as covariates are the severity of depression symptoms, the severity of anxiety symptoms, experiencing high cholesterol (yes/no), the presence of hypertension (yes/no), diabetics (yes/no), cardiovascular disease (yes/no), and Body Mass Index (BMI) (Underweight, healthy weight, overweight, obese). The Generalized Anxiety Disorder (GAD-7) [30] questionnaire was used to assess the severity of anxiety symptoms, and the Public Health Questionnaire (PHQ-8) [31] was used to assess the severity of depression symptoms within the last two weeks. Behavioral factors included in this analysis were age at starting smoking (continuous variable) and average sleep hours (continuous variable).
Statistical analysis
The CMP regression is particularly suited for count data, effectively handling overdispersion and underdispersion. Considering a count variable as the response variable, namely, the number of cigarettes smoked in a day, this study used a Conway-Maxwell-Poisson (CMP) regression extended to incorporate additive components using penalized splines. The model considers applicable parametric and non-parametric components for estimation, which are given below.
Parametric components –– Respondents’ food security status, sex, the severity of anxiety, the severity of depression, experiencing any cardiovascular disease, high cholesterol, and hypertension.
Non-parametric components—The non-parametric components of the model are the respondents’ age, age at starting smoking, average sleep hours, and Body Mass Index (BMI).
When dealing with cross-sectional count data, the most commonly used regression models are Poisson and negative-binomial regression [22]. Taking this into account, results from this model were compared with outcomes from the Poisson and Negative binomial generalized additive models to investigate the nature of the association further.
2.1.4 Sub-group analysis.
To investigate the association further, this study conducted two separate models to examine how various health conditions and disease status affect the relationship between daily cigarette consumption and household food security. The categorizations for the sub-groups were based on whether the respondents had diabetes or cardiovascular diseases. The model for cardiovascular disease was based on 114 observations, while the model for diabetes was based on 172 observations.
All statistical analyses were conducted using R (4.4.0).
Model specification and estimation
2.1.5 Generalized additive model.
As, additive models have the advantage to capture complicated relationship than mixed methods or normal models [32,33]. Generalized Additive Model (GAM) are used to incorporate nonlinear terms which is an extension of Generalized Linear Models (GLM). Linear form of the covariates are being replaced by the smooth functions in the GAM model. A generalized additive model is a generalized linear model with a linear predictor involving a sum of smooth functions of covariates which has the formulation [34],
, Xi* is a row of the model matrix of any strictly parametric model components, θ is the parametric component, fi is the smooth functions of the covariates and g(.) is a smooth monotonic and twice differentiable link function. To estimate smooth functions there exist multiple methods [34–36]. θi is the smoothing parameter that control the trade-off between fit and smoothness. Cubic spline smoother, backfitting algorithm, penalized spline approaches are widely used. Backfitting algorithm uses a regression type fitting algorithm, which serves as baseline method for estimating fi. It has the flexibility to incorporate wide variety of smoothing methods for components. Convergence can be found in the paper to follow [35,36].
GAMs employ smoothing techniques to model non-linear relationships between variables. Commonly used smoothing methods include regression splines (like B-splines and P-splines), local regression (loess), and smoothing splines. Regression splines are particularly favored in practice due to their computational efficiency and representation as linear combinations of basis functions, making them well-suited for estimation and prediction. [37,38] A smoothing spline with two predictors can be used for fitting a wiggly surface. Splines can take more than two arguments, in which case wiggly hypersurfaces are modeled. Given a linear predictor with appropriate smooths, this linear predictor can be used to model Gaussian response variables, or Poisson or binomial responses. GAMs can also accommodate ordinal responses as well as multinomial responses.
The penalized splines approach is a popular method for fitting additive models. It represents each function in the model using spline-type basis expansions. To prevent overfitting, a penalty term is added to the likelihood function, which is then maximized. This maximization process is carried out using penalized iteratively re-weighted least squares (P-IRLS) [35]. In essence, the GAM is fitted by iteratively minimizing a penalized least squares problem,
denotes the adjusted response variable and
denotes the weights at the kth iteration od the P-IRLS algorithm.
measures the roughness of
, which are matrices of known coefficients
. The
are smoothing parameters that control the trade-off between fit and smoothness and their selection can be achieved by minimizing the Generalized Cross Validation (GCV) score, AIC, or another criterion [35,39–42].
The Generalized Additive model for the CMP regression, Poisson and Negative Binomial is used to compare the results.
2.1.6 Poisson generalized additive model.
The Poisson GLM is described as,
The log functions for Poisson link is,
The conditional expectations can be formulated as,
is the smoothing parameter. The GAM consist of a random component, an additive component, and a link function relating the two components. The response variable assuming to follow exponential family,
Where is the natural parameter and
is the scale parameter. The quantity is defined as,
Where the function are the smooth functions defines the additive component and the relationship between
and
. A smoother is a tool for summarizing the trend of a response measurement Y as a function of one or more predictor measurements
. It produces an estimate of the trend that is less variable than Z itself. An important property of a smoother is its nonparametric nature. It doesn't assume a rigid form for the dependence of Z on
. In this study, we focus on only a cubic smoothing spline that can be used with GAM. A cubic smoothing spline is the solution to the following optimization problem: among all functions
with two continuous derivatives, find one that minimizes the penalized least square:
Where is a fixed constant and
. The first term measures closeness to the data while the second term penalizes curvature in the function. It can be shown that there exists an explicit, unique minimizer and that minimizer is a natural cubic spline with knots at the unique values of xi. The parameter λ is the smoothing parameter. Further estimation process with AIC description illustrated in this paper [43–45].
2.1.7 Negative binomial generalized additive model.
Negative binomial (NB) regression is the most common full-likelihood method for analyzing count data exhibiting overdispersion with respect to the Poisson distribution [46]. The log-likelihood for NB model for known K is,
To fit the NB model for Z given covariates X, various link functions are possible. Methodology describe in [34] requires the known value of k. And for a known , estimation of k reduces to an ordinary maximum likelihood problem which maximizes the log-likelihood. The structure of the algorithm is elaborately described in [47], for which further estimations aren’t presented here.
2.1.8 The CMP regression.
To model count response variable, Poisson regression is widely used [48]. Though its flexibility always in question for the usage of a single parameter in the model. Overdispersion and underdispersion is one of the common issues that can not be addressed through the Poisson model [23]. The Conway-Maxwell Poisson (COM-Poisson) have proven powerful in capturing wide range of dispersion [49]. Originated by [50] is a two-parameter generalization of the Poisson distribution that allows for different levels of dispersion. Though originated by Conway, the statistical properties are revived by [23]. As, the COM Poisson have the flexibility to capture overdispersion and underdispersion, the COM Poisson distribution is a two-parameter model, generalized by Poisson, Bernouli and Geometric distributions. [23,50] The probability mass function (p.m.f) has the form,
Where Z is a random variable that follows CMP distribution with the parameters, is the generalization of the Poisson rate parameter and
is the dispersion parameter. The normalizing constant is denoted by
.
Three common distributions e.g., Poisson , Geometric
, Bernoulli (
with probability
is included as special cases in the CMP distribution. The CMP distribution addresses both over dispersion
and under dispersion
. The Mean and variance of the distribution is approximated as,
The detailed approximation and other distributional properties of the distribution can be found in the papers. [51,52]
The COM-Poisson regression was developed by the model formulation of these papers [53,54]. The regression form to describe the relationship between the explanatory and response variable can be stated with log link function
Which models the indirect association between E(Z) and X. The loglinear link,
The loglink function is being used for reducing the varying dispersion in Poisson and logistic regression. □ and □ are the regression coefficients for the centering link function and the shape link function.
2.1.9 CMP generalized additive model.
For the restrictive nature of the CMP distribution in modeling nonlinear relationships or time series data additive model for CMP regression is being developed in the paper [22]. CMP generalized model has the formulation,
Where i = 1,…,n where and
are the parameter vectors the parametric part of ln(λi) and ln(νi)
are the smooth functions for the covariates xj and zj and are subject to identifiability constraints. For further model estimation readers are encouraged to go through the paper [22].
2.1.10 Model Estimation.
The revived properties of CMP distribution by [23] estimate the log-likelihood by
Summing over n observations, the log-likelihood can be written as,
These equations uses the maximizing equation under constraint . In [23] Generalized linear model is used to obtain the maximum likelihood estimates. As ν models the dispersion, to control it covariates need to be included which is demonstrated by two level of group data [55]. The log-likelihood can be described as,
Resulting in the score equations:
The standardized generalization linear models can not be used to solve the equations. So, mostly gradient based methods are being used in CMP regression than IRLS. The detail formulation of gradient based mixed methods using GLM can be found in the paper, readers are encouraged to read the paper for the demonstration [24].
2.1.10.1 Iterative reweighted least squares (IRLS) framework.
Because of the advantage of not affecting outliers and not being bound to use the information matrix, the blank in the field of CMP regression of not using the IRLS method has been filled by [22] to capture the complexity of nonlinear relationships. The estimation of CMP regression in this paper follows this method to generate the result. The framework of estimation is developed by five steps. First step includes the calculations the cumulants. Several authors have used different approaches to calculate cumulants [23,56,57]. Asymptotic approximation improved by including two lower terms [58]. To make this approximation close to the true value, ,
the cumulants are being calculated from the asymptotic terms and for other values p.m.f is used with bounding error [22]. In the second step, IRLS equations are being updated for parameters □ and □ which can be easily estimated with WLS method. In the following three steps proof of convergence of the two step method, practical issues and Inferences been described [22].
Ethics statement
The Research Ethics Review Board (ERB) of the National Center for Health Statistics (NCHS) approved the content and methods of the National Health Interview Survey (NHIS) to protect the study participants [59]. Prior to taking part in the NHIS study, all participants provided their verbal consent. Potential NHIS respondents were informed about their rights regarding survey participation and were assured of the confidentiality of their responses, as well as the protection of their data [60]. All data in the publicly available dataset are fully anonymized prior to release. All authors declare they have no competing interests.
3 Results
The study used data from the National Health Interview Survey (NHIS) of 2022, a cross-sectional household survey of the US noninstitutionalized civilian population. Among which the adult module has been used.
Among 27,561 surveyed adults, 819 adults have been selected for our study, where the number of daily cigarette consumption has been used as the count variable for the application of CMP regression. As we have filtered the dataset with the variables of our interest, there was no missing data in our final dataset. The variables of our interest were Age, Age when started smoking, severity of depression, severity of anxiety, presence of cardiovascular disease, presence of cholesterol, diabetics, hypertension, BMI, and marital status. For our two distinct models, we have selected two diseases, cardiovascular and diabetics. With the numerical functions of age, BMI, smoking age, and weight with the categorical co-variates of food security, gender, hypertension, diabetics, anxiety, and depression severity for the cardiovascular model/lambda model. For the nu model, the numerical functions are the same with the additional categorical variable of the presence of cholesterol. In both models, the count variable was the number of cigarettes consumed in a day.
Demographic characteristics of the selected variables been represented in Table 1.
The median age of the sample population was 61 years, with smoking initiation occurring at a median age of 17 years (p < 0.001 and p = 0.003, respectively). Regarding cigarette consumption, individuals with high food security reported a median daily intake of 11 cigarettes, while those with marginal, low, and very low food security consumed a median of 10, 20, and 15 cigarettes per day, respectively. Daily cigarette consumption varied by food security status, with higher consumption among those with lower food security with a level of significance 0.040.
Among the 819 participants, males outnumbered females (432 vs. 387). The majority of both males (55%) and females (45%) had high food security; however, this difference was not statistically significant (p = 0.13). Severe depression was more prevalent among individuals with very low food security (28%), compared to those with marginal (4.5%) and low food security (15%), demonstrating a significant association (p < 0.001). Similarly, severe anxiety was significantly higher among participants with very low food security (27%) compared to those with high food security (7%) (p < 0.001).
In terms of physical health conditions, 8.9% of participants with very low food security had cardiovascular disease (p = 0.2). The prevalence of hypertension was similar among those with high (87%) and very low food security (84%) (p = 0.8). Among participants with diabetes, the prevalence was equal (22%) for those with high and very low food security (p = 0.5). Additionally, 42% of participants with low food security were obese, though this association was not statistically significant (p = 0.8). The marital status of participants showed mixed results with p-value of <0.001.
The plot (Fig 1) of No. of daily cigarette consumption indicates dispersion of data. By comparing the value of mean and variance it’s verified, the count variable has overdispersion.
We have presented our model for analyzing the presence of two types of diseases, such as Cardiovascular and Diabetes, within the overall dataset. After filtering the data for the presence of cardiovascular disease, we found 114 observations, and 172 for diabetes. We have focused on reporting the most significant and common findings in both models since we are considering two diseases. Additionally, to compare how well CMP-GAM captures the over and under dispersion of data, we compared the model with Poisson-GAM and Negative Binomial-GAM.
For Cardiovascular disease
We’ve considered the following to model the CMP with the presence of cardiovascular disease.
The coefficient significance results are described in Tables 2 and 3. The non-parametric components are Age, Smoking age, Sleep hours, BMI and the parametric terms are Anxiety and Depression scale, Hypertension, Food Security.
Looking at the results of the parametric components presented in Table 2, it is evident that food security, gender of participants, Hypertension, and the severity of Anxiety and depression have no significant association with daily cigarette consumption. The parametric components across all the subgroup analysis provides similar results, for this reason we’ve provided only the parametric values for cardiovascular disease.
From the non-parametric components of Table 3, smoking age shows significance in CMP-GAM. In Poisson-GAM, Age, smoking age, and sleep hours tend to have a significant effect on daily cigarette consumption. However, in Negative-GAM, no variable shows significance.
For diabetics data
To assess diabetics data we’ve considered the following model with parameter,
The coefficient significance results are described in Table 4. The non-parametric components include Age, Smoking age, Sleep hours, and BMI, while the parametric terms consist of Food security, sex, Hypertension, Anxiety and Depression scale, and Cholesterol.
Like the cardiovascular dataset, no parametric variable was found to have significance in the diabetics model.
For overall data
We’ve considered the following model for overall dataset:
The coefficient significance results are described in Table 5. The non-parametric components are Age, Smoking age, Sleep hours, and BMI, and the parametric terms are Food security, sex, Hypertension, Anxiety and Depression scale, Cardiovascular disease, Diabetic disease, and Cholesterol.
None of the parametric components yield significant results in any of the three models. Conversely, in the non-parametric components, smoking age is significant in CMP-GAM. For both the Poisson GAM and Negative Binomial GAM models, we obtain results similar to those of the diabetic model, only differing in the numeric value of the non-parametric coefficients.
Out of the three models, the Poisson GAM yields significant results for almost all smooth variables. On the other hand, only the age of smoking is a significant variable in CMP-GAM and NBGAM, supporting the reference by Chatla and Shmueli [22] that the Poisson GAM has more inference error than the other two models due to excessive dispersion.
Considering the overall dataset with an almost equal distribution of parametric and non-parametric components, CMP GAM and NB GAM are better with less inference error, despite our data only exhibiting overdispersion. Although we were unable to check for overdispersion, this could be useful for further research in the field of health science.
4 Discussion
This study used National Health Interview Survey 2022 data to understand the association between the daily consumption of cigarettes and household food security status among adults in the US. Our study revealed that the median daily consumption of cigarettes was higher for households with low to very low food security status. However, this study explored that for the overall population, households’ food security status was not statistically significantly associated with daily cigarette consumption. Moreover, this association was unchanged for the two sub-groups such as cardiovascular and diabetic patients. Yet, research in the past has discovered that this connection can go both ways. For example, smoking could worsen food insecurity by using up financial resources, while food insecurity could cause psychological stress that encourages smoking and makes it harder to quit [61]. Our study was the first to use the CMP-GAM model to disentangle the association of daily cigarettes consumptions with household food security status.
This study observed a significant association between the age at which people started smoking and the number of cigarettes smoked per day, which is also prevalent in findings from Poisson-GAM and negative binomial-GAM that support some of the prior research findings. Individuals who initiated smoking at an early age tend to smoke more cigarettes daily [62]. Furthermore, beginning to smoke at a young age has been recognized as a strong indicator of nicotine dependence, suggesting that starting smoking early is linked to an increased probability of developing an addiction to nicotine and smoking a greater number of cigarettes each day [63]. At the same time, the prevalence of smoking has been found to be higher among food-insecure households compared to food-secure households.5 This finding sheds additional light on the barely uncovered association between household food security status and smoking frequency among young adults.
Following the findings from sub-group analysis with diabetic’s patients, results of Poisson-GAM and Negative-binomial-GAM observed an association between the age of the respondents and household food security status among patients with both cardiovascular and diabetics disease. Based on an earlier study it was found that the age of household heads was inversely related to household food security status [64]. Another study conducted in Eastern Cape Province, South Africa, found that age, household income, access to credit, and gender have a statistically significant impact on the food security status of households [65]. Additional research may be necessary to determine whether the age of the household head could have a direct impact on the household’s food security status, especially for the households with diabetics patients.
Food security may also have an impact on the increased risk of depression and anxiety. Almost half of the participants in the study experiencing moderate to severe anxiety (50.8%) and depression (48.6%) also report low to very low household food security. Earlier evidence consistently indicates that food insecurity is significantly related to increased risks of depression, anxiety, and stress. The mental health impacts of food insecurity vary by region, with North American households experiencing increased stress and stress [66]. Based on earlier research on Canadian adults, it was observed that food insecurity had been linked to adverse mental health outcomes for adults [67].
Higher cigarette consumption can be one of the crucial risk factors in the development and management of diabetes. One study showed that men who smoked 25 or more cigarettes per day had nearly double the risk of developing type 2 diabetes compared to non-smokers [68]. However, with the sub-group analysis of diabetic patients, this study did not find any significant relationship between the number of cigarettes smoked by respondents in a day and their household food security status. Likewise, this study found that the number of cigarettes smoked is not significantly associated with demographic variables like gender, BMI, sleep hours, presence of hypertension, or socioeconomic status. Aligned with this, earlier research with university students in Montenegro did not find a significant impact on cigarette smoking [69]. However, studies have found that BMI can be a factor that may influence smoking behavior. [70,71] Exploring this linkage further can be beneficial in planning and designing public health intervention aimed at reducing cigarette consumption.
Limitations
This study has several limitations. The information is self-reported, which means there could be some loss of information due to potential biases related to recall and social desirability. The NHIS is a cross-sectional study and cannot track changes in health status or behaviors over time. Thus, it is essential to conduct repeat cross-sectional data or longitudinal research to determine the prevalence of cigarette smoking in this population. Besides this, our study was limited to assessing the association in the presence of only two types of diseases; exploring further with other diseases and considering the effects of comorbidities could have been guided into better construct in understanding the association. Additionally, this study did not conduct confounding or mediation analyses, which could have provided a deeper understanding of the pathways through which the exposure influences the outcome. Future research should consider incorporating these analyses to assess these relationships more comprehensively.
5 Conclusion
The study did not find any significant relationship between food security status and daily cigarette consumption both for overall and sub-categories of populations. However, it identified a significant association between daily cigarette consumption and the age of smoking initiation. This underscores the need for strengthening targeted intervention aims at preventing early smoking while promoting programs that educate young people about the risks of smoking and raising awareness of healthier lifestyle choices and risks of early smoking.
References
- 1.
Fao I. The State of Food Security and Nutrition in the World 2023 [Internet]. FAO; IFAD; UNICEF; WFP; WHO; 2023 [cited 2024 Sep 8. ]. https://openknowledge.fao.org/handle/20.500.14283/cc3017en
- 2. Camacho-Rivera M, Islam JY, Rodriguez DR, Vidot DC, Bailey Z. Food insecurity disparities and mental health impacts among cancer survivors during the COVID-19 pandemic. Health Equity. 2022;6(1):729–37.
- 3. Reeder N, Tolar-Peterson T, Bailey RH, Cheng W-H, Evans MW Jr. Food Insecurity and Depression among US Adults: NHANES 2005-2016. Nutrients. 2022;14(15):3081. pmid:35956257
- 4.
Umam S, Razzak RB. Linguistic disparities in mental health services: Analyzing the impact of spanish language support availability in saint louis region, Missouri. In: APHA 2024 Annual Meeting and Expo; 2024.
- 5. Sultan Y, Salman Z, Alzaatreh M, Edilbi A, Alani R, Sultan I. Smoking-related disease impact in the eastern Mediterranean region: A comprehensive assessment using global burden of disease data. Asian Pacific Journal of Cancer Prevention. 2024;25(2):495–505.
- 6. Kang SY, Cho HJ. Association between the use of tobacco products and food insecurity among South Korean adults. Int J Public Health. 2022;67:1604866.
- 7. Kim JE, Tsoh JY. Cigarette Smoking Among Socioeconomically Disadvantaged Young Adults in Association With Food Insecurity and Other Factors. Prev Chronic Dis. 2016;13:E08.
- 8. Kim-Mozeleski JE, Castele MC, Nambiar P, Chagin KM, Pike Moore S, Hardy P. A Randomized Trial to Address Food Insecurity and Promote Smoking Cessation Among Low-Income Adults. J Prim Care Community Health. 2024;15:21501319241245275.
- 9. Kim-Mozeleski JE, Seligman HK, Yen IH, Shaw SJ, Buchanan DR, Tsoh JY. Changes in food insecurity and smoking status over time: Analysis of the 2003 and 2015 Panel Study of Income Dynamics. Am J Health Promot. 2019;33(5):698–707.
- 10. Balinda IG, Sugrue DD, Ivers LC. More than malnutrition: a review of the relationship between food insecurity and tuberculosis. Open Forum Infectious Diseases. 2019;6(4):ofz102.
- 11. Baek SU, Lee YM, Won JU, Yoon JH. Association of food insecurity with the use of tobacco products and urine cotinine-measured smoking intensity: evidence from a population-based study in South Korea, 2019–2021. Tobacco Control. 2024.
- 12. Robson SM, Lozano AJ, Papas M, Patterson F. Food Insecurity and Cardiometabolic Risk Factors in Adolescents. Prev Chronic Dis. 2017;14:E110.
- 13. Canales MK, Coffey N, Moore E. Exploring Health Implications of Disparities Associated with Food Insecurity Among Low-Income Populations. Nurs Clin North Am. 2015;50(3):465–81. pmid:26333604
- 14. Kim-Mozeleski JE, Pandey R. The Intersection of Food Insecurity and Tobacco Use: A Scoping Review. Health Promot Pract. 2020;21(1_suppl):124S-138S. pmid:31908208
- 15. van der Velde LA, Nyns CJ, Engel MD, Neter JE, van der Meer IM, Numans ME, et al. Exploring food insecurity and obesity in Dutch disadvantaged neighborhoods: a cross-sectional mediation analysis. BMC Public Health. 2020;20(1):569. pmid:32345298
- 16. van den Brand FA, Candel MJJM, Nagelhout GE, Winkens B, van Schayck CP. How financial incentives increase smoking cessation: a two-level path analysis. Nicotine Tob Res. 2020;23(1):99–106.
- 17. Jo Y-H, Talmage DA, Role LW. Nicotinic receptor-mediated effects on appetite and food intake. J Neurobiol. 2002;53(4):618–32. pmid:12436425
- 18. Kim-Mozeleski JE, Chagin KM, Sehgal AR, Misak JE, Fuehrer SM. Food insecurity, social needs, and smoking status among patients in a county hospital system. Preventive Medicine Reports. 2022;29:101963.
- 19. Sheira LA, Frongillo EA, Hahn J, Palar K, Riley ED, Wilson TE, et al. Relationship between food insecurity and smoking status among women living with and at risk for HIV in the USA: a cohort study. BMJ Open. 2021;11(9):e054903. pmid:34489299
- 20. Mayer M, Gueorguieva R, Ma X, White MA. Tobacco use increases risk of food insecurity: An analysis of continuous NHANES data from 1999 to 2014. Preventive Medicine. 2019;126:105765.
- 21. Bergmans RS. Food insecurity transitions and smoking behavior among older adults who smoke. Preventive Medicine. 2019;126:105784.
- 22. Chatla SB, Shmueli G. Efficient estimation of COM–Poisson regression and a generalized additive model. Computational Statistics & Data Analysis. 2018;121:71–88.
- 23. Shmueli G, Minka TP, Kadane JB, Borle S, Boatwright P. A useful distribution for fitting discrete data: Revival of the Conway–Maxwell–Poisson distribution. Journal of the Royal Statistical Society Series C: Applied Statistics. 2005;54(1):127–42.
- 24. Sellers KF, Shmueli G. A Flexible Regression Model for Count Data. The Annals of Applied Statistics. 2010;4(2):943–61.
- 25. Berkowitz SA, Seligman HK, Rigdon J, Meigs JB, Basu S. Supplemental Nutrition Assistance Program (SNAP) Participation and Health Care Expenditures Among Low-Income Adults. JAMA Intern Med. 2017;177(11):1642–9. pmid:28973507
- 26.
National Center for Health Statistics. National Health Interview Survey, 2022 survey description. [Internet]. National Center for Health Statistics. 2023. https://ftp.cdc.gov/pub/Health_Statistics/NCHS/Dataset_Documentation/NHIS/2021/srvydesc-508.pdf
- 27. Seidenberg AB, Moser RP, West BT. Preferred Reporting Items for Complex Sample Survey Analysis (PRICSSA). Journal of Survey Statistics and Methodology. 2023;11(4):743–57.
- 28.
Children Living in Households That Experienced Food Insecurity: United States, 2019-2020 [Internet]. National Center for Health Statistics (U.S.); 2022 Feb [cited 2024 Sep 4. ]. https://stacks.cdc.gov/view/cdc/113966
- 29. Coleman-Jensen A, Nord MU.S. Adult Food Security Survey Module.
- 30. Sapra A, Bhandari P, Sharma S, Chanpura T, Lopp L. Using Generalized Anxiety Disorder-2 (GAD-2) and GAD-7 in a Primary Care Setting. Cureus. 2020;12(5):e8224. pmid:32582485
- 31. Kroenke K, Strine TW, Spitzer RL, Williams JBW, Berry JT, Mokdad AH. The PHQ-8 as a measure of current depression in the general population. J Affect Disord. 2009;114(1–3):163–73. pmid:18752852
- 32. Stieb DM, Judek S, Burnett RT. Meta-analysis of time-series studies of air pollution and mortality: update in relation to the use of generalized additive models. J Air Waste Manag Assoc. 2003;53(3):258–61. pmid:12661685
- 33. Dominici F, McDermott A, Zeger SL, Samet JM. On the use of generalized additive models in time-series studies of air pollution and health. Am J Epidemiol. 2002;156(3):193–203. pmid:12142253
- 34. Hastie T, Tibshirani R. Generalized Additive Models. Statist Sci. 1986;1(3).
- 35.
Wood SN. Generalized Additive Models. Chapman and Hall/CRC. 2017. https://doi.org/10.1201/9781315370279
- 36. Wood SN. Stable and Efficient Multiple Smoothing Parameter Estimation for Generalized Additive Models. Journal of the American Statistical Association. 2004;99(467):673–86.
- 37.
de Boor C. A Practical Guide to Splines. Springer New York. 1978. https://doi.org/10.1007/978-1-4612-6333-3
- 38. Wood SN. Modelling and smoothing parameter estimation with multiple quadratic penalties. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2000;62(2):413–28.
- 39. Wood SN. On p-values for smooth components of an extended generalized additive model. Biometrika. 2012;100(1).
- 40. Wood SN. Fast Stable Restricted Maximum Likelihood and Marginal Likelihood Estimation of Semiparametric Generalized Linear Models. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2010;73(1):3–36.
- 41. Wood SN. Fast stable direct fitting and smoothness selection for generalized additive models. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2008;70(3):495–518.
- 42. Wood SN. On confidence intervals for generalized additive models based on penalized regression splines. Aus NZ J of Statistics. 2006;48(4):445–64.
- 43.
Zuur AF, Ieno EN, Walker NJ, Saveliev AA, Smith GM. GLM and GAM for Count Data. Statistics for Biology and Health. Springer New York. 2009. 209–43. https://doi.org/10.1007/978-0-387-87458-6_9
- 44. Barry SC, Welsh AH. Generalized additive modelling and zero inflated count data. Ecological Modelling. 2002;157(2–3):179–88.
- 45. Rothery P, Roy DB. Application of generalized additive models to butterfly transect count data. Journal of Applied Statistics. 2001;28(7):897–909.
- 46. Yee TW. The VGAM package for negative binomial regression. Aus NZ J of Statistics. 2020;62(1):116–31.
- 47. Thurston SW, Wand MP, Wiencke JK. Negative binomial additive models. Biometrics. 2000;56(1):139–44.
- 48. Coxe S, West SG, Aiken LS. The analysis of count data: a gentle introduction to poisson regression and its alternatives. J Pers Assess. 2009;91(2):121–36. pmid:19205933
- 49. Abdella GM, Kim J, Al-Khalifa KN, Hamouda AM. Penalized Conway-Maxwell-Poisson regression for modelling dispersed discrete data: The case study of motor vehicle crash frequency. Safety Science. 2019;120:157–63.
- 50. R C. A queueing model with state dependent service rate. Journal of Industrial Engineering. 1961;12:132. https://cir.nii.ac.jp/crid/1371132819349358976
- 51. Daly F, Gaunt RE. The Conway-Maxwell-Poisson distribution: Distributional theory and approximation. Latin American Journal of Probability and Mathematical Statistics. 2016;13(1):635.
- 52. Sellers KF, Premeaux B. Conway–Maxwell–Poissonregression models for dispersed count data. WIREs Computational Stats. 2020;13(6).
- 53. Sellers KF, Borle S, Shmueli G. The COM‐Poisson model for count data: a survey of methods and applications. Appl Stoch Models Bus & Ind. 2011;28(2):104–16.
- 54.
Morris D, Sellers K, Menger A. Fitting a flexible model for longitudinal count data using the NLMIXED procedure. In: SAS Global Forum Paper; 2017. 1–6.
- 55. Sellers KF, Shmueli G. Data Dispersion: Now You See It… Now You Don’t. Communications in Statistics - Theory and Methods. 2013 Sep 2;42(17):3134–47.
- 56.
Olver F. Asymptotics and Special Functions [Internet]. A K Peters/CRC Press; 1997. Available from: http://dx.doi.org/10.1201/9781439864548
- 57.
Minka TP, Shmeuli G, Kadane JB, Borle S, Boatwright P. Computing with the COM-Poisson distribution. 2003.
- 58. Gaunt RE, Iyengar S, Olde Daalhuis AB, Simsek B. An asymptotic expansion for the normalizing constant of the Conway–Maxwell–Poisson distribution. Ann Inst Stat Math. 2017;71(1):163–80.
- 59. National Center for Health Statistics. Advance letter. https://ftp.cdc.gov/pub/Health_Statistics/NCHS/Survey_Questionnaires/NHIS/2022/advance-letter-2022.pdf
- 60. National Center for Health Statistics. National Health Interview Survey 2022 Capi Manual for NHIS Field Representatives. 2021 [cited 2024 Oct 3. ]. https://ftp.cdc.gov/pub/Health_Statistics/NCHS/Survey_Questionnaires/NHIS/2022/frmanual-508.pdf
- 61. Kim-Mozeleski JE, Pandey R, Tsoh JY. Psychological distress and cigarette smoking among U.S. households by income: Considering the role of food insecurity. Prev Med Rep. 2019;16:100983. pmid:31516816
- 62. Breslau N. Daily cigarette consumption in early adulthood: age of smoking initiation and duration of smoking. Drug Alcohol Depend. 1993;33(3):287–91. pmid:8261892
- 63. Mahmoud MA, Rashad H, Hashish A. Assessment of nicotine dependence and its potential predictors among a cohort of current Egyptian cigarette smokers - a preparatory phase of a workplace anti-smoking initiative: Part I. Int J Health Sci Res. 2023;13(6):173–82.
- 64. Haddabi AS, Ndehfru NJ, Aliyu A. Analysis of food security status among rural farming households in Mubi North local government area of Adamawa State, Nigeria. Int J Res Granthaalayah. 2019;7(7):226–46.
- 65. Cheteni P, Khamfula Y, Mah G. Exploring Food Security and Household Dietary Diversity in the Eastern Cape Province, South Africa. Sustainability. 2020;12(5):1851.
- 66. Pourmotabbed A, Moradi S, Babaei A, Ghavami A, Mohammadi H, Jalili C, et al. Food insecurity and mental health: a systematic review and meta-analysis. Public Health Nutr. 2020;23(10):1778–90. pmid:32174292
- 67. Jessiman-Perreault G, McIntyre L. Household Food Insecurity Narrows the Sex Gap in Five Adverse Mental Health Outcomes among Canadian Adults. Int J Environ Res Public Health. 2019;16(3):319. pmid:30678378
- 68. Meisinger C, Döring A, Thorand B, Löwel H. Association of cigarette smoking and tar and nicotine intake with development of type 2 diabetes mellitus in men and women from the general population: the MONICA/KORA Augsburg Cohort Study. Diabetologia. 2006;49(8):1770–6.
- 69. Bošković V, Vlaović-Bošković J, Ljaljević A, Babović B, Slović Ž, Ćorović S, et al. Cigarette consumption among University students in Montenegro: The prevalence and smoking habits. konzumiranje Cigareta među studentima univerziteta Crne gore: prevalenCija i pušačke navike [Internet]. 2022 [cited 2024 Aug 22. ]. https://scidar.kg.ac.rs/handle/123456789/15261
- 70. Kwon JA, Jeon W, Park EC, Kim JH, Kim SJ, Yoo KB. Effects of Disease Detection on Changes in Smoking Behavior. Yonsei Med J. 2015;56(4):1143.
- 71. Freathy RM, Kazeem GR, Morris RW, Johnson PCD, Paternoster L, Ebrahim S, et al. Genetic variation at CHRNA5-CHRNA3-CHRNB4 interacts with smoking status to influence body mass index. Int J Epidemiol. 2011;40(6):1617–28. pmid:21593077