Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

An ecological examination of early adolescent e-cigarette use: A machine learning approach to understanding a health epidemic

Abstract

E-cigarette use among adolescents is a national health epidemic spreading faster than researchers can amass evidence for risk and protective factors and long-term consequences associated with use. New technologies, such as machine learning, may assist prevention programs in identifying at risk youth and potential targets for intervention before adolescents enter developmental periods where e-cigarette use escalates. The present study utilized machine learning algorithms to explore a wide array of individual and socioecological variables in relation to patterns of lifetime e-cigarette use during early adolescence (i.e., exclusive, or with tobacco cigarettes). Extant data was used from 14,346 middle school students (Mage = 12.5, SD = 1.1; 6th and 8th grades) who participated in the Utah Prevention Needs Assessment. Students self-reported their substance use behaviors and related risk and protective factors. Machine learning algorithms examined 112 individual and socioecological factors as potential classifiers of lifetime e-cigarette use outcomes. The elastic net algorithm achieved outstanding classification for lifetime exclusive (AUC = .926) and dual use (AUC = .944) on a validation test set. Six high value classifiers were identified that varied in importance by outcome: Lifetime alcohol or marijuana use, perception of e-cigarette availability and risk, school suspension(s), and perceived risk of smoking marijuana regularly. Specific classifiers were important for lifetime exclusive (parent’s attitudes regarding student vaping, best friend[s] tried alcohol or marijuana) and dual use (best friend[s] smoked cigarettes, lifetime inhalant use). Our findings provide specific targets for the adaptation of existing substance use prevention programs to address early adolescent e-cigarette use.

Introduction

E-cigarette use among adolescents is a national epidemic [1]. The popularity of these devices has spread faster than health researchers could amass evidence for the potential deleterious effects of e-cigarette use. As the prevalence of traditional cigarette smoking among adolescents in the United States (U.S.) has declined, e-cigarette use, or vaping, has become the most commonly used form of nicotine uptake among youth [2]. Adolescents’ decisions to engage in e-cigarette use may be understood through an ecological framework that accounts for complex interactions between spheres of influence [3]. Research is underway to identify individual and socioecological risk factors associated with e-cigarette use [411]. However, this literature has prominently focused on high school samples resulting in a dearth of knowledge regarding e-cigarette risk factors during early adolescence. Identifying factors associated with the emergence of e-cigarette use during early adolescence may facilitate intervention prior to developmental periods where use escalates (i.e., middle to late adolescence [12]). These efforts may be bolstered by new methodologies that allow researchers to efficiently explore the importance of a wide range of variables in relation to e-cigarette use [13]. In this study, we use machine learning algorithms to simultaneously consider a large number of individual and socioecological factors in relation to patterns of e-cigarette usage among middle school students [7].

The use of e-cigarettes has been touted as a healthier alternative to tobacco cigarettes, despite their delivery of nicotine and other potentially harmful chemicals [14]. A major concern of nicotine consumption during early adolescence is the possible alteration of functioning in the brain’s reward systems at a sensitive developmental period, in ways that can increase risk for other substance use, mood disorders, and difficulties with concentration and learning [14]. In addition to nicotine-related risks, other carcinogenic agents are found in chemicals in the e-liquid as well as those produced in the vaporizing product or even associated with the e-cigarette materials (i.e., nickel, chromium, cadmium; [14]). Chemicals used to flavor e-liquid have also been found to have sufficiently high toxicity to warrant medical concerns [15] or even cause death [16]. The possible harms of e-cigarettes go well beyond exposure to nicotine.

Researchers have documented complex relationships between individual (e.g., academic performance, substance use, perceptions of use) and socioecological (e.g., access, advertisements, peer and parental factors) influences implicated in e-cigarette use during middle to late adolescence [79]. Less is known about e-cigarette use risk factors during early adolescence. The early adolescent e-cigarette literature has predominately focused on the prevalence and reasons for use, or factors associated with susceptibility rather than initiation [12, 1720]. Studies examining adolescent e-cigarette use have also had a narrow focus when considering potential individual and socioecological influences on adolescent e-cigarette use. For example, studies commonly focus on specific adolescent attitudes (e.g., the perceived danger of e-cigarettes and tobacco cigarettes), substance use behaviors (e.g., alcohol, tobacco, marijuana), aspects of the environment (e.g., access, advertisements), and social influences (e.g., peer and parental e-cigarette or cigarette use [79, 12, 19]). Research is needed to examine these risk factors in conjunction with a broader array of influences traditionally associated with early substance use (e.g., anti-social behavior; parenting practices; school involvement, performance, environment; community attachment, norms, drug use, and delinquency; [2125]).

It is also important to consider patterns of use when identifying correlates of early adolescent vaping. Research on high school students (M = 14.6 years, SD = 0.7; 9th and 12th graders) suggests risk factors vary between youth who have utilized e-cigarettes exclusively and those who have used them in combination with tobacco cigarettes, with dual use being associated with greater behavioral problems and substance use (i.e., lifetime alcohol, marijuana, drug use prescription drug misuse; [7, 26]). Exclusive use may represent adolescents using e-cigarettes as a “safer” alternative to traditional tobacco cigarettes [19]. Dual use may be associated with tobacco cessation or recreational use in conjunction with other substances [19, 26]. Research has yet to determine whether differences in risk factors for exclusive or dual e-cigarette use exist during early adolescence.

Methodological challenges may explain the limited number of studies examining a broad array of correlates of e-cigarette use during early adolescence. For example, lifetime e-cigarette is a low base rate behavior during early adolescence relative to later developmental periods [26]. Prevalence rates are even lower when researchers examine exclusive and dual e-cigarette use relative to general lifetime use [7]. Furthermore, limitations associated with traditional statistical methodologies may pose a barrier to examining the broad array of potential factors implicated in early adolescent e-cigarette use (e.g., statistical power issues; multicellularity; family-wise error rate [13]). These limitations can be addressed with large datasets, however, meeting statistical assumptions for multicollinearity and reducing family-wise error may limit the number of potential risk factors that can be simultaneously considered in relation to early adolescent e-cigarette use.

Machine learning may facilitate the examination of factors associated with early adolescent e-cigarette use. Machine learning provides an efficient method of simultaneously examining large numbers of variables representing youth individual and socioecological factors to determine their importance in classifying substance use [13]. Within the context of machine learning, variable importance refers to the relative ability of variables to reduce the error in predictions of group membership (e.g., exclusive e-cigarette user or non-user) compared to other covariates in the model [27]. Elastic net, random forest, k-nearest neighbors, and neural networks are examples of common algorithms that are capable of providing superior accuracy in classifying lifetime substance use relative to traditional logistic regression [13, 28]. Each of these algorithms approaches classification with contrasting linear (i.e., elastic net) and nonlinear (i.e., random forest, k-nearest neighbors, neural networks) methods, providing the opportunity to identify the algorithm that best performs the classification task for each outcome [27].

Identifying high value correlates of e-cigarette initiation during early adolescence could improve our ability to identify at-risk youth prior to developmental periods where the prevalence and frequency of vaping escalate (i.e., middle to late adolescence; [12]). While machine learning may provide an additional tool for informing substance use prevention efforts [29], few studies have utilized machine learning to identify factors associated with patterns of early e-cigarette use (i.e., lifetime exclusive use, dual use with tobacco cigarettes) or determined which method provides the best classification accuracy. Prior applications of machine learning have predominately focused on unstructured data (i.e., pictures, text) to classify e-cigarette use [30, 31]. While a recent study trained machine learning algorithms on survey data collected on older teens (Mage = 15.36 years old; SD = 1.85), this research had a narrow focus on tobacco related substance use predictors that precludes a broader understanding of factors associated with early vaping initiation (i.e., LASSO and Random Forest; [32]). Thus, our aim was to (a) explore a wide array of factors using machine learning to identify important classifiers of lifetime exclusive and dual e-cigarette use within a sample of middle school students, (b) and identify the algorithm that best performs the classification task. These analyses can help quantify the relative importance of predictors and establish the extent to which e-cigarette use can be classified by individual and socioecological factors.

Materials and method

The present study utilized data from the Utah Student Health and Risk Prevention (SHARP) survey project, which has been collecting and disseminating information on substance use prevalence and related behaviors since 2007 (Utah Department of Human Services [UDHS]; [33]). SHARP was developed as a collaboration between multiple state agencies to assess risk and protective factors for problem behaviors among Utah middle and high school students. Students complete the Utah Prevention Needs Assessment (PNA) survey biannually, during the spring of odd numbered years, as a part of the SHARP survey project. The PNA survey gathers statewide data on substance use and individual/socioecological factors that influence the use of alcohol, tobacco, and other drugs. PNA surveys are implemented and used to inform statewide prevention policy and programming across the U.S. Surveys are completed in schools and are self-administered using paper and pencil. The present study used data collected during the Spring of 2017 (i.e., March-June 2017) as part of a PNA survey in Utah. Parents provided written consent for their child to participate in the survey. Youths whose parents did not consent to their participation in the PNA were not administered the survey. The students also provided verbal assent prior to participating in the PNA survey. Participation in the survey was voluntary and students could opt to participate in an alternative activity or discontinue at any time. The Utah State University Institutional Review Board approved secondary analyses of the 2017 Utah PNA survey data as non-human subjects research as participants could not be re-identified (protocol #10108). Previous research has utilized similar statewide school-based samples to identify factors associated with e-cigarette use among adolescents across the U.S. (e.g., Hawaii, Texas, Connecticut, New Jersey; [69, 20, 34]).

The pesent study focused on 14,346 middle school students (i.e., 6th and 8th grade) who participated in the 2017 Utah PNA survey. Participants were 12 and half years old on average (M = 12.5; SD = 1.1), were relatively balanced on sex (girls; n = 7,532; 52.5%) and grade (6th; n = 7,473, 52.1%), and were predominantly White (10,191; 71%). Nearly a third of students attended school within Salt Lake County (n = 4,173; 29.1%) in Utah. Youths in this sample reported a 9.4% (n = 1,343) prevalence of lifetime e-cigarette use and 5.4% (n = 784) tobacco cigarette use. Students largely reported abstaining from both tobacco cigarette and e-cigarette use (90.6%; n = 13,003) and reported greater lifetime exclusive e-cigarette use (5.5%; n = 791) relative to exclusive tobacco cigarettes use (1.6%; n = 232). Within the sample, 3.8% (n = 552) of students reported dual lifetime use of tobacco cigarettes and e-cigarettes. See Table 1 for sample demographic information by outcomes.

thumbnail
Table 1. Student demographics by lifetime e-cigarette use groups.

https://doi.org/10.1371/journal.pone.0287878.t001

Measures

Individual and socioecological variables.

The measures utilized in the present study have been traditionally used and reported by the SHARP survey project as individual items [33]. Variables examined in the present study have been identified as being theoretically and/or empirically important factors in the substance use literature. We decided to examine individual items to provide a nuanced understanding of e-cigarette use risk factors [13]. Variables included a wide array of factors representing individual (i.e., antisocial behaviors/attitudes, rebelliousness, academic performance, perceived risk of drug use, intentions for adulthood substance use, lifetime substance use), community (i.e., attachment, prosocial involvement and reward, drug use consequences and antisocial behavior, perceived availability of substances), school (i.e., learning environment perceptions, enjoyment, commitment, benefits of learning, truancy), home (i.e., parenting practices, family history of substance abuse, rewards for prosocial behavior, parental attitudes regarding antisocial behavior and drug use, relationship quality with parents), and social (i.e., best friends engaged in antisocial behavior, tried alcohol or drugs, exhibited prosocial behavior; social rewards for antisocial and prosocial behaviors) influences. See S1 Table for all items examined in the current study.

Outcome.

Students reported whether they ever tried electronic cigarettes or e-cigarettes (i.e., yes or no). They also reported whether they had ever tried tobacco cigarettes, even just a puff (i.e., yes or no). Two dichotomous outcome variables were created from these items to represent lifetime exclusive e-cigarette use and dual use (i.e., tobacco cigarette and e-cigarette use). The comparison group for each outcome was students who abstained from using either substance.

Analytic plan

In our sample, 47% (n = 6,744) of participants were missing at least one covariate. Prior to imputation, data was randomly resampled into training (70%; n = 9,657 e-cigarette; n = 9,490 dual) and testing sets (30%; n = 4,237 e-cigarette; n = 4,065 dual). We then used mode imputation, wherein missing values were replaced with the mode for each variable to address missingness independently for training and testing sets. Mode imputation is commonly utilized within the context of machine learning for classification tasks [28]. As algorithms can struggle to predict low base rate outcomes, a method known as downsampling was used to randomly resample and reduce the negative class (i.e., those that did not use e-cigarettes or tobacco cigarettes) until it was equal to the positive class within the training set [27]. Thus, rates of lifetime use and non-use were equal for each outcome within the resampled training sets. The training sets were n = 1,108 for exclusive e-cigarette use and n = 774 for dual use.

Five dissimilar machine learning algorithms-elastic net, random forest, neural networks, k-nearest neighbors, and logistic regression–were then fitted to the training set to create classification models for each outcome [35]. Each classification algorithm drew information from 112 variables representing student individual and socioecological factors. 5-fold cross-validation was used to identify variables that improved classification accuracy across random subsets of data within the training set [28]. Model performance was assessed on a test set using the Area Under (AUC) the Receiving Operator Characteristic (ROC) curve, which represents the ability of a model to classify outcomes across all possible cut points [27]. The top performing classification algorithm on the test set was selected for each outcome (i.e., AUC; sensitivity, specificity; [27]). Variable importance figures reflect results from the best performing algorithms for each outcome. High value classifiers were then identified through visual inspection of the relative importance figures. Variables that demonstrate a large increase in relative importance over subsequent covariates were said to be high value classifiers [28]. High value classifiers were examined using a cross-tabulation visualization to determine the nature of the relationship between each variable and the corresponding outcome [13].

Results

Patterns of lifetime e-cigarette use differed by demographic variables within the current sample. Chi-square tests of independence suggest e-cigarette usage was significantly (p < .001) associated with student sex, grade, and race/ethnicity. Boys, 8th graders, Latinxs, Native Americans, and mixed-race students reported the greatest proportion of use across outcomes. Exclusive and dual use were generally associated with a greater proportion of lifetime use across substances relative to non-users. See Table 1 for demographic variables by outcomes.

Exclusive use

Algorithmic performance on the exclusive e-cigarette use classification task ranged from good to outstanding (AUC = .787 - .926) on the test set. See S1 Fig in S1 File for ROCs for classification algorithms. Elastic net was the best performing algorithm in classifying exclusive e-cigarette use (AUC = .926, sensitivity = .857, specificity = .848). In contrast, logistic regression was the worst performing algorithm in classifying lifetime e-cigarette use (AUC = .787, sensitivity = .768, specificity = .806). Elastic net identified perceived availability of e-cigarettes, lifetime alcohol use, parents’ attitudes regarding their use of vape products, school suspension, perceived risk of e-cigarette use, lifetime marijuana use, best friend(s) tried alcohol, best friend(s) used marijuana, and perceived risk of smoking marijuana regularly as the best discriminators between lifetime exclusive e-cigarette users and non-users. See Fig 1 for variable importance. Visual inspection of cross-tabulation mosaics suggests that perceived availability of e-cigarettes (i.e., sort of hard, very easy, sort of easy), lifetime substance use (i.e., alcohol, marijuana), school suspensions (i.e., 1 or more), lower levels of perceived risk associated with e-cigarette use (i.e., none to moderate), best friend(s) tried alcohol or used marijuana (i.e., 1 or more), and less perceived risk associated with smoking marijuana regularly were all associated with a greater proportion of lifetime e-cigarette use. Students who reported that their parents would view their use of vape products as “very wrong” had the lowest proportion of use relative to other levels of approval (i.e., wrong to not wrong at all). See S3-S11 Figs in in S1 File for cross-tabulation visualizations.

thumbnail
Fig 1. Top 20 variables with the highest relative importance in classifying lifetime e-cigarette use.

Results represent validation on a separate test dataset.

https://doi.org/10.1371/journal.pone.0287878.g001

Dual use

Algorithmic performance on the dual tobacco cigarette and e-cigarette use classification task also ranged from excellent to outstanding (AUC = .725 - .944) on the test set. See S2 Fig in S1 File for ROCs for classification algorithms. Elastic net and random forest had the same AUC score (.944). However, elastic net (sensitivity = .824, specificity = .947) outperformed random forest (sensitivity = .818, specificity = .939) based on sensitivity and specificity. Logistic regression was the worst performing algorithm in classifying lifetime dual use (AUC = .725, sensitivity = .630, specificity = .779). Elastic net identified lifetime alcohol use, lifetime marijuana use, perceived availability of e-cigarettes, best friend(s) cigarette use, perceived risk of e-cigarette use, lifetime inhalant use, school suspensions, and perceived risk of smoking marijuana regularly as the best discriminators between lifetime dual users and non-users. See Fig 2 for variable importance. Visual inspection of cross-tabulation mosaics suggests that lifetime substance use (i.e., alcohol, marijuana, inhalants), higher levels of perceived availability of e-cigarettes (i.e., very easy, sort of easy), best friend(s) that have smoked cigarettes (i.e., 1 or more), school suspensions (i.e., 1 or more), lower levels of perceived risk associated with e-cigarette use and using marijuana regularly (i.e., none to moderate) were all associated with a greater proportion of lifetime dual use. See S12-S19 Figs in S1 File for cross-tabulation visualizations.

thumbnail
Fig 2. Top 20 variables with the highest relative importance in classifying lifetime dual use.

Results represent validation on a separate test dataset.

https://doi.org/10.1371/journal.pone.0287878.g002

Discussion

The present study expands the literature through the simultaneous exploration of established correlates of e-cigarette initiation and traditional factors associated with substance use in relation to early adolescent vaping. Algorithms utilizing information regarding students’ individual characteristics and socioecological context demonstrated high levels of classification accuracy for both lifetime exclusive and dual e-cigarette use. Elastic net generally outperformed other algorithms in classification accuracy. While the order of importance of classifiers differed by the outcome, elastic net consistently identified six high value classifiers across usage groups: lifetime alcohol or marijuana use, perception of e-cigarette availability and risk, school suspension(s), and perceived risk of smoking marijuana regularly. Several high value classifiers differed between youth who reported lifetime exclusive (i.e., parent’s attitudes regarding their use of vaping products, best friend[s] tried alcohol, best friend[s] used marijuana) and dual e-cigarette use (i.e., best friend[s] smoked cigarettes, lifetime inhalants use). These findings highlight important commonalities and differences in risk profiles between lifetime exclusive and dual e-cigarette users.

Research using high school samples has documented higher rates of life substance use among dual versus exclusive e-cigarette users [7]. Within our sample, rates of substance use were generally higher among exclusive and dual users relative to those who abstained from both. Consistent with prior research, the greatest portion of substance use was found among youth who had reported dual use [26]. However, only lifetime alcohol and tobacco cigarette use were found to be important classifiers of both e-cigarette use outcomes among middle school students, which is consistent with prior findings in high school samples [7]. Our findings also extend prior work through the identification of inhalant use as a novel risk factor specifically related to lifetime dual use during early adolescence. It is possible that dual users may access a wide variety of substances recreationally and may utilize inhalants as they are easy to access within the home [26, 36]. Further research is needed to understand the relationship between lifetime inhalant and dual use.

Our findings confirm that the availability of e-cigarettes is an important influence on early adolescent vaping [19]. Despite a Utah state law that restricts the sale of these products to individuals under the age of 20 when this survey took place, accessibility was especially important to exclusive e-cigarette use, which is concerning as this may translate to future traditional cigarette use among adolescents who may have otherwise abstained from tobacco cigarette use [8]. Consistent with prior research, students who reported lower perceived danger of using e-cigarettes reported a greater proportion of lifetime use [12]. Our findings suggest a need to consider perceptions regarding the danger of other substances, such as marijuana, when assessing the risk for early adolescent e-cigarette use. While recent findings have highlighted the importance of school-based factors in assessing risk for e-cigarette use among high school samples (i.e., truancy and poor academic performance; [10]), our findings suggest that student suspensions were the most relevant aspect of school in relation to early adolescent e-cigarette use. It is possible school suspensions may be associated with an increased risk for e-cigarette use as a potential proxy for rule breaking behaviors and/or through greater unsupervised time outside of school [7, 37]. Further research is needed to elucidate the relationship between school suspensions and early adolescent e-cigarette use.

It is important to mention that factors traditionally associated with substance use such as adolescent and peer delinquency, community substance use norms, and school involvement were not important predictors of lifetime exclusive and dual e-cigarette use within the current sample. These findings may signal potential differences between factors underlying e-cigarette and other forms of substance use. Additionally, factors identified by prior research as relevant predictors of vaping (e.g., parenting practices, perceived risk of smoking tobacco cigarettes) were not relevant correlates of patterns of e-cigarette use within the present sample [7, 12]. It is possible that when competing against other variables within a machine learning approach, these important predictors are truly of lesser importance relative to high value classifiers identified in the current study.

Implications

Our findings support addressing early adolescent vaping through prevention programs aiming to address substance use broadly (i.e., alcohol, tobacco, marijuana, inhalants, e-cigarettes). There is ample evidence for the efficacy of prevention programs whether focused on a single substance or multiple ones [38]. Substance use prevention programs may benefit from adding components that equip parents to effectively communicate their disapproval of their child’s use of e-cigarettes (i.e., attitudes) and teach youth refusal skills to resist peer influences [39, 40]. Targeted prevention programs are already supported by research documenting the perception of risk associated with vaping as a consistent predictor of e-cigarette use [7, 8] and our findings highlight the importance of also considering perceptions of danger regarding marijuana use during early adolescence. Specific to programming, youth’s perceptions of risk do not align with research evidence providing an important point of content for preventive interventions [41]. Altering youth’s perception of e-cigarette accessibility, however, may require intervention at a broader social level (e.g., public media campaigns). Alternatively, decreasing accessibility to e-cigarettes may be achieved by actions external to youths such as strong enforcement of laws regarding consumption for underage users and/or those selling e-cigarette products to them, or by way of increasing prices for goods associated with e-cigarette use.

Machine learning appears to be a promising screening tool for the identification of risk factors that can accelerate the development of the e-cigarette knowledge base needed to curb the rapid spread of vaping among adolescents nationally. Algorithms were able to efficiently explore a wide range of factors in association with early adolescent e-cigarette use, which confirmed findings from later developmental stages and identified several novel risk factors (i.e., inhalant use, perceived risk of marijuana, school suspensions). An important consideration in this research is that the tools utilized to identify e-cigarette use classifiers are publicly available. R offers open-access statistical packages for machine learning. Additionally, there are substantial training materials available for free online. These tools can provide an accessible and replicable method of generating and disseminating scientific knowledge regarding e-cigarette use classifiers nationally. We encourage researchers to apply machine learning algorithms to their data to draw new insight regarding factors contributing to a variety of e-cigarette use outcomes among adolescents. Examining cross-sectional markers of e-cigarette use could also identify important variables that can be examined longitudinally in prospective research. Machine learning algorithms have many exciting applications when applied to longitudinal data, including identifying context specific predictors of service use, specific targets for substance use prevention programs, and ensuring that important factors are not excluded from causal models examining mechanisms underpinning early vaping initiation.

Limitations

Results from algorithms used in the present study do not necessarily imply causal mechanisms explaining patterns of lifetime e-cigarette use but rather identify factors that are strong correlates of group membership (i.e., use or non-use). Longitudinal research is needed to establish causal links. The present study examined lifetime substance use that may range from experimentation to habitual use. Future research may consider using machine learning as a method of identifying youth at risk for habitual e-cigarette use. Although a large number of the Utah adolescent population was captured, the PNA survey does not include students in private schools, correctional facilities, or treatment centers. Additionally, students who were not in attendance, declined to participate, or did not return parental consent forms were not represented in the survey. Furthermore, findings may not generalize to students in other states. Future research should replicate our findings in different contexts and developmental periods. Lastly, there were few youths who reported exclusive tobacco cigarette use within the current sample, which limited our ability to compare differences in correlates between e-cigarette use outcomes and tobacco cigarette use with machine learning algorithms. Future research should use a sample with greater representation of exclusive tobacco cigarette users to determine whether corelates differ from e-cigarette use outcomes during early adolescence.

Conclusions

The present study utilized a machine learning approach to efficiently identify high value correlates of early adolescent lifetime e-cigarette use. This approach identified several shared risk factors for exclusive and dual e-cigarette use such as lifetime use of specific substances (i.e., alcohol, marijuana), perception of e-cigarette availability and risk, school suspension(s), and perceived risk of smoking marijuana regularly. Several differences were also identified between youth who reported lifetime exclusive (i.e., parent’s attitudes regarding their use of vaping products, best friend[s] tried alcohol or used marijuana) and dual use (i.e., best friend[s] smoked cigarettes, lifetime inhalants use) relative to non-users. This information provides a first step towards identifying youth at risk for e-cigarette use during early adolescence. Further research is needed to examine high value classifiers identified by the present study using explanatory models and longitudinal data to understand the mechanisms underlying their importance in accounting for differences in risk profiles between e-cigarette usage groups during early adolescence.

Acknowledgments

Representatives of the Department of Human Services supported this work through the provision of information regarding the PNA survey methods and sampling procedure.

References

  1. 1. Office of the Surgeon General. Surgeon General’s Advisory on E-cigaretteUse Among Youth. 2019. https://www.cdc.gov/tobacco/basic_information/e-cigarettes/surgeon-general-advisory/index.html
  2. 2. Centers for Disease Control and Prevention. Tobacco product use among middle and high school students—United States, 2011–2017. Morb Mortal Wkly Rep. 2018;63: 629–633. pmid:29879097
  3. 3. Bronfenbrenner U. Toward an experimental ecology of human development. Am Psychol. 1977;32: 513–531.
  4. 4. Barrington-Trimis JL, Berhane K, Unger JB, Cruz TB, Urman R, Chou P, et al. The E-cigarette Social Environment, E-cigarette Use, and Susceptibility to Cigarette Smoking. 2016 [cited 26 Jun 2019].
  5. 5. Bold KW, Morean ME, Kong G, Simon P, Camenga DR, Cavallo DA, et al. Early age of e-cigarette use onset mediates the association between impulsivity and e-cigarette use frequency in youth. Drug Alcohol Depend. 2017;181: 146–151. pmid:29055268
  6. 6. Giovenco DP, Casseus M, Duncan DT, Coups EJ, Lewis MJ, Delnevo CD. Association Between Electronic Cigarette Marketing Near Schools and E-cigarette Use Among Youth. J Adolesc Heal. 2016;59: 627–634. pmid:27720358
  7. 7. Wills TA, Knight R, Williams RJ, Pagano I, Sargent JD. Risk factors for exclusive e-cigarette use and dual e-cigarette use and tobacco use in adolescents. Pediatrics. 2015;135: e43–e51. pmid:25511118
  8. 8. Wills TA, Sargent JD, Knight R, Pagano I, Gibbons FX. E-cigarette Use and Willingness to Smoke in a Sample of Adolescent Nonsmokers. 2016;2: 1–16.
  9. 9. Simon P, Camenga DR, Morean ME, Kong G, Bold KW, Cavallo DA, et al. Socioeconomic status and adolescent e-cigarette use: The mediating role of e-cigarette advertisement exposure. Prev Med (Baltim). 2018;112: 193–198. pmid:29673887
  10. 10. McCabe SE, West BT, Veliz P, Boyd CJ. E-cigarette Use, Cigarette Smoking, Dual Use, and Problem Behaviors Among U.S. Adolescents: Results From a National Survey. J Adolesc Heal. 2017;61: 155–162. pmid:28391965
  11. 11. Rocheleau GC, Vito AG, Intravia J. Peers, Perceptions, and E-Cigarettes: A Social Learning Approach to Explaining E-Cigarette Use Among Youth. J Drug Issues. 2020;50: 472–489.
  12. 12. Kwon E, Seo DC, Lin HC, Chen Z. Predictors of youth e-cigarette use susceptibility in a U.S. nationally representative sample. Addict Behav. 2018;82: 79–85. pmid:29501011
  13. 13. Vázquez AL, Domenech Rodríguez MM, Barrett TS, Schwartz S, Amador Buenabad NG, Bustos Gamiño MN, et al. Innovative Identification of Substance Use Predictors: Machine Learning in a National Sample of Mexican Children. Prev Sci. 2020;21: 171–181. pmid:31960262
  14. 14. National Institute on Drug Abuse. What are electronic cigarettes? 2018.
  15. 15. Tierney PA, Karpinski CD, Brown JE, Luo W, Pankow JF. Flavour chemicals in electronic cigarette fluids. Tob Control. 2016;25: e10–e15. pmid:25877377
  16. 16. Centers for Disease Control and Prevention. Outbreak of lung injury associated with the use of e-cigarette, or vaping, products. 2020.
  17. 17. Carey FR, Rogers SM, Cohn EA, Harrell MB, Wilkinson A V, Perry CL. Understanding susceptibility to e-cigarettes: A comprehensive model of risk factors that influence the transition from non-susceptible to susceptible among e-cigarette naïve adolescents. Addict Behav. 2019;91: 68–74. pmid:30241775
  18. 18. Cullen KA, Gentzke AS, Sawdey MD, Chang JT, Anic GM, Wang TW, et al. e-Cigarette Use among Youth in the United States, 2019. JAMA—J Am Med Assoc. 2019;322: 2095–2103. pmid:31688912
  19. 19. Kong G, Morean ME, Cavallo DA, Camenga DR, Krishnan-Sarin S. Reasons for electronic cigarette experimentation and discontinuation among adolescents and young adults. Nicotine Tob Res. 2015;17: 847–854. pmid:25481917
  20. 20. Krishnan-Sarin S, Morean ME, Camenga DR, Cavallo DA, Kong G. E-cigarette use among high school and middle school adolescents in Connecticut. Nicotine Tob Res. 2015;17: 810–818. pmid:25385873
  21. 21. Eitle DJ, Eitle TM. School and county characteristics as predictors of school rates of drug, alcohol, and tobacco offenses. J Health Soc Behav. 2004;45: 408–421. pmid:15869113
  22. 22. Evans WD, Powers A, Hersey J, Renaud J. The influence of social environment and social image on adolescent smoking. Heal Psychol. 2006;25: 26–33. pmid:16448295
  23. 23. Mason MJ, Zaharakis NM, Rusby JC, Westling E, Light JM, Mennis J, et al. A longitudinal study predicting adolescent tobacco, alcohol, and cannabis use by behavioral characteristics of close friends. Psychol Addict Behav. 2017;31: 712–720. pmid:28703615
  24. 24. Novak SP, Clayton RR. The influence of school environment and self-regulation on transitions between stages of cigarette smoking: A multilevel analysis. Heal Psychol. 2001;20: 196–207. pmid:11403217
  25. 25. Voelkl KE, Frone MR. Predictors of substance use at school among high school students. J Educ Psychol. 2000;92: 583–592.
  26. 26. Curran KA, Burk T, Pitt PD, Middleman AB. Trends and Substance Use Associations With E-Cigarette Use in US Adolescents. Clin Pediatr (Phila). 2018;57: 1191–1198. pmid:29652181
  27. 27. Kuhn M, Johnson K. Applied Predictive Modeling with Applications in R. 2013. http://appliedpredictivemodeling.com/s/Applied_Predictive_Modeling_in_R.pdf
  28. 28. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning. Second. New York: Springer; 2013.
  29. 29. Barenholtz E, Fitzgerald ND, Hahn WE. Machine-learning approaches to substance-abuse research: emerging trends and their implications. Curr Opin Psychiatry. 2020;33: 334–342. pmid:32304429
  30. 30. Ketonen V, Malik A. Characterizing vaping posts on instagram by using unsupervised machine learning. Int J Med Inform. 2020;141: 104223. pmid:32623330
  31. 31. Malik A, Khan MI, Karbasian H, Nieminen M, Ammad-Ud-Din M, Khan SA. Modeling Public Sentiments About JUUL Flavors on Twitter Through Machine Learning. Nicotine Tob Res. 2021; 1–11. pmid:33991191
  32. 32. Choi J, Jung H-T, Ferrell A, Woo S, Haddad L. Machine Learning-Based Nicotine Addiction Prediction Models for Youth E-Cigarette and Waterpipe (Hookah) Users. J Clin Med. 2021;10: 972. pmid:33801175
  33. 33. Utah Department of Human Services. Student Health and Risk Prevention (SHARP) survey reports. 2019.
  34. 34. Temple JR, Shorey RC, Lu Y, Torres E, Stuart GL, Le VD. E-cigarette use of young adults motivations and associations with combustible cigarette alcohol, marijuana, and other illicit drugs. Am J Addict. 2017;26: 343–348. pmid:28370717
  35. 35. James G, Witten D, Hastie T, Tibshirani R. An Introduction to Statistical Learning. Current Medicinal Chemistry. Springer; 2013.
  36. 36. Medina-Mora María E, Real T. Epidemiology of inhalant use. Curr Opin Psychiatry. 2008;21: 247–251. pmid:18382222
  37. 37. Lee KTH, Vandell DL. Out-of-School Time and Adolescent Substance Use. J Adolesc Heal. 2015;57: 523–529. pmid:26306790
  38. 38. Das JK, Salam RA, Arshad A, Finkelstein Y, Bhutta ZA. Interventions for Adolescent Substance Abuse: An Overview of Systematic Reviews. J Adolesc Heal. 2016;59: S61–S75. pmid:27664597
  39. 39. Marsiglia FF, Kulis S, Yabiku ST, Nieri TA, Coleman E. When to Intervene: Elementary School, Middle School or Both? Effects of keepin’ It REAL on Substance Use Trajectories of Mexican Heritage Youth. Prev Sci. 2011;12: 48–62. pmid:21128119
  40. 40. Patterson G. The next generation of PMTO models. Behav Ther. 2005;28: 25–32.
  41. 41. Cheeta S, Halil A, Kenny M, Sheehan E, Zamyadi R, Williams AL, et al. Does perception of drug-related harm change with age? A cross-sectional online survey of young and older people. BMJ Open. 2018;8. pmid:30401725