We conduct a detailed investigation of the relationship among the obesity rate of urban areas and expressions of happiness, diet and physical activity on social media. We do so by analyzing a massive, geo-tagged data set comprising over 200 million words generated over the course of 2012 and 2013 on the social network service Twitter. Among many results, we show that areas with lower obesity rates: (1) have happier tweets and frequently discuss (2) food, particularly fruits and vegetables, and (3) physical activities of any intensity. Additionally, we provide evidence that each of these results offer different and unique insight into the variation of the obesity rate in urban areas within the United States. Our work shows how the contents of social media may potentially be used to estimate real-time, population-scale measures of factors related to obesity.
Citation: Gore RJ, Diallo S, Padilla J (2015) You Are What You Tweet: Connecting the Geographic Variation in America’s Obesity Rate to Twitter Content. PLoS ONE 10(9): e0133505. doi:10.1371/journal.pone.0133505
Editor: David Meyre, McMaster University, CANADA
Received: January 16, 2015; Accepted: June 3, 2015; Published: September 2, 2015
Copyright: © 2015 Gore et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: The authors have no support or funding to report.
Competing interests: The authors have declared that no competing interests exist.
Obesity is becoming increasingly problematic and common in the United States population [1, 2]. More than one-third of U.S. adults are obese resulting in an annual medical cost of over $150 billion dollars [1, 3, 4]. These medical costs occur because obese people are significantly more prone to the leading causes of preventable death including: heart disease, stroke and type 2 diabetes . Obesity is defined by a Body-Mass Index (BMI) which reflects an individual’s weight divided by square of their height. Obese individuals have a BMI of 30 kg m2 or greater. Obesity rate is defined as the percentage of the people in a Metropolitan Statistical Area (MSA) who have a BMI of 30 kg m2 or greater [2, 6].
Despite the prevalence of obesity in the U.S. it is not problematic to the same degree across the country. According to the 2012–2013 Gallup-Healthways Wellness Survey (GHWS) the obesity rate of U.S. MSAs ranges from 12.4% (Boulder, CO) to 39.5% (Huntington, WV). The lack of uniformity in the obesity rate has motivated researchers to identify the factors that can affect obesity and offer insight into the variation in the data .
While the GHWS and other approaches to quantifying the well being of a city rely almost exclusively on survey data, there are now a range of complementary, remote-sensing methods available to researchers. The explosion in the amount and availability of data relating to social media in the past 10 years has driven a rapid increase in the application of data-driven techniques to the social sciences and other analyses of large-scale populations.
Our overall aim in this paper is to investigate how the obesity rate of an urban geographic area correlates with the contents of geo-tagged tweets in that area. Here, tweets refer to 140 character microblogs expressed on the social media platform www.twitter.com and urban areas reflect the 189 MSAs defined by the U.S. Office of Management and Budget . In particular we ask four research questions using geo-tagged tweets from 2012–2013:
- How is the average happiness of the tweets in an urban area related to the population’s obesity rate?
- How is the overall discussion of food consumption on Twitter, and the nutritional density of the food discussed, in related an urban area related to the population’s obesity rate?
- How is the overall discussion of physical activity on Twitter, and the intensity of the activity discussed, in an urban area related to the population’s obesity rate?
- To what extent do the measures used to answer these questions offer unique insight and how well does each correlate with a MSA-level survey measure of a similar variable?
Our methodology for answering the first question uses word frequency distributions collected from a large corpus of geo-tagged tweets posted on Twitter, with individual words scored for their happiness independently by users of Amazon’s Mechanical Turk service . This measure was introduced by Dodds and Danforth , tested for robustness and sensitivity , and employed by Mitchell et. al in a similar pursuit .
In answering questions 2 and 3 we explore the extent to which the level of granularity needed to answer the first question is required for the second and third question. To answer the final question we compute the correlations among the measures used to answer the first three questions to gauge how much unique insight they provide. We also evaluate how well each of our derived Twitter measures correlates with a MSA-level survey measure of a similar variable. This analysis helps determine if the measures actually capture the intended variables (happiness, diet and physical activity) as opposed to other unrelated variables.
The answers to these questions are not always intuitive and provide significant insight into the health-related habits of Twitter users in different urban areas. Ultimately, they show how social media may potentially be used to estimate population-scale measures of factors related to obesity.
The remainder of the paper is structured as follows. In the Methods section, we describe the data sets in our study and our measures of happiness, diet and physical activity derived from tweets. In the Results section we demonstrate that obesity rate and happiness have a similar relationship in 2012 and 2013 as the two variables did in 2011. Next, we explore the relationship between the discussion of food consumption on Twitter and the obesity rate in urban areas. Then, we shift our focus to discussions of physical activity. Finally, we explore the extent to which these measures: (1) contain unique insight and (2) match MSA-level survey measures of similar variables. We conclude with a discussion of the validity and limitations of our study along with directions for future work.
The obesity rates of the MSAs are provided by the 2012–2013 Gallup Healthways Wellbeing Survey. While other sources of geographic obesity rates exist (i.e. BRFSS and NHANES)[13, 14] we use the GHWS because its data was collected during the same time frame (2012–2013) as our Twitter corpus and (2) it measures other MSA-level variables related to happiness, diet and physical activity which allow us to evaluate additional aspects of our work (i.e. Question 4).
The relationship between these datasets is examined using six measures derived from our Twitter corpus: (a) one related to happiness, (b) three related to diet and (c) two related to physical activity. We define each of these measures next.
Measure of Happiness
To quantify the happiness of a tweet we employ Mitchell et al.’s measure havg which reflects the happiness of a tweet. In previous work Mitchell et al. showed that the happiness of tweets are correlated with several population-scale measures including household income, education levels and the 2011 obesity rate in MSAs .
The happiness of a tweet is measured using the Language Assessment by Mechanical Turk (LabMT) word list, assembled by combining the 5,000 most frequent words occurring in each of four text sources: Google Books (English), music lyrics, the New York Times and Twitter. Ten thousand of these individual words have been scored by users of Amazon’s Mechanical Turk service on a scale of 1 (sad) to 9 (happy), resulting in a measure of happiness, h, for each given word . For example, ‘rainbow’ is one of the happiest words in the list with a score of 8.10, while ‘earthquake’ is one of the saddest, with a score of 1.90. Neutral words like ‘the’ or ‘thereof’ tend to score in the middle of the scale, with h(the) = 4.98 and h(thereof) = 5.00 respectively.
In Eq 1, fi is the frequency of the ith word wi in T for which we have a happiness value h(wi) and is the normalized frequency of the word wi.
Measures of Diet
To quantify the dietary content of the foods one tweets about we explore three different measures at varying degrees of granularity. Each of these three measures require that we partition our corpus of tweets using the following binary criteria: if a tweet contains a word(s) describing at least one food in the USDA National Nutrient Database (USDANDB)  it is placed in the Food Tweets set FT; otherwise it is placed in the Non-Food Tweets set NFT.
Given this partitioning, the Food Tweet % (FT%) of a MSA, is the ratio of Food Tweets in the MSA compared to the total number of tweets within the MSA. This reflects our first measure of diet and is shown in Eq 2. (2)
While, the FT% of a MSA quantifies how frequently people tweet about food, it does not offer any insight into the actual food about which people tweet. To measure how nutritious each food included in each tweet is we measure the average nutrient density, ndavg, of the tweet by using the Nutrient-Rich Foods Index (NRF) formula .
While other formulae to determine the nutrient density of foods exist, we use the NRF because its’ scores have been shown to be highly correlated with the recommendations of the USDA’s Healthy Eating Index  and diets featuring high nutrient dense foods on the NRF have been been shown to reduce obesity, while diets consisting of low nutrient dense foods increase the prevalence of obesity [18, 19]. Furthermore the NRF is not restricted to any subset of foods. It is generalizable to any food in the USDANDB .
Nutrient density in the NRF is determined by computing the daily recommended intake value of protein, dietary fiber, vitamin A, vitamin C, vitamin E, calcium, magnesium, iron and potassium provided per 100 kCals of a given food and then subtracting the daily recommended intake values for saturated fat, sodium and added sugars in 100 kCals of the food. Using this formula, fruits and vegetables are some of the most nutrient dense foods (nrf(spinach) = 694.8; nrf(strawberries) = 375.9) while soda is one of the least (nrf(soda) = −55.8). For a given tweet T containing N unique foods we calculate the average nutrient density ndavg using Eq 3. (3)
The calculation of ndavg in Eq 3 is similar to the calculation of havg. In Eq 3 fi is the frequency of the ith food foodi in T with NRF value nrf(foodi) and is the normalized frequency of the food foodi. The result is a measure of the average nutrient density of the foods mentioned in a single tweet.
There is a significant difference between the level of granularity in our first measure (FT%) and our second (ndavg). To bridge this gap we formulate one more measure of the diet of an MSA: Produce % (Prod%). Prod% marries together the nutritional aspects of ndavg with the coarse granularity of FT%.
Recall, fruits and vegetables are among the most nutritionally dense items on the NRF Index. Any tweet that mentions at least one food listed in either Fruits and Fruit Juices or Vegetable and Vegetable Products sections of the USDANDB is in set Prod. Given this partitioning, Prod% is the ratio of tweets in set Prod the compared to the total number of tweets in the MSA. This measure is shown in Eq 4. (4)
Measures of Physical Activity
Along with happiness and diet, research has shown that the physical activity level of individuals affects obesity [21–23]. With this foundation we explore two different measures to quantify discussions of physical activity within our Twitter data set. Each of these measures require that we partition our corpus of tweets into those that discuss physical activities and those that do not. To do this partition we use a binary criteria similar to our food tweet criteria. If a tweet contains a word(s) discussing at least one physical activity in the guidelines for exercise testing published by the American College of Sports Medicine (ACSM) and the Center for Disease Control and Prevention (CDC)  it is placed in the Physical Activity Tweets set PA; otherwise it is placed in the Non-Physical Activity Tweets set NPA. While the guidelines for exercise published by the ACSM and CDC are not exhaustive and do not contain every possible physical activity descriptor we employ them in our work because they list over 400 activities and are well established. They been used by the American Heart Association , national cross-sectional studies  and public health recommendations .
Our first physical activity metric, Physical Activity % (PA%) is shown in Eq 5. It measures the ratio of Physical Activity Tweets compared to the total number of tweets. (5)
The guidelines of physical activities from the ACSM and CDC divides activities into two categories which serve as the basis for our second measure. The two categories of activities are: (1) moderately intense activities that burn 3.5 kCals a minute and (2) strenuously intense activities that burn 7.0 kCals a minute. Moderately intense physical activities include yoga, walking and stretching while strenuously intense physical activities include jogging, mountain climbing and aerobics. For a given tweet T discussing M moderately intense physical activities and S strenuously intense physical activities we calculate, paweighted in Eq 6. paweighted is the weighted number of calories burned by participating in all the physical activities discussed in the tweet for one minute. (6)
Objectivity and Limitations
All of the measures in Eqs 2–6 make no attempt to take the context of words or the meaning of a tweet into account. While this may limit the ability of our measures to appropriately score tweets containing only a few words, previous researchers have employed this approach and obtained reliable results. Furthermore, by ignoring the context of words we gain a degree of impartiality. We are not the one’s deciding a priori whether a given word, food or activity is associated with obesity. This strategy reduces experimental bias and maintains objectivity.
Happiness and Obesity Rate
The first measure we explore is the happiness conveyed in individual words from tweets. Mitchell et al. showed that the happiness of tweets are correlated with the 2011 obesity rate in MSAs . To validate this result we explore the correlation between the happiness of a tweet and the obesity rate of MSAs in our random sample of Twitter data. Recall, our Twitter data contains ∼ 25 million tweets collected during 2012 and 2013 while Mitchell et al.’s data contains ∼ 10 million tweets collected during 2011. Also Mitchell et al. used GHWS obesity rates collected during 2011 while we use obesity rates collected during 2012 and 2013.
Fig 1 shows the correlation of havg and the obesity rate in all the MSAs for: (a) 2011 (Mitchell et al.) and (b) 2012–2013 (our work). The data shows that the happiness people express in tweets generally decreases as the obesity rate increases. This result holds true in 2011 as well as in 2012–2013. Furthermore, the strength of the relationship and the subtleties of the data points are similar. For example, Boulder, CO is the city with the lowest obesity rate and is among the three most happy cities each year. Furthermore Beaumont, TX is in the top 10 MSAs in terms of obesity rate in both data sets and bottom five happiest cities. The Spearman correlation coefficients are similar (r = -0.339 in 2011, r = -0.318 in 2012–2013) and each have p-values far below.001 indicating that the negative correlations are statistically significant. Next, we explore the relationship of five measures of other factors affecting obesity (diet and physical activity) that can be gleamed from Twitter data in a manner similar to the happiness metric, havg.
Dietary Health and Obesity Rate
Research has shown that diet influences obesity [28, 29]. However, the happiness metric, havg, does not account for diet. Many foods that are widely considered unhealthy have high happiness values (h). For example, the term cake has a h value = 7.58 Also, healthy foods can have relatively low happiness values. The term vegan has a h value of 4.82 despite reflecting a diet featuring fruits and vegetables. Furthermore, many healthy and unhealthy foods are not included in the list of terms scored for happiness. As a result, they are completely ignored in the previous analysis.
To gather insight into the relationship between the foods one tweets about and obesity we explore the correlation between three different measures of the dietary content of a tweet and the obesity rate of MSAs. The first measure we explore is ndavg shown in Eq 3. Recall, ndavg reflects the average nutrient density of a tweet. The twitter data we use for this analysis includes more than two million tweets from 2012–2013 mentioning more than six hundred of the 8,000 different foods listed in the USDANDB. The Spearman correlation between ndavg and obesity rate in all MSAs over 2012–2013 is shown in Fig 2.
Fig 2 shows that there is not a statistically significant relationship between the nutrient density of the foods people discuss in their tweets and obesity rate. This result is unexpected. Given our previous result related to the happiness of tweets and the established relationship between diet and obesity, we anticipated a statistically significant negative correlation. We pursue an explanation by identifying the ten foods that are most strongly negatively and positively correlated with obesity. These results are shown in Table 1.
Table 1 elucidates several insights into the set of tweets that discuss food. The first is that areas with lower obesity rates do not exclusively discuss foods that are nutritionally dense. Similarly areas with high obesity rates discuss a mix of nutritionally dense and non-nutritionally dense foods. Specifically, both lists contain multiple foods with positive and negative NRF Index values and the food with the highest NRF Index value (collards) is correlated with high obesity rates.
It is important to note that our nutrient density metric ignores the quantity and preparation of the food consumed in the tweet. These limitations could explain the lack of a significant relationship between the nutrient density of foods people discuss in tweets and their obesity rate. However, the correlation coefficients and p-values in Table 1 reveal that tweets that discuss food, regardless of their nutritional density, are more likely to be negatively correlated with obesity rate than positively correlated. The absolute value of the correlation coefficient of the food tenth most negatively correlated with obesity is ∼ 25% larger than the absolute value of the correlation coefficient for the food most positively correlated with obesity rate. The p-values in Table 1 also reflect this trend. The relationship between all the foods negatively correlated with obesity rate are statistically significant (p <.05) while only the top four foods positively correlated with obesity rate are statistically significant.
Given these two observations we explore the data to see if the frequency with which individuals tweet about food, regardless of its nutritional density, is correlated with obesity. We use the same twitter data as our previous analysis. However, in this version we measure the ratio of Food Tweets compared to the total number of tweets. This metric, FT% is shown in Eq 2. The Spearman correlation between FT% and obesity over all MSAs for each is shown in Fig 3.
Fig 3 shows that the frequency with which people discuss foods in tweets generally decreases as obesity rate increases. For example, San Francisco, CA is the MSA with one of the highest FT% and is among the ten MSAs with the lowest obesity rate. Similarly, several of the MSAs with top twenty obesity levels (Flint, MI; Mobile, AL; Rockford, IL) are amongst the bottom twenty MSAs in terms of FT%. However, the negative correlation between FT% and obesity rate is not as strong as the negative correlation between havg and obesity rate. Furthermore, the negative correlation between FT% and obesity is not immediately obvious. There is not a quorum of established evidence that shows that the more people discuss food the less obese they are.
In order to examine our data further we explore our final measure of the diet of a MSA: Produce % (Prod%). Recall, Prod% marries together the nutritional aspects of ndavg with the coarse granularity of FT%. It reflects the percentage of total tweets that discuss at least one of the foods listed in either the Fruits and Fruit Juices or Vegetable and Vegetable Products sections of the USDANDB. The twitter data we use for this analysis includes more than one million tweets from 2012–2013 mentioning more than 150 different fruits, vegetables or fruit/vegetable related products. The Spearman correlation between Prod% and obesity rate over all MSAs is shown in Fig 4.
Fig 4 shows that the Prod% metric reconciles the trends we saw in our previous explorations with the measures ndavg and FT%. The frequency with which people tweet about fruits, vegetables or related products increases as obesity decreases.
Intuitively this makes sense. Fruits and vegetables are some of the highest scoring items on the NRF Index, so eating them regularly should decrease the obesity rate. The previous measure, ndavg, attempted to account for this but over penalized tweeters for mentioning average and below average foods on the NRF Index. The FT% metric offered a much coarser level of granularity but did not consider the nutritional density of the foods being discussed in a tweet at all. By including nutritional density at a coarse level of granularity we are able to reveal a correlation with obesity rate (r = -0.344) that is similar in magnitude to the correlation between havg and obesity rate. Next, we investigate the discussion of physical activity levels on Twitter and their relationship to the obesity rate in MSAs.
Physical Activity Level and Obesity Rate
Along with happiness and diet, research has shown that the physical activity level of individuals affects obesity [21–23]. However, none of our previously explored measures (havg, ndavg, FT% and Prod%) account for discussions of physical activities within tweets. As a result, we explore two different measures of discussions of physical activity within our Twitter data set.
Our first physical activity measure, Physical Activity % (PA%) measures the ratio of Physical Activity related tweets compared to the total number of tweets. Our second measure weights physical activities according to the intensity levels published by guidelines of the ACSM and CDC. These two measures are shown in Eqs 5 and 6. The Spearman correlation between PA% and obesity rate and paweighted and obesity rate in all MSAs over 2012 and 2013 is shown in Fig 5(a) and 5(b).
The twitter data we use for this analysis includes more than three million tweets from 2012 and 2013 mentioning more than eighty of the physical activities listed by the ACSM and CDC. Almost two million tweets discuss forty-eight different activities of moderate intensity and more than one million tweets discuss thirty-six different activities of strenuous intensity.
The paweighted values of the tweets in our data set vary. The minimum is zero, which reflects a tweet that does not discuss any physical activities from the list published by the ACSM and CDC. The maximum paweighted observed in our data set is 24.5. However, over 99% of the tweets in our data set have paweighted values of either: 0, 3.5 or 7.
Fig 5 shows that there is a statistically significant negative correlation between both PA% and paweighted and the obesity rate in MSAs. However, the relationship between PA% and obesity rate is stronger (r = -0.330) than the relationship between paweighted (r = -0.190) and obesity rate. This result may seem unexpected. The paweighted metric offers the capability to combine the calories burned from multiple activities based on their intensity level. Given these additional capabilities one might expect it to correlate better with obesity rate than the basic PA% metric. To gather additional insight we calculate the activities most positively and negatively correlated with obesity rate in Table 2. Table 2 only includes five activities in each column because there are so few physical activities that have a positive statistically significant correlation with obesity rate.
Table 2 shows that areas with low obesity and areas with high obesity engage in twitter discussions of a mixture of moderately and strenuously intense activities. Both lists include three moderately intense activities and two strenuously intense activities. However, Table 2 also shows that areas with lower obesity rates simply tweet more about physical activities than areas with high obesity rates. The absolute value of the correlation coefficient for the fifth most negatively correlated activity is higher than the absolute value of the correlation coefficient for the activity most positively correlated with obesity.
It is important to note that our physical activity measures ignore if an individual’s discussion of an activity reflects them physically engaging in it or merely witnessing it in some manner. The inability to make this distinction could explain the lack of a more significant relationship between the intensity levels of physical activities and obesity rate.
However, these insights do reveal similarities between the measures: (1) ndavg and Prod% and (2) paweighted and PA%. In both cases adding too much detail to the measure derived from tweets diluted the relationship between the quantities of interest. This is a valuable lesson learned. Given the complexity of Mitchell et. al.’s happiness metric, havg, we assumed we would need measures of discussions of food and physical activities with a similar structure. However, this is not the case. The more coarse measures Prod% and PA% had a stronger relationship to obesity rate than the nuanced measures ndavg and paweighted. Next, we explore the extent to which these measures provide different insight about the obesity rate of a MSA and evaluate the extent to which each correlates with a a MSA-level survey measure of a similar variable.
Evaluation of Measures
The results we have presented thus far demonstrate that three measures (havg, Prod% and PA%) which can be obtained from geo-tagged tweets have a statistically significant negative correlation with the obesity rate of a MSA and that correlation is on the order of -0.30. However, we have not presented any results which show that: (1) the three measures (havg, Prod% and PA%) have unique relationships with the obesity rate of a MSA and (2) the measures actually quantify the happiness, diet and physical activity level of a MSA.
We address both of these questions by computing the correlation among seven variables. Three of the seven variables are the measures of happiness, diet, and physical activity that can be gleamed from Twitter discussions within a MSA and are most correlated with obesity rate: havg, Prod% and PA%. The other four variables reflect MSA-level data collected by the GHWS survey data. These variables are the: (1) obesity rate of a MSA, (2) percentage of individuals in a MSA who report that they eat a healthy diet, (3) percentage of individuals in a MSA who report that they exercise frequently and (4) Well-Being Index of a MSA. The Well-Being Index is computed by aggregating the responses from participants to five statements. Each participant rates their agreement with each statement on a 0 (very strong disagreement) -10 (very strong agreement) scale. The statements are :
- I am satisfied with my present life situation and anticipated life situation.
- My daily feelings and mental state are healthy.
- I have the physical ability to live a full life.
- The behaviors I engage in positively affect my physical health.
- Within my community I feel safe, satisfied and optimistic.
Fig 6 visualizes the lower triangle of a matrix of the Spearman correlations among the seven variables. The blue boxes in Fig 6 reflect a positive correlation, red boxes reflect a negative correlation. This data shows that each of the measures we computed from Twitter discussions within a MSA (havg, Prod% and PA%) are more correlated with the obesity rate of a MSA than they are correlated with any of the other measures computed from Twitter data. This provides evidence that each of the three measures reflect different factors which are correlated with the obesity rate of a MSA. In other words, these three measures are not simply different methods of quantifying the same variable.
Blue boxes reflect a positive correlation, red boxes reflect a negative correlation.
Furthermore, each of the three measures gleamed from our Twitter corpus is more correlated with the MSA-level measure of a similar variable from GHWS than any other variable. To help elucidate this trend we have underlined the correlation coefficient of the variables with the strongest correlation to happiness, Prod% and PA%. While, this trend does not completely rule out the existence of confounders within our Twitter-level measures, it provides evidence that havg, Prod% and PA% are actually reflecting the level of happiness/well-being, diet/healthy-eating and physical activity/exercise within a MSA as opposed to three completely unrelated variables. Next, we review related work, discuss the validity and limitations of our results and provide directions for future work.
We are not the first researchers to explore modeling human behavior with content from Twitter. Emotions have been accurately captured at different levels of granularity from tweets by using hashtags  and sentiment analysis [31, 32]. Given these classification capabilities other researchers have used Twitter data to explore the emotional states individuals go through in a 24 hour period  and while watching sporting events .
Tweets have also been used to model consumer confidence  and identify major news events that cause breaking points in public opinion . They have served as a platform to explore the unique characteristics of astrophysicists  and been analyzed to characterize varieties of the Spanish dialect on a global scale . However, the two studies most related to our research are Broniatowski et al.’s work on modeling the spread of influenza through tweets  and Mitchell et al.’s exploration of the relationship between the happiness of a tweet and its geographic origin .
Since we have already reviewed and validated Mitchell et al.’s work, we only focus on Broniatowski et al. here. Broniatowski et al identified measures that distinguish tweets relevant to influenza from other tweets. In this paper we adopt this strategy to identify measures related to the variation in obesity rate of MSAs from 2012–2013.
We have identified three measures which can be gleamed from Twitter content related to happiness, diet and physical activities. Each of these measures has a statistically significant negative correlation with obesity on the order of -0.30. Furthermore, we have provided evidence that these measures reflect different variables associated with obesity and that these variables actually reflect the happiness, diet and physical activity levels of MSAs. Ultimately, this work has furthered the research effort in understanding obesity by providing a new path through social media data for the development of population-scale measures of factors related to obesity.
Despite these results, internal and external validity threats affect our study. Threats to internal validity arise when factors affect the dependent variables without the evaluators’ knowledge. It is possible that some flaws in the implementation of our metrics could have affected the results of the evaluation. However, the algorithms we used to compute the metrics passed several internal code reviews and the strength of the relationship between our implementation of the happiness metric, havg, and the obesity rate in MSAs is similar to previously published results . Threats to external validity occur when the results of the evaluation cannot be generalized. Although the evaluation was performed for two years of data over 189 MSAs the results cannot be generalized to: (1) other urban areas, (2) during different years or (3) different Twitter data sets.
Furthermore, there are issues that must be addressed with how well a geo-tagged Twitter data set can represent the obesity rate of a population. Only 15% of online adults regularly use Twitter, and 18–29 year-olds and minorities tend to be more highly represented on Twitter than in the general population . Furthermore, on Twitter, 95% of users never geo-tag a single tweet and only ∼ 1% of users geo-tag the majority of the tweets they post. Also, the extent to which the individual ‘tweeter’ is represented in our Twitter corpus is biased. Very passive users (< 50 tweets per year) and very active users (> 1000 tweets per year) geo-tag a smaller percentage of tweets than moderate users (50–1000 tweets per year) . Finally, we collected only a random sample of all tweets during 2012–2013. Ultimately, these limitations mean that the data set which informed our study is a non-uniform subsample of statements made by a non-representative portion of MSA populations.
Even with these limitations and validity threats we have only scratched the surface of what is possible using social media datasets. In particular, Tables 1 and 2 could be very illuminating. One can observe that the top foods and physical activities positively (espresso, yoga) and negatively (french fries, hunting) correlated with obesity rate may have social and cultural underpinnings (i.e. income and education levels).
This would not be unexpected. Recall, previous work showed that the happiness of a MSA, which correlates with our diet and physical activities measures, has statistically significant positive correlations with: (a) the percentage of households with median income levels and (b) the percentage of the individuals living in an area who have obtained a bachelor’s degree. Also, happiness has a statistically significant negative correlation with families living below the poverty line. In future work, we plan to use the census data for 2012 to investigate how different demographics across urban areas are correlated with our measures of diet (Prod%) and physical activity level (PA%).
Additionally, we have not examined whether or not these methods have any predictive power. Future work will look at how observed changes in the measures which can be gleamed from Twitter data, predict changes in the obesity rate of MSAs. We plan to pursue this in future work using content from Twitter and the GHWS data collected in 2014 and 2015.
S1 Dataset. Dataset for havg over all MSAs for 2012 & 2013.
S2 Dataset. Dataset for ndavg over all MSAs for 2012 & 2013.
S3 Dataset. Dataset for FT% over all MSAs for 2012 & 2013.
S4 Dataset. Dataset for Prod% over all MSAs for 2012 & 2013.
S5 Dataset. Dataset for paweighted over all MSAs in 2012 & 2013.
S6 Dataset. Dataset for PA% over all MSAs in 2012 & 2013.
S1 Table. Foods Most Negatively Correlated With Obesity Rate.
S2 Table. Foods Most Positively Correlated With Obesity Rate.
S3 Table. Physical Activities Most Negatively Correlated With Obesity Rate.
S4 Table. Physical Activities Most Positively Correlated With Obesity Rate.
The authors gratefully acknowledge the support from their colleagues within the Virginia, Modeling, Analysis Center at Old Dominion University.
Conceived and designed the experiments: RJG SD JP. Performed the experiments: RJG. Analyzed the data: RJG. Contributed reagents/materials/analysis tools: RJG. Wrote the paper: RJG SD JP.
- 1. Tsai AG, Williamson DF, Glick HA. Direct medical cost of overweight and obesity in the USA: a quantitative systematic review. Obesity Reviews. 2011;12(1):50–61. doi: 10.1111/j.1467-789X.2009.00708.x. pmid:20059703
- 2. Ogden CL, for Health Statistics (US) NC, et al. Prevalence of obesity in the United States, 2009–2010. US Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Health Statistics; 2012.
- 3. Cawley J, Meyerhoefer C. The medical care costs of obesity: an instrumental variables approach. Journal of health economics. 2012;31(1):219–230. doi: 10.1016/j.jhealeco.2011.10.003. pmid:22094013
- 4. Trogdon JG, Finkelstein EA, Feagan CW, Cohen JW. State-and Payer-Specific Estimates of Annual Medical Expenditures Attributable to Obesity. Obesity. 2012;20(1):214–220. doi: 10.1038/oby.2011.169. pmid:21681222
- 5. Finkelstein EA, Khavjou OA, Thompson H, Trogdon JG, Pan L, Sherry B, et al. Obesity and severe obesity forecasts through 2030. American journal of preventive medicine. 2012;42(6):563–570. doi: 10.1016/j.amepre.2011.10.026. pmid:22608371
- 6. Shah NR, Braverman ER. Measuring adiposity in patients: the utility of body mass index (BMI), percent body fat, and leptin. PLoS One. 2012;7(4):e33308. doi: 10.1371/journal.pone.0033308. pmid:22485140
- 7. Gallup-Healthways Well Being Index 2011–2014;. Accessed: 2014-11-24. http://info.healthways.com/wellbeingindex.
- 8. Huberman BA, Romero DM, Wu F. Social networks that matter: Twitter under the microscope. Available at SSRN 1313405. 2008;.
- 9. Turk AM. Best Practices Guide. Amazon Web Services. 2011;.
- 10. Dodds PS, Danforth CM. Measuring the happiness of large-scale written expression: Songs, blogs, and presidents. Journal of Happiness Studies. 2010;11(4):441–456. doi: 10.1007/s10902-009-9150-9.
- 11. Dodds PS, Harris KD, Kloumann IM, Bliss CA, Danforth CM. Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter. PloS one. 2011;6(12):e26752. doi: 10.1371/journal.pone.0026752. pmid:22163266
- 12. Mitchell L, Frank MR, Harris KD, Dodds PS, Danforth CM. The geography of happiness: Connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PloS one. 2013;8(5):e64417. doi: 10.1371/journal.pone.0064417. pmid:23734200
- 13. Carwile JL, Michels KB. Urinary bisphenol A and obesity: NHANES 2003–2006. Environmental research. 2011;111(6):825–830. doi: 10.1016/j.envres.2011.05.014. pmid:21676388
- 14. National Center for Health Statistics (US). Plan and operation of the third National Health and Nutrition Examination Survey, 1988–94. 32. Natl Ctr for Health Statistics; 1994.
- 15. Food U. Nutrient Database for Dietary Studies, 11.0. Agricultural Research Service, Food Surveys Research Group, Beltsville, MD. 2014;.
- 16. Fulgoni VL, Keast DR, Drewnowski A. Development and validation of the nutrient-rich foods index: a tool to measure nutritional quality of foods. The Journal of nutrition. 2009;139(8):1549–1554. doi: 10.3945/jn.108.101360. pmid:19549759
- 17. Variyam JN, Blaylock JR, Smallwood DM, Basiotis PP. USDA’s Healthy Eating Index and nutrition information. United States Department of Agriculture, Economic Research Service; 1998.
- 18. Fuhrman J, Sarter B, Glaser D, Acocella S. Changing perceptions of hunger on a high nutrient density diet. Nutrition journal. 2010;9(1):393–399. doi: 10.1186/1475-2891-9-51.
- 19. Drewnowski A. Obesity and the food environment: dietary energy density and diet costs. American journal of preventive medicine. 2004;27(3):154–162. doi: 10.1016/j.amepre.2004.06.011. pmid:15450626
- 20. Guenther PM, Reedy J, Krebs-Smith SM. Development of the healthy eating index-2005. Journal of the American Dietetic Association. 2008;108(11):1896–1901. doi: 10.1016/j.jada.2008.08.016. pmid:18954580
- 21. Wing RR. Physical activity in the treatment of the adulthood overweight and obesity: current evidence and research issues. Medicine and science in sports and exercise. 1999;31(11 Suppl):S547–52. pmid:10593526
- 22. Ross R, Janssen I, Dawson J, Kungl AM, Kuk JL, Wong SL, et al. Exercise-induced reduction in obesity and insulin resistance in women: a randomized controlled trial. Obesity research. 2004;12(5):789–798. doi: 10.1038/oby.2004.95. pmid:15166299
- 23. Weltman A, Weltman JY, Watson Winfield DD, Frick K, Patrie J, Kok P, et al. Effects of continuous versus intermittent exercise, obesity, and gender on growth hormone secretion. The Journal of Clinical Endocrinology & Metabolism. 2008;93(12):4711–4720. doi: 10.1210/jc.2008-0998.
- 24. USD of Health. Physical activity and health: a report of the Surgeon General. DIANE Publishing; 1996.
- 25. Haskell WL, Lee IM, Pate RR, Powell KE, Blair SN, Franklin BA, et al. Physical activity and public health: updated recommendation for adults from the American College of Sports Medicine and the American Heart Association. Circulation. 2007;116(9):1081. doi: 10.1161/CIRCULATIONAHA.107.185649. pmid:17671237
- 26. Arriaza Jones D, Ainsworth BE, Croft JB, Macera CA, Lloyd EE, Yusuf HR. Moderate leisure-time physical activity: who is meeting the public health recommendations? A national cross-sectional study. Archives of Family Medicine. 1998;7(3):285. doi: 10.1001/archfami.7.3.285.
- 27. Weyer C, Linkeschowa R, Heise T, Giesen H, Spraul M. Implications of the traditional and the new ACSM physical activity recommendations on weight reduction in dietary treated obese subjects. International journal of obesity and related metabolic disorders: journal of the International Association for the Study of Obesity. 1998;22(11):1071–1078. doi: 10.1038/sj.ijo.0800728.
- 28. Togo P, Osler M, Sørensen T, Heitmann B. Food intake patterns and body mass index in observational studies. International journal of obesity and related metabolic disorders: journal of the International Association for the Study of Obesity. 2001;25(12):1741–1751. doi: 10.1038/sj.ijo.0801819.
- 29. Newby PK, Muller D, Hallfrisch J, Qiao N, Andres R, Tucker KL. Dietary patterns and changes in body mass index and waist circumference in adults. The American journal of clinical nutrition. 2003;77(6):1417–1425. pmid:12791618
- 30. Mohammad SM, Kiritchenko S. Using hashtags to capture fine emotion categories from tweets. Computational Intelligence. 2014;.
- 31. Mohammad SM, Kiritchenko S, Zhu X. NRC-Canada: Building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:13086242. 2013;.
- 32. Balabantaray R, Mohammad M, Sharma N. Multi-class twitter emotion classification: A new approach. International Journal of Applied Information Systems. 2012;4(1):48–53. doi: 10.5120/ijais12-450651.
- 33. Bollen J, Mao H, Pepe A. Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena. In: ICWSM; 2011.
- 34. Sintsova V, Musat CC, Pu Faltings P. Fine-grained emotion recognition in olympic tweets based on human computation. In: 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. EPFL-CONF-197185; 2013.
- 35. O’Connor B, Balasubramanyan R, Routledge BR, Smith NA. From tweets to polls: Linking text sentiment to public opinion time series. ICWSM. 2010;11:122–129.
- 36. Akcora CG, Bayir MA, Demirbas M, Ferhatosmanoglu H. Identifying breakpoints in public opinion. In: Proceedings of the First Workshop on Social Media Analytics. ACM; 2010. p. 62–66.
- 37. Holmberg K, Bowman TD, Haustein S, Peters I. Astrophysicists’ Conversational Connections on Twitter. PloS one. 2014;9(8):e106086. doi: 10.1371/journal.pone.0106086. pmid:25153196
- 38. Gonçalves B, Sánchez D. Crowdsourcing Dialect Characterization through Twitter. PloS one. 2014;9(11):e112074. doi: 10.1371/journal.pone.0112074. pmid:25409174
- 39. Broniatowski DA, Paul MJ, Dredze M. National and local influenza surveillance through twitter: An analysis of the 2012–2013 influenza epidemic. PloS one. 2013;8(12):e83672. doi: 10.1371/journal.pone.0083672. pmid:24349542
- 40. Smith A, Brenner J. Twitter use 2012. Pew Internet & American Life Project. 2012;p. 4.