Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Collective Attention and Stock Prices: Evidence from Google Trends Data on Standard and Poor's 100

Abstract

Today´s connected world allows people to gather information in shorter intervals than ever before, widely monitored by massive online data sources. As a dramatic economic event, recent financial crisis increased public interest for large companies considerably. In this paper, we exploit this change in information gathering behavior by utilizing Google query volumes as a "bad news" indicator for each corporation listed in the Standard and Poor´s 100 index. Our results provide not only an investment strategy that gains particularly in times of financial turmoil and extensive losses by other market participants, but reveal new sectoral patterns between mass online behavior and (bearish) stock market movements. Based on collective attention shifts in search queries for individual companies, hence, these findings can help to identify early warning signs of financial systemic risk. However, our disaggregated data also illustrate the need for further efforts to understand the influence of collective attention shifts on financial behavior in times of regular market activities with less tremendous changes in search volumes.

Introduction

In the past decade connections of people all around the globe have dramatically increased due to technological innovations related to the internet. The ongoing worldwide computerization and integration provides great opportunities for scientists to enhance our understanding of the complex systems in which humans live today. Increasing availability of massive social media data abets efforts trying to explain collective behavior with methods stemming from the natural sciences [17], allowing to transfer knowledge about mechanisms already found in, for instance, complex ecological systems [810].

Given the impact of the recent financial crisis on economic wealth, political decisions and personal fortunes, special interest bestow researchers upon patterns in modern financial markets [1123]. For an approximation of collective financial behavior diverse online sources have already been used that yield complementary results: strong correlations are reported between trading volumes of securities and the frequency brand names appear on Twitter [24], and the number of daily search queries on Yahoo [25], respectively. Editing activity in Wikipedia is linked to critical events in the near future [26], and the text content of daily tweets is analyzed in respect to its mood and found to be predictive of changes in the values of the Dow Jones Industrial Average [27]. Moreover, the Bitcoin crypto-currency and its price dynamics have been shown to exhibit, besides more fundamental and technical drivers [28], strong relationships with the number of new users, Wikipedia page views and search queries provided by Google Trends data [29,30]. The latter, publicly available service seems to be especially fruitful for scientists to comprehend collective financial behavior. Therein, Google provides access to aggregated information on the volume of queries for specific search terms over time. Although mostly capturing the attention of uninformed investors [31], those online search query data have delivered useful information to predict trading volumes [32], to diversify portfolio risks [33], and to quantify trading behavior with given keywords [34] or with semantic topics derived by a latent Dirichlet allocation (“topic modelling") of Wikipedia articles [35].

To explain collective financial actions, it is helpful to remind Herbert Simon´s [36] famous notion that decisions of economic actors start with the gathering of information, yet, that the attention of those actors is rather limited compared to the amount of available information. His observation stems from the 1950s, but seems to be more valid than ever in modern societies. Real-time information supply from countless online sources makes selection processes for investors increasingly important and for scientists an even richer research area. However, a series of problems clouds the possibilities of social media data [3739]. In addition to more general issues of adequate methodological standards for analyzing large social media data discussed by Ruths and Pfeffer [40], we can add, in accordance with Sun and others [41], that the cited studies concerned with collective human behavior and financial markets mainly focus on the prediction of composite indices. In contrast, the influence of collective attention shifts on individual stock price movements is so far a widely unexplored question.

This paper tries to fill this gap by investigating not only aggregate compositions but individual stock prices and their connection to firm-specific volumes of Google search queries. The disaggregated data set allows the examination of the direct relationship between stock prices and company search volumes on different levels of aggregation. By including company-level information about sectoral affiliations we can study how diverse ways of doing business in different branches inspire different information gathering strategies. Moreover, we can also investigate individual company performances. The connection between stock markets and Google search volume becomes evident on all levels during recent financial turmoil; a period, in which returns of a Google Trend based strategy by far exceed average market developments. Our findings are consistent with the intuition that recessions are highly suitable for market predictions made by collective behavior indicators, since such downturns draw mass attention to economic issues in general and therein to the most affected business sectors in particular.

Results

For our analysis we gathered the search volumes provided by Google Trends for all companies listed in the Standard & Poor´s 100 (SPY) in August, 2014 (see Material section for further details and S1 File for the actual data) [42]. The SPY index composition is based on one hundred large and well established “blue chips” and represents ten major economic branches defined by the Global Industry Classification Standard [43]. We used the full corporate name in combination with “company” as search term to avoid semantic ambiguities. The scores produced by Google Trends consist of the volume of each search query relative to the total number of searches carried out at each point in time. The results are reported weekly and the subsequent data set was collected for the period between 4 January, 2004 and 4 January, 2014. For five companies we could not find any Google Trends scores with the described search terms, they are excluded from further analysis (an overview of the missing data is provided in S2 File). The following results are therefore based on the remaining 95 stocks, and a stricter sample (yielding similar results) is discussed in the Material section.

In order to identify shifts in collective information retrieval behavior we calculate relative changes in search volumes for each stock: Δni (tt) = ni (t) − Ni (t– 1,Δt), with, Ni (t– 1,Δt) = (ni (t– 1) + ni (t– 2)+…+ni (t − Δt)) / Δt, where ni is the relative search volume for stock i and Δt is set to three weeks, as done by Preis and colleagues [34]. To analyze the average change for each week over all company search queries we use and for companies belonging to sector S, respectively.

Fig 1 shows for company-based queries of the SPY components in the period 2004–2014 that collective information gathering behavior is especially high in bearish markets during the collapse of Lehman Brothers and, to a lower degree, during the unfolding Euro-crisis in the beginning of 2010. Large attention increases occur only during those slumps, whereas declining numbers of search queries are more likely in rising markets.

thumbnail
Fig 1. Company search volumes and stock market movement.

Closing prices p(t) of the Standard and Poor´s 100 index are plotted as a function of time for the first day of every week in the period between 4 January, 2004 and 4 January, 2014. The subsequent curve is dyed by a color code corresponding to the average change of search volumes for each of the Standard and Poor´s 100 companies. Red sections indicate increasing collective attention in terms of search volumes and green parts illustrate declining number of search queries. The average for each week over all company search queries is calculated by , with Δt being 3 weeks. Please see S3 File for different time windows of Δt.

https://doi.org/10.1371/journal.pone.0135311.g001

Utilizing sectoral information of each stock underlines the negative relationship of searches and markets but differs substantially across industries, as shown in Fig 2. The largest attention shifts take place with regard to financial corporations during the Subprime crisis. Additionally, the company-level data reveal some more specific intersections with real world events. Within the Materials sector, for instance, search queries see large changes in 2004 and 2005 due to regulatory inspections about DuPont´s involvement in the release of perfluorooctanoic acid (PFOA, also known as C8) into drinking water [44].

thumbnail
Fig 2. Sectoral search volumes and stock market movement.

For each sector of the companies contained in the Standard and Poor´s 100 the development of closing prices p(t) of the index is drawn as a function of time for the first day of every week in the period between 4 January, 2004 and 4 January, 2014. Each index curve is dyed by a color code corresponding to the average change of search volumes for the respective companies of each sector S. The average change for each week over all company search queries for sector S is given by , with Δt being 3 weeks.

https://doi.org/10.1371/journal.pone.0135311.g002

Despite these peaks in public attention, Figs 1 and 2 provide only “weak” (i.e. descriptive) evidence. Using a simple regression model with a basic control variable in terms of the S&P 500 volatility index reveals that the statistical relationship between the development of the SPY index and the average change over all (and sectoral) company search queries is indeed negative, as expected, but not significant (Table 1). This non-significance is true on a general level and for each considered sector. Hence, during regular market developments, and with it the better part of the observation period, no big changes exist in the public interest for large corporations; only through extraordinary events people seem to modify their search behavior in this respect.

thumbnail
Table 1. Influence of company search queries on the S&P 100 index development.

https://doi.org/10.1371/journal.pone.0135311.t001

Although there is no “hard” evidence for a general connection between the average (resp. sectoral) shift in search queries and the SPY index development, peak times of financial turmoil are visibly accompanied by increased collective attention. The main part of this paper tries to exploit these collective attention shifts and investigate their relationship with individual stock prices. For this purpose, we implement a hypothetical trading strategy based on company-level Google Trends scores. The intuition behind our strategy is to take an investment position that utilizes collective attention as an indicator for “bad news” and treats an increase in collective search queries as a signal to go “short”, as it was successfully done in [33,34]. Following this approach, we first set all portfolios to an arbitrary value of 1. We implement the proposed strategy by selling a certain stock i at closing price pi(t) on the first trading day of week t and buying it back at closing price pi(t + 1) of the first trading day of the consecutive week, if the relative change in search queries is higher than the weekly average (i.e. Δni(t– 1,Δt)>0). The cumulative return Ri of this “short position” is then changing by log(pi(t)) − log(pi(t+1)) If, in contrast,Δni(t– 1,Δt)<0, the relative change in search volume indicates no “bad news”, but neither an immediate incentive to buy stock i regarding to changes in collective attention. In this case, we rely on the SPY index as a general indicator for collective financial behavior. We identify shifts in SPY prices by calculating relative changes as shown above with Google Trend scores. Thus, the “long position” is taken if the SPY index at the beginning of a trading week is higher than its average over three preceding weeks (i.e. Δnspy(t– 1,Δt)>0). The cumulative return Ri changes then by log(pi(t+1)) − log(pi(t)). If the index is below average at the beginning of a week we are going “short”. As before, the cumulative return Ri of this “short position” is then aggregated by log(pi(t)) − log(pi(t+1)) In summary, our investment strategy utilizes, on the one hand, Google Trends as an indicator to “short-sell” certain stocks with high attention scores, which appear especially around large market movements. On the other hand, our strategy follows the SPY index development and its mapping of general financial behavior.

In Fig 3, the mean performance of the Google Trends strategy for all stocks contained in the Standard & Poor´s 100 is illustrated by a blue line. Each company has thereby the same weight in the constructed portfolio. To get an approximation of the average market evolution we depicted the development of the index itself as a red line. Their difference is dyed blue. As baseline we implement a random strategy, in which investment decisions are generated in an uncorrelated manner by buying and selling the SPY index randomly. This simulation was executed 10,000 times and the reported results are the mean for each point in time of these procedure. The dashed lines indicate the standard deviation of the simulations. Applied for all stocks of the SPY index in the period between 2004 and 2014, the accumulated return of the Google Trends strategy is over 175%. This (hypothetical) profit outperforms the average market development indicated by the SPY index considerably, particularly in times of crisis. During financial turmoil in 2008 and 2009 the Google Trends strategy generated large profits in an extremely negative market environment. The US subprime crisis shifted mass attention—and with it Google search volumes—in the direction of certain companies. Our strategy utilizes this mechanism by “short-selling” such stocks. Otherwise it relies on general market trends, i.e. following general financial behavior. Combining both collective attention indicators, the strategy outperforms the SPY by circa 140%. Note hereby that the SPY index is already an exclusive aggregation of high-performance shares.

thumbnail
Fig 3. Cumulative performance of an investment strategy based on company-level Google Trends data.

Cumulative profits and losses of an investment strategy based on collective attention shifts are plotted over time (blue line). The strategy utilizes company-level Google Trends data as an indicator for “bad news” and the movement of the Standard and Poor´s 100 index as an approximation of general financial behavior. To compare its performance, the cumulative return of a Standard and Poor´s 100 based “buy and hold” strategy is depicted as a red line. The difference between both lines is dyed blue. As a baseline measure we draw the mean of a random investment strategy (green line) and its standard deviation (dashed lines). The proposed investment strategy based on collective attention shifts yields a cumulative profit of circa 175%, compared to around 36% gained by Standard and Poor´s 100 index between 4 January, 2004 and 4 January, 2014. It outperforms the well-established index for the entire observation period.

https://doi.org/10.1371/journal.pone.0135311.g003

Moreover, the company-level data allows us to examine the Google Trends strategy for each of the 10 sectors included in the Standard & Poor´s 100. These results are shown in Fig 4. To compare our hypothetical investment strategy we implement a “buy and hold” strategy based on the stocks of each sector to get an “Industry based strategy” by buying all shares of an industry in the beginning and selling it at the end of each week. The mean cumulative returns of all stocks of one sector are used as sector specific baseline and depicted as red lines. Fig 4 draws a more differentiated picture of the performance of the Google Trends strategy, whereby it outperforms in every sector the industry average. However, for some sectors the Google Trends strategy produces significant higher returns than in others. Clearly, the most profound attention shifts occur in Financials. The strategy increases the value of a hypothetical portfolio that trades financial stocks over 320%. For branches relatively unaffected by the financial crisis (e.g. Health Care or Telecommunication Services) our Google Trends strategy generates lower profits, yet, still considerable higher gains than the average of the subsequent industries. This cross-sectoral pattern is consistent with the above reasoning that collective attention shifts are a major influence factor of stock price formations, since the strategy performs particularly well in times of large market movements and with companies that are in the center of massive public interest in the period between 2004 and 2014.

thumbnail
Fig 4. Sectoral disaggregation of cumulative returns of an investment strategy based on company-level Google Trends data.

For each sector of the Standard and Poor´s 100, cumulative profit and loss of a Google Trends based investment strategy is depicted as a function of time (blue line). The hypothetical strategy aggregates company-level Google Trends data for 10 sectors and utilizes search volumes above a three weekly average as a “bad news” indicator combined with the movement of the Standard and Poor´s 100 index as an approximation of general financial behavior. To compare each sectoral performance we delineate the average of a “buy and hold” strategy based on the stocks for each industry (red line). The difference is dyed blue. A random investment strategy (green line) and its standard deviation (dashed lines) illustrate performance baselines. The highest profits could have been gained with stocks from the Financial sector (circa 325%), the lowest profits would have been realized within Telecommunication Services (around 99%). Nevertheless, for each sector the proposed investment strategy outperforms the average sectoral development between 4 January, 2004 and 4 January, 2014.

https://doi.org/10.1371/journal.pone.0135311.g004

Finally, we can investigate the effect of the proposed Google Trends strategy for individual companies. In Fig 5 we present the performance of the 10 highest weighted constituents of the Standard & Poor´s 100 [44]. Generally, the same pattern of success is visible. For all companies the Google Trend based strategy generated considerable gains, and in all but one cases (Johnson & Johnson) the cumulative profits were higher than those 36% of the index. Similar to the observations on the sectoral level, the results are related with the way of doing business and the industry affiliation, respectively. The companies benefiting most from the investment strategy are banks and other financial institutions (e.g. JP Morgan Chase), since they were in the center of the financial turmoil during the Subprime crisis. In contrast, firms with businesses that are not directly connected with the financial market (e.g. Procter & Gamble), are less eligible for the investment strategy. Again, collective attention shifts as a negative trading signal apply especially well for corporations that were hit hard by recent financial crisis.

thumbnail
Fig 5. Individual cumulative returns of an investment strategy based on company-level Google Trends data.

Cumulative profit and loss for each of the 10 highest weighted corporations in the Standard and Poor´s 100 index, based on a Google Trends investment strategy (blue line). To compare each performance, the average index development (~36% between 2004 and 2014) is depicted as a red line. The difference is dyed blue. A random investment strategy (green line) and its standard deviation (dashed lines) illustrate performance baselines. The results range from very high profits in the case of banks and other financial institutions (e.g. JP Morgan Chase) to rather low gains (e.g. Procter & Gamble), which highlights the eligibility of the proposed investment strategy during market upheavals.

https://doi.org/10.1371/journal.pone.0135311.g005

Discussion

Our results demonstrate that publicly available data on collective information gathering behavior can help investors in times of financial turmoil to hedge portfolio values and even extend their profits. In the period between January 2004 and January 2014 we investigated Google Trends data for companies listed in the Standard & Poor´s 100 index and detected increases in search volumes during the large market slumps in 2008 and, to a lower degree, in 2010. Investors who would have utilized these company-level search queries as an indicator for “bad news” could have gained considerable profits, particularly for sectors that were in the center of the Subprime crisis.

Interpreting Google Trends as an approximation for collective informational needs, the results may represent a general pattern in modern investment decision making regarding the importance of collective attention shifts for stock price formations. In times of large changes and great uncertainty the necessity to collect information about investment assets like stocks seems to be especially high [12,15,17,18,33,34]. In today´s world this means to “google” such companies one is interested in. At the same time, economic issues see a rise in media coverage during such market upheavals [13] and, hence, are more likely incorporated into everyday-life conversation topics. This (and many more conceivable) micro social processes become manifest in increasing search volumes for affected companies and are succeeded by decreasing stock prices. Therefore, changes in collective attention are an important reason for the particular success of our trading strategy during recent crisis and within afflicted sectors and companies; an explanation that follows directly Herbert Simon´s suggestion about attention as a scarce commodity which enhances the gathering of information especially in times of economic uncertainty [36].

However, the pattern applies particularly well if the state of the economy is turning drastically [20]. This means collective attention in terms of Google Trends data serve especially well as an indicator for “bad news” and subsequent falling prices, which can be exploited by going “short” for those assets. In contrast, people seem not inclined to search in great numbers for corporations that are presenting, for instance, respectable annual reports or announce noteworthy sales numbers. Thus, collective attention for large companies may follow the general media logic that “good news is no news”, or in our case more precisely, that “only bad news is relevant news” in order to use it as an investment signal. In this way, the interplay between collective information gathering and financial behavior may even contribute to the overreaction of investors during financial crises and the subsequent magnification of economic slumps.

As a consequence, Google Trends data offer mainly a possibility to investigate collective financial behavior and search queries in negative economic contexts. Instead, regular market environments with steadily rising prices seem less connected to collective attention shifts, as far as current evidence tells us. Nevertheless, we are convinced that in the near future more disaggregated data will be available, so that regular market contexts can also be investigated in greater depths and advance our knowledge about complex social systems.

Material

We retrieved search volume data from Google Trends website (http://www.google.com/trends) on every day between August 23, 2014 and August 29, 2014 for all companies in the Standard and Poor´s 100 index. The index composition is taken as of August 29, 2014 and has not changed since then. Search volume data are restricted to requests of users localized in the USA, the home location of all companies contained in the Standard and Poor´s 100 index. The series are reported weekly on a Sunday to Saturday frequency. Since only five search terms can be looked up simultaneously, we retrieved the data for each company separately. To avoid semantic ambiguities we used the full company name plus the word “company”. The search volumes are normalized by Google with a maximum of 100, serving as a scaling factor for the rest of the series. Due to this normalization Google Trends results are dependent from the time of observation. Therefore, we provide in S1 File the original data used in this article to facilitate the reproduction of our results.

However, there are missing values of different degrees within the dataset. For five companies we could not find any Google Trends scores with the described search terms. They have no values for all 522 weeks and are therefore not included in the analysis. A detailed distribution of the missing values can be found in S2 File. For 71 companies we have “complete” data in the sense that not more than four weeks are missing for the whole observation period. Calculating the results only for those companies, Fig 6 shows a very similar development and increases the performance of the proposed investment strategy slightly. Thus, the results for the entire dataset represent a lower boundary for possible profits, i.e. that stricter data cleaning can even improve profits.

thumbnail
Fig 6. Cumulative returns of an investment strategy based on company-level Google Trends data for 71 corporations of the Standard and Poor´s 100.

Cumulative profit and loss for all corporations in the dataset that have no more than 4 missing values over the whole observation period between 4 January, 2004 and 4 January, 2014. Profits and losses are based on a Google Trends investment strategy (blue line). The development of the curve is very similar to the results displayed in Fig 3, which uses all available data. However, using only the 71 corporations that meet the stricter condition increases the overall profits.

https://doi.org/10.1371/journal.pone.0135311.g006

To further support the reliability of the proposed trading strategy, we calculated another possible approach: We reversed the whole trading strategy by buying (instead of selling) if search volumes are above average. If this case is not applying, we are trusting the SPY index if its below its mean (i.e. Δnspy(t– 1,Δt)<0) and are selling otherwise, which is for both elements of the strategy the exact opposite as initially suggested. The subsequent results are shown in S4 File and represent the inversion of the curve presented in Fig 3. Thus, if an investor would have applied this (hypothetical) trading strategy and used Google Trends scores as a buying signal, she would have generated a huge loss.

Furthermore, the stock prices were downloaded from Yahoo Finance (http://finance.yahoo.com) on a daily basis for every trading day between January 4, 2004 and January 4, 2014. Closing prices pi(t) at the first trading day of a week are matched to Google Trends data of the previous week, when information would be available for hypothetical investors. In a regular trading week, for instance, closing price pi(t) on Monday would correspond to Google search queries for i in the previous week, which is available on the preceding Saturday.

All steps of the analysis described above were conducted with the open-source language Python. All Figures were drawn with the R-Package “ggplot2”.

Supporting Information

S1 File. Google Trends data for Standard and Poor´s 100 companies.

https://doi.org/10.1371/journal.pone.0135311.s001

(ZIP)

S2 File. Missing values of the Google Trends data for Standard and Poor´s 100 companies.

https://doi.org/10.1371/journal.pone.0135311.s002

(CSV)

S3 File. Different time windows for the proposed Google Trends investment strategy.

https://doi.org/10.1371/journal.pone.0135311.s003

(TIF)

S4 File. Cumulative performance of a strategy that reverses the proposed investment mechanism.

The reversed trading strategy is buying (instead of selling) if search volumes are above average and trusting the SPY index if its below its mean and selling otherwise, which is for both elements the opposite as suggested in Fig 3.

https://doi.org/10.1371/journal.pone.0135311.s004

(TIF)

Acknowledgments

I would like to gratefully acknowledge many discussions with Jan Riebling, Alexandra Rudolph and Christina Heller.

Author Contributions

Conceived and designed the experiments: RHH. Performed the experiments: RHH. Analyzed the data: RHH. Contributed reagents/materials/analysis tools: RHH. Wrote the paper: RHH.

References

  1. 1. Goel S, Hofman JM, Lahaie S, Pennock DM, Watts DJ. Predicting consumer behavior with Web search. Proc Natl Acad Sci. 2010;107: 17486–17490. pmid:20876140
  2. 2. Preis T, Moat HS, Stanley HE, Bishop SR. Quantifying the Advantage of Looking Forward. Sci Rep. 2012;2.
  3. 3. Golder SA, Macy MW. Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures. Science. 2011;333: 1878–1881. pmid:21960633
  4. 4. Buldyrev SV, Parshani R, Paul G, Stanley HE, Havlin S. Catastrophic cascade of failures in interdependent networks. Nature. 2010;464: 1025–1028. pmid:20393559
  5. 5. Lazer D, Pentland A, Adamic L, Aral S, Barabási A-L, Brewer D, et al. Computational Social Science. Science. 2009;323: 721–723. pmid:19197046
  6. 6. Schweitzer F, Fagiolo G, Sornette D, Vega-Redondo F, Vespignani A, White DR. Economic networks: The new challenges. Science. 2009;325: 422. pmid:19628858
  7. 7. King G. Ensuring the Data-Rich Future of the Social Sciences. Science. 2011;331: 719–721. pmid:21311013
  8. 8. May RM, Levin SA, Sugihara G. Complex systems: Ecology for bankers. Nature. 2008;451: 893–895. pmid:18288170
  9. 9. May RM. Networks and webs in ecosystems and financial systems. Philos Transact A Math Phys Eng Sci. 2013;371: 20120376.
  10. 10. Heiberger RH. Stock network stability in times of crisis. Phys Stat Mech Its Appl. 2014;393: 376–381.
  11. 11. Gabaix X, Gopikrishnan P, Plerou V, Stanley HE. A theory of power-law distributions in financial market fluctuations. Nature. 2003;423: 267–270. pmid:12748636
  12. 12. Andersen JV, Nowak A, Rotundo G, Parrott L, Martinez S. “Price-Quakes” Shaking the World’s Stock Exchanges. PLoS ONE. 2011;6: e26472. pmid:22073168
  13. 13. Piškorec M, Antulov-Fantulin N, Novak PK, Mozetič I, Grčar M, Vodenska I, et al. Cohesiveness in Financial News and its Relation to Market Volatility. Sci Rep. 2014;4.
  14. 14. Lillo F, Farmer JD, Mantegna RN. Econophysics: Master curve for price-impact function. Nature. 2003;421: 129–130. pmid:12520292
  15. 15. Casarin R, Squazzoni F. Being on the Field When the Game Is Still Under Way. The Financial Press and Stock Markets in Times of Crisis. PLoS ONE. 2013;8: e67721. pmid:23861791
  16. 16. Preis T, Schneider JJ, Stanley HE. Switching processes in financial markets. Proc Natl Acad Sci. 2011;108: 7674–7678. pmid:21521789
  17. 17. Preis T, Kenett DY, Stanley HE, Helbing D, Ben-Jacob E. Quantifying the Behavior of Stock Correlations Under Market Stress. Sci Rep. 2012;2.
  18. 18. Quax R, Kandhai D, Sloot PMA. Information dissipation as an early-warning signal for the Lehman Brothers collapse in financial time series. Sci Rep. 2013;3.
  19. 19. Gontis V, Kononovicius A. Consentaneous Agent-Based and Stochastic Model of the Financial Markets. PLoS ONE. 2014;9.
  20. 20. Münnix MC, Shimada T, Schäfer R, Leyvraz F, Seligman TH, Guhr T, et al. Identifying States of a Financial Market. Sci Rep. 2012;2.
  21. 21. Ivanov PC, Yuen A, Perakakis P. Impact of Stock Market Structure on Intertrade Time and Price Dynamics. PLoS ONE. 2014;9: e92885. pmid:24699376
  22. 22. Sun X-Q, Shen H-W, Cheng X-Q, Wang Z-Y. Degree-Strength Correlation Reveals Anomalous Trading Behavior. PLoS ONE. 2012;7: e45598. pmid:23082114
  23. 23. Alanyali M, Moat HS, Preis T. Quantifying the Relationship Between Financial News and the Stock Market. Sci Rep. 2013;3.
  24. 24. Mathiesen J, Angheluta L, Ahlgren PTH, Jensen MH. Excitable human dynamics driven by extrinsic events in massive communities. Proc Natl Acad Sci. 2013;110: 17259–17262. pmid:24101482
  25. 25. Bordino I, Battiston S, Caldarelli G, Cristelli M, Ukkonen A, Weber I. Web Search Queries Can Predict Stock Market Volumes. PLoS ONE. 2012;7: e40014. pmid:22829871
  26. 26. Moat HS, Curme C, Avakian A, Kenett DY, Stanley HE, Preis T. Quantifying Wikipedia Usage Patterns Before Stock Market Moves. Sci Rep. 2013;3.
  27. 27. Bollen J, Mao H, Zeng X. Twitter mood predicts the stock market. J Comput Sci. 2011;2: 1–8.
  28. 28. Kristoufek L. What Are the Main Drivers of the Bitcoin Price? Evidence from Wavelet Coherence Analysis. PLoS ONE. 2015;10: e0123923. pmid:25874694
  29. 29. Garcia D, Tessone CJ, Mavrodiev P, Perony N. The digital traces of bubbles: feedback cycles between socio-economic signals in the Bitcoin economy. J R Soc Interface. 2014;11: 20140623. pmid:25100315
  30. 30. Kristoufek L. BitCoin meets Google Trends and Wikipedia: Quantifying the relationship between phenomena of the Internet era. Sci Rep. 2013;3.
  31. 31. Bank M, Larch M, Peter G. Google search volume and its influence on liquidity and returns of German stocks. Financ Mark Portf Manag. 2011;25: 239–264.
  32. 32. Preis T, Reith D, Stanley HE. Complex dynamics of our economic life on different scales: insights from search engine query data. Philos Trans R Soc Lond Math Phys Eng Sci. 2010;368: 5707–5719.
  33. 33. Kristoufek L. Can Google Trends search queries contribute to risk diversification? Sci Rep. 2013;3.
  34. 34. Preis T, Moat HS, Stanley HE. Quantifying Trading Behavior in Financial Markets Using Google Trends. Sci Rep. 2013;3.
  35. 35. Curme C, Preis T, Stanley HE, Moat HS. Quantifying the semantics of search behavior before stock market moves. Proc Natl Acad Sci. 2014;111: 11600–11605. pmid:25071193
  36. 36. Simon HA. A Behavioral Model of Rational Choice. Q J Econ. 1955;69: 99–118.
  37. 37. Lazer D, Kennedy R, King G, Vespignani A. The Parable of Google Flu: Traps in Big Data Analysis. Science. 2014;343: 1203–1205. pmid:24626916
  38. 38. Tufekci Z. Big Questions for Social Media Big Data: Representativeness, Validity and Other Methodological Pitfalls. Eighth International AAAI Conference on Weblogs and Social Media. 2014. Available: http://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8062
  39. 39. Cohen R, Ruths D. Classifying Political Orientation on Twitter: It’s Not Easy! Seventh International AAAI Conference on Weblogs and Social Media. Palo Alto, CA; 2013. pp. 91–99. Available: http://www.aaai.org/ocs/index.php/ICWSM/ICWSM13/paper/view/6128
  40. 40. Ruths D, Pfeffer J. Social media for large studies of behavior. Science. 2014;346: 1063–1064. pmid:25430759
  41. 41. Sun X-Q, Shen H-W, Cheng X-Q. Trading Network Predicts Stock Price. Sci Rep. 2014;4.
  42. 42. S&P 100—S&P Dow Jones Indices [Internet]. [cited 16 Feb 2015]. Available: http://us.spindices.com/indices/equity/sp-100
  43. 43. GICS Structure [Internet]. [cited 29 May 2013]. Available: http://www.msci.com/products/indices/sector/gics/gics_structure.html
  44. 44. Cortese A. DuPont, Now in the Frying Pan. The New York Times. 8 Aug 2004. Available: http://www.nytimes.com/2004/08/08/business/dupont-now-in-the-frying-pan.html. Accessed 5 Mar 2015.