Identifying Key Drivers of Return Reversal with Dynamical Bayesian Factor Graph

Shuai Zhao; Yunhai Tong; Zitian Wang; Shaohua Tan

doi:10.1371/journal.pone.0167050

Abstract

In the stock market, return reversal occurs when investors sell overbought stocks and buy oversold stocks, reversing the stocks’ price trends. In this paper, we develop a new method to identify key drivers of return reversal by incorporating a comprehensive set of factors derived from different economic theories into one unified dynamical Bayesian factor graph. We then use the model to depict factor relationships and their dynamics, from which we make some interesting discoveries about the mechanism behind return reversals. Through extensive experiments on the US stock market, we conclude that among the various factors, the liquidity factors consistently emerge as key drivers of return reversal, which is in support of the theory of liquidity effect. Specifically, we find that stocks with high turnover rates or high Amihud illiquidity measures have a greater probability of experiencing return reversals. Apart from the consistent drivers, we find other drivers of return reversal that generally change from year to year, and they serve as important characteristics for evaluating the trends of stock returns. Besides, we also identify some seldom discussed yet enlightening inter-factor relationships, one of which shows that stocks in Finance and Insurance industry are more likely to have high Amihud illiquidity measures in comparison with those in other industries. These conclusions are robust for return reversals under different thresholds.

Citation: Zhao S, Tong Y, Wang Z, Tan S (2016) Identifying Key Drivers of Return Reversal with Dynamical Bayesian Factor Graph. PLoS ONE 11(11): e0167050. https://doi.org/10.1371/journal.pone.0167050

Editor: Wei-Xing Zhou, East China University of Science and Technology, CHINA

Received: May 24, 2016; Accepted: November 8, 2016; Published: November 28, 2016

Copyright: © 2016 Zhao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: This work is supported by the Consulting Project of the Chinese Academy of Engineering numbered as 2016-XY-12, and the Project of Beijing Technology Plan numbered as Z161100001116060.

Competing interests: The authors have declared that no ompeting interests exist.

Introduction

In the stock market, return reversal occurs when investors sell overbought stocks and buy oversold stocks, reversing the stocks’ price trends. Apart from the extensive research on market price analysis [1–7], return reversal has also attracted lots of attention. Bondt and Thaler [8, 9] initially documented long-term return reversal in the US stock market, indicating that stocks performed well in the past three to five years tend to have low future returns. Lehmann [10] and Jegadeesh [11] first recorded that short-term return reversal, specifically, weekly and monthly reversals, also exist among US stocks. Not only in US, researchers have found the phenomenon worldwide [12–16]. For instance, Chang et al. [16] provided empirical evidence on return reversal in the Japanese stock market.

A critical question about return reversal is: what are the driving forces? There are many theories proposed, however, no unanimous conclusions have been reached. Among the theories, the following ones received more supports compared with the others: 1) overreaction hypothesis. Overreaction hypothesis [8, 17–22] states that investors tend to overreact to recent economic developments, leading to extreme movements in stock prices in the short-run and price movements in the opposite direction in the long-run. 2) liquidity effect. Liquidity effect [23–27] states that the non-informational trading demanding for immediacy would drive the market prices to deviate from the fundamentals. When the non-informational trading is absorbed by liquidity suppliers, return reversal happens. 3) January effect. According to Tax-loss selling hypothesis [28–30], January effect [27, 29, 31, 32] states that investors tend to intensively sell stocks in December, especially those performing badly, to offset taxable realized capital gains, which induces a decline in stock prices. In January, the selling pressure disappears and investors tend to repurchase stocks, driving stock prices up. Other theories including inventory imbalances [27], lead-lag effect between large and small firms [33], and microstructure features of the stock market such as bid-ask spread [15, 34–36] have also been proposed as drivers of return reversal.

Although existing studies have generated some enlightening conclusions, they face some typical problems. First, they usually analyzed a limited number of driving factors corresponding to just one or two economic theories [24, 25, 37], leaving other driving factors deliberately neglected. Second, they generally assumed that analyzed factors are linearly correlated, which is a strong abstraction of the non-linear and time-varying characteristics of real-world market [38–43]. Third, to the best of our knowledge, they have seldom mentioned the relationships among the driving factors, which are likewise important for a more complete understanding of the mechanism in deep.

Motivated by the analysis, in this paper, we develop a new method to identify key drivers of return reversal, which is based on dynamical Bayesian factor graph. The basic structure of dynamical Bayesian factor graph is a Bayesian factor graph [44], which is a subclass of Bayesian network [45]. As a systematic non-linear and data-driven causal discovery method, Bayesian factor graph can deal with multiple factors in a unified framework, and is quite effective in uncovering factor relationships.

Fig 1 serves as an example of Bayesian factor graph, which is associated with a small factor set F = {Industry, Capitalization, Volume, Return} and depicts the influential factors of the return of a stock. Industry stands for the industry category the stock is in. Capitalization, Volume and Return represent the market capitalization, trading volume and return of the stock respectively.

Download:

Fig 1. An example of Bayesian factor graph.

The edge from Industry to Capitalization makes the former a parent of the latter, and the latter a child of the former. The rule is also applied to all the other edges.

https://doi.org/10.1371/journal.pone.0167050.g001

In Fig 1, the nodes correspond to the factors, while the edges indicate influential relationships among the factors. According to Fig 1, Return is causally dependent on Capitalization and Volume. Industry is relevant to Capitalization. In addition, Industry and Return are conditionally independent given Capitalization.

To adapt Bayesian factor graph to complex time-varying systems, we compute a time series of emergent factor graphs over a specific period of time, and term them as dynamical Bayesian factor graph [39], with which the evolution of factor relationships can be captured.

As related studies [22, 24], we merely focus on stocks with large capitalization to avoid microstructure concerns of the stock market. Additionally, we analyze the stock returns that are adjusted by the Fama-French three-factor model [46] instead of raw returns, since Blitz et al. [22] found that return reversals cleared of influences from the Fama-French factors have greater chances to make profits.

In specific, our work consists of four steps: 1) a comprehensive set of potential driving factors of return reversal corresponding to various economic theories are constructed, and the class factor indicating whether return reversal arises is defined. 2) dynamical Bayesian factor graph is applied to generate a global picture depicting factor relationships as well as their evolution. 3) from each member graph of the dynamical structure, the factors in the Markov blanket [47] of the class factor are identified as key drivers of return reversal, as conditioned on those factors the class factor is independent of any other factors. Moreover, through the mechanism of inference, the marginal probabilities and conditional probability table (CPT) of the class factor given the key drivers are calculated. By comparing the marginal and conditional probabilities, how the key drivers function in specific can be revealed. 4) some representative potential driving factors that consistently influence key reversal drivers are selected, and their influential effects on the reversal drivers are systematically studied.

The contribution of our work lies in three aspects. First, we employ dynamical Bayesian factor graph to capture the relationships as well as the dynamics of the relationships among various factors related to return reversal, from which we get a clearer picture of the mutual interaction of the financial factors. Second, based on dynamical Bayesian factor graph, we propose some definitions that are quite useful in analyzing key drivers of return reversal. These definitions aim at dealing with three problems, including evaluating the credibility of generated graphs, locating key drivers of a specific factor and capturing key driver dynamics, and figuring out how the key drivers function from a quantitative perspective. Third, through extensive experiments on the US market, we conclude that the liquidity factors consistently emerge as key drivers of return reversal, which is in line with the theory of liquidity effect. Specifically, we find that stocks with high turnover rates or high Amihud illiquidity measures [48] have a greater probability of experiencing return reversals. Apart from the consistent drivers, we find other drivers of return reversal that generally change from year to year, and they serve as important characteristics to evaluate the trends of stock returns. Besides, we also learn that among all the potential driving factors, those corresponding to overreaction hypothesis and stock industry impose most consistent influential effects on the liquidity factors. One of the influential effects shows that stocks in Finance and Insurance industry are more likely to have high Amihud illiquidity measures compared with those in other industries. These conclusions are robust for return reversals under different thresholds and provide insights in estimating future return reversals.

Note that there is a kind of investment strategy called contrarian strategy, the essence of which is to selectively buy stocks performing badly and sell stocks performing well with the purpose of taking advantage of return reversals to make profits [11, 13, 21, 22, 27, 49, 50]. By accurately identifying key drivers of return reversal in advance, our research can hopefully help design more profitable contrarian strategies.

The rest of the paper is organized as follows. The next section describes dynamical Bayesian factor graph and the way of identifying key drivers of return reversal in detail, followed by two sections introducing our research data and presenting empirical results based on stocks in the US market, respectively. The last section concludes the paper.

1 Methods

In this section, we first describe the concepts of Bayesian factor graph, then introduce dynamical Bayesian factor graph and related definitions, and finally give the way of identifying key drivers of return reversal.

1.1 Bayesian factor graph

Bayesian factor graph, as a subclass of Bayesian network, is a probabilistic qualitative model which is designed to uncover relationships among a set of financial factors. The edges in a factor graph reflect inter-factor relationships, including causality, relevance and conditional independence.

Described formally, a Bayesian factor graph is a directed acyclic graph in which the joint distribution of d factors, X = {X₁,X₂,…,X_d}, is encoded. Nodes of the graph stand for the factors, while the graph structure reveals the qualitative information among the factors. Two unconnected nodes imply that corresponding factors are conditionally independent. If there exists an edge from node X_i to X_j, then X_i is called a parent of X_j, and X_j is a child of X_i. The conditional probabilities of the nodes given their parents are the quantitative information of the graph.

With the parent node set of X_i denoted as , the whole factor graph can be represented as G = {G₁, G₂, …, G_d}, and the joint probability of X given G can be represented as: (1)

The structure of G is initially unknown, and needs to be learned based on the observations of X. In this paper, we employ incremental association Markov blanket (IAMB) algorithm [51], which is one of the optimized derivatives of inductive causation algorithm [52], to learn graph structures. The learning procedure generally comprises three steps [53]:

First, the undirected structure of a factor graph is learned by detecting the Markov blankets of factors.

During the detection process, we use the mutual information of two factors, computed by Eq (2), as the measure of factor association. Besides, we adopt Chi-square test to judge whether two factors are conditionally independent. The significance level of the independence test is set to 5%. We use the P-value of the test to measure the strength of the corresponding edge. The smaller the P-value is, the stronger the strength is. (2)

In Eq (2), p_X,Y,Z(x, y, z), p_X,Z(x, z), p_Y,Z(y, z) stand for the joint probability distribution functions of factor X, Y and Z, X and Z, Y and Z, respectively. p_Z(z) is the marginal probability density function of Z. It is always true that MI(X; Y|Z) ≥ 0 and MI(X; Y|Z) = MI(Y; X|Z).

Second, set the directions of edges which are part of a V − structure in light of the d-separation criterion [54]. There are three kinds of basic structures in a acyclic directed graph: X → Y → Z, X ← Y → Z, and X → Y ← Z. The former two represent the same constraints of conditional independence that X and Z are conditionally independent given Y. According to Pearl and Verma [52], they are equivalent and indistinguishable based on observational data. The latter one, which is referred to as a V-structure, indicates that X is marginally independent of Z. Therefore, it is not equivalent to the former two structures and can be uniquely identified.

Third, add directions to other edges to meet the acyclic restriction of a factor graph.

1.2 Dynamical Bayesian factor graph and related definitions

Given a data set D = {x_τ: T₀ < T₁ ≤ τ ≤ T₂} for a factor set X = {X₁,X₂,…,X_d} that covers the period of [T₀, T₂], in order to model the dynamics of the relationships among the d factors over [T₁, T₂], one Bayesian factor graph G_t can be built for each t (T₁ ≤ t ≤ T₂) based on the subset of data D_t = {x_τ: t₀ ≤ τ < t}. In this way, a series of discrete time t_i, (i = 1, 2, …, n, T₁ ≤ t_i ≤ T₂) will lead to a time series of Bayesian factor graphs , and we term these dynamical Bayesian factor graph, which is seen as a dynamical model for X during [T₁, T₂].

Suppose we have dynamical Bayesian factor graph , where N is the node set of G_t, and E_i is the edge set of member graph . To measure how credible and G_t are, we propose Definition 1.

Definition 1. The credibility of and G_t are calculated by Eqs (3) and (4) respectively, (3) (4) where e represents an edge in E_i, |E_i| equals the number of edges and P_s(e) stands for the P-value of the independence test for the two factors that e links. As the equations show, is the average of the P-values for E_i, while Cred(G_t) is the average of . From an overall perspective, and G_t with smaller credibility values reflect more credible factor relationships.

Given the Markov blanket of a node in a factor graph, which includes its parents, its children and the children’s other parents, the node is independent of any other nodes. In other words, the Markov blanket provides all the needed information to forecast the behavior of the node. In light of the fact, we introduce Definition 2.

Definition 2. The nodes in the Markov blanket of node m (m ∈ N) in terms of are termed key drivers of m for t_i, and denoted by . Given two member graphs , , the similarity between and is computed by Eq (5), which is similar to the Jaccard-index. (5)

In Eq (5), represents the number of mutual nodes of and .

The similarity measure, which varies between 0 and 1, reflects the dynamics of the key drivers. In specific, similarity = 0 indicates that two key drivers sets are totally different, while similarity = 1 means that the two sets are identical. The larger similarity is, the more similar the two sets are.

For a node in a factor graph, its marginal probabilities indicate the chances that the node takes possible values given no information, while its CPT conditioned on its key drivers indicates the chances that the node takes those values given the information provided by the key drivers. Suppose we concern what values of the key drivers would more likely lead to the node taking a specific value. We can first locate the marginal and conditional probabilities corresponding with the value, and then look up the conditional probabilities that are higher than the marginal probability. In this way, the desired key drivers values can be intuitively revealed. Based on the discussion, we propose the following definition.

Definition 3. Focusing on the situation that node m = m_d (m ∈ N), where m_d is a specific value that m can take, the marginal probability of m = m_d for t_i, denoted by , is termed free probability, and the conditional probabilities of m = m_d for t_i given , denoted by , are termed driving probabilities. Let (6) (7) where represents the set of values that can take and k is one of . and are termed desired probability and desired values, respectively.

The desired probability indicates the greatest probability that m = m_d for t_i given the key drivers, while the desired values show corresponding key drivers values. After calculating the various probabilities in Definition 3 for t_i (1 ≤ i ≤ n), we obtain a time series of free probabilities , and a series of desired probabilities . Through comparing the statistics such as the mean values of the two series, we can get an overall picture about how the key drivers of m function.

1.3 Dynamical Bayesian factor graph in identifying key drivers of return reversal

We first introduce the definition of return reversal, and then describe the way of identifying key drivers of return reversal using dynamical Bayesian factor graph.

1.3.1 The definition of return reversal.

Denoting the price, raw return, trading volume and publicly held shares of stock i at month t as P_it, r_it, V_it and respectively, and calculating r_it through Eq (8), we define the class factor IsReversal_it, which equals 1 or -1, to indicate whether return reversal would happen to i at t + 1 or not. For clarity, an instance where IsReversal_it equals 1 is termed a reversal instance hereafter. (8)

To determine the value of IsReversal_it, we first adjust the raw return through the Fama-French three-factor model in Eq (9): (9) where RM_t and RF_t are market return and risk-free return at t respectively, and (RM_t − RF_t) is the market risk factor. SMB_t and HML_t are the company size factor and value factor respectively, with SMB standing for “Small (market capitalization) Minus Big” and HML for “High (book-to-market ratio) Minus Low”. The factor values can be found at Kenneth French’s web site (http://www.mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html). α_i, β_r, β_s and β_h are parameters to be estimated, and rs_it is the adjusted return of i at t. As related research [22], we train the model using the data in [t − 36, t − 1] to compute rs_it.

IsReversal_it will equal 1 as long as the following two restrictions are satisfied, or equal -1 otherwise. (10) (11)

The first restriction guarantees that the return of i will reverse in subsequent month, while the second one rules out stochastic return reversals with threshold r_th.

1.3.2 The way of identifying key drivers of return reversal.

We go through four steps to identify key drivers of return reversal.

First, we build dynamical Bayesian factor graph involving IsReversal_it and potential driving factors of return reversal (introduced in the next section) over a specific period of time.

Second, we evaluate the credibility of the dynamical structure and each of its member graphs.

Third, we identify the key drivers of IsReversal_it from the member graphs, and calculate similarity between the key drivers sets for different time to capture the dynamics of the reversal drivers.

Fourth, in terms of each member graph, we compute the free probability, driving probabilities and desired probability of IsReversal_it = 1, and identify corresponding desired values. After getting the time series of the free and desired probabilities, we compare the mean values of the two series to figure out how the key drivers affect return reversal from an overall perspective.

2 Data

Table 1 shows the set of potential driving factors of return reversal as well as their corresponding economic theories, types of values and short descriptions. We give some detailed explanations of the factors in S1 File.

Download:

Table 1. The potential driving factors of return reversal.

https://doi.org/10.1371/journal.pone.0167050.t001

Each year from 2000 to 2010, we select 100 stocks with largest capitalization from NYSE and AMEX markets, and collect the data of IsReversal_it and the potential driving factors for each of the stocks. Specifically, we collect earning announcement dates of firms from the Institutional Brokers’ Estimate System, and the rest of the data from Center for Research in Security Prices. Totally, we build a panel data set containing around 13,000 instances.

Table 2 lists yearly maximum and minimum capitalization and corresponding stocks’ tickers in our data set. The unit of the capitalization is one billion dollars.

Download:

Table 2. Some descriptive information of our data set.

https://doi.org/10.1371/journal.pone.0167050.t002

To show how the quantify of reversal instances changes with r_th, we choose five values for the threshold: 2%, 4%, 6%, 8%, and 10%, and plot corresponding proportions of reversal instances in Fig 2.

Download:

Fig 2. The proportions of reversal instances under different values of r_th.

As a r_th that is too small will lead to many random fluctuations among reversal instances, the value of the threshold should be carefully chosen. The figure can help decide what values of r_th should be experimented with.

https://doi.org/10.1371/journal.pone.0167050.g002

Fig 2 shows that when r_th ≤ 4%, over 40 percent of the instances are considered as reversal instances. When experimenting with those instances, we find no key drivers of IsReversal_it for several years, suggesting that there are many random fluctuations among the reversal instances. As a result, we focus on return reversals with r_th = 6%, 8% and 10% in succeeding experiments.

3 Results

In this section, we build dynamical Bayesian factor graph of IsReversal_it and the potential driving factors to identify key drivers of return reversal.

Setting T₀, T₁ and T₂ to be the year of 2000, 2005 and 2011 respectively, we use the period [T₀, T₁) as the in-sample period, and [T₁, T₂] as the out-of-sample period. As related research [44, 55], we discretize all the continuous factors into two levels, namely high and low levels (represented by 1 and 0), through equal-frequency method.

For each year T in [T₁, T₂], a Bayesian factor graph is learned on the basis of data in [T − 5, T − 1]. During the learning process, we ban the edges pointing from IsReversal_it to the other factors, in that IsReversal_it indicates the future trends of stock returns, and thus would impossibly influence the values of the other factors at current month.

We conduct experiments using R language, and use the package ‘bnlearn’ [53] to generate network structures.

To begin with, we analyze the results of experiments for return reversal with r_th = 6%, and then check whether obtained conclusions are robust for return reversals under different thresholds.

3.1 The results of experiments with r_th = 6%

Fig 3 shows the generated dynamical Bayesian factor graph , where the factor circled in red ellipse is the class factor and the weights on the edges are the P-values of corresponding independence tests. To make the graphs neater, we ignore the subscripts of all the factors. Besides, we replace the weights that are below 1e-15 (indicating edges with quite strong strength) by 0 as such weights would take too much space and intersect, making the graphs hard to read.

Download:

Fig 3. Dynamical Bayesian factor graph with r_th = 6%: G^r6.

The arrows indicate the time order of the member graphs. As a whole, the dynamical structure captures the mutual influential relationships as well as the dynamics of these relationships among all the factors.

https://doi.org/10.1371/journal.pone.0167050.g003

In Table 3, we list the credibility values of G^r6 and all of its member graphs, which show that the average P-values of the graph edges are obviously below the significance level of 5%.

Download:

Table 3. The credibility values of G^r6 and its member graphs.

https://doi.org/10.1371/journal.pone.0167050.t003

Next, we summarize the conclusions drawn from Fig 3, in regards of identifying key drivers of return reversal, applying inference on the generated graphs and learning relationships among the potential driving factors.

3.1.1 Identifying key drivers of return reversal.

First, we identify the key drivers of IsReversal from each member graph in Fig 3, and tag them in red font. In order to capture the dynamics of the key drivers, we calculate the similarity between the key drivers sets for two adjacent years, denoted by (t_i = 2006, 2007, …, 2011). Fig 4 shows the similarity measures for the out-of-sample years.

Download:

Fig 4. The similarity measures for the out-of-sample years.

https://doi.org/10.1371/journal.pone.0167050.g004

From Figs 3 and 4 we learn that: 1) there are always mutual factors among the key drivers sets for two adjacent years, as all the similarity measures are above 0. As a matter of fact, the liquidity factors consistently appear as the key drivers, which supports the theory of liquidity effect. 2) all the similarity measures, except that for year 2011, are below 1, reflecting that apart from the consistent drivers, other drivers of return reversal generally change from year to year. It is worth noting that for year 2007, the factors related to three economic theories including liquidity, industry and market effects appear as the key drivers. 3) the factors corresponding to theories including January and consistency effects are irrelevant to return reversal during the out-of-sample period.

3.1.2 Applying inference on the generated graphs.

In terms of each member graph in Fig 3, the local probability distribution of each factor and the joint probability distribution of all the factors can be determined. Based on these probability distributions, we can perform various probabilistic inferences. The basic idea of inference is to monitor posterior distributions of factors of interest given some evidence factors [44] whose states are already known. There are some commonly used exact inference algorithms for small-scale Bayesian networks, such as variable elimination and junction tree algorithm [56].

With the inference mechanism, we calculate the free and driving probabilities of IsReversal = 1 (as we care more the situation that return reversal happens) by setting evidences to null and all possible states of the key drivers respectively. Via comparing the free and driving probabilities, how the key drivers function in specific can be intuitively revealed. We adopt the junction tree algorithm implemented in the R package ‘gRain’ [57] to achieve the inferences.

Regarding year 2011 as an example, Table 4 shows the free probability and driving probabilities , where . The row in bold indicates the desired value and desired probability .

Download:

Table 4. The probabilities of IsReversal = 1 for year 2011.

https://doi.org/10.1371/journal.pone.0167050.t004

Table 4 suggests that stocks with the following features: high turnover rates, high Amihud illiquidity measures and prices that are not near to 5-year high (HighNear = 0), experience return reversals with the highest probability, around 9% higher than the free probability.

For the other out-of-sample years, we summarize the free and desired probabilities of IsReversal = 1 given corresponding desired values in Table 5.

Download:

Table 5. The probabilities of IsReversal = 1 for the other out-of-sample years.

https://doi.org/10.1371/journal.pone.0167050.t005

Table 5 reveals the following conclusions. First, for each year, high turnover rate or high Amihud illiquidity measure signals return reversals with a greater probability. Second, for year 2007, Efficiency = 1 shows as part of the desired values, confirming that return reversal is more likely to happen when the market is comparatively inefficient. Third, on average, the desired probabilities are around 5% higher than the free probabilities, indicating that the key drivers we identified have great potential in predicting return reversals.

Assuming that investors have access to all the values of the key drivers, they can make some effective evaluations about the trends of stock returns based on the results in Tables 4 and 5. However, it is common that sometimes the key drivers values cannot be fully gained. In this situation, investors can still make some estimations about IsReversal using the inference mechanism. Let’s take the following scenario for example.

At month t in year 2007, investors possess evidence e_i regarding stock i shown in Table 6, and want to obtain some clues on whether i will experience return reversal at t + 1. Note that e_i does not cover Illiquidity which is in . However, it contains factors including HighNear and VolGrowth which can directly or indirectly affect Illiquidity.

Download:

Table 6. Evidence e_i.

https://doi.org/10.1371/journal.pone.0167050.t006

To get the clues, we adopt the junction tree algorithm to calculate the free probability and the posterior probability . Table 7 gives the results, showing that the posterior probability is 5% higher than the free probability. In other words, conditioned on e_i stock i is more likely to experience return reversal compared with the case that no evidence is provided.

Download:

Table 7. The inference results for year 2007.

https://doi.org/10.1371/journal.pone.0167050.t007

As Fig 3 displays, factor relationships might change over time. As a result, a same piece of evidence might lead to different inference results in different years. For instance, if we change the year in the scenario to 2011, the inference results would become those in Table 8, which imply that e_i leads to i going through return reversal with probability 5% lower than the free probability.

Download:

Table 8. The inference results for year 2011.

https://doi.org/10.1371/journal.pone.0167050.t008

Although the inference results are not accurate enough for prediction purpose, they can provide investors with insights on what their current knowledge indicates with respect to future return reversals, and hence help them design investment strategies.

3.1.3 Learning relationships among the potential driving factors.

Besides key drivers of return reversal, relationships among the potential driving factors can also be conveniently studied based on Fig 3.

As we have learned that the liquidity factors consistently perform as key drivers of return reversal, subsequently we make some explorations about how the other potential driving factors influence the liquidity factors.

First, we select the representative factors that interact closely with both Turnover and Illiquidity by summarizing the intersection of the key drivers (except IsReversal) of the both factors over the out-of-sample years. Fig 5 displays the frequencies of the intersection factors.

Download:

Fig 5. The frequencies of the intersection key drivers of Illiquidity and Turnover.

The higher the frequency is, the more consistent the interplay between the corresponding factor and the liquidity factors is.

https://doi.org/10.1371/journal.pone.0167050.g005

Fig 5 shows that HighNear, LowNear as well as Industry which respectively represent the theories of overreaction hypothesis and industry effect impose influential effects on the liquidity factors over the whole out-of-sample period, and thus are chosen as the representative factors.

Second, we use the junction tree algorithm to calculate the free and conditional probabilities of Turnover and Illiquidity, conditioned respectively on (HighNear, LowNear) and Industry. As high turnover rate or high Amihud illiquidity measure is more likely to signal return reversal, we merely focus on the free and highest conditional probabilities of Turnover = 1 and Illiquidity = 1. Tables 9 and 10 show the results given (HighNear, LowNear), while Tables 11 and 12 list those given Industry.

Download:

Table 9. The free and highest conditional probabilities of Turnover = 1 given (HighNear, LowNear).

https://doi.org/10.1371/journal.pone.0167050.t009

Download:

Table 10. The free and highest conditional probabilities of Illiquidity = 1 given (HighNear, LowNear).

https://doi.org/10.1371/journal.pone.0167050.t010

Download:

Table 11. The free and highest conditional probabilities of Turnover = 1 given Industry.

https://doi.org/10.1371/journal.pone.0167050.t011

Download:

Table 12. The free and highest conditional probabilities of Illiquidity = 1 given Industry.

https://doi.org/10.1371/journal.pone.0167050.t012

From Tables 9 and 10, we learn that: (1) stocks whose prices are neither near to 5-year high nor near to 5-year low tend to have high turnover rates with greater probabilities, averagely around 12% higher than the free probabilities. (2) for all the out-of-sample years except 2008 and 2009, stocks whose prices are near to 5-year low are more likely to have high Amihud illiquidity measures, with probabilities around 6% higher than the free probabilities on average.

Tables 11 and 12 suggest that: (1) stocks in other industries (not Manufacturing and Finance and Insurance) tend to have high turnover rates with greater probabilities, around 13% higher than the free probabilities averagely. (2) stocks in Finance and Insurance industry are more likely to have high Amihud illiquidity measures, with probabilities around 10% higher than the free probabilities on average.

To the best of our knowledge, few efforts have been made to systematically study the inter-factor relationships above. These relationships can be quite helpful for investors in estimating future return reversals when the values of the liquidity factors are unavailable.

3.2 Robustness check

To check the robustness of the following two main conclusions in the previous subsection, in this subsection, we experiment on return reversals with r_th = 8%, 10% respectively.

Conclusion 1. The liquidity factors consistently serve as key drivers of return reversal, and other drivers generally change from year to year.

Conclusion 2. Stocks with high turnover rates or high Amihud illiquidity measures experience return reversals with a greater probability.

S1 and S2 Figs give corresponding dynamical Bayesian factor graphs, while S1 and S2 Tables show the credibility of the dynamical structures and their member graphs. From the tables we learn that the factor relationships reflected by the graphs are quite credible. The key drivers of IsReversal are tagged in red font. S3 and S4 Figs display the similarity measures between the key drivers sets for two adjacent years.

S1, S2, S3 and S4 Figs intuitively confirm that Conclusion 1 still stands for return reversals with r_th = 8%, 10%. Moreover, it is worth noting that the key drivers under a lower r_th generally constitute part, or all of the key drivers under a higher r_th. For example, for year 2009, the key drivers only include Turnover when r_th = 6%, which extend to Turnover and Industry when r_th = 8%, and to Turnover, Industry and Illiquidity when r_th = 10%.

S3 and S4 Tables show the free and desired probabilities of IsReversal = 1 given corresponding desired values for the out-of-sample years. It is obvious that both tables support Conclusion 2. What’s more, the tables also indicate that with r_th raised, the mean values of both the free and desired probabilities turn lower, which appears reasonable, whereas the differences between them go up. This phenomenon implies that for return reversals under more restrictive conditions, the influential effects imposed by the key drivers become more obvious.

As to relationships among the potential driving factors, many such relationships in the previous subsection, such as those between Industry and the liquidity factors, vary little, in that all the factor values except those of IsReversal used in this subsection remain unchanged. For clarity, we do not give detailed analyses.

In summary, we conclude that the main conclusions for return reversal with r_th = 6% are robust for return reversals with r_th = 8%, 10%.

4 Conclusions

In this paper, we employ dynamical Bayesian factor graph to identify key drivers of return reversal. Our empirical results demonstrate that liquidity factors consistently emerge as key drivers of return reversal, supporting the theory of liquidity effect. In specific, stocks with high turnover rates or high Amihud illiquidity measures experience return reversals with a greater probability. Apart from liquidity factors, other drivers of return reversal generally change from year to year. We also learn that factors corresponding to overreaction hypothesis and stock industry impose most consistent influential effects on liquidity factors. One of the influential effects shows that stocks in Finance and Insurance industry are more likely to have high Amihud illiquidity measures compared with those in other industries. These conclusions are robust for return reversals under different thresholds. Our work reveals the drivers of return reversal from a more comprehensive perspective and sheds light on designing more profitable contrarian investment strategies.

Although our research has generated some enlightening results, there is room for improvements. Currently, we only study stocks in the US market based on discretized factors. In the coming research, we would study stocks in international markets with continuous factors analyzed.

Supporting Information

S1 Fig. Dynamical Bayesian factor graph with r_th = 8%: G^r8.

https://doi.org/10.1371/journal.pone.0167050.s001

(EPS)

S2 Fig. Dynamical Bayesian factor graph with r_th = 10%: G^r10.

https://doi.org/10.1371/journal.pone.0167050.s002

(EPS)

S3 Fig. The similarity measures for the out-of-sample years with r_th = 8%.

https://doi.org/10.1371/journal.pone.0167050.s003

(EPS)

S4 Fig. The similarity measures for the out-of-sample years with r_th = 10%.

https://doi.org/10.1371/journal.pone.0167050.s004

(EPS)

S1 Table. The credibility of G^r8 and its member graphs.

https://doi.org/10.1371/journal.pone.0167050.s005

(PDF)

S2 Table. The credibility of G^r10 and its member graphs.

https://doi.org/10.1371/journal.pone.0167050.s006

(PDF)

S3 Table. The probabilities of IsReversal = 1 with r_th = 8% for the out-of-sample years.

https://doi.org/10.1371/journal.pone.0167050.s007

(PDF)

S4 Table. The probabilities of IsReversal = 1 with r_th = 10% for the out-of-sample years.

https://doi.org/10.1371/journal.pone.0167050.s008

(PDF)

S1 File. Some detailed explanations of the potential driving factors of return reversal.

https://doi.org/10.1371/journal.pone.0167050.s009

(PDF)

S1 Data. The capitalization of the stocks in our data set.

https://doi.org/10.1371/journal.pone.0167050.s010

(XLSX)

S2 Data. The panel data set that is used in our experiments.

https://doi.org/10.1371/journal.pone.0167050.s011

(XLSX)

Acknowledgments

We thank Yinan Yu from the University of Hong Kong for helping us collect the research data.

Author Contributions

Conceptualization: SZ YT.
Formal analysis: SZ ZW.
Funding acquisition: YT ST.
Methodology: SZ ZW ST.
Writing – review & editing: SZ YT ZW ST.

References

1. Heiberger RH. Collective Attention and Stock Prices: Evidence from Google Trends Data on Standard and Poor’s 100. PloS one. 2015;10(8):e0135311. pmid:26258498
- View Article
- PubMed/NCBI
- Google Scholar
2. Ticknor JL. A Bayesian regularized artificial neural network for stock market forecasting. Expert Systems with Applications. 2013;40(14):5501–5506.
- View Article
- Google Scholar
3. de Oliveira FA, Nobre CN, Zárate LE. Applying artificial Neural Networks to prediction of stock price and improvement of the directional prediction index-Case study of PETR4, Petrobras, Brazil. Expert Systems with Applications. 2013;40(18):7596–7606.
- View Article
- Google Scholar
4. Patel J, Shah S, Thakkar P, Kotecha K. Predicting Stock Market Index using Fusion of Machine Learning Techniques. Expert Systems with Applications. 2014;42(4):2162–2172.
- View Article
- Google Scholar
5. Patel J, Shah S, Thakkar P, Kotecha K. Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques. Expert Systems with Applications. 2015;42(1):259–268.
- View Article
- Google Scholar
6. Rather AM, Agarwal A, Sastry V. Recurrent neural network and a hybrid model for prediction of stock returns. Expert Systems with Applications. 2014;42(6):3234–3241.
- View Article
- Google Scholar
7. Żbikowski K. Using volume weighted support vector machines with walk forward testing and feature selection for the purpose of creating stock trading strategy. Expert Systems with Applications. 2014;42(4):1797–1805.
- View Article
- Google Scholar
8. Bondt WF, Thaler RH. Does the stock market overreact? The Journal of finance. 1985;40(3):793–805.
- View Article
- Google Scholar
9. Bondt WF, Thaler RH. Further evidence on investor overreaction and stock market seasonality. The Journal of finance. 1987;42(3):557–581.
- View Article
- Google Scholar
10. Lehmann BN. Fads, martingales, and market efficiency. The Quarterly Journal of Economics. 1990;105(1):1–28.
- View Article
- Google Scholar
11. Jegadeesh N. Evidence of predictable behavior of security returns. The Journal of finance. 1990;45(3):881–898.
- View Article
- Google Scholar
12. Patel J. Profit from Prices: All You Need for Profit in Stock Trading Is Stock Prices. 1st ed. CreateSpace Independent Publishing Platform; 2007.
13. Tang GY, Zhang H. Stock return reversal and continuance anomaly: new evidence from Hong Kong. Applied Economics. 2014;46(12):1335–1349.
- View Article
- Google Scholar
14. Subrahmanyam A. Distinguishing Between Rationales for Short-Horizon Predictability of Stock Returns. Financial Review. 2005;40(1):11–35.
- View Article
- Google Scholar
15. Cooper M. Filter rules based on price and volume in individual security overreaction. Review of Financial Studies. 1999;12(4):901–935.
- View Article
- Google Scholar
16. Chang RP, McLeavey D, Rhee SG. Short-term abnormal returns of the contrarian strategy in the Japanese stock market. Journal of Business Finance & Accounting. 1995;22(7):1035–1048.
- View Article
- Google Scholar
17. Hong H, Stein JC. A unified theory of underreaction, momentum trading, and overreaction in asset markets. The Journal of finance. 1999;54(6):2143–2184.
- View Article
- Google Scholar
18. Bondt WF, Thaler RH. Do security analysts overreact? The American Economic Review. 1990;80(2):52–57.
- View Article
- Google Scholar
19. Da Z, Liu Q, Schaumburg E. Decomposing short-term return reversal. Staff Report, Federal Reserve Bank of New York; 2011.
20. Hirschey M. Extreme return reversal in the stock market. The Journal of Portfolio Management. 2003;29(3):78–90.
- View Article
- Google Scholar
21. De Groot W, Huij J, Zhou W. Another look at trading costs and short-term reversal profits. Journal of Banking & Finance. 2012;36(2):371–382.
- View Article
- Google Scholar
22. Blitz D, Huij J, Lansdorp S, Verbeek M. Short-term residual reversal. Journal of Financial Markets. 2013;16(3):477–504.
- View Article
- Google Scholar
23. Da Z, Liu Q, Schaumburg E. A closer look at the short-term return reversal. Management Science. 2013;60(3):658–674.
- View Article
- Google Scholar
24. Hameed A, Huang J, Mian GM. Industries and stock return reversals. Journal of Financial and Quantitative Analysis. 2015;50(1–2):89–117.
- View Article
- Google Scholar
25. Avramov D, Chordia T, Goyal A. Liquidity and autocorrelations in individual stock returns. The Journal of finance. 2006;61(5):2365–2394.
- View Article
- Google Scholar
26. Nagel S. Evaporating liquidity. Review of Financial Studies. 2012;25(7):2005–2039.
- View Article
- Google Scholar
27. Jegadeesh N, Titman S. Short-horizon return reversals and the bid-ask spread. Journal of Financial Intermediation. 1995;4(2):116–132.
- View Article
- Google Scholar
28. George TJ, HWANG CY. Long-Term Return Reversals: Overreaction or Taxes? The Journal of finance. 2007;62(6):2865–2896.
- View Article
- Google Scholar
29. D’Mello R, Ferris SP, Hwang CY. The tax-loss selling hypothesis, market liquidity, and price pressure around the turn-of-the-year. Journal of Financial Markets. 2003;6(1):73–98.
- View Article
- Google Scholar
30. Givoly D, Ovadia A. Year-End Tax-Induced Sales and Stock Market Seasonality. The Journal of finance. 1983;38(1):171–185.
- View Article
- Google Scholar
31. Grinblatt M, Keloharju M. Tax-loss trading and wash sales. Journal of Financial Economics. 2004;71(1):51–76.
- View Article
- Google Scholar
32. Roll R. A simple implicit measure of the effective bid-ask spread in an efficient market. The Journal of finance. 1984;39(4):1127–1139.
- View Article
- Google Scholar
33. Boudoukh J, Richardson MP, Whitelaw R. A tale of three schools: Insights on autocorrelations of short-horizon stock returns. Review of Financial Studies. 1994;7(3):539–573.
- View Article
- Google Scholar
34. Kaul G, Nimalendran M. Price reversals: Bid-ask errors or market overreaction? Journal of Financial Economics. 1990;28(1):67–93.
- View Article
- Google Scholar
35. Gutierrez RC, Kelley EK. The Long-Lasting Momentum in Weekly Returns. The Journal of finance. 2008;63(1):415–447.
- View Article
- Google Scholar
36. Conrad J, Gultekin MN, Kaul G. Profitability of short-term contrarian strategies: Implications for market efficiency. Journal of Business & Economic Statistics. 1997;15(3):379–386.
- View Article
- Google Scholar
37. Otchere I, Chan J. Short-term overreaction in the Hong Kong stock market: can a contrarian trading strategy beat the market? The Journal of Behavioral Finance. 2003;4(3):157–171.
- View Article
- Google Scholar
38. Gupta A, Dhingra B. Stock market prediction using hidden Markov models. In: Engineering and Systems (SCES), 2012 Students Conference on. IEEE; 2012. p. 1–4.
39. Wang L, Wang Z, Zhao S, Tan S. Stock market trend prediction using dynamical Bayesian factor graph. Expert Systems with Applications. 2015;42(15):6267–6275.
- View Article
- Google Scholar
40. Qian XY, Song FT, Zhou WX. Nonlinear behaviour of the Chinese SSEC index with a unit root: Evidence from threshold unit root tests. Physica a-Statistical Mechanics and Its Applications. 2008;387(2–3):503–510.
- View Article
- Google Scholar
41. Qian XY, Liu YM, Jiang ZQ, Podobnik B, Zhou WX, Stanley HE. Detrended partial cross-correlation analysis of two nonstationary time series influenced by common external forces. Physical Review E. 2015;91(6):062816. pmid:26172763
- View Article
- PubMed/NCBI
- Google Scholar
42. Choudhry T, Osoble BN. Nonlinear Interdependence between the US and Emerging Markets’ Industrial Stock Sectors. International Journal of Finance & Economics. 2015;20(1):61–79.
- View Article
- Google Scholar
43. Chu XJ, Wu CF, Qiu JY. A nonlinear Granger causality test between stock returns and investor sentiment for Chinese stock market: a wavelet-based approach. Applied Economics. 2016;48(21):1915–1924.
- View Article
- Google Scholar
44. Wang Z, Wang L, Tan S. Emergent and spontaneous computation of factor relationships from a large factor set. Journal of Economic Dynamics and Control. 2008;32(12):3939–3959.
- View Article
- Google Scholar
45. Pearl J. Causality: models, reasoning, and inference. Econometric Theory. 2003;19:675–685.
- View Article
- Google Scholar
46. Fama EF, French KR. Common risk factors in the returns on stocks and bonds. Journal of Financial Economics. 1993;33(1):3–56.
- View Article
- Google Scholar
47. Pearl J. Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann; 2014.
48. Amihud Y. Illiquidity and stock returns: cross-section and time-series effects. Journal of Financial Markets. 2002;5(1):31–56.
- View Article
- Google Scholar
49. Shi HL, Jiang ZQ, Zhou WX. Profitability of contrarian strategies in the Chinese stock market. PloS one. 2015;10(9):e0137892. pmid:26368537
- View Article
- PubMed/NCBI
- Google Scholar
50. Malin M, Bornholt G. Long-term return reversal: Evidence from international market indices. Journal of International Financial Markets, Institutions and Money. 2013;25:1–17.
- View Article
- Google Scholar
51. Tsamardinos I, Aliferis CF, Statnikov AR, Statnikov E. Algorithms for Large Scale Markov Blanket Discovery. In: FLAIRS Conference. vol. 2; 2003. p. 376–380.
52. Verma T, Pearl J. Equivalence and Synthesis of Causal Models. Uncertainty in Articial Intelligence. 1991;6:255–268.
- View Article
- Google Scholar
53. Scutari M. Learning Bayesian networks with the bnlearn R package. Journal of Statistical Software. 2010;35(3):1–22.
- View Article
- Google Scholar
54. Geiger D, Verma T, Pearl J. Identifying Independence in Bayesian Networks. Networks. 1990;20(5):507–534.
- View Article
- Google Scholar
55. Zhang X, Hu Y, Xie K, Wang S, Ngai E, Liu M. A causal feature selection algorithm for stock prediction modeling. Neurocomputing. 2014;142(1):48–59.
- View Article
- Google Scholar
56. Lauritzen SL, Spiegelhalter DJ. Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society Series B (Methodological). 1988; p. 157–224.
- View Article
- Google Scholar
57. Højsgaard S. Bayesian networks in R with the gRain package. Rel téc Aalborg University. 2015; p. 1–15.

[ref1] 1. Heiberger RH. Collective Attention and Stock Prices: Evidence from Google Trends Data on Standard and Poor’s 100. PloS one. 2015;10(8):e0135311. pmid:26258498
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Ticknor JL. A Bayesian regularized artificial neural network for stock market forecasting. Expert Systems with Applications. 2013;40(14):5501–5506.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. de Oliveira FA, Nobre CN, Zárate LE. Applying artificial Neural Networks to prediction of stock price and improvement of the directional prediction index-Case study of PETR4, Petrobras, Brazil. Expert Systems with Applications. 2013;40(18):7596–7606.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref4] 4. Patel J, Shah S, Thakkar P, Kotecha K. Predicting Stock Market Index using Fusion of Machine Learning Techniques. Expert Systems with Applications. 2014;42(4):2162–2172.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref5] 5. Patel J, Shah S, Thakkar P, Kotecha K. Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques. Expert Systems with Applications. 2015;42(1):259–268.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref6] 6. Rather AM, Agarwal A, Sastry V. Recurrent neural network and a hybrid model for prediction of stock returns. Expert Systems with Applications. 2014;42(6):3234–3241.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref7] 7. Żbikowski K. Using volume weighted support vector machines with walk forward testing and feature selection for the purpose of creating stock trading strategy. Expert Systems with Applications. 2014;42(4):1797–1805.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref8] 8. Bondt WF, Thaler RH. Does the stock market overreact? The Journal of finance. 1985;40(3):793–805.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref9] 9. Bondt WF, Thaler RH. Further evidence on investor overreaction and stock market seasonality. The Journal of finance. 1987;42(3):557–581.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref10] 10. Lehmann BN. Fads, martingales, and market efficiency. The Quarterly Journal of Economics. 1990;105(1):1–28.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref11] 11. Jegadeesh N. Evidence of predictable behavior of security returns. The Journal of finance. 1990;45(3):881–898.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref12] 12. Patel J. Profit from Prices: All You Need for Profit in Stock Trading Is Stock Prices. 1st ed. CreateSpace Independent Publishing Platform; 2007.

[ref13] 13. Tang GY, Zhang H. Stock return reversal and continuance anomaly: new evidence from Hong Kong. Applied Economics. 2014;46(12):1335–1349.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref14] 14. Subrahmanyam A. Distinguishing Between Rationales for Short-Horizon Predictability of Stock Returns. Financial Review. 2005;40(1):11–35.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref15] 15. Cooper M. Filter rules based on price and volume in individual security overreaction. Review of Financial Studies. 1999;12(4):901–935.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref16] 16. Chang RP, McLeavey D, Rhee SG. Short-term abnormal returns of the contrarian strategy in the Japanese stock market. Journal of Business Finance & Accounting. 1995;22(7):1035–1048.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref17] 17. Hong H, Stein JC. A unified theory of underreaction, momentum trading, and overreaction in asset markets. The Journal of finance. 1999;54(6):2143–2184.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref18] 18. Bondt WF, Thaler RH. Do security analysts overreact? The American Economic Review. 1990;80(2):52–57.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref19] 19. Da Z, Liu Q, Schaumburg E. Decomposing short-term return reversal. Staff Report, Federal Reserve Bank of New York; 2011.

[ref20] 20. Hirschey M. Extreme return reversal in the stock market. The Journal of Portfolio Management. 2003;29(3):78–90.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref21] 21. De Groot W, Huij J, Zhou W. Another look at trading costs and short-term reversal profits. Journal of Banking & Finance. 2012;36(2):371–382.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref22] 22. Blitz D, Huij J, Lansdorp S, Verbeek M. Short-term residual reversal. Journal of Financial Markets. 2013;16(3):477–504.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref23] 23. Da Z, Liu Q, Schaumburg E. A closer look at the short-term return reversal. Management Science. 2013;60(3):658–674.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref24] 24. Hameed A, Huang J, Mian GM. Industries and stock return reversals. Journal of Financial and Quantitative Analysis. 2015;50(1–2):89–117.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref25] 25. Avramov D, Chordia T, Goyal A. Liquidity and autocorrelations in individual stock returns. The Journal of finance. 2006;61(5):2365–2394.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref26] 26. Nagel S. Evaporating liquidity. Review of Financial Studies. 2012;25(7):2005–2039.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref27] 27. Jegadeesh N, Titman S. Short-horizon return reversals and the bid-ask spread. Journal of Financial Intermediation. 1995;4(2):116–132.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref28] 28. George TJ, HWANG CY. Long-Term Return Reversals: Overreaction or Taxes? The Journal of finance. 2007;62(6):2865–2896.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref29] 29. D’Mello R, Ferris SP, Hwang CY. The tax-loss selling hypothesis, market liquidity, and price pressure around the turn-of-the-year. Journal of Financial Markets. 2003;6(1):73–98.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref30] 30. Givoly D, Ovadia A. Year-End Tax-Induced Sales and Stock Market Seasonality. The Journal of finance. 1983;38(1):171–185.
View Article
Google Scholar

[86] View Article

[87] Google Scholar

[ref31] 31. Grinblatt M, Keloharju M. Tax-loss trading and wash sales. Journal of Financial Economics. 2004;71(1):51–76.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref32] 32. Roll R. A simple implicit measure of the effective bid-ask spread in an efficient market. The Journal of finance. 1984;39(4):1127–1139.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref33] 33. Boudoukh J, Richardson MP, Whitelaw R. A tale of three schools: Insights on autocorrelations of short-horizon stock returns. Review of Financial Studies. 1994;7(3):539–573.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref34] 34. Kaul G, Nimalendran M. Price reversals: Bid-ask errors or market overreaction? Journal of Financial Economics. 1990;28(1):67–93.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref35] 35. Gutierrez RC, Kelley EK. The Long-Lasting Momentum in Weekly Returns. The Journal of finance. 2008;63(1):415–447.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref36] 36. Conrad J, Gultekin MN, Kaul G. Profitability of short-term contrarian strategies: Implications for market efficiency. Journal of Business & Economic Statistics. 1997;15(3):379–386.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref37] 37. Otchere I, Chan J. Short-term overreaction in the Hong Kong stock market: can a contrarian trading strategy beat the market? The Journal of Behavioral Finance. 2003;4(3):157–171.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref38] 38. Gupta A, Dhingra B. Stock market prediction using hidden Markov models. In: Engineering and Systems (SCES), 2012 Students Conference on. IEEE; 2012. p. 1–4.

[ref39] 39. Wang L, Wang Z, Zhao S, Tan S. Stock market trend prediction using dynamical Bayesian factor graph. Expert Systems with Applications. 2015;42(15):6267–6275.
View Article
Google Scholar

[111] View Article

[112] Google Scholar

[ref40] 40. Qian XY, Song FT, Zhou WX. Nonlinear behaviour of the Chinese SSEC index with a unit root: Evidence from threshold unit root tests. Physica a-Statistical Mechanics and Its Applications. 2008;387(2–3):503–510.
View Article
Google Scholar

[114] View Article

[115] Google Scholar

[ref41] 41. Qian XY, Liu YM, Jiang ZQ, Podobnik B, Zhou WX, Stanley HE. Detrended partial cross-correlation analysis of two nonstationary time series influenced by common external forces. Physical Review E. 2015;91(6):062816. pmid:26172763
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref42] 42. Choudhry T, Osoble BN. Nonlinear Interdependence between the US and Emerging Markets’ Industrial Stock Sectors. International Journal of Finance & Economics. 2015;20(1):61–79.
View Article
Google Scholar

[121] View Article

[122] Google Scholar

[ref43] 43. Chu XJ, Wu CF, Qiu JY. A nonlinear Granger causality test between stock returns and investor sentiment for Chinese stock market: a wavelet-based approach. Applied Economics. 2016;48(21):1915–1924.
View Article
Google Scholar

[124] View Article

[125] Google Scholar

[ref44] 44. Wang Z, Wang L, Tan S. Emergent and spontaneous computation of factor relationships from a large factor set. Journal of Economic Dynamics and Control. 2008;32(12):3939–3959.
View Article
Google Scholar

[127] View Article

[128] Google Scholar

[ref45] 45. Pearl J. Causality: models, reasoning, and inference. Econometric Theory. 2003;19:675–685.
View Article
Google Scholar

[130] View Article

[131] Google Scholar

[ref46] 46. Fama EF, French KR. Common risk factors in the returns on stocks and bonds. Journal of Financial Economics. 1993;33(1):3–56.
View Article
Google Scholar

[133] View Article

[134] Google Scholar

[ref47] 47. Pearl J. Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann; 2014.

[ref48] 48. Amihud Y. Illiquidity and stock returns: cross-section and time-series effects. Journal of Financial Markets. 2002;5(1):31–56.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref49] 49. Shi HL, Jiang ZQ, Zhou WX. Profitability of contrarian strategies in the Chinese stock market. PloS one. 2015;10(9):e0137892. pmid:26368537
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref50] 50. Malin M, Bornholt G. Long-term return reversal: Evidence from international market indices. Journal of International Financial Markets, Institutions and Money. 2013;25:1–17.
View Article
Google Scholar

[144] View Article

[145] Google Scholar

[ref51] 51. Tsamardinos I, Aliferis CF, Statnikov AR, Statnikov E. Algorithms for Large Scale Markov Blanket Discovery. In: FLAIRS Conference. vol. 2; 2003. p. 376–380.

[ref52] 52. Verma T, Pearl J. Equivalence and Synthesis of Causal Models. Uncertainty in Articial Intelligence. 1991;6:255–268.
View Article
Google Scholar

[148] View Article

[149] Google Scholar

[ref53] 53. Scutari M. Learning Bayesian networks with the bnlearn R package. Journal of Statistical Software. 2010;35(3):1–22.
View Article
Google Scholar

[151] View Article

[152] Google Scholar

[ref54] 54. Geiger D, Verma T, Pearl J. Identifying Independence in Bayesian Networks. Networks. 1990;20(5):507–534.
View Article
Google Scholar

[154] View Article

[155] Google Scholar

[ref55] 55. Zhang X, Hu Y, Xie K, Wang S, Ngai E, Liu M. A causal feature selection algorithm for stock prediction modeling. Neurocomputing. 2014;142(1):48–59.
View Article
Google Scholar

[157] View Article

[158] Google Scholar

[ref56] 56. Lauritzen SL, Spiegelhalter DJ. Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society Series B (Methodological). 1988; p. 157–224.
View Article
Google Scholar

[160] View Article

[161] Google Scholar

[ref57] 57. Højsgaard S. Bayesian networks in R with the gRain package. Rel téc Aalborg University. 2015; p. 1–15.

Figures

Abstract

Introduction

1 Methods

1.1 Bayesian factor graph

1.2 Dynamical Bayesian factor graph and related definitions

1.3 Dynamical Bayesian factor graph in identifying key drivers of return reversal

1.3.1 The definition of return reversal.

1.3.2 The way of identifying key drivers of return reversal.

2 Data

3 Results

3.1 The results of experiments with rth = 6%

3.1.1 Identifying key drivers of return reversal.

3.1.2 Applying inference on the generated graphs.

3.1.3 Learning relationships among the potential driving factors.

3.2 Robustness check

4 Conclusions

Supporting Information

S1 Fig. Dynamical Bayesian factor graph with rth = 8%: Gr8.

S2 Fig. Dynamical Bayesian factor graph with rth = 10%: Gr10.

S3 Fig. The similarity measures for the out-of-sample years with rth = 8%.

S4 Fig. The similarity measures for the out-of-sample years with rth = 10%.

S1 Table. The credibility of Gr8 and its member graphs.

S2 Table. The credibility of Gr10 and its member graphs.

S3 Table. The probabilities of IsReversal = 1 with rth = 8% for the out-of-sample years.

S4 Table. The probabilities of IsReversal = 1 with rth = 10% for the out-of-sample years.

S1 File. Some detailed explanations of the potential driving factors of return reversal.

S1 Data. The capitalization of the stocks in our data set.

S2 Data. The panel data set that is used in our experiments.

Acknowledgments

Author Contributions

References

3.1 The results of experiments with r_th = 6%

S1 Fig. Dynamical Bayesian factor graph with r_th = 8%: G^r8.

S2 Fig. Dynamical Bayesian factor graph with r_th = 10%: G^r10.

S3 Fig. The similarity measures for the out-of-sample years with r_th = 8%.

S4 Fig. The similarity measures for the out-of-sample years with r_th = 10%.

S1 Table. The credibility of G^r8 and its member graphs.

S2 Table. The credibility of G^r10 and its member graphs.

S3 Table. The probabilities of IsReversal = 1 with r_th = 8% for the out-of-sample years.

S4 Table. The probabilities of IsReversal = 1 with r_th = 10% for the out-of-sample years.