How Volatilities Nonlocal in Time Affect the Price Dynamics in Complex Financial Systems

What is the dominating mechanism of the price dynamics in financial systems is of great interest to scientists. The problem whether and how volatilities affect the price movement draws much attention. Although many efforts have been made, it remains challenging. Physicists usually apply the concepts and methods in statistical physics, such as temporal correlation functions, to study financial dynamics. However, the usual volatility-return correlation function, which is local in time, typically fluctuates around zero. Here we construct dynamic observables nonlocal in time to explore the volatility-return correlation, based on the empirical data of hundreds of individual stocks and 25 stock market indices in different countries. Strikingly, the correlation is discovered to be non-zero, with an amplitude of a few percent and a duration of over two weeks. This result provides compelling evidence that past volatilities nonlocal in time affect future returns. Further, we introduce an agent-based model with a novel mechanism, that is, the asymmetric trading preference in volatile and stable markets, to understand the microscopic origin of the volatility-return correlation nonlocal in time.

From the perspective of physicists, a financial market is regarded as a dynamic system, and the price dynamics, i.e. the time evolution of stock prices, can be characterized by temporal correlation functions, which describe how one variable statistically changes with another.It is well-known that the price volatilities are long-range correlated in time, which is called volatility clustering.Many activities have been devoted to the study of the collective behaviors related to volatility clustering in stock markets [3,5,6,38,39].However, our understanding on the movement of the price return itself is very much limited.The autocorrelating time of returns is extremely short, that is, on the order of minutes [3,38].As to higher-order time correlations, it is discovered that the return-volatility correlation is negative -in other words, past negative returns enhance future volatilities [4,9,23,40,41].This is the so-called leverage effect.As far as we know, all stock markets in the world exhibit the leverage effect except for the Chinese stock market, which unexpectedly shows an anti-leverage effect, i.e., the correlation between past returns and future volatilities is positive [9,41].Returns represent the price changes, and volatilities measure the fluctuations of the price movement.The leverage and anti-leverage effects characterize how price changes induce fluctuations.At this stage, one may ask what affects the return itself.It has been discovered that future returns can be predicted by the dividend-price ratio [42,43], which is corroborated by subsequent studies.However, the predictive power of the dividend-price ratio is sensitive to the selection of the sample period [44,45].Recently, price extrema are found to be linked with peaks in the volume time series [13].Moreover, it is reported that massive data sources, such as Google Trends and Wikipedia, contain early signs of market moves.The argument is that these "big data" capture investors' attempts to gather information before decisions are made [35,46,47].These researches provide insight into the price dynamics.
What is the dominating mechanism of the price dynamics is highly complicated.The problem how volatilities affect the price dynamics has drawn much attention.Although many efforts have been made, it remains enormously challenging.According to a hypothesis known as the volatility feedback effect, an anticipated increase in volatility would raise the required return in the future.To allow for higher future returns, the current stock price decreases [24,25].Based on this hypothesis, various models, such as Generalized AutoRegressive Conditional Heteroskedasticity (GARCH) model [48] and Exponential GARCH (EGARCH) model [49] have been applied to examine the correlation between past volatilities and future returns, and the results are controversial.The correlation is discovered to be positive in some researches [24,25], while negative in others [49][50][51].Often the coefficient linking past volatilities to future returns is statistically insignificant [26].On the other hand, the volatility-return correlation function can be used to characterize the correlation between past volatilities and future returns.If the hypothesis of the volatility feedback effect is valid, the volatility-return correlation function should be non-zero.However, it typically fluctuates around zero [4,9].Such a volatility-return correlation function can only characterize the correlation local in time.In fact, the scenario in financial markets may be more complicated.Interactions, and thus correlations could be nonlocal in time.
In this study, we construct a class of dynamic observables nonlocal in time to explore the volatilityreturn correlation, based on the empirical data of hundreds of individual stocks in the New York and Shanghai stock exchanges, as well as 25 stock market indices in different countries.Strikingly, the correlation is discovered to be non-zero, with an amplitude of a few percent and a duration over two weeks.This result provides compelling evidence that past volatilities nonlocal in time affect future returns.Further, we introduce an agent-based model with a novel mechanism, that is, the asymmetric trading preference in volatile and stable markets, to understand the microscopic origin of the volatility-return correlation nonlocal in time.

Materials
We collect the daily closing prices of 200 individual stocks in the New York Stock Exchange (NYSE), 200 individual stocks in the Shanghai Stock Exchange (SSE) and 25 stock market indices in different countries.The time periods of the individual stocks and stock market indices are presented in Table 1.All these data are obtained from Yahoo! Finance (finance.yahoo.com).To keep the time periods for all stocks exactly the same and as long as possible, we select 200 stocks in the SSE, most of which are large-cap stocks.For comparison, 200 stocks in the NYSE are collected.

Asymmetric conditional probability in volatile and stable markets
To explore the volatility-return correlation in stock markets, we construct a class of observables, including conditional probabilities and correlation functions.We first discuss the conditional probabilities.The price of a financial index or individual stock at time t is denoted by Y (t ), and the logarithmic return is defined as R (t ) ≡ ln Y (t ) − ln Y (t − 1).For comparison of different indices or stocks, we introduce the normalized return Here • • • represents the average over time t .In other words, R (t ) = [ n i=1 R(i)] /n is the average of the time series R (t ), where n denotes the total number of the data points of R (t ), and σ = [ R 2 − R 2 ] 1/2 is the standard deviation of R (t ).There may be various definitions of volatility, a simplified one is which measures the magnitude of the price fluctuation.One may compute temporal correlation functions to investigate the dynamic correlations.The usual volatility-return correlation function is defined as f (t) = v (t ) • r (t + t) with t > 0, and it characterizes how the volatility at t influences the return at t + t.However, this correlation function fluctuates around zero [4,9].It is noteworthy that such a kind of f (t) is local in time, while interactions such as information exchanges in financial markets may be more complicated, leading to correlations nonlocal in time.
To explore the correlations nonlocal in time, we first define an average volatility at t over a past period of time T , To evaluate whether the average fluctuation in a short time period T 1 is strong or weak, we compare it with a background fluctuation, which is defined over a much longer period of time T 2 in the past.Therefore, we introduce the difference of the average volatilities in two different time windows, with T 2 T 1 .T 1 and T 2 are called the short window and long window, respectively.When ∆v (t ) > 0, the stock market in the time window T 1 is volatile; otherwise, it is relatively stable.
Next, we compute the conditional probability P + (t)| ∆v(t )>0 , which is the probability of r(t + t) > 0 on the condition of ∆v(t ) > 0.Here we consider only t > 0. Correspondingly, the conditional probability P + (t)| ∆v(t )<0 is the probability of r(t + t) > 0 for ∆v(t ) < 0. We do not observe any r(t + t) equal to 0 in the normalized return series.Thus, the conditional probability of r(t + t) < 0 is 1 − P + (t)| ∆v(t )>0 and 1 − P + (t)| ∆v(t )<0 , respectively.In a time series of returns, the total number of positive returns is generally different from that of negative ones.Let us denote the unconditional probability that the return is positive by P 0 (t), which is the percentage of the positive returns in all returns without any condition.
The specific calculations for P + (t)| ∆v(t )>0 , P + (t)| ∆v(t )<0 and P 0 (t) are described in S1 Appendix.If past volatilities and future returns do not correlate with each other, both P + (t)| ∆v(t )>0 and P + (t)| ∆v(t )<0 should be equal to P 0 (t).In other words, if P + (t)| ∆v(t )>0 and P + (t)| ∆v(t )<0 are different from P 0 (t), i.e., if the conditional probability of returns is asymmetric in volatile and stable markets, there exists a non-zero volatility-return correlation and such a correlation is nonlocal in time.In this case, it can be proven that if P + (t)| ∆v(t )>0 > P 0 (t), we have P + (t)| ∆v(t )<0 < P 0 (t), otherwise, we have P + (t)| ∆v(t )<0 > P 0 (t) (see S1 Appendix).To describe the asymmetric conditional probability in volatile and stable markets, we introduce ∆P (t It is important that the probability difference ∆P (t) relies on ∆v(t ), thereby depending on the time windows T 1 and T 2 .Even though the volatility-return correlation function local in time is zero, the nonlocal observable ∆P (t) can be non-zero.We call a pair of T 1 and T 2 at which ∆P (t) is non-zero an effective pair.At the time windows T 1 = 24 and T 2 = 205, for instance, we compute ∆P (t) for 200 stocks in the NYSE and take an average over these stocks.As displayed in Fig. 1(a), the average ∆P (t) remains positive for over 20 days with an amplitude of 1 percent.The result indicates that the past volatilities nonlocal in time enhance the positive returns in the future.For comparison, three curves for ∆P (t) averaged over 150, 100 and 50 randomly chosen stocks are also displayed.Within fluctuations, these three curves are consistent with that for ∆P (t) averaged over 200 stocks.We take the average over many stocks for the purpose of exploring the collective behavior of stocks.For a single stock, the price dynamics is much more complicated, and ∆P (t) fluctuates more strongly.Then we perform the same computation for 200 stocks in the SSE at the time windows T 1 = 10 and T 2 = 210.As displayed in Fig. 1(b), ∆P (t) remains positive for about 40 days and the amplitude is about 5 percent.Compared with the results for the NYSE, the amplitude and duration of ∆P (t) for the SSE are respectively much larger and longer.The reason may be that the US stock market is highly developed, with large market size and diversified investment philosophies, while the Chinese stock market is emerging and of small market size, in which the investment philosophies of investors resemble each other.For the validation of our methods, each point of ∆P (t) in Fig. 1 is analyzed by performing Student's t-test.In general, a p-value less than 0.01 is considered statistically significant.For the NYSE, the smallest p-value is in the order of 10 −12 , and all the p-values for 1 t 19 are less than 0.01.For the SSE, the p-values are even smaller, and less than 0.01 for 1 t 52.
Actually the definition of volatility in Eq. ( 2) is a simplified one.A more standard definition of volatility at t is where m represents a relatively small time window, which may be set to be 5 days, i.e., the number of the trading days in a week.Given that these two definitions v(t ) and v 1 (t ) may lead to different results in extreme volatility regimes, we consider both of them in our calculations.For v 1 (t ), the average volatility at t over a past period of time , with T m.Thus, the difference of the average volatilities in two different time windows is ∆v For further comparison, one may also define the average volatility at t over a past period of time T as and For the NYSE and SSE respectively,we compute ∆P 1 (t) and ∆P 2 (t), and take an average over individual stocks.The time windows are the same as those for ∆P (t) in Fig. 1.As displayed in Fig. 2, the curves for ∆P (t), ∆P 1 (t) and ∆P 2 (t) overlap each other within fluctuations.In the following calculations, we mainly consider ∆P (t) and ∆P 1 (t).Effective pairs of T 1 and T 2 In the calculation of ∆P (t), the time windows T 1 and T 2 are crucial.T 1 represents the recent period of time, and investors measure the current fluctuation of prices according to the volatility averaged over T 1 .Thus, T 1 should be relatively small.In our calculations, T 1 ranges from 1 to 44 days.Here 44 is the number of trading days in two months.T 2 stands for the period of time in which one estimates the background of volatilities in the past.Theoretically, T 2 should be much larger than T 1 .On the other hand, T 2 should not be arbitrarily large either: firstly, the memory of investors may not last very long; secondly, maybe more importantly, T 2 reflects the long-term fluctuation of stock markets, which should be reasonably fixed.In our calculations, T 2 ranges from 45 to 250 days.Here 250 is the number of trading days in a year.In fact, T 2 is more crucial to ∆P (t) than T 1 .If T 2 were equal to the total length of the volatility series, v (t ) T2 would be a constant, and ∆P (t) would become a local observable, which is just a volatility-return correlation function local in time but averaged over a T 1 -day moving window.In Fig. 1(a) and (b), we display ∆P (t) computed with a specific effective pair of T 1 and T 2 .Actually, the effective pair of T 1 and T 2 is not unique, and exists in a particular region.Therefore, we compute ∆P (t) with each pair of T 1 and T 2 , and identify the effective pairs at which ∆P (t) is significantly non-zero.Since ∆P (t) needs to be computed in a large region of T 1 and T 2 , it is inefficient to observe the behavior of ∆P (t) by eyes.Besides, due to the fluctuation of ∆P (t), the visual observation could be difficult in some cases.Therefore, we propose technical criteria to efficiently discriminate the non-zero ∆P (t).
The schematic diagram of the criteria is displayed in Fig. 3.The criteria comprise four steps: Figure 3.A schematic graph of the criteria for identifying non-zero ∆P .The red line represents ∆P (t),which is the 3-point smoothed ∆P (t).t 1 is the day when the sign of ∆P (t) changes for the first time.We define ∆P (t) in the range of 1 ≤ t ≤ t 1 − 1 as the first part, and that in the range of t 1 ≤ t ≤ t 1 + τ − 1 as the second part.The average absolute values for the first and second parts are denoted by AP 1 and AP 2 , respectively.
(1) ∆P (t) is smoothed with a 3-day moving window and the result is denoted by ∆P (t).
(2) Supposing ∆P (t) changes sign for the first time at t 1 , we define ∆P (t) in the range of 1 ≤ t ≤ t 1 −1 as the first part, and that in the range of t 1 ≤ t ≤ t 1 − 1 + τ as the second part.A non-zero ∆P (t) would remain positive or negative in the first part, while fluctuate around zero in the second part.We set τ to be 44, i.e., the number of the trading days in two months, which is long enough to confirm whether the second part of ∆P (t) fluctuates around zero.
(3) we calculate the average absolute values for the first and second parts of ∆P (t), denoted by AP 1 and AP 2 respectively.
(4) For a non-zero ∆P (t), it has to be satisfied that (i)∆P (t) > AP 2 for 1 t t 0 , and t 0 > 10; (ii) each value of ∆P (t) in the second part is smaller than AP 1 .With these conditions, we sift out the non-zero ∆P (t) preliminarily.To measure how significantly ∆P (t) differs from zero, we calculate the average value of ∆P (t) for 1 t t 0 , which is denoted by AP 0 .Actually, the larger |AP 0 | is, the more significantly ∆P (t) differs from zero.The average of non-zero AP 0 over different pairs of T 1 and T 2 is denoted by AP 0 .To consolidate our results, we identify those non-zero ∆P (t), which meet an additional requirement: (iii) |AP 0 | > |AP 0 |.AP 0 is set to 0 unless ∆P (t) satisfies all the requirements above.Now we compute ∆P (t) for the individual stocks in the NYSE with each pair of T 1 and T 2 .∆P (t) is averaged over 200 stocks, and the corresponding AP 0 is calculated.The landscape of AP 0 is displayed in Fig. 4(a).The result indicates that the effective pairs of T 1 and T 2 do exist in a particular region, and both T 1 and T 2 are characteristics of the stock markets.In Fig. 4(a), the effective pairs of T 1 and T 2 are basically adjacent to each other, suggesting that ∆P (t) locally is not very sensitive to T 1 and T 2 .This is somehow expected, since ∆P (t) is computed from the volatilities averaged over T 1 and T 2 , and a little alteration in T 1 or T 2 would not dramatically change ∆P (t).From this perspective, the gaps between the disconnected regions in Fig. 4(a) probably result from the fluctuations, especially taking into account the relatively small amplitude of non-zero ∆P (t) for the NYSE.Next, we perform a parallel analysis on the 200 stocks in the SSE, and the landscape for the amplitude of ∆P (t) is displayed in Fig. 4(b).Similar with the result for the NYSE, a large region of non-zero ∆P (t) is observed for the SSE.At a single pair of T 1 and T 2 , ∆P (t) averaged over 200 stocks would generally be non-zero, if ∆P (t) of some stocks is non-zero.Moreover, the region of non-zero ∆P (t) varies from one stock to another.Therefore, the average ∆P (t) of the individual stocks is non-zero in a relatively large region for both the NYSE and SSE.Additionally, as displayed in Fig. 4(b), there exists only one connected region of non-zero ∆P (t) for the SSE, with the amplitude dwindling from the center to the edge.Compared with the result for the NYSE in Fig. 4(a), the region of non-zero AP 0 in Fig. 4(b) is broader, without gaps, and the value of AP 0 is almost an order of magnitude larger.The reason may be traced back to the fact that the Chinese stock market is emerging, and less efficient than the US stock market.To further validate our methods, we perform Student's t-test on each point of non-zero ∆P (t) in Fig. 4. A p-value less than 0.01 is considered statistically significant.At an effective pair of T 1 and T 2 , ∆P (t) is confirmed to be non-zero, if all the p-values are less than 0.01 for 1 t 10.All non-zero ∆P (t) are confirmed except for a few ones at very small T 1 .
We also compute ∆P 1 (t) with different pairs of T 1 and T 2 for the NYSE and SSE.Since m in Eq. ( 6) is set to be 5, T 1 should not be smaller than 5.The landscapes for the amplitude of ∆P 1 (t) are almost the same as those for the amplitude of ∆P (t).
Further, we compute ∆P (t) for the 25 stock market indices in different countries.The volatility-return correlation is positive for 16 indices, and the corresponding effective pairs of T 1 and T 2 , as well as the maximum AP 0 , are given in Table 1.For most of these indices, the maximum AP 0 is over 2 percent, indicating that the correlation is rather prominent.In Fig. 5(a), we display the regions of effective pairs of T 1 and T 2 for 5 representative indices including the Brazil, Shanghai, Mexico, Spanish and S&P 500 indices.For other 7 indices, nonzero ∆P (t) could not be detected for almost all pairs of T 1 and T 2 .Exceptionally, the Australia and Japan indices exhibit a negative volatility-return correlation, i.e., the volatilities in a past period of time enhance the negative returns in future times.The effective pairs of T 1 and T 2 , as well as the maximum AP 0 for these two indices, are also presented in Table 1.We also compute ∆P 1 (t) for the 5 representative indices, and the regions of effective pairs of T 1 and T 2 are shown in Fig. 5(b).Compared with Fig. 5(a), the regions of the effective pairs of T 1 and T 2 in Fig. 5(b) change slightly.The reason may be that the fluctuation of ∆P (t) and ∆P 1 (t) for indices is stronger than that for the individual stocks.Specifically, navy stands for the Brazil Index, orange stands for the Shanghai Index, red stands for the Mexico Index, yellow stands for the Spanish Index and crimson stands for the S&P 500 Index.For clarity, we display only one color at the overlapping regions, given that these regions are small.Some scattered points are also omitted.
To confirm that the nonlocal volatility-return correlation is indeed a nontrivial dynamic property of the stock markets, we randomly shuffle the time series of returns, i.e., randomize the time order of the returns, and perform the same calculation.In this case, ∆P (t) just fluctuates around zero.The result provides evidence that the correlation does originate from the interactions between past volatilities and future returns.

Volatility-return correlation functions nonlocal in time
Up to now, we have only concerned with the signs of ∆v(t ) and r(t + t) in computing ∆P (t).Actually, the magnitudes of ∆v(t ) and r(t + t) should also be important to both theoretical analysis and practical applications.Taking into account the magnitudes of ∆v(t ) and r(t + t), we may explicitly construct a correlation function nonlocal in time to describe the volatility-return correlations, Both ∆P (t) and F (t) reflect the asymmetric behavior of r(t + t) in volatile and stable markets, but ∆P (t) should be more fundamental.When ∆P (t) is non-zero, F (t) would be zero only if the contributions of r(t + t) > 0 and r(t + t) < 0 happen to cancel each other.We compute F (t) with different pairs of T 1 and T 2 for the 200 stocks in the NYSE and SSE respectively, and identify the non-zero ones with the same criteria for the non-zero ∆P (t).We also introduce AF 0 to describe how significantly F (t) differs from zero, of which the definition is the same as AP 0 for ∆P (t).F (t) is averaged over 200 stocks, and the landscape of the corresponding AF 0 is displayed in Fig. 6.The dynamic behavior of F (t) is qualitatively the same as that of ∆P (t) but quantitatively different.Both the amplitude of F (t) and the region of effective pairs of T 1 and T 2 are smaller than those of ∆P (t).The fluctuation of F (t) is also somewhat stronger.Student's t-test is performed on the non-zero F (t) and almost all of them are confirmed to be non-zero.We also compute F 1 (t), which is defined as F 1 (t) = ∆v 1 (t ) • r(t + t) , with each pair of T 1 and T 2 for the NYSE and SSE, and the results are almost the same as those for F (t).In fact, ∆P (t) can be expressed as the correlation function G(t) = sgn(∆v(t )) • sgn(r(t + t)) .Here sgn(x) represents the sign of x.G(t) behaves almost the same way as ∆P (t) does.Additionally, one may also define another volatility-return correlation function H(t) = sgn(∆v(t )) • r(t + t) .Since only the magnitude of r(t + t) is taken into consideration, H(t) is less fluctuating than F (t), whereas the result looks qualitatively similar.
There have been many researches with different methods focusing on how volatilities affect returns in financial markets.A direct way is to calculate the usual volatility-return correlation function, which is defined as f (t) = v (t ) • r (t + t) with t > 0. However, the result fluctuates around zero [4,9].In the past years, various GARCH-like models are applied to investigate the correlation between past volatilities and future returns.In these models, the future returns are assumed to be correlated with the past volatilities, and there are coefficients quantifying the correlation.The results are controversial.The correlation is discovered to be positive in some researches [24,25], but negative in others [49][50][51].More often, the coefficient linking past volatilities and future returns is statistically insignificant [26].From our perspective, these studies only characterize the volatility-return correlation local in time.In our work, however, both ∆P (t) and F (t) are nonlocal in time, which are constructed based on the difference between the average volatilities in two different time windows.The correlation characterized by ∆P (t) and F (t) is more complicated and of higher-order.

Agent-based model with asymmetric trading preference
We construct an agent-based model to investigate the microscopic origin of the nonlocal volatility-return correlation.Agent-based modeling is a promising approach in complex systems, and has been applied successfully to study the fundamental properties in financial markets, such as the fat-tail distribution of returns, the long-range temporal correlation of volatilities, and the leverage and anti-leverage effects [15,18,27,39,[52][53][54][55][56][57].
The basic structure of our model is borrowed from the models in refs.[15,18], which is built on agents' daily trading, i.e., buying, selling and holding stocks.Since the information for investors is highly incomplete, an agent's decision of buying, selling or holding is assumed to be random.Due to the lack of persistent intraday trading in the empirical trading data, we consider that only one trading decision is made by each agent in a single day.In our model, there are N agents and each agent only operates one share of stock each day.On day t, we denote the trading decision of agent i by The probability of buying, selling and holding decisions are denoted by P buy , P sell and P hold , respectively.Assuming that the price is determined by the difference between the demand and supply of the stock, we define the return R(t) as Next, we introduce the investment horizon based on the fact that investors make decisions according to the previous market performance of different time horizons.It is found that the relative portion γ i of investors with i days investment horizon follows a power-law decay γ i ∝ i −η with η = 1.12.With the condition of where M is the maximum investment horizon.Considering different investment horizons of various agents, we introduce a weighted average return R (t) to describe the integrated investment basis of all agents.Specifically, R (t) is defined as where k is a proportional coefficient.We set k = 1/( M i=1 M j=i γ j ), so that |R (t)| max = N = |R(t)| max .According to ref. [18], the maximum investment horizon M is set to 150.
Herding behavior is an important collective behavior in financial markets.We define a herding degree D(t) to describe the clustering degree of the herding behavior, On day t + 1, the average number of agents in each group is N • D(t + 1), and therefore we divide all agents into 1/D(t + 1) groups.The agents in a same group make a same trading decision with the same trading probability.In ref. [15], it is assumed that the probabilities of buying and selling are equal, i.e., P buy = P sell = p, and p is a constant estimated to be 0.0154.Therefore the trading probability is P trade = P buy + P sell = 2p and the holding probability is P hold = 1 − 2p.In our model, the trading probability is also kept to be 2p and remains constant during the dynamic evolution.Now we introduce a novel mechanism in our model, that is, the asymmetric trading preference in volatile and stable markets.In financial markets, the market behaviors of buying and selling are not always in balance [58].Hence, P buy and P sell are not always equal to each other.They are affected by previous volatilities, and the more volatile the market is, the more P buy differs from P sell .
For an agent with i days investment horizon, the average volatility over previous i days is taken into account, which is defined as Then we define the background volatility as V M (t), where M is the maximum investment horizon.On day t, the agent with i days investment horizon estimates the volatility of the market by comparing V i (t) with V M (t).Therefore, the integrated perspective of all agents on the recent market volatility is defined as Thus, we define the probabilities of buying and selling as Here the parameter c measures the degree of agents' asymmetric trading preference in volatile and stable markets.Compared with the model in ref. [18], c is the only new parameter added in our model.We speculate that c can be determined from the trade and quote data of stock markets.Unfortunately, the data are currently not available to us.
To judge from the amplitude of the volatility-return correlation, c should be a small number.Let us set c to be 1/80.The total number of the agents, N , is 10000.The returns of the initial 150 time steps are set to be random values following a standard Gaussian distribution.On day t, we randomly divide N agents into 1/D(t) groups.The agents in a same group make a same trading decision with the same probability.After each agent makes his decision, the return R(t) can be computed.Repeating the procedure we produce 20000 data points of R(t) in each simulation, and abandon the first 15000 data points for equilibration.Thus we obtain a sample with 5000 data points.
After the time series R(t) generated from our model is normalized to r(t), we compute ∆P (t) with the time windows T 1 = 3 and T 2 = 150.The result is averaged over 100 samples and displayed in Fig. 7(a).∆P (t) is significantly non-zero with an amplitude of 3 percent, lasting for about 20 days.For comparison, three curves for ∆P (t) averaged over 75, 50 and 25 randomly chosen samples are also displayed.Within fluctuations, these four curves are consistent with each other and in agreement with the empirical results.We also perform the simulation with c = 1/40 and c = 1/160, respectively, to investigate the dependence of ∆P (t) on c.As displayed in the inset of Fig. 7(a), the amplitude of ∆P (t) increases with c, i.e., the magnitude of c determines the amplitude of the volatility-return correlation.For c = 1/40, the amplitude of ∆P (t) is about 6 percent, which is in the order of that for the SSE and other markets with a strong volatility-return correlation.For c = 1/160, the amplitude of ∆P (t) is close to that for the S&P500 index, of which the volatility-return correlation is relatively weak.Therefore, with c ranging from 1/160 to 1/40, our model produces the volatility-return correlation consistent with the empirical results.Additionally, if c is negative, the volatility-return correlation will be negative, i.e., the sign of c fixes the correlation to be positive or negative.
Next, we compute ∆P (t) with different pairs of T 1 and T 2 , and determine the region of effective pairs of T 1 and T 2 .∆P (t) is averaged over 100 samples, and the landscape of AP 0 is shown in Fig. 7(b).A single region with non-zero ∆P (t) is observed.For T 2 smaller than 120, for example, ∆P (t) is almost zero.In other words, the effective pairs of T 1 and T 2 exist in a particular region, which is consistent with the empirical results.

Discussion
We construct a class of dynamic observables nonlocal in time to explore the correlation between past volatilities and future returns in stock markets.Strikingly, the volatility-return correlation is discovered to be non-zero, with an amplitude of a few percent and a duration of over two weeks.The result indicates that past volatilities nonlocal in time affect future returns.Both the nonlocal dynamic observables ∆P (t) and F (t) rely on two time windows T 1 and T 2 .The effective pairs of T 1 and T 2 exist in a particular region, suggesting that both T 1 and T 2 are the characteristics of the stock markets.
Our results are robust for not only individual stocks but also stock market indices.The volatility-return correlation nonlocal in time is detected to be positive for individual stocks in the New York and Shanghai stock exchanges, as well as 16 stock indices.For other 7 indices, ∆P (t) fluctuates around zero.However, we suppose there may exist some higher-order correlations between volatilities and returns for these indices, which could be described by more complicated nonlocal observables.Exceptionally, other 2 indices exhibit a negative volatility-return correlation.
To investigate the microscopic origin of the volatility-return correlation, we construct an agent-based model with a novel mechanism, that is, the asymmetric trading preference in volatile and stable markets.Accordingly, a parameter c is introduced to describe the degree of the asymmetric trading preference.The simulation results exhibit a positive correlation which is in agreement with the empirical ones.More importantly, the effective pairs of T 1 and T 2 for simulation results exist in a particular region, which is also consistent with the empirical ones.Actually, our model can also produce a negative correlation by changing the sign of c.The results reveal that both the positive and negative correlations arise from the asymmetric trading preference in volatile and stable markets.In our model, the nonlocality arises from the interaction between the integrated perspective on the recent market volatility and the probabilities of buying and selling.
Our results provide new insight into the price dynamics.Contrary to the assumptions in various models, the rise and fall of prices turn out to be far from random.To the best of our knowledge, the volatility-return correlation nonlocal in time is the only property concerning the control of the price dynamics, given that the autocorrelating time of returns is extremely short.This non-zero volatility-return correlation implies that there may exist higher-order correlations of returns, which deserves further investigation in the future, especially for those 7 indices with ∆P (t) fluctuating around zero.Furthermore, our results indicate that nonlocality is an intrinsic characteristic in the financial markets, which is more important than we thought before.Besides the volatility-return correlation in the stock markets, many other nonlocal correlations in financial systems are to be explored, which serves as our future agenda.

Figure 1 .
Figure 1.The probability difference for the individual stocks.The probability difference ∆P (t) for (a) 200 individual stocks in the SSE and (b) 200 individual stocks in the NYSE.The black line shows ∆P (t) averaged over 200 stocks with error bars.The other lines represent ∆P (t) averaged over 150, 100, and 50 randomly chosen stocks.The time windows are T 1 = 24 and T 2 = 205 for the NYSE, and T 1 = 10 and T 2 = 210 for the SSE.

Figure 2 .
Figure 2. The probability differences for three definitions of volatility.The results are averaged over individual stocks.For the NYSE, the time windows are T 1 = 24 and T 2 = 205.The result for the SSE is displayed in the inset, with the time windows T 1 = 10 and T 2 = 210.

Figure 4 .
Figure 4.The landscape for the amplitude of ∆P (t).The amplitude AP 0 of ∆P (t) at different time windows T 1 and T 2 for (a) 200 individual stocks in the NYSE and (b) 200 individual stocks in the SSE.T 1 ranges from 1 day to 44 days, and the increment is 1 day.T 2 is from 45 to 250 days, with an increment of 5 days.The larger AP 0 is, the more significantly ∆P (t) differs from zero.For ∆P (t) fluctuating around zero, AP 0 = 0.

Figure 5 .
Figure 5.The effective pairs of time windows for five representative indices.The probability difference (a) ∆P (t) and (b) ∆P 1 (t) for five stock market indices.Different colors represent the regions of effective pairs of T 1 and T 2 for different indices.Specifically, navy stands for the Brazil Index, orange stands for the Shanghai Index, red stands for the Mexico Index, yellow stands for the Spanish Index and crimson stands for the S&P 500 Index.For clarity, we display only one color at the overlapping regions, given that these regions are small.Some scattered points are also omitted.

Figure 6 .
Figure 6.The landscape for the amplitude of F (t).The amplitude AF 0 of F (t) at different time windows T 1 and T 2 for (a) 200 individual stocks in the NYSE and (b) 200 individual stocks in the SSE.

Figure 7 .
Figure 7.The simulation results of the agent-based model.(a) The probability difference ∆P (t) computed with the simulated returns at the time windows T 1 = 3 and T 2 = 150.The parameter c is 1/80.The black line represents ∆P (t) averaged over 100 samples with error bars and the other lines stand for ∆P (t) averaged over 75, 50 and 25 randomly chosen samples.∆P (t) for different values of c are displayed in the inset, with T 1 = 3 and T 2 = 150.(b) The amplitude AP 0 of ∆P (t) at different time windows T 1 and T 2 .Each ∆P (t) is averaged over 100 samples.The larger AP 0 is, the more significantly ∆P (t) differs from zero.For ∆P (t) fluctuating around zero, AP 0 = 0.

Table 1 .
The time period, effective pair of time windows and maximum AP 0 .The time period, effective pair of T 1 and T 2 , and maximum AP 0 for the individual stocks in the NYSE and SSE, as well as 18 stock indices.The volatility-return correlation nonlocal in time is positive for all these indices and stocks, except for the Australia and Japan indices, which exhibit a negative volatility-return correlation.For other 7 indices, nonzero ∆P (t) could not be detected for almost all pairs of T 1 and T 2 .