A new test of multivariate nonlinear causality

The multivariate nonlinear Granger causality developed by Bai et al. (2010) (Mathematics and Computers in simulation. 2010; 81: 5-17) plays an important role in detecting the dynamic interrelationships between two groups of variables. Following the idea of Hiemstra-Jones (HJ) test proposed by Hiemstra and Jones (1994) (Journal of Finance. 1994; 49(5): 1639-1664), they attempt to establish a central limit theorem (CLT) of their test statistic by applying the asymptotical property of multivariate U-statistic. However, Bai et al. (2016) (2016; arXiv: 1701.03992) revisit the HJ test and find that the test statistic given by HJ is NOT a function of U-statistics which implies that the CLT neither proposed by Hiemstra and Jones (1994) nor the one extended by Bai et al. (2010) is valid for statistical inference. In this paper, we re-estimate the probabilities and reestablish the CLT of the new test statistic. Numerical simulation shows that our new estimates are consistent and our new test performs decent size and power.


Introduction
After the pioneering work of Granger [1], Granger causality tests have been developed into a set of useful methods to detect causal relations between time series in economics and finance. Linear Granger causality tests within the linear autoregressive model class have been developed in many directions, e.g., Hurlin et al. [2] proposed a procedure for causality tests with panel data, Ghysels et al. [3] test for Granger causality with mixed frequency data based on the multiple-horizon framework established by Dufour and Renault [4] and Dufour et al. [5]. Though linear tests of Granger causality have been investigated very deeply, they are limited in their capability to detect nonlinear causality.
The real world is "almost certainly nonlinear" as Granger [6] notes, so it is more important to test the nonlinear causality. Baek and Brock [7] develop a nonlinear Granger causality test which is modified by Hiemstra and Jones [8] later on to study the bivariate nonlinear causal relationship between two series. Among the various tests of nonlinear Granger causality, the Hiemstra-Jones test (hereafter, the HJ test) proposed by Hiemstra and Jones [8] is the most cited by scholars and the most frequently applied by practitioners in economics and finance. There were over 1100 Google Scholar hits by September 2016, which illustrates its significance in the economics and finance literatures. Bai et al. [9] extend the HJ test from bivariate setting to multivariate setting catering to the practical needs that economic and financial factors usually move together and influence others in groups. This extension encourages a large amount of applications. For example, Lam et al. [10] suggest to use such technics to make better investment decisions. Zheng and Chen [11] proposed a complete double selection method in identifying external influential factors for a particular stock market. Choudhry et al. [12] investigates the nonlinear dynamic co-movements between gold returns, stock market returns and stock market volatility during the recent global financial crisis. Choudhry et al. [13] investigate the relationship between stock market volatility and the business cycle in four major economies US, Canada, Japan and the UK.
However, several works note that counterintuitive results are obtained from the HJ test, Diks and Panchenko ( [14,15]) find that the HJ test is seriously over-rejecting in simulation studies. In accordance with the evidence presented by Diks and Panchenko ([14,15]), Bai et al. [16] reinvestigate the HJ test and reveal some of the underlying reasons for the questionable performance of HJ test. They find that the estimators of the probabilities in the definition are not U-statistics as Hiemstra and Jones [8] claimed and the central limit theorem of the test statistics is not valid. Bai et al. [16] propose a set of consistent estimators of the probabilities in the definition of Hiemstra and Jones [8] and provide a new test statistic with its asymptotic distribution.
Considering the significant importance of the multivariate nonlinear Granger causality test, there is an urgent need to reinvestigate Bai et al. [9] and extend Bai et al. [16] to multivariate setting. The remainder of this paper is organized as follows. In Section 2, we simply review the procedure of the multivariate nonlinear Granger causality test (here after BWZ test) extended by Bai et al. [9]. In Section 3, we re-estimate the probabilities in the definition of Bai et al. [9] and establish the asymptotic distribution of the new test statistics. Simulation results are presented in Section 4. Finally, we provide some concluding remarks in Section 5.

Definition 1 The vector time series {Y t } does not strictly Granger cause another vector time
where P(Á|Á) denotes conditional probability.
Using the notation Bai, et al. [9] re-express Eq (1) as For two sets of simultaneous samples {x i,t , i = 1, Á Á Á, n 1 , t = 1, Á Á Á, T} and {y i,t , i = 1, Á Á Á, n 2 , t = 1, Á Á Á, T}, they propose the following test statistic Remark: Following the instruction of Hiemstra and Jones [8], Bai et al. [9] take C j (Ã, n)s as multivariate U-statistic estimators of their counterparts C j (Ã)s and apply the asymptotic property of U-statistic to show the limiting results for the test statistics (3). However the C j (Ã, n)s are not U-statistics, because the expectations of the general terms are not the same. Moreover, the C j (Ã)s are related to the indices t and s (in fact, related to |t − s| for strongly stationary processes), while the C j (Ã, n)s were independent of t and s for summing up over them. Therefore, the C j (Ã, n) estimators are neither consistent nor asymptotic normal estimators of their counterparts C j (Ã).

A new multivariate nonlinear causality test
We first remind the reader that the pair (s, t) (in fact, |t − s| for strongly stationary processes) in Eq (1) of Definition 1 is a key parameter of the probabilities C j (Ã). In fact, both Hiemstra and Jones [8] and Bai et al. [9] note this, and there is no problem in Eq (1) of Definition 1. However, it seems that they overlooked this fact in their proposed estimation of C j (Ã). The improper estimators C j (Ã, n) thus lead to an invalid asymptotic distribution of the test statistic.
We now begin to state the procedure for our new test.
share a common standard deviation 1, and thereby share a common scale parameter. For any given pair (s, t), we denote Under the assumption of the stationary, for the given pair ( , which does not depend on t, so we can write , the same to the others. So under the assumption of strictly stationary, for each l > 0, we examine whether there is nonlinear Granger causality from {Y t } to {X t } by testing the following hypothesis If we consider two sets of simultaneous samples (To implement the test, each series is standardized at first so that all series share a common standard deviation, and thereby share a common scale parameter.) {x i,t , i = 1, Á Á Á, n 1 , t = 1, Á Á Á, T} and {y j,t , j = 1, Á Á Á, n 2 , t = 1, Á Á Á, T}, we first provide the consistent estimators of C 1 (M x + L x , L y , e; l), C 2 (L x , L y , e; l), C 3 (M x + L x , e; l) and C 4 (L x , e; l) arê Iðy L y j j;tÀ L y j ; y L y j j;tþlÀ L y j ; eÞ; Iðy L y j j;tÀ L y j ; y L y j j;tþlÀ L y j ; eÞ; The consistency of our proposed estimators can be shown straightforwardly and the detail of the proof is omitted. We use a simple numerical study to show that our estimators are consistent whereas those of Bai et al. [9] are not. Let where {a 1,t }, {a 2,t } are i.i.d. and mutually independent random variables generated from N(0, 1), while {Y t } could be any stationary sequence. Let l = 1, L x = L y = M x = 1. We can calculate the exact values of C 4 (L x , e; l), which are 0.2709 and 0.5057, respectively, when e = 1 and e = 1.5. For simplicity, we denote the true value of C 4 (L x , e; l), the estimate proposed by Bai et al. [9] and our new estimate as C 4 ,Ĉ BWZ 4 andĈ 4 , respectively, in Table 1. Additionally, Table 1 provides the estimated values with their corresponding relative estimation errors in brackets when T = 1000, 2000 and 4000. It is obvious thatĈ BWZ 4 is not consistent. Now, we propose as the test statistic, and we derive the following asymptotic distribution of T n for the Granger causality test. Theorem 1 Stationary sequences {x i,t , i = 1, Á Á Á, n 1 , t = 1, Á Á Á, T} and {y j,t , j = 1, Á Á Á, n 2 , t = 1, Á Á Á, T} are strong mixing, with mixing coefficients satisfying the conditions of Lemma 1 presented in Appendix, for given values of l, L x , L y , M x and e > 0, under the null hypothesis that {Y t } does not strictly Granger cause {X t }, then the test statistic is defined in (5) The asymptotic variance σ 2 (M x , L x , L y , e, l) with its consistent estimatorŝ 2 ðM x ; L x ; L y ; e; lÞ and the proof of theorem 1 are given in the Appendix (With some basic definitions from [17]). The hypothesis H 0 defined in (4) is rejected at α if jT n j=ŝðM x ; L x ; L y ; e; lÞ > z a=2 ; where z α/2 is the up α/2 quantile of the standard normal distribution. In this situation, we will conclude that there exists nonlinear Granger causality from {Y t } to {X t }.
There are several possible methods to estimate the asymptotic variance σ 2 (M x , L x , L y , e, l). A model-based approach uses known laws of {X t } and {Y t } to calculate the expectations in the formula given in the Appendix and simply substitutes C j (Ã), j = 1, 2, 3, 4 with their corresponding

Simulaiton
In this subsection, we perform numerical studies using simulations to illustrate the applicability and superiority of the new multivariate nonlinear Granger causality test developed in Section 3. Let R be the times of rejecting the null hypothesis that Y t does not strictly Granger cause X t nonlinearly in 10,000 replications at the α level, and thus, the empirical power is R/10,000. In our simulation, the level α = 0.05, we standardized the series and chose the same lag length and lead length: L x = L y = M x = 1. We set three situations of l and two situations of e: l = 1, l = 2, l = 3 and e = 1, e = 1.5. Consider the following model: where {(Y 1,t , Y 2,t ) 0 } are i.i.d. and mutually independent random variables generated from standard normal distribution N(0, 1), {ε t } is Gaussian white noise generated from N(0, 0.1) and independent of {Y 1,t }, {Y 2,t }. There is no nonlinear Granger causality from Y t to X t when β = 0, and causality strengthens when β increases. From the results displayed in Table 2, we conclude first that our test possesses decent size, as we can see when β = 0 the empirical size are all closed to the test level 0.05 for different settings of parameters and sample size. Second, our test possesses very appropriate power, as we see that empirical power increases as β increases, especially when sample size is 500 the empirical power sharply increase to 1. Further, we find that different settings of e may influence the test results. Though the influence is little in our simulation, we still suggest that practitioners choose a couple of different values of e.

Application
Now we apply our new method to detect the nonlinear causality from returns to exchanging volumes on China's stock market. To eliminate the potential influence of asymmetry information caused by language barriers, different accounting standards and foreign currencies exchanging, we consider China's A shares which are denominated in the local currency the Yuan (also called RMB) and traded among China's citizens. A shares listed on the Shanghai Stock Exchange are named SHA while the ones listed on the Shenzhen Stock Exchange are named SZA.
We denote the prices of SHA and SZA at t as P SH t and P SZ t and denote the according trading volumes as V SH t and V SZ t . The daily stock returns of SHA and SZA (from Jan. 1, 2010 to Dec. 30, 2016) are expressed as Y 1t ¼ ln ðP SH t =P SH tÀ 1 Þ and Y 2t ¼ ln ðP SZ t =P SZ tÀ 1 Þ respectively, the percentage changes in trading volume are expressed as X 1t ¼ ln ðV SH t =V SH tÀ 1 Þ and X 2t ¼ ln ðV SZ t =V SZ tÀ 1 Þ. We apply our new multivariate causality test proposed in section 2.2 to detect the causalities between (Y 1t , Y 2t ) 0 and (X 1t , X 2t ) 0 , we let L x 1 = L x 2 = L y 1 = L y 2 = m x 1 = m x 2 = 1 and consider l = 1, l = 2 and l = 3, e = 1 and e = 1.5. Table 3 presents some descriptive statistics of Y 1t , Y 2t , X 1t and X 2t . It is worth noting that the standard deviations of Y 1t , Y 2t , X 1t and X 2t are 0.014843, 0.017552, 0.195853 and 0.171018 respectively. To implement our proposed test, Y 1t , Y 2t , X 1t and X 2t need to be standardized at first so that all series share a common standard deviation 1. To avoid tedious notation we also denote the standardized sequences (Y 1t − Mean(Y 1t ))/SD(Y 1t ), (Y 2t − Mean(Y 2t ))/SD(Y 2t ), (X 1t − Mean(X 1t ))/SD(X 1t ) and (X 2t − Mean(X 2t ))/SD(X 2t ) as Y 1t , Y 2t , X 1t and X 2t . Table 4 reports the results of detecting causality from (Y 1t , Y 2t ) 0 to (X 1t , X 2t ) 0 as well as two pairs of subset, Y 1t to X 1t and Y 2t to X 2t . Generally speaking, there exists causality from stock returns to volume changes, as we can see on the row which testing no causality from (Y 1t , Y 2t ) 0 ! (X 1t , X 2t ) 0 , p values are all much smaller than level 0.05. Further, we try to dig which subset possess the dynamic causal explanatory ability. Results on the last row of Table 3 strongly suggest that the returns of SZA nonlinearly cause the trading volume of SZA. However, the causal explanatory ability is weak from the returns of SHA to the trading volume of SHA. We think these results coincide with practical situation, noticing that companies listed on the Shanghai Stock Exchange are likely to be big state-owned companies, while those listed on the Shenzhen Stock Exchange tend to be smaller export-oriented companies. Investors who invest on big stateowned companies are tend to be value investment and would like to hold their stocks with in a long time but pay less attention on the price changing comparing to these invest on Shenzhen A shares.

Conclusion and remarks
In this paper, we reinvestigate the multivariate nonlinear Granger causality test extended by Bai et al. [9] which attempt to uncover significant nonlinearities in the dynamic interrelationships between two groups of variables. We find that Bai et al. [9] as well as Hiemstra and Jones [8] take the estimators of the probabilities in their definition as U-statistics and establish a CLT of the test statistic by applying the asymptotic property of U-statistics. After revealing that the estimators proposed by Bai et al. [9] is not U-statistics, we show that their estimators are also not consistent. The procedure of our new test begins with presenting consistent estimators of probabilities in the definition. Numerical study supports that our estimators are consistent, further our new test possesses admirable properties both in size and power.
There are still amounts of appealing aspects in nonlinear Granger causality test. It is worth noting that Diks and Wolski [18] extend the test in Diks and Panchenko [15] which highlight a need for substitutions for the relationship tested in the HJ test.
Appendix A1: Central limit theorems for strong mixing stationary sequence fðZ t ; F t Þ; À 1 < t < 1g is a stochastic process defined on the probability space ðO; F ; PÞ. The history and the future of Z t are σ-algebras M 1 t ¼ fF s ; s > tg and σ-algebras EðZ 0 Z j Þ < 1, and if σ 6 ¼ 0, then lim n!1 Readers can be referred to Ibragimov [17] for a proof and detailed discussion.