## Figures

## Abstract

The problem of controlling stationarity involves an important aspect of forecasting, in which a time series is analyzed in terms of levels or differences. In the literature, non-parametric stationary tests, such as the Kwiatkowski-Phillips-Schmidt-Shin (KPSS) tests, have been shown to be very important; however, they are affected by problems with the reliability of lag and sample size selection. To date, no theoretical criterion has been proposed for the lag-length selection for tests of the null hypothesis of stationarity. Their use should be avoided, even for the purpose of so-called ‘confirmation’. The aim of this study is to introduce a new method that measures the distance by obtaining each numerical series from its own time-reversed series. This distance is based on a novel stationary ergodic process, in which the stationary series has reversible symmetric features, and is calculated using the Dynamic Time-warping (DTW) algorithm in a self-correlation procedure. Furthermore, to establish a stronger statistical foundation for this method, the F-test is used as a statistical control and is a suggestion for future statistical research on resolving the problem of a sample of limited size being introduced. Finally, as described in the theoretical and experimental documentation, this distance indicates the degree of non-stationarity of the times series.

**Citation: **Poulos M (2016) Determining the Stationarity Distance via a Reversible Stochastic Process. PLoS ONE 11(10):
e0164110.
https://doi.org/10.1371/journal.pone.0164110

**Editor: **Zhong-Ke Gao, Tianjin University, CHINA

**Received: **April 17, 2016; **Accepted: **September 20, 2016; **Published: ** October 20, 2016

**Copyright: ** © 2016 Marios Poulos. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

**Data Availability: **All relevant data are within the paper.

**Funding: **The author received no specific funding for this work

**Competing interests: ** The author has declared that no competing interests exist

## Introduction

A time series is a sequence of data points collected over time, and the analysis of time series is a very important issue in economic and engineering forecasting. In the engineering literature, state-space methods have been developed for sequential data analysis. Several researchers have attempted to bridge the gap between engineering methods and statistics [1, 2]. Recently, two advanced research areas have been combined using this approach: the methods of nonlinear time-series analysis and the theory of complex networks. Multivariate time-series analysis is used to model and explain the interactions and co-movements among a group of time-series variables. Specifically, Gao et al. [3–6] recently developed some important multivariate time series analysis methods, especially complex network-based methods, to uncover complicated flow mechanism underlying industrial multiphase flow system. Time-frequency (TF) analysis is considered to be one of the most significant active fields with respect to this issue. Because such data are essentially non-stationary and correlated and often have periodic patterns, multivariate time series are difficult to analyze and forecast [1].

In time-series clustering, time series data are partitioned into groups based on similarity or distance so that time series in the same cluster are similar. For time-series clustering, the first step is identifying an appropriate distance/similarity metric. Then, in the second step, existing clustering techniques, such as k-means, hierarchical and density-based clustering or subspace clustering, are used to find clustering structures. Furthermore, with modern methods, such as multivariate multi-scale complex network (MMCN) analysis [6], mapping a multivariate time series into a MMCN can be achieved. The goal of this study is to investigate the inherent properties of multivariate time series from the perspective of complex-network and multi-scale analysis. One significant property of complex network analysis is stationarity.

The stationarity property of a time series can be used to obtain significant sample statistics, such as variances, means, and correlations with other variables. However, such statistics are valuable as descriptors of forthcoming behavior only if the series is stationary. For instance, if the series reliably increases over time, the sample mean and variance will increase with the sample size but will always undervalue the mean and variance in forthcoming periods.

A stationary process is a process whose statistical properties (mean and standard deviation) do not vary according to the place or the time (at which the function is calculated) [7]. There are two main types of methods for examining stationarity: the parametric method and the nonparametric method [8]. Parametric approaches are generally used by researchers working in the time domain, such as economists, who make certain assumptions about the nature of their data. The bases of these approaches are unit-root tests (beginning with the classic Dickey-Fuller [DF] test, with numerous modifications, and Perron-type tests), which have as a null hypothesis the existence of a unit root in the series. These approaches use parametric auto-regression to approximate the autoregressive-moving-average (ARMA) structure of the errors in the test regression. The alternative to stationarity is a combined hypothesis. There are many types of Kwiatkowski-Phillips-Schmidt-Shin (KPSS) tests, such as the popular ones used for testing integration, which take null stationarity as a simple hypothesis [9]. The difference between these tests is that the KPSS test uses a nonparametric method to correct a series for serial correlation, whereas the DF and Perron tests use parametric corrections [9].

Given a choice between parametric and nonparametric tests, scientists choose the latter because the assumptions made in KPSS tests are more general, which makes them applicable to a wider class of processes. However, in both cases, problems remain unsolved. One application of KPSS tests is the adaptive detection of bandwidth through an observation process [10]. Additionally, in nonparametric methods, choosing the appropriate order (lag) for describing the time series is a critical step. This choice requires lag selection that satisfies stationary control, such as a KPSS test. Furthermore, the order (lag) selection is extremely difficult when the experiment focuses on only a few sample data; in this case, the default order is zero (0).

Additionally, an increase in sample size does not help reduce the occurrence of rejections; in fact, it increases the rejection rate. This finding runs counter to the large-sample theory on which stationarity tests typically rely: Asymptotic approximations yield higher accuracies as the sample size increases. The existence of such a size problem immediately calls into question the credibility of examining empirical evidence with stationarity tests, such as KPSS. Indeed, it is well understood that many observed time series in empirical macroeconomics and international finance exhibit robust persistence and, thus, fall into the problematic parameter zone [11].

Furthermore, the results of experiments conducted on Indian macroeconomic variables revealed that the aforementioned tests, including KPSS tests, are prohibitively sensitive to the choice of lag length [6]. Since, as of now, no theoretical criterion exists for the lag-length selection for tests in which the null hypothesis is stationarity, their use should be avoided, even for so-called ‘confirmation’ [12].

In conclusion, parametric stationary tests, such as KPSS tests, are very important but suffer from problems with reliability in the lag and sample size selection that limit their applicability.

The aim of this study is to introduce a new method that measures the distance in terms of the measurement error between mirror time series. This error corresponds to the theoretical distance that must be investigated to determine whether a time series is stationary. Using this approach eliminates the aforementioned problems with lag and sample size selection. Furthermore, the distance calculated via this method could be used as a specific property in a multi-scale complex network.

This distance is based on a stationary ergodic process, which relies on the following three (3) considerations:

- Any reversible process is stationary, and any time reversal of a reversible process is also stationary [13–15].
- If {
*X*_{n},*n*∈*R*} is stationary, then the time-reversed process is also stationary. - The possible metric deviation (distance) between the unpredictable series can be used as a measure of the degree of irreversible progress; this is implemented using the Dynamic Time-warping (DTW) technique. DTW is adopted because it is considered to be the better of the two time-series methods [16] because its metric is superior to the classic Euclidean distance metric. This metric, which corresponds to the degree of irreversibility, is called the “Stationarity Distance”, and it is a measure of the non-stationary characteristics of a time series and is used in the above-described process.

To corroborate these assertions, a series of statistical procedures is applied using the proposed method to analyze a specific data set. Additionally, a KPSS test (with a null hypothesis of stationarity) is also applied to the same data [17], and the reliability of the proposed method is evaluated on suitable (with stationary and non-stationary properties) time-series data. For further verification, the selected (non-)stationary data segments are visually inspected by plotting. More details on this procedure are presented in the Experimental Procedure section.

Finally, this study can be broadly divided into four sections. In the first section, Methodology, the definitions of the reversible stochastic process (RSP) and KPSS methods are given. In the second section, Experimental Procedure, the experimental methods and the results obtained with both methods are presented. In the third section, the statistical evaluation of the RSP method is analyzed. Finally, in the fourth section, the conclusions and plans for future research on the RSP method are described.

## Formulation

### The RSP Method

This method is based on the following formulation:

A discrete time-stationary process {*X*_{N},*i* = 1,…,*N*} is time reversible if, for every positive integer N [14],
(1)
and a discrete time series produces a corresponding mirror time series
(2)
(3)
Thus, given Eq 2, the proposed algorithm is based on the following:
(4)

If the error is zero (*error* = 0), then the time series consists of a stationary process, as defined in Eq 1, where the estimated error is based on the dissimilarity measure between the discrete time series and the reversible time series . The physical meaning of this distance is the process’s degree of stationarity.

### Error calculation using DTW

Numerous time series dissimilarity measures have been proposed [18]. However, the most common measures, which were proposed long ago, remain the most competitive ones. The most-used metric distance function is the Euclidean distance, which obeys the metric properties: *triangle inequality*, *non-negativity*, *identity*, and *symmetry*. This function remains amazingly competitive [19] with other, more complicated metrics, particularly for large data sets [20]. Euclidean distance and its variants have several drawbacks that make their use inappropriate in certain applications, such as sensitivity to noise, shifting and scaling [19,21]

DTW lends robustness to the similarity computation. Using this method, time series of different lengths can be compared because DTW replaces the one-to-one point comparison, used in Euclidean distance, with a many-to-one (and vice versa) comparison [19]. Thus, one of the main conclusions of comparison studies [22,23] is that, despite newly proposed methods, the Euclidean and DTW [20,24] dissimilarity measures remain two of the most robust, simple, generic, and capable measures. Additionally, to measure the shape similarity in the sign data set, alignment-based distances (such as DTW) are more suitable; in contrast, the Euclidean distance is not robust to distortions in time and other noise [25]. Because the DTW algorithm is more efficient than the Euclidean distance with respect to noise sensitivity [20,24,25], the DTW is selected for use in the error calculation based on Eq 3 to obtain more reliable measurements than would be possible using other, similar techniques.

Assume that the local dissimilarity of function {*f*} is defined between any pair of elements *x*_{i} ∧ *y*_{i} under the constraint
(5)

Then, if a given path is the lowest-cost path between two series, the corresponding technique (DTW) [26] lays the warping curve *φ*(*k*), ∀ *k* = 1,2,…,*T*:
(6)

The warping functions *φ*_{x} ∧ *φ*_{y} remap the time indices of *X* ∧ *Y* correspondingly. Given *φ*, the average accumulated distortion between the warped time series *X* ∧ *Y* can be calculated [26] as follows.
(7)
where *m*_{φ}(*k*) is a per-step weighting coefficient, and *M*_{φ} is the corresponding normalization constant, which confirms that the accumulated distortions are comparable along different paths. To ensure reasonable warps, constraints on *φ* are usually imposed. The idea underlying DTW is finding the optimal alignment *φ* such that
(8)
Therefore, one chooses the distortion of the time axes of *X* ∧ *Y* that brings the two time series as near to each other as possible.

### Procedure formulation

The measure of the degree of non-stationarity is calculated according to the scaling of , keeping in mind that a time series {*X*_{N}} is stationary when .

Then, according to these definitions, the implementation of this method can be analyzed as follows (Fig 1):

### Comparison with the adaptive KPSS test

According to the KPSS model [8,9], the null hypothesis of the stationarity trend corresponds to the hypothesis that the variance of the random walk (r_{t}) equals zero, which is expressed by the following equations.
(9)
(10)
and *t* = 1,2,3,…,*T*, which are expressed in terms of the number of observations.

Thus, if we suppose that the null hypothesis is determined to be *ξ*_{t} = *ξ*_{0}, i.e., is constant or , this hypothesis defines the stationary process against the hypothesis, which defines the nonstationary process. assuming that *e*_{t} are the estimated errors, which are computed as the residuals of regression {*y*_{t}} and are given by , then the value of KPSS is calculated using Eq (11).
(11)
where p is the order (lag) of an autoregressive AR(1) model. In other words, the partial autocorrelation at lag *p* is equal to the estimated AR coefficient in an autoregressive model with *p* coefficients [8]. The parameters of Eq 11 are defined as follows.
(12)
where *σ*^{2} is the long-run variance of *e*_{t},
(13)
(14)
is the consistent estimator of *σ*^{2}, which can be constructed from the residuals *e*_{t},

where *p* is the truncation lag and
(15)
is the optional weighing function that corresponds to the specific choice of window [10].

## Experimental Results

For the input data, a long data set *y*_{t} of sample size T = 1560 was selected from the Time Series Data Library [17, 27]. This data set contains internet traffic data (in bits) from an internet service provider (ISP) and aggregated traffic in the backbone of the United Kingdom academic network. It was collected between 19 November 2004 (09:30 hours) and 27 January 2005 (at 11:11 hours) as hourly data [17, 27]. This interval was chosen because this time series contains segments with (non-)stationary features (Fig 2).

Additionally, for calibration, we tested both methods on a classic stationary-shaped function [28,29] called the “Mexican Hat”, which was implemented in the numerical series to investigate the reliability and sensitivity of the tested methods with respect to the detection of a classic simulated stationary series (Fig 3 and Table 1). Thus, these simulated data were generated into T = [20, 40, 60, 100, 500] values to investigate the abilities of both methods to recognize this classic simulated stationary series as being stationary (Table 1). Similarly, the methods were also tested on a classic non-stationary function [28,29] called the “sinusoidal-shape” using simulated data with T = [32, 63, 126, 629, 1527] values (Fig 4 and Table 2).

Next, the processing of these data is implemented using two processing methods: RSP and KPSS. The above-described data set [16] is segmented into data sets of various sizes ranging from 42 and 301 (Tables 3 and 4). This segmentation was performed by applying an empirical criterion. The selected weak stationary data and non-stationary segments were obtained by visually inspecting the data using the plots. Specifically, the weak stationary data were segmented using visual criteria, such as stable mean and variance. These segments have sinusoidal shapes with visually apparent random phases. This selection was adopted because these data represent ergodic and weak stationary processes [30].

Furthermore, this segmented time series was verified using the KPSS criterion, and the null hypothesis results of the first two lags (Tables 3 and 4) were investigated. Finally, the statistical results and the series diagrams are depicted in Table 1.

### KPSS processing

The KPSS processing was implemented via the kpsstest.m function in Matlab. Via this procedure, we investigate the possible stationary positions of the candidate segment for two specific tests—0- and 1-autocovariance lags in the Newey-West estimator of the long-run variance, each conducted at a 0.1 level of significance—using a significance level of a = 0.01. Then, we calculate the KPSS value according to Eq (8) and the exact probability value (p-value). The p-value of a statistical hypothesis test indicates the probability of obtaining a value of the test statistic that is as extreme as or more extreme than that observed by chance alone. If the null hypothesis H_{0} is true, then the p-value determined by a KPSS test will be low, indicating an increased probability of rejecting the hypothesis of stationarity. In this case, the null hypothesis H_{o} (hp = 0) is accepted when the p-value, which is calculated using the kpss.m function of Matlab, is less than or equal to 0.01. For each candidate segment, two p-values are calculated for each lag (0 and 1) (Tables 2 and 3), whereas for the Mexican Hat, nine lags (0–9) are used (Table 1) [9].

### RSP processing

The RSP processing is implemented in 3 steps according to the procedure depicted in Fig 1. Specifically, in the first step, the selected time series is processed using a reversible procedure, according to the first and second definitions (Eqs (1) and (2)). For example, for a given time series (series 14 and Fig 5), using the reversible procedure, the reverse time series can be produced (series 15 and Fig 6).

In the second step, DTW = 0.3661 is calculated between the two time series using Eq (6), which is implemented via the dwt.m function of Matlab (Fig 7).

Finally, in the third step, the result of the characterization of the investigated time series X_{n} is extracted according to the evaluation procedure; this result is analyzed in Section 4.

## Results

As seen in Table 1, the decision hypothesis with H_{0} = 0, i.e., the acceptance decision for stationarity, depends on the sample size (Table 5) because the minimum-order lag increases dramatically according to a proportional relation. In contrast, for the RSP distance, the decision hypothesis depends on the sample size that consistently yields a zero RSP distance. Furthermore, in Tables 2 and 6, the decision hypothesis with H_{0} = 1, i.e., the rejection decision for stationarity, is presented for all five cases from the first lag. However, the rejection is very strong for segments with sizes exceeding 100. In contrast, in the RSP method, the calculated distances are much greater than one, indicating clear differentiation between the two investigated series types (Mexican Hat and sinusoid). Furthermore, in the sinusoid case (Table 6), the RSP distance is half of the corresponding sample size.

Furthermore, the results shown in Tables 3 and 4 indicate that, in the non-stationary decision, KPSS is very sensitive to the calculation of the orders of the lags. The characterized case of the error decision H_{0} = 0 (i.e., sample segment n.4 in Table 3, which is referred to as segment (200:300) in the time series in the ISP database). In contrast, the RSP distance yields stationary progress <10^{−2}, whereas the non-stationary progress is >10^{−1}. The statistical verification of this remark is provided in Section 4.

The most obvious method of estimating the similarity or dissimilarity between time series is to calculate a metric distance directly. In this case, the DTW between the original series is selected. For a small data set, this method may be feasible. However, for large data sets, it is problematic because the time complexity is O(n*N), where n is the number of features that must be obtained for each time series, and N is the number of time series in the data set. To calculate the similarity and index efficiently, many techniques for dimensionality reduction, such as the discrete Fourier transform (DFT), the piecewise aggregate approximation (PAA), and the discrete wavelet transform (DWT), have been proposed. These techniques allow a time series X of arbitrary length n to be represented with a time series of length w, where w< n [18].

The PAA is a very simple dimensionality-reduction method for time-series mining. For the time series *X*_{N} (Eq (1)), a new reduction time series *X*_{M} is obtained, which is calculated using the following equation:
(18)

This reduces the dimensionality from N to M by first dividing the original time series into M equally sized frames and then calculating the mean values for each frame. The sequence assembled from the mean values is the PAA approximation (i.e., transform) of the original time series. As shown in [31], the complexity of the PAA transform can be reduced from O(n*N) to O(n*M), where n is the number of frames.

In this case, the proposed method is tested using the PAA method to examine the efficiency of this method with respect to dimensionality reduction. Then, eight examples from Tables 3 and 4 are tested by attempting to reduce their dimensionality by ~50%, and the results are presented in Table 7. Furthermore, the results presented in Tables 1 and 2 demonstrate the efficiency of this dimensionality reduction because the stationary (Table 1) and non-stationary (Table 2) cases were tested using the same curves (Mexican hat and sinusoidal) with different sizes (20–1257) and yielded stable decisions. In Table 1 (stationary case), the reduction of the dimensionality from 500 to 20 yielded a value of zero in every case, unlike the KPSS method, which showed sensitivity to this variation. The same stable results were obtained for the results shown in Table 2 (non-stationary case), with dimensionality reduction from 1257 to 32. This case, the obtained distances are >15, and the KPSS method suffered from the same problems with the decision as mentioned for Table 1.

As shown by the results in Table 7, the error of the distances between the low-dimensional data and the original time series is very small (<0.005) and, in every case, the decisions on non-stationarity or weak stationarity remain the same, as indicated in Tables 3 and 4.

### Statistical processing

In this section, the populations of the variances, means and medians of the calculated distances (Tables 3 and 4), which were extracted separately for both classes (stationary and non-stationary), are investigated (Table 7). This investigation focuses on the following two queries:

- Why is the diversity higher than those of the two aforementioned populations?
- What is the sample size required for the calculated diversity to achieve statistical accuracy?

Answering these queries could involve accurately measuring of the classification of the time series, i.e., stationary or non-stationary. To this end, the two-sample F-test for equal variances (homogeneity) is adopted because the hypothesis of the homogeneity of variance is known to be significant in the classification stage of discrimination inquiries. Implementing this process returns a test decision for the null hypothesis H_{0} according to the data in vectors x and y, which states that x and y are drawn from normal distributions with the same variance [32]. The alternative hypothesis H_{A} is that they come from normal distributions with different variances. The result h will be 1 if the test rejects the null hypothesis at the 5% significance level and 0 otherwise. Thus, by using this test, the possible diversity of variances for the aforementioned populations is obtained. The F-test is calculated according to Eq (19):
(19)

Under the null hypothesis, the test statistic *F* has an *F*-distribution with degrees of freedom (equal to *n*_{1}−1) in the numerator and degrees of freedom (equal to *n*_{2}−1) in the denominator, where *n*_{1} and *n*_{2} are the sample sizes of the two data sets (x and y). Using the data in Table 4, the coefficient can be calculated via Eq (13): F = 6.4240e-04 with p = 4.4234e-26; the confidence intervals for the x and y vectors are 0.0003 and 0.0016, respectively, and DF = 19. Thus, H_{0} is strongly rejected because according to a two-tailed test.

Answering the second query could solve crucial issues regarding the collection of difficult-to-collect sampling data, such as medical signals. One important aspect of designing an experiment is determining how many observations are needed to draw conclusions with sufficient accuracy and confidence. The required sample size depends on many factors, including the type of experiment being contemplated, how it will be conducted, the available resources, and the desired sensitivity and confidence. Thus, the required sample sizes for the aforementioned two populations (stationary and non-stationary) are identified via an iterative procedure, resulting in a series of successively improving estimates of the required n [32]. To this end, the following equation is used:
(20)
(21)
and SS is the sum of the squares of the deviations from the mean. This values is called the sum of squares and is defined as and with *v*_{1} = *n*_{1}−1 and *v*_{2} = *n*_{2}−1.

Then, if we set *d* equal to half the width of the confidence interval [8], the interval limits for the difference between the two “populations means” can be estimated, and *d* is approximately with 2(*n*−1) degrees of freedom.

Suppose the difference between the populations is *μ*_{1}−*μ*_{2}. For 95% confidence, the interval must be no wider than . Therefore, *SS*_{1} = 6.1009e-05, *SS*_{2} = 0.0950, and the following quantity is determined using Eq (12):

Then, let us suppose that a sample size of 50 is necessary, with 2(50−1) = 98 degrees of freedom. Thus, *t*_{0.05(2),98} = 1.984. According to Eq (14), the limiting sample size n can be calculated using an iterative procedure [16]. First, we calculate

Next, it is possible to determine the estimate using *n* = 16, for which *t*_{0.05(2),30} = 2.042.

Repeating the procedure allows estimates that n = 17, for which *t*_{0.05(2),32} = 2.037.

According to the proposed convergence procedure [8], the value of the sequential iteration is less than 0.1. Therefore, a sample of size at least 17 (i.e., more than 16) should be obtained from each of the two populations to achieve the specified confidence interval (see Table 8).

## Conclusion

This paper investigates the measurement of the stationarity distance of a stochastic series. The proposed method is based on the assumption that each series deviates from a stationary state and that the series would itself be stationary if it satisfied a particular condition. This particular condition is expressed by a reversible property, which contains the stationary series. The estimated deviation of this condition is called the stationarity distance or measurement error between mirror time series. This distance is based on a novel stationary ergodic process, in which the stationary series has reversible symmetric features and is calculated using the DTW algorithm via a self-correlation procedure.

Additionally, for verification, this method is compared with the KPSS test. The results of this comparison indicate that the proposed method solves several problems associated the KPSS test, including those relating to the sample size and the prediction of the order lag on which the null hypothesis is based.

Furthermore, to obtain additional statistical evidence supporting the utility of this method, as a statistical control, the F-test was used. Resolving the problem of the introduction of a sample of limited size is a topic for future research. The testing results for both methods regarding the simulated stationary series (Mexican Hat) and non-stationary (sinusoidal) series revealed the superiority of the RSP method, particularly for large sample sizes >100 (Tables 1, 2, 5 and **6**). Additionally, the test performed using real data verified the weaknesses of KPSS, particularly for some cases (e.g., cases 4 and 17; Table 4) in which the null hypothesis is accepted for a visually verified non-stationary time series. Furthermore, in both tests, the RSP showed good agreement with the expected results. Additionally, the results of the F-test showed that the RSP-distance testing groups (weak-stationary in Table 3 and non-stationary in Table 4) have non-homogeneous properties, i.e., the distances of each group are drawn from normal distributions with different variances. This finding indicates that the selected distance populations can be differentiated from each other at the 95% confidence level. Furthermore, the selected sample size of each group (20) was found to be sufficient for this statistical analysis because it is greater than 17, the value calculated in Section 4.

## Future Research

This method could be applied to additional time-series difference data to investigate the variation in the stationarity distance over time. This research could be valuable for making predictions using diagnostic medical data. For example, electroencephalography (EEG) is a noninvasive and accessible method that is widely used to measure brain function and make inferences about regional brain activity. The stationarity of EEG has been studied by many researchers, but the stationarity of EEG segments with event-related potentials (ERPs) remains concerning in many abnormal cases. Thus, the proposed method could provide a practical solution for measuring the stationarity of EEG segments very quickly and accurately.

Furthermore, this method could be used in directed weighed-complex networks. For example, in a typical study [33], the weighed-complex network is constructed using the phase-space distance calculation. This model could be modified to use RSP instead of the phase-space distance to spatially depict the stationarity properties of the investigated times series, which could have very important applications, such as EEG and seismography.

## Author Contributions

**Conceptualization:**MP.**Data curation:**MP.**Formal analysis:**MP.**Funding acquisition:**MP.**Investigation:**MP.**Methodology:**MP.**Project administration:**MP.**Resources:**MP.**Software:**MP.**Supervision:**MP.**Validation:**MP.**Visualization:**MP.**Writing – original draft:**MP.**Writing – review & editing:**MP.

## References

- 1.
Harvey AC. Forecasting, structural time series models and the Kalman filter. Cambridge: Cambridge University Press; 1990.
- 2. Leite MCA, Petrov NP, Weng E. Stationary distributions of semistochastic processes with disturbances at random times and with random severity. Nonlinear Anal Real World Appl. 2012;13: 497–512.
- 3. Gao ZK, Yang YX, Zhai LS, Ding MS, Jin ND. Characterizing slug to churn flow transition by using multivariate pseudo Wigner distribution and multivariate multiscale entropy. Chem Eng J. 2016;291: 74–81.
- 4. Gao ZK, Fang PC, Ding MS, Jin ND. Multivariate weighted complex network analysis for characterizing nonlinear dynamic behavior in two-phase flow. Exp Therm Fluid Sci. 2015;60: 157–164.
- 5. Gao Z, Yang Y, Zhai L, Jin N, Chen G. A four-sector conductance method for measuring and characterizing low-velocity oil-water two-phase flows. IEEE Trans Instrum Meas. 2016;65: 1690–1697.
- 6. Gao ZK, Yang YX, Zhai LS, Dang WD, Yu JL, Jin ND. Multivariate multiscale complex network analysis of vertical upward oil-water two-phase flow in a small diameter pipe. Scientific Rep. 2016;6: 20052.
- 7.
Hyndman RJ, Athanasopoulos G. Forecasting: principles and practice. Heathmont, Vic.: OTexts; 2014.
- 8. Hobijn B, Franses PH, Ooms M. Generalizations of the KPSS-test for stationarity. Stat Neerl. 2004;58: 483–502.
- 9.
Syczewska EM. Empirical power of the Kwiatkowski-Phillips-Schmidt-Shin test. 2010. Available: https://ideas.repec.org/p/wse/wpaper/45.html.
- 10.
Dahlhaus R. Locally stationary processes. In: Rao TS, Rao CR, editors. Handbook of statistics. Amsterdam: North-Holland; 2012. pp. 351–412.
- 11.
Kuo BS, Tsong CC. Bootstrap Inference for Stationarity. Helsinki Center for Economic Research. Discussion Paper, no. 50. 2005. Available: http://ethesis.helsinki.fi/julkaisut/eri/hecer/disc/50/bootstra.pdf.
- 12.
Virmani V. Unit Root Tests: Results from Some Recent Tests Applied to Select Indian Macroeconomic Variables. 2004. Available: http://www.iimahd.ernet.in/publications/data/2004-02-04vineet.pdf.
- 13.
Kumar A, Manjunath D, Kuri J. Communication networking: an analytical approach. Amsterdam: Elsevier; 2004.
- 14. Sharifdoost M, Mahmoodi S, Pasha E. A statistical test for time reversibility of stationary finite state Markov chains. Applied Math Sci. 2009;52: 2563–2574.
- 15. Weiss G. Time-reversibility of linear stochastic processes. J Appl Probab. 1975;12: 831–836.
- 16. Aach J, Church GM. Aligning gene expression time series with time warping algorithms. Bioinformatics. 2001;17: 495–508. pmid:11395426
- 17.
Hyndman RJ. Time Series Data Library. Available: http://data.is/TSDLdemo.
- 18.
Serrà J, Arcos JL. A competitive measure to assess the similarity between two time series. In: International conference on case-based reasoning. Berlin, Heidelberg: Springer; 2012. pp. 414–427.
- 19.
Cassisi C, Montalto P, Aliotta M, Cannata A, Pulvirenti A. Similarity measures and dimensionality reduction techniques for time series data mining. Advances in Data Mining Knowledge Discovery and Applications. 2012. Available: http://www.earth-prints.org/handle/2122/8082
- 20. Sakoe H, Chiba S. Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans Acoust. 1978;26: 43–49.
- 21.
Perng CS, Wang H, Zhang SR, Parker DS. Landmarks: a new model for similarity-based pattern querying in time series databases. In: Proc2000 ICDE, 2000; San Diego, CA. 2000. pp. 33–42.
- 22. Keogh E, Kasetty S. On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min Knowl Discov. 2003;7: 349–371.
- 23. Wang X, Mueen A, Ding H, Trajcevski G, Scheuermann P, Keogh E. Experimental comparison of representation methods and distance measures for time series data. Data Min Knowl Discov. 2013;26: 275–309.
- 24.
Rabiner LR, Juang BH. Fundamentals of speech recognition. Upper Saddle River, NJ: PTR Prentice Hall; 1993.
- 25.
Zhang Z, Huang K, Tan T. Comparison of similarity measures for trajectory clustering in outdoor surveillance scenes. In: 18th International Conference on Pattern Recognition (ICPR'06), 2006 IEEE; 2006; pp. 1135–1138.
- 26. Giorgino T. Computing and visualizing dynamic time warping alignments in R: the DTW package. J Stat Softw. 2009;31: 1–24.
- 27. Cortez P, Rio M, Rocha M, Sousa P. Multi-scale internet traffic forecasting using neural networks and time series methods. Expert Systems. 2012;29: 143–155.
- 28. Kang K, Shelley M, Sompolinsky H. Mexican hats and pinwheels in visual cortex. Proc Nat Acad Sci U S A. 2003;100: 2848–2853.
- 29. Ermentrout GB, Cowan JD. A mathematical theory of visual hallucination patterns. Biol Cybern. 1979;34: 137–150. pmid:486593
- 30.
http://ocw.mit.edu/courses/mechanical-engineering/2-017j-design-of-electromechanical-robotic-systems-fall-2009/course-text/MIT2_017JF09_ch04.pdf.
- 31. Wang Q, Megalooikonomou V. A dimensionality reduction technique for efficient time series similarity analysis. Information Systems. 2008;33: 115–132. pmid:18496587
- 32.
Zar JH. Biostatistical Analysis. 4th ed. Upper Saddle River, NJ: Prentice Hall; 1999.
- 33. Gao ZK, Jin ND. A directed weighted complex network for characterizing chaotic dynamics from time series. Nonlinear Anal Real World Appl. 2012;13: 947–952.