Figures
Abstract
Non-parametric Mann-Kendall tests for autocorrelated data rely on the assumption that the distribution of the normalized Mann-Kendall tau is Gaussian. While this assumption holds asymptotically for stationary autoregressive processes of order 1 (AR(1)) and simple moving average (SMA) processes when sampling over an increasingly long period, it often fails for finite-length time series. In such cases, the empirical distribution of the Mann-Kendall tau deviates significantly from the Gaussian distribution. To assess the validity of this assumption, we explore an alternative asymptotic framework for AR(1) and SMA processes. We prove that, along upsampling sequences, the distribution of the normalized Mann-Kendall tau does not converge to a Gaussian but instead to a bounded distribution with strictly positive variance. This asymptotic behavior suggests scaling laws which determine the conditions under which the Gaussian approximation remains valid for finite-length time series generated by stationary AR(1) and SMA processes. Using Shapiro-Wilk tests, we numerically confirm the departure from normality and establish simple, practical criteria for assessing the validity of the Gaussian assumption, which depend on both the autocorrelation structure and the series length. Finally, we illustrate these findings with examples from existing studies.
Citation: Gamot T, Thibeau--Sutre N, Van Dooren TJM (2026) On the Gaussian distribution of the Mann-Kendall tau in the case of autocorrelated data. PLoS One 21(2): e0333224. https://doi.org/10.1371/journal.pone.0333224
Editor: Sheng Du, China University of Geosciences, CHINA
Received: September 8, 2025; Accepted: January 24, 2026; Published: February 19, 2026
Copyright: © 2026 Gamot et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The code for reproducing figures is accessible at https://doi.org/10.5281/zenodo.18414688.
Funding: AAP2023 FIRE mini-grant.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
The Mann-Kendall test is a non-parametric statistical method designed to assess whether a time series exhibits a monotonic trend, based on the Mann-Kendall tau statistic. Initially introduced by Henry B. Mann [1] and later refined by Maurice G. Kendall [2], the test leverages the fact that for independent and identically distributed (i.i.d.) data, the distribution of the Mann-Kendall tau normalized by its variance converges asymptotically to a Gaussian distribution.
To address scenarios where the data are not i.i.d., extensions have been developed for autocorrelated datasets [3,4]. These modifications incorporate adjustments to account for serial correlation; however, they do not establish asymptotic normality and instead proceed under the assumption that this property remains valid.
When considering autoregressive processes of order 1 (AR(1)) and simple moving average processes of order q (SMA(q)) with fixed parameters, classical results on the Central Limit Theorem for U-statistics applied to α-mixing processes [5] and m-dependent processes [6] establish that the normalize’s asymptotic distribution is Gaussian. Yet, when the lag-1 autocorrelation parameter or the order of the moving average is high relative to the finite length of a time series, the empirical distribution of the normalized Mann-Kendall statistic is far from Gaussian (see for example [7]). This is a key observation as, in practice, the family of Mann-Kendall tests are applied to autocorrelated time series of finite length and Gaussian distributions are used as an approximation. For instance, modified versions of the original test for autocorrelated data are widely used in hydrological studies that typically involve time series with several dozen to hundreds of data points [3,4,7]. So how can one determine whether the Gaussian approximation is justified? Since the Mann-Kendall tau is based on pairwise comparisons of all random variables in a sequence, this question requires analyzing asymptotic regimes where the density of pairs with non-negligible dependence is non-zero. To explore this, we examine specific sequences of time series of increasing length generated by AR(1) or SMA(q) processes, as defined in Sect 2. The relevance of asymptotic results remains a point of discussion in statistics (e.g. [8]) and exact and Monte Carlo methods have been proposed as general remedies for hypothesis testing on finite samples (e.g. [9,10]). However, in the context of time series, there is scope for different asymptotic results than the ones usually envisaged. Here we will investigate sequences differing from the ones investigated previously where the limits converged to Gaussian distributions. Each time series in the investigated sequences is generated by a stationary process whose parameters depend on the series length and, therefore, on its position within the sequence. For the AR(1) case, this sequence of time series corresponds to refining the sampling of a continuous Ornstein-Uhlenbeck process within a fixed time window - i.e. upsampling - which increases autocorrelation. Regarding the SMA(q) process, these sequences amount to averaging increasingly larger samples of a white noise process while maintaining constant relative window size. Such averages are used to construct statistics given the null model when testing for the presence of critical transitions in time series - also known as early warning signals [11]. In Sect 3, we prove that the asymptotic distribution of the normalized Mann-Kendall tau of these two types of times series cannot be Gaussian. These proofs suggest natural scalings for deciding whether the Gaussian approximation is suitable for time series of finite length generated by AR(1) or SMA(q) processes. Finally, Sect 4 numerically illustrates the departure from Gaussian behavior using the Shapiro-Wilk test for normality and confirms that these scalings are appropriate. We can therefore provide easy-to-implement criteria, for given values of parameters and time series length, to decide whether the Gaussian approximation - and hence the Mann-Kendall tests - may be appropriate or not for a time series.
2 Assumptions and examples
Let X be a random variable, and Xi, , be n random variables having the same distribution as X. The Mann-Kendall tau for the time series
is defined as [1,12]:
where and
.
We assume that the distribution of X is such that there are no ties. For simplicity, we denote the Mann-Kendall tau for the sequence by
. This non-parametric statistic is a special case of Kendall’s rank correlation coefficient and is used for detecting monotonic trends. It ranges from -1 (strictly decreasing trend) to +1 (strictly increasing trend).
If the Xi are independent, Kendall [2] proved that:
where is the variance of the random variable
,
is the standard Gaussian random distribution and
stands for convergence in distribution. Let us stress that it is the normalized random variable
which converges in distribution towards a Gaussian and not
(which is bounded between –1 and +1).
Let us now consider the case where the Xi are not independent and are Gaussian random variables. The variance of the Mann-Kendall tau simplifies as follows using Greiner’s equality [13]:
where is the expectation of the random variable
and
denotes the Pearson correlation coefficient.
In this paper, we only consider sequences of identically distributed Gaussian random variables (for a fixed n) that verify the following property:
Assumption 1 (Correlation function). such that,
, the sequence
satisfies
We then call the autocorrelation function of the sequence
. Note that, if
,
and
, then Assumption 1 means that the sequence is weak-sense (or wide-sense) stationary. In the case of Gaussian random variables, weak stationarity is equivalent to strict stationarity. This condition is satisfied by all examples considered in this article.
In the remainder of this section, we introduce an assumption on the existence of a renormalized asymptotic autocorrelation in a sequence of time series. This enables us to derive an easy-to-handle expression for the asymptotic variance of the Mann-Kendall tau of these time series.
Assumption 2 (Correlation function renormalization). such that,
and
, the sequence of sequences
satisfies
where, for all positive integers is a sequence of n random variables.
In this article, we choose sequences of identically distributed Gaussian random variables whose correlation depends on the length n of the sequence, hence the superscript (n). Then, random variables from two sequences and
of length n and m (with
) have different correlation functions. Hereafter we introduce the two classes of sequences that we have in mind for considering Assumptions 1 and 2:
Example 1 (Autoregressive process of order 1 AR(1)). Let such that:
where, for a given n, the are independent and identically distributed Gaussian random variables:
.
Then for the sequence of sequences , Assumptions 1 and 2 are true. In particular,
. Note that we only consider k > 0 with this renormalisation.
For instance, if , the first elements of the sequence of sequences are given by:
where
,
where
and
where
,
where
,
, and
, where
,
where
,
,
, and
, where
,
- etc.
Example 1 presents sequences of time series of length n generated by AR(1) processes with increasing autocorrelation at lag-1 . The time series are comparable to upsampling an Ornstein–Uhlenbeck process over [0,1] as described in [14] (§5) with the sequence
representing a regular subdivision of this process with steps of
. For example, starting with two values sampled at the edges of the interval, that is at time 0 and 1, the autocorrelation between these is equal to
. If one subdivides the interval into n > 2 uniformly spaced samples, then the autocorrelation between two successive values has to be
so that the autocorrelation between the first and last values remains
. So this case arises naturally when increasing the sampling of the same experiment of finite duration.
Example 2 (Simple moving average SMA). Let , where
is the floor function, and
such that:
where the are independent and identically distributed Gaussian random variables:
.
Then for the sequence , Assumptions 1 and 2 are true. In particular,
.
A standard parameter characterizing moving average processes is the relative window size. As an example, starting with a dataset of N points and averaging by groups of q contiguous points creates a moving average dataset of length with a relative window size
. Then, if the window size q depends on n and
, the relative window size is asymptotically
.
Example 2 presents a sequence of time series generated by SMA processes whose relative window size tends towards a constant when the length of the time series goes to infinity. This type of time series arises naturally when considering averaging over windows of fixed relative size. For example, when testing for the presence of critical transitions in time series (also known as early warning signals), a methodology involves averaging over a constant fraction of the time series [11].
Finally, following the definition by [15], the Mann-Kendall tau can be defined as a U-statistic with a non-symmetric kernel of degree 2 [16]. Classical results on the Central Limit Theorem for U-statistics applied to α-mixing processes [5] and m-dependent processes [6] establish that the asymptotic distribution of the Mann-Kendall tau is Gaussian for AR(1) and MA(q) processes with fixed lag-1 autocorrelation parameter k for AR(1) processes and order q of SMA processes. However, in Examples 1 and 2, we are considering sequences of time series generated by processes whose parameters depends on the length of the time series. For the example cases, we will prove that the distribution of the normalized Mann-Kendall tau cannot be asymptotically Gaussian.
3 Asymptotic variance of the Mann-Kendall tau for renormalized ARMA process
In this section, we delve deeper into the two classes of examples introduced, building upon Assumptions 1 and 2 to derive a key lemma for calculating the variance of the Mann-Kendall tau. All proofs are provided in S1 Appendix.
As before, let’s consider a sequence of identically distributed Gaussian random variables. Then, under Assumption 1, we obtain:
Lemma 1 (Renormalization). Suppose Assumptions 1 and 2 hold for a sequence of sequences . For a given n, the
are identically distributed Gaussian random variables.
Let and
. Then:
where is the Mann-Kendall tau of the
sequence and
.
In the field of time series analysis, stationary autoregressive moving average (ARMA) processes are often considered. They are composed of an autoregressive part and a moving average part. Since these latter processes are the examples we are interested in, we introduce the more general class of ARMA processes.
Definition 1 (ARMA process). An autoregressive moving average process of order (p,q) (ARMA(p,q)) is a discrete temporal process () such that:
where kj and are the parameters of the model and the
are the error terms (white noise).
An autoregressive process (p) is an
(p,0).
A moving average (q) is an
(0,q).
In the following, we will only consider p = 1, 0 < k < 1, i.i.d. Gaussian random variables and all the
equal to one (the simple moving average, SMA).
In the definition of the ARMA process, the noise is chosen as Gaussian and so the ARMA process is a Gaussian process. Moreover, we choose the variance of
such that for the AR(1) process,
.
Lemma 2 (Correlation for the ARMA process). Let () follow an ARMA(1,q–1) process such that:
Let and d = j − i, then:
- If d < q–1
- If
Let 0 < k < 1, a > 0, , and let
follow an ARMA(1,qn) process of parameter kn with
and
. Then Assumption 2 holds, and we can use Lemma 1 and Lemma 2 to obtain the following theorem:
Theorem 1 (Renormalized correlation function for the ARMA process). Let a > 0 and . For
, let
follow an ARMA(1,qn) process of parameter kn and such that
, then:
From Theorem 1, we deduce results for the AR(1) and SMA(q) cases.
Corollary 1.1 (Asymptotic variance for the AR(1) process). Let be a sequence of random variables following an AR(1) process of parameter kn such that
with
. Then,
and:
Corollary 1.2 (Asymptotic variance for the SMA(q) process). Let be a sequence of random variables following a SMA(qn) process with
. Then,
.
- If
- If
In particular, in Corollaries 1.1 and 1.2, cannot converge in distribution to a Gaussian. Indeed, let’s assume that
. Then, as
is bounded between –1 and +1, the support of the distribution of
is uniformly bounded for all
and so the normalized Mann-Kendall tau cannot converge to a Gaussian.
A key result regarding a specific sum of arcsin terms, which is essential for calculating the variance of the Mann-Kendall tau in the context of SMA processes, is presented in the following Proposition:
Proposition 1.
In this section, we demonstrated that the asymptotic (as ) variance of the Mann-Kendall tau is strictly positive for sequences of time series of length n generated by the following processes:
- an AR(1) process with autocorrelation at lag-1 parameter kn increasing towards 1 as
in the following manner:
with
independent of n.
- an SMA(qn) process with parameter
, which corresponds to an asymptotic relative window size
.
Consequently, the normalized Mann-Kendall tau of these time series cannot converge to a Gaussian.
4 Checking for non-normality
In practical applications, one deals with finite time series produced by AR(1) or SMA(q) processes with usually constant parameter values k or q. In that case, we have seen at the end of Sect 2 that the asymptotic distribution of the Mann-Kendall tau is Gaussian. However, this asymptotic result is less relevant when it is needed to decide whether a statistic calculated on a finite time series would approximately follow a Gaussian distribution or not. It is possible to determine the parameters or α so that the time series would have been produced by processes described in Examples 1 and 2. For such processes we know that these parameters determine the asymptotic variance associated with increasing upsampling. For example, a time series of length n produced by an AR(1) of correlation at lag-1 k can be identified as a time series of length n described by Example 1, where
. If
is asymptotically (as
) non-zero, we have proved that the distribution of the Mann-Kendall tau of this time series cannot converge to a Gaussian distribution. Consequently, our results suggest that, for time series of finite length, assuming normality of the Mann-Kendall statistic is not appropriate when autocorrelation is strong relative to the length of the time series, and this also limits the applicability of the modified Mann-Kendall tests. Therefore, it is essential to understand how the distribution of the Mann-Kendall tau deviates from normality for specific parameter values and time series lengths.
Deriving Berry-Esseen bounds for the Mann-Kendall tau across different types of autocorrelated processes would help us understanding how the distribution of the Mann-Kendall tau deviates from normality for specific parameter values and time series lengths, as these bounds quantify the accuracy of the Gaussian approximation [17]. For instance, uniform and non-uniform bounds have been established for U-statistics with symmetric kernels in the case of independent samples [18] and weakly dependent samples [19]. However, these bounds depend on both sample size n and a constant term influenced by autocorrelation, which lacks a clear expression. It makes it impractical as a theoretical criterion for when to use the family of Mann-Kendall tests for autocorrelated processes such as the AR(1) process. Furthermore, while the Kendall tau is a U-statistic with a symmetric kernel, the Mann-Kendall tau has a non-symmetric kernel [16], limiting the applicability of many classical results. Therefore, we conducted a numerical investigation on the Mann-Kendall tau distribution for time series of length n generated by AR(1) and SMA(q) processes, with varied levels of autocorrelation (parameter k) and window size (parameter q). We investigated whether isolines of the values of parameters or α on the spaces delineate regions where the distribution of the Mann-Kendall tau is close to Gaussian or not.
4.1 Numerical investigation
For each combination of parameters and length of time series, we computed the Mann-Kendall tau of 102 different time series to find the empirical distribution of tau. To evaluate if these empirical distributions are roughly Gaussian, we used the Shapiro-Wilk test, which tests the null hypothesis that the population is an i.i.d. Gaussian sample (with unknown expectation and variance) [20]. We compared the rejection rate (computed over 104 p-values) of this null hypothesis to the predetermined significance level. If the true distribution of our simulated tau-values is, indeed, Gaussian, then the proportion of rejections should converge to the significance level as sample size (i.e., the number of tau-values) increases. We used the Shapiro-Wilk test because it is more powerful than other classic normality tests [21], utilizing the stats.shapiro Python implementation from the SciPy package [22].
4.2 For the AR(1) process
First of all, we studied time series of length n produced by AR(1) processes with autocorrelation at lag-1 k, for values of n ranging from 5 to 100 and k from 0.23 to 0.99. Fig 1A shows the proportion at which the Shapiro-Wilk test rejects normality at the 5% significance level depending on n and k.
Empirical proportion of rejections of the null hypothesis of the Shapiro-Wilk test of normality at the 5% significance level for AR(1) processes. p-values are calculated over 102 values of the Mann-Kendall statistic, and rejection proportions are based on 104 p-values. A: Rejection rate of time series of length generated by AR(1) processes of autocorrelation at lag-1
. Examples of lines for which
is constant are in black, for four values of
. B: Rejection rate of time series of length n generated by AR(1) processes for several values of
. The dotted line is the 5% significance threshold. If the true distribution of the Mann-Kendall tau is Gaussian for a given
, then the proportion of rejection should converge to the significance level as sample size (i.e., the number of tau-values) increases. Fig C-H present examples of empirical distributions of the normalized Mann-Kendall tau
for n = 50 and several values of
.
It can be seen that, for a fixed time series of length n, the distribution of the Mann-Kendall tau is not Gaussian if the autocorrelation parameter k of the generating AR(1) is too close to 1. Furthermore, the closer k is to 1, the larger n needs to be for the distribution of the Mann-Kendall tau to remain (approximately) Gaussian. This suggests that, for each k, there exists a minimum n above which the distribution of Mann-Kendall tau can be considered Gaussian.
As proved in Sect 3, the distribution of the Mann-Kendall tau of time series where is asymptotically non-zero cannot converge to a Gaussian. Lines where
is constant are added in black on Fig 1A for four examples of
values. Furthermore, the ranges of values for discriminating empirical rejection rates have been chosen to match the proportion of rejection associated with each
. Then, we see that the lines where
is constant are also the lines where the proportion of rejection of the Shapiro-Wilk test is constant. Thus,
is the right scaling to decide in practice whether the Gaussian approximation is justified.
Moreover, the closer is to 1 (
), the further the normality test rejection rate is from the significance level. This is the expected effect: larger autocorrelation drives the Mann-Kendall tau distribution away from the Gaussian distribution. However, if
is small enough, the proportion of rejection is approximately the significance level. This confirms that for small ktot, the distribution of the Mann-Kendall tau is well approximated by a Gaussian. Fig 1B presents rejection rates of the null hypothesis of the Shapiro-Wilk test depending on n, for several values of
. It is clear that the rejection rates are approximately constant for a given value of ktot, for any n. We also see that the rejection rates are very close to their final values for small values of n (typically for n > 10), making these results useful for short time series.
This numerical study validates the theoretical scaling obtained in Sect 3, but also provides practical values of for which the Gaussian approximation is not adequate. As the Shapiro-Wilk test is slightly conservative, rejection rates converge to values which are slightly below 5% when
goes to 0. Here, we see on Fig 1A and 1B that the proportion of rejection is equal to the significant threshold for
. Then, we propose to chose this as a criterion to decide whether the Gaussian approximation is justified. Examples of empirical distributions of the Mann-Kendall tau for several
values are shown in Fig 1C–1H. We see that for
, the Gaussian approximation does not seem justified. If
is close enough to 1, the empirical distribution is bimodal. We note that the distribution for intermediate values of
(see Fig 1E for example) is very similar to the one found by Hamed [7], who proposed the Beta distribution as a more accurate approximation.
Anyone who wants to use a modified Mann-Kendall test for autocorrelated data on a time series from a real system, and assumes that the underlying process is an AR(1) process with autocorrelation at lag-1 k can therefore estimate the total autocorrelation parameter and know if the Gaussian approximation is justified. This allows to decide whether modified Mann-Kendall test can be applied or not on the Mann-Kendall tau of the time series. For example, let’s consider a time series of length n = 20 produced by an AR(1) process with autocorrelation at lag-1 parameter k = 0.5. Then,
. According to the previous criterion,
is clearly too high to consider that the distribution of the Mann-Kendall tau of the time series of interest is Gaussian at the 5% significance level. Therefore, it is not reasonable to assume that the distribution of the Mann-Kendall tau is Gaussian and therefore to apply a test from the family of modified Mann-Kendall tests for autocorrelated data as they rely on this Gaussian assumption. In this case, these tests are not suitable for reliably detecting trends.
4.2.1 Case study.
We apply the practical criterion presented in the last section - that is to compare the value of to 10−8 - to time series from the literature. In particular, we focus on the three papers that have proposed modified Mann-Kendall tests for autocorrelated data [3,4,7]. The analyzed time series are taken from the hydrological literature.
The existence of a trend in a time series alters the estimate of the autocorrelation parameter [4]. Therefore, we follow the method proposed by Yue and Wang [4] and first remove any potential trend, using the non-parametric method of Theil [23] and Sen [24]. Then, the lag-1 autocorrelation coefficient is estimated on the detrended time series
[25]:
with the sample average.
As the results from Sect 3 concern AR(1) processes with positive autocorrelation parameters, we only keep time series which satisfy this condition. The length of the time series n, the estimated autocorrelation at lag-1 parameter , the upper bound of the 90% confidence interval (upper bound of the confidence interval for a one-sided test at the significance threshold of 5%)
, as well as the estimated total autocorrelation
of the upper bound, and the identifiers of the stations where the measurements were taken (see the cited articles for the full identification of the time series) are shown in Table 1. See the data availability section for the python implementation.
Comparing with the 10−8 threshold, we conclude that the Gaussian approximation for the Mann-Kendall tau is suitable for seven out of the fourteen time series. The time series concerned are indicated by a tick in the last column of Table 1. For the remaining seven time series, this implies that a portion of the
95% confidence interval does not satisfy the previously established criterion.
The length of the time series n, the estimated autocorrelation at lag-1 parameter , the upper bound of the 90% confidence interval (upper bound of the confidence interval for a one-sided test at the significance threshold of 5%)
, as well as the estimated total autocorrelation
of the upper bound
4.3 For the SMA process
We applied the same methodology to study time series of length n produced by SMA processes of order q, for values of n ranging from 5 to 150 and q from 5 to 50. Fig 2A shows the proportion of time series for which the Shapiro-Wilk test rejects normality at the 5% significance level as a function of n and q.
A: Empirical proportion of rejections of the null hypothesis of the Shapiro-Wilk test of normality at the 5% significance level for time series of length n and initial length N (see Example 2) generated by SMA processes of order q. Examples of lines for which the relative window size is constant are in black, for four examples of α. B: Rejection rate of time series of initial length N generated by SMA processes for several values of α. The dotted line is the 5% significance threshold. If the true distribution of the Mann-Kendall tau is Gaussian for a given α, then the proportion of rejection should converge to the significance level as sample size (i.e., the number of tau-values) increases. Fig C–F present examples of empirical distributions of the normalized Mann-Kendall tau
for several values of α when N = 100.
It can be seen that, for a time series of fixed length n, the distribution of the Mann-Kendall statistic is not Gaussian if the order q of the generating SMA process is too high. Furthermore, the higher q is, and the larger n needs to be, for the distribution of the Mann-Kendall tau to remain (approximately) Gaussian. This suggests that, for each q, there exists a minimum n above which the distribution of tau can be considered Gaussian.
As proved in Sect 3, the distribution of the Mann-Kendall tau of time series where the relative window size is asymptotically non-zero cannot converge to a Gaussian (see Example 2 for notations). Lines for which α is constant are added in black on Fig 2A for four α values.
Furthermore, the ranges of rejection rates for each colour band have been chosen to minimize the distances from the values of α plotted. Then, we see that the lines where α is constant are also the lines where the proportion of rejection of the Shapiro-Wilk test is constant. Thus, α is the right scaling to decide in practice whether the Gaussian approximation is justified.
Moreover, the closer α is to 1 (), the further the normality test rejection rate is from the significance level. This is the expected effect: larger window sizes introduce more autocorrelation, thus driving the Mann-Kendall tau distribution away from the Gaussian distribution. However, if α is small enough, the proportion of rejection is approximately the significance level. This confirms that for small α, the distribution of the Mann-Kendall tau is well approximated by a Gaussian.
Fig 2B presents rejection rates of the null hypothesis of the Shapiro-Wilk test depending on N, for several values of α. It is clear that the rejection rates are approximately constant for all values of N if α is fixed. Rejection rates would similarly be constant for all values of n if α is fixed. We also see that the rejection rates are very close to their final values for small values of N (typically for N > 10), making these results useful for short time series.
Therefore, this numerical study validates the theoretical scaling obtained in Sect 3, but also provides practical values of α for which the Gaussian approximation is not adequate. As the Shapiro-Wilk test is slightly conservative, rejection rates converge to values slightly below 5% (which would be the expected rate for independent data) when α goes to 0. Here, we see on Fig 2A and 2B that the proportion of rejection is equal to the significant threshold for %. Then, we propose to use this as a criterion to decide whether the Gaussian approximation is justified. Note that, for a relative window size of 10% or less, the Shapiro-Wilk test does not reject normality, but it does not prove either that the distribution is normal (see Discussion).
Examples of empirical distributions of the Mann-Kendall tau for several α values are shown in Fig 2C–2F. We see that for , the Gaussian approximation does not seem justified visually. If α is high enough, the empirical distribution is bimodal, very far from the Gaussian distribution. Distributions of the Mann-Kendall statistic are very similar to the one found for the autoregressive process, see Fig 1.
Anyone who wants to use a modified Mann-Kendall test on data produced by the averaging on windows of size q of an initial time series of length N from a real system can therefore estimate the relative window size and know if the Gaussian approximation is justified. This allows to decide whether modified Mann-Kendall test can be applied or not on the Mann-Kendall tau of the time series. For example, if considering the averaging of a time series of length N = 50 over rolling windows of size q = 20, then, the relative window size
and according to the previous criterion,
is too high to consider that the distribution of the Mann-Kendall tau of the time series of interest is Gaussian. Therefore, it is not reasonable to apply a test from the family of modified Mann-Kendall tests for autocorrelated data to reliably detect trends.
From the previous section, we conclude that the distribution of the Mann-Kendall tau for time series generated by SMA processes is not approximately Gaussian for relative window sizes of more than 10%, independently of the time series length.
Discussion
In this article, we demonstrate the existence of sequences of time series generated by stationary autoregressive AR(1) and simple moving average (SMA) processes for which the normalized Mann-Kendall tau distribution cannot be asymptotically Gaussian. Instead, it converges to a bounded distribution with strictly positive variance. This result suggests that the non-Gaussian nature of the distribution should emerge noticeably in finite-length time series with sufficient autocorrelation. We found in a numerical investigation that the parameters determining the variance in our asymptotic results are indeed the ones which determine whether the distribution of the Mann Kendall tau will be close to Gaussian or not in finite-length time series. To guide practical application, we provided easy-to-implement criteria which clarify when the Gaussian approximation is appropriate for tests applied to real data. Our numerical investigations indicate that these criteria remain relevant when applied to time series of a relatively small number of points.
For time series of length n generated by an AR(1) process with lag-1 parameter k, we showed theoretically that emerges naturally as the right scaling between k and n to reject the asymptotic Gaussian approximation. Based on these theoretical foundations, we selected the Shapiro-Wilk test for normality to numerically check that
is the correct scaling. We also proposed the practical threshold
to decide whether the distribution of the Mann-Kendall tau of time series is Gaussian or not. It corresponds to the (k,n) contour where the null hypothesis of a Gaussian distribution is rejected in 5% of tests. The criteria on autocorrelation of a timeseries can be checked using the code shared in the data accessibility Section.
Regarding the SMA process of order q, the correct scaling is which can naturally be interpreted as the relative window size of the moving average. Numerically, using the Shapiro-Wilk test for normality, we find that if α is larger than 10%, then a Gaussian distribution of the Mann-Kendall tau is rejected in over 5% of the tests. Therefore, we propose
% as a practical threshold to decide whether the distribution of the Mann-Kendall tau of time series is Gaussian or not. For real datasets, if one assumes that the data are resampled using a moving average but does not know the parameter α, one could investigate the shape of the autocorrelation function. Theoretically, its slope could be used to retrieve the value of the relative window size. However, further analysis is required to evaluate the power of this approach to detect the true value of the relative window size.
We illustrated these results with empirical distributions of the normalized Mann-Kendall tau for several values of and α. The distributions are very similar for all n, depending mainly on the value of
and α, which once again underlines the fact that the proposed scalings can be used to determine whether a distribution is Gaussian or not in finite-length time series. These findings fill a gap between intermediate-length observations, as discussed by [7], and different asymptotic results, which predict a Gaussian distribution in cases where autocorrelation is present in the data but
converges to zero in long time series.
Nevertheless, our results should be treated with some caution. We simulated the stochastic processes assuming error terms and initial observations sampled from independent Gaussian variables. This does not need to be the case for observed time series. We have not studied the degree to which the various Mann-Kendall tests might be robust with respect to such deviations from the normality assumptions on error terms and initial observations. Future work could investigate whether our results hold when relaxing the assumption that the underlying process is Gaussian, which was necessary for Eq (4). Additionally, deriving Berry-Esseen bounds for the Mann-Kendall tau across different types of autocorrelated processes would enhance understanding, as these bounds quantify the accuracy of the Gaussian approximation. In time series where in the case of AR(1) processes and where
for SMA processes, however, we don’t immediately see how such deviations might suddenly generate statistics which follow a Gaussian distribution. In time series where
and where
, this might lead to additional cases where statistics are not Gaussian. Furthermore, the sample size for the Shapiro-Wilk test was arbitrarily set to 102 time series. Changing the sample size could slightly change the rejection rates of the null hypothesis; however, an exploratory analysis showed that this has minimal impact on our results and the identified thresholds.
Although Lemma 1 is broadly applicable, our focus has been on ARMA processes due to their widespread use in scientific fields such as ecology [26], hydrology [25], and finance [27]. Autoregressive models, particularly the AR (1) process, are fundamental for time series analysis, modeling, and forecasting, with frequent applications in hydrology [4]. In contrast, moving-average models are often used for noise reduction in time series, as well as for modeling purposes. Among different statistical techniques for forecasting and trend detection involving regression methods, time series methods and stochastic processes (e.g., [28–30]), the Mann-Kendall statistic can be used to test the null hypothesis of no trend without fitting a specific time series or non-linear model. Mann-Kendall statistics with Gaussian distributions were recently recommended to this end [31]. We believe that our findings could encourage preliminarily testing the normality assumption before applying Mann-Kendall tests for trend detection. Consequently, our results on the importance of autocorrelations may have broader relevance, particularly when testing for the presence of critical transitions in time series (also known as early warning signals), where methodology involves averaging, which may introduce artificial autocorrelation. We plan to explore these implications further in a forthcoming paper.
Acknowledgments
The authors are very grateful to Michael Kopp and Jean-René Chazottes for helpful discussions and for providing useful comments on various versions of this manuscript.
References
- 1. Mann HB. Nonparametric tests against trend. Econometrica: Journal of the Econometric Society. 1945;245–59.
- 2.
Kendall MG. Rank correlation methods. 4th ed. Griffin; 1975.
- 3. Hamed KH, Ramachandra Rao A. A modified Mann-Kendall trend test for autocorrelated data. Journal of Hydrology. 1998;204(1–4):182–96.
- 4. Yue S, Wang C. The Mann-Kendall test modified by effective sample size to detect trend in serially correlated hydrological series. Water Resources Management. 2004;18(3):201–18.
- 5. Yoshihara K. Limiting behavior of U-statistics for stationary, absolutely regular processes. Z Wahrscheinlichkeitstheorie verw Gebiete. 1976;35(3):237–52.
- 6. Sen PK. On the properties of U-statistics when the observations are not independent: part one estimation of non-serial parameters in some stationary stochastic process. Calcutta Statistical Association Bulletin. 1963;12(3):69–92.
- 7. Hamed KH. Exact distribution of the Mann–Kendall trend test statistic for persistent data. Journal of Hydrology. 2009;365(1–2):86–94.
- 8. Lindsey J. Some statistical heresies (with discussion). Journal of the Royal Statistical Society Series D: The Statistician. 1999;48(1):1–40.
- 9. RA F. The logic of inductive inference (with discussion). Journal of the Royal Statistical Society. 1935;98:39–82.
- 10. Dufour J-M, Kiviet JF. Exact inference methods for first-order autoregressive distributed lag models. Econometrica. 1998;66(1):79.
- 11. Dakos V, Carpenter SR, Brock WA, Ellison AM, Guttal V, Ives AR, et al. Methods for detecting early warnings of critical transitions in time series illustrated using simulated ecological data. PLoS One. 2012;7(7):e41010. pmid:22815897
- 12. Kendall MG. A new measure of rank correlation. Biometrika. 1938;30(1/2):81.
- 13. Greiner R. Uber das fehlersystem der kollektivmasslehre. Zeitschrift für Mathematik und Physik. 1909;57:121–58.
- 14. Phillips PCB. Towards a unified asymptotic theory for autoregression. Biometrika. 1987;74(3):535–47.
- 15. Hoeffding W. A class of statistics with asymptotically normal distribution. Ann Math Statist. 1948;19(3):293–325.
- 16.
Chen X. Kendall’s Tau as the test for trend in time series data. Acadia University; 2012.
- 17. Berry AC. The accuracy of the Gaussian approximation to the sum of independent variates. Trans Amer Math Soc. 1941;49(1):122–36.
- 18.
Chen LH, Shao QM. Normal approximation for nonlinear statistics using a concentration inequality approach. 2007.
- 19.
Bentkus V, Götze F, Tikhomirov A. Berry-Esseen bounds for statistics of weakly dependent samples. 1997.
- 20. Shapiro SS, Wilk MB. An analysis of variance test for normality (complete samples). Biometrika. 1965;52(3/4):591.
- 21. Razali NM, Wah YB. Power comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling tests. Journal of Statistical Modeling and Analytics. 2011;2(1):21–33.
- 22. Royston P. Remark AS R94: a remark on algorithm AS 181: the W-test for normality. Applied Statistics. 1995;44(4):547.
- 23. Theil H. A rank-invariant method of linear and polynomial regression analysis. Indagationes mathematicae. 1950;12(85).
- 24. Sen PK. Estimates of the regression coefficient based on Kendall’s Tau. Journal of the American Statistical Association. 1968;63(324):1379–89.
- 25.
Salas JD. Applied modeling of hydrologic time series. Water Resources Publication; 1980.
- 26. Ives AR, Abbott KC, Ziebarth NL. Analysis of ecological time series with ARMA(p,q) models. Ecology. 2010;91(3):858–71. pmid:20426343
- 27.
Tsay RS. Analysis of financial time series. Wiley; 2005.
- 28. Lehnertz K. Time-series-analysis-based detection of critical transitions in real-world non-autonomous systems. Chaos. 2024;34(7):072102. pmid:38985967
- 29. Taylor SJ, Letham B. Forecasting at scale. The American Statistician. 2018;72(1):37–45.
- 30. Tsay RS. Testing and modeling threshold autoregressive processes. Journal of the American Statistical Association. 1989;84(405):231–40.
- 31. Chen S, Ghadami A, Epureanu BI. Practical guide to using Kendall’s τ in the context of forecasting critical transitions. R Soc Open Sci. 2022;9(7):211346. pmid:35911200