On the Gaussian distribution of the Mann-Kendall tau in the case of autocorrelated data

Tristan Gamot; Nils Thibeau--Sutre; Tom J. M. Van Dooren

doi:10.1371/journal.pone.0333224

Abstract

Non-parametric Mann-Kendall tests for autocorrelated data rely on the assumption that the distribution of the normalized Mann-Kendall tau is Gaussian. While this assumption holds asymptotically for stationary autoregressive processes of order 1 (AR(1)) and simple moving average (SMA) processes when sampling over an increasingly long period, it often fails for finite-length time series. In such cases, the empirical distribution of the Mann-Kendall tau deviates significantly from the Gaussian distribution. To assess the validity of this assumption, we explore an alternative asymptotic framework for AR(1) and SMA processes. We prove that, along upsampling sequences, the distribution of the normalized Mann-Kendall tau does not converge to a Gaussian but instead to a bounded distribution with strictly positive variance. This asymptotic behavior suggests scaling laws which determine the conditions under which the Gaussian approximation remains valid for finite-length time series generated by stationary AR(1) and SMA processes. Using Shapiro-Wilk tests, we numerically confirm the departure from normality and establish simple, practical criteria for assessing the validity of the Gaussian assumption, which depend on both the autocorrelation structure and the series length. Finally, we illustrate these findings with examples from existing studies.

Citation: Gamot T, Thibeau--Sutre N, Van Dooren TJM (2026) On the Gaussian distribution of the Mann-Kendall tau in the case of autocorrelated data. PLoS One 21(2): e0333224. https://doi.org/10.1371/journal.pone.0333224

Editor: Sheng Du, China University of Geosciences, CHINA

Received: September 8, 2025; Accepted: January 24, 2026; Published: February 19, 2026

Copyright: © 2026 Gamot et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The code for reproducing figures is accessible at https://doi.org/10.5281/zenodo.18414688.

Funding: AAP2023 FIRE mini-grant.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

The Mann-Kendall test is a non-parametric statistical method designed to assess whether a time series exhibits a monotonic trend, based on the Mann-Kendall tau statistic. Initially introduced by Henry B. Mann [1] and later refined by Maurice G. Kendall [2], the test leverages the fact that for independent and identically distributed (i.i.d.) data, the distribution of the Mann-Kendall tau normalized by its variance converges asymptotically to a Gaussian distribution.

To address scenarios where the data are not i.i.d., extensions have been developed for autocorrelated datasets [3,4]. These modifications incorporate adjustments to account for serial correlation; however, they do not establish asymptotic normality and instead proceed under the assumption that this property remains valid.

When considering autoregressive processes of order 1 (AR(1)) and simple moving average processes of order q (SMA(q)) with fixed parameters, classical results on the Central Limit Theorem for U-statistics applied to α-mixing processes [5] and m-dependent processes [6] establish that the normalize’s asymptotic distribution is Gaussian. Yet, when the lag-1 autocorrelation parameter or the order of the moving average is high relative to the finite length of a time series, the empirical distribution of the normalized Mann-Kendall statistic is far from Gaussian (see for example [7]). This is a key observation as, in practice, the family of Mann-Kendall tests are applied to autocorrelated time series of finite length and Gaussian distributions are used as an approximation. For instance, modified versions of the original test for autocorrelated data are widely used in hydrological studies that typically involve time series with several dozen to hundreds of data points [3,4,7]. So how can one determine whether the Gaussian approximation is justified? Since the Mann-Kendall tau is based on pairwise comparisons of all random variables in a sequence, this question requires analyzing asymptotic regimes where the density of pairs with non-negligible dependence is non-zero. To explore this, we examine specific sequences of time series of increasing length generated by AR(1) or SMA(q) processes, as defined in Sect 2. The relevance of asymptotic results remains a point of discussion in statistics (e.g. [8]) and exact and Monte Carlo methods have been proposed as general remedies for hypothesis testing on finite samples (e.g. [9,10]). However, in the context of time series, there is scope for different asymptotic results than the ones usually envisaged. Here we will investigate sequences differing from the ones investigated previously where the limits converged to Gaussian distributions. Each time series in the investigated sequences is generated by a stationary process whose parameters depend on the series length and, therefore, on its position within the sequence. For the AR(1) case, this sequence of time series corresponds to refining the sampling of a continuous Ornstein-Uhlenbeck process within a fixed time window - i.e. upsampling - which increases autocorrelation. Regarding the SMA(q) process, these sequences amount to averaging increasingly larger samples of a white noise process while maintaining constant relative window size. Such averages are used to construct statistics given the null model when testing for the presence of critical transitions in time series - also known as early warning signals [11]. In Sect 3, we prove that the asymptotic distribution of the normalized Mann-Kendall tau of these two types of times series cannot be Gaussian. These proofs suggest natural scalings for deciding whether the Gaussian approximation is suitable for time series of finite length generated by AR(1) or SMA(q) processes. Finally, Sect 4 numerically illustrates the departure from Gaussian behavior using the Shapiro-Wilk test for normality and confirms that these scalings are appropriate. We can therefore provide easy-to-implement criteria, for given values of parameters and time series length, to decide whether the Gaussian approximation - and hence the Mann-Kendall tests - may be appropriate or not for a time series.

2 Assumptions and examples

Let X be a random variable, and X_i, , be n random variables having the same distribution as X. The Mann-Kendall tau for the time series is defined as [1,12]:

(1)

where and .

We assume that the distribution of X is such that there are no ties. For simplicity, we denote the Mann-Kendall tau for the sequence by . This non-parametric statistic is a special case of Kendall’s rank correlation coefficient and is used for detecting monotonic trends. It ranges from -1 (strictly decreasing trend) to +1 (strictly increasing trend).

If the X_i are independent, Kendall [2] proved that:

(2)

where is the variance of the random variable , is the standard Gaussian random distribution and stands for convergence in distribution. Let us stress that it is the normalized random variable which converges in distribution towards a Gaussian and not (which is bounded between –1 and +1).

Let us now consider the case where the X_i are not independent and are Gaussian random variables. The variance of the Mann-Kendall tau simplifies as follows using Greiner’s equality [13]:

(3)

(4)

where is the expectation of the random variable and denotes the Pearson correlation coefficient.

In this paper, we only consider sequences of identically distributed Gaussian random variables (for a fixed n) that verify the following property:

Assumption 1 (Correlation function). such that, , the sequence satisfies

(5)

We then call the autocorrelation function of the sequence . Note that, if , and , then Assumption 1 means that the sequence is weak-sense (or wide-sense) stationary. In the case of Gaussian random variables, weak stationarity is equivalent to strict stationarity. This condition is satisfied by all examples considered in this article.

In the remainder of this section, we introduce an assumption on the existence of a renormalized asymptotic autocorrelation in a sequence of time series. This enables us to derive an easy-to-handle expression for the asymptotic variance of the Mann-Kendall tau of these time series.

Assumption 2 (Correlation function renormalization). such that, and , the sequence of sequences satisfies

(6)

where, for all positive integers is a sequence of n random variables.

In this article, we choose sequences of identically distributed Gaussian random variables whose correlation depends on the length n of the sequence, hence the superscript ⁽ⁿ⁾. Then, random variables from two sequences and of length n and m (with ) have different correlation functions. Hereafter we introduce the two classes of sequences that we have in mind for considering Assumptions 1 and 2:

Example 1 (Autoregressive process of order 1 AR(1)). Let such that:

where, for a given n, the are independent and identically distributed Gaussian random variables: .

Then for the sequence of sequences , Assumptions 1 and 2 are true. In particular, . Note that we only consider k > 0 with this renormalisation.

For instance, if , the first elements of the sequence of sequences are given by:

where ,
where and where ,
where , , and , where ,
where , , , and , where ,
etc.

Example 1 presents sequences of time series of length n generated by AR(1) processes with increasing autocorrelation at lag-1 . The time series are comparable to upsampling an Ornstein–Uhlenbeck process over [0,1] as described in [14] (§5) with the sequence representing a regular subdivision of this process with steps of . For example, starting with two values sampled at the edges of the interval, that is at time 0 and 1, the autocorrelation between these is equal to . If one subdivides the interval into n > 2 uniformly spaced samples, then the autocorrelation between two successive values has to be so that the autocorrelation between the first and last values remains . So this case arises naturally when increasing the sampling of the same experiment of finite duration.

Example 2 (Simple moving average SMA). Let , where is the floor function, and such that:

where the are independent and identically distributed Gaussian random variables: .

Then for the sequence , Assumptions 1 and 2 are true. In particular, .

A standard parameter characterizing moving average processes is the relative window size. As an example, starting with a dataset of N points and averaging by groups of q contiguous points creates a moving average dataset of length with a relative window size . Then, if the window size q depends on n and , the relative window size is asymptotically .

Example 2 presents a sequence of time series generated by SMA processes whose relative window size tends towards a constant when the length of the time series goes to infinity. This type of time series arises naturally when considering averaging over windows of fixed relative size. For example, when testing for the presence of critical transitions in time series (also known as early warning signals), a methodology involves averaging over a constant fraction of the time series [11].

Finally, following the definition by [15], the Mann-Kendall tau can be defined as a U-statistic with a non-symmetric kernel of degree 2 [16]. Classical results on the Central Limit Theorem for U-statistics applied to α-mixing processes [5] and m-dependent processes [6] establish that the asymptotic distribution of the Mann-Kendall tau is Gaussian for AR(1) and MA(q) processes with fixed lag-1 autocorrelation parameter k for AR(1) processes and order q of SMA processes. However, in Examples 1 and 2, we are considering sequences of time series generated by processes whose parameters depends on the length of the time series. For the example cases, we will prove that the distribution of the normalized Mann-Kendall tau cannot be asymptotically Gaussian.

3 Asymptotic variance of the Mann-Kendall tau for renormalized ARMA process

In this section, we delve deeper into the two classes of examples introduced, building upon Assumptions 1 and 2 to derive a key lemma for calculating the variance of the Mann-Kendall tau. All proofs are provided in S1 Appendix.

As before, let’s consider a sequence of identically distributed Gaussian random variables. Then, under Assumption 1, we obtain:

(7)

(8)

Lemma 1 (Renormalization). Suppose Assumptions 1 and 2 hold for a sequence of sequences . For a given n, the are identically distributed Gaussian random variables.

Let and . Then:

(9)

where is the Mann-Kendall tau of the sequence and .

In the field of time series analysis, stationary autoregressive moving average (ARMA) processes are often considered. They are composed of an autoregressive part and a moving average part. Since these latter processes are the examples we are interested in, we introduce the more general class of ARMA processes.

Definition 1 (ARMA process). An autoregressive moving average process of order (p,q) (ARMA(p,q)) is a discrete temporal process () such that:

(10)

where k_j and are the parameters of the model and the are the error terms (white noise).

An autoregressive process (p) is an (p,0).

A moving average (q) is an (0,q).

In the following, we will only consider p = 1, 0 < k < 1, i.i.d. Gaussian random variables and all the equal to one (the simple moving average, SMA).

In the definition of the ARMA process, the noise is chosen as Gaussian and so the ARMA process is a Gaussian process. Moreover, we choose the variance of such that for the AR(1) process, .

Lemma 2 (Correlation for the ARMA process). Let () follow an ARMA(1,q–1) process such that:

(11)

Let and d = j − i, then:

If d < q–1

(12)

If

(13)

Let 0 < k < 1, a > 0, , and let follow an ARMA(1,q_n) process of parameter k_n with and . Then Assumption 2 holds, and we can use Lemma 1 and Lemma 2 to obtain the following theorem:

Theorem 1 (Renormalized correlation function for the ARMA process). Let a > 0 and . For , let follow an ARMA(1,q_n) process of parameter k_n and such that , then:

From Theorem 1, we deduce results for the AR(1) and SMA(q) cases.

Corollary 1.1 (Asymptotic variance for the AR(1) process). Let be a sequence of random variables following an AR(1) process of parameter k_n such that with . Then, and:

Corollary 1.2 (Asymptotic variance for the SMA(q) process). Let be a sequence of random variables following a SMA(q_n) process with . Then, .

If

If

In particular, in Corollaries 1.1 and 1.2, cannot converge in distribution to a Gaussian. Indeed, let’s assume that . Then, as is bounded between –1 and +1, the support of the distribution of is uniformly bounded for all and so the normalized Mann-Kendall tau cannot converge to a Gaussian.

A key result regarding a specific sum of arcsin terms, which is essential for calculating the variance of the Mann-Kendall tau in the context of SMA processes, is presented in the following Proposition:

Proposition 1.

In this section, we demonstrated that the asymptotic (as ) variance of the Mann-Kendall tau is strictly positive for sequences of time series of length n generated by the following processes:

an AR(1) process with autocorrelation at lag-1 parameter k_n increasing towards 1 as in the following manner: with independent of n.
an SMA(q_n) process with parameter , which corresponds to an asymptotic relative window size .

Consequently, the normalized Mann-Kendall tau of these time series cannot converge to a Gaussian.

4 Checking for non-normality

In practical applications, one deals with finite time series produced by AR(1) or SMA(q) processes with usually constant parameter values k or q. In that case, we have seen at the end of Sect 2 that the asymptotic distribution of the Mann-Kendall tau is Gaussian. However, this asymptotic result is less relevant when it is needed to decide whether a statistic calculated on a finite time series would approximately follow a Gaussian distribution or not. It is possible to determine the parameters or α so that the time series would have been produced by processes described in Examples 1 and 2. For such processes we know that these parameters determine the asymptotic variance associated with increasing upsampling. For example, a time series of length n produced by an AR(1) of correlation at lag-1 k can be identified as a time series of length n described by Example 1, where . If is asymptotically (as ) non-zero, we have proved that the distribution of the Mann-Kendall tau of this time series cannot converge to a Gaussian distribution. Consequently, our results suggest that, for time series of finite length, assuming normality of the Mann-Kendall statistic is not appropriate when autocorrelation is strong relative to the length of the time series, and this also limits the applicability of the modified Mann-Kendall tests. Therefore, it is essential to understand how the distribution of the Mann-Kendall tau deviates from normality for specific parameter values and time series lengths.

Deriving Berry-Esseen bounds for the Mann-Kendall tau across different types of autocorrelated processes would help us understanding how the distribution of the Mann-Kendall tau deviates from normality for specific parameter values and time series lengths, as these bounds quantify the accuracy of the Gaussian approximation [17]. For instance, uniform and non-uniform bounds have been established for U-statistics with symmetric kernels in the case of independent samples [18] and weakly dependent samples [19]. However, these bounds depend on both sample size n and a constant term influenced by autocorrelation, which lacks a clear expression. It makes it impractical as a theoretical criterion for when to use the family of Mann-Kendall tests for autocorrelated processes such as the AR(1) process. Furthermore, while the Kendall tau is a U-statistic with a symmetric kernel, the Mann-Kendall tau has a non-symmetric kernel [16], limiting the applicability of many classical results. Therefore, we conducted a numerical investigation on the Mann-Kendall tau distribution for time series of length n generated by AR(1) and SMA(q) processes, with varied levels of autocorrelation (parameter k) and window size (parameter q). We investigated whether isolines of the values of parameters or α on the spaces delineate regions where the distribution of the Mann-Kendall tau is close to Gaussian or not.

4.1 Numerical investigation

For each combination of parameters and length of time series, we computed the Mann-Kendall tau of 10² different time series to find the empirical distribution of tau. To evaluate if these empirical distributions are roughly Gaussian, we used the Shapiro-Wilk test, which tests the null hypothesis that the population is an i.i.d. Gaussian sample (with unknown expectation and variance) [20]. We compared the rejection rate (computed over 10⁴ p-values) of this null hypothesis to the predetermined significance level. If the true distribution of our simulated tau-values is, indeed, Gaussian, then the proportion of rejections should converge to the significance level as sample size (i.e., the number of tau-values) increases. We used the Shapiro-Wilk test because it is more powerful than other classic normality tests [21], utilizing the stats.shapiro Python implementation from the SciPy package [22].

4.2 For the AR(1) process

First of all, we studied time series of length n produced by AR(1) processes with autocorrelation at lag-1 k, for values of n ranging from 5 to 100 and k from 0.23 to 0.99. Fig 1A shows the proportion at which the Shapiro-Wilk test rejects normality at the 5% significance level depending on n and k.

Download:

Fig 1. Deviations from the Gaussian distribution for AR(1) processes.

Empirical proportion of rejections of the null hypothesis of the Shapiro-Wilk test of normality at the 5% significance level for AR(1) processes. p-values are calculated over 10² values of the Mann-Kendall statistic, and rejection proportions are based on 10⁴ p-values. A: Rejection rate of time series of length generated by AR(1) processes of autocorrelation at lag-1 . Examples of lines for which is constant are in black, for four values of . B: Rejection rate of time series of length n generated by AR(1) processes for several values of . The dotted line is the 5% significance threshold. If the true distribution of the Mann-Kendall tau is Gaussian for a given , then the proportion of rejection should converge to the significance level as sample size (i.e., the number of tau-values) increases. Fig C-H present examples of empirical distributions of the normalized Mann-Kendall tau for n = 50 and several values of .

https://doi.org/10.1371/journal.pone.0333224.g001

It can be seen that, for a fixed time series of length n, the distribution of the Mann-Kendall tau is not Gaussian if the autocorrelation parameter k of the generating AR(1) is too close to 1. Furthermore, the closer k is to 1, the larger n needs to be for the distribution of the Mann-Kendall tau to remain (approximately) Gaussian. This suggests that, for each k, there exists a minimum n above which the distribution of Mann-Kendall tau can be considered Gaussian.

As proved in Sect 3, the distribution of the Mann-Kendall tau of time series where is asymptotically non-zero cannot converge to a Gaussian. Lines where is constant are added in black on Fig 1A for four examples of values. Furthermore, the ranges of values for discriminating empirical rejection rates have been chosen to match the proportion of rejection associated with each . Then, we see that the lines where is constant are also the lines where the proportion of rejection of the Shapiro-Wilk test is constant. Thus, is the right scaling to decide in practice whether the Gaussian approximation is justified.

Moreover, the closer is to 1 (), the further the normality test rejection rate is from the significance level. This is the expected effect: larger autocorrelation drives the Mann-Kendall tau distribution away from the Gaussian distribution. However, if is small enough, the proportion of rejection is approximately the significance level. This confirms that for small k_tot, the distribution of the Mann-Kendall tau is well approximated by a Gaussian. Fig 1B presents rejection rates of the null hypothesis of the Shapiro-Wilk test depending on n, for several values of . It is clear that the rejection rates are approximately constant for a given value of k_tot, for any n. We also see that the rejection rates are very close to their final values for small values of n (typically for n > 10), making these results useful for short time series.

This numerical study validates the theoretical scaling obtained in Sect 3, but also provides practical values of for which the Gaussian approximation is not adequate. As the Shapiro-Wilk test is slightly conservative, rejection rates converge to values which are slightly below 5% when goes to 0. Here, we see on Fig 1A and 1B that the proportion of rejection is equal to the significant threshold for . Then, we propose to chose this as a criterion to decide whether the Gaussian approximation is justified. Examples of empirical distributions of the Mann-Kendall tau for several values are shown in Fig 1C–1H. We see that for , the Gaussian approximation does not seem justified. If is close enough to 1, the empirical distribution is bimodal. We note that the distribution for intermediate values of (see Fig 1E for example) is very similar to the one found by Hamed [7], who proposed the Beta distribution as a more accurate approximation.

Anyone who wants to use a modified Mann-Kendall test for autocorrelated data on a time series from a real system, and assumes that the underlying process is an AR(1) process with autocorrelation at lag-1 k can therefore estimate the total autocorrelation parameter and know if the Gaussian approximation is justified. This allows to decide whether modified Mann-Kendall test can be applied or not on the Mann-Kendall tau of the time series. For example, let’s consider a time series of length n = 20 produced by an AR(1) process with autocorrelation at lag-1 parameter k = 0.5. Then, . According to the previous criterion, is clearly too high to consider that the distribution of the Mann-Kendall tau of the time series of interest is Gaussian at the 5% significance level. Therefore, it is not reasonable to assume that the distribution of the Mann-Kendall tau is Gaussian and therefore to apply a test from the family of modified Mann-Kendall tests for autocorrelated data as they rely on this Gaussian assumption. In this case, these tests are not suitable for reliably detecting trends.

4.2.1 Case study.

We apply the practical criterion presented in the last section - that is to compare the value of to 10⁻⁸ - to time series from the literature. In particular, we focus on the three papers that have proposed modified Mann-Kendall tests for autocorrelated data [3,4,7]. The analyzed time series are taken from the hydrological literature.

The existence of a trend in a time series alters the estimate of the autocorrelation parameter [4]. Therefore, we follow the method proposed by Yue and Wang [4] and first remove any potential trend, using the non-parametric method of Theil [23] and Sen [24]. Then, the lag-1 autocorrelation coefficient is estimated on the detrended time series [25]:

(14)

with the sample average.

As the results from Sect 3 concern AR(1) processes with positive autocorrelation parameters, we only keep time series which satisfy this condition. The length of the time series n, the estimated autocorrelation at lag-1 parameter , the upper bound of the 90% confidence interval (upper bound of the confidence interval for a one-sided test at the significance threshold of 5%) , as well as the estimated total autocorrelation of the upper bound, and the identifiers of the stations where the measurements were taken (see the cited articles for the full identification of the time series) are shown in Table 1. See the data availability section for the python implementation.

Download:

Table 1. Validity of the Gaussian approximation for example time series.

https://doi.org/10.1371/journal.pone.0333224.t001

Comparing with the 10⁻⁸ threshold, we conclude that the Gaussian approximation for the Mann-Kendall tau is suitable for seven out of the fourteen time series. The time series concerned are indicated by a tick in the last column of Table 1. For the remaining seven time series, this implies that a portion of the 95% confidence interval does not satisfy the previously established criterion.

The length of the time series n, the estimated autocorrelation at lag-1 parameter , the upper bound of the 90% confidence interval (upper bound of the confidence interval for a one-sided test at the significance threshold of 5%) , as well as the estimated total autocorrelation of the upper bound

4.3 For the SMA process

We applied the same methodology to study time series of length n produced by SMA processes of order q, for values of n ranging from 5 to 150 and q from 5 to 50. Fig 2A shows the proportion of time series for which the Shapiro-Wilk test rejects normality at the 5% significance level as a function of n and q.

Download:

Fig 2. Deviations from the Gaussian distribution for SMA processes.

A: Empirical proportion of rejections of the null hypothesis of the Shapiro-Wilk test of normality at the 5% significance level for time series of length n and initial length N (see Example 2) generated by SMA processes of order q. Examples of lines for which the relative window size is constant are in black, for four examples of α. B: Rejection rate of time series of initial length N generated by SMA processes for several values of α. The dotted line is the 5% significance threshold. If the true distribution of the Mann-Kendall tau is Gaussian for a given α, then the proportion of rejection should converge to the significance level as sample size (i.e., the number of tau-values) increases. Fig C–F present examples of empirical distributions of the normalized Mann-Kendall tau for several values of α when N = 100.

https://doi.org/10.1371/journal.pone.0333224.g002

It can be seen that, for a time series of fixed length n, the distribution of the Mann-Kendall statistic is not Gaussian if the order q of the generating SMA process is too high. Furthermore, the higher q is, and the larger n needs to be, for the distribution of the Mann-Kendall tau to remain (approximately) Gaussian. This suggests that, for each q, there exists a minimum n above which the distribution of tau can be considered Gaussian.

As proved in Sect 3, the distribution of the Mann-Kendall tau of time series where the relative window size is asymptotically non-zero cannot converge to a Gaussian (see Example 2 for notations). Lines for which α is constant are added in black on Fig 2A for four α values.

Furthermore, the ranges of rejection rates for each colour band have been chosen to minimize the distances from the values of α plotted. Then, we see that the lines where α is constant are also the lines where the proportion of rejection of the Shapiro-Wilk test is constant. Thus, α is the right scaling to decide in practice whether the Gaussian approximation is justified.

Moreover, the closer α is to 1 (), the further the normality test rejection rate is from the significance level. This is the expected effect: larger window sizes introduce more autocorrelation, thus driving the Mann-Kendall tau distribution away from the Gaussian distribution. However, if α is small enough, the proportion of rejection is approximately the significance level. This confirms that for small α, the distribution of the Mann-Kendall tau is well approximated by a Gaussian.

Fig 2B presents rejection rates of the null hypothesis of the Shapiro-Wilk test depending on N, for several values of α. It is clear that the rejection rates are approximately constant for all values of N if α is fixed. Rejection rates would similarly be constant for all values of n if α is fixed. We also see that the rejection rates are very close to their final values for small values of N (typically for N > 10), making these results useful for short time series.

Therefore, this numerical study validates the theoretical scaling obtained in Sect 3, but also provides practical values of α for which the Gaussian approximation is not adequate. As the Shapiro-Wilk test is slightly conservative, rejection rates converge to values slightly below 5% (which would be the expected rate for independent data) when α goes to 0. Here, we see on Fig 2A and 2B that the proportion of rejection is equal to the significant threshold for %. Then, we propose to use this as a criterion to decide whether the Gaussian approximation is justified. Note that, for a relative window size of 10% or less, the Shapiro-Wilk test does not reject normality, but it does not prove either that the distribution is normal (see Discussion).

Examples of empirical distributions of the Mann-Kendall tau for several α values are shown in Fig 2C–2F. We see that for , the Gaussian approximation does not seem justified visually. If α is high enough, the empirical distribution is bimodal, very far from the Gaussian distribution. Distributions of the Mann-Kendall statistic are very similar to the one found for the autoregressive process, see Fig 1.

Anyone who wants to use a modified Mann-Kendall test on data produced by the averaging on windows of size q of an initial time series of length N from a real system can therefore estimate the relative window size and know if the Gaussian approximation is justified. This allows to decide whether modified Mann-Kendall test can be applied or not on the Mann-Kendall tau of the time series. For example, if considering the averaging of a time series of length N = 50 over rolling windows of size q = 20, then, the relative window size and according to the previous criterion, is too high to consider that the distribution of the Mann-Kendall tau of the time series of interest is Gaussian. Therefore, it is not reasonable to apply a test from the family of modified Mann-Kendall tests for autocorrelated data to reliably detect trends.

From the previous section, we conclude that the distribution of the Mann-Kendall tau for time series generated by SMA processes is not approximately Gaussian for relative window sizes of more than 10%, independently of the time series length.

Discussion

In this article, we demonstrate the existence of sequences of time series generated by stationary autoregressive AR(1) and simple moving average (SMA) processes for which the normalized Mann-Kendall tau distribution cannot be asymptotically Gaussian. Instead, it converges to a bounded distribution with strictly positive variance. This result suggests that the non-Gaussian nature of the distribution should emerge noticeably in finite-length time series with sufficient autocorrelation. We found in a numerical investigation that the parameters determining the variance in our asymptotic results are indeed the ones which determine whether the distribution of the Mann Kendall tau will be close to Gaussian or not in finite-length time series. To guide practical application, we provided easy-to-implement criteria which clarify when the Gaussian approximation is appropriate for tests applied to real data. Our numerical investigations indicate that these criteria remain relevant when applied to time series of a relatively small number of points.

For time series of length n generated by an AR(1) process with lag-1 parameter k, we showed theoretically that emerges naturally as the right scaling between k and n to reject the asymptotic Gaussian approximation. Based on these theoretical foundations, we selected the Shapiro-Wilk test for normality to numerically check that is the correct scaling. We also proposed the practical threshold to decide whether the distribution of the Mann-Kendall tau of time series is Gaussian or not. It corresponds to the (k,n) contour where the null hypothesis of a Gaussian distribution is rejected in 5% of tests. The criteria on autocorrelation of a timeseries can be checked using the code shared in the data accessibility Section.

Regarding the SMA process of order q, the correct scaling is which can naturally be interpreted as the relative window size of the moving average. Numerically, using the Shapiro-Wilk test for normality, we find that if α is larger than 10%, then a Gaussian distribution of the Mann-Kendall tau is rejected in over 5% of the tests. Therefore, we propose % as a practical threshold to decide whether the distribution of the Mann-Kendall tau of time series is Gaussian or not. For real datasets, if one assumes that the data are resampled using a moving average but does not know the parameter α, one could investigate the shape of the autocorrelation function. Theoretically, its slope could be used to retrieve the value of the relative window size. However, further analysis is required to evaluate the power of this approach to detect the true value of the relative window size.

We illustrated these results with empirical distributions of the normalized Mann-Kendall tau for several values of and α. The distributions are very similar for all n, depending mainly on the value of and α, which once again underlines the fact that the proposed scalings can be used to determine whether a distribution is Gaussian or not in finite-length time series. These findings fill a gap between intermediate-length observations, as discussed by [7], and different asymptotic results, which predict a Gaussian distribution in cases where autocorrelation is present in the data but converges to zero in long time series.

Nevertheless, our results should be treated with some caution. We simulated the stochastic processes assuming error terms and initial observations sampled from independent Gaussian variables. This does not need to be the case for observed time series. We have not studied the degree to which the various Mann-Kendall tests might be robust with respect to such deviations from the normality assumptions on error terms and initial observations. Future work could investigate whether our results hold when relaxing the assumption that the underlying process is Gaussian, which was necessary for Eq (4). Additionally, deriving Berry-Esseen bounds for the Mann-Kendall tau across different types of autocorrelated processes would enhance understanding, as these bounds quantify the accuracy of the Gaussian approximation. In time series where in the case of AR(1) processes and where for SMA processes, however, we don’t immediately see how such deviations might suddenly generate statistics which follow a Gaussian distribution. In time series where and where , this might lead to additional cases where statistics are not Gaussian. Furthermore, the sample size for the Shapiro-Wilk test was arbitrarily set to 10² time series. Changing the sample size could slightly change the rejection rates of the null hypothesis; however, an exploratory analysis showed that this has minimal impact on our results and the identified thresholds.

Although Lemma 1 is broadly applicable, our focus has been on ARMA processes due to their widespread use in scientific fields such as ecology [26], hydrology [25], and finance [27]. Autoregressive models, particularly the AR (1) process, are fundamental for time series analysis, modeling, and forecasting, with frequent applications in hydrology [4]. In contrast, moving-average models are often used for noise reduction in time series, as well as for modeling purposes. Among different statistical techniques for forecasting and trend detection involving regression methods, time series methods and stochastic processes (e.g., [28–30]), the Mann-Kendall statistic can be used to test the null hypothesis of no trend without fitting a specific time series or non-linear model. Mann-Kendall statistics with Gaussian distributions were recently recommended to this end [31]. We believe that our findings could encourage preliminarily testing the normality assumption before applying Mann-Kendall tests for trend detection. Consequently, our results on the importance of autocorrelations may have broader relevance, particularly when testing for the presence of critical transitions in time series (also known as early warning signals), where methodology involves averaging, which may introduce artificial autocorrelation. We plan to explore these implications further in a forthcoming paper.

Supporting information

S1 Appendix. Proofs of the main text.

https://doi.org/10.1371/journal.pone.0333224.s001

(PDF)

Acknowledgments

The authors are very grateful to Michael Kopp and Jean-René Chazottes for helpful discussions and for providing useful comments on various versions of this manuscript.

References

1. Mann HB. Nonparametric tests against trend. Econometrica: Journal of the Econometric Society. 1945;245–59.
- View Article
- Google Scholar
2. Kendall MG. Rank correlation methods. 4th ed. Griffin; 1975.
3. Hamed KH, Ramachandra Rao A. A modified Mann-Kendall trend test for autocorrelated data. Journal of Hydrology. 1998;204(1–4):182–96.
- View Article
- Google Scholar
4. Yue S, Wang C. The Mann-Kendall test modified by effective sample size to detect trend in serially correlated hydrological series. Water Resources Management. 2004;18(3):201–18.
- View Article
- Google Scholar
5. Yoshihara K. Limiting behavior of U-statistics for stationary, absolutely regular processes. Z Wahrscheinlichkeitstheorie verw Gebiete. 1976;35(3):237–52.
- View Article
- Google Scholar
6. Sen PK. On the properties of U-statistics when the observations are not independent: part one estimation of non-serial parameters in some stationary stochastic process. Calcutta Statistical Association Bulletin. 1963;12(3):69–92.
- View Article
- Google Scholar
7. Hamed KH. Exact distribution of the Mann–Kendall trend test statistic for persistent data. Journal of Hydrology. 2009;365(1–2):86–94.
- View Article
- Google Scholar
8. Lindsey J. Some statistical heresies (with discussion). Journal of the Royal Statistical Society Series D: The Statistician. 1999;48(1):1–40.
- View Article
- Google Scholar
9. RA F. The logic of inductive inference (with discussion). Journal of the Royal Statistical Society. 1935;98:39–82.
- View Article
- Google Scholar
10. Dufour J-M, Kiviet JF. Exact inference methods for first-order autoregressive distributed lag models. Econometrica. 1998;66(1):79.
- View Article
- Google Scholar
11. Dakos V, Carpenter SR, Brock WA, Ellison AM, Guttal V, Ives AR, et al. Methods for detecting early warnings of critical transitions in time series illustrated using simulated ecological data. PLoS One. 2012;7(7):e41010. pmid:22815897
- View Article
- PubMed/NCBI
- Google Scholar
12. Kendall MG. A new measure of rank correlation. Biometrika. 1938;30(1/2):81.
- View Article
- Google Scholar
13. Greiner R. Uber das fehlersystem der kollektivmasslehre. Zeitschrift für Mathematik und Physik. 1909;57:121–58.
- View Article
- Google Scholar
14. Phillips PCB. Towards a unified asymptotic theory for autoregression. Biometrika. 1987;74(3):535–47.
- View Article
- Google Scholar
15. Hoeffding W. A class of statistics with asymptotically normal distribution. Ann Math Statist. 1948;19(3):293–325.
- View Article
- Google Scholar
16. Chen X. Kendall’s Tau as the test for trend in time series data. Acadia University; 2012.
17. Berry AC. The accuracy of the Gaussian approximation to the sum of independent variates. Trans Amer Math Soc. 1941;49(1):122–36.
- View Article
- Google Scholar
18. Chen LH, Shao QM. Normal approximation for nonlinear statistics using a concentration inequality approach. 2007.
19. Bentkus V, Götze F, Tikhomirov A. Berry-Esseen bounds for statistics of weakly dependent samples. 1997.
20. Shapiro SS, Wilk MB. An analysis of variance test for normality (complete samples). Biometrika. 1965;52(3/4):591.
- View Article
- Google Scholar
21. Razali NM, Wah YB. Power comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling tests. Journal of Statistical Modeling and Analytics. 2011;2(1):21–33.
- View Article
- Google Scholar
22. Royston P. Remark AS R94: a remark on algorithm AS 181: the W-test for normality. Applied Statistics. 1995;44(4):547.
- View Article
- Google Scholar
23. Theil H. A rank-invariant method of linear and polynomial regression analysis. Indagationes mathematicae. 1950;12(85).
- View Article
- Google Scholar
24. Sen PK. Estimates of the regression coefficient based on Kendall’s Tau. Journal of the American Statistical Association. 1968;63(324):1379–89.
- View Article
- Google Scholar
25. Salas JD. Applied modeling of hydrologic time series. Water Resources Publication; 1980.
26. Ives AR, Abbott KC, Ziebarth NL. Analysis of ecological time series with ARMA(p,q) models. Ecology. 2010;91(3):858–71. pmid:20426343
- View Article
- PubMed/NCBI
- Google Scholar
27. Tsay RS. Analysis of financial time series. Wiley; 2005.
28. Lehnertz K. Time-series-analysis-based detection of critical transitions in real-world non-autonomous systems. Chaos. 2024;34(7):072102. pmid:38985967
- View Article
- PubMed/NCBI
- Google Scholar
29. Taylor SJ, Letham B. Forecasting at scale. The American Statistician. 2018;72(1):37–45.
- View Article
- Google Scholar
30. Tsay RS. Testing and modeling threshold autoregressive processes. Journal of the American Statistical Association. 1989;84(405):231–40.
- View Article
- Google Scholar
31. Chen S, Ghadami A, Epureanu BI. Practical guide to using Kendall’s τ in the context of forecasting critical transitions. R Soc Open Sci. 2022;9(7):211346. pmid:35911200
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Mann HB. Nonparametric tests against trend. Econometrica: Journal of the Econometric Society. 1945;245–59.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Kendall MG. Rank correlation methods. 4th ed. Griffin; 1975.

[ref3] 3. Hamed KH, Ramachandra Rao A. A modified Mann-Kendall trend test for autocorrelated data. Journal of Hydrology. 1998;204(1–4):182–96.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref4] 4. Yue S, Wang C. The Mann-Kendall test modified by effective sample size to detect trend in serially correlated hydrological series. Water Resources Management. 2004;18(3):201–18.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref5] 5. Yoshihara K. Limiting behavior of U-statistics for stationary, absolutely regular processes. Z Wahrscheinlichkeitstheorie verw Gebiete. 1976;35(3):237–52.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Sen PK. On the properties of U-statistics when the observations are not independent: part one estimation of non-serial parameters in some stationary stochastic process. Calcutta Statistical Association Bulletin. 1963;12(3):69–92.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref7] 7. Hamed KH. Exact distribution of the Mann–Kendall trend test statistic for persistent data. Journal of Hydrology. 2009;365(1–2):86–94.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref8] 8. Lindsey J. Some statistical heresies (with discussion). Journal of the Royal Statistical Society Series D: The Statistician. 1999;48(1):1–40.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. RA F. The logic of inductive inference (with discussion). Journal of the Royal Statistical Society. 1935;98:39–82.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref10] 10. Dufour J-M, Kiviet JF. Exact inference methods for first-order autoregressive distributed lag models. Econometrica. 1998;66(1):79.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref11] 11. Dakos V, Carpenter SR, Brock WA, Ellison AM, Guttal V, Ives AR, et al. Methods for detecting early warnings of critical transitions in time series illustrated using simulated ecological data. PLoS One. 2012;7(7):e41010. pmid:22815897
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref12] 12. Kendall MG. A new measure of rank correlation. Biometrika. 1938;30(1/2):81.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref13] 13. Greiner R. Uber das fehlersystem der kollektivmasslehre. Zeitschrift für Mathematik und Physik. 1909;57:121–58.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref14] 14. Phillips PCB. Towards a unified asymptotic theory for autoregression. Biometrika. 1987;74(3):535–47.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref15] 15. Hoeffding W. A class of statistics with asymptotically normal distribution. Ann Math Statist. 1948;19(3):293–325.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref16] 16. Chen X. Kendall’s Tau as the test for trend in time series data. Acadia University; 2012.

[ref17] 17. Berry AC. The accuracy of the Gaussian approximation to the sum of independent variates. Trans Amer Math Soc. 1941;49(1):122–36.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref18] 18. Chen LH, Shao QM. Normal approximation for nonlinear statistics using a concentration inequality approach. 2007.

[ref19] 19. Bentkus V, Götze F, Tikhomirov A. Berry-Esseen bounds for statistics of weakly dependent samples. 1997.

[ref20] 20. Shapiro SS, Wilk MB. An analysis of variance test for normality (complete samples). Biometrika. 1965;52(3/4):591.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref21] 21. Razali NM, Wah YB. Power comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling tests. Journal of Statistical Modeling and Analytics. 2011;2(1):21–33.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref22] 22. Royston P. Remark AS R94: a remark on algorithm AS 181: the W-test for normality. Applied Statistics. 1995;44(4):547.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref23] 23. Theil H. A rank-invariant method of linear and polynomial regression analysis. Indagationes mathematicae. 1950;12(85).
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref24] 24. Sen PK. Estimates of the regression coefficient based on Kendall’s Tau. Journal of the American Statistical Association. 1968;63(324):1379–89.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref25] 25. Salas JD. Applied modeling of hydrologic time series. Water Resources Publication; 1980.

[ref26] 26. Ives AR, Abbott KC, Ziebarth NL. Analysis of ecological time series with ARMA(p,q) models. Ecology. 2010;91(3):858–71. pmid:20426343
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref27] 27. Tsay RS. Analysis of financial time series. Wiley; 2005.

[ref28] 28. Lehnertz K. Time-series-analysis-based detection of critical transitions in real-world non-autonomous systems. Chaos. 2024;34(7):072102. pmid:38985967
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref29] 29. Taylor SJ, Letham B. Forecasting at scale. The American Statistician. 2018;72(1):37–45.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref30] 30. Tsay RS. Testing and modeling threshold autoregressive processes. Journal of the American Statistical Association. 1989;84(405):231–40.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref31] 31. Chen S, Ghadami A, Epureanu BI. Practical guide to using Kendall’s τ in the context of forecasting critical transitions. R Soc Open Sci. 2022;9(7):211346. pmid:35911200
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

Figures

Abstract

1 Introduction

2 Assumptions and examples

3 Asymptotic variance of the Mann-Kendall tau for renormalized ARMA process

4 Checking for non-normality

4.1 Numerical investigation

4.2 For the AR(1) process

4.2.1 Case study.

4.3 For the SMA process

Discussion

Supporting information

S1 Appendix. Proofs of the main text.

Acknowledgments

References