Figures
Abstract
The CUSUM control chart is suitable for detecting small to moderate parameter shifts for processes involving autocorrelated data. The average run length (ARL) can be used to assess the ability of a CUSUM control chart to detect changes in a long-memory seasonal autoregressive fractionally integrated moving average with exogenous variable (SARFIMAX) process with underlying exponential white noise. Herein, new ARLs via an analytical integral equation (IE) solution as an analytical IE and a numerical IE method to test a CUSUM control chart’s ability to detect a wide range of shifts in the mean of a SARFIMAX(P, D, Q, r)s process with underlying exponential white noise are presented. The analytical IE formulas were derived by using the Fredholm integral equation of the second type while the numerical IE method for the approximate ARL is based on quadrature rules. After applying Banach’s fixed-point theorem to guarantee its existence and uniqueness, the precision of the proposed analytical IE ARL was the same as the numerical IE method. The sensitivity and accuracy of the ARLs based on both methods were assessed on a CUSUM control chart running a SARFIMAX(P, D, Q, r)s process with underlying exponential white noise. The results of an extensive numerical study comprising the examination of a wide variety of out-of-control situations and computational schemes reveal that none of the methods outperformed the IE. Specifically, the computational scheme is easier and can be completed in one step. Hence, it is recommended for use in this situation. An illustrative example based on real data is also provided, the results of which were found to be in accordance with the research results.
Citation: Areepong Y, Peerajit W (2022) Integral equation solutions for the average run length for monitoring shifts in the mean of a generalized seasonal ARFIMAX(P, D, Q, r)s process running on a CUSUM control chart. PLoS ONE 17(2): e0264283. https://doi.org/10.1371/journal.pone.0264283
Editor: Maria Alessandra Ragusa, Universita degli Studi di Catania, ITALY
Received: September 30, 2021; Accepted: February 5, 2022; Published: February 25, 2022
Copyright: © 2022 Areepong, Peerajit. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files.
Funding: This work was supported by Grant No. KMUTNBBasicR-64-20, National Science, Research and Innovation Fund (NSRF) via King Mongkut’s University of Technology North Bangkok, to Wilasinee Peerajit. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The discipline of statistical process control (SPC) provides tools such as control charts for monitoring processes and detecting changes in a given in-control model. The Shewhart, cumulative sum (CUSUM), and exponentially weighted moving average (EWMA) control charts are the three most commonly used ones. The CUSUM and EWMA control charts can effectively detect small shifts in the parameters of interest (usually location parameters) whereas the Shewhart control chart such as the well-known -chart is used to detect large shifts in process parameters. Details of the charts can be found in Montgomery [1]. The CUSUM control chart was used in the present study due to its superior capability of detecting small parameter shifts and to take advantage of the benefits of using an upper-sided control chart.
Small levels of autocorrelation between successive observations can have a significant effect on the statistical properties of control charts. Numerous researchers have considered the effect of autocorrelation on control chart performance. Johnson and Bagshaw [2] and Bagshaw and Johnson [3] concluded that even small levels of autocorrelation generated by using autoregressive AR(1) or moving average MA(1) time series models made it more difficult to detect out-of-control signals on a conventional CUSUM control chart. Nevertheless, a CUSUM control chart has been used for monitoring shifts in the mean of a stationary ARMA (p, q) model process, and its performance has been evaluated when applied to processes with autocorrelated data [4].
Observations from real-world stochastic processes frequently exhibit a time series component. In particular, economic observations can have AR and MA components in a time series model. The error or so-called white noise, which is defined as the difference between the actual and approximated values, should be kept to a minimum to maintain the highest possible accuracy rate. Although the white noise (also referred to as Gaussian white noise) generated by autocorrelated observations usually follows a normal distribution, this is not always the case. Indeed, non-Gaussian white noise processes are useful for modeling a wide variety of phenomena, including wind speed, and the oxygen concentration and flow rate of water. Numerous studies have been conducted on time series models with non-Gaussian white noise, and when it is exponentially distributed (referred to as exponential white noise) is particularly interesting. Jacob and Lewis [5] considered an ARMA(1,1) process with exponential white noise, while Mohamed and Hocine [6] conducted a Bayesian analysis of an AR(1) process with exponential white noise. Furthermore, Pereira and Turkman [7] conducted a Bayesian analysis of threshold AR models with exponential white noise, while Suparman [8] recently proposed parameter estimations for an AR model of unknown order with exponential white noise.
Time series models are capable of capturing process correlation. An important class of time series model is the stationary process in which it is assumed that the process remains stable around a constant mean. For monitoring autocorrelated processes, the type of model can provide a framework for establishing statistical control. The following basic time series models were used in this study. The conventional Box-Jenkins AR integrated MA (ARIMA) model can be generalized as the AR fractionally integrated MA (ARFIMA) model that allows non-integer (fractional) values for the differencing parameter. This is advantageous when modeling time series with inherent long-range dependence (i.e., long-memory) [9, 10]. Accordingly, ARFIMA models are appropriate for time series long-memory processes, and exogenous variables can be incorporated into the time series model (i.e., ARFIMAX) to improve the latter’s performance. Ebens [11] first introduced the ARFIMAX model with k exogenous variables and used it to estimate the realized volatilities in a Dow Jones Industrial Average portfolio, while Degiannakis [12] estimated and forecasted intraday realized volatility using ARFIMAX and ARFIMAX-TARCH (threshold AR conditional heteroskedasticity) models.
Seasonality (seasonal periods over a single year) is a phenomenon that affects time series. Numerous scholars have discussed ARFIMA models with seasonal components, and it has been used to explain a large number of real-world cyclical phenomena in long-memory processes. For instance, it occurs in series of revenues, inflation rates, monetary aggregates, and gross national product. In the current study, we concentrate on seasonal ARFIMAX (SARFIMAX) models. In terms of economic forecasting and other fields, cases where an exogenous variable is included in the forecasting model are usually more accurate than ones without it. In addition, time series model processes have been monitored by using control charts. For instance, Ramjee [13] discovered that the performances of Shewhart and EWMA control charts with correlated observations in an ARFIMA model were not good, and so proposed the hyperbolic weighted MA (HWMA) control chart instead. Subsequently, Ramjee et al. [14] designed an HWMA forecast-based control chart particularly designed for a non-stationary ARFIMA model with autocorrelated data. When residual control charts with ARFIMA and ARIMA models were used to monitor the air quality Taiwan [15], the one with the ARFIMA model was more appropriate than the one with the ARIMA model. Rabyk and Schmid [16] recently introduced the EWMA control chart for detecting changes in the mean of a long-memory process with the control chart’s design based on an ARFIMA(p, d, q) process.
The most widespread measure of evaluating the performance of the control chart is the average run length (ARL). The power of a control chart is defined as the probability of its out-of-control signal detection, while the average number of samples required to signal an out-of-control situation in the process (or expected value of the run length) is represented by the ARL. The in-control and out-of-control ARLs are denoted as ARL0 and ARL1, respectively: ARL0 should be as large as possible whereas ARL1 should be as small as possible when the process changes undesirably. To improve the out-of-control detection performance on a CUSUM control chart, two alternative schemes are proposed that help to minimize ARL1 when ARL0 is fixed.
Assessing the ARL of a control chart using randomness and numerical methods is possible using a variety of techniques, including Monte Carlo, Markov Chain, and integral equations (IEs). The Monte Carlo simulation technique can be used to determine the expected run length until the target ARL value approaches that of an IE under the assumption of a sufficiently long simulation time. For the Markov Chain approach, the range between the upper and lower control limits is divided into sub-intervals, while the probabilities of the runs on the control chart are presented as a transitional probability matrix [17] over the sub-interval so that the ARL can be evaluated by inverting the matrix. In the meantime, the IE method comprises two alternatives: analytical IE and numerical IE. The analytical IE method generates exact ARLs or explicit formulas. The numerical IE method is transformed into a system of linear equations that can be solved for the ARL by using certain linear algebra-based methods. Crowder [18] provided a numerical method by using an IE of the second kind and a computer program for ARL evaluation of a normally distributed process on a two-sided EWMA control chart. Moreover, Hawkins [19] solved IEs for determining the ARL of a CUSUM control chart. Later, ARLs by using analytical IEs and the Markov Chain approach were used on EWMA and CUSUM control charts for observations from an AR(1) process with additional random error [20]. To evaluate the ARL for a SARMA(1, 1)s process with exponential white noise on a CUSUM control chart, Phanyaem [21] developed explicit formulas for IEs initially based on the SARMA model. In addition, analytical IEs for the solution for the ARL for a long-memory SARFIMA model on CUSUM control chart were presented by Peerajit et al. [22]. Recently, complex data from seasonal and non-seasonal MA processes with exogenous variables were used to evaluate the ARL on a CUSUM control chart by Sunthornwat and Areepong [23]. Moreover, it is important to apply Banach’s fixed-point theorem to prove the existence and uniqueness of an ARL. In this paper, our focus was on establishing ARLs for monitoring changes in a long-memory SARFIMAX process with underlying exponential white noise running on a CUSUM control chart.
The remainder of the article is organized as follows. Section 2 contains a brief review of the ARFIMAX models taking long memory and seasonality into account and descriptions of the CUSUM control chart, as well as the characteristics of the ARL and Banach’s fixed-point theorem. Theoretical proof for the existence and uniqueness of analytical IE, along with its comparable accuracy to the numerical ARL are covered in Section 3. The next section comprises derivations of the proposed analytical and numerical IE methods for the ARL for monitoring changes in a long-memory SARFIMAX process with underlying exponential white noise on a CUSUM control chart. The results of an evaluation of the performances of the IE methods are provided in Section 5. In Section 6, application of the methods to the monthly stock price data for PTT Public Company Ltd with the EUR/THB exchange rate as the exogenous variable is reported. Conclusions and a discussion of possible future work are provided in Section 7. Finally, the appendix contains technical details of the computation of the ARL by using the proposed methods.
Preliminaries
The relevant fundamentals of the long-memory SARFIMAX process with underlying exponential white noise are described in this section. In addition, we determine whether detecting changes in the mean of a generalized SARFIMAX process running on a CUSUM control chart is suitable for this study. For simplicity, the second subsection is focused on the upper-sided CUSUM control chart only. Next, performance evaluation of a control chart and the characteristics of the ARL are covered. In the last subsection, definitions related to Banach’s fixed-point theorem are covered.
Long-memory SARFIMAX processes with underlying exponential white noise
The ARFIMA model, a generalized form of the usual Box-Jenkins ARIMA model but allowing non-integer (fractional) values for the differencing parameter, is useful for modeling time series with inherent long-range dependence [9, 10]. Furthermore, including an exogenous variable in the ARFIMA model produces the ARFIMAX model with better performance [12]. In practice, time series are often found in meteorology, economics, hydrology, and astronomy [24]. Seasonality is an autocorrelation structure where the data vary according to specific periods in a year, a phenomenon that is included in our analysis.
A SARFIMAX model of order (P, D, Q, r)s, with AR order (P), fractional integration order (D), MA order (Q), and exogenous variables order (r), and period (s) for seasonal data is an extension of the ARFIMAX(p, d, q, r) model applied to a seasonal time series. SARFIMAX(P, D, Q, r)s is stationary and invertible for D ∈ (−1, 0.5); for D > 0, where D is seasonal fractionally differentiated/ integration and D ∈ (0, 0.5) is for a long-memory process on a CUSUM control chart.
Definition 1. Let be a sequence from a SARFIMAX(P, D, Q, r)s process defined as
(1)
where μ is the constant of the process, Xit,l = 1, 2, …, r are exogenous variables at time t, The ωl,l = 1, 2, …, r are coefficients corresponding to r exogenous variables. It is assumed that the error terms (ξt) are white noise, i.e., ξt ~ Exp(β). The operator B is a backward-shift operator (i.e., BskYt = Yt−sk), (1 − Bs)D is seasonal fractional difference operator, where s refers to the number of seasonal periods per year (e.g., s = 12 for monthly data), and ΦP(Bs) and ΘQ(Bs) are the coefficient of seasonal AR and MA polynomials in B order P and
, respectively, defined by
and
where the coefficient of seasonal Φi, i = 1.2, …, P, and Θj, j = 1, 2, …, Q are real numbers.
Remark: For all real D, seasonal fractional difference operator (1 − Bs)D with seasonality is defined by employing a binomial expansion as follows:
where D becomes crucial for describing the degree of seasonal persistence.
The result of the process is based on binomial expansion. After simplification of the derivation of process, the generalized SARFIMAX(P, D, Q, r)s process is as follows:
(2)
where initial value is normally the process mean, seasonal AR, seasonal MA and exogenous variables coefficient are, respectively, ξt−s, ξt−2s, …, ξt−Qs = 1, −1 ≤ Φi ≤ 1, −1 ≤ Θj ≤ 1, and −1 ≤ ωl ≤ 1. It is assumed that the initial value of generalized SARFIMAX(P, D, Q, r)s process Yt−s, Yt−2s, …, Yt−Ps, Yt−(P+1)s, … = 1. In this paper, Yt is long memory process which is assess D ∈ (0, 0.5).
Process monitoring on a CUSUM and EWMA control charts.
One of the simplest and efficient SPC techniques for detecting small and moderate changes in the mean of a process is the CUSUM and EWMA control charts. The statistic of the upper-sided CUSUM control chart [25] can be expressed by the following recursive equation:
(3)
where CUSUM control chart parameter Yt is a sequence of long-memory SARFIMAX(P, D, Q, r)s process with underlying exponential white noise. η > 0 and C0 ≥ 0 as the reference and starting value, respectively. The starting value of the CUSUM statistic is φ (i.e., C0 = φ, φ is the initial value; φ ∈ [0, b]), and parameter η is used to balance the series (Ct) that is commonly chosen to be larger than but close to the value of the in-control mean.
Remark: If Ct control limit b > 0, an alarm is triggered that indicates that the process might be out-of-control. In view of an exact performance evaluation, we chose rational or even real number design parameter η.
For the above CUSUM sequence (Ct), the corresponding stopping time with predetermined threshold b can be deployed:
(4)
where τb is the stopping time, b is the upper control limit (UCL) of CUSUM chart.
The statistic of the upper-sided EWMA control chart [26] can be expressed by the following recursive equation:
where λ is an exponential smoothing parameter with 0 < λ ≤ 1. The starting value of the EWMA statistic is Z0 = Y0, the target mean value is μ, Yt is a processes with mean μ and variance σ2.
The upper control limit (UCL) and lower control limit (LCL) to detect the sequence is given by,
(5)
where L is the width of the control limits, where μ0 is the target mean, σ is the process standard deviation.
Characteristics of the ARL.
Throughout the whole paper, the error term (ξt) is a sequence of continuous i.i.d. random variables taken from an exponential distribution with distribution function. (F(x, β)), which is used in the next section concerning monitoring of the mean. We consider the following simplified change-point models:
(6)
where β0 and β1 are known parameters; β1 > β0. As mentioned previously, when considering the change point in Eq (6), the ARL defined with Ev(.) can be characterized as follows:
where Ev() denotes the expectation under distribution F(x, β) for a given change-point time (ν). ν = ∞ indicates no change in the statistical process and is referred to as the in-control ARL (ARL0). Herein, ν = 1 marks the first time point that a change takes place from β0 to β in the statistical process, which is called out-of-control ARL (ARL1).
Definitions and theories for evaluating the ARL.
Definitions, theories, and concepts necessary to prove the existence and uniqueness of the IEs in the proof based on the functional analysis are covered in this section.
Definition 2. Let (M, d) be a metric space and let ϒ: M → M be a mapping. A point u ∈ M is called a fixed point of ϒ if ϒ(u) = u.
Remark: Fixed point theorems guarantee the existence of a fixed point under appropriate conditions on the map ϒ and the set M.
Definition 3. A normed vector space M is a Banach space if the metric space (M, d) is complete, where d(u, v) = ‖u − v‖, for all u, v ∈ M.
Consider the Banach space is n-dimensional Euclidean space , where the norm |.| is given by the Euclidean distance. The C(M) is space of continuous functions where M is a compact interval and the norm ‖.‖ is given by ‖f‖∞ = supu∈M |f(u)| for f ∈ C(M).
Definition 4. Let (M, d) be a metric space and let ϒ: M → M be a map, and ϒ is called contraction if there exists a fixed constant ρ ∈ [0, 1) such that
Theorem 1 (Banach’s fixed-point theorem or the contraction theorem) [27].
Let (M, d) be a complete metric space, then each contraction map ϒ: M → M has a unique fixed point.
The existence and uniqueness of the analytical IE for the ARL on a CUSUM control chart
The analytical IE for the ARL of a CUSUM control chart running a long-memory SARFIMAX process corresponds to a Fredholm integral equation of the second type. In this section, Banach’s fixed-point theorem is applied to prove the existence and uniqueness of the analytical IE solution and demonstrate that it has the same accuracy as the numerical IE method.
Throughout, this paper, let ℓ(φ) be the ARL conditioned on the initial value for a CUSUM control chart running a long-memory SARFIMAX process with underlying exponential white noise [17]. The initial value for monitoring CUSUM statistic C0 = φ is determined at φ ∈ [0, b]. Let and
be the probability measure and induced expectation corresponding to φ. Moreover, ARL is defined as a function of
, where τb = inf{t > 0; Ct > b} satisfies the solution for the analytical IE as follows:
(7)
where indicator function is
The IE for the ARL derived from the Fredholm integral equation of the second kind can be written as follows:
(8)
where f(.) is the probability density function (pdf) of an exponential distribution [17].
Hence, deriving the analytical IE via Eq (8) becomes
(9)
where ℓ(.) is an arbitrary function on [0, b]. The right-hand side of Eq (9) becomes a continuous function, and so theoretically, it is clear that the ARL function ℓ(.) is a continuous solution on [0, b]. for the IE in Eq (9). After that, definitions and Banach’s fixed-point theorem are applied to guarantee the existence and uniqueness of the solution for functional ℓ(φ).
Theorem 2. The existence and uniqueness of the solution for the analytical IE are derived for a long-memory SARFIMAX(P, D, Q, r)s process with underlying exponential white noise.
Proof: (Existence). As shown in Appendix A in S1 File,
In this situation we have
where ℓ(φ) is a fixed point of ϒ. Hence, the IE in Eq (9) can be rewritten as ϒ(ℓ(φ)) = ℓ(φ), which is Banach’s fixed-point theorem. If operator ϒ is a contraction, then fixed point ϒ(ℓ(φ)) = ℓ(φ).
Proof: (Uniqueness). Let ϒ be a contraction mapping on a complete metric space. By considering Banach’s fixed-point theorem, it can be shown that ϒ: C([0, b]) → C([0, b]) is a contraction mapping on the complete metric space (C([0, b]), ‖.‖∞) where metric C([0, b]) is the space of all continuous functions ℓ(φ) on interval; [0, b] endowed with supremum norm ‖.‖∞ = supφ∈[0,b] |ℓ(φ)|, for every function ℓ(.) ∈ C([0, b]). It has a unique solution, as shown in Appendix B in S1 File.
Thus, as proved and guaranteed by applying Banach’s fixed-point theorem, the analytical IE for a long-memory SARFIMAX(P, D, Q, r)s process with underlying exponential white noise on the CUSUM control chart exists and is unique for solutions of functional ℓ(φ).
Analytical and approximate ARLs for a long-memory SARFIMAX process with underlying exponential white noise on a CUSUM control chart
In this section, we apply the proposed analytical and numerical IEs corresponding to a Fredholm integral equation of the second type representing analytical and approximate ARLs for monitoring changes in the process mean of a long-memory SARFIMAX process with underlying exponential white noise on a CUSUM control chart.
The proposed analytical IE for a SARFIMAX(P, D, Q, r)s process with underlying white noise on a CUSUM control chart
Theorem 3. Exact formula ℓ(φ) for monitoring changes in the process mean of a long-memory SARFIMAX process with underlying exponential white noise on a CUSUM control charts can be expressed as
Proof: Let ℓ(φ) be the ARL of the IE corresponding to the long-memory SARFIMAX(P, D, Q, r)s process on a CUSUM control chart. Thus, can be substituted as follows:
(10)
By replacing φ = 0 in Eq (10), we obtain
(11)
Subsequently, by substituting ℓ(0) into Eq (10), it follows that
(12)
After rearranging, we obtain
(13)
Finally, by substituting constant c from Eq (13) into Eq (12), we obtain
(14)
The proof is complete.
As shown in the previous equation, the in-control ARL (ARL0) is assigned exponential parameter (β = β0), as follows:
(15)
Similarly, the out-of-control ARL (ARL1) is assigned exponential parameter (β = β1), where β1 = β0(1 + δ), and the shift in the mean is determined by δ and β0 = 1 as follows:
(16)
These circumstances show that the computational scheme can be completed in one stage.
The numerical ARL for a SARFIMAX(P, D, Q, r)s process with underlying white noise on a CUSUM control chart
An approximate ARL using the numerical IE method via Fredholm’s integral equation of the second kind was previously formulated for a CUSUM control chart by Peerajit et al. [22]. For this method, an approximate ARL was developed by using the Gauss-Legendre quadrature rules technique.
Definition 5. The quadrature rules usually applied to integral can be approximated by the sum of the areas of a rectangle is as follows:
where the value of integral f is chosen by applying base b/m with heights at the midpoints of the intervals of length b/m beginning at zero and W(g) is a weight function. Division points υ1 ≤ … ≤ υm fall within interval [0, b] and wj = b/m ≥ 0 set of constant weights wj, j = 1, 2, …, m.
Theorem 4. The numerical IE method (ϑ(φ)) for monitoring changes in the process mean of a long-memory SARFIMAX process with underlying exponential white noise (Yt) on a CUSUM control chart can be expressed as
with
, and
.
Proof: By applying ϑ(φ) as the approximate ARL from the IE solution in Eq (9) using the Gauss-Legendre quadrature rule technique and replacing φ with υj, we obtain
(17)
With the system of m linear equations with m unknowns ϑ(υ1), ϑ(υ2), …, ϑ(υm), we can rearrange Eq (17) as follows:
This is equivalent to matrix form Jm×1 = 1m×1 + Rm×mJm×1 If the inverse (Im − Rm×m)−1 exists, then the unique solution is
where Jm×1 = [ϑ(υ1), ϑ(υ2), …, ϑ(υm)]′, Im = diag(1, 1, …, 1) is the unit matrix order m, 1m×1 = [1, 1, …, 1]′ is a column vector of ϑ(υj), and Rm×m is a matrix of (m, m)th elements:
where
Therefore, the approximation for the integral is obtained in the summation where υj is replaced by φ in ϑ(υj), which completes the proof.
Performance evaluation and comparison
A popular benchmark to measure the performance of a control chart is the ARL for detecting shifts in the process mean as a measure of its sensitivity and practicability. Thus, the performances of the analytical and numerical IE methods for determining the ARL for monitoring shifts in the process mean for a long-memory SARFIMAX(P, D, Q, r)s process on a CUSUM control chart were compared. We determined the number of division points m = 800 for the approximated ARL by using the numerical IE method. The in-control situation comprises exponential white noise with mean β0 = 1, while the out-of-control process is signaled by a shift in the process mean from β0 to β1, where β1 = β0(1 + δ). Thus, δ = 0.01, 0.03, 0.05, 0.10, 0.30, 0.50, 0.70, 1.00, 1.50, and 2.00. δ = 0, means that the process is in-control whereas δ > 0, means that the process is out-of-control, denoted as ARL0 and ARL1, respectively. Accordingly, ARL0 was fixed at 370 or 500. For the CUSUM statistic, the value of η was determined as 3.0, 3.5, or 4.0. Thus, the CUSUM control limit (b) was calculated by using Eq (15) to achieve the specified ARL, the results for which are reported in Table 1. The coefficient parameters for long-memory SARFIMAX(2, 0.15, 1, 1)12, SARFIMAX(2, 0.30, 1, 1)12 and SARFIMAX(1, 0.45, 1, 1)12 processes were set as Φ1 = ± 0.1, Φ2 = 0.2, Φ1 = 0.1, and ω1 = 0.5. For a fixed ARL0, the method achieving the smallest ARL1 has delivered the best performance [1]. The results for the sensitivity, accuracy (see Definition 6), and computational times of the two methods are provided in Tables 2–6 and Fig 1.
(A) Models [1] to [3] for ARL0 = 370, (B) models [–1] to [–3] for ARL0 = 370, (C) models [1] to [3] for ARL0 = 500, and (D) models [–1] to [–3] for ARL0 = 500.
Definition 6. The percentage accuracy (% Accuracy) indicates the relative performances of the analytical and numerical IE methods defined as follows:
where ℓ(φ) and ϑ(φ) are the ARL values obtained using the analytical and numerical IE methods, respectively.
The following algorithm was developed by using the Mathematica program to calculate the values of the ARLs for a long-memory SARFIMAX(P, D, Q, r)s process on a CUSUM control chart:
- Step 1: Determine the values of coefficients Φ1, Φ2, Θ1, ω1, and the initial values of ξt−s, ξt−2s, …, ξt−Qs, Xlt, Yt−s, Yt−2s, Yt−3s, …, Yt−Ps, Yt−(P +1)s.
- Step 2: Specify β0 for the in-control exponential white noise (ξt ~ Exp(β)).
- Step 3: Determine the value of known parameter (η) and the initial value of CUSUM statistic φ.
- Step 4: Calculate the upper control limit (b) by using Eq (15) for the specified ARL0 and η.
- Step 5: Compute ARL1 for shifts in the process mean (β1) where b is attained from Step 4.
The above algorithm was executed to obtain ARLs for out-of-control situations as measurements of the performance of the proposed analytical and numerical IE methods.
The results for computed parameter b and reference parameter η for ARL0 = 370 or 500 using the proposed ARL schemes are reported in Table 1. b is inversely proportional to η for long-memory process models [1] to [3]. For instance, for prespecified ARL0 = 370 on long-memory SARFIMAX(2, 0.15, 1, 1)12 with coefficient values Φ1 = ±0.1, Φ2 = 0.2, Θ1 = 0.1, and ω1 = 0.5, it can be seen that for η = 3, b = 4.300562 and 4.021805, respectively. Only the ARL0, b, and η values are involved in the design of the CUSUM control chart (see Eq (15)).
The performance assessment of the analytical and numerical IE methods for determining the ARL for detecting shifts in the process means requires computing their ARL1 values, as described earlier. Tables 2–5 provide the numerical results obtain from both methods in which we report their ARL1 values, percentage accuracy, and computational times. We summarize our findings as follows:
- The results of the analytical and numerical IE methods for detecting changes in the process mean tended to decrease when the shift size was increased in the order of long-memory process models [1] SARFIMAX(2, 0.15, 1, 1)12, [2] SARFIMAX(2, 0.30, 1, 1)12 and [3] SARFIMAX(1, 0.45, 1, 1)12.
- The ARL1 values for [3] SARFIMAX(1, 0.45, 1, 1)12 and [2] SARFIMAX(2, 0.30, 1, 1)12 were slightly lower than for [1] SARFIMAX(2, 0.15, 1, 1)12. With negative coefficients, the results were similar: [–3] SARFIMAX(1, 0.45, 1, 1)12 and [–2] SARFIMAX(2, 0.30, 1, 1)12 were slightly lower than those [–1] SARFIMAX(2, 0.15, 1, 1)12 (see the results for ARL0 = 370 in Tables 2 and 3 and 500 in Tables 4 and 5).
- The reference value (η) was inversely proportional to the upper control limit and directly proportional to ARL1 for both methods.
- The percentage accuracy results were ~99% in all cases, meaning that the proposed analytical IE method was very accurate.
- The values in parentheses in each column are the computational times for calculating ARL1: 31–34 minutes for ARL0 = 370 and 32–38 minutes for ARL0 = 500 using the numerical IE but less than 1 second for both using the analytical IE.
Fig 1 shows plots of δ (shift size) versus ARL1 value obtained by using the analytical IE for the various long-memory SARFIMAX(P, D, Q, r)s processes with different reference values (η) = 3.0, 3.5, 4.0 and for ARL0 = 370 and 500. These graphs demonstrate the various characteristics of long-memory processes on the CUSUM control chart with coefficients Φ1 = 0.1, Φ2 = 0.2, Θ1 = 0.1, and ω1 = 0.5 in Fig 1A and 1C and Φ1 = −0.1, Φ2 = 0.2, Θ1 = 0.1, and ω1 = 0.5 in Fig 1B and 1D. It is evident that ARL1 decreased rapidly for small shifts (0 < δ ≤ 0.05) and slightly less so for moderate shifts (0.05 < δ ≤ 0.20). In addition, the lowest reference parameter (η) = 3.0 was better for detecting small to moderate shifts in the process mean than 3.5 and 4. Moreover, when D was increased from 0.15 to 0.30, detecting changes was more sensitive. The results also show that the efficacy of both IEs for detecting changes was best for models [3] and [–3] with reference parameter (η) = 3.0 and = 370 and 500.
In summary, the results show that the analytical IE method is a good alternative to the numerical IE method for evaluating shifts in the process mean of a long-memory SARFIMAX(P, D, Q, r)s process with exponential white noise on a CUSUM control chart, especially because of the marked reduction in computational time.
Illustrative example
In this section, the results of applying the proposed analytical and numerical IE methods for determining the ARL for monitoring changes in the mean of a long-memory SARFIMAX(P, D, Q, r)s processes with exponential white noise on a CUSUM control chart involving real data are provided. The real dataset comprises monthly stock price data for the PTT Public Company Ltd. in Thailand (https://th.investing.com/equities/ptt-historical-data) with the EUR/THB exchange rate as the exogenous variable (https://th.investing.com/currencies/eur-thb-historical-data). In this example, we used monthly data from January 2003 to August 2018 comprising 188 observations. A seasonal pattern exists when a series is influenced by a seasonal factor, which is a 12-month difference term in the model (S = 12). Accordingly, SARFIMAX(P, D, Q, r)12 was selected as the model. Statistical software package Eviews 10 was utilized for SARFIMAX(P, D, Q, r)12 filtering and estimating the model parameters.
The results in Table 6 reveal that the probability of variable X was significant (0.05; p-value = 0.0355), which indicates that the exchange rate does impact the stock price data for PTT. It can also be seen that the dataset is suitable for a long-memory SARFIMAX(1, 0.496151, 1 1)12 model presenting statistically significant parameters with coefficients Φ1 = 1.00, Θ1 = 0.799986, ω1 = −0.114685 and the residual series behaving as white noise. The suitability of the model was checked by plotting the actual (Yt) and fitted () components and appears to be in close agreement, as shown in Fig 2.
After that, the distribution of the white noise was confirmed as asymptotic exponential by using the Kolmogorov-Smirnov test. The results show that the white noise significantly fitted an exponential distribution (P-value = 0.091 > 0.05) with a mean of 3.42526 when the process was in-control, as reported in Table 7.
Based on the results in Tables 6 and 7, the SARFIMAX(1, 0.496151, 1 1)12 model (Yt) comprised the following equation:
where ξt ~ Exp(β0 = 3.42526).
Next, we calculated ARL0 using the analytical and numerical IEs and chose the value of parameter (η) = 6.5 after calculating the control limits for ARL0 = 370 and 500 using Eq (16) (b = 17.37735 and 19.2241, respectively). The rationale behind the use of 370 and 500 of the in-control ARLs is that they give good results in practice. The performances of the two methods for determining the ARL for detecting shifts in the process mean of a long-memory SARFIMAX(1, 0.496151, 1 1)12 on a CUSUM control chart are presented in Table 8 and Fig 3.
CL, center line; UCL, upper control limit; LCL, lower control limit.
It can be seen that the ARL1 results using the two methods are similar to those in Tables 2–5. The numerical results obtained from both methods were similar for both short- and long-term detections (ARL0 = 370 and 500, respectively) for all shift sizes of the process mean. Once again, the computation time for the analytical IE was far less than that of the numerical IE method (less than 1 second versus approximately 33 minutes, respectively). This means that the analytical IE for determining ARL1 is a good alternative for detecting changes in the process mean on a CUSUM control chart.
A SARFIMAX process involving real data was run on both CUSUM and EWMA control charts. The performances of the control charts were compared in terms of the ARL for detecting small to moderate shifts in the process mean; the ARL constructed using the analytical IE was used on the CUSUM control chart whereas the one using the numerical IE method was used on the EWMA control chart [28]. For the performance comparison, boundary values b = 17.37735, 19.2241 for the CUSUM control chart and b = 0.2507646, 0.2725181 for the EWMA control chart were used with prespecified ARL0 = 370 or ARL0 = 500, and smoothing parameter λ for the EWMA control chart was determined as 0.1. The results of the comparison are summarized in Table 9. For ARL0 = 370, the EWMA control chart provided a smaller ARL1 than the CUSUM control chart for shift size 0 < δ ≤ 0.3 whereas the CUSUM control chart provided a smaller ARL1 than the EWMA control chart for shift size 0.5 < δ ≤ 2.0. However, for ARL0 = 500, the CUSUM control chart performed better than the EWMA control chart for shift sizes 0.3 < δ ≤ 2.0. This means that the performance of the CUSUM control chart was more powerful than that of the EWMA chart when detecting moderate shifts in the process mean for ARL0 = 370 and 500. Therefore, the CUSUM control chart was more efficacious than the EWMA control chart under these conditions, which is in accordance with the study results.
In Fig 3, the center line, the lower and upper control limits of the CUSUM control chart were CL = β0, LCL = 0, UCL = 17.37735 and 19.2241for ARL0 = 370 and 500, respectively. We can see that the signal was first given at the 12th and 32nd-47th time points for ARL0 = 370 and 500, respectively (the observations in red plotted above the upper control limit). These results confirm the sensitivity of the proposed analytical IE for the ARL on a CUSUM control chart providing 176 and 156 out-of-control signals for ARL0 = 370 and 500, respectively. This situation directly affects the stock price of PPT. Hence, tracking changes in exchange rates can be key both in terms of economy and finance. In an investor’s view, the control chart above is the point in making profitable decisions for investors.
Conclusions
Performance assessment of control chart can be measured by using the ARL determined via analytical and numerical IE methods used to analyze and approximate the ARL computation, respectively. We applied both methods to determine the ARL for monitoring changes in the mean of a long-memory SARFIMAX process with underlying exponential white noise on a CUSUM control chart. Computation of the ARL by using the numerical IE method was used to check the accuracy of the analytical IE method by using the percentage accuracy benchmark and comparing their computation times. The number of division points (m) for the numerical IE method was large enough so that its ARL converged to that determined using the explicit formulas. The results show that the analytical IE was as accurate as the numerical IE method but required less computational time because its computational scheme was easier and could be completed in one stage.
The CUSUM control chart is suitable for detecting small and moderate parameter shifts for autocorrelated processes with bounded support [29, 30]. However, finding optimal parameters of the CUSUM control chart in these situations can be somewhat difficult. Therefore, obtaining the optimal values for the parameters of a CUSUM control chart should be studied further. The scope of the study could be extended in terms of the ARL and the methods used to compute it and could be applied to other types of processes including application to real data with other distributions of white noise such as gamma and Weibull from the exponential family.
References
- 1.
Montgomery DC. Introduction to Statistical Quality Control. 6 th ed. New York: John Wiley and Sons; 2009.
- 2. Johnson RA, Bagshaw M. The Effect of Serial Correlation on the Performance of Cusum Tests. Technometrics. 1974; 16(1): 103–112.
- 3. Bagshaw M, Johnson RA. The Effect of Serial Correlation on the Performance of CUSUM Tests II. Technometrics. 1975; 17(1): 73–80.
- 4. Atienza OO, Tang LC, Ang BW. A CUSUM Scheme for Autocorrelated Observations. Journal of Quality Technology. 2002; 34(2): 187–199.
- 5. Jacob PA, Lewis PAW. A Mixed Autoregressive-Moving Average Exponential Sequence and Point Process (EARMA 1,1). Advances in Applied Probability. 1977; 9(1): 87–104.
- 6. Mohamed I, Hocine F. Bayesian Estimation of an AR(1) Process with Exponential White Noise. A Journal of Theoretical and Applied Statistics. 2003; 37(5): 365–372.
- 7. Pereira IMS, Turkrman MA. Bayesian Prediction in Threshold Autoregressive Models with Exponential White Noise. Sociedad de Estadisticae Investigacion Operativa Test. 2004; 13: 45–64.
- 8. Suparman S. A New Estimation Procedure Using a Reversible Jump MCMC Algorithm for AR Models of Exponential White Noise. International Journal of GEOMATE. 2018; 15(49): 85–91.
- 9. Granger CWJ, Joyeux R. An Introduction to Long Memory Time Series Models and Fractional Differencing. Journal of Time Series Analysis. 1980; 1(1): 15–29.
- 10. Hosking JRM. Fractional Differencing. Biometrika. 1981; 68(1): 165–176.
- 11.
Ebens H. Realized Stock Index Volatility. Department of Economics, Johns Hopkins University; 1999; Working Paper No. 420.
- 12. Degiannakis S. ARFIMAX and ARFIMAX-TARCH Realized Volatility Modeling. Journal of Applied Statistics. Taylor & Francis Journals. 2008; 35(10): 1169–1180.
- 13.
Ramjee R. Quality Control Charts and Persistent Processes. PhD [dissertation]. Stevens Institute of Technology; 2000.
- 14. Ramjee R, Crato N, Ray BK. A Note on Moving Average Forecasts of Long Memory Processes with an Application to Quality Control. International Journal of Forecasting. 2002; 18: 291–297.
- 15. Pan JN, Chen ST. Monitoring Long-Memory Air Quality Data using ARFIMA Model. Environmetrics. 2008; 19: 209–219
- 16. Rabyk L, Schmid W. EWMA Control Charts for Detecting Changes in the Mean of a Long-Memory Process. Metrika. 2016; 79(3): 267–301.
- 17. Champ CW, Rigdon SE. A Comparison of the Markov Chain and the Integral Equation Approaches for Evaluating the Run Length Distribution of Quality Control Charts. Communications in Statistics Simulation and Computation. 1991; 20: 191–204.
- 18. Crowder S. Average Run Lengths of Exponentially Weighted Moving Average Control Charts. Journal of Quality Technology. 1987; 19: 161–164.
- 19. Hawkins DM. A CUSUM for a Scale Parameter. Journal of Quality Technology. 1981; 13: 228–31.
- 20. VanBrackle LN, Reynolds MR Jr. EWMA and CUSUM Control Charts in the Presence of Correlation. Communications in Statistics-Simulation and Computation. 1997; 26(4): 979–1008.
- 21. Suvimol P. Average Run Length of Cumulative Sum Control Charts for SARMA(1, 1)L Models. Thailand Statistician. 2017; 15: 184–195.
- 22. Peerajit W, Areepong Y, Sukparungsee S. Explicit Analytical Solutions for ARL of CUSUM Chart for a Long-Memory SARFIMA Model. Communications in Statistics-Simulation and Computation. 2019; 48(4): 1176–1190.
- 23. Sunthornwat R, Areepong Y. Average Run Length on CUSUM Control Chart for Seasonal and Non-Seasonal Moving Average Processes with Exogenous Variables. Symmetry. 2020; 12(1): 1–15.
- 24. Bisognin C, Lopes SRC. Estimating and Forecasting the Long-Memory Parameter in the Presence of Periodicity. Journal of Forecasting. 2005; 26(6): 405–427.
- 25. Page ES. Continuous Inspection Schemes. Biometrika. 1954;42: 243–54.
- 26. Roberts S.W. Control Chart Tests Based on Geometric Moving Average. Technometrics 1959; 1: 239–250.
- 27.
Sofonea M, Han W, Shillor M. Analysis and Approximation of Contact Problems with Adhesion or Damage, Chapman & Hall/CRC, New York. 2005.
- 28. Sunthornwat R, Areepong Y, Sukparungsee S. Average Run Length of the Long-Memory Autoregressive Fractionally Integrated Moving Average Process of the Exponential Weighted Moving Average Control Chart. Cogent Mathematics. 2017; 4: 1358536.
- 29. Sheng-Shu C, Fong-Jung Y. A CUSUM Control Chart to Monitor Wafer Quality World Academy of Science. Engineering and Technology International Journal of Industrial and Manufacturing Engineering. 2013; 7(6): 1183–1188.
- 30. Abujiya MR, Lee MH, Riaz M. Combined Application of Shewhart and Cumulative Sum R Chart for Monitoring Process Dispersion. Quality and Reliability Engineering International. 2016; 32(1), 51–67.