Figures
Abstract
In this paper, ranked set sampling is used for developing a non-parametric location chart which is developed on the basis of Wilcoxon signed rank statistic. The average run length and some other characteristics of run length are used as the measures to assess the performance of the proposed scheme. Some selective distributions including Laplace (or double exponential), logistic, normal, contaminated normal and student’s t-distributions are considered to examine the performance of the proposed Wilcoxon signed rank control chart. It has been observed that the proposed scheme shows superior shift detection ability than some of the competing counterpart schemes covered in this study. Moreover, the proposed control chart is also implemented and illustrated with a real data set.
Citation: Abid M, Nazir HZ, Tahir M, Riaz M (2018) On designing a new cumulative sum Wilcoxon signed rank chart for monitoring process location. PLoS ONE 13(4): e0195762. https://doi.org/10.1371/journal.pone.0195762
Editor: Fabio Rapallo, Universita degli Studi del Piemonte Orientale Amedeo Avogadro, ITALY
Received: October 27, 2017; Accepted: March 28, 2018; Published: April 17, 2018
Copyright: © 2018 Abid et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The authors received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Statistical process control (SPC) is a combination of well-known statistical methods and problem-solving methods (such as cause and effect diagram, check sheet, Pareto chart, control chart etc.) that are used to monitor the process (cf. Montgomery [1]). The non-parametric control chart is an emerging area of development in the theory of SPC. Its main advantage is that it does not require any knowledge about the underlying distribution of the variables (cf. Chakraborti et al. [2]).
Different sampling schemes have been used in control charting strategies due to the fact that these play a substantial role in the performance of control charting structures. From one of these sampling schemes, ranked set sampling (RSS) scheme is becoming an important statistical toolkit and it is highly applied in SPC. McIntyre [3] proposed RSS scheme for collecting data in those situations when measurements are damaging or expensive while the ranking of the observation is relatively easy. In RSS scheme first, we select a sample of size n2 units from the target population and then partition the sample into n sets each of size n. After that rank the units within each set and then select the smallest ranked unit from the first set, the second smallest ranked unit from the second set, and so on, until the largest ranked unit is selected from the last set as shown in the following demonstration:
These steps are repeated m times (cycles) to produce a balanced RSS of size r = nm. Now a days, RSS has become a useful statistical technique and has been used widely in statistical quality control. Initially, Salazar and Sinha [4] developed control charts by using RSS and median RSS schemes. Muttlak and Al-Sabah [5] are also developed control charts using different RSS schemes. Abujiya and Muttlak [6] developed a control chart on the basis of different double RSS (DRSS) schemes. Al-Omari and Haq [7] also proposed control charts using the DRSS scheme. Abujiya et al. [8] improved the performance of the exponentially weighted moving average (EWMA) control charts by using various forms of RSS schemes. Mehmood et al. [9] introduced the control charts using different RSS schemes to examine the process mean. Abujiya et al. [10] also enhanced cumulative sum (CUSUM) control charts for studying the process dispersion under various RSS schemes. Haq et al. [11] introduced a new synthetic control chart for evaluating process mean and process dispersion by utilizing the various forms of RSS. Haq et al. [12] has also recommended a new Maximum EWMA (Max EWMA) control chart by using the ordered DRSS (ODRSS) and ordered imperfect DRSS (OIDRSS) schemes. The procedure of DRSS and ODRSS schemes are as follows: If we again apply the RSS scheme on n ranked set samples, then we will get a double ranked set sample of size n. This sampling procedure is called DRSS. If we arrange this double ranked sample in an increasing order of magnitude, we get an ordered double ranked set sample of size n, and this scheme is named ODRSS (cf. Haq et al., [12]). If we are faced errors in the ranking procedure of ODRSS, then it is called imperfect double ranked set sampling (OIDRSS) (cf. Haq et al., [12]). Also, Haq et al. [13] developed a new synthetic EWMA and synthetic CUSUM control chart using various forms of RSS schemes for monitoring the process mean.
Recently, some researchers have paid much attention to developing a control chart, based on the RSS method by using a non-parametric approach. Abid et al. [14] developed a non-parametric CUSUM control chart, based on sign test, by using RSS technique. Tapang et al. [15] proposed three non-parametric control charts based on RSS method. A non-parametric EWMA control charts based on sign test and Wilcoxon signed-rank statistic using RSS has also been presented by Abid et al. [16, 17]. In this study, we used the RSS method to develop a non-parametric CUSUM Wilcoxon signed rank control chart to monitor the process target.
The rest of the paper is structured as follows: In Section 2, the concept of the CUSUM signed rank chart by using SRS scheme and the construction of this chart in the RSS scheme is reported. Section 3, consists of the performance evaluation and comparisons of the proposed chart. A real-life use of the suggested scheme is illustrated in Section 4. While a brief conclusion and recommendation are given in Section 5.
2. CUSUM signed rank chart using SRS
Bakir and Reynolds [18] proposed the non-parametric CUSUM signed rank chart by utilizing SRS scheme, in future, named as CUSUM-SR chart. According to Bakir and Reynolds [18] the designed structure of the Wilcoxon signed rank (SR) statistic is illustrated below: Let θ0 be the mean value of the quality characteristic Xph and it is of the known median, θ, that is being tested. Let be the within-group absolute rank of deviations from the target value θ0. Mathematically,
Then, the test statistic of SR under SRS is as follows:
(1)
where p = 1,2,3,… and h = 1,2,…,n and
According to Gibbons and Chakraborti [19] the SR test statistic is linearly related to another SR test statistic
through the relationship:
(2)
The mean and variance of
are
and
, respectively (cf. Gibbons and Chakraborti [19]). Using the relationship in Eq (2), we get the mean and variance of SRSRS as E(SRSRS) = 0 and
, respectively.
The two plotting and monitoring statistics and
of the CUSUM-SR scheme using Eq (1) are defined as:
(3)
where
is the SR statistic at time p and K is the reference value of the CUSUM-SR scheme (cf. Montgomery [1]). Initial values of
and
are set as zero i.e.,
and
After that,
and
are plotted against the control limit H. If any
or
, the process is declared to be out of control, if not, it is in control. So, the CUSUM-SR chart has two parameters K and H. These two parameters are defined as (cf. Montgomery [1]).
(4)
where
.
2.1. Proposed CUSUM signed rank chart using RSS
Let a sample of size n is obtained from X(ph)i using RSS to examine the process location, θ0. Let be the within-group absolute rank of deviations from the target value θ0. Mathematically,
Let, r = nm,m > 1 gives a RSS and m be the number of cycles to be repeat.
The test statistic of SR under RSS is as follows (cf. Kim and Kim [20])
(5)
where, p = 1,2,3,…, h = 1,2,…,n and i = 1,2,…m and
The test statistic of SR using RSS is also linearly linked to another test statistic of SR i.e.,
through the relationship:
(6)
The expected value and variance of the statistic
are as follow:
and
, respectively (cf. Kim and Kim [20]). So, the mean and variance of SRRSS, through the relationship mentioned in Eq (6), are given as: E(SRRSS) = 0 and
, respectively. The value of
can be defined as:
(7)
where the values of Fh(0), h = 1,2,…,n, are obtained byestimating the incomplete beta integral.
(8)
The values of
, from n = 2,3,…, 10, are reported in Table 1 which is a vital part to get the enhancement of RSS over SRS (cf. Hettmansperger [21]).
So, the two plotting and monitoring statistics i.e., and
of the proposed non-parametric CUSUM signed rank control chart, later on named as, RCUSUM-SR chart based on Eq (5) can be written as:
(9)
where
is the SR statistic of the proposed chart at time p and K is the reference value of the suggested chart (cf. Montgomery [1]). The initial value of the two plotting statistics, for the proposed scheme, is taken equal to zero i.e.,
and
Next, the two plotting and monitoring statistics, given in Eq (9), are plotted against the control limit H. As long as the values of
and
are plotted inside of the control limit i.e., H, then the process is assumed to be in control, if not, it is out of control. The K and H are two design parameters of the proposed chart and these parameters are selected in a way that they satisfy average run length (ARL). The two parameters of the proposed chart are defined as (cf. Montgomery [1]).
(10)
where
We evaluate that combinations of (K,H) by considering different values of n and m under which the proposed chart has its ARL0 close to 370 and 500 (i.e. ARL0 ≅ 370 and 500). The values of (K,H) of the proposed RCUSUM-SR chart are given in Table 2. From Table 2, we found that increase in the values of n and m may also cause increase in the value of K and vice versa. There is also found an upward and downward pattern in the value of H by increasing the values of n and m (cf. Table 2). Table 2 plays a vital role for implementing the proposed chart in practice.
3. Performance assessment of the proposed chart
In this section, we assessed the performance of the proposed RCUSUM-SR control chart for in control and out of control processes. The average run length (ARL), which is the mean of run length (RL) distribution, is the most significant and commonly used measure to assess the performance of control charts. Some researchers such as Barnard [22], Woodall [23] and Gan [24] have discouraged the only use of ARL due to the skewed distribution of run length. Hence, in this study, we also include some other measures of run length such as the standard deviation (SDRL) and some percentile points (which includes 5th, 25th, 50th, 75th and 95th) in order to enhance better results.
In this study, we include a variety of distributions such as: (a) the standard normal distribution, N(0,1); (b) the Student’s t-distribution, t(v), with degree of freedom v = 4 and 8, respectively; (c) the Laplace (or double exponential) distribution, ; (d) the logistic distribution,
; the contaminated normal (CN) distribution, a mixture of
and
, represented by
where
and α = 0.05. The main use of CN distribution is to evaluate the performance of proposed chart in case of outliers. The distributions considered in this study, having mean value 0 and the value of standard deviation 1, in order to compare the results of all distributions.
In this study, a Monte Carlo (MC) simulation is used to calculate the run length values of the RCUSUM-SR chart and R language is used for this purpose. The simulations steps are given below:
- We create random subgroup from all the selected distributions using the values of K,H,m,n and shift of the size (δ) for a given ARL0.
- The proposed chart is constructed by using the expression given in Eq (9).
- The number of samples, at which the plotting statistic of the proposed chart falls outside the control limit H, is observed.
- The steps (i)-(iv) are repeated 50000 to obtain the run lengths.
At the end, we found the average of the values in step (iv) to get the ARL value.The in control and out of control performances of the RCSUSM-SR chart is compared with the charts selected in this study. The performance assessment is made on the basis of ARL. The proposed chart is compared with the CUSUM-SR chart proposed by Bakir and Reynolds [18], the nonparametric EWMA signed rank chart using RSS developed by Abid et al. [17] hereafter, is named as REWMA-SR control chart, the nonparametric EWMA signed rank chart using SRS constructed by Graham et al. [25], after that, is named as EWMA-SR control chart and the noparametric EWMA sign chart proposed by Graham et al. [26] and is named as EWMA-SN control chart. As the proposed and the existing charts are non-parametric control charts, therefore, their in control run length distribution have to remain similar for all continuous distributions and these results are reported in Table 3 at ARL0 ≈ 500. It is to be noted that the ARL0 of the proposed chart and the existing charts seems almost identical for all distributions under consideration in this study (cf. Table 3). The first row of each cell in Tables 3–7 shows the ARL and SDRL values, respectively, whereas the second row shows the values of the 5th, 25th, 50th, 75th and 95th percentiles (in this order).
The ARL1 values of the RCUSUM-SR chart for all the selected distributions and δ = 0.5(0.5)2.5 (shift in the process), for n = 5, 10 and m = 1(2)3 are given in Tables 4 and 5.
The main findings from Tables 4 and 5 are as follows:
- The ARL1 values of the RCUSUM-SR scheme increase as the value of n increases and vice versa (cf. Table 4 vs Table 5).
- It is found to be increase in the values of ARL1 when the value of m decreases and vice versa (cf. Table 4).
- The out of control performance of the proposed scheme is relatively better in case of t(4) and t(8) when δ ≤ 0.1, as compared to other distributions selected in this study (cf. Tables 4 and 5).
- It is also found that the performance of the RCUSUM-SR chart is relatively better in case of CN distribution in comparison to other distributions such as N(0,1),
and
when δ ≤ 0.1 (cf. Tables 4 and 5).
- The SDRL and percentiles values also increases when the values of m and n decrease and vice versa (cf. Table 4 vs Table 5).
So, in general, we can say that the proposed scheme is more proficient in detecting the small and large shifts in the location parameter.
3.1. Comparisons
This section is related to the comparison of the RCUSUM-SR chart with the charts considered in this study. The proposed chart is compared with the CUSUM-SR, the REWMA-SR, the EWMA-SR and the EWMA-SN charts on the basis of ARL. We have also regenerated the results of Abid et al. [17] Bakir and Reynolds [18], Graham et al. [25] and Graham et al. [26] and found similar results.
3.1.1. RCUSUM-SR versus CUSUM-SR.
The ARL1 values of the CUSUM-SR scheme are reported in Table 6. From Tables 5 and 6, it is revealed that the ARL1 performance of the RCUSUM-SR chart is quite better in comparison to CUSUM-SR chart for all the values of m and n under all the selected distributions. The proposed scheme has smaller ARL1 values as compared to the CUSUM-SR scheme for all the selected shifts (cf. Table 5 versus Table 6). Hence, the proposed scheme performs more efficiently under all conditions in comparison to CUSUM-SR scheme.
3.1.2. RCUSUM-SR versus EWMA-SN.
The results of EWMA-SN scheme are given in Table 7. From Tables 5 and 7, it is observed that the RCUSUM-SR scheme has better ARL1 performance in comparison to the EWMA-SN scheme for all choices of design parameters under all the selected distributions. Thus, we can say that RCUSUM-SR has a better performance in detecting all the selected shifts than the EWMA-SN chart.
3.1.3. RCUSUM-SR versus EWMA-SR.
The ARL1 values of the EWMA-SR chart are provided in Table 7. Comparing the EWMA-SR chart with the RCUSUM-SR chart, it is noted that RCUSUM-SR chart performs more efficiently than the EWMA-SR scheme for detecting all the selected shifts under all the selected distributions (cf. Table 5 versus Table 7). Hence, the proposed RCUSUM-SR chart shows reasonably improved performance in comparison to the EWMA-SR scheme.
3.1.4. RCUSUM-SR versus REWMA-SR.
The ARL1 results of REWMA-SR scheme are given in Table 7. To compare the proposed scheme with the REWMA-SR, we observed that the RCUSUM-SR scheme shows improved shift detection ability than the REWMA-SR scheme under normal and non-normal distributions (cf.Table 5 versus Table 7). Thus, the proposed chart performs more efficiently as compared to the REWMA-SR scheme.
To prove the superiority of the proposed chart over other existing charts, we have made some graphical presentation in the form of ARL1 curve. In Figs 1 and 2, the proposed RCUSUM-SR scheme is compared with the CUSUM-SR, the REWMA-SR, the EWMA-SR and the EWMA-SN charts. From Fig 1, it can be seen that the proposed chart shows better performance as compared with the CUSUM-SR, the REWMA-SR, the EWMA-SR and the EWMA-SN charts at all shifts. We noticed that performance of the RCUSUM-SR scheme also increased as the value of m increased as compared to the other schemes under study (cf. Fig 2). At the end, it is we may conclude that the RCUSUM-SR chart is more useful in detecting all the selected shifts under all the considered distributions.
4. Application of the proposed chart
Application of the proposed chart is presented in this section. The data set is supplied by Muttlak and Al-Sabah [5] and collected in Pepsi Cola Company in Al-Khobar, Saudi Arabia.
From Muttlak and Al-Sabah [5] data set using RSS technique, 27 ranked set samples were chosen by repeating cycle 4 times and every cycle has size n = 3, so r = 12 and during the filling process it was observed that internal pressure system of machines got high that led to the increment in the quantity of soft drink bottles. Management indicated the quality control manager that the problem occurred and observed at the 19th sample point of data collection. The proposed scheme is compared with REWMA-SR scheme. We also chosen 27 samples having size r = 12 using SRS scheme for comparing the RCUSUM-SR chart to the CUSUM-SR, EWMA-SR and EWMA-SN charts. We have chosen the design constants of the proposed scheme and other schemes by setting up ARL0 ≅ 370 for the construction of the control limits and these values are given in Table 8.
The monitoring statistics of the EWMA-SN, the EWMA-SR, the REWMA-SR, the CUSUM-SR and the RCUSUM-SR schemes are thus plotted against their control limits as depicted in Figs 3, 4, 5, 6 and 7 respectively. We observed that the EWMA-SN scheme triggered two out of control warning sign at samples 26 and 27 (cf. Fig 3). On the other hand, the EWMA-SR and the CUSUM-SR charts proposed by Bakir and Reynolds [18] and Graham et al. [25], respectively, indicated out of control signals at samples 25, 26 and 27 (cf. Figs 4 and 6). The REWMA-SR chart gave out of control signs at samples 23, 24, 25, 26 and 27 (cf. Fig 5). Finally, the RCUSUM-SR chart triggered seven out of control signals at samples 21, 22, 23, 24, 25, 26 and 27 (cf. Fig 7). So, we can conclude that the proposed scheme has a quicker shift detection capability as compared to the EWMA-SN, the EWMA-SR, the REWMA-SR and the CUSUM-SR schemes.
5. Conclusions and recommendations
Control charts are extensively used to monitor stability and performance of processes with an objective of detecting abnormal variations in process parameters. In this paper, we have applied ranked set sampling scheme in order to propose a non-parametric CUSUM control chart based on Wilcoxon signed rank test statistic to monitor the process location. The proposed chart is compared with the REWMA-SR, the CUSUM-SR chart, the EWMA-SN and the EWMA-SR charts. From the results of this study, we have observed that the proposed scheme shows a quicker detection ability as compared to its competitors. Hence, it is concluded that when the units are difficult and costly to measure, the proposed non-parametric CUSUM Wilcoxon signed rank control chart, based on ranked set sampling may be preferred to its existing counterparts. It is to be mentioned that the scope of this work can also be extended to other control charting strategies and diversities of runs rules schemes.
References
- 1.
Montgomery DC. Introduction to Statistical Quality Control. 6th Edition, Wiley, New York; 2009.
- 2. Chakraborti S, Van der Lann, Van der Wiel MA. Non-parametric control charts: An overview and some results. Journal of Quality Technology. 2001; 33: 304–315.
- 3. Mclntyre GA. A method for unbiased selective sampling, using ranked sets. Australian Journal of Agricultural Research. 1952; 3: 385–390.
- 4. Salazar RD, Sinha AK. Control chart x based on ranked set sampling. Comunicacion Tecica. 1997; 1-97-09 (PE/CIMAT).
- 5. Muttlak HA, Al-Sabah W. Statistical quality control based on pair and selected ranked set sampling. Pakistan Journal of Statistics. 2003; 19: 107–128.
- 6. Abujiya MR, Muttlak HA. Quality control chart for the mean using double ranked set sampling. Journal of Applied Statistic. 2004; 31: 1185–1201.
- 7. Al-Omari AI, Haq A. () Improved quality control charts for monitoring the process mean using double ranked set sampling methods. Journal of Applied Statistic. 2012; 39: 745–763.
- 8. Abujiya MR, Riaz M, Lee MH. Improving the performance of the Exponentially Weighted Moving Average control charts. Quality and Reliability Engineering International. 2012; 30: 571–590.
- 9. Mehmood R, Riaz M, Does RJMM. Control charts for location based on different sampling schemes. Journal of Applied Statistics. 2013; 40: 483–494.
- 10. Abujiya MR, Riaz M, Lee MH. Enhanced cumulative sum charts for monitoring process dispersion. PLOS ONE. 2013; 10: e0124520.
- 11. Haq A, Brown J, Moltchanova E. A new maximum exponentially weighted moving average control chart for monitoring process mean and dispersion. Quality and Reliability Engineering International. 2015; 31: 1587–1610.
- 12. Haq A, Brown J, Moltchanova E. New synthetic control charts for monitoring process mean and process dispersion. Quality and Reliability Engineering International. 2015; 31: 1305–1325.
- 13. Haq A, Brown J, Moltchanova E. New synthetic EWMA and synthetic CUSUM control charts for monitoring the process mean. Quality and Reliability Engineering International. 2016; 32: 269–290.
- 14. Abid M, Nazir ZA, Riaz M, Lin Z. Use of ranked set sampling in non-parametric control charts. Journal of the Chinese Institute of Engineers. 2016; 39: 627–636.
- 15. Tapang W, Pongpullponsak A, Sarikavanij S. (2016) Three non-parametric control charts based on ranked set sampling. Chiang Mai Journal of Science. 2016; 43: 915–930.
- 16. Abid M, Nazir ZA, Riaz M, Lin Z. An efficient non-parametric EWMA Wilcoxon signed-rank chart for monitoring location. Quality and Reliability Engineering International Quality and Reliability Engineering International. 2017; 33: 669–685.
- 17. Abid M, Nazir ZA, Riaz M, Lin Z. Investigating the impact of ranked set sampling in non-parametric CUSUM control charts. Quality and Reliability Engineering International. 2017; 33: 203–214.
- 18. Bakir ST, Reynolds MR. A non-parametric procedure for process control based on within-group ranking. Technometrics. 1979; 21: 175–183.
- 19.
Gibbson JD, Chakraborti S. Non-parametric Statistical Inference. 4th edition. Marcel Dekker;,New York; 2003.
- 20. Kim DA, Kim YC. Wilcoxon signed rank test using ranked-set sampling. Korean Journal Computational and Applied Mathematic. 1996; 3: 235–243.
- 21. Hettmansperger TP. The ranked-set sample sign test. Journal of Non-parametric Statistics. 1995; 47: 263–270.
- 22. Barnard GA. Control charts and stochastic processes. Journal of the Royal Statistical Society (B). 1959; 21: 239–271.
- 23. Woodall WH. (1983) The distribution of the run length of one-sided CUSUM procedures for continuous random variables. Technometrics. 1983; 25: 295–301.
- 24. Gan FF. An optimal design of cumulative sum control chart based on median run length. Communications in StatisticsSimulation and Computation. 1993; 23: 485–503.
- 25. Graham MA, Chakraborti S, Human SW. A non-parametric exponentially weighted moving average signed-rank chart for monitoring location. Computational Statistic and Data Analysis. 2011; 55: 2490–2503.
- 26.
Graham MA, Chakraborti S, Human SW. A non-parametric EWMA control chart based on the sign statistic. Technical report 09/04. Department of Statistics, University of Pretoria; 2011.