Figures
Abstract
Two-stage cluster sampling is often employed in survey sampling when complete population information is not available. In this setting, the Exponentially Weighted Moving Average (EWMA) statistic offers an efficient way to estimate the population mean by incorporating both past and current data. Motivated by this, we propose a class of memory-type ratio and exponential estimators for estimating the population mean under a two-stage cluster sampling framework. Theoretical expressions for the biases and mean square errors (MSE) of the proposed estimators are derived. To evaluate their performance, a comprehensive simulation study was carried out, supplemented by an empirical application. Several special cases of the proposed estimators were also considered and compared with existing two-stage estimators. The analysis was performed under different values of the EWMA smoothing constant (). Both simulation and empirical results consistently show that the proposed memory-type two-stage ratio estimators outperform existing approaches, providing improved efficiency with minimum MSE.
Citation: Minhas KS, Semary HE, Jabeen R, Zaka A (2025) Multiscale study of memory type simple ratio estimators in two stage sampling under exponentially weighted moving averages. PLoS One 20(11): e0335586. https://doi.org/10.1371/journal.pone.0335586
Editor: Laleh Tafakori, RMIT University, AUSTRALIA
Received: April 12, 2025; Accepted: October 14, 2025; Published: November 13, 2025
Copyright: © 2025 Minhas et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The simulated facts and data statistics used to support the findings of this study are included in the article.
Funding: This work was supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University (IMSIU) (grant number IMSIU-DDRSP2502).
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
In survey sampling, when population information is not fully available, the cluster sampling technique is often ideal for selecting a random sample. When this procedure is implemented in two stages, it becomes two-stage cluster sampling [1]. In this design, a random sample of nnn clusters, referred to as first-stage units (FSUs), is selected from the total of N clusters. Subsequently, from each of the nnn selected clusters of size Mi (i = 1, 2… n), a random sample of mi elements is drawn, known as the second-stage units (SSUs) [2].
[3] suggested general families of factor-type estimators for the population mean under the condition that non-response occurs either on the study variable alone or on both the study and auxiliary variables, making use of auxiliary variable information. Further [4], recommended calibration based estimation of finite population total in two stage random sampling. [5,6] proposed generalized ratio estimators using single and two auxiliary variables in two-stage sampling and exponential estimators to estimate population mean using two-phase scheme of sampling considering two auxiliary variables in two-stage sampling. The problem of estimating population mean in the presence of non-response under two stage technique of sampling was suggested by [7]. By using [8] technique, the estimator under two non-response models has been established. Moreover [9], proposed five different two-stage sampling designs based on neoteric ranked set sampling (NRSS). In another study [10], introduced a new two-stage cluster sampling design in which the first-stage sampling was carried out using probability proportional to size (PPS) and the second stage employed RSS to address variability in population unit sizes at the first stage. Similarly [11], analyzed clustered data under two-stage informative sampling within two-level models. Their investigation provided a new method to estimate cluster effects while accounting for design weights using a normal approximation. Furthermore [12], investigated a general class of exponential and chain ratio-type estimators for estimating the population mean under two-stage sampling with two auxiliary variables in the presence of measurement error. In addition [13], proposed a new generalized class of estimators based on PPS sampling with two auxiliary variables for estimating the finite population mean.
Several studies have extended memory-type estimators using the EWMA statistic in different sampling settings. For time-based surveys, memory-type ratio and product estimators were proposed in [14–16], while their application under ranked-based sampling (RBS) schemes such as extreme, median, and quartile RSS was also investigated. A hybrid EWMA approach was introduced in [17] to improve process mean estimation using secondary information. Further extensions include memory-type estimators with auxiliary variables in two-phase simple random sampling [18], logarithmic estimators using EWMA [19], and hybrid EWMA-based logarithmic estimators [20]. Variance estimation in the presence of measurement errors was addressed in [21] through memory-type ratio and product estimators with EWMA. Additional developments include the use of auxiliary variables with extended EWMA statistics [22] and optimal classes of memory-type simple ratio estimators in two-stage sampling [23].
Several extensions of the EWMA statistic have been proposed in recent studies. A homogeneously weighted moving average chart for variance was developed in [24] for rapid detection of process shifts using auxiliary information, while [25] introduced an adaptive EWMA-based control chart for the coefficient of variation under time-varying process means and linearly changing standard deviations. In healthcare applications [26], addressed patient risk adjustment by designing EWMA charts based on residuals from support vector machine regression, and [27] extended this idea to monitoring process dispersion across diverse environments through adaptive EWMA charts using support vector regression. For survey estimation [28], proposed generalized families of exponential estimators based on EWMA, while [29] introduced optimal memory-type methods for imputing missing time-based survey data. Furthermore [30], analyzed the effects of measurement and non-response errors on estimation accuracy, considering cases where non-response affected only the study variable or both study and auxiliary variables.
Several studies have examined two-stage sampling and memory-type estimators using different statistics. Building on this literature, the present study develops ratio and exponential estimators under a two-stage sampling scheme by incorporating the EWMA statistic for time-scale surveys at the second stage. The memory-type estimator integrates both past and present information through the EWMA approach introduced by [16]. Specifically, we propose generalized two-stage simple ratio estimators for three cases: (i) unequal first-stage units with weighted mean, (ii) unequal first-stage units with unweighted mean, and (iii) equal-sized first-stage units. To evaluate their performance, a simulation study is conducted across these scenarios.
The remainder of this paper is organized as follows. Section 2 reviews existing two-stage ratio and exponential estimators under different cases of first-stage unit selection. Section 3 introduces the proposed memory-type ratio estimators using the EWMA statistic and derives their bias and MSE, while Section 4 develops the corresponding exponential estimators. Section 5 presents a simulation study comparing the proposed methods with existing ones, and Section 6 illustrates their application using international trade data. Finally, Section 7 concludes with key findings and possible future directions.
2. Existing two stage estimators
The brief detail of some existing ratio and two stage sampling estimators for the specified three cases are
2.1. Sukhatme et al. (1984)—Two stage estimator
2.1.1. Case-I.
The two stage estimator [31] is given below when FSU are unequal and weighted mean is used
the Mean Square Error (MSE) as
2.2. Sukhatme et al. (1984)—Two stage ratio estimators
2.2.1. Case-I.
The two stage estimator when FSU are unequal and weighted mean is used can be written as
the MSE can be inscribed as
3. Proposed simple ratio estimators in two stage sampling using EWMA statistic
In this section, generalized memory type ratio estimators have been recommended for two stage sampling using [31].
3.1. Notations
3.1.1. Case-I: Equal FSU and weighted mean is used.
Let a population consists of N called FSU and each FSU comprises of Mi named as SSU. Assume a sample of n FSU and a sample of mi SSU from each of n FSU selected by allocating the weights to the SSU respectively. Suppose that the selection of units at each stage has been done using simple random sampling. The following notations are used to obtain the proposed estimators.
is the value of jth SSU in the ith FSU (j = 1, 2…, M & i = 1, 2…, N).
&
are the means per SSU in the ith FSU in population and sample respectively.
= the mean per SSU in population and sample.
is the average of SSU in population.
, denotes the mean per SSU in the ith FSU in the sample.
is the mean per SSU in sample.
represents the EWMA statistic per SSU in the sample for auxiliary information.
is the EWMA statistic for study variable per SSU in the sample.
Let us consider, and the respective expectations are
where,
3.1.2. Case-II: Unequal FSU and unweighted mean is used.
For case-II, assume the equal weights for all unequal FSU to obtain an unweighted and biased estimator of population mean in two-stage sampling plan. The procedure for the two stage sampling design in this situation is the same as explained previously in case-I. The notations and expectations for this case can be defined as
is the mean per SSU in population.
is the mean per SSU in sample.
Now, let and the respective expectations are
where,
3.1.3. Case-III: Equal sizes FSU.
Suppose that a population comprises of N FSU and each FSU consists of M, SSU and let a sample of n FSU is selected and a sample of m SSU from each of n FSU. Assume that the units have been selected through simple random sampling technique. To obtain the expressions of biases and MSE’s under two stage random sampling, the following symbolizations is used. Consider,
is the mean per SSU in population
is the mean per SSU in sample.
Now, let and the respective expectations are
where,
3.2. The Bias and MSE for proposed two stage ratio estimators
The suggested ratio estimator for two stage sampling design using memory type at second stage for the following three cases are
3.2.1. Case-I.
The ratio estimator may be defined as
using the notations mentioned in section (3.1.1), the estimator in Eq. (16) can be written as
now, expanding the series, , up to second order, we obtained
by simplifying the Eq. (18), we get
take expectation on both sides, we obtained the Bias () as
squaring and taking expectation on both sides in Eq. (17), ignoring square and higher terms, we have the MSE expression for the proposed estimator as
3.2.2. Case-II.
The ratio estimator in this case can be inscribed as
by utilizing the notations in section (3.1.2), the bias and MSE of estimator given in Eq. (22) may be defined as
3.2.3. Case-III.
The ratio estimator for case-III can be defined as
by utilizing the notations in section (3.1.3), the bias and MSE of estimator given in Eq. (25) may be defined as
4. Proposed exponential estimators in two stage sampling using EWMA statistic
This section introduced the proposed generalized exponential estimators separately by following [32] using single auxiliary variable for three different cases in two stage sampling as mentioned previously.
4.1. Case-I
The modified exponential ratio estimator in two stage sampling scheme using memory type estimator can be defined in the form of as
by expanding the series up to one degree and ignoring the higher terms, we get
for bias, subtract and applying expectation in the equation and expanding the exponential series including the square terms, we have
the obtained expression for bias is
the MSE of the proposed estimator using the first order approximation may be written as
5. Simulation study
A simulation study is conducted to evaluate the performances of proposed estimators in two stage sampling scheme using EWMA statistic followed by [23]. The comparisons of proposed estimators with existing two stage ratio and exponential estimators are done. The results of MSEs are presented in the Tables 1–4. The formula to calculate the MSEs of estimators for case-I, case-II & case-III, is
where, .
The population generated from bivariate normal distribution, with and ρ = 0.1, 0.3, 0.5, 0.7, 0.9, of size 1500 consists of 25 clusters with equal and unequal FSU’s in each cluster. Suppose that FSS, n, of size 4, 9 is selected then a SSS of 100 is selected from each cluster. To check the efficiency of the suggested estimator, a simulation study is designed to estimate MSE’s. We use λ = 0.15, 0.35, 0.55, 0.75, 0.95 for EWMA statistic.
The computed results are given in Tables 1–8, represents the MSE’s of memory type ratio estimators under two stage sampling scheme using EWMA statistic. It can be observed from the results that MSEs of the proposed memory type ratio estimators in two stage sampling are smaller than two stage ratio estimators proposed by [31] and [5]. The proposed estimators’ outperformed as compared to some present estimators.
6. Real life application
We have used the data presented in [33] to check the efficiency of the proposed estimators. There are 7 continents considered as FSUs and 124 countries are selected according to locations from these clusters of continents as SSUs. There is only one country in the 7th continent therefore; it is positioned in the 6th continent. The data of import and export in the year 1983 are considered as study and auxiliary variables respectively. Further, the data is divided into 6 clusters, say N, n = 2 and M0 = 124 and = 20.67. The characteristics of the population data are presented in Table 9.
On the basis of the information mentioned in Table 9, we have calculated theMSE’s of the proposed ratio estimator for unqual FSUs and the previously develpoed estimators by [5] and [31] demonstrated in Table 10. We used different values of λ = 0.3, 0.5, 0.75, 0.9 to estimate the efficiency of the proposed two stage ratio estimator using EWMA statistic. In the comparison presented in Table 10, it can be observed that the proposed estimator perform efficiently than the other estimators in terms of minimum MSEs.
7. Conclusions and prespectives
Two-stage cluster sampling is an essential design in large-scale surveys, and incorporating both current and past information can substantially improve estimation accuracy. In this study, memory-type two-stage ratio and exponential estimators were proposed using the EWMA statistic under three different cases of the two-stage sampling scheme. A simulation study demonstrated that the proposed estimators are efficient across different values of , consistently achieving minimum MSE compared to existing approaches. A real-life application further confirmed their precision and practical utility. Overall, the findings highlight that the proposed ratio estimators provide more efficient and reliable results than previously developed estimators in two-stage sampling.
Future research could extend this work by exploring memory-type estimators with alternative smoothing statistics beyond EWMA, such as hybrid or adaptive weighting schemes. Another promising direction is to investigate the performance of the proposed estimators under more complex sampling designs, including stratified two-stage cluster sampling or unequal probability selection. Applications to big data and high-dimensional survey settings may also provide valuable insights. Finally, incorporating robust estimation methods to handle outliers, measurement errors, and non-response would further enhance the practical relevance of memory-type estimators.
References
- 1.
Okafor FC. Sample Survey Theory with Application. Nsukka: Afro-Orbis Publications Limited; 2002.
- 2. Ugwu MC, Madukaife MS. Two-stage cluster sampling with unequal probability sampling in the first stage and ranked set sampling in the second stage. Statistic Trans New Series. 2022;23(3):199–214.
- 3. Chaudhary MK, Singh VK, Kadilar C. Study of general families of estimators in two-stage sampling with unequal size clusters under non-response. Hacettepe J Math Stat. 2016;46(117):1–15.
- 4. Aditya A, Sud UC, Chandra H. Calibration approach based estimation of finite population total under two stage sampling. J Ind Soc Agri Stat. 2016;70(3):219–26.
- 5. Jabeen R, Sanaullah A, Hanif M. Generalized estimator for estimating population mean under two stage sampling. Pakistan J Stat. 2014;30(4):465–86.
- 6. Jabeen R, Sanaullah A, Hanif M. Efficient class of exponential estimators for population mean in two-stage cluster sampling. Pakistan J Stat. 2015;31(6).
- 7. Devi M, Sisodia BVS. Estimation of population mean in two– stage sampling under a deterministic response mechanism in the presence of non-response. J Appl Nat Sci. 2017;9(3):1666–75.
- 8. Hansen MH, Hurwitz WN. The problem of non-response in sample surveys. J Am Stat Assoc. 1946;41(236):517–29. pmid:20279350
- 9. Taconeli CA, Cabral ADS. New two-stage sampling designs based on neoteric ranked set sampling. J Stat Comput Simulat. 2018;89(2):232–48.
- 10. Ozturk O. Two-stage cluster samples with ranked set sampling designs. Ann Inst Stat Math. 2017;71(1):63–91.
- 11. Kim JK, Rao JNK, Kwon Y. Analysis of Clustered Survey Data Based on Two-Stage Informative Sampling and Associated Two-Level Models. J Royal Stat Soc Series A. 2022;185(4):1522–40.
- 12. Nawaz H, Rabia M, Amir A, Aslam M. Two-stage sampling exponential and chain ratio type estimators in the presence of measurement error. J Stat Manag Syst. 2021;25(1):285–300.
- 13. Ahmad S, Shabbir J, Zahid E, Aamir M, Alqawba M. New generalized class of estimators for estimation of finite population mean based on probability proportional to size sampling using two auxiliary variables: A simulation study. Sci Prog. 2023;106(4):368504231208537. pmid:37885238
- 14. Noor-ul-Amin M. Memory type ratio and product estimators for population mean for time-based surveys. J Stat Comput Simulat. 2020;90(17):3080–92.
- 15. Aslam I, Noor-ul-Amin M, Yasmeen U, Hanif M, et al. Memory Type Ratio and Product Estimators in Stratified Sampling. J Reliab Stat Stud. 2020;13:1–20.
- 16. Noor-ul-Amin M. Memory type estimators of population mean using exponentially weighted moving averages for time scaled surveys. Commun Stat Theor Method. 2019;50(12):2747–58.
- 17. Singh GN, Bhattacharyya D, Bandyopadhyay A, Khalid M. Study of a Memory Type Shrinkage Estimator of Population Mean in Quality Control Process. IEEE Access. 2021;9:161555–64.
- 18. Shahzad N, Zaidi A, Zia S, et al. Memory type estimator of population mean using exponentially weighted moving averages in two-phase sampling. J Positive School Psychol. 2022;6(10):1176–92.
- 19. Bhushan S, Kumar A. Novel Log Type Class Of Estimators Under Ranked Set Sampling. Sankhya B. 2021;84(1):421–47.
- 20. Bhushan S, Kumar A, Alrumayh A, Khogeer HA, Onyango R. Evaluating the performance of memory type logarithmic estimators using simple random sampling. PLoS One. 2022;17(12):e0278264. pmid:36520778
- 21. Qureshi MN, Alamri OA, Riaz N, Iftikhar A, Tariq MU, Hanif M. Memory-type variance estimators using exponentially weighted moving average statistic in presence of measurement error for time-scaled surveys. PLoS One. 2023;18(11):e0277697. pmid:37944483
- 22. Kumar A, Anshika, Emam W, Tashkandy Y. Memory type general class of estimators for population variance under simple random sampling. Heliyon. 2024;10(16):e36090. pmid:39247371
- 23. Zahid R, Noor-Ul-Amin M, Khan I, AlQahtani SA, Pathak PK, Rahimi J. Combination of memory type ratio and product estimators under extended EWMA statistic with application to wheat production. Sci Rep. 2023;13(1):13547. pmid:37599295
- 24. Noor-ul-Amin M, Arshad A. Homogeneously Weighted Moving Average- Variance Control Chart Using Auxiliary Information. Commun Stat Simulat Comput. 2023;52(10):48901–4908.
- 25. Riaz A, Noor-Ul-Amin M, Emam W, Tashkandy Y, Yasmeen U, Rahimi J. Adaptive EWMA control chart for monitoring the coefficient of variation under ranked set sampling schemes. Sci Rep. 2023;13(1):17617. pmid:37848515
- 26. Noor-Ul-Amin M, Khan I, Alzahrani ARR, Ayari-Akkari A, Ahmad B. Risk adjusted EWMA control chart based on support vector machine with application to cardiac surgery data. Sci Rep. 2024;14(1):9633. pmid:38671182
- 27. Noor-Ul-Amin M, Kazmi MW, Alkhalaf S, Abdel-Khalek S, Nabi M. Machine learning based parameter-free adaptive EWMA control chart to monitor process dispersion. Sci Rep. 2024;14(1):31271. pmid:39732838
- 28. Singh P, Sharma P, Maurya P. Enhancing Accuracy in Population Mean Estimation with Advanced Memory Type Exponential Estimators. J Reliab Stat Stud. 2025;17(2):417–34.
- 29. Kumar A, Bhushan S, Alomair AM. Optimal class of memory type imputation methods for time-based surveys using EWMA statistics. Scientific Rep. 2024;14(25740).
- 30. Singh P, Maurya P, Sharma P. Addressing non-response and measurement errors in time-scale surveys. Discover Appl Sci. 2025;7(374).
- 31.
Sukhatme PV, Sukhatme BV, Sukhatme S, et al. Sampling Theory of Surveys with Applications. Iowa State University Press; 1984.
- 32. Bahl S, Tuteja RK. Ratio and Product Type Exponential Estimators. J Inf Optim Sci. 1991;12(1):159–64.
- 33.
Sarndal CE, Swensson B, Wretman J. Model Assisted Survey Sampling. Springer-Verlag, New York, Inc. Appendix D. 1992. p. 662.