Correction
17 Sep 2024: Qureshi MN, Alamri OA, Riaz N, Iftikhar A, Tariq MU, et al. (2024) Correction: Memory-type variance estimators using exponentially weighted moving average statistic in presence of measurement error for time-scaled surveys. PLOS ONE 19(9): e0310870. https://doi.org/10.1371/journal.pone.0310870 View correction
Figures
Abstract
The present study suggested memory-type ratio and product estimators for variance estimation in the presence of measurement errors. We applied the exponentially weighted moving averages statistic which simultaneously utilizes the current and prior information for better estimation in surveys based on the time-scale. The expressions of approximate mean square errors of memory-type estimators are derived using Taylor series up to first order. Mathematical conditions are also obtained for which the suggested memory-type ratio and product estimators perform better than the conventional ratio and product estimators. The efficiency of the proposed estimators is observed using an extensive simulation study in the presence of measurement errors. A real data application is also carried out to support the mathematical expressions. From the results, it is shown that the use of prior sample information significantly increased the efficiency of the proposed estimators.
Citation: Qureshi MN, Alamri OA, Riaz N, Iftikhar A, Tariq MU, Hanif M (2023) Memory-type variance estimators using exponentially weighted moving average statistic in presence of measurement error for time-scaled surveys. PLoS ONE 18(11): e0277697. https://doi.org/10.1371/journal.pone.0277697
Editor: Srinivasa Rao Gadde, The University of Dodoma, UNITED REPUBLIC OF TANZANIA
Received: May 18, 2022; Accepted: November 2, 2022; Published: November 9, 2023
Copyright: © 2023 Tariq et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data are all contained within the paper.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Surveys are relatively inexpensive and useful to describe the characteristics of large populations. It plays a vital role in many disciplines to gather the information for the estimation of unknown parameters of the population. Many surveys have been designed and administered in many modes and repeated under the similar approach with equal time space all over the globe. For instance, the Australian Bureau of Statistics and the National Bureau of Statistics of China conducted health, economic and agricultural surveys annually, quarterly, and monthly. Every year, the Pakistan Bureau of Statistics conducts the Labor Force Survey (LFS) to measure the different statistics of Pakistan. Bureau of Statistics, Punjab, has been conducting Multiple Indicator Cluster Surveys (MICS) regularly since 2004.
In survey sampling, statisticians usually effort to enhance the performance of the estimators by using the auxiliary information correlated with the study variable. In practice, the researchers disposed to get information on many variables rather than the study variable only. Ratio-type estimators are considered to be good example for the situation having a positive correlation between the auxiliary variable and the study variable whereas product-type estimators are assumed to be better for the negatively correlated variables.
The estimation of variance is important when the variability control is difficult in its application such as in agricultural research, experiments in biosciences, manufacturing industries, and pharmaceutical laboratories. The estimation of population variance in the study variable also received significant interest from various authors such as [1–11].
Measurement errors (MEs) are defined as the difference between observed observations and the true observations of the study variable [12]. It has severe effects on the estimation of population parameters in terms of increased bias and variation. It is essential to study the impact of ME to develop better estimation techniques and to get more reliable and efficient estimates of the underlying population parameters. Many authors have studied the effect of ME along with the estimation of population parameters. One may refer to [13–18]. Several researchers like [19–24] contribute to the variance estimation of the concerned variable in the presence of ME.
Initially, [25] introduced the idea of EWMA statistic and recycled the current and previous information to observe the change in the process mean. The memory-type ratio and product estimators for mean estimation were suggested by [26] using EWMA statistic under simple random sampling without replacement (SRSWOR). Similarly, [27] proposed the memory type ratio and product estimators for population mean using stratified sampling.
The EWMA statistic for population variance estimation of the study variable with 0≤λ≤1 is defined as:
where
is the current sample variance and λ is the weighted parameter or smoothing constant of the given observations. As λ increases from 0 to 1, the larger weight is given to current information and simultaneously smaller weight to the past observations. For λ = 1, all the weight is received by current information and EWMA statistic performs the same as the conventional variance estimator. Here, t represents the sample numbers and Vt−1 reveals the past observations. The initial value V0 is taken as one or the average value of the prior sample.
In this study, memory-type ratio and product estimators are suggested in the presence of ME for the time-scaled surveys. After a brief discussion on surveys, auxiliary information, variance estimation, ME, and EWMA statistics, the rest of the paper is as follows. Section 2 is based on the sampling scheme, notations, and conventional estimators under ME. Proposed estimators along with the derivation of approximate mean square error (MSE) are given in Section 3. Mathematical comparisons between the proposed memory-type estimators and the corresponding conventional estimators are given in Section 4. A simulation study is conducted in Section 5 to assess the performance of the proposed estimators over the conventional estimators. Real data application is also used in Section 6 to validate the findings of the mentioned estimators obtained through simulations. Final remarks are given in Section 7.
2. Sampling procedure and conventional estimators in the presence of ME
Let Y and X be the study variable and the auxiliary variable respectively defined on N identifiable but distinct units of a population U = {U1,U2,U3…,UN}. Let n pairs of observations be obtained by using SRSWOR on two variables Y and X. Consider a situation where both Y and X variables are observed with some considerable error. Let for the ith sampling unit (i=1, 2 …n), yi and xi be the observed instead of true observations Yi and Xi. The MEs may be defined as
and
where ui and vi are the associated MEs with constant or zero mean and known variances
and
respectively. Following [28], it is assumed that the errors ui and vi are independent of each other and also independent of Yi and Xi. This implies that COV(X, Y) ≠ 0 and COV(X, U) = COV(X, V) = COV(U, Y) = COV(V, Y) = COV(U, V) = 0. It is further assumed that the finite population correction is neglected. The expected values of
and
in the presence of MEs are defined as
and
It is assumed that the error variances, ad
of the study and the auxiliary variables are known. The unbiased estimator of
and
are respectively given by
and
Let the sampling errors for the study and auxiliary variables are defined as
Such that: E(ξo) = 0, E(ξ1) = 0, and
,
where
,
where z = Y, X, U, V.
The derivations of the above error terms are given in the S1 Appendix.
The usual sample variance estimator with the equation of its variance in the presence of ME is
and
The classical ratio estimator given by [29] under ME is defined as
The expression MSE for ratio estimator under ME is
Similarly, the product estimator along with the expression of MSE in the presence of ME is given as
and
3. Proposed memory-type estimators
In this section, we processed the EWMA statistic for both study and the auxiliary variables to propose the memory-type estimators in the presence of ME. The EMWA statistic for the study variable is defined as
Similarly, the EWMA statistic for the auxiliary variable x is defined as
A pilot study must have reliable results before conducting every survey. The initial values of and
are taken as average values estimated from the pilot study or taken as one. The memory-type ratio and product estimators based on EWMA statistic using ME are defined as
where
is assumed to be the known population variance of the auxiliary variable. To derive the approximate mean square errors of both memory-type estimators, let us define the following error terms as
(1)
We may rewrite the memory-type ratio estimator in terms of sampling errors as
(2)
Simplifying and applying Taylor series up to the second-order on Eq (2), we have
(3)
Shortening and applying expectation on both sides of Eq (3), we have the MSE of as
(4)
The
in the presence of ME as
(5)
Similarly, the MSE of the memory-type product estimator is
(6)
4. Mathematical comparisons
The efficiency comparisons of the proposed memory-type estimators over the conventional estimators in terms of MSE are
- The proposed memory-type ratio estimator will perform better than the conventional ratio estimator if
- The proposed memory-type product estimator will perform better than the conventional product estimator if
As λ value ranges from 0 to 1. Therefore, under the condition, λ<1, the proposed memory-type estimator would always be more efficient than the conventional ratio estimator in presence of ME. But if we take λ = 1, the performance of both memory-type and usual estimators will be equal.
5. Simulation study
An extensive simulation study is conducted to evaluate the performance of the proposed memory-type estimators in the presence of ME. The MSE and relative efficiency (RE) of the proposed and existing estimators are computed by using the following formulas with 50,000 replications as
and
where
The results of memory-type ratio and product estimators are computed at different levels of correlation ρYX(= 0.80,0.85,0.90 and 0.95) and weight parameter λ(= 0.10,0.20,0.50,0.75) by using the following algorithm:
- A population of size N = 5000 is generated using multivariate normal distribution having
- Different values have opted for weight constant λ.
- Sample of size n (= 50, 100, 200, 300, and 500) is considered to estimate the unknown parameter.
- Fifty thousand iterations are used to compute the proposed estimator under ME from the samples obtained in Step 3.
- The MSE and RE of each estimator are calculated for each sample size and presented in Tables 1–4.
From the numerical findings summarized in Tables 1 and 2, it is observed that the MSEs of the memory-type ratio estimator are less than the MSEs of the conventional ratio estimator in the presence of ME. Equivalently, the REs presented in Tables 3 and 4 shown that the proposed ratio estimator is more efficient than the conventional ratio estimator. The effect of ρYX is observed as it increases up to 1, the MSE decreases and the efficiency of the proposed memory-type ratio estimator increases which revealed that the use of the auxiliary variable, correlated with the survey variable may enhance the efficiency of the proposed estimator. The role of sample size is significant in the efficiency of the proposed estimator. The MSEs of the proposed estimator decrease as the sample size increases. Similarly, the weight λ assigned to the current and previous values has the great impact on the efficiency of the proposed memory-type ratio estimator as shown in Tables 3 and 4. It is further noticed that, when the value of λ goes down, the larger weight is assigned to previous values and hence increases the efficiency of the proposed estimators. For λ = 1, the efficiency of proposed estimators based on EWMA statistic will be as good as conventional estimators.
To check the efficiency of the memory-type product estimator, we generated a bivariate population of size 1000 having negative correlation between the concerned variable and the auxiliary variable. The detail of the population parameter is
The results of MSE and RE of proposed and conventional ratio and product estimators are summarized in Tables 5 and 6 displayed that the product type estimators perform better than the ratio estimators in the presence of ME for the population having a negatively correlated data with the study variable. Moreover, the proposed memory-type product estimator found to be more efficient than the conventional product estimator on various sample sizes and λ. The MSEs of the product eestimator decreases as the sample size increases from 50 to 500.
6. Real-data application
A real data is taken from [30] to confirm the application of memory-type ratio estimator proposed in this paper. The data set contains the true consumption expenditure as study variable Y and true income as an auxiliary variable X whereas the variables contaminated with ME are taken y as measured consumption expenditure and x as measured income with the respective population parameters are N = 10, μy = 127, μx = 170,
and ρyx = 0.964.
By considering regular time intervals, twenty samples, each of size 4, with λ = 0.2 are chosen by using SRSWOR. Here, we only considered the conventional and proposed memory-type ratio estimators as the correlation coefficient between the study and the auxiliary variables is positive. The results of all the estimators (except the product-type estimators) in the presence and absence of ME are computed and reported in Table 7. It is clear from the numerical findings given in Table 8, that the proposed memory-type ratio estimator under ME is more efficient than the conventional ratio estimator in terms of MSE and RE.
7. Conclusion
In this paper, we have suggested memory-type ratio and product estimators based on EWMA statistics for estimating the population variance in the presence and absence of ME. The proposed estimators use the current as well as previous sample information. The reported results show that the proposed estimators estimated the population variance more efficiently than the conventional estimators. The ratio (memory-type and conventional) estimators perform well for the populations having a positive correlation between the study and the auxiliary variables, whereas the product (memory-type and conventional) estimators perform better for negatively correlated populations. Further, the REs of the proposed estimators increase, and the MSEs decrease as the sample size increases. The results of real data application confirm the findings obtained from the simulation. Consequently, based on numerical results, it is suggested that the proposed memory-type ratio and product estimators based on EWMA statistics are superior and more useful for estimating population variance for the time-scaled surveys.
Current manuscript uses only one auxiliary variable to estimate the population variance. This approach can be extended to estimate various population characteristics using two or more auxiliary variables in the presence of ME and non-response.
References
- 1. Kadilar C, Cingi H. Ratio estimators for the population variance in simple and stratified random sampling. Applied Mathematics and Computation. 2006;173(2):1047–1059. https://doi.org/10.1016/j.amc.2005.04.032.
- 2. Shabbir J, Gupta S A T. On Improvement in Variance Estimation. Communications in Statistics - Theory and Methods. 2007;36(12):2177–2185. https://doi.org/10.1080/03610920701215092.
- 3. Yadav S K, Kadilar C. Improved exponential type ratio estimator of the population variance. Revista Colombiana de Estadistica. 2013;36(1):145–152.
- 4. Adichwal N K, Sharma P, Singh R. Generalized Class of Estimators for Population Variance Using Information on Two Auxiliary Variables. International Journal of Applied and Computational Mathematics. 2017; 3(2):651–661. https://doi.org/10.1007/s40819-015-0119-6.
- 5. Singh H P, Pal S K, Yadav A. A study on the chain ratio-ratio-type exponential estimator for finite population variance. Communications in Statistics - Theory and Methods. 2017;47(6):1442–1458. https://doi.org/10.1080/03610926.2017.1321124.
- 6. Ismail M, Kanwal N, Shahbaz M Q. Generalized ratio-product-type estimator for variance using auxiliary information in simple random sampling. Kuwait Journal of Science. 2018;45(1):79–88.
- 7. Bhat M A. An Improvement in Variance Estimator for the Estimation of Population Variance, Using Known Values of Auxiliary Information. International Journal of Pure & Applied Bioscience. 2018; 6(5):135–138. https://doi.org/10.18782/2320-7051.6842.
- 8. Javed K, Jamal N, Hanif M, Ali M, Shahzad U, Victoria Garcia Luengo A. Improved Estimator of Finite Population Variance Using Coefficient of Quartile Deviation. Asian Journal of Advanced Research and Reports. 2018;1(3):1–6. https://doi.org/10.9734/ajarr/2018/v1i313065.
- 9. Yadav S K, Sharma D K, Mishra S S. Searching efficient estimator of population variance using tri-mean and third quartile of auxiliary variable. International Journal of Business and Data Analytics. 2019;1(1): 30. https://doi.org/10.1504/ijbda.2019.098830.
- 10. Qureshi M N, Riaz N Hanif M. Generalized estimator for the estimation of rare and clustered population variance in adaptive cluster sampling. Journal of Statistical Computation and Simulation. 2019;89(11):2084–2101.
- 11. Singh G N, Bhattacharyya D, Bandyopadhyay A. Calibration estimation of population variance under stratified successive sampling in presence of random non response. Communications in Statistics - Theory and Methods. 2020;0(0):1–23. https://doi.org/10.1080/03610926.2020.1719158.
- 12. Hansen M H, William N, Hurwitz E, Marks W P M. Response Errors in Survey. JASA. 1951; 46: 147 – 190.
- 13. Singh H P, Karpe N. A class of estimators using auxiliary information for estimating finite population variance in presence of measurement errors. Communications in Statistics - Theory and Methods. 2009a;38(5):734–741. https://doi.org/10.1080/03610920802290713.
- 14. Misra S, Yadav D K, ika D, Kr Shukla A. On Estimation of Population Coefficient of Variation in Presence of Measurement Errors. International Journal of Mathematics Trends and Technology. 2017;51(4):307–311. https://doi.org/10.14445/22315373/ijmtt-v51p540.
- 15. Khalil S, Noor-ul-Amin M, Hanif M. Estimation of population mean for a sensitive variable in the presence of measurement error. Journal of Statistics and Management Systems. 2018;21(1):81–91. https://doi.org/10.1080/09720510.2017.1367478.
- 16. Zahid E, Shabbir J. Estimation of finite population mean for a sensitive variable using dual auxiliary information in the presence of measurement errors. PLoS ONE. 2019;14(2):1–17. pmid:30742674
- 17. Khalil S, Gupta S, Hanif M. Estimation of finite population mean in stratified sampling using scrambled responses in the presence of measurement errors. Communications in Statistics Theory and Methods. 2019;48(6):1553–1561.
- 18. Singh R, Bouza C, Mishra M. Estimation in stratified random sampling in the presence of errors. Revista Investigacion Operacional. 2020; 41(1):123–135.
- 19. Singh H P, Karpe N. Estimation of Population Variance Using Auxiliary Information in the Presence of Measurement Errors. Statistics in Transition. 2008;9(3):443–470.
- 20. Diana G, Giordan M. Finite population variance estimation in presence of measurement errors. Communications in Statistics - Theory and Methods. 2012;41(23):4302–4314. https://doi.org/10.1080/03610926.2011.573165.
- 21. Sharma P, Singh R. A generalized class of estimators for finite population variance in presence of measurement errors. Journal of Modern Applied Statistical Methods. 2013;12(2):231–241. https://doi.org/10.22237/jmasm/1383279120.
- 22. Singh H P, Pal S K. Improved estimation of finite population variance using auxiliary information in presence of measurement errors. Investigacion Operacional. 2016;37(2):147–162.
- 23. Misra S, Kumari D, Yadav D K. Some Improved Estimators for Estimating Population Variance in the Presence of Measurement Errors. Journal of Statistics Applications & Probability. 2016;5(2):311–320. https://doi.org/10.18576/jsap/050212.
- 24. Masood S, Shabbir J. Generalized multi-phase regression-type estimators under the effect of measuemnent error to estimate the population variance. Hacettepe Journal of Mathematics and Statistics. 2016;45(4):1297–1306. https://doi.org/10.15672/HJMS.201612116352.
- 25. Roberts S W. Control Chart Tests Based on Geometric Moving Averages. Technometrics 1. 1959;3(2):39–50.
- 26. Noor-ul-amin M. Memory type estimators of population mean using exponentially weighted moving averages for time scaled surveys. Communications in Statistics - Theory and Methods. 2019;47(24):1–12. https://doi.org/10.1080/03610926.2019.1670850.
- 27. Aslam I, Noor-ul-Amin M, Yasmeen U, Hanif M. Memory type ratio and product estimators in stratified sampling. Journal of Reliability and Statistical Studies. 2020; 13(1):1–15.
- 28. Singh H P, Karpe N. A General Procedure for Estimating the General Parameter Using Auxiliary Information in Presence of Measurement Errors. Communications for Statistical Applications and Methods. 2009b;16(5):821–840. https://doi.org/10.5351/ckss.2009.16.5.821.
- 29. Isaki C T. Variance estimation using auxiliary information. Journal of the American Statistical Association. 1983; 78(381):117–123. https://doi.org/10.1080/01621459.1983.10477939.
- 30.
Gujarati D N. Basic Econometrics (Fourth Edi). The McGraw−Hill Companies. 2004.