Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Memory-type variance estimators using exponentially weighted moving average statistic in presence of measurement error for time-scaled surveys

  • Muhammad Nouman Qureshi ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – review & editing

    qures089@umn.edu

    Affiliation School of Statistics, University of Minnesota, Minneapolis, Minnesota, United States of America

  • Osama Abdulaziz Alamri,

    Roles Formal analysis, Funding acquisition, Investigation

    Affiliation Statistics Department, Faculty of Science, University of Tabuk, Tabuk, Saudi Arabia

  • Naureen Riaz,

    Roles Conceptualization, Funding acquisition, Methodology, Resources, Visualization

    Affiliation Department of Mathematics, Lahore Garrison University, Lahore, Pakistan

  • Ayesha Iftikhar,

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Resources, Validation

    Affiliation Lahore Business School, University of Lahore, Lahore, Pakistan

  • Muhammad Umair Tariq,

    Roles Methodology, Visualization

    Affiliation Department of Statistics, National College of Business Administration and Economics, Lahore, Pakistan

  • Muhammad Hanif

    Roles Conceptualization, Project administration, Supervision, Validation, Visualization

    Affiliation Department of Statistics, National College of Business Administration and Economics, Lahore, Pakistan

Correction

17 Sep 2024: Qureshi MN, Alamri OA, Riaz N, Iftikhar A, Tariq MU, et al. (2024) Correction: Memory-type variance estimators using exponentially weighted moving average statistic in presence of measurement error for time-scaled surveys. PLOS ONE 19(9): e0310870. https://doi.org/10.1371/journal.pone.0310870 View correction

Abstract

The present study suggested memory-type ratio and product estimators for variance estimation in the presence of measurement errors. We applied the exponentially weighted moving averages statistic which simultaneously utilizes the current and prior information for better estimation in surveys based on the time-scale. The expressions of approximate mean square errors of memory-type estimators are derived using Taylor series up to first order. Mathematical conditions are also obtained for which the suggested memory-type ratio and product estimators perform better than the conventional ratio and product estimators. The efficiency of the proposed estimators is observed using an extensive simulation study in the presence of measurement errors. A real data application is also carried out to support the mathematical expressions. From the results, it is shown that the use of prior sample information significantly increased the efficiency of the proposed estimators.

1. Introduction

Surveys are relatively inexpensive and useful to describe the characteristics of large populations. It plays a vital role in many disciplines to gather the information for the estimation of unknown parameters of the population. Many surveys have been designed and administered in many modes and repeated under the similar approach with equal time space all over the globe. For instance, the Australian Bureau of Statistics and the National Bureau of Statistics of China conducted health, economic and agricultural surveys annually, quarterly, and monthly. Every year, the Pakistan Bureau of Statistics conducts the Labor Force Survey (LFS) to measure the different statistics of Pakistan. Bureau of Statistics, Punjab, has been conducting Multiple Indicator Cluster Surveys (MICS) regularly since 2004.

In survey sampling, statisticians usually effort to enhance the performance of the estimators by using the auxiliary information correlated with the study variable. In practice, the researchers disposed to get information on many variables rather than the study variable only. Ratio-type estimators are considered to be good example for the situation having a positive correlation between the auxiliary variable and the study variable whereas product-type estimators are assumed to be better for the negatively correlated variables.

The estimation of variance is important when the variability control is difficult in its application such as in agricultural research, experiments in biosciences, manufacturing industries, and pharmaceutical laboratories. The estimation of population variance in the study variable also received significant interest from various authors such as [111].

Measurement errors (MEs) are defined as the difference between observed observations and the true observations of the study variable [12]. It has severe effects on the estimation of population parameters in terms of increased bias and variation. It is essential to study the impact of ME to develop better estimation techniques and to get more reliable and efficient estimates of the underlying population parameters. Many authors have studied the effect of ME along with the estimation of population parameters. One may refer to [1318]. Several researchers like [1924] contribute to the variance estimation of the concerned variable in the presence of ME.

Initially, [25] introduced the idea of EWMA statistic and recycled the current and previous information to observe the change in the process mean. The memory-type ratio and product estimators for mean estimation were suggested by [26] using EWMA statistic under simple random sampling without replacement (SRSWOR). Similarly, [27] proposed the memory type ratio and product estimators for population mean using stratified sampling.

The EWMA statistic for population variance estimation of the study variable with 0≤λ≤1 is defined as: where is the current sample variance and λ is the weighted parameter or smoothing constant of the given observations. As λ increases from 0 to 1, the larger weight is given to current information and simultaneously smaller weight to the past observations. For λ = 1, all the weight is received by current information and EWMA statistic performs the same as the conventional variance estimator. Here, t represents the sample numbers and Vt−1 reveals the past observations. The initial value V0 is taken as one or the average value of the prior sample.

In this study, memory-type ratio and product estimators are suggested in the presence of ME for the time-scaled surveys. After a brief discussion on surveys, auxiliary information, variance estimation, ME, and EWMA statistics, the rest of the paper is as follows. Section 2 is based on the sampling scheme, notations, and conventional estimators under ME. Proposed estimators along with the derivation of approximate mean square error (MSE) are given in Section 3. Mathematical comparisons between the proposed memory-type estimators and the corresponding conventional estimators are given in Section 4. A simulation study is conducted in Section 5 to assess the performance of the proposed estimators over the conventional estimators. Real data application is also used in Section 6 to validate the findings of the mentioned estimators obtained through simulations. Final remarks are given in Section 7.

2. Sampling procedure and conventional estimators in the presence of ME

Let Y and X be the study variable and the auxiliary variable respectively defined on N identifiable but distinct units of a population U = {U1,U2,U3…,UN}. Let n pairs of observations be obtained by using SRSWOR on two variables Y and X. Consider a situation where both Y and X variables are observed with some considerable error. Let for the ith sampling unit (i=1, 2 …n), yi and xi be the observed instead of true observations Yi and Xi. The MEs may be defined as and where ui and vi are the associated MEs with constant or zero mean and known variances and respectively. Following [28], it is assumed that the errors ui and vi are independent of each other and also independent of Yi and Xi. This implies that COV(X, Y) ≠ 0 and COV(X, U) = COV(X, V) = COV(U, Y) = COV(V, Y) = COV(U, V) = 0. It is further assumed that the finite population correction is neglected. The expected values of and in the presence of MEs are defined as and

It is assumed that the error variances, ad of the study and the auxiliary variables are known. The unbiased estimator of and are respectively given by and

Let the sampling errors for the study and auxiliary variables are defined as

Such that: E(ξo) = 0, E(ξ1) = 0, and , where , where z = Y, X, U, V.

The derivations of the above error terms are given in the S1 Appendix.

The usual sample variance estimator with the equation of its variance in the presence of ME is and

The classical ratio estimator given by [29] under ME is defined as

The expression MSE for ratio estimator under ME is

Similarly, the product estimator along with the expression of MSE in the presence of ME is given as and

3. Proposed memory-type estimators

In this section, we processed the EWMA statistic for both study and the auxiliary variables to propose the memory-type estimators in the presence of ME. The EMWA statistic for the study variable is defined as

Similarly, the EWMA statistic for the auxiliary variable x is defined as

A pilot study must have reliable results before conducting every survey. The initial values of and are taken as average values estimated from the pilot study or taken as one. The memory-type ratio and product estimators based on EWMA statistic using ME are defined as where is assumed to be the known population variance of the auxiliary variable. To derive the approximate mean square errors of both memory-type estimators, let us define the following error terms as (1)

We may rewrite the memory-type ratio estimator in terms of sampling errors as (2)

Simplifying and applying Taylor series up to the second-order on Eq (2), we have (3)

Shortening and applying expectation on both sides of Eq (3), we have the MSE of as (4) The in the presence of ME as (5)

Similarly, the MSE of the memory-type product estimator is (6)

The in the presence of ME as (7)

4. Mathematical comparisons

The efficiency comparisons of the proposed memory-type estimators over the conventional estimators in terms of MSE are

  1. The proposed memory-type ratio estimator will perform better than the conventional ratio estimator if
  2. The proposed memory-type product estimator will perform better than the conventional product estimator if

As λ value ranges from 0 to 1. Therefore, under the condition, λ<1, the proposed memory-type estimator would always be more efficient than the conventional ratio estimator in presence of ME. But if we take λ = 1, the performance of both memory-type and usual estimators will be equal.

5. Simulation study

An extensive simulation study is conducted to evaluate the performance of the proposed memory-type estimators in the presence of ME. The MSE and relative efficiency (RE) of the proposed and existing estimators are computed by using the following formulas with 50,000 replications as and where

The results of memory-type ratio and product estimators are computed at different levels of correlation ρYX(= 0.80,0.85,0.90 and 0.95) and weight parameter λ(= 0.10,0.20,0.50,0.75) by using the following algorithm:

  1. A population of size N = 5000 is generated using multivariate normal distribution having
  2. Different values have opted for weight constant λ.
  3. Sample of size n (= 50, 100, 200, 300, and 500) is considered to estimate the unknown parameter.
  4. Fifty thousand iterations are used to compute the proposed estimator under ME from the samples obtained in Step 3.
  5. The MSE and RE of each estimator are calculated for each sample size and presented in Tables 14.
thumbnail
Table 1. The MSEs of ratio-type estimators with and without ME for .

https://doi.org/10.1371/journal.pone.0277697.t001

thumbnail
Table 2. The MSEs of ratio-type estimators with and without ME for .

https://doi.org/10.1371/journal.pone.0277697.t002

thumbnail
Table 3. The RE of ratio-type estimators with and without ME for .

https://doi.org/10.1371/journal.pone.0277697.t003

thumbnail
Table 4. The RE of ratio-type estimators with and without ME for .

https://doi.org/10.1371/journal.pone.0277697.t004

From the numerical findings summarized in Tables 1 and 2, it is observed that the MSEs of the memory-type ratio estimator are less than the MSEs of the conventional ratio estimator in the presence of ME. Equivalently, the REs presented in Tables 3 and 4 shown that the proposed ratio estimator is more efficient than the conventional ratio estimator. The effect of ρYX is observed as it increases up to 1, the MSE decreases and the efficiency of the proposed memory-type ratio estimator increases which revealed that the use of the auxiliary variable, correlated with the survey variable may enhance the efficiency of the proposed estimator. The role of sample size is significant in the efficiency of the proposed estimator. The MSEs of the proposed estimator decrease as the sample size increases. Similarly, the weight λ assigned to the current and previous values has the great impact on the efficiency of the proposed memory-type ratio estimator as shown in Tables 3 and 4. It is further noticed that, when the value of λ goes down, the larger weight is assigned to previous values and hence increases the efficiency of the proposed estimators. For λ = 1, the efficiency of proposed estimators based on EWMA statistic will be as good as conventional estimators.

To check the efficiency of the memory-type product estimator, we generated a bivariate population of size 1000 having negative correlation between the concerned variable and the auxiliary variable. The detail of the population parameter is

The results of MSE and RE of proposed and conventional ratio and product estimators are summarized in Tables 5 and 6 displayed that the product type estimators perform better than the ratio estimators in the presence of ME for the population having a negatively correlated data with the study variable. Moreover, the proposed memory-type product estimator found to be more efficient than the conventional product estimator on various sample sizes and λ. The MSEs of the product eestimator decreases as the sample size increases from 50 to 500.

thumbnail
Table 5. The MSE of proposed and conventional ratio and product estimators.

https://doi.org/10.1371/journal.pone.0277697.t005

thumbnail
Table 6. The RE of proposed and conventional ratio and product estimators.

https://doi.org/10.1371/journal.pone.0277697.t006

6. Real-data application

A real data is taken from [30] to confirm the application of memory-type ratio estimator proposed in this paper. The data set contains the true consumption expenditure as study variable Y and true income as an auxiliary variable X whereas the variables contaminated with ME are taken y as measured consumption expenditure and x as measured income with the respective population parameters are N = 10, μy = 127, μx = 170, and ρyx = 0.964.

By considering regular time intervals, twenty samples, each of size 4, with λ = 0.2 are chosen by using SRSWOR. Here, we only considered the conventional and proposed memory-type ratio estimators as the correlation coefficient between the study and the auxiliary variables is positive. The results of all the estimators (except the product-type estimators) in the presence and absence of ME are computed and reported in Table 7. It is clear from the numerical findings given in Table 8, that the proposed memory-type ratio estimator under ME is more efficient than the conventional ratio estimator in terms of MSE and RE.

thumbnail
Table 8. The MSE and RE of EWMA statistic and ratio estimators for real population.

https://doi.org/10.1371/journal.pone.0277697.t008

7. Conclusion

In this paper, we have suggested memory-type ratio and product estimators based on EWMA statistics for estimating the population variance in the presence and absence of ME. The proposed estimators use the current as well as previous sample information. The reported results show that the proposed estimators estimated the population variance more efficiently than the conventional estimators. The ratio (memory-type and conventional) estimators perform well for the populations having a positive correlation between the study and the auxiliary variables, whereas the product (memory-type and conventional) estimators perform better for negatively correlated populations. Further, the REs of the proposed estimators increase, and the MSEs decrease as the sample size increases. The results of real data application confirm the findings obtained from the simulation. Consequently, based on numerical results, it is suggested that the proposed memory-type ratio and product estimators based on EWMA statistics are superior and more useful for estimating population variance for the time-scaled surveys.

Current manuscript uses only one auxiliary variable to estimate the population variance. This approach can be extended to estimate various population characteristics using two or more auxiliary variables in the presence of ME and non-response.

References

  1. 1. Kadilar C, Cingi H. Ratio estimators for the population variance in simple and stratified random sampling. Applied Mathematics and Computation. 2006;173(2):1047–1059. https://doi.org/10.1016/j.amc.2005.04.032.
  2. 2. Shabbir J, Gupta S A T. On Improvement in Variance Estimation. Communications in Statistics - Theory and Methods. 2007;36(12):2177–2185. https://doi.org/10.1080/03610920701215092.
  3. 3. Yadav S K, Kadilar C. Improved exponential type ratio estimator of the population variance. Revista Colombiana de Estadistica. 2013;36(1):145–152.
  4. 4. Adichwal N K, Sharma P, Singh R. Generalized Class of Estimators for Population Variance Using Information on Two Auxiliary Variables. International Journal of Applied and Computational Mathematics. 2017; 3(2):651–661. https://doi.org/10.1007/s40819-015-0119-6.
  5. 5. Singh H P, Pal S K, Yadav A. A study on the chain ratio-ratio-type exponential estimator for finite population variance. Communications in Statistics - Theory and Methods. 2017;47(6):1442–1458. https://doi.org/10.1080/03610926.2017.1321124.
  6. 6. Ismail M, Kanwal N, Shahbaz M Q. Generalized ratio-product-type estimator for variance using auxiliary information in simple random sampling. Kuwait Journal of Science. 2018;45(1):79–88.
  7. 7. Bhat M A. An Improvement in Variance Estimator for the Estimation of Population Variance, Using Known Values of Auxiliary Information. International Journal of Pure & Applied Bioscience. 2018; 6(5):135–138. https://doi.org/10.18782/2320-7051.6842.
  8. 8. Javed K, Jamal N, Hanif M, Ali M, Shahzad U, Victoria Garcia Luengo A. Improved Estimator of Finite Population Variance Using Coefficient of Quartile Deviation. Asian Journal of Advanced Research and Reports. 2018;1(3):1–6. https://doi.org/10.9734/ajarr/2018/v1i313065.
  9. 9. Yadav S K, Sharma D K, Mishra S S. Searching efficient estimator of population variance using tri-mean and third quartile of auxiliary variable. International Journal of Business and Data Analytics. 2019;1(1): 30. https://doi.org/10.1504/ijbda.2019.098830.
  10. 10. Qureshi M N, Riaz N Hanif M. Generalized estimator for the estimation of rare and clustered population variance in adaptive cluster sampling. Journal of Statistical Computation and Simulation. 2019;89(11):2084–2101.
  11. 11. Singh G N, Bhattacharyya D, Bandyopadhyay A. Calibration estimation of population variance under stratified successive sampling in presence of random non response. Communications in Statistics - Theory and Methods. 2020;0(0):1–23. https://doi.org/10.1080/03610926.2020.1719158.
  12. 12. Hansen M H, William N, Hurwitz E, Marks W P M. Response Errors in Survey. JASA. 1951; 46: 147 – 190.
  13. 13. Singh H P, Karpe N. A class of estimators using auxiliary information for estimating finite population variance in presence of measurement errors. Communications in Statistics - Theory and Methods. 2009a;38(5):734–741. https://doi.org/10.1080/03610920802290713.
  14. 14. Misra S, Yadav D K, ika D, Kr Shukla A. On Estimation of Population Coefficient of Variation in Presence of Measurement Errors. International Journal of Mathematics Trends and Technology. 2017;51(4):307–311. https://doi.org/10.14445/22315373/ijmtt-v51p540.
  15. 15. Khalil S, Noor-ul-Amin M, Hanif M. Estimation of population mean for a sensitive variable in the presence of measurement error. Journal of Statistics and Management Systems. 2018;21(1):81–91. https://doi.org/10.1080/09720510.2017.1367478.
  16. 16. Zahid E, Shabbir J. Estimation of finite population mean for a sensitive variable using dual auxiliary information in the presence of measurement errors. PLoS ONE. 2019;14(2):1–17. pmid:30742674
  17. 17. Khalil S, Gupta S, Hanif M. Estimation of finite population mean in stratified sampling using scrambled responses in the presence of measurement errors. Communications in Statistics Theory and Methods. 2019;48(6):1553–1561.
  18. 18. Singh R, Bouza C, Mishra M. Estimation in stratified random sampling in the presence of errors. Revista Investigacion Operacional. 2020; 41(1):123–135.
  19. 19. Singh H P, Karpe N. Estimation of Population Variance Using Auxiliary Information in the Presence of Measurement Errors. Statistics in Transition. 2008;9(3):443–470.
  20. 20. Diana G, Giordan M. Finite population variance estimation in presence of measurement errors. Communications in Statistics - Theory and Methods. 2012;41(23):4302–4314. https://doi.org/10.1080/03610926.2011.573165.
  21. 21. Sharma P, Singh R. A generalized class of estimators for finite population variance in presence of measurement errors. Journal of Modern Applied Statistical Methods. 2013;12(2):231–241. https://doi.org/10.22237/jmasm/1383279120.
  22. 22. Singh H P, Pal S K. Improved estimation of finite population variance using auxiliary information in presence of measurement errors. Investigacion Operacional. 2016;37(2):147–162.
  23. 23. Misra S, Kumari D, Yadav D K. Some Improved Estimators for Estimating Population Variance in the Presence of Measurement Errors. Journal of Statistics Applications & Probability. 2016;5(2):311–320. https://doi.org/10.18576/jsap/050212.
  24. 24. Masood S, Shabbir J. Generalized multi-phase regression-type estimators under the effect of measuemnent error to estimate the population variance. Hacettepe Journal of Mathematics and Statistics. 2016;45(4):1297–1306. https://doi.org/10.15672/HJMS.201612116352.
  25. 25. Roberts S W. Control Chart Tests Based on Geometric Moving Averages. Technometrics 1. 1959;3(2):39–50.
  26. 26. Noor-ul-amin M. Memory type estimators of population mean using exponentially weighted moving averages for time scaled surveys. Communications in Statistics - Theory and Methods. 2019;47(24):1–12. https://doi.org/10.1080/03610926.2019.1670850.
  27. 27. Aslam I, Noor-ul-Amin M, Yasmeen U, Hanif M. Memory type ratio and product estimators in stratified sampling. Journal of Reliability and Statistical Studies. 2020; 13(1):1–15.
  28. 28. Singh H P, Karpe N. A General Procedure for Estimating the General Parameter Using Auxiliary Information in Presence of Measurement Errors. Communications for Statistical Applications and Methods. 2009b;16(5):821–840. https://doi.org/10.5351/ckss.2009.16.5.821.
  29. 29. Isaki C T. Variance estimation using auxiliary information. Journal of the American Statistical Association. 1983; 78(381):117–123. https://doi.org/10.1080/01621459.1983.10477939.
  30. 30. Gujarati D N. Basic Econometrics (Fourth Edi). The McGraw−Hill Companies. 2004.