Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A simulation study: Improved ratio-in-regression type variance estimator based on dual use of auxiliary variable under simple random sampling

  • Sohaib Ahmad,

    Roles Conceptualization, Data curation, Formal analysis, Writing – original draft

    Affiliation Department of Statistics, Abdul Wali Khan University, Mardan, Pakistan

  • Sardar Hussain,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Writing – original draft

    Affiliation Department of Statistics, Quaid-i-Azam University Islamabad, Pakistan

  • Kalim Ullah,

    Roles Conceptualization, Data curation, Writing – original draft

    Affiliation Foundation University Medical College, Foundation University School of Health Sciences, DHA-I, Islamabad, Pakistan

  • Erum Zahid,

    Roles Conceptualization, Formal analysis, Writing – original draft

    Affiliation Department of Applied Mathematics and Statistics, Institute of Space Techonalogy, Islamabad, Pakistan

  • Muhammad Aamir,

    Roles Conceptualization, Software, Supervision

    Affiliation Department of Statistics, Abdul Wali Khan University, Mardan, Pakistan

  • Javid Shabbir,

    Roles Conceptualization, Supervision, Validation

    Affiliations Department of Statistics, Quaid-i-Azam University Islamabad, Pakistan, Department of Statistics, University of Wah at Wah Cantt, Wah Cantt, Pakistan

  • Zubair Ahmad ,

    Roles Conceptualization, Formal analysis, Writing – original draft, Writing – review & editing

    z.ferry21@gmail.com

    Affiliation Department of Statistics, Quaid-i-Azam University Islamabad, Pakistan

  • Huda M. Alshanbari,

    Roles Data curation, Formal analysis, Software, Writing – review & editing

    Affiliation Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia

  • Wejdan Alajlan

    Roles Formal analysis, Software, Writing – original draft

    Affiliation Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia

Abstract

In this article, we proposed an improved finite population variance estimator based on simple random sampling using dual auxiliary information. Mathematical expressions of the proposed and existing estimators are obtained up to the first order of approximation. Two real data sets are used to examine the performances of a new improved proposed estimator. A simulation study is also recognized to assess the robustness and generalizability of the proposed estimator. From the result of real data sets and simulation study, it is examining that the proposed estimator give minimum mean square error and percentage relative efficiency are higher than all existing counterparts, which shown the importance of new improved estimator. The theoretical and numerical result illustrated that the proposed variance estimator based on simple random sampling using dual auxiliary information has the best among all existing estimators.

1. Introduction

In survey sampling, it is well known fact that efficiency of population parameters estimates can be improved by the use of auxiliary information, providing it is highly associated with the study variable. Auxiliary information plays a vital role in parameter selection and estimation to achieve efficient estimates of the unknown population parameters. Many authors have suggested estimators based on original auxiliary information. In fact, appropriate use of auxiliary variable in survey sampling results in significant decrease in the variance of the estimator of unknown population parameter(s). Variance is one of the features that can be used to describe a statistical population. Brilliant efforts have been taken to estimate this characteristic as accurately as possible. The goal is to reduce the scattering of the calculated estimate when it is obtained from many available samples. Furthermore, the importance of using information from one or more auxiliary variables is to improve estimator efficiency. Many attempts have previously been made in this direction, including use of the ratio and transformed ratio-product-type. By utilizing the association among the study variable and auxiliary variable, the survey statisticians have shaped effective ratio and product estimators for population variance. The estimation of population variance of the study variable is required in a variety of scenarios in application. In a variety of fields, including agriculture, medicine, biology, and business, where we encounter populations that are likely to be skewed, variance estimate for the population is important.

Variations can be found in our daily lives throughout the world. It is a common belief that no two things or people are exactly the same. For example, a farmer must have a good alertness of how climate geographies change over time to time in order to determine how to plant his lot. For proper treatment, a doctor must have a detailed realization of differences in human hypertension and fever. Many authors suggested different types of estimators for variance including Das [1], Prasad and Singh [2], Swain and Mishra [3], Ahmad and Shabbir [4], Kadilar and Cingi [5], Subramani and Kumarapandiyan [6], Singh and Solanki [7], Yadav et al. [8], Singh et al. [9], Yadav and Kadilar [10], Adichwal et al. [11,12], Yasmeen et al. [13], Zaman [14,15], Zaman and Bulut [16], Zaman et al. [17,18], Bhushan et al. [19], Shahzad et al. [20], Ahmad et al. [21], Ozturk [22], Ozturk and Demirel [23], Frey and Feeman [24], Zamanzade and Vock [25], Mahdizadeh and Zamanzade [26,27], Bhushan et al. [28], Bhushan et al. [29], Zaman and Bulut [30,31], Yadav and Zaman [32], Zaman and Kadilar [33], Zaman and Dunder [34] and Singh and Khalid [35]. We requisite to estimate population parameters for numerous variables in multidisciplinary surveys, such as house lengths, income level, and the number of unemployed people, in socio-economic surveys. The goal of this study is to offer a strategy for estimating population variance using a dual auxiliary variable in simple random sampling. We found no contribution on variance estimation using dual auxiliary variables under simple random sampling in the literature.

2. Notations and symbols

Consider ψ = (ψ1, ψ2,…,ψN) be a population comprise of size N elements, let a sample of size n is chosen from ψ by using simple random sampling without replacement (SRSWOR). Let Y, X and Rx represent the study variable, auxiliary variable and rank of the auxiliary variable respectively. Consider population variance of the study variable Y, auxiliary variable X and rank of the auxiliary variable Rx is denoted by and . Let yi, xi and rxi, be the sample for the ith units correspondingly. Let , be the sample variances. , of Y, X and Rx be the corresponding variances. To derive the properties of the suggested estimator, we have the following error terms.

Where r, s and t be the positive number and υ200, υ020, υ002 are the second order moments about means of Y, X and rank of X, and ƛrst is the moment ratio.

3. Existing estimators

In this section, we discussed several well-known estimators of population variance.

  1. The usual estimator, is given by: (1)
    The variance of , is given by: (2)
  2. Isaki [36] suggested ratio estimator, is given by: (3)
    The properties of , is given as: and (4)
  3. The difference estimator, is given by: (5)
    Where
    The minimum variance at the optimal value of Ψ11, is given by: (6)
    Where
  4. Rao [37] difference type estimator, is given by: (7) where
    The properties at the optimal value of Ϣ1 and Ϣ2, are given by: and (8)
  5. Singh et al. [38] suggested the following estimators: (9)
    The properties of , are given by: (10) (11)
  6. Grover and Kaur [39], suggested the following estimator, is given by: (12)
    Where Ϣ3 and Ϣ4 are the unknown constant
    The minimum MSE at the optimum values of Ϣ3, Ϣ4, are given by: (13)
  7. Ahmad et al. [40], suggested the following estimator, given by: (14)
    After expanding the above equation, we have
    The bias and MSE of , are given by:
    The optimal values of Ϣ5, Ϣ6, and Ϣ7, are given by: and
    The minimum mean square error at the optimal values of and , are given by: (15) where

4. Proposed improved ratio-in-regression type variance estimator

The need for auxiliary variable can boost estimator efficiency during the design and estimation stage. When the association among the study variable and auxiliary variables exist, then rank of the auxiliary variable is also associated with the study variable. Therefore, the rank of the auxiliary variable (that includes rank of the auxiliary variable) can be considered as a novel auxiliary variable, and this evidence may help us in increasing the precision of an estimator. Based on these ideas, we are encouraged to investigate dual use of the auxiliary variable for population variance because it has been examined that there has been no work before. The suggested estimator includes the auxiliary information in the form of an auxiliary variable and in the form of a rank auxiliary variable. There are several estimators for estimating population variance based on original auxiliary variables. The key advantage of our proposed estimator in simple random sampling is that it is more flexible and best than existing estimators. By taking motivation from Ahmad et al. [40] we proposed the following estimator: (16)

Simplifying (16), we have (17)

Simplify (17), we have (18)

The optimum values of Ϣ8, Ϣ9 and Ϣ10 are given by:

Where,

The minimum MSE at the optimum values of Ϣ8, Ϣ9 and Ϣ10 in (18), we have: (19)

5. Efficiency comparison

In this section, the theoretical comparison of the suggested estimator with existing estimators is considered:

  1. From (2) and (19), where
  2. From (4) and (19),
  3. From (6) and (19),
  4. From (8) and (19),
  5. From (11) and (19),
  6. From (13) and (19),
  7. From (15) and (19),

6. Data description

In this part, we carry out a mathematical study to see the performances of proposed estimator in comparison of existing estimator. We considered two real data sets under SRS. The data descriptions of these populations are given in Table 1. We compare the production of our proposed estimator with existing counterparts in terms of PRE. To obtain the PRE, we used the following expression.

Where (u) = (0, 1, 2, 3, 4, 5, 6).

Population 1: [Source: Singh [41]]

Y: Estimated number of fish in year 1995.

X: Estimated number of fish in year 1994,

Rx = Rank of X variable

Population 2: [Source: Singh [41]]

Y: Estimated number of fish in year 1995.

X: Estimated number of fish in year 1993,

Rx = Rank of X variable

7. Simulation study

We have generated two populations of size 5,000 from normal distribution with different parameters by using R language. The populations have different correlations, i.e. population I is positively correlated and population-II has strong positive correlation between X and Y variables. The population details are given below:

Population-1

X = N(5,10), Y = X+N(2,8), N = 5,000, = 6.989187, = 5.146998, = 2500.5, = 156.7091, = 95.68973, = 2083750, ρyx = 0.7696238, ρyrx = 0.7529799, ρxrx = 0.975884.

Population-2

X = N(5,10), Y = X+N(1,3), N = 5,000, = 6.114148, = 5.03637, = 2500.5, = 104.8296, = 95.96678, = 2083750, ρyx = 0.9551769, ρyrx = 0.9335775, ρxrx = 0.9764096.

The percentage relative efficiency (PRE) is calculated as follows:

, where j = 0,1,…,6.

The results are given below:

8. Discussion

To show the performances of our suggested improved variance estimator, we used two real populations. Data description of these mentioned populations are given in Table 1. The bias result using real data sets are given in Table 2. The MSE and PRE are presented in Table 3. From the numerical results, the proposed estimator is found to be more effective than its existing estimators. The gain in efficiency in population 2 is more as compared to population 1. As we increase the sample size, the MSE decreases and PRE increases, yielding the expected results. We also evaluated the efficiency of our proposed improved variance estimator in simple random sampling using simulation study. The bias result using simulated data sets are given in Table 4. From Table 5, it is concluded that our suggested improved variance estimator performs better than the standard estimator and other existing estimators for a number of correlation coefficient values. The outcome of the simulation study clearly determines that for the simulated data sets 1–2, the PRE of the proposed estimator is greater than the PRE of the existing estimate. Overall the gain in efficiency of our proposed estimator is the best as compared to all existing counterparts.

thumbnail
Table 2. Bias result using Populations 1–2 using two real data sets.

https://doi.org/10.1371/journal.pone.0276540.t002

thumbnail
Table 3. MSE and Percentage relative efficiency using Populations 1–2.

https://doi.org/10.1371/journal.pone.0276540.t003

thumbnail
Table 4. Bias result using Populations 1–2 using simulated data sets.

https://doi.org/10.1371/journal.pone.0276540.t004

thumbnail
Table 5. MSE and PRE of , w.r.t the existing counterparts using two simulated data.

https://doi.org/10.1371/journal.pone.0276540.t005

9. Conclusion

In this article, we proposed an improved ratio-in-regression type exponential estimator for the finite population variance using dual auxiliary variables under simple random sampling. Expressions for mean square error of the proposed estimator are derived up to first order of approximation. The proposed estimator is compared with several existing counterparts to judge their uniqueness and superiority using two real data sets. Moreover, simulation study is also conducted to check the robustness and generalizability of the proposed variance estimator. A numerical study is carried out to support the theoretical results. Based on the numerical findings, it is observed from the result of real data sets and simulation study, showing when compared to its existing counterparts; the suggested estimator produces well results. Therefore we recommend the use of our proposed estimators for efficiently estimating the finite population varaince under simple random sampling using dual auxiliary varaible. The current work can be extended to develop an improved class of estimators under measurement error, probability proportional to size (PPS), two-stage sampling using dual auxiliary variable for estimating the population variance.

Acknowledgments

Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R299), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia

References

  1. 1. Das AK. Use of auxiliary information in estimating the finite population variance. Sankhya, c. 1978; 40:139–48.
  2. 2. Prasad B, Singh HP. Some improved ratio-type estimators of finite population variance in sample surveys. Communications in Statistics-Theory and Methods. 1990 Jan 1; 19(3):1127–39.
  3. 3. Swain AK, Mishra G. Estimation of finite population variance under unequal probability sampling. Sankhyā: The Indian Journal of Statistics, Series B. 1994 Dec 1:374–88.
  4. 4. Ahmad S, Shabbir J. Use of extreme values to estimate finite population mean under pps sampling scheme. Journal of Reliability and Statistical Studies. 2018 Oct 5:99–112.
  5. 5. Kadilar C, Cingi H. Improvement in variance estimation in simple random sampling. Communications in Statistics—Theory and Methods. 2007 Aug 9; 36(11):2075–81.
  6. 6. Subramani J, Kumarapandiyan G. Variance estimation using quartiles and their functions of an auxiliary variable. Ratio. 2012; 1(1).
  7. 7. Singh HP, Solanki RS. Improved estimation of finite population variance using auxiliary information. Communications in Statistics-Theory and Methods. 2013 Aug 3; 42(15):2718–30.
  8. 8. Yadav R., Upadhyaya L. N., Singh H. P., & Chatterjee S. (2013). A generalized family of transformed ratio-product estimators for variance in sample surveys. Communications in Statistics-Theory and Methods, 42(10), 1839–1850.
  9. 9. Singh GN, Priyanka P, Prasad S, Singh S, Kim JM. A class of estimators for population variance in two occasion rotation patterns. Communications for Statistical Applications and Methods. 2013; 20(4):247–57.
  10. 10. Yadav SK, Kadilar C. A two parameter variance estimator using auxiliary information. Applied Mathematics and Computation. 2014 Jan 1; 226:117–22.
  11. 11. Adichwal NK, Sharma P, Verma HK, Singh R. Generalized class of estimators for population variance using auxiliary attribute. International journal of applied and computational mathematics. 2016 Dec; 2(4):499–508.
  12. 12. Adichwal NK, Sharma P, Singh R. Generalized class of estimators for population variance using information on two auxiliary variables. International Journal of Applied and Computational Mathematics. 2017 Jun; 3(2):651–61.
  13. 13. Yasmeen U, Noor-ul-Amin M, Hanif M. Exponential estimators of finite population variance using transformed auxiliary variables. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences. 2019 Mar; 89(1):185–91.
  14. 14. Zaman T. An efficient exponential estimator of the mean under stratified random sampling. Mathematical Population Studies. 2021 Apr 3; 28(2):104–21.
  15. 15. Zaman T. Generalized exponential estimators for the finite population mean. Statistics in Transition new series. 2020; 21(1):159–68.
  16. 16. Zaman T, Bulut H. An efficient family of robust-type estimators for the population variance in simple and stratified random sampling. Communications in Statistics-Theory and Methods. 2021 Jul 15:1–5.
  17. 17. Zaman T, Sagir M, Şahin M. A new exponential estimators for analysis of COVID‐19 risk. Concurrency and Computation: Practice and Experience. 2022 May 1; 34(10):e6806.
  18. 18. Zaman T, Dünder E, Audu A, Alilah DA, Shahzad U, Hanif M. Robust regression-ratio-type estimators of the mean utilizing two auxiliary variables: A simulation study. Mathematical Problems in Engineering. 2021 Sep 6; 2021.
  19. 19. Bhushan S, Kumar A, Kumar S. Efficient classes of estimators for population variance using attribute. International Journal of Mathematics in Operational Research. 2022; 22(1):74–92.
  20. 20. Shahzad U, Ahmad I, Almanjahie IM, Koyuncu N, Hanif M. Variance estimation based on L-moments and auxiliary information. Mathematical Population Studies. 2022 Jan 2; 29(1):31–46.
  21. 21. Ahmad S, Aamir M, Hussain S, Shabbir J, Zahid E, Subkrajang K, Jirawattanapanit A. A New Generalized Class of Exponential Factor-Type Estimators for Population Distribution Function Using Two Auxiliary Variables. Mathematical Problems in Engineering. 2022 Aug 8; 2022.
  22. 22. Ozturk O. (2014). Statistical inference for population quantiles and variance in judgment post-stratified samples. Computational Statistics & Data Analysis, 77, 188–205.
  23. 23. Ozturk O, Demirel N. Estimation of population variance from multi-ranker ranked set sampling designs. Communications in Statistics-Simulation and Computation. 2016 Nov 25; 45(10):3568–83.
  24. 24. Frey J., & Feeman T. G. (2013). Variance estimation using judgment post-stratification. Annals of the Institute of Statistical Mathematics, 65(3), 551–569.
  25. 25. Zamanzade E, Vock M. Variance estimation in ranked set sampling using a concomitant variable. Statistics & Probability Letters. 2015 Oct 1; 105:1–5.
  26. 26. Mahdizadeh M, Zamanzade E. Estimating asymptotic variance of M-estimators in ranked set sampling. Computational Statistics. 2020 Dec; 35(4):1785–803.
  27. 27. Mahdizadeh M, Zamanzade E. New estimator for the variances of strata in ranked set sampling. Soft Computing. 2021 Jul; 25(13):8007–13.
  28. 28. Bhushan S, Kumar A, Kumar S, Singh S. SOME MODIFIED CLASSES OF ESTIMATORS FOR POPULATION VARIANCE USING AUXILIARY ATTRIBUTE. Pakistan Journal of Statistics. 2022 Apr 1; 38(2).
  29. 29. Bhushan S, Kumar A, Singh S, Kumar S. An improved class of estimators of population mean under simple random sampling. Philippine Statist. 2021; 70(1):33–47.
  30. 30. Zaman T, Bulut H. A new class of robust ratio estimators for finite population variance. Scientia Iranica. 2022 Jan 4.
  31. 31. Zaman T, Bulut H. A simulation study: Robust ratio double sampling estimator of finite population mean in the presence of outliers. Scientia Iranica. 2021 Oct 5.
  32. 32. Yadav SK, Zaman T. Use of some conventional and non-conventional parameters for improving the efficiency of ratio-type estimators. Journal of Statistics and Management Systems. 2021 Jul 4; 24(5):1077–100.
  33. 33. Zaman T, Kadilar C. Novel family of exponential estimators using information of auxiliary attribute. Journal of Statistics and Management Systems. 2019 Nov 17; 22(8):1499–509.
  34. 34. Zaman T, Dünder E. Proposing Novel Modified Ratio Estimators by Adding an Exponential Parameter. Lobachevskii Journal of Mathematics. 2020 Mar;41(3):451–8.
  35. 35. Singh GN, Khalid M. A composite class of estimators to deal with the issue of variance estimation under the situations of random non-response in two-occasion successive sampling. Communications in Statistics-Simulation and Computation. 2022 Apr 3; 51(4):1454–73.
  36. 36. Isaki CT. Variance estimation using auxiliary information. Journal of the American Statistical Association. 1983 Mar 1;78(381):117–23.
  37. 37. Rao TJ. On certail methods of improving ration and regression estimators. Communications in Statistics-Theory and Methods. 1991 Jan 1;20(10):3325–40.
  38. 38. Singh R, Chauhan P, Sawan N, Smarandache F. Improved exponential estimator for population variance using two auxiliary variables. Infinite Study; 2009 Feb 1.
  39. 39. Grover LK, Kaur P. Ratio type exponential estimators of population mean under linear transformation of auxiliary variable: theory and methods. South African Statistical Journal. 2011 Jan 1; 45(2):205–30.
  40. 40. Ahmad S, Hussain S, Aamir M, Yasmeen U, Shabbir J, Ahmad Z. Dual use of auxiliary information for estimating the finite population mean under the stratified random sampling scheme. Journal of Mathematics. 2021 Nov 16; 2021.
  41. 41. Singh S. Advanced Sampling Theory with Applications: How Michael"" Selected"" Amy. Springer Science & Business Media; 2003.