Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Efficient class of ratio cum median estimators for estimating the population median

Abstract

In estimation theory, the use of auxiliary information significantly improves precision while estimating population parameters. In this paper, an efficient class of ratio cum median estimators of the population median is suggested using simple random sampling without replacement. The expressions for bias and mean square error of the proposed class are derived theoretically. The condition for the asymptotic optimum estimator is obtained with its bias and mean square error expressions. Under certain realistic conditions, the asymptotic optimum estimator is more proficient, based on analytical and numerical comparisons with some existing estimators that are members of the suggested class of estimators. The superiority of the proposed ratio cum median estimators is shown through real data applications. Such a new proposed estimator will be useful in the future for data analysis and making decisions.

1. Introduction

The use of auxiliary information, either at the selection or estimation stage or at both stages, significantly improves precision while estimating population parameters. Sometimes, in survey sampling, the collected data may follow the normal distribution. In that case, we use the mean and ordinary least squares methods to estimate the population parameters, as these methods give accurate and precise results in that situation. However, in most cases, data collected may not follow the normal distribution (for example, salary, consumption, etc.), but may follow some highly skewed distributions. In these situations, using the value of the mean will not provide accurate and precise results, as the mean is too sensitive to outliers. So, one can use another measure of central tendencies, such as the median since it is not sensitive to extreme values or outliers. Researchers usually find it difficult to propose such a new technique in order to obtain a valid inference in such situations. Therefore, in this present study, we tend to develop a family of estimators of the population median, which could be adopted whenever the population distribution is skewed. Hence, we suggest a family of ratio cum median estimators of population median by using a combination of scalars with the known population median of a supplementary variable. Also, the asymptotic optimum estimator (AOE) conditions are obtained under which the suggested estimator is highly efficient and consistent, and compares favourably amongst other estimators of the median.

Consider R to be the supplementary variable and S to be the response variable of the finite population under study. Let ri and si be the sample variables on each ith unit drawn from Ri and Si variables by simple random sampling without replacement (SRSWOR). Let tr(r) and ts(s) are the marginal densities of r and s respectively; tr(Hr) and ts(Hs) are the probability density functions of the variables. Moreover, let and be the sample medians; Hr and Hs, the population medians of the variables with the correlation coefficient

(1.1)

Thus, (1.2) where (1.3)

Different estimators with their properties are recommended by different authors in different situations, taking into consideration the above-mentioned population. [1] recommended the usual unbiased median estimator and is given with the expression of variance as (1.4) [2] suggested the classical ratio median estimator given as with bias, and its mean square error, (1.5)

[3, 4] suggested classical product-type median estimator as with bias, and its mean square error, (1.6)

[5] recommended the exponential ratio type median estimators

. Its bias is expressed as: , and its mean square error is (1.7)

Also, [5] recommended the exponential product type median estimator with its bias and mean square error expression as

with bias and its mean square error (1.8)

[6] Suggested the classical chain ratio type median estimator and is given with its bias and mean square error as with bias and its Mean square error (1.9)

For more works on estimators of population median, we refer the readers to [79]. [10] Considered the randomized response method in estimating the population ratio based on ranked set sampling. [11, 12] investigated the population median estimation in two-stage ranked set sampling, and multistage ranked set sampling [13]. Suggested an estimator for the population mean using stratified median ranked set sampling [14]. Considered the case of median estimation with the imputation of the missing observations [15]. Suggested ratio estimators of population mean using auxiliary information in simple random sampling and median ranked set sampling [16]. Considered the missing values using ranked set sampling for ratio estimation of the population mean.

Other parts of the paper are organized as follows. In Section 2, the proposed class of ratio estimators for the population median is provided. The bias and mean square error of the proposed class of estimators are given in Section 3. The Asymptotic optimal estimator for the family of these estimators is considered and presented in Section 4. In Section 5, some existing members of the proposed class of estimators are summarized. Efficiency comparisons are presented in Section 6 while empirical investigations are given in Section 7 for real data illustration. The paper is concluded in Section 8 with some suggestions for future works.

2. The proposed class of ratio estimators for population median

In this section, we have proposed a ratio estimator for population median as (2.1)

Then, we set the above-proposed estimator’s boundary conditions: if , and , making the estimator q(η,ψ) unbiased. This justifies the use of the number 2 in (2.1)

Theorem 1: If in q(η,ψ) different scalars are being set, such that 2→θ and is used to replace in (2.1), the new class of estimators (2.2) is the adjustment of (2.1) for suitably chosen scalars θ,u,η,ψ such that θ and u satisfy the condition (2.3)

3. The bias and mean square error of the proposed class of estimators

Let, (3.1)

Therefore, expressing (2.2) in terms of (3.1) results in an approximate expression for the bias and the MSE for the proposed class of estimators. Thus, we have

Assuming that |er|<1 and expanding (1+er)η, and (1+er/2)−1, we obtain

By neglecting the terms of ei(i = r or s) having power greater than two, we have (3.2)

Therefore, (3.3)

Hence, the bias for the proposed class of ratio estimators q(θ,u,η,ψ) to the first-degree approximation is obtained. Taking the expectation of (3.3) and using (1.2), the bias is given by (3.4)

Now, by neglecting the terms having powers greater than two in ei(i = r or s), squaring both sides, taking expectation in (3.3) and substituting (1.2), the MSE to the first-degree approximation is obtained for the recommended class of ratio cum median estimator as (3.5)

4. The optimal condition for the proposed class of estimators

To investigate the optimal conditions for the proposed class of estimators, let so that, (4.1)

Hence, substituting (4.1) into (3.5), we obtain (4.2)

Finally, substituting (3.3) in (4.2), we obtain the asymptotic MSE for the proposed class of estimators z(θ,u,η,ψ) as (4.3)

5. Some existing members of the proposed class of estimators

In this section, we will show how some existing estimators are members of the suggested class of ratio cum median estimator when certain suitable values are assumed by scalars θ,u,η,ψ, with their exact expressions of biases and mean square errors. Table 1 provides defined values of θ,u,η,ψ that will produce the existing members of the suggested class of estimators mentioned in this study.

thumbnail
Table 1. Some existing members of the proposed class of estimators.

https://doi.org/10.1371/journal.pone.0274690.t001

6. Efficiency comparisons

Here, theoretical comparisons have been made between the suggested ratio-cum-median estimator at its optimal condition with the rest of the estimators, which are members of this recommended estimator, when θ,u,η,ψ, assume certain unique values. To make a comparison, we take the final expression of the mean square error of the proposed estimator at its optimal condition and the MSE expressions of the estimators, which are members of this estimator.

  1. (a) Comparison of MSEopt[q(θ,u,η,ψ)] with sample median estimator

Comparing (1.4) and (4.3), (optimum)q(θ,u,η,ψ) will be more efficient than if

That is

, which is always true.

  1. (b) Comparison of MSEopt[q(θ,u,η,ψ)] with classical ratio median estimator

Comparing (1.5) and (4.3), (optimum)q(θ,u,η,ψ) will be more efficient than if

, which is always true.

  1. (c) Comparison of MSEopt[q(θ,u,η,ψ)] with classical product median estimator

Comparing (1.6) and (4.3), (optimum)q(θ,u,η,ψ) will be more efficient than if. That is , which is always true.

  1. (d) Comparison of MSEopt[q(θ,u,η,ψ)] with classical exponential ratio median estimator,

Comparing (1.7) and (4.3), (optimum)q(θ,u,η,ψ) is more efficient than if , which is always true.

  1. (e) Comparison of MSEopt[q(θ,u,η,ψ)] with classical exponential product median estimator .

Comparing (1.8) and (4.3), (optimum)q(θ,u,η,ψ) will be more efficient than if , which is always true.

  1. (f) Comparison of MSEopt[q(θ,u,η,ψ)] with classical chain ratio median estimator

Comparing (1.9) and (4.3), (optimum)q(θ,u,η,ψ) will be more efficient than . Hence,

, which is always true.

  1. (g) Comparison of MSEopt[q(θ,u,η,ψ)] with that of [8] estimator

For MSEopt[q(θ,u,η,ψ)]to be more efficient than MSEopt[q(θ,u,η,ψ)]≤ Since , and

Hence [8] estimator is more efficient than the proposed estimator.

  1. (h) Comparison of MSEopt[q(θ,u,η,ψ)] with that of [9] estimators

For the proposed estimator to be more efficient than Irfan et al. (2021) estimator

MSEopt[q(θ,u,η,ψ)]≤MSE(Ti(d))

MSE(q(θ,u,η,ψ))≤MSE(Ti(d))

From the works of [9] their proposed estimators are more efficient than [8] estimator. Therefore, [9] estimators are more efficient than proposed estimator.

Remarks: As it is stated that the estimators of [8, 9] are more efficient than the proposed estimator, however the practical applications of these estimators in the field become a bit cumbersome and sometimes almost infeasible because of the presence of the unknown parameters in the estimators. Even if the estimators of these unknown parameters could be determined, they may turn out to be biased. But our proposed classes of estimator do not have parameters that are unknown. The parameters found could be easily estimated from past or pilot surveys.

7. Empirical study

In this study, we have used data sets of four populations to reveal our general results and check the optimality performance of the AOE for recommended estimator q(θ,u,η,ψ) over the fitted members of estimators mentioned in this study. The objective of using these four data sets is that, it is actually the situation where our suggested estimators perform better than existing ones as these data sets are not normal but follow skewed distributions. The results are presented in Tables 25, respectively for each population. The Percent Relative Efficiency (PRE) results are presented in Table 6. The populations are stated as:

thumbnail
Table 2. Bias and MSE values of members of the proposed class of estimators for population 1.

https://doi.org/10.1371/journal.pone.0274690.t002

thumbnail
Table 3. Bias and MSE values of members of the proposed class of estimators for population 2.

https://doi.org/10.1371/journal.pone.0274690.t003

thumbnail
Table 4. Bias and MSE values of members of the proposed class of estimators for population 3.

https://doi.org/10.1371/journal.pone.0274690.t004

thumbnail
Table 5. Bias and MSE values of members of the proposed class of estimators for population 4.

https://doi.org/10.1371/journal.pone.0274690.t005

thumbnail
Table 6. PREs of the members of the proposed class of estimators.

https://doi.org/10.1371/journal.pone.0274690.t006

7.1 Population I (see [17] for more details)

M = 69, m = 17, ρc = 0.3166, Hs = 2068, CHs = 3.45399, CHr = 3.33433.

From the above, we obtain

t = 0264, ⇒(1−t) = 0.736, kH = 0.32796, , , and . Now, setting u = 2 such that (u) satisfies, the condition in (3.3) implies θ = 3. Then, set η = 2195/5000 and ψ = 12/100 so that (2η+ψ) = (2kH/u), kH = 0.32796, so that (4.1) is satisfied. Hence, putting (θ,u,η,ψ) = (3,2,2195/5000,12/100) in (3.2), for population I, the AOE for the suggested class of estimators for the population median Hs is obtained by (5.1) with its bias and MSE to the first degree of approximation, respectively, as (Table 2) (5.2) (5.3)

7.2 Population II (see [18] for more details)

M = 340, m = 150, ρc = 0.92, Hs = 178, CHs = 31.17635, CHr = 3.41315.

From the above, we obtain

t = 0.4412, ⇒(1−t) = 0.5588, kH = 8.40345, , , and . In population II, setting u = 6 such that (u) satisfies, the condition in (3.3) implies θ = 7. Then, set η = 5823/10000 and ψ = 9/11 so that (2η+ψ) = (2kH/u), kH = 8.40345, so that (5.1) is satisfied. Hence, putting (θ,u,η,ψ) = (7,6,5823/10000,9/11) in (3.2), for population II, AOE for the suggested class of estimators for the population median Hs is obtained by (5.4) with its bias and MSE to the first degree of approximation, respectively, as (Table 3) (5.5) (5.6)

7.3 Population III (see [19] for more details)

M = 396, m = 65, ρc = 0.84, Hs = 30, CHs = 2.82869, CHr = 2.73045.

From the above, we obtain t = 0.16415, ⇒(1−t) = 0.83585, kH = 0.776112, , , and . Now, in population III, setting u = −3 such that (u) satisfies, the condition in (3.3) implies θ = −2. Then, set η = 808/3125 and ψ = −29/56 so that (2η+ψ) = (2kH/u), kH = 0.776112, so that (5.1) is satisfied. Hence, let (θ,u,η,ψ) = (−2,−3,808/3125,−29/56) in (3.2), for population III, AOE for the suggested class of estimators for the population median Hs is calculated by (5.7) with its bias and MSE to the first degree of approximation, respectively, as in (Table 4) (5.8) (5.9)

7.4 Population IV (see [20] for more details)

M = 67, m = 23, ρc = 0.6624, Hs = 4.8, CHs = 2.73045, CHr = 2.71592.

For these quantities, we get t = 0.3433, ⇒(1−t) = 0.6567, kH = 0.08932, , , and . Consider population IV by setting u = 4 such that (u) satisfies, the condition in (3.3) implies θ = 5. Then, set η = 1273/2500 and ψ = 42/79 so that (2η+ψ) = (2kH/u), kH = 0.08932, and then (5.1) is satisfied. Hence, for (θ,u,η,ψ) = (5,4,1273/2500,42/79) in (3.2) for population IV, AOE for the suggested class of estimators for the population median Hs is obtained by (5.10) with its bias and MSE to the first degree of approximation, respectively, as (Table 5) (5.11) (5.12)

Next, we consider how much gain in percent relative efficiency over the fitted members of estimators of the recommended estimator. Using Formulae (5.13), the results are given in Table 6.

(5.13)

The AOE of the recommended class of estimators does have the same expression of mean square error as the unbiased linear regression estimator, which stands clearly from Eq (4.1) and is independent of scalars (θ,u,η,ψ).

While investigating whether AOE is more proficient than the fitted members of the suggested class of estimators captured in this present work, it is clearly revealed from Section 8 and Eq (4.1) that the AOE agrees with the work of [21]. The proposed ratio class of median estimator is preferred to the classical ratio median estimator. Under certain optimal conditions, the recommended class of ratio-cum-median estimator is an alternative estimator to the unbiased linear regression estimator, which confirms the results obtained by [2224].

While adopting the suggested class of ratio-cum-median estimator on the data, we have observed that AOE only depends on the value of (kH) and the fixed value of the scalar (u). Although there is no fixed or global optimum estimator that has been proved numerically, the achievement in our recommended class of ratio-cum-median estimators was noticeable. Precisely, using the data of four populations, the AOEs were , , and for the population I, II, III, and IV, respectively. Whenever the optimality conditions are satisfied, the AOE performs much better than the other members of the proposed family of estimators. The optimal results are not always obtained from the existing estimators. However, our AOE will always give optimal results since the optimality conditions are dependent on values of (kH) of the population under consideration that are revealed from Tables 25. Furthermore, from Table 6, we reveal that there is a greater gain in the AOE optimal estimator’s efficiency for population median for all the four populations under consideration in the study. Hence, some of our proposed members of the class of median estimators are preferred to the sample median estimator and some existing estimators, which are members of the same class. In addition, we have shown that, with ancillary information, our proposed ratio- cum-median method will enhance precision in sample survey, and our findings agree with the same findings of previous researchers [2528].

On the other hand, it is also observed that the [8, 9] estimators are more efficient than our proposed class of estimators, even at optimality conditions. This is indicated as the Mean squared errors of these estimators are less than the proposed class of estimators in the four populations. The increase in efficiency of these estimators is due to the presence of many unknown parameter estimates, since it is a well-known fact that MSE is a function of the number of parameters present. Hence, the proposed class of median estimators outcompete [8, 9] estimators, in terms of their simplicity convenience in application.

8. Conclusions

In this paper, we have proposed an efficient class of ratio-cum-median estimators of the population median using simple random sampling without replacement. This study comes with the conclusion that once the optimality conditions given in Eq (2.3) and Eq (4.1) are fulfilled, the AOE of the recommended class of ratio-cum-median estimators performs much better than the existing members of the suggested estimators. The real data analysis also shows the superiority of the proposed class of estimator compared to other class members. Based on our analyses and findings, we strongly recommend using the AOE of the proposed class of estimators in practical situations. For future work, the suggested class of ratio-cum-median estimators can be extended by considering it with a Stratified random sampling, systematic sampling and ranked set sampling and its modifications, see [2932].

References

  1. 1. Gross T. S. (1980): Median Estimation in sample surveys. Proc. Surv. Res. Meth. Secl. Amer. Statist. Assoc., 181–184.
  2. 2. Kuk A. Y. C and Mak T. K. (1989): Median estimation in the presence of auxiliary information. Journal of the Royal Statistical Society, B, 51, 261–269.
  3. 3. Robson D. S. (1957): Application of multivariate Polykays to the theory of unbiased ratio type estimation. Journal of American Statistical Association, 52, 511–522.
  4. 4. Murthy M. N. (1964): Product method of estimation. Sankhya: Indian Journal of statistical series A, 26, 69–74.
  5. 5. Bahl S. and Tuteja R. K. (1991): Ratio and product type exponential estimator. Journal of information and optimization sciences, 12(1), 159–164.
  6. 6. Kadilar C. and Cingi H. (2003): A Study on the chain ratio type estimator. Hacettepe Journal of Mathematics and Statistics, 32, 105–108.
  7. 7. Enang E.I., Etuk S.I., Ekpenyong E.J. and Akpan V.M. (2016). An alternative exponential estimator for population median. Int. J. Stat. Econ., 17: 85–97.
  8. 8. Shabbir J. and Gupta S. (2017): A Generalized class of difference type estimators for population median in survey sampling. Hacettepe Journal of Mathematics and Statistics. 46 (5), 1015–1028.
  9. 9. Irfan M., Javed M., Shongwe S. C., Zohaib M. and Bhatti S. H. (2021): Estimation of population median under robust measures of an auxiliary variable. Mathematical problems in Engineering, vol. 2021,
  10. 10. Santiago A., Bouza C., Sautto J.M., and Al-Omari A.I. (2016). Randomized response procedure in estimating the population ratio using ranked set sampling. Journal of Mathematics and Statistics, 12(2): 107–114.
  11. 11. Jemain A.A., Al-Omari A.I. and Ibrahim K. (2007). Multistage median ranked set sampling for estimating the population median. Journal of Mathematics and Statistics, 3(2): 58–64.
  12. 12. Jemain A.A., Al-Omari A.I., and Ibrahim K. (2008). Two-stage ranked set sampling for estimating the population median. Sains Malaysiana, 37(1): 95–99.
  13. 13. Ibrahim K., Syam M. and Al-Omari A.I. (2010). Estimating the population mean using stratified median ranked set sampling. Applied Mathematical Sciences, 4(47): 2341–2354.
  14. 14. Bouza C.N. and Al-Omari A.I. (2011). Ranked set estimation with imputation of the missing observations: The median estimator. Revista Investigacion Operacional, 32(1): 30–37.
  15. 15. Al-Omari A.I. (2012). Ratio estimation of population mean using auxiliary information in simple random sampling and median ranked set sampling. Statistics and Probability Letters, 82(11): 1883–1990.
  16. 16. Al-Omari A.I., and Bouza C.N. (2015). Ratio estimators of the population mean with missing values using ranked set sampling. Environmetrics, 26(2): 67–76.
  17. 17. Singh S. (2003): Advances Sampling Theory and Applications: How Michael ‘Selected’ Amy. Volumes I and II, Kluwer Academic Publishers, The Netherlands.
  18. 18. Aladag S. and Cingi H. (2015): Improvement in estimating the population median in simple random sampling and stratified random sampling using auxiliary information. Communication in Statistics- Theory and Methods, 44, 1013–1032.
  19. 19. Chen Z., Bia Z. and Sinha B. K. (2004): Ranked set sampling: Theory and Applications, New York, Springer-Verlag.
  20. 20. Aczel A. D. and Sounderpandian J. (2004). Complete Business Statistics. McGraw Hill, New York.
  21. 21. Scheaffer R. L., William M., Lyman O. (1990): Elementary Survey Sampling. 4th edition, Boston, PWS-Kent Publishing Company.
  22. 22. Singh H. P., and Vishwakarma G. K. (2007): Modified exponential ratio and product estimators for finite population mean in double sampling. Austrian Journal of Statistics, 36(3), 217–225.
  23. 23. Solanki R. S., Singh H. P. and Rathour A. (2012). An Alternative estimator for estimating the finite population means using auxiliary information in sample surveys. ISRN Probab. Stat. Article ID 657682.
  24. 24. Choudhary S. and Singh B. K. (2013): A class of product-cum dual to product estimators of the population mean in survey sampling using auxiliary information. Asian Journal Math. Stat., 6(1): 1–11.
  25. 25. Sukhatme P. V. and Sukhatme B. V. (1970): Sampling Theory of Surveys with Applications. Ames, USA: Lowa State University Press.
  26. 26. Cochran W. G. (1940). The estimation of the yields of the cereal experiments by sampling for the ratio of grain to total produce. Journal of Agricultural Science, 30, 262–275.
  27. 27. Perri, P. F. (2005). Combining two auxiliary variables in ratio cum product type estimators. Proceedings of Italian Statistical Society. Intermediate Meeting on Statistics and environment. Messina, 21-23 September, 2005, 193-196.
  28. 28. Udofia G. A. (2006): Sample Survey Theory and Methods (1). Calabar: University of Calabar Press.
  29. 29. Haq A., Brown J., Moltchanova E. and Al-Omari A.I. (2015). Varied L ranked set sampling scheme. Journal of Statistical Theory and Practice, 9(4): 741–767.
  30. 30. Haq A., Brown J., Moltchanova E. and Al-Omari A.I. (2016). Paired double ranked set sampling. Communications in Statistics-Theory and Methods, 45(1): 2873–2889.
  31. 31. Al-Omari A.I. (2011). Estimation of mean based on modified robust extreme ranked set sampling. Journal of Statistical Computation and Simulation, 81(8): 1055–1066.
  32. 32. Zamanzade E. and Al-Omari A.I. (2016). New ranked set sampling for estimating the population mean and variance. Hacettepe Journal of Mathematics and Statistics, 45(6): 1891–1905.