Optimum second call imputation in PPS sampling

Fariha Sohil; Muhammad Umair Sohail; Javid Shabbir

doi:10.1371/journal.pone.0261834

Abstract

The current study deals with imputation of item non-response in probability proportional to size (PPS) sampling. A new imputation procedure is proposed by using the known co-variance between the study variable and the auxiliary variable in the case of quantitative sensitive study variable by considering the non-response in a randomization mechanism on the second call. An empirical study is conducted at the optimum values of k_og and n_og for the relative comparisons of ratio, difference, and proposed estimators, respectively, with the Hansen-Hurwitz estimator.

Citation: Sohil F, Sohail MU, Shabbir J (2022) Optimum second call imputation in PPS sampling. PLoS ONE 17(1): e0261834. https://doi.org/10.1371/journal.pone.0261834

Editor: Dejan Dragan, Univerza v Mariboru, SLOVENIA

Received: June 19, 2021; Accepted: December 12, 2021; Published: January 21, 2022

Copyright: © 2022 Sohil et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: In this research, a hypothetical data set is used which can be easily regenerated at the given value of parameters with the help of available statistical software. The parameters are included in the paper and its Supporting information files.

Funding: The authors received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

Survey sampling is a technique which is utilizes in almost every field of life to estimate the finite population parameters with limited response. There are many sample selection procedures, which provide reliable data by selecting the representative sample. In equal probability sampling schemes, the probability of selection is equal for all the units in target population. If units varying in size, equal probability sampling may not give the appropriate importance to large or small units in the population. The appropriate importance to the population units is assigned by allocating the unequal probabilities of selection to the different units in the population. Thus, when units are different in size and variable under study is correlated with their auxiliary information e.g. size, then the selection probabilities may be assigned in proportion to their sizes. For example,

Colleges with large number of educational departments are likely to have more students and more faculty members. For the funds allocation, it may well be desirable to adopt a scheme of selection in which colleges are selected with probabilities proportional to their students or departments.
In an industrial survey, the number of workers may be selected as size of industrial area.
In biological studies, the number of patients may be selected according to the size of the hospital.

For all of these cases, the selection of sampling units is proportional to the size of auxiliary information associated with the particular unit, is called sampling with probability proportional to size (PPS). It is well known that the proper use of auxiliary information at estimation stage or at design stage or at both stages is helpful to magnify the performance of resultant estimators. Ratio, product, and regression estimators are good examples in this context.

In many real life situations, where non-response/refusals may affect the reliability and accuracy of data sets. These refusals are mostly occurred due to many reasons such as time of survey (during summer or winter vacations, office hours etc.), survey contents (embarrassing nature of questions, double barrel question etc.), respondent burden (irrelevant questions, length of questionnaire etc.), or data collection methods (telephone or mail surveys, personal interviews etc).

Initially, [1] provides an idea of sub-sampling the non-respondents of first call by dividing the population into two strata; respondents and non-respondents at first call. The detailed discussion on the proposed estimator is given in Subsections 1.1 and 1.2 for the case of simple random and PPS sampling scheme, respectively.

1.1 Sample selection in simple random sampling

Let Ω = {Ω₁, Ω₂, Ω₃, ⋯, Ω_N} be a finite population of N units. Let y_i and (x_i, z_i) be the values of the study variable (y) and the auxiliary variable (x, z), respectively, for i = 1, 2, ⋯, N. Assume that x_i has high positive and z_i has low positive correlation, respectively, with the study variable (y_i). So, x_i is used at the estimation stage and z_i is used at the sample selection stage from population. Let a sample {(ℑ = ℑ₁, ℑ₂, ⋯, ℑ_n)} of size n be selected using simple random sampling without replacement(SRSWOR) scheme. Assume that n_1s units respond at first call, report their responses y_i(1) and n_2s units do not respond at first call. Further, a sample of size , where k > 1, is drawn from n_2s non-respondent group, report their responses y_i(2), belong to group G₁ and r_2s = r_1s(k − 1) are those, who refuse to report their response belong to group G₂.

Thus, the sub-sampling estimate for population mean, is given by (1) where , , , and r be the respondents. The variance of is given by (2) where , , ,, , , and .

1.2 Selection of sample with PPS sampling

In PPS sampling scheme, the selection of units in the sample is carried with probability proportional to a given measure of size, where the size is measured by the available suitable auxiliary information. Let u_i = y_i/(Nπ_i) and v_i = x_i/(Nπ_i), where and also let and be the unbiased estimators of population means and their variances are and , respectively, where . It is also assumed that the average value of u_i is approximately equal to average value of y_i.

Let a sample {s_i = (s₁, s₂, s₃, ⋯, s_n)} of size n be selected using PPS with replacement sampling scheme. Assume that n₁ units respond at first call, report their responses u_i(1) = y_i(1)/(N₁ π_1i), where and n₂ units do not respond at first call. Further, a sample of size, , is drawn from n₂ non-respondent group, report their responses u_i(2) = y_i(2)/(N₂π_2i), where , belongs to group G₁ and r₂ = r₁(k − 1) is those, who refuse to report their responses belong to group G₂. Thus, the Hansen-Hurwitz estimator under PPS sampling scheme can be modified as: (3) where n₁ and r₁ are the PPS respondent units at first and second calls, respectively. The variance of is given by (4) where , , , and .

1.3 Statement of the problem

When variables of interest are sensitive or embarrassing in nature, then respondents are reluctant to report their true responses or may refuse to respond. Several statistical models are available in literature to protect the confidentiality and privacy of interviewee by hiding their identities, which are helpful to reduce the non-response bias. A pioneer idea of randomized response technique (RRT) was described by [2] to handle the high rate of refusals due to sensitive nature of questions. Commonly, these refusals have been occurred during the analysis of demographic and economic variables, respectively, etc. Interest readers may be referred to read [3–9], and many others. [10, 11] use the randomized response models (RRMs) for obtaining the true status of interviewee on second attempt. The proposed estimators by these researchers can perform better as compared to traditional ones.

The aim of this investigation is to study the missing complete at random (MCAR) values at second call, when the interviewees are reluctant to use RRMs. For the non-respondents of first call, different additive, multiplicative and subtractive models, respectively, might be utilized to create the feeling among respondents that their privacy is secured beside their truthful response.

For creating privacy protection felling among non-respondents of first call, we consider to modify linear randomized response model proposed by [12]. From the n₂ non-respondents of first call, the scrambled response is obtained using the [12] model.

1.3.1 Privacy protection at second call.

Let the i^th respondent draw two cards i.e S_1i and S_2i from two independent decks of cards, say D₁ and D₂, respectively, which are un-correlated with y. At the second call, the i^th respondent can report the scrambled response as follows: (5)

Let E₃ and V₃ be, respectively, the expected value and variance over the scrambled device. We assume that E₃(S_1i) = θ₁, E₃(S_2i) = θ₂, and with and . Also let be the suitable transformation of randomized response for the i^th unit whose expectation under (5) model coincides with the true response y_i, as: (6) with (7) where and .

At the second call, out of n₂ non-respondent of first call, only r₁ interviewees can give their scrambling responses and remaining r₂ units cannot give their true or scrambled responses. Let be the sample mean of respondent class at second attempt.

2 Modifying existing literature

In this section, we modify the exiting literature as per the statement of the problem. The most commonly used imputation procedures are discussed in Subsection 2.1, 2.2, and 2.3.

2.1 Mean estimator

In this section, our focus is to impute the missing r₂ values by using conventional method of imputation. The missing structure is defined as follows: (8)

Hence, the whole population is divided in Ω₍₁₎ and Ω₍₂₎ strata having N₁ and N₂ units, respectively. Furthermore, Ω₍₂₎ is divided into two groups G₁ and G₂ of size R₁ and R₂ units, respectively, when N₁, N₂, R₁ and R₂ are known in advance. For the case of scrambled responses at second call, the point Hansen-Hurwitz estimator for population mean can be modified as: (9)

So, we have the following Lemmas.

Lemma 2.1 The variance of , is given by (10)

Proof. Proof: Let E_j and V_j, j = (1, 2) be the expected values and variances for given n₂ and r₁, respectively. Then, by the definition of variance, we have (11)

Corollary 2.1.1. It is important to note that requires the second moment (μ_2u) of y, which is generally unknown. [13] suggested two possible ways to acquire μ_2u: (i) guess it from the prior information or pilot survey and (ii) obtain the sample estimate to derive the information about μ_2u by keeping in mind the sensitive nature of u_i.

Lemma 2.2. The variance of , is given by (12)

Proof. Proof Let E_m and V_m, m = (4, 5) be the expected values and variances for given N₁ and N₂, respectively. By definition, we have (13)

By ignoring correction factor for the ease of computation, then we have (14)

Corollary 2.2.1. From (4) and (14), we see that the variance of modified estimator is higher than Hansen-Hurwitz estimator. It means that is less efficient than .

The objective of our study is to increase the truth and confidence among interviewees that their privacy is secure beside their true answers. Moreover, the non-response at first call might be occurred due to non-availability or inability to provide the required information. Therefore, at the second call, it may happen that those people are willing to report their responses directly, even the sensitive characteristics are investigated. For this purpose, the randomization in stages should be re-expounded as an optional randomized response (ORR) procedure, which permits the respondents to divulging the direct or true response without using RRT, is given by (15) where

It is easy to show that the unbiased estimator for is derived by replacing (15) in (9) and its variance becomes (1 − t_i)ϕ_i instead of ϕ_i, in (14). Furthermore, ORR reduces the variance and privacy at various values of t_i for the non-respondents at first call.

2.2 Ratio estimator

Initially, [14] takes into account the utility of auxiliary information at estimation stage by defining the ratio estimator for population. The traditional ratio estimator can be modified for the imputation of missing scrambled responses at second call, as: (16) where , , and .

The point estimator for sub-population (Ω₍₂₎), is given by (17)

The Hansen-Hurwitz ratio estimator for population mean , is given by (18)

The variance of modified ratio estimator is given by (19) where ,

2.3 Difference estimator

Now, we consider the difference estimator for explaining missing structure of scrambled responses, as: (20) where d is an unknown constant.

The point estimator for sub-population mean (Ω), is given by (21)

The combined version of modified Hansen-Hurwitz estimator is given by (22)

The variance of estimator, is stated as (23) where .

When , variance of reduces to (24) where .

The problem of estimating the population parameters by using higher order moments of the auxiliary variable was considered by [15–17]. Later on [18–20] among others, also contemplate the known higher order moments of the auxiliary variable for estimation of finite population parameters. In the theory of survey sampling, it is well established result that the use of higher order moments of the auxiliary variable plays a pivotal role in estimating the finite population mean of the study variable. This literature inspired the researchers to impute the missing values at second call by using known covariance between the study variable and the auxiliary variable.

3 Proposed imputation procedure

Initially, [21] improves the conventional mean estimator by using a tuning constant (α_(s)), in the case of missing values, as: (25) which leads to Searls’s type estimator for is given by (26)

Although Searls’s approach uses the known coefficient of variation to increase the efficiency of the estimation procedure. The optimum value of α_(s) depends on , and , which are stable quantities. The stability of these constant has been explored by numerous researchers like [22–24], etc. Therefor, the present investigation is a significant search of optimum imputation method by using the co-variance between the study and auxiliary variable. The imputation of item non-response is given by (27) where α₁, α₂, and α₃ are suitable chosen constants and are determined by minimizing the resultant mean square error. The point estimator for population mean, is defined as: (28)

The modified version of Hansen-Hurwitz difference estimator is given by (29)

The variance of , is given by (30) where

The optimum values of α_j, j = (1, 2, 3) are obtained, respectively, by minimizing (30), as follows: (31) where , and is the coefficient of multiple determination of u on v and .

Substituting (31) in (30), the variance of , is given by (32) where .

Remark 1. The second term in is vanished, if k = 1. It happens when each non-respondent of first call is interviewed at second call.

4 Choice of sampling fractions

We shall deduce the optimum values of k and n that minimize the variance at specified cost. The cost function for the proposed model is based on following four components, as:

C₀ = over head cost.
C₁ = per unit cost for collecting the response by mail inquiry at first call.
C₂ = the unit cost for obtaining the scrambled response from the non-respondent group of first call.
C₃ = cost per unit for editing, processing or imputing the missing r₂ values.

Thus, the cost function is given by (33)

Note that C* is the total cost, thus it varies from sample to sample. So, we use the expected cost by applying the expectation on (33), we have (34)

So, we have the following Lemma, as:

Lemma 4.1. The optimum values of k and n for the minimum expected cost are, respectively, given by (35) and (36) where g = c, d, and p.

Proof. Let the variance be a fixed V₀, i.e , then the Lagrange function, is given by (37) where ξ is a Lagrange multiplier. Differentiating (37) with respect to n, equating to zero i.e and ignoring δ_j, j = (1or2). We have which implies (38)

Note that (39)

Substituting (38) in (39), we have (40)

Substituting (40) in (38), we have (41) which is the required optimum sample size (n_ov). Now, we differentiate (37) with respect to k and equate to zero i.e . Then, we have which implies (42)

Using (38) in (42), we have (43) which is the required optimum value of k.

Corollary 4.1.1. The optimum values of n and k are proportional to the expected cost (C*). To get optimum values of k and n, that, , we simply substitute and in (35) and (36).

5 Empirical comparison

On the lines of [25], the relative comparison of with respect to is considered by generating a hypothetical population under following key steps, as:

Let two independent populations say {Ω_(x) and Ω_(z)} of size 1000 are obtained from gamma distribution, using following parametric values, as: (44)
The study variable is generated by (45)
Splitting the populations into two strata having N₁ = 690 and N₂ = 310 units.
Assume that, out of N₂ units, R₁ units provides the response by using (5) and remaining R₂ = (N₂ − R₁) are those who refuse to give their true or scrambled responses.
Imputing the missing R₂ values by using and S_uv.
Repeat the process times. The variance of the given estimator is obtained by using following expression, as: (46) and the relative efficiency (R.E) of is obtained by using the following expression (47)

For the numerical comparison, we consider the following values of un-known constants, as: where r is assumed response rate, which is 40% of n_og. The optimum values of relative efficiencies (R.Es) of are given in Table 1.

Download:

Table 1. Optimum values of k, n, and R.E(j) w.r.t.

.

https://doi.org/10.1371/journal.pone.0261834.t001

Table 1 shows the optimum values of k, n, and R.E(j) of estimators i.e modified ratio and difference estimators. Under this hypothetical population, the modified estimators i.e perform better as compared to traditional Hansen-Hurwitz estimator . We also observe that the optimum value of n_og is approximately similar for all , so optimum sample of size n_op is used for the relative comparison between existing and proposed imputation estimators.

From Table 1, we observed following proportionality relationships between C₂, C₃, r₁, V_o, k_op, n_op, and R.E(j).

The values of V_o and n_op have inverse relationship with C₂ and C₃.
C₂ and C₃ have the positive relationship with RE(j). As the costs of scrambling response and imputation increase, the relative efficiencies of have been improved significantly.
r₁ has the negative association with k_op and n_op. The values of k_op and n_op decrease as r₁ increasing.
V_o also has the inverse relationship with k_op and n_op. As the value of V_o increases, the values of k_op and n_op decrease.
The relative efficiencies of are also inversely correlated with r₁ and V_o. The values of R.E(j) decrease as V_o and n_op increase.

From the numerical finding, we can conclude that the proposed imputation procedure at second call should be performs better as compared to existing and tradition Hansen-Hurwitz estimators at various values of C₂, C₃, r₁ and V_o.

6 Conclusion

The problem of non-response bias in the sensitive quantitative study variable has been diminished by sub-sampling the non-respondent, viz. Hansen and Hurwitz (1946) procedure. A new imputation mechanism has been defined by using the known co-variance between the study variable and the auxiliary variable. Optimum value for sample size is also derived for a given set of unit cost (C_q, q = 0, 1, 2, 3), r₁, and V_o. From the Table 1, we can easily say that the proposed imputation method can outperforms as compared to ratio, difference, and Hansen-Hurwitz estimators.

When the processing, editing, or imputing cost per unit is high, the proposed imputation strategy can performs better as compared to their counterpart. Our proposed imputation procedure is also useful when there are serious concerns about the non-response bias or refusals due to the sensitive nature of the study variable that is difficult to ignore it.

Supporting information

S1 Code. In this research a hypothetical data set is used which can be easily regenerated at the given value of parameters with the help of available statistical software.

https://doi.org/10.1371/journal.pone.0261834.s001

(R)

Acknowledgments

We are grateful to the reviewers and the associate editor for their in depth comments for improving the quality of the article.

References

1. Hansen M. H. and Hurwitz W. N. (1946). The problem of non-response in sample surveys. Journal of the American Statistical Association, 41(236):517–529 pmid:20279350
- View Article
- PubMed/NCBI
- Google Scholar
2. Abul-Ela A.-L. A., Greenberg G. G., and Horvitz D. G. (1967). A multi-proportions randomized response model. Journal of the American Statistical Association, 62(319):990–1008.
- View Article
- Google Scholar
3. Warner S. L. (1965). Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309):63–69. pmid:12261830
- View Article
- PubMed/NCBI
- Google Scholar
4. Greenberg B. G., Abul-Ela A.-L. A., Simmons W. R., and Horvitz D. G. (1969). The unrelated question randomized response model: Theoretical framework. Journal of the American Statistical Association, 64(326):520–539.
- View Article
- Google Scholar
5. Moors J. (1971). Optimization of the unrelated question randomized response model. Journal of the American Statistical Association, 66(335):627–629.
- View Article
- Google Scholar
6. Folsom R. E., Greenberg B. G., Horvitz D. G., and Abernathy J. R. (1973). The two alternate questions randomized response model for human surveys. Journal of the American Statistical Association, 68(343):525–530.
- View Article
- Google Scholar
7. Eichhorn B. H. and Hayre L. S. (1983). Scrambled randomized response methods for obtaining sensitive quantitative data. Journal of Statistical Planning and Inference, 7(4):307–316.
- View Article
- Google Scholar
8. Mangat N. and Singh R. (1990). An alternative randomized response procedure. Biometrika, pages 439–442.
- View Article
- Google Scholar
9. Gjestvang C. R. and Singh S. (2006). A new randomized response model. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 68(3):523–530.
- View Article
- Google Scholar
10. Diana G., Riaz S., and Shabbir J. (2014). Hansen and hurwitz estimator with scrambled response on the second call. Journal of Applied Statistics, 41(3):596–611.
- View Article
- Google Scholar
11. Ahmed S., Shabbir J., and Gupta S. (2017). Use of scrambled response model in estimating the finite population mean in presence of non response when coefficient of variation is known. Communications in Statistics-Theory and Methods, 46(17):8435–8449.
- View Article
- Google Scholar
12. Diana G. and Perri P. F. (2010a). New scrambled response models for estimating the mean of a sensitive quantitative character. Journal of Applied Statistics, 37(11):1875–1890.
- View Article
- Google Scholar
13. Diana G. and Perri P. F. (2010b). New scrambled response models for estimating the mean of a sensitive quantitative character. Journal of Applied Statistics, 37(11):1875–1890.
- View Article
- Google Scholar
14. Cochran W. (1940). The estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. The Journal of Agricultural Science, 30(02):262–275.
- View Article
- Google Scholar
15. Srivastava S. K. and Jhajj H. S. (1981). A class of estimators of the population mean in survey sampling using auxiliary information. Biometrika, 68(1):341–343.
- View Article
- Google Scholar
16. Isaki C. T. (1983). Variance estimation using auxiliary information. Journal of the American Statistical Association, 78(381):117–123.
- View Article
- Google Scholar
17. Singh S. and Horn S. (1998). An alternative estimator for multi-character surveys. Metrika, 48(2):99–107.
- View Article
- Google Scholar
18. Mohamed C., Sedory S. A., and Singh S. (2016). Imputation using higher order moments of an auxiliary variable. Communications in Statistics-Simulation and Computation, 46(8):6588–6617.
- View Article
- Google Scholar
19. Sohail M. U., Shabbir J., and Ahmed S. (2017). Modified class of ratio and regression type estimators for imputing scrambling response. Pakistan Journal of Statistics, 33(4):277–300.
- View Article
- Google Scholar
20. Bhushan S., Pratap Pandey A., and Pandey A. (2018). On optimality of imputation methods for estimation of population mean using higher order moment of an auxiliary variable. Communications in Statistics-Simulation and Computation, pages 1–15.
- View Article
- Google Scholar
21. Searls D. T. (1964). The utilization of a known coeffcient of variation in the estimation procedure. Journal of the American Statistical Association, 59(308):1225–1226.
- View Article
- Google Scholar
22. Murthy M. N. (1967). Sampling theory and methods. Calcutta-35: Statistical Publishing Society, 204/1, Barrackpore Trunk Road, India.
23. Reddy V. (1978). A study on the use of prior knowledge on certain population parameters in estimation. Sankhya C, 40:29–37.
24. Singh S. (2009). A new method of imputation in survey sampling. Statistics, 43(5):499–511.
- View Article
- Google Scholar
25. Okafor F. C. and Hyunshik L. (2000). Double sampling for ratio and regression estimation with sub-sampling the non-respondents. Survey Methodology, 26(2):183–188.
- View Article
- Google Scholar

[ref1] 1. Hansen M. H. and Hurwitz W. N. (1946). The problem of non-response in sample surveys. Journal of the American Statistical Association, 41(236):517–529 pmid:20279350
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Abul-Ela A.-L. A., Greenberg G. G., and Horvitz D. G. (1967). A multi-proportions randomized response model. Journal of the American Statistical Association, 62(319):990–1008.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Warner S. L. (1965). Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309):63–69. pmid:12261830
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Greenberg B. G., Abul-Ela A.-L. A., Simmons W. R., and Horvitz D. G. (1969). The unrelated question randomized response model: Theoretical framework. Journal of the American Statistical Association, 64(326):520–539.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref5] 5. Moors J. (1971). Optimization of the unrelated question randomized response model. Journal of the American Statistical Association, 66(335):627–629.
View Article
Google Scholar

[16] View Article

[17] Google Scholar

[ref6] 6. Folsom R. E., Greenberg B. G., Horvitz D. G., and Abernathy J. R. (1973). The two alternate questions randomized response model for human surveys. Journal of the American Statistical Association, 68(343):525–530.
View Article
Google Scholar

[19] View Article

[20] Google Scholar

[ref7] 7. Eichhorn B. H. and Hayre L. S. (1983). Scrambled randomized response methods for obtaining sensitive quantitative data. Journal of Statistical Planning and Inference, 7(4):307–316.
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref8] 8. Mangat N. and Singh R. (1990). An alternative randomized response procedure. Biometrika, pages 439–442.
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref9] 9. Gjestvang C. R. and Singh S. (2006). A new randomized response model. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 68(3):523–530.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref10] 10. Diana G., Riaz S., and Shabbir J. (2014). Hansen and hurwitz estimator with scrambled response on the second call. Journal of Applied Statistics, 41(3):596–611.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref11] 11. Ahmed S., Shabbir J., and Gupta S. (2017). Use of scrambled response model in estimating the finite population mean in presence of non response when coefficient of variation is known. Communications in Statistics-Theory and Methods, 46(17):8435–8449.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref12] 12. Diana G. and Perri P. F. (2010a). New scrambled response models for estimating the mean of a sensitive quantitative character. Journal of Applied Statistics, 37(11):1875–1890.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref13] 13. Diana G. and Perri P. F. (2010b). New scrambled response models for estimating the mean of a sensitive quantitative character. Journal of Applied Statistics, 37(11):1875–1890.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref14] 14. Cochran W. (1940). The estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. The Journal of Agricultural Science, 30(02):262–275.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref15] 15. Srivastava S. K. and Jhajj H. S. (1981). A class of estimators of the population mean in survey sampling using auxiliary information. Biometrika, 68(1):341–343.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref16] 16. Isaki C. T. (1983). Variance estimation using auxiliary information. Journal of the American Statistical Association, 78(381):117–123.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref17] 17. Singh S. and Horn S. (1998). An alternative estimator for multi-character surveys. Metrika, 48(2):99–107.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref18] 18. Mohamed C., Sedory S. A., and Singh S. (2016). Imputation using higher order moments of an auxiliary variable. Communications in Statistics-Simulation and Computation, 46(8):6588–6617.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref19] 19. Sohail M. U., Shabbir J., and Ahmed S. (2017). Modified class of ratio and regression type estimators for imputing scrambling response. Pakistan Journal of Statistics, 33(4):277–300.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref20] 20. Bhushan S., Pratap Pandey A., and Pandey A. (2018). On optimality of imputation methods for estimation of population mean using higher order moment of an auxiliary variable. Communications in Statistics-Simulation and Computation, pages 1–15.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref21] 21. Searls D. T. (1964). The utilization of a known coeffcient of variation in the estimation procedure. Journal of the American Statistical Association, 59(308):1225–1226.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref22] 22. Murthy M. N. (1967). Sampling theory and methods. Calcutta-35: Statistical Publishing Society, 204/1, Barrackpore Trunk Road, India.

[ref23] 23. Reddy V. (1978). A study on the use of prior knowledge on certain population parameters in estimation. Sankhya C, 40:29–37.

[ref24] 24. Singh S. (2009). A new method of imputation in survey sampling. Statistics, 43(5):499–511.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref25] 25. Okafor F. C. and Hyunshik L. (2000). Double sampling for ratio and regression estimation with sub-sampling the non-respondents. Survey Methodology, 26(2):183–188.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

Figures