Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Calibrated-Two Optional Randomized Response Techniques (C-TORRT) for the estimation of quantitative sensitive variable information

  • Mojeed Abiodun Yunusa,

    Roles Conceptualization, Data curation, Methodology, Software, Validation, Writing – original draft

    Affiliation Department of Statistics, Usmanu Danfodiyo University, Sokoto, Nigeria

  • Ahmed Audu ,

    Roles Conceptualization, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    ahmed.audu@udusok.edu.ng

    Affiliations Department of Statistics, Usmanu Danfodiyo University, Sokoto, Nigeria, Department of Statistical Sciences, Sefako Makgatho Health Sciences University, Pretoria, South Africa

  • Umar Usman,

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Affiliation Department of Statistics, Usmanu Danfodiyo University, Sokoto, Nigeria

  • Kazeem Olalekan Aremu,

    Roles Conceptualization, Investigation, Methodology, Software, Supervision, Validation, Writing – review & editing

    Affiliations Department of Mathematics, Usmanu Danfodiyo University, Sokoto, Nigeria, Department of Maths and Applied Maths, Sefako Makgatho Health Sciences University, Pretoria, South Africa

  • Maggie Aphane

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Software, Supervision, Validation, Writing – review & editing

    Affiliation Department of Maths and Applied Maths, Sefako Makgatho Health Sciences University, Pretoria, South Africa

Abstract

Accurate estimation of sensitive quantitative variables remains a challenge in survey research due to respondents’ reluctance to disclose truthful information. While existing randomized response techniques (RRT) offer privacy protection, many suffer from inefficiencies and limited robustness. This study addresses this critical gap by proposing new classes of Calibrated-Two Optional Randomized Response Techniques (C-TORRT), developed through calibration methods that incorporate auxiliary information to enhance estimation accuracy and respondent privacy. The theoretical framework of the proposed models demonstrates unbiasedness, reduced variance, higher privacy protection, and a superior combined metric of efficiency and privacy. Empirical studies based on real-life and simulated data showed that the proposed C-TORRT models consistently outperformed existing RRT models. For instance, under Population I, the proposed model achieved a variance of 21.76704, privacy level of 222.4369, and a percentage relative efficiency (PRE) of 608.93, compared to the Azeem et al. model with variance 142.4927 and PRE 93.02. Similarly, under Population II, the C-TORRT model reduced the variance to 4.3098 and raised the PRE to 499.60, a significant improvement over Gjestvang and Singh’s variance of 21.5316 and PRE of 100. Real-life data application using academic records confirmed these findings, where the C-TORRT estimators yielded lower variance (0.8084), higher privacy levels (817.28), and smaller combined efficiency-privacy metrics (0.000991) compared to existing models. These results underscore the superior efficiency, precision, and privacy protection of the proposed C-TORRT models, making them robust alternatives for sensitive quantitative data collection.

Introduction

Reliable data on sensitive issues such as cases of rape, use of hard drug, illegal earning, sexual harassment, cheating in examination and other sensitive issues are not easily obtainable from respondents using direct method of questioning in a sensitive survey. In a sensitive survey, respondents would rather choose to provide untrue response or refuse to answer the sensitive question, because of fear of being stigmatized or punished by the law. To obtain more reliable information from the respondents [1], came up with an ingenious approach for data collection known as the randomized response technique (RRT). This method guarantees the privacy of the respondents and also conceals their responses. The technique was specifically designed for qualitative variable but was extended by [2] to quantitative variable [3] reintroduced this technique as additive model. Randomized response techniques (RRT) are valuable techniques used in surveys or questionnaires to collect sensitive data while maintaining the privacy of participants. Social desirability bias is minimized using the randomized response technique (RRT) and it ensures participants privacy in sharing sensitive information without any fear. RRT increases the chance that respondents will answer sensitive questions honestly, the data collected is generally more accurate. This is particularly useful in cases where researchers are studying controversial or sensitive issues. By guaranteeing confidentiality, RRT can increase participation rates in surveys or studies involving sensitive topics. The primary goal of RRT is to protect individual privacy by randomizing responses and ensuring that no one can determine the true response of any specific participant. This provides an extra layer of protection for individuals participating in surveys. RRT can be applied to various types of data collection, including face-to-face interviews, self-administered questionnaires, and online surveys, making it a versatile tool for researchers. By reducing the potential discomfort associated with divulging sensitive information, RRT can lead to lower non-response rates, resulting in more comprehensive and valuable data.

RRT application in different field of endeavors like health, education, criminology, policy making etc. cannot be overemphasized. Under health, It can be used to gather accurate data about sensitive health behaviors like substance abuse, sexual practices, and mental health issues. RRTs can help estimate the prevalence of various diseases, especially those that carries a significant social stigma (such as sexually transmitted infections or mental health disorders) or those related to sensitive behaviors (such as HIV or Hepatitis C). RRT can be used in survey to investigate how patients are using their medication. RRT approach helps researchers and policymakers to have better understanding of disparities in healthcare access, quality, and outcomes, particularly among disadvantaged groups. They can also be employed to assess the accessibility and utilization of healthcare services, particularly among vulnerable populations. This can provide insights into barriers to care and inform targeted interventions. Several authors have proposed different RRT models and estimators for estimation of population parameters of sensitive qualitative/quantitative variables. These authors include [422].

Auxiliary information refers to additional data that is related to the variable of interest and can be used to improve estimators’ performance in sample surveys. Auxiliary information can help reduce the variance of estimators, leading to more precise estimates [23]. This is particularly useful when dealing with small sample sizes or when estimating small subpopulations. Auxiliary information can be used to adjust for potential biases in the sample, such as non-response bias or under-coverage bias, resulting in more accurate estimates. By incorporating auxiliary information, estimators can become more efficient, requiring smaller sample sizes to achieve the same level of precision [24]. This can help reduce the costs associated with data collection. Auxiliary information can be utilized to handle missing data in surveys, either through imputation or by informing the choice of weights in estimation methods. Auxiliary information can come from various sources, such as administrative data, census data, or previous surveys [25]. This allows researchers to leverage existing data to improve estimators and enrich their analyses. Using auxiliary information to align estimates with known population benchmarks or external data sources can facilitate better comparisons across different surveys or time periods. One of the approaches in which the auxiliary information can be incorporated into models/estimators is the use of calibration techniques [26].

Calibration of estimators in sample surveys is a method used to adjust for discrepancies between the survey sample and the target population, particularly when the sample may not be fully a representative. Calibration adjusts estimators to align with known population totals or benchmarks, reducing bias and improving the accuracy of the estimates. Calibration helps reduce the impact of sampling variability, which can arise when a sample is not entirely a representative of the target population. Calibration can help correct for non-sampling errors such as frame coverage errors, non-response bias, and measurement errors. By aligning estimates to known population benchmarks, calibration ensures that estimates from different surveys or time periods can be compared more accurately. Calibrated estimators are generally more robust, as they account for uncertainties in the sampling process and reduce sensitivity to model assumptions. Calibration can incorporate auxiliary information from other sources, such as administrative data or census data, to further refine and improve the estimates. Recently, [27] and [28] developed calibrated estimators for sensitive variables under simple and stratified random sampling respectively. In the present, we aimed at proposing randomized response calibrated estimators for estimation of mean of sensitive study variable under simple random sampling schemes.

Some existing quantitative randomized response schemes and their estimators

Let Y be a sensitive study variable which correlated with an auxiliary variable X from a population consists of N elements from which a sample of size n elements is drawn. Let S and T be scrambling variables which are uncorrelated with Y whose mean and variances are assumed to be known. Let Z be the scrambling response of Y. Then, the following notations are defined.

  1. Population mean of the sensitive study variable Y
  2. : Population variance of the sensitive study variable Y
  3. : Population mean of the scrambling variable S
  4. : Population variance of the scrambling variable S
  5. : Population mean of the scrambling variable T
  6. : Population variance of the scrambling variable T
  7. : Population mean of the auxiliary variable X
  8. : Population variance of the auxiliary variable
  9. : Sample mean of the auxiliary variable X
  10. : Sample variance of the auxiliary variable

Some relevant and related reviewed literatures are presented as follows.

[11] proposed RRT model for estimating sensitive information as in Eq. (1).

(1)

where and are constants determined by the interviewer.

The estimator of population mean and its variance using [11] model under the assumption that are given as in Eqs. (2) and (3) respectively and the combined metric of privacy level and efficiency denoted by is given as in Eq. (4).

(2)(3)(4)

[22] proposed RRT model with one scramble variable for estimating sensitive information as in Eq. (5).

(5)

where and are constants determined by the interviewer.

The estimator of population means and its variance using [22] model under the assumption that are given as in Eqs. (6) and (7) respectively and the combined metric of privacy level and efficiency denoted by is given as in Eq. (8).

(6)(7)(8)

Materials and methods

Proposed calibration estimators

Let and be the sets of respondents belonging to the first and second categories of Z respectively in all the RRT models stated in Eqs. (1) and (5) having elements and . Then, the models and their estimators can be generally written in the form defined Eqs. (9) and (10) respectively.

(9)(10)

where , ,

Motivated by [26], this study proposed two (2) calibration Schemes and Estimators to obtain two new classes of RRT models for sensitive variables,

First proposed calibration estimator.

The first proposed calibration scheme and estimator of population mean is defined as in Eqs. (11) and (12) respectively

(11)

where and are the new calibration weights to be obtained by minimizing the chi-square distance defined as in Equation (12).

(12)

To compute new calibrated weights and , Lagrange function is defined as in Eq. (13).

(13)

Partially differentiating Eq. (13) with respect to , , and , equate the results to zero, we obtained Eqs. (14)(16)

(14)(15)(16)

By substituting Eqs. (14) and (15) into Eq. (16) and solve for , Eq. (17) is obtained.

(17)

Substituting the value of into Eqs. (14) and (15), the expressions for and are obtained as in Eqs. (18) and (19) respectively.

(18)(19)

Substituting Eqs. (18) and (19) in Eq. (11) to obtain the proposed calibration estimator as in Eq. (20)

(20)

where .

Case 1: Setting in Eq. (20), member of denoted by is obtained as in Eq. (21)

(21)

where

Case 2: Setting in Eq. (20), member of denoted by is obtained as in Eq. (22)

(22)

where .

The resultant estimator obtained in Eq. (20) can be expressed as in Eq. (23)

(23)

Compared Eq. (23) with Eq. (20), the first proposed modified RRT model is obtained as in Eq. (24).

(24)

Second proposed calibration estimator.

The second proposed calibration scheme and estimator of population mean is defined as in Eqs. (25) and (26) respectively.

(25)

where and are the new calibration weights to be obtained by minimizing the chi-square distance defined as in Equation (26).

(26)

To compute new calibrated weights and , Lagrange multiplier function is defined as in Eq. (27)

(27)

Partially differentiating (27) with respect to , , and , equate the results to zero, Eqs. (28)(31) are obtained respectively

(28)(29)(30)(31)

By substituting Eqs. (28) and (29) into Eqs. (30) and (31), system of linear equation in Eq. (32) is obtained

(32)

where , , , .

By solving Eq. (32) for and , Eq. (33) is obtained.

(33)

Substituting the values of and into Eqs. (28) and (29), the expressions for and are obtained as in Eqs. (34) and (35) respectively.

(34)(35)

By substituting Eqs. (34) and (35) in Eq. (25), the proposed calibration estimator is obtained as in Eq. (36).

(36)

where , , .

Case 1: Setting in Eq. (36), member of denoted by is obtained as in Eq. (37)

(37)

where

Case 2: Setting in Eq. (36), member of denoted by is obtained as in Eq. (38).

(38)

where ,

The resultant estimator obtained in Eq. (36) can be expressed as in Eq. (39).

(39)

Compare Eq. (39) with Eq. (10), the second proposed modified RRT model (C-RRT-2) is obtained as in Eq. (40) (Table 1).

thumbnail
Table 1. Members of the proposed calibrated optional quantitative RRT model .

https://doi.org/10.1371/journal.pone.0339271.t001

(40)

Properties of the proposed calibration RRT estimators

This subsection presents the theoretical properties (Expectation, variance, privacy level) of the proposed calibrated estimators.

Theorem 1: Given that is unbiased of , then, . That is, the estimators of population mean are unbiased.

Proof:

As , , where , , , .

Since for all , . Then, can be defined as in Eq. (41).

(41)(42)(43)(44)

Hence, the proof:

Theorem 2: Given that is unbiased of , then,

(45)

Proof: The variance of

(46)(47)(48)

Simplify Eq. (48), Eq. (45) is obtained, Hence the proof.

Theorem 3: The privacy level of denoted by is given as in Eq. (49).

(49)

Proof:

(50)(51)

Take expectation of Eq. (51), Eq. (52) is obtained.

(52)

Simplify Eq. (52), Eq. (49) is obtained. Hence, the proof.

By using the results of Eqs. (45) and (49), the combined metric of privacy level and efficiency of , denoted by is obtained as in Eq. (53).

(53)

Properties of proposed model .

  1. (i) The sample mean of the model , denoted by is obtained as in Eq. (54).
(54)
  1. (ii) Let in Eq. (49), then and are obtained as in Eqs. (55) and (56) respectively.
(55)(56)
  1. (iii) Using the result of Eq. (55), the variance of denoted by is obtained as in Eq. (57).
(57)
  1. (iv) Using the results of Eqs. (55)(57), the privacy level of denoted by is obtained as in Eq. (58).
(58)
  1. (iv) Using the results of Eqs. (57) and (58), the combined metric of privacy level and efficiency of denoted by is obtained as in Eq. (59).
(59)

Members of the Proposed Calibrated Optional Quantitative RRT Model are presented in Eqs. (60)(63).

(60)(61)(62)(63)

Properties of proposed model .

  1. (i) The sample mean of the model , denoted by is obtained as in Eq. (64).
(64)
  1. (ii) Let in Eq. (49), then and are obtained as in Eqs. (65) and (66) respectively.
(65)(66)
  1. (iii) Using the result of Eq. (65), the variance of denoted by is obtained as in Eq. (67).
(67)
  1. (iv) Using the results of Eqs. (65) and (66), the privacy level of denoted by is obtained as in Eq. (68)
(68)
  1. (v) Using the results of Eqs. (67) and (68), the combined metric of privacy level and efficiency of denoted by is obtained as in Eq. (69).
(69)

Members of the Proposed Calibrated Optional Quantitative RRT Model are presented in Eqs. (70)(73).

(70)(71)(72)(73)

Theoretical efficiency comparison

This subsection presents theoretical comparisons of the proposed calibrated optional RRT models with their existing counterparts.

  1. i. The Proposed C-RRT Model is more efficient than RRT Model if . Therefore, the condition in Eq. (74) is obtained.
(74)
  1. ii. The Proposed C-RRT Model is more efficient than RRT Model if . Therefore, the condition in Eq. (75) is obtained.
(75)

Results and discussion

Simulation study

In this subsection, simulation studies were conducted using some probability distributions defined in Table 2 to assess the performance of the proposed models with respect to existing models under study. Data of size 1000 units were generated for the study population using function defined in Table 2. Sample of size 100 units were selected using simple random sampling without replacement to compute the biases, efficiency, percentage relative efficiency, privacy level and combined metric of efficiency and privacy of the estimators using Eqs. (76)(80) respectively. The processes of sampling and computation procedures were conducted 100 times and averages of the results were computed as presented in Tables 38.

thumbnail
Table 2. Distributions of non-linear populations used for empirical study.

https://doi.org/10.1371/journal.pone.0339271.t002

thumbnail
Table 3. Bias, variance, PRE, privacy level and combined metric of efficiency and privacy level when and using population I.

https://doi.org/10.1371/journal.pone.0339271.t003

thumbnail
Table 4. Bias, variance, PRE, privacy level and combined metric of efficiency and privacy level when and using population I.

https://doi.org/10.1371/journal.pone.0339271.t004

thumbnail
Table 5. Bias, variance, PRE, privacy level and combined metric of efficiency and privacy level when and using population ii.

https://doi.org/10.1371/journal.pone.0339271.t005

thumbnail
Table 6. Bias, variance, pre, privacy level and combined metric of efficiency and privacy level when and using population II.

https://doi.org/10.1371/journal.pone.0339271.t006

thumbnail
Table 7. Bias, variance, PRE, privacy level and combined metric of efficiency and privacy level when and using population II.

https://doi.org/10.1371/journal.pone.0339271.t007

thumbnail
Table 8. Bias, variance, PRE, privacy level and combined metric of efficiency and privacy level when and using population II.

https://doi.org/10.1371/journal.pone.0339271.t008

(76)(77)(78)(79)(80)
  1. i. Tables 3, 5 and 7 present the numerical results of the estimate population mean for the existing and proposed estimators as well as the variance, PRE, privacy level and Combined Metric of Efficiency and Privacy level of the estimators for population 1, 2, and 3 when and through simulation studies. Similarly, Tables 4, 6 and 8 present the numerical results of the estimate population mean for the existing and proposed estimators as well as the variance, PRE, privacy level and Combined Metric of Efficiency and Privacy level of the estimators for population 1, 2, and 3 when and through simulation studies
  2. ii. Tables 38 present the results of the estimate of the population parameter for the proposed and existing estimators and the results revealed that the estimators of proposed models , , , ,, , and produced estimates that are closer to the corresponding population parameter than the existing estimators except in few cases. For instance, under Population I, when estimating a population value of 11.4454, the C-TORRT estimators gave values like 10.20878, compared to 12.14269 from Azeem et al (2023). This indicates better accuracy and less systematic error in the proposed models. This implies that the proposed models are more robust than the existing models.
  3. iii. The results of variances of the estimators of the proposed and existing models are presented in Tables 38 and the results revealed that the estimators of proposed models , , , ,, , and have minimum variances compared to that of the existing estimators except in few cases. For instance, under Population I, the proposed estimator had variance 21.76704, compared to 142.4927 for Azeem et al. (2023). Under Population II, a variance of 4.3098 was observed for a C-TORRT estimator, compared to 21.5316 for Gjestvang & Singh. These reductions indicate substantial gains in statistical efficiency. This implies that the proposed models are more efficient than the existing models.
  4. iv. Similarly, Tables 38 present the results of the percentage relative efficiency (PRE) for the proposed and existing estimators and the results revealed that the estimators of proposed models , , , ,, , and have higher PRE values compared to the existing estimators with exception of few cases. Under Population I, C-TORRT models showed PRE > 600, while Azeem’s model had PRE = 93.02. Under Population II, C-TORRT achieved PRE = 499.6, again vastly outperforming existing techniques. These values confirm that C-TORRT models require fewer samples to achieve the same precision. These results imply that with exception of few cases, the proposed models are more precise (efficiency gains) in estimating sensitive information compared to the existing models.
  5. v. Tables 38 present the results of the privacy level of the estimators of the proposed and existing models and the results revealed that the estimators of proposed models , , , ,, , and have higher privacy level values than the existing estimators except in few cases. For instance, in Population I, a C-TORRT model had privacy level = 228.0839, while Azeem’s model had 205.1096. In real-life data, C-TORRT had privacy levels around 817.28, which was higher than corresponding models (813.4977). This reflects improved confidentiality safeguards, vital in sensitive surveys. This implies that the privacy of the respondents are more protected in relation to sensitive information under proposed models than under the existing models.
  6. vi. The results of combined metric of efficiency and privacy level of the estimators of the proposed models , , , ,, , , and that of the existing models are presented in Tables 38 and the results revealed that the estimators of proposed models have minimum values of combine metric for efficiency and privacy level compared to that of the existing models except in few cases. For instance, the proposed C-TORRT models had 0.09 or lower, while existing models ranged up to 0.80 or higher. This implies that the proposed models have better balancing of efficiency and privacy level compared to the existing models.

Real life application

In this subsection, the applicability of the proposed models was conducted using Cumulative Grade Point Average (CGPA) of 300 level students of Department of Statistics, Usmanu Danfodiyo University, Sokoto, Nigeria as study variable with auxiliary variable X (number of times students seek academic help), the data is presented in appendix (See Table A1 in S1 File). The existing and the proposed RRT models were applied to the data generated and the results of the data from the models are presented in Tables 9 and 10.

thumbnail
Table 9. Data obtained using Gjestvang and Singh and corresponding proposed models.

https://doi.org/10.1371/journal.pone.0339271.t009

thumbnail
Table 10. Data obtained using Azeem et al. and corresponding proposed models.

https://doi.org/10.1371/journal.pone.0339271.t010

Table 11 present the numerical results of Variance, Privacy level and Combined Metric of Efficiency and Privacy level using data in Tables 9 and 10. The results revealed that the proposed models , and have smaller Variance, higher privacy level and smaller combined metric of efficiency and privacy level compared to the model of Gjestvang and Singh (2009). Similarly, the proposed models , and outperformed the corresponding model of Azeem et al. (2023). For instance: with a sample size of n = 150, the C-TORRT model achieved a combined metric of only 0.000991, compared to 0.001005 from Azeem’s model. These results revealed that the proposed C-RRT models are more robust and precise in estimating sensitive information.

thumbnail
Table 11. Variance, privacy level and combined metric of efficiency and privacy level of Gjestvang and Singh and Azeem et al. and corresponding proposed models.

https://doi.org/10.1371/journal.pone.0339271.t011

Conclusions

This study proposed Calibrated-Two Optional Randomized Response Techniques (C-TORRT) for the Estimation of Quantitative Sensitive Variable Information by modifying RRT models proposed by Gjestvang and Singh (2010) and Azeem et al. (2023). The existing RRT Models were improved by incorporating non-sensitive auxiliary variable that is correlated to the sensitive variable through calibration approach. The C-TORRT models of the proposed calibration schemes were derived. The estimators of the population mean for the C-TORRT models as well as their theoretical properties like variance, privacy level, combined metric of efficiency and privacy level of the proposed calibration were derived so as to assess their efficiency, precision and robustness in estimating sensitive information. Empirical study through simulated data and real-life data as presented in section 4 was conducted numerically. Estimates, variances, privacy level and combined metric of efficiency and privacy level of the proposed and existing RRT models were computed and presented in Tables 36 and 10 respectively.

Based on simulated and real-world data results, the following conclusions are made:

  1. i. Efficiency: C-TORRT models produced significantly lower variances (e.g., 21.77 vs. 142.49; 4.30 vs. 21.53), making them 5–7 times more efficient in some scenarios.
  2. ii. Precision: The bias and PRE values confirm improved accuracy and reliability, with PRE rising above 600 in optimal cases.
  3. iii. Privacy: C-TORRT ensures better respondent protection, with privacy levels consistently higher than existing methods (e.g., 228 vs. 205).
  4. iv. Balanced performance: The combined metric is consistently lower in C-TORRT models (as low as 0.000991), reflecting an optimal trade-off between data quality and privacy.

The results revealed that the C-TORRT models outperformed the existing RRT models considered in the study except in few cases. This implies that the incorporation of auxiliary information through calibration approach enhanced the robustness and the performance of the proposed C-TORRT models. Therefore, it can be concluded that the proposed C-TORRT models have higher goodness of fit as compared to their counterparts.

This study is limited to incorporation of auxiliary variable into RRT models proposed by Gjestvang and Singh (2010) and Azeem et al. (2023) through calibration approach; however, other approaches like two-step calibration, power calibration and calibrated maximum likelihood design weight approaches can be used for further studies.

Supporting information

S1 File. Table A1. CGPA and Number of Times Students Seek Academic Help (NTSH).

https://doi.org/10.1371/journal.pone.0339271.s001

(DOCX)

References

  1. 1. Warner SL. Randomized response: a survey technique for eliminating evasive answer bias. J Am Stat Assoc. 1965;60(309):63–6. pmid:12261830
  2. 2. Warner SL. The linear randomized response model. J Am Stat Assoc. 1971;66(336):884–8.
  3. 3. Pollack KH, Bek Y. A comparison of three randomized response models for quantitative data. J Am Stat Appl. 1976;71:884–6.
  4. 4. Eichhorn BH, Hayre LS. Scrambled randomized response methods for obtaining sensitive quantitative data. J Stat Plann Inference. 1983;7(4):307–16.
  5. 5. Gupta S, Gupta B, Singh S. Estimation of sensitivity level of personal interview survey questions. J Stat Plann Inference. 2002;100(2):239–47.
  6. 6. Bar-Lev SK, Bobovitch E, Boukai B. A note on randomized response models for quantitative data. Metrika. 2004;60(3):255–60.
  7. 7. Singh HP, Mathur N. Estimation of population mean when coefficient of variation is known using scrambled response technique. J Stat Plann Inference. 2005;131(1):135–44.
  8. 8. Gupta S, Thornton B, Shabbir J, Singhal S. A comparison of multiplicative and additive optional RRT models. J Stat Theory Appl. 2006;5(3):226–39.
  9. 9. Gupta S, Shabbir J. On the estimation of population mean and sensitivity in two stage optional randomized response model. J Indian Soc Agric Stat. 2007;61:164–8.
  10. 10. Saha A. A simple randomized response technique in complex surveys. Metron. 2007;65:59–66.
  11. 11. Gjestvang CR, Singh S. An improved randomized response model: estimation of mean. J Appl Stat. 2009;36(12):1361–7.
  12. 12. Huang K-C. Unbiased estimators of mean, variance and sensitivity level for quantitative characteristics in finite population sampling. Metrika. 2009;71(3):341–52.
  13. 13. Gupta S, Shabbir J, Sehra S. Mean and sensitivity estimation in optional randomized response models. J Stat Plann Inference. 2010;140(10):2870–4.
  14. 14. Diana G, Perri PF. New scrambled response models for estimating the mean of a sensitive quantitative character. J Appl Stat. 2010;37(11):1875–90.
  15. 15. Diana G, Perri PF. A class of estimators for quantitative sensitive data. Stat Papers. 2009;52(3):633–50.
  16. 16. Hussain Z. Improvement of the Gupta and Thornton scrambling model through double use of randomization device. Int J Acad Res Bus Soc Sci. 2012;:291–7.
  17. 17. Mehta S, Dass BK, Shabbir J, Gupta S. A three-stage optional randomized response model. J Stat Theory Pract. 2012;6(3):417–27.
  18. 18. Singh HP, Tarray TA. An improved Bar Lev, Bobovitch and Boukai randomized response model using moments ratios of scrambling variable. Hacettepe J Math Stat. 2015;45(89):1–1.
  19. 19. Tarray T, Singh HP. A simple way of improving the Bar-Lev, Bobovitch and Boukai randomized response model. Kuwait J Sci. 2017;44(4):83–90.
  20. 20. Gupta S, Mehta S, Shabbir J, Khalil S. A unified measure of respondent privacy and model efficiency in quantitative RRT models. J Stat Theory Pract. 2018;12(3):506–11.
  21. 21. Azeem M, Salam A. Introducing an efficient alternative technique to optional quantitative randomized response models. Eur J Res Behav Soc Sci. 2023;19(1):24–42.
  22. 22. Azeem M, Shabbir J, Salahuddin N, Hussain S, Ijaz M. A comparative study of randomized response techniques using separate and combined metrics of efficiency and privacy. PLoS One. 2023;18(10):e0293628. pmid:37889902
  23. 23. Cochran WG. The estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. J Agric Sci. 1940;30(2):262–75.
  24. 24. Cochran WG. Sampling theory when the sampling-units are of unequal sizes. J Am Stat Assoc. 1942;37(218):199–212.
  25. 25. Singh S. Advanced Sampling Theory with Applications: How Michael ‘selected’ Amy Volume. 2003;I:71–136.
  26. 26. Audu A, Singh RVK, Ishaq OO, Khare S, Singh R, Adewara AA. On the estimation of finite population variance for a mail survey design in the presence of non-response using new conventional and calibrated estimators. Commun Stat - Theor Methods. 2022;53(3):848–64.
  27. 27. Jabeen J, Aslam MK, Sanaullah A, Zaka A. Calibrated estimator for sensitive variables under stratified random sampling. Thailand Statistician. 2024;22(2):363–73.
  28. 28. Audu A, Yunusa MA, Aphane M, Lekganyane M. Calibrated optional randomized response techniques for efficient and robust estimation of quantitative sensitive variables. Sci Rep. 2025;15(1):17138. pmid:40382462