Robust small area estimation for unit level model with density power divergence

Xijuan Niu; Zhiqiang Pang; Zhaoxu Wang

doi:10.1371/journal.pone.0288639

Abstract

Unit level model is one of the classical models in small area estimation, which plays an important role with unit information data. Empirical Bayesian(EB) estimation, as the optimal estimation under normal assumption, is the most commonly used parameter estimation method in unit level model. However, this kind of method is sensitive to outliers, and EB estimation will lead to considerable inflation of the mean square error(MSE) when there are outliers in the responses y_ij. In this study, we propose a robust estimation method for the unit-level model with outliers based on the minimum density power divergence. Firstly, by introducing the minimum density power divergence function, we give the estimation equation of the parameters of the unit level model, and obtain the asymptotic distribution of the robust parameters. Considering the existence of tuning parameters in the robust estimator, an optimal parameter selection algorithm is proposed. Secondly, empirical Bayesian predictors of unit and area mean in finite populations are given, and the MSE of the proposed robust estimators of small area means is given by bootstrap method. Finally, we verify the superior performance of our proposed method through simulation data and real data. Through comparison, our proposed method can can solve the outlier situation better.

Citation: Niu X, Pang Z, Wang Z (2023) Robust small area estimation for unit level model with density power divergence. PLoS ONE 18(11): e0288639. https://doi.org/10.1371/journal.pone.0288639

Editor: Angelo Moretti, Utrecht University: Universiteit Utrecht, NETHERLANDS

Received: October 27, 2022; Accepted: June 30, 2023; Published: November 16, 2023

Copyright: © 2023 Niu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: This data can be obtained from the R package "rsae", https://github.com/tobiasschoch/rsae.

Funding: This work has been supported in part by the Excellent Graduate Research Project in Gansu Province: "Analysis of Household Surveys Based on Small Area Estimation" (2021CXZX-698) and the the Young Scientists Fund of Qinghai Normal University "Application Research of Minimum Density Power Divergence in Robust Small Area Estimation" (KJQN2022014). The funders had supported the data collection and analysis, decision to publish, preparation of the manuscript of this study.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

In sampling estimation, due to the challenge of small sample or even no sample, small area estimation(SAE) has received unanimous favor from statisticians [1–4]. Compared with the traditional direct estimation method, the small area estimation can solve the small sample problem better by “borrowing strength” from the auxiliary information. In practice, the small area estimation method is widely used in population statistics [1], medical statistics [2], agricultural statistics [5, 6], poverty rate estimation [3, 7] and other fields. In terms of theoretical research, the theory of small area estimation has also been fully developed, forming a relatively complete theoretical system, [8] describe the basic theory of small area; [9] introduces the theory of small area estimation of several kinds of mixed models. For comprehensive overviews of small area estimation, see [8–11].

Among the SAE methods, the model-based SAE method has received more attention from statisticians. Although direct estimation can give an unbiased estimator of the target variable, due to the small sample size, direct estimator using only the original data is not reliable. The small sample size of raw data can be overcome by statistical models using auxiliary variables. As one of the basic models of small area estimation, unit level model can deal with the estimation of target variables at each unit level in a small area and calculate the corresponding area level values from unit data. Due to the limitations of data collection, acquisition of auxiliary variables, and model calculation, unit level model is not as concerned by scholars as area level model. If observation data and auxiliary information can be obtained at the unit level, the establishment of the unit level model is a better choice for SAE. [5] use the nested error regression(NER) model to estimate crop area at county level based on sampling data and satellite data. [12] generalizes the NER model and discusses the estimators under the generalized linear mixed model by using the hierarchical Bayes(HB) method. Empirical best linear unbiased estimator (EBLUP) is the most widely used method in model-based SAE, which can solve the estimation problem of mixed linear models well. When the observed variables are dichotomous variables or counting variables, the empirical Bayes (EB) method is more widely used, for example, [1, 2] have been mentioned. In the basic unit level model, the errors of individual and area-specific random effect are assumed to follow the normal distribution. However, [13] points out that this assumption is difficult to be verified in practice, which means that the traditional EB method is not reliable in the presence of outliers. Meanwhile, in practical application, due to the small sample size, outliers are are very common, which will cause large estimation deviation in the traditional estimation method. In this paper, we focus on unit level models whose random effects have different skewed distributions or observations in the model have outliers, and propose robust estimation methods for such models.

The existence and influence of outliers in sampling estimation have been studied for a long time. [14] mentioned that outliers are a fact of life for any survey. [15] discussed how outliers can affect shrinkage estimators, even a single outlier may lead all the small area estimates to collapse to their corresponding direct estimates. At present, there are several common methods to deal with outlier observations [14, 16–18]. In the first method, the outliers are deleted directly and the remaining observations are used for estimation and prediction. Obviously, this method may be feasible to delete a few outlier observations when the sample size is very large. However, when the sample size is small, the method will not only cause the loss of information, but also lead to the deviation of the estimation from the real value. In the second method, non-outlier observations are used instead of outliers for estimation, and robust projection is used for robust prediction in the unsampled population. However, [16] states that an observation cannot be considered unique if it is accurately captured, and there is no reason to think that outlier observations cannot be included in an unsampled population. [14] have used robust projection method to construct robust small area estimator, and obtained MSE estimation for robust predictors. [3] proposed using global-local shrinkage priors for modeling random effects that allow potential outliers in the areal effect. In the other method, the influence of outliers on estimation results can be reduced by constructing robust estimators insensitive to outlier observations, which is also the main method concerned by scholars.

The early research on robust estimation of SAE can be traced back to the mean robust estimators of strata means, which proposed by [19], with small area means being a special case. [15] used a hierarchical Bayes (HB) framework to study the effect of outliers in v_i. The HB estimators based on long-tailed distributions (such as, t and Cauchy) are more robust to outlying area-specific error v_i than the estimators based on normal distribution. [20] research results indicate that the use of a t–distribution with small k can diminish the effect of outliers in the sense that more weight is given to the direct estimator than in the case of normal area-specific error. However, these two methods only consider the robust estimation in special cases, and do not discuss the properties of the estimation parameter. [17] introduced robust SAE based on quantiles, which used M-quantiles to offer an alternative to the modeling of between area variation. [21] proposed a robust Bayes predictor for the FH model that can overcome the over-shrink caused by outlier observations, but the influence of outliers on the model coefficients is not taken into account. [18] developed robust EBLUP(REBLUP) in general linear mixed models, which they used Huber’s ϕ-function to modify certain “residual” terms by down-weighting contributions due to the outliers. By using a parametric bootstrap procedure, they have also developed estimators of the MSEs of the REBLUPs. Up to now, this method has been widely used in small area robust estimation. Later researches on robust SAE are completed on the basis of the above research. Such as, robust SAE in business surveys is discussed by using robust projection and M-quantile method in [22, 23] studied the robust estimation of nested error linear regression model by using huber’s ϕ-function and M-quantile based on hierarchical bayes theory and given prior information; The robust SAE of generalized linear models is discussed in [24, 25] reviewed the robust estimation of small area with outliers, and proposed Bootstrap MSE based on M-quantile estimators. [26] provide an overview of robust small area estimation. Readers who are interested in this research can refer to [22–27]. In this paper, we propose a new robust Bayes estimator using dendity power divergence, and investigate the proposed estimator’s MSE and parameter estimation.

In the field of statistical inference, when outliers appear in the data, the density-based minimum distance is an effective method to solve this problem. The density power divergence (DPD) [4], which measures the discrepancy between two density functions, has been successfully used to build a robust estimator for independent and identically distributed observations. Since then, the method by minimzing DPD(MDPD) has been one of the most powerful tools in robust estimation. [28] extended the construction of the DPD and the corresponding minimum DPD estimator (MDPDE) to the case of independent but non-identically distributed data. The main idea of the DPD is to give small weight to the terms related to outliers, and then, the parameter estimation becomes robust against outliers. In the cases, the parameter α controls the trade-off between robustness and efficiency. The smaller the value of α is, the more efficient the model is; The higher the value of α is, the better the stability of the outlier is. The minimum divergence estimator corresponding to α = 0 is the maximum likehood estimator(MLE). [29] used density power divergence to study robust Bayesian estimators, and discussed the asymptotic properties of the estimators and parameters. [30] discussed the theory of MDPD method in robust regression using the methods of S-estimation. This result showed that robust estimation based on MDPD method and Huber-ϕ function had similar effects. [31] employed the γ-divergence (similar to density power divergence) for the Fay-Herriot model and discussed empirical Bayes confidence intervals rather than MSE. Therefore, in this paper, we apply the MDPD method to the unit level model and compare it with the robust estimation proposed by [18].

In this paper, our main work is reflected in the following aspects. Firstly, the MDPD method is applied to the basic unit level model with outliers, and robust estimates of unknown parameters in the model are obtained. The asymptotic properties of the estimated parameters are further derived. Secondly, the selection algorithm of adjustment parameters in estimators is proposed to select the optimal tuning parameters. Thirdly, combined with parameter estimators, the general expression of robust small area estimator of unit level model is proposed. Fourthly, Bootstrap method is used to calculate the MSE of robust estimators, and the algorithm is given. Finally, the maximum likelihood estimator, the robust estimator in [18] and the proposed robust estimator are compared between simulated data and actual data to illustrate the efficiency and robustness properties of the estimators.

The rest of this paper is organized as follows. In Section 2, we introduce the basic unit level model and some notations used. In Section 3, the background and definition of MDPD method are reviewed, and the robust estimation equations of model parameters are obtained by applying MDPD method to unit-level model. The asymptotic properties of robust parameters obtained by MDPD method are presented in Section 4. In Section 5, firstly, we propose robust empirical Bayes predicator based on MDPDE, secondly, we give an algorithm to select the optimum tuning parameter, and finally, we give the algorithm procedure to estimate MSE of robust EBP using Bootstrap method. In Section 6, we investigate performances of the proposed estimator by simulation and real data. The proposed method is compared with the robust method of [18] under different outlier generation backgrounds.

2 Basic unit level model

The NER model is a popular basic unit level model proposed by [5]. Suppose a finite survey population is partitioned into m small areas, with the i-th area having N_i units such that . We assume that y_ij is the value of a response variable Y for the j th unit in the i-th small area. The unit-specific auxiliary data x_ij = (x_ij1, …, x_ijp)^T are available for each population element j(j = 1, …, N_i) in each small area i. Then, the NER model described as (1) Where β is a p-variate vector of unknown regression coefficients, the area-specific random effects v_i are assumed to be independent . The unit errors for known constants k_ij and the ’s are iid random variables independent of the v_i’s, with .

We assume that a sample s_i of size n_i is taken from the N_i units in the i th area (i = 1, …, m), and that the sample values also obey the assumed model (1). The latter assumption is satisfied under simple random sampling from each area or more generally for sampling designs that use the auxiliary information x_ij in the selection of the samples s_i. To see this, we write (1) in matrix form as (2) where x_i is a n_i × p matrix, y_i and e_i are n_i × 1 vectors, and 1_i is the n_i × 1 vector of ones.

Let denote the unknown parameter of the model given in Eq (1). Under the assumption of normality of the model, y_i∣x_i obeys normal distribution. The conditional probability density is (3) where

The matrix V_i can be inverted explicitly. Using the Sherman-Morrison formula: and denoting we get (4) where

3 Density power divergence

3.1 Minimum density power divergence estimator

The density power divergence (DPD) measure was developed by [4] in terms of a tuning parameter γ. The DPD measure between the model density f_θ and the true density g is defined as where γ is a tuning parameter. Note that, G is not necessarily a member of the model family F_θ. Further, for γ = 0, the DPD measure is obtained as a limiting case of γ → 0⁺, and is same as the Kullback-Leibler (KL) divergence. Generally, given a parametric model, we estimate θ by minimizing the DPD measure with respect to θ over its parametric space Θ. We call the estimator the minimum power divergence estimator (MDPDE). It is well-known that, for γ = 0, minimization of the KL-divergent is equivalent to maximization of the log-likelihood function. Thus, the MLE can be considered as a special case of the MDPDE when γ = 0.

We substitute the conditional density f_θ(y_i ∣ X_i) in 3 as the model density into the definition of DPD, and define the DPD measure based on SAE model as where h(x) is the marginal probability density function of X and g(y∣x) is the true conditional density of Y given X For γ > 0, after approximating the true distribution with the empirical, the DPD measure turns out to be (5) where the last part of the expression in the right hand side of Eq (2) is independent of the unknown parameter . Hence, Eq (5) simplifies to (6) where . Using the formula (4), we get

The MDPDE of θ is obtained by minimizing over θ ∈ Θ, where Θ is the parameter space composed by all possible parameters θ. Obviously, if the i-th observation is an outlier, then the value of the conditional density f_θ(y_i ∣ x_i) is smaller compared to other observations. In this way, the second term of Eq (5) is negligible when γ > 0, thus the corresponding MDPDE becomes robust against outlier. The tuning parameter γ controls the trade-off between efficiency and robustness of MDPDE. When γ increases, robustness increases and efficiency decreases, and vice versa. In addition, when γ = 0, the DPD becomes KL divergence, and MDPDE becomes MLE. At this time, for an outlying observation, the KL divergenence measure diverges as f_θ(y_i ∣ x_i) → 0, and MLE method is invalid.

The partial derivative of with respect to in Eq (5) is taken to obtain the following estimated equation: (7) (8) (9)

3.2 Choice of the optimal tuning parameter γ

It can be seen from the above results that the unknown tuning parameter γ is included in the estimated expression of parameter θ which is iterated by MDPD method. The choice of γ determines the trade-off between robustness and statistical efficiency [4]. When the value of γ is closer to 1, the estimated parameter has stronger robustness; otherwise, the weaker the robustness, the stronger the efficiency. Therefore, choosing the appropriate tuning parameter γ is the key factor in robust estimation. we hope to choose a data-driven value of γ in an optimal way which balances the concerns of robustness and efficiency. At present, there are two main methods for selecting tuning parameters. One is based on the proportion relationship between efficiency and robustness. Researchers determine the proportion between them according to their own needs, and then select the optimal tuning parameters. For example, [29] selects tuning parameters when using DPD method for area level estimation. Another method is based on data-driven parameter selection method, [32] minimizes the MSE of the estimated parameter to get the optimal tuning parameter, but this method depends on the selection of initial value, different pilot value of parameter may result in different tuning parameters. [33] Based on [32], a method of adjusting parameter selection that does not depend on the initial value is proposed. In this paper, we will use the parameter selection method mentioned in [33] to select the optimal tuning parameters for constructing robust small area estimators.

For the true value θ* of the unknown parameter θ, the optimal tuning parameter γ is obtained by minimizing the summed MSE of the MDPD estimator, i.e (10)

As the unknown parameter θ* is contained in the formula (10), there are usually two ways to select the optimal tuning parameter γ. One method is to think that the estimated is the true value of parameter θ*, that is, the optimal tuning parameter can be obtained only by solving the minimum value of the first term of the above equation. This method is easy to use, but we know that is asymptotically tend to θ, so direct substitution will produce a large error. Another method is to set an initial value θ_p of θ* and then minimize (10) to find the optimal parameter. This method is used in [32], and wick-Jones (WJ) algorithm is given to select the optimal tuning parameters. However, the WJ algorithm relies heavily on the selection of the initial θ^p, which directly determines the selection of the optimal parameter. In order to overcome the shortcomings of the above two selection methods, [33] proposed iterative WJ (IWJ) algorithm, which is used to calculate the optimal tuning parameters. In this paper, this method is also used to select the optimal tuning parameters. The specific algorithm steps are as follows:

Algorithm 1 IWJ algorithm

Input: set the initial γ,

Repeat:

1: WJ algorithm is used to minimize Formula (10) and update γ within interval I_γ,

2: Fix γ⁽ⁱ⁺¹⁾, put it into the MDPD iteration program, get the estimate of , and update it θ*⁽ⁱ⁺¹⁾ ← MDPD(θ*⁽ⁱ⁺¹⁾).

3: repeat step 1,2, until |γ⁽ⁱ⁺¹⁾ − γ⁽ⁱ⁾| < ϵ or |θ*⁽ⁱ⁺¹⁾ − θ*⁽ⁱ⁾| < ϵ*, where ϵ and ϵ* is the accuracy of parameter estimation.

Output: θ^(m+1)

4 Asymptotic distribution of the robust estimator

In this section, we investigate the asymptotic distribution of the robust estimator of model parameters, when the data generating distribution G(y ∣ x) is not necessarily in the model famliy. Let’s define the score function as (11)

According to the definition of score function and (3), we can get (12)

For i = 1, 2, ⋯, N, we define

We further define . For the asymptotic distribution of the MDPDE, we need the following assumptions:

The true density g(y ∣ x) is supported over the entire real line ;
There is an open subset ω ∈ Θ₀ containing the best fitting parameter θ such tat J is positive definite for all θ ∈ ω;
There exist functions M_jkl(x, y) such that ≤ M_jkl(x, y) for all θ ∈ ω, where ∫_x∫_y|M_jkl(x, y)|g(y ∣ x)h(x)dydx < ∞ for all j, k and l.

Theorem 4.1 Under the regularity conditions (1)-(3), with probability tending to 1 as m → ∞, there exists , such that

is consistent for θ;
the asymptotic distribution of is given by

Proof: The proof of the theorem is given in S1 File.

Note that, if the true distribution g(y ∣ x) is a member of the model family f_θ(y∣x) for some θ ∈ Θ, then (13)

In this case, the symmetric matrix J⁽ⁱ⁾ can be partitioned as

Combining Eqs (12) and (13), the elements in J⁽ⁱ⁾ can be deduced. Detailed calculation and results can be found in S1 File.

Similarly, ξ⁽ⁱ⁾ can be partitioned as , In S1 File, the derivation formula of the components of ξ⁽ⁱ⁾ is given.

Note that if we write the matrix J⁽ⁱ⁾ as a function of γ, i.e. J⁽ⁱ⁾ ≡ J⁽ⁱ⁾(γ), Based on the representation of K⁽ⁱ⁾ and J⁽ⁱ⁾ in (13), we have

Therefore, K can be written as

Through the calculation of the above covariance matrix, it can be seen that the parameter variance increases with the increase of γ, which indicates that the efficiency of MDPDE decreases with the increase of γ. This further verifies that the tuning parameter γ is used to control the trade-off between efficiency and robustness of MDPDE, and that robustness increases and efficiency decreases as γ increases However, our subsequent simulations show that this loss of efficiency is not significant.

5 Robust empirical Bayes perdictor and MSE

5.1 Robust EB predictor under a finite population

In this section,we discuss EB estimators of parameters of a finite population. A finite population P contains N units and a sample s of size n is drawn from P. We denote by y^P the unit values vector of the target variable in the population, which is assumed to be random with a given joint probability distribution. We write y_s as the subvector of y^P composed of sampling units, y_r as the subvector composed of unsampled units and assume without loss of generality that the first n units of y are the sample elements, that is, .

We assume that the vaule y_ij of a target variable for jth unit in ith area follows the basic unit level model (1). At the same time, we assume that y_i obeys normal distribution under the condition of auxiliary information X_i, i.e y_i ∼ N(X_iβ, V_i). We next partition (2) into sampled and nonsampled parts: where the subscript r denotes the nonsampled units. The covriance matrix can be decomposited as: where

The non-sampled sub-vectors y_ir follow the marginal models derived from the population model (1), i.e. (14)

The vectors e_ir are independent with , where . The vectors y_ir are independent and normally distributed with where μ_ir = X_irβ.

The distribution of y_ir, given the sample data y_is, is (15)

The conditional mean vector is (16) and the conditional covariance matrix is (17) where

If n_i ≠ 0 and j ∈ U_i − s_i, the conditional mean is where and For any j ∈ U_i− s_i, it thus holds that

For any j ∈ U_i − s_i, the conditional variance is

In general, our goal is to use the available sample data y_s to estimate the value of the real measurable function τ = h(y^p) with respect to the population vector y^p. Therefore, The conditional distribution of y_r, given y_s, plays an important role in the calculations of the best predictors (BPs) of population parameters τ = h(y). Assume that the model parameters are known. Under the unit level model, the BP is an unbiased predictor of τ that minimizes the MSE. According to [8], the BP of τ is .

Therefore, the EB estimator of τ can be obtained by using formula (15)–(17), , where , and . In practice, the model parameters are replaced by consistent estimates , and then the variables are generated from (15), thus, the EBLUP of can be obtained.

5.2 EBLUP of area means

In this section, we derives the EBLUPs of , where . Let , and be consistent estimators of the model parameters , and , respectively. Under the conditioned distribution (15), the predicted values are (18) or equivalently

The EBLUP of is (19) where is the domain sample fraction, , and .

5.3 MSE of the EBP

It is usually difficult to estimate the mean square prediction error(MSPE) of unit level model, which is caused by two reasons. First, the true distribution of error terms and non-sampled units in the unit level model is unknown, so it is impossible to obtain the density function for MSPE calculation. Second, even when the distribution of unsampled units is known, MSPE calculations are sometimes challenged by multiple integrals in the calculation of expectations due to the fact that model involve unit level data. In this paper, we use the parameter Bootstrap method mentioned in [18, 34] to estimate the MSPE.

The parametric bootstrap methods can be used to estimate the MSE of EBP for finite populations. The method proceed as follows:

The robust estimation of parameters and are obtained by using the DPD method mentioned in section 2;
Generate bootstrap domain effects as ; Generate, independently of , unit errors as
Generate a bootstrap population of response variables from the model
Let denote the vector of generated bootstrap response variables for area i. Calculate target quantities for the bootstrap population as .
Fit the model to the bootstrap data and obtain bootstrap model parameter estimators, denoted , and .
Obtain the bootstrap EB estimator of τ_i using (18), denoted .
Repeat steps (2)-(7) a large number of times B. Let be true value and the EB estimator obtained in b th replicate of the bootstrap procedure, b = 1, …, B.
The bootstrap MSE estimator of is given by

6 Application

In this section, we compare the effects of several robust Bayes estimators with the estimator proposed in this paper based on simulated and real data.

6.1 Simulation

6.1.1 Contaminated distribution.

In this paper, we do the same simulation as [18], that is, a unit level model with a single auxiliary variable x: with k = 40 and n = 4. Where auxiliary variable x_ij ∼ N(1, 1), and the area-specific random effects v_i and the random errors e_ij were generated from the contaminated distribution . This means that a (1 − η) proportion of the errors’ were generated from the underlying “true” distribution N(0, σ²) and the remaining η proportion of the errors were generated from the “arbitrary” contaminated distribution The choice η = 0 indicates no contamination of the distribution. For the underlying distributions, we set and for the contaminated distributions, we set and the proportion of contamination η₁ = η₂ = 0.10. We considered four possible combinations {(0, 0), (0, v), (0, e), (v, e)} of contamination, where (0, 0) indicates no contamination of the distributions, (0, v) indicates the contamination only in the distribution of the area-specific random effects v_i, and so on.

For each simulation configuration, the regression coefficients were fixed at (β₀, β₁) = (1, 1). We ran four sets of simulations, each of size 500. Given our focus on bias robustness, the main performance indicator for an MSE estimator in four studies is the relative bias, defined by Where the subscript i indexes the small areas and the subscript j indexes the S Monte Carlo simulations, with denoting the simulation j value of the estimator in area i, and Y_i denotes the actual vaule in area i. We also measured the stability of an MSE estimator by its relative MSE,

In the simulation experiment, we compare the traditional maximum likelihood(ML) method, the robust estimation method(RML) mentioned in [18], and the robust minimum density power divergence method(RMD) proposed by us when taking different tuning parameters(γ = 0.1, 0.2, 0.3and the optimal γ choosed by IWJ algorithm).

First we compare the estimates of model parameters under four contamination scenarios. Table 1 shows the RABiases and RAMSEs of estimators obtained from the robust estimation methods under different conditions, where the first row corresponding to each parameter represents the RABias and the second row represents the RAMSE.

Download:

Table 1. Simulated RABias and RAMSEs of robust and classical estimators of fixed effects and variance components.

https://doi.org/10.1371/journal.pone.0288639.t001

The following conclusions can be clearly drawn from Table 1. In case of no contamination in the data, ML method in parameter estimation performance is best, However, RML and RMD methods with smaller tuning parameters are very similar to ML estimation results. This indicates that in this case, RMD method with smaller tuning parameters and RML method are almost as efficient as the ML method, and there is little difference between the RABias and RAMSE, while RMD method with larger tuning parameters performs poorly. It shows that the selection of tuning parameters is very important, and the optimal tuning parameters can be obtained according to the algorithm provided in Section 3.2. In the case of contamination in random effect v_i, The variance σ estimated by the ML method has a large RABias and RAMSE, while the estimation by the RML method becomes smaller. However, the RMD method proposed by us is obviously better than the RML method, which has smaller bias and MSE in the estimation of all parameters. According to the simulated data in the Table 1, when tuning parameter γ was obtained by IWJ algorithm, the proposed RMD method provides better results for the estimation of all parameter. Similarly, in the case of outliers in the random errors e_ij, estimated by ML method has a large bias and MSE. RML method has a good control on the influence of outliers, and the estimated bias and MSE of each parameter are relatively small. In the proposed RMD method, when γ = 0.1, the estimation of model variance in the results is reduced, but it is not as good as the RML method. However, when γ > = 0.2, the RMD method performs better than the RML method. In the case of both area effects v_i and model errors e_ij are contaminated, the ML estimator of variance component is heavily influenced by the outliers and produced much larger biases and MSE, RML method reduced the effects of outliers, but performance is not the best. The proposed RMD method is significantly better than the RML method, as we only need to select appropriate tuning parameters.

Next, using the same data set used in the above simulation, we consider the estimation of small area mean. The mean of the known auxiliary variables in the ith region are . Table 2 presents average simulated RABiases and average simulated RAMSE(averaged over the areas) of the estimators of small area means for the proposed and classical methods. The M-quantile(MQ) regression method is proposed by [17], and the area mean is where, in this case, with estimating in . the bias corrected version of the REBLUP(BC-RML) method is also used to compare with other methods, and the simulation results are shown in Table 2.

Download:

Table 2. Simulated RABiases and RAMSE of robust and classical estimators of small area means (averaged over areas).

https://doi.org/10.1371/journal.pone.0288639.t002

In the case of uncontaminated data, EBLUP obtained using the ML method appears to be the most efficient, as expected. The REBLUP using the RML method and the proposed robust MDPDE(RMDPDE) are also seen to be almost as efficient as the EBLUP. In the other three cases with outliers, we can get a conclusion consistent with the above simulation through the data in the Table 2, the small area mean obtained by ML method has a large bias and MSPE, the small area mean obtained by RML method performs slightly better, the proposed RMD method performs strictly better than the RML method.

In order to check the performance of several existing robust methods on the size of contaminated proportion and variance of contaminated distribution, we further simulated and verified the variation of MSE of estimated parameters with contaminated proportion and variance of contaminated distribution. Just like the above simulation steps, we consider the estimation effect under the three cases where the distribution of area-specific random error is contaminated, the distribution of unit random error is contaminated, and the distribution of both unit random error and area-specific random error is contaminated. In simulating contaminated data, we use the model in section 6.1.1 to generate data, and then perform parameter estimation using the method mentioned above. The simulation was repeated 500 times and the average MSE was taken into account to plot the change curve. In the first case, the MSE performance of the estimated parameters is considered as the contaminated proportion increases. The variance of contaminated distribution was fixed at 25, and the MSE of the estimated parameters under the three contamination scenarios was considered when the contaminated proportion changed between 0 and 0.5 with a step length of 0.02. In another case, MSE performance of the estimated parameters is considered when variance of contaminated distribution increases. In the contaminated distribution, the contaminated proportion was determined to be 0.1, considering that the variance of the contaminated distribution increased by 5 steps from 5 to 100, the change of MSE of the estimated parameters under the three contamination conditions was considered.

As can be seen from the Figs 1 and 2, when the contaminated proportion of random effect increases from 0 to 0.5, the MSE of the four parameters in the small area model increases with the increase of the contaminated proportion. Among the MSE of the four parameters, except for , the MSE of the other three parameters is not very large. As we expected, is easily affected by contaminated proportion of e_ij, and the estimated MSE is relatively large. In the comparison between the several methods, ML performed worst. When the tuning parameter γ was small(0–0.2), MDPD performed almost as well as RML. In some cases, MDPD performed better, but when the value of γ was large, RML performed better.

Download:

Fig 1. The MSE of robust estimated parameters versus the contaminated proportion of e_ij.

Left-hand panel, β₀, right-hand panel, β₁. RML, the robust estimation method presented in [24]; ML, maximum likelihood estimation; Mdpd1, Mdpd2 and Mdpd3 represent the minimum density power devergence method with tuning parameter γ = 0.1, 0.2, 0.3 respectively.

https://doi.org/10.1371/journal.pone.0288639.g001

Download:

Fig 2. The MSE of robust estimated parameters versus the contaminated proportion of e_ij.

Left-hand panel, , right-hand panel, .

https://doi.org/10.1371/journal.pone.0288639.g002

Combined with Figs 3 and 4, we can easily find the following conclusions. When the area-specific random effect v_i is contaminated, the MSE of increases with the increase of the contaminated proportion, while the MSE of is independent of the contaminated proportion, and the MSE is small. Comparing the performance of several methods for MSE estimation of four parameters, The MSE of is small, and there is little difference among the methods. In the estimation of MSE of , ML method performs the worst, RML method performs slightly better than ML, and several MDPD methods perform significantly better than the above two methods. It further shows that the MDPD method is effective.

Download:

Fig 3. The MSE of robust estimated parameters versus the contaminated proportion of v_i.

Left-hand panel, β₀, right-hand panel, β₁.

https://doi.org/10.1371/journal.pone.0288639.g003

Download:

Fig 4. Plot of the MSE of robust estimated parameters versus the contaminated proportion of v_i.

Left-hand panel, , right-hand panel, .

https://doi.org/10.1371/journal.pone.0288639.g004

When both the random error e_ij and the area-specific random error v_i are contaminated, the change of MSE of the estimated parameters with the contaminated proportion is shown in Figs 5 and 6. As can be seen from the figure, when the contaminated proportion increases, the MSE of parameters also increases. In comparison with several estimation methods, the proposed MDPD method is obviously superior to RML and ML method, while ML method performs the worst.

Download:

Fig 5. The MSE of robust estimated parameters versus the contaminated proportion of (v_i, e_ij).

Left-hand panel, β₀, right-hand panel, β₁.

https://doi.org/10.1371/journal.pone.0288639.g005

Download:

Fig 6. The MSE of robust estimated parameters versus the contaminated proportion of (v_i, e_ij).

Left-hand panel, , right-hand panel, .

https://doi.org/10.1371/journal.pone.0288639.g006

When the individual error e_ij is contaminated and the variance of the contamination distribution increases, the MSE of the four parameters is presented in Figs 7 and 8. As can be seen from the figure, when the variance of contamination distribution increases, only the MSE of parameters obtained by ML method shows a significant increase trend. However, the MSE of the parameters obtained by other robust estimation methods does not increase significantly, which indicates that the robust estimation method is uniformly effective in this case. Compared with several kinds of robust methods, the proposed MDPD method is superior to RML method, especially in the estimation of the MSE of .

Download:

Fig 7. The MSE of robust estimated parameters versus the contamination variance of e_ij.

Left-hand panel, β₀, right-hand panel, β₁.

https://doi.org/10.1371/journal.pone.0288639.g007

Download:

Fig 8. The MSE of robust estimated parameters versus the contamination variance of e_ij.

Left-hand panel, , right-hand panel, .

https://doi.org/10.1371/journal.pone.0288639.g008

When the area-specific random error is contaminated and the variance of the contamination distribution varies from 0 to 100, the MSE of the parameters is shown in the Figs 9 and 10. It can be seen from the figure that the MSE of is independent of the variance of contamination distribution. In the estimation of MSE of parameters , ML method is seriously affected by the variance of contamination distribution. RML method has improved its performance, but not as good as MDPD method.

Download:

Fig 9. The MSE of robust estimated parameters versus the contamination variance of v_i.

Left-hand panel, β₀, right-hand panel, β₁.

https://doi.org/10.1371/journal.pone.0288639.g009

Download:

Fig 10. The MSE of robust estimated parameters versus the contamination variance of v_i.

Left-hand panel, , right-hand panel, .

https://doi.org/10.1371/journal.pone.0288639.g010

When both individual error e_ij and area-specific random error v_i are contaminated, Where the contaminated proportion is 0.1 and the variance of the contamination distribution varies from 0 to 100, the MSE of the parameters is shown in Figs 11 and 12. It can be seen from the figure that ML method has the worst performance, while RML method has improved the estimation effect, but it is not good for the estimation of parameters . In summary, MDPD method has good robustness for the estimation of several parameters.

Download:

Fig 11. The MSE of robust estimated parameters versus the contamination variance of (v_i, v_ij).

Left-hand panel, β₀, right-hand panel, β₁.

https://doi.org/10.1371/journal.pone.0288639.g011

Download:

Fig 12. The MSE of robust estimated parameters versus the contamination variance of (v_i, e_ij).

Left-hand panel, , right-hand panel, .

https://doi.org/10.1371/journal.pone.0288639.g012

6.1.2 Finite population area means.

In this section, we focus on the small area means for finite population contains m areas and the ith area of size N_i. We also use the method in 6.1.1 to simulate the performance of the population mean when m = 40 and each area is equal in size to N_i = 40, 80, 200 respectively. we generated a finite population using the unit-level model (2) with x = (1, x)^t, where x ∼ N(1, 1). For each of the four contamination schemes used in Section 6.1.1, we then generated a series of 500 population data sets. From each population data set, ni = 4 units from N_i units in the ith area were selected as a random samples. For the data set of each simulation, we can use the robust method mentioned in the above simulation to obtain the small area mean for the ith area. Finally, we compare the average estimation of area mean after 500 simulations.

Table 3 presents average simulated RABiases(first line) and average simulated RAMSEs (second line,averaged over the areas) of the estimators of small area means . From the simulation results, in general, the proposed RMD method has smaller MSE in most cases. In some cases, it is not as good as the estimated effect of RML method, but the difference is not significant. In addition, we find that when the area size increases, such as Ni = 200, the estimation obtained by the proposed method has a smaller MSE, and the estimation effect is better than that obtained by traditional estimation method.

Download:

Table 3. Simulated biases and mean squared errors of robust and classical estimators of fixed effects and variance components.

https://doi.org/10.1371/journal.pone.0288639.t003

An interesting finding is that, when the area-specific random effects v_i is contaminated, although the ML method has a large relative bias and MSE in the estimation of model parameters, the area mean obtained by the ML method has a smaller MSE. It shows that ML method is not sensitive to the variation of area-specific random effect, but is very sensitive to the variation of model random error e_ij. And the method proposed in this paper has a good robust effect on the two random effects.

6.2 Real data

In this section, we use the data that is used by [5] to estimate the area under corn and soybeans for each of m = 12 counties in North-Central Iowa. This data can be obtained from the R package “sae”, which contains 37 samples of areas of corn and soybeans from the 12 counties, as well as the number of pixels classified by the LANDSAT satellite as corn and soybeans for each sample segment. The unit-level model was established with the data collected from farm interviews as the dependent variable and LANDSAT satellite data as the auxiliary variable. (20) which is a special case of model (1) with k_ij = 1, x_ij = (1, x_ij1, x_ij2)^T, and β = (β₀, β₁, β₂)^T. Here, y_ij is the number of hectares of corn (or soybeans), x_ij1 is the number of pixels classified as corn, and x_ij2 is the number of pixels classified as soybeans in the jth area segment of the ith county.

[5] identified an observation in Hardin county as an outlier, and they simply delete this observation when predicting the areas of corn and soybean. In [18], the robust estimation method is used to analyze this data, and the corresponding predicted value in the presence of outliers is given. Here, we use our proposed robust estimation method to model the data and analyze the influence of outlier on traditional estimators.

Considering the existence of outliers in the data, the robust estimation method proposed by us is considered to estimate and predicte the areas of corn in each segement. In addition, since there is only one outlier observation in this data, we select tuning parameter γ = 0.01, 0.05 for estimation in the proposed robust estimation method. In Table 4, regression coefficients and variance of random errors estimated by ML method, robust method proposed by [18] and MDPD method proposed by us are shown. The standard error for each parameter obtained from the asymptotic distribution in Section 4 is also shown in parentheses. When the tuning parameter γ = 0.01, It is clear from the table that the parameters estimated by MDPD are between those estimated by ML method and RML method. When the tuning parameter γ was increased to 0.05, the coefficients estimated by the model changed significantly. By comparing the standard errors of the parameters shown in the table, it can be seen that the parameters estimated by the proposed method have smaller standard errors.

Download:

Table 4. Estimates of the model parameters from several methods. Standard errors are shown in the parenthesis.

https://doi.org/10.1371/journal.pone.0288639.t004

In order to compare the estimation results, we show the predicted values of the mean hectares of corn per segment using the mldel (20). The EBLUP values obtained using the above estimation method are presented in the Table 5, where the Bootstrap estimates of the MSPE from 500 bootstrap samples are shown in parentheses. First of all, in terms of estimation results, the estimation of the region without outliers by using the proposed estimation method is closer to the result of ML estimation, and the prediction of region Hardin has been improved to some extent. Secondly, by comparing bootstrap MSPE, the MSPE values obtained by our proposed method are smaller, which shows that our proposed method is effective.

Download:

Table 5. Predicted mean hectares of corn per segment(bootstrap root MSPE in parentheses).

https://doi.org/10.1371/journal.pone.0288639.t005

7 Discussion

In this paper, we propose a robust small area estimation method for unit level models with outlier observations. By introducing MDPD method, a robust estimation method with outliers and non-normal distribution errors is presented. Firstly, we have proposed an estimation equation for the parameters of the cell level model based on MDPD method and obtained the asymptotic properties of the model parameters. Secondly, combined with the asymptotic distribution of parameters, the selection procedure of optimal tuning parameter is given. Thirdly, the EBLUP values of unit and area mean in finite population is given. Finally, we verify the superior performance of our proposed method through simulated data and real data. In the simulation part, we simulate the robust estimation when the distribution is polluted, and discuss the effects of several kinds of robust estimation methods in three kinds of pollution cases. In particular, we discuss the variation of MSE of several estimation methods when the pollution ratio changes and the variance of pollution distribution changes. At the same time, the simulation results show that the proposed method can solve the outlier situation better. In the real data, we use the classical data of a small area estimation to illustrate the effectiveness of our proposed method. Through comparison, our proposed method can well deal with the special case of outlier observation.

Furthermore, it can be verified that the proposed method is also effective for random effects subject to other biased distributions. In this paper, we note that when the distribution of random errors is contaminated and the probability of contamination is greater than 0.3, the performance of our proposed estimation method is generally poor than the robust estimation method in [18], but in this case, the MSE obtained by several kinds of methods are very large, and the robust estimation results are not very valuable. In the next step, the estimation method proposed by us can also be applied to the small region estimation problem of exponential distribution. Of course, these work need to be further studied and proved.

In this paper, we use IWJ algorithm to select the optimal tuning parameters. In similar related work, the parameter selection algorithm based on Hyvarinen score is given in [35]. In further research, this algorithm can be applied to the unit level model proposed in this paper, and the purpose of selecting the optimal tuning parameters can also be achieved. In addition, we only compared two classical robust estimation methods. In order to further illustrate the effectiveness of the method proposed in this paper, we can further compare the method proposed in this paper with [3, 17] and other methods.

Supporting information

S1 File. The proof of the theorem.

https://doi.org/10.1371/journal.pone.0288639.s001

(PDF)

Acknowledgments

We are grateful Lanzhou University of Finance and Economics for providing with a learning and research platform.

References

1. Marshall RJ. Mapping disease and mortality rates using empirical Bayes estimators. Journal of the Royal Statistical Society: Series C (Applied Statistics). 1991; 40(2):283–294. pmid:12157989
- View Article
- PubMed/NCBI
- Google Scholar
2. Clayton D, Kaldor J. Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. Biometrics. 1987; p.671–681. pmid:3663823
- View Article
- PubMed/NCBI
- Google Scholar
3. Tang X, Ghosh M, Ha N S, et al. Modeling random effects using global–local shrinkage priors in small area estimation. Journal of the American Statistical Association. 2018; 113(524): 1476–1489.
- View Article
- Google Scholar
4. Basu A, Harris IR, Hjort NL, Jones MC. Robust and efficient estimation by minimising a density power divergence. Biometrika. 1998; 85(3):549–559.
- View Article
- Google Scholar
5. Battese GE, Harter RM, Fuller WA. An error-components model for prediction of county crop areas using survey and satellite data. Journal of the American Statistical Association. 1988; 83(401):28–36.
- View Article
- Google Scholar
6. Cruze NB, Erciulescu AL, Nandram B, et al. Producing official county-level agricultural estimates in the United States: Needs and challenges. Statistical science. 2019; 34(2):301–316.
- View Article
- Google Scholar
7. Janicki R. Properties of the beta regression model for small area estimation of proportions and application to estimation of poverty rates. Communications in Statistics-Theory and Methods. 2020; 49(9):2264–2284.
- View Article
- Google Scholar
8. Rao JNK, Molina I. Small area estimation. John Wiley and Sons, 2015.
9. Morales D, Esteban MD, Pérez A, et al. A course on small area estimation and mixed models. Methods, theory and applications in R. 2021.
- View Article
- Google Scholar
10. Pfeffermann D. New important developments in small area estimation. Statistical Science. 2013; 28(1):40–68.
- View Article
- Google Scholar
11. Sugasawa S, Kubokawa T. Small area estimation with mixed models: a review. Japanese Journal of Statistics and Data Science. 2020; 3(2):693–720.
- View Article
- Google Scholar
12. Datta GS, Ghosh M. Bayesian prediction in linear models: Applications to small area estimation. The Annals of Statistics. 1991; p.1748–1770.
- View Article
- Google Scholar
13. Sinharay S, Stern HS. Posterior predictive model checking in hierarchical models. Journal of Statistical Planning and Inference. 2003; 111(1–2):209–221.
- View Article
- Google Scholar
14. Chambers R, Chandra H, Salvati N, et al. Outlier robust small area estimation. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2014; 76(1):47–69.
- View Article
- Google Scholar
15. Datta GS, Lahiri P. Robust hierarchical Bayes estimation of small area characteristics in the presence of covariates and outliers. Journal of Multivariate Analysis. 1995; 54(2):310–328.
- View Article
- Google Scholar
16. Chambers RL. Outlier robust finite population estimation. Journal of the American Statistical Association. 1986; 81(396):1063–1069.
- View Article
- Google Scholar
17. Chambers R, Tzavidis N. M-quantile models for small area estimation. Biometrika. 2006; 93(2):255–268.
- View Article
- Google Scholar
18. Sinha SK, Rao JNK. Robust small area estimation. Canadian Journal of Statistics. 2009; 37(3):381–399.
- View Article
- Google Scholar
19. Ghosh M, Lahiri P. Robust empirical Bayes estimation of means from stratified samples. Journal of the American Statistical Association. 1987; 82(400): 1153–1162.
- View Article
- Google Scholar
20. Bell, William R, and Elizabeth T. Huang. “Using the t-distribution to deal with outliers in small area estimation.” Proceedings of Statistics Canada Symposium. 2006.
21. Ghosh M, Maiti T, Roy A. Influence functions and robust Bayes and empirical Bayes small area estimation. Biometrika. 2008; 95(3):573–585.
- View Article
- Google Scholar
22. Smith PA, Bocci C, Tzavidis N, et al. Robust estimation for small domains in business surveys. arXiv preprint arXiv:2006.01864, 2020.
23. Chakraborty A, Datta GS, Mandal A. Robust hierarchical Bayes small area estimation for the nested error linear regression model. International Statistical Review. 2019; 87:S158–S176.
- View Article
- Google Scholar
24. Sinha SK. Robust small area estimation in generalized linear mixed models. Metron. 2019; 77(3):201–225.
- View Article
- Google Scholar
25. Bertarelli G, Chambers R, Salvati N. Outlier robust small domain estimation via bias correction and robust bootstrapping. Statistical Methods and Applications. 2021; 30(1):331–357.
- View Article
- Google Scholar
26. Jiang J, Rao JS. Robust small area estimation: An overview. Annual review of statistics and its application. 2020; 7:337–360.
- View Article
- Google Scholar
27. Fujisawa H, Eguchi S. Robust parameter estimation with a small bias against heavy contamination. Journal of Multivariate Analysis. 2008; 99(9):2053–2081.
- View Article
- Google Scholar
28. Ghosh A, Basu A. Robust estimation for independent non-homogeneous observations using density power divergence with applications to linear regression. Electronic Journal of statistics. 2013; 7:2420–2456.
- View Article
- Google Scholar
29. Sugasawa S. Robust empirical Bayes small area estimation with density power divergence. Biometrika. 2020; 107(2):467–480.
- View Article
- Google Scholar
30. Riani M, Atkinson A C, Corbellini A, Perrotta D. Robust regression with density power divergence: theory, comparisons, and data analysis. Entropy.2020; 22(4):399. pmid:33286173
- View Article
- PubMed/NCBI
- Google Scholar
31. Kurisu D, Ishihara T,Sugasawa S. Adaptively robust small area estimation: Balancing robustness and efficiency of empirical Bayes confidence intervals. 2021. arXiv preprint arXiv:2108.11551.
32. Warwick J, Jones MC. Choosing a robustness tuning parameter. Journal of Statistical Computation and Simulation. 2005; 75(7):581–588.
- View Article
- Google Scholar
33. Basak S, Basu A, Jones MC. On the ‘optimal’density power divergence tuning parameter. Journal of Applied Statistics. 2021; 48(3):536–556. pmid:35706540
- View Article
- PubMed/NCBI
- Google Scholar
34. Hall P, Maiti T. On parametric bootstrap methods for small area prediction. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2006; 68(2):221–238.
- View Article
- Google Scholar
35. Sugasawa S, Yonekura S. On selection criteria for the tuning parameter in robust divergence. Entropy.2021; 23(9), 1147. pmid:34573772
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Marshall RJ. Mapping disease and mortality rates using empirical Bayes estimators. Journal of the Royal Statistical Society: Series C (Applied Statistics). 1991; 40(2):283–294. pmid:12157989
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Clayton D, Kaldor J. Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. Biometrics. 1987; p.671–681. pmid:3663823
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Tang X, Ghosh M, Ha N S, et al. Modeling random effects using global–local shrinkage priors in small area estimation. Journal of the American Statistical Association. 2018; 113(524): 1476–1489.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref4] 4. Basu A, Harris IR, Hjort NL, Jones MC. Robust and efficient estimation by minimising a density power divergence. Biometrika. 1998; 85(3):549–559.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref5] 5. Battese GE, Harter RM, Fuller WA. An error-components model for prediction of county crop areas using survey and satellite data. Journal of the American Statistical Association. 1988; 83(401):28–36.
View Article
Google Scholar

[16] View Article

[17] Google Scholar

[ref6] 6. Cruze NB, Erciulescu AL, Nandram B, et al. Producing official county-level agricultural estimates in the United States: Needs and challenges. Statistical science. 2019; 34(2):301–316.
View Article
Google Scholar

[19] View Article

[20] Google Scholar

[ref7] 7. Janicki R. Properties of the beta regression model for small area estimation of proportions and application to estimation of poverty rates. Communications in Statistics-Theory and Methods. 2020; 49(9):2264–2284.
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref8] 8. Rao JNK, Molina I. Small area estimation. John Wiley and Sons, 2015.

[ref9] 9. Morales D, Esteban MD, Pérez A, et al. A course on small area estimation and mixed models. Methods, theory and applications in R. 2021.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Pfeffermann D. New important developments in small area estimation. Statistical Science. 2013; 28(1):40–68.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Sugasawa S, Kubokawa T. Small area estimation with mixed models: a review. Japanese Journal of Statistics and Data Science. 2020; 3(2):693–720.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Datta GS, Ghosh M. Bayesian prediction in linear models: Applications to small area estimation. The Annals of Statistics. 1991; p.1748–1770.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Sinharay S, Stern HS. Posterior predictive model checking in hierarchical models. Journal of Statistical Planning and Inference. 2003; 111(1–2):209–221.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Chambers R, Chandra H, Salvati N, et al. Outlier robust small area estimation. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2014; 76(1):47–69.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Datta GS, Lahiri P. Robust hierarchical Bayes estimation of small area characteristics in the presence of covariates and outliers. Journal of Multivariate Analysis. 1995; 54(2):310–328.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Chambers RL. Outlier robust finite population estimation. Journal of the American Statistical Association. 1986; 81(396):1063–1069.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Chambers R, Tzavidis N. M-quantile models for small area estimation. Biometrika. 2006; 93(2):255–268.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Sinha SK, Rao JNK. Robust small area estimation. Canadian Journal of Statistics. 2009; 37(3):381–399.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Ghosh M, Lahiri P. Robust empirical Bayes estimation of means from stratified samples. Journal of the American Statistical Association. 1987; 82(400): 1153–1162.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Bell, William R, and Elizabeth T. Huang. “Using the t-distribution to deal with outliers in small area estimation.” Proceedings of Statistics Canada Symposium. 2006.

[ref21] 21. Ghosh M, Maiti T, Roy A. Influence functions and robust Bayes and empirical Bayes small area estimation. Biometrika. 2008; 95(3):573–585.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref22] 22. Smith PA, Bocci C, Tzavidis N, et al. Robust estimation for small domains in business surveys. arXiv preprint arXiv:2006.01864, 2020.

[ref23] 23. Chakraborty A, Datta GS, Mandal A. Robust hierarchical Bayes small area estimation for the nested error linear regression model. International Statistical Review. 2019; 87:S158–S176.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref24] 24. Sinha SK. Robust small area estimation in generalized linear mixed models. Metron. 2019; 77(3):201–225.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref25] 25. Bertarelli G, Chambers R, Salvati N. Outlier robust small domain estimation via bias correction and robust bootstrapping. Statistical Methods and Applications. 2021; 30(1):331–357.
View Article
Google Scholar

[70] View Article

[71] Google Scholar

[ref26] 26. Jiang J, Rao JS. Robust small area estimation: An overview. Annual review of statistics and its application. 2020; 7:337–360.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref27] 27. Fujisawa H, Eguchi S. Robust parameter estimation with a small bias against heavy contamination. Journal of Multivariate Analysis. 2008; 99(9):2053–2081.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref28] 28. Ghosh A, Basu A. Robust estimation for independent non-homogeneous observations using density power divergence with applications to linear regression. Electronic Journal of statistics. 2013; 7:2420–2456.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref29] 29. Sugasawa S. Robust empirical Bayes small area estimation with density power divergence. Biometrika. 2020; 107(2):467–480.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref30] 30. Riani M, Atkinson A C, Corbellini A, Perrotta D. Robust regression with density power divergence: theory, comparisons, and data analysis. Entropy.2020; 22(4):399. pmid:33286173
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref31] 31. Kurisu D, Ishihara T,Sugasawa S. Adaptively robust small area estimation: Balancing robustness and efficiency of empirical Bayes confidence intervals. 2021. arXiv preprint arXiv:2108.11551.

[ref32] 32. Warwick J, Jones MC. Choosing a robustness tuning parameter. Journal of Statistical Computation and Simulation. 2005; 75(7):581–588.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref33] 33. Basak S, Basu A, Jones MC. On the ‘optimal’density power divergence tuning parameter. Journal of Applied Statistics. 2021; 48(3):536–556. pmid:35706540
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref34] 34. Hall P, Maiti T. On parametric bootstrap methods for small area prediction. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2006; 68(2):221–238.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref35] 35. Sugasawa S, Yonekura S. On selection criteria for the tuning parameter in robust divergence. Entropy.2021; 23(9), 1147. pmid:34573772
View Article
PubMed/NCBI
Google Scholar

[100] View Article

[101] PubMed/NCBI

[102] Google Scholar

Figures

Abstract

1 Introduction

2 Basic unit level model

3 Density power divergence

3.1 Minimum density power divergence estimator

3.2 Choice of the optimal tuning parameter γ

4 Asymptotic distribution of the robust estimator

5 Robust empirical Bayes perdictor and MSE

5.1 Robust EB predictor under a finite population

5.2 EBLUP of area means

5.3 MSE of the EBP

6 Application

6.1 Simulation

6.1.1 Contaminated distribution.

6.1.2 Finite population area means.

6.2 Real data

7 Discussion

Supporting information

S1 File. The proof of the theorem.

Acknowledgments

References