Figures
Abstract
This paper discusses the first moment, i.e., the mean income point, of income density functions and the estimation of single-parametric Lorenz curves. The mean income point is implied by an income density function and associated with a single-parametric Lorenz function. The boundary of the mean income point can show the flexibility of a parametric Lorenz function. I minimize the sum of squared errors in fitting both grouped income data and the mean income point and identify the best parametric Lorenz function using a large panel dataset. I find that each parametric Lorenz function may do a better job than others in fitting particular grouped data; however, a zero- and unit-modal single-parametric Lorenz function is identified to be the best of eight typical optional functions in fitting most (666 out of 969) observations of a large panel dataset. I perform a Monte Carlo simulation as a robustness check of the empirical estimation.
Citation: Shao LF (2022) The first moment of income density functions and estimation of single-parametric Lorenz curves. PLoS ONE 17(6): e0267828. https://doi.org/10.1371/journal.pone.0267828
Editor: Baogui Xin, Shandong University of Science and Technology, CHINA
Received: July 17, 2021; Accepted: April 17, 2022; Published: June 24, 2022
Copyright: © 2022 Liang Frank Shao. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Both the parametric Lorenz curve (LC) function and grouped income data are individually associated with an income distribution probability density function. Krause [1] shows that the modality of the income density function must be considered when fitting grouped income data using the associated Lorenz function. The first moment of an income density function, the mean income point, is often different from the mode. Whether a parametric density function or income data is zero- or unit-modal, the mean income point must be applied as well to evaluate the estimation, which has not been discussed in the related literature. This paper shows that the mean income point of the income probability density functions is another key consideration in estimation. I discuss the first moments of grouped income data and ten typical single-parametric LCs and identify the best parametric LC by minimizing the sum of squared errors (SSE) on both grouped income data and the first moments.
The position of the mean income point of an income density function on the associated LC has been discussed in Shao [2]. It is located at the tangential point of the unit-slope line of the associated LC. In Shao [2], the mean income point is called the mean division point (MDP), and its coordinates are called the mean-income division shares (MDSs), including the mean-income population share (MPS) and the mean income share (MIS). I use these abbreviations in this study. Fig 1 below shows that point M is the MDP for the LC . The diagonal line
has a unit intercept and a slope of negative one, and the dashed line going through point M is parallel to line
.
The position of the MDP, point M, can be either on line AB or on one of the two sides of the line when we change the LC by moving point M on the 45° line that goes through M. The sum of the MDSs (MPS + MIS), denoted by MDC, is the intercept of the dashed line going through point M. Because the slope of the dashed line is negative one so that the sum of the coordinates of any point on the line is equal to its intercept. Clearly, we have MDC ∈ (0, 2). Now, if we change the shape of the LC to let its MDP (point M) be at, below or above line AB, then the MDC will be equal to, less than, or larger than one, respectively. Fig 16 in S1 Appendix illustrates the other two cases, MDC ≤ 1, using the Chotikapanich and RGKO LCs (refer to the next subsection).
The boundary of the MDC helps to measure the flexibility of a parametric LC. For a perfectly flexible parametric Lorenz function, the MDP should be able to be located at any places in the triangular area ΔOBC in Fig 1. For instance, point M can move along two directions, either the 45° tangential line or the dashed perpendicular line going through M. That is, a flexible parametric function can let its MDC take any values in the interval (0, 2) in one direction and can also keep it constant in another direction when we change the shape of the LC by properly changing its parameter values.
Note that the Pietra ratio (Hoover, Robin Hood or Schutz index) can be defined by the difference MPS − MIS. Therefore, the correlation between MDC and inequality is bridged by the MDSs, which can be described by the following two equations: MDC + Pietra = 2MPS, MDC − Pietra = 2MIS. That is, MDC and inequality are either positively correlated for a given MIS or negatively correlated for a given MPS.
As noted in Sarabia, Castillo and Slottje (page 752, [3]), the estimator of least squares with parametric functions on grouped data does not guarantee the estimated Gini to satisfy Gastwirth’s [4] Gini bounds, because the estimator may not lead to the best parametric function that captures the data’s moments; however, once the data’s MIS and MPS have been well captured in the estimation then the estimated Gini will meet the bounds. Therefore, including the MPS and MIS in the estimation of least squares is a stricter constraint than Gastwirth’s Gini bounds.
I determine the best parametric LC for grouped income data to be the one that performs the best of all optional functions in fitting the data and capturing the first moment (MDP, point M). The least SSE is used to measure the goodness of fit of the estimation.
I choose the best parametric LC by the following three steps. First, I estimate each parametric LC function by minimizing the SSE (SSELC) of grouped income data and select the parametric LC with the smallest SSELC. Second, I calculate the estimated first moments for each LC chosen in the first step and obtain the distance (SSEMDP) between the estimated first moments and the data’s first moments calculated by the polynomial LC estimator in Shao [2]. Then, I select the parametric LC with the least SSEMDP. Last, if the previous two steps lead to the same parametric LC, then that is the best one I choose for the grouped income data; otherwise, the choice changes to focus on either fitting the data or capturing the first moment, or even looking for new optional functions.
I discuss the first moment for ten typical single-parametric LCs in which I propose two new single-parametric functional forms from current multiparametric LC functions, which are associated with zero-modal densities. The seven single-parametric LC functions (Chotikapanich [5]; Gupta [6]; Rohde [7]; the Weibull, the Fisk, log-normal, and Pareto functions) chosen from the literature perform better than the other optional functions (e.g., Krause [1], Ogwang and Tao [8], Hossain and Saeki [9], Wang and Smyth [10], among others) that I have tried and dropped in my estimation. I show that the first moment is a simple and effective factor to evaluate the parametric estimation of grouped income data. Each parametric LC may do a better job than all others in fitting the observations of a particular subset of a large panel of grouped income data; however, a zero-and unit-modal single-parametric LC is identified to perform the best for most (666 out of 969) observations in the panel dataset.
The paper consists of six sections. Section 2 discusses the MDSs, MDC and modality of ten single-parametric LCs, Section 3 describes the data, Section 4 contains the estimation results, Section 5 performs Monte Carlo simulation and Section 6 concludes.
2. Single-parametric Lorenz function
After considering as many meaningful single-parametric versions as I can find for all current multiparametric LCs (e.g., Ogwang and Tao [8], Wang and Smyth [10], Hossain and Saeki [9], Abdalla and Hassan [11], Wang, Smyth and NG [12], among others) and the widely cited single-parametric functions, I come up with ten single-parametric LCs which are discussed in this subsection.
Employing the analysis of Shao [2] and theorems 1 and 2 from Krause [1], I present the following two propositions (Proposition 2 is also described in Sarabia, Jorda and Trueba [13]), which are used later in the context to calculate the mean and mode income points on the Lorenz curve.
Proposition 1 Let p denote the cumulative population share, y be the cumulative income share, y = L(p) be a differentiable LC, F(x) be the cumulative income probability function, and x be an income variable with a mean of μ. I define the cumulative population share at the mean income μ as MPS = F(μ). Then, the MDSs can be found by the following equations:
Proposition 2 For a C3(third differentiable and continuous) parametric LC, y = L(p), it is associated with a cumulative probability function F(x) and a continuous density function f(x). The density function has a mode xm if the following conditions are satisfied:
Brief proofs for the two propositions are provided in S4 Appendix. For convenience of expression, the following definition is presented for the first moments of the Lorenz curve.
Definition: The first moments of (an income density function on its associated) Lorenz curve are the mean division shares (MPS, MIS), MDC and mode (pm, ym).
I discuss the first moments (MDSs, MDC, pm, ym) for the following ten single-parametric LCs using the above two propositions.
1. The RGKO Lorenz function
I first discuss the following single-parametric version of the biparametric LC by Rasche, Gaffney, Koo, and Obst [14], which is one of the Lamé LCs introduced by Henle, Horton and Jakus [15] and studied by Sarabia, Jordá and Trueba [13]. I denote it as RGKO for convenience of exposition.
Let L′ (p) = 1 and solve the equation for MPS. Then, the following results are obtained for the MDSs of the RGKO function:
(2)
Employing proposition 2, the RGKO LC’s third derivative, the mode population share (pm) and income share (ym) are as follows:
The third derivative of the RGKO function shows that it is associated with a zero- and unit-modal density. The mode equation of the RGKO LC shows that to hold pm > ym, which is a necessary condition for the mode to exist, we need α < .369, which leads to the MDSs restrictions MPS > 0.75 and MIS < .25. Therefore, the RGKO LC could be a good option to fit the grouped income data that imply a zero- or unit-modal density function and satisfy the first moments conditions 0.75 < MPS < 1, MIS < 0.25, and MDC = 1, which are generally for those grouped income data with a large income inequality because the Pietra ratio (MPS − MIS) would be larger than 0.5 under the MDSs constraints. Shao (2017) notes that the Gini coefficient is positively correlated with MPS and negatively correlated with MIS in the data.
Fig 2 below shows the MDSs and the mode of the RGKO LC with its parameter α. It is easy to see that the RGKO function’s first moments have the following restrictions: MPS > 0.5, MIS < 0.5 and pm > 0.333 > ym. For the Weibull function, its first moments have the restrictions: pm < 0.634, and MPS ∈ (0.43, 0.64). The mean of the MPS of the data is 0.6367. Therefore, the RGKO function may perform a better estimation than the Weibull function for the observations in which the MPS is larger than 0.64 due to their different restrictions on the first moments. The observations of the MPS are often larger than 0.64 in a large proportion of the dataset.
2. The ASR Lorenz function
Nicholas Rohde [7] proposes a single-parametric LC as follows:
Sarabia, Prieto, and Sarabia [16] find that it is a reparameterization of the function proposed by Aggarwal and Singh [17]. I thus denote the function as ASR. Krause [1] notes that the ASR LC is associated with a zero-modal density function.
The ASR function’s MDSs are as follows:
(3)
The MDCs of the ASR and RGKO functions are one, and the two functions also have the same restrictions for the MDSs. However, as mentioned, their associated density functions have different modalities. The RGKO function can perform much better estimations than the ASR for the data implying unit-modal densities. Fig 3 below shows the MDSs of the ASR LC with its parameter β.
3. The Kakwani Lorenz function
Kakwani [18] proposes a three-parameter LC. I propose a single-parametric version of the function as follows: .
Let y′(p) = 1. Then, the function’s MDSs and MDC are as follows:
(4)
The third derivative of the function shows that its density function is zero-modal. Fig 4 below shows the MDSs and MDC with parameter δ. Clearly, MDC ∈ (.948, 1.333) for this LC function.
Thus, the MDC of the Kakwani LC can be either greater or less than one. A limitation of the function is that the MPS is constant, which could be a bad restriction to its capacity to fit the data. However, the mean and standard deviation of the MPS in the disposable income data are 0.64 and 0.046, respectively. Thus, the MPS of the grouped income data is quite stable. It will be shown that the Kakwani LC is still a good option to fit many observations in the data.
4. The OMFLG Lorenz function
Ortega, et al. [19] proposed a three-parameter LC. I propose a single-parametric version, denoted as OMFLG, for the function as follows:
The function’s third derivative is
It is easy to see that the associated density of the OMFLG function is zero-modal since the solution to y‴ (p) = 0 is . Let y′ (MPS) = 1. Then, its MDSs and MDC are solved as follows:
(5)
Fig 5 below shows the MDSs and MDC of the OMFLG LC with its parameter θ.
5. The Chotikapanich Lorenz function
A single-parametric function proposed by Chotikapanich [5] is as follows:
Krause (2014) shows that the Chotikapanich function is associated with a zero-modal density function. The following results are obtained for the MDSs and MDC:
(6)
Fig 6 below shows that its MDC is located in the interval (.8638, 1).
For the hierarchy of exponential Lorenz functions (Sarabia, Castillo and Slottje [3]), L1(p; k, α), the maximum of its MDC is less than that of the Chotikapanichi function; hence it does not outperform RGKO, Pareto and Kakwani functions in fitting the Lorenz curves that have an MDC equal or larger than one. The exponential Lorenz function, L1(p; k, α), does not increase the MPS, MIS and MDC of the initial exponential function (), because both the exponential function and Chotikapanich Lorenz function are convex, and their product shifts toward the left of the two functions. However, in the paper [3] the other two hierarchies of exponential LCs, L2(p; k, γ) and L3(p; k, α, γ), could be more flexible than L1(p; k, α) in fitting particular data, which are beyond the task of this study.
6. The Gupta Lorenz function
Gupta [6] proposes the following single-parametric LC function:
The third derivative shows that it is zero-modal. The function does not have closed solutions for the MDSs, and it shows similar properties to the Chotikapanich LC with regard to the MDSs. That is, MDC < 1, MPS > .5, and MIS < .5. Table 1 shows the MDSs and MDC for a few values of the parameter τ.
7. The Pareto Lorenz function
The Pareto Lorenz curve has the following cumulative distribution function (CDF) with a positive scale parameter xm and a positive shape parameter γ:
The Pareto Lorenz function is as follows:
Krause [1] shows that the Pareto function is associated with a zero-modal density function. The following results are obtained for the MDSs and MDC:
(7)
Fig 7 below shows the MDSs and MDC of the Pareto LC with parameter γ.
8. The Weibull Lorenz function
Krause [1] discusses the mode of the Weibull distribution, which has the following probability density function with a scale parameter a and a shape parameter b:
The Weibull LC function is as follows:
The MDSs and MDC for the Weibull LC are as follows:
(8)
The above equations show that the mode population share pm of the Weibull density function is less than 0.634, and the corresponding MDC is restricted in the interval (0.86, 0.897), which is not as flexible as the RGKO LC mode and MDC. The MDSs of the Weibull LC take larger values for the MIS and smaller values for the MPS than those of the RGKO LC. Shao [2] notes that a larger Gini coefficient is significantly associated with a larger MPS and a smaller MIS. Therefore, the Weibull function might not be a good option to estimate the data with a large Gini coefficient, but the RGKO function might be the better one to fit those data.
Fig 8 below shows the mode equation, and it is easy to see that pm < .634.
Fig 9 below graphs the MDSs and MDC, and it is easy to see that MPS > .43 > MIS and MDC ∈ (.86,1).
9. The Fisk Lorenz function
The Fisk distribution has the following CDF, which is unit modal as the shape parameter β > 1.
Mean income μ and mode m of the distribution are as follows:
Its MPS and mode population share are as follows:
The Fisk Lorenz function is as follows:
Plugging MPS and pm into the Fisk Lorenz function yields its MIS and ym, respectively. Fig 10 below shows the Fisk MDSs and MDC. Its MDC takes values in the interval (1, 1.09), which is a subset of the interval of the Pareto’s MDC. Thus, the Fisk function is not as flexible as the Pareto function regarding the MDC.
Fig 11 below shows the mode population and income shares of the Fisk function. The population share at the mode is less than 0.5, which dramatically restricts the function from capturing the mode of the empirical and simulation datasets because the mode population share is often larger than 0.5 for many observations in the datasets.
10. Lognormal Lorenz function
The lognormal distribution has the following CDF:
The distribution is unit modal, and its mean μ and mode m are as follows:
Its MDSs and Lorenz function are as follows:
Its mode population share and income share are as follows:
It is easy to see that its MDC is constant at 1: MDC = 1. Therefore, three functions (RGKO, ASR, Lognormal) have a constant MDC of one, among which empirical and simulation estimations will show that the RGKO function outperforms the other two; meanwhile, the mode population share is less than 0.5, which restricts the function from capturing the mode of the empirical and simulation datasets because the mode population share is often larger than 0.5 for many observations in the datasets. Fig 12 below shows the first moments of the lognormal LC:
Table 2 below summarizes the restrictions of the first moments of the ten single-parametric functions. The table shows that each function has different restrictions on its first moments from what other functions have, which implies the differences in the flexibility of the functions in fitting grouped income data and capturing the first moments.
The Weibull, RGKO, Fisk and Log-normal functions are zero- and unit-modal, and the other six functions are zero-modal. The Weibull function could do a good job in fitting those unit-modal density data that satisfy the conditions MDC ∈ (0.86, 0.897) and MPS < 0.643, which also restricts its capacity to fit the data because, as the following data subsection shows, the sample mean of the MDC, 1.02, is relatively far away out of the interval, and the mean of the MPS is 0.6367 in the data, which means that approximately half of the observations in the data do not satisfy the MPS restriction of the Weibull function.
There are two related reasons why I focus on these ten functions. One is that I include the latest and widely used single-parametric functions (RGKO, ASR, Pareto, Chotikapanich, Gupta, Weibull, Fisk and lognormal), from which I find that all the MDCs of these functions are either greater or less than one (or equal to one), and none can reach all of the three cases by changing its parameter values. That is, they are not perfectly flexible in the position of MDP; therefore, I am motivated to explore more flexible parametric functions that allow MDC to be both greater and less than one. The other is that I have tried many single-parametric versions of the current multiparametric LCs and ended up with the two functions of OMFLG and Kakwani, which have a more flexible MDP.
Actually, the RGKO LC outperforms all other optional functions in the sense that it can do a better job of both fitting more observations and capturing the first moments, and there are also two reasons for its excellent performance. One is that its MDC is constant at one and that it can be both zero-modal and unit-modal; although the ASR and log-normal functions also have the constant unit MDC, the ASR function is not unit modal, and the lognormal function’s mode position is more limited than that of the RGKO. Moreover, all other functions can only allow their MDC to be one at one value of their parameters at most (e.g. refer to the MDC curves of the Kakwani and OMFLG functions in Figs 4 and 5). The other is that the data’s mean (1.018) of the MDC is very close to one, that is, a large number of observations have unit MDC. Therefore, the RGKO, ASR and lognormal functions match the data best regarding the position of the MDP, but the first moments of the RGKO are the most flexible among the three functions. Similar to the Kakwani LC, its MPS is 2/3, which is very close to the mean (0.636) of the MPS in the dataset, and it is relatively more flexible than the other eight single-parametric functions because its MDC can be both larger and less than one.
In summary, each of the above ten parametric LCs has particular restrictions on its first moments (MDSs, MDC, mode). A prerequisite for a parametric LC to be a good option to fit grouped income data is that their first moments must match each other. That is, the first moments of the density function implied by the data must match the first moments of the parametric LC function. It will be shown in the empirical estimation and simulation subsections that this condition has been almost perfectly satisfied between each function and the subset data of the function.
3. Data
I use the data of the MDSs calculated by Shao [2] from a large panel dataset retrieved from WIID4.3 (https://www.wider.unu.edu/project/wiid-world-income-inequality-database). The income variable definition is disposable income per capita. I use the decile data because the goodness of fit may vary for the same observation between the quintile and decile data, and the decile data often provide better accuracy than the quintile data for parametric functions even though there are cases that quintile data occasionally can be better fitted than the decile data by the GMM method (Jordá, Sarabia and Jäntti [20]). The MDS data are estimated using the polynomial estimator in Shao [2]. That is, a polynomial LC function is estimated for each observation using Newton’s method; then, the MDSs and MDC are calculated from the estimated polynomial LC.
The polynomial function is supposed to be the most flexible function in function approximation; hence, I assume that the polynomial estimation could capture the empirical first moments of the grouped income data, which might lead to biased estimation and there will be no such concern if the micro data (income data of the density distribution) are available. Fortunately, later simulation shows that the summary statistics of the first moments in both the empirical estimation and simulation are consistent (please refer to Table 8), and the conclusions of the empirical estimation and simulation do not differ. Therefore, the potential bias of the polynomial estimation does not mislead my conclusion.
The decile dataset has 969 observations for 82 countries from 1953 to 2015. The minimum MDC is 0.8443, which is for Norway in 1973, and the maximum MDC is 1.2493, which is for Nepal in 1977. Table 3 summarizes the MDSs and MDC of the dataset. It shows that the first moments seem to be pretty stable because their standard deviations are relatively very small with respect to their mean values, and there are the following inequality relationships, MIS < MPS < MDC. Fig 17 in S2 Appendix graphs the observations of the first moments by country.
4. Estimation
I apply three steps to identify the best parametric LC in fitting grouped income data and capturing the first moments. First, I use the estimator of nonlinear least squares to estimate the parameter of a single-parametric LC. That is, I solve the following minimization problem for each parametric LC:
where L(pi, x) is a single-parametric LC; x is the parameter to be estimated; and {(pi,yi):i ∈ N, 0 ≤ i ≤ 10} is an observation, which denotes an LC implied by the decile data of an income distribution. I compare the eight values of SSELC and choose the function with the least SSELC for each observation. Then, I am able to find one parametric function to best fit every observation. That is, the 969 observations could be divided into eight subsets, each of which can be best fitted by one parametric function in the sense of the least SSELC.
Second, I assume that the estimated MDSs in Shao [2] are the real values of the first moments of the income density implied by grouped income data, and are the estimated first moments of a parametric LC from the first step. I calculate the standard deviation between
and MDSs and take it as the goodness of fit for a function’s performance in capturing the data’s first moments.
I compare the functions’ values of SSEMDP for each observation (LC) and choose the function with the least SSEMDP for the observation. Then, the 969 observations can be partitioned into eight subsets again, each of which can be best fitted by one function in the sense of the least SSEMDP.
Last, I choose the parametric function to be the best option for an observation when the two measures (SSELC, SSEMDP) of the goodness of fit lead to the same function. Of course, it is possible that a parametric LC might not be favored using the two measures for any of the observations, and one function might be more favored than the others for some observations.
I proceed with the estimation using the 8 functions in Table 2 excluding the Fisk and lognormal functions because previous discussion shows that the two functions are not as flexible as other functions and inclusion of the two functions does not change our results. Table 4 summarizes the estimation results for the first step of the procedure. The Gupta and Chotikapanich functions are dropped because there are no observations that can be better fitted by the two functions.
The second column of Table 4 is the number of successes, which means the number of observations in which a parametric function always presents a smaller SSELC than others. The third column shows the restrictions of the MDSs and MDC for each parametric function. The last four columns are the data summary for each subset of the data that are best fitted by the function in each row.
The data statistics in Table 4 show that the first moment restrictions of a parametric LC are almost perfectly satisfied by each subset of the data. The table also shows that the RGKO LC has the best fit in 801 out of 969 observations of the data, and the Kakwani LC follows with 134 observations. There are no observations that can be best fitted by the Gupta and Chotikapanich LCs regarding the goodness of fit SSELC.
The RGKO and ASR LCs have the same restrictions on the first moments, but the RGKO LC performs much better than the ASR LC. The cause of the performance difference might be their modality. That is, if a priori the modality of an income data is unknown a parametric LC with zero- and unit-modal density offers more flexibility and may fit the data much better than those with zero-modal density.
Table 5 summarizes the results for the second step of the procedure. It shows that each parametric function can capture the first moments of some observations better than all others; the RGKO and Kakwani functions still outperform other functions for greater numbers of observations.
Finally, Table 6 below shows the estimation results of the three-step procedure for the eight parametric LCs. The second column, success, represents the number of observations for which a single-parametric LC shows a minimum value in both SSELC and SSEMDP among the eight functions in fitting the large panel of grouped income data and capturing the first moments. The Gupta and Chotikapanich functions are dropped because they are outperformed by other functions for all observations. It shows that the RGKO and the Kakwani functions are more powerful options than others and that the RGKO LC fits the data the best in the sense of both the grouped income data and the first moments for most (666) of the 969 observations in the data.
The three-step estimation procedure shows that the least SSEs estimated on the income data and first moments may not lead to the same function; however, when a function’s first moments are more flexible than others at describing the data’s first moments, it could be a better option. The RGKO function is found to fit 801 income observations better than other functions, in which 135 (801 − 666) observations’ first moments are not well captured by the function with the least SSELC; it can also capture the first moments better than others for 690 observations, in which 34 (690 − 666) income observations are not well fitted with the least SSEMDP. However, the RGKO function still performs the best in the estimations because most of the observations (666 out of 969) can be perfectly fitted by this function in the sense of the smallest values for both SSELC and SSEMDP.
Table 6 shows the summary statistics of the six income datasets corresponding to the six functions that each has the smallest SSELC and SSEMDP on the data. Two points can be drawn from the table. First, a parametric LC function’s flexibility in fitting grouped income data could be described by or at least related to the constraints of its first moments. Second, the summary statistics of the data’s first moments are located within the constraint intervals of the function’s first moments, and they are distinct among the six subsample datasets. Therefore, the first moments of the income data, either estimated from grouped income data using a polynomial estimator or calculated from micro data, can be used to choose the right functions in fitting the data so that both the income data and its first moments are well fitted.
The MPS, MIS and MDC must be considered together when choosing functions because only after having spotted both the MDP and inequality can we know what the LC looks like. The MDC tells the location (>, =, <1) of MDP, and MPS and MIS jointly determine inequality size, which can be measured by their difference (Pietra ratio = MPS − MIS). It is not enough to look at only the match of MDSs between the data and parametric functions because the matched MDSs do not mean that the MDCs could match. Therefore, Table 6 shows that the summary statistics of the three first-moment metrics from each subsample data always best match the constraints of the chosen function.
Figs 13 and 14 below draw graphs between the SSE and the first moments (MDC and MPS), which show that the MDC and MPS are the key factors for the RGKO and Kakwani functions to be respectively mirrored by the SSEMDP and SSELC. The standardized (square root) SSE is smaller when the MDC data are closer to one for the RGKO LC and so is when the MPS data are closer to 2/3 for the Kakwani LC. The shapes of SSEMDP and SSELC are similar for the two functions in the figures because the two functions fit both the income data and the first moments well.
The RGKO LC is more powerful than the others in fitting grouped income data and hitting the first moments because its first moments match the data better than the other functions.
In summary, a single-parametric Lorenz function outperforms other options in the least squares estimation if the function’s first moments match the data’s first moments better than the others do.
5. Simulation
I perform a Monte Carlo simulation to generate a set of random LCs for each function and apply the estimation procedure in Section 4 on the generated grouped-income data. The random LCs (grouped income data and the first moments) are calculated from a set of values of the function’s parameter that is uniformly distributed in its restricted interval (see Table 2). The estimation is exactly the same as what I do in Section 4; that is, I fit the income and MDSs data using the 8 functions and compare the residuals of one function with the other 7 functions (excluding the Fisk and log-normal functions because inclusion of the two functions does not change our results). Note that the data generated from either LC (grouped income data) or its associated density function (micro data) are equivalent because a Lorenz function and its associated density function express the same income distribution.
Taking the RGKO function as an example, the simulation procedure is as follows. First, we generate a set (e.g., 1000) of random values for the parameter, which is uniformly distributed in (0, 1), and set up the cumulative population shares to be a vector of 11 points including all decile points, 0 and 1. Second, I generate a set of 1000 random RGKO LCs (grouped income data) by the functions (1) and (2). Finally, I perform the same estimation procedure as Section 4 with the simulated data. I repeat the above procedure for the other 7 functions.
For example, for a set of random RGKO LCs including extreme shapes when the parameter α is less than 0.25, some of the income data and their first moments can be better fitted by the other functions (ASR, OMFLG, Pareto and Weibull functions). For the RGKO LCs generated when the parameter α is greater than 0.25, none of the other 7 functions can outperform the RGKO function in fitting both the income data and capturing its first moments; hence, we call the interval (0.25, 1) the critical boundary of the RGKO function in the simulation. Fig 15 shows the two sets of random RGKO LCs: one is for α to be within the critical boundary, and the other is for α to be out of the boundary. Each set has 1000 LCs.
Therefore, there could be extreme cases of a parametric function; the function might not be able to beat out the other functions in fitting these extreme cases of the function. Table 7 summarizes the critical boundaries described by intervals for the 7 functions’ parameters. Each function outperforms the other 7 functions in both fitting the data and hitting the first moments that are randomly generated by the parameter to be within the critical boundary. Note that the critical boundaries might be shrunk if more flexible functions are included in the estimation.
Table 8 below summarizes the data statistics for each function in both the simulation estimation and the empirical estimation (see Table 6). Each function outperforms all other functions in both fitting the grouped income data and hitting their first moments in the estimations using the sub data sets associated with the statistics. Table 8 shows that the empirical statistics are always within the limit of simulation statistics (called critical values of the first moments) for each function.
Two points are concluded from Table 8. One is that a parametric function can fit both the data and its first moments very well when the statistics of the data’s first moments match the statistics (critical values) of the simulation of the chosen function; therefore, the critical values of the first moments may serve as standard rules to evaluate empirical estimation; the other is that the flexibility of parametric LC functions in fitting a dataset could be explained by the size of standard deviation of the first moments in the Monte Carlo simulation of the functions. For instance, RGKO has the best flexibility in its simulated MDSs because the standard deviation (0.13 for both MPS and MIS) of its simulated MDSs is the largest among the 7 functions.
In summary, the flexibility of parametric LC functions in fitting a dataset could be explained by the size of the standard deviation of their first moments in the simulation.
6. Concluding remarks
Grouped income data are associated with an income probability density function. A parametric Lorenz function also implies a probability density function. Therefore, the first moments of the two density functions must be well matched when using a parametric Lorenz function to fit the data; that is, the first moments of parametric Lorenz functions must be considered to evaluate the estimation. The least SSE of the estimated first moments can be used to measure the goodness of fit when comparing different parametric functions.
The single-parametric RGKO function outperforms the other optional single-parametric functions in the empirical estimation, which is confirmed by it having the largest standard deviation of its MDSs in Monte Carlo simulation. The flexibility of a parametric LC can be described by the standard deviation of its first moments in the simulation, which is associated with a critical boundary of its parameter, and the critical boundary also determines the function’s capacity in both fitting the data and capturing the data’s first moments.
Nonetheless, it is possible to find new single-parametric LCs to outperform the RGKO LC in fitting the data with unit MDC because the first moments are necessary but not sufficient factors to describe the income density function and grouped income data perfectly, and higher-degree moments could also be considered in the estimation.
It is complex to choose the best one from multiple different single-parametric functions in fitting grouped income data. Multiparametric LC functions are often more flexible than single-parametric functions, and it is likely to simplify the work to find one at the cost of more complicated functional forms, which warrant future work.
Acknowledgments
The author would like to thank two anonymous referees, Markus Jäntti, Melanie Krause, Tsunfeng Chiang, and participants at the Econometrica Society 2018 conferences at Xiamen University, the China Economics Society 2019 conference at Dongbei University of Economics and Finance, and the CEC 2018 Summer Conference at the Jiangxi University of Economics and Finance for their informative comments.
References
- 1. Krause Melanie. Parametric Lorenz Curves and the Modality of the Income Density Function. Review of Income and Wealth, 2014, Vol. 60, Issue 4, pp. 905–929. https://doi.org/10.1111/roiw.12047
- 2. Shao Liang F. How Do Mean Division Shares Affect Growth and Development? Panoeconomicus, 2017, 64 (5), pp. 525–545. https://doi.org/10.2298/PAN150520033S
- 3. Sarabia José-María, Castillo Enrique and Slottje Daniel J.. An Exponential Family of Lorenz Curves. Southern Economic Journal, 2001, Vol. 67, No. 3. pp. 748–756.
- 4. Gastwirth Joseph L. The Estimation of the Lorenz Curve and Gini Index. Review of Economics and Statistics, 1972, 54, 306–16.
- 5. Chotikapanich D. A Comparison of Alternative Functional Forms for the Lorenz Curve. Economics Letters, 1993, 41, 21–29.
- 6. Gupta M. R. Functional Form for Estimating the Lorenz Curve. Econometrica, 1984, 52, pp. 1313–1314.
- 7. Rohde Nicholas. An Alternative Functional Form for Estimating the Lorenz Curve. Economics Letters, 2009, 105 (1), pp. 61–63. https://doi.org/10.1016/j.econlet.2009.05.015
- 8. Ogwang Tomson and Tao U. L. G.. A New Functional Form for Approximating the Lorenz Curve. Economics Letters, 1996, 52 (1), pp. 21–29. https://doi.org/10.1016/0165-1765(96)00833-6
- 9. Hossain Sharif, and Saeki Chikayoshi. A New Functional Form for Estimating Lorenz Curves. Journal of Business and Economics Research, 2003, 1 (1). https://doi.org/10.19030/jber.v1i1.2959
- 10. Wang ZuXiang and Smyth Russell. A Piecewise Method for Estimating the Lorenz Curve. Economics Letters, 2015, 129(C), pages 45–48. https://doi.org/10.1016/j.econlet.2015.02.008
- 11. Abdalla Ibrahim M. and Hassan Mohamed Y.. Maximum Likelihood Estimation of Lorenz Curves Using Alternative Parametric Model. Metodoloski zvezki, 2004, 1(1), pp. 109–118. http://mrvar.fdv.uni-lj.si/pub/mz/mz1.1/abdalla.pdf.
- 12. Wang ZuXiang, Smyth Russell and Yew-Kwang NG. A New Ordered Family of Lorenz Curves with an Application to Measuring Income Inequality and Poverty in Rural China. China Economic Review, 2009, 20, pp.218–239.
- 13. Sarabia J. M., Jordá V. and Trueba C.. The Lamé Class of Lorenz Curves. Communications in Statistics Theory and Methods, 2017, 46(11), 5311–5326.
- 14. Rasche R. H., Gaffney J., Koo A. Y. C. and Obst N.. Functional Forms for Estimating the Lorenz Curve. Econometrica, 1980, 48, pp.1061–1062.
- 15.
Henle J. M., Horton N. J. and Jakus S. J.. Modelling Inequality with a Single Parameter. In: Modeling Income Distributions and Lorenz Curves, Chotikapanich D. Edit., Economic Studies in Inequality, Social Exclusion and Well-Being, 2008, Vol. 5:255–269, Springer Verlag, New York.
- 16. Kakwani N.C. On a Class of Poverty Measures. Econometrica, 1980, 48, 437–446.
- 17.
Jorda, V., J. M. Sarabia and M. Jäntti. Estimation of Income Inequality from Grouped Data. 2018, arXiv preprint arXiv:1808.09831.
- 18. Aggarwal V. and Singh R.. On Optimum Stratification with Proportional Allocation for a Class of Pareto Distributions. Communications in Statistics, Theory and Methods, 1984, 13, 3017–3116.
- 19. Ortega , Martín P., Fernández A., Ladoux M. and García A.. A New Functional Form for Estimating Lorenz Curves. Review of Income and Wealth, 1991, 37, pp. 447–452.
- 20. Sarabia J.M., Prieto F. and Sarabia M.. Revisiting a Functional Form for the Lorenz Curve. Economics Letters, 2010, 107, 249–252. https://doi.org/10.1016/j.econlet.2010.01.038