Figures
Abstract
In Statistical theory, inclusion of an additional parameter to standard distributions is a usual practice. In this study, a new distribution referred to as Alpha-Power Pareto distribution is introduced by including an extra parameter. Several properties of the proposed distribution, including moment generating function, mode, quantiles, entropies, mean residual life function, stochastic orders and order statistics are obtained. Parameters of the proposed distribution have been estimated using maximum likelihood estimation technique. Two real datasets have been considered to examine the usefulness of the proposed distribution. It has been observed that the proposed distribution outperforms different variants of Pareto distribution on the basis of model selection criteria.
Citation: Ihtisham S, Khalil A, Manzoor S, Khan SA, Ali A (2019) Alpha-Power Pareto distribution: Its properties and applications. PLoS ONE 14(6): e0218027. https://doi.org/10.1371/journal.pone.0218027
Editor: Haroldo V. Ribeiro, Universidade Estadual de Maringa, BRAZIL
Received: March 1, 2019; Accepted: May 23, 2019; Published: June 12, 2019
Copyright: © 2019 Ihtisham et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The authors received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
For the last few decades, improvement over standard distributions has become a common practice in statistical theory. Usually, an additional parameter is added by using generators or existing distributions are combined to obtain new distributions [1]. The purpose of such modification is to bring more tractability to the classical distributions for useful analysis of complex data structures. [2] and [3] developed a methodology of adding a new parameter in existing distributions. [4] presented an idea of beta generated distributions in which parent distribution is beta while baseline distribution can be the cumulative distribution function (cdf) of any continuous random variable. [5] modified the idea of [4] and replaced beta distribution by Kumaraswamy distribution. Further, [6] proposed the idea of T-X family of continuous distributions in which probability density function (pdf) of beta distribution was replaced by the pdf of any continuous random variable and instead of cdf, a function of cdf satisfying certain conditions was used. [7] provided a detail review on methods of generating univariate continuous distributions.
More recently, [8] presented a new method, called alpha power transformation (APT), for including an extra parameter in continuous distribution. Basically, the idea was introduced to incorporate skewness to the baseline distribution. The alpha power transformation is defined as follows:
Let F(x) be the cdf of any continuous random variable X, then cdf of APT family is given as (1)
The corresponding probability density function is (2)
Particularly, the generator was used to transform one parameter exponential distribution into two parameter alpha power exponential distribution. Several properties of the proposed distribution were studied including explicit expressions for survival function, hazard function, quantiles, median, moments, moments generating functions, order statistics, mean residual life function and entropies. Also, the shape behavior of pdf, hazard rate function and survival function were examined. [9] and [1] have successfully used the above generator for transforming two parameters Weibull distribution into three parameters alpha power Weibull distribution. The transformation has been applied by different researchers to obtain alpha power transformed distributions including alpha power transformed generalized exponential distribution [10], alpha power transformed Lindly distribution [11], alpha power transformed extended exponential distribution [12], alpha power transformed inverse Lindly distribution [13] etc.
Pareto distribution is a well-known distribution used to model heavy tailed phenomena [14]. It has many applications in actuarial science, survival analysis, economics, life testing, hydrology, finance, telecommunication, reliability analysis, physics and engineering [15–17]. Pareto distribution is successfully used by [18] for projection of losses in an insurance company, real state and liability experience of hospitals. [16] applied Pareto distribution to model sea clutter intensity returns. [19] used Pareto distribution for investigation of wealth in society. [20] considered generalized form of Pareto distribution to model exceedances over a margin in flood control. Many types of Pareto distribution and its generalization are available in literature. The Pareto distribution of first kind as described by [21] has the cdf as follows: (3)
It has two parameters α and k, where k is the lower bound of the data. [18] normalized the data by dividing each observation by the pre-selected lower bound that gives k = 1. Eventually, the cdf and pdf of Pareto distribution can be written as (4) (5) where β is the scale parameter. As the hazard rate function of Pareto distribution is decreasing and has reversed J shaped pdf, it may occasionally be inadequate to fit the data well. Practically, there can be various options for projection of risks and losses, for example, machine life cycle and human mortality has more flexible behavior. That is why researchers proposed various amendment and extensions of the Pareto distribution with different number of parameters [17]. For example, Generalized P [22], Exponentiated P [23,24], Beta P [25], Beta Generalized P [26], Weibull P [27,28], Kumaraswamy P [29], Kumaraswamy Generalized P [30], Exponentiated Weibull P [31], The Burr X-P [17], Exponentiated Generalized P [14].
The aim of this study is to propose a new and more flexible distribution, which, we call Alpha Power Pareto (APP) distribution, by introducing an additional parameter to Basic Pareto distribution, to obtain an adequate fit. Numerous properties of the APP distribution are studied in the following section along with more attractive expressions for quantile function, median, mode, moments, order statistics, mean residual life function and stress strength parameter. Lemma 1 and 2 contains expressions for stochastic ordering, Shannon and Renyi entropies respectively. The next section provides method of maximum likelihood estimation of parameters in addition to simulation studies. Two real data applications are used to check the effectiveness of the proposed model. Conclusions are provided in the last section.
Alpha Power Pareto (APP) distribution
Random variable X is said to have an APP distribution if its pdf is of the form (6) and 0 otherwise. By setting x-β = z in Eq (6), it can be easily verified that
The corresponding cdf of APP distribution is (7)
The survival (reliability) function and hazard rate function are obtained, respectively, as follows: (8) (9)
Henceforth, a random variable X that follows the distribution in (6) is symbolized by X~APP(α, β).
Figs 1 and 2 demonstrate the graphs of pdf and hazard function of APP distribution for different values of α when β is fixed. Clearly, the pdf of APP distribution is decreasing function for α < 1 and uni-modal and positively skewed for α < 1.
Quantile function
Quantile function is defined as an inverse of the distribution function. Consider the identity where U follows standard Uniform distribution. The pth quantile of APP distribution is given by (10)
Median of APP distribution can be obtained by putting p = 1/2, that is, (11)
Mode
The mode of the distribution can be found by solving the following equation
By taking the derivative of Eq (6) and equating it to zero and solving for x, mode becomes (12)
In Table 1 mode of the APP distribution is calculated for different choices of α and β. These results can be verified through Fig 1.
Moments
The moment generating function of APP distribution is given by (13) by substituting x-β = z and the following series representation (14) it can be easily verified that (15) by taking derivative of Eq (15) and putting t = 0,1,2…r, E(X), E(X2 ),…E(Xr) of APP distribution are obtained as (16) (17) (18)
Mean residual life function
Assuming that X is a continuous random variable with survival function given in Eq (8), the mean residual life function is defined as the expected additional lifetime that a component has survived until time t. The mean residual life function, say, μ(t) is given by (19) where (20)
Substituting Eqs (8), (16) and (20) in Eq (19), can be written as (21)
Stochastic ordering
Stochastic ordering plays a significant role for assessing the comparative behavior of continuous random variable. It is known that if a distribution has likelihood ratio (lr) ordering, then it possesses the same ordering in hazard rate (hr) and distribution (st). It is also known that if a family of distribution has likelihood ratio ordering, then there exists a uniformly most powerful test [32].
Lemma 1: Let X1~APP(α1, β) and X2~APP(α2, β) be two independent random variables. If α1 < α2 then
Proof: Likelihood ratio is given by
Order statistics
Let X1, X2, X3, …, Xn be a random sample of size n from APP distribution and let Yi:n denote the ith order statistics, then the pdf of Yi:n is given by (22) substituting the pdf and cdf of APP distribution in (22), we get the pdf of ith order statistics for y>1 as (23) by putting i = 1, we get first order statistics as (24) by putting i = n we get nth order statistics as (25)
Stress-strength parameter
Suppose X1 and X2 be two continuous and independent random variables, where X1~APP(α1, β) and X2~APP(α2, β) then the stress strength parameter, say S, is defined as using the pdf and cdf of APP distribution, stress strength parameter S, can be obtained as (26)
The use of (14) in Eq (26) yields (27)
Lemma 2: Shannon and Renyi entropy for random variable X that follows Alpha Power Pareto distribution is as follows (28) (29)
Proof:
For APP distribution, the Shannon and Renyi entropies are given respectively as the results can be obtained easily by using Eq (14).
Parameters estimation
Maximum likelihood estimation
Let X1, X2, X3, …, Xn be a random sample from APP (α1, β) then the likelihood function is given by (30) taking logarithm, Eq (32) becomes taking derivative of the above equation with respect to α and β and equating to zero, the following two normal equations are obtained (31) (32) by solving (31) and (32) simultaneously, MLE of α and β can be obtained. Standard algorithm like Newton Raphson method or Bisection method can be used to solve these nonlinear equations. It is well known that MLEs are asymptotically normally distributed i.e, where Σ is variance covariance matrix and can be obtained by inverting observed Fisher information matrix F as given below taking second derivative of Eqs (31) and (32) w.r.t α and β (33) (34) (35)
Asymptotic (1 − ζ)100% confidence intervals for parameters can be obtained as where Zζ is the upper ζth percentile of the standard normal distribution.
Simulations study
Simulation study has been performed for average MLEs, Mean Square Error (MSE) and bias. W = 1000 samples of size n = 50, 80, 100 and 120 were produced form APP distribution. Random numbers were generated by the following expression where U is uniform random numbers with parameter [0, 1]. Bias and MSE are calculated by where b = (α, β). Simulations results were obtained for different combinations of α and β. The average values of MSEs and Bias are displayed in Table 2. It can be illustrated clearly that these estimates are reasonably consistent and approaches to the true values of parameters as sample size increases. Furthermore, with increasing sample size the MSEs and Bias decrease for all parameter combinations. Therefore, it has been concluded that MLE process performs well in estimating the parameters of APP distribution.
Applications
Two data sets have been analyzed to demonstrate the performance of the proposed model. The first data set consists of 40 wind related catastrophes used by [33]. It includes claims of $2,000,000. The sorted values, observed in millions are as follows.
The second data set consists of survival time (in weeks) of 33 acute myelogenous leukaemia patients. The data has been analysed by [17, 34]. The data values are as follows.
The fit of the proposed APP distribution is compared with several other competitive models namely Basic Pareto, Pareto distribution by [35], Genaralized Pareto distibution by [22], Kumaraswamy Pareto distribution by [29], Exponentiated Generalized Pareto Distribution by [14] and Inverse Pareto distribution [36] with the following pdfs.
- Basic Pareto Distribution (BP)
- Pareto Distribution (PD)
- Generalized Pareto Distribution (GPD)
- Kumaraswamy Pareto Distibution (KPD)
- Exponentiated Generalized Pareto Distribution (ExGPD)
- Inverse Pareto Distribution (IPD)
The goodness of fit test is applied, using AdequacyModel package of R software, to check the performance of APP distribution and several other versions of Pareto distribution discussed above. Goodness of fit criteria include the result of Akaike’s Information Criteria (AIC), Consistent Akaike’s Information Criteria (CAIC), Bayesian Information Criterion (BIC), Hannan-Quinn Information Criteria (HQIC), -ln along with the result of Kulmogrov-Smirnov test (KS) and its p value as shown in Tables 3 and 4. In general, if the values of all the above criteria are smaller and p value is greater, the model is considered as good fit.
From the results provided in Tables 3 and 4 it is evident that AIC, CAIC, BIC, HQIC and -log-likelihood are lower for APP distribution as compared to the other fitted distributions. Promising performance of the proposed distribution is visible from Figs 3 and 4. Figs 5 and 6, QQ-plot and PP-plot is provided. Apparently, some of the values of QQ-plot depart from the fitted line, but actually, it is an expected behavior of a heavy tailed distributions [37].
Conclusion
The new distribution, termed as APP distribution, is introduced using alpha power transformation. Mainly, the transformation is applied for adding skewness to a family of distribution functions. Different properties of the distribution have been derived including moment generating function, order statistics, stress strength parameter, mean residual life function, mode, stochastic ordering and expressions for entropies. Maximum likelihood estimation procedure has been used to provide parameter estimates of the unknown parameters. The proposed distribution has been applied to two real datasets, which indicates its better performance as compared to other variants of Pareto distributions.
References
- 1. Dey S, Sharma VK, Mesfioui M. A new extension of Weibull distribution with application to lifetime data. Annals of Data Science. 2017; 4(1):31–61.
- 2. Mudholkar G S, Srivastava DK. Exponentiated Weibull family for analyzing bathtub failure-rate data. IEEE transactions on reliability. 1993; 42(2):299–302.
- 3. Marshall AW, Olkin I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 1997; 84(3):641–52.
- 4. Eugene N, Lee C, Famoye F. Beta-normal distribution and its applications. Communications in Statistics-Theory and methods. 2002; 31(4):497–512.
- 5. Jones M. C. Kumaraswamy’s distribution: A beta-type distribution with some tractability advantages. Statistical Methodology. 2009; 6(1):70–81.
- 6. Alzaatreh A, Lee C, Famoye F. A new method for generating families of continuous distributions. Metron. 2013; 71(1):63–79.
- 7. Lee C, Famoye F, Alzaatreh AY. Methods for generating families of univariate continuous distributions in the recent decades. Wiley Interdisciplinary Reviews: Computational Statistics. 2013; 5(3):219–38.
- 8. Mahdavi A, Kundu D. A new method for generating distributions with an application to exponential distribution. Communications in Statistics-Theory and Methods. 2017; 46(13):6543–57.
- 9. Nassar M, Alzaatreh A, Mead M, Abo-Kasem O. Alpha power Weibull distribution: Properties and applications. Communications in Statistics-Theory and Methods. 2017 18;46(20):10236–52.
- 10. Dey S, Alzaatreh A, Zhang C, Kumar D. A new extension of generalized exponential distribution with application to Ozone data. Ozone: Science & Engineering. 2017; 39(4):273–85.
- 11. Dey S, Ghosh I, Kumar D. Alpha-Power Transformed Lindley Distribution: Properties and Associated Inference with Application to Earthquake Data. Annals of Data Science. 2018:1–28.
- 12. Hassan AS, Mohamd RE, Elgarhy M, Fayomi A. Alpha power transformed extended exponential distribution: properties and applications. Journal of Nonlinear Sciences and Applications. 2018; 12(4), 62–67.
- 13. Dey S, Nassar M, Kumar D. Alpha power transformed inverse Lindley distribution: A distribution with an upside-down bathtub-shaped hazard function. Journal of Computational and Applied Mathematics. 2019; 348:130–45.
- 14. Lee S, Kim J H. Exponentiated generalized Pareto distribution: Properties and applications towards extreme value theory. Communications in Statistics-Theory and Methods. 2018; 1–25.
- 15. Brazauskas V, Serfling R. Favorable estimators for fitting Pareto models: A study using goodness-of-fit measures with actual data. ASTIN Bulletin: The Journal of the IAA. 2003; 33(2):365–81.
- 16. Farshchian M, Posner F L. The Pareto distribution for low grazing angle and high resolution X-band sea clutter. Naval Research Lab Washington DC; 2010.
- 17. Korkmaz M, Altun E, Yousof H, Afify A, Nadarajah S. The Burr X Pareto Distribution: Properties, Applications and VaR Estimation. Journal of Risk and Financial Management. 2018; 11(1):1.
- 18.
Philbrick S W. A practical guide to the single parameter Pareto distribution. PCAS LXXII. 1985; 44–85.
- 19. Levy M, Levy H. Investment talent and the Pareto wealth distribution: Theoretical and experimental analysis. Review of Economics and Statistics. 2003; 85(3):709–25.
- 20. Castillo E, Hadi AS. Fitting the generalized Pareto distribution to data. Journal of the American Statistical Association. 1997; 92(440):1609–20.
- 21.
Johnson N. L., and Kotz S., Balakrishnan N. Continuous Univariate Distributions-I. New York: John Wiley; 1994.
- 22. Pickands J III. Statistical inference using extreme order statistics. The Annals of Statistics. 1975; 3(1):119–31.
- 23. Gupta RC, Gupta PL, Gupta RD. Modeling failure time data by Lehman alternatives. Communications in Statistics-Theory and methods. 1998; 27(4):887–904.
- 24. Nadarajah S. Exponentiated Pareto distributions. Statistics. 2005; 39(3):255–60.
- 25. Akinsete A, Famoye F, Lee C. The beta-Pareto distribution. Statistics. 2008; 42(6):547–63.
- 26. Mahmoudi E. The beta generalized Pareto distribution with application to lifetime data. Mathematics and computers in Simulation. 2011; 81(11):2414–30.
- 27. Alzaatreh A, Famoye F, Lee C. Weibull-Pareto distribution and its applications. Communications in Statistics-Theory and Methods. 2013; 42(9):1673–91.
- 28. Tahir M H, Cordeiro GM, Alzaatreh A, Mansoor M, Zubair M. A new Weibull–Pareto distribution: properties and applications. Communications in Statistics-Simulation and Computation. 2016; 45(10):3548–67.
- 29.
Pereira MB, Silva RB, Zea LM, Cordeiro GM. The kumaraswamy Pareto distribution. arXiv preprint arXiv:1204.1389. 2012.
- 30. Nadarajah S, Eljabri S. The kumaraswamy gp distribution. Journal of Data Science. 2013; 11(4):739–66.
- 31. Afify AZ, Yousof HM, Hamedani GG, Aryal G. The exponentiated Weibull-Pareto distribution with application. J. Stat. Theory Appl. 2016; 15:328–46.
- 32.
Shaked M, Shanthikumar Jeyaveerasingam G. Stochastic Orders. Series: Springer Series in Statistics. New York: Springer; 2007.
- 33.
Hogg R. and Klugman S.A. Loss Distributions. New York: Wiley; 1984.
- 34. Mead ME, Afify AZ, Hamedani GG, Ghosh I. The beta exponential Fréchet distribution with applications. Austrian Journal of Statistics. 2017; 46(1):41–63.
- 35.
Johnson N. L., and Kotz S. Continuous Univariate Distributions-2. Boston: Houghton Mifflin; 1970.
- 36. Guo L, Gui W. Bayesian and Classical Estimation of the Inverse Pareto Distribution and Its Application to Strength-Stress Models. American Journal of Mathematical and Management Sciences. 2018; 37(1):80–92.
- 37. Brodin E, Rootzén H. Univariate and bivariate GPD methods for predicting extreme wind storm losses. Insurance: Mathematics and Economics. 2009; 44(3):345–56.