Lomax exponential distribution with an application to real-life data

In this paper, a new modification of the Lomax distribution is considered named as Lomax exponential distribution (LE). The proposed distribution is quite flexible in modeling the lifetime data with both decreasing and increasing shapes (non-monotonic). We derive the explicit expressions for the incomplete moments, quantile function, the density function for the order statistics etc. The Renyi entropy for the proposed distribution is also obtained. Moreover, the paper discusses the estimates of the parameters by the usual maximum likelihood estimation method along with determining the information matrix. In addition, the potentiality of the proposed distribution is illustrated using two real data sets. To judge the performance of the model, the goodness of fit measures, AIC, CAIC, BIC, and HQIC are used. Form the results it is concluded that the proposed model performs better than the Lomax distribution, Weibull Lomax distribution, and exponential Lomax distribution.


Introduction
In probability theory, it has been a usual practice for the last few years to modify the existing probability distributions so as to improve the flexibility of the existing models. These modifications are based on different methods such as increasing the number of parameters, making some transformation in the original distribution, proper mixing of two distributions etc. The main goal of such modifications is to improve the flexibility of the classical models. Motivating from the above methods, Ghitany and Al-Awadhi [1] proposed a compound form of the Lomax distribution with exponential distribution. Cordeiro et al. [2] modified the gamma-G family of distributions. Zografos et al. [3] employed the cumulative distribution function (Cdf) of the Lomax distribution as a baseline distribution. Lemonte et al. [4] and Lai et al. [5] used the idea of combining two distributions. Lemonte et.al [6] demonstrated the idea of Mcdonald-G family of distribution with a Lomax baseline function. Ibrahim et al. [7] modified the Lomax distribution by producing the real number to the power of the cumulative distribution function (Cdf) of Lomax distribution. Ashour and Eltehiwy [8], Merovci and Puka [9] and Khan et al. [10] utilized the well-known method that is the transmutation technique to generate new probability distributions.
In this paper, we propose a modification to the Lomax distribution. The Lomax distribution is defined as: let a positive random variable Y has the Lomax distribution with parameters a and b, then the cumulative distribution function (Cdf) takes the form where a and b are the shape and scale parameters respectively. The probability density function related to (1) is given by The above probability distribution has been modified by many researchers. For example, Cordeiro et al. [2] explored the gamma-Lomax distribution and discussed its applications to real data sets. Lemonte et al. [6] presented an extended Lomax distribution. Ibrahim et al. [7] produced a new three parameters probability distribution and referred to it as exponentiated Lomax distribution. Ashour and Eltehiwy [8] discussed the new modification to the Lomax distribution and termed it as a transmuted exponentiated Lomax distribution. Tahir et al. [11] discussed the Weibull Lomax distribution with applications to applied data.
The Lomax distribution is a heavily skewed probability distribution that plays a vital role in modeling the lifetime data sets produced in business, computer science, medical and biological sciences, engineering, economics, income and wealth inequality, Internet traffic and reliability modeling. The Lomax or Pareto II distribution have been applied to model the data related to income and wealth [12,13], the distribution of computer files on server [14], reliability and life testing [15] etc. The Lomax distribution is an alternative to the exponential distribution when the data are heavily tailed [16]. The Lomax distribution has also been applied to record values by Ahsanullah [17]. El-Bassiouny et al. [18] investigated the exponential Lomax distribution. Afify et al. [19] defined the transmuted Weibull-Lomax distribution with real-world applications. For other probability distributions and their applications to different fields, we refer to see [20][21][22][23][24][25][26][27][28][29][30][31], and [32-36] respectively.
In reliability theory where one deals with life testing experiments, most of the data sets result in non-monotonic hazard rate shapes. In such situations, the existing distributions fail to provide an adequate fit to the data. The main goal of this paper is to provide a new probability model that would be more flexible which adequately represents the data sets and have tractable statistical properties. The proposed model shall refer to as Lomax exponential distribution. The proposed model is produced using the transformation X ¼ Ye Y in the Lomax distribution. In the following section we have derived different statistical properties including hazard rate function, survival function, quantile function, moments, order statistics, parameter estimation, Renyi entropy, and asymptotic confidence bounds of the proposed model. We have further explored applications of the proposed model with two real data sets in addition to a simulation study.

Lomax exponential (LE) distribution
Let a random variable Y has the Lomax exponential distribution with parameters a and b. The parameters a and b are the shape and scale parameters respectively. The cumulative distribution function of the Lomax exponential distribution is given by The corresponding probability function to (3) is given as The hazard rate function and survival functions respectively are defined as The behavior of the hazard rate function Theorem 1. The behavior of the hazard rate function of Lomax exponential (a,b) distribution h(y) is studied by taking the derivative of the hazard rate function in Eq (5) and is given by Simplifying we get h 0 ðyÞ ¼ À ae y ðe y À by À 2bÞ ðye y þ bÞ The mode of the above expression is the roots of h 0 (x) = 0. If b>1, then h 0 (x) = 0 implies that the h(x) has a maximum at

Quantile function and median
The quantile function Q (FL) (y) of the LE(a,b) is the real solution of the following equation where u~Uniform (0,1). Solving (9) for y, we have where W (.) is the product log function. For calculating the median we have to put u = 0.5 in Eq (10) to have Rth moment Theorem 2. If Y has a Lomax exponential distribution with parameters a and b then the r th moments (about the origin) of X, say u 0 r , does not exist.
n ! y nþk in the above expression to have By solving (12) the integral in (12), we get expression for u 0 r as follows Hence the skewness and kurtosis can be defined by using the relation, where, var(y) = E(y 2 )−E 2 (y).

Order statistics
Let Y 1 ,Y 2 ,. . .,Y n be ordered random variables, then the probability density function (Pdf) of the i th order statistics is given by, The 1 st and n th order probability density function (pdf) of the LE can be obtained using (3) and (4) in (16) to have

Parameter estimation
In this section, the usual method, that, the maximum likelihood estimation is used to find out the estimates of the unknown parameters of LE(a,b) based on complete information. Let us assume that we have a sample Y 1 ,Y 2 ,. . .,Y n from LE(a,b). The Likelihood function is given by Substituting (4) in (19), we get By applying the natural logarithm to (20), the log-likelihood function is Now computing the first partial derivatives of (21) and setting the results equal zeros, we have The above Eqs from (22) to (25) are not in closed form. For the solution of these explicit equations, we refer to using some iterative procedure such as Newton Raphson, Bisection methods, or some other to get the approximate maximum likelihood estimates (MLE) of these parameters.

Asymptotic confidence bounds
Since the MLE of the unknown parameters a,b are not in closed forms, therefore, it is not possible to derive the exact distribution of the MLE. We have derived the asymptotic confidence bounds for the unknown parameters of LE(a,b) based on the asymptotic distribution of the MLE. For the information matrix, we find the second time partial derivatives of the Eqs from (22) to (25) and are given as So that the observed information matrix is given by Hence the variance-covariance matrix is approximated as To obtain the estimate of V, we replace the parameters by the corresponding MLE ' s to get Using the above variance-covariance matrix, one can derive the (1β) 100% confidence intervals for the parameters a and b as followinĝ a � Zb

Renyi entropy
Theorem 3. If a random variable X has a LE(a,b), then the Renyi entropy R H (x) is defined by By employing the result of the above expression in (30) we have Solving the function under the integral sign, finally we get

Applications
In this section, we provide an application of the LE distribution to two real data sets to illustrate its usefulness and compare its goodness-of-fit with other invariant forms of the Lomax distribution including the Weibull Lomax (WL) [11], Exponential Lomax (EL) [18], and the Lomax (L) [37], by using Kolmogorov-Smirnov (K-S) statistic, Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), and Hannan Quinn information Criterion (HQIC). Formulae of these criteria are given by where L is the maximized likelihood function and y i is the given random sample,ĉ is the maximum likelihood estimator and p is the number of parameters in the model.

Data set 2: Breaking stress of carbon fibers
The second real data set represent the failure times of 84 aircraft windshield. This data is taken from an article published by [18].  Table 2 represent the goodness of fit measures AIC, CAIC, BIC, and HQIC of the Lomax exponential distribution for the wind catastrophes data. Table 3 represent the maximum likelihood estimates and Table 4 represent the goodness of fit measures AIC, CAIC, BIC, and HQIC using breaking stress of carbon fibers data. In general, the model is to be considered the best one among others for which these (AIC, CAIC, BIC, and HQIC) statistics values are small. From Table 2 and Table 4, it is evident that the LE model leads to the preferable fit over the Lomax, Weibull Lomax, and Exponential Lomax distribution. Fig 3 show the theoretical and empirical probability density function (Pdf) and cumulative distribution function (Cdf) and Fig 4 provides the Q-Q plot and P-P plot of the Lomax exponential for data set 1. Fig 5 shows the theoretical and empirical probability density function (Pdf) and cumulative distribution function (Cdf) and Fig 6 provides the Q-Q plot and P-P plot of the Lomax exponential for data set 2. It is evident that the LE distribution fitted the line very well as compared to others Simulations Expression (11) can be easily used to draw random data from LE(a,b) distribution. The experiment is repeated for 100 times with a sample of size n = 30, 60, and 90 for different values of the parameter. The average bias and Mean square error (MSE) are given in Table 5. The results reveal that increase in the sample size results in a decrease in both the bias and MSE. The mathematical form of the mean square error and bias are as follows: ðâ i À aÞ

Total time on the test (TTT)
The TTT plot plays an important role in identifying the appropriate model to fit the given data in respect of the failure rates. This plot tells us the different forms of the failure rate. If the TTT plot has a straight line (diagonal), this indicates that the given data has a constant failure rate.
The failure rates will be increase if this plot is concave and decreases if it is convex. For the bath-tub shape, this plot first decreases and then increases. Similarly, if the failure rates follow some inverted bath-tub shape, then it will be first concave and then convex. The TTT plot is The TTT plots for the data (losses due to wind catastrophes and breaking stress of carbon fibers) are given in Fig 7. The graph clearly shows that the proposed distribution plays an important role both in monotonic and non-monotonic hazard rate shapes.

Conclusion
In this paper, we presented a new modification of the Lomax distribution consisting of two parameters called Lomax exponential Distribution (LE). The statistical properties of the LE distribution are obtained including moments, entropy measures, hazard function, Survival function, median, mode, order statistics, etc. Furthermore, the parameters of the model are estimated using the maximum likelihood estimation method. Asymptotic confidence intervals of the parameters, based on MLE, have been constructed. In future, a study may be conducted to estimate the parameter of the proposed model using Bayesian approach. The behavior of the hazard rate function has been investigated. It is concluded that the Lomax exponential distribution can model data sets having both monotonically and non-monotonically hazard rate shapes. The paper also presents an application of the LE distribution by using two real data sets. The results based on the real-life data sets reveal that the proposed distribution is more flexible for the lifetime data sets and provide a better fit to the data sets as compared to other competing probability models including the Lomax distribution, Weibull Lomax distribution, and exponential Lomax distribution.
Supporting information S1 Data. Losses due to wind catastrophes.