Figures
Abstract
The paper investigates a new scheme for generating lifetime probability distributions. The scheme is called Exponential- H family of distribution. The paper presents an application of this family by using the Weibull distribution, the new distribution is then called New Flexible Exponential distribution or in short NFE. Various statistical properties are derived, such as quantile function, order statistics, moments, etc. Two real-life data sets and a simulation study have been performed so that to assure the flexibility of the proposed model. It has been declared that the proposed distribution offers nice results than Exponential, Weibull Exponential, and Exponentiated Exponential distribution.
Citation: Ijaz M, Mashwani WK, Belhaouari SB (2020) A novel family of lifetime distribution with applications to real and simulated data. PLoS ONE 15(10): e0238746. https://doi.org/10.1371/journal.pone.0238746
Editor: Feng Chen, Tongii University, CHINA
Received: June 26, 2020; Accepted: August 21, 2020; Published: October 1, 2020
Copyright: © 2020 Ijaz et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data sets are taken from the Literature and the references are given at the end of the manuscript. The data set 1 is taken from [21] while the second data set is taken from [22]. The Links are respectively given by https://www.researchgate.net/profile/Pedro_Rafael_Marinho/publication/267467380_The_exponentiated_Lomax_Poisson_distribution_with_an_application_to_lifetime_data/links/5498ab5b0cf2eeefc30f9bed/The-exponentiated-Lomax-Poisson-distribution-with-an-application-to-lifetime-data.pdf https://rss.onlinelibrary.wiley.com/doi/abs/10.2307/2347795.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Probability distribution plays a vital role in modeling lifetime data that arise in different fields of science such as in Survival analysis, Economics, Biology, Engineering, and in some other applied field of sciences. There are many lifetime probability distributions that can be used to model the data, for example, Exponential, Weibull, and Weibull Exponential distribution are among others. All these distributions have desirable properties and real applications. However, these distributions fail to model the data following a non-monotonic hazard rate function, for example, Exponential distribution can only model the constant hazard rate and the Weibull distribution can only model a monotonic hazard rate function. In this paper, we have present a new distribution that can model both the monotonically and non-monotonically hazard rate functions. But in practice, we have real data sets which follow a non-monotonic hazard rate function, for example, the infant mortality rate, or the lifetime of an electronic device follows a non-monotonic hazard rate functions.
To overcome the above limitations found in the existing probability distributions, researchers are working to modify these distributions. It is usual practice to modify the current distributions by generating a generator and then applied to the existing distributions so as to derive a new probability model. For example, Aldeni et. al [1] produced a new family of distributions arising from the quantile of generalized lambda distribution, Cordeiro et. al [2] worked on the generalized odd half-Cauchy family of distributions, Alzaatreh et. al [3] presented a generalized Cauchy family of distributions, Alzaatreh et. al [4] introduced T-normal family of distributions, Nasir et. al [5] investigated the generalized Burr family of distributions based on quantile function, Mudholkar et. al [6] worked on the Exponentiated Weibull family of distribution.
This paper contributes a new scheme for generating probability distributions to the existing literature of probability theory. In this paper, a new scheme is investigated and applied to the existing probability distributions so as to derive a new probability distribution. The main objective of the paper is to achieve maximum flexibility while modeling the lifetime data with both the monotonically and non-monotonically hazard rate functions.
Let X be a continuous random variable follows the Exponential distribution then the cumulative distribution function (Cdf) is given by
(1.1)
The Exponential distribution is modified by many researchers, for example, Gupta and Kundu [7] presented the generalized distribution which is also known as Exponentiated Exponential distribution. The Cdf is given by
(1.2)
Barreto, and Cribari [8] introduced a generalization of the Exponential-Poisson distribution with the following Cdf
(1.3)
Barreto et. al [9] introduced the Beta Generalized Exponential distribution. El-Bassiouny [10] introduced the Exponential Lomax distribution. Mudholkar and Srivastava, (1993) defined the Exponentiated Weibull family of distribution [6]. Nadarajah and Kotz [11] present the Beta Exponential distribution.
In this paper, a novel family is produced called Exponential- H family of distribution. We discussed one special case of this family and call it New Flexible Exponential distribution (NFE) by employing the Weibull distribution as a baseline. The detailed discussion is as follows
Exponential- H family (Ex-H) of distributions
The Exponential- H family (Ex-H) is mostly related to the Weibull-G family of distributions investigated by Marcelo et. al [12]. The cumulative distribution function (CDF) of the Exponential- H family (Ex-H) takes the following form
(2.1)
where L(x;ζ) = H(x;ζ)exp(x), and H(x;ζ) is the non-decreasing function hazard rate function depending on the parameter vector ζ. The corresponding probability density function (PDF) is given by
(2.2)
New Flexible Exponential distribution (NFE)
This section illustrates the special case of the Ex-H family by considering the hazard function of the Weibull distribution. The hazard function of the Weibull distribution is defined by
By employing the above result in Eq (2.1) and (2.2), we obtained the CDF and PDF of the NFE distribution respectively
(3.1)
(3.2)
The survival and hazard rate function of NFE is defined by
(3.3)
(3.4)
Fig 1 shows the graphical representation of the probability density function and cumulative distribution function, with different parameter values.
Theorem 1. The behavior of the hazard rate h(x) function of NFE (a,b) is defined by
- Increasing when a>0,b>1,
- Decreasing when a>0,b≤1.
Proof. The derivative of Eq (3.4) is given by
For a>0,b≤1,h′(x). Then the function h(x) is decreasing and for a>0,b>1. For a>0,b>1, h′(x) = 0 implies that the h(x) has a maximum at
and, the function h(x) is increasing for a>0,b>1,
Hence, the hazard rate function has the ability to model both monotonically and non-monotonically hazard rate functions.
Fig 2 shows the plot for the hazard function of the New Flexible Exponential distribution with different values of a parameter.
Quantile function and median
The quantile function Q(NFE)(x) of the NFE(a,b) is the real solution of the following equation
(4.1)
where u~Uniform (0,1).
Solving (4.1) for x, we have
(4.2)
where W (Z) is the Lambert W function and is defined as
For the median, put u = 0.5 in Eq (4.2).
Rth moments
Theorem 2: If a random variable X has NFE distribution with parameters a,b then the rth moments (about the origin) of X is defined by
Putting (3.2) in the above expression, we obtained the following form
(5.1)
Solving the first part in the above expression (5.1), we have
(5.2)
Now solving the second part of (5.1), we have
(5.3)
Order statistics
Let X1,X2,X3,…Xn be ordered random variables, then the Pdf of the ith order statistics is given by,
(6.1)
The 1st and nth order probability density function of NFE can be obtained by putting (3.1) and (3.2) in (6.1), and is given by respectively
(6.2)
(6.3)
Parameter estimation
In this section, the maximum likelihood method is used to find out the estimates of the unknown parameters of NEF (a, b) based on a complete data set information. Let us assume that we have a sample X1,X2,X3…Xn from NEF (a,b). The Likelihood function is given by
(7.1)
Substituting (3.2) in (7.1), we get
(7.2)
By applying the natural log to (7.2), the log-likelihood function is defined by
(7.3)
To find the estimates of the unknown parameters, we have to compute the partial derivatives of (7.3) with respect to parameters and equate the results to zero
(7.4)
(7.5)
The above two Eq (7.4) and (7.5) are not in closed form. Thus, it is difficult to estimate the unknown parameters and hence we refer to use the numerical technique that is the Newton Raphson or Bisection method to get the MLE.
Asymptotic confidence bounds
Since, the MLE of the unknown parameters is not closed in form and thus the exact distribution of MLE cannot be derived. However, one can find the asymptotic confidence bounds for the unknown parameters of NEF(a,b) based on the asymptotic distribution of MLE which is as follows
The second time partial derivatives of Eq from (7.4) and (7.5) is respectively given by
(8.1)
(8.2)
(8.3)
The observed information matrix is defined by
Hence, the variance-covariance matrix is approximated as
To obtain the estimate of V, we have to replace the parameters by the corresponding MLE, which is defined as
(8.4)
By using the above variance-covariance matrix, we can derive the (1 - β) 100% confidence intervals for the parameters a and b in the following form
where
is the upper
percentile of the standard normal distribution.
Renyi entropy
Theorem 3: If a random variable X has NFE(a,b) then the Renyi entropy RH(x) is defined by
Proof. The general form of the Renyi entropy is given by
By employing (3.2) in the above expression, we have
(9.1)
using the following Binomial and exponential expansion
and
A solution to the integral form in the above expression leads to the final result
(9.2)
Applications
This section illustrates the usefulness of NFE(a,b) distribution by using two real data sets. The comparison with other distributions (Exponential, Weibull Exponential and Exponentiated Exponential distributions) have been studied by using different criteria including Akaike information criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian information criterion (BIC), and Hannan Quinn information criterion (HQIC). For a more detailed discussion on these criteria and their applications to various fields, we refer to see [13–20]. The mathematical form of these criteria are given by
where,
is the maximized likelihood function and yi is the given random sample,
is the maximum likelihood estimator and p is the number of parameters in the model.
As a general rule, a probability model with fewer values of these criteria should be considered the best-fitted model among other probability distributions.
Data set 1: Failure times of Aircraft windshield
The first data set represents the failure times of 84 Aircraft windshields recently studied by Ramos et. al [21]. The data set values are 0.040, 1.866, 2.385, 3.443, 0.301, 1.876, 2.481, 3.467, 0.309,1.899, 2.610, 3.478, 0.557, 1.911, 2.625, 3.578, 0.943, 1.912,2.632, 3.595, 1.070, 1.914, 2.646, 3.699, 1.124, 1.981, 2.661,3.779,1.248, 2.010, 2.688, 3.924, 1.281, 2.038, 2.82,3, 4.035, 1.281, 2.085, 2.890, 4.121, 1.303, 2.089, 2.902, 4.167, 1.432,2.097, 2.934, 4.240, 1.480, 2.135, 2.962, 4.255, 1.505, 2.154,2.964, 4.278, 1.506, 2.190, 3.000, 4.305, 1.568, 2.194, 3.103,4.376, 1.615, 2.223, 3.114, 4.449, 1.619, 2.224, 3.117, 4.485,1.652, 2.229, 3.166, 4.570, 1.652, 2.300, 3.344, 4.602, 1.757,2.324, 3.376, 4.663.
Fig 3 shows the empirical and theoretical Cdf and Pdf of NFE distribution while Fig 4 represents the QQ and PP plots. Table 1 represents the maximum likelihood estimates of the New Flexible Exponential distribution for aircraft data. Table 2 represents the goodness of fit criteria including AIC, CAIC, BIC, and HQIC. The numerical values in Table 2 are less for the New Flexible Exponential distribution than others and hence we conclude that the New Flexible Exponential distribution perform better as compared to Exponential, Weibull Exponential and the Exponentiated Exponential distribution.
Data set 2: Strengths of 1.5 cm glass bares
The second real data set represents the Strengths of 1.5 cm glass bares, measured at the National Physical Laboratory, England. The data set is taken from the Smith and Naylor [22] with the following values 0.55,0.93,1.25,1.36,1.49,1.52,1.58,1.61,1.64,1.68,1.73,1.81,2,0.74,1.04,1.27,1.39,1.49,1.53,1.59,1.61,1.66,1.68,1.76,1.82,2.01,0.77,1.11,1.28,1.42,1.5,1.54,1.6,1.62,1.66,1.69,1.76,1.84,2.24,0.81,1.13,1.29,1.48,1.5,1.55,1.61,1.62,1.66,1.70,1.77,1.84,0.84,1.24,1.3,1.48,1.51,1.55,1.61,1.63,1.67,1.7,1.78,1.89.
Fig 5 shows the empirical and theoretical Cdf and Pdf of NFE distribution while Fig 6 represents the QQ and PP plots. Table 3 represents the maximum likelihood estimates of the New Flexible Exponential distribution for aircraft data. Table 4 represents the goodness of fit criteria including AIC, CAIC, BIC, and HQIC. The numerical values in Table 4 are less for the New Flexible Exponential distribution than others and hence we conclude that the New Flexible Exponential distribution perform better as compared to Exponential, Weibull Exponential and the Exponentiated Exponential distribution.
Simulation
To conduct a simulation study, Eq (4.2) is used to generate random data from the New Flexible Exponential distribution. The simulation experiment is repeated for 100 times each with sample of size n = 120, 150 and 180. First, we fixed the parameter b = 0.3 and vary a = 0.008,0.009,0.01,0.02 0.3. Secondly, we fixed the variable a = 0.01 and vary b = 0.4,0.44,0.5,0.51. Table 5 demonstrates the mean Bias and Mean square error (MSE). The result given in Table 5 has shown that both the Bias and MSE are decreasing as the sample size n increase.
Conclusion
In this paper, a new family of distribution called Exponential-H (Ex-H) family of distribution is presented. The special case is derived by employing the Weibull distribution as a baseline distribution and we called it New Flexible Exponential distribution (NFE). Different statistical properties of the NFE distribution are obtained such as hazard function, Survival function, order statistics, moments, and Renyi entropy. The parameters of the model are estimated using the maximum likelihood method. Moreover, the simulation study is also carried out. Two data sets were used to support the usefulness of the NFE distribution. The numerical values conclude that the NFE distribution performed better than Exponential, Weibull Exponential, and Exponentiated Exponential distribution.
Supporting information
S1 Data. Failure times of Aircraft windshield [21].
https://doi.org/10.1371/journal.pone.0238746.s001
(TIF)
S2 Data. Strengths of 1.5 cm glass bares [22].
https://doi.org/10.1371/journal.pone.0238746.s002
(TIF)
Acknowledgments
I would appreciate and thank the referees for their comments and further suggestions so that to improve the paper further.
References
- 1. Aldeni M.; Lee C.; Famoye F. Families of distributions arising from the quantile of generalized lambda distribution. Journal of Statistical Distributions and Applications, 4(1), 2017, 25.
- 2. Cordeiro G. M.; Alizadeh M.; Ramires T. G.; Ortega E. M. The generalized odd half-Cauchy family of distributions properties and applications. Communications in Statistics-Theory and Methods, 46(11), 2017, 5685–5705.
- 3. Alzaatreh A.; Lee C.; Famoye F.; Ghosh I. The generalized Cauchy family of distributions with applications. Journal of Statistical Distributions and Applications, 3(1), 2016, 12.
- 4. Alzaatreh A.; Lee C.; Famoye F. T-normal family of distributions: a new approach to generalize the normal distribution. Journal of Statistical Distributions and Applications, 1(1), 2014, 16.
- 5. Nasir M. A.; Aljarrah M.; Jamal F.; Tahir M. H. A new generalized Burr family of distributions based on quantile function. Journal of Statistics Applications and Probability, 6(3), 2017, 1–14.
- 6. Mudholkar G. S., Srivastava D. K., & Freimer M. The exponentiated Weibull family: A reanalysis of the bus-motor-failure data. Technometrics, 37(4), 1995, 436–445.
- 7. Gupta R. D., Kundu D.. Generalized exponential distributions, Australian and New Zealand jounrnal of Statistics,41,(1999), 173–188.
- 8. Barreto-Souza W., & Cribari-Neto F. A generalization of the exponential-Poisson distribution. Statistics & Probability Letters, 79(24), 2009, 2493–2500.
- 9. Barreto-Souza W., Santos A. H., & Cordeiro G. M. The beta generalized exponential distribution. Journal of Statistical Computation and Simulation, 80(2), 2010, 159–172.
- 10. El-Bassiouny A. H.; Abdo N. F.; Shahen H. S. Exponential lomax distribution. International Journal of Computer Applications, 121(13), 2015.
- 11. Nadarajah S., & Kotz S. The beta exponential distribution. Reliability engineering & system safety, 91(6), 2006, 689–697.
- 12. Bourguignon M., Silva R. B., & Cordeiro G. M. The Weibull-G family of probability distributions. Journal of Data Science, 12(1), 2014, 53–68.
- 13. Huang H., Song B., Xu P., Zeng Q., Lee J., & Abdel-Aty M. (2016). Macro and micro models for zonal crash prediction with application in hot zones identification. Journal of transport geography, 54, 248–256.
- 14. Zeng Q., Wen H., Huang H., Pei X., & Wong S. C. (2017). A multivariate random-parameters Tobit model for analyzing highway crash rates by injury severity. Accident Analysis & Prevention, 99, 184–191. pmid:27914307
- 15. Zeng Q., Guo Q., Wong S. C., Wen H., Huang H., & Pei X. (2019). Jointly modeling area-level crash rates by severity: a Bayesian multivariate random-parameters spatio-temporal Tobit regression. Transportmetrica A: Transport Science, 15(2), 1867–1884.
- 16. Zeng Q., Hao W., Lee J., & Chen F. (2020). Investigating the impacts of real-time weather conditions on freeway crash severity: a Bayesian spatial analysis. International journal of environmental research and public health, 17(8), 2768. pmid:32316427
- 17. Dong B., Ma X., Chen F., & Chen S. (2018). Investigating the differences of single-vehicle and multivehicle accident probability using mixed logit model. Journal of Advanced Transportation, 2018. pmid:32908973
- 18. Chen F., Chen S., & Ma X. (2018). Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data. Journal of safety research, 65, 153–159. pmid:29776524
- 19. Chen F., Chen S., & Ma X. (2016). Crash frequency modeling using real-time environmental and traffic data and unbalanced panel data models. International journal of environmental research and public health, 13(6), 609.
- 20. Chen F., & Chen S. (2011). Injury severities of truck drivers in single-and multi-vehicle accidents on rural highways. Accident Analysis & Prevention, 43(5), 1677–1688. pmid:21658494
- 21. Ramos M. W. A.; Marinho P. R. D.; Silva R. V.; Cordeiro G. M. The exponentiated Lomax Poisson distribution with an application to lifetime data, Advances and Applications in Statistics. 34, 2013, 107–135.
- 22. Smith R. L. and Naylor J. C. A comparison of maximum likelihood and Bayesian estimators for the three-parameter Weibull distribution. Applied Statistics 36, 1987,358–369.