Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A new extended gumbel distribution: Properties and application

  • Aisha Fayomi ,

    Contributed equally to this work with: Aisha Fayomi, Sadaf Khan, Muhammad Hussain Tahir, Ali Algarni, Farrukh Jamal, Reman Abu-Shanab

    Roles Conceptualization, Methodology

    Affiliation Faculty of Science, Department of Statistics, King Abdulaziz University, Jeddah, Saudi Arabia

  • Sadaf Khan ,

    Contributed equally to this work with: Aisha Fayomi, Sadaf Khan, Muhammad Hussain Tahir, Ali Algarni, Farrukh Jamal, Reman Abu-Shanab

    Roles Data curation, Formal analysis, Investigation, Visualization, Writing – original draft

    smkhan6022@gmail.com

    Affiliation Department of Statistics, The Islamia University of Bahawalpur, Bahawalpur, Pakistan

  • Muhammad Hussain Tahir ,

    Contributed equally to this work with: Aisha Fayomi, Sadaf Khan, Muhammad Hussain Tahir, Ali Algarni, Farrukh Jamal, Reman Abu-Shanab

    Roles Resources, Supervision

    Affiliation Department of Statistics, The Islamia University of Bahawalpur, Bahawalpur, Pakistan

  • Ali Algarni ,

    Contributed equally to this work with: Aisha Fayomi, Sadaf Khan, Muhammad Hussain Tahir, Ali Algarni, Farrukh Jamal, Reman Abu-Shanab

    Roles Investigation, Validation

    Affiliation Faculty of Science, Department of Statistics, King Abdulaziz University, Jeddah, Saudi Arabia

  • Farrukh Jamal ,

    Contributed equally to this work with: Aisha Fayomi, Sadaf Khan, Muhammad Hussain Tahir, Ali Algarni, Farrukh Jamal, Reman Abu-Shanab

    Roles Project administration, Writing – review & editing

    Affiliation Department of Statistics, The Islamia University of Bahawalpur, Bahawalpur, Pakistan

  • Reman Abu-Shanab

    Contributed equally to this work with: Aisha Fayomi, Sadaf Khan, Muhammad Hussain Tahir, Ali Algarni, Farrukh Jamal, Reman Abu-Shanab

    Roles Writing – review & editing

    Affiliation Department of Mathematics, College of Science, University of Bahrain, Zallaq, Bahrain

Correction

28 Aug 2024: Fayomi A, Khan S, Tahir MH, Algarni A, Jamal F, et al. (2024) Correction: A new extended gumbel distribution: Properties and application. PLOS ONE 19(8): e0309768. https://doi.org/10.1371/journal.pone.0309768 View correction

Abstract

A robust generalisation of the Gumbel distribution is proposed in this article. This family of distributions is based on the T-X paradigm. From a list of special distributions that have evolved as a result of this family, three separate models are also mentioned in this article. A linear combination of generalised exponential distributions can be used to characterise the density of a new family, which is critical in assessing some of the family’s properties. The statistical features of this family are determined, including exact formulations for the quantile function, ordinary and incomplete moments, generating function, and order statistics. The model parameters are estimated using the maximum likelihood method. Further, one of the unique models has been systematically studied. Along with conventional skewness measures, MacGillivray skewness is also used to quantify the skewness measure. The new probability distribution also enables us to determine certain critical risk indicators, both numerically and graphically. We use a simulated assessment of the suggested distribution, as well as apply three real-world data sets in modelling the proposed model, in order to ensure its authenticity and superiority.

1 Introduction

The employment of traditional probability models to anticipate real-life occurrences is causing increasing dissatisfaction among applied practitioners. Tail characteristics and goodness of fit metrics may have a constraining tendency, which could be one of the reasons. As a response, in recent years, there has been a substantial rise in the generalisation of well-known probability distributions. The challenge is to find such versatile families that can fit both skew and symmetric data. It’s important to realize that the majority of generalised distributions described in the literature are constructed using the generalised classes approach (G-class) and the compounding principle. In [1], the authors provided a concise review of literature regarding generalization of distributions and transformation through versatile parameter induction techniques. We offer to the readers a few, but not exhaustive, lists in chronological sequence: [214].

According to [15], Emil J. Gumbel originated the use of the Gumbel distribution (GuD) on data bearing extreme values. By “extreme data” we mean the behaviour of a random variable that occurs at the sample threshold level and is seen using intense data and insights. In the reference [16], the authors defined the GuD, alternatively known as extreme value distribution-type I (γ = 0), as the predominant model for quantifying extreme occurrences such as flood frequency analysis, droughts, earthquakes, sea currents and wind speed in order to understand the trajectory, magnitude, and pattern of complex phenomena. Environmental sciences, geology, accelerated life testing, meteorology, risk assessment and epidemiology are just a few of the a set of diverse fields where it might well be utilized. The authors in [17] showed that the score statistics of global sequence alignment follows a Gumbel distribution. In the reference [18], a comprehensive list of real life scenarios to which GuD can be applicable is provided by the authors. To learn more, see references [1924].

Earliest generalizations of GuD was reported by [25] by introducing a shape parameter to Gumbel distribution. [26] provided a generalization of GuD based on the asymptotic distribution of the mth extreme, tracing back to [27]. [28] emphasized on a trivial choice of distribution since the GuD with only location and scale parameter yields narrower confidence intervals and has also the danger of underestimating the level of return. In the reference [29], a unique modification of GuD was proposed. It is based on the logarithmic transformation of an odd Weibull variable and is defined as (1) where Δ = e(xμ)/σ, −∞ ≤ x ≤ + ∞, −∞ ≤ μ ≤ + ∞ and 0 ≤ βσ ≤ + ∞.

Since then, researchers adopted a more formalistic approach to generalize GuD. Some notable generalizations include [30] to define Beta-Gumbel (BGu), [31] to propose Kumaraswamy-gumbel (KumGu), [32] to define exponentiated-Gumbel (EGu), [33] to define exponentiated-generalized Gumbel (EGGu), just to mention some.

[14] proposed a simplified approach to generalize any continuous distribution viz. a viz. the transformed-transformer (T-X) family, which has become an indispensable part of modern distribution theory. Let z(t) be the probability density function (pdf) and Z(t) be the cumulative distribution function (cdf) of a random variable (av) T such that (a1, a2) with support −∞ < a1 < a2 < ∞. Let W[Z(x)] act as generator function of the cdf Z(x) of any baseline av such that K[Z(x)] is differentiable and increasing, lies in the defined range, i.e. a1K[Z(x)] ≤ a2 such that when x → −∞ as K[Z(x)] → a1 and x → + ∞ as K[Z(x)] → a2. (2)

The pdf corresponding to Eq (2) is (3)

To generalize any continuous distribution, the methodology defined by the cdf in Eq (2) has become indispensable part of modern distribution theory. In the same vein, Al-Aqtash et al. [34] introduced the Gumbel-X family of distributions. Let z(t) be the density function (pdf) and Z(t) be the distribution function (cdf) of an arbitrary variable (av) T such that (a1, a2) with support −∞ < a1 < a2 < ∞. Let K[Z(x; φ)] = log [(Z(x; φ))/1−(Z(x; φ))] act as the generator function of the cdf of any baseline av such that K[Z(x; φ)] fulfills the defined criterion. For μ = 0, the cdf of Gumbel-X family is given as (4) where Δ = Z(x)/(1 − Z(x)).

This study introduces a new class of distributions following the T-X methodology, viz. a viz. the exponentiated Gumbel-G (EGuG) family of distributions. This is achieved by replacing the link function K[Z(x; φ)] = log[−log(1 − Z(x; φ))] in T-X family. It is worthy to remember that the link function log[−log(1 − Z(x; φ))] = log[−log(Z(x; φ))] and either of the mentioned form can be employ. EGuG family has thus far not been reported in the literature. We choose EGu distribution to define new family due to its superiority over the ordinary GuD because of presence of shape parameter θ that entails the improvements in tail of the distribution. Moreover, to the best of our knowledge, majority of the extreme value theory literature is supported by data from meteorology, geology, seismology, and hydrology (see references [15, 1822, 24, 3034]). The health implications of climate-related shifts in extreme event exposure, on the other hand, have not been explored. This study’s theoretical investigation will presumably fill this void in existing literature. Additionally, log[−log(1 − Z(x; φ))] function involves double log transformation and cannot be employed on GuD, which somehow makes the link function redundant. Following the success of the proposed generator to generalize Logistic and Normal distributions, we use this generator to define EGuG distribution. We study some of its mathematical properties and provide general properties and application of one of its specific model.

This article is outlined as follows: In Section 2, we define the EGuG family and present some of its special models. In order to optimise the structure of the generalisation being proposed, we provide the linear representation of EGuG density along with some of the mathematical properties of the family such as shapes of density and hazard rate function, moments and generating function, order statistics and estimation of model parameters. In Section 3, we choose Nadarajah-Haghighi (NH) distribution as baseline model to form EGuNH distribution whose mathematical properties as well as some risk measures are established. A simulation study is also conducted for some parametric combinations. Section 4 comprises of the numerical illustrations based on three life data sets. In Section 5, the article’s concluding thoughts are summed up.

2 The EGuG family

Let T follows the EGu av with μ = 0 and shape parameters θ ≥ 0 and σ ≥ 0, say EGu (θ, σ), then its cdf is given by (5) The corresponding pdf to Eq (5) is given as (6) For any baseline distribution with cdf Z(x; Φ) and pdf z(x; Φ) = dZ(x; Φ)/dx, the cdf of EGuG family is given as (7) where θ, σ are shape parameters and φ is the vector of baseline parameter.

The pdf corresponding to Eq (7) is given by (8) where Z(x; φ) is the baseline cdf and z(x; φ) is the baseline pdf. Furthermore, the dependence on the vector φ of the parameters might be omitted at times and simply write Z(x) = Z(x; φ) and z(x) = z(x; φ). Henceforth, X ∼ EGu−G(θ, σ; φ) denotes an av having density Eq (8).

The survival function (sf), hazard rate function (hrf) and cumulative hazard rate function (chrf) of this new family are, respectively, given by and

Simulating the EGuG family is simply done by inverting Eq (7) as follows: If U has a uniform U(0, 1) distribution, then (9) has the density function Eq (8), where QZ(.) = Z−1(.) is the baseline quantile function (qf).

2.1 Special models

Eq (7) can be useful in modelling real life survival data with different shapes of hrf. Table (1) lists −log[Z(x; φ)] and the associated parameters for some special distributions.

thumbnail
Table 1. Distributions and corresponding −log[Z(x; φ)] functions.

https://doi.org/10.1371/journal.pone.0267142.t001

Here three special models of EGuG family of distribution are defined.

2.1.1 EGu-Weibull(EGuW).

The EGu-W model is defined from Eq (7) by taking Z(x; φ) = 1 − exp{−axb}, z(x; φ) = abxb−1 exp{−axb}, as cdf and pdf of the baseline Weibull distribution with a, b > 0, respectively.

The cdf and pdf of EGu-W distribution are, respectively, given by and where θ, σ and b are shape parameters while a is scale parameter.

2.1.2 EGu-BurrXII(EGuBXII).

Let us consider the parent distribution as BXII with power parameters a, b > 0 by taking Z(x; φ) = 1−(1 + xa)b, z(x; φ) = abxa−1(1 + xa)b−1 be the cdf and pdf of the distribution.

The cdf and pdf of EGu-BXII distribution are, respectively, given by and (10) where θ, σ, a and b are shape parameters.

2.1.3 EGu-Nadarajah Haghighi(EGuNH).

Consider to take Nadaraah Haghighi (NH) as baseline distribution with cdf as Z(x; φ) = [1 − exp{1−(1 + λx)β}] and pdf as z(x; φ) = λβ(1 + λx)β−1exp{1 − (1 + λx)β}. Then, the cdf and pdf of EGuNH reduces to (11) and (12) where θ, σ and α are shape parameters while λ is scale parameter.

2.2 Useful expansion for the EGuG cdf

We provide a useful expansion for Eq (7) in terms of linear combinations of exp-G distribution. For a random baseline cdf Z(x), an av is said to have the exp-G distribution having parameter such that > 0, say Y ∼ exp-G (), if its pdf and cdf are given as respectively. Thus, several properties of the proposed model can be derived from those properties of the exp-G distribution studied by the authors in [38], to mention few.

By expanding Eq (7) using binomial and power series expansion, the resultant expression is given (13)

Using Mathematica software, it can be verified that we can start the limit of integers (i, j) from 1 instead of 0 in above equation. Further, we can write [−log Z(x; φ)]i/σ as since .

Now consider, for any real parameter c and (0, 1), the following formula holds: (14) where P0(c) = 1/2; P1(c) = c(3c + 5)/24; P2(c) = c(c2 + 5c + 6)/48 etc is the stirling’s polynomial. Then, the cdf F(x) in Eq (13) can be expressed (using Eq (14) and generalized binomial expansion) as (15) where .

By differentiating Eq (15), we obtain (16) where hm(x) = mZm−1(x; Φ) z(x; Φ) is the exp-G density function with power parameter m.

2.3 Shapes of density and hazard function

Analytical descriptions of density and hrf forms are conceivable. The roots of the equation represent the EGuG density’s critical points: (17) The equation is used to find the EGuG hrf’s crucial points. (18) Any numerical software can be used to examine Eqs (17) and (18) to determine the local maximum and minimum and inflexion points.

2.4 Moments

The first formula for the sth moment of X follows from Eq (16) as (19) where Setting s = 1 in Eq (refrthmoment1pdfmix1) can provide explicit expression for the mean of several parent distributions.

A second alternative formula for is obtained from Eq (19) in terms of the baseline qf as (20) where

The central moments (μt) and cumulants (κt) of X can follow from Eq (19) as and , respectively, where . The skewness and kurtosis can be calculated from the third and fourth standardized cumulants.

The sth incomplete moment of X can be determined from Eq (16) as (21) The last integral can be computed for most G distributions.

A crucial applicability of the first incomplete moment m1(⋅) has to do with the Bonferroni and Lorenz curves, which are extremely beneficial in a variety of fields. For a given probability π, they are given by and , respectively, where m1(y) comes from Eq (refincompletepdfmix1) with s = 1 and q = Q(π) follows from Eq (9). The Lorenz and Bonferroni curve for EGuNH are displayed graphically (Figs 13 and 14, subsequently).

The totality of excursions from the mean and median is used to estimate the degree of scatter in a population and is defined by and , respectively, where is the mean and M = Q(0.5) is the median. These measures can be expressed as and , where is given by Eq (refcdfEGuG).

The moment generating function (mgf) of X can be expressed as where Mm(t) is the mgf of Ym. Hence, M(t) can be determined from the exp-G generating functions.

2.5 Order statistics

Order statistics are used in a wide range of statistical theory and practise. Let X1, …, Xn is a random sample from the EGu-G distribution and Xi: n denote the ith order statistic. Then, pdf of Xi:n can be written as Inserting Eq (refcdfEGuG) and Eq (refpdfEGuG) in the last equation, and expanding it as in section (3.1), we get (22) where and

2.6 Estimation

The three alternate approaches for inference are point estimation, interval estimation, and hypothesis tests. Several approaches for parameter point estimation have been published in the literature, the most extensively utilised of which is the maximum likelihood method. MLEs (maximum likelihood estimates) have properties that can be used to construct confidence ranges for model parameters. Large sample theory provides simple approximations that work well in repeated sampling for these estimations. The normal approximation for MLEs can be tackled analytically or computationally in distribution theory.

We use the optimum likelihood method to estimate the unknown parameters of the new distribution. Let x1, ⋯, xn be n observations from the EGu-G family given by Eq (8) with parameter vector Θ = (θ, β; Φ). The log-likelihood ℑ = ℑ(Θ) for Θ is given by (23) Eq (refmleegug) can be maximized either directly by using the R (optim function), SAS (NLMixed procedure) or Ox (MaxBFGS function), or then by solving the nonlinear likelihood equations by differentiating it. The components of the score vector U(Θ) are Setting these equations to zero and solving them simultaneously yields the MLEs of the family parameters.

The observed information matrix for the parameter vector Θ = (θ, σ, Φk) is given by whose elements can be determined by using any mathematical software. Under normal conditions of regularity, the multivariate normal distribution, where is the observed information analysed at , can be used to estimate confidence ranges for model parameters. Furthermore, we may use likelihood ratio (LR) statistics to assess the EGuG model to any of its specific models.

3 Properties of EGuNH

In comparison to Gamma, Weibull, and exponentiated exponential distributions, NH distribution (also known as extended exponential distribution) is the preferred option for zero inflated data. The cdf and pdf of NH distribution has already been defined in Section (3.1.3). For λ = 1, we define the cdf and pdf of EGuNH distribution as (24) and (25) where C = 1 − (1 + x)α and θ, σ and α are shape parameters.

Henceforth, we denote by X a av having density (25). The sf and hrf of X has the form and

3.1 Shapes of density and hazard rate function of EGuNH distribution

The crucial points of the pdf of X are obtained from the equation: Similarly, the critical points of the hrf of X are obtained from the equation:

Some plots of the density of EGuNH for selected parameter values are presented in Figs 14 while plots of the hrf of EGuNH for random parameter values are presented (Figs 58). It is apparent that the density of EGuNH can be reversed-J, unimodal, and symmetrical. Similarly, EGuNH hazard rate shapes may tend to be increasing, decreasing, bathtub, or upside-down bathtub. The new model is much superior at fitting data sets in a variety of risk evaluation scenarios.

thumbnail
Fig 1. Plots of EGuNH density for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g001

thumbnail
Fig 2. Plots of EGuNH density for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g002

thumbnail
Fig 3. Plots of EGuNH density for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g003

thumbnail
Fig 4. Plots of EGuNH density for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g004

thumbnail
Fig 5. Plots of EGuNH hazard rate for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g005

thumbnail
Fig 6. Plots of EGuNH hazard rate for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g006

thumbnail
Fig 7. Plots of EGuNH hazard rate for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g007

thumbnail
Fig 8. Plots of EGuNH hazard rate for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g008

3.2 Central properties of EGuNH distribution

In this section, some useful expressions for the linear expansion, moments and incomplete moments of EGuNH distribution have been deduced using the Eq (16).

Proposition 1. (26) where

. Recalling the result defined in Eq (15) as A straightforward differentiation of the above result yields density by (27) The result in (27) is the linear expansion of NH densities. Hence, we shall derive several core properties of EGuNH using the major result of Eq 27.

Proposition 2.

Let W be a av with density ψ(x;m, α). Then, several properties of W can follow from those of X. The sth ordinary moment of X can be written as (28) where and Utilizing the results derived in Eq (19), the sth moments are defined in (28).

Proposition 3.

The sth incomplete moment expression can be written as (29) where Following the results defined in Eq (21), the sth incomplete moments are defined in (29). The skewness and kurtosis of X can be calculated from the third and fourth standardized cumulants. The classical skewness (Fig 9) and kurtosis plots (Fig 10) of the EGuNH distribution are displayed. Additionally, we provide the graphical illustration of MacGvillary skewness (MGs), which is based on quantile approach, in Figs 11 and 12. These plots reveal that the parameters θ and σ play a decisive role in modeling the skewness and kurtosis behaviors of X.

thumbnail
Fig 11. MacGillivray skewness of EGuNH for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g011

thumbnail
Fig 12. MacGillivray skewness of EGuNH for a variety of parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.g012

MacGillivray [35] proposed another method to evaluate the skewness measure based on the qf and is defined as where (0, 1), Q(.) is the qf defined in (32).

The MG skewness plots are very sensitive for extremely small values of parameter θ and σ which certainly signifies longer tails of EGuNH. Likewise, plots of the Lorenz (Fig 13) and Bonferroni (Fig 14) curves of EGuNH distribution for some random values are displayed. These plots reveal how the distribution parameters affect inequality measures which can be used to establish some orderings, an essential feature for applied statisticians. Some descriptive statistics related to EGuNH are presented in Tables 2 and 3, respectively.

thumbnail
Fig 13. Plots of the Lorenz curves of EGuNH distribution.

https://doi.org/10.1371/journal.pone.0267142.g013

thumbnail
Fig 14. Plots of the Bonferroni curves of EGuNH distribution.

https://doi.org/10.1371/journal.pone.0267142.g014

thumbnail
Table 2. Descriptive measures of EGuNH for some parameter values.

https://doi.org/10.1371/journal.pone.0267142.t002

thumbnail
Table 3. Moments and moment ratios of EGuNH for some parameter combinations.

https://doi.org/10.1371/journal.pone.0267142.t003

3.3 Acturial measures EGuNH: Value at risk

The theory of finance is based upon risk evaluation. Investors are particularly interested to invest in entities in which there is minimum risk (specified with high degree of confidence) of losing money. In finance, value at risk (VaR) is the most extensively used metric for assessing liability. It is also known as quantile risk measure or quantile premium principle of the distribution of aggregate losses. It is characterised by a level of assurance “q” (usually at 95% and 99%). To a layman, VaR answers a simple question that “What is the worst case scenario that can happen in a particular investment?”

If X has pdf (25), then VaR is the qth quantile of its cdf (24), defined as (30)

3.4 Acturial measures EGuNH: Expected shortfall

Despite of the popularity of VaR measures, there are many shortcomings (see [36]). To counter inherent problems in VaR, Artzner et al. [37, 38] proposed the use of expected shortfall (ES). Expected shortfall quantifies the average loss in states beyond the VaR level. ES has a number of aliases such as “conditional VaR”, “mean excess loss” or “tail VaR”. We define the ES as follows (31) For a combination of various parameter values, plots of VaRs (Fig 15) and ESs (Fig 16) are displayed respectively.

thumbnail
Fig 15. Plots of the VaR of EGuNH distribution for some random parameter values.

https://doi.org/10.1371/journal.pone.0267142.g015

thumbnail
Fig 16. Plots of the ES of EGuNH distribution for some random parameter values.

https://doi.org/10.1371/journal.pone.0267142.g016

3.5 Parameter estimation of EGuNH

The log-likelihood function ℑ for the vector of parameters Θ = (θ, σ, α) for the model defined in (25) is given by The components of the score vector U(Θ) are The MLE of Θ can also be obtained by solving the nonlinear equations Uθ = 0, Uσ = 0 and Uα = 0. Because these equations cannot be solved analytically, the estimates can be calculated numerically using statistical software.

3.6 Simulation study of EGuNH distribution

The qf of the EGuNH distribution has an explicit form as follows (32) Here, we use Monte Carlo simulations to demonstrate the performance and correctness of maximum likelihood estimations of the EGuNH parameters by inverting Eq (24) to generate a sample data from the model. The simulation study is perform for sample sizes n = 50, 100, 200, 500, and parameter combinations: I: θ = 0.2, σ = 0.75 and α = 0.5, II: θ = 2.2, σ = 0.45 and α = 0.5, III: θ = 3.4, σ = 0.75 and α = 1.5 and IV: θ = 3.4, σ = 1.35 and α = 1.5. This study is carried out for N = 2000 times, each with given n and computed the average estimates (AEs) as well as their average biases (Bias), mean squared errors (MSEs) and coverage probabilities (CPs) of the MLEs. The AEs, Bias, MSEs and CPs for the parameters θ, σ and α are given in Tables (4)–(7). The empirical findings suggests that the bias and MSEs decreases as sample size increases. Further, the empirical CPs are quite close to the nominal level of 95%. As a result, MLEs and their approximate findings can be used to evaluate and build approximated confidence intervals of the EGuNH distribution parameters θ, σ and α.

4 Applications of the EGuNH distribution

Statistical methods that fail to account for all of the uncertainties in the model are prone to produce an overly optimistic assessment of future extremes, are frequently contradicted by observed extreme events in a variety of scientific fields. The current literature regarding extreme value theory is full of such models in which data sets are meteorology data such as earthquakes, floods, rains, droughts, hurricanes etc. (see [1532]). On the contrary, health hazards is an area of extreme value theory which should be explored. Death, damage, or disease; exacerbation of underlying medical disorders; and negative effects on mental health are some of the health hazards of climate-related increases in exposure to extreme occurrences.

In this section, we provide some applications of the EGuNH model on three real life phenomenons, two of which related to health hazards in extreme value theory. We estimate the unknown parameters of the distributions by the principal of maximum likelihood. We compute the log-likelihood function evaluated at the MLEs () using the method of a limited-memory quasi-Newton code for bound-constrained maximization (L-BFGS-B). In order to select the best probability model, a variety of criteria for evaluating information (ICs) can be considered. We considered the following well-known ICs: the maximized log-likelihood (), Akaike Information criterion (AIC), Anderson-Darling (A), Cramér-von Mises (W) and Kolmogorov-Smirnov measures (D; P-value (p)), where lower values of all these statistics except higher p values of K-S, indicate good fits. The required computations are carried out using the R script AdequacyModel which is freely accessible from http://cran.r-project.org/web/packages/AdequacyModel/AdequacyModel.pdf.

The fits of the EGuNH distribution is compared with other competitive models which are given in Table 8. The parameters are all positive real numbers of these densities.

4.1 Meteorology data

Meteorological phenomena are weather events that most individuals are affected by, due to changes in extreme weather and climatic events, such as earthquakes, heat waves, floods, hurricanes, droughts etc. The present data is taken from [44], denoted by D1, gives the time in days between successive serious earthquakes world-wide. An earthquake is included if its magnitude was at least 7.5 on the Richter scale, or if over 1000 people were killed. There were 63 earthquakes recorded altogether, and so 62 recorded waiting times. The data are: 840, 157, 145, 44, 33, 121, 150, 280, 434, 736, 584, 887, 263, 1901, 695, 294, 562, 721,40, 1336, 335, 1354, 454,139, 780, 203, 436, 30, 246, 1617, 638, 937, 735,76, 710, 36, 667,384, 129, 46, 402, 194, 40, 556, 99, 9, 209, 599, 38, 365, 92, 82, 220, 759, 304, 83, 319, 375, 832, 460, 567, 328.

4.2 Cancer data

According to [45], extreme events have the potential to disrupt the delivery of cancer care. For example, some deadly carcinogens may be released into communities as a result of hurricanes and wild fires; industry shutdowns may result in a shortage of life-saving medical equipment in hospitals, causing shortages in cancer facilities across the country; and infrastructure collapse may limit access to patients undergoing cancer therapies. The following two data, denoted by cancer 1 (D2) and cancer 2 (D3) are related to cancer patients.

Cancer data 1. The survival times, in weeks, of 33 patients who succumbed to Acute Myelogenous Leukemia are the subject of D2. This data was recently studied by the authors in [46] The data are: 65, 156, 100, 134, 16, 108, 121, 4,39, 143, 56, 26, 22, 1, 1, 5, 65, 56, 65, 17, 7, 16, 22, 3, 4, 2, 8, 4, 3, 30, 4,43.

Cancer data 2. D3 signifies the number of patients suffering from blood cancer. The Saudi Cancer Registry (SCR) provides such information, covering the time period from 1994 to the present day. The data is extracted from a report [47] which concerns an overview of cancer incidence statistics for Saudi Arabia in 2012. The data are: 1277, 1290, 1357, 1369, 1408, 1455, 1478, 1549, 115, 181, 255, 418, 441, 461, 516, 739, 743, 789, 807, 865, 924, 983, 1024, 1062, 1063, 1165, 1191, 1222, 1222, 1251, 1578, 1578, 1599, 1603, 1605, 1696, 1735, 1799, 1815, 1852. The descriptive statistics for each of the three data sets are given in Table 9.

thumbnail
Table 9. The descriptive statistics related to D1, D2 and D3.

https://doi.org/10.1371/journal.pone.0267142.t009

The empirical findings of all the three data are suggestive of the heavy tailed data. The TTT plots Figs (17)–(19) for the data sets are given. In particular, the TTT plots show bathtub, increasing and decreasing hrf, allowing us to fit EGuNH model on these data sets. The approximated hrf Figs (20)–(22) for each data point correlates to the TTT graphs. Table 10 summarizes the results of the MLEs and their related standard errors (in parentheses) of the model parameters for the proposed model while the ICs are listed in Table 11 for the D1, D2 and D3, respectively. It is customary to supplement the analytical result defined in Tables 9 and 10, by displaying it graphically. Hence, the estimated pdfs Figs (23)–(25), PP–plots Figs (26)–(28), estimated cdfs Figs (29)–(31) and estimated sfs Figs (32)–(34) for the three data sets are given. On the given data sets, the numerical values authenticates that the EGuNH model provides the best fit as compared to the other models.

thumbnail
Table 10. MLEs with their respective SEs (in parenthesis) for D1, D2 and D3.

https://doi.org/10.1371/journal.pone.0267142.t010

thumbnail
Table 11. The statistics , AIC, BIC, A, W, D and p for D1, D2 and D3.

https://doi.org/10.1371/journal.pone.0267142.t011

The variance-covariance matrices of the MLEs of the EGuNH distribution for D1 is The variance-covariance matrices of the MLEs of the EGuNH distribution for D2 is The variance-covariance matrices of the MLEs of the EGuNH distribution for D3 is

4.2.1 Numerical calculations of VaRs and ESs.

We were able to further investigate EGuNH’s application to these risk measures thanks to the results reported in Section 4. To quantify the volatility associated with these measures, we take the values of MLEs of D1, D2 and D3., respectively, from Table 11. Higher risk measures indicate heavier tails, while lower risk measures indicate a model with a much lighter tail tendency. It’s pertinent to mention that the EGuNH model yielded significantly more impressive results than others, implying that the model has a longer tail. The numerical findings of VaRs and ESs for data 1, data 2, and data 3 of the proposed model at respective level of significance (LoS) are shown in Table 12. The summarized output of these risk measures (VaRs in Figs 3537 and ESs in Figs 3840), graphically, for the reader’s expedience.

thumbnail
Table 12. Numerical measures of VaRs and ESs of EGuNH for D1, D2 and D3.

https://doi.org/10.1371/journal.pone.0267142.t012

5 Concluding remarks

We propose and study the EGuG model and obtain a wide range of mathematical and statistical modelling methods to characterise the model’s structural and dynamic aspects including properties such as quantile function, ordinary and incomplete moments, mean deviations, bonferroni and lorenz curves, generating function and order statistics. The parameters of the family are estimated by the method of maximum likelihood. An extended exponential distribution is taken as baseline model to propose EGuNH distribution. Some simulations are performed to check the asymptotic properties of the estimates. Three applications to real data set are presented to illustrate the potentiality of the proposed models. For future research, the proposed model can further be extended using compounding. We expect that the modification may facilitate in estimating analytically tractable Bayesian estimates of the reliability function under different priors.

Acknowledgments

The authors would seek this opportunity to thank the respected comments made by the reviewers which greatly help in the overall presentation of the manuscript.

References

  1. 1. Tahir MH, Nadarajah S, Parameter induction in continuous univariate distributions: Well-established G families. An. Acad. Bras. Ciênc. (2015) 87: 539–568. pmid:26131628
  2. 2. Azzalini A, A class of distributions which includes the normal ones. Scandanavian Journal of Statistics (1985)12: 171–178.
  3. 3. Mudholker GS, Srivastava DK, Exponentiated Weibull family for analyzing bathtub failure data. IEEE Transactions of Reliability (1993)42: 299–302
  4. 4. Marshall AW, Olkin I, A new method for adding parameters to a family of distributions with application to the exponential and Weibull families Biometrika 84 (1997), pp. 641–652.
  5. 5. Gupta RC, Gupta PL, Gupta RD, Modeling failure time data by Lehman alternatives Commun. Stat. Theory Methods 27 (1998), pp. 887–904.
  6. 6. Gupta RD, Kundu D, Exponentiated exponential family: An alternative to gamma and Weibull distributions Biometrical Journal 42 (2001), pp. 117–130
  7. 7. Eugene N, Lee C, Famoye F, Beta-normal distribution and its applications Commun. Stat. Theory Methods 31 (2002), pp. 497–512.
  8. 8. Nadarajah S, Kotz S, The exponentiated type distributions Acta Applicandae Mathematica, 92 (2006a), pp.97–111.
  9. 9. Gleaton JU, Lynch JD, Properties of generalized log-logistic families of lifetime distributions J. Probab. Stat. Sci. 4 (2006), pp. 51–64.
  10. 10. Gupta RD, Kundu D, Generalized exponential distribution Aust. N. Z. J. Stat. 41 (1999), 173–188.
  11. 11. Zografos K, Balakrishnan N, On families of beta- and generalized gamma generated distributions and associated inference Stat. Methodol. 6 (2009), pp. 344–362.
  12. 12. Cordeiro GM, DeCastro M, A new family of generalized distributions J. Stat. Comput. Simul. 81 (2011), 883–898.
  13. 13. Cordeiro GM, Ortega EMM, Cunha DCC, The exponentiated generalized class of distributions J. Data. Sci. 11 (2013), 1–27.
  14. 14. Alzaatreh A, Famoye F, Lee C, A new method for generating families of continuous distributions Metron 71 (2013), pp. 63–79.
  15. 15. Coles S, An Introduction to Statistical Modeling of Extreme Values Springer, NewYork, 2001.
  16. 16. Fisher RA, Tippet LHC, Limiting form of the frequency distribution of the largest or smallest number of the sample Proceedings of the Cambridge Philosophical society, (1928) 24:190–190.
  17. 17. Sardiu M, Alves G, Yu YK, Score statistics of global sequence alignment from the energy distribution of a modified directed polymer and directed percolation problem Physical Review E (2005), 72(6). pmid:16485984
  18. 18. Kotz S, Nadarajah S, Extreme Value Distribution: Theory and Applications Imperial college Press (2000), London.
  19. 19. Nadarajah S, Choi D, Maximum daily rainfall in South Korea J. Earth Syst. Sci., 116(4), August 2007, 311–320.
  20. 20. De M, A new unbiased plotting position formula for the gumbel distribution Stochastic environmental research and risk assessment (2000), 14:1–7.
  21. 21. Solomon O, Prince O, Flood frequency analysis of Osse river using Gumbel’s distribution Civil and environmental research (2013), 3(10).
  22. 22. Clarke RT, Fitting and testing the significance of linear trends in Gumbel distributed data Hydrology and Earth system sciences (2002), 6(1):17–24.
  23. 23. Salinas V, Perez P, Gonzalez E, Vaquera H, Goodness of fit tests for the Gumbel distribution with type-II right censor data Revista Colombiana de Estadistica (2012), 35(3): 409–424.
  24. 24. Miladinovic B, Tsokos CP, Senstivity of the Bayesian reliability estimates for the modified Gumbel failure model International journal of Reliability, quality and safety engineering (2009), 16(4):331–341.
  25. 25. Ojo MO, On a generalization of the Gumbel distribution Statistics on the Internet (2000).
  26. 26. Adeymei S, Ojo MO, A generalization of the Gumbel distribution Kragujevac Journal of Mathematics (2003), 25: 19–29.
  27. 27. Gumbel EJ, Les valeurs extremes des distributions statistiques Ann. Inst. H Poincare, 5, (1934), 115.
  28. 28. Persson K, Ryden J, Exponentiated Gumbel distribution for the estimation of return levels of significant wave height Journal of Environmental Statistics (2010), 1(3): 1–12.
  29. 29. Cooray K, Generalized Gumbel distribution Journal of Applied Statistics (2010), 37(1): 171–179.
  30. 30. Nadarajah S, Kotz S, The Beta-Gumbel distribution Mathematical Problems in Engineering (2004),4:323–332.
  31. 31. Cordiero GM, Nadarajah S, Ortega EMM, The Kumaraswamy Gumbel distribution Statistical Methods Application (2012), 21: 139–168.
  32. 32. Nadarajah S, The exponentiated Gumbel distribution with climate application Environmetrics (2006), 17: 13–23.
  33. 33. Andrade T, Rodriguez H, Bourguignon M, Cordeiro GM, The exponentiated generalized Gumbel distribution Revista Colombiana de Estadistica (2015), 38(1): 123–143.
  34. 34. Al-Aqtash R, Lee C, Famoye F, Gumbel-Weibull distribution: Properties and applications J. Mod. Appl. Stat. Methods (2014), 13:201–225.
  35. 35. Macgillivray HL, Skewness and Asymmetry: Measures and Orderings The Annals of Statistics, September 1986. https://doi.org/10.1214/aos/1176350046
  36. 36. McNeil AJ, Frey R, Embrechts P, Quantitative risk management: concepts, techniques and tools Princeton University Press (2005).
  37. 37. Artzner P, Delbaen F, Eber JM, Heath D, Thinking Coherently Risk (1997) November, 10(11):68–71.
  38. 38. Artzner P, Application of coherent risk measures to capital requirements in insurance North American Actuarial Journal (1999), 3(2):11–25.
  39. 39. Ramirez FAP, Guerraa RR, Canterleb DR, Cordeiro GM, The Logistic Nadarajah Haghighi distribution and its associated regression model for reliability applications Reliability Engineering and system safety (2020).
  40. 40. Sangsanit Y, Bodhisuwan W, The Topp-Leone generator of distributions: Properties and inferences Songklanakarin Journal of Science and Technology (2016), 38(5): 537–548.
  41. 41. Lemonte AJ, A new exponential-type distribution with constant, decreasing, increasing, upside-down bathtub and bathtub-shaped failure rate function Computational Statistical and Data Analysis (2013), 62:149–170.
  42. 42. Lemonte AJ, Cordeiro GM, Arenas GM, A new useful three-parameter extension of the exponential distribution Statistics (2016) 50:312–337.
  43. 43. Nadarajah S, Haghighi F, An extension of the exponential distribution Statistics: A Journal of Theoretical and Applied Statistics 45(6):543–558.
  44. 44. Castillo E, Hadi AS, Balakrishnan N, Sarabia M, ExtremeValue and related models withApplications in engineering and science Wiley Science: A John Wiley Sons Publication
  45. 45. Nogueira LM, Sahar LE, Jason A, Jemal A, Yabroff KR, Association Between Declared Hurricane Disasters and Survival of Patients With Lung Cancer Undergoing Radiation Treatment JAMA (2019), 322(3):269–271. pmid:31310288
  46. 46. Jamal F, Abuzaid AH, Tahir MH, Nasir MA, Khan S, Mashwani WK, New Modified Burr III Distribution, Properties and Applications Math. Comput. Appl. 2021 26(82). https://doi.org/10.3390/mca26040082
  47. 47. Bazarbashi S, Eid HA, Minguet J, Cancer Incidence in Saudi Arabia: 2012 Data from the Saudi Cancer Registry Asian Pacific Journal of Cancer Prevention (2017). pmid:28952273