Modeling lifetime and count data using a unified flexible family: Its discrete counterpart, properties, and inference

Ahmed Z. Afify; Maha M. Helmi; Hassan M. Aljohani; Sara M. A. Alsheikh; Hisham A. Mahran

doi:10.1371/journal.pone.0319091

Abstract

In this article, two flexible classes called the modified Kavya–Manoharan-G (MKM-G) and discrete modified Kavya–Manoharan-G (DMKM-G) families are investigated. The two proposed families provide more flexibility for modeling real-lifetime and count data from environmental, medical, engineering, and educational fields. Due to the new extra shape parameter of the two proposed families, their special sub-models are capable of modeling monotonic and non-monotonic hazard rates. The basic properties of the MKM-G family are studied. Eight classical approaches of estimation are used for estimating the MKM-exponential (MKME) parameters. The performances of the estimators are explored using simulation results. Additionally, the DMKM-exponential (DMKME) distribution is defined. Finally, the importance and flexibility of the MKME and DMKME distributions are addressed by fitting seven real-lifetime and count data from aforementioned applied fields. The real data analysis shows that the special models of the two classes are good candidates and can provide close fit as compared to well-known competing continuous and discrete distributions.

Citation: Afify AZ, Helmi MM, Aljohani HM, Alsheikh SMA, Mahran HA (2025) Modeling lifetime and count data using a unified flexible family: Its discrete counterpart, properties, and inference. PLoS ONE 20(4): e0319091. https://doi.org/10.1371/journal.pone.0319091

Editor: Shaiful Anuar Abu Bakar,, University of Malaya, MALAYSIA

Received: July 22, 2024; Accepted: January 27, 2025; Published: April 17, 2025

Copyright: © 2025 Afify et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data used in this study are included in the article.

Funding: The authors extend their appreciation to Taif University, Saudi Arabia, for supporting this work through project number (TU-DSPP-2024-162).

Competing interests: The authors of this paper declare no conflict of interest.

1 Introduction

Recently, several attempts to develop generalized distributions have been made to model lifetime and count data in different applied fields. These generalized models have several applications including medicine, economics, biological studies, engineering, finance, and environmental sciences, among others. However, there is a clear need for more flexible distributions, which are capable of modeling different shapes of aging and failure criteria.

Some notable families are the Marshall–Olkin-G [1], Kumaraswamy-G [2], Weibull-G [3], Kumaraswamy transmuted-G [4], odd Dagum-G [5], log–logistic tan-G [6], modified generalized-G [7], logarithmic-U [8], Lambert-G [9], new exponential-H [10], and alpha beta-power-F [11], among many others.

One of the recent classes is called the Kavya–Manoharan-G (KM-G) family [12], which is used to introduce some generalized distributions include the KM Kumaraswamy distribution [13], KM log-logistic distribution [14], KM Kumaraswamy exponential distribution [15], KM Burr X distribution [16], KM power Lomax distribution [17].

In this article, we propose a flexible extension of the KM-G class by adding an extra shape parameter. The newly constructed class is called the modified Kavya–Manoharan-G (MKM-G) family, which increases the flexibility of the generated models. Furthermore, the discrete counterpart of the new MKM-G family is proposed. The discrete counterpart class is called the discrete modified Kavya–Manoharan-G (DMKM-G) family. The DMKM-G family is derived using the survival discretization (SD) approach.

The objectives of the current article are five-fold: (i) to propose two new flexible continuous and discrete families; (ii) to present four special lifetime models and a discrete model as special cases of the two proposed classes, with a more detailed analysis of the MKM-exponential (MKME) and DMKM-exponential (DMKME) sub-models; (iii) to address the mathematical characteristics of the MKM-G family; (iv) to discuss the estimation of MKME parameters using eight classical estimation approaches; and (v) to explore the empirical importance of the MKME and DMKME models through analysis of seven real-life datasets, including two count datasets.

We are motivated to introduce the MKM-G and DMKM-G families for several reasons: (i) The MKM-G special models can represent reversed-J shaped, right-skewed, and unimodal densities, as well as bathtub, increasing, modified bathtub, decreasing, and unimodal hazard rate (HR) shapes; (ii) These sub-models generalize several published lifetime models such as the KM Burr X and KM exponential models; (iii) The DMKME distribution exhibits unimodal, reversed-J, increasing, bathtub, and decreasing discrete HR shapes, making it more versatile than other count distributions, which typically only exhibit increasing or decreasing shapes; (iv) The special sub-models of both proposed classes are well-suited for modeling asymmetric lifetime and count data across various applied fields, including insurance, biology, medicine, engineering, and life testing; and (v) The empirical importance of the MKME and DMKME models is explored using seven real-lifetime and count datasets, showing that the two proposed distributions outperform several well-known lifetime and discrete distributions in modeling real-world data; and (vi) Finally, the new families offer simple analytical forms and exceptional flexibility, which may lead to wider applications in engineering, reliability, environmental sciences, insurance, medicine, and economics.

The rest of the paper is outlined in the following eight sections. In Section 2, the MKM-G and DMKM-G families are defined. Five special sub-models of the MKM-G and DMKM-G classes are presented in Section 3. In Section 4, some mathematical properties of the MKM-G class are derived. Estimation methods of the MKME parameters are discussed in Section 5. Detailed simulation results are presented in Section 6. Section 7 provides seven applications for real-lifetime and count data to show the flexibility of the MKME and DMKME distributions. Finally, some concluding remarks are explored in Section 8.

2 Synthesis of the MKM-G family and its discrete counterpart

If X is a random variable (RV) following the KM-G family with parameters α and θ (see [12]), then it has the following cumulative distribution function (CDF)

(1)

where and is the baseline CDF with a vector of parameters ϑ.

The corresponding probability density function (PDF) of (1) reduces to

(2)

Definition 1. A RV X is said to follow the MKM-G family, denoted by MKM-G (), if its CDF has the form

(3)

where θ is a shape parameter.

The corresponding PDF of Equation (3) has the form

(4)

The HR function (HRF) of the MKM-G family becomes

The added shape parameter θ allows us to explore the tail behavior of the density (4) and provides more flexibility as we can see in Section 3. Additionally, the importance of the MKM-G family follows from its ability to generate new flexible distributions which exhibit monotone and non-monotone failure rates.

Remark: The KM-G class follows as a special case by setting in Equation (3).

The SD approach is used to discretize the MKM-G family of distributions. The SD technique depends on the survival function (SF), say, and , which can be adopted to define the probability mass function (PMF) as follows

(5)

The SF of the MKM-G family has the form

Applying the SD approach in (5), the PMF of the DMKM-G family is given by

(6)

where

The CDF and SF of the DMKM-G family is given by

and

3 Five special lifetime and discrete models

This section presents four special sub-models of the MKM-G family. These sub-models provide flexible forms of some baseline distributions namely the exponential (E), Burr X (Bx), Burr XII (BXII), and log-logistic (LL) distributions. The special sub-models of the MKM-G class are called the MKME, MKM-Burr X (MKMBX), MKM-Burr XII (MKMBXII), and MKM-log logistic (MKMLL) distributions. The four special distributions provide decreasing, bathtub, reversed J shaped, increasing, unimodal, and modified bathtub shapes. Additionally, their densities provide right-skewed, symmetrical, and reversed-J shapes as displayed in Figs 1-4. Furthermore, the DMKME model is defined in Section 3.5. Figs 5 and 6 display the PMF and HRF plots of the DMKME model for different values of its parameters θ and λ. The HRF of the DMKME distribution provides unimodal, bathtub, increasing, reversed-J, and decreasing discrete failure rates.

Download:

Fig 1. Different shapes of the PDF and HRF of the MKME distribution.

https://doi.org/10.1371/journal.pone.0319091.g001

Download:

Fig 2. Different shapes of the PDF and HRF of the MKMBXII distribution.

https://doi.org/10.1371/journal.pone.0319091.g002

Download:

Fig 3. Different shapes of the PDF and HRF of the MKMBx distribution.

https://doi.org/10.1371/journal.pone.0319091.g003

Download:

Fig 4. Different shapes of the PDF and HRF of the MKMLL distribution.

https://doi.org/10.1371/journal.pone.0319091.g004

3.1 The MKME distribution

The CDF of the MKME distribution follows by setting the E CDF, , in Equation (3). Then, the CDF of the MKME distribution reduces to

The corresponding PDF and HRF of the MKME distribution have the forms

(7)

and

The RV with PDF (7) is denoted by . For , the MKME distribution reduces to the KME distribution [12]. Fig 1 gives some possible shapes of the density and HR functions of the MKME distribution.

3.2 The MKMBXII Distribution

By taking the CDF of the BXII distribution (for and ), say, , as a baseline CDF in (3), the MKMBXII CDF follows as

The PDF of the MKMBXII distribution reduces to

For , the MKMBXII distribution reduces to the KMBXII distribution. Fig 2 gives some shapes of the PDF and HRF of the MKMBXII distribution for different parametric values and λ.

3.3 The MKMBx distribution

Consider the CDF of the Bx distribution (for and ), say, . By inserting the Bx CDF in Equation (3), the CDF of the MKMBx distribution follows as

where .

The PDF of the MKMBx model reduces to

By setting in the above equation, the KMBx distribution [16] follows as a special case. The MKM-Rayleigh distribution is obtained when . The MKMBx model reduces to the KM-Rayleigh distribution for Some shapes of the density and failure rate functions of the MKMBx distribution are given in Fig 3, for different values of η and λ.

3.4 The MKMLL distribution

Consider the LL CDF (for and ), say, . By inserting the LL CDF in Equation (3), we obtain the CDF of the MKMLL distribution (for )

The MKMLL density takes the form

The MKMLL distribution reduces to the KMLL model for . Fig 4 provides the shapes of the density and hazard functions of the MKMLL distribution.

3.5 The DMKME distribution

The DMKME distribution is derived here by substituting the CDF of the E distribution in Equation (6), then the PMF of the DMKME distribution follows as

(8)

where . Then, the corresponding SF and CDF of (8) reduce to

and

The HRF of the DMKME takes the form

The plots of the PMF and HRF of the DMKME model are presented in Figs 5 and 6, for some choices of the parameters θ and λ. The DMKME HRF can be unimodal, bathtub, increasing and decreasing discrete HRF.

4 Properties of the MKM-G family

Some general mathematical properties of the MKM-G family are provided in this section.

4.1 Linear representation

Important and simple mixture representations for the CDF and PDF of the MKM-G family in terms of exponentiated-G (Exp-G) density are provided in this section.

Consider the exponential series, which is given by

(9)

Applying (9) to Equation (3), we obtain

Hence, the MKM-G CDF is expressed as

(10)

where And is the Exp-G CDF with power parameter . By differentiating Equation (10), the PDF of the MKM-G family reduces to

(11)

where and is the Exp-G PDF with . Thus, many mathematical characteristics of the MKM-G family follow simply from those of the Exp-G family.

4.2 Quantile function and moments

The quantile function (QF) of the MKM-G family, say, , follows by inverting (3). Then, the MKM-G QF takes the form

where ais the baseline QF and .

Hereafter, let denotes the Exp-G RV with positive power parameter k. Hence, based on Equation (11), the rth moment of X has the form

(12)

Setting in Equation (12) gives the mean of X ().

The moment generating function (MGF) of the RV X is defined by . Hence, the MGF of the MKM-G family can be derived from (11) in two different forms. The first one follows as

where is the MGF of . Then, of the MKM-G family is calculated based on the MGF of the Exp-G class.

The second formula for takes the form

where .

The qth incomplete moment of X is expressed based on (11) as

(13)

The first incomplete moment (FIM), say, , follows from (13) when . It also can be calculated by two formulae. The first formula for follows from (13) as

(14)

where is the FIM of the Exp-G class. The second formula for reduces to

where , where is computed numerically and is the baseline QF.

Bonferroni () and Lorenz curves are useful applications for because they can be calculated, for a given probability τ, by and , where is the mean of X and is the QF of X at τ. The two curves have important applications in demography, economics, insurance, reliability, and medicine. Additionally, the mean deviations about the mean and about the median of X, where is the median and is simply evaluated from (3).

4.3 Entropies

The variation of the uncertainty can be measured by the Rényi entropy, say , which is given by

Using the MKM-G density (4), we obtain

By applying the power series in Equation (9), we have

Then, reduces to

Hence, follows as

where

Then, the Rényi entropy of the MKM-G family reduces to

The Shannon entropy follows from the Rényi entropy when ϕ tends to 1.

4.4 Mean residual life and mean inactivity time

The expected additional life length for a unit, which is alive at age t can be expressed by the mean residual life (MRL). The MRL of a RV X is defined (for ) by . The MRL of X reduces to

(15)

where is the CDF of the MKM-G family. Inserting (14) in Equation (15) gives

The waiting time elapsed since the failure of an item on condition that this failure had occurred in is expressed as the mean inactivity time (MIT). The MIT of X, say follows as

Combining the last equation and Equation (14), the MIT of X takes the form

4.5 Probability weighted moments

The expectation of a certain function of a RV whose mean exists is known as the probability weighted moments (PWMs). The th PWM of X has the form

Using the CDF and PDF of the MKM-G family, we can write

Applying the exponential and binomial series, the above equation reduces to

(16)

Equivalently, we can write

where

Then, the PWM of the MKM-G family follows as

4.6 Order statistics

Let be a random sample from the MKM-G family. The density of the ith order statistic, say , has the form

(17)

where is the beta function.

Using Equations (3) and (4) of the MKM-G family, we obtain

Based on Equation (16), the last equation has the form

(18)

where is the Exp-G PDF with parameter and

Then, the PDF of is expressed by

(19)

Hence, the PDF of the MKM-G order statistic is a linear combination of Exp-G densities. Equation (19) illustrates that the properties of can be derived from those properties of .

The rth moment of follows as

4.7 Moments of the MKME distribution

This subsection provides a simple expression for the rth moment of the MKME model.

Based on Equation (11), the PDF of the MKME distribution reduces to

Applying the binomial expansion to the last term, the above equation becomes

(20)

where

and is the PDF of the E model with scale parameter . Then, the MKME PDF is expressed as a single linear combination of E PDFs.

The rth moment of the MKME distribution is obtained from Equation (20) as follows

(21)

The mean of say, , follows from (21) with Furthermore, the R software is used to obtain some numerical values for the , variance , skewness , and kurtosis measures of the MKME distribution. The values of the given measures are reported in Table 1 It is noted that that the spread for its is much larger ranging from 6.6978 to 18.6635, whereas the of the MKME distribution can range in the interval (1.4482, 3.2662).

Download:

Table 1. The values of some measures of the MKME distribution for several values of θ and λ.

https://doi.org/10.1371/journal.pone.0319091.t001

Additionally, Table 2 provides the numerical values of for the MKME model based on the numerical integration (NUI) and summation (SUM) formula for several values of λ and θ at truncated M terms, where L is the truncated terms from this summation. Table 2 shows that the summation in (21) converges to the NUI of for all values of λ and θ when L reaches to 50.

Download:

Table 2. The mean of the MKME model based on the SUM and NUI formulae for some values of λ and θ at truncated M terms.

https://doi.org/10.1371/journal.pone.0319091.t002

The QF of the MKME distribution is

5 Estimation approaches

This section discusses the estimation of the MKME parameters using some classical methods called the maximum likelihood (ML), least-squares (LS), weighted least-squares (WLS), maximum product of spacing (MPS), percentiles (PC), Cramér–von Mises (CM), Anderson–Darling (AD), and right-tail AD (RTAD) estimators.

Let be a random sample from the MKME distribution and be their order statistics. The log-likelihood function, say, l, reduces to

where

The ML estimators (MLEs) of θ and λ are determined by maximizing the above equation with respect to the parameters θ and λ, or by solving the following two equations

and

The MLEs are also calculated by using different statistical programs such as Mathematica, Mathcad, and R (optim function), among others.

Swain et al. [18] estimated the parameters of the beta distribution using the LS estimators (LSEs) and WLS estimators (WLSEs). The LSEs and WLSEs of the MKME parameters θ and λ are determined by minimizing

where for the LS approach, for the WLS approach and

Furthermore, the LSEs and WLSEs can be obtained by solving the following nonlinear equations

where

(22)

and

(23)

Cheng and Amin [19, 20] pioneered the MPS approach as an alternative method to estimate the parameters of different continuous univariate models. The uniform spacings, say , of a random sample of size n from the MKME distribution are defined by

where and

The MPS estimators (MPSEs) of the MKME parameters can be determined by maximizing

with respect to θ and λ.

Additionally, the MPSEs of the MKME parameters are also calculated by solving

where are defined in Equations (22) and (23) for .

The PC estimators (PCEs) are proposed by [21] to estimate the model parameters by equating the sample PC points with the population PC points. If is an unbiased estimator of , then the PCEs of the parameters of the MKME distribution follow by minimizing

with respect to θ and λ.

Cramér [22] and Von Mises [23] introduced the CVM estimators (CVMEs) which are obtained as the difference between the estimated CDF and empirical CDF.

The CVMEs of the MKME parameters can be determined by minimizing

Furthermore, the CVMEs are also obtained by solving the following nonlinear equations

The AD estimators (ADEs) are an important type of minimum distance estimators.

The ADEs of the MKME parameters can be determined by minimizing

with respect to θ and λ. The ADEs are also calculated by solving the following equations

The RTAD estimators (RTADEs) of θ and λ are given by minimizing

6 Simulation results

This section explores the performance of several estimators of the MKME parameters by using detailed simulation studies. We generate 5000 samples from the MKME distribution based on some sample sizes and some parametric values of the two parameters and . We calculate the average estimates (AEs) of the parameters and mean square errors (MSEs) for each sample size using the eight estimators.

The performance of the studied estimators is evaluated using MSEs. The AEs and MSEs (in parentheses) for different estimators are presented in Tables 3–6. It is observed that as the sample size n increases, the AEs converge to the true parameter values, and the MSEs decrease towards zero, demonstrating that the estimators are asymptotically unbiased. Overall, the numerical simulations show that all eight estimation methods perform excellently with respect to the MSEs.

Download:

Table 3. The AEs and MSEs of the MKME parameters for

.

https://doi.org/10.1371/journal.pone.0319091.t003

Download:

Table 4. The AEs and MSEs of the MKME parameters for

.

https://doi.org/10.1371/journal.pone.0319091.t004

Download:

Table 5. The AEs and MSEs of the MKME parameters for

.

https://doi.org/10.1371/journal.pone.0319091.t005

Download:

Table 6. The AEs and MSEs of the MKME parameters for

.

https://doi.org/10.1371/journal.pone.0319091.t006

7 Lifetime and count data analysis in applied sciences

This section is devoted to showing the empirical flexibility of the special models of the two proposed families using some real-life data, including both continuous and discrete data.

7.1 Applications for the MKME model

In this subsection, we analyze five real-life datasets from applied fields including environmental sciences, medicine, and engineering to explore the flexibility of the MKME model. The first dataset refers to waiting times (in minutes) before the service of 100 bank customers. This dataset is studied by [24]. The data observations are: 3.2, 0.8, 4.6, 1.9, 0.8, 3.3, 6.2, 4.7, 6.2, 8, 7.7, 9.7, 12.5, 9.8, 12.9, 18.1, 17.3, 27, 1.3, 31.6, 3.5, 6.2, 4.7, 8.2, 13, 10.7, 18.2, 13, 1.5, 33.1, 1.8, 3.6, 1.9, 4, 2.6, 4.8, 4.1, 4.9, 6.3, 4.9, 6.7, 4.2, 8.6, 11, 6.9, 8.6, 11.2, 10.9, 8.6, 11, 13.7, 11.2, 13.6, 13.3, 18.4, 19, 18.9, 38.5, 2.1, 4.2, 5, 4.3, 5.3, 7.1, 5.5, 7.1, 8.8, 7.1, 8.9, 11.1, 8.8, 13.9, 19.9, 14.1, 20.6, 2.7, 21.3, 4.4, 2.9, 4.3, 3.1, 21.9, 4.4, 5.7, 9.6, 7.1, 6.1, 7.4, 8.9, 7.6, 5.7, 11.5, 9.5, 11.9, 15.4, 12.4, 17.3, 15.4, 23, 21.4.

The second dataset is discussed [25], and it contains annual maximum flood levels over a 20-year period of the Susquehanna River at Harrisburg, Pennsylvania. These flood levels are measured in millions cubic of feet per second. The data observations are: 0.494, 0.654, 0.315, 0.297, 0.449, 0.379, 0.402, 0.379, 0.423, 0.324, 0.740, 0.269, 0.418, 0.416, 0.412, 0.338, 0.484, 0.392, 0.265, 0.613.

The third dataset is analyzed by [26], and it refers to remission times (in months) for 128 bladder cancer patients. This dataset contains the following observations: 3.48, 0.08, 2.09, 4.87, 8.66, 6.94, 13.11, 0.20, 23.63, 2.23, 4.98, 3.52, 6.97, 13.29, 9.02, 0.40, 3.57, 2.26, 5.06, 9.22, 7.09, 13.80, 0.50, 25.74, 2.46, 5.09, 3.64, 7.26, 14.24, 9.47, 25.82, 2.54, 0.51, 3.70, 7.28, 5.17, 9.74, 26.31, 14.76, 0.81, 3.82, 2.62, 5.32, 10.06, 7.32, 14.77, 2.64, 32.15, 3.88, 7.39, 5.32, 10.34, 34.26, 14.83, 0.90, 4.18, 2.69, 5.34, 10.66, 7.59, 15.96, 1.05, 36.66, 2.69, 5.41, 4.23, 7.62, 16.62, 10.75, 43.01, 2.75, 1.19, 4.26, 7.63, 5.41, 17.12, 12.63, 1.26, 46.12, 2.83, 5.49, 4.33, 7.66, 17.14, 11.25, 79.05, 2.87, 1.35, 5.62, 11.64, 7.87, 17.36, 3.02, 1.40, 4.34, 7.93, 5.71, 11.79, 1.46, 18.10, 4.40, 8.26, 5.85, 11.98, 1.76, 19.13, 3.25, 6.25, 4.50, 8.37, 2.02, 22.69, 12.02, 3.31, 6.54, 4.51, 8.53, 20.28, 12.03, 2.02, 6.76, 3.36, 12.07, 2.07, 21.73, 3.36, 8.65, 6.93.

The fourth dataset is studied by [27], and it refers to the number of vehicle fatalities for 39 counties in South Carolina in the year 2012. The data observations are: 50, 22, 13, 17, 26, 4, 9, 27, 9, 48, 31, 20, 6, 12, 5, 9, 14, 16, 33, 9, 68, 3, 20, 51, 4, 13, 17, 2, 16, 52, 6, 48, 12, 23, 10, 8, 13, 1, 15.

The fifth dataset is considered by [28]. This dataset refers to the time between failures for 30 repairable items, and its observations are: 0.70, 0.11, 1.43, 0.71, 2.63, 0.77, 1.49, 2.46, 3.46, 0.59, 1.23, 0.74, 1.17, 0.94, 0.40, 4.36, 1.74, 2.23, 4.73, 0.45, 1.46, 1.06, 0.30, 2.37, 1.82, 0.63, 1.24, 1.23, 1.86, 1.97.

The ML method is used to estimate the model parameters from each dataset, and the R program is used to obtain different computations. The fitting performance of the MKME distribution is compared to other competing E models including the generalized E (GE) [29], Marshall–Olkin E (MOE), alpha-power E (APE) [30], generalized DUS E (GDUSE) [31], generalized inverted E (GIE) [32], KME, and E distributions.

The fitting performance of the proposed MKME model and other competing distributions is explored using some goodness-of-fit measures including the Anderson–Darling Cramér–von Mises and Kolmogorov–Smirnov (KS) statistics with its associated p-value.

The values of the four measures, , , and KS as well as the p-value, of the fitted models are given in Tables Tables 7–11 for the five datasets, respectively. These tables also display the ML estimates and standard errors (SEs) of the parameters of the MKME distribution and other rival models. It is shown that, the MKME distribution has the lowest values of , , and KS statistics and largest p-value, showing its close fit to the five analyzed datasets. Furthermore, the fitting performance of the MKME model is explored visually through the plots of the PDF, CDF, SF, and probability-probability (PP) for all datasets. The plots are presented in Figs Fig 8, Fig 9, Fig 10. The plots indicate that the MKME provides the best fit to all datasets.

Download:

Table 7. The findings from waiting times dataset for the fitted distributions.

https://doi.org/10.1371/journal.pone.0319091.t007

Download:

Table 8. The findings from flood levels dataset for the fitted distributions.

https://doi.org/10.1371/journal.pone.0319091.t008

Download:

Table 9. The findings from cancer dataset for the fitted distributions.

https://doi.org/10.1371/journal.pone.0319091.t009

Download:

Table 10. The findings from vehicle fatalities dataset for the fitted distributions.

https://doi.org/10.1371/journal.pone.0319091.t010

Download:

Table 11. The findings from failures times dataset for the fitted distributions.

https://doi.org/10.1371/journal.pone.0319091.t011

Download:

Fig 5. Plots of the PMF of the DMKME distribution for selected values of θ and λ.

https://doi.org/10.1371/journal.pone.0319091.g005

Download:

Fig 6. Plots of the HRF of the DMKME distribution for selected values of θ and λ.

https://doi.org/10.1371/journal.pone.0319091.g006

Download:

Fig 7. Plots of the GS and MK for the MKME distribution.

https://doi.org/10.1371/journal.pone.0319091.g007

Download:

Fig 8. The fitted functions plots of the MKME model for waiting times dataset (left panel) and flood levels dataset (right panel).

https://doi.org/10.1371/journal.pone.0319091.g008

Download:

Fig 9. The fitted functions plots of the MKME model for cancer dataset (left panel) and vehicle fatalities dataset (right panel).

https://doi.org/10.1371/journal.pone.0319091.g009

Download:

Fig 10. The fitted functions plot the MKME model for failures times dataset.

https://doi.org/10.1371/journal.pone.0319091.g010

7.2 Applications of the DMKME Model

In this subsection, we use two real count datasets to illustrate the flexibility of the DMKME distribution as compared to the existing discrete models including the transmuted record type geometric (TRTG) [33], exponentiated discrete Lindley (EDL) [34], natural discrete Lindley (NDL) [35], discrete Lindley (DL) [36], geometric (Gc), discrete Ramos-Louzada (DRL) [37], and discrete Poisson-Lindley (DPL) [38] distributions.

The first dataset represents the final marks of mathematics examination for 48 slow space students. This exam was done in the Indian Institute of Technology at Kanpur as discussed in [39]. The data are: 60, 29, 25, 50, 15, 13, 39, 27, 14, 15, 40, 18, 7, 7, 8, 19, 37, 12, 18, 5, 21, 15, 44, 86, 15, 14, 15, 70, 6, 50, 23, 58, 19, 23, 11, 6, 34, 18, 34, 12, 4, 28, 20, 23, 65, 19, 31, 21.

The second dataset represents the infant mortality rate per 1000 live births for some nations in 2021. The dataset is reported at https://data.worldbank.org/ indicator/SP.DYN.IMRT.IN. The data are: 12, 56, 10, 3, 22, 69, 7, 6, 11, 19, 44, 27, 13, 7, 12, 4, 3, 11, 27, 84, 25, 35, 6, 14, 6, 11.

Table 12 shows the ML estimates of the parameters of the discrete models and their corresponding SEs (in parenthesis) for the two count datasets. Furthermore, Table 12 provides the goodness-of-fit measures for all discrete models.

Download:

Table 12. The ML estimates and goodness-of-fit measures for the DMKME and other discrete models.

https://doi.org/10.1371/journal.pone.0319091.t012

The values in Table 12 indicate that the DMKME distribution outperforms all competing discrete models, with the lowest values for , f and KS measures and the highest p-value, for the two count datasets. The PP plots for the two count datasets are displayed in Figs 11 and 12, respectively. These plots support the superior fit of the DMKME model, which provides a closer fit for both datasets as compared to other discrete distributions.

Download:

Fig 11. The PP plots of the fitted discrete models for dataset I.

https://doi.org/10.1371/journal.pone.0319091.g011

Download:

Fig 12. The PP plots of the fitted discrete models for dataset II.

https://doi.org/10.1371/journal.pone.0319091.g012

8 Some conclusions

We proposed two new classes of distributions called the modified Kavya-Manoharan-G (MKM-G) and discrete modified Kavya-Manoharan-G (DMKM-G) families. The MKM-G class extends and improves the flexibility of the Kavya-Manoharan (KM) family. Five special models of the MKM-G and DMKM-G families are provided. These special models have the advantage of being capable of modeling different shapes of aging and failure criteria. Basic mathematical properties of the MKM-G family are explored. We discuss eight approaches for estimating the parameters of the MKM-exponential (MKME) distribution, evaluating their performance through numerical simulations. We demonstrate that the MKME distribution provides a superior fit to five real-life datasets from the fields of reliability, medicine, engineering, and environmental science, outperforming several existing exponential models. Additionally, two count datasets are analyzed to illustrate the flexibility of the DMKM-exponential model. The newly discrete model provides better fits as compared to other important discrete distributions. Overall, the proposed models provide flexible and effective alternatives to existing distributions for modeling both lifetime and count data across diverse applied fields.

Future work on the MKM-G and DMKM-G families could include extending these models to bivariate and multivariate distributions for modeling dependent data in complex applications, as well as further generalizing them to incorporate more flexible hazard rate functions. Advanced estimation techniques, such as Bayesian or machine learning approaches, could be explored to improve accuracy in large datasets. The families could also be tested on big data from industries like finance, healthcare, and telecommunications. A comparative study with other flexible distribution families, the development of user-friendly software for implementation, and the extension to handle censored or truncated data are additional areas for future research.

References

1. Marshall A. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 1997;84(3):641–52.
- View Article
- Google Scholar
2. Cordeiro GM, de Castro M. A new family of generalized distributions. Journal of Statistical Computation and Simulation. 2011;81(7):883–98.
- View Article
- Google Scholar
3. Bourguignon M, Silva RB, Cordeiro GM. The Weibull-G Family of Probability Distributions. Journal of Data Science. 2021;12(1):53–68.
- View Article
- Google Scholar
4. Afify AZ, Cordeiro GM, Yousof HM, Alzaatreh A, Nofal ZM. The Kumaraswamy Transmuted-G Family of Distributions: Properties and Applications. Journal of Data Science. 2021;14(2):245–70.
- View Article
- Google Scholar
5. Afify AZ, Alizadeh M. The odd Dagum family of distributions: Properties and applications. Journal of Applied Probability and Statistics. 2020;15(1):45–72.
- View Article
- Google Scholar
6. Zaidi SM, Sobhi MMA, El-Morshedy M, Afify AZ. A new generalized family of distributions: Properties and applications. MATH. 2021;6(1):456–76.
- View Article
- Google Scholar
7. Shama MS, El Ktaibi F, Al Abbasi JN, Chesneau C, Afify AZ. Complete Study of an Original Power-Exponential Transformation Approach for Generalizing Probability Distributions. Axioms. 2023;12(1):67.
- View Article
- Google Scholar
8. Zhao Y, Ahmad Z, Alrumayh A, Yusuf M, Aldallal R, Elshenawy A, et al. A novel logarithmic approach to generate new probability distributions for data modeling in the engineering sector. Alexandria Engineering Journal. 2023;62:313–25.
- View Article
- Google Scholar
9. Abbasi JNA, Afify AZ, Alnssyan B, Shama MS. The Lambert-G Family: Properties, Inference, and Applications. CMES. 2024;140(1):513–36.
- View Article
- Google Scholar
10. Jamal F, Alqawba M, Altayab Y, Iqbal T, Afify AZ. A unified exponential-H family for modeling real-life data: Properties and inference. Heliyon. 2024;10(6):e27661. pmid:38509929
- View Article
- PubMed/NCBI
- Google Scholar
11. Semary HE, Hussain Z, Hamdi WA, Aldahlan MA, Elbatal I, Nagarjuna VBV. Alpha–beta-power family of distributions with applications to exponential distribution. Alexandria Engineering Journal. 2024;100:15–31.
- View Article
- Google Scholar
12. Kavya P, Manoharan M. Some parsimonious models for lifetimes and applications. Journal of Statistical Computation and Simulation. 2021;91(18):3693–708.
- View Article
- Google Scholar
13. Alotaibi N, Elbatal I, Shrahili M, Al-Moisheer AS, Elgarhy M, Almetwally EM. Statistical Inference for the Kavya–Manoharan Kumaraswamy Model under Ranked Set Sampling with Applications. Symmetry. 2023;15(3):587.
- View Article
- Google Scholar
14. Al-Nefaie AH. Applications to Bio-Medical data and statistical inference for a Kavya-Manoharan log-logistic model. Journal of Radiation Research and Applied Sciences. 2023;16(1):100523.
- View Article
- Google Scholar
15. Eldessouky EA, Hassan OHM, Elgarhy M, Hassan EAA, Elbatal I, Almetwally EM. A New Extension of the Kumaraswamy Exponential Model with Modeling of Food Chain Data. Axioms. 2023;12(4):379.
- View Article
- Google Scholar
16. Hassan OHM, Elbatal I, Al-Nefaie AH, Elgarhy M. On the Kavya–Manoharan–Burr X Model: Estimations under Ranked Set Sampling and Applications. JRFM. 2022;16(1):19.
- View Article
- Google Scholar
17. Riad FH, Radwan A, Almetwally EM, Elgarhy M. A new heavy tailed distribution with actuarial measures. Journal of Radiation Research and Applied Sciences. 2023;16(2):100562.
- View Article
- Google Scholar
18. Swain JJ, Venkatraman S, Wilson JR. Least-squares estimation of distribution functions in johnson’s translation system. Journal of Statistical Computation and Simulation. 1988;29(4):271–97.
- View Article
- Google Scholar
19. Cheng R, Amin N. Maximum product-of-spacings estimation with applications to the lognormal distribution. Mathematical Report. 1979;791.
- View Article
- Google Scholar
20. Cheng RCH, Amin NAK. Estimating Parameters in Continuous Univariate Distributions with a Shifted Origin. Journal of the Royal Statistical Society Series B: Statistical Methodology. 1983;45(3):394–403.
- View Article
- Google Scholar
21. Kao JHK. Computer Methods for Estimating Weibull Parameters in Reliability Studies. IRE Trans Reliab Qual Control. 1958;13:15–22.
- View Article
- Google Scholar
22. Cramér H. On the composition of elementary errors. Scandinavian Actuarial Journal. 1928;1928(1):13–74.
- View Article
- Google Scholar
23. Von Mises RE. Wahrscheinlichkeit Statistik und Wahrheit. Switzerland: Springer; 1928.
24. Ghitany ME, Atieh B, Nadarajah S. Lindley distribution and its application. Mathematics and Computers in Simulation. 2008;78(4):493–506.
- View Article
- Google Scholar
25. Dumonceaux R, Antle CE. Discrimination Between the Log-Normal and the Weibull Distributions. Technometrics. 1973;15(4):923–6.
- View Article
- Google Scholar
26. Lee ET, Wang JW. Statistical Methods for Survival Data Analysis. USA: John Wiley and Sons; 2003.
27. Mann PS. Introductory Statistics. New York (USA): Wiley Publisher; 2016.
28. Murthy DP, Xie M and Jiang R. Weibull Models. USA: John Wiley & Sons; 2004.
29. Gupta RD, Kundu D. Theory & Methods: Generalized exponential distributions. Aus NZ J of Statistics. 1999;41(2):173–88.
- View Article
- Google Scholar
30. Mahdavi A, Kundu D. A new method for generating distributions with an application to exponential distribution. Communications in Statistics - Theory and Methods. 2016;46(13):6543–57.
- View Article
- Google Scholar
31. Maurya SK, Kaushik A, Singh SK, Singh U. A new class of distribution having decreasing, increasing, and bathtub-shaped failure rate. Communications in Statistics - Theory and Methods. 2016;46(20):10359–72.
- View Article
- Google Scholar
32. Abouammoh AM, Alshingiti AM. Reliability estimation of generalized inverted exponential distribution. Journal of Statistical Computation and Simulation. 2009;79(11):1301–15.
- View Article
- Google Scholar
33. Almazah MMA, Erbayram T, Akdoğan Y, AL Sobhi MM, Afify AZ. A New Extended Geometric Distribution: Properties, Regression Model, and Actuarial Applications. Mathematics. 2021;9(12):1336.
- View Article
- Google Scholar
34. El-Morshedy M, Eliwa MS, Nagy H. A new two-parameter exponentiated discrete Lindley distribution: properties, estimation and applications. J Appl Stat. 2019;47(2):354–75. pmid:35706520
- View Article
- PubMed/NCBI
- Google Scholar
35. Al-Babtain AA, Ahmed AHN, Afify AZ. A New Discrete Analog of the Continuous Lindley Distribution, with Reliability Applications. Entropy (Basel). 2020;22(6):603. pmid:33286375
- View Article
- PubMed/NCBI
- Google Scholar
36. Bakouch HS, Jazi MA, Nadarajah S. A new discrete distribution. Statistics. 2012;48(1):200–40.
- View Article
- Google Scholar
37. Afify AZ, Elmorshedy M, Eliwa MS. A New Skewed Discrete Model: Properties, Inference, and Applications. Pak J Stat Oper Res. 2021;17:799–816.
- View Article
- Google Scholar
38. Sankaran M. 275. Note: The Discrete Poisson-Lindley Distribution. Biometrics. 1970;26(1):145.
- View Article
- Google Scholar
39. Gupta RD, Kundu D. A new class of weighted exponential distributions. Statistics. 2009;43(6):621–34.
- View Article
- Google Scholar

[ref1] 1. Marshall A. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 1997;84(3):641–52.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Cordeiro GM, de Castro M. A new family of generalized distributions. Journal of Statistical Computation and Simulation. 2011;81(7):883–98.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Bourguignon M, Silva RB, Cordeiro GM. The Weibull-G Family of Probability Distributions. Journal of Data Science. 2021;12(1):53–68.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Afify AZ, Cordeiro GM, Yousof HM, Alzaatreh A, Nofal ZM. The Kumaraswamy Transmuted-G Family of Distributions: Properties and Applications. Journal of Data Science. 2021;14(2):245–70.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Afify AZ, Alizadeh M. The odd Dagum family of distributions: Properties and applications. Journal of Applied Probability and Statistics. 2020;15(1):45–72.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Zaidi SM, Sobhi MMA, El-Morshedy M, Afify AZ. A new generalized family of distributions: Properties and applications. MATH. 2021;6(1):456–76.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Shama MS, El Ktaibi F, Al Abbasi JN, Chesneau C, Afify AZ. Complete Study of an Original Power-Exponential Transformation Approach for Generalizing Probability Distributions. Axioms. 2023;12(1):67.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Zhao Y, Ahmad Z, Alrumayh A, Yusuf M, Aldallal R, Elshenawy A, et al. A novel logarithmic approach to generate new probability distributions for data modeling in the engineering sector. Alexandria Engineering Journal. 2023;62:313–25.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Abbasi JNA, Afify AZ, Alnssyan B, Shama MS. The Lambert-G Family: Properties, Inference, and Applications. CMES. 2024;140(1):513–36.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Jamal F, Alqawba M, Altayab Y, Iqbal T, Afify AZ. A unified exponential-H family for modeling real-life data: Properties and inference. Heliyon. 2024;10(6):e27661. pmid:38509929
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref11] 11. Semary HE, Hussain Z, Hamdi WA, Aldahlan MA, Elbatal I, Nagarjuna VBV. Alpha–beta-power family of distributions with applications to exponential distribution. Alexandria Engineering Journal. 2024;100:15–31.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref12] 12. Kavya P, Manoharan M. Some parsimonious models for lifetimes and applications. Journal of Statistical Computation and Simulation. 2021;91(18):3693–708.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref13] 13. Alotaibi N, Elbatal I, Shrahili M, Al-Moisheer AS, Elgarhy M, Almetwally EM. Statistical Inference for the Kavya–Manoharan Kumaraswamy Model under Ranked Set Sampling with Applications. Symmetry. 2023;15(3):587.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref14] 14. Al-Nefaie AH. Applications to Bio-Medical data and statistical inference for a Kavya-Manoharan log-logistic model. Journal of Radiation Research and Applied Sciences. 2023;16(1):100523.
View Article
Google Scholar

[42] View Article

[43] Google Scholar

[ref15] 15. Eldessouky EA, Hassan OHM, Elgarhy M, Hassan EAA, Elbatal I, Almetwally EM. A New Extension of the Kumaraswamy Exponential Model with Modeling of Food Chain Data. Axioms. 2023;12(4):379.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref16] 16. Hassan OHM, Elbatal I, Al-Nefaie AH, Elgarhy M. On the Kavya–Manoharan–Burr X Model: Estimations under Ranked Set Sampling and Applications. JRFM. 2022;16(1):19.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref17] 17. Riad FH, Radwan A, Almetwally EM, Elgarhy M. A new heavy tailed distribution with actuarial measures. Journal of Radiation Research and Applied Sciences. 2023;16(2):100562.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref18] 18. Swain JJ, Venkatraman S, Wilson JR. Least-squares estimation of distribution functions in johnson’s translation system. Journal of Statistical Computation and Simulation. 1988;29(4):271–97.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref19] 19. Cheng R, Amin N. Maximum product-of-spacings estimation with applications to the lognormal distribution. Mathematical Report. 1979;791.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref20] 20. Cheng RCH, Amin NAK. Estimating Parameters in Continuous Univariate Distributions with a Shifted Origin. Journal of the Royal Statistical Society Series B: Statistical Methodology. 1983;45(3):394–403.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref21] 21. Kao JHK. Computer Methods for Estimating Weibull Parameters in Reliability Studies. IRE Trans Reliab Qual Control. 1958;13:15–22.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref22] 22. Cramér H. On the composition of elementary errors. Scandinavian Actuarial Journal. 1928;1928(1):13–74.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref23] 23. Von Mises RE. Wahrscheinlichkeit Statistik und Wahrheit. Switzerland: Springer; 1928.

[ref24] 24. Ghitany ME, Atieh B, Nadarajah S. Lindley distribution and its application. Mathematics and Computers in Simulation. 2008;78(4):493–506.
View Article
Google Scholar

[70] View Article

[71] Google Scholar

[ref25] 25. Dumonceaux R, Antle CE. Discrimination Between the Log-Normal and the Weibull Distributions. Technometrics. 1973;15(4):923–6.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref26] 26. Lee ET, Wang JW. Statistical Methods for Survival Data Analysis. USA: John Wiley and Sons; 2003.

[ref27] 27. Mann PS. Introductory Statistics. New York (USA): Wiley Publisher; 2016.

[ref28] 28. Murthy DP, Xie M and Jiang R. Weibull Models. USA: John Wiley & Sons; 2004.

[ref29] 29. Gupta RD, Kundu D. Theory & Methods: Generalized exponential distributions. Aus NZ J of Statistics. 1999;41(2):173–88.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref30] 30. Mahdavi A, Kundu D. A new method for generating distributions with an application to exponential distribution. Communications in Statistics - Theory and Methods. 2016;46(13):6543–57.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref31] 31. Maurya SK, Kaushik A, Singh SK, Singh U. A new class of distribution having decreasing, increasing, and bathtub-shaped failure rate. Communications in Statistics - Theory and Methods. 2016;46(20):10359–72.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref32] 32. Abouammoh AM, Alshingiti AM. Reliability estimation of generalized inverted exponential distribution. Journal of Statistical Computation and Simulation. 2009;79(11):1301–15.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref33] 33. Almazah MMA, Erbayram T, Akdoğan Y, AL Sobhi MM, Afify AZ. A New Extended Geometric Distribution: Properties, Regression Model, and Actuarial Applications. Mathematics. 2021;9(12):1336.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref34] 34. El-Morshedy M, Eliwa MS, Nagy H. A new two-parameter exponentiated discrete Lindley distribution: properties, estimation and applications. J Appl Stat. 2019;47(2):354–75. pmid:35706520
View Article
PubMed/NCBI
Google Scholar

[94] View Article

[95] PubMed/NCBI

[96] Google Scholar

[ref35] 35. Al-Babtain AA, Ahmed AHN, Afify AZ. A New Discrete Analog of the Continuous Lindley Distribution, with Reliability Applications. Entropy (Basel). 2020;22(6):603. pmid:33286375
View Article
PubMed/NCBI
Google Scholar

[98] View Article

[99] PubMed/NCBI

[100] Google Scholar

[ref36] 36. Bakouch HS, Jazi MA, Nadarajah S. A new discrete distribution. Statistics. 2012;48(1):200–40.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref37] 37. Afify AZ, Elmorshedy M, Eliwa MS. A New Skewed Discrete Model: Properties, Inference, and Applications. Pak J Stat Oper Res. 2021;17:799–816.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref38] 38. Sankaran M. 275. Note: The Discrete Poisson-Lindley Distribution. Biometrics. 1970;26(1):145.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref39] 39. Gupta RD, Kundu D. A new class of weighted exponential distributions. Statistics. 2009;43(6):621–34.
View Article
Google Scholar

[111] View Article

[112] Google Scholar

Figures

Abstract

1 Introduction

2 Synthesis of the MKM-G family and its discrete counterpart

3 Five special lifetime and discrete models

3.1 The MKME distribution

3.2 The MKMBXII Distribution

3.3 The MKMBx distribution

3.4 The MKMLL distribution

3.5 The DMKME distribution

4 Properties of the MKM-G family

4.1 Linear representation

4.2 Quantile function and moments

4.3 Entropies

4.4 Mean residual life and mean inactivity time

4.5 Probability weighted moments

4.6 Order statistics

4.7 Moments of the MKME distribution

5 Estimation approaches

6 Simulation results

7 Lifetime and count data analysis in applied sciences

7.1 Applications for the MKME model

7.2 Applications of the DMKME Model

8 Some conclusions

References