The unit ratio-extended Weibull family and the dropout rate in Brazilian undergraduate courses

Fernando A. Peña-Ramírez; Renata R. Guerra; Charles Peixoto Mafalda

doi:10.1371/journal.pone.0290885

Abstract

We propose a new family of distributions, so-called the unit ratio-extended Weibull family (). It is derived from ratio transformation in an extended Weibull random variable. The use of this transformation is a novelty of the work since it has been less explored than the exponential and has not yet been studied within the extended Weibull class. Moreover, we offer a valuable alternative to model double-bounded variables on the unit interval. Five special models are studied in detail, namely the: i) unit ratio-Gompertz; ii) unit ratio-Burr XII; iii) unit ratio-Lomax; v) unit ratio-Rayleigh, and vi) unit ratio-Weibull distributions. We propose a quantile-parameterization for the new family. The maximum likelihood estimators (MLEs) are presented. A Monte Carlo study is performed to evaluate the behavior of the MLEs of unit ratio-Gompertz and unit ratio-Rayleigh distributions. This last model has closed-form and approximately unbiased MLE for small sample sizes. Further, the submodels are adjusted to the dropout rate in Brazilian undergraduate courses. We focus on the areas of civil engineering, economics, computer sciences, and control engineering. The applications show that the new family is suitable for modeling educational data and may provide effective alternatives compared to other usual unit models, such as the Beta, Kumaraswamy, and unit gamma distributions. They can also outperform some recent contributions in the unit distribution literature. Thus, the family can provide competitive alternatives when those models are unsuitable.

Citation: Peña-Ramírez FA, Guerra RR, Mafalda CP (2023) The unit ratio-extended Weibull family and the dropout rate in Brazilian undergraduate courses. PLoS ONE 18(11): e0290885. https://doi.org/10.1371/journal.pone.0290885

Editor: Mohamed R. Abonazel, Cairo University, EGYPT

Received: April 19, 2023; Accepted: August 11, 2023; Published: November 16, 2023

Copyright: © 2023 Peña-Ramírez et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting information files.

Funding: This research was partially funded by Fundação de Amparo à Pesquisa do Estado do Rio Grande do Sul (FAPERGS), Brazil, grant number 23/2551-0000851-3 awarded by RRG.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

The formulation of new generalized classes of probability distributions is a topic that has received a great deal of attention in recent years, particularly when it comes to positive data [1]. To mention a few, we refer the reader to [2–4] as extensions of the Weibull distribution and [5, 6] for Nadarajah-Haghighi generalizations. Most of these works are introduced aiming to furnish more flexible distributions regarding shape densities and hazard rates. However, there is much to be done when considering random variables supported in the unit interval. We can cite the beta and Kumaraswamy [7] () distributions as classical unit models In this respect.

Motivated by the increasing interest in modeling bounded data, other unit distributions have been introduced and are available in the literature. Some of these advances are, for instance, the unit gamma [8] (), simplex [9], CDF-quantile [10], unit Birnbaum-Saunders [11, 12] (), unit Weibull [13] (), unit extended Weibull [14], complementary unit extended Weibull [14], unit Gompertz [15], unit Burr XII [16, 17], reflected unit Burr XII [18], unit generalized half normal [19], bounded odd inverse Pareto exponential [20], Modified Kumaraswamy [21], unit-sinh-normal [22], log-Bilai [23] and log-weighted exponential [24] distributions. This interest is due to several natural and anthropogenic phenomena which are bounded in a certain interval [12]. The list of double-bounded random variables may include the proportion of chemical components in different substances [25], vote proportions [12, 26], relative air humidity [27], well-being indicators [14], mortality rates [18, 28], loss given default [29] among several other indexes, indicators, ratios, and rates. Nevertheless, some situations may require other alternatives to model heavy tails and asymmetric proportion data where current models have limitations.

In this context, we introduce the so-called unit ratio-extended Weibull () family of distributions, which is built upon the ratio transformation in the extended Weibull [30] () class. The most common method to derive those unit distributions is applying the exponential transformation in positive random variables. The use of ratio transformation is a novelty of this work since it has been less explored and has not yet been studied within the class. One advantage of introducing the is that some special models can produce N-shaped, U-shaped, and unimodal density shapes. These features make the proposed family quite attractive for educational modeling and addressing real-life problems involving asymmetric and heavy-tailed double-bounded indicators. The N-shaped behavior, for example, is not assumed by the classical beta and distributions but can be accommodated by some special cases. We conduct shape analysis and provide density plots on the proposed models to illustrate these characteristics.

Our main contribution lies in offering a valuable alternative to model double-bounded variables in the unit interval. Moreover, we present at least four contributions achieved by pioneering the class. First, the new family has more than twenty special models that may provide a source of alternatives to deal with rates and proportions, among other random variables in the unit domain. The second contribution is to provide a quantile parametrization for the new family. This framework is useful since the quantiles are outlier-resistant location measures and have a more intuitive interpretation than the original parametrization. The third contribution is related to parameter estimation under the maximum likelihood approach. As illustrated in Section 4, some one-parameter special cases present its maximum likelihood estimator (MLE) in closed form.

Finally, the fourth contribution to formulating the family is its applicability for modeling educational indicators. This type of data has motivated the proposal of several unit distributions. It is the case of [31–33], which examines indicators related to educational attainment percentage and school living conditions across various countries. We can also cite [14] for analyzing literacy rates in Brazilian and Colombian municipalities and [17] for modeling the dropout rate of Brazilian undergraduate animal science courses. However, it should be noted that, to the author’s knowledge, there remains a significant gap in the available information regarding the phenomenon of first-year, or freshman, student dropout. This paper’s motivating data sets concern the first-year dropout rate in Brazilian undergraduate courses. We analyze this outcome for civil engineering, economics, computer sciences, and control engineering courses. The four data sets are positive-skewed, and we observe a short amount of courses with a dropout rate smaller than 17%. This feature should be common for these kinds of data. When analyzing higher education institutions, academic programs with low dropout rates tend to receive higher quality ratings [34]. In addition, this measure is seen as an indicator of institutional excellence and performance [35]. Therefore, our proposals have the advantage of providing consistently better fits than classical beta and distributions when modeling the dropout rate in Brazilian undergraduate courses (see Section 6). As illustrated in the applications, they can also outperform some recent contributions in the distribution literature, such as the , , and distributions. All analysis in this paper is carried out using R programming language. The computational codes and data sets used to obtain the plots, simulations, and application results are made available on a GitHub repository (Computer codes available at https://github.com/penaramirez/UREW).

The rest of the paper is organized as follows. Section 2 presents the theoretical background and defines the new family of unit distributions. Some special cases are presented in Section 3. Section 4 focuses on inferential procedures based on the maximum likelihood method. We present results for all family members and derive expressions for the MLEs of some special models. Section 5 discusses simulation studies’ results to assess the performance of the point and asymptotic interval estimators. Section 6 illustrates our proposed family’s relevance in educational data, specifically about the first-year dropout rate in some Brazilian undergraduate courses. The final remarks are presented in Section 7.

2 The unit ratio-extended Weibull family of distributions

This section presents the theoretical background and defines the proposed family from a ratio transformation in the class of distributions. Let X be a random variable on the class, and denote . The probability density function (pdf) of X is given by (1) where x > 0, α > 0, H(x; ξ) is a non-negative monotonically increasing function which depends on the parameter vector ξ, and h(x; ξ) is the derivative of H(x; ξ) with respect to x. For each formulation of H(x; ξ), different special models result. Thus, several well-known distributions can be obtained depending on the choice of this function. Table 1 presents twenty alternatives for H(x; ξ), their corresponding derivatives, and inverse functions. Further details on this family and some generalizations to examine non-negative data are given by [36–38].

Download:

Table 1. Some

special models and their corresponding H(x; ξ), H⁻¹(x; ξ) and h(x; ξ) functions.

https://doi.org/10.1371/journal.pone.0290885.t001

The cumulative distribution function (cdf) and quantile function (qf) are given by and respectively, where H⁻¹(⋅; ξ) is the inverse function of H(⋅; ξ).

We define the class of distribution by considering the ratio transformation Y = X/(1 + X), where Hereafter, we denote Y as a random variable, which has cdf (2) where 0 < y < 1, α > 0, and ξ is the parameter vector associated to the H(⋅; ξ) function.

Thus, the pdf and qf of the proposed family are and respectively. The proposition below refers to a quantile-based parametrization for the family. Analogous frameworks can be found in other unit models recently introduced. See [14, 39, 40] for median-based parametrizations and [41] for a quantile-based example.

Proposition 1. Let Y be a random variable, then its cdf can be rewritten as (3) where q(τ) ∈ (0, 1) is a location parameter which corresponds to the τth quantile of Y, ξ is the parameter vector associated with H(⋅; ξ), and τ is assumed as known.

Proof. The result in Eq (3) holds by replacing in (2). Hence, the qf Y can be rewritten as Setting u = τ in the above equation, we obtaing that Q_Y(τ) = q(τ), which concludes the proof.

Under the quantile parametrization, the pdf can be written as (4)

3 Some special cases

Several well-established statistical models are special cases in the EW family. They can be considered baseline models in the family by replacing their corresponding H(⋅; ξ) functions in the cdf (2). Here, we give further details on five of those models, namely: the unit ratio-Gompertz (), unit ratio-Burr XII (), unit ratio-Lomax (), unit ratio-Weibull (), and unit ratio-Rayleigh () distributions. These models are introduced using the quantile-parametrization given in Proposition 1. The H(⋅; ξ) functions of these and several other models members of the family can be consulted in Table 1.

3.1 The unit ratio-Gompertz distribution

The distribution is obtained considering the Gompertz as a baseline model in the family. Thus, by taking H(x; ξ) = exp{βx} − 1 in (2), the cdf can be written as where y ∈ (0, 1), β > 0 is a shape parameter, and μ ∈ (0, 1) is the τth quantile parameter. The corresponding pdf, qf, and hazard rate function (hrf) are (5) (6) and respectively. Fig 1(a) illustrates the pdf shapes for several combinations of μ and β, with τ = 0.5.

Download:

Fig 1. Density plots for some

special models.

(a) , (b) , (c) , (d) .

https://doi.org/10.1371/journal.pone.0290885.g001

3.2 The unit ratio-Burr XII distribution

The distribution is obtained considering the Burr XII as a baseline model in the family. Thus, by taking H(x; ξ) = log[1 + x^β] in (2), and after simplification, the cdf reduces to where y ∈ (0, 1), and μ ∈ (0, 1) is the τth quantile parameter. The corresponding pdf, qf, and hrf are (7) (8) and respectively. Fig 1(b) illustrates the pdf shapes for several combinations of μ and τ = 0.5. This plot illustrates the flexibility of the distribution. It can have N-shaped, U-shaped, and unimodal density shapes, being able to fit asymmetric and heavy-tailed double-bounded data.

3.3 The unit ratio-Lomax distribution

The distribution is obtained considering the Lomax as a baseline model in the family. Thus, by taking H(x; ξ) = log[1 + x^β] in (2), and after simplificaion, the cdf reduces to where y ∈ (0, 1), β > 0 is a shape parameter, and μ ∈ (0, 1) is the τth quantile parameter. The corresponding pdf, qf, and hrf are (9) and respectively. Fig 1(c) illustrates the pdf shapes for several combinations of μ and β, with τ = 0.5.

3.4 The unit ratio-Weibull and unit ratio-Rayleigh distributions

The distribution is obtained considering the Weibull as baseline model in the family. By taking H(x; ξ) = x^β in (2), the can be written as where y ∈ (0, 1), β > 0 is a shape parameter, and μ ∈ (0, 1) is the τth quantile parameter. The corresponding pdf, qf, and hrf are (10) and respectively. For β = 2, the reduces to the distribution, which is also new. The is a one-parameter model obtained considering the Rayleigh as a baseline model in the family. Its pdf is given by (11) Fig 1(d) illustrates the pdf shapes for several combinations of μ and τ = 0.5. It shows that the distribution presents unimodal density shape, accomodating left and right-skewed data in the unit interval.

4 Maximum likelihood estimation

Here, we consider estimation of the parameters of the family by the maximum likelihood (ML) method. The log-likelihood for a random sample y₁, … y_n from (4), based on parameter vetor θ = (μ, ξ^⊤)^⊤, is (12)

The components of the score vector U(θ) = [U_μ, U_ξ]^⊤, are e For fixed values of ξ, it is possible to obtain a closed-form for the MLE of the μ. By setting U_μ = 0 and solving for μ, we have Therefore, obtaing the EMV of μ in closed-form is possible when ξ = ∅. Otherwise, to get the MLEs of the parameters μ and ξ, it is necessary to use some iterative procedures such as Newton-Raphson type algorithms to maximize (12).

We can construct approximate confidence intervals for θ based on the asymptotic normality property. Under standard regularity conditions, the asymptotic distribution of can be approximated by the multivariate normal distribution, where is the observed information matrix. Thus, the asymptotic 100(1 − η)% confidence intervals of θ are given by where z_η/2 is the quantile η/2 of the standard normal distribution, and . In what follows, we present the likelihood estimation of some special cases of the family.

4.1 MLE for the distribution

Let y₁, …, y_n be a random sample of size n from the distribution. The log-likelihood function is The escore function U_μ is (13) By setting U_μ = 0 and solving for μ, we have the EMV of μ as (14) and the Fisher’s observed information is computed as (15) The conditions for the maximum value of the function ℓ(μ|y₁, …, y_n) require that . This is easily observed by substituting (14) into (15), where it is verified that

4.2 MLE for the distribution

Let y₁, …, y_n be a random sample of size n from the distribution with parameter vector θ = (β, μ)^⊤. The log-likelihood function is (16) The components of the score vector U_θ = (U_β, U_μ)^⊤ are and Note that the system of equations U_θ = 0 cannot be solved in closed form; therefore, the maximization of (16) to obtain the EMV of θ can be carried out using the quasi-Newton BFGS nonlinear optimization algorithm implemented in the optim function available in R.

5 Simulation study

In this Section, a Monte Carlo study is carried on to evaluate the performance of the MLEs of the family in finite samples. For that, the and distributions are considered. This study conducted 10,000 Monte Carlo replications with sample sizes n ∈ {10, 25, 50, 75, 100}. Aiming to evaluate the point estimators, we use the set of estimates of parameters obtained in each replication to calculate its mean, variance, relative biases (RBs), standard deviations (SDs), and root mean squared errors (RMSEs). Regarding the initial values selected for simulation, we highlight that the distribution has a closed form for its MLE (see Eq (14)). Therefore, one advantage of using this model is that it does not require defining initial values in the ML method. For the two-parameter special cases, we use the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm and compute the observed information matrix numerically from the optim function in the R programming language. Therefore, we set the sample quantile as the initial value for μ and one for the shape parameter. These values are used either for the simulated or actual data experiments performed in the paper. We calculate the coverage probability of the 95% pointwise confidence interval (CP_95%) to evaluate the interval estimation. Next, we provide the numerical results for both considered distributions. Next, the numerical results for both distributions considered are presented.

5.1 Numerical Analysis for the distribution

We generate occurrences of the variable Y following a law with five different values of μ (scenarios). For that, we use the inversion method replacing u ∼ U(0, 1) in the qf. The simulation results are shown in Table 2. It reveals low RB values in all the scenarios and sample sizes considered. We highlight that all its observed values are less than 0.7%. We also observe low SD values, all less than 0.5. For all the sample sizes, it is common to observe RMSE’s lower values for the central values of μ (μ = 0.4, for example) than for the close values of the extremes (μ = 0.15 or μ = 0.9, for example). In its last column, it can verify that the coverage probabilities of the 95% pointwise confidence intervals of the parameter are quite close to the nominal level.

Download:

Table 2. Results of the Monte Carlo simulation from the

distribution.

https://doi.org/10.1371/journal.pone.0290885.t002

Fig 2 indicates that the RB and RMSE of decrease as the sample size increases, corroborating the asymptotic properties of the MLEs.

Download:

Fig 2. Percentual RB and RMSE of the

estimator in several scenarios.

https://doi.org/10.1371/journal.pone.0290885.g002

5.2 Numerical analysis for the distribution

Analogous to the previous experiment, occurrences of the variable Y are initially generated, which follows a distribution with different configurations in its parameters μ and β. The data are generated using the inversion method in the qf. In Table 3‘, we present the simulation results. It shows that μ’s estimates are more accurate than β’s. We can also observe that the RB of μ is always less than 0.4% in absolute value. For sample sizes greater than 75, the RB of is always less than 10%. In the last column of Table 2, we can be observed that the coverage probabilities of the 95% pointwise confidence intervals of both parameters are quite close to the nominal level.

Download:

Table 3. Results of the Monte Carlo simulation from the

distribution.

https://doi.org/10.1371/journal.pone.0290885.t003

Fig 3(a) presents a plot with the sum of the RB of and that we call the total RB. Fig 3(b) presents a similar plot with the sum of the RMSE of and , that we call the total RMSE. They show that the total RB and total RMSE of and decrease as the sample size increases, corroborating the asymptotic properties of the MLEs.

Download:

Fig 3. Total percentual RB and total RMSE of the

estimators in several scenarios.

(a) VR(%) total, (b) EQM total.

https://doi.org/10.1371/journal.pone.0290885.g003

6 Applications

This section illustrates the usefulness of the family through applications in educational data related to student dropout, also known as student attrition. This outcome has some complexity in data collection [42], and a diversity of definitions has been considered in the specialized literature. In this paper, we are interested in analyzing the first-year dropout rate in undergraduate courses, defined as the proportion of students who withdraw from the course before completing the first year. Thus, from a sample with n undergraduate courses, the ith observation is obtained as where i ∈ {1, …, n}. The decision to focus the study on freshmen students lies in the evidence that the risk of dropping out is higher during the first year of college, also called the freshmen year [42, 43]. This period is seen as the most critical time for the connection between academic programs and students [44]. Therefore, understanding the behavior of this variable may be helpful in developing practices aimed at reducing the early dropout from undergraduate courses from different areas.

The data used in this case study were collected from the Brazilian higher education census microdata, conducted in 2018 [45] and were calculated from the entering students in 2018. We select the presential courses with more than 29 new students and first-year dropout rate in the (0, 1) interval in the census academic year. The applications refer to four data sets about civil engineering, economics, computer sciences, and control engineering courses. We fit special models and compare their performance with other existing double-bounded distributions, which are not special cases of the proposed family.

Table 4 gives a descriptive summary of the dropout rates of each dataset considered. The Economics course exhibits smaller values for all central tendency measures and higher for the skewness, kurtosis, and amplitude measures. The other courses present those measures quite close when compared with each other. Their mean and median are around 17% and 16%, respectively. The descriptive measures indicate that, for all data sets, the mass of observations concentrates on the left. This configuration is adequate since the dropout rate is negatively related to institutional quality and effectiveness. Academic programs with low dropout rates are often considered to be more efficient [34]. Nevertheless, the dropout rates in higher education are social and institutional concerns [42], and there is a broad consensus on the need for universities to promote students’ success [46]. The fact that many students do not achieve their goals during university experience is a waste of talent and human potential [42, 46].

Download:

Table 4. Descriptive statistics for the dropout rates in the four course types considered.

https://doi.org/10.1371/journal.pone.0290885.t004

For modeling these data, we fit five special models studied in the current paper, i.e., the and distributions. Their densities are given by equations (5), (7), (9), (11), and (10), respectively. We fix τ at 0.5 in those equations. We also considered six well-known alternative distributions to describe random variables supported in the unit interval for comparison purposes. We fit the Beta, , , , , and complementary unit Weibull () [14] distributions. They do not represent special cases and are selected as competitive distributions due to their relevance in the literature. The beta and are classical models for double-bounded outcomes. The is chosen due to its relevance to various problems. It has received a great deal of attention from statisticians for developing methodological advances [47]. The and are two of the most relevant models regarding recent advances in distribution theory. The arises as an alternative model due to its usefulness regarding educational modeling. This distribution has proved helpful in analyzing literacy rates [14]. The densities of all these competitor models are presented in Appendix A.

Parameter estimation is performed by the maximum likelihood method for all fitted models, and the Cramér-von Misses corrected statistic [48] (W*) is considered as the goodness-of-fit measure. Those estimates are computed using the goodness.fit function from the AdequacyModel package [49]. The goodness.fit function allows computing the MLEs of probability distributions and their goodness-of-fit statistical measures. It uses the optim function in the implementation and includes several optimization techniques. For the paper results, we use the BFGS algorithm and compute the observed information matrix numerically. Thus, the standard errors and confidence intervals were obtained from the asymptotic normality property of the MLEs. We set the initial values at 1 for the shape (or precision) parameter, the sample mean for the distributions indexed in the mean, and the sample quantile for those with quantile parametrization.

The estimation results for all data sets are reported in Table 5. We observe that the distributions on the family have the lowest W* for the course types considered. The proposed models occupy the first three positions in the ranking for civil engineering and computer sciences. Analyzing the control engineering course, we note that the outperforms the others and is followed by the distribution, which also belongs to the family. For the economics course, the distribution has superior goodness-of-fit. Fig 4 displays the boxplots and the histograms with fitted density functions for the three best models according to W*. Those plots corroborate that the fits are adequate to the dropout rates of all course types considered and provides real improvement over existing distributions. Therefore, the proposed family is shown competitive with classical unit models such as the beta and distributions.

Download:

Fig 4. Histogram and estimated densities for the applications.

https://doi.org/10.1371/journal.pone.0290885.g004

Download:

Table 5. MLEs estimates, the corresponding standard errors (given in parentheses) and goodness-of-fit measure for all fitted models and course types considered.

https://doi.org/10.1371/journal.pone.0290885.t005

The special cases also exhibit superior performance when compared to recent alternatives, including the , , and distributions. It is worth noting that the distribution has been commonly used in educational modeling. In [14], it was verified that this model can properly fit literacy rates. However, it is important to highlight that while higher literacy rates are desirable [14], lower values are considered more favorable in the case of dropout rates [17]. In this case, it is expected that left-skewed distributions to fit better the former and right-skewed distributions to be more suitable for the latter. This feature may explain why the is not among the best models for the analyzed datasets while evincing the capacity of the family to model the first-year dropout rate effectively.

Our results may represent useful tools for universities to evaluate and improve their programs. It is a relevant application as it allows us to deal with the academic, social, and economic implications of university dropout [17]. Nevertheless, other potential applications can be explored in the context of educational modeling. The new family can be competitive to model literacy rates [14], educational attainment percentages [31], proportions of adolescents who want top grades at school [32], and proportions of the novice teachers with a mentor at the school [33]. These variables have been explored through other commonly used distributions in educational modeling. We can also cite the graduation and persistence rates as further applications, which are related to student progression and academic success patterns [34].

7 Final remarks

This paper defines the unit ratio-extended Weibull () family of distributions. It is obtained on a ratio transformation in the extended Weibull family and can be used to model continuous random variables in the unit interval. The new family has a closed-form for quantile measures; thus, we provide a quantile parametrization for the family. Several special cases are derived, and parameter estimation is explored using the maximum likelihood theory. We show that some one-parameter special cases may present closed-form for the maximum likelihood estimator (MLE). We perform Monte Carlo experiments to assess the performance of those estimators. For example, the unit ratio-Rayleigh MLE is approximately unbiased for small sample sizes. We also note an appropriate performance for the unit ratio-Gompertz MLEs. The utility of the proposed family is illustrated with applications to the first-year dropout rate of undergraduate courses in Brazilian universities. We select four course types and note that, for those data, the special models fit properly and outperform other classical and recent unit distributions. Thus, the new family can be competitive alternative when those models are unsuitable. We emphasize that a long list of possibilities can be addressed in future works. For example, our approach can be investigated in the presence of zeros and ones, and quantile regression models are also a natural path. The can also be generalized to accommodate time-dependent double-bounded indicators by using the autoregressive moving average models. This kind of structure is in the state-of-art literature on the analysis of double-bounded time series. The can also attract applications to other double-bounded variables, being a competitive option to other unit distributions commonly used in educational modeling. For instance, literacy rates, educational attainment percentages, graduation, and persistence rates are educational measurements that represent potential applications for the proposed family.

Appendix

A—Alternative distributions fitted in the applications

In this appendix, we present the unit distributions fitted in Section 6 as alternative models to the family. These model and their corresponding densities are listed bellow (for 0 < y < 1):

The beta density is given by where μ ∈ (0, 1) is the mean of Y and ϕ > 0 is a precision parameter. The above parametrization is pioneered by [50].
The density is given by where μ ∈ (0, 1) is the qth quantile parameter, and ϕ > 0 is a precision parameter. The above parametrization is pioneered by [51]. In Section 6 we fix q at 0.5 thus the parameter μ refers to the median of Y.
The density is given by where μ ∈ (0, 1) is the mean of Y and ϕ > 0 is a precision parameter. The above parametrization is pioneered by [52].
The density is given by where α > 0 and β > 0 are shape parameters. The is pioneered by [11].
The density is given by where μ ∈ (0, 1) is the τth quantile parameter and β > 0 are shape parameters. The above parametrization is pioneered by [41]. In Section 6 we fix τ at 0.5 thus the parameter μ refers to the median of Y.
The density is given by where μ ∈ (0, 1) is the median of Y and β > 0 is a shape parameter. The above distribution is pioneered by [14].

Supporting information

S1 Data.

https://doi.org/10.1371/journal.pone.0290885.s001

(ZIP)

References

1. Tahir MH, Nadarajah S. Parameter induction in continuous univariate distributions: Well-established G families. Annals of the Brazilian Academy of Sciences. 2015;87:539–568. pmid:26131628
- View Article
- PubMed/NCBI
- Google Scholar
2. Peña-Ramírez FA, Guerra RR, Cordeiro GM, Marinho PR. The exponentiated power generalized Weibull: Properties and applications. Anais da Academia Brasileira de Ciências. 2018;90:2553–2577. pmid:30304207
- View Article
- PubMed/NCBI
- Google Scholar
3. Zichuan M, Hussain S, Iftikhar A, Ilyas M, Ahmad Z, Khan DM, et al. A new extended-family of distributions: properties and applications. Computational and Mathematical Methods in Medicine. 2020;2020. pmid:32549906
- View Article
- PubMed/NCBI
- Google Scholar
4. Arif M, Khan DM, Khosa SK, Aamir M, Aslam A, Ahmad Z, et al. Modelling insurance losses with a new family of heavy-tailed distributions. Computers, Materials & Continua. 2021;66:537–550.
- View Article
- Google Scholar
5. Peña-Ramírez FA, Guerra RR, Cordeiro GM. The Nadarajah-Haghighi Lindley distribution. Anais da Academia Brasileira de Ciências. 2019;91:e20170856. pmid:30994747
- View Article
- PubMed/NCBI
- Google Scholar
6. Peña-Ramírez FA, Guerra RR, Canterle DR, Cordeiro GM. The logistic Nadarajah–Haghighi distribution and its associated regression model for reliability applications. Reliability Engineering & System Safety. 2020;204:107196.
- View Article
- Google Scholar
7. Kumaraswamy P. A generalized probability density function for double-bounded random processes. Journal of Hydrology. 1980;46:79–88.
- View Article
- Google Scholar
8. Grassia A. On a family of distributions with argument between 0 and 1 obtained by transformation of the gamma distribution and derived compound distributions. Australian Journal of Statistics. 1977;19:108–114.
- View Article
- Google Scholar
9. Barndorff-Nielsen OE, Jorgensen B. Some parametric models on the simplex. Journal of Multivariate Analysis. 1991;39:106–116.
- View Article
- Google Scholar
10. Smithson M, Shou Y. CDF-quantile distributions for modelling random variables on the unit interval. British Journal of Mathematical and Statistical Psychology. 2017;70:412–438. pmid:28306155
- View Article
- PubMed/NCBI
- Google Scholar
11. Mazucheli J, Menezes AFB, Dey S. The unit-Birnbaum-Saunders distribution with applications. Chilean Journal of Statistics. 2018;9:47–57.
- View Article
- Google Scholar
12. Mazucheli J, Leiva V, Alves B, Menezes AF. A new quantile Regression for modeling bounded data under a unit Birnbaum–Saunders distribution with applications in medicine and politics. Symmetry. 2021;13:682.
- View Article
- Google Scholar
13. Mazucheli J, Menezes A, Ghitany M. The unit-Weibull distribution and associated inference. Journal of Applied Probability and Statistics. 2018;13:1–22.
- View Article
- Google Scholar
14. Guerra RR, Peña-Ramírez FA, Bourguignon M. The unit extended Weibull families of distributions and its applications. Journal of Applied Statistics. 2021;48:3174–3192. pmid:35707261
- View Article
- PubMed/NCBI
- Google Scholar
15. Mazucheli J, Menezes AF, Dey S. Unit-Gompertz distribution with applications. Statistica. 2019;79:25–43.
- View Article
- Google Scholar
16. Korkmaz MÇ, Chesneau C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Computational and Applied Mathematics. 2021;40:1–26.
- View Article
- Google Scholar
17. Ribeiro TF, Peña-Ramírez FA, Guerra RR, Cordeiro GM. Another unit Burr XII quantile regression model based on the different reparameterization applied to dropout in Brazilian undergraduate courses. Plos one. 2022;17:e0276695. pmid:36327245
- View Article
- PubMed/NCBI
- Google Scholar
18. Ribeiro TF, Cordeiro GM, Pena-Ramirez FA, Guerra RR. A new quantile regression for the COVID-19 mortality rates in the United States. Computational and Applied Mathematics. 2021;40:1–16.
- View Article
- Google Scholar
19. Korkmaz MÇ. The unit generalized half normal distribution: A new bounded distribution with inference and application. UPB Scientific Bulletin, Series A: Applied Mathematics and Physics. 2020;82:133–140.
- View Article
- Google Scholar
20. Nasiru S, Abubakari AG, Angbing ID. Bounded odd inverse pareto exponential distribution: Properties, estimation, and regression. International Journal of Mathematics and Mathematical Sciences. 2021;2021:1–18.
- View Article
- Google Scholar
21. Sagrillo M, Guerra RR, Bayer FM. Modified Kumaraswamy distributions for double bounded hydro-environmental data. Journal of Hydrology. 2021;603:127021.
- View Article
- Google Scholar
22. Martínez-Flórez G, Tovar-Falón R. New regression models based on the unit-sinh-normal distribution: Properties, inference, and applications. Mathematics. 2021;9:1231.
- View Article
- Google Scholar
23. Altun E, El-Morshedy M, Eliwa M. A new regression model for bounded response variable: An alternative to the beta and unit-Lindley regression models. Plos one. 2021;16:e0245627. pmid:33481884
- View Article
- PubMed/NCBI
- Google Scholar
24. Altun E. The log-weighted exponential regression model: alternative to the beta regression model. Communications in Statistics—Theory and Methods. 2021;50:2306–2321.
- View Article
- Google Scholar
25. Espinheira PL, Silva LCM, Cribari-Neto F. Bias and variance residuals for machine learning nonlinear simplex regressions. Expert Systems with Applications. 2021;185:115656.
- View Article
- Google Scholar
26. Yero EJH, Sacco NC, do Carmo Nicoletti M. Effect of the municipal Human Development index on the results of the 2018 Brazilian presidential Elections. Expert Systems with Applications. 2020;168:114305.
- View Article
- Google Scholar
27. Bayer FM, Bayer DM, Pumi G. Kumaraswamy autoregressive moving average models for double bounded environmental data. Journal of Hydrology. 2017;555:385–396.
- View Article
- Google Scholar
28. Melchior C, Zanini RR, Guerra RR, Rockenbach DA. Forecasting Brazilian mortality rates due to occupational accidents using autoregressive moving average approaches. International Journal of Forecasting. 2021;37:825–837.
- View Article
- Google Scholar
29. Calabrese R, Zanin L. Modelling spatial dependence for loss given default in peer-to-peer lending. Expert Systems with Applications. 2022;192:116295.
- View Article
- Google Scholar
30. Gurvich M, DiBenedetto A, Ranade S. A new statistical distribution for characterizing the random strength of brittle materials. Journal of Materials Science. 1997;32:2559–2564.
- View Article
- Google Scholar
31. Korkmaz M, Chesneau C, Korkmaz ZS. Transmuted unit Rayleigh quantile regression model: Alternative to beta and Kumaraswamy quantile regression models. Univ Politeh Buchar Sci Bull Ser Appl Math Phys. 2021;83:149–158.
- View Article
- Google Scholar
32. Korkmaz MÇ, Chesneau C, Korkmaz ZS. A new alternative quantile regression model for the bounded response with educational measurements applications of OECD countries. Journal of Applied Statistics. 2023;50:131–154. pmid:36530782
- View Article
- PubMed/NCBI
- Google Scholar
33. Korkmaz MÇ, Korkmaz ZS. The unit log–log distribution: a new unit distribution with alternative quantile regression modeling and educational measurements applications. Journal of Applied Statistics. 2021; p. 1–20.
- View Article
- Google Scholar
34. Sneyers E, De Witte K. The interaction between dropout, graduation rates and quality ratings in universities. Journal of the Operational Research Society. 2017;68:416–430.
- View Article
- Google Scholar
35. Cave M. The use of performance indicators in higher education: A critical analysis of developing practice. Higher Education Policy Series, 2. ERIC; 1991.
36. Marinho PRD, Cordeiro GM, Ramírez FP, Alizadeh M, Bourguignon M. The exponentiated logarithmic generated family of distributions and the evaluation of the confidence intervals by percentile bootstrap. Brazilian Journal of Probability and Statistics. 2018;32:281–308.
- View Article
- Google Scholar
37. Abbas K, Hussain Z, Rashid N, Ali A, Taj M, Khan SA, et al. Bayesian estimation of Gumbel type-II distribution under type-II censoring with medical applicatioNs. Computational and Mathematical Methods in Medicine. 2020;2020:1–11.
- View Article
- Google Scholar
38. Peña-Ramírez FA, Guerra RR, Cordeiro GM. A new Nadarajah-Haghighi generalization with five different shapes for the hazard function. Revista Colombiana de Estadística. 2023;46:1–29.
- View Article
- Google Scholar
39. Mitnik PA, Baek S. The Kumaraswamy distribution: median-dispersion reparameterizations for regression modeling and simulation-based estimation. Statistical Papers. 2013;54:177–192.
- View Article
- Google Scholar
40. Lemonte AJ, Bazán JL. New class of Johnson distributions and its associated regression model for rates and proportions. Biometrical Journal. 2016;58:727–746. pmid:26659998
- View Article
- PubMed/NCBI
- Google Scholar
41. Mazucheli J, Menezes A, Fernandes L, De Oliveira R, Ghitany M. The unit-Weibull distribution as an alternative to the Kumaraswamy distribution for the modeling of quantiles conditional on covariates. Journal of Applied Statistics. 2020;47:954–974. pmid:35706917
- View Article
- PubMed/NCBI
- Google Scholar
42. Ferrão ME, Almeida LS. Multilevel modeling of persistence in higher education. Ensaio: Avaliação e Políticas Públicas em Educação. 2018;26:664–683.
- View Article
- Google Scholar
43. Thammasiri D, Delen D, Meesad P, Kasap N. A critical assessment of imbalanced class distribution problem: The case of predicting freshmen student attrition. Expert Systems with Applications. 2014;41:321–330.
- View Article
- Google Scholar
44. Sneyers E, De Witte K. Interventions in higher education and their effect on student success: a meta-analysis. Educational Review. 2018;70:208–228.
- View Article
- Google Scholar
45. INEP. Instituto Nacional de Estudos e Pesquisas Educacionais Anísio Teixeira: Censo da Educação Superior; 2018. Brasília: Ministério da Educação. Available from: http://portal.inep.gov.br/basica-levantamentos-acessar.
46. Ferrão M, Almeida L. Differential effect of university entrance score on first-year students’ academic performance in Portugal. Assessment & Evaluation in Higher Education. 2019;44:610–622.
- View Article
- Google Scholar
47. Guedes AC, Cribari-Neto F, Espinheira PL. Modified likelihood ratio tests for unit gamma regressions. Journal of Applied Statistics. 2020;47:1562–1586. pmid:35707584
- View Article
- PubMed/NCBI
- Google Scholar
48. Chen G, Balakrishnan N. A general purpose approximate goodness-of-fit test. Journal of Quality Technology. 1995;27:154–161.
- View Article
- Google Scholar
49. Marinho PRD, Silva RB, Bourguignon M, Cordeiro GM, Nadarajah S. AdequacyModel: An R package for probability distributions and general purpose optimization. PloS one. 2019;14:e0221487. pmid:31450236
- View Article
- PubMed/NCBI
- Google Scholar
50. Ferrari SLP, Cribari-Neto F. Beta regression for modelling rates and proportions. Journal of Applied Statistics. 2004;7:799–815.
- View Article
- Google Scholar
51. Bayes CL, Bazán JL, De Castro M. A quantile parametric mixed regression model for bounded response variables. Statistics and its interface. 2017;10:483–493.
- View Article
- Google Scholar
52. Mousa AM, El-Sheikh AA, Abdel-Fattah MA. A gamma regression for bounded continuous variables. Advances and Applications in Statistics. 2016;49:305–326.
- View Article
- Google Scholar

[ref1] 1. Tahir MH, Nadarajah S. Parameter induction in continuous univariate distributions: Well-established G families. Annals of the Brazilian Academy of Sciences. 2015;87:539–568. pmid:26131628
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Peña-Ramírez FA, Guerra RR, Cordeiro GM, Marinho PR. The exponentiated power generalized Weibull: Properties and applications. Anais da Academia Brasileira de Ciências. 2018;90:2553–2577. pmid:30304207
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Zichuan M, Hussain S, Iftikhar A, Ilyas M, Ahmad Z, Khan DM, et al. A new extended-family of distributions: properties and applications. Computational and Mathematical Methods in Medicine. 2020;2020. pmid:32549906
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Arif M, Khan DM, Khosa SK, Aamir M, Aslam A, Ahmad Z, et al. Modelling insurance losses with a new family of heavy-tailed distributions. Computers, Materials & Continua. 2021;66:537–550.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref5] 5. Peña-Ramírez FA, Guerra RR, Cordeiro GM. The Nadarajah-Haghighi Lindley distribution. Anais da Academia Brasileira de Ciências. 2019;91:e20170856. pmid:30994747
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Peña-Ramírez FA, Guerra RR, Canterle DR, Cordeiro GM. The logistic Nadarajah–Haghighi distribution and its associated regression model for reliability applications. Reliability Engineering & System Safety. 2020;204:107196.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref7] 7. Kumaraswamy P. A generalized probability density function for double-bounded random processes. Journal of Hydrology. 1980;46:79–88.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref8] 8. Grassia A. On a family of distributions with argument between 0 and 1 obtained by transformation of the gamma distribution and derived compound distributions. Australian Journal of Statistics. 1977;19:108–114.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref9] 9. Barndorff-Nielsen OE, Jorgensen B. Some parametric models on the simplex. Journal of Multivariate Analysis. 1991;39:106–116.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref10] 10. Smithson M, Shou Y. CDF-quantile distributions for modelling random variables on the unit interval. British Journal of Mathematical and Statistical Psychology. 2017;70:412–438. pmid:28306155
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref11] 11. Mazucheli J, Menezes AFB, Dey S. The unit-Birnbaum-Saunders distribution with applications. Chilean Journal of Statistics. 2018;9:47–57.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref12] 12. Mazucheli J, Leiva V, Alves B, Menezes AF. A new quantile Regression for modeling bounded data under a unit Birnbaum–Saunders distribution with applications in medicine and politics. Symmetry. 2021;13:682.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref13] 13. Mazucheli J, Menezes A, Ghitany M. The unit-Weibull distribution and associated inference. Journal of Applied Probability and Statistics. 2018;13:1–22.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref14] 14. Guerra RR, Peña-Ramírez FA, Bourguignon M. The unit extended Weibull families of distributions and its applications. Journal of Applied Statistics. 2021;48:3174–3192. pmid:35707261
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref15] 15. Mazucheli J, Menezes AF, Dey S. Unit-Gompertz distribution with applications. Statistica. 2019;79:25–43.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref16] 16. Korkmaz MÇ, Chesneau C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Computational and Applied Mathematics. 2021;40:1–26.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref17] 17. Ribeiro TF, Peña-Ramírez FA, Guerra RR, Cordeiro GM. Another unit Burr XII quantile regression model based on the different reparameterization applied to dropout in Brazilian undergraduate courses. Plos one. 2022;17:e0276695. pmid:36327245
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref18] 18. Ribeiro TF, Cordeiro GM, Pena-Ramirez FA, Guerra RR. A new quantile regression for the COVID-19 mortality rates in the United States. Computational and Applied Mathematics. 2021;40:1–16.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref19] 19. Korkmaz MÇ. The unit generalized half normal distribution: A new bounded distribution with inference and application. UPB Scientific Bulletin, Series A: Applied Mathematics and Physics. 2020;82:133–140.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref20] 20. Nasiru S, Abubakari AG, Angbing ID. Bounded odd inverse pareto exponential distribution: Properties, estimation, and regression. International Journal of Mathematics and Mathematical Sciences. 2021;2021:1–18.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref21] 21. Sagrillo M, Guerra RR, Bayer FM. Modified Kumaraswamy distributions for double bounded hydro-environmental data. Journal of Hydrology. 2021;603:127021.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref22] 22. Martínez-Flórez G, Tovar-Falón R. New regression models based on the unit-sinh-normal distribution: Properties, inference, and applications. Mathematics. 2021;9:1231.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref23] 23. Altun E, El-Morshedy M, Eliwa M. A new regression model for bounded response variable: An alternative to the beta and unit-Lindley regression models. Plos one. 2021;16:e0245627. pmid:33481884
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref24] 24. Altun E. The log-weighted exponential regression model: alternative to the beta regression model. Communications in Statistics—Theory and Methods. 2021;50:2306–2321.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref25] 25. Espinheira PL, Silva LCM, Cribari-Neto F. Bias and variance residuals for machine learning nonlinear simplex regressions. Expert Systems with Applications. 2021;185:115656.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref26] 26. Yero EJH, Sacco NC, do Carmo Nicoletti M. Effect of the municipal Human Development index on the results of the 2018 Brazilian presidential Elections. Expert Systems with Applications. 2020;168:114305.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref27] 27. Bayer FM, Bayer DM, Pumi G. Kumaraswamy autoregressive moving average models for double bounded environmental data. Journal of Hydrology. 2017;555:385–396.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref28] 28. Melchior C, Zanini RR, Guerra RR, Rockenbach DA. Forecasting Brazilian mortality rates due to occupational accidents using autoregressive moving average approaches. International Journal of Forecasting. 2021;37:825–837.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref29] 29. Calabrese R, Zanin L. Modelling spatial dependence for loss given default in peer-to-peer lending. Expert Systems with Applications. 2022;192:116295.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref30] 30. Gurvich M, DiBenedetto A, Ranade S. A new statistical distribution for characterizing the random strength of brittle materials. Journal of Materials Science. 1997;32:2559–2564.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref31] 31. Korkmaz M, Chesneau C, Korkmaz ZS. Transmuted unit Rayleigh quantile regression model: Alternative to beta and Kumaraswamy quantile regression models. Univ Politeh Buchar Sci Bull Ser Appl Math Phys. 2021;83:149–158.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref32] 32. Korkmaz MÇ, Chesneau C, Korkmaz ZS. A new alternative quantile regression model for the bounded response with educational measurements applications of OECD countries. Journal of Applied Statistics. 2023;50:131–154. pmid:36530782
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref33] 33. Korkmaz MÇ, Korkmaz ZS. The unit log–log distribution: a new unit distribution with alternative quantile regression modeling and educational measurements applications. Journal of Applied Statistics. 2021; p. 1–20.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref34] 34. Sneyers E, De Witte K. The interaction between dropout, graduation rates and quality ratings in universities. Journal of the Operational Research Society. 2017;68:416–430.
View Article
Google Scholar

[110] View Article

[111] Google Scholar

[ref35] 35. Cave M. The use of performance indicators in higher education: A critical analysis of developing practice. Higher Education Policy Series, 2. ERIC; 1991.

[ref36] 36. Marinho PRD, Cordeiro GM, Ramírez FP, Alizadeh M, Bourguignon M. The exponentiated logarithmic generated family of distributions and the evaluation of the confidence intervals by percentile bootstrap. Brazilian Journal of Probability and Statistics. 2018;32:281–308.
View Article
Google Scholar

[114] View Article

[115] Google Scholar

[ref37] 37. Abbas K, Hussain Z, Rashid N, Ali A, Taj M, Khan SA, et al. Bayesian estimation of Gumbel type-II distribution under type-II censoring with medical applicatioNs. Computational and Mathematical Methods in Medicine. 2020;2020:1–11.
View Article
Google Scholar

[117] View Article

[118] Google Scholar

[ref38] 38. Peña-Ramírez FA, Guerra RR, Cordeiro GM. A new Nadarajah-Haghighi generalization with five different shapes for the hazard function. Revista Colombiana de Estadística. 2023;46:1–29.
View Article
Google Scholar

[120] View Article

[121] Google Scholar

[ref39] 39. Mitnik PA, Baek S. The Kumaraswamy distribution: median-dispersion reparameterizations for regression modeling and simulation-based estimation. Statistical Papers. 2013;54:177–192.
View Article
Google Scholar

[123] View Article

[124] Google Scholar

[ref40] 40. Lemonte AJ, Bazán JL. New class of Johnson distributions and its associated regression model for rates and proportions. Biometrical Journal. 2016;58:727–746. pmid:26659998
View Article
PubMed/NCBI
Google Scholar

[126] View Article

[127] PubMed/NCBI

[128] Google Scholar

[ref41] 41. Mazucheli J, Menezes A, Fernandes L, De Oliveira R, Ghitany M. The unit-Weibull distribution as an alternative to the Kumaraswamy distribution for the modeling of quantiles conditional on covariates. Journal of Applied Statistics. 2020;47:954–974. pmid:35706917
View Article
PubMed/NCBI
Google Scholar

[130] View Article

[131] PubMed/NCBI

[132] Google Scholar

[ref42] 42. Ferrão ME, Almeida LS. Multilevel modeling of persistence in higher education. Ensaio: Avaliação e Políticas Públicas em Educação. 2018;26:664–683.
View Article
Google Scholar

[134] View Article

[135] Google Scholar

[ref43] 43. Thammasiri D, Delen D, Meesad P, Kasap N. A critical assessment of imbalanced class distribution problem: The case of predicting freshmen student attrition. Expert Systems with Applications. 2014;41:321–330.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref44] 44. Sneyers E, De Witte K. Interventions in higher education and their effect on student success: a meta-analysis. Educational Review. 2018;70:208–228.
View Article
Google Scholar

[140] View Article

[141] Google Scholar

[ref45] 45. INEP. Instituto Nacional de Estudos e Pesquisas Educacionais Anísio Teixeira: Censo da Educação Superior; 2018. Brasília: Ministério da Educação. Available from: http://portal.inep.gov.br/basica-levantamentos-acessar.

[ref46] 46. Ferrão M, Almeida L. Differential effect of university entrance score on first-year students’ academic performance in Portugal. Assessment & Evaluation in Higher Education. 2019;44:610–622.
View Article
Google Scholar

[144] View Article

[145] Google Scholar

[ref47] 47. Guedes AC, Cribari-Neto F, Espinheira PL. Modified likelihood ratio tests for unit gamma regressions. Journal of Applied Statistics. 2020;47:1562–1586. pmid:35707584
View Article
PubMed/NCBI
Google Scholar

[147] View Article

[148] PubMed/NCBI

[149] Google Scholar

[ref48] 48. Chen G, Balakrishnan N. A general purpose approximate goodness-of-fit test. Journal of Quality Technology. 1995;27:154–161.
View Article
Google Scholar

[151] View Article

[152] Google Scholar

[ref49] 49. Marinho PRD, Silva RB, Bourguignon M, Cordeiro GM, Nadarajah S. AdequacyModel: An R package for probability distributions and general purpose optimization. PloS one. 2019;14:e0221487. pmid:31450236
View Article
PubMed/NCBI
Google Scholar

[154] View Article

[155] PubMed/NCBI

[156] Google Scholar

[ref50] 50. Ferrari SLP, Cribari-Neto F. Beta regression for modelling rates and proportions. Journal of Applied Statistics. 2004;7:799–815.
View Article
Google Scholar

[158] View Article

[159] Google Scholar

[ref51] 51. Bayes CL, Bazán JL, De Castro M. A quantile parametric mixed regression model for bounded response variables. Statistics and its interface. 2017;10:483–493.
View Article
Google Scholar

[161] View Article

[162] Google Scholar

[ref52] 52. Mousa AM, El-Sheikh AA, Abdel-Fattah MA. A gamma regression for bounded continuous variables. Advances and Applications in Statistics. 2016;49:305–326.
View Article
Google Scholar

[164] View Article

[165] Google Scholar

Figures

Abstract

1 Introduction

2 The unit ratio-extended Weibull family of distributions

3 Some special cases

3.1 The unit ratio-Gompertz distribution

3.2 The unit ratio-Burr XII distribution

3.3 The unit ratio-Lomax distribution

3.4 The unit ratio-Weibull and unit ratio-Rayleigh distributions

4 Maximum likelihood estimation

4.1 MLE for the distribution

4.2 MLE for the distribution

5 Simulation study

5.1 Numerical Analysis for the distribution

5.2 Numerical analysis for the distribution

6 Applications

7 Final remarks

Appendix

A—Alternative distributions fitted in the applications

Supporting information

S1 Data.

References