Figures
Abstract
Jet fuel plays a crucial role as an essential energy source in aerospace and aviation operations. The recent increase in fuel prices has presented airlines with the new challenge of managing jet fuel costs to ensure consistent cash flow and minimize operational uncertainties. The conventional risk prediction models used by airlines often assume that risks are normally distributed according to the classical Central Limit Theorem, which can lead to under-hedging. This paper proposes an innovative approach using the stable Paretian model to analyze the price return of jet fuel in large samples. It comprehensively compares the fitting effect of the stable Paretian distribution with that of the normal distribution based on specific criteria and non-parametric significance tests. Furthermore, it investigates the accuracy of risk measures such as Value at Risk (VaR) and Conditional Value at Risk (CVaR) predicted by both models. In addition to comparing differences in VaR between predicted values and actual values, this paper provides a more comprehensive comparison of risk measures under rolling window forecast situation. Results suggest that despite indistinguishable results in VaR backtest, the stable Paretian distribution has a overall better fitting effect as well as a less biased predicted CVaR based on the AIC of -14099.46, BIC of -14110.98, p = 0.58 in Kolmogorov-Smirnov test and p = 0.46(0.92) in the 0.01(0.05) significance level of Expected Shortfall Regression Test. This might be explained by its ability to capture asset return dynamics while maintaining shape stability with few parameters. This research can provide valuable insights for guiding airlines’ risk management decisions. its ability to capture asset return dynamics while maintaining shape stability with few parameters. This research can provide valuable insights for guiding airlines’ risk management decisions.
Citation: Lin S, Zhang S, Wang C, He F, Xu Z, Zhang Y (2024) Modeling the distribution of jet fuel price returns based on fat-tail stable Paretian distribution. PLoS ONE 19(10): e0309975. https://doi.org/10.1371/journal.pone.0309975
Editor: Zhengmao Li, Aalto University, FINLAND
Received: April 28, 2024; Accepted: August 21, 2024; Published: October 30, 2024
Copyright: © 2024 Lin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Observational data are available from U.S energy information administration, https://www.eia.gov/dnav/pet/hist/LeafHandler.ashx?n=PET&s=EER_EPJK_PF4_RGC_DPG&f=D.
Funding: This research is supported by the Fundamental Research Funds for the Central Universities (Grant Number: PHD2023-033 & 24CAFUC04013) The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Fuel procurement costs account for a large part of the major costs of airlines [1]. Keeping jet fuel costs less volatile or more predictable reduces cash flow uncertainty, which is good for financing. Therefore, large airlines usually take corresponding measures to hedge the price risk of jet fuel [2]. Specifically, for a given transportation demand, if the purchase volume is treated as an exogenous variable, then the annual jet fuel cost is determined by the purchase volume multiplied by the purchase price per gallon, so stabilizing the purchase price is key to reducing fuel cost volatility. Indeed, the profits of large airlines are typically very sensitive to changes in fuel prices. In the case of China Southern, Asia’s largest airline by passenger traffic for more than four decades, every 1% increase in fuel prices last year reduced profits by more than 2%. The reason is not only that jet fuel, as the primary energy source, is irreplaceable under current technological conditions, but also that the means to reduce consumption are very limited [3]. The jet fuel consumption efficiency of Chinese airlines in the past few years has remained relatively stable at about 0.3 kg per ton-km, as shown in Fig 1. Therefore, the implementation of risk management to reduce price volatility seems to be the most sustainable mechanism at present. Scholars have proposed various hedging strategies to stabilize the fuel cost [4–6]. However, research focusing on the return distribution of jet fuel is rare. Despite the price movement of some types of energy tends to correlate with each other such as jet fuel with crude oil, price variation of jet fuel can be special when the price of crude oil surges or intensive military operations emerge. Therefore, studying the price movement of jet fuel instead of another type of energy is necessary [7]. Compared with crude oil returns, the distribution of return of jet fuel prices is more likely to show fat-tail as well as asymmetry features, which might be explained by characteristics of concentrated buyer and seller as well as way of dealing, i.e. transaction of jet fuel usually happens over the counter [3]. Since airlines are so sensitive to the tail risk of return of jet fuel, the application of a proper probability model that captures statistical features including fat tail, asymmetry as well as maintaining shape stability [8] is necessary based on the logic of Generalized Central Limit Theorem and efficient market hypothesis. Thus part of the challenges to reach this goal is to observe the effectiveness of a stable paretian distribution model and conduct a comprehensive test for both fitting effect and forecast precision. By employing different goodness-of-fit tools along with risk-measure testing methods, we hope to overcome these challenges.
The initial step in effectively hedging the risk of jet fuel is to identify and calculate the risk that needs to be hedged. However, when airlines calculate price risk, most of them utilize traditional risk models which heavily rely on the assumption that the random variables, such as asset returns (interpreted as percentage changes in price), adhere to a normal distribution. Nevertheless, numerous empirical studies have concluded that asset returns frequently deviate from a normal distribution due to their leptokurtic, fatty-tailed, and asymmetric marginal distributions. In such cases, using usually distributed returns may result in underestimating the risk. Researchers have identified various probability distribution models—such as generalized hyperbolic, generalized asymmetric t (gat), extreme value or Laplace distribution—to describe non-normal patterns of return rates. However, as revealed by Calzolari et al., they either employ a greater number of parameters or lack shape stability when subjected to summation [8]. Furthermore, their grounded theorem lacks the strength of the Generalized Central Limit Theorem. Therefore, this study exclusively focuses on the stable Paretian probability model and utilizes it to characterize the stochastic process of return rates of jet fuel in comparison with the common Gaussian probability distribution [2, 3]. The utilization of a stable probability distribution family is prevalent in academic research due to its robust theoretical support, favorable mathematical properties with minimal parameters, and substantial empirical evidence [9–12]. The stable Paretian distribution, as a unique type of stable probability distribution family, has garnered significant interest from both theoretical and empirical researchers [13–15]. We do not assert that the stable Paretian distribution correctly captures patterns of fuel returns, as we believe no model can. However, despite its drawback of infinite variance, the stable Paretian distribution is supported by solid theorems, has few parameters, and possesses proper statistical properties that capture features such as asymmetry and fatty tails. This justifies its rationality for computing price risk and ultimately benefits corresponding risk management decisions.
Previous researchers have extensively explored the theoretical aspects of stable probability models. However, empirical studies on calculating price risk using stable distribution have been relatively rare until recent decades. This is mainly due to the lack of explicit expressions in most stable distributions and their infinite variance statistical properties, which require extensive calculations for large sample data and heavy reliance on computer technology. With recent advancements in theoretical research on stable probability models and the development of computer technology, empirical research based on stable distribution has started to emerge. The concept of stable distribution (also known as alpha-stable distribution) was first introduced by Levy in 1925 while he was investigating the properties of the sum of independent and identically distributed random variables. Nolan furthered the research based on his predecessors by establishing the characteristic equation for the stable distribution and providing a method for parameter estimation via asymptotic expression. This work lays a solid foundation for subsequent scholars to utilize the stable probability model [16]. Weidong Xu et al. have shown the superior descriptive capability of stable distribution in characterizing Chinese stock returns, as compared to the renowned Black-Scholes model [17]. Nolan advocated for the utilization of stable probability models in price dynamics due to their superior fit to data across a wide range. In contrast, normal models inadequately capture the probability density of empirical data, particularly by underestimating the likelihood of extreme values or outliers. The adoption of stable models holds significant potential for enhancing price risk management and insurance actuarial practices [18]. Royuela-del-Val innovatively programmed a superior algorithm which dramatically benefit the application of it in cognitive computing [19]. Masaru Shintani and Ken Umeno gave intuitive proof that, as long as we regard logarithmic returns of stock as independent random variables with the same power-law index α, summation of returns would converge to stable distribution even those random variables come from non-identical distribution [20]. Liu applied stable model to predict factors affecting roadway surrounding rock stability [21]. Samet et al. conducted their empirical research to model volatility of three types of energy futures through two time series models based on three different distribution assumptions. They concluded that APARCH model fits sample time series better in most cases under general extreme value distribution (GED) while GARCH model slightly outperformed under stable distribution. At the same time, risk measurements in the energy market tend to be significantly underestimated based on conventional models instead of that based on APARCH model and general extreme value distribution [22]. Liu Kwangil Bae found out that stable distribution is the possible reason for short-term momentum and long-term reversal [23]. Nolan illustrated the new numerical integration method to compute cumulative distribution functions(CDF) of the stable probability model thus further benefit the application of stable distribution in practice [24]. Tong Liu et al. combined the temper stable distribution and the CGARCH model to model volatility of asset returns via simulations and empirical data from Shanghai Stock Exchange. Both results indicate that CGARCH model with assumption that random variables follow a tempered stable distribution work better compared to CGARCH based on other distributional assumptions [25]. Andrei et al. employed the stable distribution for optimal portfolio construction by buying S&P500 sector ETFs, as a result they successfully developed an investment strategy which significantly outperform S&P500 index [26]. Meintanis validated data fitting effect of multivariate stable Paretian law through various goodness-of-fit tests [27].
As for work related to computation of energy risk, plenty of researches have been done recently. Muteba et al. found evidence of a structure break in the volatility process as well as leverage effects in data-driven research covering BRICS stocks and world crude oil prices [28]. Zhou et al. identified the volatility spillover effect in renewable energy market based on BEKK-GARCH and multidimensional analysis [29]. Samunderu and Murahwa found a more accurate computation for risk of bond index via combining EGRCH and a non-normality presumption [30]. Lin et al. attempts CVaR and MS-GARCH to model price risk of aviation fuel, which as a result effectively stabilized the volatility of fuel cost [6]. Jing adopted Asymmetric Laplace Distribution to effectively compute VaR and CVaR [31]. Yanto introduced Cluster-Based model to improve Efficiency of Fuel Budgeting [32]. Cao suggested a Multivariate GARCH to predict risk of jet fuel and implement composite associated hedging strategy [5]. Tang employed the Bayesian DCC-MGARCH and frequency connectedness methods to model risk among the green bond, clean energy, and fossil fuel markets [33]. Samunderu proposed a two-tier model to compute Jet Fuel Price Risk and proved it a valuable method to guide airlines’ risk management strategies [34].
Since the application of stable Paretian distribution to calculate potential future losses in jet fuel positions and the investigation of the accuracy of these calculations for sustainable airline management have not yet been thoroughly studied, this research aims to address this gap by conducting empirical research rather than solely making theoretical contributions. We aim to contribute to risk management by following design. First, we compare the goodness-of-fit result from the stable Paretion model and Gaussian model using information ratio, KS test, and stabilized probability plot, all results show an outperformance from Paretian model. Then we evaluate the accuracy of predicted risk measures using rolling window forecasts to obtain a series of out-of-sample prediction results, the discrepancies between predictions and actual losses are carefully observed and tested via both interval forecast coverage testing and regression test. While the prediction model shows no superiority in VaR testing, it gains a positive result from CVaR regression testing under 1% alpha level, which suggests a better effectiveness of the prediction model especially under extreme conditions.
2. Methodology
In this section, we offer a comprehensive explanation of the probability distribution used in our analysis to model logarithm returns of jet fuel and subsequently evaluate price risk. Additionally, we delve into the definition of probability distribution, conduct a goodness-of-fit test, and present various risk measures as part of our methodology.
2.1 Stable Paretian distribution
2.1.1 Definition.
Stable distributions, also known as alpha stable or Lévy stable distributions, constitute a class of probability density distributions with valuable mathematical properties that can effectively capture asymmetry, leptokurtic, and fat-tail regularities using only 4 parameters in addition to shape stability. In the context of this paper, the term "stable Paretian" specifically denotes non-Gaussian stable distributions, excluding the normal distribution which can be regarded as a special case of alpha stable distribution. The term "stable" signifies that the shape of X remains unchanged (only scaled and shifted) under addition. Similarly, the term "Paretian" is employed because tails of non-Gaussian stable distributions demonstrate a power-law behavior or asymptotically adhere to Paretian law. Various definitions for stable distribution exist due to prior research; therefore, we adopt the definition based on shape stability and characteristic function provided by John P. Nolan (1997) [16].
In shape stability definition, we define non-degenerate random variable(r.v.)X as stable there are real numbers cn > 0 in addition to dn ∈ R, if for all n > 1 such that
(1)
Where X1, X2,…,Xn are i.i.d copies of X. When dn = 0 for all n we define X as strictly stable, otherwise as quasi-stable, which is often the case including this work. As long as r.v. is quasi-stable, linear transformation would not change shape of its probability density distribution.
In characteristic function definition, the distribution of a random variable X is defined as stable and notates X ~ S(α, s defiδ1; 1) if it has characteristic function—the Fourier transform of probability density function below:
(2)
Where 0 < α ≤ 2, −1 ≤ β ≤ 1, γ ≥ 0, δ ∈ R, u ∈ R, , notice that first 4 Greeks are shape parameters. The first shape parameter α describes tail behavior, the second shape parameter β shows the skewness of the distribution. The third and fourth shape parameter γ and δ1 describe scale and location respectivelywhere δ1 is the expectation of population probability density distribution. Notice that when α = 2 the stable distribution becomes normal distribution, so the term “stable distribution” in following sections refers in particular to stable Paretian distribution(sometimes called non-Gaussian distribution)with the tail parameter α < 2.
The last integer number 1 denotes a certain way of parameterization considering there are over 10 ways of parameterization for stable probability family favored by different researchers at the mean time δ is defined differently under different parameterization. In fact, these diverse parameterizations had caused tremendous confusion which possibly tempered the application of it in empirical analysis. In some literature, researchers employed the notation S(σ, β, μ) for the class of stable laws, others adopted the notation S(α, β, 1). Considering those parameterizations are related to combination of historical evolution as well as mathematical analyze methods, here we decide to take the first modified parameterization notation brought up by Nolan as X ~ S(α, β, γ, δ1; 1) for following reasons. Usually, Greek α in distribution parameter is treated as different and fixed. Nonetheless all four parameters α, β, γ, δ are unknown and need to be estimated in empirical study of statistical reference, so the aforementioned notation expressed this idea clearly. Additionally, the scale parameter γ andlocation parameter δ iscommonly mistaken as standard deviation σ and expectation μ, which are not always true even in the normal probability distribution, so instead of adopting symbols σ for the scale and μ for the location, we take γ and δ. Lastly, the integer1 is used to distinguish among various parameterizations in which δ possibly means different. We believe everyone who adopt stable probability density distribution model should clearly state which parameterization is taken so that some misunderstanding could be avoid.
The advantage of this form X ~ S(α, β, γ, δ1; 1) includes the nice algebraic features along with simplicity of characteristic function, both of which leads to preferability. However, one might choose to employ other parameterizations if other needs are required like continuity everywhere.
Please note that for most stable distributions, there is no explicit closed formula available, with the exception of a few special forms such as the normal, Cauchy, and Lévy distributions. It is challenging to directly derive Value at Risk (VaR) and Conditional Value at Risk (CVaR) by integrating the density function; therefore, in the following experiment, we utilize random variable simulation to obtain predicted risk measures.
2.1.2 Parameter estimation.
There are several ways to estimate 4 parameters of certain stable distribution, In this paper we employ maximum likelihood (ML) estimation algorithm in Matlab software given by Nolan [16], however, the basic idea is the not much different as ML estimation in other model.
For a vector of observations X = (X1 + X2 + ⋯ + Xn), the ML estimation of the parameter vector θ = (the ML esti is obtained by maximizing the log-likelihood function, define the log-likelihood function:
(3)
The tilde denotes that no explicit density function hence we have to approximate it via judiciously chosen methods involving embedding characteristic function in the complex plane and employing contour integration. Since our work focus on empirical application instead of making contribution to theoretical field, elaboration of mathematics derivation is beyond our scope here.
2.1.3 Generalized Central Limit Theorem (GCLT).
Generalized Central Limit Theorem (GCLT) claims that the appropriate normalized sum of i.i.d random variables will converge to a stable distribution even for variables with infinite variance. Formal definition of GCLT could be found in work of Chen [35]. Recently Masaru Shintani and Ken Umeno further gave some proof that summation of asset logarithmic returns would converge to stable Paretian distribution even those random variables come from non-identical distributions as long as those distributions have same first shape parameter α [8].
In order to comprehend the rationale behind applying GCLT to model return rates of price data, it is imperative to assume the efficiency of the jet fuel market. According to the efficient market hypothesis, despite potential irrational behavior exhibited by individuals, the market as a whole accurately prices assets based on all available information. Consequently, changes in asset prices are indicative of new information rather than irrational behavior from any individual investor. Considering the diverse array of market participants and complex composition, the arrival of new information can be viewed as continuous over time, akin to fluctuations in price. Therefore, the distribution of fuel returns essentially represents the dissemination of novel information. If we consider a piece of new information randomly extracted from the information set within a small-time interval as an independent identically distributed (i.i.d) random variable, then the daily price return is the cumulative impact of all the new information on a given day; ultimately, the observed daily asset return is the aggregate result of numerous small components. According to GCLT, this aggregation converges to a stable distribution. Additionally, in a stochastic process based on a stable model, there are infinite numbers of jumps over any time interval, with most being small and approximating transitions in asset prices between neighboring decimalized prices. Those logics explain the rationality to model return rates of public traded asset with stable distribution, meanwhile as long as the first shape parameter α is not 2, which is true in most cases, this stable distribution can be called stable Paretian distribution.
While the assumption of complete independence of data is stringent, the efficient market hypothesis posits that in an efficient market for publicly traded assets, the current price incorporates all past information and expectations. Therefore, it becomes impossible to predict future price movements based on past data or intercorrelations among data. This is because if such predictions were feasible, individuals would act upon this information by buying when correlations indicate a future rise or selling when they do not. Consequently, asset prices would adjust towards a new equilibrium that reflects the correlations discovered by market participants. Furthermore, it is common to make assumptions about independent random variables in academic studies; for example, in transportation management, one often assumes that vehicle volume passing through a road or space during a certain time period follows specific models such as Poisson probability distribution for ease of computing sample statistics. However, a lower-than-anticipated number of vehicles during non-peak hours may be associated with a higher-than-expected number of vehicles during rush hour upon closer examination. In the realm of data mining, correlations can often be uncovered by delving deeply into the data. Therefore, for the sake of convenience and mathematical tractability, we make use of this assumption despite acknowledging its imperfections.
2.2 Goodness of fit test
2.2.1 Kolmogorov-Smirnov test.
In order to gather additional evidence supporting the superior performance of the stable Paretian probability model without relying on density formulas, we intend to conduct null hypothesis significance tests (NHST) for both sets of fitting results. Unfortunately, there is currently no specific NHST designed for the stable model. However, given that we are not making any underlying distributional assumptions about the data, nonparametric techniques are suitable choices for an unknown distribution. Considering that the application of a Chi-square test requires subjective partitioning to obtain different bins, which may result in varying outcomes based on how the data is grouped, we have opted for the versatile Kolmogorov-Smirnov (KS) test. This test has been proven to be applicable for all continuous distributions within our limited options.
KS test is a popular nonparametric NHST to test how well the empirical distribution function from actual data fit the theoretical distribution function based on the difference between their CDF, or to compare whether the distributions of two groups of samples are the same. The advantage of KS test is that it applies to all continuous distribution, so we employ this method to test whether the out of sample historical data fits the stable Paretian distribution or normal distribution. The null hypothesis and test statistic are:
(4)
(5)
(6)
(7)
S(x) = the empirical cumulative distribution function calculated from sample data, F0 (x) = the theoretical distribution. The test statistic Dn measures the maximum distance between the empirical and theoretical distributions, with Dn following the Kolmogorov distribution as n approaches infinity under the assumption that H0 is true. By comparing the test statistic Dn to the critical value Dnα, we can draw conclusions regarding whether the empirical distribution significantly differs from the theoretical distribution given a large sample size.
2.2.2 Stabilized probability plot.
The stabilized probability plot represents an advancement from the traditional percent-percent (PP) plot and serves as a graphical goodness-of-fit test. The PP plot is commonly utilized to illustrate the distribution pattern of data, with the cumulative proportion of the sample on one axis. In contrast to QQ plots, the abscissa of data points in PP plots depend solely on sample size, preventing clustering of points. The stabilized probability plot employs a sine transformation to stabilize variances of plotted points, thereby enhancing interpretability by maintaining approximately equal variances. Additionally, acceptance regions can be conveniently identified through the use of two straight lines. Therefore, apart from the KS test, we also apply stabilized probability plot to visually show stable Paretian distribution behave more aptly in this work. The sample statistic DSP which, analogous to the standard KS statistic Dn, is also defined to be themaximum deviation of the plotted points from theoretical values. In fact, Michael employed distribution function of two-sided KS test to determine the critical value of DSP. The abscissa and ordinate of PP plot and stabilized probability plot arecomputed in Table 1.
2.3 Risk measures
2.3.1 Value-at-risk(VaR).
Wall street institutions generally use value-at-risk VaR to measure the maximum possible loss value of assets during a specific time in the Future. In this work we only talk about the VaR expressing in rate of return, however it could be expressed in currency unit with more steps to compute. Supposing stands for assets,
stands for uncertainties with the probability density function p(S), x and s are decision vectors, f(x, s) stands forloss of assetsfor corresponding assets and risk factors. Given any probability level β(0<β<1) and time period t, the cumulative probability distribution function φ(x, α) such thatupper limit of possible losses of asset is smaller or equal to α is given by:
(8)
For simplicity, assuming p(S) and φ(x, α) are continuous everywhere. Then φ(x, α) full determines the dynamic of thisr.v., we can express the corresponding VaR as:
(9)
αβ is the = VaR under significance level β.
2.3.2 Conditional Value-at-Risk (CVaR).
In practical applications, VaR exposes its limitations, including(a) a lack of subadditivity and convexity; (b) there may be multiple local minimum values when minimizing the problem, which does not meet the requirements of risk ‘consistency’; (c)gives no information for the left part of distribution below α quantile. Then comes another method of risk measurement: Conditional Value-at-Risk CVaR(sometimes called Expecting Shortfall). Similarly, given the confidence level 1eβ and time period t, CVaR is the conditional expectation of the asset in condition that loss X is greater(in absolute value)than the corresponding VaR value αβ(x), the formula can be denoted as follows:
(10)
φβ(x) = defined as CVaR of assets under significance level β.
2.4 Evaluation of forecast risk measures
In order to better evaluate the effectiveness of predicted risk measures based on stable Paretian distribution, we utilize rolling window forecasts to generate a series of out-of-sample predictions for VaR and CVaR. Subsequently, we compare these risk predictions with actual losses to identify any discrepancies. Through these methods, we can assess whether forecasts derived from the new model are able to more accurately capture real losses or at least mitigate bias when unexpected severe risk events occur.
2.4.1 Coverage test for VaR.
For the VaR test, we choose the log-likelihood ratio of the unconditional coverage test given by Kupiec [36] and the log-likelihood ratio of the conditional coverage test proposed by Christoffersen [37]. Both unconditional and conditional coverage tests rely on the indicator sequence constructed from a given interval prediction used in dynamic situations. While the former focuses on average(unconditional) coverage, the latter is improved to detect whether the interval prediction considers clustered outliers because the prediction range should dynamically vary through time. If risk measures are correctly predicted, the test statistic log-likelihood ratio should follow a chi-square distribution. Actual loss that exceeds our prediction is also presented in the test to compare the performance of two models.
2.4.2 Dynamic quantile test for VaR.
We also utilize dynamic quantile(DQ) test designed by Eagle and Manganelli to check whether the probability of loss in every period exceeding the VaR is independent of past information [38]. This method models the quantile directly rather than modeling the whole distribution as VaR is tightly linked to the standard deviation of return distribution which should be autocorrelated during periods of volatility cluster. Therefore, DQ test constructed the conditional autoregressive value at risk (CAViaR) model which specifies the evolution of the quantile over time using an autoregressive process and estimates the parameters with regression quantile. A generic CAViaR specification is presented below:
(11)
where ft(β) denotes the θ quantile of the distribution of asset returns formed at time t, β is a vector of unknown parameters while l is a function of a finite number of lagged values of observables. The parameters are estimated by minimizing the regression quantile loss function, then a constructed test statistic following chi-square distribution is then computed to run significance test.
2.4.3 Regression test for CVaR.
For the CVaR test, we adopt the Expected Shortfall Regression Backtest (ESR) shed light on by Bayer and Dimitriadis [39]. This method can assess the correctness of forecasted CVaR risk measures. The advantage of this state-of-art backtest is that we can test with only forecasted CVaR one explanatory variable, which is quite convenient in contrast to other current backtest methods. The preliminary setup of ESR tests, also called auxiliary ESR test, is designed as a joint regression model for the VaR and CVaR. Regression models are presented as follows:
(12)
(13)
Where {yt} is the response data, and
is predicted VaR and CVaR respectively. Under the asymptotic theory they proposed, these two variables are collinear such that Eq 11 is simplified as
(14)
Eq 12 is also called the strict ESR test. Therefore, we only need return data as well as forecast CVaR to test whether there is any significant difference between predicted risk measures from two probability distributions. The test statistic is constructed as the Wald statistic and we implement the R package esback to complete corresponding procedures [39]. For properly forecasted CVaR, the intercept term should be zero, and the slope should be one, which means a failure to reject the null hypothesis.
3. Results and discussion
In this section, we begin by modeling the return rates of jet fuel using both stable Paretian distribution and normal distribution. The empirical analysis utilizes the daily logarithmic returns of Coast Kerosene-Type Jet Fuel Spot Price, which is available at the US Energy Information Administration from January 1, 2010 to June 30, 2023, encompassing over 3000 data points. The selection of Coast Kerosene-Type Jet Fuel as the representative jet fuel price is based on two primary reasons. Firstly, it is the predominant aviation fuel used for turbojet and turboprop aircraft. Secondly, the spot commodity price provides a sufficient sample capacity compared to futures prices used in other research due to derivatives eventually expiring.
To ensure the robustness of our findings for out-of-sample data, we partition our dataset into two segments. The first segment covers observations from January 1, 2010 to December 31, 2021 (3014 observations) and is utilized for in-sample distribution fitting including parameter estimation and statistical NHST. The second segment spans from January 1, 2022 to June 30, 2023 (373 observations) and serves as an out-of-sample test specifically designed to compare computational results of risk measures using Matlab [19–23].
3.1 Summary statistics of sample indicates a obvious non-normal pattern
Due to the inherent tendency for asset prices to exhibit inflation or non-stationarity, our focus is directed towards the return rates of jet fuel spot price, which can be interpreted as percentage change in price. Both ADF and KPSS test statistics confirm that the return data displays weak stationarity, while the price data does not. The in-sample summary statistics in Table 2 and Fig 2 presented below indicate a non-Gaussian pattern with high kurtosis in the sample statistics of price returns, along with extreme outliers exceeding 6 standard deviations—a phenomenon highly unlikely under normal distribution. Consequently, it is justified to employ a well-founded fat-tail probability model with shape stability to accurately capture the regularity of jet fuel return data.
3.2 Outperformance in visual distribution fitting and information criterion
With the theoretical support of GCLT and the intuitive proof provided by Masaru et al., we performed distribution fitting and parameter estimation based on the 1st sample. As a result, we obtained both probability model parameters in Fig 3.
The histogram displays fitted distributions for stable and normal in blue and red lines, respectively. It is evident that the blue line provides a superior fit. The point estimate of the alpha parameter is 1.63, indicating that the fitted distribution resembles a stable Paretian distribution, as commonly observed in empirical studies due to its heavier tail values and central values with fewer midrange values compared to the normal distribution. Given that stable Paretian models involve more parameters and thus fewer degrees of freedom, it is appropriate to utilize information criteria such as Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) to compare the performance of two models while considering penalties for additional parameters. Results from different criteria are provided as Table 3.
Based on log-likelihood and both Information Criterion, it is appropriate to conclude that the stable Paretian distribution provides a good fit. To enhance the credibility of this conclusion, we will conduct goodness of fit tests in subsequent sections to further demonstrate that the dynamics of jet fuel returns are more accurately represented by the stable Paretian probability distribution.
3.3 Further goodness of fit tests confirm significance in outperformance
As the distribution of data is assumed to be unknown, it is appropriate to employ a non-parametric approach such as the Kolmogorov-Smirnov test to further examine the significance of the outperformance. The findings are detailed in Table 4 for reference.
The p of the KS statistics for the in-sample returns, as indicated in the 2nd column of Table 3, is sufficiently small to reject the null hypothesis at both 1% and 5% alpha levels, indicating that the first sample data does not adhere to a Gaussian distribution. However, we are unable to reject the null hypothesis that the data follows a stable distribution given a p of 0.57. Additionally, we have conducted a graphical goodness-of-fit analysis to visually and directly demonstrate these differences as shown in Fig 3.
Fig 4(a) illustrates the disparity among the empirical CDF determined by actual data, the theoretical stable Paretian CDF, and the normal CDF. It is challenging to visually distinguish these CDFs from each other. Therefore, a logarithm transformation of data for x > 0 is performed and presented in Fig 4(b), where the blue line closely aligns with the red line. Fig 4(c) and 4(d) display stabilized probability plots for two probability models respectively. It is evident that all data points fall within the acceptance region under the stable CDF presumption on the right side, while a few points fall outside this region in the left picture. Consequently, it is reasonable to conclude from these tests that a stable distribution indeed provides a better description of the data, justifying further investigation into whether forecasted risk measures demonstrate greater accuracy in computing risk.
3.4 Investigate accuracy of risk measures from stable Paretian distribution
We obtain a series of out-of-sample VaR and CVaR values for jet fuel returns from 1 Jan 2021 to 30 Jun 2023using 250-day rolling window setup to visually indicate the differences in model outcomes, The conditional variance is inferred by the GARCH model step-by-step, so risk measures behave differently as the forecast interval becomes wide during volatility clustering. Through cutoff length function mentioned in the work of Vinogradov [40], a symmetric tail truncation is adopted to ensure stable Paretian model has a finite conditional variance that equals to forecast result from GARCH model. Furthermore, discrepancies between these risk predictions and actual losses are tested based on the methodology mentioned in section 2.4.
Three different methods, including the utilization of actual historical data, integration of density functions, and simulation due to the absence of a closed-form expression, are employed for computing risk measures in dynamic situations. Obtaining historical VaR and CVaR from out-of-sample data is a straightforward process after appropriate sorting. Additionally, measurements under the normal model can be directly integrated using the normal density function. However, generating and calculating VaR and CVaR from 1,000,000 simulated random variables following the stable Paretian probability distribution may be time-consuming. Furthermore, discrepancies between these risk predictions and actual losses are tested based on the methodology mentioned in 2.4. We draw the dynamic forecast risk measures in Fig 5 as well as the corresponding test results in Table 5 below.
To show when the loss exceeds thresholds, risk measures here are original negative returns instead of the absolute value. In those figures, the black line represents the actual historical returns from the 2nd sample, while the blue and red lines represent computed risk measures by stable and normal models respectively. Additionally, we tabulate the four types of backtesting results on discrepancies between predicted risk measures and actual losses to facilitate a comparison between the two models. A cursory examination of these results may be frustrating as stable Paretian distribution is less volatile and correlated with actual returns due to evident errors. However, the Gaussian distributed return model gives a more volatile forecast possibly because it is more adaptive to the conditional variance model which was initially constructed based on it. Despite less variation, the stable Paretian model, the blue line, starts with a more conservative prediction which might be capable of better covering loss when dramatic events happen. Meanwhile, there are some clear failures when actual returns move beyond the predicted risk VaR, which is reasonable given our sample size is 373 and confidence levels are 0.01 and 0.05.
In all three VaR test, we can see differences between the two models are so indistinguishable [36, 37]. As Table 4 shows, both models pass the interval coverage test as well as the DQ test though the result of from the stable model under 5% significance levels showing 0.09 p-value in unconditional coverage test, which means a rejection of the null hypothesis under 10% level. However, in the CVaR regression test, the stable model turns out to be superior to that from the Gaussian model under 1% alpha as both regression tests lead to rejection for the latter, which indicates that the regression coefficients are not 0 and 1, meaning a greater bias in risk prediction [39, 41]. The practical significance of risk estimation coincides with the conclusion in static situations that stable Paretian distribution gives a less biased CVaR forecast hence better models of fat-tail risk of jet fuel despite the insignificant underperformance in VaR test. Given the sensitivity of airline corporations to tail risk and the consequences of predictions not covering real losses can be severe, using stable Paretian probability distribution to model the return of jet fuel could be particularly beneficial especially when a hedging strategy to minimize CVaR is considered.
4. Conclusions
The majority of expenses in the airline industry are attributed to fuel costs. The increased volatility of jet fuel prices can result in greater uncertainty in cash flow and subsequently higher costs for capital acquisition. To mitigate price fluctuations, many airlines utilize a normal probability distribution assumption to model the stochastic process governing changes in jet fuel prices and guide corresponding hedging operations in accordance with the classical Central Limit Theorem (CLT). In this study, we have innovatively applied a stable Paretian model to fit the return rates of jet fuel data obtained from the U.S. Energy Information Administration, followed by contrasting results using non-parametric goodness-of-fit tests. The superior performance of our model is clearly evident both visually and statistically in the fitting of our distribution, as well as in pairwise comparisons using in-sample data. Additionally, we have assessed the accuracy of computed risk measurements in out-of-sample back tests, and our model demonstrates less bias towards actual numbers. All the aforementioned evidence suggests that computing price risk for jet fuel with a stable Paretian probability distribution, while not perfect, is superior to the common Gaussian probability distribution especially in extreme events. This could be valuable for making risk hedging decisions due to solid theoretical support from GCLT, shape stability, and proper statistical properties with parsimonious parameter numbers.
While this research is limited in its comparison of modeling effects to only two probability models, we believe that the choice to consider normal and stable distributions is justified due to their strong theoretical foundations, favorable statistical properties, and stability in summation. These factors elevate their suitability for application in research to a higher level compared to other probability models. In fact, these two patterns are so commonly observed in empirical data that some scholars have referred to them as "normal" or "stable" laws rather than theorems or models. In our upcoming research, we aim to broaden the scope of data analysis from commodities to publicly traded assets. A promising advancement in cognitive computation of stable models involves adjusting the slope rate at the tails to mitigate the impact of Paretian law. By accelerating the decay rate of the tails, higher moments can be rendered finite, thereby endowing our model with adjustable volatility through selection of different temper parameters. The current trend in research involves combining tempered stable probability distribution with heteroskedasticity processes such as GARCH for dynamic computation of marginal probability distribution with varying conditional mean and variance. These methods enable us to model and compute stochastic processes in a more dynamic and flexible manner, ultimately enhancing decision-making capabilities.
References
- 1. Swidan H, Merkert R. The relative effect of operational hedging on airline operating costs. Transport Policy. 2019;80: 70–77.
- 2. Hu R, Xiao Y, Jiang C. Jet fuel hedging, operational fuel efficiency improvement and carbon tax. Transportation Research Part B: Methodological. 2018;116: 103–123.
- 3.
Morrell Peter S. Airline Finance. 5th Edition. London: Routledge; 2021.
- 4. Swidan H, Merkert R, Kwon OK. Designing optimal jet fuel hedging strategies for airlines–Why hedging will not always reduce risk exposure. Transportation Research Part A: Policy and Practice. 2019;130: 20–36.
- 5. Cao M, Conlon T. Composite jet fuel cross-hedging. Journal of Commodity Markets. 2022; 100271.
- 6. Lin S, Wang M, Cheng Z, He F, Chen J, Liao C, et al. Risk Management of Fuel Hedging Strategy Based on CVaR and Markov Switching GARCH in Airline Company. Sustainability. 2022;14: 15264.
- 7. Naumann M, Suhl L. How does fuel price uncertainty affect strategic airline planning? Oper Res Int J. 2013;13: 343–362.
- 8. Calzolari G, Halbleib R, Parrini A. Estimating GARCH-type models with symmetric stable innovations: Indirect inference versus maximum likelihood. Computational Statistics & Data Analysis. 2014;76: 158–171.
- 9. Liu X, Xu S, Tang S. CFAR Strategy Formulation and Evaluation Based on Fox’s H-function in Positive Alpha-Stable Sea Clutter. Remote Sensing. 2020;12: 1273.
- 10.
Nouri AM, Zeitouny O, Alabdallah S. Neutrosophic Stable Random Variables. 2022;50.
- 11. Xu D, Yu W, Deng C, He ZS. Non-Contact Detection of Vital Signs Based on Improved Adaptive EEMD Algorithm (July 2022). Sensors. 2022;22: 6423. pmid:36080881
- 12. Zanette DH, Samengo I. The Emergence of the Normal Distribution in Deterministic Chaotic Maps. Entropy. 2024;26: 51. pmid:38248177
- 13. Francq C, Meintanis SG. Fourier-type estimation of the power GARCH model with stable-Paretian innovations. Metrika. 2016;79: 389–424.
- 14. Ortobelli S, Lando T, Petronio F, Tichý T. Asymptotic stochastic dominance rules for sums of i.i.d. random variables. Journal of Computational and Applied Mathematics. 2016;300: 432–448.
- 15. Pernagallo G, Torrisi B. An empirical analysis on the degree of Gaussianity and long memory of financial returns in emerging economies. Physica A: Statistical Mechanics and its Applications. 2019;527: 121296.
- 16. Nolan JP. Numerical calculation of stable densities and distribution functions. Communications in Statistics Stochastic Models. 1997;13: 759–774.
- 17. Xu W, Wu C, Dong Y, Xiao W. Modeling Chinese stock returns with stable distribution. Mathematical and Computer Modelling. 2011;54: 610–617.
- 18. Nolan JP. Financial modeling with heavy-tailed stable distributions: Financial modeling with heavy-tailed stable distributions. WIREs Comput Stat. 2014;6: 45–55.
- 19. Royuela-del-Val J, Simmross-Wattenberg F, Alberola-López C. libstable: Fast, Parallel, and High-Precision Computation of α-Stable Distributions in R, C/C++, and MATLAB. J Stat Soft. 2017;78.
- 20. Shintani M, Umeno K. Super Generalized Central Limit Theorem—Limit Distributions for Sums of Non-identical Random Variables with Power Laws. J Phys Soc Jpn. 2018;87: 043003.
- 21. Liu Y, Ye Y, Wang Q, Liu X. Stability Prediction Model of Roadway Surrounding Rock Based on Concept Lattice Reduction and a Symmetric Alpha Stable Distribution Probability Neural Network. Applied Sciences. 2018;8: 2164.
- 22. Gunay S, Khaki A. Best Fitting Fat Tail Distribution for the Volatilities of Energy Futures: Gev, Gat and Stable Distributions in GARCH and APARCH Models. JRFM. 2018;11: 30.
- 23. Bae K, Kang H, Kang J. Can fat-tail create the momentum and reversal? Applied Economics. 2020;52: 4850–4863.
- 24. Nolan JP. Computational aspects of stable distributions. WIREs Computational Stats. 2022;14.
- 25. Liu T, Shi Y. Innovation of the Component GARCH Model: Simulation Evidence and Application on the Chinese Stock Market. Mathematics. 2022;10: 1903.
- 26. Vasiukevich A, Pinsky E. Constructing portfolios using stable distributions: The case of S&P 500 sectors exchange-traded funds. Machine Learning with Applications. 2022;10: 100434.
- 27.
Meintanis SG, Nolan JP, Pretorius C. Specification procedures for multivariate stable-Paretian laws for independent and for conditionally heteroskedastic data. TEST. 2023 [cited 5 Apr 2024].
- 28. Muteba Mwamba JW, Mwambi SM. Assessing Market Risk in BRICS and Oil Markets: An Application of Markov Switching and Vine Copula. IJFS. 2021;9: 30.
- 29. Zhou W, Gu Q, Chen J. From volatility spillover to risk spread: An empirical study focuses on renewable energy markets. Renewable Energy. 2021;180: 329–342.
- 30. Samunderu E, Murahwa YT. Return Based Risk Measures for Non-Normally Distributed Returns: An Alternative Modelling Approach. JRFM. 2021;14: 540.
- 31. Jing H, Liu Y, Zhao J. Asymmetric Laplace Distribution Models for Financial Data: VaR and CVaR. Symmetry. 2022;14: 807.
- 32. Yanto J, Liem RP. Cluster-Based Aircraft Fuel Estimation Model for Effective and Efficient Fuel Budgeting on New Routes. Aerospace. 2022;9: 624.
- 33. Tang C, Aruga K, Hu Y. The Dynamic Correlation and Volatility Spillover among Green Bonds, Clean Energy Stock, and Fossil Fuel Market. Sustainability. 2023;15: 6586.
- 34. Samunderu E. Jet Fuel Price Risk and Proxy Hedging in Spot Markets: A Two-Tier Model Analysis. Commodities. 2023;2: 280–311.
- 35.
Chen LHY, Goldstein L, Shao Q-M. Normal Approximation by Stein’s Method. Berlin, Heidelberg: Springer Berlin Heidelberg; 2011.
- 36. Kupiec PH. Techniques for Verifying the Accuracy of Risk Measurement Models. JOD. 1995;3: 73–84.
- 37. Christoffersen PF. Evaluating Interval Forecasts. International Economic Review. 1998;39: 841.
- 38. Bayer S, Dimitriadis T. Regression-Based Expected Shortfall Backtesting. Journal of Financial Econometrics. 2022;20: 437–471.
- 39. Dimitriadis T, Bayer S. A joint quantile and expected shortfall regression framework. Electron J Statist. 2019;13.
- 40. Vinogradov DV. Arbitrary truncated Levy flight: Asymmetrical truncation and high-order correlations. Physica A: Statistical Mechanics and its Applications. 2012;391: 5584–5597.
- 41. Ambre D, Sheyyab M, Lynch P, Mayhew EK, Brezinsky K. A Raman spectroscopy based chemometric approach to predict the derived cetane number of hydrocarbon jet fuels and their mixtures. Talanta. 2024;271: 125635. pmid:38219321