Abstract
We apply seven alternative tdistributions to estimate the market risk measures Value at Risk (VaR) and its extension Expected Shortfall (ES). Of these seven, the twin tdistribution (TT) of Baker and Jackson (in Twin t distribution, University of Salford Manchester. https://arxiv.org/abs/1408.3237, 2014) and generalized asymmetric distribution (GAT) of Baker (in A new asymmetric generalization of the tdistribution, University of Salford Manchester. https://arxiv.org/abs/1606.05203, 2016) are applied for the first time to estimate market risk. We analytically estimate VaR and ES over 1day horizon and extend this to multiday horizon using Monte Carlo simulation. We find that taken together TT and GAT distributions provide the best backtesting results across individual confidence levels and horizons for majority of scenarios. Moreover, we find that with the lengthening of time horizon, TT and GAT models performs well, such that at the 10day horizon, GAT provides the best backtesting results for all of the five indices and the TT model provides the second best results, irrespective period of study and confidence level.
Introduction
From its very beginnings in the 1980s ValueatRisk (VaR) as a measure of market risk has received widespread acceptance both amongst industry and regulators on account of its ease of calculation and intuitive interpretation. In its most basic form, VaR provides the worst possible loss at a given confidence level over a specific horizon. The main drawback of VaR, other than, that it is a single number is that there is no one accepted way of calculating it. It is possible that the use of different models will lead to different VaRs and that this could be very costly to financial institutions. In that, if VaR is over estimated, then the institution is tying of capital which it could use elsewhere for a higher return; or if it under estimates, then the firm is severely exposed to market down turns as it has not set aside correct amount of capital. The financial crisis of 2007–2008 has illustrated the drawbacks in stark terms of the VaR methodology and this has resulted in debate amongst academics, regulators and market practitioners. As part of this debate, the related measure to VaR, the expected shortfall (ES) is now given more prominence under Basel III.
Given the underlying nature of equity returns, forecasting of volatility is critical to the success of VaR models Siu (2018) and Chiou et al. (2009) amongst others. The volatility clustering resulting from infrequent large jump has been modelled using GARCH type process of Bollerslev (1986). This basic GARCH model leads to the development of more advanced models such as EGARCH, NGARCH, which are explicitly able to incorporate the skewness and excess kurtosis that are observed in equity returns.
To calculate VaR and ES, GARCH models need to be enhanced with more complex distributions. One such approach has been the use of the family of tdistributions. The student tdistributions have played particularly significant role in financial research as models for the distribution of heavytailed phenomena such as financial markets data. However, student tdistribution that allows for heavy tails than the normal, but assumes that the distribution is symmetric around zero. Huang and Lin (2014) compare the forecasting performance of several VaR models. Lin et al (2006) use historical simulation to estimate portfolio VaR. Baixaali and Alvarez (2006) consider the impact of excess kurtosis on VaR. Angelidis et al. (2007) examine different weighing schemes for robust VaR estimation. Wong et al (2012) model tail risk beyond VaR. The comparison focuses on the difference between normal distribution and student tdistribution. Mogel and Auer (2018) imply student t and extreme value theory to compute Value at Risk and compare them with historical simulation other approaches. Their results suggest that historical simulation outperforms EVTbased approach.
The student’s tdistribution can permit for kurtosis in the conditional distribution but not for skewness. Hansen (1994) was the first to propose a generalization of student’s tdistribution that allowed modelling skewness in conditional distributions of financial returns.
In this study we compare the performance of seven different tdistributions. The first is the standardized tdistribution (ST) used by Bollerslev (1987). The second is the Twin tdistribution (TT) of Baker and Jackson (2014). This distribution is heavytailed like a ST distribution but closer to the normality at the central part of the curve. The third distribution is the Generalized tdistribution (GAT) of Baker (2016). This distribution generalizes the tdistribution through two types of skewness. Fourth and fifth distributions are the Asymmetric exponential power distribution (AEP) and its special case (SEP) of Zhu and ZindeWalsh (2009). The sixth and seventh distributions are the Asymmetric student tdistribution (AST) and Skewed student tdistribution (SST) respectively of Zhu and Galbraith (2010).
Our analysis focuses on datasets of five major stock indices covering S&P500, FTSE100, NASDAQ100, NIKKEI225 and DAX30 for the period 1995–2014. Calculation of 1day ahead ES follows a twostage procedure. In the first step, an asymmetric GARCHtype volatility model is fitted to the historical data by maximum likelihood estimation. From this model, the socalled standardized residuals are extracted. The asymmetric GARCHtype model is used to calculate 1step predictions of conditional mean and conditional standard deviation. In the second step, various long tail and asymmetric distributions are applied to the standardized residuals and calculate with estimated parameters of distributions. Finally, 1 day ahead conditional expected shortfall \(ES_{t + 1}\) is calculated.
For the situation where the variance is time varying, going from 1dayahead to hdaysahead expected shortfall is not so straightforward. As in the case of GARCH, scaling by the horizon h is not attainable as variance mean revert. Additionally, the returns over the next h days are not normally distributed. To overcome this difficulty in calculating VaR and ES we use Monte Carlo simulation to generate the returns h ahead.
We find overall EGARCH (1,1) provides the best fit for volatility for the indices considered in this study. We find substantial evidence in the improvement of our results with the use of EGARCH(1,1) combined with GAT and EGARCH(1,1) combined with TTD. When we compare the GAT distribution proposed by Baker (2016) with AST distribution proposed by Zhu and Galbraith (2010) we find GAT outperforms AST by providing better fit to financial returns and more accurate forecast of the ES. As the empirical distribution of the financial returns has been reported to be asymmetric and shows a significant excess of kurtosis (Abad et al. 2014). The longer period ES forecasts is estimated using Monte Carlo Simulation with GAT, AEPD, SEPD, AST, SST, ST and TT as standardized distributions of returns for world’s major five stock indices (S&P500, FTSE100, NASDAQ100, NIKKEI225 and DAX30).
The contribution of this paper is as follows. First, our study provides further support for the usefulness and superiority of fat tailed distributions especially asymmetric distributions in the major stock markets. Second, it proposes the use of fat tailed distribution to measure financial risk for a longer horizon. In contrast to the current literature that mainly focuses on the 1 day ahead ES, our approach considers the usefulness of fat tail distribution for calculation of ES beyond 1day. To the best of our knowledge, our research is the first to consider two new distributions and compare them with other previous distributions for ES calculation.
The remainder of this paper is organized as follow: Sect. 2 addresses the methodological framework. Results are discussed in Sect. 3. Section 4 concludes the findings.
Methodological framework
Since its inception in the 1980s, VaR and its extension the ES have been the market risk measure of choice both for industry and regulators. To calculate market risk, we follow the risk measure of Dowd et al. (2008) and define \(M_{\varphi }\) as follows:
where \(q_{p}\) is the p loss quintile, \(\varphi \left( p \right)\) is a weighting function defined over the full range of cumulative probabilities \(p \in \left[ {0,1} \right]\) and \(M_{\varphi }\) is the class of quantilebased risk measures.
As noted by Dowd et al. (2008) VaR and ES constitute two wellknown members of this class. The VaR at confidence level \(\alpha\) with \(R_{t}\) as the index return in period t and \({{\Psi }}_{t  1}\) represents the information available at time \(t  1\) is defined as follows:
Moreover, each individual risk measure is characterised by its individual weighting function \(\varphi \left( p \right)\). The weighting function for VaR is a Dirac delta function that gives the outcome \(\left( {p = \alpha } \right)\) an infinite weight and zero weight for every other outcome.
The ES at confidence level \(\alpha\) is the average of the worst \(1  \alpha\) losses, which is defined as follows:
The weighting function for ES gives all tail quantiles the same weight of \(1/1  a\) and the nontail quantiles zero weight.
We define an asset’s return process at time t as follows:
where \({{\upsigma }}_{t}\) is the conditional volatility, \({{\upmu }}_{t}\) is the conditional mean of returns and \(z_{t}\) is an independent and identically distributed random variable that follows alternative tdistributions.
The key challenge in calculating VaR and other market risk measures is the modelling and estimation of the conditional volatility that incorporates the observed characteristics of share price and index returns such as volatility clustering, asymmetry and long memory. Since its introduction by Bollerslev (1986), the GARCH approach to modelling volatility has become popular, resulting in a wide range of alternative GARCH specifications being proposed.
VaR and ES calculation over single period
Following Christoffersen (2012) the calculation of VaR and ES follows a twostage procedure:

1.
A GARCHtype volatility model is fitted to the historical data by maximum likelihood estimation (ML). From this model, the socalled standardized residuals are extracted. The GARCHtype model is used to calculate 1step predictions of conditional mean \(\left( {\mu_{t + 1} } \right)\) and conditional standard deviation \(\left( {\sigma_{t + 1} } \right)\).

2.
Various long tail and asymmetric distributions are applied to the standardized residuals to calculate \(F^{  1} \left( p \right)\) with estimated parameters of the distributions. Finally, the 1day ahead conditional \(VaR_{t + 1}^{p}\) and conditional \(ES_{t + 1}^{p}\) are calculated based on the following formulae:
$$VaR_{t + 1}^{p} =  \mu_{t + 1}  \sigma_{t + 1} F^{  1} \left( p \right)$$(5)$$ES_{t + 1}^{p} =  E_{t} \left[ {R_{t + 1} R_{t + 1} <  VaR_{t + 1}^{p} } \right]$$(6)
Standardized tdistribution
Bollerslev (1987) used the standardized tdistribution with \(v\) > 2. The standardized tdistribution density with \(v\) > 2 is then:
where \(\varGamma \left( v \right) = \int_{0}^{\infty } {e^{  x} } x^{v  1} dx\) is the gamma function. \(v\) is the parameter that describe the thickness of tails. Corresponding conditional \(VaR_{t + 1}^{p}\) with \(t_{p}^{  1}\) as the pth quantile of student tdistribution and conditional \(ES_{t + 1}^{p}\) are:
where
The main drawback of the student tdistribution is that it is symmetrical while financial time series can be skewed.
Twin tdistribution (TT)
Baker and Jackson (2014) applied Johnson’s transformation to statistical modelling and construct a new long tailed distribution that is like the tdistribution. The t like distribution is useful for fitting data, as it is more normal in the body of the distribution but has the same power law tail behavior.
The probability density function is:
As \(v \to \infty\) the distribution becomes standard normal. The distribution function for \(x > 0\) is:
where \(S = \frac{{x^{2} }}{v}, C = \sqrt {1 + S^{2} }\), B is the beta function and I the regularized incomplete beta function.
Conditional \(VaR_{t + 1}^{p}\) and \(ES_{t + 1}^{p}\) of TT are:
where
\(F_{TT }^{  1}\) is the inverse of cdf \(F_{TT}\).
Generalized asymmetric tdistribution (GAT)
A 6parameter asymmetric fattailed distribution (GAT) is proposed by Baker (2016). The pdf of the GAT is:
where B is the beta function, ν > 0 controls tail power, µ is a centre of location (not necessarily the mean), ϕ > 0 is a measure of scale (but not the variance, which may not exist), r > 0 controls tail power asymmetry, c > 0 controls the scale asymmetry, and α > 0 controls how early ‘tail behaviour’ is apparent.
The cdf of the GAT distribution is:
where
Conditional \(VaR_{t + 1}^{p}\) and \(ES_{t + 1}^{p}\) of GAT are:
where
and \(F_{GAT}^{  1}\) is the inverse of cdf \(F_{GAT}\).
The asymmetric exponential power distribution (AEP)
The asymmetric exponential power distribution is proposed by Zhu and ZindeWalsh (2009).
where \(\beta = \left( {\alpha ,d_{1} ,d_{2} ,\mu ,\sigma } \right)\) is parameter vector, \(\mu \in R \,{\text{and}} \,\sigma > 0\) is still location and scale parameters respectively,\(\alpha \in \left( {0,1} \right)\) is skewness parameter.\(d_{1} > 0\) and \(d_{2} > 0\) are left and right tail parameters respectively, \(K_{EP} \left( d \right)\) is the normalizing constant is:
and \(\alpha^{*}\) is:
Note that:
The AEP density function is still continuous at every point and unimodal with mode at \(\mu\). The parameter \(\alpha^{*}\) in the AEP density provides scale adjustments respectively to the left and right parts of the density to ensure continuity of the density under changes of shape parameters \(\left( {\alpha ,d,d_{2} } \right)\).
The VaR and ES is computed analytically for the AEP distribution in Zhu and Galbraith (2011).
Conditional \(VaR_{t + 1}^{p}\) conditional \(ES_{t + 1}^{p}\) of AEP are:
where
\(Q\left( {\alpha ,x} \right)\) denotes the regularized complementary incomplete gamma function:
\(Q^{  1}\) denotes the inverse of \(Q\left( {\alpha ,x} \right)\) and \(\varGamma\) is gamma function:
Skewed exponential power distribution (SEP)
Skewed is the special case of AEP proposed by Zhu and ZindeWalsh (2009), if \(d_{2} = d_{1} = d\) implying \(\alpha = \alpha^{*}\) The AEP reduced to SEP:
The SEP density is skewed to the right for \(\alpha < 1/2\) and to the left for \(\alpha < 1/2\).
Conditional \(VaR_{t + 1}^{p}\) and \(ES_{t + 1}^{p}\) of SEP are:
where
Asymmetric student tdistribution (AST)
AST proposed by Zhu and Galbraith (2010) and density function is defined as:
\(\alpha \in \left( {0,1} \right)\) is skewness parameter.\(v_{1} > 0\) and \(v_{2} > 0\) are left and right tail parameters respectively.
where Γ \(\left( . \right)\) is gamma function and \(\alpha^{*}\) is:
Denoting by µ and σ the location (centre) and scale parameters, respectively, the general form of the AST density is expressed as \(\frac{1}{\sigma }f_{AST} \left( {\frac{x  \mu }{\sigma };\alpha ,v_{1} ,v_{2} } \right)\).
Note that
Conditional \(VaR_{t + 1}^{p}\) and \(ES_{t + 1}^{p}\) of AST are:
where
where \(S_{v} \left( . \right)\) is the cumulative distribution function of the standard student tdistribution with ν degrees of freedom and \(S_{v}^{  1}\) is its inverse.
where \(q = VaR_{AST} \equiv F_{AST}^{  1}\).
Skewed student tdistribution (SST)
By letting \(v_{2} = v_{1} = v\) and \(\alpha^{*} = \alpha\) in AST by Zhu and Galbraith (2010), we obtain new parameterization of skewed student tdistribution (SST):
Conditional \(VaR_{t + 1}^{p}\) and \(ES_{t + 1}^{p}\) of SST are:
where
where \(S_{v} \left( . \right)\) is the cumulative distribution function of the standard student tdistribution with ν degrees of freedom and \(S_{v}^{  1}\) is its inverse.
where \(q = VaR_{SST} \equiv F_{SST}^{  1}\).
Term structure of risk: VaR and ES calculation over multiperiod
To date majority of studies have focused on single day market risk estimation. Currently the most popular method is the squareroot rule that is applied over short time horizons. If we consider a simple case of normal distribution with a constant variance \(\sigma_{PF}^{2}\), per square–root rule, the VaR and ES for returns over the next \(h\) days calculated on day \(t,\) as:
However, given the dynamic nature of variance, moving from one period ahead to multi period hdays ahead is not straightforward because scaling variance as modelled by GARCH processes is not mean reverting with the returns over the next h days are not normally distributed. This drawback means that Monte Carlo simulation needs to be used to calculate VaR and ES over multiperiod horizon. We follow Christoffersen (2012) in simulating the index returns having first estimated the underlying GARCH model parameters. Further details on the simulation methodology can be found in Christoffersen (2012). Based on simulated returns over hdays \(\left\{ \check{R}_{i,t + 1:t + h} \right\}_{i = 1}^{MC}\), the VaR and ES over period h is:
where \(1\left( \cdot \right)\) takes the value 1 if the argument is true and zero otherwise and MC denotes the number of draws.
Backtesting risk models
Bootstrap test for the expected shortfall
To evaluate ES we first use McNeil and Frey (2000) test for zero unconditional mean. The test focuses on the discrepancy between the observed return and the ES forecast for the periods in which the return exceeds the VaR forecast, the assessment of ES forecasts is not independent of the VaR forecasts. McNeil and Frey (2000) defined residuals as:
According to McNeil and Frey (2000) these residuals are iid and conditional on \(X_{t + 1} > x_{q}\) or equivalently \(Z_{t + 1} > z_{q}\) being the qquantile of Z. Based on our stock price data and our estimates of expected shortfall, we can construct the corresponding residuals on days when violation occurs. McNeil and Frey (2000) call these residuals exceedance residuals and denote them by:
where
Under the null hypothesis that we estimate \(\mu_{t + 1} ,\)\(\sigma_{t + 1}\) and the expected shortfall correctly, these residuals should behave like an iid sample from a random variable with mean zero and the alternative hypothesis is that the residuals have a mean greater than zero (McNeil and Frey, 2000).
MAE for backtesting ES
We evaluated the expected shortfall as measure of downside risk based on the mean absolute error defined as
where \(ES_{t + i} \left( q \right)\) is the expected shortfall as measure of downside risk and \(R_{t + 1}\) are observed returns and N is the number of observations. The model with minimum MAE value is preferred to the other models.
Empirical results
Data and preliminary analysis
The data for this study comprises of five global stock indices, including S&P500, FTSE100, NASDAQ100 – comprising of nonAmerican and nonfinancial top 100 companies on the NASDAQ exchange, NIKKEI225 and DAX30. All data is obtained from Datastream. For all the indices, the sample comprises of 18 years of daily observation from 1995 to 2013 with a total of 4698 daily observations. The continuously compounded returns are calculated as the logarithmic difference of daily closing price multiplied by 100.
The summary statistics are presented in Table 1. The value of skewness is negative for all return series, indicating an asymmetry in the distribution of return. A negatively skewed distribution or skewed to the left has a longleft tail. All our data series are characterized by many small gains and a few extreme losses. The kurtosis of our data set is greater than 3 and reflects fat tails. We reject the null hypothesis of the normal distribution as the p value for Jarque–Bera (1980) test is less than 0.05. Jarque–Bera test confirms that all return series have nonnormal distributions. The Ljung–Box (1978) Qstatistics reported in Table 1 for both returns and squared returns for all data series also reject the null hypothesis of no autocorrelation through 20lags at a 5% significance level.
Parameter estimation of distributions of return
Specifically we estimate the parameters of the following seven models: Standardized tdistribution (ST), Twin tdistribution (TT) of Baker and Jackson (2014), Generalized asymmetric distribution (GAT) of Baker (2016), Asymmetric exponential power distribution (AEP) of Zhu and ZindeWalsh (2009), Skewed exponential power distribution (SEP) and the special case of AST, the Skewed Student tdistribution, Asymmetric Student t distribution (AST) of Zhu and Galbraith (2010). The estimation procedure is as follows:

Given the specific ith t distribution with parameter \({{\Theta }}^{\left( i \right)} {\text{for }}1 \le i \le 7\), we identify the underlying GARCH process using the likelihood method. Then use the estimated GARCH process to generate the conditional volatility (\(\hat{\sigma }_{t}\)) and the conditional mean (\(\hat{\mu }\)).

We obtain standardized residuals \(\hat{z}_{t}\) as the function of \(\hat{\mu }_{t}\) and \(\hat{\sigma }_{t}\).

We define F_{(i)}(.) as the CDF of ith candidate t distribution, and Q(.) as the quantile function of z_{t}. The desired \({{\Theta }}_{\left( i \right)}^{:*}\) should satisfy
$${{\Theta }}_{\left( i \right)}^{*} = \begin{array}{*{20}c} {\arg { \inf }} \\ {{{\Theta }}_{\left( i \right)}^{*} } \\ \end{array} \left {Q\left( {\hat{z}_{t} } \right)  F_{\left( i \right)}^{  1} \left( {\hat{z}_{t} } \right);{{\Theta }}_{\left( i \right)} } \right, 1 \le i \le 7$$(30)
The parameter estimates are reported in Tables 2, 3, 4, 5 and 6 with the loglikelihood values, AIC and BIC values. The bold values of AIC and BIC criteria in all tables represent top three best models for the specific data set. Per AIC and BIC values in Table 2, the bestfitting models for S&P 500 data are the Generalized asymmetric t distribution (GAT), Student tdistribution (ST) and Twin tdistribution (TT). All above models have lowest AIC and BIC while, asymmetric tdistribution (AST) and skewed exponential power distribution (SEP) have highest AIC and BIC value respectively.
Examining the values of AIC and BIC in Tables 3, 4, 5 and 6 for the indices FTSE100, NASDAQ100, NIKKEI225 and DAX30, we observe that the generalized asymmetric tdistribution (GAT),^{Footnote 1}^{,}^{Footnote 2} Student tdistribution (ST) and Twin tdistribution (TT) are the top three models respectively except for FTSE100 where AEP is the second best model. When we compare GAT and AST models, we find that GAT significantly outperforms AST. AEP model as an alternative to AST and GAT performs better that AST but under performs GAT. We see that the best fitting models for all our indices data sets are our two new distributions GAT and TT and Standardized Student t distribution. Overall the GAT distribution is the best model, as it has many advantages over Standardized Student tdistribution. Standardized tdistribution does not support asymmetry. Neither of the Zhu–ZindeWalsh (2009) Asymmetric exponential power distribution and Zhu and Galbraith (2010) asymmetric tdistribution provide the best fits to the models. Per AIC and BIC, the new Twin tdistribution also performs better than Asymmetric t distribution and Exponential power distribution for all data sets. To summarize based on the AIC and BIC criteria GAT, TT and ST provide a better fit than AST and AEP and their skewed versions.
Oneday ahead expected shortfall backtesting
To test the validity of the different tdistributional assumptions on the five different indices, the VaR and ES at different confidence levels ranging from 5% to 0.5% are calculated. As the existing literature has demonstrated VaR has serious drawbacks, we therefore only evaluate different ES models to assess market risk. As stated earlier the competing risk models are ST, TT, GAT, AEP, SEP, AST and SST, with the TT and GAT used for the first time in the literature to calculate market risk as calculated by VaR or ES.
To evaluate the ES forecasts, we first use McNeil and Frey’s (2000) bootstrap test. The high p values given by this test speak in favour of a model, while low p values speak against a model. The results indicate that AST, AEP and GAT have highest p values which is significantly higher then of 0.01. As indicated by McNeil and Frey’s (2000) that an assumption of normality always fails the test with p values in all cases much less than 0.01. In our case p values for all distribution is much higher than of 0.01 in almost all the cases. The results, which are shown in Table 7, and clearly provide insight for the use of asymmetric distributions for the risk analysis purpose. However, this test provides little information into the relative performance of the methods. This motivates the use of an additional approach to evaluating ES forecast accuracy.
We compare ES by using MAE that calculates the difference between the actual and the expected losses when a violation occurs. The small value of calculated mean absolute error and the mean squared errors appear small enough to suggest that the best fitting models are reasonable.
Table 7 contains the performance results for all the models and indices, with each of the panels containing the results for each of the indices across the seven models. From Panel A, we see that for the S&P500, at the 5% level GAT, TT and SST provide the best fit. At the 2.5% level, again results indicate that GAT, TT and SST models providing the best fit. However, at the 1% or 0.5% level SST model provides the best fit by outperforming the other models. At 1% and 0.5% SEP is the second best model.
Panel B contains the results for the FTSE100 index, up to 2.5% confidence level GAT, ST and TT model outperforms all other models. However, at the 1% and 0.5% level GAT, TT are the best performers. Panel C comprises of the NASDAQ backtesting results where we find that at 5% GAT provides the best fit, while SEP and AEP are second and third best models. At 2.5%, 1% and 0.5% again GAT, SEP and AEP are the outperformers, however SEP outperforms GAT and AEP marginally. Moreover, AST performs better then TT, SST and ST. In Panel D we present the results for NIKKIE225 and note that our proposed GAT model outperforms all other models at 5%. At 2.5% only SEP outperforms GAT. However, at 1% and 2.5% both SEP and AEP perform better than of GAT. Panel E indicate the results for DAX30 GAT, TT and SEP are the best performing models at 5% and 2.5%. At 1% TT, SEP and SST are the best performers. AT 0.5% SEP, SST and AEP perform better than GAT and TT.
To summarize our key results:

(i)
GAT model and TT models are in the top three models at 5% and 2.5% confidence level in almost all cases.

(ii)
AST model have highest values of MAE for almost all datasets and significance levels except NASDAQ in panel C.

(iii)
The skewed version of AST model (SST) is amongst the models with the highest MAE values except S&P500 in panel A, where it is third best model after GAT and TT.

(iv)
AEP model as alternative to asymmetric distributions performs better than the AST, but GAT model clearly outperforms AEP in most of the cases.

(v)
The skewed version of AEP model (SEP) performs better than of the skewed version of AST model (SST) in most of the cases. For NASDAQ, NIKKIE300 and DAX225 it is among the top three models.

(vi)
The results of MAE indicate different model ranking for the same confidence level. However, for most of the cases GAT remain in the top three models.

(vii)
These results give us a strong indication that new parameterization of generalized asymmetric distribution provides valuable improvement in the results. When we compare ES backtesting for two asymmetric tdistributions, MAE of GAT are significantly lower than that of AST. These results indicate strong implication for further research for use of asymmetric tdistribution as ES measure.
Based on the ES backtests conducted, we conclude that the GAT model by Baker (2016) outperforms the competing AST by Zhu and Galbraith (2010) model by a significant margin. As an alternative to asymmetric tdistribution AEP model also underperforms GAT model.^{Footnote 3}
Multiperiod horizon ES backtesting
Table 8 contains the backtesting results across 4 days and 10 days for each of the indices across all of the models. Regarding the results for the 5day horizon, we find that for the S&P500, SST model outperforms all other models up to 1% level. However, at the 0.5% level GAT model outperforms all other models. SST and GAT are among the top three outperformers for all the significance level while TT is among top two best model for 2.5% and 1% significance level.
From Panel B, we see that for the FTSE100 index, up to 2.5% level, SST is the best performer, followed by GAT at 1% and 0.5%. Looking at Panel C and D, we see that for the NASDAQ100 and NIKKIE225 at all significance level GAT, SST and TT are the best performers. At 5% and 2.5% SST outperforms GAT and TT. However, at 1% and 0.5% GAT has smallest MAE than of SST and TT. In panel C the results for DAX30 indicates that GAT, TT and AEP are the top three models for all significance levels.
From Table 8, with regards to the 10day horizon, results are straight cut, for the S&P500, FTSE100, NIKKEI225 and DAX30, GAT provides the best result across all confidence levels with the TT providing the best fit for the remaining index NASDAQ100. Our results for the predicted ES for 5days and 10days can be summarized as follows:

(i)
At the 5day horizon results are mixed with the SST being the best performer up to 1% level in majority of the cases. However, at 0.5% confidence level GAT is the best performer. Overall, GAT remains in the top three models based on the lowest MAE value.

(ii)
When we increase the number of horizons to 10days, MAE values clearly suggest GAT as the best model for almost all data sets.

(iii)
Both AEP and SEP perform very poorly to forecast ES for both 5days and 10days horizon at various significance levels.

(iv)
AST model has highest MAE value in most of the case for both 5days and 10days horizon.

(v)
We can infer that results of ES models are not similar across different time horizons. However, the satisfactory predictions of the GAT are in accordance with the findings of 1day ahead ES evaluation. Again, like 1day ahead GAT model out performs AST model and give clear implication for the use of GAT distribution for risk forecasting.
Concluding remarks
The recent crisis has highlighted the weaknesses of VaR as a market measure of risk. This has resulted in the related superior measure ES being given more prominence under Basel III (Basel Committee on Banking and Supervision 2013, 2017). Previous studies have focused on VaR and more specifically on a single day VaR. This study has sought to complement earlier studies by expanding market risk measures to ES over multiday horizon using seven different models that incorporate the observed empirical characteristics of equity returns as noted by Kellner and Rosch (2016) who recommends that only models which allow for heavy tailed and/or skewness can accurately estimate both VaR and ES.
In this study we make a number of contributions. First, we found that when seven different models based on alternative tdistributions were fitted to the standardized residuals, we found that our two new proposed models Generalized asymmetric tdistribution (GAT) of Baker (2016) and Double tdistribution (TT) of Baker and Jackson (2014) provided the best fit, with GAT model being overall the best model. Moreover, surprisingly the Standard tdistribution outperformed many of the more complex tdistributions.
Second, the performance of the ES models are dependent on the market and the confidence level, particularly so at the 1day and 5day horizons. This result would indicate that for short horizons, risk managers and regulators should use a variety of models and check the accuracy of each model specific to each index and constantly reassess the validity of each model. For longer horizons we find that our new proposed models GAT outperformed all the models considered in this study. This would indicate that for longer horizons, risk managers should focus on a single model, rather than a number of alternative models.
Third, complex models do not always lead to best fits or backtesting results. For example, in many cases the Standardized tdistribution outperforms the more complex Asymmetric exponential power distribution (AEP) of Zhu and ZindeWalsh (2009). These findings are further reinforced by the outperformance of by our simpler GAT and TT distributions across different horizons, confidence levels and markets.
Finally the backtesting results indicates a wide variation of ES values across different models and indices. Given that the VaR and ES values form the basis of regulatory capital allocation, it is imperative that the most accurate model with the lowest estimated VaR and ES are used by both regulators and managers as the wrong model may mean either capital is not efficiently used or insufficient capital is set aside. In this regard, our GAT model provides a reliable alternative to many of the existing models in that it is overall the best performing model across different confidence levels, different horizons and different indices.
Notes
 1.
The flexibility of GAT distribution allows us to set \(\alpha = 1\), leading to a 5parameter distribution that turns out to fit returns data almost identically well as the AST distribution. On the other hand, by allowing \({{\upalpha }}\) to deviate from the unity, we have a more general distribution that fits the data better. As α increases, the fatness of the tails decreases, while the powerlaw behaviour remains the same. In this study we allow \({{\upalpha }}\) deviate from the unity to fit the data with fatter tails. AST and GAT distribution behaviour are compared by fixing \({{\upalpha }} = 1\).
 2.
According to Baker (2016) the AST distribution by Zhu and Galbraith (2010) has discontinuity in the second derivative of the loglikelihood function, as a result the usual regularity conditions for maximum likelihood estimation are not satisfied and makes inference for parameter values difficult. This is a real problematic issue for estimation of standard error because it relies on the second derivative of the loglikelihood. On the other hand, GAT does not have the same inferential problems, as the loglikelihood function has no discontinuities in derivatives. When we compare GAT with Azzalini (2015) skewnormal distribution it reveals that the derivative of the loglikelihood with respect to the skewness parameter is zero when the parameter is zero (the skewnormal reduces to a normal distribution). GAT distribution does not have this problem (Baker 2016). The parameter r controls the asymmetry, with \({\text{r}} = 1\) for a symmetric distribution. We can also fit GAT distribution by setting \({{\upalpha }} = 1\) and \({\text{r}} = 1\) with only four parameters floated as many skew distributions require only \({{\upmu }},\phi , {\text{v }}\) and c parameter so that skewness is modelled purely by having different probability mass in the two tails.
 3.
To further test the robustness of our results, we created subsample for the whole period excluding the three financial crisis period and subsample for each of the financial crisis periods. We found that the performance of the models were independent of the sample period, i.e. GAT distribution was overall the best performer regardless of the sample period. However, magnitude of the risk measures VaR and ES decreased when we excluded the crisis periods from our sample and correspondingly they increased during each of the financial crisis periods.
References
Abad P, Benito S, Lopez C (2014) A comprehensive review of value at risk methodologies. Span Rev Financ Econ 12:15–32
Angelidis T, Benos A, Degiannakis S (2007) A robust VaR model under different time periods and weighting schemes. Rev Quant Finance Account 28:187–201
Azzalini A (2015) Skewnormal family of distributions. Wiley StatsRef: Statistics Reference Online. Wiley, Hoboken
Baixaali JS, Alvarez S (2006) Evaluating effects of excess kurtosis on VaR estimates: evidence for international stock indices. Rev Quant Finance Account 27:27–46
Baker RD (2016) A new asymmetric generalization of the tdistribution. University of Salford Manchester. https://arxiv.org/abs/1606.05203
Baker RD, Jackson D (2014) Twin t distribution. University of Salford Manchester. https://arxiv.org/abs/1408.3237
Basel Committee on Banking Supervision (2013) Basel III: a global regulatory framework for more resilient banks and banking systems. www.bis.org. Accessed 21 Aug 2014
Basel Committee on Banking Supervision (2017) Revisions to the Basel III market risk framework Basel, Switzerland: BIS. www.bis.org. Accessed 10 Feb 2018
Bollerslev T (1986) Generalized autoregressive conditional heteroskedasticity. J Econom 31:307–327
Bollerslev T (1987) A conditional heteroskedastic time series model for speculative prices and rates of return. Rev Econ Stat 69:542–547
Chiou WJP, Lee CL, Lee CF (2009) Variation in stock returns risks: an international comparison. Revi Pac Basin Financ Mark Polic 12(2):245–266
Christoffersen P (2012) Elements of financial risk management, 2nd edn. Academic Press, San Diego, CA
Dowd K, Cotter J, Sorwar G (2008) Spectral risk measures: properties and limitations. J Financ Serv Res 34:61–75
Hansen BE (1994) Autoregressive conditional density estimation. Int Econ Rev 35:705–730
Huang YC, Lin B (2014) Valueatrisk analysis for Taiwan stock index futures: fat tails and conditional asymmetries in return innovations. Rev Quant Finance Account 22:79–95
Jarque CM, Bera AK (1980) Efficient tests for normality, homoscedasticity and serial independence of regression residuals. Econ Lett 6(3):255–259
Kellner R, Rosch D (2016) Quantifying market risk with valueatrisk or expected shortfall?—consequences for capital requirement and model risk. J Econ Dyn Control 68:45–63
Lin CH, Chien CCC, Chen SW (2006) Incorporating the timevarying tail fatness into the historical simulation method for portfolio valueatrisk. Revi Pac Basin Financ Mark Polic 9(2):257–274
Ljung GM, Box GEP (1978) On a measure of a lack of fit in time series models. Biometrika 65(2):297–303
McNeil AJ, Frey R (2000) Estimation of tailrelated risk measures for heteroscedastic financial time series: an extreme value approach. J Empir Finance 7:271–300
Mogel B, Auer BR (2018) How accurate are modern valueatrisk estimators derived from extreme value theory? Rev Quant Finance Account 50:79–103
Siu YW (2018) Volatility forecast by volatility index and its use as a risk management tool under ValueatRisk approach. Revi Pac Basin Financ Mark Polic 21(2):p1850010
Wong WK, Fan C, Zeng Y (2012) Capturing tail risk beyond VaR. Revi Pac Basin Financ Mark Polic 15(3):1–25
Zhu D, Galbraith JW (2010) A generalized asymmetric student tdistribution with application to financial econometrics. J Econom 157:297–305
Zhu D, Galbraith JW (2011) Modelling and forecasting expected shortfall with a generalized asymmetric student t and asymmetric exponential power distribution. J Empir Finance 18:765–778
Zhu D, ZindeWalsh V (2009) Properties and estimation of asymmetric exponential power distribution. J Econom 148:86–99
Author information
Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Iqbal, R., Sorwar, G., Baker, R. et al. Multiday expected shortfall under generalized t distributions: evidence from global stock market. Rev Quant Finan Acc 55, 803–825 (2020). https://doi.org/10.1007/s11156019008601
Published:
Issue Date:
Keywords
 Generalize t distribution
 Asymmetric t distribution
 Expected shortfall
 EGARCH models
 Multidays ahead expected shortfall
JEL Classification
 C13
 C15
 C51
 C52
 C53
 C58
 G17