1 Introduction

The severe drought in the US during the summer of 2012 coincided with price increases of corn, soybeans and other field crops by more than 50%. Figure 1 provides an illustration of this period by showing the contract price for the purchase of one bushel of corn and soybeans to be delivered at the end of 2012. Prices remained relatively stable during the first few months of 2012, while the level of precipitation in the Primary Corn and Soybean Belt was not much different from its level in previous years. June and July were however exceptionally dry months (bottom panel of Fig. 1).Footnote 1 As this drought was prolonged and growing in severity, prices increased rapidly. Price levels flattened only once the level of rainfall returned to its historical average in the late summer.

Fig. 1
figure 1

Futures prices and precipitation in 2012. Note: The upper panel shows prices of futures contracts for delivery at the end of 2012 from Thomson Reuters Datastream. The vertical grey area indicates a period of severe drought. The lower panel shows monthly area-weighted precipitation data for the Primary Corn and Soybean Belt. The bandwidth around the average corresponds to the historical lower and upper quartile. Precipitation data come from the U.S. Climate Divisional Database and are obtained from the National Climatic Data Center of the National Oceanic and Atmospheric Administration

Such extreme movements in agricultural commodity prices are anything but uncommon. For instance, between August 2007 and March 2008, the price of wheat almost doubled, but before the end of 2008, the wheat price returned to its original level. Another example is the price of corn, which fell a massive 55% in the second half of 2008 alone. These examples illustrate the highly volatile behavior of agricultural commodity prices.

The extreme price movements raise questions regarding the shape of the tail distribution of changes in agricultural commodity prices. Mandelbrot (1963a, 1963b), using Houthakker’s cotton price series, was the first to suggest that the tails of the distribution of commodity price changes diminish by a power instead of an exponent, as is the case under the more common (log)normal assumption.Footnote 2 If the tail of a distribution diminishes by a power, then the probability of variable \(\tilde {x}\) exceeding threshold u, if u is large, is distributed as

$$ \Pr (\tilde{x}>u)\sim Cu^{-\alpha }, $$
(1)

where C > 0 and α > 0 are referred to as, respectively, the scale and the shape parameter.Footnote 3 The distribution is named after Pareto who discovered that the upper tail of the income distribution follows a power law. Distributions with tails that obey the functional form in Eq. 1 are classified as heavy-tailed. Tails which follow a power law are in the end always fatter than thin-tailed distributions that decrease at an exponential rate.

Mandelbrot explained the origin of the heavy-tailedness of agricultural commodity prices by advancing that the physical world is full of heavy-tailed phenomena.Footnote 4 Such exogenous heavy-tailed shocks could trigger the heavy-tailedness of commodity price changes. But how a power law tail may arise endogenously in the market for agricultural commodities has not been formally investigated.

The main contribution of this article is twofold.

First, we show how fat tails in agricultural commodity prices may arise endogenously in a standard macroeconomic model. In the model, commodity price spikes occur as a result of adverse productivity shocks, due to, e.g., hurricanes, diseases and droughts.Footnote 5 In themselves these productivity shocks need not be (although they may be) heavy-tailed. This is the extra kick that our economic analysis provides. We show how the power law spikes observed in agricultural commodity prices can arise endogenously in the economy as productivity shocks feed through the system, even if the productivity shocks come from a thin-tailed distribution or a distribution with a bounded support.

Second, we provide new empirical evidence on how to use extreme value theory (EVT) to measure the extreme price risk of agricultural commodities using nearly ninety years of data. Differently from previous studies, we employ formal statistical tests to confirm the heavy-tailedness of price spikes for agricultural commodities. We use back-testing procedures to provide empirical evidence on the accuracy of the risk measures. We show that the non-normality of the return distribution strongly influences the level of the risk measures. Our empirical estimates provide a good indication of the size of the risks as measured by widely used and easily understood risk measures.

What lays at the origin of heavy-tailed phenomena in the economy is an intriguing question. The literature has proposed several mechanisms that can explain some of the heavy-tailed phenomena in the economy (for a literature review, see Gabaix, 2009). To provide a few examples: Gabaix et al. (2003) and Gabaix et al. (2006) suggest a mechanism where trading by large institutional investors – whose size distribution follows a power law tail – could be responsible for heavy-tailed stock market returns. Reed (2001) suggests that the power law in the cross-sectional distribution of the size of agglomerations, i.e., “Zipf’s Law”, could be the result of a process where each agglomeration grows with geometric Brownian motion and where the time of observation itself is a random variable that follows an exponential distribution. Toda (2014) and Toda and Walsh (2015) show in a general equilibrium model that this principle could also be responsible for the power law in the cross-sectional distributions for wealth and consumption. Similarly, our paper suggests a mechanism that can explain the origin of heavy tails in agricultural commodity prices.

A good understanding of the most extreme agricultural commodity returns is instrumental in any commodity risk management application. Strong agricultural commodity price movements hurt many of the industry’s stakeholders, including producers, processing firms, traders, consumers and institutional investors. Commodities are often included in investors’ portfolios because of their diversification benefits and inflation-hedging potential; see, e.g., Erb and Harvey (2006) and Gorton et al. (2012). The economic prosperity of both importing and exporting developing countries often depends on the price development of raw material commodities. More than fifty countries depend on only three or fewer commodities for more than half their total exports.Footnote 6 As a result these countries are very vulnerable to price volatility, see, e.g., Deaton (1999) or Balagtas and Holt (2009). The producers, processing industry and traders can be hurt when prices spike, while farmers can be highly vulnerable to commodity price decreases, which translate directly into a loss of income. Extreme commodity price swings may also hurt large institutional investors. Pension funds, hedge funds like CTAs, and fund-of-funds are known to invest heavily in commodity futures contracts. The risk management findings in this study are relevant for these affected parties.

Market participants need a risk quantification methodology to decide upon the optimal strategy. They should be able to answer questions like: How likely is a 10% drop in the corn price over the next week? Suppose my risk tolerance allows me to accept a 25% price increase, how likely is it that the wheat price will exceed this threshold next month? Is either the corn or the wheat price more likely to experience extreme price movements? What is the expected size of the maximum loss due to price risk during the next decade?

Such questions may be addressed by the use of EVT. The popularity of EVT to assess the risk of an extreme event has increased considerably. For example, EVT has been used to examine the severity of stock market crashes, the pricing of catastrophic loss risk in reinsurance or the extent of operational risk in banks (ECB 2006). EVT is particularly suitable for analyzing rare events when sample sizes are too small for determining the probability, extent or cause of the extreme returns using conventional statistical techniques. The semi-parametric EVT approach exploits the functional regularities that probability distributions necessarily display far from the center.

Interestingly, in spite of its growing recognition, application of EVT in agricultural price risk management has so far been sparse in the academic literature.Footnote 7Kofman and De Vries (1990) estimate the tail parameters for potato futures. Matia et al. (2002) estimate the tail parameters of a large number of general commodities. Their article provides no risk management applications, however. Krehbiel and Adkins (2005) apply EVT to four complex NYSE energy futures contracts to estimate various risk measures. Even so, their analysis is limited to oil and gas contracts, whose return distribution may be very different from those of renewable agricultural commodities. More recently, Morgan et al. (2012) use EVT on weekly data to estimate three different tail risk measures for corn and soybeans. Their thorough study is evidence of the growing interest in this topic.

2 Theory

We use a standard off-the-shelf two-sector macro model to describe how fat tails in agricultural commodity prices may arise endogenously. The economy in this model consists of households and firms. The households maximize their utility subject to their budget constraint by purchasing goods from firms while choosing their labor supply to firms. The firms need labor in order to produce goods. Firms operate in the agricultural or the differentiated goods sector. The agricultural sector is modeled as a competitive sector (Ardeni and Freebairn 2002). The differentiated goods sector is an environment with monopolistic competition in the spirit of Dixit and Stiglitz (1977). Exogenous shocks affect the firms’ productivity in both sectors, where productivity is defined as the number of goods that a firm can produce per unit of labor. In the agricultural sector, these shocks can be best thought of as changes in weather and other natural hazards. For the differentiated goods sector, which also captures the services industry, the shocks mostly represent changes in productivity due to innovations.

The macro literature has focused almost exclusively on the Dixit and Stiglitz (1977) specification for the differentiated goods demand, see, e.g., Walsh (2017). The familiar Dixit and Stiglitz (1977) specification with endogenous labor supply derives from the following utility function for households

$$ U=Z^{1-\theta }\left[ \frac{1}{n}\sum\limits_{i=1}^{n}Q_{i}^{\rho }\right]^{\theta /\rho }-\frac{1}{1+\delta }L^{1+\delta }, $$
(2)

where Z is the consumed quantity of the competitive good, the Qis are the consumed quantities of the differentiated goods and L is quantity of labor. Essentially, all goods are perishable in this one-period model.Footnote 8 The parameter ρ, which is constrained to ρ ∈ (0,1), determines the “love of variety” of households, which is the extent to which households value diversity of the products they purchase from firms in the differentiated goods sector. Values of ρ closer to zero are associated with a stronger love of variety. Parameter 𝜃 ∈ (0,1) determines the importance of the other goods and services relative to the agricultural produce in the consumer’s consumption bundle. The higher the level of 𝜃, the smaller the share of income the consumer is willing to spend on agricultural goods. The parameter δ is the inverse of the Frisch (1959) elasticity of labor supply, δ ∈ (0,1). The higher the level of δ, the less responsive labor supply will be to changes in wages.

Households maximize their utility subject to their budget constraint. The budget constraint of the households reads

$$ wL+{\Pi} (Q)=qZ+\frac{1}{n}\sum\limits_{i=1}^{n}p_{i}Q_{i}, $$
(3)

where w is the wage rate, where q and pi are the goods prices, and where π(Q) are the profits of firms in the differentiated goods sector which are distributed as dividends to the households.Footnote 9

The production of firms in both sectors depends on their productivity and labor inputs. For their production functions we assume classical Ricardian technologies (e.g., Turnovsky 1974), where

$$ Z=BN, $$
(4)

and

$$ Q_{i}=AN_{i}. $$
(5)

The N and Ni are the labor inputs. The A and B are the respective productivity coefficients that determine how many goods the firms in both sectors can produce per unit of labor. Both A and B are random variables. Shocks to A and B are productivity shocks in the sense that changes in A and B translate into identical changes in the amount of goods a firm can produce per unit of labor. In the case of variable A, which measures productivity in the differentiated goods sector, such shocks are the in macroeconomics familiar supply side total factor productivity shocks. The shocks to variable B in the agricultural sector capture the shocks to agricultural productivity that are inherent to nature.

Firms in the economy maximize profits. The market for the agricultural product is assumed to be perfectly competitive, and, hence, the aggregate profit of all firms in the agricultural sector,

$$ {\Pi}_{\theta}(Z)=qZ-wN, $$
(6)

equals zero in equilibrium. The producers of the differentiated products aim to maximize their profit functions

$$ {\Pi}_{i}(Q_{i})=p_{i}Q_{i}-wN_{i}. $$
(7)

They do so by exploiting their pricing power while ignoring their pricing effect on consumer income wL + π(Q) and on the price index of the differentiated goods,

$$ P=\left( \frac{1}{n}\sum\limits_{i=1}^{n}p_{i}^{\rho /\left( \rho -1\right) }\right)^{\frac{\rho -1}{\rho }}. $$
(8)

Finally, to determine the general price level we assume a simple quantity-type relationship (Fisher 1911),

$$ M=wL, $$
(9)

where M is defined as the total quantity of money.

2.1 Equilibrium price distribution

With the above preparations, we can now obtain the implications for the equilibrium prices.

Proposition 1

Equilibrium prices result when consumers maximize utility (2) and firms in both sectors maximize profits. The equilibrium prices of the differentiated goods are,

$$ p_{i}=M\frac{1/\rho^{\theta /\delta +1}}{A\left( \theta^{\theta }\left( 1-\theta \right)^{1-\theta }A^{\theta }B^{1-\theta }\right)^{1/\delta }}\text{ for all }i. $$
(10)

For the agricultural good the equilibrium price is

$$ q=M\frac{1/\rho^{\theta /\delta }}{B\left( \theta^{\theta }\left( 1-\theta \right)^{1-\theta }A^{\theta }B^{1-\theta }\right)^{1/\delta }}. $$
(11)

Proof

See Appendix A. □

Most macro models consider shocks to M, A and B. Let us focus on the agricultural productivity shocks due to randomness in B.Footnote 10 Assuming M and A to be constant, we can write the price of the agricultural good as

$$ q(B)={\Theta} B^{-\frac{1+\delta -\theta }{\delta }}, $$
(12)

where

$${\Theta} =M\frac{1/\rho^{\theta /\delta }}{\left( \theta^{\theta }\left( 1-\theta \right)^{1-\theta }A^{\theta }\right)^{1/\delta }}. $$

Note that the power in Eq. 12 is negative. From Eqs. 10 and 11 it transpires that a positive productivity shock to A or B lowers the equilibrium prices pi and q. This is because when more produce comes to the market, this extra supply can only be absorbed if prices are lowered. This reflects what happens in both well-known real business cycle models and the now standard New Keynesian Model that is the backbone to most current day macro models.Footnote 11

For illustrative purposes, consider the situation where B follows a beta distribution (we relax this assumption later):

$$ \Pr \left\{ B<t\right\} =t^{\upbeta } $$
(13)

on [0,1] and β > 0. The likelihood of extremely adverse productivity shocks under this distribution is lower for higher values of β. Under the assumption of the beta distribution, we can derive the price distribution of the agricultural product as follows. Denote the randomness in q by \(\tilde {q}\). Then

$$ \begin{array}{@{}rcl@{}} \Pr \left\{ \tilde{q}>u\right\} & =\Pr \left\{ {\Theta} B^{-\frac{1+\delta -\theta }{\delta }}>u\right\} \\ & =\Pr \left\{ B<{\Theta}^{\frac{\delta }{1+\delta -\theta }}u^{-\frac{\delta }{1+\delta -\theta }}\right\} \\ & ={\Theta}^{\frac{\upbeta \delta }{1+\delta -\theta }}u^{-\frac{\upbeta \delta }{ 1+\delta -\theta }}, \end{array} $$
(14)

with support on \([{\Theta } ,\infty )\). The distribution of equilibrium prices in Eq. 14 has the same functional form as the heavy-tailed distribution in Eq. 1, where αδ/(1 + δ𝜃).

We proceed by relaxing the assumption of the beta distribution. More specific, we derive a general condition for the density function of the productivity shocks fB(t). Any density function of the productivity shocks that satisfies this condition will imply a equilibrium price distribution that is heavy-tailed. Let \({\mathscr{L}}(u)\) denote a slowly varying function, in the sense that

$$ \lim_{u \rightarrow \infty} \frac{\mathcal{L}(tu)}{\mathcal{L}(u)}=1 \text{ \ for any \ } t>0.$$

Then the following proposition provides a general condition that results in the heavy-tailedness of the commodity price distributions. The essential requirement is that zero is in the support of B.

Proposition 2

Suppose that the distribution and density of the agricultural productivity coefficient B are continuous. Given the price-productivity relationship for agricultural produce in Eq. 12, we have that

$$ \Pr (\tilde{q}>u)\sim \mathcal{L} (u)u^{-\alpha }\text{ as }u\rightarrow \infty , $$
(15)

with

$$ \alpha =\xi \frac{\delta }{1+\delta -\theta } , $$
(16)

if

$$ \lim\limits_{s\downarrow 0 }\frac{wf_{B}(sw)}{f_{B}(s)} =w^{\xi }\text{ with }\xi \in \mathbb{R}^{+}. $$
(17)

Proof

See Appendix B. □

Whether the condition on the distribution of B in Eq. 17 is satisfied depends on the shape of the density of the productivity shocks, fB(B), for values close to zero. The reason is that high price levels occur in periods of low agricultural productivity, such as severe droughts. The tail distribution for those low levels of productivity determine the shape of the distribution function for extremely high prices.

Corollary 1

The shape of the distribution of the agricultural productivity coefficient B close to zero determines whether the equilibrium price distribution is heavy-tailed. The more slowly the density of B converges to zero for extremely low productivity levels, i.e., the lower ξ, the heavier is the tail of the agricultural price distribution, i.e., the lower is α.

The condition in Proposition 2 holds true with different values for ξ for a broad range of distribution functions with positive support. For the beta distribution, which we considered for illustrative purposes, the condition in Eq. 17 is satisfied with ξ =β. The condition also holds true for distribution functions that follow very different functional forms. For instance, the standard uniform distribution and standard exponential distribution satisfy the condition in Eq. 17 with ξ = 1. More general, the distribution function \(\Pr (B<u)=1-\exp (-\lambda x^{\upbeta })\) with λ,β> 0 satisfies the condition with ξ =β. Other popular distribution functions that satisfy the condition in Eq. 17 are the Chi-squared distribution with l degrees of freedom, which satisfies the condition with ξ = l/2; the Gamma distribution with shape parameter l, which satisfies the condition with ξ = l; and the (heavy-tailed) Burr (Type XII) distribution with parameters (c,l), which satisfies the condition with ξ = c. All the aforementioned distributions would result in a heavy-tailed equilibrium price distribution.

The condition also shows that not every possible distribution does lead to heavy-tailed prices in our macro-economic framework. An example of a popular exception is the lognormal distribution: Its limit in Eq. 17 converges to 0.Footnote 12 Seriously low levels of agricultural productivity are too rare under the lognormal distribution to generate a heavy tail among the occurrences of high agricultural prices in the model.

As follows from Proposition 2, the shape parameter of the tail of the distribution of agricultural prices not only depends on the distribution of productivity shocks, but also on the preference parameters 𝜃 and δ.

Corollary 2

The greater the share of agricultural produce in consumption, i.e., the higher 1 − 𝜃, and the higher the elasticity of labor supply, i.e., the higher 1/δ, the heavier is the tail of the distribution of agricultural prices, i.e., the lower is α.

Given the distribution function of productivity shocks, it follows from Eq. 16 that a high value of 1 − 𝜃 results in a low shape parameter of the equilibrium price distribution of agricultural goods, α, and hence in a fatter tail. This finding has the following intuition. The importance of the share of the agricultural good in the consumption bundle of the agents is represented by 1 − 𝜃, see the utility function in Eq. 2. The larger the role of the agricultural good for the agents’ utility, the more extreme price reactions one may expect if supply falls. This is reflected in a fatter tail of the equilibrium price distribution, i.e., a lower α.

It also follows from Eq. 16 that a high value of parameter δ results in a high value of shape parameter α. Adverse productivity shocks (low realizations of B) have a dual effect on the output of the competitive sector. First, given the amount of labor used, an adverse technology shock in the competitive sector directly reduces output, since Z = BN. Second, low productivity decreases the equilibrium amount of labor used in the competitive sector, which further reduces output. In particular, the equilibrium labor supply L = cB(1−𝜃)/δ, where c > 0 is a constant (see Eq. 31 in Appendix A). Therefore, in equilibrium, the change in the production of the competitive good in response to a shock to B is smaller for higher values of parameter δ (lower elasticity of the labor supply), which results in thinner tails of the equilibrium price distribution, i.e., a higher α.

The property of the model that allows the equilibrium price distribution to be heavy-tailed, even if the productivity shocks are not heavy-tailed themselves, is that the equilibrium prices respond in a nonlinear fashion to shortages due to adverse productivity shocks. More precisely, the price of the agricultural product in our model is inversely proportional to the productivity. So, increasingly larger shortages of the agricultural product have an increasingly larger impact on its price in absolute terms. The property that product prices are inversely proportional to productivity is not unique to the environment we study above, and, in fact, a property that is very common to other economic models. Heavy-tailed price distributions can be obtained in those other models in a very similar manner as in the stylized model above. For various reasons, however, price distributions may not be heavy-tailed in a macroeconomic model. For example, one reason in our model would be that productivity shocks are bounded away from zero. In some macroeconomic models, prices are a linear though negative function of productivity, in which case prices would only be heavy-tailed if productivity shocks are heavy-tailed distributed. Finally, the relationship between α and ξ in Eq. 16 suggests a way to calibrate the value for the tail index ξ of agricultural productivity.Footnote 13

3 Empirical methodology

The previous section discussed the plausibility of fitting a power law to the right tail distribution of changes in food prices. Section 3.1 discusses how we test whether the right tail of the distribution of agricultural commodity returns follows a power law. Section 3.2 explains how we fit a power law and estimate risk measures under the assumption that returns are heavy-tailed, while Section 3.3 discusses how we estimate the risk without assuming heavy tails.

Next, we apply EVT to determine the parameters of the power law.

3.1 Testing the power law

We start by testing whether the right tail of the distribution of agricultural commodity returns follows the power law in Eq. 1 against thin-tailed alternatives. We do so by using the generalized extreme value distribution. This distribution describes the behaviour of the largest observation in a sufficiently large number of randomly observed draws. Let \(\tilde {y}_{m}\) denote the maximum of m i.i.d. observations of \(\tilde {x}\). The distribution of \(\tilde {x}\) is said to be in the domain of attraction of the generalized extreme value distribution, if, for some normalizing sequences of constants \(\mu _{m}\in \mathbb {R}\) and σm > 0, and \(\gamma \in \mathbb {R}\), the limit probability

$$ \lim_{m \rightarrow \infty} \Pr \left( \frac{\tilde{y}_{m}-\mu_{m}}{\sigma_{m}}\geq s \right)=G_{\gamma}(s), $$
(18)

where

$$ G_{\gamma}(s)=\exp (- (1+\gamma s)^{-1/ \gamma}). $$
(19)

One distinguishes three types of extreme value distributions depending on whether γ < 0, γ > 0 or γ = 0 (where (1 + γs)− 1/γ is read as es for γ = 0). If γ < 0, then the maximum possible value of \(\tilde {y}_{m}\) is finite.

By estimating parameter γ in Eq. 19, we can test whether commodity price returns are heavy-tailed (i.e., γ > 0) versus thin-tailed and other alternatives (i.e., γ ≤ 0). The density of the generalized extreme value distribution resembles a power law when γ > 0

$$ g_{\gamma}(s)= \begin{cases} \left( 1+\gamma s\right)^{-1/\gamma - 1} G_{\gamma}(s) &\text{ if } \gamma \neq 0, \\ e^{-s} G_{\gamma}(s) & \text{ if } \gamma = 0. \\ \end{cases} $$
(20)

Balkema and De Haan (1974) establish the one-to-one relationship between the parameters in the generalized extreme value distribution and the heavy-tailedness of a distribution. All distributions for which the distribution of exceedances beyond a sufficiently large threshold converges weakly in the limit to the power law in Eq. 1 with shape parameter α > 0, must be in the domain of attraction of the generalized extreme value distribution with γ = 1/α, and vice versa (Balkema and De Haan 1974, Theorem 4). By contrast, all distributions for which the tail converges to a thin-tailed exponential distribution must be in the domain of attraction of the generalized extreme value distribution with γ = 0 (Balkema and De Haan 1974, Theorem 3).

The literature has developed estimators for γ that allow for testing whether γ > 0. Since the theoretical model is predicated on the heavy-tailedness of the upper tail of the commodity price return distribution, we first test whether the data do not reject this presumption. To this end, we first use three estimators that allow for γ to fall into all three possible classes. Once we have confidence that γ > 0, we employ the Hill estimator which only applies if γ > 0, but is more efficient. The general estimators can be somewhat ambiguous depending on how many observations are taken into account. We will therefore test for the heavy-tailedness using three different estimators: the generalized Hill estimator of Beirlant et al. (1996) (\(\hat {\gamma }_{BVT}\)), the moment estimator of Dekkers et al. (1989) (\(\hat {\gamma }_{DEH}\)) and the mixed moment estimator of Fraga Alves et al. (2009) (\(\hat {\gamma }_{FGHN}\)).

3.2 Estimating risk with heavy tails

After establishing the heavy-tailedness of the right tail of agricultural commodity returns, we proceed with estimating the parameters in the power law. Let a series with n discrete returns be ordered from high to low: \({X}_{1}\geq {\dots } \geq {X}_{n}\). Let k denote the number of returns considered in the tail of the distribution. Then, following Hill (1975), one can estimate the shape and the scale parameter in Eq. 1 as

$$ \frac{1}{\hat{\alpha}}=\hat\gamma_{H}=\frac{1}{k}\sum\limits_{j=1}^{k}\ln \frac{X_{j}}{X_{k+1}} $$
(21)

and

$$ \hat{C}=\frac{k}{n}X_{k+1}^{\hat{\alpha}}. $$
(22)

One can easily convert the estimated parameters into economically meaningful risk measures, such as Value-at-Risk (VaR). The VaR is defined as the loss level that is exceeded with probability p, that is, for return Rt, \(VaR(p)=-\sup \{c:\Pr (R_{t}\leq c) \leq p) \}\). Using the Hill-estimators in Eqs. 2122 in the Pareto distribution in Eq. 1 gives the VaR-estimator for the right tail as

$$ \widehat{VaR}_{H}(p)=X_{k+1} \left( \frac{k}{np} \right)^{1/\hat{\alpha}} . $$
(23)

The VaR plays an important role in the safety-first framework developed by Roy (1952) and Telser (1955). Agents with the safety-first principle of Telser (1955) in their utility functions maximize their expected return, while limiting the probability that a loss larger than some disaster level occurs at some admissible level p.

3.3 Estimating risk without assuming heavy tails

The theory in Section 2 suggests why the right tail of the distribution for commodity prices follows a power law. Different from the right tail of the distribution, the left tail of the distribution has a theoretical endpoint, provided that the price of an agricultural commodity cannot fall below zero. One could therefore object to using a power law, which has an unbounded support, for modelling the tail risk in the left tail. Since the tail risk in the left tail is important for several market participants (e.g., farmers, exporters), we therefore consider two different methods to estimate the downside tail risk.

Our first method is to simply estimate the downside tail risk as if the left tail follows a power law. The basis for following this approach is a Taylor approximation, by a power function; which may work well at medium to higher loss levels. Theoretically, one may expect the risk estimates under the assumption of a power law to be increasingly upward biased in situations where changes in prices get closer to the maximum possible price decline. However, the method could still be useful from a pragmatic point of view as long as it is supported by empirical evidence that the method provides accurate risk estimates.

Our second method is to estimate the VaR under a more general distribution that allows tails with finite endpoints. In particular, we estimate parameter γ in Eq. 19 with an estimator that allows for all three cases γ > 0, γ = 0 and γ < 0. Let \(\hat {\gamma }_{G}\) denote a consistent general estimator of γ. Then the estimate \(\hat \gamma _{G}\) can be used in the general quantile estimator developed by Dekkers et al. (1989) to estimate the VaR as

$$ \widehat{VaR}_{G}(p)=\left[ X_{k+1}\left( \frac{k}{np} \right)^{\hat{\gamma}_{G}}-X_{k+1}\right]\times \frac{\hat{\gamma}_{H}}{\hat{\gamma}_{G} \min\{1,1/\lvert 1-\hat{\gamma}_{G} \rvert \} }+X_{k+1}. $$
(24)

Note that this general estimator for the VaR condenses to the specific VaR estimator that assumes heavy tails in Eq. 23 whenever \(\hat {\gamma }_{G}=\hat {\gamma }_{H}\). In our application using the general quantile estimator in Eq. 24, we report VaR estimates based on the general estimator of Fraga Alves et al. (2009), but the estimators of Dekkers et al. (1989) and Beirlant et al. (1996) generate qualitatively similar results.

We compare the performance of the VaR estimator assuming heavy tails and the general VaR estimator in the back-testing procedure in Section 8.

3.4 Other estimation details

We obtain standard errors of the shape parameters and risk measures from a bootstrap procedure. The asymptotic normality of the Hill estimator also holds in the presence of serial dependence; see, e.g., Drees (2008). Following Hartmann et al. (2006), we refrain from assumptions on the specific dependence structure and apply a bootstrap procedure with fixed block length and 10,000 replications. Following Hall et al. (1995), we set the optimal block length equal to n1/3. We aim to choose a threshold parameter k that balances the bias and variance of the estimator as usual (Drees et al. 2000).

4 Data

For our purposes, we rely on two different categories of price series. The first series concerns monthly prices of agricultural commodities in the spot market going back to 1928. The second series concerns weekly returns on contracts traded on the futures market from 1979 onward. Our estimates for weekly returns on futures contracts are in particular interesting from a risk management perspective. The lower frequency of the series with spot prices is better aligned with the macroeconomic model in which power law spikes observed in agricultural commodity prices may arise endogenously in the economy. For each price series, we calculate discrete returns from the observed prices as Rt = (PtPt− 1)/Pt− 1.

The monthly spot prices from 1928 to 2014 are obtained from Global Financial Data. These series are likely to be impacted by price controls, resulting in a lower level of volatility during the World War II period. Price controls were effective since May 1942, and, although most were removed in 1946, they may have had some impact on prices until 1948Q2; see Evans (1982). To avoid these price controls from impacting our estimates, we removed all observations from 1942M1 until 1948M6 from our dataset. To account for potential illiquidity in spot markets impacting our estimates, we removed from each spot price series the observations in months without price changes and those in months following a month without price change. This resulted in the removal of on average 47 observations in each series.

Futures contracts for delivery at a particular date are usually traded for a relatively short period, ranging from several months to several years. To obtain long-term futures returns series, or so-called continuous series, we combine consecutive data from several futures contracts, see, e.g., De Roon et al. (2000). We take considerable effort to construct high-quality continuous futures return series. Our procedure is as follows. First, we download daily open interest and price series of all available futures contracts from Thomson Reuters Datastream for each commodity. Those time series are available over a period of 34 years: from January 1979 until December 2012. Subsequently, daily returns are calculated for all futures price series. Finally, we construct the continuous futures returns series from the individual return series. In January 1979 we start with the futures contract that has the largest open interest. For each day we include its returns in the new continuous series until six weeks before the contract’s last trading day. At this date we switch to the futures contract with the largest open interest and a later last trading day. Again we include the returns until six weeks before the last trading day and repeat the last step. This procedure results for each commodity in a daily continuous futures returns series, which is then converted into a weekly returns series of 1,773 observations from, on average, 164 different futures contracts.

Our method has an important advantage compared to Datastream’s procedure to construct continuous futures series. By calculating returns prior to constructing the continuous series, no returns are calculated over price observations from two different futures series. Therefore, our series represents the return that investors could achieve by rolling over futures contracts as opposed to the continuous Datastream series which includes price jumps due to changes in the underlying futures series. The extreme returns in our series thus represent genuine financial risks to market participants.Footnote 14

From all agricultural commodities traded in the United States, the following commodities are investigated: Corn, cotton, oats, soybeans, wheat, live cattle, orange juice (futures market) or oranges (spot market), and sugar.Footnote 15

5 Testing for heavy-tailedness

Table 1 reports the descriptive statistics of the monthly returns in the spot market and the weekly returns on futures contracts. A quick overview of the data confirms the non-normality of the returns. Six out of eight series with monthly returns in the spot market contain at least one observation with a distance of more than 6 standard deviations from the mean, an observation that would occur roughly once every 42 million years under the assumption of a normal distribution.Footnote 16 For the weekly returns, six out of eight series contain at least one observation with a distance of more than five standard deviations from the mean. The probability of such a return occurring under the assumption of the normal distribution is about once every 33,500 years. Figure 2 reports QQ-plots of three arbitrarily chosen weekly return series (corn, cotton and live cattle) as an illustration. The non-normality is strongly confirmed by QQ-plots of the return series against the normal distribution. Only the right tail of the return series for live cattle seems to be quite close to the normal distribution.

Table 1 Descriptive Statistics
Fig. 2
figure 2

QQ-plots of agricultural commodity returns. Note: QQ-plots of the weekly returns of futures contracts for corn, cotton and live cattle against a normal distribution with the same mean and standard deviation

We proceed with a formal test of whether the upper tail distribution is heavy-tailed by estimating parameter γ in Eq. 19 for each of the returns series. The theory shows how heavy-tailedness can arise in the right tail of agricultural commodity returns as a consequence of price spikes from adverse productivity shocks that are not heavy-tailed themselves. Values of \(\hat {\gamma }\) larger than 0 suggest that the tail of a distribution is heavy-tailed. Table 2, panels (a) and (b) report the estimates for the upper tail of the monthly returns in the spot market and the weekly returns in the futures market, respectively. Heavy-tailedness is rejected at a 5 percent significance level for t-statistics smaller than − 1.6. Rejection of the thin-tailed distribution at a five percent significance level requires a t-statistic larger than 1.6.

Table 2 Testing for Heavy-tailedness

Table 2, panel (a) shows that all estimates for γ, except one, are positive for the upper tail distribution of monthly returns in the spot market. This suggests that the right tails of these distributions are heavy-tailed. The exception to the rule is live cattle, which did also seem relatively close to the thin-tailed alternative in the QQ-plot. It is possible to reject the thin-tailed alternative for 5 out of 8 series based on the estimators of Beirlant et al. (1996) and Fraga Alves et al. (2009), with the strongest test results for corn, cotton, oats and sugar. The Dekkers et al. (1989) estimator, which exhibits somewhat larger standard errors, also rejects the thin-tailed alternative for 4 out of 8 series. None of the estimates suggest the rejection of heavy-tailedness for the upper tail of monthly returns in the spot market. The pattern of heavy-tailedness is very similar when turning to the right tail distribution of weekly futures returns reported in Table 2, panel (b), although the results for weekly futures returns are somewhat less strong than for the monthly returns in the spot market. The thin-tailed alternative can be rejected at a 5 percent significance level for 2 out of 8 futures returns series based on the Dekkers et al. (1989) estimator, and for 3 out of 8 futures returns series based on the Beirlant et al. (1996) and Fraga Alves et al. (2009) estimators. The point estimates for live cattle and wheat are slightly negative. None of the estimates suggest the rejection of heavy-tailedness for the upper tail of weekly future returns.

6 Empirical tail estimates

Table 3 documents the estimated tail parameters under the assumption of heavy-tailed distributions. Unreported Hill plots show that the Hill estimates are relatively stable when a total of k = 50 tail observations are used for the monthly returns series and a total of k = 75 tail observations for the weekly returns series. This corresponds to 5.4% of all monthly observations and 4.2% of all weekly observations.

Table 3 Tail parameter estimates

Table 3, panel (a) documents the estimated tail parameters using the Hill estimator for the monthly returns in the spot market. The value of most shape parameters is estimated to be in the range from 2 to 3. The most risky commodities with respect to the shape parameter describing price spikes are oranges, with an estimated tail index of around 1.95, and sugar, with an estimated tail index of around 2.36. The table also reports the Hill estimator for the left tail of the distribution. These estimates can be used to approximate the distribution using a Taylor approximation in the form of a power law function even though it is known that the returns in the left tail are bounded.

Table 3, panel (b) documents the estimated tail parameters for the weekly futures returns. Theoretically, the estimates of the shape parameters should be robust for changes in the data frequency in case of independent and identically distributed returns.Footnote 17 Nevertheless, time series characteristics such as volatility clustering could result in differences in tail behavior at different data frequencies. Moreover, there are differences between the futures market and the spot market for commodities, where the latter is, in general, characterized by lower liquidity and higher trading costs in terms of organizing the logistics of storage and transportation. The tail parameter estimates for the weekly returns in the futures market are overall somewhat higher than those of the monthly returns in the spot market with estimates that are more often in the range from 3 to 4. Hence, our estimates suggest the presence of a higher risk of extreme price spikes in the spot market than is documented for the futures market. Finally, live cattle is the commodity with the highest estimates for the shape parameters, implying less heavy tails, which is consistent with our previous results in Table 2.

While the shape parameter α is relatively similar across commodities within each market, there is a larger variation in the estimates of the scale parameter C. The variation in the scale at least partly explains the variation in the VaR estimates within each market; see Tables 4 and 5 discussed in the next section. Suppose that the shape parameter α is identical across commodities within a particular market. Then, assuming independence of returns across commodities, an optimal portfolio could be constructed on the basis of the mean and the scale (Hyung and De Vries 2005). Here, the scale substitutes for the variance as in more traditional mean-variance portfolio optimization.Footnote 18

Table 4 Risk estimates for monthly returns in the spot market, 1928-2014

7 Risk estimates

This section discusses the empirical results of the risk estimates. Table 4 reports risk estimates for monthly returns in the spot market. The risk estimates for the right tail are based on the estimator in Eq. 23, which assumes heavy tails in line with the theory and results shown above. The risk estimates for the left tails are based on the generalized estimator in Eq. 24. As an illustration take the VaR levels for increases in the price of oranges of 84.6% and 172.6% at a 1.00% and 0.25% probability level, respectively.Footnote 19 The first estimate means that once every 100 months (around 8 years), one may expect the spot price of oranges to increase by more than 80% within a single month. The price is likely to more than double once every 400 months, or about once every 33 years. The risk estimates also reflect the pattern that the largest price spikes tend to exceed the largest price declines in the data (Table 1), with larger risk estimates for the right tail than the left tail.

Table 5 Risk estimates for weekly returns in the futures market, 1979-2012

The highly volatile behaviour of the agricultural commodity prices is again evidenced in the table with the risk estimates for weekly futures returns (Table 5). In the same vein, the table reports the 0.25% and 0.05% VaR for the weekly futures returns.Footnote 20 The 0.25% VaR is expected to be exceeded about once every eight years, and the 0.05% VaR about once every 40 years, or about once during the career of an investment professional.

The relevant risk management information for an investor with a long position is contained in the left tail of the distribution, shown in Table 5, panel (a). We find that sugar has the highest risk estimates for price declines in the futures market of all commodities studied. Once every forty years the price of sugar is expected to fall by more than 17.6% within a week. The safest commodity, in terms of price development, appears to be live cattle. Once every forty years the price of live cattle is expected to fall by more than 9.9% within a week. Orange juice tends to have the highest upside price risk in the futures market, with a price spike that exceeds 34.3% within a week once every forty year.

In the introduction, a question was posed regarding the likelihood of a price change occurring in excess of an market participant’s tolerance level for price increases of 25% (e.g., a food processor or importing country). We apply Eq. 1 using the estimated parameters \(\hat {C}\) and \(\hat {\alpha }\) for the right tail of the weekly return distribution for wheat in Table 3. This gives the probability of exceeding the threshold as \(\Pr (\tilde {x}>u)\approx 14.63\times 25^{-3.20}\approx 4.84\times 10^{-4}\). The inverse of this number yields the number of weeks in which at least one price spike of more than 25% is expected to occur. The outcome is around 2,065 weeks. Hence, with 52 weeks per year, we expect to see such a large price jump for wheat once every 40 years.

8 Back-testing

To examine the accuracy of the V aR(p) estimates, we employ an out-of-sample back-testing procedure. In this method, the V aR(p) estimates based on historical price changes are compared to the realized price changes. Thus, first V aRtn+m+ 1 (p) is estimated using a horizon of m past returns: {Rtn,...,Rtn+m}. If the realized return Rtn+m+ 1 exceeds the estimated V aRtn+m+ 1(p), then a “VaR-exceedance” is registered. The above procedure is repeated with a rolling window at time tn + m + 2 et cetera. For a series containing n returns, the procedure yields a total of nm observations of whether a VaR-exceedance occurred. According to the VaR definition, if the methodology provides accurate predictions of V aR(p), then the proportion of VaR-exceedances should have a value close to p. Thus, \({\sum }_{j=m+1}^{n}\mathbf {1}(R_{t-n+j}>VaR_{t-n+j}(p))/(n-m)=p\) should match approximately for accurate VaR estimates, where 1(⋅) denotes the indicator function. An unconditional coverage test is used to test whether the number of observed VaR-exceedances deviates statistically significantly from the expected number (Kupiec 1995).

The procedure is implemented as follows. The back-testing procedure is executed under the assumption of the normal distribution, the generalized EVT distribution and the power law. For the monthly returns in the spot market, the length of the rolling window for the estimation is set at m = 360 observations (30 years) and each tail is assumed to contain the most extreme 8.3% (1/12th) of the observations in the estimation horizon. For the weekly returns in the futures market, the length of the rolling window is set at m = 520 weekly observations (10 years) and each tail is assumed to contain the most extreme 5.0% of the observations in the estimation horizon. A visual representation of the procedure’s results is given in Fig. 3. The spikes show weekly wheat returns. The bold lines show the 0.25% VaR estimates from the normal distribution (in gray), the generalized EVT distribution (in blue) and the power law (in red). The first period without the VaR bands reflects the window needed for estimation.

Fig. 3
figure 3

Backtesting agricultural prices Value-at-Risk Note: The spikes show the weekly wheat returns. The lines report the VaR estimated using a rolling window of 520 weeks. The estimates are based on the normal distribution (in gray), generalized EVT distribution using the estimator of Fraga Alves et al. (2009) (in blue) and a power law tail distribution using the estimator of Hill (1975) (in red). The VaR estimate from the normal distribution and generalized EVT distribution are exceeded at a frequency higher than once every eight years

The results of the back-testing procedure are reported in Tables 6 and 7 for monthly and weekly data, respectively. The columns report the number of VaR-exceedances at three different VaR probabilities for each commodity and each distribution. The line labelled “average” reports the average number of observed VaR-exceedances in each column. The bottom line indicates the expected number of VaR-exceedances for accurate VaR estimates. A risk assessment methodology that underestimates the risk would lead to an average of observed VaR-exceedances that is higher than the expected number, while risk estimates that are too conservative would lead to a number of VaR-exceedances that is too low. Finally, the bottom line reports the number of unconditional coverage tests that are rejected at a 5 percent significance level for each column.

Table 6 Back-testing Monthly Returns in the Spot Market, 1928-2014
Table 7 Back-testing Weekly Returns in the Futures Market, 1979-2012

Back-testing risk estimates for the right tail of the monthly returns in the spot market for agricultural commodities exemplify the benefits of assuming a power law tail. Table 6, panel (b) shows that assuming a normal distribution leads to far too many VaR-exceedances in the right tail. The predictions from the normal distribution increasingly underestimate the risk when one moves further into the tail. The third column in the table shows that assuming a normal distribution leads to a number of 0.25% VaR-exceedances that is more than 5 times too high. Moving to the generalized EVT distribution partly alleviates this problem, since the actual number of VaR-exceedances is closer to the expected number of VaR-exceedances, but still exceeds the expected number of VaR-exceedances by more than a factor two for the probability levels 0.50% and 0.25%. In contrast, assuming heavy-tailedness leads to a number of VaR-exceedances that is close to the expected number. For the probability levels of 0.50% and 0.25%, the average number of exceedances is, respectively, 3.2 and 2.0, which is close to their expected number of 3.3 and 1.7, respectively. These estimates concern the most extreme events, or about one observation in 16.7 and 33.3 year, respectively. The generally better performance of assuming a heavy-tailed distribution is also reflected in the lower number of rejected unconditional coverage tests in the bottom line of the table, in particular for the smaller probability levels.

As discussed, one may expect for the left tail that the heavy-tailed approximation leads to risk estimates that are somewhat too conservative. This is indeed confirmed in Table 6, panel (a), where the heavy-tailed approximation leads to fewer VaR-exceedances than expected. For the probability levels of 0.50% and 0.25%, the average number of exceedances is, respectively, 1.8 and 0.4, which is less than the expected number of 3.3 and 1.7, respectively. However, depending on the context, it may still be better to use the power law approximation than the generalized EVT distribution, which turns out to lead to more VaR-exceedances than expected, and to a rejection of the unconditional coverage test for some of the series.

Assuming heavy-tailedness also works well when modeling the risk for weekly returns in the spot market for observations both far in the right and far in the left tail (Table 7). The average number of observed exceedances in the left and right tail at the 0.25% probability is, respectively, 3.5 and 2.6, which is close to the expected number of 3.1. These estimates concern events that are expected to occur only once every 8 years. The accuracy of risk estimates for weekly returns based on the power law is also substantially better when compared to the normal and the generalized EVT distribution, which both tend to underestimate the risk and lead to too many VaR-exceedances. The results for the left tail provide support for using a Taylor approximation in the form of a power law function to estimate extreme price risks at medium to higher loss levels in the context of weekly futures returns, even in situations where the theoretical return distribution is bounded.

9 Conclusion

A proper understanding of extreme commodity returns is instrumental to any market participant in agricultural commodity markets. Especially knowledge regarding the largest price swings is important for risk management purposes. We construct a two-sector general equilibrium model which describes how productivity shocks affect agricultural commodity prices. In our model, extreme price spikes arise as a result of productivity shocks in the agricultural sector, which are translated into upward price changes in a nonlinear manner, resulting in a heavy-tailed equilibrium return distribution in the commodity market. Our study shows how this mechanism may result in heavy-tailed price distributions, even if the productivity shocks are not heavy-tailed themselves.

We provide further empirical evidence on the heavy-tailedness of agricultural commodity price returns in both the futures market and the spot market. Extreme Value Theory is used to estimate the parameters of the power law in the tail of their distribution. These estimates are used to measure an agricultural commodity investor’s exposure to extreme price risks. We calculate Value-at-Risk measures to provide estimates of the likelihood and size of the largest losses an investor may encounter. Back-testing shows that this methodology provides relatively accurate estimates of risk measures for agricultural commodities when compared to estimates based on the normal distribution or the generalized EVT distribution, in particular for the right tail.

Our results have two implications. First, we document that market participants in this asset class are likely to encounter large future price swings. Our model supports the prevalence of fat-tailed commodity price distributions and thus a highly volatile return behaviour. Secondly, the risk measurement framework in this paper informs market participants on a well-suited method to quantitatively assess the largest price risks, with our empirical results providing essential knowledge on when the methodology performs particularly well.

An interesting observation is that the price spikes of live cattle tend to be less heavy-tailed (if at all) than the price spikes of the crop commodities. One potential explanation that is consistent with our theoretical framework is that the production of crop commodities in our dataset are strongly affected by adverse productivity shocks arising from short-term weather conditions, while the supply of live cattle is less impacted by such adverse productivity shocks. Arguably, a single commodity is insufficient to validate our framework, but exploring the factors that can explain the differences in heavy-tailedness among a broader set of commodities is an interesting direction for future research.