1 Introduction

Started at the beginning of the 1990s, the liberalization process of the electricity sector engaged several countries worldwide with the aim of transforming existing monopolistic markets into competitive markets. Such competitive markets were properly designed for allowing trades of electricity as a new commodity and were organized to discover equilibrium prices through demand and supply balancing. Electricity has, in fact, very peculiar characteristics [13]. It cannot be stored in an economically convenient way. Its transmission requires a constant balancing between injections to and withdrawals from the power grid. Moreover, electricity has a highly inelastic demand curve, strongly dependent on weather conditions (temperature, wind speed, precipitation, etc.). Supply is, in general, provided by low marginal costs generators but, in many cases, the mismatch between supply and demand, as for instance peaks in electricity demand, can be satisfied at very high costs [13, 50]. Given all this, it is not hard to understand how the liberalized marked interaction between demand and supply has dramatically increased the short-term volatility of power prices: shortages in electricity generation due to forced outages and/or grid congestions, peaks in electricity demand, fluctuations in hydroelectricity production, may result in unanticipated jumps in power prices and spikes of very high amplitude. These peculiarities have led to very erratic price dynamics not observed in any other commodity or financial market [48].

In the face of this, accurately modeling and forecasting electricity price dynamics becomes a crucial task for designing effective short-term trading strategies and long-term investments in power generating technologies. Namely, at the corporate level short- and long-term electricity price forecasts are very important from the producer’s perspective [13, 53]. On one hand short-term price forecasts are of particular interest for defining bidding strategies [5] and scheduling production in order to maximize trading profits or hedge financial risk [4, 49]. On the other hand price forecasts on longer time horizons, ranging from a few years to decades, are of strategic importance for valuing investments in new generating technologies and for power planning decision making of energy companies and policy makers. Providing a link between accurate short-term modeling and long-term behavior of electricity prices is thus an important and necessary task.

Short-term price forecasting techniques are well developed in the literature, both for point and for probabilistic forecasting. A standard reference for price point forecasting is the in-depth review proposed by Weron [49]. Probabilistic forecasting consists of forecasting the whole price distribution or some related parts, as for example quantiles, at a time not too far in the future. Short-term probabilistic forecasting was recently reviewed by Nowotarski and Weron [40]. On the other side, long-run forecasting of electricity prices has not yet been investigated as much. This fact might be due to a limited understanding of the main drivers of the most important variables which affect electricity prices over long time horizons, as fossil fuel prices, environmental policies regarding \(\hbox {CO}_{\text{2 }}\) emissions, technological changes, smart grid evolution, etc. [47]. Although some methods for long-term point forecasting electricity prices and their volatilities are proposed in the literature (see, e.g., [1, 15]), papers on probabilistic forecasting methods, in which the probability density function of electricity prices is forecast, are few and they are mainly devoted to mid-term forecasting ranging from 1 month to 1 year [2, 3]. A review on probabilistic mid- and long-term electricity price forecasting was discussed by Ziel and Steinert [54]. In the same paper, the Authors proposed also a probabilistic approach to forecast electricity prices for several months up to 3 years. However, valuing investments in new generating technologies requires to take into account forecasts of revenues from selling electricity at market prices and their volatilities for decades, i.e., over the whole lifetime of the plant. The present paper aims to fill this gap in the literature by introducing a new probabilistic approach to long-term forecasting in order to simulate revenues distributions over very long time horizons.

We propose a long-term forecasting methodology in which the long-run behavior of power prices is derived from the short-term dynamics. To this end, we will start from accurately modeling short-term random movements of electricity prices and we will end up to provide probability distributions in the long-run. In particular, we discuss three short-term stochastic models. In the first model, which we will name ‘Model 1’, the dynamics of electricity prices is described by a mean-reverting diffusion process. In the second model, i.e., ‘Model 2’, a mean-reverting jump-diffusion process is used to describe the dynamics of prices. Finally, in the third model, i.e., ‘Model 3’, the dynamics of electricity prices is described by a mean-reverting two regime-switching Markov process. Mean-reversion is a very relevant feature of the electricity price behavior observed in power markets. First, it is responsible for reducing prices after a spike has occurred; second, it forces the stochastic component of prices to fluctuate around some long-run mean, driving probability densities toward stationary long-run distributions. By the use of mean-reversion, the short-term dynamics can be connected to the long-term behavior of power prices. Moreover, these models can be calibrated on historical data and can include a structural component in terms of forward looking information based on macroeconomic views about the future long-term evolution of electricity prices. In this way, the proposed approach develops a robust link between accurate short-term modeling and long-term behavior of electricity prices. This is one of the novelty aspects of the present paper and the first main contribution to the literature.

Long-run forecasting is an important topic of research. When an electricity company plans to build new power plants, it needs long-term revenues and generation cost forecasts over the whole lifetime of the plants, basing its decision-making on some long-term metrics as, for example, the Net Present Value (NPV) of the investment [24]. The NPV criterion is a widespread method suitable for long-term evaluation [31, 45] that takes into account revenues from selling electricity in power markets and costs incurred during the whole lifetime of the plants. In addition, the stochastic NPV theory, which attributes to the NPV a probability distribution, provides a powerful tool to perform risk analysis of investments [45].

As a second contribution to the existing literature, we discuss an evaluation scheme for risky investments in new baseload generating technologies, namely fossil fuel (gas and coal) power plants and nuclear power plants, based on the stochastic NPV as a long-term metric. The stochastic NPV is computed under accurate modeling of the stochastic dynamics of the main factors affecting the profitability of the investment. In this regard, five sources of uncertainty are taken into account, namely electricity market prices, fossil fuel prices (natural gas and coal prices), nuclear fuel prices and \(\hbox {CO}_{\text{2 }}\) prices. Market based \(\hbox {CO}_{\text{2 }}\) pricing schemes (like the European Union Emissions Trading Scheme, EU-ETS) generate volatility in \(\hbox {CO}_{\text{2 }}\) prices [14, 38] through the interaction between demand and supply, thus introducing a new source of uncertainty which must be taken into account for valuing investments in power generation technologies [20]. These factors are the main financial risk sources in the electricity sector [15, 26]. Since the analysis is limited to baseload technologies, quantity uncertainty has a minor effect and it is not taken into account. Regarding the nuclear source, we do not consider here the financial risk due to the social acceptance of this technology. The reason is that we assume that the investment evaluation is performed in a case in which the nuclear power generation is a well accepted technology. Although in principle construction costs could be stochastic, especially for nuclear power plants [27], we do not consider here this possibility. However, the model can be extended to account for uncertainty in all types of costs.

The novelty of this approach is to provide a model to value investments in new baseload generating technologies in a stochastic framework in which random movements of electricity and fuel prices are accurately modeled both in the short- and in the long-term. The starting point of our analysis is modeling the short-term behavior and then investigate the long-term limit in order to compute stochastic revenues and costs during the whole lifetime of the plants. Regarding the electricity price dynamics, we use the regime-switching Model 3 that better describes the random movements of electricity prices observed in real markets with respect to Model 1 and Model 2 (as it is will shown in the following). From the costs side, fuel prices too are modeled according to well defined stochastic processes. In particular, since gas market prices exhibit mean-reversion and jumps, we use a mean-reverting jump-diffusion model to capture the features of the short-term dynamics of gas prices. Coal and nuclear fuel prices do not show mean-reversion and we assume that their time evolution are both described by a Geometric Brownian Motion (GBM). As a further source of uncertainty, we will consider the possibility to include \(\hbox {CO}_{\text{2 }}\) costs into the analysis. \(\hbox {CO}_{\text{2 }}\) prices will be modeled according to a GBM. As suggested in the literature, the long-run analysis of investments in the power sector must integrate historical data and future trends in market prices, including expert evaluations of future regulations and forthcoming technologies [19]. The evaluation model we propose can be calibrated on historical data on market prices and can incorporate a structural component in terms of macroeconomic views about future trends of electricity and fuel prices. This approach allows us to determine the probability density of stochastic NPVs of new generating technologies and to perform risk analysis of investments in capacity expansion. The empirical analysis, based on cost data of new generating technologies collected from the ‘Annual Energy Outlook 2019’ [9] reveals that, with the exception of the gas generation, both coal and nuclear power generation show a negative expected NPV. This is an important result which can be useful for both investors and policy makers in their efforts to plan capacity expansion and future power system configurations. This is the third main contribution to the literature.

It could be interesting to relate our approach to more structural approaches to long-term price setting and forecasting. For example, in the model proposed by Oliveira and Costa [41] a fictional microeconomic dynamics drives the market to an equilibrium, which is reached by groping and mutual learning by part of the market agents. Since the dynamics is fictional, no information can be given about the real dynamics which will go on inside the period studied. In the model proposed by De Vries and Heijnen [8], the specific aim is understanding how the dynamics of electricity generation capacity growth can be stimulated and controlled in time, within a long-term horizon. In this case, demand and supply changes are matched in time under different regulatory and legislation schemes, and under demand growth uncertainty. Together, these two approaches can be taken as examples of possible ways to tackle with one of the most important facts behind long-term price valuation. Price forecasting on long time horizons means taking into account a capacity expansion problem and its impact on prices [6], especially in the presence of variable renewable energy (VRE) sources [52]. It is certainly not easy to include this feature into long-term forecasting models. In any case, making an econometrics of prices emerge from matching demand with supply is in the end deeply linked to our approach. In a different but twin way, the stochastic NPV assessment tries to model a decision process under uncertainty, yet without directly coping with the microstructure of the power market. In our approach, the effects of the microstructure are included in the phenomenology of historical power and fuel prices, and we indirectly include it by calibrating the model on empirical data related to historical market prices over long time horizons and on macroeconomic forward looking views about future trends of electricity, fuel and \(\hbox {CO}_{\text{2 }}\) market prices.

The paper is organized as follows. Section 2 discusses the short-term dynamics of electricity prices. Some models are introduced and estimated on historical data from Palo Verde and PJM markets. In Sect. 3, the long-term behavior of electricity prices is investigated, and stationary probability distributions are derived and discussed. Section 4 introduces the stochastic NPV metric and its link with the stochastic Levelized Cost Of Electricity (LCOE). Sections 5 and 6 concern the stochastic modeling of revenues and costs respectively. In Sect. 7, a stochastic NPV based analysis of investments in new generating technologies is provided. Section 8 concludes. Finally, “Appendix ” provides some technical results about the time evolution of central moments in the jump-diffusion model.

2 Modeling electricity price dynamics

Figure 1 depicts the time series of daily electricity prices observed between January, 2009 and December, 2018 at the two US power markets of Palo Verde (US Southwest region) and PJM (US Northeast region). Daily prices are obtained as weighted averages of the 24 hourly market prices and are expressed in nominal dollars per megawatthour (\({\$}\)/MWh). These electricity price time series are available at www.eia.gov/electricity/wholesale and are freely downloadable.

Fig. 1
figure 1

Historical behavior of power prices at Palo Verde market (left) and at PJM market (right) since January, 2009–December, 2018

Looking at Fig. 1, we note that electricity prices follow a very erratic dynamics characterized by high volatility, jumps and pronounced spikes. Moreover, multi-regime dynamics can be observed. In normal stable periods, prices fluctuate around some long-run mean; in turbulent periods prices experience jumps and short-lived spikes. After a jump or a spike has occurred, a mean-reversion mechanism forces back prices to fluctuate around some long-run mean. Accurate modeling power price dynamics means to take into account all these features.

Several continuous-time models for electricity prices were proposed in the literature. Since the seminal paper by Lucia and Schwartz [30], in which a mean-reverting diffusion process was proposed to model the power price dynamics at the Nord Pool market, the literature on this topics has grown exponentially. Mean-reverting jump-diffusion processes and mean-reverting regime-switching models were extensively used to accurately describe the jumpy and the spiky behavior of electricity prices observed in power markets. In this section we focus on three models of these types, namely a mean-reverting diffusion model, a mean-reverting jump-diffusion model, and mean-reverting two regime-switching model.

Let us denote by P(t) the daily price at time t of one MWh of electricity, and by \(s(t)=\ln P(t)\) its natural logarithm. We assume that s(t) is a linear superposition of a deterministic component, f(t), possibly accounting for trend and seasonality, and a random component, x(t), namely

$$\begin{aligned} s(t)=f(t)+x(t). \end{aligned}$$
(1)

Since electricity prices may be higher in winter time and in summer time, we express the deterministic component as

$$\begin{aligned} f(t)=b_0+b_1t+b_2 \cos \big (b_3+\frac{2\pi t}{\tau }\big )+b_4 \cos \big (b_5+\frac{4\pi t}{\tau }\big ), \end{aligned}$$
(2)

to describe the semiannual periodicity. A linear trend is included to account for expected inflation and possibly for a real escalation rate of power prices (positive or negative). The parameter \(\tau\) denotes the average number of observations per year. One can estimate the seasonal component parameters \(b_j\) (\(j=0,2,\cdots ,5\)) by fitting f(t) to market data using Ordinary Least Squares (OLS) techniques. Table 1 depicts the parameters estimates obtained in the Palo Verde and PJM power markets.

Table 1 Parameter estimates of the deterministic component

Figure 2 shows the time series of stochastic log-returns (hereinafter, log-returns) obtained as daily changes of the random component x(t), at Palo Verde market (left panel) and at PJM market (right panel). Descriptive statistics of log-returns are displayed in Table 2.

Fig. 2
figure 2

Historical behavior of log-returns at Palo Verde market (left) and at PJM market (right) since January, 2009–December, 2018

Table 2 Descriptive statistics of log-returns

As mentioned in Sect. 1, and as it can be seen also in the figure, interplay between demand and supply generates a lot of volatility in nowadays power markets. Log-returns show large fluctuations with jumps and spikes, and non-normal, leptokurtic empirical distributions. With the aim of capturing the features of log-returns observed in these markets, we discuss now three continuous-time stochastic models for the dynamics of x(t), called, respectively, Model 1, Model 2 and Model 3. The main features of these models are described below.

2.1 Model 1

In Model 1, the dynamics of x(t) is described by the following mean-reverting diffusion process,

$$\begin{aligned} dx(t)=-\alpha _0 x(t)dt+\sigma _0 dw_0(t), \end{aligned}$$
(3)

where \(\alpha _0\) is the mean-reversion parameter, \(\sigma _0\) is the volatility, and \(w_0(t)\) is a Wiener process. Although this model captures the mean-reverting behavior of power prices, it is not able to account for jumps and spikes. This limitation can be overcome by modeling power price dynamics with a jump-diffusion process.

2.2 Model 2

In Model 2 the dynamics of x(t) is described by a mean-reverting jump-diffusion process of the form

$$\begin{aligned} dx(t)=-\alpha _0 x(t)dt+\sigma _0 dw_0(t)+Jdq(t), \end{aligned}$$
(4)

where q(t) is a Poisson process with constant intensity \(\lambda\). In Eq. (4) the random jump amplitude J is distributed as a Gaussian random variable with zero mean and standard deviation \(\sigma _J\), i.e. \(J \sim N(0,\sigma _J^2)\). We assume that the Wiener process, the Poisson process, and the jump amplitude are mutually independent processes. We remark that the zero mean jump amplitude was chosen according to the observed low values of the skewness present in the data (see Table 2). Nevertheless, the proposed analysis is general and can be extended in a straightforward way to include jumps with arbitrary probability distributions (see “Appendix ”).

2.3 Model 3

Model 3 consists of a regime-switching process with two regimes. Regime-switching processes add a further degree of freedom to the description of the dynamics of electricity prices. In our specific, they allow us to combine in one model periods of steady dynamics and of jumpy dynamics, depending on the realization of a stochastic two-valued latent state variable of the system. We can thus make use of two different state-dependent mean-reversion rates and include stochastic volatility. Model 3 is characterized by the following process,

$$\begin{aligned} dx(t)=\left\{ \begin{array}{l} -\alpha _0 x(t) dt + \sigma _0 dw_0(t),\\ -\alpha _1 x(t) dt + \sigma _1 dw_1(t)+Jdq(t). \\ \end{array} \right. \end{aligned}$$
(5)

The dynamics of the base regime (first line of Eq. 5) is described by a mean-reverting diffusion process in order to account for the motion during stable periods. In contrast, the dynamics of the jumpy regime (second line of Eq. 5) is described by a mean-reverting jump-diffusion process. As in the two previous models, \(w_0(t)\) and \(w_1(t)\) are Wiener processes and q(t) is a Poisson process with constant intensity \(\lambda\). In Eq. (5) the random jump amplitude J is distributed as a Gaussian random variable with zero mean and standard deviation \(\sigma _J\), i.e., \(J \sim N(0,\sigma _J^2)\). We assume that the Wiener processes, the Poisson process, and the jump amplitude, are mutually independent processes. The switching between regimes is driven by a hidden Markov process characterized by the following transition probability matrix,

$$\begin{aligned} {\pi }=\left( \begin{array}{cc} 1-\gamma dt &{} \eta dt \\ \gamma dt &{} 1-\eta dt \end{array} \right) , \end{aligned}$$
(6)

where \(\gamma dt\) denotes the transition probability of a switch from the base regime to the turbulent regime in the infinitesimal time interval \([t, t+dt]\), and \(\eta dt\) is the probability of the opposite transition.

All these models were estimated on market data by maximum likelihood using the Euler discretization with time step \(\varDelta t\) equal to 1 day. In the case of Model 3, the Hamilton filtering technique [17, 18] was used. Estimation results are depicted in Table 3 for the Palo Verde market, and in Table 4 for the PJM. For each market, the parameters estimates, the log-likelihood (LL), and the value of the Schwartz criterion (SC) are reported.

Table 3 Palo Verde estimation results
Table 4 PJM estimation results

As it can be read from the last line of these tables the empirical analysis reveals that the regime-switching model describes the dynamics of electricity prices better than diffusion and jump-diffusion models. The Schwartz criterion indicates that the presence of jumps in the dynamics enhances the fit. Moreover, the added degrees of freedom coming from multiple mean-reversion rates and volatilities (i.e., for the base and the turbulent regime), makes regime-switching models more suitable to describe the dynamics of electricity prices observed in real markets [32, 37]. As expected, the mean-reversion parameter as well as the volatility parameter are lower in the base regime with respect to the turbulent one. The statistical analysis of simulated trajectories confirms that regime-switching models offer an interesting agreement with market data. Tables 5 and 6 display some parameters computed from simulated log-returns time series. Such values are computed averaging over ten thousands randomly generated paths using, respectively, Palo Verde and PJM estimates to be compared with the statistics from original data shown in Table 2. We can observe that the statistical analysis of simulated trajectories shows an interesting agreement with market data in the case of Model 3.

Table 5 Statistics of simulated path log-returns obtained using Palo Verde estimated parameters
Table 6 Statistics of simulated path log-returns obtained using PJM estimated parameters

We conclude this section by noticing that Palo Verde and PJM markets are characterized by very different values of the mean-reversion rate. In all the models which we discussed, mean-reversion parameters estimated on Palo Verde market data are lower than those estimated on PJM market data. As we will see in the following sections, this fact has important consequences on the long-run behavior of electricity prices.

3 Long-term evolution of electricity prices

The long-term behavior of electricity prices can be investigated by studying the time evolution of the central moments of log-price distributions. Central moments are defined by

$$\begin{aligned} M_n(t)=E\big [\big (x(t)-\mu (t)\big )^n\big ], \end{aligned}$$
(7)

where \(\mu (t)=E[x(t)]\) denotes the expected value of the random variable x(t). In both Model 1 and Model 2 if \(x(0)=x_0\) is the initial condition of the process, \(\mu (t)\) is given by

$$\begin{aligned} \mu (t)=x_0e^{-\alpha t}. \end{aligned}$$
(8)

In these two models, closed form solutions for central moments can be found. In particular, odd central moments are zero and the first two even moments are given by

$$\begin{aligned} M_2(t)=\frac{\sigma ^2}{2\alpha }\bigg (1-e^{-2\alpha t}\bigg ), \end{aligned}$$
(9)

and

$$\begin{aligned} M_4(t)=3\bigg [\frac{\sigma ^2}{2\alpha }\bigg (1-e^{-2\alpha t}\bigg )\bigg ]^2=3M_2(t)^2, \end{aligned}$$
(10)

in Model 1, and by

$$\begin{aligned} M_2(t)=\frac{\sigma ^2+\lambda \sigma _J^2}{2\alpha }\bigg (1-e^{-2\alpha t}\bigg ), \end{aligned}$$
(11)

and

$$\begin{aligned} M_4(t)=\frac{3 \lambda \sigma _J^4}{4 \alpha }\bigg (1-e^{-4\alpha t}\bigg )+3M_2(t)^2, \end{aligned}$$
(12)

in Model 2. The proofs of the above equations as well as the proofs of the formulas presented in this section are provided in A.

It is well known that x(t) are normal variables for \(t>0\) in Model 1 [16], but they are not normal in Model 2, as it can be verified by the value of the kurtosis which is different from 3. In the limit \(t \rightarrow \infty\), central moments converge toward stationary values that can be computed by using the recursive relationship

$$\begin{aligned} M_{2n}=\frac{2n-1}{2\alpha }\sigma ^2 M_{2(n-1)} \end{aligned}$$
(13)

in Model 1, and by

$$\begin{aligned} M_{2n}=\frac{2n-1}{2\alpha }\sigma ^2 M_{2(n-1)}+\frac{\lambda }{2n\alpha }\sum _{k=1}^n\frac{(2n)!}{2^k k! (2n-2k)!}\sigma _J^{2k}M_{2(n-k)} \end{aligned}$$
(14)

in Model 2. The stationary value of the kurtosis is \(K=3\) in Model 1, and

$$\begin{aligned} K=3+3 \frac{\lambda \sigma _J^4}{\alpha \big (\sigma ^2+\sigma _J^2\big )^2}, \end{aligned}$$
(15)

in Model 2. Figure 3 depicts the time behavior of standard deviation and kurtosis, computed with estimated parameters at Palo Verde and PJM markets for both Model 1 and Model 2.

Fig. 3
figure 3

Time behavior of standard deviation and kurtosis simulated using Palo Verde (left panels) and PJM (right panels) estimated parameters

In market data, probability distributions tend to be stationary in a few tens of days. After this time interval, the process random variables x(t) and x(s) become identically distributed for any time t and time s. The speed of convergence to the stationary distribution depends, of course, on the mean-reversion parameter. The convergence is, therefore, faster for PJM market prices. Moreover, random variables x(s) and x(t) (\(t\ge s)\) become uncorrelated in a few tens of days. To show this, we notice that the autocorrelation function can be computed in a closed form in both models and reads,

$$\begin{aligned} \text {Cov}\big (x(s),x(t)\big )=\frac{\sigma ^2}{2\alpha }\bigg [e^{-\alpha (t-s)}-e^{-\alpha (t+s)}\bigg ], \end{aligned}$$
(16)

in Model 1, and

$$\begin{aligned} \text {Cov}\big (x(s),x(t)\big )=\frac{\sigma ^2+\lambda \sigma _J^2}{2\alpha }\bigg [e^{-\alpha (t-s)}-e^{-\alpha (t+s)}\bigg ], \end{aligned}$$
(17)

in Model 2. For large s, i.e., after some tens of days, the autocorrelation function becomes stationary in both Model 1 and Model 2. It depends on the time difference, \(s-t\), with a correlation coefficient which decreases exponentially,

$$\begin{aligned} \rho (s,t)=e^{-\alpha (t-s)}, \end{aligned}$$
(18)

in both Model 1 and Model 2. A more complicated structure arises when Model 3 is considered and no closed form solutions for central moments can be found. Moreover, differently from Model 1 and Model 2, the time evolution of the central moments depends on the initial condition of the stochastic dynamics.

Fig. 4
figure 4

Time behavior of the first four moments simulated using estimated parameters at Palo Verde market (left panels) and at PJM (right panels). The initial condition is set at \(x(0)=0\) (blue line) and \(x(0)=3\) (red line)

Figure 4 shows the time behavior of the mean, standard deviation, skewness and kurtosis (hereinafter, the first four moments) obtained by Monte Carlo simulations using estimated parameters at Palo Verde and PJM markets respectively. Simulations were performed under two very different initial conditions, namely \(x(0)=0\) and \(x(0)=3\). The first value was chosen equal to the long-run mean value of x(t), the second value was chosen very far from the long-run mean value (it corresponds to an electricity price greater than one thousand dollars). In both cases, standard deviation and kurtosis converge rapidly (in some tens of days) to their stationary values. The mean and the skewness converge rapidly to zero. The convergence is faster in the PJM market because of larger values of the mean reversion parameters. Moreover, random variables x(t) and x(s) (\(s\ge t)\) become uncorrelated in some tens of days. Figure 5 depicts the time behavior of the correlation coefficient in the time interval of [0, 150] days simulated by using Palo Verde and PJM estimated parameters. In each panel, the blue line refers to the initial condition \(x(0)=0\), and the red line to the initial condition \(x(0)=3\). The third (yellow) line shows the time behavior of the stationary correlation coefficient. Of course, the stationary autocorrelation function does not depend on the initial condition of the dynamics. Due to higher mean-reversion values, both in the base regime and in the turbulent regime, the convergence toward zero correlation is faster for PJM market prices.

Fig. 5
figure 5

Time behavior of the correlation coefficient simulated by using Palo Verde (left panel) and PJM (right panel) estimated parameters

The discussed results are very useful to investigate the long-term behavior of electricity prices. Valuing investments in the power sector for new generating capacity requires to take into account revenues from selling electricity on power market for decades. We will see that such results can be used to conciliate accurate descriptions of short-term behavior with long-term macroeconomic views about future trends of electricity prices.

4 The stochastic net present value of power generation and the stochastic levelized cost of electricity

The stochastic Net Present Value (NPV) and the stochastic Levelized Cost Of Electricity (LCOE) are mathematical constructs that allow us to introduce risk into the evaluation process of investments in new generating technologies [28, 45]. As specified in the Introduction, we will consider in this paper five sources of risk, namely the risk associated to random movements of (1) electricity prices, (2) gas, (3) coal prices, (4) nuclear fuel and (5) \(\hbox {CO}_{\text{2 }}\) prices.

To briefly introduce some basic concepts about stochastic NPV and LCOE, let us consider an investment project in a generating plant, financially seen as a cash-flow stream defined on a yearly timetable (as depicted in Fig. 6). We denote by \(n=-N<0\) the construction starting time, and by \(n=0\) the end of construction time. The time \(n=0\) is also the starting time of the operations, and \(n=M\ge 1\) is the end of operations time. The cash-flow evaluation time is \(n=0\).

Fig. 6
figure 6

Project timeline

We suppose that there are k risk sources and we denote by \(\xi\) the risky sources stochastic path. The unlevered cash-flow generated by the project at time n, \(F_n^z\), can be expressed as follows,

$$\begin{aligned} F_n^z(\xi )=R_n^z(\xi )-C_n^z(\xi )-T_n^z(\xi ), \quad n=1,2,\ldots ,M, \end{aligned}$$
(19)

where \(R_n^z(\xi )\) accounts for stochastic revenues from selling electricity, \(C_n^z(\xi )\) for stochastic costs, and \(T_n(\xi )\) for income taxes in the year n. Revenues, costs, and taxes incurring in the time interval \([n-1,n]\) are computed as lump sums and valued at time n. All these quantities are expressed in nominal terms. The cost term in Eq. (19) includes all the costs incurred during the whole the operational lifetime of the plant, namely fixed and variable operation and maintenance (O&M) costs, fuel costs (not included in the variable O&M costs), waste management (for nuclear generation) and decommissioning costs. With regard to fossil fuels, costs can include externalities, i.e., environmental costs like \(\hbox {CO}_{\text{2 }}\) market costs.

In the classic definition, the NPV of an investment in the generating technology z is determined by the difference between the present value of expected revenues and the present values of expected costs and taxes including pre-operations investment costs. NPV can be, then, determined by subtracting investment costs from the present value of the unlevered cash-flow. To compute the present value, the unlevered cash-flow must be discounted at the Weighted Average Cost of Capital (WACC) nominal rate [46]. If we denote by r the nominal WACC rate on an annual basis, the NPV formula reads

$$\begin{aligned} NPV^z=\sum _{n=1}^M\frac{E\big [F_n^z(\xi )\big ]}{(1+r)^n}-I_0^z. \end{aligned}$$
(20)

In Eq. (20) \(I_0^z\) is the pre-operations nominal investment, starting at \(n=-N\) and ending at \(n=0\), computed as a lump sum, namely as

$$\begin{aligned} I_0^z= {\hat{I}}_{-N}^z(1+r)^N+\cdots + {\hat{I}}_{-1}^z(1+r)+ {\hat{I}}_0^z, \end{aligned}$$
(21)

where \({\hat{I}}_n^z\) is the nominal amount of the construction cost allocated to year n. Following the MIT [34, 35] analytical approach, yearly tax liabilities are computed by subtracting costs and asset depreciation from sales revenues, thus getting

$$\begin{aligned} T_n^z(\xi )=T_c\big [R_n^z(\xi )-C_n^z(\xi )-D_n^z\big ], \end{aligned}$$
(22)

where \(D^z_n\) is the fiscal depreciation. Finally, revenues from electricity sales can be cast as follows,

$$\begin{aligned} R_n^z(\xi )=Q^zP_n^z(\xi ), \end{aligned}$$
(23)

where \(Q^z\) is the amount of electricity generated by the technology z in 1 yearFootnote 1, and \(P_n^z(\xi )\) is the (yearly averaged) unitary selling price of the electricity produced in the year n by the technology z. By substituting Eq. (19) into Eq. (20) and accounting for Eqs. (22) and (23), after some algebraic manipulations, we get

$$\begin{aligned} \begin{aligned} NPV^z&=(1-T_c)Q^z\sum _{n=1}^ME\big [P_n^z(\xi )\big ]F_{0,n}+\\&\quad -(1-T_c)\sum _{n=1}^ME\big [C_n^z(\xi )\big ]F_{0,n}+T_c\sum _{n=1}^MD_n^zF_{0,n}-I_0^z, \end{aligned} \end{aligned}$$
(24)

where we posed

$$\begin{aligned} F_{0,n}=\frac{1}{(1+r)^n}. \end{aligned}$$
(25)

Equation (24) can be also used as a starting point to determine the Levelized Cost Of Electricity (LCOE) of a given technology.

By definition, the LCOE of the technology z, henceforth denoted by \(P^{\text {LC},z}\), is that deterministic nonnegative real price of the electricity produced by the specific generation technology z, assumed constant over time, that makes the NPV equal to zero. The LCOE is hence a break-even reference unitary cost, i.e., the break-even cost per MWh of produced electricityFootnote 2. To get the analytical form of \(P^{\text {LC},z}\), let us pose

$$\begin{aligned} E\big [P_n^z(\xi )\big ]=(1+i)^{n-n_b}P^{\text {LC},z}, \end{aligned}$$
(26)

where i is the expected yearly inflation rate and \(n_b\) labels the base year (which is used to transform real prices into nominal prices). By substituting Eq. (26) into Eq. (24) and equating \(NPV^z\) to zero, we obtain

$$\begin{aligned} P^{\text {LC},z}= \frac{\sum _{n=1}^M E\big [C_n^z(\xi )\big ] F_{0,n}}{Q^z\sum _{n=1}^M(1+i)^{n-n_b}F_{0,n}}+\frac{I_0^z-T_c\sum _{n=1}^M{ D_n^z}F_{0,n}}{(1-T_c)Q^z\sum _{n=1}^M(1+i)^{n-n_b}F_{0,n}}. \end{aligned}$$
(27)

In order to include into the analysis the risk due to random movements of revenues and costs, we must extend the classic NPV metrics. Now we are going to define the stochastic NPV. Such a definition must satisfy the constraint that the classic NPV must coincide with the mean of the stochastic NPV, i.e.,

$$\begin{aligned} NPV^z=E\big [NPV^z(\xi )\big ]. \end{aligned}$$
(28)

We hence define the stochastic NPV of the technology z as the random variable

$$\begin{aligned} \begin{aligned} NPV^z(\xi )&=(1-T_c)Q^z\sum _{n=1}^MP_n^z(\xi )F_{0,n} \\&\quad -(1-T_c)\sum _{n=1}^MC_n^z(\xi )F_{0,n}+T_c\sum _{n=1}^MD_n^zF_{0,n}-I_0^z. \end{aligned} \end{aligned}$$
(29)

Starting from the definition of the stochastic NPV of the technology z, it is straightforward to introduce the stochastic LCOE of the technology z. Let us denote by \(P^{\text {LC},z}(\xi )\) the stochastic LCOE of the technology z. \(P^{\text {LC},z}(\xi )\) is defined path by path as that nonnegative real price of the electricity produced by the specific generation technology z, assumed constant over time, that makes the stochastic \(NPV^z(\xi )\) equal to zero. To get the analytical form of the stochastic LCOE, let us pose

$$\begin{aligned} P_n^z(\xi )=(1+i)^{n-n_b}P^{\text {LC},z}(\xi ), \end{aligned}$$
(30)

thus obtaining from Eq. (29),

$$\begin{aligned} \begin{aligned} P^{\text {LC},z}(\xi )&=\frac{\sum _{n=1}^M C_n^z(\xi ) F_{0,n}}{Q^z\sum _{n=1}^M(1+i)^{n-n_b}F_{0,n}}\\& \quad+\frac{I_0^z-T_c\sum _{n=1}^M{ D_n^z}F_{0,n}}{(1-T_c)Q^z\sum _{n=1}^M(1+i)^{n-n_b}F_{0,n}}. \end{aligned} \end{aligned}$$
(31)

As in the case of the stochastic NPV, Eq. (31) shows that the mean of the stochastic LCOE coincides with the classic, deterministic LCOE.

By substituting Eq. (31) into Eq. (29) we can obtain a very useful relationship between the stochastic NPV and the stochastic LCOE of the generation technology z, namely

$$\begin{aligned} NPV^z(\xi )=(1-T_c)Q^z\sum _{n=1}^M\big [P_n^z(\xi )-P^{\text {LC},z}(\xi ) (1+i)^{n-n_b}\big ]F_{0,n}. \end{aligned}$$
(32)

We remark that the relevant quantity for evaluating an investment in a generating technology is not the NPV itself (for which doubling the size of a plant would double the NPV), but the unitary NPV, i.e., the NPV per unit of generated electricity. We can, therefore, introduce a ‘reduced stochastic NPV’ in the following form,

$$\begin{aligned} NPV^z_\text {red}(\xi )=\frac{NPV^z(\xi )}{(1-T_c)Q^z\sum _{n=1}^M(1+i)^{n-n_b}F_{0,n}}. \end{aligned}$$
(33)

The reduced stochastic NPV can be cast in a much more expressive form. From Eq. (32) we namely get

$$\begin{aligned} NPV^z_\text {red}(\xi )={\hat{P}}^z(\xi )-P^{\text {LC},z}(\xi ), \end{aligned}$$
(34)

where

$$\begin{aligned} {\hat{P}}^z(\xi )=\frac{\sum _{n=1}^MP_n^z(\xi )F_{0,n}}{\sum _{n=1}^M(1+i)^{n-n_b} F_{0,n}}. \end{aligned}$$
(35)

Equation (34) clearly shows that break-even or profitability can be reached if and only if \(E\big [{\hat{P}}^z(\xi )\big ] \ge P^{\text {LC},z}\) (it means that the right quantity to be compared with the deterministic LCOE is \(E\big [{\hat{P}}^z(\xi )\big ]\)). For dispatchable baseload technologies, such as nuclear, coal or combined-cycle gas turbines (CCGTs), which can have the same electricity output profile, \(P_n^z\) is a technology independent quantity, i.e.,

$$\begin{aligned} P_n^z(\xi )=P_n^b(\xi ), \end{aligned}$$
(36)

where \(P_n^\text {b}\) denotes the (yearly averaged) unitary selling price of the baseload generation in the year n. In such a case, Eq. (34) becomes

$$\begin{aligned} NPV^z_\text {red}(\xi )={\hat{P}}^\text {b}(\xi )-P^{\text {LC},z}(\xi ), \end{aligned}$$
(37)

where

$$\begin{aligned} {\hat{P}}^\text {b}(\xi )=\frac{\sum _{n=1}^MP_n^b(\xi )F_{0,n}}{\sum _{n=1}^M(1+i)^{n-n_b}F_{0,n}}. \end{aligned}$$
(38)

Very often in the literature, the classic LCOE is used as a metric to compare generation costs of different technologies [9, 22]. That is, the classic LCOE metric has been used as an alternative to the classic NPV metric. For baseload generation, an economic comparison through LCOE makes sense because, as shown by Eq. (37), the technology that maximizes the expected reduced NPV is the technology that minimizes LCOE. This close link between the LCOE methodology and the financial notion of NPV ‘has always heightened its appeal’ [22]. This simple metric allows for a straightforward comparison of technologies that have different sizes, different lifetimes and different cost profiles in both regulated and liberalized electricity markets. However, in the case of Variable Renewable Energy (VRE) sources, such as wind or solar sources, \({\hat{P}}_n^z(\xi )\) can be a technology dependent parameter. Namely, in liberalized markets the hourly electricity output profile of non-dispatchable technologies can significantly differ between each other, and the LCOE evaluation approach should be carefully used for comparative purposes. Well aware of this, some Authors tried to investigate timing impact of electricity generation from VRE sources [23, 43]. In any case, being the LCOE a break-even reference unitary cost, it is a useful reference cost metric which can be compared with expected electricity market prices for checking, on the basis of Eq. (34), if break-even can be reached.

The next sections will be devoted to compute stochastic revenues and costs. Our aim ere his to provide a model to evaluate long-run investments in new generation technologies under uncertainty through the stochastic NPV metric. Without loss of generality, in the following we assume that the evaluation time, \(n=0\), coincides with the base year, \(n_b\).

5 Modeling revenues

When modeling revenues, we assumed that the dynamics of electricity prices is described by Model 3, the more realistic model that well reproduces most of the observed features of market prices. This model was calibrated on past data, in a backward looking way, separating the deterministic component of Eq. (2) from the stochastic component. Since we now want to include macroeconomic forward looking information, we can replace in Eq. (2) the values of \(b_0\) and \(b_1\) related to the linear (affine) trend with new values that can account for macroeconomic views on the future long-term evolution of electricity prices. The \(\beta _0\) and \(\beta _1\) parameters will be determined in a such a way to match the expected long-run mean, the expected inflation rate, and the expected real escalation rate of power prices. Such macroeconomic estimates can be found in the Annual Energy Outlook 2019 [9] provided by the US Energy Information Administration.

According to both classic and stochastic NPV approaches the revenues from selling electricity during the annual time interval \([n-1,n]\) must be computed as a lump sum valued at time n. Limiting our analysis to baseload generation, the revenues term of Eq. (37) can be computed assuming that \(P_n^\text {b}\) is the annual averages of daily market prices, i.e.,

$$\begin{aligned} P_n^\text {b}=e^{\beta _0+\beta _1 n}\bigg [\frac{1}{365}\sum _{k=1}^{365}P_0(t_{kn})\bigg ], \end{aligned}$$
(39)

where \(\beta _0\) and \(\beta _1\) account for a forward looking linear (affine) trend. Such parameters go to replace \(b_0\) and \(b_1\). \(P_0(t_{kn})\) denotes the electricity price at the \(k\text {th}\) day of the \(n\text {th}\) year, computed with \(b_0=b_1=0\). The dependence on the path \(\xi\) will be henceforth omitted. To investigate the statistical properties of \(P_n^\text {b}\), let us consider the detrended log-revenues process, \(h^\text {b}_n\), given by

$$\begin{aligned} h^\text {b}_n=\log P_n^\text {b}-\beta _0-\beta _1 n. \end{aligned}$$
(40)

Figure 7 shows the time behavior of the 1-year autocorrelation of \(h^\text {b}_n\) as n varies on a 30-year time horizon, computed using estimated parameters at Palo Verde and PJM markets. Figure 8 depicts the time behavior of the first four moments of the detrended log-revenues process as as n varies on a 30-year time horizon.

Fig. 7
figure 7

One-year autocorrelation of the detrended log-revenues process

Fig. 8
figure 8

Time behavior of the first four moments of the detrended log-revenues process

We notice that the random variables \(h^\text {b}_n\) (\(n\ge 1\)) are uncorrelated random variables with same mean, \({\bar{h}}^\text {b}\) (about 0.06 at Palo Verde, and 0.05 at PJM), same standard deviation, \(\varSigma _{\text {b}}\) (about \(9.46\%\) at Palo Verde, and \(5.37\%\) at PJM), zero skewness and kurtosis equal to three. Moreover, we performed some tests of normality, such as the Jarque-Bera test, the Kolmogorov-Smirnov test, the Anderson-Darling test and the chi-square test on Monte Carlo generated samples with sample size of five thousands trials. All these tests reveal that the hypothesis of normality for \(h^\text {b}_n\) (\(n\ge 1\)) cannot be rejected. As a consequence, we assume that \(h^\text {b}_n\) (\(n\ge 1\)) are i.i.d. normal random variable with mean \({\bar{h}}^\text {b}\) and standard deviation \(\varSigma _{\text {b}}\), i.e., \(h_n^\text {b} \sim N({{\bar{h}}^\text {b}},\varSigma _{\text {b}}^2)\). This is an important result which is very useful to compute revenues from selling electricity on a long-term timescale.

Now we need to determine \(\beta _0\) and \(\beta _1\). This can be done using macroeconomic views about the long-run behavior of power prices. We thus require that

$$\begin{aligned} \exp \big (\beta _0+{{\bar{h}}^\text {b}} +\frac{1}{2}\varSigma _{\text {b}}^2\big )=A^\text {b}, \end{aligned}$$
(41)

where \(A^\text {b}\) is the current annual average of the electricity generation price, and

$$\begin{aligned} \beta _1=\pi +\pi ^\text {b}, \end{aligned}$$
(42)

where \(\pi =\ln (1+i)\) and \(\pi ^\text {b}\) accounts for the view on the real escalation rate of power prices, i.e., \(\pi ^\text {b}=\ln (1+k^\text {b})\) with \(k^\text {b}\) the expected real escalation rate of power prices. The Annual Energy Outlook 2019 [9] provides the following values, \(A^\text {b}=64\) \(\$_\text {2018}\) per MWh, \(i=2.3\%\) per annum, and \(k^\text {b}=-0.5\%\) per annum.

Finally, we point out that the above analysis could also have been carried out using Model 1 or Model 2, which are special cases of Model 3. However, the estimate of the log-revenues standard deviation parameter, \(\varSigma _{\text {b}}\), could be affected by errors due to a naive choice of the short-term model. In fact, Model 1 underestimates the log-revenues standard deviation (about \(6.16\%\) at Palo Verde, and \(4.79\%\) at PJM) and, on the contrary, Model 2 overestimates such a parameter (about \(10.93\%\) at Palo Verde, and \(5.94\%\) at PJM). Although these values differ slightly in the case of PJM, such differences become large in the case of Palo Verde, especially for Model 1. This fact has important consequences on the shape of the probabilistic distribution of the stochastic NPV leading to an incorrect assessment of the risk associated with the investment [28]. This would lead energy companies and policy makers to misjudge the risk associated with new investments in power generation. For these reasons, in the empirical analysis discussed in Sect. 7 we will use the standard deviation value estimated on Palo Verde market data using Model 3.

6 Modeling fuel price dynamics

To compute stochastic NPVs of new generating technologies we need to determine the stochastic behavior of fuel costs. In this section, we accurately model the stochastic dynamics of fuels and \(\hbox {CO}_{\text{2 }}\) prices which are the main sources of risk when power generating costs are considered.

6.1 Modeling fossil fuel prices

Figure 9 shows the historical behavior of natural gas and coal prices. Our data set consists of time series of gas and coal prices at a monthly frequency since January, 1999 until November, 2018. Prices are expressed in nominal dollars per mmBtu, i.e., nominal dollars per million Btus. Data refer to the cost of natural gas and coal receipts at electric generating plants, and they were downloaded from the US Energy Information Administration at site www.eia.doe.gov/totalenergy/data.

Fig. 9
figure 9

Historical behavior of fossil fuel market prices since January, 1999–November, 2018. Left panel: natural gas. Right panel: coal

Fig. 10
figure 10

Historical behavior of fossil fuel log-returns since January, 1999–November, 2018. Left panel: natural gas. Right panel: coal

Figure 10 shows the historical behavior of fossil fuel log-returns, calculated as monthly changes in the natural logarithm of monthly prices. Table 7 displays the descriptive statistics of fossil fuel log-returns.

Table 7 Descriptive statistics of fossil fuel log-returns

The empirical analysis reveals that the time series of gas and coal log-returns do not show infra-annual seasonality and are almost uncorrelated (the correlation coefficient in the period under investigation is about − 0.0052).

Geometric Brownian Motion (GBM) is often used in the literature to model fossil fuel price dynamics [21, 33]. However, looking at Fig. 10, it seems that such process cannot be able to fully capture the observed dynamics of gas log-returns. Evidence exists for a more complicated behavior of gas prices showing mean-reversion, jumps and stochastic volatility [15]. We propose, therefore, a stochastic model, in which the time evolution of of gas prices is described by a mean-reverting jump-diffusion model. In contrast, to describe the dynamics of coal prices we use a GBM stochastic process. Observed coal prices, in fact, do not show mean-reversion patterns in the period under investigation, and the kurtosis value (3.3687) is consistent with normally distributed log-returns.

6.1.1 Modeling natural gas price dynamics

To model the gas price dynamics, let us denote by \(P^\text {ga}(t)\) the gas market price at time t (the suffix ‘ga’ stands for ‘gas’), expressed in nominal dollars per mmBtu, and by \(s^\text {ga}(t)=\ln P^\text {ga}(t)\) its natural logarithm. We assume that \(s^\text {ga}(t)\) can be decomposed as follows,

$$\begin{aligned} s^\text {ga}(t)=b_0^\text {ga}+x^\text {ga}(t), \end{aligned}$$
(43)

where \(b_0^\text {ga}\) accounts for a constant trend and \(x^\text {ga}(t)\) for the stochastic component of the dynamics. In the period under investigation, the gas price dynamics does not show any linear trend and the empirical analysis confirms this feature. We estimated \(b_0^\text {ga}\) on market data using Ordinary Least Squares (OLS) techniques, thus finding \(b_0^\text {ga}=1.5224\). Looking at the left panel of Fig. 9, it is worth to observe that US natural gas market went through important transitions in the sample period. In the early 2000s, prices were trending upward as conventional US natural gas production was declining over time, then the boom in unconventional gas production coincided with a persistent decrease in natural gas prices in the 2010s. Moreover, gas prices were trending upward again at the end of 2010s and macroeconomic projections to 2050 show a persistent upward trend in prices [9]. Over very long time horizons such trend changes can be considered as random events and we account for them in the stochastic component of the dynamics through a mean-reversion mechanism in a jump-diffusion process. We assume, therefore, that the time evolution of the stochastic component of the gas price dynamics, \(x^\text {ga}(t)\), is described by the following stochastic differential equation,

$$\begin{aligned} dx^\text {ga}(t)=-\alpha ^\text {ga} x^\text {ga}(t)dt+\sigma ^\text {ga} dw^\text {ga}(t)+J^\text {ga}dq^\text {ga}(t), \end{aligned}$$
(44)

where \(w^\text {ga}(t)\) is a Wiener process and \(q^\text {ga}(t)\) is a Poisson process with constant intensity \(\lambda ^\text {ga}\). The random jump amplitude \(J^\text {ga}\) is distributed according to a normal random variable with zero mean and standard deviation \(\sigma _J^\text {ga}\), i.e., \(J \sim N\big (0,(\sigma _J^\text {ga})^2\big )\). We assume that the Wiener process \(w^\text {ga}\), the Poisson process \(q^\text {ga}\), and the jump amplitude \(J^\text {ga}\) are mutually independent processes. The stochastic component of the dynamics was estimated on market data by maximum likelihood using the Euler discretization with time step \(\varDelta t\) equal to 1 month. Estimation results are depicted in Table 8.

Table 8 Estimation results of the stochastic component of the gas price dynamics

The statistical analysis of simulated trajectories confirms that the jump-diffusion model offers an interesting agreement with market data. The first four moments of paths log-returns, obtained averaging over ten thousands paths randomly generated using estimated parameters, are displayed in Table 9.

Table 9 Statistics of simulated paths log-returns using estimated parameters

In computing the stochastic NPV of gas generation, gas costs incurred in the annual time interval \([n-1,n]\) must be computed as a lump sum and valued at time n. We assume that such costs (per mmBtu) in the year n are determined as a monthly market price average, \(P_n^\text {ga}\), given by

$$\begin{aligned} P_n^\text {ga}=e^{\beta ^\text {ga}_0+\beta ^\text {ga}_1 n}\bigg [\frac{1}{12}\sum _{k=1}^{12}P^\text {ga}(t_{kn})\bigg ], \end{aligned}$$
(45)

where \(\beta ^\text {ga}_0\) and \(\beta ^\text {ga}_1\) account for a forward looking linear (affine) trend, and \(t_{kn}\) denotes the \(k\text {th}\) month of the \(n\text {th}\) year. The dependence on the path \(\xi\) has been omitted. The parameters \(\beta ^\text {ga}_0\) and \(\beta ^\text {ga}_1\) allow us to calibrate the model on macroeconomic views about the future trend of gas market prices.

To investigate the statistical properties of \(P_n^\text {ga}\), let us consider the detrended log-cost process given by

$$\begin{aligned} h_n^\text {ga}=\log P_n^\text {ga}-\beta ^\text {ga}_0-\beta ^\text {ga}_1 n. \end{aligned}$$
(46)

Figure 11 shows the time behavior of the autocorrelation function (left panel) and the 1-year autocorrelation (right panel) of \(h_n^\text {ga}\) (\(n\ge 1\)) as n varies on a 30-year time horizon, computed with estimated parameters. Figure 12 depicts the time behavior of the first four moments of \(h_n^\text {ga}\) as n varies on a 30-year time horizon, computed with estimated parameters. We noticed that the random variables \(h_n^\text {ga}\) (\(n\ge 1\)) are correlated random variables with a constant correlation coefficient, which is approximately equal to 0.7. After a transient period of about 3 year, such random variables are characterized by the same values of mean, \({\bar{h}}^\text {ga}\) (about 0.02), and the same value of the standard deviation, \(\varSigma _{\text {ga}}\) (about 0.35), zero skewness and kurtosis equal to three. Moreover, we performed some tests of normality such as the Jarque–Bera test, the Kolmogorov–Smirnov test, the Anderson–Darling test and the chi-square test on Monte Carlo generated samples with sample size of five thousands trials. All these tests reveal that the hypothesis of normality for \(h_n^\text {b}\) (\(n\ge 1\)) cannot be rejected. We assume, therefore, that \(h^\text {ga}_n\) (\(n\ge 1\)) are correlated i.d. normal random variable with mean \({\bar{h}}^\text {ga}\) and standard deviation \(\varSigma _{\text {ga}}\), i.e., \(h^\text {ga}_n \sim N({{\bar{h}}^\text {ga}},\varSigma _{\text {ga}}^2)\). This is a further important result, useful to compute gas costs on a long-term horizon.

Fig. 11
figure 11

The autocorrelation function of the detrended log-cost process (left panel), and the 1-year autocorrelation function (right panel)

Fig. 12
figure 12

Time behavior of the first four moments of the detrended log-cost process

The calibration of the model on macroeconomic views about long-term behavior of gas prices can proceed as in the electricity case. Namely, we require that

$$\begin{aligned} \exp \big (\beta _0^\text {ga}+{{\bar{h}}^\text {ga}}+\frac{1}{2} \varSigma _{\text {ga}}^2\big )=A^\text {ga}, \end{aligned}$$
(47)

where \(A^\text {ga}\) is the current annual average of the gas price, and

$$\begin{aligned} \beta _1^\text {ga}=\pi +\pi ^\text {ga}, \end{aligned}$$
(48)

where \(\pi ^\text {ga}\) accounts for the view on the real escalation rate of gas prices, i.e., \(\pi ^\text {ga}=\ln (1+k^\text {ga})\) and \(k^\text {ga}\) is the expected real escalation rate of gas prices. The Annual Energy Outlook 2019 [9] provides the following values, \(A^\text {ga}=3.54\) \(\$_\text {2018}\) per mmBtu, and \(k^\text {ga}=1.4\%\) per annum. Such values are also reported in Table 10.

6.1.2 Modeling coal price dynamics

To model the coal price dynamics, let us denote by \(P^\text {co}(t)\) the coal market price at time t (the suffix ‘co’ stands for ‘coal’), expressed in nominal dollars per mmBtu. We assume that the dynamics of \(P^\text {co}(t)\) is described by a GBM process of the type,

$$\begin{aligned} \frac{dP^\text {co}(t)}{P^\text {co}(t)}=(\pi ^\text {co}+\pi )dt+\sigma ^\text {co} dw^\text {co}(t), \end{aligned}$$
(49)

where \(w^\text {co}(t)\) is a Wiener process. The volatility parameter is set at the value 0.0143 on a monthly basis (see Table 7), i.e., \(\sigma ^\text {co}=0.05\) on an annual basis. The drift coefficient accounts for the expected inflation, \(\pi\), and the expected real escalation rate of coal prices, \(\pi ^\text {co}=\ln (1+k^\text {co})\), where \(k^\text {co}\) is the expected real escalation rate of coal prices. The Annual Energy Outlook 2019 [9] provides the value \(k^\text {co}=0.2\%\) per annum. Such value is also reported in Table 10. Since the correlation between coal market prices and power market prices is negligible [21], the Wiener process, \(w^\text {co}(t)\), is assumed to be independent from the stochastic processes driving the time evolution of power prices and gas prices. Differently from mean-reverting processes, Geometric Brownian Motion has not stationary solutions for the probability density function. Then, the coal cost evaluation in computing the stochastic NPV of coal plants is performed pointwise at the end of each year using Equation (49) with an initial condition given by the coal price value reported in Table 10.

6.2 Modeling nuclear fuel prices

Our data set consists of a time series of nuclear fuel prices at an annual frequency since 1973 until 2017. Data can be downloaded from the US Energy Information Administration at site www.eia.doe.gov/opendata. Figure 13 shows the historical behavior of nuclear fuel prices (left panel) and their annual log-returns, calculated as annual changes in the natural logarithm of prices (right panel). Prices are expressed in nominal dollars per mmBtu.

Fig. 13
figure 13

Historical behavior of nuclear fuel prices (left panel) and log-returns (right panel) since 1973–2017

Let us denote by \(P^\text {nu}(t)\) the nuclear fuel price at time t (the suffix ‘nu’ stands for ‘nuclear’). In the period under investigation log-returns do not show mean-reversion, and we assume that the dynamics of nuclear fuel prices is described by a GBM process of the type

$$\begin{aligned} \frac{dP^\text {nu}(t)}{P^\text {nu}(t)}=(\pi ^\text {nu}+\pi )dt+ \sigma ^\text {nu}dw^\text {nu}(t), \end{aligned}$$
(50)

where \(w^\text {nu}(t)\) is a Wiener process and \(\pi ^\text {nu}=\ln (1+k^\text {nu})\), being \(k^\text {nu}\) the expected real escalation rate of nuclear fuel prices. The Annual Energy Outlook 2019 [9] provides the following value \(k^\text {nu}=0.2\%\) per annum, as reported in Table 10. The Wiener process, \(w^\text {nu}(t)\), is assumed to be independent from the stochastic processes driving the time evolution of power prices and fossil fuel prices.

The volatility parameter was estimated by maximum likelihood on the nuclear fuel dataset. We obtained the value \(\sigma ^\text {nu}=0.07\) on an annual basis. As in the coal case, the nuclear fuel cost evaluation for computing the stochastic NPV of nuclear power plants is performed pointwise at the end of each year using Eq. (50) with an initial condition given by the nuclear fuel price value reported in Table 10.

6.3 Modeling \(\hbox {CO}_{\text{2 }}\) prices

Let us denote by \(P^\text {ca}(t)\) the \(\hbox {CO}_{\text{2 }}\) price at time t (the suffix ‘ca’ stands for ‘carbon’), expressed in nominal dollars per ton of \(\hbox {CO}_{\text{2 }}\). The dynamics of \(\hbox {CO}_{\text{2 }}\) prices, is modeled according to a GBM process of the type

$$\begin{aligned} \frac{dP^\text {ca}(t)}{P^\text {ca}(t)}=\pi dt+\sigma ^\text {ca} dw^\text {ca}(t), \end{aligned}$$
(51)

where \(\sigma ^\text {ca}\) is the carbon volatility and \(w^\text {ca}(t)\) is a Wiener process which is assumed to be independent from the stochastic processes driving the time evolution of power prices, fossil fuel prices, and nuclear fuel prices. However, the model can be extended to account for a possible correlation between \(\hbox {CO}_{\text{2 }}\) prices and power prices. A \(\hbox {CO}_{\text{2 }}\) price equal to 30 \(\$_\text {2018}\) per ton of \(\hbox {CO}_{\text{2 }}\) [7, 22] is assumed as initial condition of the \(\hbox {CO}_{\text{2 }}\) price process. Regarding the volatility, we consider a \(\hbox {CO}_{\text{2 }}\) price volatility \(\sigma ^\text {ca}=20\%\). These assumptions try to depict a realistic scenario in order to investigate the impact of \(\hbox {CO}_{\text{2 }}\) volatility on the risk of investments in fossil fuel power plants [14]. The contribution of \(\hbox {CO}_{\text{2 }}\) costs to the stochastic NPV is computed pointwise at the end of each year.

7 The stochastic NPV of generating electricity

Table 10 details technical data and costs used in the empirical analysis. The main data source is the ‘Annual Energy Outlook 2019’ [9]. In particular, the cost of new generating technologies are taken from ‘Capital Cost Estimates for Utility Scale Electricity Generating Plants’ [12] and ‘Cost and Performance Characteristics of New Generating Technologies, Annual Energy Outlook 2019-January 2019’ [10], both provided by the US Energy Information Administration. Data refer to a Conventional Natural Gas Combined Cycle (NGCC) facility for the gas technology, to an Ultra Supercritical Coal (USC) facility for the coal technology, and to an advanced PWR (Pressurized Water Reactor) nuclear power facility. All costs are denominated in year 2018 US dollars (\(\$_\text {2018}\)). With the exception of the nuclear technology, decommissioning costs were set at about \(5\%\) of the overnight cost [22, 42]. For nuclear power plants we assumed a larger percentage of the overnight cost to account for decommissioning and radioactive waste disposal [39, 51]. Overnight costs are uniformly distributed on the construction period. In accordance to the Annual Energy Outlook 2019 (AEO 2019), we assumed an expected inflation rate \(i=2.3\%\) per annum, and a corporate tax rate \(T_c=21\%\) as specified in the Tax Cuts and Jobs Act of 2017. As in AEO 2019, all LCOE calculations are performed using a nominal after-tax WACC rate of \(7.0\%\) per annum [9, 11].

Table 10 Technical assumptions

Probabilistic distributions of stochastic LCOE and reduced NPV [equation (hereinafter NPV)] of new generating technologies can be obtained by using Monte Carlo techniques. Random paths for electricity prices, fuel prices, and \(\hbox {CO}_{\text{2 }}\) prices are generated according to dynamic models illustrated in the previous section. Along such paths, LCOE and NPV values are computed by using LCOE and NPV formulas reported in Sect. 4. Numerical computations are performed without and with \(\hbox {CO}_{\text{2 }}\) stochastic costs. In this last case, we assumed that macroeconomic views on the future evolution of power prices as well as their volatility are not influenced by the inclusion of carbon costs. Computations are performed under the hypothesis of zero correlation between (unitary) revenues and costs. This is not a strong assumption, even in the case of gas fired plants. The empirical analysis developed in the previous sections revealed that annual revenues are not correlated over time. On the other side, gas generation annual costs show a high positive correlation over time. This implies that, although gas and power prices could show some cointegration relationship [36], nevertheless the correlation between annual revenues and costs, if there is any, must be negligible. However, the model can be extended in a straightforward way to include such a correlation.

Figure 14 depicts simulated distributions of the stochastic LCOE (without and with \(\hbox {CO}_{\text{2 }}\) costs) of baseload sources computed on a 30-year plant lifetime. Figure 15 depicts simulated distributions of the stochastic NPV of baseload sources (without and with \(\hbox {CO}_{\text{2 }}\) costs) computed on a 30-year plant lifetime.

Fig. 14
figure 14

Stochastic LCOE distributions without \(\hbox {CO}_{\text{2 }}\) costs (left panel) and with \(\hbox {CO}_{\text{2 }}\) costs (right panel) computed over a 30-year plant lifetime

Fig. 15
figure 15

Stochastic NPV distributions without \(\hbox {CO}_{\text{2 }}\) costs (left panel) and with \(\hbox {CO}_{\text{2 }}\) costs (right panel) computed over a 30-year plant lifetime

Table 11 reports LCOE values for power generation from natural gas, coal, and nuclear sources, computed (without \(\hbox {CO}_{\text{2 }}\) costs) over different plant lifetimes. Table 12 reports expected NPV values for power generation from natural gas, coal, and nuclear sources, computed (without \(\hbox {CO}_{\text{2 }}\) costs) over different plant lifetimes. All these distributions include forward looking views. With the exception of the gas technology, both coal and nuclear power generation show a negative expected NPV. Lengthening the plant lifetime does not modify this picture.

Table 11 Expected LCOE values
Table 12 Expected NPV values

The inclusion of \(\hbox {CO}_{\text{2 }}\) costs into the analysis worses the picture. However, in the case of a carbon cost of 30 \(\$_\text {2018}\) per ton of \(\hbox {CO}_{\text{2 }}\), and a volatility \(\sigma ^\text {ca}=20\%\), the NPV of gas generation is still positive (about 6.8) even if the probability to get negative NPV is sensibly increased. Table 13 displays some statistics of stochastic NPV distributions for gas generation. Stochastic \(\hbox {CO}_{\text{2 }}\) costs reduce the expected NPV and increase the standard deviation of the stochastic NPV distribution. As a consequence, the probability to get negative NPV strongly increases (from 0.1 to \(15\%\) when stochastic \(\hbox {CO}_{\text{2 }}\) costs are included). This can be an alert ring for the profitability of the investment.

Table 13 Statistics of stochastic NPV distributions of gas generation without \(\hbox {CO}_{\text{2 }}\) costs (first row) and with \(\hbox {CO}_{\text{2 }}\) costs (second row)

8 Concluding remarks

In this paper we provided a general methodology for valuing long-term investments in the power sector. In particular, investments in new generating technologies (fossil fuel and nuclear technologies) were analyzed and discussed. Stationary distributions of electricity and natural gas prices were used to determine long-term probabilistic distributions of revenues and costs on the whole lifetime of the plants. This approach can be useful for power planning decision making of energy companies and policy makers. By providing the probability density of stochastic NPVs, the proposed methodology allows the management to perform risk analysis of investments in capacity expansion by computing, for example, the Value at Risk (VaR) of a given investment, or the probability to obtain negative NPVs, which is in our opinion an important planning tool. Although our analysis was limited to baseload generation, it can be extended to peak-load generation and to VRE generating technologies in a straightforward way. In such cases, the proposed methodology allows us to include also uncertainty over capacity utilization. This will be the first direction of our future research.

We showed in the paper that, although the gas power generation seems to be the only viable option, nevertheless it is a very risky one [26]. The gas prices volatility is in fact very high and may cause large fluctuations around the expected NPV. In the presence of \(\hbox {CO}_{\text{2 }}\) costs, the probability to get a negative value of the stochastic NPV of a gas fired plant is about \(15\%\). Risk reducing strategies through diversification of generating assets can be a very important solution. This was in fact the main reason for introducing stochastic NPV and stochastic LCOE. Electricity companies often own not just one among many dispatchable and non-dispatchable generation technologies, but mixtures of them, i.e., power generating portfolios. The approach proposed in this paper can be used to investigate the selection problem for power portfolios under uncertainty. In this regard, the stochastic NPV metrics as well as the stochastic LCOE metrics offer the possibility to perform portfolio risk analysis under general deviation measures, such as e.g., standard deviation and Conditional VaR Deviation (CVaRD) [25, 44]. Strategies that aim at reducing risk through diversification, in fact, allow one to investigate the trade-off between the expected NPV and risk, as measured by the standard deviation or by the CVaRD of the stochastic NPV. In this way, the management becomes able to define optimal generating portfolios. Although portfolio selection under uncertainty based on stochastic LCOE was developed in the literature [27, 28] also in the presence of VRE sources [29], the portfolio selection problem based on stochastic NPV with accurate modeling of electricity and fuel price dynamics has not yet been developed. This will be the second direction of our future research.