Irreversible Capital Accumulation with Economic Impact

Al Motairi, Hessah; Zervos, Mihail

doi:10.1007/s00245-016-9341-9

Irreversible Capital Accumulation with Economic Impact

Open access
Published: 21 March 2016

Volume 75, pages 525–551, (2017)
Cite this article

Download PDF

You have full access to this open access article

Applied Mathematics & Optimization Submit manuscript

Irreversible Capital Accumulation with Economic Impact

Download PDF

Hessah Al Motairi¹ &
Mihail Zervos²

3071 Accesses
12 Citations
Explore all metrics

Abstract

We consider an irreversible capacity expansion model in which additional investment has a strictly negative effect on the value of an underlying stochastic economic indicator. The associated optimisation problem takes the form of a singular stochastic control problem that admits an explicit solution. A special characteristic of this stochastic control problem is that changes of the state process due to control action depend on the state process itself in a proportional way.

Irreversible investment with fixed adjustment costs: a stochastic impulse control approach

Article 28 February 2019

Irreversible Investment

Mathematical Modeling of Investments in an Imperfect Capital Market

Article 26 July 2021

1 Introduction

A standard capacity expansion model, which is a special case of the model studied by Kobila [34], can be described as follows. We model market uncertainty by means of the geometric Brownian motion given by

$$\begin{aligned} dX_t^0 = bX_t^0 \, dt + \sqrt{2} \sigma X_t^0 \, dW_t , \quad X_0^0 = x >0 , \end{aligned}$$

(1)

for some constants b and $\sigma \ne 0$, where W is a standard one-dimensional Brownian motion. The random variable $X_t^0$ can represent an economic indicator such as the price of or the demand for one unit of a given investment project’s output at time t. The firm behind the project can invest additional capital at proportional costs at any time, but cannot disinvest from the project. We denote by y the project’s initial capital at time 0 and by $\zeta _t$ the total additional capital invested by time t. We assume that there is no capital depreciation, so the total capital invested at time t is

$$\begin{aligned} Y_t = y + \zeta _t , \quad Y_0 = y \ge 0 . \end{aligned}$$

(2)

The investor’s objective is to maximise the total expected discounted payoff resulting from the project’s management, which is given by the performance index

$$\begin{aligned} J_{x,y}^0 (\zeta ) = {\mathbb {E}}\left[ \int _0^\infty e^{-rt} h(X_t^0, Y_t) \, dt - K \int _{[0, \infty [} e^{-rt} \, d\zeta _t \right] , \end{aligned}$$

(3)

over all capacity expansion strategies $\zeta $. The discounting rate $r>0$ and the cost of each additional unit of capital $K>0$ are constants, while h is an appropriate running payoff function.

Under suitable assumptions on the problem data, the solution to this stochastic control problem is characterised by a threshold given by a strictly increasing free-boundary function $G^0: {\mathbb R}_+ \rightarrow {\mathbb R}_+$. In the special case that arises when $h(x,y) = x^\alpha y^\beta $, for some $\alpha > 0$ and $\beta \in ]0,1[$, namely, when h is a so-called Cobb-Douglas production function,

$$\begin{aligned} G^0 (y) = \left( \frac{rK (\alpha - m)}{-m\beta } \right) ^\frac{1}{\alpha } y^\frac{1-\beta }{\alpha } \quad \text {for } y \ge 0 , \end{aligned}$$

where $m < 0$ is an appropriate constant. If the initial condition (x, y) is strictly below the graph of the function $G^0$ in the x-y plane, then it is optimal to invest so that the joint process $(X^0,Y)$ has a jump at time 0 that positions it in the graph of $G^0$. Otherwise, it is optimal to take minimal action so that the state process $(X^0,Y)$ does not fall below the graph of $G^0$, which amounts to reflecting it in $G^0$ in the positive y-direction.

Irreversible capacity expansion models have attracted considerable interest and can be traced back to Manne [38] (see Van Mieghem [47] for a survey). More relevant to this paper models have been studied by several authors in the economics literature: see Dixit and Pindyck [17, Chapter 11] and references therein. Related models that have been studied in the mathematics literature include Davis, Dempster, Sethi and Vermes [13], Arntzen [4], Øksendal [42], Wang [48], Chiarolla and Haussmann [11], Bank [6], Alvarez [2, 3], Løkka and Zervos [35], Steg [45], Chiarolla and Ferrari [9], De Angelis, Federico and Ferrari [15], and references therein. Furthermore, capacity expansion models with costly reversibility were introduced by Abel and Eberly [1], and were further studied by Guo and Pham [22], Merhi and Zervos [40], Guo and Tomecek [23, 24], Guo, Kaminsky, Tomecek and Yuen [21], Løkka and Zervos [36], De Angelis and Ferrari [16], and Federico and Pham [19].

In the model that we have briefly discussed above, additional investment does not influence the underlying economic indicator, which is unrealistic if one considers supply and demand issues. The nature of the optimal strategy is such that, if $b < {\frac{1}{2}}\sigma ^2$, then $\lim _{t \rightarrow \infty } X_t^0 = 0$ and the investment’s maximal optimal capacity level remains finite for realistic choices of the problem data. On the other hand, if $b \ge {\frac{1}{2}}\sigma ^2$, then $\limsup _{t \rightarrow \infty } X_t^0 = \infty $ and the optimal capacity level typically converges to $\infty $ as $t \rightarrow \infty $.

The model that we study here assumes that additional investment has a strictly negative effect on the value of the underlying economic indicator process X. We assume that increasing the project’s capacity by a very small amount $\Delta \zeta _t = \varepsilon $ at time t affects the process X linearly, namely,

$$\begin{aligned} \Delta X_t \equiv X_{t+} - X_t = - c \varepsilon X_t \quad \Rightarrow \quad X_{t+} = (1 - c \varepsilon ) X_t \simeq e^{- c \varepsilon } X_t , \end{aligned}$$

for some constant $c>0$, where we have taken X to be càglàd. Furthermore, we assume that increasing the project’s capacity by an amount $\Delta \zeta _t > 0$ at time t has the same effect on the process X as increasing the project’s capacity N times infinitesimally close to each other by an amount $\Delta \zeta _t / N$ for every choice of N, which gives rise to the identities

$$\begin{aligned} X_{t+} = e^{- c (\Delta \zeta _t / N) N} X_t = e^{- c \Delta \zeta _t} X_t . \end{aligned}$$

These considerations suggest the modelling of market uncertainty by the solution to the SDE

$$\begin{aligned} dX_t = bX_t \, dt - X_t \circ d\zeta _t + \sqrt{2} \sigma X_t \, dW_t , \quad X_0 = x >0 , \end{aligned}$$

(4)

where

$$\begin{aligned} \int _0^t X_s \circ d\zeta _s = c \int _0^t X_s \, d\zeta _s^\mathrm {c}+ \sum _{0 \le s < t} X_t \bigl ( 1 - e^{-c \Delta \zeta _t} \bigr ) , \end{aligned}$$

(5)

in which expression, $\zeta ^\mathrm {c}$ denotes the continuous part of the increasing process $\zeta $. At this point, it is worth noting that Guo and Zervos [25] have considered the same state dynamics in the optimal execution problem that they study. The objective is to maximise over all admissible capacity expansion strategies $\zeta $ the performance criterion

$$\begin{aligned} J_{x,y} (\zeta ) = {\mathbb {E}}\left[ \int _0^\infty e^{-rt} h(X_t,Y_t) \, dt - K \int _{[0, \infty [} e^{-rt} \, d\zeta _t \right] , \end{aligned}$$

(6)

where $r, K > 0$ are constants and the running payoff function h satisfies Assumption 1 in the next section.

The solution to this problem is again characterised by a threshold defined by a strictly increasing free-boundary function G. Informally, the optimal strategy can be described as the one in the problem defined by (1)–(3). However, reflection in the free-boundary G is oblong rather than in the positive y-direction (see Figs. 1, 2, 3). Furthermore, the negative effect that additional investment has on the underlying economic indicator X results in a maximal optimal capacity level that is bounded in cases of special interest, such as the ones arising, e.g., when the running payoff function h is a Cobb-Douglas production function (see Example 2).

From a stochastic control theoretic perspective, the problem that we solve has the features of singular stochastic control, which was introduced by Bather and Chernoff [7] who considered a simplified model of spaceship control. In their seminal paper, Beneš, Shepp and Witsenhausen [8] were the first to solve rigorously an example of a finite-fuel singular control problem. Since then, the area has attracted considerable interest in the literature. Apart from references that we have discussed in the context of capacity expansion models, Bahlali et al. [5] Chiarolla and Haussmann [10], Chow, Menaldi and Robin [12], Davis and Zervos [14], Fleming and Soner [20, Chapter VIII], Haussmann and Suo [27, 28], Harrison and Taksar [26], Jack, Johnson and Zervos [29], Jacka [30, 31], Karatzas [32], Ma [37], Menaldi and Robin [39], Øksendal [42], Shreve et al. [43], Soner and Shreve [44], Sun [46] and Zhu [49], provide an alphabetically ordered list of further contributions.

In the references discussed above, the controlled process affects the state dynamics in a purely additive way: the change of the state process due to control action does not depend on the state process itself. Singular stochastic control models in which changes of the state process due to control action may depend on the state process were introduced and studied by Dufour and Miller [18] and Motta and Sartori [41]. To the best of our knowledge, problems with state dynamics such as the ones given by (4)–(5) have not been considered in the literature before. Furthermore, the problem that we solve is the very first one in the singular stochastic control literature that involves control action that does not affect the state dynamics in a purely additive way and admits an explicit solution (see also Remark 1 in the next section).

2 Problem Formulation and Assumptions

We fix a probability space $(\Omega , {\mathcal F},{\mathbb {P}})$ equipped with a filtration $({\mathcal F}_t)$ satisfying the usual conditions of right continuity and augmentation by ${\mathbb {P}}$-negligible sets, and carrying a standard one-dimensional $({\mathcal F}_t)$-Brownian motion W. We denote by $\mathcal Z$ the family of all càglàd $({\mathcal F}_t)$-adapted increasing process $\zeta $ such that $\zeta _0 = 0$.

The state space of the control problem that we study is defined by

$$\begin{aligned} {\mathcal S} = \bigl \{ (x,y) \in {\mathbb R}^2 \mid \ x>0 \text { and } 0 \le y \le \bar{y} \bigr \} , \end{aligned}$$

where $\bar{y} \in ]0, \infty ]$ is the maximal capital that can be invested in the project, namely, the maximum capacity level that can be achieved. Given a capacity expansion processes $\zeta \in {\mathcal Z}$, we consider the capacity process Y defined by (2) and the economic indicator process X given by (4)–(5). Using Itô’s formula, we can verify that

$$\begin{aligned} X_t = X_t^0 e^{-c \zeta _t^\mathrm {c}} \prod _{0 \le s < t} \bigl ( 1 - e^{-c \Delta \zeta _t} \bigr ) = X_t^0 e^{-c\zeta _t} , \end{aligned}$$

(7)

where $X^0$ is the geometric Brownian motion defined by (1).

Definition 1

The set $\mathcal A$ of all admissible capacity expansion strategies is the family of all processes $\zeta \in {\mathcal Z}$ such that

$$\begin{aligned} {\mathbb {E}}\left[ \int _{[0, \infty [} e^{-rt} \, d\zeta _t \right] < \infty . \end{aligned}$$

(8)

$ \Box $

The objective of the control problem is to maximise the performance index $J_{x,y}$ defined by (6) over all admissible strategies $\zeta \in {\mathcal A}$, for each initial condition $(x,y) \in {\mathcal S}$. Accordingly, we define the problem’s value function v by

$$\begin{aligned} v(x,y) = \sup _{\zeta \in {\mathcal A}} J_{x,y} (\zeta ) , \quad \text {for } (x,y) \in {\mathcal S} . \end{aligned}$$

(9)

Remark 1

In view of (7), we can see that the stochastic optimisation problem we solve is equivalent to maximising

$$\begin{aligned} J_{x,y} (\zeta ) = {\mathbb {E}}\left[ \int _0^\infty e^{-rt} h(e^{cy} X_t^0 e^{-cY_t} ,Y_t) \, dt - K \int _{[0, \infty [} e^{-rt} \, d\zeta _t \right] \end{aligned}$$

over all admissible strategies $\zeta \in {\mathcal A}$, where the dynamics of the state process $(X^0, Y)$ are given by (1)–(2). At first glance, this observation puts us in the context of the standard singular stochastic control theory because control action affects the dynamics of $(X^0, Y)$ in a purely additive way. However, such a reformulation is of limited theoretical value because the problem’s initial condition y enters non-trivially in the description of the performance criterion, which is a situation that is typically associated with time-inconsistent control problems. $\Box $

Our analysis involves the general solution to the second order Euler’s ODE

$$\begin{aligned} \sigma ^2 x^2 u''(x) + bx u' (x) - ru(x) = 0 , \end{aligned}$$

(10)

which is given by

$$\begin{aligned} u(x) = A x^n + B x^m , \end{aligned}$$

for some $A, B \in {\mathbb R}$, where the constants $m<0<n$ are the solutions to the quadratic equation

$$\begin{aligned} \sigma ^2 \lambda ^2 + ( b-\sigma ^2 ) \lambda - r =0 , \end{aligned}$$

(11)

given by

$$\begin{aligned} m,n = \frac{-(b-\sigma ^2) \pm \sqrt{(b-\sigma ^2 )^2 + 4 \sigma ^2 r}}{2\sigma ^2} . \end{aligned}$$

(12)

Our analysis also involves the function H defined by

$$\begin{aligned} H(x,y) = h_y (x,y) - cx h_x (x,y) - rK , \quad \text {for } x>0 \text { and } y \in ]0,\bar{y}[ . \end{aligned}$$

(13)

This function has a natural economic interpretation. Indeed, increasing capacity by a small amount $\varepsilon > 0$ causes the joint process (X, Y) to jump from a value (x, y) to the value $(x - cx \varepsilon , y + \varepsilon )$. Noting that

$$\begin{aligned} h(x - cx \varepsilon , y + \varepsilon ) - h(x,y) \simeq \bigl [ h_y (x,y) - cx h_x (x,y) \bigr ] \varepsilon \quad \text {and} \quad K = \int _0^\infty e^{-rt} rK \, dt , \end{aligned}$$

we can see that H(x, y) represents the project’s marginal running payoff rate in excess of the marginal cost of capital rate. In view of standard economics theory, this interpretation suggests that (a) the function $H(\cdot , y)$ should be increasing for all $y \ge 0$ because higher values of the underlying economic indicator X, which models the price of or the demand for one unit of the project’s output, should reflect higher values of marginal running payoff, and (b) the function $H(x, \cdot )$ should be decreasing for all $x>0$ because the project’s payoff rate should be concave in the volume of its output due to the balancing of supply and demand. These observations suggest requirements (17)–(19) in the following assumption. In fact, the conditions reflected by (17)–(19) are much weaker than the ones suggested by the above considerations. However, the relaxations involved present no added complications in our analysis whatsoever. The underlying economics theory also suggests that the running payoff function h should be increasing in the value of the underlying economic indicator X for each fixed value of the project’s capacity, which is captured by condition (14). The rest of the conditions appearing in the following assumption, which is admittedly rather long to state, are of a purely technical nature. It is worth noting that (15) is equivalent to the probabilistic condition

$$\begin{aligned} {\mathbb {E}}\left[ \int _0^\infty e^{-rt} \left| h(X_t^0, y) \right| dt \right] < \infty \quad \text {for all } x > 0 \text { and } y \in [0, \bar{y}] \cap {\mathbb R}\end{aligned}$$

(see (77)–(78) in Appendix 2).

Assumption 1

The constants r, K are strictly positive, the function h is $C^3$,

$$\begin{aligned}&h(\cdot , y) \text { is increasing for all } y \in [0, \bar{y}] \cap {\mathbb R}, \end{aligned}$$

(14)

$$\begin{aligned}&\int _0^x s^{-m-1} \left| h(s,y) \right| ds + \int _x^\infty s^{-n-1} \left| h(s,y) \right| ds < \infty \quad \text {for all } x > 0 \text { and } y \in [0, \bar{y}] \cap {\mathbb R}. \nonumber \\ \end{aligned}$$

(15)

There exists a point $x_0 \ge 0$ and a continuous strictly increasing function $y^\dagger : ]x_0, \infty [ \rightarrow {\mathbb R}_+$ such that

$$\begin{aligned} 0 \le y_0 := \lim _{x \downarrow x_0} y^\dagger (x) < \lim _{x \rightarrow \infty } y^\dagger (x) =: y_\infty \le \bar{y} , \quad y_0 = 0 \text { if } x_0 > 0 , \end{aligned}$$

(16)

$$\begin{aligned} H(x,y) {\left\{ \begin{array}{ll} < 0 , &{} \text {if } (x,y) \in {\mathcal H}_- ,\\ = 0 , &{} \text {if } (x,y) \in {\mathcal S} \setminus ({\mathcal H}_- \cup {\mathcal H}_+) , \\ > 0 , &{} \text {if } (x,y) \in {\mathcal H}_+ , \end{array}\right. } \end{aligned}$$

(17)

$$\begin{aligned} \liminf _{x \rightarrow \infty } H(x,y) > 0 \quad \text {for all } y \in ]y_0, y_\infty [ , \end{aligned}$$

(18)

$$\begin{aligned} \text{ the } \text{ function } H(x, \cdot ) \text { is strictly decreasing for all } y \in ]y_0, y_\infty [ , \end{aligned}$$

(19)

where

$$\begin{aligned} {\mathcal H}_- = \bigl \{ (x,y) \in {\mathcal S} \mid \ x \le x_0 \text { or } x > x_0 \text { and } y > y^\dagger (x) \bigr \} , \\ {\mathcal H}_+ = \bigl \{ (x,y) \in {\mathcal S} \mid \ x > x_0 \text { and } y < y^\dagger (x) \bigr \} . \end{aligned}$$

Also, there exist a decreasing function $\Psi : ]y_0, y_\infty [ \rightarrow ]0,\infty [$ such that $\lim _{y \downarrow 0} \Psi (y) < \infty $ if $x_0 > 0$ as well as constants $C_0>0$ and $\vartheta \in ]0,n[$ such that

$$\begin{aligned} - C_0 (1+y) \le h(x,y) \le C_0 (1+y) \bigl ( 1+x^{n-\vartheta } \bigr ) \quad \text {for all } (x,y) \in \mathcal{S} , \end{aligned}$$

(20)

$$\begin{aligned} H(x,y) \le \Psi (y) \bigl ( 1+x^{n-\vartheta } \bigr ) \quad \text {for all } x>0 \text { and } y \in ]0,\bar{y}[ . \end{aligned}$$

(21)

$\Box $

We denote by $x^\dagger $ the inverse of the function $y^\dagger $ that is defined by

$$\begin{aligned} x^\dagger (y) = {\left\{ \begin{array}{ll} 0 , &{} \text {if } 0 \le y < y_0 , \\ (y^\dagger )^{-1} (x) , &{} \text {if } y_0 \le y < y_\infty , \\ \infty , &{} \text {if } y_\infty \le y < \bar{y} . \end{array}\right. } \end{aligned}$$

(22)

Example 1

Suppose that $\bar{y} = \infty $ and h is a so-called Cobb-Douglas function, given by

$$\begin{aligned} h(x,y) = x^\alpha y^\beta , \quad \text {for } (x,y) \in {\mathcal S} , \end{aligned}$$

(23)

where $\alpha \in ]0,n[$ and $\beta \in ]0,1]$ are constants. In this case, we can check that

$$\begin{aligned} H(x,y) = \bigl ( \beta y^{-1} - c\alpha \bigr ) x^\alpha y^\beta - rK . \end{aligned}$$

If we define

$$\begin{aligned} y_0 = 0 , \quad y_\infty = \frac{\beta }{c\alpha } \quad \text {and} \quad x_0 = {\left\{ \begin{array}{ll} (rK)^{1/\alpha } , &{} \text {if } \beta =1 ,\\ 0 , &{} \text {if } \beta \in ]0,1[ , \end{array}\right. } \end{aligned}$$

then we can see that the calculations

$$\begin{aligned} \frac{\partial H (x,y)}{\partial x}= & {} \alpha \bigl ( \beta y^{-1} - c\alpha \bigr ) x^{\alpha - 1} y^\beta {\left\{ \begin{array}{ll} > 0 &{} \text {for all } y \in ]y_0, y_\infty [ , \\ < 0 &{} \text {for all } y \ge y_\infty , \end{array}\right. } \nonumber \\ \lim _{x \downarrow 0} H(x,y)= & {} -rK < 0 \text { for all } y > 0 \quad \text {and} \quad \lim _{x \rightarrow \infty } H(x,y) \nonumber \\= & {} {\left\{ \begin{array}{ll} \infty &{} \text {for all } y \in ]y_0, y_\infty [ \\ -\infty , &{} \text {for all } y \ge y_\infty , \end{array}\right. } \end{aligned}$$

imply that there exists a unique function $y^\dagger : ]x_0, \infty [ \rightarrow {\mathbb R}_+$ such that (16)–(17) hold true. Furthermore, differentiating the identity $H \bigl ( x, y^\dagger (x) \bigr ) = 0$ with respect to x, we can see that

$$\begin{aligned} \dot{y}^\dagger (x) = \frac{\alpha y (\beta - c\alpha y)}{\beta x \bigl [ (1-\beta ) + c\alpha y \bigr ]} > 0 \quad \text {for all } y \in ]y_0, y_\infty [ , \end{aligned}$$

so $y^\dagger $ is indeed strictly increasing. Also, it is straightforward to check that (19)–(18) and (20)–(21) are all satisfied for $\vartheta = n - \alpha $ and

$$\begin{aligned} \Psi (y) = {\left\{ \begin{array}{ll} 1, &{} \text {if } \beta = 1 , \\ y^{-(1-\beta )} , &{} \text {if } \beta \in ]0,1[ . \end{array}\right. } \end{aligned}$$

$\square $

3 The Solution to the Control Problem

We solve the stochastic control problem that we consider by constructing an appropriate classical solution $w: {\mathcal S} \rightarrow {\mathbb R}$ to the Hamilton-Jacobi-Bellman (HJB) equation

$$\begin{aligned} \max \Bigl \{ \sigma ^2 x^2 w_{xx} (x,y) + bx w_x (x,y)&- r w(x,y) + h(x,y) , \nonumber \\&w_y (x,y) - c x w_x (x,y) - K \Bigr \} = 0 , \quad (x,y) \in {\mathcal S} , \end{aligned}$$

(24)

where $w_y (x,0) = \lim _{y \downarrow 0} w_y (x,y)$. To obtain qualitative understanding of this equation, we consider the following heuristic arguments. At time 0, the project’s management has two options. The first one is to wait for a short time $\Delta t$ and then continue optimally. Bellman’s principle of optimality implies that this option, which is not necessarily optimal, is associated with the inequality

$$\begin{aligned} v(x,y) \ge E \left[ \int _0^{\Delta t} e^{-rt} h(X_t^0,y) \, dt + e^{-r\Delta t} v \bigl ( X_{\Delta t}^0,y \bigr ) \right] . \end{aligned}$$

Applying Itô’s formula to the second term in the expectation, and dividing by $\Delta t$ before letting $\Delta t \downarrow 0$, we obtain

$$\begin{aligned} \sigma ^2 x^2 v_{xx} (x,y) + bx v_x (x,y) - rv(x,y) + h(x,y) \le 0 . \end{aligned}$$

(25)

The second option is to increase capacity by $\varepsilon > 0$, and then continue optimally. This action is associated with the inequality

$$\begin{aligned} v(x,y) \ge v(x-cx\varepsilon , y+\varepsilon ) - K \varepsilon . \end{aligned}$$

Rearranging terms and letting $\varepsilon \downarrow 0$, we obtain

$$\begin{aligned} v_y (x, y) -c x v_x (x, y) - K \le 0 . \end{aligned}$$

(26)

Furthermore, the Markovian character of the problem implies that one of these options should be optimal and one of (25), (26) should hold with equality at any point in the state space $\mathcal S$. It follows that the problem’s value function v should identify with an appropriate solution w to the HJB equation (24).

To construct the solution w to (24) that identifies with the value function v, we first consider the existence of a strictly increasing function $G: ]y_0, y_\infty [ \rightarrow ]0, \infty [$ that partitions the state space $\mathcal S$ into two regions, the “waiting” region $\mathcal W$ and the “investment” region $\mathcal I$ defined by

$$\begin{aligned} {\mathcal W}&=\bigl \{ (x,0) \mid \ 0 < x \le x_0 \text { if } x_0 > 0 \bigr \} \\&\quad \; \cup \bigl \{ (x,y) \mid \ y \in ]y_0, y_\infty [ \text { and } 0 < x \le G(y) \bigr \} \nonumber \\&\quad \; \cup \bigl \{ (x,y) \mid \ x > 0 \text { and } y \in [y_\infty , \bar{y}] \cap {\mathbb R}\bigr \} , \nonumber \\ {\mathcal I}&= \bigl \{ (x,0) \mid \ x > x_0 \text { if } x_0 > 0 \bigr \} \\&\quad \; \cup \bigl \{ (x,y) \mid \ x > 0 \text { and } y \in [0, y_0] \text { if } y_0 > 0\bigr \} \nonumber \\&\quad \; \cup \bigl \{ (x,y) \mid y \in ]y_0, y_\infty [ \text { and } x > G(y) \bigr \} . \end{aligned}$$

In view of the interpretation of the function H defined by (13) as the project’s marginal running payoff rate in excess of the marginal cost of capital rate, which we have discussed in the previous section, we can see that increasing capacity cannot be optimal whenever the state process takes values $(x,y) \in {\mathcal S}$ such that $H(x,y) < 0$. This observation, (17) in Assumption 1 and (22) suggest that the inequality

$$\begin{aligned} G(y) < x^\dagger (y) \quad \text {for all } y \in ]y_0, y_\infty [ \end{aligned}$$

should hold true. Figures 1, 2, and 3 depict possible configurations of the waiting and the investment regions.

Inside the region $\mathcal {W}$, the heuristic arguments that we have briefly discussed above suggest that w should satisfy the differential equation

$$\begin{aligned} \sigma ^2 x^2 w_{xx}(x,y) + bx w_x (x,y) - rw(x,y) + h(x,y) = 0 . \end{aligned}$$

(27)

In light of the theory that we review in Appendix 2 and the intuitive idea that the value function should remain bounded as $x \downarrow 0$, every relevant solution to this ODE is given by

$$\begin{aligned} w(x,y) = A(y) x^n + R(x,y) , \end{aligned}$$

(28)

for some function A, where n is given by (12) and $R(\cdot , y)$ is defined by (79) for $k = h(\cdot , y)$, namely,

$$\begin{aligned} R(x,y) = \frac{1}{\sigma ^2 (n-m)} \left[ x^{m} \int _0^x s^{-m-1} h(s,y) \, ds + x^n \int _x^\infty s^{-n-1} h(s,y) \, ds \right] .\nonumber \\ \end{aligned}$$

(29)

On the other hand, w should satisfy

$$\begin{aligned} w_y (x,y) - cx w_x (x,y) = K , \quad \text {for } (x,y) \in {\mathcal I} , \end{aligned}$$

(30)

which implies that

$$\begin{aligned} w_{yx} (x,y) - c x w_{xx} (x,y) - c w_x (x,y) = 0 , \quad \text {for } (x,y) \in {\mathcal I} . \end{aligned}$$

(31)

To determine A and G, we postulate that w is $C^{2,1}$, in particular, along the free-boundary G. Such a requirement and (28)–(31) yield the system of equations

$$\begin{aligned} \bigl [ \dot{A} (y) - nc A(y) \bigr ] G^n (y)= & {} - \Bigl [ R_y \bigl ( G(y),y \bigr ) - c G(y) R_x \bigl ( G(y),y \bigr ) - K \Bigr ] , \end{aligned}$$

(32)

$$\begin{aligned} \bigl [ \dot{A} (y) - nc A(y) \bigr ] G^n (y)= & {} - \frac{G(y)}{n} \Bigl [ R_{yx} \bigl ( G(y),y \bigr ) \nonumber \\&- \,c G(y) R_{xx} \bigl ( G(y),y \bigr ) - c R_x \bigl ( G(y),y \bigr ) \Bigr ] . \end{aligned}$$

(33)

In view of the definition (29) of R, the associated expression (84) for the function $x \mapsto xR_x (x,y)$ and (83), we can see that this system is equivalent to

$$\begin{aligned} q \bigl ( G(y),y \bigr ) = 0, \end{aligned}$$

(34)

$$\begin{aligned} \dot{A} (y) = nc A(y) - \frac{1}{\sigma ^2 (n-m)} \int _{G(y)}^{\infty } s^{-n-1} H(s,y) \, ds , \end{aligned}$$

(35)

where H is defined by (13) and

$$\begin{aligned} q(x,y) = \int _0^x s^{-m-1} H(s,y) \, ds . \end{aligned}$$

(36)

We can also check that the solution to (35) is given by

$$\begin{aligned} A(y) = \frac{e^{c n y}}{\sigma ^2 (n-m)} \int _y^{y_\infty } e^{-c n u} \int _{G(u)}^\infty s^{-n-1} H(s,u) \, ds \, du , \quad \text {for } y_0 < y < y_\infty , \end{aligned}$$

(37)

if the integrals converge.

The following result, the proof of which we develop in Appendix 1, is concerned with the solution to the system of equations (34)–(35).

Lemma 1

Suppose that Assumption 1 holds true. The equation $q(x,y)=0$ for $x>0$ defines uniquely a strictly increasing $C^1$ function $G: ]y_0, y_\infty [ \rightarrow ]0,\infty [$, which satisfies

$$\begin{aligned}&x^\dagger (y) < G(y) \text { for all } y \in ]y_0, y_\infty [ , \quad \lim _{y \downarrow y_0} G(y) = 0 , \text { if } y_0 > 0 , \quad \text {and} \quad \nonumber \\&\quad \lim _{y \uparrow y_\infty } G(y) = \infty , \end{aligned}$$

(38)

where $x^\dagger $ is defined by (22). Furthermore, the function A given by (37) is well-defined and real-valued, and there exists a constant $C_1 > 0$ such that

$$\begin{aligned} 0 < A(y) G^n (y) \le C_1 \Psi (y) \left[ 1 + G^{n-\vartheta } (y) \right] \quad \text {for all } y \in ]y_0, y_\infty [ , \end{aligned}$$

(39)

where the decreasing function $\Psi $ and the constant $\vartheta > 0$ are as in (21), and

$$\begin{aligned} g^{-1} (x) + \left[ 1 + g^{-1} (x) \right] G^{n-\vartheta } \bigl ( g^{-1} (x) \bigr ) \le C_1 \bigl [ 1+ x^{n-\vartheta } \bigr ] \quad \text {for all } x > x_0 , \end{aligned}$$

(40)

where $g^{-1}$ is the inverse of the strictly increasing function g that is defined by

$$\begin{aligned} g(y) = e^{cy} G(y) , \quad \text {for } y \in ]y_0, y_\infty [ . \end{aligned}$$

(41)

Remark 2

The last limit in (38) implies that, under the optimal strategy, if $\bar{y} < \infty $, then the maximal capacity level $\bar{y}$ is never reached. This result is due to the assumption that the function $y^\dagger $ appearing in Assumption 1 is such that $y^\dagger (\chi ) < \lim _{x \rightarrow \infty } y^\dagger (x) \equiv y_\infty \le \bar{y}$ for all $\chi \in ]x_0, \infty [$. Our analysis could be trivially modified to allow for the possibility that $\bar{y} < \infty $ and $\lim _{y \uparrow \bar{y}} G(y) < \infty $, which would give rise to the situation where the maximal capacity level $\bar{y}$ is reached in finite time with strictly positive probability. Such a relaxation would simply involve allowing for the strictly increasing function $y^\dagger $ to be such that $\lim _{x \rightarrow \infty } y^\dagger (x) \equiv y_\infty > \bar{y}$. However, we have opted against such a relaxation because this would complicate the notation and the proof of Lemma 1 substantially. $\square $

Example 2

Suppose that h is a Cobb-Douglas function given by (23) in Example 1. In this case, we can check that

$$\begin{aligned} G(y) = \left[ \frac{rK (\alpha -m)}{-m} \frac{y^{1-\beta }}{\beta -\alpha c y} \right] ^{1 / \alpha } ,\quad \text {for } y \in ]y_0, y_\infty [ \equiv ]0, \beta /c\alpha [ . \end{aligned}$$

(42)

Figures 2 and 3 illustrate this example. $\square $

To complete the construction of the solution w to the HJB equation (24) that identifies with the problem’s value function v, we note that there exists a mapping $z : {\mathcal I} \rightarrow {\mathbb R}_+$ such that

$$\begin{aligned} z(x,y) \in ](y_0 -y)^+, y_\infty -y[ \quad \text {and} \quad xe^{-cz(x,y)} = G \bigl ( y + z(x,y) \bigr ) \quad \text {for all} (x,y) \in {\mathcal I} . \end{aligned}$$

(43)

Indeed, this claim follows immediately from the calculations

$$\begin{aligned}&\lim _{z \uparrow y_\infty - y} \Bigl [ xe^{-cz} - G(y+z) \Bigr ] = - \infty , \\&\frac{\partial }{\partial z} \Bigl [ xe^{-cz} - G(y+z) \Bigr ] = - cx e^{-cz} - G' (y+z) < 0 , \quad \text {for } z \in ] (y_0-y)^+, y_\infty -y[ , \\&\lim _{z \downarrow (y_0 -y)^+} \Bigl [ xe^{-cz} - G(y+z) \Bigr ] = \left. {\left\{ \begin{array}{ll} xe^{-c (y_0-y)} - \lim _{u \downarrow y_0} G(u) , &{} \text {if } y \le y_0 , \\ x - G(y) , &{} \text {if } y > y_0 \end{array}\right. } \right\} > 0 , \end{aligned}$$

in which, we have used (38) and the fact that G is increasing. We prove the following result in Appendix 1.

Lemma 2

Suppose that Assumption 1 holds true. The function w defined by

$$\begin{aligned} w(x,y) = {\left\{ \begin{array}{ll} R(x,y) , &{} \text {if } (x,y) \in {\mathcal W} \cap \bigl ( {\mathbb R}_+ \times [y_\infty , \bar{y}] \bigr ) , \\ A(y) x^n + R(x,y) , &{} \text {if } (x,y) \in {\mathcal W} \cap \bigl ( {\mathbb R}_+ \times [y_0, y_\infty [ \bigr ), \\ w \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) - K z(x,y) , &{} \text {if } (x,y) \in {\mathcal I} , \end{array}\right. } \end{aligned}$$

(44)

where A is defined by (37) and z is given by (43), is a $C^{2,1}$ solution to the HJB equation (24). Furthermore, the function $w(\cdot ,y)$ is increasing and there exists a constant $C_2 > 0$ such that

$$\begin{aligned} - C_2 (1+y) \le w(x,y) \quad \text {for all } (x,y) \in {\mathcal S} , \end{aligned}$$

(45)

$$\begin{aligned} w \bigl ( G(y),y \bigr ) \le C_2 [\Psi (y) + y] \bigl [ 1 + G^{n-\vartheta } (y) \bigr ] \quad \text {for all } y \in ]y_0, y_\infty [ , \end{aligned}$$

(46)

where the decreasing function $\Psi $ is as in (20)–(21).

We can now establish the main result of the paper.

Theorem 1

Suppose that Assumption 1 holds true. The value function v of the control problem formulated in Sect. 2 identifies with the solution w to the HJB equation (24) given by (44) in Lemma 2 and the optimal capacity expansion strategy $\zeta ^\star $ is given by

$$\begin{aligned} \zeta _t^\star = {\left\{ \begin{array}{ll} 0 , &{} \text {if } y > y_0 \text { and } e^{cy} \sup _{0 \le s \le t} X_s^0 \le \overline{g} (y) , \\ g^{-1} \left( e^{cy} \sup _{0 \le s \le t} X_s^0 \right) , &{} \text {if } y < y_\infty \text { and } e^{cy} \sup _{0 \le s \le t} X_s^0 > \overline{g} (y) , \end{array}\right. } \quad \text {for } t>0 , \end{aligned}$$

(47)

where

$$\begin{aligned} \overline{g} (y) = {\left\{ \begin{array}{ll} 0 , &{} \text {if } y_0 > 0 \text { and } y \le y_0 , \\ g(y) , &{} \text {if } y \in ]y_0, y_\infty [ , \\ \infty , &{} \text {if } y \in [y_\infty , \bar{y}] \cap {\mathbb R}_+ , \end{array}\right. } \end{aligned}$$

(48)

g is defined by (41), and $X^0$ is the geometric Brownian motion given by (1).

Proof

Fix any initial condition $(x,y) \in {\mathcal S}$ and any admissible strategy $\zeta \in {\mathcal A}$. In view of Itô-Tanaka-Meyer’s formula and the left-continuity of the processes X, Y, we can see that

$$\begin{aligned}&e^{-rT} w(X_{T+} , Y_{T+}) \\&\quad = w(x,y) + \int _0^T e^{-rt} \bigl [ \sigma ^2 X_t^2 w_{xx} (X_t,Y_t) + bX_t w_x (X_t,Y_t) - rw(X_t,Y_t) \bigr ] \, dt \\&\qquad + \int _{[0,T]} \bigl [ w_y (X_t, Y_t) - cX_t w_x (X_t, Y_t) \bigr ] \, d\zeta _t^\mathrm {c}+ M_T \\&\qquad + \sum _{0 \le t \le T} e^{-rt} \bigl [ w( X_{t+}, Y_{t+} ) - w(X_t, Y_t) \bigr ] , \end{aligned}$$

where

$$\begin{aligned} M_T = \sqrt{2} \sigma \int _0^T e^{-rt} X_t w_x(X_t, Y_t) \, dW_t . \end{aligned}$$

(49)

Combining this calculation with the observation that

$$\begin{aligned}&w( X_{t+}, Y_{t+} ) - w(X_t, Y_t)\\&\quad \mathop {=}\limits ^{(7)} \int _0^{\Delta \zeta _t} \frac{dw \bigl ( X_t e^{-cs}, Y_t +s \bigr )}{ds} \, ds , \\&\quad = \int _0^{\Delta \zeta _t} \bigl [ w_y \bigl ( X_t e^{-cs}, Y_t + s \bigr ) - cX_t e^{-cs} w_x \bigl ( X_t e^{-cs}, Y_t + s \bigr ) \bigr ] \, ds , \end{aligned}$$

we obtain

$$\begin{aligned}&\int _0^T e^{-rt} h(X_t, Y_t) \, dt - K \int _{[0,T]} e^{-rt} \, d\zeta _t + e^{-rT} w(X_{T+}, Y_{T+}) \nonumber \\&\quad = w(x,y) + \int _0^T e^{-rt} \bigl [ \sigma ^2 X_t^2 w_{xx} (X_t,Y_t) + bX_t w_x (X_t,Y_t) - rw(X_t,Y_t) \nonumber \\&\qquad + h(X_t, Y_t) \bigr ] \, dt + \int _{[0,T]} \bigl [ w_y (X_t, Y_t) - cX_t w_x (X_t, Y_t) - K \bigr ] \, d\zeta _t^\mathrm {c}+ M_T \nonumber \\&\qquad + \sum _{0 \le t \le T} e^{-rt} \int _0^{\Delta \zeta _t} \bigl [ w_y \bigl ( X_t e^{-cs}, Y_t +s \bigr ) - cX_t e^{-cs} w_x \bigl ( X_t e^{-cs}, Y_t +s \bigr ) - K \bigr ] \, ds . \end{aligned}$$

(50)

Since w satisfies the HJB equation (24), it follows that

$$\begin{aligned} \int _0^T e^{-rt} h(X_t, Y_t) \, dt - K \int _{[0,T]} e^{-rt} \, d\zeta _t + e^{-rT} w(X_{T+}, Y_{T+}) \le w(x,y) + M_T . \end{aligned}$$

(51)

In view of the integration by parts formula and (2), we can see that

$$\begin{aligned} e^{-rT} Y_{T_+} - y = -r \int _0^T e^{-rt} Y_t \, dt + \int _{[0,T]} e^{-rt} \, d\zeta _t . \end{aligned}$$

(52)

This identity, the admissibility condition (8) in Definition 1 and the monotone convergence theorem imply that

$$\begin{aligned} {\mathbb {E}}\left[ \int _0^\infty e^{-rt} Y_t \, dt \right]&= \lim _{T \rightarrow \infty } {\mathbb {E}}\left[ \int _0^T e^{-rt} Y_t \, dt \right] \nonumber \\&\le \lim _{T \rightarrow \infty } \left( \frac{y}{r} + \frac{1}{r} {\mathbb {E}}\left[ \int _{[0,T]} e^{-rt} \, d\zeta _t \right] \right) \nonumber \\&= \frac{y}{r} + \frac{1}{r} {\mathbb {E}}\left[ \int _{[0,\infty [} e^{-rt} \, d\zeta _t \right] \nonumber \\&< \infty , \end{aligned}$$

(53)

which implies that

$$\begin{aligned} \liminf _{T \rightarrow \infty } {\mathbb {E}}\left[ e^{-rT} Y_{T+} \right] = 0 . \end{aligned}$$

(54)

The lower bound in (20), the estimate (45) and (52) imply that

$$\begin{aligned}&\int _0^T e^{-rt} h(X_t, Y_t) \, dt - K \int _{[0,T]} e^{-rt} \, d\zeta _t + e^{-rT} w(X_{T+}, Y_{T+}) \\&\quad \ge - C_0 \int _0^T e^{-rt} (1 + Y_t) \, dt - K \int _{[0,T]} e^{-rt} \, d\zeta _t - C_2 e^{-rT} (1 + Y_{T+}) \\&\quad \ge - C_0 \int _0^T e^{-rt} (1 + Y_t) \, dt - (K+C_2) \int _{[0,T]} e^{-rt} \, d\zeta _t - C_2 (1+y) \\&\quad \ge - \left( \frac{C_0}{r} + C_2 + C_2 y \right) - C_0 \int _0^\infty e^{-rt} Y_t \, dt - (K+C_2) \int _{[0,\infty [} e^{-rt} \, d\zeta _t . \end{aligned}$$

The admissibility condition (8) and (53) imply that the random variable on the right-hand side of these inequalities has finite expectation. Combining this observation with (51), we can see that ${\mathbb {E}}\left[ \inf _{T \ge 0} M_T \right] > - \infty $. Therefore, the stochastic integral M is a supermartingale and ${\mathbb {E}}\left[ M_T \right] \le 0$ for all $T>0$. Furthermore, Fatou’s lemma implies that

$$\begin{aligned} J_{x,y} (\zeta ) \le \liminf _{T \rightarrow \infty } {\mathbb {E}}\left[ \int _0^T e^{-rt} h(X_t, Y_t) \, dt - K \int _{[0,T]} e^{-rt} \, d\zeta _t \right] . \end{aligned}$$

Taking expectations in (51) and passing to the limit, we obtain

$$\begin{aligned} J_{x,y} (\zeta ) \le w(x,y) + \liminf _{T \rightarrow \infty } e^{-rT} {\mathbb {E}}\left[ - w(X_{T+} , Y_{T+}) \right] . \end{aligned}$$

The inequality $J_{x,y} (\zeta ) \le w(x,y)$ now follows because the estimate (45) implies that

$$\begin{aligned} \liminf _{T \rightarrow \infty } e^{-rT} E \bigl [ - w(X_{T+} , Y_{T+}) \bigr ]&\le \lim _{T \rightarrow \infty } C_2 e^{-rT} + C_2 \liminf _{T \rightarrow \infty } e^{-rT} E \left[ Y_{T+} \right] \mathop {=}\limits ^{(54)} 0 . \end{aligned}$$

Thus, we have proved that $v(x,y) \le w(x,y)$.

To prove the reverse inequality and establish the optimality of the process $\zeta ^\star $ given by (47), we first consider the possibility that $[y_\infty , \bar{y}] \cap {\mathbb R}_+ \ne \emptyset $ and $y \in [y_\infty , \bar{y}]$. In this case, $\zeta _t^\star = 0$ for all $t \ge 0$, and

$$\begin{aligned} J_{x,y} (\zeta ^\star ) = {\mathbb {E}}\left[ \int _0^\infty e^{-rt} h(X_t^0, y) \, dt \right] \mathop {=}\limits ^{(29), (81)} R(x,y) \mathop {=}\limits ^{(44)} w(x,y) , \end{aligned}$$

which establish the required claims.

In the rest of the proof, we assume that $y < y_\infty $. In this case,

$$\begin{aligned} Y_t^\star = {\left\{ \begin{array}{ll} y , &{} \text {if } y \in ]y_0, y_\infty [ \text { and } e^{cy} \sup _{0 \le s \le t} X_s^0 \le \overline{g}(y) , \\ g^{-1} \left( e^{cy} \sup _{0 \le s \le t} X_s^0 \right) , &{} \text {if } e^{cy} \sup _{0 \le s \le t} X_s^0 > \overline{g}(y) , \end{array}\right. } \end{aligned}$$

(55)

for all $t>0$, and, apart from a possible initial jump of size $(g^{-1} (e^{cy}x) - y)^+$ at time 0, the process $(e^{cy} X^0 , Y^\star )$ is reflecting in the free-boundary g in the positive direction. In particular,

$$\begin{aligned} Y_t^\star \in [y_0, y_\infty [ , \quad e^{cy} X_t^0 \le g (Y_t^\star ) \quad \text {and} \quad \zeta _t^\star - \zeta _0^\star = \int _{]0,t[} \mathbf{1} _{\{ e^{cy} X_s^0 = g(Y_s^\star ) \}} \, d\zeta _s^\star \quad \text {for all } t > 0. \end{aligned}$$

In view of (7) and the definition (41) of g, we can see that

$$\begin{aligned} e^{cy} X_t^0 \le g (Y_t^\star ) \ \Leftrightarrow \ X_t^\star \le G (Y_t^\star ) \quad \text {and} \quad \{ e^{cy} X_t^0 = g(Y_t^\star ) \} = \{ X_t^\star = G(Y_t^\star ) \} , \end{aligned}$$

where $X^\star $ is the solution to (4) given by (7). It follows that the process $(X^\star , Y^\star )$ satisfies

$$\begin{aligned} Y_t^\star \in [y_0, y_\infty [ , \quad X_t^\star \le G (Y_t^\star ) \quad \text {and} \quad \zeta _t^\star - \zeta _0^\star = \int _{]0,t[} \mathbf{1} _{\{ X_s^\star = G(Y_s^\star ) \}} \, d\zeta _s^\star \quad \text {for all } t > 0 . \end{aligned}$$

(56)

Since the function g is strictly increasing, $\zeta _0^\star > 0$ if and only if $xe^{cy} > g(y) \mathop {=}\limits ^{(41)} e^{cy} G(y)$. Therefore,

$$\begin{aligned} \zeta _0^\star = \bigl ( g^{-1} (e^{cy}x) - y \bigr )^+ > 0 \text { if and only if } (x,y) \in {\mathcal I} . \end{aligned}$$

(57)

Furthermore, given any $(x,y) \in {\mathcal I}$, we note that

$$\begin{aligned} z = g^{-1} (xe^{cy}) - y \ \Leftrightarrow \ xe^{cy} = e^{c(y+z)} G(y+z) \ \Leftrightarrow \ xe^{-cz} = G(y+z) , \end{aligned}$$

which implies that $\zeta _0^\star = z(x,y)$, where the function z is given by (43). It follows that

$$\begin{aligned} w(X_{0+}^\star , Y_{0+}^\star ) - w(x,y) = w \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) - w(x,y) \mathop {=}\limits ^{(44)} K z(x,y) . \end{aligned}$$

(58)

In light of (56)–(58) and the construction of the solution w to the HJB equation (24), we can see that (50) implies that

$$\begin{aligned} \int _0^T e^{-rt} h \bigl (X_t^\star , Y_t^\star \bigr ) \, dt - K \int _{[0,T]} e^{-rt} \, d\zeta _t^\star + e^{-rT} w \bigl ( X_T^\star , Y_T^\star \bigr ) = w(x,y) + M_T^\star \end{aligned}$$

(59)

for all $T>0$, where the local martingale $M^\star $ is defined as in (49).

To show that $\zeta ^\star $ is indeed admissible, we use (40) and (55) to calculate

$$\begin{aligned} Y_t^\star = y \mathbf{1} _{\{ Y_t^\star = y \}} + g^{-1} \left( e^{cy} \sup _{0 \le s \le t} X_s^0 \right) \mathbf{1} _{\{ Y_t^\star > y \}} \le y + C_1 + C_1 e^{c(n-\vartheta )y} \left( \sup _{0 \le s \le t} X_s^0 \right) ^{n-\vartheta }. \end{aligned}$$

Combining these inequalities with the first estimate in (76), we can see that

$$\begin{aligned} \lim _{T \rightarrow \infty } {\mathbb {E}}\left[ e^{-rT} Y_T^\star \right] = 0 \quad \text {and} \quad {\mathbb {E}}\left[ \int _0^\infty e^{-rt} Y_t^\star \, dt \right] < \infty . \end{aligned}$$

It follows that

$$\begin{aligned} {\mathbb {E}}\left[ \int _{[0,\infty [} e^{-rt} \, d\zeta _t^\star \right]&= \lim _{T \rightarrow \infty } {\mathbb {E}}\left[ \int _{[0,T]} e^{-rt} \, d\zeta _t^\star \right] \nonumber \\&\mathop {=}\limits ^{(52)} \lim _{T \rightarrow \infty } \left( {\mathbb {E}}\left[ e^{-rT} Y_T^\star \right] + r {\mathbb {E}}\left[ \int _0^T e^{-rt} Y_t^\star \, dt \right] - y \right) \nonumber \\&< \infty , \end{aligned}$$

(60)

which proves that $\zeta ^\star \in {\mathcal A}$.

To proceed further, we note that the inequality in (56), the fact that $w(\cdot , y)$ is increasing and the bound given by (46) imply that, given any $t>0$,

$$\begin{aligned} w(X_t^\star , Y_t^\star )&\le w \bigl ( G(Y_t^\star ), Y_t^\star \bigr ) \\&\le C_2 \bigl [ \Psi (Y_t^\star ) + Y_t^\star \bigr ] \bigl [ 1 + G^{n-\vartheta } (Y_t^\star ) \bigr ] \le C_2 \bigl [ \Psi (Y_{0+}) + Y_t^\star \bigr ] \bigl [ 1 + G^{n-\vartheta } (Y_t^\star ) \bigr ] , \end{aligned}$$

the last inequality following because $\Psi $ is decreasing. Also, (20) and (56) imply that

$$\begin{aligned} h(X_t^\star , Y_t^\star ) \le C_0 (1 + Y_t^\star ) (1 + {X_t^\star } ^{n-\vartheta }) \le C_0 (1 + Y_t^\star ) \bigl [ 1 + G^{n-\vartheta } (Y_t^\star ) \bigr ] . \end{aligned}$$

The estimate (40) and (55) imply that

$$\begin{aligned}&(1 + Y_t^\star ) G^{n-\vartheta } (Y_t^\star )\\&\quad = (1+y) G^{n-\vartheta } (y) \mathbf{1} _{\{ Y_t^\star = y \}} \\&\qquad + \left[ 1 + g^{-1} \left( e^{cy} \sup _{0 \le s \le t} X_s^0 \right) \right] G^{n-\vartheta } \left( g^{-1} \left( e^{cy} \sup _{0 \le s \le t} X_s^0 \right) \right) \mathbf{1} _{\{ Y_t^\star > y \}} \\&\quad \le (1+y) G^{n-\vartheta } (y) \mathbf{1} _{\{ y>y_0 \}} + C_1 + C_1 e^{c(n-\vartheta )y} \left( \sup _{0 \le s \le t} X_s^0 \right) ^{n-\vartheta }. \end{aligned}$$

It follows that there exists a constant $C_3 = C_3 (y)$ such that

$$\begin{aligned}&w(X_t^\star , Y_t^\star ) \le C_3 \left[ 1 + \left( \sup _{0 \le s \le t} X_s^0 \right) ^{n-\vartheta } \right] \quad \text {and} \quad \\&\quad h(X_t^\star , Y_t^\star ) \le C_3 \left[ 1 + \left( \sup _{0 \le s \le t} X_s^0 \right) ^{n-\vartheta } \right] \end{aligned}$$

for all $t>0$. These inequalities and the estimates (76) imply that

$$\begin{aligned}&{\mathbb {E}}\left[ \sup _{T > 0} \left( \int _0^T e^{-rt} h \bigl ( X_t^\star , Y_t^\star \bigr ) \, dt + e^{-rT} w \bigl ( X_T^\star , Y_T^\star \bigr ) \right) \right] \nonumber \\&\quad \le C_3 \left( \frac{(1+r)}{r} + \int _0^\infty {\mathbb {E}}\left[ e^{-rt} \left( \sup _{0 \le s \le t} X_s^0 \right) ^{n-\vartheta } \right] dt\right. \nonumber \\&\left. \qquad +\, {\mathbb {E}}\left[ \sup _{T > 0} e^{-rT} \left( \sup _{0 \le s \le T} X_s^0 \right) ^{n-\vartheta } \right] \right) \nonumber \\&\quad < \infty , \end{aligned}$$

(61)

and

$$\begin{aligned} \liminf _{T \rightarrow \infty } e^{-rT} {\mathbb {E}}\bigl [ - w(X_T^\star , Y_T^\star ) \bigr ] \ge - C_3 \lim _{T \rightarrow \infty } e^{-rT} \left( 1 + {\mathbb {E}}\left[ \left( \sup _{0 \le s \le T} X_s^0 \right) ^{n-\vartheta } \right] \right) = 0 . \end{aligned}$$

(62)

In view of (59) and (61), we can see that ${\mathbb {E}}\left[ \sup _{T>0} M_T^\star \right] < \infty $. Therefore, the stochastic integral $M^\star $ is a submartingale and ${\mathbb {E}}\left[ M_T^\star \right] \ge 0$ for all $T>0$. Furthermore, Fatou’s lemma implies that

$$\begin{aligned} J_{x,y} (\zeta ^\star ) \ge \limsup _{T \rightarrow \infty } {\mathbb {E}}\left[ \int _0^T e^{-rt} h(X_t^\star , Y_t^\star ) \, dt - K \int _{[0,T]} e^{-rt} \, d\zeta _t^\star \right] . \end{aligned}$$

In view of these observations and (62), we can take expectations in (59) and pass to the limit to obtain

$$\begin{aligned} J_{x,y} (\zeta ^\star ) \ge w(x,y) + \limsup _{T \rightarrow \infty } e^{-rT} {\mathbb {E}}\left[ - w(X_T^\star , Y_T^\star ) \right] \ge w(x,y). \end{aligned}$$

This result and the inequality $v(x,y) \le w(x,y)$ that we have proved above, imply that $v(x,y) = w(x,y)$ and that $\zeta ^\star $ is optimal. $\square $

References

Abel, B.A., Eberly, J.C.: Optimal investment with costly reversibility. Rev. Econ. Stud. 63, 581–593 (1996)
Article MATH Google Scholar
Alvarez, L. H. R. (2006), A general theory of optimal capacity accumulation under price uncertainty and costly reversibility, Helsinki Center of Economic Research, Working Paper
Alvarez, L.H.R.: Irreversible capital accumulation under interest rate uncertainty. Math. Meth. Oper. Res. 72, 249–271 (2010)
Article MathSciNet MATH Google Scholar
Arntzen, H.: Optimal choice of production capacity in a random market. Stochast. Stochast. Rep. 55, 87–120 (1995)
Article MathSciNet MATH Google Scholar
Bahlali, K., Chighoub, F., Djehiche, B., Mezerdi, B.: Optimality necessary conditions in singular stochastic control problems with nonsmooth data. J. Math. Anal. Appl. 355, 479–494 (2009)
Article MathSciNet MATH Google Scholar
Bank, P.: Optimal control under a dynamic fuel constraint. SIAM J. Control Optim. 44, 1529–1541 (2005)
Article MathSciNet MATH Google Scholar
Bather, J.A., Chernoff, H.: Sequential decisions in the control of a spaceship. Fifth Berkeley Symposium on Mathematical Statistics and Probability 3, 181–207 (1967)
Beneš, V.E., Shepp, L.A., Witsenhausen, H.S.: Some solvable stochastic control problems. Stochast. Stochast. Rep. 4, 39–83 (1980)
MathSciNet MATH Google Scholar
Chiarolla, M.B., Ferrari, G.: Identifying the free boundary of a stochastic, irreversible investment problem via the Bank—El Karoui representation theorem. SIAM J. Control Optim. 52, 1048–1070 (2014)
Article MathSciNet MATH Google Scholar
Chiarolla, M.B., Haussmann, U.G.: The optimal control of the cheap monotone follower. Stochast. Stochast. Rep. 49, 99–128 (1994)
Article MathSciNet MATH Google Scholar
Chiarolla, M.B., Haussmann, U.G.: Explicit solution of a stochastic irreversible investment problem and its moving threshold. Math. Oper. Res. 30, 91–108 (2005)
Article MathSciNet MATH Google Scholar
Chow, P.L., Menaldi, J.L., Robin, M.: Additive control of stochastic linear systems with finite horizon. SIAM J. Control. Optim. 23, 858–899 (1985)
Article MathSciNet MATH Google Scholar
Davis, M.H.A., Dempster, M.A.H., Sethi, S.P., Vermes, D.: Optimal capacity expansion under uncertainty. Adv. Appl. Prob. 19, 156–176 (1987)
Article MathSciNet MATH Google Scholar
Davis, M.H.A., Zervos, M.: A pair of explicitly solvable singular stochastic control problems. Appl. Math. Optim. 38, 327–352 (1998)
Article MathSciNet MATH Google Scholar
De Angelis, T., Federico, S., Ferrari, G.: Optimal boundary surface for irreversible investment with stochastic costs, preprint (2016)
De Angelis, T., Ferrari, G.: A stochastic partially reversible investment problem on a finite time-horizon: free-boundary analysis. Stochast. Process. Appl. 124, 4080–4119 (2014)
Article MathSciNet MATH Google Scholar
Dixit, A.K., Pindyck, R.S.: Investment Under Uncertainty. Princeton University Press, Princeton (1993)
Google Scholar
Dufour, F., Miller, B.M.: Generalized solutions in nonlinear stochastic control problems. SIAM J. Control Optim. 40, 1724–1745 (2002)
Article MathSciNet MATH Google Scholar
Federico, S., Pham, H.: Characterization of the optimal boundaries in reversible investment problems. SIAM J. Control Optim. 52, 2180–2223 (2014)
Article MathSciNet MATH Google Scholar
Fleming, W.H., Soner, H.M.: Controlled Markov Processes and Viscosity Solutions. Springer, New York (1993)
MATH Google Scholar
Guo, X., Kaminsky, P., Tomecek, P., Yuen, M.: Optimal spot market inventory strategies in the presence of cost and price risk. Math. Meth. Oper. Res. 73, 109–137 (2011)
Article MathSciNet MATH Google Scholar
Guo, X., Pham, H.: Optimal partially reversible investments with entry decisions and general production function. Stochast. Process. Appl. 115, 705–736 (2005)
Article MathSciNet MATH Google Scholar
Guo, X., Tomecek, P.: Connections between singular control and optimal switching. SIAM J. Control. Optim. 47, 421–443 (2008)
Article MathSciNet MATH Google Scholar
Guo, X., Tomecek, P.: A class of singular control problems and the smooth fit principle. SIAM J. Control Optim. 47, 3076–3099 (2008)
Article MathSciNet MATH Google Scholar
Guo, X., Zervos, M.: Optimal execution with multiplicative price impact. SIAM J. Financ. Math. 6, 281–306 (2015)
Article MathSciNet MATH Google Scholar
Harrison, J.M., Taksar, M.I.: Instantaneous control of Brownian motion. Math. Oper. Res. 8, 439–453 (1983)
Article MathSciNet MATH Google Scholar
Haussmann, U.G., Suo, W.: Singular optimal stochastic controls. I. Existence. SIAM J. Control Optim. 33, 916–936 (1995)
Article MathSciNet MATH Google Scholar
Haussmann, U.G., Suo, W.: Singular optimal stochastic controls. II. Dynamic programming. SIAM J. Control Optim. 33, 937–959 (1995)
Article MathSciNet MATH Google Scholar
Jack, A., Johnson, T.C., Zervos, M.: A singular control problem with application to the goodwill problem. Stochast. Process. Appl. 118, 2098–2124 (2008)
Article MATH Google Scholar
Jacka, S.: A finite fuel stochastic control problem. Stochastics 10, 103–113 (1983)
Article MathSciNet MATH Google Scholar
Jacka, S.: Avoiding the origin: a finite-fuel stochastic control problem. Ann. Appl. Prob. 12, 1378–1389 (2002)
Article MathSciNet MATH Google Scholar
Karatzas, I.: A class of singular stochastic control problems. Adv. Appl. Prob. 15, 225–254 (1983)
Article MathSciNet MATH Google Scholar
Knudsen, T.S., Meister, B., Zervos, M.: Valuation of investments in real assets with implications for the stock prices. SIAM J. Control Optim. 36, 2082–2102 (1998)
Article MathSciNet MATH Google Scholar
Kobila, T.Ø.: A class of solvable stochastic investment problems involving singular controls. Stochast. Stochast. Rep. 43, 29–63 (1993)
Article MathSciNet MATH Google Scholar
Løkka, A., Zervos, M.: A model for the long-term optimal capacity level of an investment project. Int. J. Theor. Appl. Financ. 14, 187–196 (2011)
Article MathSciNet MATH Google Scholar
Løkka, A., Zervos, M.: Long-term optimal investment strategies in the presence of adjustment costs. SIAM J. Control Optim. 51, 996–1034 (2013)
Article MathSciNet MATH Google Scholar
Ma, J.: On the principle of smooth fit for a class of singular stochastic control problems for diffusions. SIAM J. Control Optim. 30, 975–999 (1992)
Article MathSciNet MATH Google Scholar
Manne, A.S.: Capacity expansion and probabilistic growth. Econometrica 29, 632–649 (1961)
Article MathSciNet MATH Google Scholar
Menaldi, J.-L., Robin, M.: On some cheap control problems for diffusion processes. Trans. Am. Math. Soc. 278, 771–802 (1983)
Article MathSciNet Google Scholar
Merhi, A., Zervos, M.: A model for reversible investment capacity expansion. SIAM J. Control Optim. 46, 839–876 (2007)
Article MathSciNet MATH Google Scholar
Motta, M., Sartori, C.: Finite fuel problem in nonlinear singular stochastic control. SIAM J. Control Optim. 46, 1180–1210 (2007)
Article MathSciNet MATH Google Scholar
Øksendal, A.: Irreversible investment problems. Financ. Stochast. 4, 223–250 (2000)
Article MathSciNet MATH Google Scholar
Shreve, S.E., Lehoczky, J.P., Gavers, D.P.: Optimal consumption for general diffusions with absorbing and reflecting barriers. SIAM J. Control Optim. 22, 55–75 (1984)
Article MathSciNet MATH Google Scholar
Soner, H.M., Shreve, S.E.: Regularity of the value function for a two-dimensional singular stochastic control problem. SIAM J. Control Optim. 27, 876–907 (1989)
Article MathSciNet MATH Google Scholar
Steg, J.-H.: Irreversible investment in oligopoly. Financ. Stochast. 16, 207–224 (2012)
Article MathSciNet MATH Google Scholar
Sun, M.: Singular control problems in bounded intervals. Stochast. Stochast. Rep. 21, 303–344 (1987)
MathSciNet MATH Google Scholar
Van Mieghem, J.A.: Commissioned paper: capacity management, investment, and hedging: review and recent developments. Manufact. Serv. Oper. Manage. 5, 269–302 (2003)
Article Google Scholar
Wang, H.: Capacity expansion with exponential jump diffusion processes. Stochast. Stochast. Rep. 75, 259–274 (2003)
Article MathSciNet MATH Google Scholar
Zhu, H.: Generalized solution in singular stochastic control: the nondegenerate problem. Appl. Mathe. Optim. 25, 225–245 (1992)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

We thank an anonymous referee for constructive suggestions that improved the paper.

Author information

Authors and Affiliations

Department of Mathematics, College of Science, Kuwait University, Kuwait City, Kuwait
Hessah Al Motairi
Department of Mathematics, London School of Economics, Houghton Street, London, WC2A 2AE, UK
Mihail Zervos

Authors

Hessah Al Motairi
View author publications
You can also search for this author in PubMed Google Scholar
Mihail Zervos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mihail Zervos.

Appendices

Appendix 1: Proof of Lemmas 1 and 2

Proof of Lemma 1

Given any $y \in ]y_0, y_\infty [$, we observe that

$$\begin{aligned} \frac{\partial }{\partial x} q(x,y) = x^{-m-1} H(x,y) {\left\{ \begin{array}{ll} < 0 , &{} \text {for all } x \in ]0,x^\dagger (y) [ , \\ = 0 , &{} \text {for all } x = x^\dagger (y) , \\ > 0 , &{} \text {for all } x > x^\dagger (y) , \end{array}\right. } \end{aligned}$$

where $x^\dagger $ is defined by (22) in Assumption 1. Also, we note that (17) and (18) in Assumption 1 imply that there exist constants $\varepsilon _1 = \varepsilon _1 (y)$ and $x_1 = x_1 (y) > x^\dagger (y)$ such that $H(x,y) \ge \varepsilon _1$ for all $x \ge x_1$. Given such a choice of constants, we calculate

$$\begin{aligned} \lim _{x \rightarrow \infty } q(x,y)&= \lim _{x \rightarrow \infty } \left[ q(x_1 ,y) + \int _{x_1}^x s^{-m-1} H(s,y) \, ds \right] \\&\ge \lim _{x \rightarrow \infty } \left[ q(x_1 ,y) + \frac{\varepsilon _1}{m} x_1^{-m} - \frac{\varepsilon _1}{m} x^{-m} \right] \\&= \infty , \end{aligned}$$

because $m<0$. Combining these observations with the fact that $q(0,y) = 0$, we can see that the equation $q(x,y) = 0$ for $x>0$ has a unique solution $G(y) > x^\dagger (y)$ for all $y \in ]y_0, y_\infty [$, and that G satisfies (38).

To see that the function $G: ]y_0, y_\infty [ \rightarrow ]0,\infty [$ is $C^1$ and strictly increasing, we differentiate the identity $q \bigl ( G(y),y \bigr ) = 0$ with respect to y to obtain

$$\begin{aligned} \dot{G} (y) = - G^{m+1}(y) H^{-1} \bigl ( G(y),y \bigr ) \int _0^{G(y)} s^{-m-1} H_y (s,y) \, ds > 0, \end{aligned}$$

(63)

the inequality following from (19) in Assumption 1.

To establish (40), we note that

$$\begin{aligned} \lim _{y \downarrow y_0} G^{n-\vartheta } (y) = e^{-c(n-\vartheta )y_0} \lim _{y \downarrow y_0} g^{n-\vartheta } (y) \le \lim _{y \downarrow y_0} g^{n-\vartheta } (y) \\ \end{aligned}$$

and

$$\begin{aligned} 0\le & {} \lim _{y \uparrow y_\infty } (1+y) g^{-n+\vartheta } (y) \le \lim _{y \uparrow y_\infty } (1+y) G^{n-\vartheta } (y) g^{-n+\vartheta } (y)\\= & {} \lim _{y \uparrow y_\infty } (1+y) e^{-c(n-\vartheta )y} < \infty , \end{aligned}$$

where we have used (38) and the facts that G is increasing and $n-\vartheta > 0$. Combining these inequalities with the fact that G and g are continuous increasing functions with the same domain $]y_0, y_\infty [$, we can see that there exists a constant $C_1 > 0$ such that

$$\begin{aligned} 1 + y + (1+y) G^{n-\vartheta } (y) \le C_1 \bigl [ 1 + g^{n-\vartheta } (y) \bigr ] \quad \text {for all } y \in ]y_0, y_\infty [ . \end{aligned}$$

For $x>x_0$ and $y = g^{-1} (x)$, this inequality implies the estimate in (40).

In view of (21) and the fact that the functions G, $-\Psi $ are increasing, we can see that, given any $y \in ]y_0, y_\infty [$,

$$\begin{aligned} A(y) G^n (y)&\le \frac{e^{c n y}}{\sigma ^2 (n-m)} G^n (y) \int _y^{y_\infty } e^{-c n u} \Psi (u) \left[ \frac{1}{n} G^{-n} (u) + \frac{1}{\vartheta } G^{-\vartheta } (u) \right] du \\&\le \frac{e^{c n y}}{\sigma ^2 (n-m)} \left[ \frac{1}{n} \int _y^{y_\infty } e^{-c n u} \Psi (u) \, du + \frac{1}{\vartheta } G^{n-\vartheta } (y) \int _y^{y_\infty } e^{-c n u} \Psi (u) \, du \right] \\&\le \frac{1}{\sigma ^2 (n-m)} \Psi (y)\left[ \frac{1}{cn^2} + \frac{1}{cn\vartheta } G^{n-\vartheta } (y) \right] , \end{aligned}$$

which implies (39). Finally, the strict positivity of A follows from (17) and the inequality in (38). $\square $

Proof of Lemma 2

In view of its construction, we will prove that w is $C^{2,1}$ if we show that $w_y$, $w_x$ and $w_{xx}$ are continuous along the free-boundary G. To this end, we consider any $(x,y) \in {\mathcal I}$, we recall the definition (44) of w and the definition (43) of z, and we use (30)–(31) to calculate

$$\begin{aligned} w_y (x,y) Z&= \frac{\partial }{\partial y} \Bigl [ w \bigl ( xe^{-cz(x,y)}, y + z(x,y) \bigr ) - K z(x,y) \Bigr ] \nonumber \\&= w_y \bigl ( xe^{-cz(x,y)}, y + z(x,y) \bigr ) + \Bigl [ w_y \bigl ( xe^{-cz(x,y)}, y + z(x,y) \bigr ) \nonumber \\&\quad \,- cx e^{-cz(x,y)} w_x \bigl ( xe^{-cz(x,y)}, y + z(x,y) \bigr ) - K \Bigr ] z_y (x,y) \nonumber \\&= w_y \bigl ( xe^{-cz(x,y)}, y + z(x,y) \bigr ) , \end{aligned}$$

(64)

$$\begin{aligned} w_x (x,y)&= \frac{\partial }{\partial x} \Bigl [ w \bigl ( xe^{-cz(x,y)}, y + z(x,y) \bigr ) - K z(x,y) \Bigr ] \nonumber \\&= w_x \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) e^{-cz(x,y)} \nonumber \\&\quad + \Bigl [ w_y \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) \nonumber \\&\quad \,-cx e^{-cz(x,y)} w_x \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) - K \Bigr ] z_x(x,y) \nonumber \\&= w_x \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) e^{-cz(x,y)} \end{aligned}$$

(65)

and

$$\begin{aligned} w_{xx}(x,y)&= \frac{\partial }{\partial x} \Bigl [ w_x \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) e^{-cz(x,y)} \Bigr ] \nonumber \\&= w_{xx} \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) e^{-2cz(x,y)} \nonumber \\&\quad \, + \Bigl [ w_{xy} \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) \nonumber \\&\quad \,- cx e^{-cz(x,y)} w_{xx} \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) \nonumber \\&\quad \, - c w_x \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) \Bigr ] e^{-cz(x,y)} z_x(x,y) \nonumber \\&= w_{xx} \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) e^{-2cz(x,y)} \end{aligned}$$

(66)

These calculations imply the required continuity results because $\lim _{n \rightarrow \infty } z(x_n,y_n) = 0$ for every convergent sequence $(x_n,y_n)$ in $\mathcal I$ such that $\lim _{n \rightarrow \infty } x_n = \lim _{n \rightarrow \infty } G(y_n)$.

To prove (45)–(46), we note that the bounds of h in (20), the definition (29) of R and the identity $\sigma ^2 mn = -r$ imply that

$$\begin{aligned} - \frac{C_0}{r} (1+y) \le R(x,y) \le C_0 (1+y) \left[ \frac{1}{r} + \frac{1}{\sigma ^2 (n-m-\vartheta ) \vartheta } x^{n-\vartheta } \right] . \end{aligned}$$

(67)

The lower of these bounds and the positivity of A (see (39)) imply that

$$\begin{aligned} - \frac{C_0}{r} (1+y) \le A(y) x^n + R(x,y) = w(x,y) \quad \text {for all } (x,y) \in {\mathcal W} . \end{aligned}$$

(68)

In light of (14) and (82) in Appendix 2, we can see that $R(\cdot , y)$ is increasing for all $y \in [0,\bar{y}] \cap {\mathbb R}$. Combining this observation with the inequalities $A>0$ and $n>0$, we deduce that $w_x (x, y) \ge 0$ for all $(x,y) \in {\mathcal W}$. This result, (43) and (65) imply that $w(\cdot , y)$ is increasing for all $y \in [0,\bar{y}] \cap {\mathbb R}$, which, combined with (68), implies (45). Also, (46) follows immediately from (39) and the upper bound in (67).

It remains to show that w satisfies the HJB equation (24). By the construction and the $C^{2,1}$ continuity of w, we will achieve this if we show that

$$\begin{aligned}&\sigma ^2 x^2 w_{xx} (x,y) + bx w_x (x, y) - rw (x,y) + h(x,y) \le 0 \quad \text {for all } (x,y) \in {\mathcal I} , \nonumber \\\end{aligned}$$

(69)

$$\begin{aligned}&\quad w_y (x,y) - c x w_x (x,y) - K \le 0 \quad \text {for all } (x,y) \in {\mathcal W} \cap \bigl ( {\mathbb R}_+ \times ]y_0, \bar{y}[ \bigr ).\qquad \end{aligned}$$

(70)

To see (69), we consider any $(x,y) \in {\mathcal I}$ and we use (44), (65)–(66) and the fact that w satisfies the ODE (27) inside $\mathcal W$ to calculate

$$\begin{aligned}&\sigma ^2 x^2 w_{xx} (x,y) + bx w_x (x, y) - rw (x,y) + h(x,y) \\&\quad = \sigma ^2 \bigl [ xe^{-cz(x,y)} \bigr ]^2 w_{xx} \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) + b \bigl [ xe^{-cz(x,y)} \bigr ] \\&\qquad \times w_x \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) \\&\qquad - r w \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) + rK z(x,y) + h(x,y) \\&\quad = - h \bigl ( xe^{-cz(x,y)}, y+z(x,y) \bigr ) + h(x,y) + rK z(x,y) \\&\quad = - \int _0^{z(x,y)} \left[ \frac{\partial h \bigl ( xe^{-cu}, y+u \bigr )}{\partial u} - rK \right] du \\&\quad \mathop {=}\limits ^{(13)} - \int _0^{z(x,y)} H \bigl ( xe^{-cu}, y+u \bigr ) \, du . \end{aligned}$$

These calculations, (17), (38), (43) and the continuity of z imply (69).

To prove (70), we first consider the possibility that $y_\infty < \bar{y}$. In this case, we use the fact that $w=R$ inside ${\mathcal W} \cap \bigl ( {\mathbb R}_+ \times [y_\infty , \bar{y}] \bigr )$, the definition (29) of R, the associated expression (84) for the function $x \mapsto xR_x (x,y)$ and (83) to calculate

$$\begin{aligned}&w_y (x,y) - c x w_x (x,y) - K = R_y (x,y) - c x R_x (x,y) - K \nonumber \\&\quad = \frac{1}{\sigma ^2 (n-m)} \left[ x^{m} \int _0^x s^{-m-1} H(s,y) \, ds + x^n \int _x^\infty s^{-n-1} H(s,y) \, ds \right] \nonumber \\&\quad \le 0 \qquad \text {for all } (x,y) \in {\mathcal W} \cap \bigl ( {\mathbb R}_+ \times [y_\infty , \bar{y}[ \bigr ) , \end{aligned}$$

(71)

the inequality following thanks to (17) in Assumption 1.

To proceed further, we note that, inside ${\mathcal W} \cap \bigl ( {\mathbb R}_+ \times ]y_0, y_\infty [ \bigr )$, the definition (44) of w, (32), (34), calculations similar to the ones in (71) and the definition (13) of H imply that

$$\begin{aligned} \varrho (x,y)&:= w_y (x,y) - cx w_x (x,y) - K \nonumber \\&= \frac{1}{\sigma ^2(n-m)} \left[ - x^m \int _x^{G(y)} s^{-m-1} H(s,y) \, ds + x^n \int _x^{G(y)} s^{-n-1} H(s,y) \, ds \right] . \end{aligned}$$

(72)

In light of (17), (38) and the fact that $m < 0 < n$, we can see that

$$\begin{aligned} \varrho _x (x,y)&=\frac{1}{\sigma ^2(n-m)} \left[ -m x^{m-1} \int _x^{G(y)} s^{-m-1} H(s,y) \, ds \right. \\&\quad \left. +\, n x^{n-1} \int _x^{G(y)} s^{-n-1} H(s,y) \, ds \right] \\&\ge 0 \qquad \text {for all } x \in [x^\dagger (y), G(y)] , \end{aligned}$$

which, combined with the identity $\varrho \bigl ( G(y), y \bigr ) = 0$, implies that

$$\begin{aligned} \varrho (x,y) \le 0 \quad \text {for all } x \in [x^\dagger (y), G(y)] . \end{aligned}$$

(73)

Also, we can use the inequality

$$\begin{aligned} \int _x^{G(y)} s^{-m-1} H(s,y) \, ds > 0 \quad \text {for all } x \in ]0,G(y)[ , \end{aligned}$$

which follows from (17) in Assumption 1 and (34), to calculate

$$\begin{aligned} \lim _{x \downarrow 0} \varrho (x,y)&\le \frac{1}{\sigma ^2(n-m)} \lim _{x \downarrow 0} x^n \int _x^{G(y)} s^{-n-1} H(s,y) \, ds \nonumber \\&= \frac{1}{\sigma ^2(n-m)} \lim _{x \downarrow 0} x^n \int _x^{x^\dagger (y)} s^{-n-1} H(s,y) \, ds \nonumber \\&\le 0 , \end{aligned}$$

(74)

the inequality following from (17) and the fact that $n>0$.

Finally, we can use the fact that m, n are the solutions to the quadratic equation (11) and straightforward calculations to obtain

$$\begin{aligned} \sigma ^2 x^2 \varrho _{xx} (x,y) + bx \varrho _x (x, y) - r \varrho (x,y) = - H(x,y) > 0 \quad \text {for all } x \in ]0, x^\dagger (y)[ . \end{aligned}$$

This inequality and the maximum principle imply that the function $\varrho $ has no positive maximum inside $]0, x^\dagger (y)[$, which, combined with (73)–(74), implies that $\varrho (x,y) \le 0$ for all $y \in ]y_0, y_\infty [$ and $x \in ]0, G(y)]$, and (70) follows. $\square $

Appendix 2: A Second Order Linear ODE

In this section, we review certain results regarding the solvability of a second order linear ODE on which our analysis has been based. All of the claims that we do not prove here are standard and can be found in several references (e.g., with the exception of (76), which is proved in Merhi and Zervos [40, Lemma 1], all results can be found in Knudsen et al. [33]).

Given a constant $\lambda $,

$$\begin{aligned} {\mathbb {E}}\left[ \int _0^\infty e^{-rt} \left( X^0_t \right) ^\lambda \, dt \right]&= x^\lambda \int _0^\infty e^{\left[ \sigma ^2 \lambda ^2 + (b-\sigma ^2) \lambda -r \right] t} {\mathbb {E}}\left[ e^{-\sigma ^2 \lambda ^2 t + \sqrt{2} \sigma \lambda W_t} \right] dt \nonumber \\&= {\left\{ \begin{array}{ll} \infty , &{} \text {if } \lambda \le m \text { or } \lambda \ge n , \\ -x^\lambda / \left[ \sigma ^2 \lambda ^2 + (b-\sigma ^2) \lambda -r \right] , &{} \text {if } \lambda \in ]m,n[ , \end{array}\right. } \end{aligned}$$

(75)

where $X^0$ is the geometric Brownian motion given by (1) and $m < 0 < n$ are the constants defined by (12). Furthermore, for all $\lambda \in ]0,n[$, there exist constants $\varepsilon , C > 0$ such that

$$\begin{aligned} e^{-rT} {\mathbb {E}}\left[ \left( \sup _{0 \le t \le T} X_t^0 \right) ^\lambda \right] \le C x^\lambda e^{-\varepsilon T} \quad \text {and} \quad {\mathbb {E}}\left[ \sup _{T \ge 0} e^{-rT} \left( \sup _{0 \le t \le T} X_t^0 \right) ^\lambda \right] \le C x^\lambda \end{aligned}$$

(76)

for all $x>0$.

A Borel measurable function $k : ]0,\infty [ \rightarrow {\mathbb R}$ satisfies

$$\begin{aligned} {\mathbb {E}}\left[ \int _0^\infty e^{-rt} \left| k(X_t^0) \right| dt \right] < \infty \quad \text {for all } x > 0 , \end{aligned}$$

(77)

if and only if

$$\begin{aligned} \int _0^x s^{-m-1} \left| k(s) \right| ds + \int _x^\infty s^{-n-1} \left| k(s) \right| ds < \infty \quad \text {for all } x > 0 . \end{aligned}$$

(78)

In the presence of these equivalent integrability conditions, the function R defined by

$$\begin{aligned} R(x) = \frac{1}{\sigma ^2 (n-m)} \left[ x^m \int _0^x s^{-m-1} k(s) \, ds + x^n \int _x^\infty s^{-n-1} k(s) \, ds \right] , \quad \text {for } x>0 , \end{aligned}$$

(79)

is a special solution to the non-homogeneous ODE

$$\begin{aligned} \sigma ^2 x^2 u''(x) + bx u' (x) - ru(x) + k(x) = 0 \end{aligned}$$

(80)

that admits the probabilistic expression

$$\begin{aligned} R(x)= {\mathbb {E}}\left[ \int _0^\infty e^{-rt} k(X_t^0) \, dt \right] . \end{aligned}$$

(81)

Furthermore,

$$\begin{aligned} \text {if } k \text { is increasing, then } R \text { is increasing} , \end{aligned}$$

(82)

$$\begin{aligned} \text {and, if } k \text { is constant, then } r R(x) = k \text { for all } x>0 . \end{aligned}$$

(83)

In our analysis we have used the following result.

Lemma 3

Consider any $C^1$ function $k: ]0,\infty [ \rightarrow {\mathbb R}$ satisfying the equivalent integrability conditions (77)–(78) and suppose that there exists $\varepsilon > 0$ such that

$$\begin{aligned} \forall x < \varepsilon , \text { either } k'(x) \ge 0 \text { or } k'(x) \le 0 \quad \text {and} \quad \forall x > \varepsilon ^{-1} , \text { either } k'(x) \ge 0 \text { or } k'(x) \le 0 . \end{aligned}$$

Then

$$\begin{aligned} xR'(x) = \frac{1}{\sigma ^2 (n-m)} \left[ x^m \int _0^x s^{-m} k'(s) \, ds + x^n \int _x^\infty s^{-n} k'(s) \, ds \right] \quad \text {for all } x>0 , \end{aligned}$$

(84)

in which expression, both integrals are well-defined and real-valued.

Proof

We first note that the integrability condition (78) implies that the limits

$$\begin{aligned} \lim _{z \downarrow 0} \int _z^x s^{-m-1} k(s) \, ds \quad \text {and} \quad \lim _{z \rightarrow \infty } \int _x^z s^{-n-1} k(s) \, ds \end{aligned}$$

exist in ${\mathbb R}$ and that

$$\begin{aligned} \liminf _{z \downarrow 0} z^{-m} |k(z)| = 0 \quad \text {and} \quad \liminf _{z \rightarrow \infty } z^{-n} |k(z)| = 0 . \end{aligned}$$

(85)

To see the latter claim, suppose that $\liminf _{z \downarrow 0} z^{-m} |k(z)| > 0$. In such a case, there exist constants $\varepsilon , z_1 > 0$ such that $z^{-m} |k(z)| \ge \varepsilon $ for all $z \le z_1$. Therefore,

$$\begin{aligned} \int _0^{z_1} s^{-m-1} |k(s)| \, ds \ge \varepsilon \int _0^{z_1} s^{-1} \, ds = \infty , \end{aligned}$$

which contradicts (78). We can argue similarly by contradiction to prove the second limit in (85).

Using the integration by parts formula, we calculate

$$\begin{aligned} x^{-m} k(x) - z^{-m}k(z) = -m \int _z^x s^{-m-1} k(s) \, ds + \int _z^x s^{-m} k'(s) \, ds \quad \text {for all } 0 < z < x . \end{aligned}$$

(86)

The assumptions that we have made on $k'$ and the monotone convergence theorem imply that the limit $\lim _{z \downarrow 0} \int _z^x s^{-m} k'(s) \, ds$ exists. Therefore, we can pass to the limit as $z \downarrow 0$ in (86) to obtain

$$\begin{aligned} x^{-m} k(x) = -m \int _0^x s^{-m-1} k(s) \, ds + \int _0^x s^{-m} k'(s) \, ds \quad \text {for all } x>0 . \end{aligned}$$

Similarly, we can see that

$$\begin{aligned} - x^{-n} k(x) = -n \int _x^\infty s^{-n-1} k(s) \, ds + \int _x^\infty s^{-n} k'(s) \, ds \quad \text {for all } x>0 . \end{aligned}$$

The required result follows immediately from these calculations and the expression

$$\begin{aligned} xR'(x) = \frac{1}{\sigma ^2 (n-m)} \left[ m x^m \int _0^x s^{-m-1} k(s) \, ds + n x^n \int _x^\infty s^{-n-1} k(s) \, ds \right] . \end{aligned}$$

$\square $

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Al Motairi, H., Zervos, M. Irreversible Capital Accumulation with Economic Impact. Appl Math Optim 75, 525–551 (2017). https://doi.org/10.1007/s00245-016-9341-9

Download citation

Published: 21 March 2016
Issue Date: June 2017
DOI: https://doi.org/10.1007/s00245-016-9341-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Irreversible Capital Accumulation with Economic Impact

Abstract

Similar content being viewed by others

Irreversible investment with fixed adjustment costs: a stochastic impulse control approach

Irreversible Investment

Mathematical Modeling of Investments in an Imperfect Capital Market

1 Introduction

2 Problem Formulation and Assumptions

Definition 1

Remark 1

Assumption 1

Example 1

3 The Solution to the Control Problem

Lemma 1

Remark 2

Example 2

Lemma 2

Theorem 1

Proof

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: Proof of Lemmas 1 and 2

Proof of Lemma 1

Proof of Lemma 2

Appendix 2: A Second Order Linear ODE

Lemma 3

Proof

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Irreversible Capital Accumulation with Economic Impact

Abstract

Similar content being viewed by others

Irreversible investment with fixed adjustment costs: a stochastic impulse control approach

Irreversible Investment

Mathematical Modeling of Investments in an Imperfect Capital Market

1 Introduction

2 Problem Formulation and Assumptions

Definition 1

Remark 1

Assumption 1

Example 1

3 The Solution to the Control Problem

Lemma 1

Remark 2

Example 2

Lemma 2

Theorem 1

Proof

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: Proof of Lemmas 1 and 2

Proof of Lemma 1

Proof of Lemma 2

Appendix 2: A Second Order Linear ODE

Lemma 3

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation