1 Introduction

The theory of optimal portfolio choice started with the pioneering contribution of [29]. Markowitz used the variance as a measure of the risk of a portfolio return. He recommended choosing the portfolio weights in such a way that the portfolio variance is minimal for a given level of the expected portfolio return. All of these so-called efficient portfolios lie on the efficient frontier which is a parabola in the mean-variance space.

In the meantime, many further proposals for a portfolio selection have been made. A widely made approach is based on the maximization of an investor’s utility function, where the investor chooses a portfolio for which its utility reaches a maximum possible value [16, 34]. The mean-variance approach of [32] turns out to be fully consistent with the expected utility maximization (see [14]). This result is valid without any distributional assumptions imposed on the returns. Similarly, a quadratic utility provides a closed-form solution under very general conditions [5]. However, there are many other ways to choose the utility function like e.g., the power and the exponential utility function. In these cases, no closed-form solutions can be derived without information on the distribution of the return process [6].

The focus of this paper lies on the power and the logarithmic utility functions. If W denotes the wealth of the investor, then the power utility is given by \(U\left( W\right) =\frac{1}{1-\gamma }W^{1-\gamma },\ \gamma >0, \gamma \ne 1\). The logarithmic utility \(U\left( W\right) =\log W\) is obtained as a limit of the power utility if \(\gamma \) converges to one. The quantity \(\gamma \) is equal to the relative risk aversion which is constant for the power utility (CRRA). Following the classical interpretation by John W. Pratt, this means that an investor would pay the same amount of money relatively to his wealth in order to avoid the risk [8, 11]. This is a very important property and it makes this utility function quite attractive from an economic point of view.

To determine the optimal portfolio for the power utility several numerical procedures have been proposed in the literature Grauer and Hakansson [20], Cover [13]. The first-order conditions for the maximum of the expected utility are derived and a Taylor series expansion is used to transform the expected utility into a linear combination of moments [9]. However, no analytical solution is available in the literature and mostly a numerical approach is applied [8]. Yet another possibility is to make use of distributional assumptions on the predictability of the returns [3] and to combine it with a numerical procedure [10, 28].

The use of the log-normal distribution for modelling asset or portfolio returns has become popular many years ago. In Elton and Gruber [15] an efficient set theorem is derived assuming log-normally distributed portfolio returns. They present an algorithm for determining the efficient portfolio. In the paper on multi-period mean-variance analysis, Hakansson [21] describes a limiting approach for the optimal portfolio choice. Merton and Samuelson [32] discuss the log-normal approach in detail and conclude that it should be very carefully applied.

In this paper, we consider the problem of finding the optimal portfolios for power and logarithmic utilities under the assumption that the portfolio gross returns are approximately log-normally distributed. The analytical expressions of the optimal portfolio weights are obtained for both utility functions. Moreover, we state the conditions under which the optimal portfolio in the sense of maximizing the expected power utility and the optimal portfolio that maximizes the expected logarithmic utility are both mean-variance efficient.

The log-normal distribution used as a model for portfolio returns in this paper has several nice properties. If the portfolio gross return follows a log-normal distribution, then it is possible to determine its higher-order moments straightforwardly. Moreover, if the ratio of mean and standard deviation is large (analogue of Sharpe ratio), then the log-normal distribution is close to the normal one (analytically shown in the paper). This is, for instance, the case when the mean of the portfolio is bounded and its variance is small. Furthermore, the assumption of log-normality introduces the skewness into the model of portfolio return, a well-known feature observed in empirical data. We further extend the log-normality to a general class of log-skew-normal distributions (cf., Lin and Stoyanov [26]), which includes the log-normal model as a special case. The main advantage of the considered general model is that it allows to capture both negative and positive skewness in asset return data, while the log-normal distribution possess a positive skewness only.

In applications, the asset returns \(r_j\) vary in most cases in a range up to \(\pm 15\%\). Thus the corresponding gross returns \(R_j = 1 + r_j\) take values close to 1. Consequently, the overall portfolio mean is close to one while the portfolio variance is usually small. This means that in such a case it behaves like a normal distribution. Moreover, assuming a log-normally distributed portfolio return, its continuously compounded rate of return appears to be normally distributed and vice versa. It is also notable that the normality assumption is often used in a literature, i.e. to model the asset return path, VaR process with normally distributed error terms is taken [5, 6, 8,9,10].

The rest of the paper is structured as follows. In Sects. 2 and 3 the optimization problem is described and the assumptions used in the paper are presented and discussed. Our main findings are given in Sect. 4. In Theorem 1 we derive an explicit formula for the weights of the optimal portfolio that maximizes the expected power utility and discuss its properties. In particular, we derived the conditions under which this portfolio is mean-variance efficient in Theorem 2 and show that it converges to the portfolio with the maximum Sharpe ratio when the relative risk aversion converges to infinity. In Sect. 4.2, the analytical results for the optimal portfolio in the sense of maximizing the expected logarithmic utility are present: the closed-form expression of its weights is provided in Theorem 3, while its mean-variance efficiency is proved in Theorem 4. In Sect. 4.3 we extend the theoretical results to the general family of the log-skew-normal distribution. In Sect. 5 the results of an empirical study are shown. Here we study portfolios based on stocks contained in the German stock market index. We check whether the log-normality assumption on the wealth is fulfilled and whether the conditions imposed in Theorem 1 are fulfilled. It turns out that these assumptions are reasonable and that they are mostly satisfied for the portfolios in our study. Moreover, we compare the portfolio with optimal weights with other portfolio strategies and show the dominance of the new approach.

2 Motivation of the log-normal approximation

Let us define two random variables \(X\sim N(\mu ,\sigma ^2)\) and \(Y\sim \ln N(\ln \mu ,\frac{\sigma ^2}{\mu ^2})\). It is easy to see that the first two moments of the discussed distributions coincide if \(\sigma /\mu \) is small, i.e., \(E(Y)=e^{\ln \mu +\frac{\sigma ^2}{2\mu ^2}}\approx \mu \) and \(Var(Y)=e^{2\ln \mu +\frac{\sigma ^2}{\mu ^2}}(e^{\frac{\sigma ^2}{\mu ^2}}-1)\approx \sigma ^2\) as \(\sigma /\mu \rightarrow 0\). Indeed, next we show that if \(\sigma /\mu \rightarrow 0\) and \(\mu >0\) then the distribution of Y approaches to the distribution of X.

In order to find the accuracy of the approximation we consider the difference of both distribution functions, namely

$$\begin{aligned} \begin{aligned} \Psi (x)&=\Phi \left( \frac{x-\mu }{\sigma }\right) -\Phi \left( \frac{\ln x-\ln \mu }{\sigma /\mu }\right) \\ \end{aligned}. \end{aligned}$$
(1)

Lemma 1

(Log-normal approximation) Let \(\mu >0\) then for \(\Psi (x)\) defined in (1) holds

$$\begin{aligned} \sup _{x\in \mathbb {R}}|\Psi (x)|=O\left( \frac{\sigma }{\mu } \right) ~~\text {as }\sigma /\mu \rightarrow 0. \end{aligned}$$
(2)

Thus, difference between normal and log-normal distribution functions approaches zero quite fast as \(\sigma /\mu \rightarrow 0\). So, the smaller \(\sigma /\mu \) the better approximation we receive.

Taking into account that the maximization problem deals with a portfolio gross return, it is well-known that in practice asset returns are close to zero, so one expects to have a portfolio gross return fluctuating around one. Consequently, considering the asset returns to be normally distributed with a mean nearby zero and a small variance we obtain normally distributed gross portfolio returns with a mean around one and a small variance. Meanwhile, the power utility is defined on a nonnegative domain, thus, it is natural to assume that the expected portfolio gross return is positive and the portfolio standard deviation is small. That is why, as it was shown, log-normal distribution can be a good proxy for the normally distributed portfolio returns.

As an illustrative example Fig. 1 describes an identical behaviour of density functions of normal and log-normal distributions for several combinations of \(\mu \) and \(\sigma \).

Fig. 1
figure 1

Similarity of normal and log-normal densities

3 Portfolio selection using a utility function

In this section, we give some notations and basic assumptions used in the paper and discuss the portfolio selection problem for a power utility function.

Suppose that a portfolio consists of k risky assets. Let \(r_i\) denote the relative price change of the ith asset within one period, also known in the literature as the simple one-period return on the ith asset (see e.g., [35]). Let \(\varvec{r} = (r_1,\dots ,r_k)^\prime \) be the vector of (simple) returns. The quantity \(\omega _i\) stands for the relative amount of money invested into the ith asset. Then it holds for the vector of the portfolio weights \(\varvec{\omega }=\left( {\omega }_{1},\omega _{2},\dots ,{\omega }_{k}\right) ^\prime \) that \(\varvec{\omega }^\prime \mathbf {1}=1\), where \(\mathbf {1}\) is a vector of ones. The portfolio return at the end of the period is equal to \(\varvec{\omega }^\prime \varvec{r}\). If \(W_0\) denotes the wealth of the investor at the beginning of the period, then the wealth at the end of the period is given by

$$\begin{aligned} W=W_{0}\left( \varvec{\omega }^\prime \varvec{r}+1\right) =W_0\varvec{\omega }^\prime \varvec{R}, \end{aligned}$$

where \(\varvec{R}=\varvec{r}+\mathbf {1}\). The mean vector of \(\varvec{R}\) and its covariance matrix are denoted by \(E(\varvec{R})=\varvec{\varvec{\mu }}\) and \(Var(\varvec{R})=\varvec{\varvec{\Sigma }}\) where \(\varvec{\varvec{\Sigma }}\) is assumed to be positive definite.

Many procedures have been proposed in literature how to construct an optimal portfolio, i.e., how to choose the optimal portfolio weights. [29] developed the mean-variance principle for portfolio selection. The idea is to choose the portfolio weights such that the portfolio variance is minimal for a given value of the portfolio return. Another attempt is based on utility functions. It is described in detail by, e.g., [14]. The optimal weights are obtained by maximizing the expected utility \(U\left( \cdot \right) \) of the wealth W at the end of the investment period and it is given by

$$\begin{aligned} \varvec{\omega }^*=\underset{{\{ \varvec{\omega }:\ \varvec{\omega }^\prime \mathbf {1}=1 \}}}{\arg \max } E\left[ U\left( W\right) \right] . \end{aligned}$$
(3)

There are many possibilities for the choice of a utility function \(U(\cdot )\) (see [34]). In the case of the quadratic utility the solution of (3) is mean-variance efficient, i.e., it coincides with Markowitz’s optimization problem under some conditions (see [4]). Other utility functions considered in the literature are the HARA, the CARA, and the SAHARA utilities (see e.g., [12, 34]).

In this paper we deal with the optimal portfolio choice problem under the power utility function expressed as

$$\begin{aligned} U(W)=\frac{W^{1-\gamma }}{1-\gamma },\quad \gamma >0, \gamma \ne 1 . \end{aligned}$$
(4)

It is the only utility function with constant relative risk aversion (cf. [34]) equal to \(\gamma \). The logarithmic utility is a limiting case of the power utility when \(\gamma \) converges to 1 and it is given by

$$\begin{aligned} U(W)=\log W. \end{aligned}$$
(5)

Both the power utility function and the logarithmic utility function are widely applied in practice.

In order to calculate the expected utility, it is necessary to have a statement on the distribution of the portfolio return. There have been many proposals in the literature on how to model the return of stocks. [1] proposed to use the normal distribution for asset returns while [17] and [33] suggested the use of stable distributions. An overview on various proposals can be found in, e.g., Jondeau et al. [23].

Here we choose a slightly different way. In light of the discussion in Sect. 2 we assume that the portfolio return of the investor at the end of the investment period can be well approximated by a log-normal distribution, which is characterized by two parameters \(\alpha \in I\!\!R\) and \(\beta > 0\). We briefly write \(Z \sim \ln \mathcal {N}\left( \alpha ,\beta ^2\right) \) to denote that a random variable Z is log-normally distributed. It holds that \(Z \sim \ln \mathcal {N}\left( \alpha ,\beta ^2\right) \) if and only if \(\ln \, Z \sim \mathcal {N}\left( \alpha ,\beta ^2\right) \) with the density given by

$$\begin{aligned} f(z; \alpha , \beta ) = \frac{1}{z} \; \frac{1}{\sqrt{2\pi } \beta } \; exp( - \frac{(ln \, z - \alpha )^2}{2 \beta ^2} ) \; , \quad z > 0, \end{aligned}$$

and 0 otherwise. This distribution has a nice property that its moments can be easily derived by

$$\begin{aligned} E\left( Z^\tau \right) =\exp \left( \alpha \tau +\frac{1}{2}\beta ^2\tau ^2\right) , \quad \tau \in \mathbb {R} . \end{aligned}$$
(6)

The log-normal distribution has been applied in finance by several authors (see e.g., [25, 31]). If the portfolio return is assumed to be log-normally distributed, then its continuously compounded rate of return is normally distributed and vice versa. Moreover, doing so it is implicitly assumed that the investor’s wealth is positive.

4 Closed-form solutions of optimal portfolio choice problems

In this section, we present the analytical expressions of the optimal portfolio weights obtained for both the power utility and the logarithmic utility. Under the assumption that the portfolio gross return \(\varvec{\omega }^\prime \varvec{R}\) at the end of the investment period is approximately log-normally distributed, namely \(\varvec{\omega }^\prime \varvec{R}\sim \ln \mathcal {N}\left( \alpha ,\beta ^2\right) \), we get (see Lemma 2 in the Appendix) that

$$\begin{aligned} \alpha = \ln \, X^2 - \frac{1}{2} \ln \, (V + X^2 ) ~~ \text {and} ~~ \beta ^2 = \ln \, \left( 1 + \frac{V}{X^2}\right) , \end{aligned}$$

where

$$\begin{aligned} X=E(\varvec{\omega }^\prime \varvec{R})=\varvec{\omega }^\prime \varvec{\varvec{\mu }}~~ \text {and} ~~ V=Var(\varvec{\omega }^\prime \varvec{R})=\varvec{\omega }^\prime \varvec{\varvec{\Sigma }}\varvec{\omega }\end{aligned}$$
(7)

are the expected return and the variance of the portfolio with the weights \(\varvec{\omega }\). Now we are ready to formulate our main assumption, which justifies a good log-normal approximation.

Assumption 1

\(\mathbf {(A1)}\)   For any \(\varvec{\omega }\) it holds \(\lim \limits _{k\rightarrow \infty }\frac{\sqrt{V}}{X}\rightarrow 0\).

For example, if the portfolio gross return is bounded and the asset universe is large then it is easy to check that many classical portfolios, like global minimum variance portfolio and equally weighted portfolio satisfy Assumption \(\mathbf {(A1)}\) if the covariance matrix of the asset returns \(\varvec{\Sigma }\) is well behaved for increasing dimension k (its largest eigenvalue is uniformly bounded in k). Thus, one can expect that the log-normal approximation works better in case of large dimensional portfolios.

In the following we also use of the set of efficient constants, i.e. of three quantities which uniquely determine the location of the efficient frontier, the set of optimal portfolios obtained as solutions of Markowitz’s optimization problem, in the mean-variance space. They are given by [7]

$$\begin{aligned} R_{GMV}= \frac{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\varvec{\mu }}{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\mathbf {1}}, \quad V_{GMV}=\frac{1}{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\mathbf {1}} , \quad \text {and} \quad s = \varvec{\mu }^\prime \varvec{Q} \varvec{\mu }\end{aligned}$$
(8)

with

$$\begin{aligned} \varvec{Q} = \varvec{\Sigma }^{-1}- \frac{\varvec{\Sigma }^{-1}\varvec{1} \varvec{1}^\prime \varvec{\Sigma }^{-1}}{\varvec{1}^\prime \varvec{\Sigma }^{-1}\varvec{1} }. \end{aligned}$$
(9)

Here, \(R_{GMV}\) denotes the expected portfolio return of a global minimum variance (GMV) portfolio (the portfolio with the smallest variane), \(V_{GMV}\) stands for its variance, and s represents the slope parameter of the efficient frontier which is the upper part of the parabola given by

$$\begin{aligned} (X-R_{GMV})^2=s(V-V_{GMV})\,. \end{aligned}$$
(10)

The whole parabola is known in the literature as a feasible set of optimal portfolios.

4.1 Analytical solution for the power utility

In Theorem 1 we present the closed-form solution of the optimal portfolio choice problem for the power utility function, i.e. the solution of (3) with \(U(\cdot )\) given in (4).

Theorem 1

Assume that \(\mathbf {(A1)}\) holds. If

$$\begin{aligned} \gamma \ge \gamma _{min}=2s+2\left( s(1+s)\frac{V_{GMV}}{R^2_{GMV}} +\sqrt{s(1+s)\left( 1+\frac{sV_{GMV}}{R^2_{GMV}}\right) \left( 1+(1+s)\frac{V_{GMV}}{R^2_{GMV}}\right) }\right) ,\nonumber \\ \end{aligned}$$
(11)

then the solution of the optimization problem (3) with the power utility function \(U(\cdot )\) from (4) is given by

$$\begin{aligned} \varvec{\omega }^*=\varvec{\Sigma }^{-1}\left[ \left( -\mathbf {1}+(\gamma +1)\frac{\varvec{\mu }}{X}\right) \frac{Y}{\gamma }-X\varvec{\mu }\right] \end{aligned}$$
(12)

with

$$\begin{aligned} X= & {} \frac{(\gamma +2)R_{GMV}- \sqrt{\mathcal {D}}}{2(1+s)}, \end{aligned}$$
(13)
$$\begin{aligned} Y= & {} \frac{\gamma }{s}(X R_{GMV}-R_{GMV}^2-sV_{GMV}), \end{aligned}$$
(14)
$$\begin{aligned} \mathcal {D}= & {} (\gamma +2)^2R_{GMV}^2-4(\gamma +1)(1+s)(R_{GMV}^2+sV_{GMV}) \,. \end{aligned}$$
(15)

Moreover, if \(\gamma =\gamma _{min}\) then \(\mathcal {D}=0\) and (13) simplifies to

$$\begin{aligned} X = \frac{(\gamma _{min}+2)R_{GMV}}{2(1+s)}. \end{aligned}$$

The proof of Theorem 1 is given in the appendix.

Remark 1

Theorem 1 reveals one very important fact: the solution for the power utility function not always exists. Indeed, to guarantee the existence we need to have \(\gamma \ge \gamma _{min}\), which implies that there exists a minimum level of risk aversion \(\gamma _{min}>0\) so that for all \(\gamma \ge \gamma _{min}\) the solution of the optimization problem (3) with the power utility function exists and is unique. Going through the proof of Theorem 1 one can justify that this condition is equivalent to a more technical one, i.e., \(\mathcal {D}\ge 0\) for all \(\gamma >0\).

The expected return of the optimal portfolio that maximizes the expected power utility is X, while its variance is equal to \(V=Y-X^2\). Moreover, the maximum value of the optimization problem (3) is given by

$$\begin{aligned} \max _{\{ \varvec{\omega }:\ \varvec{\omega }^\prime \mathbf {1}=1 \}} E\left[ U(W )\right] =\frac{W^{1-\gamma }_{0}}{1-\gamma }\exp \left[ (1-\gamma ^2)\ln X +\frac{1}{2}(\gamma ^2-\gamma )\ln Y\right] . \end{aligned}$$
(16)

It is remarkable that the optimal portfolio in the sense of maximizing the expected power utility function is located for any relative risk aversion coefficient \(\gamma \) on the set of feasible portfolios, i.e., on the parabola (10). This result is formulated as Corollary 1 with the proof provided in the Appendix.

Corollary 1

Under the assumptions of Theorem 1, it holds that

$$\begin{aligned} \varvec{\omega }^*=\varvec{\omega }_{GMV}+\frac{X-R_{GMV}}{s}\varvec{Q}\varvec{\mu }\end{aligned}$$
(17)

where

$$\begin{aligned} \varvec{\omega }_{GMV}=\frac{\varvec{\Sigma }^{-1}\mathbf {1}}{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\mathbf {1}}, \end{aligned}$$
(18)

are the weights of the global minimum variance portfolio; \(R_{GMV}\) and s are given in (8) and (9); X is the expected return of the optimal portfolio that maximizes the expected power utility as provided in (13).

The expression of optimal portfolio weights (17) coincides with the formula for weights obtained as a solution of Markowitz’s portfolio selection problem. Moreover, the results of Corollary 1 are used to derive conditions on the relative risk aversion coefficient \(\gamma \) which ensure that the obtained portfolio (17) lies in the upper part of the parabola (10) and, hence, the optimal portfolio that maximizes the expected power utility function is mean-variance efficient. This finding is summarized in Theorem 2 with the proof moved to the Appendix.

Theorem 2

Under the conditions of Theorem 1, the optimal portfolio in the sense of maximizing the expected power utility function is mean-variance efficient if and only if

$$\begin{aligned}&\gamma \ge \gamma _{min} \quad \text {and} \quad R_{GMV}>0\,. \end{aligned}$$
(19)

The result of Theorem 2 provides the rigorous mathematical proof of Markowitz’s conjecture that the mean-variance analysis provides a very good proxy to the utility optimization problem with the power utility in the sense that it sophistically approximates its solution (see e.g., [24, 30]). This result is also in-line with the finding of [19]. In the case of the power utility function, Corollary 1 shows that both approaches lead to the same set of optimal portfolios, while Theorem 2 makes one step further and presents the conditions under which the mean-variance efficiency holds for the optimal portfolios obtained by maximizing the expected power utility.

It is remarkable that the optimal portfolio in the sense of maximizing the expected power utility will never coincide with the global minimum variance portfolio (GMV). This observation follows from the fact that if \(X=R_{GMV}\), then from Theorem 1 we get

$$\begin{aligned} Y=\frac{\gamma }{s}(X R_{GMV}-R_{GMV}^2-sV_{GMV})={-\gamma V_{GMV}}<0, \end{aligned}$$

what is not possible since \(Y=E\left[ (\varvec{\omega }^\prime \varvec{R})^2\right] \ge 0\).

The further interesting property of the optimal portfolio in the sense of maximizing the expected power utility is formulated in Corollary 2. Here, we prove that it converges to the optimal portfolio that maximizes the Sharpe ratio, another popular portfolio in the financial literature when \(\gamma \rightarrow \infty \).

Corollary 2

Suppose that the assumptions of Theorem 2 are satisfied.

If \(\gamma \) tends to infinity, then the optimal portfolio that maximizes the expected power utility converges to the Sharpe ratio portfolio with the weights:

$$\begin{aligned} \varvec{\omega }^*=\frac{\varvec{\Sigma }^{-1}\varvec{\mu }}{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\varvec{\mu }}. \end{aligned}$$
(20)

Remark 2

Note that the Sharpe ratio portfolio presented in (20) is slightly different from a classical one because the mean vector \(\varvec{\mu }\) is not equal to expected value of the asset returns \(\text {E}(\mathbf {r})\) but to \(\text {E}(\mathbf {R})=\text {E}(\mathbf {r})+\mathbf {1}\). This implies a very interesting fact that although the expected power utility portfolio can not be exactly equal to the GMV portfolio they are very close to each other in some situations. Indeed, if \(\text {E}(\mathbf {r})\) is close to zero then the weights presented in (20) are close to the GMV portfolio from (18). This is natural because the GMV portfolio plays the role of the least risky portfolio in the mean-variance framework.

Next, Corollary 3 proves that the expected return of the optimal portfolio in the sense of maximizing the expected power utility function is not smaller than the expected return of the Sharpe ratio portfolio.

Corollary 3

Suppose that the assumptions of Theorem 2 are satisfied and \(R_{GMV}> 0\). Then the expected return of the optimal portfolio that maximizes the expected power utility is larger than or equal to the expected return of the Sharpe ratio portfolio, i.e.

$$\begin{aligned} X \ge \frac{\varvec{\mu }^\prime \varvec{\Sigma }^{-1}\varvec{\mu }}{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\varvec{\mu }}\,. \end{aligned}$$
(21)

The other interesting statement is related to the coefficient of relative risk aversion. It is natural to conclude that if \(\gamma \) increases (what means the less risky investor) we can expect the decrease of the variance of the portfolio return. So as the result we present described fact in a Corollary 4. Besides, we have also received that the expected portfolio return decreases as well.

Corollary 4

Suppose that the assumptions of Theorem 2 are satisfied and \(R_{GMV}> 0\). Then the expected return and the variance of the optimal portfolio that maximizes the expected power utility are decreasing functions of \(\gamma \).

4.2 Analytical solution for the logarithmic utility

In Theorem 3 we present the closed-form solution of the optimization problem (3) for the logarithmic utility given in (5). The proof of the theorem is given in the Appendix.

Theorem 3

Assume that \(\mathbf {(A1)}\) holds. If

$$\begin{aligned} \gamma _{min}\le 1\,, \end{aligned}$$
(22)

where \(\gamma _{min}\) is defined in (11), then the solution of the optimization problem (3) with the logarithmic utility function \(U(\cdot )\) as in (5) is given by

$$\begin{aligned} \varvec{\omega }^*=\varvec{\Sigma }^{-1}\left[ \left( -\mathbf {1}+2\frac{{\varvec{\mu }} }{X}\right) Y-X{\varvec{\mu }} \right] \end{aligned}$$
(23)

with

$$\begin{aligned} X= & {} \frac{3R_{GMV}- \sqrt{\mathcal {D}}}{2(1+s)} \quad \text {and} \quad Y=\frac{X R_{GMV}-R_{GMV}^2}{s} -V_{GMV},\nonumber \\ \mathcal {D}= & {} 9R_{GMV}^2-8(1+s)(R_{GMV}^2+sV_{GMV})\,. \end{aligned}$$
(24)

The expected return of the optimal portfolio in the sense of maximizing the expected logarithmic utility is X and its variance is given by \(V=Y-X^2\). Using the results of Theorem 1, we also obtain the maximum value of the optimization problem (3) for the logarithmic utility given by

$$\begin{aligned} \max _{\varvec{\omega }^*}E\left[ U(W )\right] =\ln W_0+2\ln X -\frac{1}{2}\ln Y. \end{aligned}$$
(25)

The expression of the weights obtained for the logarithmic utility function is a special case of the optimal portfolio weights derived for the power utility function corresponding to \(\gamma =1\). This finding is not surprising since the logarithmic utility function is a limiting case of the power utility function when \(\gamma \rightarrow 1\). Although the power utility cannot be defined for \(\gamma =1\) and only its limiting behaviour is considered, that is no longer the case with the formula for the weights presented in Theorem 1 which can also be computed for \(\gamma =1\) as soon as the condition (11) is fulfilled which coincides with (22) of Theorem 3.

Another important application of the results given in Theorems 1 and 3 is that the weights of the optimal portfolio in the sense of maximizing the expected logarithmic utility function can also be presented in the form of Markowitz’s portfolio, similarly to the expression for the weights obtained for the power utility function and they are given by

$$\begin{aligned} \varvec{\omega }^*=\varvec{\omega }_{GMV}+\frac{X-R_{GMV}}{s}\varvec{Q}\varvec{\mu }\end{aligned}$$
(26)

with X as in (24), i.e. the optimal portfolio that maximizes the expected logarithmic utility function also lies on parabola (10). Finally, Theorem 4 presents conditions under which this portfolio is mean-variance efficient, i.e. it lies on the upper part of the parabola.

Theorem 4

Under the conditions of Theorem 1, the optimal portfolio in the sense of maximizing the expected logarithmic utility is mean-variance efficient if and only if

$$\begin{aligned} \gamma _{min}\le 1 \quad \text {and} \quad R_{GMV}>0\,. \end{aligned}$$
(27)

4.3 Extension to the log-skew-normal distribution

In this section, we extend the previous results to a larger class of distributions, the so-called log-skew-normal distributions. The main feature of this family of distribution is the ability to model negative skewness that is usually observed in the behaviour of asset returns as well as of portfolio returns.

A random variable Z is said to have a log-skew-normal distribution denoted by \(Z \sim \ln \mathcal {SN}(\alpha ,\beta ^2,\nu ,\omega ^2)\) if and only if \(\ln Z\) has a skew-normal distribution denoted by \(\tilde{Z}=\ln Z \sim \mathcal {SN}(\alpha ,\beta ^2,\nu ,\omega ^2)\) and defined by its moment generating function given by (see González-Farías et al. [18], Lemma 1)

$$\begin{aligned} M_{\tilde{Z}} (t)=2\Phi (\tilde{\nu } t)\exp \left( \alpha t +\frac{1}{2}\beta ^2t^2 \right) , \end{aligned}$$
(28)

where \(\tilde{\nu }=\frac{\beta ^2\nu }{\sqrt{\omega ^2+\nu ^2\beta ^2}}\) and \(\Phi (.)\) denotes the cumulative distribution function of the standard normal distribution.

This class of distribution possesses several nice properties, which appear to be very useful in modeling the stochastic behaviour of the asset returns and the portfolio returns. First, the model parameter \(\nu \) is used to model the skewness in the returns. In contrast to the log-normal distribution, the negative values of skewness, which are usually present in the return series, can be well captured by the new family of distributions by assuming \(\nu <0\). If \(\nu =0\), then the log-skew-normal distribution becomes the log-normal distribution. Furthermore, the considered class of skew-normal distributions is closed with respect to linear transformations (see González-Farías et al. [18], Theorem 1).

The application of (28) leads to

$$\begin{aligned} E\left( Z^\tau \right)= & {} 2\Phi (\tilde{\nu } \tau )\exp \left( \alpha \tau +\frac{1}{2}\beta ^2\tau ^2\right) , \nonumber \\\approx & {} \exp \left( (\alpha +0.717\tilde{\nu })\tau +(0.5\beta ^2-0.416\tilde{\nu }^2)\tau ^2\right) , \quad \tau \in \mathbb {R} \end{aligned}$$
(29)

where the last equality is obtained using that \(\nu <0\) corresponding to the negative skewness and by the application of the following approximation (see [2, 22, 27])

$$\begin{aligned} 1-\Phi (x)\approx 0.5\exp \left( -0.717x-0.416x^2\right) , \quad \text{ for } \quad x>0. \end{aligned}$$

In using that \(0.5\beta ^2-0.416\tilde{\nu }^2>0\), we get that (29) can be rewritten by

$$\begin{aligned} E\left( Z^\tau \right) \approx \exp \left( \tilde{\alpha }\tau +0.5 \tilde{\beta }^2\tau ^2\right) , \quad \tau \in \mathbb {R} \end{aligned}$$
(30)

with \(\tilde{\alpha }=\alpha +0.717\tilde{\nu }\) and \(\tilde{\beta }^2=\beta ^2-2\cdot 0.416\tilde{\nu }^2\), which is the same expression as (6) obtained for the log-normal distribution.

Following the proof of Theorem 1 we get the following expression for the optimal portfolio weights that approximately optimize the power utility function under the assumption that the portfolio return follows a log-skew-normal distribution:

$$\begin{aligned} \varvec{\omega }^*(\nu )=\varvec{\Sigma }(\nu ) ^{-1}\left[ \left( -\mathbf {1}+(\gamma +1)\frac{\varvec{\mu }(\nu )}{X(\nu )}\right) \frac{Y(\nu )}{\gamma }-X(\nu )\varvec{\mu }(\nu )\right] \end{aligned}$$
(31)

where \(X(\nu )\) and \(Y(\nu )\) are given in (13) and (14) with \(\varvec{\mu }(\nu )\) and \(\varvec{\Sigma }(\nu )\) instead of \(\varvec{\mu }\) and \(\varvec{\Sigma }\). Although the expression of the optimal portfolio weights obtained by maximizing the power utility function under the assumption of a log-skew-normal distribution is the same as one derived in (12) assuming a log-normal distribution, it has to be noted that the mean vector \(\varvec{\mu }(\nu )\) and the covariance matrix \(\varvec{\Sigma }(\nu )\) are model specific. Only in the special case of \(\nu =0\), the solution obtained under the log-skew-normal model will coincide with the one obtained under the log-normal distribution.

5 Empirical study

This section aims to show how the above results can be applied in practice. We will analyze in detail as well whether the demanded assumptions are fulfilled in practice or not.

Our empirical study is based on stocks listed in the German stock index (DAX). We select 17 stocks from the DAX (ADS, ALV, BAYN, BMW, CBK, DAI, DBK, DPW, DTE, HEI, IFX, LHA, LIN, SAP, SIE, TKA, VOW3) and study their behaviour during the time period from September 21, 2014 up to September 17, 2017. Our analysis is based on weekly returns. Consequently, we have 157 observations for each stock.

Based on these 17 stocks we construct portfolios by randomly sampling \(k \in \{4,...,14\}\) assets. Thus, for each k the number of different sets of assets is equal to \(\left( {\begin{array}{c}17\\ k\end{array}}\right) \). From every such set, we determine the optimal portfolios in the sense of maximizing the expected power (logarithmic) utility. The parameters \(\varvec{\mu }\) and \(\varvec{\Sigma }\) are estimated for each portfolio constellation by the sample mean vector and the sample covariance matrix. Moreover, without loss of generality, we set \(W_0 = 1\).

5.1 Validating of model assumption and conditions of theorems

The findings of Theorems 1 and 3 are obtained under certain assumptions. In the following, we analyze whether these conditions are reasonable for the underlying data set.

Table 1 \(25\%\) quantile of the p-values of the log-normality test for several combinations of k and \(\gamma \)

First, we validate the assumption of log-normally distributed portfolio returns. To test the hypothesis of log-normality, we apply the Shapiro-Wilk test to the logarithmized returns of optimal portfolios that maximize the expected power utility functions for the chosen set of assets. In Table 1 the 25% quantiles of the p-values are provided for several numbers of assets k within the portfolio and several values of the relative risk aversion coefficient \(\gamma \). We remind again that for each fixed k the number of optimal portfolios under consideration is \(\left( {\begin{array}{c}17\\ k\end{array}}\right) \). Table 1 shows that more than 75% of the p-values are always considerably larger than the possible nominal levels of the test, like 5% in our case. Consequently, in at least of 75% of all considered cases, the null hypothesis that the returns of optimal portfolios in the sense of maximizing the expected power utility are log-normally distributed cannot be rejected. For example, choosing \(k=12\) and \(\gamma = 5\) we receive that 75% of p-values are greater than 0.26. Moreover, with an increasing number of assets, the p-values are monotonically increasing as well. Hence, this gives rise to the conclusion that the log-normality assumption seems to be reasonable in many practical situations. Such a result is in line with the portfolio diversification theory, i.e. adding assets into the portfolio will reduce the portfolio variance and thus, the ratio between the portfolio mean and the portfolio variance might increases indicating a better approximation of the log-normal distribution by the normal one. This finding can also be used as a motivation for the validity of Assumption \(\mathbf {(A1)}\) in Sect. 4, namely a better approximation is present for larger portfolio size.

Fig. 2
figure 2

Relative number of cases when the condition \(\gamma \ge \gamma _{min}\) of Theorem 1 (above) and the condition (19) of Theorem 2 (below) are not fulfilled plotted as a function of \(\gamma \) for several number of assets \(k \in \{4,5,...,14\}\)

Fig. 3
figure 3

Optimal portfolios for \(\gamma \in \{1, 2, 5\}\) and the Sharpe ratio portfolio located on the efficient frontier for a number of assets \(k\in \{4, 7, 10, 14\}\)

Fig. 4
figure 4

Empirical distribution functions of the power utility for several portfolio strategies (naive portfolio, Sharpe portfolio, optimal portfolio in the sense of maximizing the expected power utility) for \(k=9\) assets

Next, we analyze whether the condition (11) of Theorem 1 is fulfilled in the considered application. Our results are summarized in Fig.  2 (above). Here the percentage of all considered cases are present for several values of the number of assets k in the portfolio and of the relative risk aversion \(\gamma \) when the condition \(\{ \gamma \ge \gamma _{min}\}\) is not fulfilled. The dashed vertical lines represent the values of \(\gamma _{min}(k)\) for every number of assets k. We observe that with decreasing k and increasing \(\gamma \) this condition is almost always satisfied and the number of inappropriate cases goes to zero. For example, the probability of having \(\gamma <\gamma _{min}\) for k assets is virtually zero if \(\gamma >\gamma _{min}(k)\) for all considered values of \(k\in \{4, 5, \ldots , 14\}\). This is in line with the theoretical findings of Theorem 1. Moreover, because \(\gamma _{min}(k)\) is always smaller than one, we can conclude due to Theorem 3 that the solution for the logarithmic portfolios exists as well. Hence, the condition (11) is realistic for both small or large portfolios and the relatively large values of the risk aversion coefficient, at the same time the optimal portfolio that maximizes the expected power utility exists with a small probability if \(\gamma \) is close to zero.

Finally, we analyze the condition for the mean-variance efficiency of the optimal portfolio that maximizes the expected power utility which is stated in Theorem 2. The results are presented in Fig. 2 (below). Here we proceed in the same way as above for Theorem 1. The second expression in (19) supplies the classical mean-variance efficiency in sense that the accumulated expected portfolio return of GMV portfolio, i.e., \(R_{GMV}=\frac{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\varvec{\mu }}{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\mathbf {1}}=\frac{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\text {E}(\mathbf {r})}{\mathbf {1}^\prime \varvec{\Sigma }^{-1}\mathbf {1}}+1\) must be always positive. As we can observe, the related results look the same as the upper part of Fig. 2, which means that the first expression of (19) seems to be a stronger restriction than the second one for the considered data set. Finally, Pennacchi [34], p.21 points out that the empirical evidence on the size of individuals relative risk aversions indicates that this coefficient is usually larger than 1 in practical applications and, as such, the values of \(\gamma _{min}\) smaller than one in Fig. 2 show that the derived optimal portfolio strategy can be applied for all considered portfolio sizes in practice.

5.2 Location of the optimal portfolio that maximizes the expected power utility function on the efficient frontier

The other part of the empirical study presented by Fig. 3 is related to the location of the optimal portfolios. As it was shown in (13) and (14) the mean and the variance of the optimal portfolio depends on three parameters: \(\varvec{\mu }, \varvec{\Sigma }, \gamma \). The mean vector and the covariance matrix can be estimated and in our example, we consider four possible number of assets \(k\in \{4, 7, 10, 14\}\). We choose the first k assets in the alphabetic order. Then we build efficient frontier for each combination of assets where we add the location of the optimal portfolios for \(\gamma \in \{1, 2, 5\}\) and the Sharpe ratio portfolio. Like it was proved in Corollary 4 we receive the decrease of the variance and the mean with the increase of a risk aversion coefficient and optimal portfolio tends to the Sharpe ratio portfolio as it was shown in Corollary 2. Also, we receive the increase of both mean and variance of portfolio returns with a larger amount of assets in the portfolio.

5.3 A comparison of several portfolio strategies

Finally, we compare the power utilities of several portfolio strategies with each other. We examine the naive portfolio with equal weights, the Sharpe ratio portfolio, and the optimal portfolio derived in Theorem 1. Note that the GMV portfolio was excluded from the study because it was too close to the Sharpe ratio portfolio for a considered data set (see, Remark 2). In Fig. 4 the empirical distribution functions of the power utility obtained for each of the considered methods are given for the portfolios consisting of \(k=9\) assets and for various values of \(\gamma \). The figure shows that the best results are obtained for the strategy derived in Theorem 1 as expected. Besides, as another visualisation of Corollary 2 result we have that the utility of the Sharpe ratio portfolio nearly coincides with the derived one in Theorem 1 as the coefficient of a relative risk aversion increases. Finally, very bad performance of the equally-weighted (naive) portfolio is observed.