## Abstract

A toolset beyond mean–variance portfolio optimization is appropriate for those instances where higher return moments might need to be taken into account, either for individual decisions or for pricing studies. Maximizing expected log surplus utility is superior for compounding returns in excess of financial obligations. Here, it is matched with a more flexible scenario representation of the investor’s joint probability distribution of returns and with an agnostic optimization engine. We show simple examples based on extrapolating historical stock and bond returns and then extended using hypothetical option prices. We clarify how Black–Scholes implied volatility anomalies can arise in a portfolio context.

This is a preview of subscription content, log in to check access.

## References

Adler, Timothy, and Mark Kritzman. 2007. Mean–Variance Versus Full-Scale Optimization: In and Out of Sample.

*Journal of Asset Management*7: 301–311.Black, Fischer, and Myron Scholes. 1973. The Pricing of Options and Corporate Liabilities.

*Journal of Political Economy*81: 637–654.Breiman, L. 1962.

*Optimal Gambling Systems For Favorable Games*. Institute of Engineering Research Series, Electronics Research Laboratory, University of California, Berkeley, California, Series No 60, Issue 467.Cover, Thomas M. 1991. Universal Portfolios.

*Mathematical Finance*1: 1–29.Derman, Emanuel, and Michael B. Miller. 2016.

*The Volatility Smile*. Hoboken, NJ: Wiley.diBartolomeo, Dan. 2019. Portfolio Risk and Optimization Inclusive of Higher Moments.

*Northfield Information Systems newsletter*, October: 1.Hakansson, N. 1970. Optimal Investment and Consumption Strategies Under Risk for a Class of Utility Functions.

*Econometrica*38: 587–607.Hakansson, N. 1971. Multi-Period Mean-Variance Analysis: Toward a General Theory of Portfolio Choice.

*Journal of Finance*26: 854–884.Harvey, C.R., J.C. Liechty, M.W. Liechty, and P. Müller. 2002. Portfolio Selection with Higher Moments.

*Working Paper.*Duke University.Kelly, J.L. 1956. A New Interpretation of Information Rate.

*Bell System Technical Journal*35: 917–926.Kroll, Yoram, Haim Levy, and Harry Markowitz. 1984. Mean-Variance Versus Direct Utility Maximization.

*Journal of Finance*39: 47–61.MacLean, L.C., and W.T. Ziemba. 1991. Growth-Security Profiles in Capital Accumulation Under Risk.

*Annals of Operations Research*31: 501–509.Markowitz, Harry. 1952. Portfolio Selection.

*Journal of Finance*7: 77–91.Michaud, R. 1989. The Markowitz Optimization Enigma: Is `Optimized’ Optimal?

*Financial Analysts Journal*45: 31–42.Rubinstein, Mark. 1976. The Strong Case for the Generalized Logarithmic Utility Model as the Premier Model of Financial Markets.

*Journal of Finance*31: 551–571.Wilcox, Jarrod. 2003. Harry Markowitz and the Discretionary Wealth Hypothesis.

*Journal of Portfolio Management*29: 58–65.

## Author information

### Affiliations

### Corresponding author

## Additional information

### Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Appendices

### Appendix 1

### Why mean–variance is not generally consistent with expected utility?

Consider both *U*(*x*), a von Neumann–Morgenstern utility function of investment outcome *x*, and also *P*(*x*), an outcome probability distribution. Expected utility then is the sum of *P*(*x*)*U*(*x*). For mean–variance optimization to be precisely equivalent, maximizing mean(*P*(*x*)) − *L**variance(*P*(*x*))/2 where *L* is a risk aversion parameter would have to give the same allocation result as maximizing Σ*P*(*x*)*U*(*x*). This is much more difficult requirement than apparent at first glance.

Consider the components *P*(*x*) and *U*(*x*) individually. For each, we have arguments that they alone could justify the mean–variance approach as fully compatible with expected utility maximization. Though the contrary suggestion is often repeated in the finance literature, one cannot take for granted that the approximation succeeds perfectly if *P*(*x*) is a normal distribution, because that eliminates the third return moment but not the fourth moment. The other justification, a quadratic utility *U*(*x*), would imply that utility goes down increasingly rapidly with increasing wealth once *U*(*x*) reaches a maximum, contrary to rational utility assumptions unless we don a blindfold to the possibility that utility might exceed that amount. These observations do not make the task impossible, but they narrow the possibility to non-obvious special cases.

### Maximizing medians in the long term

For long-term compounded risky results, the eventual wealth distribution approximates a log normal distribution, with a tiny probability of huge returns. For a finite-lived investor, this is less relevant than median results. Fortunately, growth optimal policies, if the generalized central limit theorem (GCLT) can be applied, eventually maximize median terminal wealth. The GCLT covers cases where variance varies and where simple forms of autocorrelation are permitted. Then:

- 1.
Maximizing the expected log return each period maximizes its sum over multiple periods,

- 2.
The sum of log returns converges to a normal distribution,

- 3.
Which is symmetric, with maximized mean equal to maximized median.

- 4.
Taking the antilog preserves rank order,

- 5.
Maximizing median terminal wealth.

In the text, this applies to terminal surplus, which, if financial obligations were to remain constant, maximizes terminal wealth.

### Taylor series analysis behind Eq. 1

In mathematics, many smooth functions can be expressed as an infinite series of polynomials of increasing powers of distances from some local point. We can often get a good approximation from the first few terms.

A Taylor series around the point *r *= *E*, where *r* is return and *E* is the expected return, looks like:

where *f*^{n}(*E*) is the nth-order derivative of the function evaluated at *E* and *n*! is 1 times 2 … times *n*.

We consider 0! = 1 and *f*^{0} (E) = *f*(*E*).

Let *f*(*r*) = ln(1 + *r*). We want to estimate it around the location *E*.

Then the zeroth term is ln(1 + *E*).

The first-order term is (*r *− *E*)/(1 + *E*).

Then deriving the successive-order derivatives, and canceling offsetting values, we get the following.

Taking the expected value of ln(1 + *r*), the first-order term is zeroed out, and we get

where *r* is return, *E* is expected *r*, *V*(*r*) is variance of *r*, *S*(*r*) is skewness of *r*, and *V*(*r*) is kurtosis of *r*.

Now, since we actually want expected return on surplus rather than expected return on investments, substitute *LE* for *E*, where *L* is the risk aversion parameter, and substitute *LV*^{1/2} for *V*^{1/2}. Noticing that a common factor in all but the zeroth term is *Q *= *LV*^{1/2}/(1 + *LE*), we finally arrive at

Note that Q determines the relative size of the impact of skewness and kurtosis. If it is too large, the series will not converge to a finite sum. A portfolio characterized by such a Taylor series should be avoided!

### Working Python code

Appendix 2, available for download here, contains a working Python 3.7 program for calculating optimal allocations starting with a comma delimited (CSV) file with its top row containing ticker labels and successive rows showing returns for each ticker symbol.

### Appendix 2

## Rights and permissions

## About this article

### Cite this article

Wilcox, J. Better portfolios with higher moments.
*J Asset Manag* (2020). https://doi.org/10.1057/s41260-020-00170-5

Revised:

Published:

### Keywords

- Asset allocation
- Portfolio optimization
- Higher moments
- Skewness
- Kurtosis
- Scenarios