[go: up one dir, main page]

100% found this document useful (1 vote)
197 views288 pages

Lecture Notes Munk

This document discusses dynamic asset allocation. It contains chapters on preferences, one-period models, discrete-time multi-period models, continuous-time modeling, asset allocation with constant investment opportunities, stochastic investment opportunities, and the martingale approach. The document contains graphs and is intended to be printed in color.

Uploaded by

Raul Pefaur
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
197 views288 pages

Lecture Notes Munk

This document discusses dynamic asset allocation. It contains chapters on preferences, one-period models, discrete-time multi-period models, continuous-time modeling, asset allocation with constant investment opportunities, stochastic investment opportunities, and the martingale approach. The document contains graphs and is intended to be printed in color.

Uploaded by

Raul Pefaur
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 288

Dynamic Asset Allocation

Claus Munk
Copenhagen Business School
e-mail: cm.fi@cbs.dk
this version: September 18, 2012
The document contains graphs in color, use color printer for best results.
Contents
Preface v
1 Introduction to asset allocation 1
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Investor classes and motives for investments . . . . . . . . . . . . . . . . . . . . . . 1
1.3 Typical investment advice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.4 How do individuals allocate their wealth? . . . . . . . . . . . . . . . . . . . . . . . 3
1.5 An overview of the theory of optimal investments . . . . . . . . . . . . . . . . . . . 3
1.6 The future of investment management and services . . . . . . . . . . . . . . . . . . 3
1.7 Outline of the rest . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.8 Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Preferences 5
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Consumption plans and preference relations . . . . . . . . . . . . . . . . . . . . . . 6
2.3 Utility indices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.4 Expected utility representation of preferences . . . . . . . . . . . . . . . . . . . . . 10
2.5 Risk aversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.6 Utility functions in models and in reality . . . . . . . . . . . . . . . . . . . . . . . . 20
2.7 Preferences for multi-date consumption plans . . . . . . . . . . . . . . . . . . . . . 26
2.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3 One-period models 37
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.2 The general one-period model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.3 Mean-variance analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
3.4 A numerical example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
3.5 Mean-variance analysis with constraints . . . . . . . . . . . . . . . . . . . . . . . . 49
3.6 Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
i
ii Contents
3.7 Critique of the one-period framework . . . . . . . . . . . . . . . . . . . . . . . . . . 49
3.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4 Discrete-time multi-period models 51
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
4.2 A multi-period, discrete-time framework for asset allocation . . . . . . . . . . . . . 51
4.3 Dynamic programming in discrete-time models . . . . . . . . . . . . . . . . . . . . 54
5 Introduction to continuous-time modeling 59
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
5.2 The basic continuous-time setting . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
5.3 Dynamic programming in continuous-time models . . . . . . . . . . . . . . . . . . 62
5.4 Loss from suboptimal strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
5.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
6 Asset allocation with constant investment opportunities 69
6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
6.2 General utility function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
6.3 CRRA utility function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
6.4 Logarithmic utility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
6.5 Discussion of the optimal investment strategy for CRRA utility . . . . . . . . . . . 76
6.6 The life-cycle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
6.7 Loss due to suboptimal investments . . . . . . . . . . . . . . . . . . . . . . . . . . 80
6.8 Infrequent rebalancing of the portfolio . . . . . . . . . . . . . . . . . . . . . . . . . 81
6.9 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
7 Stochastic investment opportunities: the general case 85
7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
7.2 General utility functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
7.3 CRRA utility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
7.4 Logarithmic utility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
7.5 How costly are deviations from the optimal investment strategy? . . . . . . . . . . 105
7.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
8 The martingale approach 111
8.1 The martingale approach in complete markets . . . . . . . . . . . . . . . . . . . . . 111
8.2 Complete markets and constant investment opportunities . . . . . . . . . . . . . . 115
8.3 Complete markets and stochastic investment opportunities . . . . . . . . . . . . . . 119
8.4 The martingale approach with portfolio constraints . . . . . . . . . . . . . . . . . . 120
8.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
9 Numerical methods for solving dynamic asset allocation problems 129
Contents iii
10 Asset allocation with stochastic interest rates 131
10.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
10.2 One-factor Vasicek interest rate dynamics . . . . . . . . . . . . . . . . . . . . . . . 132
10.3 One-factor CIR dynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
10.4 A numerical example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
10.5 Two-factor Vasicek model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
10.6 Other studies with stochastic interest rates . . . . . . . . . . . . . . . . . . . . . . 146
10.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149
11 Asset allocation with stochastic market prices of risk 153
11.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153
11.2 Mean reversion in stock returns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153
11.3 Stochastic volatility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160
11.4 More . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
11.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
12 Ination risk and asset allocation with no risk-free asset 167
12.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
12.2 Real and nominal price dynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
12.3 Constant investment opportunities . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
12.4 General stochastic investment opportunities . . . . . . . . . . . . . . . . . . . . . . 172
12.5 Hedging real interest rate risk without real bonds . . . . . . . . . . . . . . . . . . . 172
13 Labor income 179
13.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
13.2 A motivating example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
13.3 Exogenous income in a complete market . . . . . . . . . . . . . . . . . . . . . . . . 181
13.4 Exogenous income in incomplete markets . . . . . . . . . . . . . . . . . . . . . . . 189
13.5 Endogenous labor supply and income . . . . . . . . . . . . . . . . . . . . . . . . . . 191
13.6 More . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194
14 Consumption and portfolio choice with housing 195
15 Other variations of the problem... 197
15.1 Multiple and/or durable consumption goods . . . . . . . . . . . . . . . . . . . . . . 197
15.2 Uncertain time of death; insurance . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
16 International asset allocation 199
17 Non-standard assumptions on investors 201
17.1 Preferences with habit formation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201
17.2 Recursive utility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
17.3 Model/parameter uncertainty, incomplete information, learning . . . . . . . . . . . 210
17.4 Ambiguity aversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
17.5 Other objective functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
17.6 Consumption and portfolio choice for non-price takers . . . . . . . . . . . . . . . . 210
iv Contents
17.7 Non-utility based portfolio choice . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
17.8 Allowing for bankruptcy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
18 Trading and information imperfections 213
18.1 Trading constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
18.2 Transaction costs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
A Results on the lognormal distribution 219
B Stochastic processes and stochastic calculus 223
B.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
B.2 What is a stochastic process? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
B.3 Brownian motions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
B.4 Diusion processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 234
B.5 Ito processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237
B.6 Stochastic integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237
B.7 Itos Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241
B.8 Important diusion processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242
B.9 Multi-dimensional processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
B.10 Change of probability measure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 255
B.11 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
C Solutions to Ordinary Dierential Equations 261
References 263
Preface
INCOMPLETE!
Preliminary and incomplete lecture notes intended for use at an advanced masters level or an
introductory Ph.D. level. I appreciate comments and corrections from Simon Bonde, Kenneth
Brandborg, Jens Henrik Eggert Christensen, Heine Jepsen, Thomas Larsen, Jakob Nielsen, Nicolai
Nielsen, Kenneth Winther Pedersen, Carsten Srensen, and in particular Linda Sandris Larsen.
Additional comments and suggestions are very welcome!
Claus Munk
Internet homepage: sites.google.com/site/munkfinance
v
CHAPTER 1
Introduction to asset allocation
1.1 Introduction
Financial markets oer opportunities to move money between dierent points in time and dif-
ferent states of the world. Investors must decide how much to invest in the nancial markets and
how to allocate that amount between the many, many available nancial securities. Investors can
change their investments as time passes and they will typically want to do so for example when
they obtain new information about the prospective returns on the nancial securities. Hence, they
must gure out how to manage their portfolio over time. In other words, they must determine an
investment strategy or an asset allocation strategy. The term asset allocation is sometimes used for
the allocation of investments to major asset classes, e.g., stocks, bonds, and cash. In later chapters
we will often focus on this decision, but we will use the term asset allocation interchangeably with
the terms optimal investment or portfolio management.
It is intuitively clear that in order to determine the optimal investment strategy for an investor,
we must make some assumptions about the objectives of the investor and about the possible returns
on the nancial markets. Dierent investors will have dierent motives for investments and hence
dierent objectives. In Section 1.2 we will discuss the motives and objectives of dierent types
of investors. We will focus on the asset allocation decisions of individual investors or households.
Individuals invest in the nancial markets to nance future consumption of which they obtain
some felicity or utility. We discuss how to model the preferences of individuals in Chapter 2.
1.2 Investor classes and motives for investments
We can split the investors into individual investors (households; sometimes called retail investors)
and institutional investors (includes both nancial intermediaries such as pension funds, insurance
companies, mutual funds, and commercial banks and manufacturing companies producing goods
or services). Dierent investors have dierent objectives. Manufacturing companies probably invest
mostly in short-term bonds and deposits in order to manage their liquidity needs and avoid the
1
2 Chapter 1. Introduction to asset allocation
deadweight costs of raising small amounts of capital very frequently. They will rarely set up long-
term strategies for investments in the nancial markets and their nancial investments constitute
a very small part of the total investments.
Individuals can use their money either for consumption or savings. Here we use the term savings
synonymously with nancial investments so that it includes both deposits in banks and investments
in stocks, bonds, and possibly other securities. Traditionally most individuals have saved in form
of bank deposits and maybe government bonds, but in recent years there has been an increasing
interest of individuals for investing in the stock market. Individuals typically save when they
are young by consuming less than the labor income they earn, primarily in order to accumulate
wealth they can use for consumption when they retire. Other motives for saving is to be able to
nance large future expenditures (e.g., purchase of real estate, support of children during their
education, expensive celebrations or vacations) or simply to build up a buer for hard times
due to unemployment, disability, etc. We assume that the objective of an individual investor is
to maximize the utility of consumption throughout the life-time of the investor. We will discuss
utility functions in Chapter 2.
A large part of the savings of individuals are indirect through pension funds and mutual funds.
These funds are the major investors in todays markets. Some of these funds are non-prot funds
that are owned by the investors in the fund. The objective of such funds should represent the
objectives of the fund investors.
Let us look at pension funds. One could imagine a pension fund that determines the optimal
portfolio of each of the fund investors and aggregates over all investors to nd the portfolio of
the fund. Each fund investor is then allocated the returns on her optimal portfolio, probably
net of some servicing fee. The purpose of forming the fund is then simply to save transaction
costs. A practical implementation of this is to let each investor allocate her funds among some
pre-selected portfolios, for example a portfolio mimicking the overall stock market index, various
portfolios of stocks in dierent industries, one or more portfolios of government bonds (e.g., one
in short-term and one in long-term bonds), portfolios of corporate bonds and mortgage-backed
bonds, portfolios of foreign stocks and bonds, and maybe also portfolios of derivative securities
and even non-nancial portfolios of metals and real estate. Some pension funds operate in this way
and there seems to be a tendency for more and more pension funds to allow investor discretion
with regards to the way the deposits are invested.
However, in many pension funds some hired fund managers decide on the investment strategy.
Often all the deposits of dierent fund members are pooled together and then invested according
to a portfolio chosen by the fund managers (probably following some general guidelines set up by
the board of the fund). Once in a while the rate of return of the portfolio is determined and the
deposit of each investor is increased according to this rate of return less some servicing fee. In
many cases the returns on the portfolio of the fund are distributed to the fund members using more
complicated schemes. Rate of return guarantees, bonus accounts,.... The salary of the manager of
a fund is often linked to the return on the portfolio he chooses and some benchmark portfolio(s).
A rational manager will choose a portfolio that maximizes his utility and that portfolio choice may
be far from the optimal portfolio of the fund members....
Mutual funds...
This lecture note will focus on the decision problem of an individual investor and aims to analyze
1.3 Typical investment advice 3
and answer the following questions:
What are the utility maximizing dynamic consumption and investment strategies of an indi-
vidual?
What is the relation between optimal consumption and optimal investment?
How are nancial investments optimally allocated to dierent asset classes, e.g., stocks and
bonds?
How are nancial investments optimally allocated to single securities within each asset class?
How does the optimal consumption and investment strategies depend on, e.g., risk aversion,
time horizon, initial wealth, labor income, and asset price dynamics?
Are the recommendations of investment advisors consistent with the theory of optimal in-
vestments?
1.3 Typical investment advice
TO COME... References: Quinn (1997), Siegel (2002)
Concerning the value of analyst recommendations: Barber, Lehavy, McNichols, and Trueman
(2001), Jegadeesh and Kim (2006), Malmendier and Shanthikumar (2007), Elton and Gruber
(2000)
1.4 How do individuals allocate their wealth?
TO COME...
References: Friend and Blume (1975), Bodie and Crane (1997), Heaton and Lucas (2000),
Vissing-Jrgensen (2002), Ameriks and Zeldes (2004), Gomes and Michaelides (2005), Campbell
(2006), Calvet, Campbell, and Sodini (2007), Curcuru, Heaton, Lucas, and Moore (2009), Wachter
and Yogo (2010)
Christiansen, Joensen, and Rangvid (2008): dierences due to education
Yang (2009): house owners vs. non-owners
1.5 An overview of the theory of optimal investments
TO COME...
1.6 The future of investment management and services
TO COME... References: Bodie (2003), Merton (2003)
1.7 Outline of the rest
1.8 Notation
Since we are going to deal simultaneously with many nancial assets, it will often be mathe-
matically convenient to use vectors and matrices. All vectors are considered column vectors. The
4 Chapter 1. Introduction to asset allocation
superscript
>
on a vector or a matrix indicates that the vector or matrix is transposed. We will
use the notation 1 for a vector where all elements are equal to 1; the dimension of the vector will
be clear from the context. We will use the notation e
i
for a vector (0, . . . , 0, 1, 0, . . . , 0)
>
where
the 1 is entry number i. Note that for two vectors x = (x
1
, . . . , x
d
)
>
and y = (y
1
, . . . , y
d
)
>
we
have x
>
y = y
>
x =

d
i=1
x
i
y
i
. In particular, x
>
1 =

d
i=1
x
i
and e
>
i
x = x
i
. We also dene
kxk
2
= x
>
x =

d
i=1
x
2
i
.
If x = (x
1
, . . . , x
n
) and f is a real-valued function of x, then the (rst-order) derivative of f
with respect to x is the vector
f
0
(x) f
x
(x) =

f
x
1
, . . . ,
f
x
n

>
.
This is also called the gradient of f. The second-order derivative of f is the n n Hessian matrix
f
00
(x) f
xx
(x) =
_
_
_
_
_
_
_

2
f
x
2
1

2
f
x
1
x
2
. . .

2
f
x
1
x
n

2
f
x
2
x
1

2
f
x
2
2
. . .

2
f
x
2
x
n
.
.
.
.
.
.
.
.
.
.
.
.

2
f
x
n
x
1

2
f
x
n
x
2
. . .

2
f
x
2
n
_
_
_
_
_
_
_
.
If x and a are n-dimensional vectors, then

x
(a
>
x) =

x
(x
>
a) = a.
If x is an n-dimensional vector and A is a symmetric [i.e., A = A
>
] n n matrix, then

x
_
x
>
Ax
_
= 2Ax.
If A is non-singular, then (AA
>
)
1
= (A
>
)
1
A
1
.
CHAPTER 2
Preferences
2.1 Introduction
In order to say anything concrete about the optimal investments of individuals we have to
formalize the decision problem faced by individuals. We assume that individuals have preferences
for consumption and must choose between dierent consumption plans, i.e., plans for how much to
consume at dierent points in time and in dierent states of the world. The nancial market allows
individuals to reallocate consumption over time and over states and hence obtain a consumption
plan dierent from their endowment.
Although an individual will typically obtain utility from consumption at many dierent dates
(or in many dierent periods), we will rst address the simpler case with consumption at only
one future point in time. In such a setting a consumption plan is simply a random variable
representing the consumption at that date. Even in one-period models individuals should be
allowed to consume both at the beginning of the period and at the end of the period, but we will
rst ignore the inuence of current consumption on the well-being of the individual. We do that
both since current consumption is certain and we want to focus on how preferences for uncertain
consumption can be represented, but also to simplify the notation and analysis somewhat. Since
we have in mind a one-period economy, we basically have to model preferences for end-of-period
consumption.
Sections 2.22.4 discuss how to represent individual preferences in a tractable way. We will
demonstrate that under some fundamental assumptions (axioms) on individual behavior, the
preferences can be modeled by a utility index which to each consumption plan assigns a real
number with higher numbers to the more preferred plans. Under an additional axiom we can
represent the preferences in terms of expected utility, which is even simpler to work with and used
in most models of nancial economics. Section 2.5 denes and discusses the important concept
of risk aversion. Section 2.6 introduces the utility functions that are typically applied in models
of nancial economics and provides a short discussion of which utility functions and levels of risk
aversions that seem to be reasonable for representing the decisions of individuals. In Section 2.7
5
6 Chapter 2. Preferences
we discuss extensions to preferences for consumption at more than one point in time.
There is a large literature on how to model the preferences of individuals for uncertain outcomes
and the presentation here is by no means exhaustive. The literature dates back at least to the Swiss
mathematician Daniel Bernoulli in 1738 (see English translation in Bernoulli (1954)), but was put
on a rm formal setting by von Neumann and Morgenstern (1944). For some recent textbook
presentations on a similar level as the one given here, see Huang and Litzenberger (1988, Ch. 1),
Kreps (1990, Ch. 3), Gollier (2001, Chs. 1-3), and Danthine and Donaldson (2002, Ch. 2).
2.2 Consumption plans and preference relations
It seems fair to assume that whenever the individual compares two dierent consumption plans,
she will be able either to say that she prefers one of them to the other or to say that she is indierent
between the two consumption plans. Moreover, she should make such pairwise comparisons in a
consistent way. For example, if she prefers plan 1 to plan 2 and plan 2 to plan 3, she should
prefer plan 1 to plan 3. If these properties hold, we can formally represent the preferences of the
individual by a so-called preference relation. A preference relation itself is not very tractable so
we are looking for simpler ways of representing preferences. First, we will nd conditions under
which it makes sense to represent preferences by a so-called utility index which attaches a real
number to each consumption plan. If and only if plan 1 has a higher utility index than plan 2, the
individual prefers plan 1 to plan 2. Attaching numbers to each possible consumption plan is also not
easy so we look for an even simpler representation. We show that under an additional condition
we can represent preferences in an even simpler way in terms of the expected value of a utility
function. A utility function is a function dened on the set of possible levels of consumption. Since
consumption is random it then makes sense to talk about the expected utility of a consumption
plan. The individual will prefer consumption plan 1 to plan 2 if and only if the expected utility
from consumption plan 1 is higher than the expected utility from consumption plan 2. This
representation of preferences turns out to be very tractable and is applied in the vast majority of
asset pricing models.
Our main analysis is formulated under some simplifying assumptions that are not necessarily
appropriate. At the end of this section we will briey discuss how to generalize the analysis and
also discuss the appropriateness of the axioms on individual behavior that need to be imposed in
order to obtain the expected utility representation.
We assume that there is uncertainty about how the variables aecting the well-being of an
individual (e.g., asset returns) turn out. We model the uncertainty by a probability space (, F, P).
In most of the chapter we will assume that the state space is nite, = {1, 2, . . . , S}, so that there
are S possible states of which exactly one will be realized. For simplicity, think of this as a model
of one-period economy with S possible states at the end of the period. The set F of events that
can be assigned a probability is the collection of all subsets of . The probability measure P is
dened by the individual state probabilities p

= P(), = 1, 2, . . . , S. We assume that all p

> 0
and, of course, we have that p
1
+. . . p
S
= 1. We take the state probabilities as exogenously given
and known to the individuals.
Individuals care about their consumption. It seems reasonable to assume that when an individual
chooses between two dierent actions (e.g., portfolio choices), she only cares about the consumption
2.2 Consumption plans and preference relations 7
state 1 2 3
state prob. p

0.2 0.3 0.5


cons. plan 1, c
(1)
3 2 4
cons. plan 2, c
(2)
3 1 5
cons. plan 3, c
(3)
4 4 1
cons. plan 4, c
(4)
1 1 4
Table 2.1: The possible state-contingent consumption plans in the example.
plans generated by these choices. For example, she will be indierent between two choices that
generate exactly the same consumption plans, i.e., the same consumption levels in all states. In
order to simplify the following analysis, we will assume a bit more, namely that the individual
only cares about the probability distribution of consumption generated by each portfolio. This is
eectively an assumption of state-independent preferences.
We can represent a consumption plan by a random variable c on (, F, P). We assume that
there is only one consumption good and since consumption should be non-negative, c is valued in
R
+
= [0, ). As long as we are assuming a nite state space = {1, 2, . . . , S} we can equivalently
represent the consumption plan by a vector (c
1
, . . . , c
S
), where c

[0, ) denotes the consumption


level if state is realized, i.e., c

c(). Let C denote the set of consumption plans that the


individual has to choose among. Let Z R
+
denote the set of all the possible levels of the
consumption plans that are considered, i.e., no matter which of these consumption plans we take,
its value will be in Z no matter which state is realized. Each consumption plan c C is associated
with a probability distribution
c
, which is the function
c
: Z [0, 1], given by

c
(z) =

: c

=z
p

,
i.e., the sum of the probabilities of those states in which the consumption level equals z.
As an example consider an economy with three possible states and four possible state-contingent
consumption plans as illustrated in Table 2.1. These four consumption plans may be the prod-
uct of four dierent portfolio choices. The set of possible end-of-period consumption levels is
Z = {1, 2, 3, 4, 5}. Each consumption plan generates a probability distribution on the set Z. The
probability distributions corresponding to these consumption plans are as shown in Table 2.2. We
see that although the consumption plans c
(3)
and c
(4)
are dierent they generate identical proba-
bility distributions. By assumption individuals will be indierent between these two consumption
plans.
Given these assumptions the individual will eectively choose between probability distributions
on the set of possible consumption levels Z. We assume for simplicity that Z is a nite set, but the
results can be generalized to the case of innite Z at the cost of further mathematical complexity.
We denote by P(Z) the set of all probability distributions on Z that are generated by consumption
plans in C. A probability distribution on the nite set Z is simply a function : Z [0, 1] with
the properties that

zZ
(z) = 1 and (A B) = (A) + (B) whenever A B = .
We assume that the preferences of the individual can be represented by a preference relation
on P(Z), which is a binary relation satisfying the following two conditions:
8 Chapter 2. Preferences
cons. level z 1 2 3 4 5
cons. plan 1,
c
(1) 0 0.3 0.2 0.5 0
cons. plan 2,
c
(2) 0.3 0 0.2 0 0.5
cons. plan 3,
c
(3) 0.5 0 0 0.5 0
cons. plan 4,
c
(4) 0.5 0 0 0.5 0
Table 2.2: The probability distributions corresponding to the state-contingent con-
sumption plans shown in Table 2.1.
(i) if
1

2
and
2

3
, then
1

3
[transitivity]
(ii)
1
,
2
P(Z) : either
1

2
or
2

1
[completeness]
Here,
1

2
is to be read as
1
is preferred to
2
. We write
1
6
2
if
1
is not preferred
to
2
. If both
1

2
and
2

1
, we write
1

2
and say that the individual is indierent
between
1
and
2
. If
1

2
, but
2
6
1
, we say that
1
is strictly preferred to
2
and write

1

2
.
Note that if
1
,
2
P(Z) and [0, 1], then
1
+ (1 )
2
P(Z). The mixed distribution

1
+ (1 )
2
assigns the probability (
1
+ (1 )
2
) (z) =
1
(z) + (1 )
2
(z) to the
consumption level z. We can think of the mixed distribution
1
+ (1 )
2
as the outcome of a
two-stage gamble. The rst stage is to ip a coin which with probability shows head and with
probability 1 shows tails. If head comes out, the second stage is the consumption gamble
corresponding to the probability distribution
1
. If tails is the outcome of the rst stage, the
second stage is the consumption gamble corresponding to
2
. When we assume that preferences
are represented by a preference relation on the set P(Z) of probability distributions, we have
implicitly assumed that the individual evaluates the two-stage gamble (or any multi-stage gamble)
by the combined probability distribution, i.e., the ultimate consequences of the gamble. This is
sometimes referred to as consequentialism.
Let z be some element of Z, i.e., some possible consumption level. By 1
z
we will denote the
probability distribution that assigns a probability of one to z and a zero probability to all other
elements in Z. Since we have assumed that the set Z of possible consumption levels only has a
nite number of elements, it must have a maximum element, say z
u
, and a minimum element,
say z
l
. Since the elements represent consumption levels, it is certainly natural that individuals
prefer higher elements than lower. We will therefore assume that the probability distribution
1
z
u is preferred to any other probability distribution. Conversely, any probability distribution is
preferred to the probability distribution 1
z
l . We assume that 1
z
u is strictly preferred to 1
z
l so
that the individual is not indierent between all probability distributions. For any P(Z) we
thus have that,
1
z
u 1
z
l or 1
z
u 1
z
l or 1
z
u 1
z
l .
2.3 Utility indices 9
2.3 Utility indices
A utility index for a given preference relation is a function U : P(Z) R that to each
probability distribution over consumption levels attaches a real-valued number such that

1

2
U(
1
) U(
2
).
Note that a utility index is only unique up to a strictly increasing transformation. If U is a utility
index and f : R R is any strictly increasing function, then the composite function V = f U,
dened by V() = f (U()), is also a utility index for the same preference relation.
We will show below that a utility index exists under the following two axiomatic assumptions
on the preference relation :
Axiom 2.1 (Monotonicity). Suppose that
1
,
2
P(Z) with
1

2
and let a, b [0, 1]. The
preference relation has the property that
a > b a
1
+ (1 a)
2
b
1
+ (1 b)
2
.
This is certainly a very natural assumption on preferences. If you consider a weighted average
of two probability distributions, you will prefer a high weight on the best of the two distributions.
Axiom 2.2 (Archimedean). The preference relation has the property that for any three proba-
bility distributions
1
,
2
,
3
P(Z) with
1

2

3
, numbers a, b (0, 1) exist such that
a
1
+ (1 a)
3

2
b
1
+ (1 b)
3
.
The axiom basically says that no matter how good a probability distribution
1
is, it is so that
for any
2

3
we can nd some mixed distribution of
1
and
3
to which
2
is preferred. We just
have to put a suciently low weight on
1
in the mixed distribution. Similarly, no matter how bad
a probability distribution
3
is, it is so that for any
1

2
we can nd some mixed distribution
of
1
and
3
that is preferred to
2
. We just have to put a suciently low weight on
3
in the
mixed distribution.
We shall say that a preference relation has the continuity property if for any three probability
distributions
1
,
2
,
3
P(Z) with
1

2

3
, a unique number (0, 1) exists such that

2

1
+ (1 )
3
.
We can easily extend this to the case where either
1

2
or
2

3
. For
1

2

3
,

2
1
1
+(11)
3
corresponding to = 1. For
1

2

3
,
2
0
1
+(10)
3
corresponding
to = 0. In words the continuity property means that for any three probability distributions there
is a unique combination of the best and the worst distribution so that the individual is indierent
between the third middle distribution and this combination of the other two. This appears
to be closely related to the Archimedean Axiom and, in fact, the next lemma shows that the
Monotonicity Axiom and the Archimedean Axiom imply continuity of preferences.
Lemma 2.1. Let be a preference relation satisfying the Monotonicity Axiom and the Archimedean
Axiom. Then it has the continuity property.
Proof. Given
1

2

3
. Dene the number by
= sup{k [0, 1] |
2
k
1
+ (1 k)
3
}.
10 Chapter 2. Preferences
By the Monotonicity Axiom we have that
2
k
1
+ (1 k)
3
for all k < and that k
1
+
(1 k)
3

2
for all k > . We want to show that
2

1
+ (1 )
3
. Note that by the
Archimedean Axiom, there is some k > 0 such that
2
k
1
+ (1 k)
3
and some k < 1 such
that k
1
+ (1 k)
3

2
. Consequently, is in the open interval (0, 1).
Suppose that
2

1
+ (1 )
3
. Then according to the Archimedean Axiom we can nd
a number b (0, 1) such that
2
b
1
+ (1 b){
1
+ (1 )
3
}. The mixed distribution on
the right-hand side has a total weight of k = b + (1 b) = + (1 )b > on
1
. Hence we
have found some k > for which
2
k
1
+ (1 k)
3
. This contradicts the denition of .
Consequently, we must have that
2
6
1
+ (1 )
3
.
Now suppose that
1
+ (1 )
3

2
. Then we know from the Archimedean Axiom that a
number a (0, 1) exists such that a{
1
+ (1 )
3
} + (1 a)
3

2
. The mixed distribution
on the left-hand side has a total weight of a < on
1
. Hence we have found some k < for
which k
1
+ (1 k)
3

2
. This contradicts the denition of . We can therefore also conclude
that
1
+ (1 )
3
6
2
. In sum, we have
2

1
+ (1 )
3
.
The next result states that a preference relation which satises the Monotonicity Axiom and
has the continuity property can always be represented by a utility index. In particular this is true
when satises the Monotonicity Axiom and the Archimedean Axiom.
Theorem 2.1. Let be a preference relation which satises the Monotonicity Axiom and has the
continuity property. Then it can be represented by a utility index U, i.e., a function U : P(Z) R
with the property that

1

2
U(
1
) U(
2
).
Proof. Recall that we have assumed a best probability distribution 1
z
u and a worst probability
distribution 1
z
l in the sense that
1
z
u 1
z
l or 1
z
u 1
z
l or 1
z
u 1
z
l
for any P(Z). For any P(Z) we know from the continuity property that a unique number

[0, 1] exists such that


1
z
u + (1

)1
z
l .
If 1
z
u 1
z
l ,

= 1. If 1
z
u 1
z
l ,

= 0. If 1
z
u 1
z
l ,

(0, 1).
We dene the function U : P(Z) R by U() =

. By the Monotonicity Axiom we know that


U(
1
) U(
2
) if and only if
U(
1
)1
z
u + (1 U(
1
)) 1
z
l U(
2
)1
z
u + (1 U(
2
)) 1
z
l ,
and hence if and only if
1

2
. It follows that U is a utility index.
2.4 Expected utility representation of preferences
Utility indices are functions of probability distributions on the set of possible consumption
levels. With many states of the world and many assets to trade in, the set of such probability
distributions will be very, very large. This will signicantly complicate the analysis of optimal
choice using utility indices to represent preferences. To simplify the analysis nancial economists
2.4 Expected utility representation of preferences 11
traditionally put more structure on the preferences so that they can be represented in terms of
expected utility.
We say that a preference relation on P(Z) has an expected utility representation if there exists
a function u : Z R such that

1

2

zZ

1
(z)u(z)

zZ

2
(z)u(z). (2.1)
Here

zZ
(z)u(z) is the expected utility of end-of-period consumption given the consumption
probability distribution , so (2.1) says that E[u(c
1
)] E[u(c
2
)], where c
i
is the random variable
representing end-of-period consumption with associated consumption probability distribution
i
.
The function u is called a von Neumann-Morgenstern utility function or simply a utility function.
Note that u is dened on the set Z of consumption levels, which in general has a simpler structure
than the set of probability distributions on Z. Given a utility function u, we can obviously dene
a utility index by U() =

zZ
(z)u(z).
2.4.1 Conditions for expected utility
When can we use an expected utility representation of a preference relation? The next lemma
is a rst step.
Lemma 2.2. A preference relation has an expected utility representation if and only if it can
be represented by a linear utility index U in the sense that
U(a
1
+ (1 a)
2
) = aU(
1
) + (1 a)U(
2
)
for any
1
,
2
P(Z) and any a [0, 1].
Proof. Suppose that has an expected utility representation with utility function u. Dene
U : P(Z) R by U() =

zZ
(z)u(z). Then clearly U is a utility index representing and U
is linear since
U(a
1
+ (1 a)
2
) =

zZ
(a
1
(z) + (1 a)
2
(z)) u(z)
= a

zZ

1
(z)u(z) + (1 a)

zZ

2
(z)u(z)
= aU(
1
) + (1 a)U(
2
).
Conversely, suppose that U is a linear utility index representing . Dene a function u : Z R
by u(z) = U(1
z
). For any P(Z) we have

zZ
(z)1
z
.
Therefore,
U() = U
_

zZ
(z)1
z
_
=

zZ
(z)U(1
z
) =

zZ
(z)u(z).
Since U is a utility index, we have
1

2
U(
1
) U(
2
), which the computation above shows
is equivalent to

zZ

1
(z)u(z)

zZ

2
(z)u(z). Consequently, u gives an expected utility
representation of .
12 Chapter 2. Preferences
z 1 2 3 4

1
0 0.2 0.6 0.2

2
0 0.4 0.2 0.4

3
1 0 0 0

4
0.5 0.1 0.3 0.1

5
0.5 0.2 0.1 0.2
Table 2.3: The probability distributions used in the illustration of the Substitution
Axiom.
The question then is under what assumptions the preference relation can be represented by
a linear utility index. As shown by von Neumann and Morgenstern (1944) we need an additional
axiom, the so-called Substitution Axiom.
Axiom 2.3 (Substitution). For all
1
,
2
,
3
P(Z) and all a (0, 1], we have

1

2
a
1
+ (1 a)
3
a
2
+ (1 a)
3
and

1

2
a
1
+ (1 a)
3
a
2
+ (1 a)
3
.
The Substitution Axiom is sometimes called the Independence Axiom or the Axiom of the
Irrelevance of the Common Alternative. Basically, it says that when the individual is to compare
two probability distributions, she needs only consider the parts of the two distributions which
are dierent from each other. As an example, suppose the possible consumption levels are Z =
{1, 2, 3, 4} and consider the probability distributions on Z given in Table 2.3. Suppose you want
to compare the distributions
4
and
5
. They only dier in the probabilities they associate with
consumption levels 2, 3, and 4 so it should only be necessary to focus on these parts. More formally
observe that

4
0.5
1
+ 0.5
3
and
5
0.5
2
+ 0.5
3
.

1
is the conditional distribution of
4
given that the consumption level is dierent from 1 and

2
is the conditional distribution of
5
given that the consumption level is dierent from 1. The
Substitution Axiom then says that

4

5

1

2
.
The next lemma shows that the Substitution Axiom is more restrictive than the Monotonicity
Axiom.
Lemma 2.3. If a preference relation satises the Substitution Axiom, it will also satisfy the
Monotonicity Axiom.
Proof. Given
1
,
2
P(Z) with
1

2
and numbers a, b [0, 1]. We have to show that
a > b a
1
+ (1 a)
2
b
1
+ (1 b)
2
.
Note that if a = 0, we cannot have a > b, and if a
1
+(1 a)
2
b
1
+(1 b)
2
we cannot have
a = 0. We can therefore safely assume that a > 0.
2.4 Expected utility representation of preferences 13
First assume that a > b. Observe that it follows from the Substitution Axiom that
a
1
+ (1 a)
2
a
2
+ (1 a)
2
and hence that a
1
+ (1 a)
2

2
. Also from the Substitution Axiom we have that for any

3

2
, we have

1
b
a

3
+
b
a

1
b
a

2
+
b
a

3
.
Due to our observation above, we can use this with
3
= a
1
+ (1 a)
2
. Then we get
a
1
+ (1 a)
2

b
a
{a
1
+ (1 a)
2
} +

1
b
a

2
b
1
+ (1 b)
2
,
as was to be shown.
Conversely, assuming that
a
1
+ (1 a)
2
b
1
+ (1 b)
2
,
we must argue that a > b. The above inequality cannot be true if a = b since the two combined
distributions are then identical. If b was greater than a, we could follow the steps above with a and
b swapped and end up concluding that b
1
+ (1 b)
2
a
1
+ (1 a)
2
, which would contradict
our assumption. Hence, we cannot have neither a = b nor a < b but must have a > b.
Next we state the main result:
Theorem 2.2. Assume that Z is nite and that is a preference relation on P(Z). Then can
be represented by a linear utility index if and only if satises the Archimedean Axiom and the
Substitution Axiom.
Proof. First suppose the preference relation satises the Archimedean Axiom and the Substi-
tution Axiom. Dene a utility index U : P(Z) R exactly as in the proof of Theorem 2.1, i.e.,
U() =

, where

[0, 1] is the unique number such that


1
z
u + (1

)1
z
l .
We want to show that, as a consequence of the Substitution Axiom, U is indeed linear. For that
purpose, pick any two probability distributions
1
,
2
P(Z) and any number a [0, 1]. We want
to show that U(a
1
+ (1 a)
2
) = aU(
1
) + (1 a)U(
2
). We can do that by showing that
a
1
+ (1 a)
2
(aU(
1
) + (1 a)U(
2
)) 1
z
u + (1 {aU(
1
) + (1 a)U(
2
)}) 1
z
l .
This follows from the Substitution Axiom:
a
1
+ (1 a)
2
a{U(
1
)1
z
u + (1 U(
1
)) 1
z
l } + (1 a){U(
2
)1
z
u + (1 U(
2
)) 1
z
l }
(aU(
1
) + (1 a)U(
2
)) 1
z
u + (1 {aU(
1
) + (1 a)U(
2
)}) 1
z
l .
Now let us show the converse, i.e., if can be represented by a linear utility index U, then it must
satisfy the Archimedean Axiom and the Substitution Axiom. In order to show the Archimedean
14 Chapter 2. Preferences
Axiom, we pick
1

2

3
, which means that U(
1
) > U(
2
) > U(
3
), and must nd numbers
a, b (0, 1) such that
a
1
+ (1 a)
3

2
b
1
+ (1 b)
3
,
i.e., that
U(a
1
+ (1 a)
3
) > U(
2
) > U(b
1
+ (1 b)
3
) .
Dene the number a by
a = 1
1
2
U(
1
) U(
2
)
U(
1
) U(
3
)
.
Then a (0, 1) and by linearity of U we get
U(a
1
+ (1 a)
3
) = aU(
1
) + (1 a)U(
3
)
= U(
1
) + (1 a) (U(
3
) U(
1
))
= U(
1
)
1
2
(U(
1
) U(
2
))
=
1
2
(U(
1
) +U(
2
))
> U(
2
).
Similarly for b.
In order to show the Substitution Axiom, we take
1
,
2
,
3
P(Z) and any number a (0, 1].
We must show that
1

2
if and only if a
1
+ (1 a)
3
a
2
+ (1 a)
3
, i.e.,
U(
1
) > U(
2
) U(a
1
+ (1 a)
3
) > U(a
2
+ (1 a)
3
) .
This follows immediately by linearity of U:
U(a
1
+ (1 a)
3
) = aU(
1
) +U((1 a)
3
)
> aU(
2
) +U((1 a)
3
)
= U(a
2
+ (1 a)
3
)
with the inequality holding if and only if U(
1
) > U(
2
). Similarly, we can show that
1

2
if
and only if a
1
+ (1 a)
3
a
2
+ (1 a)
3
.
The next theorem shows which utility functions that represent the same preference relation. The
proof is left for the reader as Exercise 2.1.
Theorem 2.3. A utility function for a given preference relation is only determined up to a strictly
increasing ane transformation, i.e., if u is a utility function for , then v will be so if and only
if there exist constants a > 0 and b such that v(z) = au(z) +b for all z Z.
If one utility function is an ane function of another, we will say that they are equivalent. Note
that an easy consequence of this theorem is that it does not really matter whether the utility is
positive or negative. At rst, you might nd negative utility strange but we can always add a
suciently large positive constant without aecting the ranking of dierent consumption plans.
Suppose U is a utility index with an associated utility function u. If f is any strictly increasing
transformation, then V = f U is also a utility index for the same preferences, but f u is only
the utility function for V if f is ane.
2.4 Expected utility representation of preferences 15
The expected utility associated with a probability distribution on Z is

zZ
(z)u(z). Recall
that the probability distributions we consider correspond to consumption plans. Given a con-
sumption plan, i.e., a random variable c, the associated probability distribution is dened by the
probabilities
(z) = P({ |c() = z}) =

:c()=z
p

.
The expected utility associated with the consumption plan c is therefore
E[u(c)] =

u(c()) =

zZ

:c()=z
p

u(z) =

zZ
(z)u(z).
Of course, if c is a risk-free consumption plan in the sense that a z exists such that c() = z for all
, then the expected utility is E[u(c)] = u(z). With a slight abuse of notation we will just write
this as u(c).
2.4.2 Some technical issues
Innite Z. What if Z is innite, e.g., Z = R
+
[0, )? It can be shown that in this case a
preference relation has an expected utility representation if the Archimedean Axiom, the Substi-
tution Axiom, an additional axiom (the sure thing principle), and some technical conditions
are satised. Fishburn (1970) gives the details.
Expected utility in this case: E[u(c)] =
_
Z
u(z)(z) dz, where is a probability density function
derived from the consumption plan c.
Boundedness of expected utility. Suppose u is unbounded from above and R
+
Z. Then
there exists (z
n
)

n=1
Z with z
n
and u(z
n
) 2
n
. Expected utility of consumption plan
1
with
1
(z
n
) = 1/2
n
:

n=1
u(z
n
)
1
(z
n
)

n=1
2
n
1
2
n
= .
If
2
,
3
are such that
1

2

3
, then the expected utility of
2
and
3
must be nite. But
for no b (0, 1) do we have

2
b
1
+ (1 b)
3
[expected utility = ].
no problem if Z is nite
no problem if R
+
Z, u is concave, and consumption plans have nite expectations:
u concave u is dierentiable in some point b and
u(z) u(b) +u
0
(b)(z b), z Z.
If the consumption plan c has nite expectations, then
E[u(c)] E[u(b) +u
0
(b)(c b)] = u(b) +u
0
(b) (E[c] b) < .
16 Chapter 2. Preferences
z 0 1 5

1
0 1 0

2
0.01 0.89 0.1

3
0.9 0 0.1

4
0.89 0.11 0
Table 2.4: The probability distributions used in the illustration of the Allais Para-
dox.
Subjective probability. We have taken the probabilities of the states of nature as exogenously
given, i.e., as objective probabilities. However, in real life individuals often have to form their own
probabilities about many events, i.e., they form subjective probabilities. Although the analysis is
a bit more complicated, Savage (1954) and Anscombe and Aumann (1963) show that the results
we developed above carry over to the case of subjective probabilities. For an introduction to this
analysis, see Kreps (1990, Ch. 3).
2.4.3 Are the axioms reasonable?
The validity of the Substitution Axiom, which is necessary for obtaining the expected utility
representation, has been intensively discussed in the literature. Some researchers have conducted
experiments in which the decisions made by the participating individuals conict with the Substi-
tution Axiom.
The most famous challenge is the so-called Allais Paradox named after Allais (1953). Here is
one example of the paradox. Suppose Z = {0, 1, 5}. Consider the consumption plans in Table 2.4.
The Substitution Axiom implies that
1

2

4

3
. This can be seen from the following:
0.11($1) + 0.89 ($1)
1

2
0.11

1
11
($0) +
10
11
($5)

+ 0.89 ($1)
0.11($1) + 0.89 ($0)
. .

0.11

1
11
($0) +
10
11
($5)

+ 0.89 ($0) 0.9($0) + 0.1($5)


. .

Nevertheless individuals preferring


1
to
2
often choose
3
over
4
. Apparently people tend to
over-weight small probability events, e.g., ($0) in
2
.
Other problems:
the framing of possible choices, i.e., the way you get the alternatives presented, seem to
aect decisions
models assume individuals have unlimited rationality
2.5 Risk aversion
In this section we focus on the attitudes towards risk reected by the preferences of an individual.
We assume that the preferences can be represented by a utility function u and that u is strictly
increasing so that the individual is greedy, i.e., prefers high consumption to low consumption.
We assume that the utility function is dened on some interval Z of R, e.g., Z = R
+
[0, ).
2.5 Risk aversion 17
2.5.1 Risk attitudes
Fix a consumption level c Z. Consider a random variable with E[] = 0. We can think of
c + as a random variable representing a consumption plan with consumption c + () if state
is realized. Note that E[c +] = c. Such a random variable is called a fair gamble or a zero-mean
risk.
An individual is said to be (strictly) risk-averse if she for all c Z and all fair gambles
(strictly) prefers the sure consumption level c to c + . In other words, a risk-averse individual
rejects all fair gambles. Similarly, an individual is said to be (strictly) risk-loving if she for all
c Z (strictly) prefers c + to c, and said to be risk-neutral if she for all c Z is indierent
between accepting any fair gamble or not. Of course, individuals may be neither risk-averse, risk-
neutral, or risk-loving, for example if they reject fair gambles around some values of c and accept
fair gambles around other values of c. Individuals may be locally risk-averse, locally risk-neutral,
and locally risk-loving. Since it is generally believed that individuals are risk-averse, we focus on
preferences exhibiting that feature.
We can think of any consumption plan c as the sum of its expected value E[c] and a fair gamble
= c E[c]. It follows that an individual is risk-averse if she prefers the sure consumption E[c] to
the random consumption c, i.e., if u(E[c]) E[u(c)]. By Jensens Inequality, this is true exactly
when u is a concave function and the strict inequality holds if u is strictly concave and c is a
non-degenerate random variable, i.e., it does not have the same value in all states. Recall that
u : Z R concave means that for all z
1
, z
2
Z and all a (0, 1) we have
u(az
1
+ (1 a)z
2
) au(z
1
) + (1 a)u(z
2
).
If the strict inequality holds in all cases, the function is said to be strictly concave. By the above
argument, we have the following theorem:
Theorem 2.4. An individual with a utility function u is (strictly) risk-averse if and only if u is
(strictly) concave.
Similarly, an individual is (strictly) risk-loving if and only if the utility function is (strictly)
convex. An individual is risk-neutral if and only if the utility function is ane.
2.5.2 Quantitative measures of risk aversion
We will focus on utility functions that are continuous and twice dierentiable on the interior
of Z. By our assumption of greedy individuals, we then have u
0
> 0, and the concavity of the
utility function for risk-averse investors is then equivalent to u
00
0.
The certainty equivalent of the random consumption plan c is dened as the c

Z such that
u(c

) = E[u(c)],
i.e., the individual is just as satised getting the consumption level c

for sure as getting the random


consumption c. With Z R, c

uniquely exists due to our assumptions that u is continuous and


strictly increasing. From the denition of the certainty equivalent it is clear that an individual will
rank consumption plans according to their certainty equivalents.
18 Chapter 2. Preferences
For a risk-averse individual we have the certainty equivalent c

of a consumption plan is smaller


than the expected consumption level E[c]. The risk premium associated with the consumption
plan c is dened as (c) = E[c] c

so that
E[u(c)] = u(c

) = u(E[c] (c)).
The risk premium is the consumption the individual is willing to give up in order to eliminate the
uncertainty.
The degree of risk aversion is associated with u
00
, but a good measure of risk aversion should be
invariant to strictly positive, ane transformations. This is satised by the Arrow-Pratt measures
of risk aversion dened as follows. The Absolute Risk Aversion is given by
ARA(c) =
u
00
(c)
u
0
(c)
.
The Relative Risk Aversion is given by
RRA(c) =
cu
00
(c)
u
0
(c)
= c ARA(c).
We can link the Arrow-Pratt measures to the risk premium in the following way. Let c Z
denote some xed consumption level and let be a fair gamble. The resulting consumption plan
is then c = c + . Denote the corresponding risk premium by ( c, ) so that
E[u( c + )] = u(c

) = u( c ( c, )) . (2.2)
We can approximate the left-hand side of (2.2) by
E[u( c + )] E
_
u( c) + u
0
( c) +
1
2

2
u
00
( c)
_
= u( c) +
1
2
Var[]u
00
( c),
using E[] = 0 and Var[] = E[
2
] E[]
2
= E[
2
], and we can approximate the right-hand side
of (2.2) by
u( c ( c, )) u( c) ( c, )u
0
( c).
Hence we can write the risk premium as
( c, )
1
2
Var[]
u
00
( c)
u
0
( c)
=
1
2
Var[] ARA( c).
Of course, the approximation is more accurate for small gambles. Thus the risk premium for a
small fair gamble around c is roughly proportional to the absolute risk aversion at c. We see that
the absolute risk aversion ARA( c) is constant if and only if ( c, ) is independent of c.
Loosely speaking, the absolute risk aversion ARA(c) measures the aversion to a fair gamble of
a given dollar amount around c, such as a gamble where there is an equal probability of winning
or loosing 1000 dollars. Since we expect that a wealthy investor will be less averse to that gamble
than a poor investor, the absolute risk aversion is expected to be a decreasing function of wealth.
Note that
ARA
0
(c) =
u
000
(c)u
0
(c) u
00
(c)
2
u
0
(c)
2
=

u
00
(c)
u
0
(c)

u
000
(c)
u
0
(c)
< 0 u
000
(c) > 0,
that is, a positive third-order derivative of u is necessary for the utility function u to exhibit
decreasing absolute risk aversion.
2.5 Risk aversion 19
Now consider a multiplicative fair gamble around c in the sense that the resulting consumption
plan is c = c (1 + ) = c + c, where E[] = 0. The risk premium is then
( c, c)
1
2
Var[ c] ARA( c) =
1
2
c
2
Var[] ARA( c) =
1
2
c Var[] RRA( c)
implying that
( c, c)
c

1
2
Var[] RRA( c). (2.3)
The fraction of consumption you require to engage in the multiplicative risk is thus (roughly) pro-
portional to the relative risk aversion at c. Note that utility functions with constant or decreasing
(or even modestly increasing) relative risk aversion will display decreasing absolute risk aversion.
Some authors use terms like risk tolerance and risk cautiousness. The absolute risk tolerance
at c is simply the reciprocal of the absolute risk aversion, i.e.,
ART(c) =
1
ARA(c)
=
u
0
(c)
u
00
(c)
.
Similarly, the relative risk tolerance is the reciprocal of the relative risk aversion. The risk cau-
tiousness at c is dened as the rate of change in the absolute risk tolerance, i.e., ART
0
(c).
2.5.3 Comparison of risk aversion between individuals
An individual with utility function u is said to be more risk-averse than an individual with
utility function v if for any consumption plan c and any xed c Z with E[u(c)] u( c), we have
E[v(c)] v( c). So the v-individual will accept all gambles that the u-individual will accept and
possibly some more. Pratt (1964) has shown the following theorem:
Theorem 2.5. Suppose u and v are twice continuously dierentiable and strictly increasing. Then
the following conditions are equivalent:
(a) u is more risk-averse than v,
(b) ARA
u
(c) ARA
v
(c) for all c Z,
(c) a strictly increasing and concave function f exists such that u = f v.
Proof. First let us show (a) (b): Suppose u is more risk-averse than v, but that ARA
u
( c) <
ARA
v
( c) for some c Z. Since ARA
u
and ARA
v
are continuous, we must then have that
ARA
u
(c) < ARA
v
(c) for all c in an interval around c. Then we can surely nd a small gamble
around c, which the u-individual will accept, but the v-individual will reject. This contradicts the
assumption in (a).
Next, we show (b) (c): Since v is strictly increasing, it has an inverse v
1
and we can dene
a function f by f(x) = u
_
v
1
(x)
_
. Then clearly f(v(c)) = u(c) so that u = f v. The rst-order
derivative of f is
f
0
(x) =
u
0
_
v
1
(x)
_
v
0
(v
1
(x))
,
which is positive since u and v are strictly increasing. Hence, f is strictly increasing. The second-
order derivative is
f
00
(x) =
u
00
_
v
1
(x)
_

_
v
00
_
v
1
(x)
_
u
0
_
v
1
(x)
__
/v
0
_
v
1
(x)
_
v
0
(v
1
(x))
2
=
u
0
_
v
1
(x)
_
v
0
(v
1
(x))
2
_
ARA
v
_
v
1
(x)
_
ARA
u
_
v
1
(x)
__
.
20 Chapter 2. Preferences
From (b), it follows that f
00
(x) < 0, hence f is concave.
Finally, we show that (c) (a): assume that for some consumption plan c and some c Z, we
have E[u(c)] u( c) but E[v(c)] < v( c). We want to arrive at a contradiction.
f (v( c)) = u( c) E[u(c)] = E[f(v(c))]
< f (E[v(c)])
< f (v( c)) ,
where we use the concavity of f and Jensens Inequality to go from the rst to the second line, and
we use that f is strictly increasing to go from the second to the third line. Now the contradiction
is clear.
2.6 Utility functions in models and in reality
2.6.1 Frequently applied utility functions
CRRA utility. (Also known as power utility or isoelastic utility.) Utility functions u(c) in this
class are dened for c 0:
u(c) =
c
1
1
, (2.4)
where > 0 and 6= 1. Since
u
0
(c) = c

and u
00
(c) = c
1
,
the absolute and relative risk aversions are given by
ARA(c) =
u
00
(c)
u
0
(c)
=

c
, RRA(c) = c ARA(c) = .
The relative risk aversion is constant across consumption levels c, hence the name CRRA (Constant
Relative Risk Aversion) utility. Note that u
0
(0+) lim
c0
u
0
(c) = with the consequence that
an optimal solution will have the property that consumption/wealth c will be strictly above 0
with probability one. Hence, we can ignore the very appropriate non-negativity constraint on
consumption since the constraint will never be binding. Furthermore, u
0
() lim
c
u
0
(c) = 0.
Some authors assume a utility function of the form u(c) = c
1
, which only makes sense for
(0, 1). However, empirical studies indicate that most investors have a relative risk aversion
above 1, cf. the discussion below. The absolute risk tolerance is linear in c:
ART(c) =
1
ARA(c)
=
c

.
Except for a constant, the utility function
u(c) =
c
1
1
1
is identical to the utility function specied in (2.4). The two utility functions are therefore equiv-
alent in the sense that they generate identical rankings of consumption plans and, in particular,
identical optimal choices. The advantage in using the latter denition is that this function has a
well-dened limit as 1. From lHospitals rule we have that
lim
1
c
1
1
1
= lim
1
c
1
ln c
1
= ln c,
2.6 Utility functions in models and in reality 21
-6
-4
-2
0
2
4
6
0 4 8 12 16
RRA=0.5 RRA=1 RRA=2 RRA=5
Figure 2.1: Some CRRA utility functions.
which is the important special case of logarithmic utility. When we consider CRRA utility,
we will assume the simpler version (2.4), but we will use the fact that we can obtain the optimal
strategies of a log-utility investor as the limit of the optimal strategies of the general CRRA investor
as 1.
Some CRRA utility functions are illustrated in Figure 2.1.
HARA utility. (Also known as extended power utility.) The absolute risk aversion for CRRA
utility is hyperbolic in c. More generally a utility function is said to be a HARA (Hyperbolic
Absolute Risk Aversion) utility function if
ARA(c) =
u
00
(c)
u
0
(c)
=
1
c +
for some constants , such that c + > 0 for all relevant c. HARA utility functions are
sometimes referred to as ane (or linear) risk tolerance utility functions since the absolute risk
tolerance is
ART(c) =
1
ARA(c)
= c + .
The risk cautiousness is ART
0
(c) = .
What do the HARA utility functions look like? First, let us take the case = 0, which implies
that the absolute risk aversion is constant (so-called CARA utility) and must be positive.
d(ln u
0
(c))
dc
=
u
00
(c)
u
0
(c)
=
1

implies that
ln u
0
(c) =
c

+k
1
u
0
(c) = e
k
1
e
c/
22 Chapter 2. Preferences
for some constant k
1
. Hence,
u(c) =
1

e
k
1
e
c/
+k
2
for some other constant k
2
. Applying the fact that increasing ane transformations do not change
decisions, the basic representative of this class of utility functions is the negative exponential
utility function
u(c) = e
ac
, c R,
where the parameter a = 1/ is the absolute risk aversion. Constant absolute risk aversion is
certainly not very reasonable. Nevertheless, the negative exponential utility function is sometimes
used for computational purposes in connection with normally distributed returns, e.g., in one-
period models.
Next, consider the case 6= 0. Applying the same procedure as above we nd
d(ln u
0
(c))
dc
=
u
00
(c)
u
0
(c)
=
1
c +
ln u
0
(c) =
1

ln(c + ) +k
1
so that
u
0
(c) = e
k
1
exp

ln(c + )
_
= e
k
1
(c + )
1/
. (2.5)
For = 1 this implies that
u(c) = e
k
1
ln(c + ) +k
2
.
The basic representative of such utility functions is the extended log utility function
u(c) = ln (c c) , c > c,
where we have replaced by c. For 6= 1, Equation (2.5) implies that
u(c) =
1

e
k
1
1
1
1

(c + )
11/
+k
2
.
For < 0, we can write the basic representative as
u(c) = ( c c)
1
, c < c,
where = 1/ < 0. We can think of c as a satiation level and call this subclass satiation HARA
utility functions. The absolute risk aversion is
ARA(c) =

c c
,
which is increasing in c, conicting with intuition and empirical studies. Some older nancial
models used the quadratic utility function, which is the special case with = 1 so that u(c) =
( c c)
2
. An equivalent utility function is u(c) = c ac
2
.
For > 0 (and 6= 1), the basic representative is
u(c) =
(c c)
1
1
, c > c,
where = 1/ > 0. The limit as 1 of the equivalent utility function
(c c)
1
1
1
is equal to the
extended log utility function u(c) = ln(c c). We can think of c as a subsistence level of wealth or
2.6 Utility functions in models and in reality 23
consumption (which makes sense only if c 0) and refer to this subclass as subsistence HARA
utility functions. The absolute and relative risk aversions are
ARA(c) =

c c
, RRA(c) =
c
c c
=

1 ( c/c)
,
which are both decreasing in c. The relative risk aversion approaches for c c and decreases
to the constant for c . Clearly, for c = 0, we are back to the CRRA utility functions so that
these also belong to the HARA family.
Mean-variance preferences. For some problems it is convenient to assume that the expected
utility associated with an uncertain consumption plan only depends on the expected value and the
variance of the consumption plan. This is certainly true if the consumption plan is a normally
distributed random variable since its probability distribution is fully characterized by the mean and
variance. However, it is generally not appropriate to use a normal distribution for consumption
(or wealth or asset returns).
For a quadratic utility function, u(c) = c ac
2
, the expected utility is
E[u(c)] = E

c ac
2

= E[c] a E

c
2

= E[c] a
_
Var[c] + E[c]
2
_
,
which is indeed a function of the expected value and the variance of the consumption plan. Alas,
the quadratic utility function is inappropriate for several reasons. Most importantly, it exhibits
increasing absolute risk aversion.
For a general utility function the expected utility of a consumption plan will depend on all
moments. This can be seen by the Taylor expansion of u(c) around the expected consumption,
E[c]:
u(c) = u(E[c]) +u
0
(E[c])(c E[c]) +
1
2
u
00
(E[c])(c E[c])
2
+

n=3
1
n!
u
(n)
(E[c])(c E[c])
n
,
where u
(n)
is the nth derivative of u. Taking expectations, we get
E[u(c)] = u(E[c]) +
1
2
u
00
(E[c]) Var[c] +

n=3
1
n!
u
(n)
(E[c]) E[(c E[c])
n
] .
Here E[(c E[c])
n
] is the central moment of order n. The variance is the central moment of order 2.
Obviously, a greedy investor (which just means that u is increasing) will prefer higher expected
consumption to lower for xed central moments of order 2 and higher. Moreover, a risk-averse
investor (so that u
00
< 0) will prefer lower variance of consumption to higher for xed expected
consumption and xed central moments of order 3 and higher. But when the central moments
of order 3 and higher are not the same for all alternatives, we cannot just evaluate them on the
basis of their expectation and variance. With quadratic utility, the derivatives of u of order 3
and higher are zero so there it works. In general, mean-variance preferences can only serve as an
approximation of the true utility function.
2.6.2 What do we know about individuals risk aversion?
From our discussion of risk aversion and various utility functions we expect that individuals are
risk averse and exhibit decreasing absolute risk aversion. But can this be supported by empirical
24 Chapter 2. Preferences
evidence? Do individuals have constant relative risk aversion? And what is a reasonable level of
risk aversion for individuals?
You can get an idea of the risk attitudes of an individual by observing how they choose between
risky alternatives. Some researchers have studied this by setting up laboratory experiments in
which they present some risky alternatives to a group of individuals and simply see what they
prefer. Some of these experiments suggest that expected utility theory is frequently violated,
see e.g., Grether and Plott (1979). However, laboratory experiments are problematic for several
reasons. You cannot be sure that individuals will make the same choice in what they know is an
experiment as they would in real life. It is also hard to formulate alternatives that resemble the
rather complex real-life decisions. It seems more fruitful to study actual data on how individuals
have acted confronted with real-life decision problems under uncertainty. A number of studies do
that.
Friend and Blume (1975) analyze data on household asset holdings. They conclude that the
data is consistent with individuals having roughly constant relative risk aversion and that the
coecients of relative risk aversion are on average well in excess of one and probably in excess of
two (quote from page 900 in their paper). Pindyck (1988) nds support of a relative risk aversion
between 3 and 4 in a structural model of the reaction of stock prices to fundamental variables.
Other studies are based on insurance data. Using U.S. data on so-called property/liability
insurance, Szpiro (1986) nds support of CRRA utility with a relative risk aversion coecient
between 1.2 and 1.8. Cicchetti and Dubin (1994) work with data from the U.S. on whether
individuals purchased an insurance against the risk of trouble with their home telephone line.
They conclude that the data is consistent with expected utility theory and that a subsistence
HARA utility function performs better than log utility or negative exponential utility.
Ogaki and Zhang (2001) study data on individual food consumption from Pakistan and India
and conclude that relative risk aversion is decreasing for poor individuals, which is consistent with
a subsistence HARA utility function.
It is an empirical fact that even though consumption and wealth have increased tremendously
over the years, the magnitude of real rates of return has not changed dramatically. As indicated
by (2.3) relative risk premia are approximately proportional to the relative risk aversion. As
discussed in, e.g., Munk (2012), basic asset pricing theory implies that relative risk premia on
nancial assets (in terms of expected real return in excess of the real risk-free return) will be
proportional to the average relative risk aversion in the economy. If the average relative risk
aversion was signicantly decreasing (increasing) in the level of consumption or wealth, we should
have seen decreasing (increasing) real returns on risky assets in the past. The data seems to be
consistent with individuals having on average close to CRRA utility.
To get a feeling of what a given risk aversion really means, suppose you are confronted with
two consumption plans. One plan is a sure consumption of c, the other plan gives you (1 ) c
with probability 0.5 and (1 + ) c with probability 0.5. If you have a CRRA utility function
u(c) = c
1
/(1 ), the certainty equivalent c

of the risky plan is determined by


1
1
(c

)
1
=
1
2
1
1
((1 ) c)
1
+
1
2
1
1
((1 + ) c)
1
,
2.6 Utility functions in models and in reality 25
= RRA = 1% = 10% = 50%
0.5 0.00% 0.25% 6.70%
1 0.01% 0.50% 13.40%
2 0.01% 1.00% 25.00%
5 0.02% 2.43% 40.72%
10 0.05% 4.42% 46.00%
20 0.10% 6.76% 48.14%
50 0.24% 8.72% 49.29%
100 0.43% 9.37% 49.65%
Table 2.5: Relative risk premia for a fair gamble of the fraction of your consump-
tion.
which implies that
c

1
2

1/(1)

(1 )
1
+ (1 + )
1

1/(1)
c.
The risk premium ( c, ) is
( c, ) = c c

=
_
1

1
2

1/(1)

(1 )
1
+ (1 + )
1

1/(1)
_
c.
Both the certainty equivalent and the risk premium are thus proportional to the consumption
level c. The relative risk premium ( c, )/ c is simply one minus the relative certainty equivalent
c

/ c. These equations assume 6= 1. In Exercise 2.5 you are asked to nd the certainty equivalent
and risk premium for log-utility corresponding to = 1.
Table 2.5 shows the relative risk premium for various values of the relative risk aversion coecient
and various values of , the size of the risk. For example, an individual with = 5 is willing to
sacrice 2.43% of the safe consumption in order to avoid a fair gamble of 10% of that consumption
level. Of course, even extremely risk averse individuals will not sacrice more than they can loose
but in some cases it is pretty close. Looking at these numbers, it is hard to believe in -values
outside, say, [1, 10]. In Exercise 2.6 you are asked to compare the exact relative risk premia shown
in the table with the approximate risk premia given by (2.3).
2.6.3 Two-good utility functions and the elasticity of substitution
Consider an atemporal utility function f(c, z) of two consumption of two dierent goods at
the same time. An indierence curve in the (c, z)-space is characterized by f(c, z) = k for some
constant k. Changes in c and z along an indierence curve are linked by
f
c
dc +
f
z
dz = 0
so that the slope of the indierence curve (also known as the marginal rate of substitution) is
dz
dc
=
f
c
f
z
.
26 Chapter 2. Preferences
Unless the indierence curve is linear, its slope will change along the curve. Indierence curves are
generally assumed to be convex. The elasticity of substitution tells you by which percentage you
need to change z/c in order to obtain a one percent change in the slope of the indierence curve. It
is a measure of the curvature or convexity of the indierence curve. If the indierence curve is very
curved, you only have to move a little along the curve before its slope has changed by one percent.
Hence, the elasticity of substitution is low. If the indierence curve is almost linear, you have to
move far away to change the slope by one percent. In that case the elasticity of substitution is
very high. Formally, the elasticity of substitution is dened as
=
d
_
z
c
_ _
z
c
d

f
z
f
c

_ f
z
f
c
=
f
z
_
f
c
z/c
d (z/c)
d

f
z
_
f
c
,
which is equivalent to
=
d ln (z/c)
d ln

f
z
_
f
c
.
Assume now that
f(c, z) = (ac

+bz

)
1/
, (2.6)
where < 1 and 6= 0. Then
f
c
= ac
1
(ac

+bz

)
1

1
,
f
z
= bz
1
(ac

+bz

)
1

1
,
and thus
f
z
f
c
=
b
a

z
c

1
.
Computing the derivative with respect to z/c, we get
d

f
z
f
c

d
_
z
c
_ =
b
a
( 1)

z
c

2
and thus
=
b
a
_
z
c
_
1
z
c
1
b
a
( 1)
_
z
c
_
2
=
1
1
=
1
1
,
which is independent of (c, z). Therefore the utility function (2.6) is referred to as CES (Constant
Elasticity of Substitution) utility.
For the Cobb-Douglas utility function
f(c, z) = c
a
z
1a
, 0 < a < 1, (2.7)
the intertemporal elasticity of substitution equals 1. In fact, the Cobb-Douglas utility function (2.7)
can be seen as the limit of the utility function (2.6) assuming b = 1 a as 0.
2.7 Preferences for multi-date consumption plans
Above we implicitly considered preferences for consumption at one given future point in time.
We need to generalize the ideas and results to settings with consumption at several dates. In
one-period models individuals can consume both at time 0 (beginning-of-period) and at time 1
(end-of-period). In multi-period models individuals can consume either at each date in the discrete
2.7 Preferences for multi-date consumption plans 27
time set T = {0, 1, 2, . . . , T} or at each date in the continuous time set T = [0, T]. In any case a
consumption plan is a stochastic process c = (c
t
)
tT
where each c
t
is a random variable representing
the state-dependent level of consumption at time t.
Consider the discrete-time case and, for each t, let Z
t
R denote the set of all possible consump-
tion levels at date t and dene Z = Z
0
Z
1
Z
T
R
T+1
, then any consumption plan c can
again be represented by a probability distribution on the set Z. For nite Z, we can again apply
Theorem 2.1 so that under the relevant axioms, we can represent preferences by a utility index U,
which to each consumption plan (c
t
)
tT
= (c
0
, c
1
, . . . , c
T
) attaches a real number U(c
0
, c
1
, . . . , c
T
)
with higher numbers to the more preferred consumption plans. If we further impose the Substitu-
tion Axiom, Theorem 2.2 ensures an expected utility representation, i.e., the existence of a utility
function U : Z R so that consumption plans are ranked according to their expected utility, i.e.,
U(c
0
, c
1
, . . . , c
T
) = E[U(c
0
, c
1
, . . . , c
T
)]

U (c
0
, c
1
(), . . . , c
T
()) .
We can call U a multi-date utility function since it depends on the consumption levels at all
dates. Again this result can be extended to the case of an innite Z, e.g., Z = R
T+1
+
, but also
to continuous-time settings where U will then be a function of the entire consumption process
c = (c
t
)
t[0,T]
.
2.7.1 Additively time-separable expected utility
Often time-additivity is assumed so that the utility the individual gets from consumption in
one period does not directly depend on what she consumed in earlier periods or what she plan to
consume in later periods. For the discrete-time case, this means that
U(c
0
, c
1
, . . . , c
T
) =
T

t=0
u
t
(c
t
)
where each u
t
is a valid single-date utility function. Still, when the individual has to choose her
current consumption rate, she will take her prospects for future consumption into account. The
continuous-time analogue is
U((c
t
)
t[0,T]
) =
_
T
0
u
t
(c
t
) dt.
In addition it is typically assumed that u
t
(c
t
) = e
t
u(c
t
) for all t. This is to say that the direct
utility the individual gets from a given consumption level is basically the same for all dates, but
the individual prefers to consume any given number of goods sooner than later. This is modeled by
the subjective time preference rate , which we assume to be constant over time and independent
of the consumption level. More impatient individuals have higher s. In sum, the life-time utility
is typically assumed to be given by
U(c
0
, c
1
, . . . , c
T
) =
T

t=0
e
t
u(c
t
)
in discrete-time models and
U((c
t
)
t[0,T]
) =
_
T
0
e
t
u(c
t
) dt
28 Chapter 2. Preferences
in continuous-time models. In both cases, u is a single-date utility function such as those
discussed in Section 2.6.
1
Time-additivity is mostly assumed for tractability. However, it is important to realize that the
time-additive specication does not follow from the basic axioms of choice under uncertainty, but
is in fact a strong assumption, which most economists agree is not very realistic. One problem
is that time-additive preferences induce a close link between the reluctance to substitute con-
sumption across dierent states of the economy (which is measured by risk aversion) and the
willingness to substitute consumption over time (which can be measured by the so-called elasticity
of intertemporal substitution). Solving intertemporal utility maximization problems of individuals
with time-additive CRRA utility, it turns out that an individual with a high relative risk aversion
will also choose a very smooth consumption process, i.e., she will have a low elasticity of intertem-
poral substitution. There is nothing in the basic theory of choice that links the risk aversion and
the elasticity of intertemporal substitution together. For one thing, risk aversion makes sense even
in an atemporal (i.e., one-date) setting where intertemporal substitution is meaningless and, con-
versely, intertemporal substitution makes sense in a multi-period setting without uncertainty in
which risk aversion is meaningless. The close link between the two concepts in the multi-period
model with uncertainty is an unfortunate consequence of the assumption of time-additive expected
utility.
According to Browning (1991), non-additive preferences were already discussed in the 1890 book
Principles of Economics by Alfred Marshall. See Brownings paper for further references to the
critique on intertemporally separable preferences. Let us consider some alternatives that are more
general and still tractable.
2.7.2 Habit formation and state-dependent utility
The key idea of habit formation is to let the utility associated with the choice of consumption at
a given date depend on past choices of consumption. In a discrete-time setting the utility index of
a given consumption process c is now given as E[

T
t=0
e
t
u(c
t
, h
t
)], where h
t
is a measure of the
standard of living or the habit level of consumption, e.g., a weighted average of past consumption
rates such as
h
t
= h
0
e
t
+
t1

s=1
e
(ts)
c
s
,
where h
0
, , and are non-negative constants. It is assumed that u is decreasing in h so that
high past consumption generates a desire for high current consumption, i.e., preferences display
intertemporal complementarity. In particular, models where u(c, h) is assumed to be of the power-
linear form,
u(c, h) =
1
1
(c h)
1
, > 0, c h,
1
Some utility functions are negative, including the frequently used power utility u(c) = c
1
/(1 ) with a
constant relative risk aversion > 1. When > 0, we will then have that e
t
u(c) is in fact bigger (less negative)
than u(c), which may seem to destroy the interpretation of stated in the text. However, for the decisions made by
the investor it is the marginal utilities that matter and, when > 0 and u is increasing, e
t
u
0
(c) will be smaller
than u
0
(c) so that, other things equal, the individual will choose higher current than future consumption. Therefore,
it is fair to interpret as a time preference rate and expect it to be positive.
2.7 Preferences for multi-date consumption plans 29
turn out to be computationally tractable. This is closely related to the subsistence HARA utility,
but with habit formation the subsistence level h is endogenously determined by past consump-
tion. The corresponding absolute and relative risk aversions are
ARA(c, h)
u
cc
(c, h)
u
c
(c, h)
=

c h
, RRA(c, h) c
u
cc
(c, h)
u
c
(c, h)
=
c
c h
, (2.8)
where u
c
and u
cc
are the rst- and second-order derivatives of u with respect to c. In particular,
the relative risk aversion is decreasing in c. Note that the habit formation preferences are still
consistent with expected utility.
A related line of extension of the basic preferences is to allow the preferences of an individual
to depend on some external factors, i.e., factors that are not fully determined by choices made
by the individual. One example that has received some attention is where the utility which some
individual attaches to her consumption plan depends on the consumption plans of other individuals
or maybe the aggregate consumption in the economy. This is often referred to as keeping up
with the Joneses. If you see your neighbors consume at high rates, you want to consume at
a high rate too. Utility is state-dependent. Models of this type are sometimes said to have an
external habit, whereas the habit formation discussed above is then referred to as internal habit.
If we denote the external factor by X
t
, a time-additive life-time expected utility representation
is E[

T
t=0
e
t
u(c
t
, X
t
)], and a tractable version is u(c, X) =
1
1
(c X)
1
very similar to the
subsistence CRRA or the specic habit formation utility given above. In this case, however,
subsistence level is determined by external factors. Another tractable specication is u(c, X) =
1
1
(c/X)
1
.
The empirical evidence of habit formation preferences is mixed. The time variation in risk
aversion induced by habits as shown in (2.8) will generate variations in the Sharpe ratios of risky
assets over the business cycle, which are not explained in simple models with CRRA preferences
and appear to be present in the asset return data. Campbell and Cochrane (1999) construct a
model with a representative individual having power-linear external habit preferences in which
the equilibrium Sharpe ratio of the stock market varies counter-cyclically in line with empirical
observations. However, a counter-cyclical variation in the relative risk aversion of a representative
individual can also be obtained in a model where each individual has a constant relative risk
aversion, but the relative risk aversions are dierent across individuals, as explained, e.g., by Chan
and Kogan (2002). Various studies have investigated whether a data set of individual decisions
on consumption, purchases, or investments are consistent with habit formation in preferences. To
mention a few studies, Ravina (2007) reports strong support for habit formation, whereas Dynan
(2000), Gomes and Michaelides (2003), and Brunnermeier and Nagel (2008) nd no evidence of
habit formation at the individual level.
2.7.3 Recursive utility
Another preference specication gaining popularity is the so-called recursive preferences or
Epstein-Zin preferences, suggested and discussed by, e.g., Kreps and Porteus (1978), Epstein and
Zin (1989, 1991), and Weil (1989). The original motivation of this representation of preferences is
that it allows individuals to have preferences for the timing of resolution of uncertainty, which is not
consistent with the standard multi-date expected utility theory and violates the set of behavioral
axioms.
30 Chapter 2. Preferences
In a discrete-time framework Epstein and Zin (1989, 1991) assumed that life-time utility from
time t on is captured by a utility index U
t
(in this literature sometimes called the felicity)
satisfying the recursive relation
U
t
= f(c
t
, z
t
),
where z
t
= CE
t
(U
t+1
) is the certainty equivalent of U
t+1
given information available at time t and
f is an aggregator on the form
f(c, z) = (ac

+bz

)
1/
.
The aggregator is identical to the two-good CES utility specication (2.6) and, since z
t
here refers
to future consumption or utility, = 1/(1) is called the intertemporal elasticity of substitution.
An investors willingness to substitute risk between states is modeled through z
t
as the certainty
equivalent of a constant relative risk aversion utility function. Recall that the certainty equivalent
for an atemporal utility function u is dened as
CE = u
1
(E[u(x)]) .
In particular for CRRA utility u(x) = x
1
/(1 ) we obtain
CE =
_
E[x
1
]
_ 1
1
,
where > 0 is the relative risk aversion.
To sum up, Epstein-Zin preferences are specied recursively as
U
t
=

ac

t
+b

E
t
[U
1
t+1
]

1

1/
. (2.9)
Using the fact that = 1
1

, we can rewrite U
t
as
U
t
=
_
_
ac
1
1

t
+b

E
t
[U
1
t+1
]

1
1

1
_
_
1
1
1

.
Introducing = (1 )/(1
1

), we have
U
t
=

ac
1

t
+b

E
t
[U
1
t+1
]
1


1
. (2.10)
When the time horizon is nite, we need to specify the utility index U
T
at the terminal date. If
we allow for consumption at the terminal date and for a bequest motive, a specication like
U
T
= (ac

T
+ aW

T
)
1/
(2.11)
assumes a CES-type weighting of consumption and bequest in the terminal utility with the same
CES-parameter as above. The parameter 0 can be seen as a measure of the relative
importance of bequest compared to consumption. Note that (2.11) involves no expectation as
terminal wealth is known at time T. Alternatively, we can think of c
T1
as being the consumption
over the nal period and specify the terminal utility index as
U
T
= (aW

T
)
1/
= (a)
1/
W
T
. (2.12)
2.7 Preferences for multi-date consumption plans 31
Bansal (2007) and other authors assume that a = 1b, but the value of a is in fact unimportant
as it does not aect optimal decisions and therefore no interpretation can be given to a. At least
this is true for an innite time horizon and for a nite horizon when the terminal utility takes the
form (2.11) or (2.12). In order to see this, rst note that we can rewrite (2.9) as
U
t
= a
1/

t
+ba
1

E
t
_
U
1
t+1
_
1

1/
= a
1/
_
c

t
+b

E
t
_
_
a
1/
U
t+1
_
1
_
1
_
1/
,
which implies that
a
1/
U
t
=
_
c

t
+b

E
t
_
_
a
1/
U
t+1
_
1
_
1
_
1/
.
This suggests that the utility index

U dened for any t by

U
t
= a
1/
U
t
is equivalent to the utility
index U, since it is just a scaling, and it does not involve a. With a nite time horizon and terminal
utility given by (2.11), we see that

U
T
= a
1/
U
T
= (c

T
+ W

T
)
1/
,
which also not involves a. Similarly when terminal utility is specied as in (2.12). Without loss of
generality we can therefore let a = 1.
Time-additive power utility is the special case of recursive utility where = 1/. In order to
see this, rst note that with = 1/, we have = 1 and = 1 and thus
U
t
=

ac
1
t
+b E
t
[U
1
t+1
]
1
1
or
U
1
t
= ac
1
t
+b E
t
[U
1
t+1
].
If we start unwinding the recursions, we get
U
1
t
= ac
1
t
+b E
t
_
ac
1
t+1
+b E
t+1
[U
1
t+2
]
_
= a E
t
_
c
1
t
+bc
1
t+1
_
+b
2
E
t
_
U
1
t+2
_
.
If we continue this way and the time horizon is innite, we obtain
U
1
t
= a

s=0
E
t
_
b
s
c
1
t+s
_
,
whereas with a nite time horizon and the terminal utility index (2.12), we obtain
U
1
t
= a
_
Tt

s=0
b
s
E
t
_
c
1
t+s
_
+ b
Tt
E
t
_
W
1
T
_
_
.
In any case, observe that
V
t
=
1
a(1 )
U
1
t
is an increasing function of U
t
and will therefore represent the same preferences as U
t
. Moreover,
V
t
is clearly equivalent to time-additive expected utility. Note that b plays the role of the subjective
discount factor which we often represent by e

.
32 Chapter 2. Preferences
The Epstein-Zin preferences are characterized by three parameters:
2
the relative risk aversion ,
the elasticity of intertemporal substitution , and the subjective discount factor b = e

. Relative
to the standard time-additive power utility, the Epstein-Zin specication allows the relative risk
aversion (attitudes towards atemporal risks) to be disentangled form the elasticity of intertemporal
substitution (attitudes towards shifts in consumption over time). Moreover, Epstein and Zin (1989)
shows that when > 1/, the individual will prefer early resolution of uncertainty. If < 1/,
late resolution of uncertainty is preferred. For the standard utility case = 1/, the individual
is indierent about the timing of the resolution of uncertainty. Note that in the relevant case of
> 1, the auxiliary parameter will be negative if and only if > 1. Empirical studies disagree
about reasonable values of . Some studies nd smaller than one (for example Campbell 1999),
other studies nd greater than one (for example Vissing-Jrgensen and Attanasio 2003).
The continuous-time equivalent of recursive utility is called stochastic dierential utility and
studied by, e.g., Due and Epstein (1992). The utility index U
t
associated at time t with a given
consumption process c over the remaining lifetime [t, T] is recursively given by
U
t
= E
t
_
_
T
t
f (c
s
, U
s
) ds
_
where we assume a zero utility of terminal wealth, U
T
= 0. Here f is a so-called normalized
aggregator. A somewhat tractable version of f is
f(c, U) =
_

11/
c
11/
([1 ]U)
11/
U, for 6= 1
(1 )Uln c Uln ([1 ]U) , for = 1

11/
c
11/
e
(11/)U


11/
, for = 1, 6= 1
ln c U, for = = 1
(2.13)
where = (1 )/(1
1

). This can be seen as the continuous-time version of the discrete-time


Epstein-Zin preferences in (2.10). Again, is a subjective time preference rate, reects the
degree of risk aversion towards atemporal bets, and > 0 reects the intertemporal elasticity of
substitution towards deterministic consumption plans. It is also possible to dene a normalized
aggregator for = 1 and for 0 < < 1 but we focus on the empirically more reasonable case
of > 1. As in the discrete-time framework, the special case where = 1/ (so that = 1)
corresponds to the classic time-additive power utility specication. Let us conrm that for the
case = 1/ 6= 1, where the rst denition in (2.13) applies. In this case
U
t
= E
t
_
_
T
t


1
c
1
s
U
s

ds
_
= E
t
_
_
T
t

1
c
1
s
ds
_
E
t
_
_
T
t
U
s
ds
_
.
This recursive relation is satised by
U
t
= E
t
_
_
T
t
e
(st)
1
1
c
1
s
ds
_
, (2.14)
2
With a nite time horizon and a bequest motive, there is really a fourth parameter, namely the relative weight
of bequest and consumption, as represented by the constant in (2.11) or (2.12).
2.8 Exercises 33
because then
E
t
_
_
T
t
U
s
ds
_
= E
t
_
_
T
t
_
E
s
_

_
T
s
e
(vs)
1
1
c
1
v
dv
__
ds
_
= E
t
_
_
T
t
_
v
t
e
(vs)
ds

1
1
c
1
v
dv
_
= E
t
_
_
T
t

1 e
(vt)

1
1
c
1
v
dv
_
,
where the second equality follows by changing the order of integration, and consequently
E
t
_
_
T
t

1
c
1
s
ds
_
E
t
_
_
T
t
U
s
ds
_
= E
t
_
_
T
t

1
c
1
s
ds
_
E
t
_
_
T
t

1 e
(st)

1
1
c
1
s
ds
_
= E
t
_
_
T
t
e
(st)
1
1
c
1
s
ds
_
= U
t
.
The utility index in (2.14) is a positive multiple ofand therefore equivalent tothe traditional
time-additive power utility specication.
Note that, in general, recursive preferences are not consistent with expected utility since U
t
depends non-linearly on the probabilities of future consumption levels.
2.7.4 Two-good, multi-period utility
For studying some problems it is useful or even necessary to distinguish between dierent con-
sumption goods. Until now we have implicitly assumed a single consumption good which is perish-
able in the sense that it cannot be stored. However, individuals spend large amounts on durable
goods such as houses and cars. These goods provide utility to the individual beyond the period
of purchase and can potentially be resold at a later date so that it also acts as an investment.
Another important good is leisure. Individuals have preferences both for consumption of physical
goods and for leisure. A tractable two-good utility function is the Cobb-Douglas function:
u(c
1
, c
2
) =
1
1

1
c
1
2

1
,
where [0, 1] determines the relative weighting of the two goods.
2.8 Exercises
Exercise 2.1. Give a proof of Theorem 2.3.
Exercise 2.2 ((Adapted from Problem 3.3 in Kreps (1990).)). Consider the following two prob-
ability distributions of consumption.
1
gives 5, 15, and 30 (dollars) with probabilities 1/3, 5/9,
and 1/9, respectively.
2
gives 10 and 20 with probabilities 2/3 and 1/3, respectively.
(a) Show that we can think of
1
as a two-step gamble, where the rst gamble is identical to

2
. If the outcome of the rst gamble is 10, then the second gamble gives you an additional 5
(total 15) with probability 1/2 and an additional 5 (total 5) also with probability 1/2. If the
34 Chapter 2. Preferences
outcome of the rst gamble is 20, then the second gamble gives you an additional 10 (total 30)
with probability 1/3 and an additional 5 (total 15) with probability 2/3.
(b) Observe that the second gamble has mean zero and that
1
is equal to
2
plus mean-zero
noise. Conclude that any risk-averse expected utility maximizer will prefer
2
to
1
.
Exercise 2.3 ((Adapted from Chapter 3 in Kreps (1990).)). Imagine a greedy, risk-averse, ex-
pected utility maximizing consumer whose end-of-period income level is subject to some uncer-
tainty. The income will be Y with probability p and Y
0
< Y with probability 1 p. Think of
= Y Y
0
as some loss the consumer might incur due an accident. An insurance company is
willing to insure against this loss by paying to the consumer if she sustains the loss. In return,
the company wants an upfront premium of . The consumer may choose partial coverage in the
sense that if she pays a premium of a, she will receive a if she sustains the loss. Let u denote
the von Neumann-Morgenstern utility function of the consumer. Assume for simplicity that the
premium is paid at the end of the period.
(a) Show that the rst order condition for the choice of a is
pu
0
(Y a) = (1 p)()u
0
(Y (1 a)a).
(b) Show that if the insurance is actuarially fair in the sense that the expected payout (1 p)
equals the premium , then the consumer will purchase full insurance, i.e., a = 1 is optimal.
(c) Show that if the insurance is actuarially unfair, meaning (1 p) < , then the consumer
will purchase partial insurance, i.e., the optimal a is less than 1.
Exercise 2.4. Consider a one-period choice problem with four equally likely states of the world
at the end of the period. The consumer maximizes expected utility of end-of-period wealth. The
current wealth must be invested in a single nancial asset today. The consumer has three assets
to choose from. All three assets have a current price equal to the current wealth of the consumer.
The assets have the following end-of-period values:
state 1 2 3 4
probability 0.25 0.25 0.25 0.25
asset 1 100 100 100 100
asset 2 81 100 100 144
asset 3 36 100 100 225
(a) What asset would a risk-neutral individual choose?
(b) What asset would a power utility investor, u(W) =
1
1
W
1
choose if = 0.5? If = 2?
If = 5?
Now assume a power utility with = 0.5.
(c) Suppose the individual could obtain a perfect signal about the future state before she makes
her asset choice. There are thus four possible signals, which we can represent by s
1
= {1}, s
2
= {2},
s
3
= {3}, and s
4
= {4}. What is the optimal asset choice for each signal? What is her expected
utility before she receives the signal, assuming that the signals have equal probability?
(d) Now suppose that the individual can receive a less-than-perfect signal telling her whether
the state is in s
1
= {1, 4} or in s
2
= {2, 3}. The two possible signals are equally likely. What is
the expected utility of the investor before she receives the signal?
2.8 Exercises 35
Exercise 2.5. Consider an individual with log utility, u(c) = ln c. What is her certainty equivalent
and risk premium for the consumption plan which with probability 0.5 gives her (1 ) c and with
probability 0.5 gives her (1+) c? Conrm that your results are consistent with numbers for = 1
shown in Table 2.5.
Exercise 2.6. Use Equation (2.3) to compute approximate relative risk premia for the consump-
tion gamble underlying Table 2.5 and compare with the exact numbers given in the table.
Exercise 2.7. Consider an atemporal setting in which an individual has a utility function u of
consumption. His current consumption is c. As always, the absolute risk aversion is ARA(c) =
u
00
(c)/u
0
(c) and the relative risk aversion is RRA(c) = cu
00
(c)/u
0
(c).
Let [0, c] and consider an additive gamble where the individual will end up with a consump-
tion of either c+ or c. Dene the additive indierence probability (W, ) for this gamble
by
u(c) =

1
2
+ (c, )

u(c + ) +

1
2
(c, )

u(c ). (1)
Assume that (c, ) is twice dierentiable in .
(a) Argue that (c, ) 0 if the individual is risk-averse.
(b) Show that the absolute risk aversion is related to the additive indierence probability by
the following relation
ARA(c) = 4 lim
0
(c, )

(2)
and interpret this result. Hint: Dierentiate twice with respect to in (1) and let 0.
Now consider a multiplicative gamble where the individual will end up with a consumption of
either (1 +)c or (1 )c, where [0, 1]. Dene the multiplicative indierence probability
(W, ) for this gamble by
u(c) =

1
2
+(c, )

u((1 + )c) +

1
2
(c, )

u((1 )c) . (3)


Assume that (c, ) is twice dierentiable in .
(c) Derive a relation between the relative risk aversion RRA(c) and lim
0
(c,)

and interpret
the result.
CHAPTER 3
One-period models
3.1 Introduction
TO COME...
3.2 The general one-period model
Given d risky assets with (stochastic) rates of return R = (R
1
, . . . , R
d
)
>
and a risk-free asset with
a (certain) rate of return r over the period of interest. Consider an investor having an initial wealth
W
0
and no income from non-nancial sources. If the investor invests amounts = (
1
, . . . ,
d
)
>
in
the risky assets and the remainder
0
= W
0

>
1 in the risk-free asset, he will end up with wealth
W = W
0
+
>
R+
0
r = (1 +r)W
0
+
>
(Rr1)
at the end of the period. Letting
i
=
i
/W
0
denote the fraction of wealth invested in the ith
asset, we can rewrite the terminal wealth as
W = W
0
[1 +r +
>
(Rr1)] ,
where = (
1
, . . . ,
d
)
>
.
We assume that preferences can be represented by expected utility of end-of-period consumption
or wealth so the decision problem is to choose or, equivalently, to maximize E[u(W)], where u
is a utility function. We will assume throughout the chapter that u is increasing and concave and
is suciently smooth for all the relevant derivatives to exist. Note that we ignore any consumption
decision at the beginning of the planning period, i.e., we assume that the consumption decision
has already been taken independently of the investment decision.
The rst-order condition for the problem
sup
R
d
E[u((1 +r)W
0
+
>
(Rr1))]
is
E[u
0
((1 +r)W
0
+
>
(Rr1)) (Rr1)] = 0. (3.1)
37
38 Chapter 3. One-period models
The second-order condition for a maximum will be satised since we will assume that u is concave.
Hence, the rst-order condition alone will characterize the optimal investment.
Without further assumptions, Arrow (1971), Pratt (1964), and others have shown a number of
interesting results on the optimal portfolio choice. We will state only a few and refer to Merton
(1992, Ch. 2) for further properties of the general solution to this utility maximization problem.
3.2.1 One risky asset
First we will specialize to the case with a single risky asset so that the rst-order condition
simplies to
E

u
0
_
(1 +r)W
0
+ (R r)
. .
W
_
(R r)

= 0. (3.2)
Assuming a single risky asset may seem very restrictive, but we will later see that under some
conditions, all individuals will optimally combine the risk-free asset and a single portfolio of the
available risky asset. In the results below, the only risky asset can thus be interpreted as that
portfolio.
The rst result concerns the sign of the optimal investment in the risky asset:
Theorem 3.1. Assume a single risky asset and a strictly increasing and concave utility function u.
The optimal risky investment is positive/zero/negative if and only if the excess expected return
E[R] r is positive/zero/negative.
Proof. Dene f() = E[u
0
((1 +r)W
0
+ (R r)) (R r)]. The rst-order condition (3.2) for
is f() = 0. Note that f
0
() = E

u
00
((1 +r)W
0
+ (R r)) (R r)
2

, which is negative since


u
00
< 0. Hence, f() is decreasing in . Also note that f(0) = E[u
0
((1 +r)W
0
) (R r)] =
u
0
((1 +r)W
0
) (E[R] r). Since u
0
> 0, we have f(0) > 0 if and only if E[R] > r. For E[R] > r,
the equation f() = 0 is therefore satised for a > 0.
The next result describes how the optimal investment in the risky asset varies with initial wealth:
Theorem 3.2. Assume a single risky asset with E[R] > r and assume a strictly increasing and
concave utility function u. The optimal risky investment = (W
0
) has the following properties:
(i) If ARA() is uniformly decreasing (respectively increasing; constant), then is increasing
(respectively decreasing; constant) in W
0
.
(ii) If RRA() is uniformly decreasing (respectively increasing; constant), then = /W
0
is
increasing (respectively decreasing; constant) in W
0
.
Proof. (i) Suppose that ARA is decreasing; the other cases can be handled similarly. By the
assumption E[R] > r and Theorem 3.1, we have > 0. For states in which the realized return
on the risky asset exceeds the risk-free return, we will therefore have that end-of-period wealth
satises W > (1+r)W
0
. With decreasing ARA, this implies that ARA(W) ARA((1+r)W
0
)
or, equivalently,
u
00
(W) ARA((1 +r)W
0
) u
0
(W).
Multiplying by R r > 0 gives
u
00
(W)(R r) ARA((1 +r)W
0
) u
0
(W)(R r). (3.3)
3.2 The general one-period model 39
For states in which the realized return on the risky asset is smaller than the risk-free return,
we obtain
u
00
(W) ARA((1 +r)W
0
) u
0
(W),
and multiplying by Rr < 0, we have to reverse the inequality, so that we again obtain (3.3),
which is therefore true for all realized returns. Taking expectations, we have
E[u
00
(W)(R r)] ARA((1 +r)W
0
) E[u
0
(W)(R r)] = 0, (3.4)
due to the rst-order condition (3.2).
Now, dierentiating the rst-order condition with respect to W
0
gives
E
_
u
00
(W)(R r)

1 +r +

W
0
(R r)
_
= 0,
which implies that

W
0
=
(1 +r) E[u
00
(W)(R r)]
E[u
00
(W)(R r)
2
]
. (3.5)
The denominator is strictly positive since u
00
< 0 and the numerator is positive due to (3.4).
Hence

W
0
0.
(ii) Rewrite the rst-order condition as
E
_
u
0

(1 +r)W
0
+W
0


W
0

(R r)

(R r)
_
= 0.
Then the proof of the result is similar to the proof of (i) with the relative risk aversion
replacing the absolute risk aversion. The details are left for the reader (see Exercise 3.1).
The following results provide insights about how the optimal investments depend on returns.
Dierentiating the rst-order condition (3.2) with respect to the risk-free rate r, we get
E
_
u
00
(W)

W
0
+

r
(R r)

(R r) u
0
(W)
_
= 0,
which implies that

r
=
E[u
0
(W)]
E[u
00
(W)(R r)
2
]
(W
0
)
E[u
00
(W)(R r)]
E[u
00
(W)(R r)
2
]
. (3.6)
Applying (3.5), we arrive at

r
=
E[u
0
(W)]
E[u
00
(W)(R r)
2
]
+
W
0

1 +r

W
0
.
The rst term on the right-hand side can be interpreted as the substitution eect and is strictly
negative. If the risk-free rate increases, the risk-free asset is more attractive, and the individual
will invest more in the risk-free asset and less in the risky asset. The second term on the right-hand
side is the income eect. Note that W
0
is the investment in the risk-free asset. Assuming this is
positive, an increase in the risk-free rate will make the individual wealthier. For a unit increase in
the risk-free rate, the end-of-period wealth will increase by exactly W
0
, and the present value
of that is (W
0
)/(1 + r). This increase in present wealth is multiplied by the derivative

W
0
to get the impact on the optimal risky investment. The income eect can be positive or negative.
40 Chapter 3. One-period models
If the income eect is negative, then the sum of the substitution and the income eects is clearly
negative so that

r
< 0. This will be the case if W and

W
0
> 0. The latter condition is
satised when the absolute risk aversion is increasing in wealth, cf. Theorem 3.2, but this is an
unrealistic assumption on preferences. A more interesting result is the following:
Theorem 3.3. Assume a single risky asset with limited liability so that the return satises R
1. Assume a strictly increasing and concave utility function u so that the relative risk aversion
RRA(W) 1 for all W. Then the optimal risky investment is strictly decreasing in the risk-free
rate.
Proof. First note that we can rewrite (3.6) as

r
=
E[u
0
(W) (W
0
)u
00
(W)(R r)]
E[u
00
(W)(R r)
2
]
=
E[u
0
(W) (1 + ARA(W)(W
0
)(R r))]
E[u
00
(W)(R r)
2
]
=
E[u
0
(W) (1 RRA(W) + ARA(W)W
0
(1 +R))]
E[u
00
(W)(R r)
2
]
.
The denominator is negative. Under the assumptions of the theorem, the numerator is surely
non-negative. Hence

r
0.
Under the assumptions of the theorem, the income eect is positive but it is dominated by the
negative substitution eect. Note however that the relative risk aversion is generally believed to
exceed 1. From the proof, we can see that if the relative risk aversion is suciently higher than 1,
we will typically end up with the opposite conclusion, i.e.,

r
0.
How does the optimal investment depend on the expected return on the risky asset? Decompose
the risky return as R = + , where = E[R] so that is the unexpected return. The rst-order
condition can then be rewritten as
E

u
0
_
(1 +r)W
0
+ ( + r)
. .
W
_
( + r)

= 0.
If we dierentiate with respect to and use (3.5), we nd

=
E[u
0
(W)]
E[u
00
(W)(R r)
2
]
+

1 +r

W
0
.
The rst term on the right-hand side (the substitution eect) is positive for u increasing and
concave. The second term on the right-hand side (the income eect) will be positive if 0 and

W
0
0, which is true if r and the absolute risk aversion is decreasing in wealth, as we expect
it to be. We summarize the conclusion as follows:
Theorem 3.4. Assume a single risky asset with E[R] r. Assume that the utility function is
strictly increasing and concave and exhibits a decreasing absolute risk aversion, ARA
0
(W) 0.
Then the optimal risky investment is increasing in the expected return on the risky asset.
3.2.2 Multiple risky assets
Now we return to the case with multiple risky assets. First we state a very intuitive result for
general utility functions.
3.2 The general one-period model 41
Theorem 3.5. An individual with strictly increasing and concave u will undertake risky invest-
ments if and only if E[R
j
] > r for some j {1, . . . , d}.
Proof. Dene f() = E[u
0
((1 +r)W
0
+
>
(Rr1)) (Rr1)]. As in the proof of Theorem 3.1,
it can be shown that f is decreasing in each
j
. If, and only if, the optimal portfolio has
j
0
for all j = 1, . . . , d, then
E[u
0
((1 +r)W
0
) (R
j
r)] 0, j = 1, . . . , d,
or, equivalently,
u
0
((1 +r)W
0
) E[R
j
r] 0, j = 1, . . . , d.
Since u
0
() > 0, this condition holds exactly when E[R
j
] r for all j = 1, . . . , d.
The optimal portfolio will contain a positive position in some risky asset i as long as at least
one of the risky assets, say asset j, have an expected return exceeding the risk-free rate. But, with
multiple risky assets, you cannot be sure that i = j, that will depend on the correlation between
the risky assets.
For the special case of HARA utility where the absolute risk aversion is of the form
ARA(z) =
u
00
(z)
u
0
(z)
=
1
z +
we can say more about the optimal investments. Recall from Section 2.6 that, ignoring unimportant
constants, marginal utility is given either by
u
0
(z) = (z + )
1/
(3.7)
or by
u
0
(z) = ae
az
(3.8)
where a = 1/ and the parameter in the absolute risk aversion is zero.
Theorem 3.6. For an investor with HARA utility, the amount optimally invested in each risky
asset is ane in wealth, i.e.,

(W
0
) = ((1 +r)W
0
+ ) k (3.9)
for some vector k = (k
1
, . . . , k
d
)
>
independent of wealth and of the parameter .
Note that the amount optimally invested in the risk-free asset is then also ane in wealth since

0
(W
0
) = W
0
(

(W
0
))
>
1 = (1 (1 +r)k
>
1) W
0
k
>
1.
We give a proof of the theorem for the case (3.7) and leave the case with negative exponential
utility for the reader as Exercise 3.2.
Proof. With marginal utility given by (3.7), the rst-order condition (3.1) becomes
E
_
_
(1 +r)W
0
+ +
>
(Rr1)
_
1/
(Rr1)
_
= 0. (3.10)
Fix some initial wealth

W
0
. Then the corresponding optimal portfolio

(

W
0
) satises
E
_
_
(1 +r)

W
0
+ +

(

W
0
)

>
(Rr1)
_
1/
(Rr1)
_
= 0.
42 Chapter 3. One-period models
If we divide through by

(1 +r)

W
0
+

1/
, we get
E
_
_
_
1 +

(1 +r)

W
0
+

(

W
0
)

>
(Rr1)
_
1/
(Rr1)
_
_
= 0. (3.11)
Next, we multiply through by ((1 +r)W
0
+ )
1/
and arrive at
E
_
_
_
(1 +r)W
0
+ +
(1 +r)W
0
+
(1 +r)

W
0
+

(

W
0
)

>
(Rr1)
_
1/
(Rr1)
_
_
= 0.
Comparing this with (3.10), we see that the optimal portfolio with initial wealth W
0
is

(W
0
) =
(1 +r)W
0
+
(1 +r)

W
0
+

(

W
0
)
so that (3.9) is satised with k =

(

W
0
)/[(1 + r)

W
0
+ ]. If we substitute

(

W
0
) = k[(1 +
r)

W
0
+ ] into (3.11), we get that the vector k satises
E
_
(1 + k
>
(Rr1))
1/
(Rr1)
_
= 0
so that it cannot depend on .
3.2.3 Examples with explicit solutions
For the special case of quadratic utility,
u(z) = ( z z)
2
, u
0
(z) = 2( z z),
the rst-order condition is
E[( z (1 +r)W
0

>
(Rr1)) (Rr1)] = 0,
which implies that
( z (1 +r)W
0
) (E[R] r1) E

(Rr1) (Rr1)
>

= 0.
We then get the explicit solution
= ( z (1 +r)W
0
)
_
E

(Rr1) (Rr1)
>
_
1
(E[R] r1) ,
which is (3.9) with = 1, = z, and k =
_
E

(Rr1) (Rr1)
>
_
1
(E[R] r1).
Under the assumption that the returns on the risky assets are normally distributed, we can also
derive an explicit expression for the optimal portfolio for the special case of negative exponential
utility, u(W) = e
aW
. If R N(, ) where is a d-dimensional vector of the expected rates
of return and is the d d variance-covariance matrix of these rates of return, then the end-of-
period wealth for any given portfolio is also normally distributed, W N(

,
2

), with mean
and variance given by

= W
0
(1 +r) +
>
( r1) ,
2

=
>
.
Therefore,
E[u(W)] = E

e
aW

= e
a

+
1
2
a
2

.
3.3 Mean-variance analysis 43
The function x 7 e
ax
is an increasing function so the portfolio that maximizes expected
utility will also maximize

a
2

= W
0
(1 +r) +
>
( r1)
a
2

>
.
This is achieved by the portfolio

=
1
a

1
( r1) ,
which is independent of wealth. This is consistent with Theorem 3.6 since = 0 for negative
exponential utility. With normally distributed returns and constant absolute risk aversion, the
amount optimally invested in each risky asset is independent of wealth.
3.3 Mean-variance analysis
Mean-variance analysis was introduced by Markowitz (1952, 1959). Mean-variance analysis as-
sumes that the portfolio choice of investors will depend only on the mean and variance of their
end-of-period wealth and hence on the mean and variances of the portfolios investors can form.
A portfolio is said to be mean-variance ecient if it has the lowest return variance for a given
expected return. The mean-variance ecient portfolios can thus be found by solving constrained
optimization problems. We will follow Merton (1972) and use the Lagrangian optimization tech-
nique to solve for the ecient portfolios. For an alternative characterization see Hansen and
Richard (1987) or Cochrane (2005, Ch. 5). Before we go into the derivations of optimal portfolios,
let us discuss the theoretical foundation of mean-variance analysis.
3.3.1 Theoretical foundation
In general an individuals utility of wealth will depend on all moments of wealth. This can be
seen by the Taylor expansion of u(W) around the expected wealth, E[W]:
u(W) = u(E[W])+u
0
(E[W])(WE[W])+
1
2
u
00
(E[W])(WE[W])
2
+

n=3
1
n!
u
(n)
(E[W])(WE[W])
n
,
where u
(n)
is the nth derivative of u. Taking expectations, we get
E[u(W)] = u(E[W]) +
1
2
u
00
(E[W]) Var(W) +

n=3
1
n!
u
(n)
(E[W]) E[(W E[W])
n
] .
Here E[(W E[W])
n
] is the central moment of order n. The variance is the central moment of
order 2. Obviously, a greedy investor (which just means that u is increasing) will prefer higher
expected wealth to lower for xed central moments of order 2 and higher. Moreover, a risk averse
investor (so that u
00
< 0) will prefer lower variance of wealth to higher for xed expected wealth
and xed central moments of order 3 and higher. But when the central moments of order 3 and
higher are not the same for all alternatives, we cannot just evaluate them on the basis of their
expectation and variance. Of course, with quadratic utility, the derivatives of u of order 3 and
higher are zero, so the higher order moments of wealth are irrelevant. However, quadratic utility
is a very unrealistic model of investor preferences.
Mean-variance analysis is valid if the returns on the risky assets are multivariate normally
distributed, R N(, ). Here, is a vector of the expected rates of return on the risky assets,
44 Chapter 3. One-period models
and = (
ij
) is the variance-covariance matrix of these rates of return, so that
ij
denotes the
covariance between the returns on asset i and asset j. Given that the returns on all individual
assets are normally distributed, the return on any portfoliobeing a weighted average of the
returns on the assets in the portfoliowill also be normally distributed. A portfolio characterized
by the portfolio weights = (
1
, . . . ,
d
)
>
on the risky assets and the weight
0
= 1
>
1 on the
risk-free asset has a return of
R


0
r +
>
R = r +
>
(Rr1) = r +
d

i=1

i
(R
i
r),
which is normally distributed with mean and variance given by
() E[R

] =
0
r +
>
= r +
>
( r1) = r +
d

i=1

i
(
i
r),

2
() Var[R

] =
>
=
d

i=1
d

j=1

ij
.
Consequently, the end-of-period wealth of each investor will also be normally distributed for any
portfolio choice. All higher-order moments of wealth can be written in terms of mean and variance
so that expected utility depends only on expected wealth and the variance of wealth.
An obvious short-coming of the assumption of normally distributed returns is the possibility of
rates of returns smaller than -100%, which is inconsistent with limited liability of securities. It also
allows for negative end-of-period wealth and hence negative consumption with positive probability,
which is clearly unreasonable. An alternative which at rst looks promising is to assume that the
end-of-period prices of individual assets are lognormally distributed, ruling out negative prices
and rates of return below 100%. The lognormal distribution is also fully described by its rst
two moments. Unfortunately, such an assumption is not tractable in a one-period setting since
neither the value nor the return on a portfolio will then be lognormally distributed (the lognormal
distribution is not stable under addition).
3.3.2 Mean-variance analysis with only risky assets
Assume that the variance-covariance matrix is non-singular, which is the case if none of the
assets are redundant, i.e., no asset has a return which is a linear combination of the returns of other
assets. The inverse of is denoted by
1
. A portfolio is said to be mean-variance ecient
if it has the minimum return variance among all the portfolios with the same mean return. Given
the normality assumption on returns, greedy and risk averse investors will only choose among the
mean-variance ecient portfolios. Assuming that there are no portfolio constraints, we can nd
a mean-variance ecient portfolio with expected return by solving the quadratic minimization
problem
min

1
2

>

s.t.
>
= ,

>
1 = 1.
The
1
2
in the objective will be notationally convenient when we solve the problem. Clearly, the
portfolio that minimizes half the variance will also minimize the variance.
3.3 Mean-variance analysis 45
We solve the problem by the Lagrange technique. Letting and denote the Lagrange multi-
pliers of the two constraints, the Lagrangian is
L =
1
2

>
+ (
>
) + (1
>
1) .
The rst-order condition with respect to is
L

= 1 = 0,
which implies that
=
1
+
1
1. (3.12)
The rst-order conditions with respect to the multipliers simply give the two constraints to the
minimization problem. Substituting the expression (3.12) for into the two constraints, we obtain
the equations

>

1
+ 1
>

1
= ,

>

1
1 + 1
>

1
1 = 1.
Dening
A =
>

1
, B =
>

1
1 = 1
>

1
, C = 1
>

1
1, D = AC B
2
, (3.13)
we can write the solution to these two equations in and as
=
C B
D
, =
AB
D
.
Substituting this into (3.12) we obtain
= ( )
C B
D

1
+
AB
D

1
1. (3.14)
Some tedious calculations show that the variance of the return on this portfolio is equal to

2
( ) ( )
>
( ) =
C
2
2B +A
D
. (3.15)
This is to be shown in Exercise 3.3. We see that the combinations of variance and mean form a
parabola in a (mean, variance)-diagram.
Traditionally the portfolios are depicted in a (standard deviation, mean)-diagram. The above
relation can also be written as

2
( )
1/C

( B/C)
2
D/C
2
= 1,
from which it follows that the optimal combinations of standard deviation and mean form a hy-
perbola in the (standard deviation, mean)-diagram. This hyperbola is called the mean-variance
frontier of risky assets. The mean-variance ecient portfolios are sometimes called frontier port-
folios.
Before we proceed let us clarify a point in the derivation above. We have assumed that D is
non-zero. In fact, D > 0. To see this is true, rst recall the following denition. A symmetric
d d matrix is said to be positive denite if
>
> 0 for any non-zero d-vector . Since in
our case
>
equals the variance of the portfolio and all portfolios of risky assets will have a
return with positive variance, the variance-covariance matrix is indeed a positive denite matrix.
46 Chapter 3. One-period models
A result in linear algebra says that the inverse
1
is then also positive denite, i.e., x
>

1
x > 0
for any non-zero d-vector x. In particular we have A > 0 and C > 0. Also
AD = A(AC B
2
) = (B A1)
>

1
(B A1) > 0
and since A > 0 we must have D > 0.
The minimum-variance portfolio is the portfolio that has the minimum variance among all
portfolios. We can nd this directly by solving the constrained minimization problem
min

1
2

>

s.t.
>
1 = 1
where there is no constraint on the mean portfolio return. Alternatively, we can minimize the
variance
2
( ) in (3.15) over all . Taking the latter route, we nd that the minimum variance
is obtained when the mean return is
min
= B/C and the minimum variance is given by
2
min
=

2
(
min
) = 1/C. From (3.14) we get that the minimum-variance portfolio is

min
=
1
C

1
1 =
1
1
>

1
1

1
1. (3.16)
It can be shown that the portfolio

slope
=
1
B

1
=
1
1
>

1
(3.17)
is the portfolio that maximizes the slope of a straight line between the origin and a point on
the mean-variance frontier in the (, )-diagram. (This follows as a special case of the tangency
portfolio derived in the following subsection.) Let us call
slope
the maximum slope portfolio.
This portfolio has mean A/B and variance A/B
2
. From (3.14) we see that any mean-variance
optimal portfolio can be written as a linear combination of the maximum slope portfolio and the
minimum-variance portfolio:
( ) =
(C B)B
D

slope
+
(AB )C
D

min
.
Note that the two multipliers of the portfolios sum to one. This is a two-fund separation result.
If the investors can only form portfolios of the d risky assets with normally distributed returns,
any greedy and risk-averse investor will choose a combination of two special portfolios or funds,
namely the maximum slope portfolio and the minimum-variance portfolio. These two portfolios
are said to generate the mean-variance frontier of risky assets. In fact, it can be shown that any
other two frontier portfolios generate the entire frontier.
Figure 3.1 shows an example of the mean-variance frontier generated from 10 individual assets.
3.3.3 Mean-variance analysis with both risky assets and a risk-free asset
A risk-free asset corresponds to a point (0, r) in the (standard deviation, mean)-diagram. The
investors can combine any portfolio of risky assets with an investment in the risk-free asset. The
(standard deviation, mean)-pairs that can be obtained by such a combination form a straight line
between the point (0, r) and the point corresponding to the portfolio of risky asset. Suppose for
example that we invest a fraction 1 of wealth in the risk-free asset and the fraction 1 0 in
3.3 Mean-variance analysis 47
-0.02
0.00
0.02
0.04
0.06
0.08
0.10
0.12
0.14
0.16
0.00 0.05 0.10 0.15 0.20 0.25
standard deviation
e
x
p
e
c
t
e
d

r
e
t
u
r
n
Figure 3.1: The mean-variance frontier. The curve shows the mean-variance frontier
generated from the 10 individual assets corresponding to the red xs.
a given portfolio of risky assets with some expected rate of return and some standard deviation
. Then the mean and standard deviation of the combined portfolio are
() = r + (1 ) , () = (1 ) .
Consequently,
() = r +


()
so that the set of points {((), ()) | 1} will form a straight line.
1
Other things equal, greedy and risk-averse investors want high expected return and low standard
deviation so they will move as far to the north-west as possible in the diagram. Therefore they
will pick a point somewhere on the upward-sloping line that is tangent to the mean-variance frontier
of risky assets and goes through the point (0, r). The point where this line is tangent to the frontier
of risky assets corresponds to a portfolio which we refer to as the tangency portfolio. This is
a portfolio of risky assets only. It is the portfolio that maximizes the Sharpe ratio over all risky
portfolios. The Sharpe ratio of a portfolio is the ratio (() r)/() between the excess expected
return of a portfolio and the standard deviation of the return.
To determine the tangency portfolio we consider the problem
max

>
r
_

>

_
1/2
s.t.
>
1 = 1.
1
For > 1, the standard deviation of the combined portfolio is () = (1 ) so that we get () =
r [ / ]().
48 Chapter 3. One-period models
Applying the constraint, the objective function can be rewritten as
f() =

>
( r1)
_

>

_
1/2
=
>
( r1)
_

>

_
1/2
.
The derivative is
f

= ( r1)
_

>

_
1/2

>

_
3/2

>
( r1)
and
f

= 0 implies that

>
( r1)

>

=
1
( r1) , (3.18)
which we want to solve for . Note that the equation has a vector on each side. If two vectors are
identical, they will also be identical after a division by the sum of the elements of the vector. The
sum of the elements of the vector on the left-hand side of (3.18) is
1
>
_

>
( r1)

>


_
=

>
( r1)

>

1
>
=

>
( r1)

>

,
where the last equality is due to the constraint. The sum of the elements of the vector on the
right-hand side of (3.18) is simply 1
>

1
( r1). Dividing each side of (3.18) with the sum of
the elements we obtain the tangency portfolio

tan
=

1
( r1)
1
>

1
( r1)
. (3.19)
The expectation and standard deviation of the rate of return on the tangency portfolio are given
by

tan
=
>

tan
=

>

1
( r1)
1
>

1
( r1)
,

tan
=
_

>
tan

tan
_
1/2
=
_
( r1)
>

1
( r1)
_
1/2
1
>

1
( r1)
.
The maximum Sharpe ratio, i.e., the slope of the line, is thus

tan
r

tan
=

>

1
(r1)
1
>

1
(r1)
r
((r1)
>

1
(r1))
1/2
1
>

1
(r1)
=

>

1
( r1) r[1
>

1
( r1)]
_
( r1)
>

1
( r1)
_
1/2
=
( r1)
>

1
( r1)
_
( r1)
>

1
( r1)
_
1/2
=
_
( r1)
>

1
( r1)
_
1/2
.
The upward-sloping straight line between the points (0, r) and (
tan
,
tan
) constitutes the mean-
variance frontier of all assets. Again we have two-fund separation since all investors will combine
just two funds, where one fund is simply the risk-free asset and the other is the tangency portfolio.
This result is the basis for the famous Capital Asset Pricing Model (CAPM) developed by Sharpe
(1964), Lintner (1965), and Mossin (1966). Note that also in this setting all investors will hold
dierent risky assets in the same proportion to each other, i.e., for any i, j {1, . . . , d} the ratio

i
/
j
is the same for all investors.
Exactly which combination of the two generating portfolios that a particular investor prefers is
in general dicult to determine. For the unrealistic case of negative exponential utility (CARA)
3.4 A numerical example 49
the optimal combination can be determined in closed form as shown in Section 3.2. For other
utility functions numerical optimization is necessary. In this regard the only advantage of the
mean-variance framework is the two fund separation result since that allows us to look for a single
portfolio weight (the fraction of wealth invested in the tangency portfolio) rather than portfolio
weights of all risky assets. The numerical optimization is thus simpler assuming the mean-variance
set-up.
Note that due to the assumption of normally distributed returns, the terminal wealth of the
investor can go anywhere from to + as long as some non-zero amount is invested in some
risky asset. For utility functions with innite marginal utility at a level higher than , the
utility-maximizing decision will be to invest the entire wealth in the risk-free asset. This is for
example the case for CRRA utility. The assumptions of the mean-variance analysis thus rule out
its applications for reasonable utility functions!
3.4 A numerical example
TO COME...
3.5 Mean-variance analysis with constraints
TO COME...
Elton, Gruber, and Padberg (1976), Alexander (1993), Best and Grauer (1991): non-negativity
constraints
Alexander, Baptista, and Yan (2007): Value-at-risk type constraints
3.6 Estimation
Mean-variance optimization is quite sensitive to the magnitudes of the inputs, i.e., expected
returns, variances, and covariances. Chopra and Ziemba (1993) show that it is particularly impor-
tant to obtain precise estimates of the expected returns. On the other hand, the expected returns
are very hard to estimate precisely from historical returns, cf., e.g., Merton (1980).
For more on estimation and model uncertainty and how that aects optimal portfolio choice, see
Garlappi, Uppal, and Wang (2007) and the references therein...
3.7 Critique of the one-period framework
Investors typically get utility from consumption at many points in time and not simply the
wealth level at one particular date.
Even in the case where the investor only obtains utility from wealth at one date, she has
the opportunity to change her portfolio over time, which she would normally do as new
information arises (e.g., when stock prices and interest rates change) or simply because time
passes. Investors live in a dynamic model and will take decisions dynamically. Of course, the
existence of transaction costs is a reason for not changing the portfolio too frequently, but if
we are really worried about transaction costs we should explicitly model that imperfection;
the analysis of such models is quite dicult, however.
50 Chapter 3. One-period models
Consumption and investment decisions are generally not to be separated from each other.
Investments are meant to generate future consumption!
The normality (or similar sucient distributional) assumption employed in the mean-variance
analysis is not reasonable, neither from a theoretical nor an empirical point of view. For
example, the normal distribution allocates a strictly positive probability to a return below
-100%, which cannot happen for investments in securities with limited liability.
3.8 Exercises
Exercise 3.1. Provide the details of the proof of part (ii) in Theorem 3.2.
Exercise 3.2. Give a proof of Theorem 3.6 for the case of negative exponential utility where
marginal utility is given by (3.8).
Exercise 3.3. Show Equation (3.15).
Exercise 3.4. Let R

denote the return on a portfolio located on the mean-variance ecient


frontier for risky assets only and suppose that is dierent from the minimum-variance port-
folio. Show that there is a portfolio z() also located on the mean-variance ecient frontier
for risky assets only, which has the property that Cov[R

, R
z()
] = 0. Show that E[R
z()
] =
(ABE[R

])/(BC E[R

]), where A, B, and C are the constants dened in (3.13). Hint: First
show that the covariance between the return on the ecient portfolio with mean m
1
and the return
on the ecient portfolio with mean m
2
is equal to (Cm
1
m
2
B[m
1
+m
2
] +A)/D.
Exercise 3.5. Let R
min
denote the return on the minimum-variance portfolio of risky assets.
Let R be the return on any risky asset or portfolio of risky assets, ecient or not. Show that
Cov[R, R
min
] = Var[R
min
]. Hint: Consider a portfolio consisting of a fraction a in this risky asset
and a fraction (1 a) in the minimum-variance portfolio. Compute the variance of the return on
this portfolio and realize that the variance has to be minimized for a = 0.
Exercise 3.6. Let R
1
denote the return on a mean-variance ecient portfolio of risky assets and
let R
2
denote another, not necessarily ecient, portfolio of risky assets with E[R
2
] = E[R
1
]. Show
that Cov[R
1
, R
2
] = Var[R
1
] and conclude that R
1
and R
2
are positively correlated.
CHAPTER 4
Discrete-time multi-period models
4.1 Introduction
To study dynamic consumption and investment decisions, several papers have looked at multi-
period, discrete-time models where the investor has the opportunity to consume and rebalance
her portfolio at a number of xed dates. Certainly this is a valuable extension of the single-
period setting, but it is still a limitation that the investor can only change her decisions at pre-
specied points in time and not react to new information arriving between these points in time.
A continuous-time model seems more reasonable. Furthermore, the results on optimal consumption
and investment strategies are typically clearer in continuous-time models than in discrete-time
models, and the necessary mathematical computations are much more elegant in a continuous-
time framework. Therefore, we will not give much attention to multi-period, discrete-time models.
However, some aspects of the set-up of continuous-time models may be easier to understand if we
start by looking at a discrete-time model and then take the limit as the period length goes to zero.
The basic references for the discrete-time models are Samuelson (1969), Hakansson (1970), Fama
(1970, 1976), and Ingersoll (1987, Ch. 11).
4.2 A multi-period, discrete-time framework for asset allocation
We consider an individual living over the time interval [0, T] and assume that the individual can
revise consumption and investment decisions at time points t
n
= nt, cf. the time line below. The
terminal date T is assumed to be a multiple of the decision frequency, T = Nt. We dene the
set T = {t
0
, t
1
, . . . , t
N1
} of time points, where decisions are made. At the terminal date T no
decisions are made.
t
0
0 t
1
t
2
t
N1 t
N
T
t t t
51
52 Chapter 4. Discrete-time multi-period models
We will assume that at any time t T, the individual can invest in d + 1 assets. Asset 0 is an
asset with a known return r
t
t over the next period, i.e., over the interval [t, t +t], so that r
t
is
the annualized short-term risk-free rate at time t. The returns on this asset in later periods are not
necessarily known yet, but at least the asset is risk-free over the next period. The value at time t
of a dollar invested at time 0 and subsequently rolled over at the risk-free rate is denoted by P
0
t
.
We will refer to this investment as a unit bank account. The other assets 1, 2, . . . , d are risky
assets, i.e., assets with unknown returns even over the next period. For any t T and t = T, we
denote by P
t
= (P
1
t
, . . . , P
d
t
)
>
the vector of prices of the d risky assets at time t. We assume for
notational simplicity that the assets do not pay intermediate dividends so that returns are given
only by percentage price changes. Let R
i
t+t
= (P
i
t+t
P
i
t
)/P
i
t
denote the return on risky asset i
over the interval [t, t + t] and let R
t+t
= (R
1
t+t
, . . . , R
d
t+t
)
>
denote the vector of returns on
all the risky assets over the same interval.
At any time t T the investor chooses a portfolio which is held unchanged until time t +t and
a consumption rate c
t
such that the total consumption in the interval [t, t + t) is c
t
t. (We
assume that there is a single consumption good so that c
t
is one-dimensional.) This is subtracted
from her wealth at time t. Of course, the portfolio and consumption chosen at time t for the
interval [t, t +t] can only be based on the information known at time t. We assume that there is
no consumption or investment beyond time T, which we can think of as the time of death (assumed
to be known in advance!).
For the purposes of deriving the budget constraint we will rst represent the portfolio by the
number of units of each asset held. For any t T, we let M
i
t
denote the number of units of asset
i = 0, 1, . . . , d held in the period [t, t+t). We will allow for the case where the agent earns income
from other sources than her nancial investments. We let y
t
be the rate of income earned in the
period [t, t + t) such that the entire income in this period is y
t
t. We assume that the agent
receives this amount at time t. Note that we do not model the labor supply decision resulting in
this income, but take y
t
as exogenously given.
The agent enters date t T with a wealth of
W
t
=
d

i=0
M
i
tt
P
i
t
.
This is the value of her portfolio chosen in the previous period. She then receives income y
t
t
and simultaneously has to choose the consumption rate c
t
and the new portfolio represented by
M
0
t
, M
1
t
, . . . , M
d
t
. The budget restriction on these choices is that
(y
t
c
t
) t =
d

i=0

M
i
t
M
i
tt

P
i
t
,
4.2 A multi-period, discrete-time framework for asset allocation 53
i.e., that income net of consumption equals the extra amount invested in the nancial market. We
then get that
W
t+t
W
t
=
d

i=0
M
i
t
P
i
t+t

i=0
M
i
tt
P
i
t
=
d

i=0
M
i
t
_
P
i
t+t
P
i
t
_
+
d

i=0
_
M
i
t
M
i
tt
_
P
i
t
=
d

i=0
M
i
t
_
P
i
t+t
P
i
t
_
+ (y
t
c
t
) t.
Let
i
t
= M
i
t
P
i
t
denote the amount invested in asset i at time t T and let
t
= (
1
t
, . . . ,
d
t
)
>
.
Then the change in wealth can be rewritten as
W
t+t
W
t
=
0
t
r
t
t +
>
t
R
t+t
+ (y
t
c
t
) t. (4.1)
We can also represent the portfolio by the fractions of wealth invested in the dierent assets.
After receiving income and consuming at time t, the funds invested will be W
t
+ (y
t
c
t
)t.
Assuming this is non-zero, we can dene the portfolio weight of asset i at time t as

i
t
=

i
t
W
t
+ (y
t
c
t
)t
, i = 0, 1, . . . , d.
The vector of portfolio weights in the risky assets is denoted by
t
= (
1
t
, . . . ,
d
t
)
>
. By construction
the portfolio weight of the bank account is given by
0
t
= 1
>
t
1 = 1

d
i=1

i
t
. The end-of-period
wealth can then be restated as
W
t+t
= (W
t
+y
t
t c
t
t)R
W
t+t
, (4.2)
where
R
W
t+t
= 1 +r
t
t +
>
t
_
R
t+t
r
t
t 1
_
. (4.3)
Note that the only random variable (seen from time t) on the right-hand side of these wealth
expressions is the return vector R
t+t
. Let us decompose the return into an expected and an
unexpected part,
R
t+t
=
t
t +
t

t+t

t. (4.4)
Here
t
is the vector of expected rates of return per year,
t+t
is a vector of independent stochastic
shocks all with mean zero and variance one, and
t
is a matrix determining how the returns are
aected by these shocks. The values of
t
and
t
are known at time t. The realization of the shock
vector
t+t
will be known at time t + t, just before the consumption and portfolio decisions
at that date are taken. It follows that, seen at time t, the variance-covariance matrix of R
t+t
is
given by
t

>
t
t. The elements in
t

t

>
t
are hence annualized variances and covariances.
The wealth dynamics (4.1) can now be rewritten as
W
t+t
W
t
=

0
t
r
t
+
>
t

t
+y
t
c
t

t +
>
t

t

t+t

t. (4.5)
At time 0 the investor must choose the entire consumption rate process c = (c
t
)
tT
and the
entire portfolio process represented by = (
t
)
tT
or = (
t
)
tT
. In other words, she must
choose the current values c
0
and
0
and for each future date t
n
(with n = 1, . . . , N 1) she must
54 Chapter 4. Discrete-time multi-period models
choose a consumption rate c
t
n
() and a portfolio
t
n
() for each possible state of the world at
day t
n
.
We assume that the life-time utility of consumption and terminal wealth is given by
U(c
0
, c
1
, . . . , c
t
N1
, W
T
) =
N1

n=0
e
t
n
u(c
t
n
)t +e
T
u(W
T
)
as discussed in Section 2.7. The maximal obtainable expected life-time utility seen from time 0 is
therefore
J
0
= sup
(c
t
n
,
t
n
)
N1
n=0
E
_
N1

n=0
e
t
n
u(c
t
n
)t +e
T
u(W
T
)
_
,
where the supremum is taken over all budget-feasible consumption and investment strategies.
Similarly, for each t = it T, we dene
J
t
= sup
(c
t
n
,
t
n
)
N1
n=i
E
t
_
N1

n=i
e
(t
n
t)
u(c
t
n
)t +e
(Tt)
u(W
T
)
_
, (4.6)
where the subscript on the expectations operator denotes that the expectation is taken conditional
on the information known to the agent at time t = t
i
. J is often called the indirect or derived
utility of wealth process or function, since it measures the highest attainable expected life-time
utility the investor can derive from her current wealth in the current state of the world. Note that
J
T
= u(W
T
).
4.3 Dynamic programming in discrete-time models
In the denition of indirect utility in (4.6) the maximization is over both the current and all
future consumption rates and portfolios. This is clearly a complicated maximization problem. We
will now show that we can alternatively perform a sequence of simpler maximization problems.
This result is based on the following manipulations, where t = t
i
= it as before:
J
t
= sup
(c
t
n
,
t
n
)
N1
n=i
E
t
_
N1

n=i
e
(t
n
t)
u(c
t
n
)t +e
(Tt)
u(W
T
)
_
= sup
(c
t
n
,
t
n
)
N1
n=i
E
t
_
u(c
t
)t +
N1

n=i+1
e
(t
n
t)
u(c
t
n
)t +e
(Tt)
u(W
T
)
_
= sup
(c
t
n
,
t
n
)
N1
n=i
E
t
_
u(c
t
)t + E
t+t
_
N1

n=i+1
e
(t
n
t)
u(c
t
n
)t +e
(Tt)
u(W
T
)
__
= sup
(c
t
n
,
t
n
)
N1
n=i
E
t
_
u(c
t
)t +e
t
E
t+t
_
N1

n=i+1
e
(t
n
[t+t])
u(c
t
n
)t +e
(T[t+t])
u(W
T
)
__
= sup
c
t
,
t
E
t
_
_
u(c
t
)t +e
t
sup
(c
t
n
,
t
n
)
N1
n=i+1
E
t+t
_
N1

n=i+1
e
(t
n
[t+t])
u(c
t
n
)t +e
(T[t+t])
u(W
T
)
_
_
_
Here, the rst equality is simply due to the denition of indirect utility, the second equality
comes from separating out the rst term of the sum, the third equality is valid according to the
law of iterated expectations, the fourth equality comes from separating out the discount term
e
t
, and the nal equality is due to the fact that only the inner expectation depends on future
4.3 Dynamic programming in discrete-time models 55
consumption rates and portfolios. Noting that the inner supremum is by denition the indirect
utility at time t +t, we arrive at
J
t
= sup
c
t
,
t
E
t

u(c
t
)t +e
t
J
t+t

= sup
c
t
,
t
_
u(c
t
)t +e
t
E
t
[J
t+t
]
_
. (4.7)
This equation is called the Bellman equation, and the indirect utility J is said to have the
dynamic programming property. The decision to be taken at time t is split up in two: (1) the
consumption and portfolio decision for the current period and (2) the consumption and portfolio
decisions for all future periods. We take the decision for the current period assuming that we will
make optimal decisions in all future periods. Note that this does not imply that the decision for
the current period is taken independently from future decisions. We take into account the eect
that our current decision has on the maximum expected utility we can get from all future periods.
The expectation E
t
[J
t+t
] will depend on our choice of c
t
and
t
.
1
The dynamic programming property is the basis for a backward iterative solution procedure.
First, we choose c
t
N1
and
t
N1
to maximize
u(c
t
N1
)t +e
t
E
t
N1
[ u(W
T
)] ,
where
W
T
=
_
W
t
N1
+y
t
N1
t c
t
N1
t
_

1 +r
t
N1
t +
>
t
N1
_
R
T
r
t
N1
t 1
_

.
This is done for each possible state at time t
N1
and gives us J
t
N1
. Then we choose c
t
N2
and

t
N2
to maximize
u(c
t
N2
)t +e
t
E
t
N2

J
t
N1

,
and so on until we reach time zero. Since we have to perform a maximization for each state of
the world at every point in time, we have to make assumptions on the possible states at each
point in time before we can implement the recursive procedure. The optimal decisions at any time
are expected to depend on the wealth level of the agent at that date, but also on the value of
other time-varying state variables that aect future returns on investment (e.g., the interest rate
level) and future income levels. To be practically implementable only a few state variables can be
incorporated. Also, these state variables must follow Markov processes so only the current values
of the variables are relevant for the maximization at a given point in time.
Suppose that the relevant information is captured by a one-dimensional Markov process x = (x
t
)
so that the indirect utility at any time t {0, t, . . . , Nt} can be written as J
t
= J(W
t
, x
t
, t).
Then the dynamic programming equation (4.7) becomes
J(W
t
, x
t
, t) = sup
c
t
,
t
_
u(c
t
)t +e
t
E
t
[J(W
t+t
, x
t+t
, t +t)]
_
, t T.
Doing the maximization we have to remember that W
t+t
will be aected by the choice of c
t
and

t
. From our analysis of the wealth dynamics we have that
W
t+t
= (W
t
+y
t
t c
t
t)R
W
t+t
, R
W
t+t
= 1 +r
t
t +
>
t
_
R
t+t
r
t
t 1
_
,
1
Readers familiar with option pricing theory may note the similarity to the problem of determining the optimal
exercise strategy of a Bermudan/American option. However, for that problem the decision to be taken is much
simpler (exercise or not) than for the consumption/portfolio problem.
56 Chapter 4. Discrete-time multi-period models
cf. (4.2) and (4.3). In particular, we see that
W
t+t
c
t
= R
W
t+t
t,
W
t+t

t
= (W
t
+y
t
t c
t
t) (R
t+t
r
t
t1) .
The rst-order condition for the maximization with respect to c
t
is
u
0
(c
t
)t +e
t
E
t
_
J
W
(W
t+t
, x
t+t
, t +t)
W
t+t
c
t
_
= 0,
which implies that
u
0
(c
t
) = e
t
E
t

J
W
(W
t+t
, x
t+t
, t +t)R
W
t+t

. (4.8)
The rst-order condition for the maximization with respect to
t
is
E
t
_
J
W
(W
t+t
, x
t+t
, t +t)
W
t+t

t
_
= 0,
which implies that
E
t
[J
W
(W
t+t
, x
t+t
, t +t) (R
t+t
r
t
t1)] = 0. (4.9)
While we cannot generally solve for the optimal decisions, we can show an interesting and
important result, the so-called envelope condition. First note that for the optimal choice c
t
,
t
we
have that
J(W
t
, x
t
, t) = u( c
t
)t +e
t
E
t
_
J(

W
t+t
, x
t+t
, t +t)
_
,
where

W
t+t
is next periods wealth using c
t
,
t
. Taking derivatives with respect to W
t
in this
equation, and acknowledging that c
t
and
t
will in general depend on W
t
, we get
J
W
(W
t
, x
t
, t) = u
0
( c
t
)
c
t
W
t
t +e
t
E
t
_
J
W
(

W
t+t
, x
t+t
, t +t)


W
t+t
W
t
_
,
where


W
t+t
W
t
= R
W
t+t

1
c
t
W
t
t

+ (W
t
+y
t
t c
t
t)


t
W
t

>
(R
t+t
r
t
t1) .
Inserting this and rearranging terms, we get
J
W
(W
t
, x
t
, t) = e
t
E
t
_
J
W
(

W
t+t
, x
t+t
, t +t)R
W
t+t
_
+

u
0
( c
t
) e
t
E
t
_
J
W
(

W
t+t
, x
t+t
, t +t)R
W
t+t
_
c
t
W
t
t
+ (W
t
+y
t
t c
t
t) e
t


t
W
t

>
E
t
_
J
W
(

W
t+t
, x
t+t
, t +t) (R
t+t
r
t
t1)
_
.
On the right-hand side the last two terms are zero due to the rst-order conditions (4.8) and (4.9)
so only the leading term remains, i.e.,
J
W
(W
t
, x
t
, t) = e
t
E
t
_
J
W
(

W
t+t
, x
t+t
, t +t)R
W
t+t
_
.
Combining this with (4.8) we obtain
u
0
(c
t
) = J
W
(W
t
, x
t
, t), (4.10)
4.3 Dynamic programming in discrete-time models 57
which is the so-called envelope condition. As we will see, the condition also holds in the
continuous-time models. The intuition of the envelope condition is that the optimal decision must
be such that the marginal utility from consuming a bit more must be identical to the marginal utility
from investing that bit in an optimal way. If that was not the case the allocation of wealth between
consumption and investment should be reconsidered. For example, if u
0
(c
t
) > J
W
(W
t
, x
t
, t), the
consumption c
t
should be increased and the amount invested should be decreased.
Under some simplifying assumptions on the precise form of the utility functions u and u and on
the dynamics of asset returns and income, the backward iterative procedure yields an explicit solu-
tion to the maximization problem in the form of the optimal (possibly state- and time-dependent)
consumption rate and portfolio process (and also the indirect utility of wealth J
t
). Since we can ob-
tain similar (and often clearer) results under similar assumptions in the more elegant and realistic
continuous-time setting, we will not go into these discrete-time examples.
CHAPTER 5
Introduction to continuous-time modeling
5.1 Introduction
An introduction to stochastic processes and stochastic calculus is given in Appendix B...
5.2 The basic continuous-time setting
The basic elements of mainstream continuous-time models can be seen as the limit of the multi-
period discrete-time model elements. The basis is a probability space (, F, P) with an associated
ltration F = (F
t
)
t[0,T]
which is the formal model of the evolution of the relevant uncertainty for
the investor.
The agent now has to choose a continuous-time process of consumption rates c = (c
t
)
t[0,T]
and
a continuous-time portfolio process. The portfolio process can be represented by = (
t
)
t[0,T]
,
where
t
is the d-dimensional vector of amounts invested at time t in the d risky assets, orat
least when wealth is non-zeroby = (
t
)
t[0,T]
, where
t
is the d-dimensional vector of fractions
of wealth invested at time t in the d risky assets. The remaining nancial wealth is invested in
the locally risk-free asset so
0
t
= W
t

>
t
1 = W
t

d
i=1

it
and
0
t
= 1
>
t
1. We assume
that there is a single consumption good in the economy and this good is used as a numeraire so
that all prices are measured in units of this consumption good, i.e., in real terms. We will always
require that c
t
0 with probability one. We focus on unconstrained investors so that there are
no constraints on the values
t
or
t
may have, i.e., they can take any value in R
d
; see references
in Section 18.1 to problems with constraints on the portfolios, e.g., short-selling constraints or
portfolio mix constraints. The stochastic variables c
t
and
t
(or
t
) must be F
t
-measurable, i.e.,
they can only depend on information available at time t. In other words, the processes c and (or
) are adapted. Other technical requirements should be added.
1
A consumption and investment
1
The consumption process c must be an L
1
-process, i.e.,
R
T
0
kc
t
k dt < with probability one. The portfolio
strategy must satisfy that
>
is an L
1
-process and that
>
is an L
2
-process, i.e., that
R
T
0
k
>
t

t
k
2
dt <
with probability one. Finally, must be a progressively measurable process which generally involves a bit more
59
60 Chapter 5. Introduction to continuous-time modeling
strategy must also satisfy that the wealth process induced by the strategy always stays above
a lower bound, say K, where K R. This rules out doubling strategies, cf. the discussion in
Due (2001, Ch. 6). In fact, we will typically require that wealth stays non-negative at all times,
corresponding to K = 0. This is a natural requirement, at least for the case where the investor
does not receive a minimum income from non-nancial sources (labor). The set of all consumption
and investment strategies that satisfy all these requirements on the interval [t, T] is denoted by A
t
.
Preferences: The objective is to maximize the expected life-time utility which is assumed to be
on the additively time-separable form
E
_
_
T
0
e
t
u(c
t
) dt +e
T
u(W
T
)
_
, (5.1)
where u and u are increasing and concave von Neumann-Morgenstern utility functions. We will
assume that u and u are twice continuously dierentiable on their domain. We will dene the
indirect utility process J = (J
t
) as
J
t
= sup
(c,)A
t
E
t
_
_
T
t
e
(st)
u(c
s
) ds +e
(Tt)
u(W
T
)
_
. (5.2)
An optimal consumption and investment strategy (c

) has the property that it provides at least


as high an expected life-time utility as any other feasible strategy. In particular,
J
0
= E
_
_
T
0
e
t
u(c

t
) dt +e
T
u(W

T
)
_
,
where W

T
is the terminal wealth level that follows from the strategy (c

). In other words,
when an optimal strategy exists the supremum in the denition of J is attained. Of course, J
0
will
depend on the initial wealth W
0
of the investor. We shall assume that J
0
< for all W
0
< . It
can be shown that J
0
is an increasing and concave function of initial wealth W
0
. See Exercise 5.1
at the end of the chapter.
Dynamics of prices and wealth: When the investor is about to choose consumption and
investment strategies she has to deal with a number of variables that can evolve stochastically over
time such as:
the (locally) risk-free rate r
t
(i.e., the short-term interest rate),
the prices, the expected rates of returns, the variance-covariance matrix of rates of return on
the risky assets,
the expected rate of change and variation in her income rate,
covariances or correlations between all these variables.
Of course, in a fuller model we should also include uncertainty e.g., about the time of death of
the investor, relative prices of dierent consumption goods, etc., but we ignore such issues at this
point.
than just being adapted.
5.2 The basic continuous-time setting 61
We shall assume that all exogenous shocks to these variables can be represented by standard
Brownian motions. A direct consequence is that we do not allow for any jumps in prices, except
for points in time where the asset provides its owner with a lump-sum payment, e.g., a dividend
payment of a stock or a coupon payment of a bond.
2
For simplicity, we assume that the assets
provide no payments in the life of the investor and that the vector of risky asset prices P
t
follows
a stochastic process of the form
dP
t
= diag(P
t
)

t
dt +
t
dz
t

, (5.3)
where z = (z
1
, . . . , z
d
)
>
is a d-dimensional standard Brownian motion, i.e., a vector of d indepen-
dent one-dimensional standard Brownian motions. The term diag(P
t
) denotes the (d d)-matrix
with the vector P
t
along the main diagonal and zeros o the diagonal. We can write this compo-
nentwise as
dP
it
= P
it
_
_

it
dt +
d

j=1

ijt
dz
jt
_
_
, i = 1, . . . , d.
The instantaneous rate of return on asset i is given by dP
it
/P
it
. The d-vector
t
= (
1t
, . . . ,
dt
)
>
contains the expected rates of return and the (d d)-matrix
t
= (
ijt
)
d
i,j=1
measures the sensi-
tivities of the risky asset prices with respect to exogenous shocks so that the (d d)-matrix
t

>
t
contains the variance and covariance rates of instantaneous rates of return. We assume that
t
is
non-singular. Of course, and must be adapted to the information ltration F = (F
t
).
3
This
way of modeling price dynamics in continuous-time can be seen as the limit of (4.4) when
t+t
in that expression is assumed to be multivariate standard normally distributed.
Taking the limit of the wealth dynamics in (4.5) we get
dW
t
=

0
t
r
t
+
>
t

t
+y
t
c
t

dt +
>
t

t
dz
t
.
The amount invested in the (locally) risk-free asset can be expressed as total wealth minus the
amounts invested in the risky assets,

0
t
= W
t

>
t
1.
Substituting this into the wealth dynamics above, we obtain
dW
t
= [r
t
W
t
+
>
t
(
t
r
t
1) +y
t
c
t
] dt +
>
t

t
dz
t
.
Since
t
is assumed to be a non-singular (dd)-matrix, we can dene the d-dimensional process
= (
t
) by

t
=
1
t
(
t
r
t
1),
so that

t
= r
t
1 +
t

t
,
i.e.,
it
= r
t
+

d
j=1

ijt

jt
. has the interpretation of a vector of market prices of risk (corre-
sponding to the shock process z) since it measures the excess rate of return relative to the standard
2
See, e.g., Bardhan and Chao (1995), Wu (2003), and Jeanblanc-Picque and Pontier (1990) for utility maximiza-
tion problems involving jump processes.
3
Further technical requirements should be imposed, e.g., that the processes r, , and are progressively mea-
surable, that diag(P
t
)
t
is an L
1
-process, and that diag(P
t
)
t
is an L
2
-process; cf. footnote 1.
62 Chapter 5. Introduction to continuous-time modeling
deviation. For example, if asset i is only sensitive to the rst component of the exogenous shock
z
t
, it will have
i2t
= =
idt
= 0 and hence an expected rate of return of
it
= r
t
+
i1t

1t
so
that
1t
= (
it
r
t
)/
i1t
, where
i1t
is identical to the volatility of the asset. We can now rewrite
the price dynamics as
dP
t
= diag(P
t
)
_
r
t
1 +
t

t
_
dt +
t
dz
t

.
The wealth dynamics can be rewritten as
dW
t
=

r
t
W
t
+
>
t

t

t
+y
t
c
t

dt +
>
t

t
dz
t
. (5.4)
In terms of the portfolio weights , the wealth dynamics can be written as
dW
t
= W
t

r
t
+
>
t

t

dt + [y
t
c
t
] dt +W
t

>
t

t
dz
t
. (5.5)
Solution techniques: There are two major questions to be answered: (i) Under which assump-
tions do optimal strategies exist, and (ii) How can optimal strategies (and the indirect utility
function) be computed. In these notes we will focus on the second question. There are two major
approaches for solving this type of optimization problems: the dynamic programming approach
(also known as the stochastic control approach) and the martingale approach. In the following sec-
tion we consider the dynamic programming approach, while the martingale approach is introduced
in Section 8.1.
5.3 Dynamic programming in continuous-time models
In Section 4.3 we introduced the dynamic programming approach in a discrete-time multi-period
setting. Apparently, Merton (1969, 1971) was the rst to apply the dynamic programming approach
to a continuous-time optimal consumption/investment problem. The dynamic programming ap-
proach requires that a (possibly multi-dimensional) state variable exists so that this variable follows
a Markov process and all relevant objects can be written as functions of this state variable and
time. The theory of dynamic programming contains some results on the existence of optimal
strategies, but they often require that all admissible strategies take values in a compact set, an
assumption which is certainly unsuitable for most portfolio problems. Therefore, verication the-
orems are typically applied. This involves solving the so-called Hamilton-Jacobi-Bellman (HJB)
equation associated with the control problem. Under some technical conditions the solution to the
HJB equation will give us both the optimal strategies and the indirect utility function. The HJB
equation is a fully non-linear second-order partial dierential equation. Despite the complexity of
the equation, explicit solutions have been found in many interesting settings, as we shall see in the
following chapters.
Surely we must include the wealth W
t
of the agent as a state variable and then look for a
process x = (x
t
), possibly multi-dimensional, such that the pair (W
t
, x
t
) captures all relevant
information for the agents decision at time t. Basically, the pair of stochastic processes (W, x)
must constitute a Markov system, for any given consumption-portfolio choice (c, ). If both r,
, , and y are constant (or at least deterministic functions of time), then the wealth process is
by itself a Markov process and we need not add some x. We will refer to this situation as the
case of constant investment opportunities. We study portfolio and consumption choice under that
assumption in detail in Chapter 6. However, we do know that for example the short-term interest
5.3 Dynamic programming in continuous-time models 63
rate varies stochastically over time. If r = (r
t
) is in itself a Markov process, we should include r
as a state variable, i.e., one of the elements of x should be r. Maybe multiple state variables are
needed to capture the interest rate dynamics. Then these variables should be included in x. We
will study examples of such so-called stochastic investment opportunities in Chapters 713.
For simplicity we assume in the following that the agent receives no labor income, i.e., y
t
0.
We assume further that there is a stochastically evolving state variable x = (x
t
) that captures the
variations in r, , and over time, i.e.,
r
t
= r(x
t
),
t
= (x
t
, t),
t
= (x
t
, t),
where r, , and now (also) denote suciently well-behaved functions. The variations in the
state variable x determine the future expected returns and covariance structure in the nancial
market. The market price of risk is also given by the state variable:
(x
t
) = (x
t
, t)
1
((x
t
, t) r(x
t
)1) .
Note that we have assumed that the short-term interest rate r
t
and the market price of risk vector
t
do not depend on calendar time directly. The uctuations in r
t
and
t
over time are presumably
not due to the mere passage of time, but rather due to variations in some more fundamental
economic variables. In contrast, the expected rates of returns and the price sensitivities of some
assets will depend directly on time, e.g., the volatility and the expected rate of return on a bond
will depend on the time-to-maturity of the bond and therefore on calendar time.
For simplicity we will rst assume that the state variable is one-dimensional and write it as x.
Afterwards we turn to the case of multi-dimensional state variables. The wealth process for a given
portfolio and consumption strategy now evolves as
dW
t
= W
t

r(x
t
) +
>
t
(x
t
, t)(x
t
)

dt c
t
dt +W
t

>
t
(x
t
, t) dz
t
.
The state variable x is assumed to follow a one-dimensional diusion process
dx
t
= m(x
t
) dt +v(x
t
)
>
dz
t
+ v(x
t
) d z
t
,
where z = ( z
t
) is a one-dimensional standard Brownian motion independent of z = (z
t
). Hence, if
v(x
t
) 6= 0, there is an exogenous shock to the state variable that cannot be hedged by investments
in the nancial market. In other words, the nancial market is incomplete. Conversely, if v(x
t
)
is identically equal to zero, the nancial market is complete. We shall consider examples of both
cases later. The d-vector v(x
t
) represents the sensitivity of the state variable with respect to the
exogenous shocks to market prices. Note that the d-vector (x, t)v(x) is the vector of instantaneous
covariance rates between the returns on the risky assets and the state variable.
The pair (W
t
, x
t
) forms a two-dimensional Markov diusion process that contains all the infor-
mation the investor needs for making her consumption/investment decision. The indirect utility
at time t is therefore J
t
= J(W
t
, x
t
, t), where the function J is given by
J(W, x, t) = sup
(c
s
,
s
)
s[t,T]
E
W,x,t
_
_
T
t
e
(st)
u(c
s
) ds +e
(Tt)
u(W
T
)
_
,
where E
W,x,t
[ ] denotes the expectation given that W
t
= W and x
t
= x. In a discrete-time
approximation of this setting, it follows from (4.7) that
J(W, x, t) = sup
c
t
0,
t
R
d
_
u(c
t
)t +e
t
E
W,x,t
[J(W
t+t
, x
t+t
, t +t)]
_
,
64 Chapter 5. Introduction to continuous-time modeling
where c
t
and
t
is held xed over the interval [t, t+t). If we multiply by e
t
, subtract J(W, x, t),
and then divide by t, we get
e
t
1
t
J(W, x, t) = sup
c
t
0,
t
R
d

e
t
u(c
t
) +
1
t
E
W,x,t
[J(W
t+t
, x
t+t
, t +t) J(W, x, t)]
_
.
(5.6)
When we let t 0, we have that (by lHopitals rule)
e
t
1
t
,
and that (by denition of the drift of a process)
1
t
E
W,x,t
[J(W
t+t
, x
t+t
, t +t) J(W, x, t)]
will approach the drift of J at time t, which according to Itos Lemma is given by
J
t
(W, x, t) +J
W
(W, x, t)
_
W

r(x) +
>
t
(x, t)(x)

c
t
_
+
1
2
J
WW
(W, x, t)W
2

>
t
(x, t)(x, t)
>

t
+J
x
(W, x, t)m(x)
+
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
) +J
Wx
(W, x, t)W
>
t
(x, t)v(x).
The limit of (5.6) is therefore
J(W, x, t) = sup
c0,R
d
_
u(c) +
J
t
(W, x, t) +J
W
(W, x, t)
_
W

r(x) +
>
(x, t)(x)

c
_
+
1
2
J
WW
(W, x, t)W
2

>
(x, t)(x, t)
>
+J
x
(W, x, t)m(x)
+
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
)
+J
Wx
(W, x, t)W
>
(x, t)v(x)
_
.
(5.7)
This is called the Hamilton-Jacobi-Bellman (HJB) equation corresponding to the dynamic
optimization problem. Subscripts on J denote partial derivatives, however we will write the partial
derivative with respect to time as J/t to distinguish it from the value J
t
of the indirect utility
process. The HJB equation involves the supremum over the feasible time t consumption rates
and portfolios (not the supremum over the entire processes!) and is therefore a highly non-linear
second-order partial dierential equation.
Note that we can split up the maximization over c and into separate maximization terms and
rewrite the HJB equation (5.7) as
J(W, x, t) = L
c
J(W, x, t) +L

J(W, x, t) +
J
t
(W, x, t) +r(x)WJ
W
(W, x, t)
+J
x
(W, x, t)m(x) +
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
),
(5.8)
where
L
c
J(W, x, t) = sup
c0
{u(c) cJ
W
(W, x, t)} ,
L

J(W, x, t) = sup
R
d
_
WJ
W
(W, x, t)
>
(x, t)(x) +
1
2
J
WW
(W, x, t)W
2

>
(x, t)(x, t)
>

+J
Wx
(W, x, t)W
>
(x, t)v(x)
_
.
5.3 Dynamic programming in continuous-time models 65
From the analysis above we will expect that the indirect utility function J(W, x, t) solves the
HJB equation for all possible values of W and x and all t [0, T) and that it satises the terminal
condition
J(W, x, T) = u(W) (5.9)
for all W and x. In the mathematical literature on stochastic control problems like the one we are
looking at, there are a few results concerning when a solution to the HJB equation exists. However,
these results are only valid under restrictive conditions, e.g., that the controls (c and in our case)
can only take values in a compact set. This is generally not true for the consumption/investment
problems. We are mostly interested in nding a solution. Here, we can apply a verication result.
Let us formulate the result for the problem with a one-dimensional state variable:
Theorem 5.1. Assume that V (W, x, t) solves the HJB equation (5.8) with the terminal condi-
tion (5.9) and satises some technical conditions. Let C(W, x, t) and (W, x, t) be given by
C(W, x, t) = arg max
c0
{u(c) cV
W
(W, x, t)} ,
(W, x, t) = arg max
R
d
_
WV
W
(W, x, t)
>
(x, t)(x) +
1
2
V
WW
(W, x, t)W
2

>
(x, t)(x, t)
>

+V
Wx
(W, x, t)W
>
(x, t)v(x)
_
If the strategies
c

t
= C(W

t
, x
t
, t),

t
= (W

t
, x
t
, t),
where (W

t
) is the wealth process that (c

) induces, are feasible (i.e., (c, ) A


0
), then they
are optimal, and V equals the indirect utility function, i.e.
J(W, x, t) = V (W, x, t) = E
W,x,t
_
_
T
t
e
(st)
u(c

s
) ds +e
(Tt)
u(W

T
)
_
.
The verication theorem suggests a two-step procedure. First, solve the maximization problem
embedded in the HJB-equation giving a candidate for the optimal strategies expressed in terms
of the yet unknown indirect utility function and its derivatives. Second, substitute the candidate
for the optimal strategies into the HJB-equation, ignore the sup-operator, and solve the resulting
partial dierential equation for J(W, x, t). Such a solution will then also give the candidate optimal
strategies in terms of W, x, and t. However, there is really also a third step, namely to check
that the assumptions made along the way and the technical conditions needed for the verication
theorem to apply are all satised. The standard version of the verication theorem is precisely
stated and proofed in ksendal (2003) or Fleming and Soner (1993). The technical conditions of
the standard version are not always satised in concrete consumption-portfolio problems, but at
least for some concrete problems a version with an appropriate set of conditions can be found; see,
e.g., Korn and Kraft (2001) and Kraft (2009). In the current version of these lecture notes, we will
generally ignore these technicalities and trust that a suitable verication theorem applies.
Suppose now that the state variable x is k-dimensional and follows the diusion process
dx
t
= m(x
t
) dt +v(x
t
)
>
dz
t
+ v(x
t
) d z
t
,
66 Chapter 5. Introduction to continuous-time modeling
where m now is a k-vector valued function, v is a (d k)-matrix valued function
4
, v is a (k k)-
matrix valued function, and z is a k-dimensional standard Brownian motion independent of z.
The basic derivation is the same as with a one-dimensional state variable, but the drift of J now
becomes more complicated and so does the HJB equation:
J(W, x, t) = L
c
J(W, x, t) +L

J(W, x, t) +
J
t
(W, x, t) +r(x)WJ
W
(W, x, t)
+J
x
(W, x, t)
>
m(x) +
1
2
tr
_
J
xx
(W, x, t)[v(x)
>
v(x) + v(x) v(x)
>
]
_
,
where
L
c
J(W, x, t) = sup
c0
{u(c) cJ
W
(W, x, t)} ,
L

J(W, x, t) = sup
R
d
_
WJ
W
(W, x, t)
>
(x, t)(x) +
1
2
J
WW
(W, x, t)W
2

>
(x, t)(x, t)
>

+W
>
(x, t)v(x)J
Wx
(W, x, t)
_
.
Now, J
x
and J
Wx
are k-vectors and J
xx
is a (k k)-matrix. The notation tr(A) stands for
the trace of the square matrix A = (A
ij
), which is dened as the sum of the diagonal elements,
tr(A) =

i
A
ii
.
In the special case of constant investment opportunities, the indirect utility is given by J
t
=
J(W
t
, t) and the corresponding HJB equation is simply
J(W, t) = L
c
J(W, t) +L

J(W, t) +
J
t
(W, t) +rWJ
W
(W, t) (5.10)
with
L
c
J(W, t) = sup
c0
{u(c) cJ
W
(W, t)} ,
L

J(W, t) = sup
R
d

WJ
W
(W, t)
>
+
1
2
J
WW
(W, t)W
2

>

>

_
.
The terminal condition is
J(W, T) = u(W).
In the next chapter we study this case in detail.
5.4 Loss from suboptimal strategies
The utility induced by the application of any given admissible strategy (c, ) from time t on is
V
c,
t
= E
t
_
_
T
t
e
(st)
u(c
s
) ds +e
(Tt)
u(W
c,
T
)
_
,
where W
c,
T
is the terminal wealth generated by the strategy (c, ). Suppose the dynamics of the
investment opportunities is captured by a one-dimensional diusion x = (x
t
) and that the strategy
at any time s at most depends on wealth W
s
, on time, and on x
s
. Then V
c,
t
= V
c,
(W
t
, x
t
, t).
By denition, the application of a suboptimal strategy (c, ) leads to a lower level of utility, i.e.,
V
c,
(W
t
, x
t
, t) J(W
t
, x
t
, t) V
c

(W
t
, x
t
, t).
4
In this multi-dimensional setting it would be natural to write the dz
t
-term in the state dynamics on the form
v(x
t
) dz
t
, but this would conict with our notation in the one-dimensional case, where we have used the term
v(x
t
)
>
dz
t
.
5.5 Exercises 67
If we want to measure how bad the strategy (c, ) is compared to the optimal strategy, we cannot
just use the distance in utility J(W
t
, x
t
, t) V
c,
(W
t
, x
t
, t) since that distance is not stable to
positive ane transformation of the utility function. A better measure is the wealth-equivalent
percentage loss `
t
dened implicitly by
5
V
c,
(W
t
, x
t
, t) = J(W
t
[1 `
t
], x
t
, t). (5.11)
We can interpret `
t
as the percentage of time t wealth that the individual is willing to sacrice in
order to be able to apply the optimal strategy (c

) instead of the strategy (c, ) from time t


on. Of course, `
t
depends on (c, ) and generally also on W
t
, x
t
, and t, i.e., `
t
= `
c,
(W
t
, x
t
, t).
An equivalent measure would be the percentage of extra wealth,

`
t
, needed to obtain the same
utility with the suboptimal strategy (c, ) as with the optimal strategy, i.e.,
V
c,
(W
t
[1 +

`
t
], x
t
, t) = J(W
t
, x
t
, t).
Again,

`
t
=

`
c,
(W
t
, x
t
, t).
5.5 Exercises
Exercise 5.1. Show that the indirect utility, J
t
, dened in (5.2) is an increasing and concave
function of wealth, W
t
. Hint: To show concavity, let (c
1
,
1
) be the optimal strategy with initial
wealth W
1
and let (c
2
,
2
) be the optimal strategy with initial wealth W
2
. Here, c
i
is the con-
sumption rate and
i
the vector of dollar amounts invested in the risky assets. The corresponding
terminal wealth levels are denoted W
1T
and W
2T
, respectively. For any (0, 1), you should rst
show that the strategy (c
1
+ (1 )c
2
,
1
+ (1 )
2
) is a feasible strategy with initial wealth
W
1t
+ (1 )W
2t
that results in the terminal wealth W
1T
+ (1 )W
2T
. Then apply that u
and u are assumed concave.
5
An early example of calculations of the monetary costs associated with suboptimal intertemporal behavior was
given by Cochrane (1989).
CHAPTER 6
Asset allocation with constant investment opportunities
6.1 Introduction
In this chapter we will consider the relatively simple case in which the short-term interest rate
r, the expected rates of return , and the volatility matrix of the risky assets are all assumed to
be constant through time. The market price of risk vector is therefore also a constant. We shall
also assume that the investor has no income other than the returns on the nancial investments,
i.e., y = 0. This is the problem originally considered by Merton (1969). A direct consequence
of these additional assumptions is that the risky asset price processes in (5.3) become geometric
Brownian motions so that future risky asset prices are lognormally distributed, as is well-known
from the Black-Scholes model for stock option pricing; see, e.g., Hull (2009). In this case the wealth
dynamics for a given consumption strategy c and a given portfolio weight process is
dW
t
=
_
W
t

r +
>
t

c
t
_
dt +W
t

>
t
dz
t
, (6.1)
and the indirect utility function (sometimes called the value function) is a function of only current
wealth and time
J(W, t) = sup
(c
s
,
s
)
s[t,T]
E
W,t
_
_
T
t
e
(st)
u(c
s
) ds +e
(Tt)
u(W
T
)
_
,
where E
W,t
denotes the expectations operator given W
t
= W (and given the chosen consumption
and investment strategies).
We will rst attack this problem applying the dynamic programming approach and try to solve
the HJB equation associated with the utility maximization problem. From (5.10), we have that
the HJB equation is given by
J(W, t) = L
c
J(W, t) +L

J(W, t) +
J
t
(W, t) +rWJ
W
(W, t) (6.2)
69
70 Chapter 6. Asset allocation with constant investment opportunities
with
L
c
J(W, t) = sup
c0
{u(c) cJ
W
(W, t)} , (6.3)
L

J(W, t) = sup
R
d

WJ
W
(W, t)
>
+
1
2
J
WW
(W, t)W
2

>

>

_
. (6.4)
The terminal condition is
J(W, T) = u(W).
In Section 6.2 we will see how far we can get for a general utility function. Then in Sections 6.3
and 6.4 we specialize to CRRA and logarithmic utility, respectively, for which explicit solutions can
be obtained (in Section 8.2 we derive the same results using the martingale approach). Section 6.6
discusses how wealth, investments, and consumption vary over the life-cycle. In Section 6.5 we
analyze further the optimal investment strategy for the CRRA investors. Section 6.7 explains
how to quantify the loss from following a suboptimal strategy. Finally, Section 6.8 considers the
importance of the frequency of portfolio rebalancing.
6.2 General utility function
We will try to solve our consumption and investment problem by an application of the verication
theorem, Theorem 5.1, i.e., by solving the HJB equation (6.2). The rst-order condition for the
maximization in (6.3) leads to
u
0
(c) = J
W
(W, t),
where we have used the fact that the non-negativity constraint on consumption will not be binding
under the assumption that marginal utility is innite for zero consumption (or even at a positive
subsistence level of consumption). This optimality condition is called the envelope condition, which
we also derived in a discrete-time framework in Chapter 4, cf. Equation (4.10). The condition says
that the marginal utility from currently consuming one unit more must equal the marginal utility
from investing that unit optimally. This is an intuitive optimality condition for intertemporal
choice. If we let I
u
denote the inverse of marginal utility u
0
(c), we can write our candidate for the
optimal consumption strategy as
c

t
= C(W

t
, t),
where
C(W, t) = I
u
(J
W
(W, t)). (6.5)
Substituting the maximizing c back into (6.3), we get
L
c
J(W, t) = u(I
u
(J
W
(W, t))) I
u
(J
W
(W, t))J
W
(W, t).
The rst-order condition for the (unconstrained) maximization in (6.4) leads to
J
W
(W, t)W +J
WW
(W, t)W
2

>
= 0.
Isolating , we get
=
J
W
(W, t)
WJ
WW
(W, t)
(
>
)
1
,
6.2 General utility function 71
so that our candidate for the optimal investment strategy can be written as

t
= (W

t
, t),
where
(W, t) =
J
W
(W, t)
WJ
WW
(W, t)
(
>
)
1
=
J
W
(W, t)
WJ
WW
(W, t)
(
>
)
1
( r1). (6.6)
Note that the fraction J
W
(W, t)/[WJ
WW
(W, t)] is the relative risk tolerance (i.e., the reciprocal
of the relative risk aversion) of the indirect utility function. The optimal risky investment is
therefore given by the relative risk tolerance of the investor times a vector that is the same for
all investors (assuming they have the same perceptions about , , and r), namely the inverse of
the variance-covariance matrix multiplied by the vector of excess expected rates of return. The
second-order conditions for a maximum are satised since J is concave in W and u is concave in
c. Substituting the maximizing back into (6.4) and simplifying, we get
L

J(W, t) =
1
2
kk
2
J
W
(W, t)
2
J
WW
(W, t)
,
where kk
2
=
>
.
The HJB equation is thus transformed into the second order PDE
J(W, t) = u
_
I
u
(J
W
(W, t))
_
J
W
(W, t)I
u
(J
W
(W, t)) +
J
t
(W, t)
+rWJ
W
(W, t)
1
2
kk
2
J
W
(W, t)
2
J
WW
(W, t)
.
(6.7)
If this PDE has a solution J(W, t) such that the strategy dened by (6.5) and (6.6) is feasible
(satises the technical conditions), then we know from the verication theorem that this strategy
is indeed the optimal consumption and investment strategy and the function J(W, t) is indeed the
indirect utility function. We shall sometimes consider problems with no utility from intermediate
consumption, i.e., u 0. In that case, it is of course optimal not to consume, and it is relatively
easy to see that the rst two terms of the right-hand side of (6.7) will vanish, i.e., the equation
simplies to
J(W, t) =
J
t
(W, t) +rWJ
W
(W, t)
1
2
kk
2
J
W
(W, t)
2
J
WW
(W, t)
.
In the following sections we shall obtain simple, closed-form solutions for problems with CRRA
and logarithmic utility. In Exercise 6.4 at the end of the chapter we will consider the problem
with a subsistence HARA utility function, where a simple solution also can be obtained. Semi-
explicit solutions for other utility functions have been given by Karatzas, Lehoczky, Sethi, and
Shreve (1986). Merton (1971, Sec. 6) claimed to have found a solution for the general class of
HARA functions but as noted by Sethi and Taksar (1988), this solution does not satisfy the non-
negativity constraints on wealth and consumption.
Without further computations we can already note an important result: With constant r, ,
and , two-fund separation is obtained in the continuous-time setting. This is obvious from the
optimal investment strategy in (6.6).
Theorem 6.1 (Two-fund separation). In a nancial market with constant r, , and , the optimal
investment strategy of any unconstrained investor with time-separable utility of the form (5.1) and
72 Chapter 6. Asset allocation with constant investment opportunities
no non-nancial income is a combination of the risk-free asset and a single portfolio of risky assets
given by the weights

tan
=
1
1
>
(
>
)
1

(
>
)
1
=
1
1
>
(
>
)
1
( r1)
(
>
)
1
( r1). (6.8)
The investor will invest the fraction
J
W
(W,t)
WJ
WW
(W,t)
1
>
_

>
_
1
of her wealth in the risky fund and
the remaining wealth in the risk-free asset.
The portfolio
tan
is almost indistinguishable from the tangency portfolio (3.19) of the one-period
mean-variance analysis, but in the continuous-time case the relevant expected rates of return and
variances and covariances are measured over the next innitesimal period of time. With this little
modication of the interpretation we can again look at the investment problem graphically in a
(standard deviation,mean)-diagram as we are used to from the static one-period setting. Also, we
again have the conclusion that all investors should hold risky assets in the same proportion, i.e.,

i
/
j
is the same for all investors. Note that the necessary assumption of lognormal prices is much
more realistic than the normality assumption in the one-period model. Analogous to the one-
period setting, the two-fund separation result above is the basis for a capital market equilibrium
result, which in the continuous-time case is referred to as the Intertemporal Capital Asset Pricing
Model (ICAPM) or the Continuous-time CAPM; see, e.g., Merton (1973b), Due (2001), Cochrane
(2005), and Munk (2012) for more on equilibrium asset pricing.
6.3 CRRA utility function
We will now focus on the case where the utility function exhibits constant relative risk aversion.
We are interested in three types of problems:
(1) utility from consumption only,
(2) utility from terminal wealth only,
(3) utility both from consumption and terminal wealth.
We can solve all three problems simultaneously by introducing two non-negative coecients
1
and

2
and letting
u(c) =
1
c
1
1
, u(W) =
2
W
1
1
.
Situation (1) above corresponds to
2
= 0 and
1
> 0. The exact value of
1
has no impact on
optimal decisions, but
1
= 1 would be the natural choice as notation is then simpler. Similarly,
situation (2) corresponds to
1
= 0 and
2
> 0 with
2
= 1 being the natural choice (in that
case we can disregard discounting and put = 0). Finally, situation (3) requires both
1
> 0 and

2
> 0. The ratio
2
/
1
determines the relative importance of terminal wealth and intermediate
consumption and will therefore in general aect the optimal decisions, but we could x one of the
coecients (to 1, for example) without loss of generality. In order to encompass all three situations,
we will allow for general
1
0 and
2
0 with
1
+
2
> 0. The indirect utility function is
J(W, t) = sup
(c
s
,
s
)
s[t,T]
E
W,t
_

1
_
T
t
e
(st)
c
1
s
1
ds +
2
e
(Tt)
W
1
T
1
_
.
6.3 CRRA utility function 73
The marginal utility for consumption is u
0
(c) =
1
c

. If
1
> 0, marginal utility has the inverse
function I
u
(a) =
1/
1
a
1/
. Consequently, we have that
u(I
u
(a)) =
1
I
u
(a)
1
1
=
1/
a
11/
1
and
u(I
u
(a)) aI
u
(a) =
1/
1
a
11/
1

1/
1
a
11/
=
1/
1

1
a
11/
.
The rst two terms on the right-hand side of Eq. (6.7) are thus equal to
1/
1

1
J
11/
W
. This is
also true if
1
= 0. Therefore, the HJB equation with or without intermediate consumption implies
that
J(W, t) =
1/
1

1
J
W
(W, t)
1
1

+
J
t
(W, t) +rWJ
W
(W, t)
1
2
kk
2
J
W
(W, t)
2
J
WW
(W, t)
. (6.9)
The terminal condition is that J(W, T) =
2
W
1
/(1 ).
Due to the linearity of the wealth dynamics in (6.1) it seems reasonable to conjecture that if
the strategy (c

) is optimal with time t wealth W and the corresponding wealth process W

,
then the strategy (kc

) will be optimal with time t wealth kW and the corresponding wealth


process kW

. If this is true, then


J(kW, t) = E
t
_

1
_
T
t
e
(st)
(kc

s
)
1
1
ds +
2
e
(Tt)
(kW

T
)
1
1
_
= k
1
E
t
_

1
_
T
t
e
(st)
(c

s
)
1
1
ds +
2
e
(Tt)
(W

T
)
1
1
_
= k
1
J(W, t),
i.e., the indirect utility function J(W, t) is homogeneous of degree 1 in the wealth W. Inserting
k = 1/W and rearranging, we get
J(W, t) =
g(t)

W
1
1
,
where g(t)

= (1 )J(1, t). From the terminal condition J(W, T) =


2
W
1
/(1 ), we have
that g(T)

=
2
, hence g(T) =
1/
2
.
The relevant derivatives of our guess J(W, t) are
J
W
(W, t) = g(t)

, J
WW
(W, t) = g(t)

W
1
,
J
t
(W, t) =

1
g(t)
1
g
0
(t)W
1
.
Substituting into (6.9) and gathering terms, we get
_


1
r
1
2
kk
2

g(t)

1/
1

1


1
g
0
(t)
_
g(t)
1
W
1
= 0.
Since this equation should hold for all W and all t [0, T), the term in the brackets must be equal
to zero for all t, i.e., the function g must satisfy the ordinary dierential equation
g
0
(t) = Ag(t)
1/
1
(6.10)
74 Chapter 6. Asset allocation with constant investment opportunities
with the terminal condition g(T) =
1/
2
. Here A is the constant
A =
+r( 1)

+
1
2
1

2
kk
2
=
+r( 1)

+
1
2
1

2
( r1)
>
(
>
)
1
( r1),
(6.11)
which we assume is dierent from zero. It can be checked that the solution is given by
1
g(t) =
1
A

1/
1
+
_

1/
2
A
1/
1
_
e
A(Tt)

,
We will generally assume that the relative risk aversion exceeds 1 and that and r are non-
negative, and in that case we have A > 0.
Let us show that g(t) 0 for all t [0, T]. It is sucient to demonstrate that the function
G() =
1
A

1/
1
+
_

1/
2
A
1/
1
_
e
A

is non-negative for all 0. Note that G(0) =


1/
2
0
and G
0
() = (
1/
1

1/
2
A)e
A
. We split the analysis into three cases:
(1) Suppose
1/
1
=
1/
2
A. Since
1
and
2
are not allowed both to be zero, this case is only
possible if both
1
and
2
are strictly positive. The function G is then constant, G() =

1/
1
/A =
1/
2
> 0 for all .
(2) Suppose
1/
1
>
1/
2
A. Then G
0
() > 0 for all so that G is monotonically increasing and,
since G(0) 0, we have G() > 0 for > 0. For A > 0, the limit is lim

G() =
1/
1
/A >

1/
2
. For A < 0, G() for .
(3) Suppose
1/
1
<
1/
2
A. Since both
1
and
2
are non-negative, this can only happen if A > 0.
We have G
0
() < 0 so that G is monotonically decreasing, but the limit lim

G() =

1/
1
/A is non-negative. Hence, G() stays non-negative.
We summarize our ndings in the following theorem:
Theorem 6.2. Assume that the constant A dened in (6.11) is dierent from zero. For the CRRA
utility maximization problem in a market with constant r, , and , we then have that the indirect
utility function is given by
J(W, t) =
g(t)

W
1
1
with
g(t) =
1
A

1/
1
+
_

1/
2
A
1/
1
_
e
A(Tt)

. (6.12)
The optimal investment strategy is given by
(W, t) =
1

(
>
)
1
=
1

(
>
)
1
( r1).
If the agent has utility from intermediate consumption (
1
> 0), her optimal consumption rate is
C(W, t) =
1/
1
W
g(t)
= A

1 +
_
(
2
/
1
)
1/
A1
_
e
A(Tt)

1
W.
1
For A = 0, the ODE (6.10) simplies to g
0
(t) =
1/
1
which with the terminal condition g(T) =
1/
2
has the
solution g(t) =
1/
2
+
1/
1
(T t).
6.4 Logarithmic utility 75
A similar result was rst demonstrated by Merton (1969).
The optimal consumption strategy is to consume a time-varying fraction of wealth. It is easy to
show that when
2
> 0, the consumption/wealth ratio approaches (
1
/
2
)
1/
as t T, whereas
c/W for t T when
2
= 0.
The higher the risk aversion coecient , the lower the investment in the risky assets and the
higher the investment in the risk-free asset. The optimal investment strategy is independent of the
horizon of the investor. The fraction of wealth invested in each asset is to be kept constant over
time. Note that this requires continuous rebalancing of the portfolio since the prices of individual
assets vary all the time. Consider an asset which enters the optimal portfolio with a positive
weight. If the price of this asset increases more than the prices of the other assets in the portfolio,
the fraction of wealth made up by that asset will increase. Hence, the investor should reduce the
number of units of that particular asset. So the optimal investment strategy is a sell winners,
buy losers strategy. The fact that this asset has given a high return in the previous period has
no consequence for the optimal position in that asset since the distribution of future returns is
assumed to be constant over time. If the investor does not sell a recent winner stock, she will be
too exposed to the risk of that stock.
Inserting the optimal strategy into the general expression for the dynamics of wealth, we nd
that
dW

t
= W

t
_
r +
1

kk
2

1/
1
g(t)
1

dt +
1

>
dz
t
_
. (6.13)
Therefore, optimal wealth evolves as a geometric Brownian motion (although with a time-dependent
drift). Future values of wealth are lognormally distributed. In particular, wealth stays positive.
The optimal strategy is to be further analyzed in Exercise 6.1 at the end of the chapter.
For the case where the agent only gets utility from terminal wealth (
1
= 0,
2
= 1 and = 0),
the function g reduces to g(t) = e
A(Tt)
and
A =
1

r +
1
2
kk
2

.
Hence, the indirect utility function can be written as
J(W, t) =
1
1
e
A(Tt)
W
1
=
1
1
e
(1)(r+
1
2
kk
2
)(Tt)
W
1
.
The optimal investment strategy is unaltered. Exactly the same portfolio should be held whether or
not the agent has utility from intermediate consumption. With constant investment opportunities
and time-additive CRRA utility there is no clear link between investment and consumption. Of
course, wealth will evolve dierently over time if the agent withdraws money for consumption.
Consequently, ceteris paribus, the value of the portfolio and the number of units held of the
dierent assets will be dierent (smaller) with utility from intermediate consumption.
6.4 Logarithmic utility
The solution for the case of logarithmic utility is obtained by a similar procedure. This is the
subject of Exercise 6.2 at the end of the chapter. The indirect utility function is here dened as
J(W, t) = sup
(c
s
,
s
)
s[t,T]
E
W,t
_

1
_
T
t
e
(st)
ln c
s
ds +
2
e
(Tt)
ln W
T
_
.
The result is:
76 Chapter 6. Asset allocation with constant investment opportunities
Theorem 6.3. For the logarithmic utility maximization problem in a market with constant r, ,
and , we have that the indirect utility function is given by
J(W, t) = g(t) ln W +h(t),
with
g(t) =
1

1
+

2

1

e
(Tt)

(6.14)
and, for t < T,
h(t) =

r +
1
2
kk
2

2
e
(Tt)
_

2
+

1

(T t)
2
(T t)
_
g(t) ln g(t).
The optimal investment strategy is given by
(W, t) = (
>
)
1
= (
>
)
1
( r1),
and if the agent has utility from intermediate consumption (
1
> 0) the optimal consumption
strategy is
C(W, t) =
1
g(t)
1
W =

1 + [(
2
/
1
) 1] e
(Tt)

1
W.
Note that if we take the limit of g(t) dened in Eq. (6.12) as 1, we get the expression given
in Eq. (6.14). Also note that the optimal strategy for the logarithmic utility case can be obtained
by taking limits of the optimal strategy for the CRRA case as 1.
6.5 Discussion of the optimal investment strategy for CRRA utility
Many empirical studies have documented that in the past century long-term stock investments
have in most cases outperformed (i.e., have given a higher return than) a long-term bond invest-
ment. Over short investment horizons, the dominance of stock investments is less clear. Referring
to these empirical facts, many investment consultants recommend that long-term investors should
place a large part of their wealth in stocks and then gradually shift from stocks to bonds as they
get older and their investment horizon shrinks. This recommendation conicts with the optimal
portfolio strategy we have derived above. According to our analysis, the optimal portfolio weights
of CRRA investors are independent of the investment horizon. Is this because our model of the
nancial asset prices is inconsistent with the empirical facts mentioned before? The answer is no.
To see this let us consider the simplest case with a single stock (representing the stock index) with
price dynamics
dP
t
= P
t
[dt + dz
t
] ,
where and as well as the interest rate r are constants. In other words, the price process is a
geometric Brownian motion. This implies that
P
T
= P
0
e
(
1
2

2
)T+z
T
.
6.5 Discussion of the optimal investment strategy for CRRA utility 77
Since z
T
N(0, T), the probability that a stock investment outperforms a risk-free investment
over a period of T years is equal to
Prob

P
T
P
0
> e
rT

= Prob


1
2

T + z
T
> rT

= Prob
_
z
T
>
_
r
1
2

2
_
T

_
= Prob
_
z
T
<
_
r
1
2

2
_
T

_
= N
_
( r
2
/2)

_
,
where N() is the cumulative distribution function for a standard normally distributed random
variable.
Figure 6.1 illustrates the relation between the outperformance probability and the investment
horizon. The curves dier with respect to the presumed expected rate of return on the stock, i.e.,
, whereas the interest rate is 4% and the volatility of the stock is 20% for all curves. Empirical
studies indicate that U.S. stocks over a 100-year period have had an average excess rate of return
of 8-9% per year. A -value of 15% corresponds to an expected excess rate of return of 9% per year
since 0.15 0.04 (0.20)
2
/2 = 0.09. However, it should be emphasized that historical estimates
of expected rates of return, volatilities, and correlations are not necessarily good predictors of the
future values of these quantities. In particular, the value of the excess expected rate of return
on the stock market is frequently discussed both among practitioners and academics. There are
several reasons to believe that the average return on the US stock market over the past century
is higher than what the stock market is currently oering in terms of expected returns. This
discussion is also closely linked to the so-called equity premium puzzle. See, e.g., Mehra and
Prescott (1985), Weil (1989), Welch (2000), and Mehra (2003), Shiller (2000), and Ibbotson and
Chen (2003). Probably the curves labeled = 9% and = 12% are more representative of the
current investment opportunities. In any case, it is tempting to conclude from the graph that
long-term investors should invest more in stocks than short-term investors. Why does the optimal
portfolio derived previously not reect this property?
It is important to realize that the optimal decision cannot be based just on the probabilities of
gains and losses. After all most individuals will reject a gamble with a 99% probability of winning
1 dollar and a 1% probability of losing a million dollars. The magnitudes of gains and losses are
also important for the optimal investment decision. Let us look at the probability that a stock
investment will provide a return which is K percentage points lower than a risk-free investment
over the same period, i.e.,
Prob

P
T
P
0
< e
rT
K

= Prob


1
2

T + z
T
< ln
_
e
rT
K
_

= Prob
_
z
T
<
ln
_
e
rT
K
_

_

1
2

2
_
T

_
= N
_
ln
_
e
rT
K
_

_

1
2

2
_
T

T
_
.
Table 6.1 shows such probabilities for various combinations of the return shortfall constant K
78 Chapter 6. Asset allocation with constant investment opportunities
40%
50%
60%
70%
80%
90%
100%
0 5 10 15 20 25 30 35 40
investment horizon, years
o
u
t
p
e
r
f
o
r
m
a
n
c
e

p
r
o
b
a
b
i
l
i
t
y
6%
9%
12%
15%
Figure 6.1: Outperformance probabilities. The gure shows the probability that a stock
investment outperforms a risk-free investment over dierent investment horizons. For all curves
the risk-free interest rate is 4%, and the volatility of the stock is 20%. Each of the curves
correspond to the value of the parameter which is shown besides the curve.
and the investment horizon. (The numbers in the row labeled 0% are equal to 100% minus the
outperformance probabilities shown in Figure 6.1.) Over a 10-year period the return on a risk-free
investment at a rate of 4% per year is
_
e
0.0410
1
_
100% 49.1%.
The table shows that with a 22.2% probability a stock investment over a 10-year period will give
a return which is lower than 49.1%25% = 24.1%, and there is a 5.7% probability that the stock
return will be lower than 49.1% 75% = 25.9%. Over a 40-year period the risk-free return is
395%. There is a 13% probability that a stock investment will give a return which is at least 100
percentage points lower, i.e., lower than 295%. Over longer periods the probability that stocks
underperform bonds is lower, but the probability of extremely bad stock returns is larger than over
short periods. The expected excess return on the stock increases with the length of the investment
horizon, but so does the variance of the return. Any risk-averse investor has to consider this trade-
o. For a CRRA investor in our simple nancial model, the two eects oset each other exactly
so that the optimal portfolio is independent of the investment horizon.
6.6 The life-cycle
Let us look at how wealth, consumption, and investments vary over the life-cycle. Of course,
these quantities all depend on the future shocks to the prices of the nancial assets and thus to
the wealth of the individual, but we can compute the expected future wealth, consumption, and
investment given the initial wealth.
First, consider consumption. Optimal consumption at time t is given in terms of wealth and
6.6 The life-cycle 79
Excess return on bond 1 year 10 years 40 years
0% 44.0% 31.8% 17.1%
25% 6.4% 22.2% 16.1%
50% 0.0% 13.1% 15.1%
75% 0.0% 5.7% 14.0%
100% 0.0% 1.3% 13.0%
Table 6.1: Underperformance probabilities. The table shows the probability that a stock
investment over a period of 1, 10, and 40 years provides a percentage return which is at least 0,
25, 50, 75, or 100 percentage points lower than the risk-free return. The numbers are computed
using the parameter values = 9%, r = 4%, and = 20%.
time by
c

t
=
1/
1
W

t
g(t)
.
With the wealth dynamics in (6.13), the consumption dynamics follows from an application of Itos
Lemma
dc

t
=

1/
1
g(t)
dW

t

1/
1
g
0
(t)
g(t)
2
W

t
dt
= c

t
_
r +
1

kk
2
A

dt +
1

>
dz
t
_
= c

t
_
1

r +
+ 1
2
kk
2

dt +
1

>
dz
t
_
,
where we have applied (6.10) and (6.11). Consequently, optimal consumption is a geometric
Brownian motion. In particular, the initial expectation of the future consumption is (see properties
of the geometric Brownian motion in Section B.8.1 of the appendix)
E[c

t
] = c

0
exp

r +
+ 1
2
kk
2

t
_
= W
0
A
1 +

(
2
/
1
)
1/
A1

e
AT
exp

r +
+ 1
2
kk
2

t
_
.
Clearly, consumption is expected to increase with age, decrease with age, or to be age-independent
depending on whether r +
+1
2
kk
2
is positive, negative, or zero. With realistic parameters,
the constant is positive so that consumption should increase, on average, over life.
Empirical studies show a hump-shaped consumption pattern over the life-cycle (Browning and
Crossley 2001, Gourinchas and Parker 2002) so that consumption typically increases up to around
age 40-45 and then drops throughout the rest of life. The simple model considered in this chapter
cannot generate such a pattern. In fact, the more advanced models with closed-form solutions
that we will look at in subsequent chapters cannot match the hump either. Several explanations of
the hump have been suggested in the literature, including mortality risk (Hansen and

Imrohoroglu
2008, Feigenbaum 2008), borrowing constraints (Thurow 1969, Gourinchas and Parker 2002), and
endogenous labor supply with a hump-shaped wage prole (Bullard and Feigenbaum 2007). How-
ever, none of these additional features would preserve the explicitness of our solutions in this
80 Chapter 6. Asset allocation with constant investment opportunities
model.
2
Numerical solutions that include mortality risk and borrowing constraints in a setting
with labor income can generate the consumption hump, cf., for example, Cocco, Gomes, and
Maenhout (2005).
Next, consider wealth. From (6.13) it is clear that expected future wealth is
E[W

t
] = W

0
exp

r +
1

kk
2

t
1/
1
_
t
0
1
g(u)
du
_
,
and it can be shown that

1/
1
_
t
0
1
g(u)
du = A
_
t
0
1
1 +

(
2
/
1
)
1/
A1

e
A[Tu]
du
= At ln
_
1 +

(
2
/
1
)
1/
A1

e
A[Tt]
1 +

(
2
/
1
)
1/
A1

e
AT
_
so that
E[W

t
] = W

0
exp

r +
+ 1
2
kk
2

t
_
1 +

(
2
/
1
)
1/
A1

e
A[Tt]
1 +

(
2
/
1
)
1/
A1

e
AT
.
One can show that the sign of the derivative E[W

t
]/t is equal to the sign of

r +
1

kk
2

1 +
_
(
2
/
1
)
1/
A1
_
e
A[Tt]

A.
For the special case with no utility of terminal wealth,
2
= 0, the sign will be negative at least
for t very close to T, which makes sense since in that case the individual will consume all wealth
before the terminal date. More generally, the behavior of E[W

t
] over life depends both on the
relative weights on consumption and terminal wealth, on the time preference rate and relative risk
aversion ( aects A), and on the investment opportunities (via r and kk
2
).
The expected amounts invested in the nancial assets in the future is simply
1

>
_
1
E[W

t
]
which obviously follows the same life-cycle pattern as wealth itself.
6.7 Loss due to suboptimal investments
In the section we want to assess the importance of getting the portfolio exactly right, so we
disregard consumption and put = 0,
1
= 0, and
2
= 1. We focus on the case with a single
risky asset in addition to the riskfree asset. For any xed portfolio weight in the risky asset, the
wealth dynamics will be
dW

t
= W

t
[(r + ) dt + dz
t
] ,
so that wealth follows a geometric Brownian motion. It can be shown (see Exercise 6.3) that the
expected utility for a given is
V

(W, t) E
t
_
1
1
(W

T
)
1
_
=
1
1
(g

(t))

W
1
, (6.15)
2
Labor supply exibility is limited and thus induces constraints that, like borrowing constraints, prevent closed-
form solutions. Mortality risk eectively implies an increasing time preference rate over life which may produce a
consumption hump, but it also adds unspanned risk to the labor income impeding the computation of human wealth
in closed form, unless the investor can purchase full insurance against the loss of income in case of death (Kraft
and Steensen 2008). However, the actual demand for such insurance contracts is much smaller than a theoretical
model would suggest, even for the simple constant-income life annuities relevant in retirement as reected by the
discussion of the so-called annuity puzzle (Davido, Brown, and Diamond 2005, Inkmann, Lopes, and Michaelides
2011).
6.8 Infrequent rebalancing of the portfolio 81
!"#
$"#
%"#
&"#
'"#
("#
)""#
**+,)
**+,-
**+,.
"#
)"#
-"#
."#
!"#
$"#
%"#
&"#
'"#
("#
)""#
/-$# "# -$# $"# &$# )""# )-$# )$"# )&$# -""#
**+,)
**+,-
**+,.
**+,%
Figure 6.2: Welfare losses for dierent levels of risk aversion. The gure shows the
percentage wealth-equivalent utility loss `

t
from applying a suboptimal constant portfolio weight
instead of the optimal portfolio weight. The loss is depicted as a function of the suboptimal
portfolio weight with dierent curves for dierent levels of the relative risk aversion . The
investment horizon is T t = 10 years, the Sharpe ratio of the stock is = 0.3, and the
volatility of the stock is = 0.2.
where
g

(t) = exp

r +

2

(T t)
_
.
Moreover, the percentage wealth loss `

t
dened in (5.11) is
`

t
= 1 e

1
2
()
2
(Tt)

1
2
( )
2
(T t), (6.16)
where the approximation e
x
1 +x for x near 0 is used.
Figure 6.2 illustrates the wealth loss as a function of the portfolio weight for four dierent
levels of the relative risk aversion . The investment horizon is xed to 10 years, the Sharpe ratio
of the stock is assumed to be = 0.3, and the volatility of the stock is assumed to be = 0.2
so that the excess expected return on the stock is = 0.06 = 6%. We see that the losses are
relative at around the optimal portfolio weight. Large deviations from the optimal portfolio
weight are necessary to obtain substantial losses. Highly risk-averse individuals are more sensitive
to deviations from the optimal portfolio weight. Figure 6.3 depicts the wealth loss as a function
of for dierent investment horizons. Clearly, the individual suers a bigger loss from following a
suboptimal strategy over longer periods.
6.8 Infrequent rebalancing of the portfolio
The optimal investment strategy with CRRA utility and constant investment opportunities is to
keep a xed portfolio weight in each asset. However, that requires continuous rebalancing of the
portfolio as the prices of the dierent assets do not move in parallel. Continuous rebalancing is not
practically possible. Moreover, even with tiny trading costs per transaction, continuous rebalancing
82 Chapter 6. Asset allocation with constant investment opportunities
!"#
$"#
%"#
&"#
'"#
("#
)*+
)*+"
)*,"
"#
+"#
,"#
!"#
$"#
%"#
&"#
'"#
("#
-,%# "# ,%# %"# '%# +""# +,%# +%"# +'%# ,""#
)*+
)*+"
)*,"
Figure 6.3: Welfare losses for dierent investment horizons. The gure shows the
percentage wealth-equivalent utility loss `

t
from applying a suboptimal constant portfolio weight
instead of the optimal portfolio weight. The loss is depicted as a function of the suboptimal
portfolio weight with dierent curves for dierent investment horizons T t. The relative risk
aversion is = 2, the Sharpe ratio of the stock is = 0.3, and the volatility of the stock is
= 0.2.
would be innitely expensive. It is therefore interesting to see how bad it is to rebalance in a non-
continuous way. Let us disregard consumption in the following considerations and assume a single
risky asset.
A very simple strategy is to predetermine a nite number of trading dates. At each trading
date the portfolio is rebalanced so that the portfolio weights coincide with the solution for the
continuous time case. In between trading dates, the portfolio weights will deviate from the truly
optimal weights. Suppose that t > 0 is the time period between any two adjacent trading dates.
Suppose the portfolio is rebalanced at time t so that the total wealth W
t
is split into the amount
W
t
invested in the stock and the amount (1 )W
t
in the riskfree asset. The gross return on the
stock until the next rebalancing is
S
t+t
S
t
= exp

r +
1
2

t + (z
t+t
z
t
)
_
,
and the gross return on the riskfree investment is exp{rt}. The wealth at time t +t is therefore
W
t+t
= W
t
exp

r +
1
2

t + (z
t+t
z
t
)
_
+ (1 )W
t
exp{rt}
= W
t
e
rt

1 +
_
exp


1
2

t + (z
t+t
z
t
)
_
1
__
.
Seen at time t, the only random variable on the right-hand side is z
t+t
z
t
N(0, t). The
discrete rebalancing strategy can be evaluated by Monte Carlo simulation.
3
The wealth can be
simulated forward using the above relation by replacing z
t+t
z
t
by
t+t

t, where
t+t
3
Monte Carlo simulation is described in most derivatives textbooks, e.g., Hull (2009) and Munk (2011).
6.8 Infrequent rebalancing of the portfolio 83
is a draw from the standard normal distribution, N(0, 1), with independent draws for dierent
time steps as the increments to the standard Brownian motion over non-overlapping intervals are
independent.
4
We can generate a simulated value of the terminal wealth W
T
and compute the
utility u(W
T
) =
1
1
W
1
T
. By generating a large number, M, of samples W
m
T
of terminal wealth,
we can take the average utility as an approximation of the expected utility of terminal wealth for
this discrete rebalancing strategy:
E[u(W
T
)]
1
M
M

m=1
u(W
m
T
) .
We can then compare that (approximation of the) expected utility with the value function and
compute a percentage wealth-equivalent loss `
t
as dened in (5.11) and used above.
As an example, assume r = 0.02, = 0.2, and = 0.3, and consider an investor with a relative
risk aversion of = 2 and an investment horizon of T t = 10 years. The optimal strategy
is to have = 0.75 = 75% of the wealth invested in the stock at any point in time. If we x
initial wealth to 1, the indirect utility will be 0.65377. In a Monte Carlo simulation procedure
implemented in Microsoft Excel, 2000 antithetic pairs of terminal wealth were simulated using
quarterly rebalancing.
5
The average utility was 0.65547, which corresponds to a wealth-equivalent
loss of only 0.26% (in Exercise 6.5 you are asked to do similar experiments). This experiment
indicates that it is not important to rebalance the portfolio very frequently. Between two adjacent
rebalancing dates the portfolio weight of the stock can deviate somewhat from the optimal weight,
but the deviation is typically rather small, and we have already seen in the previous section that
expected utility is relatively insensitive to small deviations from the optimal strategy.
Rogers (2001) provides a more formal analysis of the impact of infrequent portfolio rebalancing.
Branger, Breuer, and Schlag (2010) perform a detailed Monte Carlo simulation study, also for some
models with stochastic investment opportunities that we will discuss in later chapters. Their study
4
Some spread sheet applications, programming environments, and other software tools may have a built-in
procedure for generating such draws, but not all of them are of a good quality, i.e., if you use the procedure for
generating a number of such draws, the distribution of these draws may be quite dierent from the standard normal
distribution. Alternatively, you can generate draws from the N(0, 1) distribution by transforming draws from a
uniform distribution on the unit interval, a distribution we will denote by U[0, 1]. Most computer tools used for
nancial applications have a built-in generator of random numbers from the U[0, 1] distribution, but there are also
algorithms for generating these draws that can easily be implemented in any programming environment, cf., e.g.,
Press, Teukolsky, Vetterling, and Flannery (2007, Ch. 7). A popular choice is the so-called Box-Muller transformation
suggested by Box and Muller (1958). Given two draws U
1
and U
2
from the uniform U[0, 1] distribution,
1
and
2
dened by

1
=
p
2 ln U
1
cos(2U
2
),
2
=
p
2 ln U
1
sin(2U
2
)
are two independent draws from the standard normal distribution. An alternative approach is to transform a draw
U from the U[0, 1] distribution into a draw from the N(0, 1) distribution by
= N
1
(U),
where N
1
() denotes the inverse of the probability distribution function N() associated with the standard normal
distribution, i.e., N(x) =
R
x

2
exp(z
2
/2) dz. This follows from the fact that P( < a) = P(N
1
(U) < a) =
P(U < N(a)) = N(a). Of course, this approach requires an implementation of the inverse normal distribution
N
1
(), which is not known in closed form. Again, some software tools (such as Microsoft Excel) have a built-in
algorithm for computing the inverse normal distribution, but the precision of the algorithm is generally unknown to
the user, and the computation is bound to be more time-consuming than when using the Box-Muller transformation.
5
The idea of antithetic variates is explained in most textbook presentations of Monte Carlo simulation, including
Hull (2009) and Munk (2011).
84 Chapter 6. Asset allocation with constant investment opportunities
conrms that for investment problems involving only stocks and bonds, relatively infrequent rebal-
ancing induces small wealth-equivalent losses. However, when derivatives are included, frequent
rebalancing is sometimes important.
6.9 Exercises
Exercise 6.1. Consider the optimal consumption and investment strategy for a CRRA investor
(with no labor income) in a market with constant r, , and , cf. Theorem 6.2. How does the
optimal strategy depend on time and the parameters of the model? (You may assume that only
one risky asset is traded.)
Exercise 6.2. Give a proof of Theorem 6.3.
Exercise 6.3. Verify the expressions (6.15) and (6.16). Try to create gures like Figures 6.2 6.3.
Show that the alternative loss measure

`
t
under the given assumptions becomes

`
t
= e
1
2
()
2
(Tt)
1
1
2
( )
2
(T t),
so that the two loss measures are approximately the same for small deviations from the optimal
strategy.
Exercise 6.4. Assume a nancial market with a constant risk-free rate r and risky assets with
constant and . Consider an investor with no income from non-nancial sources and an indirect
utility function
J(W, t) = sup
(c
s
,
s
)
s[t,T]
E
W,t
_
_
T
t
e
(st)
u(c
s
) ds
_
,
where u now is a subsistence HARA function,
u(c) =
(c c)
1
1
with c being the subsistence level of consumption. What is the optimal consumption and investment
strategy for this investor? Compare with the standard CRRA solution. Hint: How do you invest
to nance the subsistence level of consumption in the rest of your life? What is the cost of that
investment? The remaining wealth can be invested freely.
Exercise 6.5. Implement a Monte Carlo simulation to study the impact of infrequent trading
as explained in Section 6.8. Consider an investor with utility of terminal wealth only, a constant
relative risk aversion , and an investment horizon of T t. The market consists of a riskfree asset
with a constant rate of return r and a single risky asset with volatility and a Sharpe ratio ,
both assumed constant. Experiment with the frequency of trading, e.g., by considering 1, 4, 12,
and 52 trading dates per year. Compute wealth-equivalent losses for the discrete-trading strategies
compared to the continuous-time solution. How sensitive is the wealth-equivalent losses to the
parameters r, , , , and T t?
CHAPTER 7
Stochastic investment opportunities: the general case
7.1 Introduction
In the previous chapter we analyzed the optimal investment/consumption decision under the
assumption of constant investment opportunities, i.e., constant interest rates, expected rates of
return, volatilities, and correlations. However, it is well-documented that some, if not all, of these
quantities vary over time in a stochastic manner. This situation is referred to as a stochastic
investment opportunity set. In this chapter we will study the dynamic investment/consumption
choice in a general nancial market with stochastic investment opportunities. In later chapters we
will then focus on concrete models in which, for example, interest rates or expected excess stock
returns follow some specic dynamics.
The main eect of allowing investment opportunities to vary over time is easy to explain. Risk-
averse investors with time-additive utility are reluctant to substitute consumption over time, as
discussed in Section 2.7. To keep consumption stable across states and time, a (suciently) risk-
averse investor will therefore choose a portfolio with high positive returns in states with relatively
bad future investment opportunities (or bad future labor income) and conversely. This is what is
known as intertemporal hedging. The optimal investment strategy will thus be dierent from
the case with constant investment opportunities. From this argument, we also see that there will
be a close link between the optimal consumption strategy and the intertemporal hedging part of
the optimal investment strategy.
In the rest of this chapter we will formalize these issues in a general modeling framework. We will
continue to assume that the investor receives no non-nancial income, i.e., no labor income, and
refer to Chapter 13 for the extension to the case with labor income. Throughout the chapter we
apply the dynamic programming approach, i.e., we focus on solving the Hamilton-Jacobi-Bellman
equation associated with the utility maximization problem.
85
86 Chapter 7. Stochastic investment opportunities: the general case
7.2 General utility functions
7.2.1 One-dimensional state variable
As in Section 5.3 we assume that there is a stochastically evolving state variable x = (x
t
) that
captures the variations in r, , and over time. The variations in the state variable x determine
the future expected returns and covariance structure in the nancial market. For simplicity we will
rst consider the case where x is one-dimensional and afterwards turn to the multi-dimensional
case.
The dynamics of the d risky asset prices is in this setting given by
dP
t
= diag(P
t
)

(x
t
, t) dt + (x
t
, t) dz
t

= diag(P
t
)
_
r(x
t
)1 + (x
t
, t)(x
t
)
_
dt + (x
t
, t) dz
t

.
We assume that x follows a one-dimensional diusion process
dx
t
= m(x
t
) dt +v(x
t
)
>
dz
t
+ v(x
t
) d z
t
, (7.1)
where z is a one-dimensional standard Brownian motion independent of z. If v(x
t
) 6= 0, the market
is incomplete; otherwise, it is complete. Let

x
(x) = v(x)
>
v(x) + v(x)
2
denote the instantaneous variance of the state variable. For a given consumption strategy c = (c
t
)
and investment strategy = (
t
) the wealth evolves as
dW
t
= W
t

r(x
t
) +
>
t
(x
t
, t)(x
t
)

dt c
t
dt +W
t

>
t
(x
t
, t) dz
t
,
and the indirect utility function is dened by
J(W, x, t) = sup
(c
s
,
s
)
s[t,T]
E
W,x,t
_
_
T
t
e
(st)
u(c
s
) ds +e
(Tt)
u(W
T
)
_
.
The HJB equation associated with this problem is
J(W, x, t) = L
c
J(W, x, t) +L

J(W, x, t) +
J
t
(W, x, t) +r(x)WJ
W
(W, x, t)
+J
x
(W, x, t)m(x) +
1
2
J
xx
(W, x, t)
x
(x),
(7.2)
with the terminal condition J(W, x, T) = u(W). Here
L
c
J(W, x, t) = sup
c0
{u(c) cJ
W
(W, x, t)} , (7.3)
L

J(W, x, t) = sup
R
d
_
WJ
W
(W, x, t)
>
(x, t)(x) +
1
2
J
WW
(W, x, t)W
2

>
(x, t)(x, t)
>

+J
Wx
(W, x, t)W
>
(x, t)v(x)
_
.
(7.4)
The rst-order condition with respect to c is
u
0
(c) = J
W
(W, x, t)
so that the (candidate) optimal consumption strategy is
c

t
= C(W

t
, x
t
, t),
7.2 General utility functions 87
where
C(W, x, t) = I
u
(J
W
(W, x, t)) (7.5)
and, as before, I
u
() is the inverse of u
0
(). Substituting the maximizing c back into (7.3), we get
L
c
J(W, x, t) = u(I
u
(J
W
(W, x, t))) I
u
(J
W
(W, x, t))J
W
(W, x, t). (7.6)
Note that these relations are exactly as in the case with constant investment opportunities studied
in Section 6.2 with the only exception that the indirect utility function now depends on the state
variable x.
The rst-order condition with respect to is dierent than with constant investment opportu-
nities:
J
W
(W, x, t)W(x, t)(x) +J
WW
(W, x, t)W
2
(x, t)(x, t)
>
+J
Wx
(W, x, t)W(x, t)v(x) = 0
so that the candidate optimal portfolio is

t
= (W

t
, x
t
, t),
where
(W, x, t) =
J
W
(W, x, t)
WJ
WW
(W, x, t)
_
(x, t)
>
_
1
(x)
J
Wx
(W, x, t)
WJ
WW
(W, x, t)
_
(x, t)
>
_
1
v(x). (7.7)
Substituting the maximizing back into (7.4) and simplifying, we get
L

J(W, x, t) =
1
2
k(x)k
2
J
W
(W, x, t)
2
J
WW
(W, x, t)

1
2
kv(x)k
2
J
Wx
(W, x, t)
2
J
WW
(W, x, t)
v(x)
>
(x)
J
W
(W, x, t)J
Wx
(W, x, t)
J
WW
(W, x, t)
.
(7.8)
Let us take a closer look at the portfolio (7.7). As the horizon shrinks, the indirect utility
function J(W, x, t) approaches the terminal utility function u(W) which is independent of the
state x. Consequently, the derivative J
Wx
(W, x, t) and hence the last term of the portfolio will
approach zero as t T. In other words, very short-term investors do not hedge. The last term
will also disappear for non-instantaneous investors in two special cases:
(1) J
Wx
(W, x, t) 0: The state variable does not aect the marginal utility of the investor. As
we shall see below this is always true for investors with logarithmic utility. Such an investor
is not interested in hedging changes in the state variable.
(2) v(x) 0: The state variable is uncorrelated with instantaneous returns on the traded assets.
In this case the investor is not able to hedge changes in the state variable.
In all other cases the state variable induces an additional term to the optimal portfolio relative to
the case of constant investment opportunities. From (7.7) we have the following important result:
Theorem 7.1 (Three-fund separation). All investors will combine (1) the locally risk-free asset
(the bank account), (2) the tangency portfolio given by the weights

tan
t
=
1
1
>
_
(x
t
, t)
>
_
1
(x
t
)
_
(x
t
, t)
>
_
1
(x
t
),
and (3) the hedge portfolio given by the weights

hdg
t
=
1
1
>
_
(x
t
, t)
>
_
1
v(x
t
)
_
(x
t
, t)
>
_
1
v(x
t
).
88 Chapter 7. Stochastic investment opportunities: the general case
Note that the composition of the two risky funds varies over time due to uctuations in the state
variable. It is no longer true that all investors will hold dierent risky assets in the same proportion,
i.e., the fractions
i
/
j
will be investor-specic since dierent investors may put dierent weights on
the two portfolios of risky assets. The tangency portfolio has the same interpretation as previously.
The position in the portfolio
hdg
is the change in the optimal investment strategy due to the
stochastic variations in the investment opportunity set, hence the name hedge portfolio. The next
theorem shows that among all portfolios the hedge portfolio has the maximal absolute correlation
with the state variable. In that sense it is the portfolio that is best at hedging changes in the state
variable. In a complete market the maximal correlation is one and the hedge portfolio basically
replicates the dynamics of the state variable.
Theorem 7.2. The absolute value of the instantaneous correlation between the change in the value
of an investment strategy and the change in the state variable is maximized for the investment
strategy
t
=
hdg
t
.
Proof. The value process of an investment strategy = (
t
) has dynamics
dV

t
= V

t
_
r(x
t
) +
>
t
(x
t
, t)(x
t
)
_
dt +V

t

>
t
(x
t
, t) dz
t
.
The instantaneous variance rate is (V

t
)
2

>
t
(x
t
, t)(x
t
, t)
>

t
and the instantaneous covariance
rate with the state variable is V

t

>
t
(x
t
, t)v(x
t
). Hence, the square of the instantaneous correla-
tion is

_
V

t

>
t
(x
t
, t)v(x
t
)
_
2
_
(V

t
)
2

>
t
(x
t
, t)(x
t
, t)
>

t
_

x
(x
t
)
=
(
>
t
(x
t
, t)v(x
t
))
2
_

>
t
(x
t
, t)(x
t
, t)
>

t
_

x
(x
t
)
.
The portfolio that maximizes
2
will also maximize the absolute correlation ||. The rst-order
condition for the maximization implies that
(x
t
, t)v(x
t
)
_

>
t
(x
t
, t)(x
t
, t)
>

t
_
=
_

>
t
(x
t
, t)v(x
t
)
_
(x
t
, t)(x
t
, t)
>

t
.
Multiplying through by the inverse of (x
t
, t)(x
t
, t)
>
, we arrive at
_
(x
t
, t)
>
_
1
v(x
t
)
_

>
t
(x
t
, t)(x
t
, t)
>

t
_
=
_

>
t
(x
t
, t)v(x
t
)
_

t
,
which we want to solve for
t
. The sum of the elements of the vector on the left-hand side is
1
>
_
(x
t
, t)
>
_
1
v(x
t
)
_

>
t
(x
t
, t)(x
t
, t)
>

t
_
, while the sum of the elements of the right-hand
side vector is
>
t
(x
t
, t)v(x
t
) since 1
>

t
= 1. Dividing each side by the sum of the elements, we
obtain
_
(x
t
, t)
>
_
1
v(x
t
)
1
>
_
(x
t
, t)
>
_
1
v(x
t
)
=
t
,
as was to be shown.
Let us focus for a moment on the case with a single risky asset so that both (x, t) and v(x)
are scalars. The hedge term in

t
can then be written as
J
Wx
WJ
WW
v

. Note that J
WW
< 0 by
concavity. If v and have the same sign, then the return of the risky asset will be positively
7.2 General utility functions 89
correlated with changes in the state variable. In this case we see that the hedge demand on the
asset is positive if marginal utility J
W
is increasing in x so that J
Wx
> 0. This makes good sense:
relative to the situation with a constant investment opportunity set, the agent will devote a larger
fraction of wealth to a risky asset that has a high return in states of the world where marginal
utility is high. Conversely, if v and have opposite signs so that they are negatively correlated.
Here is another interpretation of the optimal portfolio strategy, following Ingersoll (1987, p. 282):
Theorem 7.3. The optimal portfolio strategy

is the one that minimizes uctuations in con-


sumption over time among all portfolio strategies with the same expected rate of return as

.
Proof. The expected rate of return on the optimal portfolio in (7.7) is

(x, t) = r(x) + (

t
)
>
((x, t) r(x)1).
The consumption rate is given by
c

t
= C(W
t
, x
t
, t).
An application of Itos Lemma yield
dc

t
= . . . dt +
_
C
W
(W
t
, x
t
, t)W
t

>
t
(x
t
, t)
+C
x
(W
t
, x
t
, t)v(x
t
)
>
_
dz
t
+C
x
(W
t
, x
t
, t) v(x
t
) d z
t
,
where we leave the drift term unspecied and the subscripts on C denote partial derivatives. It
follows that the instantaneous variance rate of consumption is equal to

2
c
C
W
(W, x, t)
2
W
2

>
(x, t)(x, t)
>
+C
x
(W, x, t)
2

x
(x)
+ 2C
W
(W, x, t)C
x
(W, x, t)W
>
(x, t)v(x).
Now consider the problem of minimizing
2
c
over all portfolios that have an expected rate of
return equal to

(x, t), i.e., portfolios with r(x) +


>
(x, t)(x) =

(x, t). Forming the


Lagrangian
L =
2
c
+

(x, t) r(x)
>
(x, t)(x)

we nd the optimality condition

=

2C
W
(W, x, t)
2
W
2
_
(x, t)
>
_
1
(x)
C
x
(W, x, t)
WC
W
(W, x, t)
_
(x, t)
>
_
1
v(x).
Dierentiating the envelope condition u
0
(C(W, x, t)) = J
W
(W, x, t) along the optimal consumption
path with respect to W we get
u
00
(C(W, x, t))C
W
(W, x, t) = J
WW
(W, x, t)
and by dierentiating with respect to x we get
u
00
(C(W, x, t))C
x
(W, x, t) = J
Wx
(W, x, t).
Hence,
C
x
(W, x, t)
WC
W
(W, x, t)
=
J
Wx
(W, x, t)
WJ
WW
(W, x, t)
so that the second terms in

and

are identical. The rst term in

is proportional to the
rst term in

and since

is chosen such that it has the same expected rate of return as

,
the rst terms must also coincide. In total,

, which was to be shown.


90 Chapter 7. Stochastic investment opportunities: the general case
On the other hand, if we minimize the instantaneous variance rate of wealth, i.e.,
2
W
=

>
(x, t)(x, t)
>
, over all portfolios having the same expected rate of return as

, we get

=
_
(x, t)
>
_
1
(x).
This only involves the tangency portfolio. We can conclude that the investor is concerned about
uctuations over time in consumption, not in wealth.
Above, we discussed the general expressions for the optimal consumption and investment strategy
in the presence of a state variable. But these were expressed in terms of the unknown indirect
utility function. How do we proceed to nd concrete solutions?
Substituting (7.6) and (7.8) back into the HJB equation (7.2) and gathering terms, we get the
second order PDE
J(W, x, t) = u(I
u
(J
W
(W, x, t))) J
W
(W, x, t)I
u
(J
W
(W, x, t)) +
J
t
(W, x, t) +r(x)WJ
W
(W, x, t)

1
2
J
W
(W, x, t)
2
J
WW
(W, x, t)
k(x)k
2
+J
x
(W, x, t)m(x) +
1
2
J
xx
(W, x, t)
x
(x)

1
2
J
Wx
(W, x, t)
2
J
WW
(W, x, t)
kv(x)k
2

J
W
(W, x, t)J
Wx
(W, x, t)
J
WW
(W, x, t)
(x)
>
v(x).
(7.9)
If this PDE has a solution J(W, x, t) satisfying the terminal condition J(W, x, T) = u(W) and the
strategy dened by (7.5) and (7.7) is feasible (satises the technical conditions), then we know
from the verication theorem that this strategy is indeed the optimal consumption and investment
strategy and the function J(W, x, t) is indeed the indirect utility function. With no utility from
intermediate consumption, i.e., u 0, the rst two terms of the right-hand side of (7.9) vanish.
Although the PDE (7.9) looks very complicated, closed-form solutions can be found for a number
of interesting model specications as we shall see later in this chapter and in other chapters.
7.2.2 Multi-dimensional state variable
Suppose now that the state variable x is k-dimensional and follows the diusion process
dx
t
= m(x
t
) dt +v(x
t
)
>
dz
t
+ v(x
t
) d z
t
,
where m now is a k-vector valued function, v is a (d k)-matrix valued function, v is a (k k)-
matrix valued function, and z is a k-dimensional standard Brownian motion independent of z.
The instantaneous variance-covariance matrix of the state variable is the (k k) matrix

x
(x) = v(x)
>
v(x) + v(x) v(x)
>
.
denote the instantaneous variance of the state variable. As explained in Section 5.3, the HJB
equation is then
J(W, x, t) = L
c
J(W, x, t) +L

J(W, x, t) +
J
t
(W, x, t) +r(x)WJ
W
(W, x, t)
+J
x
(W, x, t)
>
m(x) +
1
2
tr
_
J
xx
(W, x, t)
x
(x)
_
,
7.2 General utility functions 91
where
L
c
J(W, x, t) = sup
c0
{u(c) cJ
W
(W, x, t)} ,
L

J(W, x, t) = sup
R
d
_
WJ
W
(W, x, t)
>
(x, t)(x) +
1
2
J
WW
(W, x, t)W
2

>
(x, t)(x, t)
>

+W
>
(x, t)v(x)J
Wx
(W, x, t)
_
.
Analogously to the case with a one-dimensional state variable discussed in the previous section,
the (candidate) optimal consumption strategy is
c

t
= C(W

t
, x
t
, t),
where
C(W, x, t) = I
u
(J
W
(W, x, t)),
so that
L
c
J(W, x, t) = u(I
u
(J
W
(W, x, t))) I
u
(J
W
(W, x, t))J
W
(W, x, t).
Likewise, the candidate optimal portfolio is

t
= (W

t
, x
t
, t),
where
(W, x, t) =
J
W
(W, x, t)
WJ
WW
(W, x, t)
_
(x, t)
>
_
1
(x)
_
(x, t)
>
_
1
v(x)
J
Wx
(W, x, t)
WJ
WW
(W, x, t)
, (7.10)
and
L

J(W, x, t) =
1
2
J
W
(W, x, t)
2
J
WW
(W, x, t)
k(x)k
2
(x)
>
v(x)
J
W
(W, x, t)J
Wx
(W, x, t)
J
WW
(W, x, t)

1
2J
WW
(W, x, t)
J
Wx
(W, x, t)
>
v(x)
>
v(x)J
Wx
(W, x, t).
We can split up the last term of the optimal portfolio into k terms, one for each element of the
state variable:
(W, x, t) =
J
W
(W, x, t)
WJ
WW
(W, x, t)
_
(x, t)
>
_
1
(x)
k

j=1
_
(x, t)
>
_
1
_
_
_
_
_
_
_
v
1j
(x)
v
2j
(x)
.
.
.
v
dj
(x)
_
_
_
_
_
_
_
J
Wx
j
(W, x, t)
WJ
WW
(W, x, t)
.
Each of the terms in the sum has the interpretation as a fund hedging changes in one element of
the state variable. Therefore, we have (k + 2)-fund separation: all investors are satised with
access to trade in the risk-free asset, the tangency portfolio, and k hedge funds.
Substituting L
c
J and L

J back into the HJB equation and gathering terms, we get the second-
order PDE
J(W, x, t) = u(I
u
(J
W
(W, x, t))) J
W
(W, x, t)I
u
(J
W
(W, x, t)) +
J
t
(W, x, t)
+r(x)WJ
W
(W, x, t)
1
2
J
W
(W, x, t)
2
J
WW
(W, x, t)
k(x)k
2
+J
x
(W, x, t)
>
m(x)
+
1
2
tr
_
J
xx
(W, x, t)
x
(x)
_
(x)
>
v(x)
J
W
(W, x, t)J
Wx
(W, x, t)
J
WW
(W, x, t)

1
2J
WW
(W, x, t)
J
Wx
(W, x, t)
>
v(x)
>
v(x)J
Wx
(W, x, t).
(7.11)
92 Chapter 7. Stochastic investment opportunities: the general case
As before, the rst two terms on the right-hand side are not present when the agent has no utility
from intermediate consumption.
7.2.3 What risks are to be hedged?
It may appear from the analysis above that investors would want to hedge all variables aecting
r
t
,
t
, and
t
, but this is actually not so. We will show that the only risks the agent will want to
hedge are those aecting r
t
and
t
.
Since
t
and thus
>
t
are assumed to be non-singular, we can think of the investor choosing the
volatility vector of wealth
t
=
>
t

t
directly rather than
t
. In these terms wealth evolves as
dW
t
= W
t
[r
t
+
>
t

t
] dt c
t
dt +W
t

>
t
dz
t
.
The indirect utility function is
J
t
= sup
(c,)
E
t
_
_
T
t
e
(st)
u(c
s
) ds +e
(Tt)
u(W
T
)
_
.
Note that this optimization problem does not involve
t
or
t
. Assuming now that there is a
variable x
t
so that
r
t
= r(x
t
),
t
= (x
t
),
then J
t
= J(W
t
, x
t
, t) and we can use the dynamic programming approach.
For a multidimensional x we will get the optimal wealth volatility vector

t
=
J
W
(W
t
, x
t
, t)
W
t
J
WW
(W
t
, x
t
, t)
(x
t
) v(x
t
)
J
Wx
(W
t
, x
t
, t)
W
t
J
WW
(W
t
, x
t
, t)
.
Hence, the optimal portfolio strategy is

t
=
J
W
(W
t
, x
t
, t)
W
t
J
WW
(W
t
, x
t
, t)
_

>
t
_
1
(x
t
)
_

>
t
_
1
v(x
t
)
J
Wx
(W
t
, x
t
, t)
W
t
J
WW
(W
t
, x
t
, t)
.
We can conclude from this analysis that the investor will only hedge the variables that aect the
short-term interest rate and the market prices of risk (this is of course only true within the present
framework; e.g., an investor with stochastic income will also want to hedge the income risk).
Stochastic variations in
t
and
t
are only interesting to the extent that they cause stochastic
variations in the market price of risk! One could imagine a market where volatilities vary stochas-
tically but expected rates of return follow the variations in volatilities so that the market price of
risk is constant over time. In such a market no agent would hedge the variations in volatilities and
expected rates of return. Similar observations were made by Detemple, Garcia, and Rindisbacher
(2003) and Munk and Srensen (2004). The volatility matrix
t
of the risky assets becomes rel-
evant when the agent wants to nd a portfolio
t
that will generate the desired wealth volatility
vector
t
.
In fact, the statement above can be strengthened slightly. Look at the PDE (7.11). Suppose
that both r and kk
2
are independent of x. Then the function J(W, t) that satises the simple
PDE
J(W, t) = u(I
u
(J
W
(W, t))) J
W
(W, t)I
u
(J
W
(W, t)) +
J
t
(W, t)
+rWJ
W
(W, t)
1
2
J
W
(W, t)
2
J
WW
(W, t)
kk
2
7.3 CRRA utility 93
with J(W, T) = u(W) will also solve the full HJB equation (7.9) as all derivatives with respect to x
will be zero. Consequently, the hedge term in (7.10) disappears. In other words, the investor will
only hedge stochastic variations that aect the short-term interest rate r
t
and the squared market
prices of risk
1
k
t
k
2
= (
t
r
t
1)
>
_

>
t
_
1
(
t
r
t
1) .
Nielsen and Vassalou (2006) show that this result is also true for non-Markov dynamics of prices.
We summarize this in the following theorem:
Theorem 7.4. Investors with time-additive utility functions and no income from non-nancial
sources will only hedge stochastic variations in the short-term interest rate r
t
and in the squared
market prices of risk k
t
k
2
.
There is a very intuitive interpretation of this result, which we can see after a few computations:
The tangency portfolio is in general given by [see (6.8)]

tan
t
=
1
1
>
_

>
t
_
1

t
_

>
t
_
1

t
.
The expected excess rate of return on the tangency portfolio is
_

tan
t
_
>
(
t
r
t
1) =
1
1
>
_

>
t
_
1

t
k
t
k
2
.
The volatility (instantaneous standard deviation) of the tangency portfolio is
_
(
tan
t
)
>

>
t

tan
t
=
1
1
>
_

>
t
_
1

t
k
t
k.
The slope of the instantaneous capital market line is therefore equal to k
t
k. (In a setting with a
single risky asset,
t
= (
t
r
t
)/
t
and k
t
k =
t
.) In a static framework the optimal portfolio is
determined by the position of the capital market line, i.e., (1) the intercept which is equal to the
risk-free rate of return and (2) the slope which is the Sharpe ratio of the tangency portfolio. It is
therefore natural that investors in a dynamic framework only are concerned about the variations
in these two variables.
7.3 CRRA utility
In this section we assume that the investor has time-additive expected CRRA utility with a
constant relative risk aversion > 1. The case = 1 that corresponds to logarithmic utility has
to be analyzed separately (see Section 7.4). However, it turns out that when is put equal to 1 in
the optimal strategies derived for > 1 we obtain the optimal strategies derived for logarithmic
utility.
7.3.1 One-dimensional state variable
Consider the indirect utility function with CRRA utility:
J(W, x, t) = sup
(c
s
,
s
)
s[t,T]
E
W,x,t
_

1
_
T
t
e
(st)
c
1
s
1
ds +
2
e
(Tt)
W
1
T
1
_
,
1
Examples where kk
2
is constant, but itself is not, can be given [see Nielsen and Vassalou (2006)], but seem
rather contrived.
94 Chapter 7. Stochastic investment opportunities: the general case
where
1
and
2
are greater than or equal to zero with at least one of them being non-zero. We
set up a conjecture for the form of J using the same arguments as we did in the case of constant
investment opportunities. Due to the linearity of the wealth dynamics it seems reasonable to guess
that if the strategy (c

) is optimal with time t wealth W and state x and the corresponding


wealth process W

, then the strategy (kc

) will be optimal with time t wealth kW and state


x and the corresponding wealth process kW

. If this is true, then


J(kW, x, t) = E
t
_

1
_
T
t
e
(st)
(kc

s
)
1
1
ds +
2
e
(Tt)
(kW

T
)
1
1
_
= k
1
E
t
_

1
_
T
t
e
(st)
(c

s
)
1
1
ds +
2
e
(Tt)
(W

T
)
1
1
_
= k
1
J(W, x, t),
i.e., the indirect utility function is homogeneous of degree 1 in the wealth level. Inserting
k = 1/W and rearranging, we get
J(W, x, t) =
1
1
g(x, t)

W
1
,
where g(x, t)

= (1 )J(1, x, t). From the terminal condition J(W, x, T) =


2
W
1
/(1 ), we
have that g(x, T)

=
2
.
The relevant derivatives of J are
J
W
(W, x, t) = g(x, t)

,
J
WW
(W, x, t) = g(x, t)

W
1
,
J
x
(W, x, t) =

1
g(x, t)
1
g
x
(x, t)W
1
,
J
xx
(W, x, t) = g(x, t)
2
g
x
(x, t)
2
W
1
+

1
g(x, t)
1
g
xx
(x, t)W
1
,
J
Wx
(W, x, t) = g(x, t)
1
g
x
(x, t)W

,
J
t
(W, x, t) =

1
g(x, t)
1
g
t
(x, t)W
1
.
Substituting into (7.7), the optimal investment strategy becomes
(W, x, t) =
1

_
(x, t)
>
_
1
(x) +
g
x
(x, t)
g(x, t)
_
(x, t)
>
_
1
v(x), (7.12)
and from (7.5) the optimal consumption strategy becomes
C(W, x, t) =
1/
1
W
g(x, t)
,
which, of course, is zero if the investor obtains no utility from intermediate consumption. It is
optimal to consume a time- and state-dependent fraction of wealth. The optimal fractions of
wealth allocated to the various risky assets are independent of the level of wealth, but depend on
the state and time.
Note again the close link between the optimal consumption strategy and the intertemporal
hedging term in the optimal investment strategy. With intermediate consumption the function
g(x, t) is the optimal wealth-to-consumption ratio. By Itos Lemma, the dynamics will be
dg(x
t
, t) = g(x
t
, t)
_
. . . dt +
g
x
(x, t)
g(x, t)
v(x
t
)
>
dz
t
+
g
x
(x, t)
g(x, t)
v(x
t
) d z
t
_
.
7.3 CRRA utility 95
The dynamics of the value of a given portfolio is
dV

t
= V

t
_
r(x
t
) +
>
t
(x
t
, t)(x
t
)
_
dt +
>
t
(x
t
, t) dz
t

.
We see that the hedge portfolio is matching the sensitivity of the optimal wealth-to-consumption
ratio with respect to the hedgeable shocks represented by dz
t
.
Inserting the derivatives above into (7.9) and simplifying, we get that g(x, t) must solve the PDE
0 =
1/
1

+
1

r(x) +
1
2
2
k(x)k
2

g(x, t) +

m(x)
1

(x)
>
v(x)

g
x
(x, t)
+
g
t
(x, t) +
1
2
g
xx
(x, t)
x
(x) +
1
2
v(x)
2
g
x
(x, t)
2
g(x, t)
(7.13)
with the terminal condition g(x, T) =
1/
2
. In the case with no intermediate consumption we have

1
= 0, and we can without loss of generality assume
2
= 1 and = 0. If we write
g(x, t) g(x, t; T) = exp

H(x, T t)
_
,
then H(x, ) has to solve the simpler PDE
0 =r(x) +
1
2
k(x)k
2

(x, ) +

m(x)
1

(x)
>
v(x)

H
x
(x, )
+
1
2

x
(x)H
xx
(x, )
1
2
_

x
(x) + ( 1) v(x)
2
_
H
x
(x, )
2
(7.14)
with the condition H(x, 0) = 0.
Theorem 7.5. With CRRA utility of terminal wealth only (
1
= 0,
2
= 1, = 0), the indirect
utility function is
J(W, x, t) =
1
1
e
(1)H(x,Tt)
W
1
=
1
1

We
H(x,Tt)

1
,
and the optimal investment strategy in (7.12) can be rewritten as
(W, x, t) =
1

_
(x, t)
>
_
1
(x)
1

H
x
(x, T t)
_
(x, t)
>
_
1
v(x),
where H(x, ) solves the PDE (7.14) with initial condition H(x, 0) = 0.
When the market is complete so that v(x) 0, the next theorem shows that the solution to
the utility maximization problem with intermediate consumption follows from the solution to the
problem of maximizing utility of wealth at a single point in time. The proof is left for Exercise 7.1.
Theorem 7.6. Let

H(x, ) be the solution to the PDE
0 =r(x) +
1
2
k(x)k
2

(x, ) +

m(x)
1

(x)
>
v(x)


H
x
(x, )
+
1
2

x
(x)

H
xx
(x, )
1
2

x
(x)

H
x
(x, )
2
(7.15)
with terminal condition

H(x, 0) = 0. Dene
g(x, t; s) = exp

(s t)
1

H(x, s t)
_
.
96 Chapter 7. Stochastic investment opportunities: the general case
Then the solution to the PDE (7.13) with v(x) 0 is
g(x, t) =
1

1
_
T
t
g(x, t; s) ds +
1

2
g(x, t; T).
In a complete market ( v(x) 0), the maximization of CRRA utility of intermediate consumption
and/or terminal wealth leads to the indirect utility J(W, x, t) =
1
1
g(x, t)

W
1
, the optimal
consumption strategy is
C(W, x, t) =
1/
1
W
g(x, t)
=
_
_
T
t
g(x, t; s) ds +

1
1

g(x, t; T)
_
1
W,
and the optimal investment strategy is
(W, x, t) =
1

_
(x, t)
>
_
1
(x)
1

D(x, t, T)
_
(x, t)
>
_
1
v(x),
where
D(x, t, T) =
_
T
t

H
x
(x, s t) g(x, t; s) ds + (
2
/
1
)
1

H
x
(x, T t) g(x, t; T)
_
T
t
g(x, t; s) ds + (
2
/
1
)
1

g(x, t; T)
.
The solution for the case with utility of intermediate consumption is thus obtained by simply
integrating up the solution for the case with utility of wealth at each of the xed time horizons
over the remaining life-time [t, T]. In any specic case with complete markets, the key challenge is
therefore to solve the PDE (7.15).
The PDE (7.14)and thus the special case (7.15)has a nice solution in a large class of in-
teresting models as we will show below. This leads to closed-form solutions to the power utility
maximization problem with terminal wealth only andif the market is completewith interme-
diate consumption. If the market is incomplete, the power utility maximization problem with
intermediate consumption is generally intractable, but the PDE (7.13) can be solved numerically.
7.3.2 Ane models
In this and the following subsection we will look at models in which the optimal portfolio and
consumption strategies of a CRRA investor can be derived in closed-form. In some of these cases
we can obtain explicit solutions, in other cases the solution involves time-dependent functions
that can be found by numerically solving ordinary dierential equations. Many of our concrete
examples in the following chapters are special cases of these models. In this section we will discuss
so-called ane models, while the next section focuses on the so-called quadratic models. The
results presented are similar to those obtained by Liu (1999, 2007). For notational simplicity we
shall assume that the state variable is one-dimensional with dynamics given by (7.1). We will
briey discuss solutions to problems with a multi-dimensional state variable in Section 7.3.4.
As explained above, the key is to solve the PDE (7.14) for H(x, ) with the initial condition
H(x, 0) = 0. Let us consider when we can nd a solution of the ane form
H(x, ) = A
0
() +A
1
()x,
where A
0
and A
1
are real-valued deterministic functions that have to satisfy A
0
(0) = A
1
(0) = 0
7.3 CRRA utility 97
to meet the initial condition. Substituting into (7.14), we nd
0 = r(x) +
1
2
k(x)k
2
A
0
0
() A
0
1
()x +

m(x)
1

(x)
>
v(x)

A
1
()

1
2
_

x
(x) + ( 1) v(x)
2
_
A
1
()
2
.
(7.16)
If r(x), k(x)k
2
, m(x), v(x)
>
(x), kv(x)k
2
, and v(x)
2
are all ane functions
2
of x, then we can
nd two ordinary dierential equations for A
0
and A
1
. In order to see this, suppose that
r(x) = r
0
+r
1
x, (7.17)
m(x) = m
0
+m
1
x, (7.18)
v(x) =
_
v
0
+ v
1
x
for some constants r
0
, r
1
, m
0
, m
1
, v
0
, and v
1
. Of course, we should have that v
0
+ v
1
x 0 for all
possible values of x, which is easily satised if either v
0
or v
1
are zero and the other parameter is pos-
itive. The term k(x)k
2
will be ane in x if each element of the vector (x) = (
1
(x), . . . ,
d
(x))
>
is of the form
i
(x) =

i0
+
i1
x since then
k(x)k
2
=
d

i=1

i
(x)
2
=
d

i=1
(
i0
+
i1
x) =
_
d

i=1

i0
_
+
_
d

i=1

i1
_
x
0
+
1
x. (7.19)
Similarly, the termkv(x)k
2
will be ane in x if each element of the vector v(x) = (v
1
(x), . . . , v
d
(x))
>
is of the form v
i
(x) =

v
i0
+v
i1
x. Then we have
kv(x)k
2
=
d

i=1
v
i
(x)
2
=
d

i=1
(v
i0
+v
i1
x) =
_
d

i=1
v
i0
_
+
_
d

i=1
v
i1
_
x V
0
+V
1
x. (7.20)
In addition, we must have that v(x)
>
(x) is ane in x. With the specications of (x) and v(x)
just given, we have
v(x)
>
(x) =
d

i=1
v
i
(x)
i
(x) =
d

i=1
_
(v
i0
+v
i1
x)(
i0
+
i1
x).
This will only be ane in x if, for each i, we have either
(i) v
i0
=
i0
= 0, or
(ii) v
i1
=
i1
= 0, or
(iii) v
i0
=
i0
and v
i1
=
i1
.
To encompass all possible situations let us write
v(x)
>
(x) = K
0
+K
1
x, (7.21)
where K
0
and K
1
are real-valued parameters. If we substitute (7.17)(7.21) into (7.16) and use
the fact that (7.16) must hold for all values of x and all , we obtain a system of two ordinary
2
A real-valued function is said to be an ane function of the k-vector x, if it can be written as a
1
+a
>
2
x, where
a
1
is a constant scalar and a
2
is a constant k-vector (possibly zero so that a constant is also included in the set of
ane functions). A vector- or matrix-valued function is said to be ane if all its elements are ane.
98 Chapter 7. Stochastic investment opportunities: the general case
dierential equations for A
0
and A
1
:
A
0
0
() = r
0
+

0
2
+

m
0

K
0

A
1
()
1
2
(V
0
+ v
0
) A
1
()
2
,
A
0
1
() = r
1
+

1
2
+

m
1

K
1

A
1
()
1
2
(V
1
+ v
1
) A
1
()
2
. (7.22)
These equations are to be solved with the initial conditions A
0
(0) = A
1
(0) = 0.
First (7.22) is solved for A
1
(). From Theorem C.2, we can make the following conclusion.
Suppose that

m
1

K
1

2
+ 2
1

r
1
+

1
2

(V
1
+ v
1
) > 0 (7.23)
and dene
=

m
1

K
1

2
+ 2
1

r
1
+

1
2

(V
1
+ v
1
).
Then the solution to (7.22) with A
1
(0) = 0 is
A
1
() =
2

r
1
+

1
2

(e

1)

+
1

K
1
m
1

(e

1) + 2
. (7.24)
Since A
0
() = A
0
() A
0
(0) =
_

0
A
0
0
(s) ds, we can afterwards compute A
0
() as
A
0
() =

r
0
+

0
2

m
0

K
0
_

0
A
1
(s) ds
1
2
(V
0
+ v
0
)
_

0
A
1
(s)
2
ds. (7.25)
Also from Theorem C.2, we have that
_

0
A
1
(s) ds =
2
( 1) (V
1
+ v
1
)
_
_
_
1
2

+
1

K
1
m
1

+ ln
_
_
2

+
1

K
1
m
1

(e

1) + 2
_
_
_
_
_
and
_

0
A
1
(s)
2
ds = - ugly expression to be lled in -
Combining these ndings with Theorem 7.5, we arrive at the following conclusion:
3
Theorem 7.7. Assume that r(x), k(x)k
2
, m(x), v(x)
>
(x), kv(x)k
2
, and v(x)
2
are all ane
functions of x and given by (7.17)(7.21), and that the parameter condition (7.23) holds. For an
investor with CRRA utility from terminal wealth only, the indirect utility function is then given by
J(W, x, t) =
1
1

We
A
0
(Tt)+A
1
(Tt)x

1
,
where A
1
is given by (7.24) and A
0
is given by (7.25). The optimal investment strategy is given by
(W, x, t) =
1

_
(x, t)
>
_
1
(x)
1

_
(x, t)
>
_
1
v(x)A
1
(T t).
In some important special cases, A
0
and A
1
simplify considerably. For example, if V
1
+ v
1
= 0
so that the second-order term in (7.22) vanishes then (again see Theorem C.2)
=

m
1

K
1

=
1

K
1
m
1
,
3
Note the close connection between the analysis above and the analysis for so-called ane models of the term
structure of interest rates, see e.g., Due and Kan (1996), Dai and Singleton (2000), or Munk (2011).
7.3 CRRA utility 99
so that A
1
() reduces to
A
1
() =
r
1
+

1
2

_
1 e

_
.
In this case the integrals in (7.25) are also relatively simple:
_

0
A
1
(u) du =
1
m
1

K
1

r
1
+

1
2

A
1
()

,
_

0
A
1
(u)
2
du =
1

r
1
+

1
2

m
1

K
1

r
1
+

1
2

3
A
1
()
m
1

K
1

A
1
()
2
2

r
1
+

1
2

_.
This special case is relevant in Chapter 10.
For the problem with utility of intermediate consumption, we can provide a solution for the
complete markets case ( v(x) 0) by combining the above computations with Theorem 7.6. The
only dierence in the relevant ODEs, and thus in their solutions, is that we have to impose the
restriction v
0
= v
1
= 0 because of the complete market assumption.
Theorem 7.8. Assume a complete nancial market ( v(x) 0) in which r(x), k(x)k
2
, m(x),
v(x)
>
(x), and kv(x)k
2
are all ane functions of x and given by (7.17), (7.18), (7.19), (7.20),
and (7.21). Imposing the restriction v
0
= v
1
= 0, assume that the parameter condition (7.23) holds,
and let A
1
and A
0
be given by (7.24) and (7.25). Dene
g(x, t; s) = exp

(s t)
1

(A
0
(s t) +A
1
(s t)x)
_
.
For an investor with CRRA utility from intermediate consumption and possibly terminal wealth,
the indirect utility function is then given by
J(W, x, t) =
1
1
_

1
_
T
t
g(x, t; s) ds +
1

2
g(x, t; T)
_

W
1
,
the optimal consumption strategy is
C(W, x, t) =
_
_
T
t
g(x, t; s) ds +

1
1

g(x, t; T)
_
1
W,
and the optimal investment strategy is
(W, x, t) =
1

_
(x, t)
>
_
1
(x)
1

D(x, t, T)
_
(x, t)
>
_
1
v(x),
where
D(x, t, T) =
_
T
t
A
1
(s t) g(x, t; s) ds + (
2
/
1
)
1

A
1
(T t) g(x, t; T)
_
T
t
g(x, t; s) ds + (
2
/
1
)
1

g(x, t; T)
.
Assuming for simplicity that
2
= 0 and
1
= 1, we can rewrite the ratio in the hedge term of
the optimal investment strategy as
_
T
t
A
1
(s t) g(x, t; s) ds
_
T
t
g(x, t; s) ds
=
_
T
t
w(x, s t)A
1
(s t) ds,
where we have dened w(x, st) = g(x, t; s)/
_
T
t
g(x, t; s) ds. Since w(x, st) > 0 and
_
T
t
w(x, s
t) ds = 1, we may interpret the hedging demand of an investor with utility of consumption and a
100 Chapter 7. Stochastic investment opportunities: the general case
time horizon of T as a weighted average of the hedging demands of investors with time horizons of
s [t, T] and utility of terminal wealth only. If A
1
is either monotonically increasing or decreasing
(as will be the case in many concrete settings), there will exist a T

[t, T] such that


_
T
t
w(x, s t)A
1
(s t) ds = A
1
(T

t),
in which case we can represent the hedging demand as
_
(x, t)
>
_
1
v(x)A
1
(T

t). Since this


is exactly the hedging demand of an investor with time horizon T

and utility of terminal wealth


only, we may interpret T

as the eective time horizon of the investor with time horizon T and
utility of consumption. Note the similarity to the concept of duration for xed-income securities,
cf. Munk (2011).
7.3.3 Quadratic models
The assumptions of the ane models cover some interesting settings, but not all. In this section
we shall see that under another set of assumptions on the market parameter functions r, m, v, ,
and v, we obtain an exponential-quadratic expression for the function g(x, t). In Chapter 11, we
will study an important example which is covered by these assumptions.
As before, the key is to solve the PDE (7.14) for H(x, ) with the initial condition H(x, 0) = 0.
Let us consider when we can nd a solution of the quadratic form
H(x, ) = A
0
() +A
1
()x +
1
2
A
2
()x
2
,
where A
0
, A
1
, and A
2
are real-valued deterministic functions that have to satisfy A
0
(0) = A
1
(0) =
A
2
(0) = 0 to ensure that H(x, 0) = 0 for all x. Substituting the relevant derivatives into (7.14),
we arrive at
0 = r(x) +
1
2
k(x)k
2
+

m(x)
1

v(x)
>
(x)

(A
1
() +A
2
()x)
A
0
0
() A
0
1
()x
1
2
A
0
2
()x
2
+
1
2
_
kv(x)k
2
+ v(x)
2
_
A
2
()

1
2
_
kv(x)k
2
+ v(x)
2
_
(A
1
() +A
2
()x)
2
.
(7.26)
To ensure that we only have powers of x of order zero, one, and two, we can allow (i) r(x) and
k(x)k
2
to be quadratic
4
in x, (ii) m(x) and v(x)
>
(x) can be ane in x, while (iii) kv(x)k
2
and
v(x)
2
have to be constant. Therefore, write v(x) = v = (v
1
, . . . , v
d
)
>
, v(x) = v, and
r(x) = r
0
+r
1
x +r
2
x
2
, (7.27)
m(x) = m
0
+m
1
x, (7.28)

i
(x) =
i0
+
i1
x (7.29)
4
A real-valued function is said to be a quadratic function of the k-vector x, if it can be written as a
1
+a
>
2
x +
x
>
a
3
x, where a
1
is a constant scalar, a
2
is a constant k-vector, and a
3
is a constant (k k)-matrix (either a
2
or a
3
or both can be zero so that a constant and an ane function are also considered quadratic. A vector- or
matrix-valued function is said to be quadratic if all its elements are quadratic.
7.3 CRRA utility 101
for some constants r
0
, r
1
, r
2
, m
0
, m
1
, m
2
,
i0
,
i1
,
i2
. Consequently,
k(x)k
2
=
d

i=1

i
(x)
2
=
_
d

i=1

2
i0
_
+ 2
_
d

i=1

i0

i1
_
x +
_
d

i=1

2
i1
_
x
2

0
+
1
x +
2
x
2
, (7.30)
v(x)
>
(x) =
d

i=1
v
i
(x)
i
(x) =
_
d

i=1
v
i

i0
_
+
_
d

i=1
v
i

i1
_
x K
0
+K
1
x. (7.31)
If we substitute (7.27)(7.31) into (7.26) and use the fact that (7.26) must hold for all values
of x and all t, we obtain a system of three ordinary dierential equations for A
0
, A
1
, and A
2
:
A
0
0
() = r
0
+

0
2
+

m
0

K
0

A
1
()
+
1
2
_
kvk
2
+ v
2
_
A
2
()
1
2
_
kvk
2
+ v
2
_
A
1
()
2
, (7.32)
A
0
1
() = r
1
+

1
2
+

m
0

K
0

A
2
()
+
_
m
1

K
1

_
kvk
2
+ v
2
_
A
2
()
_
A
1
(), (7.33)
A
0
2
() = 2r
2
+

2

+ 2

m
1

K
1

A
2
()
1

_
kvk
2
+ v
2
_
A
2
()
2
. (7.34)
These equations are to be solved with the initial conditions A
0
(0) = A
1
(0) = A
2
(0) = 0.
The equations (7.33) and (7.34) can be solved using Theorem C.3. Suppose that

m
1

K
1

2
+
1

2r
2
+

2

_
kvk
2
+ v
2
_
> 0 (7.35)
and dene
= 2

m
1

K
1

2
+
1

2r
2
+

2

(kvk
2
+ v
2
).
Then the solution to (7.34) with A
2
(0) = 0 is
A
2
() =
2

2r
2
+

2

(e

1)

+ 2
1

K
1
2m
1

(e

1) + 2
. (7.36)
The solution to (7.33) with A
1
(0) = 0 is
A
1
() =
r
1
+

1
2
2r
2
+

2

A
2
() +
4q

_
e
/2
1
_
2
( + 2
1

K
1
2m
1
)(e

1) + 2
, (7.37)
where
q =

m
0

K
0

2r
2
+

2

m
1

K
1

r
1
+

1
2

.
Finally, we can compute A
0
() by integrating up (7.32):
A
0
() =

r
0
+

0
2

m
0

K
0
_

0
A
1
(s) ds
+
1
2
_
kvk
2
+ v
2
_
_

0
A
2
(s) ds
1
2
_
kvk
2
+ v
2
_
_

0
A
1
(s)
2
ds.
(7.38)
These integrals can be calculated explicitly and are generally quite complex, but simplify somewhat
in relevant special cases.
102 Chapter 7. Stochastic investment opportunities: the general case
We summarize our ndings in the following theorem.
5
Theorem 7.9. Assume that v(x) = v, v(x) = v, and that r(x), m(x), and (x) are given as
in (7.27)(7.29), and that the parameter condition (7.35) holds. For an investor with CRRA utility
from terminal wealth only, the indirect utility function is then given by
J(W, x, t) =
1
1

We
A
0
(Tt)+A
1
(Tt)x+
1
2
A
2
(Tt)x
2

1
,
where A
2
, A
1
, and A
0
are given by (7.36), (7.37), and (7.38). The optimal investment strategy is
given by
(W, x, t) =
1

_
(x, t)
>
_
1
(x)
1

_
(x, t)
>
_
1
v (A
1
(T t) +A
2
(T t)x) .
For a complete market we can generalize the above results to encompass investors with utility
from intermediate consumption. The relevant ODEs are the same, and thus their solutions are the
same as above,except that we impose the condition v = 0.
Theorem 7.10. Assume that the market is complete ( v(x) 0), that v(x) = v, and that r(x),
m(x), and (x) are given as in (7.27)(7.29). Imposing the restriction v = 0, assume that the
parameter condition (7.35) holds, and let A
2
, A
1
, and A
0
be given by (7.36), (7.37), and (7.38).
Dene
g(x, t; s) = exp

(s t)
1

A
0
(s t) +A
1
(s t)x +
1
2
A
2
(s t)x
2
_
.
For an investor with CRRA utility from intermediate consumption and possibly terminal wealth,
the indirect utility function is then given by
J(W, x, t) =
1
1
_

1
_
T
t
g(x, t; s) ds +
1

2
g(x, t; T)
_

W
1
,
the optimal consumption strategy is
C(W, x, t) =
_
_
T
t
g(x, t; s) ds +

1
1

g(x, t; T)
_
1
W,
and the optimal investment strategy is
(W, x, t) =
1

_
(x, t)
>
_
1
(x)
1

D(x, t, T)
_
(x, t)
>
_
1
v,
where
D(x, t, T) =
_
T
t
(A
1
(s t) +A
2
(s t)x) g(x, t; s) ds +

1
1

(A
1
(T t) +A
2
(T t)x) g(x, t; T)
_
T
t
g(x, t; s) ds +

1
1

g(x, t; T)
.
5
Note the close connection to the so-called quadratic models of the term structure of interest rates, see e.g., Ahn,
Dittmar, and Gallant (2002) and Leippold and Wu (2003).
7.3 CRRA utility 103
7.3.4 Multi-dimensional state variable
With a multi-dimensional state variable x, a qualied guess on the indirect utility function is
J(W, x, t) =
1
1
g(x, t)

W
1
,
which indeed is a solution to the HJB equation (7.11) if the function g(x, t) solves the PDE
0 =
1/
1

+
1

r(x) +
1
2
2
k(x)k
2

g(x, t) +
g
t
(x, t)
+

m(x)
1

v(x)
>
(x)

>
g
x
(x, t) +
1
2
tr
_
g
xx
(x, t)
x
(x)
_
+
1
2
( 1)g(x, t)
1
g
x
(x, t)
>
v(x) v(x)
>
g
x
(x, t)
(7.39)
with the terminal condition g(x, T) =
1/
2
. The optimal investment strategy is
(W, x, t) =
1

_
(x, t)
>
_
1
(x) +
1
g(x, t)
_
(x, t)
>
_
1
v(x)g
x
(x, t),
and with intermediate consumption the optimal consumption rate is given by
C(W, x, t) =
1/
1
W
g(x, t)
.
With no intermediate consumption (
1
= 0,
2
= 1, = 0), we can write
g(x, t) g(x, t; T) = exp

H(x, T t)
_
,
and H(x, ) then has to solve
0 =r(x) +
1
2
k(x)k
2

(x, ) +

m(x)
1

v(x)
>
(x)

>
H
x
(x, )
+
1
2
tr
_

x
(x)H
xx
(x, )
_

1
2
H
x
(x, )
>
_

x
(x) + ( 1) v(x) v(x)
>
_
H
x
(x, )
(7.40)
with the condition H(x, 0) = 0. In this case, the indirect utility function is
J(W, x, t) =
1
1

We
H(x,Tt)

1
, (7.41)
and the optimal investment strategy is
(W, x, t) =
1

_
(x, t)
>
_
1
(x)
1

_
(x, t)
>
_
1
v(x)H
x
(x, T t).
As in the case of a one-dimensional state variable, the solution to the problem with utility of
consumption can be stated in terms of various integrals involving H under the assumption that
the market is complete.
The results for ane and quadratic models with a one-dimensional state variable can be gen-
eralized to settings with a multi-dimensional state variable. We get exactly the same results as
in Theorems 7.77.10 except that the A
1
-function is now vector-valued and the A
2
-function is
matrix-valued. We get a larger system of dierential equations to solve.
Let us briey summarize the results for the multi-dimensional ane case. The short rate is of
the form
r(x) = r
0
+r
>
1
x,
104 Chapter 7. Stochastic investment opportunities: the general case
the dynamics of the state variable x is
dx
t
=
_
m
0
+mx
t
_
dt +D
_
V (x
t
)
. .
v(x
t
)
dz
t
+

D
_

V (x
t
)
. .
v(x
t
)
d z
t
,
where m
0
is a k-vector, m is a k k-matrix, D is a k d-matrix,

D is a k k-matrix, and the
d d-matrix V (x) and the k k-matrix

V (x) are diagonal matrices with elements
[V (x)]
ii
=
i
+V
>
i
x, [

V (x)]
ii
=
i
+

V
>
i
x.
Furthermore, we must have
v(x)(x) = D
_
V (x
t
)(x) = K
0
+K
1
x (7.42)
for some k-vector K
0
and (k k)-matrix K
1
, and
k(x)k
2
=
0
+
>
1
x (7.43)
for some scalar
0
and k-vector
1
. Eqs. (7.42) and (7.43) are satised if (x) =
_
V (x) for
some d-vector but slightly more general specications of (x) are also possible. In this case, the
PDE (7.40) has a solution of the ane form
H(x, ) = A
0
() +A
1
()
>
x,
where A
1
() satises A
1
(0) = 0 and the ODE
A
0
1
() = r
1
+

1
2
+

m
1

K
1

>
A
1
()

1
2
_
d

i=1
[D
>
A
1
()]
2
i
V
i
+
k

i=1
[

D
>
A
1
()]
2
i

V
i
_
,
and A
0
() satises A
0
(0) = 0 and the ODE
A
0
0
() = r
0
+

0
2
+

m
0

K
0

>
A
1
()

1
2
_
d

i=1
[D
>
A
1
()]
2
i

i
+
k

i=1
[

D
>
A
1
()]
2
i

i
_
.
Given A
1
, A
0
can be computed by integration:
A
0
() =
_

0
A
0
0
(s) ds =

r
0
+

0
2

m
0

K
0

>
_

0
A
1
(s) ds

1
2
_
d

i=1

i
_

0
[D
>
A
1
(s)]
2
i
ds +
k

i=1

i
_

0
[

D
>
A
1
(s)]
2
i
ds
_
.
The optimal portfolio with utility of terminal wealth only is
(x, t) =
1

_
(x, t)
>
_
1
(x)
1

_
(x, t)
>
_
1
_
V (x)D
>
A
1
(T t).
These results can be extended to utility of intermediate consumption as long as the market is
complete.
7.4 Logarithmic utility 105
There are also cases in which the function H(x, t) is the sum of a function which is ane in some
of the individual state variables and quadratic in the others. For example, with a two-dimensional
state variable x = (x
1
, x
2
)
>
, we will under some conditions get a solution of the form
H(x
1
, x
2
, t) = A
0
(T t) +A
11
(T t)x
1
+A
12
(T t)x
2
+
1
2
A
2
(T t)x
2
2
,
and, consequently, the investment strategy
(W, x, t) =
1

_
(x, t)
>
_
1
(x)

_
(x, t)
>
_
1
[v
1
(x)A
11
(T t) +v
2
(x) (A
12
(T t) +A
2
(T t)x
2
)] ,
where v
i
is the d-vector of sensitivities of x
i
with respect to the traded risks dz
t
.
7.4 Logarithmic utility
Logarithmic utility is the special case of CRRA utility in which the relative risk aversion equals
one. For notational simplicity, let us assume a one-dimensional state variable. Applying the same
procedure to the problem with log utility as we did for CRRA utility, one can show that (this is
Exercise 7.2)
J(W, x, t) = g(t) ln W +h(x, t)
where g(t) is again given by (6.14) and where h(x, t) must satisfy a certain PDE. Since the cross
derivative J
Wx
(W, x, t) = 0, the optimal risky portfolio in (7.7) reduces to
(W, x, t) =
_
(x, t)
>
_
1
(x).
We can conclude that a logarithmic investor does not hedge stochastic variations in the
investment opportunity set. She behaves myopically, i.e., as in a static one-period framework.
Optimal consumption is again given by
C(W, x, t) =
1/
1
W
g(t)
.
Letting
0
(W, x, t) denote the fraction of wealth optimally invested in the instantaneously risk-free
asset, we can summarize the entire investment strategy as
_

0
(W, x, t)
(w, x, t)
_
=
_
1 1
>
_
(x, t)
>
_
1
(x)
_
(x, t)
>
_
1
(x)
_
This portfolio is sometimes referred to as the log portfolio or the growth-optimal portfolio,
since it is also the portfolio with the highest expected average compound growth rate of portfolio
value. This average growth rate is dened as
1
Tt
ln (W
T
/W
t
) .
7.5 How costly are deviations from the optimal investment strategy?
The following results are taken from Larsen and Munk (2012).
We consider an investor with a power utility function of wealth at some future date T and ignore
both intermediate consumption and income other than nancial returns. Any combination of an
initial wealth W and an investment strategy will give rise to a terminal wealth W

T
(a partially
106 Chapter 7. Stochastic investment opportunities: the general case
controlled random variable) and the expected utility associated with that investment strategy is
thus
J

(W, x, t) = E
t
_
1
1
(W

T
)
1
_
,
where W is the initial (time t) wealth and > 1 is the constant relative risk aversion coecient.
It is well-known that no matter what assumptions are made about the dynamics of investment
opportunities, the optimal investment strategy for a CRRA investor will be independent of her
wealth level. Hence, we will focus on strategies of the form (x, t) that only depends on the
state variable and time (and not on wealth). The next theorem characterizes the expected utility
generated by such an investment strategy.
Theorem 7.11. The expected utility generated by the investment strategy
t
= (x
t
, t) is
J

(W, x, t) =
1
1

We
H

(x,t)

1
, (7.44)
where the function H

(x, t) satises the PDE


H

t
+
_
m(x) ( 1)v(x)(x, t)
>
(x, t)
_
> H

x
+
1
2
tr
_
H

xx(x)
_

1
2
(H

x
)
>
(x)H

x
+r(x) +(x, t)
>
(x, t)
_
(x)

2
(x, t)
>
(x, t)
_
= 0
with the terminal condition H

(x, T) = 0.
As explained in Section 5.4, we can associate a percentage wealth loss `
t
with any given subop-
timal investment strategy . The loss is implicitly dened by the relation
J

(W
t
, x
t
, t) = J(W
t
[1 `
t
], x
t
, t).
With = (x, t) and CRRA utility of terminal wealth only, it follows from Eqs. (7.41) and (7.44)
that the loss can be stated as
`
t
= 1 exp {[H(x
t
, t) H

(x
t
, t)]} H(x
t
, t) H

(x
t
, t).
Using these results, one can investigate various interesting suboptimal strategies, e.g.,
(i) the optimal strategy given that some assets are omitted from the portfolio,
(ii) the myopic, no hedge strategy, and
(iii) a certain absolute deviation from the optimal portfolio weights.
When the return dynamics have an ane or quadratic structure, the utility losses associated with
these three suboptimal strategies can be derived from solving appropriate ordinary dierential
equations (ODEs). Obviously, case (i) allows us to evaluate the benets of adding an extra asset
class to the portfolio decision problem. Various recent academic papers have investigated portfolio
choice models with various derivatives, corporate bonds, or other assets not traditionally included
in a Merton-style model. From time to time innovative members of the nancial industry promote
investments in asset classes typically ignored. We provide a framework for a well-founded analysis
of the investor welfare gains from expanding the investment universe. Case (ii) allows us to address
the importance of intertemporal hedging. Some authors report that, for the specic model of return
7.6 Exercises 107
dynamics they consider, the intertemporal hedging demand is quite small; see, e.g., At-Sahalia
and Brandt (2001), Ang and Bekaert (2002), Brandt (1999), and Chacko and Viceira (2005).
However, it is not clear that a small change in the long-term investment strategy cannot have a
signicant impact on the expected life-time utility. In fact, in a model with a constant risk-free
rate and a single stock index with constant expected return and time-varying volatility, Gomes
(2007) reports small intertemporal hedging demands and signicantalthough not dramatically
largeutility losses from ignoring the hedge term. Case (iii) allows us to gauge the robustness of
the optimal investment strategy, e.g., deviations from the truly optimal strategy due to applying a
slightly mis-specied model or slightly inaccurate parameter values. The size of the utility loss from
small perturbations of the optimal strategy will also indicate how frequent the portfolio should be
rebalanced in practical implementations. Exercise 7.3 deals with case (iii).
For further discussions and examples see Larsen and Munk (2012).
7.6 Exercises
Exercise 7.1. Give a proof of Theorem 7.6.
Exercise 7.2. Verify the results stated in Section 7.4.
Exercise 7.3. Consider a trading strategy

which is a perturbation of the optimal strategy

in the sense that

(x
t
, t) =

(x
t
, t) +
_
(x
t
, t)
>
_
1
(x
t
, t)
for some (x, t) that can be interpreted as the error made in the assessment of the optimal sensi-
tivity of wealth with respect to the shocks to asset prices. Let

(x, t) = H(x, t) H

(x, t) so
that the wealth loss is `

(x, t) = 1 exp{

(x, t)}

(x, t). Show that

satises the PDE

m(x) ( 1)v(x)
_
1

(x) +(x, t)
_
( 1)
_
1

v(x)
>
v(x) + v(x) v(x)
>
_
H

>

x
+

t
+
1
2
tr
_

xx
(x)
_
+
1
2
(

x
)
>
(x)

x
+

2
k(x, t)k
2
= 0 (7.45)
with the terminal condition

(x, T) = 0. In particular, show that if (x, t) is independent of x,


the solution

(x, t) =

(t) to
(

)
0
(t) +

2
k(t)k
2
= 0,

(T) = 0,
will also solve the full PDE (7.45). Hence, the solution is

(t) =

2
_
T
t
k(s)k
2
ds.
Observe that the loss is increasing in the risk aversion, the time horizon, and the squared error
k(s)k
2
.
Exercise 7.4. In the models considered so far we have assumed a single consumption good, but
modern economics oer an enormous variety of dierent consumption goods. The purpose of this
exercise is to perform a preliminary analysis of how the presence of multiple consumption goods
may aect the optimal consumption and investment strategies of an individual investor.
108 Chapter 7. Stochastic investment opportunities: the general case
For simplicity, assume that the investor cares about only two consumption goods and both goods
are perishable (non-storable). For i = 1, 2, let c
it
denote that units of good i consumed at time t.
Let good 1 be the numeraire so that its price is normalized to one at all times. The time t price
of good 2 is denoted by
t
. To focus on the impact of multiple consumption goods, let us assume
constant investment opportunities, i.e., we assume that the investor can invest in a risk-free asset
with a constant annualized rate of return equal to r and in d risky assets with price dynamics
dP
t
= diag(P
t
)
_
r1 +
_
dt + dz
t

in the usual notation. Furthermore, assume that the price of good 2 follows a diusion process
d
t
=

(
t
) dt +

(
t
)
>
dz
t
+

(
t
) d z
t
.
Here z is a one-dimensional standard Brownian motion independent of the d-dimensional standard
Brownian motion z.
We consider an individual with time-additive expected utility (and, for simplicity, we disregard
any utility of terminal wealth) so that the indirect utility function is
J(W, , t) = sup
(c
1s
,c
2s
,
s
)
s[t,T]
E
t
_
_
T
t
e
(st)
u(c
1s
, c
2s
) ds
_
.
(a) Explain why the HJB-equation associated with this problem can be written as
J(W, , t) =L
c
J(W, , t) +L

J(W, , t) +
J
t
(W, , t) +rWJ
W
(W, , t)
+

()J

(W, , t) +
1
2
(k

()k
2
+

()
2
)J

(W, , t),
where
L
c
J = sup
c
1
,c
2
{u(c
1
, c
2
) (c
1
+c
2
)J
W
} ,
L

J = sup

WJ
W

>
+
1
2
W
2
J
WW

>

>
+WJ
W

>

_
.
(b) Show that the optimal consumption decisions at any point in time have the property that
u
2
(c
1
, c
2
)
u
1
(c
1
, c
2
)
= ,
where u
i
denotes the derivative of u with respect to c
i
. Interpret this result.
In the remainder of the exercise assume the Cobb-Douglas style utility function
u(c
1
, c
2
) =
1
1
_
c

1
c
1
2
_
1
,
where > 0 is the relative risk aversion and (0, 1) captures the relative preference weights of
the two goods.
(c) Show that the optimal consumption decisions imply that c
2
=
1

c
1
and interpret that
result.
(d) Show that L
c
J(W, , t) =

J
11/
W
for some constants and and determine those
constants.
7.6 Exercises 109
(e) Express the optimal portfolio in terms of relevant derivatives of J and interpret your
ndings. How does the presence of two consumption goods aect the optimal portfolio?
(f) Show that
L

J =
1
2
J
2
W
J
WW
kk
2

1
2
J
2
W
J
WW
k

k
2

J
W
J
W
J
WW

>

.
(g) Conjecture that J(W, , t) =
1
1
g(, t)

W
1
and derive a partial dierential equation for
g.
(h) Is the market complete or incomplete?
In the remainder of the exercise assume that the price process for good 2 is a geometric Brownian
motion spanned by the traded assets, i.e.,
d
t
=
t
[ dt +
>
dz
t
] ,
where is a constant scalar and a constant vector.
(i) Show that
g(, t) =
(1)
1

h(t)
solves the relevant partial dierential equation for some constant and some function h(t).
(j) What is the optimal consumption and investment strategy in this case?
CHAPTER 8
The martingale approach
8.1 The martingale approach in complete markets
The dynamic programming approach requires the existence of a nite-dimensional Markov pro-
cess x = (x
t
) such that the indirect utility function of the investor can be written as J
t
=
J(W
t
, x
t
, t). In contrast, the martingale approach does not require additional assumptions on the
stochastic processes that the investor cannot control beyond those outlined in Section 5.2. In par-
ticular, we do not have to assume that the interest rates, price variances etc. are fully described by
a nite-dimensional Markov process. The dynamic programming approach does not allow many
conclusions on problems where the PDE cannot be solved explicitly. For example, it is hard to tell
whether an optimal strategy actually exists. This question is easier to study with the martingale
approach. In this section we consider the case where the market is complete. The subsequent
section incorporates various portfolio constraints.
We go back to the general model for risky asset prices stated in (5.3). We consider a complete
market so that the variations in the risk-free rate of return r
t
, expected rates of return
t
, and vari-
ances and covariances dened by
t
between rates of return are caused by the same d-dimensional
standard Brownian motion z that aects the risky asset prices. Therefore, the market price of risk
vector
t
dened by

t
=
1
t
(
t
r
t
1)
summarizes the risk-return tradeo of all risks. In a complete market there is a unique state-price
deator process (a.k.a. the pricing kernel) = (
t
) given by

t
= exp

_
t
0
r
s
ds
_
t
0

>
s
dz
s

1
2
_
t
0
k
s
k
2
ds
_
, (8.1)
Consequently (to be shown in Exercise 8.1), the state-price deator evolves as
d
t
=
t
[r
t
dt +
>
t
dz
t
] . (8.2)
We also have a unique equivalent martingale measure (also known as the risk-neutral probability
measure) Q dened by the Radon-Nikodym derivative dQ/dP = exp{
_
T
0
r
s
ds}
T
. We assume that
111
112 Chapter 8. The martingale approach
is an L
2
[0, T] process. The time zero price of a stochastic payo X
T
at some point T is given by
E
Q
_
e

R
T
0
r
s
ds
X
T
_
= E[
T
X
T
] .
Similarly, the time t price is
E
Q
t
_
e

R
T
t
r
s
ds
X
T
_
= E
t
_

t
X
T
_
.
For more information about state-price deators, market prices of risk, and risk-neutral probabili-
ties, see Bjork (2009), Due (2001), Munk (2012) or other textbook presentations of modern asset
pricing theory.
For simplicity we assume that the investor receives no income from non-nancial sources. Then
a natural constraint on the investors choice of consumption and portfolio strategy (c, ) at time 0
is that
E
_
_
T
0

t
c
t
dt +
T
W
T
_
W
0
,
where W
T
is the terminal wealth induced by (c, ) and W
0
is the initial wealth of the investor. This
simply says that the time zero price of the strategy cannot exceed the initial wealth available.
This is shown rigorously in the following theorem. But rst we recall from (5.5) that wealth evolves
as
dW
t
= W
t

r
t
+
>
t

t

dt c
t
dt +W
t

>
t

t
dz
t
.
From this, (8.2), and Itos Lemma we get that
d (
t
W
t
) =
t
c
t
dt +
t
W
t
_

>
t

t

>
t
_
dz
t
,
or equivalently

t
W
t
+
_
t
0

s
c
s
ds = W
0
+
_
t
0

s
W
s
_

>
s

s

>
s
_
dz
s
. (8.3)
Theorem 8.1. If (c, ) is a feasible strategy, then
E
_
_
T
0

t
c
t
dt +
T
W
T
_
W
0
,
where W
T
is the terminal wealth induced by (c, ).
Proof. Dene the stopping times (
n
)
nN
by

n
= T inf

t [0, T]

_
t
0
k
s
W
s

>
s

s

k
2
ds n
_
.
Then the stochastic integral on the right-hand side of (8.3) is a martingale on [0,
n
]. Taking
expectations in (8.3) leaves us with
E[

n
W

n
] + E
__

n
0

t
c
t
dt
_
= W
0
.
Letting n , we have
n
T, and it can be shown by use of Lebesgues monotone convergence
theorem that
E
__

n
0

t
c
t
dt
_
E
_
_
T
0

t
c
t
dt
_
.
8.1 The martingale approach in complete markets 113
Furthermore, Fatous lemma can be applied to show that
liminf
n
E[

n
W

n
] E[
T
W
T
] .
The claim now follows.
The idea of the martingale approach is to focus on the static optimization problem
sup
(c,W)
E
_
_
T
0
e
t
u(c
t
) dt +e
T
u(W)
_
, (8.4)
s.t. E
_
_
T
0

t
c
t
dt +
T
W
_
W
0
rather than the original dynamic problem
sup
(c,)
E
_
_
T
0
e
t
u(c
t
) dt +e
T
u(W
T
)
_
,
s.t. dW
t
= W
t

r
t
+
>
t

t

dt c
t
dt +W
t

>
t

t
dz
t
.
In the static problem the agent chooses the terminal wealth directly, whereas in the dynamic prob-
lem the terminal wealth follows from the portfolio strategy (and the consumption strategy). For
the terminal wealth variable W, the agent is allowed to choose among the non-negative, integrable
and F
T
-measurable random variables. This approach was suggested by Karatzas, Lehoczky, and
Shreve (1987) and Cox and Huang (1989, 1991). Some preliminary aspects were addressed by
Pliska (1986).
The Lagrangian for the constrained optimization problem (8.4) is given by
L = E
_
_
T
0
e
t
u(c
t
) dt +e
T
u(W)
_
+
_
W
0
E
_
_
T
0

t
c
t
dt +
T
W
__
= W
0
+ E
_
_
T
0
_
e
t
u(c
t
)
t
c
t
_
dt +
_
e
T
u(W)
T
W
_
_
,
where is a Lagrange multiplier. We can maximize the expectation in the last line by max-
imizing
_
e
T
u(W)
T
W
_
with respect to W for each possible value of
T
and maximizing
_
e
t
u(c
t
)
t
c
t
_
with respect to c
t
for each t and each possible value of
t
. This results in the
rst-order conditions
e
t
u
0
(c
t
) =
t
, e
T
u
0
(W) =
T
,
where is then chosen such that the inequality constraint holds as an equality. Let I
u
() denote
the inverse of the marginal utility function u
0
() and I
u
() the inverse of u
0
(). Then the candidates
for the optimal consumption and the optimal terminal wealth can be written as
c
t
= I
u
_
e
t

t
_
, W = I
u
_
e
T

T
_
.
The present value of this choice depends on the Lagrange multiplier :
H() = E
_
_
T
0

t
I
u
(e
t

t
) dt +
T
I
u
(e
T

T
)
_
. (8.5)
We look for a multiplier such that H() = W
0
so that the entire budget is spend. Since marginal
utility is decreasing, this is also the case for the inverse of marginal utility and hence also for the
114 Chapter 8. The martingale approach
function H. We will assume that H() is nite for all > 0. This condition should be veried in
concrete applications. Under this assumption, H has an inverse denoted by Y, and the appropriate
Lagrange multiplier is = Y(W
0
). The next theorem says that the optimal policy in the static
problem is feasible and optimal in the dynamic problem.
Theorem 8.2. Assume that H() < for all > 0. The optimal consumption rate is given by
c

t
= I
u
_
Y(W
0
)e
t

t
_
.
Under the optimal portfolio strategy the terminal wealth level is
W

= I
u
_
Y(W
0
)e
T

T
_
.
The wealth process under the optimal policy is given by
W

t
=
1

t
E
t
_
_
T
t

s
c

s
ds +
T
W

_
. (8.6)
Proof. First note that for a concave and dierentiable function u we have that
u( c) u(c)
c c
u
0
( c)
for any c > c since the left-hand side is the slope of the line through the points (c, u(c)) and ( c, u( c))
and the right-hand side is the slope of the tangent at c. It follows immediately that
u( c) u(c) u
0
( c)( c c).
A moment of reection (maybe supported by a sketch of a graph) will convince you that the
inequality holds even if c c. Let us take c = I
u
(z) for some z. Then u
0
( c) = z so that we can
conclude that
u(I
u
(z)) u(c) z (I
u
(z) c) , c, z > 0.
Analogously, we have
u(I
u
(z)) u(W) z (I
u
(z) W) , W, z > 0.
Hence, for any feasible strategy (c, ) with associated terminal wealth W, we have that
E
_
_
T
0
e
t
(u(c

t
) u(c
t
)) dt +e
T
( u(W

) u(W))
_
E
_
_
T
0
Y(W
0
)
t
(c

t
c
t
) dt +Y(W
0
)
T
(W

W)
_
0,
where the last inequality follows from the fact that, by Theorem 8.1,
E
_
_
T
0

t
c
t
dt +
T
W
_
W
0
,
and, per construction,
E
_
_
T
0

t
c

t
dt +
T
W

_
= W
0
.
8.2 Complete markets and constant investment opportunities 115
Thus, if there is a portfolio strategy

such that (c

) is feasible and gives a terminal wealth of


W

, then the strategy (c

) will be optimal. Dene the process W

by (8.6). Obviously,

t
W

t
+
_
t
0

s
c

s
ds = E
t
_
_
T
0

s
c

s
ds +
T
W

T
_
denes a martingale, so by the martingale representation theorem, an adapted L
2
[0, T] process
exists such that

t
W

t
+
_
t
0

s
c

s
= W
0
+
_
t
0

>
s
dz
s
. (8.7)
Dene a portfolio process by

t
=
_

>
t
_
1


t
W

t

t
+
t

(with the remaining wealth W

t
(1
>
t
1) invested in the bank account). A comparison of (8.7)
and (8.3) shows that the wealth process corresponding to this strategy together with the consump-
tion strategy c

is exactly (W

t
). From (8.6), it is clear that terminal wealth is W

T
= W

.
Note that the indirect utility at time 0 as a function of initial wealth W
0
is
J(W
0
) = E
_
_
T
0
e
t
u(c

s
) ds +e
T
u(W

)
_
= E
_
_
T
0
e
t
u
_
I
u
(Y(W
0
)e
t

t
)
_
dt +e
T
u
_
I
u
(Y(W
0
)e
T

T
)
_
_
.
We shall demonstrate how to apply the martingale approach on concrete consumption and
investment choice problems in Sections 8.2 and 8.3. The martingale approach is in many aspects
more elegant and it is better suited for answering the existence question under general conditions, cf.
Cuoco (1997). However, the existence of an optimal portfolio strategy is based on the martingale
representation theorem, which in itself does not give an explicit representation of the optimal
portfolio, nor a way to compute it. In some settings the martingale approach can give an abstract
characterization of both the optimal consumption and portfolio strategy even for non-Markov
dynamics, but in order to obtain explicit expressions for the optimal strategies the setting is
typically specialized to a Markov setting. So far, there are only a few examples of explicit solutions
computed with the martingale approach where the solution could not have been easily found by
an application of the dynamic programming approach. (See Munk and Srensen (2004) for one
example.) However, in some of the relatively simple problems, such as the complete markets case
studied by Cox and Huang (1989), it can be shown that the optimal portfolio policies can be
found by solving a partial dierential equation (PDE), which has a simpler structure than the
HJB equation.
8.2 Complete markets and constant investment opportunities
As discussed in Section 8.1 portfolio/consumption problems can also be analyzed using the so-
called martingale approach instead of the dynamic programming approach used above. Recall that
the application of the martingale approach is considerably more complex for incomplete markets,
so we assume a complete market setting. We will try to get as far as possible without imposing
116 Chapter 8. The martingale approach
constant investment opportunities so that we will not have to start all over when we generalize to
stochastic investment opportunities.
According to Theorem 8.2, if
1
> 0, the optimal consumption rate is given by
c

t
= I
u
_
Y(W
0
)e
t

t
_
and, if
2
> 0, the optimal level of terminal wealth level is
W

= I
u
_
Y(W
0
)e
T

T
_
.
For the case of CRRA utility
u(c) =
1
c
1
1
, u(W) =
2
W
1
1
,
we have
u
0
(c) =
1
c

, u
0
(W) =
2
W

with inverse functions


I
u
(z) =
1/
1
z

, I
u
(z) =
1/
2
z

,
assuming that
1
,
2
> 0. It turns out to be useful to dene a process g = (g
t
) by
g
t
= E
t
_
_
T
t

1/
1
e

(st)

11/
ds +
1/
2
e

(Tt)

11/
_
.
Consequently, the function H dened in (8.5) can be computed as
H() = E
_
_
T
0

1/
1
e

t
dt +
T

1/
2
e

T
_
=

E
_
_
T
0

1/
1
e

1
1

t
dt +
1/
2
e

1
1

T
_
=

g
0
with inverse function
Y(W
0
) = W

0
g

0
.
Therefore, the optimal consumption policy is
c

t
=
1/
1
e

t
Y(W
0
)

t
=
1/
1
W
0
g
0
e

t
= e

t
W
0
_
E
_
_
T
0
e

11/
t
dt +

1/
e

11/
T
__
1
,
(8.8)
and the optimal terminal wealth level is
W

=
1/
2
e

T
Y(W
0
)

T
=
1/
2
W
0
g
0
e

T
= e

T
W
0
_
E
_
_
T
0

1/
e

11/
t
dt +e

11/
T
__
1
.
8.2 Complete markets and constant investment opportunities 117
The wealth process under the optimal policy is given by
W

t
=
1

t
E
t
_
_
T
t

s
c

s
ds +
T
W

_
=
W
0
g
0
1

t
E
t
_
_
T
t

1/
1
e

1
1

s
ds +
1/
2
e

1
1

T
_
=
W
0
g
0
e

t
E
t
_
_
T
t

1/
1
e

(st)

1
1

ds +
1/
2
e

(Tt)

1
1

_
=
W
0
g
0
e

t
g
t
. (8.9)
Consequently,
W

t
g
t
=
W
0
g
0
e

t
.
We see immediately from (8.8) that we can rewrite the optimal time t consumption rate as
c

t
=
1/
1
W

t
g
t
so that g
t
is proportional to the optimal wealth-to-consumption ratio. Moreover, for s > t, we
have
c

s
=
W
0
g
0

1/
1
e

s
=
W
0
g
0

1/
1
e

t
e

(st)

=
W

t
g
t

1/
1
e

(st)

,
(8.10)
which states the uncertain consumption rate at time s given information available at time t.
Similarly, we can express the optimal terminal wealth as
W

=
W

t
g
t

1/
2
e

(Tt)

. (8.11)
The indirect utility at time t is
J
t
= E
t
_
_
T
t
e
(st)
u(c

s
) ds +e
(Tt)
u(W

)
_
=
1
1
E
t
_
_
T
t
e
(st)

1
(c

s
)
1
ds +e
(Tt)

2
(W

)
1
_
=
1
1

t
g
t

1
E
t
_
_
T
t
e

(st)

1/
1

11/
ds +e

(Tt)

1/
2

11/
_
=
1
1
g

t
(W

t
)
1
,
where the third equality is due to (8.10) and (8.11), whereas the last equality follows from the
denition of g
t
.
The equations above are generally valid for CRRA utility. Now let us specialize to the case of
constant investment opportunities, where the state-price deator is

t
= e
rt
>
z
t

1
2
kk
2
t
.
118 Chapter 8. The martingale approach
Consequently, future values of the state-price deator are lognormally distributed. Note that for
any s > t, we have
1
E
t
_
e

(st)

11/
_
= E
t
_
e

(st)

e
r(st)
>
(z
s
z
t
)
1
2
kk
2
(st)

1
1

_
= e

(st)
e
(1
1

)r(st)
1
2
(1
1

)kk
2
(st)
E
t
_
e
(1
1

)
>
(z
s
z
t
)
_
= e

(st)
e
(1
1

)r(st)
1
2
(1
1

)kk
2
(st)
e
1
2
(1
1

)
2
kk
2
(st)
= e

r(1)


1
2
1

2
kk
2

(st)
= e
A[st]
,
where A is again the constant given by (6.11). Now we can compute g
t
in closed form:
g
t
= E
t
_
_
T
t

1/
1
e

(st)

11/
ds +
1/
2
e

(Tt)

11/
_
=
_
T
t

1/
1
E
t
_
e

(st)

11/
_
ds +
1/
2
E
t
_
e

(Tt)

11/
_
=
_
T
t

1/
1
e
A[st]
ds +
1/
2
e
A[Tt]
=
1
A

1/
1
+ [A
1/
2

1/
1
]e
A[Tt]

,
which is deterministic and identical to the function g(t) dened in (6.12). Hence, for the case
of constant investment opportunities, the formulas for the optimal consumption rate and the
indirect utility derived above coincide with the results obtained by use of the dynamic programming
approach.
It remains to derive the optimal investment strategy. The optimal wealth process is given in (8.9).
Since we know by now that g
t
is deterministic, the only stochastic process on the right-hand side is
the state-price deator
t
. With constant investment opportunities the dynamics of the state-price
deator is
d
t
=
t
[r dt +
>
dz
t
] .
Applying Itos Lemma we can now derive the dynamics of the optimal wealth. Focusing on the
volatility term, we get
dW

t
= . . . dt
1

t
d
t
= . . . dt +W

t
1

>
dz
t
.
If we compare with the dynamics of the wealth for any given investment strategy = (
t
) stated
in (6.1), we see that the optimal wealth process is obtained with the investment strategy

t
=
1

>
_
1
,
as we found out using the dynamic programming approach.
1
The third equality is due to the following result: For a random variable x N(m, s
2
), E[e
ax
] = e
am+
1
2
a
2
s
2
.
In our case a = 1
1

and x =
>
(z
s
z
t
) =
P
d
i=1

i
(z
is
z
it
) is normally distributed with mean zero and
variance
P
d
i=1

2
i
(s t) = kk
2
(s t).
8.3 Complete markets and stochastic investment opportunities 119
8.3 Complete markets and stochastic investment opportunities
In this section we will apply the martingale approach to solve the consumption/portfolio problem
in a situation with stochastic investment opportunities. The martingale approach was introduced
in Section 8.1. In Section 8.2 we used the martingale approach to solve the consumption-portfolio
problem of a CRRA investor in the case of constant investment opportunities. Also in this section
we will assume complete markets and CRRA preferences for both intermediate consumption and
terminal wealth corresponding to
1
=
2
= 1.
We know already from Section 8.2 that the optimal time t consumption rate is
c

t
=
W
0
g
0
e

t
=
W

t
g
t
,
where W

t
is the wealth at time t if the optimal strategies are pursued, and the process g = (g
t
) is
dened by
g
t
= E
t
_
_
T
t
e

(st)

11/
ds +e

(Tt)

11/
_
.
The optimal terminal wealth level is
W

=
W
0
g
0
e

T
.
The indirect utility at time t is
J
t
=
1
1
g

t
(W

t
)
1
.
Furthermore, the wealth process under the optimal policy is given by
W

t
=
W
0
g
0
e

t
g
t
.
If r and are constant, g
t
is a deterministic function of time and the optimal investment strategy
is given in Section 8.2. If the investment opportunities are stochastic in the sense that r or or
both are stochastic processes, then g is a stochastic process. Write the dynamics of g as
dg
t
= g
t

gt
dt +
>
gt
dz
t

,
for some drift process
g
= (
gt
) and some sensitivity process
g
= (
gt
). The optimal wealth is
a function of t,
t
, and g
t
. Recall that the dynamics of the state-price deator
t
is
d
t
=
t
[r
t
dt +
>
t
dz
t
] .
An application of Itos Lemma gives that the dynamics of optimal wealth is
dW

t
= . . . dt
1

t
d
t
+
W

t
g
t
dg
t
= . . . dt +W

t
+
gt

>
dz
t
.
Comparing with the dynamics of wealth for any given portfolio, we can conclude that an optimal
investment strategy is

t
=
1

>
t
_
1

t
+
_

>
t
_
1

gt
.
120 Chapter 8. The martingale approach
This result was rst derived by Munk and Srensen (2004). It is a natural generalization of the
results obtained in Markov settings using the dynamic programming approach. The hedge term
of the portfolio is matching the volatility of the process g which is important for consumption.
Looking at the denition of g, we can see that only variations in the state-price deator, i.e., in
interest rates and market prices of risk, will be hedged. This is also in line with ndings in Markov
set-ups. Of course,
g
has to be identied in order for this result to be of practical relevance.
This is possible in many concrete cases, primarily cases with Markov dynamics where the dynamic
programming approach also applies, i.e., in ane or quadratic diusion models. But Munk and
Srensen (2004) consider a relevant and non-trivial example with non-Markov dynamics.
For investors with logarithmic utility ( = 1), we see that the process (g
t
) is always deter-
ministic so that the volatility
g
is zero. The optimal portfolio of a log investor is therefore

t
=
1

>
t
_
1

t
as has already been shown for Markov settings.
8.4 The martingale approach with portfolio constraints
This note provides a short introduction to the martingale approach to dynamic consumption
and portfolio choice problems in the case with constraints on the allowed portfolios. For details
and further results, see the original work by He and Pearson (1991), Karatzas, Lehoczky, Shreve,
and Xu (1991), Cvitanic and Karatzas (1992), Xu and Shreve (1992a, 1992b), Cuoco (1997), and
Munk (1997b, Ch. 3), as well as the textbook presentations by Korn (1997, Ch. 4) and Karatzas
and Shreve (1998, Ch. 6). Warning: all these references employ a lot of high-level mathematics.
8.4.1 A general representation of portfolio constraints
We consider a nancial market where d +1 assets can potentially be traded, possibly with some
constraints on the portfolios allowed. One of the asset will be denoted by asset 0 and represents a
locally risk-free asset with return process r = (r
t
), i.e., price process
P
0t
= exp
_
t
0
r
u
du
_
.
The other d assets are risky with prices given by the vector P
t
= (P
1t
, . . . , P
dt
)
>
satisfying
dP
t
= diag(P
t
)[
t
dt +
t
dz
t
],
where z
t
is a d-dimensional standard Brownian motion.
t
is assumed to have full rank d implying
the dynamic completeness of the market, at least potentially. None of the assets pay dividends
over the period [0, T] of interest to the investor considered below. Alternatively, we can think of
P
it
as the time t value that is obtained by purchasing one unit of asset i at time 0 and reinvesting
any dividends received from asset i by purchasing additional units of the same asset.
A trading strategy is a pair (
0
, ), where
0
is a one-dimensional (adapted) and = (
1
, . . . ,
d
)
>
is a d-dimensional (progressively measurable) stochastic process.
0t
denotes the dollar amount
invested in the savings account at time t.
it
is the dollar amount invested at time t in the ith
risky asset, i = 1, . . . , d.
Let K be a non-empty, closed, convex subset of R
d+1
. A trading strategy (
0
, ) is called K-
admissible if (
0t
,
t
)
>
K for all t [0, T] and all states and (
0
, ) satises some integrability
8.4 The martingale approach with portfolio constraints 121
conditions ensuring that the value of the trading strategy is well-dened. K is called the portfolio
constraint set. Various interesting specications of K are listed below. The set of K-admissible
trading strategies is denoted by P(K). A consumption process is a non-negative (progressively
measurable) process c in L
1
[0, T]. The set of consumption processes is denoted by C.
Given a trading strategy (
0
, ) P(K) and a consumption process c C, the dynamics of the
investors wealth W
t
= W

0
,,c
t
is
dW
t
=

0
t
r
t
+
>
t

t
+y
t
c
t

dt +
>
t

t
dz
t
. (8.12)
Initial wealth is W
0
= w. Here y is a non-negative (progressively measurable) stochastic process
representing the endowment stream of the agent, e.g., labor income. Since
0
t
= W
t

>
t
1, we can
rewrite the wealth dynamics as
dW
t
= [r
t
W
t
+
>
t
(
t
r
t
1) +y
t
c
t
] dt +
>
t

t
dz
t
,
which does not involve
0
explicitly. Note, however, that there may be constraints on the investment
in the instantaneously risk-free asset.
A triple (
0
, , c) is called K-admissible given the initial wealth w if
(i) (
0
, ) P(K), c C,
(ii) W

0
,,c
t
K at all times t T for some positive constant K,
(iii) W

0
,,c
T
0.
Let A(w; K) denote the set of triples (
0
, , c), which are K-admissible with initial wealth w.
In some situations, it is advantageous to let the agent choose a terminal wealth W directly
instead of choosing a trading strategy (
0
, ). A consumption/terminal wealth pair (c, W), where
c C and W is a non-negative F
T
-measurable random variable with nite expectations, is called
K-admissible with initial wealth w, if there exists a trading strategy (
0
, ) such that (
0
, , c) is
K-admissible with W

0
,,c
0
= w and W

0
,,c
T
= W. In that case (
0
, ) is said to nance (c, W).
Let A
0
(w; K) denote the set of K-admissible consumption/terminal wealth pairs (c, W). Clearly, if
(
0
, , c) A(w; K), then (c, W

0
,,c
T
) A
0
(w; K).
Note that we can model situations, where the endowment stream is not spanned by traded assets,
i.e., where y is not adapted to the ltration generated by traded assets, by letting y depend on,
say, P
d
and then restricting the investor to a policy with values in (a subset of) R R
d1
{0}.
By restricting the individuals to K-admissible processes, a number of interesting situations can
be examined. It turns out that the so-called support function of K plays an important role. Let
= (
0
, ) R R
d
. Then the support function : R
d+1
R {, +} of K is dened by
() = sup
(
0
,)K
(
0

>
) .
The eective domain of , i.e. the set of R
d+1
for which () < , is denoted by

K. Next,
we list a few interesting properties of and

K. See, e.g., Rockafellar (1970, Sect. 13) for more on
support functions.
(i)

K is a closed convex cone
2
, called the barrier cone of K.
2
A set D R
N
is called a cone if x D whenever x D and > 0.
122 Chapter 8. The martingale approach
(ii) If K is a cone, then 0 on

K.
(iii) is sub-additive, that is
(
1
) + (
2
) (
1
+
2
),
which follows from the corresponding property of the supremum operator.
(iv) If (
0
, ) K and

K, then

0
+
>
+ () 0. (8.13)
Of course, this follows trivially from the denition of .
It turns out that we need to impose the following assumption on K.
Assumption 8.1. K is such that is bounded from above on

K, or, equivalently, is non-positive
on

K and
0
0 for all

K.
Note that we are considering constraints on the amounts invested in the dierent assets. Cvitanic
and Karatzas (1992) started all this, but considered constraints on portfolio weights, which is less
general than constraints on amounts invested. Munk (1997b) extended/adapted the results of
Cvitanic and Karatzas (1992) to constraints on amounts invested, which is particularly important
to cover labor income where portfolio weights might not be well-dened. Here are examples of
interesting constraint sets:
Example 8.1. [Complete market] A complete market corresponds to having K = R
d+1
. This
implies that

K = {0}
d+1
and () = 0 for all

K. This is the standard market structure,
in which (in various degrees of generality) consumption/portfolio problems are studied by, e.g.,
Merton (1969, 1971), Karatzas, Lehoczky, and Shreve (1987), and Cox and Huang (1989, 1991).
2
Example 8.2. [Non-traded assets] A situation where there are only m < d tradable risky assets,
but otherwise no constraints on the tradable assets, can be modeled by letting K = R R
m

{0}
dm
. In that case,

K = {0} {0}
m
R
dm
and () = 0 on

K. 2
Example 8.3. [Short-sale constraints] To model prohibition of short-selling the risky assets number
m+1, . . . , d, let K = RR
m
R
dm
+
. Then

K = {0} {0}
m
R
dm
+
and again () = 0 on

K.
2
Example 8.4. [Buying constraints] With K = R R
m
R
dm

, the investor is not allowed to


have positive amounts invested in the last d m risky assets. Then

K = {0} {0}
m
R
dm

and
() = 0 on

K. 2
Example 8.5. [Portfolio mix constraints] K = {(
0
, ) R
d+1
| x
0
+
>
1 0 and

K(x)},
where

K(x) is some non-empty, closed, convex subset of R
d
containing the origin, and v = /x
8.4 The martingale approach with portfolio constraints 123
for x > 0 and v = 0 for x = 0, models a portfolio mix constraint. In this case

K = { R
d+1
|
>
(
0
, ) 0, (
0
, ) K}
and () = 0 on

K. 2
Example 8.6. [Collateral constraints] With K =
_
(
0
, ) R
d+1

>
(
0
, ) 0
_
, where
[0, 1]
d+1
, we can model the situation, where, using the jth security (j = 0, 1, . . . , d) as collateral,
it is only possible to borrow the fraction
j
of its value. In this case

K = R
+
= {k|k 0} and
() = 0 on

K. 2
Example 8.7. [Minimum capital requirements] Let K = {(
0
, ) R
d+1
|
0
+
>
1 k}, where
k R
+
. Then

K = R
+
1
d+1
= {(, . . . , ) R
d+1
| 0}, and () = k
0
for = (
0
, )

K.
The special minimum capital requirement k = 0 represents a borrowing constraint. 2
Example 8.8. [Combinations of constraints] Any combination of the above constraints, i.e., where
K is the intersection of some of the Ks of the previous examples. 2
8.4.2 The problem to solve
The general utility maximization problem to solve is
J(w) = sup
(
0
,,c)A(w;K)
V

0
,,c
(w),
where
V

0
,,c
(w) = E
_
_
T
0
U
1
(c
s
, s) ds +U
2
(W

0
,,c
T
, T)
_
and it is understood that the wealth process starts at W

0
,,c
0
= w. Equivalently, we can solve
J(w) = sup
(c,W)A
0
(w;K)
V
c,W
(w),
where
V
c,W
(w) = E
_
_
T
0
U
1
(c
s
, s) ds +U
2
(W, T)
_
.
We assume that the utility functions U
1
(, t) and U
2
(, T) have innite marginal utility at zero, i.e.,
U
0
1
(0, t) lim
c0
U
0
1
(c, t) = and similarly U
0
2
(0, T) lim
W0
U
0
2
(W, T) = . A technical aside:
we have to modify the denition of the set of admissible policies such that now A(w; K) denotes
the set of strategies (
0
, , c) which are admissible in the sense explained above and, further, satisfy
the condition
3
E
_
_
T
0
U
1
(c
t
, t)

dt +U
2
(W

0
,,c
T
, T)

_
<
and similarly for A
0
(w; K).
3
Here X

= max{0, X}.
124 Chapter 8. The martingale approach
8.4.3 Auxiliary unconstrained problems
We will dene a set of articial, auxiliary unconstrained markets. Given a process (
0
, ), where
(
0t
,
t
)

K for any t [0, T], we dene a market M

where the short-term risk-free rate, the


expected returns on the risky assets, and the income rate are perturbed relative to the true market:
(i) the risk-free rate at time t is r
t
+
0t
,
(ii) the drift vector of the risky asset prices is
t
+
t
,
(iii) the income rate is y
t
+ (
t
).
There are no portfolio constraints in the articial market M

, i.e., it is a complete market. The


unique market price of risk is

t
=
1
t
(
t
+
t
(r
t
+
0t
)1),
the change of measure to the unique risk-neutral measure Q

is captured by
dQ

dP
= Z
T
, where
Z
t
= exp

_
t
0

>
s
dz
s

1
2
_
t
0

>
s

s
ds
_
,
and the unique state-price deator is given by

t
= exp

_
t
0
(r
s
+
0s
)) ds
_
Z
t
.
In general, Z

is a local martingale. For technical reasons, we have to restrict ourselves to s for


which Z is a true martingale. Let N

be the set of such processes , i.e.,


N

=
_
L
2
[0, T]

(t, )

K, (t, ) [0, T] and Z

is a martingale
_
.
The wealth process in the auxiliary market M

corresponding to any investment/consumption


policy (
0
, , c) is the process W

0
,,c

given by
dW

0
,,c
t
= (
0t
[r
t
+
0t
] +
>
t
[
t
+
t
]) dt (c
t
y
t
(
t
)) dt +
>
t

t
dz
t
= (
0t
r
t
+
>
t

t
) dt (c
t
y
t
) dt +
>
t

t
dz
t
+ ((
t
) +
0t

0t
+
>
t

t
) dt.
(8.14)
Note that, from (8.13),
(
t
) +
0t

0t
+
>
t

t
0,
so a comparison of (8.14) and (8.12) yields that
W

0
,,c
t
W

0
,,c
t
(8.15)
path-by-path: following a given strategy you will always end up with at least as high a terminal
wealth in any articial market as in the true market.
A triple (
0
, , c) consisting of a trading strategy (
0
, ) and a consumption process c is called
admissible in M

[with initial wealth w] if (


0
, , c) and W

0
,,c

satisfy the same conditions as a


K-admissible triple in the original market except for the requirement (
0t
,
t
) K, t. The set of
8.4 The martingale approach with portfolio constraints 125
triples (
0
, , c) admissible in M

is denoted A

(w), i.e.,
A

(w) =
_
(
0
, , c) P(R
d+1
) C

0
,,c
t
K, t [0, T], W

0
,,c
T
0, and
E
_
_
T
0
U
1
(c
t
, t)

dt +U
2
(W

0
,,c
T
, T)

_
<
_
.
The unconstrained utility maximization problem in M

is
J

(w) = sup
(
0
,,c)A

(w)
V

0
,,c
(w).
We let (

0
,

, c

) denote the optimal strategy in the market M

, i.e., J

(w) = V

0
,

,c

(w). As
before, we can also maximize over consumption and terminal wealth:
J

(w) = sup
(c,W)A
0

(w)
V
c,W
(w).
Let (c

, W

) denote the optimal consumption process and terminal wealth in the market M

, i.e.,
J

(w) = V
c

,W

(w). Admissibility means budget-feasible in the sense that


E
_
_
T
0

t
(c
t
y
t
(
t
)) dt +
T
W
_
w,
plus some technical integrability conditions.
8.4.4 Linking the articial markets to the true market
Due to (8.15), we can conclude that (
0
, , c) A(w; K) (
0
, , c) A

(w). Consequently,
J(w) J

(w), N

. (8.16)
The indirect utility obtainable in any of the articial markets is at least as high as the indirect
utility in the true market. The main result of Cvitanic and Karatzas (1992) and Munk (1997b,
Ch. 3) is to provide the following four ways to characterize optimality in the true market via the
articial markets:
1. Minimality of : The optimal trading strategy in an articial market is not necessarily
K-valued and is therefore not necessarily admissible in the true market. If we can nd an
articial market M

in which the optimal strategy (

0
,

, c

) is also admissible in the true


market, then it is clear that
J(w) V

0
,

,c

(w) = J

(w).
Combining that with (8.16), we can conclude that
J(w) = J

(w) = V

0
,

,c

(w)
so that (

0
,

, c

) is the optimal strategy also in the true market. It is clear that J(w) =
J

(w) can only be satised in the least favorable articially unconstrained market, i.e., we
should minimize the indirect utility over all articial markets.
126 Chapter 8. The martingale approach
2. Financiability of (c

, W

): Suppose we can nd a so that the optimal consumption and


terminal wealth (c

, W

) is nanced by a trading strategy (

0
,

), which is K-valued and


satises
(
t
) +

0t

0t
+ (

t
)
>

t
= 0
for all t and all states. Then it follows from (8.14) that the strategy will generate the same
terminal wealth in the true market as in the articial market M

. Since the strategy is


admissible in the true market, we have
J(w) V

0
,

,c

(w) = J

(w),
and again we can combine that with (8.16) and conclude that (

0
,

, c

) is optimal in the
true market.
3. Parsimony of : If we can nd a N

such that (c

, W

) C L
1
+
satises
E
_
_
T
0

t
(c

t
y
t
(
t
)) dt +
T
W

_
w, N

,
then (c

, W

) and the corresponding strategy (

0
,

, c

) are optimal in the true market.


This proof is complicated and will be skipped here. Note that the left-hand side of the above
inequality is the cost of implementing (c

, W

) in the articial market M



. For = ,
the above inequality will be satised as an equality. The intuition is that if we can nd
an articial market for which the optimal strategy is budget-feasible in all other articial
markets, then it is the least expensive and hence the least favorable of the solutions to the
articial market problems.
4. Dual optimality of : The unconstrained maximization problem
J

(w) = sup
(c,W)
E
_
_
T
0
U
1
(c
s
, s) ds +U
2
(W, T)
_
,
s.t. E
_
_
T
0

t
(c
t
y
t
(
t
)) dt +
T
W
_
w,
can be solved with Lagrangian technique. If denotes the Lagrange multiplier on the budget
constraint, the solution can be written as c
t
= I
1
(
t
, t), W = I
2
(
T
, T), where I
1
(, t)
and I
2
(, T) are the inverse functions of U
0
1
(, t) and U
2
(, T), respectively. Substituting the
solution back into the objective function, we obtain

V

() + w, where

() = E
_
_
T
0

U
1
(
t
, t) dt +

U
2
(
T
, T)
_
+ E
_
_
T
0

t
(y
t
+ (
t
) dt
_
,
and

U
1
and

U
2
are the convex conjugates of U
1
and U
2
, respectively, i.e.

U
1
(x, t) = sup
q>0
{U
1
(q, t) qx} = U
1
(I
1
(x, t), t) xI
1
(x, t),
and similarly for

U
2
. The problem

J() = inf
N

()
8.5 Exercises 127
is called the dual problem. The Lagrange multiplier in M

is related to initial wealth w via


some function = Y

(w) which ensures that the budget constraint is satised as an equality.


It can then be shown that the dual problem is linked to the original problem as follows: if
we can nd a N

such that

J() =

V

() for = Y

(w),
then M

is the least favorable market and (

0
,

, c

) is optimal for the original constrained


problem in the true market.
The dual problem leads to a way of proving the existence of an optimal consumption and
investment strategy in the constrained true market. If there is a solution to the dual problem, then
there is also a solution to the primal problem, i.e., the utility maximization problem in the true
market. Cvitanic and Karatzas (1992) state sucient conditions for the existence of an optimal
solution to the dual problem, which are then also sucient conditions for the existence of an
optimal consumption and investment strategy in the constrained true market. However, one of
the conditions is that the Arrow-Pratt relative risk aversion measures corresponding to the utility
functions U
1
and U
2
are smaller than or equal to one, whereas individuals are generally believed to
have a relative risk aversion greater than one. Cuoco (1997) attacks the primal problem directly
using alternative methods and is able to establish less restrictive conditions for the existence of an
optimal solution.
The results above provide important intuition for constrained utility maximization problems.
The results have been used to provide explicit solutions to some constrained utility maximization
problems, but only with simple constraints and simple preferences such as logarithmic utility. The
ideas of considering the dual problem and the articially unconstrained markets have also been
used recently in various numerical solution techniques, cf. Haugh, Kogan, and Wang (2006) and
Bick, Kraft, and Munk (2012).
8.5 Exercises
Exercise 8.1. Show that (8.2) follows from (8.1).
CHAPTER 9
Numerical methods for solving dynamic asset allocation problems
If the problem features CRRA utility, no portfolio constraints, and return dynamics that do not
t into neither the ane nor the quadratic class: with (i) utility of terminal wealth only or (ii)
complete markets and utility of intermediate consumption and/or terminal wealth, it suces to
solve a PDE like (7.40) numerically. This can be done (when the dimension of the state variable is
three or lower) using standard methods, like nite dierence methods. See, e.g., Wilmott, Dewynne,
and Howison (1993), Thomas (1995), Wilmott (1998), Tavella and Randall (2000), Seydel (2009),
and Munk (2011). With incomplete markets and intermediate consumption, the more complicated
PDE (7.39) has to be solved numerically. For more general preferences, we would normally have
to solve the even more complicated PDE (7.9) numerically.
In many realistic cases, the portfolios are constrained and these constraints have to be taken into
account when solving the HJB equation, and then closed-form solutions are generally impossible
to nd. It is still possible (at least, for low-dimensional problems) to implement a nite dierence
type recursive solution method to solve the relevant HJB equation (a variant is called the Markov
Chain Approximation Approach). See, e.g., Brennan, Schwartz, and Lagnado (1997), Fitzpatrick
and Fleming (1991), Munk (1997a, 2003), Van Hemert (2010), and Munk and Srensen (2010). A
more or less equivalent approach used in some papers is to assume a discrete-time setting from
the beginning and then solve the Bellman equation by backwards recursive dynamic programming.
However, some authors use large time steps (e.g., allow only annual consumption and investment
decisions) and assume very simple distributions (binomial, trinomial) of the relevant state variables
over these long time steps. See, e.g., Campbell and Cocco (2003), Cocco (2005), Cocco, Gomes,
and Maenhout (2005), and Yao and Zhang (2005a).
An alternative which, at least potentially, can handle higher-dimensional problems is Monte
Carlo simulation based approaches to the HJB equation. Various versions have been proposed.
See, e.g., Detemple, Garcia, and Rindisbacher (2003, 2005), Cvitanic, Goukasian, and Zapatero
(2003), Brandt, Goyal, Santa-Clara, and Stroud (2005), van Binsbergen and Brandt (2007), and
Koijen, Nijman, and Werker (2007, 2010).
129
130 Chapter 9. Numerical methods for solving dynamic asset allocation problems
Yet another alternative for the solution of some consumption and portfolio choice problems
involving portfolio constraints and/or incomplete markets is suggested by Bick, Kraft, and Munk
(2012). The method applies to CRRA utility and return dynamics of the ane-quadratic type.
The method combines (i) the idea of articially unconstrained and complete markets introduced in
connection with the martingale approach in Section 8.4 and (ii) the results on closed-form solutions
for unconstrained ane-quadratic settings and CRRA utility. The method considers a subset of
articially unconstrained and complete markets for which relatively simple closed-form solutions
exists. Each of these strategies is transformed into a feasible strategy in the true, constrained
market. This gives a set of feasible strategies parameterized by a number of parameters. If this
number is fairly low, one can search for the best of the strategies, where the evaluation of the
strategy is done via Monte Carlo simulation. The method also provides an upper bound for the
true, unknown optimal expected utility (given by the worst of the considered articial markets)
and thus an upper bound on the wealth-equivalent loss the individual might suer by following the
best of the feasible strategies considered. Another numerical method building on the martingale
techniques was suggested by Haugh, Kogan, and Wang (2006).
CHAPTER 10
Asset allocation with stochastic interest rates
10.1 Introduction
It is an empirical fact that both nominal and real interest rates and bond yields vary stochasti-
cally over time. It is therefore natural to include the short-term interest rate r
t
as a state variable.
This was rst done in a portfolio-choice context by Merton (1973b) who considered a general
one-factor dynamics for r
t
, but he was not able to go beyond the general characterization of the
investment strategy in (7.7). We will focus on individuals with CRRA utility and on models in
which the interest rate dynamics is of an ane form, since then we can obtain closed-form solutions
for the optimal strategies as explained in Section 7.3.2. The ane class includes the well-known
models of Vasicek (1977) and Cox, Ingersoll, and Ross (1985). See, e.g., Munk (2011) for a com-
prehensive analysis of dynamic models of the term structure of interest rates. We can also apply
the general results of Section 7.3 to cases where the dynamics of the term structure of interest rate
is given by a multi-factor ane or quadratic model.
Recall that investors are (or should be) concerned about real interest rates and hence they
would want to invest in real bonds. Indeed, we will assume that investors have access to trade
in a complete market of real bonds (Exercise 10.1 at the end of the chapter discusses an optimal
investment problem with stochastic interest rates when no bonds are traded.) We will focus on
determining the optimal bond/stock mix so we assume that only a single stock is traded. We
interpret this stock as the entire stock market index. The results can be generalized to the case
with multiple stocks.
Investors with non-log utility will hedge variations in interest rates. Bonds carry a build-in hedge
against interest rate risk since bond prices are inversely related to interest rates. Over a period
where interest rates have fallen, indicating that future investment opportunities are worsened, bond
prices have risen and generated a positive return. The converse is also true. We will therefore
expect that interest rates are hedged by investing in bonds, but precisely how many bonds and
which bonds this hedge should involve has to be computed using concrete models.
In Section 10.2 we study the case where the real short-term interest rate behaves according to
131
132 Chapter 10. Asset allocation with stochastic interest rates
the Vasicek model. Section 10.3 considers the CIR model of interest rates. Section 10.4 gives
a numerical example in which the quantitative eects of interest rate uncertainty on optimal
portfolios can be assessed. Section 10.5 briey looks at optimal portfolio choice when interest rate
dynamics is given by a two-factor version of the Vasicek model. Other studies with stochastic
interest rates are briey discussed in Section 10.6. In many countries there are no liquid markets
for real bonds, only for nominal bonds. Then we have to take the dynamics of consumer prices
and ination into account. We consider those issues in Chapter 12.
10.2 One-factor Vasicek interest rate dynamics
Following Vasicek (1977), assume that r
t
follows the Ornstein-Uhlenbeck process
dr
t
= [ r r
t
] dt
r
dz
1t
,
with an associated constant market price of risk
1
. We assume that , r, and are positive
constants. The process exhibits mean reversion in the sense that, if r
t
< r, the short rate is
expected to increase over the next instant, whereas if r
t
> r, the short rate is expected to fall.
This is a very realistic feature of the model. Future values of the short rate are normally distributed
so, in particular, short rates can take on any negative value, which is not realistic.
It is a consequence of these assumptions that the price of a zero-coupon bond with maturity

T
is given by
B

T
t
= e
a(

Tt)b(

Tt)r
t
,
where
b() =
1

_
1 e

_
,
a() = y

( b()) +

2
r
4
b()
2
,
where y

r +

1



2
r
2
2

is the asymptotic zero-coupon yield as time-to-maturity goes to


innity. From Itos Lemma it follows that the dynamics of the zero-coupon bond price is
dB

T
t
= B

T
t
_
r
t
+
1

r
b(

T t)
_
dt +
r
b(

T t) dz
1t

,
and similarly the dynamics of any bond (or any other xed-income security) is of the form
dB
t
= B
t
[(r
t
+
1

B
(r
t
, t)) dt +
B
(r
t
, t) dz
1t
] . (10.1)
It is well-known that any bond (or other xed-income security) can be generated from an appro-
priate dynamic investment strategy in the bank account and in just one (arbitrary) bond (or other
long-lived term structure derivative). Let us for the present take an arbitrary bond with price B
t
and dynamics given by (10.1).
The price of the single stock (representing the stock market index) is assumed to follow the
process
dS
t
= S
t
_
(r
t
+
S
) dt +
S
dz
1t
+
_
1
2

S
dz
2t
_
.
The parameter is the correlation between bond market returns and stock market returns,
S
is
the volatility of the stock, and is the Sharpe ratio of the stock which we assume constant.
10.2 One-factor Vasicek interest rate dynamics 133
The asset allocation problem of a CRRA investor under these assumptions was studied by
Srensen (1999) and Bajeux-Besnainou, Jordan, and Portait (2001) for utility of terminal wealth
only. Korn and Kraft (2001) discuss some technical issues related to the application of the veri-
cation theorem to this problem.
To get this into the notation applied so far, we rewrite the price dynamics as
_
dB
t
dS
t
_
=
_
B
t
0
0 S
t
___
r
t
1 +
_

B
(r
t
, t) 0

S
_
1
2

S
__

2
__
dt
+
_

B
(r
t
, t) 0

S
_
1
2

S
__
dz
1t
dz
2t
__
,
where

2
= (
1
)/
_
1
2
. (10.2)
We are therefore in a complete market model with a single state variable (x = r). We can rewrite
the dynamics of r as
dr
t
= [ r r
t
] dt +
_

r
, 0
_
dz
t
,
where z = (z
1
, z
2
)
>
. In this model the state variable has an ane drift and a constant volatility,
and the market price of risk vector = (
1
,
2
)
>
is also constant. Hence, Theorem 7.7 applies
with CRRA utility from terminal wealth only and Theorem 7.8 applies with CRRA utility from
intermediate consumption and possibly terminal wealth. In the notation used there, we have
r
0
= 0, r
1
= 1, m
0
= r, m
1
= ,

0
=
2
1
+
2
2
,
1
= 0, v
0
= 0, v
1
= 0,
V
0
=
2
r
, V
1
= 0, K
0
=
r

1
, K
1
= 0.
In this case the ordinary dierential equation (7.22) reduces to
A
0
1
() = 1 A
1
(),
which with the initial condition A
1
(0) = 0 has the unique solution
A
1
() =
1

_
1 e

_
= b().
This result also follows from the discussion below Theorem 7.7. Next, A
0
follows from (7.25):
A
0
() =
1
2
_

2
1
+
2
2
_
+

r +
1

1
_

0
b(s) ds
1
2

2
r
_

0
b(s)
2
ds
=
1
2
_

2
1
+
2
2
_
+

r
1
2
2

2
r
2
r

( b()) +
1
4

2
r
b()
2
,
where we have used that
_

0
b(s) ds =
1

( b()) ,
_

0
b(s)
2
ds =
1

2
( b())
1
2
b()
2
.
For the case with utility from terminal wealth only we have from Theorem 7.7 that the optimal
investment strategy is
(W, r, t)
_

B
(W, r, t)

S
(W, r, t)
_
=
1

_
(r, t)
>
_
1

1

_
(r, t)
>
_
1
_

r
0
_
b(T t)
=
1

_
(r, t)
>
_
1
+
1

_
(r, t)
>
_
1
_

r
0
_
b(T t).
(10.3)
134 Chapter 10. Asset allocation with stochastic interest rates
We can see that the hedge portfolio only involves the bond, not the stock, which should not come
as a surprise since bonds seem more appropriate for hedging interest rate risk than stocks. The
higher the risk aversion , the lower the investment in the tangency portfolio and the higher the
investment in the hedge bond. The inverse of the transposed volatility matrix is
_

B
(r, t)
S
0
_
1
2

S
_
1
=
1
_
1
2

B
(r, t)
S
_
_
1
2

S

S
0
B
(r, t)
_
so that we can write out the fraction of wealth invested in the stock and the bond as

S
(W, r, t) =

2

S
_
1
2
, (10.4)

B
(W, r, t) =
1

B
(r, t)
_


_
1
2

2
_
+
1

r
b(T t)

B
(r, t)
. (10.5)
If the bond in the portfolio is the zero-coupon bond maturing at the end of the investment
horizon of the investor, i.e., at time T, then
B
(r, t) =
r
b(T t), and we see that the hedge term
simply consists of a fraction ( 1)/ in the zero-coupon bond. This is a natural choice of hedge
instrument since it is exactly the truly risk-free asset for an investor only interested in time T
wealth. The log utility investor ( = 1) does not hedge. The hedge position of a less risk averse
investor ( < 1) is negative, while a more risk averse investor ( > 1) takes a long position in the
bond in order to hedge interest rate risk. An innitely risk averse investor ( ) will invest her
entire wealth in the zero-coupon bond maturing at T.
If we continue to use the zero-coupon bond maturing at T as the bond instrument, we see
from (10.3) that we can write the risky part of the optimal investment strategy as
(W, r, t)
_

B
(W, r, t)

S
(W, r, t)
_
=
1

_
(t)
>
_
1
+
1

_
1
0
_
.
Consequently, the fraction of wealth invested in the bank account (i.e., the locally risk-free asset)
is

0
(W, r, t) = 1
B
(W, r, t)
S
(W, r, t)
= 1
1

1
>
_
(t)
>
_
1

1

=
1

1 1
>
_
(t)
>
_
1

.
Note that the term in the parenthesis is exactly what a log investor would hold in the bank account.
The entire investment strategy can be written as
_
_
_
_

S
_
_
_
_
=
1

_
_
_
_

log
0

log
B

log
S
_
_
_
_
+
1

_
_
_
_
0
1
0
_
_
_
_
.
The strategy is hence a simple combination of the log investors portfolio and the zero-coupon bond
maturing at the investment horizon of the investor. Note that as the risk aversion increases, the
position in stocks will decrease, while the position in bonds will increase. Hence, the bond/stock
ratio increases with risk aversion which is consistent with popular advice. However, the allocation
10.3 One-factor CIR dynamics 135
to stock is still independent of the investment horizon which conicts with traditional advice that
the stock weight should increase with the investment horizon.
With utility from intermediate consumption only, it follows from Theorem 7.8 that the hedge
term of the optimal bond investment strategy is
1

B
(r
t
, t)
_
T
t
e

(st)
1

A
0
(st)
1

b(st)r
b(s t) ds
_
T
t
e

(st)
1

A
0
(st)
1

b(st)r
ds
, (10.6)
where
B
(r
t
, t) again represents the volatility of the bond chosen for implementing the strategy.
It can be shown that the time t volatility of a coupon bond paying a continuous coupon at a
deterministic rate K(s) up to time T is given by

B
(r, t) =
_
T
t
K(s)B
s
t

r
b(s t) ds
_
T
t
K(s)B
s
t
ds
.
Hence, we can interpret the time t interest rate hedge as the fraction ( 1)/ of wealth invested
in a bond with continuous coupon
K(s) = e
a(st)
1

A
0
(st)

(st)+
1

b(st)r
.
Munk and Srensen (2004) show that this coupon is closely related to the expected consumption
rate at time s. For an investor with utility from consumption over the entire period [t, T], the zero-
coupon bond maturing at T is no longer the truly risk-free asset. Since the investor is interested
in payments at all dates in [t, T], he hedges interest rate risk by investing in a combination of all
zero-coupon bonds maturing in this interval, i.e., in some sort of coupon bond.
10.3 One-factor CIR dynamics
Consider the same set-up as above except that the short-term interest rate now is assumed to
follow the square-root process
dr
t
= [ r r
t
] dt
r

r
t
dz
1t
, (10.7)
where , r, and
r
are positive constants. The market price of the risk represented by z
1
is assumed
to be given by
1
(r, t) =
1

r
t
/
r
. As shown by Cox, Ingersoll, and Ross (1985), zero-coupon
bond prices are on the form
B

T
t
= e
a(

Tt)b(

Tt)r
t
as in the Vasicek model, but a and b are now given by
b() =
2(e

1)
( + )(e

1) + 2
,
a() =
2 r

2
r

1
2
( + ) + ln
2
( + )(e

1) + 2

,
where =
1
and =
_

2
+ 2
2
r
. The price evolves as
dB

T
t
= B

T
t
_
r
t
+b(

T t)
1
r
t
_
dt +b(

T t)
r

r
t
dz
1t

.
We let
B
(r, t) denote the volatility of the bond which the investor trades in, and we see that for
a zero-coupon bond the volatility is
B
(r, t) = b(

T t)
r

r.
136 Chapter 10. Asset allocation with stochastic interest rates
We assume that the investor can also trade in a single stock with price S
t
evolving as
dS
t
= S
t
_
(r
t
+ (r
t
)
S
) dt +
S
dz
1t
+
_
1
2

S
dz
2t
_
.
Here
S
is a positive constant, and z
2
is a one-dimensional standard Brownian motion independent
of z
1
so that the constant is the instantaneous correlation between stock returns and bond returns.
We assume that the market price of risk associated with z
2
is a constant
2
so that
(r) =

r +
_
1
2

2
. (10.8)
Again we have an ane, complete market model of the type studied in Section 7.3.2. In this case
we have
r
0
= 0, r
1
= 1, m
0
= r, m
1
= ,

0
=
2
2
,
1
=

2
1

2
r
, v
0
= 0, v
1
= 0,
V
0
= 0, V
1
=
2
r
, K
0
= 0, K
1
=
1
.
The solution is stated in terms of two deterministic functions A
0
and A
1
. Let
=
1


1
.
The ordinary dierential equation (7.22) for A
1
becomes
A
0
1
() =

1 +

2
1
2
2
r

A
1
()
1
2

2
r
A
1
()
2
with the initial condition A
1
(0) = 0. Assuming

2
+ 2
2
r
1

1 +

2
1
2
2
r

> 0,
which is certainly satised for 1, the unique solution follows immediately from (7.24):
A
1
() =
2

1 +

2
1
2
2
r

(e

1)
( + ) (e

1) + 2
,
and we have introduced the additional auxiliary parameters
=


2
+ 2
2
r
1

1 +

2
1
2
2
r

.
A
0
can then be computed from (7.25):
A
0
() =

2
2
2
+ r
_

0
A
1
(s) ds =

2
2
2

1

2 r

2
r

1
2
( + ) + ln
2
( + ) (e

1) + 2

.
It follows from Theorem 7.7 that the optimal investment strategy for an investor with CRRA
utility from terminal wealth only is

B
(W, r, t) =
1

B
(r, t)
_

r

2
_
1
2
_
+
1

B
(r, t)
A
1
(T t),

S
(W, r, t) =

2

S
_
1
2
.
10.4 A numerical example 137
If the bond instrument used is the zero-coupon bond maturing at the end of the investors horizon,
we have
B
(r, t) =
r

rb(T t), and the hedge component will simplify to


1

A
1
(t t)/b(T t).
As opposed to the Vasicek case we do not have A
1
(T t) = b(T t). This implies that the optimal
hedge consists of investing the time-varying fraction
1

A
1
(T t)/b(T t) in the zero-coupon
bond maturing at the end of the investors horizon. A similar result was obtained by Deelstra,
Grasselli, and Koehl (2000) and Grasselli (2000) using the martingale approach for the case of
utility from terminal wealth only.
For an investor with CRRA utility of intermediate consumption only, Theorem 7.8 applies. The
fraction of wealth optimally invested in the stock is the same as above, while the fraction of wealth
optimally invested in the bond instrument changes to

B
(W, r, t) =
1

B
(r, t)
_

r

2
_
1
2
_
+
1

B
(r, t)
_
T
t
A
1
(s t)e

(st)
1

A
0
(st)
1

A
1
(st)r
ds
_
T
t
e

(st)
1

A
0
(st)
1

A
1
(st)r
ds
.
10.4 A numerical example
We will take historical estimates of mean returns, standard deviations, and correlations as rep-
resentative of future investment opportunities. These estimates are taken from Dimson, Marsh,
and Staunton (2002). All returns are measured per year. The historical average real return on
the U.S. stock market is
S
= 8.7% with a standard deviation of
S
= 20.2%, while the average
real return on bonds is
B
= 2.1% with a standard deviation of
B
= 10.0%. The average real
U.S. short-term interest rate is r = 1.0%. The correlation between stock returns and bond returns
is = 0.2. Dierent bonds will have dierent average returns and dierent standard deviation of
the return. Similarly, the correlation between the return on a bond and the return on the stock
market index may not be identical for all bonds. It is not clear exactly what bond or bond index,
the above estimates are based on, but we will assume that the estimates for
B
and
B
apply to
a 10-year zero-coupon bond.
The volatility matrix of the bond and the stock is
=
_
0.1 0
0.0404 0.1979
_
.
The (average) Sharpe ratio of the bond is
1
= (2.1 1.0)/10.0 = 0.11 and the (average) Sharpe
ratio of the stock market is = (8.7 1.0)/20.2 0.3812. Using (10.2) this corresponds to a
market price of risk of
2
0.3666 on the exogenous shock that only aects the stock market. The
variance-covariance matrix of returns is =
>
. From (6.8), the tangency portfolio of the bond
and the stock is given by

tan
=
_

tan
B

tan
S
_
=
_
0.1596
0.8404
_
,
so that the bond/stock ratio is approximately 0.19. Recall that this will be true for all agents who
have time-additive utility and who believe that investment opportunities are constant over time.
The tangency portfolio has a mean return of 7.65% and a standard deviation of 17.37%.
CRRA investors ignoring the uctuations of interest rates will choose a portfolio of risky assets
given by =
1

[1
>
(
>
)
1
]
tan
, where is the relative risk aversion of the agent. The portfolio
138 Chapter 10. Asset allocation with stochastic interest rates
tangency bond stock cash exp. return volatility
0.5 4.4079 0.7034 3.7045 -3.4079 0.3030 0.7655
1 2.2039 0.3517 1.8522 -1.2039 0.1565 0.3827
2 1.1020 0.1758 0.9261 -0.1020 0.0832 0.1914
2.2039 1.0000 0.1596 0.8404 0.0000 0.0765 0.1737
3 0.7346 0.1172 0.6174 0.2654 0.0588 0.1276
4 0.5510 0.0879 0.4631 0.4490 0.0466 0.0957
5 0.4408 0.0703 0.3704 0.5592 0.0393 0.0765
6 0.3673 0.0586 0.3087 0.6327 0.0344 0.0638
8 0.2755 0.0440 0.2315 0.7245 0.0283 0.0478
10 0.2204 0.0352 0.1852 0.7796 0.0246 0.0383
20 0.1102 0.0176 0.0926 0.8898 0.0173 0.0191
50 0.0441 0.0070 0.0370 0.9559 0.0129 0.0077
200 0.0110 0.0018 0.0093 0.9890 0.0107 0.0019
Table 10.1: Portfolio weights for CRRA investors ignoring interest rate uctuations.
is independent of the investment horizon. In Table 10.1 we show the portfolio allocation for
various -values. The numbers in the column tangency denotes the fraction of wealth invested
in the tangency portfolio. This investment is divided into the bond and the stock in the following
two columns. The cash position is determined residually so that weights sum to one. The last
two columns show the instantaneous expected rate of return and volatility of the portfolio. In
Figure 10.1 the curved line shows the mean-variance ecient portfolios of risky assets, i.e., the
combinations of expected returns and volatility that can be obtained by combining the bond and
the stock. The straight line corresponds to the optimal portfolios for investors assuming constant
investment opportunities with an interest rate equal to the long-term average.
Now let us look at investors who realize that interest rates vary over time and consequently
alter their investment strategy (except for log-utility investors). First, we assume that the real
short-term interest rate r
t
follows the one-factor Vasicek model so that the analysis and results
of Section 10.2 applies. The long-term average interest rate is r = 1.0% and we take a short-rate
volatility of
r
= 5%, which is also consistent with the U.S. historical estimate. We use the same
values of the market prices of risk as above. We set the value of the mean reversion rate = 0.4965
so that the volatility of a 10-year zero-coupon bond according to the model is equal to the historical
estimate of 10.0%. The current short rate is assumed to equal the long-term level, r
t
= r.
Let us rst consider investors with utility of terminal wealth only. Their optimal portfolios
are given by (10.4) and (10.5). Table 10.2 shows the optimal portfolios for CRRA investors with
dierent combinations of risk aversion and investment horizon. The numbers under the column
heading hedge are
1

b(T)/b(10), which is the hedge demand for the 10-year zero-coupon bond
which the investors are allowed to trade in. While the weight on the tangency portfolio and thus
the stock is independent on the investment horizon, this is not true for the weight on the hedge
portfolio and hence not true for the total weight on the bond and on cash. The ratio of the bond
weight to the stock weight is shown in the column bond/stock. The bond-stock ratio increases
10.4 A numerical example 139
0%
2%
4%
6%
8%
10%
12%
14%
16%
0% 5% 10% 15% 20% 25% 30% 35% 40%
volatility
e
x
p
.

r
a
t
e

o
f

r
e
t
u
r
n
Figure 10.1: The mean-variance frontiers. The gure shows the mean-variance frontier
without the risk-free asset (blue curve) and with the risk-free asset (straight black line). The
tangency portfolio of the risky assets is indicated with a red x.
considerably with the risk aversion and, for investors with > 1, with the investment horizon.
The investor with a horizon of T will want to hedge interest rate risk by investing in the T-period
zero-coupon bond. That bond is replicated by a portfolio of b(T)/b(10) units of the 10-year zero-
coupon bond and a cash position. Since b is increasing in T, the hedge demand for the 10-year bond
increases with the horizon T. It is important to emphasize that the portfolio weights on the bond
and thus the bond/stock ratio will depend on the maturity (and payment schedule) of the bond, the
investor is trading in. In particular, a recommendation of a particular bond weight or bond/stock
ratio should always be accompanied by a specication of what bond the recommendation applies
to.
Next, we consider investors with utility from intermediate consumption and no utility from
terminal wealth. In this case the hedge term in the bond weight (10.5) is replaced by (10.6). Now
the hedge demand depends on the current interest rate level, which we assume is equal to the
long-term average of 1%. Table 10.3 shows the optimal portfolios for investors with a 1-year and a
30-year horizon. We see the same overall picture as for investors with utility from terminal wealth
only, but for a given investment horizon the hedge demand for bond and hence the bond/stock
ratio are smaller with utility of consumption since the optimal bond for hedging has a smaller
duration then the investment horizon.
Let us now compare the current mean/variance tradeo chosen by dierent investors. As dis-
cussed above, CRRA investors that either have a zero (or very, very short) investment horizon or
do not take interest rate risk into account will pick a portfolio that corresponds to a point on the
straight line in Figure 10.2. This is the instantaneous mean-variance ecient frontier. Similarly,
140 Chapter 10. Asset allocation with stochastic interest rates
horizon tangency hedge bond stock
bond
stock
cash exp. return volatility
T = 1 0.5 4.4079 -0.3941 0.3093 3.7045 0.08 -3.0138 0.2986 0.7551
1 2.2039 0.0000 0.3517 1.8522 0.19 -1.2039 0.1565 0.3827
2 1.1020 0.1970 0.3729 0.9261 0.40 -0.2990 0.0854 0.1979
5 0.4408 0.3153 0.3856 0.3704 1.04 0.2439 0.0428 0.0908
10 0.2204 0.3547 0.3899 0.1852 2.11 0.4249 0.0286 0.0592
20 0.1102 0.3744 0.3920 0.0926 4.23 0.5154 0.0214 0.0467
T = 5 0.5 4.4079 -0.9229 -0.2195 3.7045 -0.06 -2.4850 0.2928 0.7442
1 2.2039 0.0000 0.3517 1.8522 0.19 -1.2039 0.1565 0.3827
2 1.1020 0.4615 0.6373 0.9261 0.69 -0.5634 0.0883 0.2094
5 0.4408 0.7383 0.8087 0.3704 2.18 -0.1791 0.0474 0.1207
10 0.2204 0.8306 0.8658 0.1852 4.67 -0.0510 0.0338 0.1010
20 0.1102 0.8768 0.8943 0.0926 9.66 0.0130 0.0270 0.0950
T = 10 0.5 4.4079 -1.0000 -0.2966 3.7045 -0.08 -2.4079 0.2920 0.7429
1 2.2039 0.0000 0.3517 1.8522 0.19 -1.2039 0.1565 0.3827
2 1.1020 0.5000 0.6758 0.9261 0.73 -0.6020 0.0887 0.2112
5 0.4408 0.8000 0.8703 0.3704 2.35 -0.2408 0.0481 0.1256
10 0.2204 0.9000 0.9352 0.1852 5.05 -0.1204 0.0345 0.1074
20 0.1102 0.9500 0.9676 0.0926 10.45 -0.0602 0.0278 0.1022
T = 30 0.5 4.4079 -1.0070 -0.3036 3.7045 -0.08 -2.4009 0.2919 0.7428
1 2.2039 0.0000 0.3517 1.8522 0.19 -1.2039 0.1565 0.3827
2 1.1020 0.5035 0.6794 0.9261 0.73 -0.6055 0.0888 0.2114
5 0.4408 0.8056 0.8760 0.3704 2.37 -0.2464 0.0482 0.1261
10 0.2204 0.9063 0.9415 0.1852 5.08 -0.1267 0.0346 0.1080
20 0.1102 0.9567 0.9743 0.0926 10.52 -0.0669 0.0278 0.1028
Table 10.2: Portfolio weights for CRRA investors who assume Vasicek interest rate
dynamics and have utility from terminal wealth only.
10.4 A numerical example 141
horizon tangency hedge bond stock
bond
stock
cash exp. return volatility
T = 1 0.5 4.4079 -0.2253 0.4780 3.7045 0.1290 -3.1825 0.3005 0.7593
1 2.2039 0.0000 0.3517 1.8522 0.1899 -1.2039 0.1565 0.3827
2 1.1020 0.1114 0.2872 0.9261 0.3101 -0.2134 0.0845 0.1949
5 0.4408 0.1787 0.2490 0.3704 0.6722 0.3805 0.0413 0.0835
10 0.2204 0.2013 0.2365 0.1852 1.2766 0.5783 0.0269 0.0481
20 0.1102 0.2126 0.2302 0.0926 2.4856 0.6772 0.0197 0.0324
T = 30 0.5 4.4079 -0.9639 -0.2605 3.7045 -0.0699 -2.4440 0.2924 0.7435
1 2.2039 0.0000 0.3517 1.8522 0.1899 -1.2039 0.1565 0.3827
2 1.1020 0.4428 0.6187 0.9261 0.6677 -0.5448 0.0881 0.2085
5 0.4408 0.7254 0.7957 0.3704 2.1445 -0.1662 0.0473 0.1196
10 0.2204 0.8245 0.8597 0.1852 4.6319 -0.0449 0.0337 0.1004
20 0.1102 0.8751 0.8927 0.0926 9.6176 0.0147 0.0270 0.0948
Table 10.3: Portfolio weights for CRRA investors who assume Vasicek interest rate
dynamics and have utility from intermediate consumption only.
horizon tangency hedge bond stock
bond
stock
cash exp. return volatility
T = 0 1.1020 0 0.1758 0.9261 0.19 -0.1020 0.0832 0.1914
T = 1, wealth 1.1020 0.1970 0.3729 0.9261 0.40 -0.2990 0.0854 0.1979
T = 5, wealth 1.1020 0.4615 0.6373 0.9261 0.69 -0.5634 0.0883 0.2094
T = 10, wealth 1.1020 0.5000 0.6758 0.9261 0.73 -0.6020 0.0887 0.2112
T = 30, wealth 1.1020 0.5035 0.6794 0.9261 0.73 -0.6055 0.0888 0.2114
T = 1, cons. 1.1020 0.1114 0.2872 0.9261 0.3101 -0.2134 0.0845 0.1949
T = 30, cons. 1.1020 0.4428 0.6187 0.9261 0.6677 -0.5448 0.0881 0.2085
Table 10.4: Portfolio weights for investors with a constant relative risk aversion of
= 2.
each of the other curves corresponds to the combinations chosen by CRRA investors with a given
non-zero horizon who take interest rate risk into account. Since these curves lie to the right of
the instantaneous mean-variance frontier, all these investors could obtain a higher instantaneous
expected rate of return for the same volatility by choosing a dierent portfolio. But the long-term
investors are willing to sacrice some expected return in the short term in order to hedge changes
in interest rates and place themselves in a better position if interest rates should decline.
Table 10.4 shows the optimal portfolios for investors with a constant relative risk aversion equal
to 2, but with dierent investment horizons. Here we can clearly see the eect of the investment
horizon on the optimal bond holdings and the bond/stock ratio. Relative to the extreme short-term
investor, long-term investors have the same stock weight but shifts wealth from cash to bonds. If
we look at the instantaneous risk/return trade-o, the longer-term investors choose more risky
portfolios, i.e., they take on more short-term risk. But the main point is that long-term investors
do not choose their portfolio according to the short-term risk/return trade-o.
142 Chapter 10. Asset allocation with stochastic interest rates
0%
2%
4%
6%
8%
10%
12%
14%
16%
0% 5% 10% 15% 20% 25% 30% 35% 40%
volatility
e
x
p
.

r
a
t
e

o
f

r
e
t
u
r
n
Figure 10.2: Optimal frontiers with Vasicek interest rates. Each curve contains the
combinations of current expected rate of return and volatility for CRRA investors with a given
investment horizon T. From the left, the curves represent (a) T = 0 (black straight line; identical
to the mean-variance frontier), (b) T = 1 and utility of consumption (blue curve), (c) T = 1 and
utility from terminal wealth only (red cruve), (d) T = 30 and utility from consumption (grey
curve), and (e) T = 30 and utility from terminal wealth only (green curve).
Next, we want to investigate how sensitive the asset allocation choice is to the assumed interest
rate model. We do that by computing the optimal portfolios when interest rates follow the CIR
model (10.7). We want to make a reasonably fair comparison between the two models. For that
purpose we choose
r
= 0.5 in the CIR model so that the average short rate volatility is
r

r = 0.05
as in the Vasicek model. We set
1
= 0.55 and
2
= 0.3666 so that the model is consistent with
the estimated mean stock and bond returns when r = r is used to compute the Sharpe ratios of
the bond market (
1
(r)) and the stock market ((r) in (10.8)). The mean reversion rate is set at
= 0.7994 so that the volatility of a 10-year zero-coupon bond according to the model is equal to
the historical estimate of 10.0%. The optimal portfolio in the CIR setting depends on the current
interest rate level. In the computations we put this equal to the long-term average of 1%.
In Table 10.5 we list the optimal portfolios for investors with CRRA utility of terminal wealth
both for the Vasicek and the CIR setting. We consider an investor with a 1-year horizon and
an investor with a 30-year horizon. The stock weight is identical in the two models. The hedge
demand for bonds and hence the total bond demand (and the cash position) do depend on the
interest rate model, but the dierences are relatively small. The yield curves of the two models
are almost identical. The long-term yield is 1.601% in the Vasicek model and 1.600% in the CIR
model. With a current short rate of 1%, the yield curve is uniformly increasing in both models,
cf. the results on the shape of the yield curve in the two models reported by, e.g., Munk (2011).
10.5 Two-factor Vasicek model 143
Vasicek model CIR model
horizon tangency stock hedge bond cash hedge bond cash
T = 1 0.5 4.4079 3.7045 -0.3941 0.3093 -3.0138 -0.6374 0.0660 -2.7705
1 2.2039 1.8522 0.0000 0.3517 -1.2039 0 0.3517 -1.2039
2 1.1020 0.9261 0.1970 0.3729 -0.2990 0.2482 0.4241 -0.3502
5 0.4408 0.3704 0.3153 0.3856 0.2439 0.3653 0.4357 0.1939
10 0.2204 0.1852 0.3547 0.3899 0.4249 0.3979 0.4331 0.3817
20 0.1102 0.0926 0.3744 0.3920 0.5154 0.4129 0.4305 0.4769
T = 30 0.5 4.4079 3.7045 -1.0070 -0.3036 -2.4009 -1.0066 -0.3033 -2.4012
1 2.2039 1.8522 0.0000 0.3517 -1.2039 0 0.3517 -1.2039
2 1.1020 0.9261 0.5035 0.6794 -0.6055 0.5012 0.6771 -0.6032
5 0.4408 0.3704 0.8056 0.8760 -0.2464 0.8012 0.8715 -0.2420
10 0.2204 0.1852 0.9063 0.9415 -0.1267 0.9010 0.9362 -0.1214
20 0.1102 0.0926 0.9567 0.9743 0.0669 0.9509 0.9685 -0.0611
Table 10.5: Portfolio weights with Vasicek or CIR dynamics for CRRA investors
with utility from terminal wealth only.
10.5 Two-factor Vasicek model
Brennan and Xia (2000) study a two-factor Vasicek interest rate model with utility from terminal
wealth only. Assume that the dynamics of the short-term interest rate r
t
is
dr
t
= (
r
+u
t

r
r
t
) dt
r
dz
1t
,
du
t
=
u
u
t
dt
u

ru
dz
1t

u
_
1
2
ru
dz
2t
,
where z
1
= (z
1t
) and z
2
= (z
2t
) are independent one-dimensional standard Brownian motions. The
one-factor Vasicek model is the special case where u
t
0, and then the short rate r
t
is expected
to move towards the long-run level
r
/
r
. The new state variable u allows for variations in this
long-run target for the short-term interest rate. Note that we can rewrite the drift rate of u as

u
(0 u
t
) which shows that u exhibits mean reversion around the long-run level 0. Future values
of r and u are normally distributed so it is a Gaussian model. The market prices of risk associated
with the shocks represented by z
1
and z
2
are denoted by
1
and
2
, respectively, and are assumed
constant.
Beaglehole and Tenney (1991) and Hull and White (1994) studied such a model and its impli-
cations for the pricing of bonds. They have shown that the time t price of the zero-coupon bond
maturing at time

T is given by
B

T
(r, u, t) = e
a(

Tt)b
1
(

Tt)rb
2
(

Tt)u
,
where
b
1
() =
1

r
_
1 e

_
,
b
2
() =
1

u
+
1

r
(
r

u
)
e

u
(
r

u
)
e

,
144 Chapter 10. Asset allocation with stochastic interest rates
and a() is a quite complicated function which is not important for what follows. The dynamics of
the price of the zero-coupon bond maturing at time

T is then
dB

T
t
= B

T
t
_
_
r
t
+
B
(

T t)
_
dt +
B1
(

T t) dz
1t
+
B2
(

T t) dz
2t
_
,
where, for all 0, we have dened

B
() =
1

B1
() +
2

B2
(),

B1
() =
r
b
1
() +
u

ru
b
2
(),

B2
() =
u
_
1
2
ru
b
2
().
Consider an investor with utility of wealth at time T exhibiting a constant relative risk aversion
> 1. The investor earns no labor income, has no preferences for consumption before time T,
and is not subject to portfolio constraints. The investor can trade in a single non-dividend paying
stock (representing the stock market index) with time t price S
t
, which evolves according to
dS
t
= S
t
[(r
t
+
S

S
) dt +
S
k
1
dz
1t
+
S
k
2
dz
2t
+
S
k
3
dz
3t
] ,
where z
3
= (z
3t
) is a one-dimensional standard Brownian motion independent of z
1
and z
2
, and
k
3
=
_
1 k
2
1
k
2
2
so that
S
is the volatility of the stock. The constant
3
is the market price of
risk associated with z
3
so the Sharpe ratio of the stock is

S
= k
1

1
+k
2

2
+
_
1 k
2
1
k
2
2

3
.
In total we assume that the investor can invest in the following four assets:
(1) the locally risk-free asset (aka. the bank account or cash deposits) providing a net rate of
return of r
t
,
(2) a zero-coupon bond maturing at time T
1
,
(3) a zero-coupon bond maturing at time T
2
6= T
1
,
(4) the stock.
Of course, both T
1
and T
2
must be greater than current time t, but they can be smaller or larger
than the investment horizon T. If one or both bonds mature before T, the investor will then
have to replace the matured bond with a new bond maturing further into the future. As the
term structure of interest rates is driven by two Brownian motions, it would not help the investor
to trade in additional default-free bonds. The dynamics of the three risky assets can be written
compactly as
d
_
_
_
_
B
T
1
(r, u, t)
B
T
2
(r, u, t)
S
t
_
_
_
_
=
_
_
_
_
B
T
1
(r, u, t) 0 0
0 B
T
2
(r, u, t) 0
0 0 S
t
_
_
_
_
_
r
t
1 + (t)
_
dt + (t) dz
t

,
where
=
_
_
_
_

3
_
_
_
_
, (t) =
_
_
_
_

B1
(T
1
t)
B2
(T
1
t) 0

B1
(T
2
t)
B2
(T
2
t) 0

S
k
1

S
k
2

S
k
3
_
_
_
_
.
10.5 Two-factor Vasicek model 145
This is a two-dimensional ane asset allocation model as dened in Section 7.3.4. By solving
the relevant ODEs, the indirect utility function turns out to be (the following results are to be
veried in Exercise 10.3)
J(W, r, u, t) =
1
1
g(r, u, t)

W
1
,
g(r, u, t) = exp

A
0
(T t)
1

A
1r
(T t)r
1

A
1u
(T t)u
_
,
A
1r
() b
1
(), A
1u
() b
2
(),
where b
1
and b
2
are dened above. A
0
() is another deterministic function, which is not impor-
tant for the optimal portfolio if we disregard intermediate consumption. The optimal investment
strategy is
1
_
_
_
_

B1
(t)

B2
(t)

S
_
_
_
_
=
1

_
(t)
>
_
1

1

_
(t)
>
_
1
_
_
_
_

r

u

ru
0
u
_
1
2
ru
0 0
_
_
_
_
_
A
1r
(T t)
A
1u
(T t)
_
The optimal investment in the stock reduces to

S
=
1

S
k
3
.
The optimal investments in the two bonds can be rewritten as

B1
(t) =
1

u
_
1
2
ru
d(t)
_

r
b
1
(T
2
t)
_
k
2

3
k
3

2
_
+
u
b
2
(T
2
t)
_

k
1

3
k
3

_
1
2
ru
+

k
2

3
k
3

ru
_
_
+
1
d(t)
(b
2
(T
2
t)b
1
(T t) b
1
(T
2
t)b
2
(T t)) ,

B2
(t) =
1

u
_
1
2
ru
d(t)
_

r
b
1
(T
1
t)
_

k
2

3
k
3
_
+
u
b
2
(T
1
t)
_
k
1

3
k
3

_
1
2
ru
+

k
2

3
k
3

ru
_
_
+
1
d(t)
(b
1
(T
1
t)b
2
(T t) b
1
(T t)b
2
(T
1
t)) ,
where
d(t) = b
1
(T
1
t)b
2
(T
2
t) b
1
(T
2
t)b
2
(T
1
t).
Note that the portfolio weights are purely deterministic and thus independent of the state variables
r and u. Also note that if T = T
1
, the hedge term for the T
1
-bond reduces to
1

, whereas the
hedge term for the T
2
-bond vanishes. Conversely, if T = T
2
.
1
When computing (
>
t
)
1
, it is useful to know that

M
11
M
12
.
.
. M
13
M
21
M
22
.
.
. M
23
. . . . . . . . . . . .
0 0
.
.
. M
33

1
=

M
11
M
12
M
21
M
22
!
1
.
.
.
1
M
33

M
11
M
12
M
21
M
22
!
1

M
13
M
23
!
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
0 0
.
.
.
1
M
33

146 Chapter 10. Asset allocation with stochastic interest rates


Now assume the following parameters:

r
= 0.02,
r
= 0.5,
r
= 0.022,
1
= 0.11,

u
= 0.1,
u
= 0.005,
ru
= 0,
2
= 0.07,

S
= 0.141,
S
= 0.284, k
1
= 0.15, k
2
= 0.2.
Suppose in the following that at any point in time the two zero-coupon bonds which the individual
invests in mature in 5 years and 20 years, respectively. Hence, the sensitivity matrix (t) is constant
and so is the speculative part of the optimal investment strategy. With the listed parameters, the
5-year bond has a volatility of 4.82% and an expected excess rate of return of 0.63%, whereas
the 20-year bond has a volatility of 9.40% and an excess expected rate of return of 1.07%. The
instantaneous correlations between the stock and the 5-year bond and 20-year bond are 0.235 and
0.247, respectively. The instantaneous correlation between the two bonds is 0.874. The tangency
portfolio (normalized so that the weights sum to 100%) consists of 62.5% in the 5-year bond,
14.5% in the 20-year bond, and 52.0% in the stock.
The Figures 10.3 and 10.4 depict the optimal investments in the risky assets as a function of
the investment horizon for a relative risk aversion of = 2 and = 4, respectively. The lines
corresponding to the speculative demands are at as explained above, and the gures conrm that
for each asset the speculative demand for = 4 is exactly half of the speculative demand for = 2.
There is no hedge demand for the stock so the speculative stock demand equals the total stock
demand. The hedge demand for the two bonds are highly dependent on the investment horizon of
the investor. With a 5-year horizon, the 5-year bond is the perfect hedge instrument so the 20-year
bond drops out of the hedge portfolio. Conversely for a 20-year horizon. For a horizon between
5 and 20 years, the hedge portfolio consists of long positions in both bonds in order to replicate
a zero-coupon bond with a maturity identical to the horizon. The same argument explains the
composition of the hedge portfolio for other horizons. For a horizon shorter than 5 years, a large
long position in the 5-year bond and a small short position in the 20-year bond to emulate the
desired bond maturity. For a horizon longer than 20 years, this requires a large long position in
the 20-year bond and a small short position in the 5-year bond. A consequence of these results is
that the optimal portfolio weight in a bond of a given maturity is non-monotonic in the horizon of
the investor.
10.6 Other studies with stochastic interest rates
Brennan, Schwartz, and Lagnado (1997) apply the two-factor Brennan-Schwartz interest rate
dynamics in a model that also has stochastic dividends on stocks. They study the eect the length
of the investment horizon has for an investor with utility from terminal wealth only. Due to the
complexity of their model they must resort to numerical solution techniques.
Wachter (2003) shows that as risk aversion approaches innity, an investor with utility only of
wealth at time T will invest solely in the real zero-coupon bond maturing at time T. This holds for
any utility function and for all well-behaved Ito-processes for the returns of the available assets.
With utility of intermediate consumption, the innitely risk-averse individual should invest in a
certain coupon bond with coupons related to expected future consumption.
Munk and Srensen (2004) study the asset allocation problem when the term structure of interest
10.6 Other studies with stochastic interest rates 147
!"#
$"#
%"#
&"#
'""#
'!"#
'$"#
!
"
#
$
%
"
&
'
"

)
*
'
+
,
$
() +,-. / 0123
!") +,-. /0123
45,36
() +,-. /72.82
!") +,-. /72.82
/$"#
/!"#
"#
!"#
$"#
%"#
&"#
'""#
'!"#
'$"#
" '" !" 9" $" ("
!
"
#
$
%
"
&
'
"

)
*
'
+
,
$
-./*0$1*.$ ,"#'2".3 4*5#0
() +,-. / 0123
!") +,-. /0123
45,36
() +,-. /72.82
!") +,-. /72.82
Figure 10.3: Optimal portfolios in the two-factor Vasicek model: risk aversion of 2.
!"#
$"#
%"#
&"#
'""#
'!"#
'$"#
!
"
#
$
%
"
&
'
"

)
*
'
+
,
$
() +,-. / 0123
!") +,-. /0123
45,36
() +,-. /72.82
!") +,-. /72.82
/$"#
/!"#
"#
!"#
$"#
%"#
&"#
'""#
'!"#
'$"#
" '" !" 9" $" ("
!
"
#
$
%
"
&
'
"

)
*
'
+
,
$
-./*0$1*.$ ,"#'2".3 4*5#0
() +,-. / 0123
!") +,-. /0123
45,36
() +,-. /72.82
!") +,-. /72.82
Figure 10.4: Optimal portfolios in the two-factor Vasicek model: risk aversion of 4.
148 Chapter 10. Asset allocation with stochastic interest rates
rates evolve according to models in the Heath-Jarrow-Morton (HJM) class. As shown by Heath,
Jarrow, and Morton (1992), any dynamic interest rate model is fully specied by the current term
structure and the forward rate volatilities. Therefore the HJM modeling framework is natural
when comparing the separate eects of the current term structure and the dynamics of the term
structure on the optimal interest rate hedging strategy. Term structure models in the HJM class
are not necessarily Markovian, but the class includes the well-known Markovian models such as the
Vasicek model. To cover the non-Markovian models the authors apply the martingale approach
to solve the utility maximization problem instead of the dynamic programming approach. Within
the HJM framework one may x the current yield curve and vary its future dynamics to gauge the
eect of the interest rate dynamics. As in all term structure models one can x the dynamics and
vary the initial yield curve (for absolute pricing models, such as the Vasicek and CIR models, not
all initial yield curves are possible). The paper compares the optimal portfolio and consumption
strategies for a standard one-factor Vasicek and a three-factor model where the term structure
can exhibit three kinds of changes: A parallel shift, a slope change, and a curvature change. The
authors nd that the form of the initial term structure is of crucial importance for the certainty
equivalents of future consumption and, hence, important for the relevant interest rate hedge, while
the specic dynamics of the term structure is of minor importance. Of course, further studies of
this kind is needed to nd out whether this conclusion is generally valid.
Detemple and Rindisbacher (2010) derive a new and very general portfolio decomposition result.
Assuming utility of time T wealth only, the optimal portfolio is decomposed into three terms: (i)
the speculative term, (ii) a term hedging variations in the price of the zero-coupon bond maturing
at time T, and (iii) a term hedging against uctuations in the density of the so-called T-forward
probability measure, i.e., the equivalent martingale measure corresponding to the use of the zero-
coupon bond maturing at T as the numeraire, cf. Bjork (2009) or Munk (2011).
It has long been recognized that the volatility of interest rates varies over time in a non-
deterministic way. This is a key motivation behind models in which one or several of the state
variables follow square-root processes. In the basic interest rate models with stochastic volatility,
the zero-coupon bond prices will depend non-trivially on all state variables and thus in particu-
lar on the volatility-determining state variables. Because the rst-order partial derivatives of the
zero-coupon bond price with respect to these volatility factors are generally non-zero, it is possible
to set up a trading strategy in bonds of dierent maturities which is completely hedged against
volatility shocks, i.e., the stochastic volatility is spanned by the traded bonds. However, some
recent empirical studies document unspanned stochastic volatility in the sense that a part of the
stochastic volatility in the yield curve cannot be hedged away using only bonds. Simple xed-
income derivatives like caps and swaptions, which obviously depend on the volatility of interest
rates, cannot be perfectly replicated by trading even a larger number of bonds. Bond markets are
incomplete.
2
Trolle (2009) studies the optimal demand for bonds and interest rate derivatives in a
2
For example, based on 1995-2000 data from the U.S., the U.K., and Japan, Collin-Dufresne and Goldstein (2002)
nd that only a (small) part of the returns on at-the-money straddles can be explained by changes in the underlying
swap rates in a regression analysis. An at-the-money straddle is a portfolio consisting of an at-the-money cap and
an at-the-money oor. By construction, such a straddle is neutral to small changes in the interest rate level, but
very sensitive to changes in volatility. The results thus show that variations in interest rate volatility are only partly
due to variations in the level of interest rates. Note that this is model-independent evidence of unspanned stochastic
volatility: no model is assumed for the pricing of the caps and oors involved. For further empirical support of
10.7 Exercises 149
model featuring unspanned stochastic volatility (such a model is necessarily quite complex). Since
interest rate derivatives (options, caps, oors, swaptions, etc.) will depend on the volatility factors
not spanned by the bonds, investing in derivatives allow you to pick up the market price of risk
associated with those factors and also to hedge against adverse shifts in the factors. Trolles empir-
ical investigation shows that the market prices of risk of the unspanned volatility factors and thus
the Sharpe ratios of the interest rate derivatives are high (compared to bonds). As a consequence,
he nds substantial welfare gains from including interest rate derivatives in the portfolio.
A number of papers explore models with both stochastic interest rates and stochastic ination
rates. We will study such a setting in Chapter 12. As an example, Campbell and Viceira (2001)
study a discrete-time consumption and portfolio choice problem of an innitely-lived investor with
recursive utility of the Epstein-Zin type. They assume that the real short-term interest rate and
the expected ination rate follow correlated AR(1) processes, i.e., discrete-time versions of the
Ornstein-Uhlenbeck process, similar to the dynamics of r and u in the two-factor Vasicek model
above. They derive an approximate analytic solution to the problem and compare the optimal
bond demand for a long-term ination-indexed bond to the optimal bond demand for a long-term
nominal bond.
Another study with both stochastic interest rates and ination is Sangvinatsos and Wachter
(2005). They assume that the nominal short-term interest rate and the expected ination rate are
ane functions of a three-dimensional state variable, which follows an Ornstein-Uhlenbeck process.
The market prices of risk are allowed to be ane in the state variable as well so that excess expected
bond returns vary with the state variables in contrast to the one- and two-factor Vasicek models
considered earlier in the chapter. The model is still ane, so they can derive a closed-form solution
for the optimal portfolio of an investor with CRRA utility of terminal wealth. Among other things
they show that, when the investor has access to several long-term bonds of dierent maturities,
the optimal portfolio typically involves relatively extreme long and short positions.
10.7 Exercises
Exercise 10.1. Consider a nancial market where the only two assets traded are (1) a bank
account with a rate of return of r
t
and (2) a risky asset with price P
t
following the geometric
Brownian motion,
dP
t
= P
t
[dt + dz
t
] .
The short-term interest rate is assumed to follow a Vasicek process:
dr
t
= [ r r
t
] dt +
r
dz
t
+
_
1
2

r
d z
t
.
(a) Describe the model!
We look at an investor with CRRA utility of terminal wealth only,
J(W, r, t) = sup

E
W,r,t
_
W
1
T
1
_
,
where the process denotes the fraction of wealth invested in the risky asset.
unspanned stochastic volatility, see Heidari and Wu (2003), Li and Zhao (2006), Jarrow, Li, and Zhao (2007), and
Trolle and Schwartz (2009).
150 Chapter 10. Asset allocation with stochastic interest rates
(b) State the HJB equation corresponding to this problem.
(c) Find the rst-order condition for .
(d) Show that the indirect utility function is of the form
J(W, r, t) =
1
1

We
A
0
(Tt)+A
1
(Tt)r+
1
2
A
2
(Tt)r
2

1
.
What can you say about the functions A
i
?
(e) Find the optimal portfolio strategy. Compare it with the solution for constant r.
Exercise 10.2. Consider an economy with a single agent. The agent owns a production plant that
generates units of the consumption good of the economy. The agent can choose to withdraw con-
sumption goods from the production or reinvest them in the production process. The productivity
of her plant depends on a state variable Y
t
that follows the process
dY
t
= (b Y
t
) dt +k
_
Y
t
dz
t
, Y
0
= y,
where b, and k are positive constants with 2b > k
2
. Let c
t
0 denote the rate by which the agent
withdraws consumption goods from the production plant and let X
c
t
be the value of the plant at
time t given the consumption process c. We assume that
dX
c
t
= (X
c
t
hY
t
c
t
) dt +X
c
t

_
Y
t
dz
t
, X
c
0
= x,
where h and are positive constants with h >
2
. The agent has a log utility of consumption over
her life-time T, so that the indirect utility function is
V (x, y, t) = sup
c
E
x,y,t
_
_
T
t
e
(st)
ln c
s
ds
_
.
(a) State the HJB equation corresponding to the problem and nd the rst-order condition for
the optimal consumption rate.
(b) Verify that the function
V (x, y, t) = A
0
(t) ln x +A
1
(t)y +A
2
(t)
satises the HJB equation and nd ordinary dierential equations that the functions A
0
, A
1
and
A
2
must solve. Show that A
0
(t) =
1

(1 e
(Tt)
). Find an explicit expression for the optimal
consumption rate, c

t
.
(c) We know from the martingale approach that the state-price deator
t
satises
t
=
u
0
(c

t
, t), where is a constant, and where u(c, t) = e
t
ln c in our case. Use this and the ex-
pression for optimal consumption to show that

t
=
1

e
t
A
0
(t)
X

t
,
where X

t
is the optimal value of the production plant, i.e., X

t
= X
c

t
. Apply Itos Lemma in
order to nd the dynamics of
t
.
(d) We also know that
d
t
=
t
[r
t
dt +
t
dz
t
] ,
10.7 Exercises 151
where r
t
is the short-term interest rate. Conclude that r
t
= (h
2
)Y
t
. Show that the dynamics
of r
t
is on the form
dr
t
= [ r r
t
] dt +
r

r
t
dz
t
,
where , r and
r
are positive constants. Appreciate this result!
Exercise 10.3. Verify the expressions stated in Section 10.5 for the indirect utility function and
the optimal investment strategy for the two-factor Vasicek model. If preferences for intermediate
consumption are included, how will the optimal consumption and investment strategy look like?
CHAPTER 11
Asset allocation with stochastic market prices of risk
11.1 Introduction
In this chapter we consider models where interest rates are constant, but the market price of
stock market risk varies stochastically over time.
11.2 Mean reversion in stock returns
Several empirical studies provide evidence of mean reversion in stock returns so that expected
stock returns are high after a period of low realized returns and vice versa. See, e.g., Poterba
and Summers (1988), Fama and French (1989), Campbell, Lo, and MacKinlay (1997, Ch. 7), and
Cochrane (2005, Ch. 20). Formulated dierently, stock returns appear to be predictable by factors
related to the current stock price, such as the earnings/price ratio or the dividend/price ratio.
1
We have seen earlier that CRRA investors should have a constant fraction of wealth invested in
the stock market index if the stock market risk premium is constant over time. Mean reversion
in stock returns leads to lower variance of long-term stock returns, which intuitively should lead
to larger investments in the stock. Moreover, we expect that CRRA investors should invest more
[less] in the stock in periods where the expected future stock return is high [low].
Some recent papers have set up formal models studying the implications for portfolio decisions
of mean reversion in stock returns. Both Kim and Omberg (1996) and Wachter (2002) obtain
closed-form expressions for the optimal investment strategy in a set-up with a constant risk-free
interest rate r and a single risky asset (representing the stock market) with price P
t
evolving as
dP
t
= P
t
[(r +
t
) dt + dz
t
] , (11.1)
where the volatility is assumed to be a positive constant, but the market price of risk
t
follows
a mean-reverting process. Note that in this setting the market price of risk is identical to the
1
There is also evidence that stock returns can be predicted by the current level of interest rates, cf., e.g., Ang
and Bekaert (2007).
153
154 Chapter 11. Asset allocation with stochastic market prices of risk
Sharpe ratio of the stock. Kim and Omberg (1996) consider an investor with a CRRA utility of
terminal wealth only, which allows them to let
t
have an undiversiable risk component. On
the other hand, Wachter (2002) considers a time-separable CRRA utility function of consumption,
so to obtain explicit solutions she assumes that the market price of risk is perfectly (negatively)
correlated with the price level. Wachter argues that the assumption of a correlation of 1 is
empirically not unreasonable. To allow for non-perfect correlation we write the dynamics of as
d
t
=

dt +

dz
t
+
_
1
2

d z
t
. (11.2)
All constants are assumed positive, except the correlation parameter . The market price of risk is
assumed to follow an Ornstein-Uhlenbeck process with long-term average

, mean reversion speed
, and volatility

.
A negative value of the correlation will represent mean reversion in the returns on the stock in
the following sense. A positive shock dz
t
will then aect the current stock return
dP
t
P
t
=
P
t+dt
P
t
P
t
positively and the market price of risk
t+dt
=
t
+d
t
negatively. Hence the market price of risk
and the expected stock return for a short period starting at t +dt will be lower. So high realized
return in the current period will be followed by a low expected return in the following period.
Likewise, low realized return in the current period will be followed by a high expected return in
the subsequent period.
Let us rst study how the distribution of future prices is aected by the mean reversion property.
It follows from the price dynamics (11.1) that
P
T
= P
t
exp
_
_
T
t

r
1
2

2
+
s

ds +
_
T
t
dz
u
_
= P
t
exp
_

r
1
2

(T t) +
_
T
t

s
ds +
_
T
t
dz
u
_
. (11.3)
From (11.2), it follows that

s
=

+e
(st)
_

_
+
_
s
t

e
(su)
dz
u
+
_
s
t
_
1
2

e
(su)
d z
u
and, hence,
_
T
t

s
ds =

(T t) +
_

_
_
T
t
e
(st)
ds
+
_
T
t
__
s
t

e
(su)
dz
u
_
ds +
_
T
t
__
s
t
_
1
2

e
(su)
d z
u
_
ds.
To proceed, we interchange the order of integration in the two double integrals, which leaves us
with
_
T
t

s
ds =

(T t) +
_

_
_
T
t
e
(st)
ds
+
_
T
t
_
_
T
u

e
(su)
ds
_
dz
u
+
_
T
t
_
_
T
u
_
1
2

e
(su)
ds
_
d z
u
=

(T t) +
_

_
b(T t) +
_
T
t

b(T u) dz
u
+
_
T
t
_
1
2

b(T u) d z
u
=

(T t) +
_

_
b(T t) +
_
T
t

b(T s) dz
s
+
_
T
t
_
1
2

b(T s) d z
s
,
11.2 Mean reversion in stock returns 155
where we have introduced b() = (1 e

)/, and where the last line simply replaces u by s in


the integrals. Next, we substitute this expression into (11.3) and combine the two z-integrals so
that we end up with
P
T
= P
t
exp
_

r

2
2
+

(T t) + b(T t)
_

_
+
_
T
t
(1 +

b(T s)) dz
s
+

_
1
2
_
T
t
b(T s) d z
s
_
.
Only the last two terms are stochastic and since the integrands are deterministic functions of time,
the two stochastic integrals are normally distributed random variables. Hence, P
T
is lognormally
distributed. Since the integrals have mean zero, we get
E
t
[ln P
T
] = ln P
t
+

r

2
2
+

(T t) + b(T t)
_

_
.
The variance is
Var
t
[ln P
T
] = Var
t
_

_
T
t
(1 +

b(T s)) dz
s
+

_
1
2
_
T
t
b(T s) d z
s
_
=
2
_
_
T
t
(1 +

b(T s))
2
ds +
2

(1
2
)
_
T
t
b(T s)
2
ds
_
=
2
_
T
t
_
1 + 2

b(T s) +
2

b(T s)
2
_
ds
=
2
_
1 +
2

+

2

(T t)

+

2

b(T t)

2

2
b(T t)
2
_
,
where the last equality follows from the integrals
_
T
t
b(T u) du =
1

(T t b(T t)) ,
_
T
t
b(T u)
2
du =
1

2
(T t b(T t))
1
2
b(T t)
2
.
With a constant Sharpe ratio , the stock price would follow a geometric Brownian motion so that
the future price would be lognormally distributed with E
t
[ln P
T
] = ln P
t
+

r

2
2
+

(T t)
and Var
t
[ln P
T
] =
2
(T t). If we take the ratio of the variance of ln P
T
with the mean reversion
feature to the variance of ln P
T
without mean reversion, we get
Var
t
[ln P
T
]

2
(T t)
= 1 +
2

+

2

+

2

b(T t)
T t


2

2
b(T t)
2
T t
1 +
2

+

2

2
for T .
The variations in the Sharpe ratio will therefore decrease the variance in the long run if
2

+

2

2
< 0 <

2
,
i.e., if the correlation between the Sharpe ratio and the stock price is suciently negative.
Figure 11.1 illustrates the eects of the mean reversion feature on the distribution of ln(P
T
/P
0
)
for T = 5 and T = 30 years by comparing with the distribution under the assumption of the
standard Merton model in which
t
is constant and the stock price follows a geometric Brownian
motion (GBM). As expected, the distribution with mean reversion has thinner tails and a higher
156 Chapter 11. Asset allocation with stochastic market prices of risk
-1.5 -1 -0.5 0 0.5 1 1.5 2
ln (P_T/P_0)
mean rev GBM
(a) Horizon of T = 5 years
-2 -1 0 1 2 3 4 5 6
ln (P_T/P_0)
mean rev GBM
(b) Horizon of T = 30 years
Figure 11.1: Eects of mean reversion on the distribution of the log-return,
ln(P
T
/P
0
). The graphs show the distribution of log-return with mean reversion (black curve)
and without mean reversion, i.e., assuming the stock price follows a geometric Brownian motion
(red curve). The parameter values are r = 0.03, = 0.2, = 0.02,

= 0.3,

= 0.01, = 0.8,
and
t
=

.
top. Given the seemingly reasonable parameter values assumed when generating the gure, the
dierences between the two distributions are not visible for horizons lower than one year (not
illustrated), still quite small for the 5-year horizon, while very clear for the 30-year horizon. This
suggests that it is more important to take the mean reversion property of stock returns into account
for investors with relatively long investment horizons. Figure 11.2 shows that the mean reversion
feature increases the probability that a 100% stock market position outperforms a 100% risk-free
position, butwith the given parametersthe increase is rather limited even for long investment
horizons.
Now, let us turn to the eect of mean reversion on the optimal investment strategy for investors
with a constant relative risk aversion > 1. In the model introduced above the market price of
risk is the only state variable, i.e., we put x = in the notation of Chapter 7. For CRRA utility
we have from Section 7.3 that the indirect utility function will be of the form
J(W, , t) =
1
1
g(, t)

W
1
.
Since is an ane function of itself, we have a quadratic model according to the classication in
Chapter 7. For an investor with CRRA utility of terminal wealth only, we get from Theorem 7.9
that the indirect utility function is given by
J(W, , t) =
1
1

We
A
0
(Tt)+A
1
(Tt)+
1
2
A
2
(Tt)
2

1
and the optimal investment strategy in the stock is
(W, , t) =
1

(A
1
(T t) +A
2
(T t)) .
In the notation of Section 7.3.3 we have
r
0
= r, r
1
= 0, r
2
= 0, m
0
=

,
m
1
= ,
0
= 0,
1
= 0,
2
= 1,
K
0
= 0, K
1
=

, kvk
2
=
2

, v
2
= (1
2
)
2

.
11.2 Mean reversion in stock returns 157
0%
20%
40%
60%
80%
100%
0 5 10 15 20 25 30
investment horizon, years
o
u
t
p
e
r
f
o
r
m
a
n
c
e

p
r
o
b
a
b
i
l
i
t
y
Mean rev
GBM
Figure 11.2: Outperformance probabilities as a function of the investment horizon.
The graphs show the probability that a stock investment outperforms a risk-free investment for
dierent investment horizons with mean reversion (black curve) and without mean reversion,
i.e., assuming the stock price follows a geometric Brownian motion (red curve). The parameter
values are r = 0.03, = 0.2, = 0.02,

= 0.3,

= 0.01, = 0.8, and


t
=

.
If we dene
= +
1

,
assume that
2

2
+
2

2
+ (1
2
)
_
1

2
> 0
and dene
= 2
_

2
+
1

2

2

(
2
+ (1
2
)),
it follows from Section 7.3.3 that
A
2
() =
2

1
( + 2 ) (e

1) + 2
,
A
1
() =
4

_
e
/2
1
_
2
( + 2 ) (e

1) + 2
,
and
A
0
() = r +

_

0
A
1
(s) ds +
1
2

_

0
A
2
(s) ds
1
2

2

2
+ (1
2
)
_
_

0
A
1
(s)
2
ds
= r +
1

2
2

2
+

2

+ 2

+
4

3
( 4 ) e

+ 8 e
/2
4
2 ( 2 ) (1 e

)
+

2( 1)
1

2
+ (1
2
)
ln

2 ( 2 ) (1 e

)
2

,
where the last equality is adapted from Kim and Omberg (1996).
2
This condition will be satised except for extreme combinations of ,

, , and . A discussion of the


solution if this condition is not satised can be found in Kim and Omberg (1996).
158 Chapter 11. Asset allocation with stochastic market prices of risk
70%
75%
80%
85%
90%
0 10 20 30 40 50
investment horizon, years
p
o
r
t
f
o
l
i
o

w
e
i
g
h
t
Mean rev
GBM
Figure 11.3: Optimal portfolio weight of stock as a function of the investment
horizon. The parameter values are = 2, r = 0.03, = 0.2, = 0.02,

= 0.3,

= 0.01,
= 0.8, and
t
=

.
With > 1, it can be shown that A
1
() and A
2
() are positive
3
and increasing.
4
If the current
value of the market price of risk is positive and the correlation is negative (consistent with empirical
observations), it follows that the hedge term of the optimal portfolio is positive and increasing with
the horizon of an investor with > 1. An investor with a long horizon should therefore invest
a larger fraction of wealth in stocks than an investor with the same risk aversion, but a shorter
horizon. This is consistent with typical recommendations of investment advisors. This is illustrated
by Figure 11.3 for reasonable parameter values. Note, however, that the extra fraction of wealth
invested in the stock due to the mean reversion of returns is relatively small even for long horizons.
Figure 11.4 shows how the optimal stock allocation depends on the current market price of risk,
both in the model with mean reversion and in the model where the market price of risk is assumed
to be constant.
With utility from intermediate consumption and possibly terminal wealth, we must assume that
either = 1 or = 1. We will stick to the latter, more realistic case. The restriction = 1
aects all the functions A
0
, A
1
, and A
2
due to the presence of in and q. For notational
simplicity let us consider an investor with utility stemming only from intermediate consumption,
i.e.,
2
= 0. From Theorem 7.10, we get that the optimal investment strategy is
(W, , t) =
1

+
1

D(, t, T)

,
3
First note that
2
+ (1
2
) >
2
+ 1
2
= 1 and, hence, + 2 > 2


2
+ 2 0. It is then clear that A
1
and A
2
are positive.
4
Direct dierentiation leads to A
0
2
() = 8
1

2
e

/[( + 2 )(e

1) + 2]
2
, which is positive, and A
0
1
() =
4
1

e
/2
(e
/2
1)[( + 2 )(e
/2
1) + 2]/[( + 2 )(e

1) + 2]
2
, which is also positive.
11.2 Mean reversion in stock returns 159
-50%
0%
50%
100%
150%
200%
-0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
current market price of risk
p
o
r
t
f
o
l
i
o

w
e
i
g
h
t
Mean rev
GBM
Figure 11.4: Optimal portfolio weight of stock as a function of the current market
price of risk. The parameter values are = 2, T t = 30, r = 0.03, = 0.2, = 0.02,

= 0.3,

= 0.01, = 0.8, and


t
=

.
where
D(, t, T) =
_
T
t
(A
1
(s t) +A
2
(s t)) g(, t; s) ds
_
T
t
g(, t; s) ds
,
g(, t; s) = exp

(s t)
1

A
0
(s t) +A
1
(s t) +
1
2
A
2
(s t)
2
_
,
and we must insert = 1 in the expressions of the A
i
s. Again it can be shown that, for > 1
and > 0, the hedging component is positive and increasing with the time horizon T. With
intermediate consumption the horizon eect on the stock investment is dampened relative to the
case with utility from terminal wealth only since the eective investment horizon is lower than T.
The optimal consumption rate is
C(W, , t) =
_
_
T
t
g(, t; s) ds
_
1
W.
It can be shown that the consumption/wealth ratio is increasing in when > 0 and >
1. To see this note that the derivative of the wealth/consumption ratio with respect to is

_
T
t
g(, t; s) (A
1
(s t) +A
2
(s t)) ds, which also enters the hedging demand. In fact,
whenever the hedging demand is positive, the wealth/consumption ratio will be decreasing in ,
and the consumption/wealth ratio will therefore be increasing in . The intuition for this result
is as follows: An increase in indicates better future investment opportunities. This gives an
income eect that induces higher current consumption. On the other hand, investments are then
more protable there is a substitution eect. With > 1, the income eect dominates. To keep
consumption stable across states, the investor must choose a portfolio which gives positive returns
160 Chapter 11. Asset allocation with stochastic market prices of risk
in states with relatively bad future investment opportunities, i.e., low . With = 1, stocks have
high returns exactly when is low so the investor will hold more stocks relative to the case with
constant investment opportunities.
Various empirical studies show that value (high book-to-market) stocks and growth (low book-
to-market) stocks have risk-return characteristics that deviate considerably from the general stock
market, cf., e.g., Fama and French (1992, 2007) and Campbell and Vuolteenaho (2004). In par-
ticular, short-term returns of value stocks have a higher average and lower standard deviation
than growth stocks, but value stocks are riskier than growth stocks in the long run. Although the
practical interest in value and growth stocks is immense, only few papers have studied dynamic
portfolio choice models taking into account the special characteristics of value and growth stocks.
Lynch (2001) and Jurek and Viceira (2011) use a vector auto regression model of return dynamics
incorporating predictive variables and allow for infrequent rebalancing of portfolios. While Lynch
solves for the optimal portfolios by numerical dynamic programming, Jurek and Viceira suggest a
recursive approach based on an approximation. Larsen and Munk (2012) set up a continuous-time
model that leads to exact and relative simple closed-form expressions for the optimal strategies and
for the losses associated with selected suboptimal strategies. The model allows both for special
return characteristics in growth and value stocks and for mean reversion in returns. They derive
simple expressions (involving solutions to some Ricatti-type ODEs) for the optimal investments
in the dierent types of stocks and a risk-free asset for a power utility maximizing investor. The
model is estimated using U.S. return data.
Further references: Barberis (2000), Lynch (2001), Pastor and Stambaugh (2012), Wachter
and Warusawitharana (2009), and Branger, Larsen, and Munk (2012).
11.3 Stochastic volatility
As discussed in Section 7.2.3, stochastic volatility will only induce hedging to the extent that
it aects the market prices of risk. Suppose, for example, that a CRRA investor can trade in a
risk-free asset with a constant interest rate and in the stock market index following the process
dP
t
= P
t
[(r +
1

t
) dt +
t
dz
1t
] ,
where the volatility
t
can follow any stochastic process. The Sharpe ratio of the stock,
1
, is
assumed constant. Then the optimal fraction of wealth invested in the stock index is

t
=
1

t
without any hedge term. The dynamics of wealth is then
dW
t
= (W
t
[r +
t

1
] c
t
) dt +W
t

t
dz
1t
=

W
t
_
r +

2
1

_
c
t

dt +W
t

dz
1t
with the constant relative risk exposure that the CRRA investor prefers. The optimal combination
of the risk-free asset and the stock index varies over time as the volatility varies, corresponding
to movements along the instantaneous mean-variance frontier. Of course to know exactly how the
portfolio is to be rebalanced, it is necessary to model the uctuations in volatility.
11.3 Stochastic volatility 161
The more interesting case is when the Sharpe ratio of the stock depends on the level of the
volatility. A tractable model is the Heston model, introduced by Heston (1993) for the pricing
of stock options in the presence of stochastic volatility. The dynamics of the stock price and the
instantaneous variance V
t
=
2
t
of the stock is assumed to be
dP
t
= P
t
_
_
r +

1
V
t
_
dt +
_
V
t
dz
1t
_
,
dV
t
= [

V V
t
] dt +
V
_
V
t
dz
1t
+
_
1
2

V
_
V
t
dz
2t
,
where z
1
and z
2
are independent standard Brownian motions. Hence,
V

V
t
is the volatility
of the variance, is the instantaneous correlation between the stock and the variance, and the
variance is assumed to mean-revert around a long-term level of

V with reecting the speed of
mean reversion. The market price of risk associated with z
1
, which is identical to the Sharpe ratio
of the stock, is
1
(V
t
) =

V
t
. Of course, we can safely assume

1
> 0.
If || 6 = 1 and the investor can only trade in the stock and the riskfree asset, the market is
incomplete. The optimal portfolio choice of a CRRA investor in this framework was derived by
Liu (1999, 2007) and Kraft (2005) and follows as a special case of our analysis of ane models.
The state variable is V
t
, which obviously follows an ane process, and the squared market price
of risk is proportional to V
t
and thus also ane. More precisely, in the notation of Section 7.3.2,
we have
r
0
= r, r
1
= 0, m
0
=

V , m
1
= , V
0
= 0, V
1
=
2

2
V
,
v
0
= 0, v
1
= (1
2
)
2
V
,
0
= 0,
1
=

2
1
, K
0
= 0, K
1
=
V

1
.
Let
= +
1

1
.
The condition (7.23) becomes

2
+
1

2
1

2
V
_

2
+ [1
2
]
_
> 0,
which is certainly satised for > 1. Dening
=
_

2
+
1

2
1

2
V
(
2
+ [1
2
]),
the key function A
1
() follows from (7.24):
A
1
() =

2
1

1
( + )(e

1) + 2
.
A
1
is positive and increasing in . For a CRRA investor with utility of time T wealth only, the
optimal fraction of wealth invested in the stock is then
(t) =


V
A
1
(T t)
=

2
1

V
e
(Tt)
1
( + )(e
(Tt)
1) + 2
,
cf. Theorem 7.7. Note that the portfolio weight does not vary with the current volatility level.
Empirical estimates of the correlation between the stock and its instantaneous variance are
negative. Volatility tends to go up, when stock prices go down. Consequently, the hedge term is
162 Chapter 11. Asset allocation with stochastic market prices of risk
positive. A low variance represents a situation of bad investment opportunities since the market
prices of risk are then also low. Due to the negative correlation, stocks have a built-in hedge:
should investment opportunities deteriorate (falling variance), the stock will typically increase
substantially in price.
When the volatility of the stock is stochastic and imperfectly correlated with the stock, options
on the stock are non-redundant assets. By investing in an option, which is sensitive to the shock z
2
,
the investor can improve her welfare. Let O
t
= f(P
t
, V
t
, t) denote the price of such an option (or any
asset/portfolio with non-zero exposure to z
2
), where f is assumed to be suciently dierentiable.
By Itos Lemma
dO
t
= . . . dt +f
P
(P
t
, V
t
, t)P
t
_
V
t
dz
1t
+f
V
(P
t
, V
t
, t)

V
_
V
t
dz
1t
+
_
1
2

V
_
V
t
dz
2t


dO
t
O
t
= . . . dt + (f
P
(P
t
, V
t
, t)P
t
+f
V
(P
t
, V
t
, t)
V
)

V
t
O
t
dz
1t
+f
V
(P
t
, V
t
, t)
_
1
2

V
t
O
t
dz
2t
.
An important element of the model, which has to be specied at this point, is the market price of
risk
2t
associated with z
2
. Following Liu and Pan (2003), assume that
2t
=

V
t
, which will
keep us in the ane model class. The expected rate of return of the option is then

O
t
= r +
1t
(f
P
(P
t
, V
t
, t)P
t
+f
V
(P
t
, V
t
, t)
V
)

V
t
O
t
+
2t
f
V
(P
t
, V
t
, t)
_
1
2

V
t
O
t
= r +
_

1
(f
P
(P
t
, V
t
, t)P
t
+f
V
(P
t
, V
t
, t)
V
) +

2
f
V
(P
t
, V
t
, t)
_
1
2

V
_
V
t
O
t
.
We now have a complete markets model with two risky assets having a volatility matrix of

t
=
_

V
t
0
(f
P
(P
t
, V
t
, t)P
t
+f
V
(P
t
, V
t
, t)
V
)

V
t
O
t
f
V
(P
t
, V
t
, t)
_
1
2

V
t
O
t
_
,
and
(V
t
) =
_

V
t

V
t
_
, v(V
t
) =
_

V

V
t
_
1
2

V
t
_
, v(V
t
) = 0.
In the notation and terminology of Section 7.3.2, this is an ane model with
r
0
= r, r
1
= 0, m
0
=

V , m
1
= , V
0
= 0, V
1
=
2
V
,
v
0
= 0, v
1
= 0,
0
= 0,
1
=

2
1
+

2
2
, K
0
= 0, K
1
=
V

1
+
_
1
2

.
Let
= +
1


V
(

1
+
_
1
2

2
)
and note that the condition (7.23) is satised as long as > 1. Dene
=
_

2
+
1

2

2
V
(

2
1
+

2
2
).
From (7.24) we get the relevant version of the A
1
-function, which we now denote by

A
1
to distin-
guish it from the A
1
-function earlier in this section:

A
1
() =

2
1
+

2
2

e

1
( + )(e

1) + 2
.
Just like A
1
above,

A
1
is also positive and increasing.
11.3 Stochastic volatility 163
According to Theorem 7.7, the optimal portfolio for an investor with CRRA utility of time T
wealth is then
_

St

Ot
_
=
1

>
t
_
1
_

V
t

V
t
_

>
t
_
1
_

V

V
t
_
1
2

V
t
_

A
1
(T t),
which implies that the fraction of wealth optimally invested in the option is

Ot
=
O
t
f
V
(P
t
, V
t
, t)
_

V
_
1
2

A
1
(T t)
_
,
and the fraction of wealth optimally invested in the stock is

St
=
1

2
_

_
1
2
+
f
P
(P
t
, V
t
, t)P
t

V
_
1
2
f
V
(P
t
, V
t
, t)
__
+
1

f
P
(P
t
, V
t
, t)P
t
f
V
(P
t
, V
t
)

A
1
(T t)
=

_
1
2
f
P
(P
t
, V
t
, t)P
t

Ot
O
t
.
Let us assume that the option price is positively related to the stock volatility so that f
V
(P
t
, V
t
, t) >
0. The hedge demand for the option is then negative. The hedge portfolio should increase in value
when the variance V
t
drops as this implies deteriorating market prices of risk. As the option price
increases with the variance, a short position in the option will give the desired hedge. The sign
of the speculative demand for the option equals the sign of the constant

2
in the market price of
z
2
-risk. According to most empirical studies, this market price of risk is negative; see, e.g., Bakshi
and Kapadia (2003) and Chernov and Ghysels (2000).
5
A negative position in the option will give
a negative exposure to the volatility-specic risk represented by z
2
, which leads to a positive risk
premium. Both the speculative demand and the hedging demand for the option are thus negative.
In their illustration of the solution, Liu and Pan (2003) assumes that the option the investor
trades in is a socalled delta-neutral straddle. A straddle is a combination of a long position in a
call and a long position in a put with the same strike prices and maturity dates. The strike price
is determined so that the delta of the call, i.e., the derivative of the call price with respect to the
stock price, equals
1
2
. Then it follows from the put-call parity that the delta of the put equals

1
2
so that the delta of the straddle is equal to zero. The value of the straddle is thus insensitive
to small changes in the stock price. On the other hand, the straddle will be highly sensitive to
changes in the volatility, so it is an obvious instrument for trading volatility. In their numerical
illustrations, Liu and Pan (2003) nd for example that the optimal portfolio of an investor with
a relative risk aversion of 3 and a horizon of 5 years consists of (approximately) 24% in the stock
and -54% in the straddle, and thus 130% in the riskfree asset. This is certainly a non-standard
investment recommendation.
Liu and Pan (2003) and Larsen and Munk (2012) compute utility losses from ignoring options
completely when determining the optimal investment or from including options in a suboptimal
way. Both studies conclude that the utility losses from excluding options can be substantial.
The results of Larsen and Munk (2012) indicate that inclusion of the option is mainly important
because it gives access to the apparently sizeable volatility risk premium, whereas the benets from
5
There is less consensus about the magnitude of the market price of volatility risk. Liu and Pan (2003) refers to

2
= 06 as a conservative estimate. Note that it remains unclear whether the assumption that
2t
is proportional
to
t
=

V
t
is appropriate.
164 Chapter 11. Asset allocation with stochastic market prices of risk
volatility hedging are smaller. Of course, the attractiveness of the option depends heavily on the
estimate of the parameter

2
.
Liu and Pan (2003) extend the above setting to include jumps of a given size in the stock price,
motivated by the observed stock market crashes. With the assumption that the intensity of jump
arrivals is be proportional to V
t
, they are able to nd a closed-form solution (this is an ane
jump-diusion setting). Their results indicate that the estimates of the jump size, the jump risk
premium, and the jump intensity are highly important for the optimal option position. A put
option provides protection against big drops in the stock price and thus becomes more attractive
due to the inclusion of negative jumps in the model. Liu, Longsta, and Pan (2003) and Branger,
Schlag, and Schneider (2008) consider extensions where both the stock price and the variance may
jump.
Chacko and Viceira (2005) consider a quite spurious model with stochastic volatility that does
not t into the cases where we have explicit solutions. They nd explicit, approximate solutions
for an investor with Epstein-Zin utility.
11.4 More
Mean reversion and momentum in stock prices: Koijen, Rodriguez, and Sbuelz (2009)
Correlation risk: Buraschi, Porchia, and Trojani (2010)
11.5 Exercises
Exercise 11.1. Throughout this exercise consider an individual with a time-additive expected
power utility of consumption and/or terminal wealth, so that the objective of the individual at
any time t T is to maximize
E
t
_
_
T
t
e
(st)

1
u(c
s
) ds +e
(Tt)

2
u(W
T
)
_
,
where
1
,
2
0 with
1
+
2
> 0, > 0 is the subjective time preference rate, and u(c) =
1
1
c
1
,
where > 1 is the constant relative risk aversion. The individual has an initial (time t) wealth of
W
t
and earns no income from non-nancial sources. The individual has access to a nancial market
with a risk-free asset and a risky asset (a stock). The risk-free asset pays a constant continuously
compounded annualized rate of return of r. The risky asset has a price process P = (P
t
) with
dynamics
dP
t
= P
t
[
t
dt +
t
dz
t
] ,
where z = (z
t
) is a one-dimensional standard Brownian motion, and
t
and
t
are well-behaved
stochastic processes. Below you are going to consider various models for
t
and
t
.
First, consider Model 1 in which
t
= r +
t
for a constant > 0 and any well-behaved
t
.
(a) What is the optimal consumption and investment strategy of the individual?
Next, consider Model 2 in which
t
= > r and
t
= kP

t
for some constants k > 0 and
R.
(b) Show that the instantaneous return variance rate of the stock,
1
dt
Var
t
[dP
t
/P
t
], has a constant
11.5 Exercises 165
elasticity with respect to the stock price.
6
(The elasticity of a function f(x) is dened as
df/f
dx/x
=
df
dx
x
f
= f
0
(x)
x
f(x)
.) Describe the impact of the sign of on the relation between the stock price and
the volatility.
(c) Determine and describe the market price of the (stock) risk in this model.
The natural next step would be to note that the indirect utility must be depending on the stock
price, i.e., of the form J(W
t
, P
t
, t), and then write down and try to solve the associated HJB-
equation. However, the HJB-equation for J(W, P, t) turns out to appear quite complicated (try it
yourself!). A change-of-variable simplies the equation to be solved. Dene the process x = (x
t
)
by
x
t
= k
2
P
2
t
.
(d) What is the dynamics of x (if possible, express dx
t
without any P
t
in the equation)?
(e) Argue that the model ts into the ane setting of Chapter 7. State the optimal consumption
and investment strategy. Find explicit expressions for the two deterministic functions entering the
solution, i.e., A
0
and A
1
in the notation of Section 7.3 of the lecture notes.
(f) How does the optimal consumption and investment at a given point in time depend on the
stock price at that date? Explain! How does the optimal investment depend on the time horizon?
(When considering the optimal investment here, you may assume
1
= 0,
2
> 0.)
Next, consider Model 3: suppose that
t
= > r and
t
= 1/

x
t
, where
dx
t
= ( x x
t
) dt +
x

x
t

dz
t
+
_
1
2
d z
t

,
where z = ( z
t
) is a one-dimensional standard Brownian motion independent of z and [1, +1].
(g) Determine the dynamics of the instantaneous stock variance V
t
=
2
t
; if possible, express
dV
t
without any x
t
in the equation. What is the instantaneous correlation between the stock price
and the instantaneous stock variance?
(h) For the case
1
= 0,
2
> 0, nd the optimal investment strategy. Provide explicit expressions
for any functions entering the optimal investment strategy.
(i) Compare models 2 and 3.
Assume the following parameter values:
r = 0.02, = 0.10, = 0.34, x = 28,
x
= 0.65, = 0.5.
(j) Assume that the current (time 0) value of the state variable is equal to the long-run level,
i.e., x
0
= x. For all combinations of {1.01, 2, 4, 10, 20} and T {1, 5, 10, 30}, compute the
optimal fraction of wealth invested in the stock. How big is the hedge demand compared to the
myopic demand?
(k) How sensitive is the optimal portfolio to the current volatility of the stock? Provide a few
graphs illustrating your answer.
(l) For each of the three models, discuss whether the individual would benet from having access
to trade in an option on the stock.
6
Therefore the stock price process is called a CEV process (CEV: Constant Elasticity of Variance).
166 Chapter 11. Asset allocation with stochastic market prices of risk
Exercise 11.2. In Vasiceks original model the excess expected return of any zero-coupon bond
is given by
1

r
b(

T t) and thus deterministic. However, empirical studies indicate that excess


bond returns vary with the level of interest rates. We can obtain that by generalizing the Vasicek
model to the socalled essentially ane Vasicek model in which the real-world short rate dynamics
is still
dr
t
= [ r r
t
] dt
r
dz
1t
,
but the market price of risk associated with z
1
is now allowed to be an ane function of the short
rate:

1t
=

1
+

1
r
t
,
where

1
and

1
are constants. It turns out that the price of a zero-coupon bond is still of the
exponential-ane form
B

T
t
= e
a(

Tt)b(

Tt)r
t
,
but a and b are dierent from the original Vasicek model. In particular,
b() =
1

_
1 e

_
, =
r

1
.
(a) State the bond price dynamics in this model.
There is also evidence that the excess expected return on the stock market vary (negatively)
with the level of short-term interest rates. Write the stock price dynamics as
dS
t
= S
t
_
(r
t
+
S

t
) dt +
S
dz
1t
+
_
1
2

S
dz
2t
_
.
Here
t
is the instantaneous Sharpe ratio of the stock. Assume that the market price of risk
associated with z
2
is of the ane form

2t
=

2
+

2
r
t
,
where

2
and

2
are constants.
(b) Determine
t
as a function of r
t
and check that the model can potentially capture the
explained predictability pattern.
Now think of the asset allocation model in which a CRRA investor (with no labor income and
utility of terminal wealth only) can invest in the bank account, a single bond, and the stock index
with price dynamics as stated above.
(c) Verify that the model ts into the quadratic framework.
(d) Determine the optimal investment strategy (including the necessary A
i
-functions).
(e) How does the optimal strategy in this model dier from the strategy found in Section 10.2,
where we assumed the original Vasicek model and no return predictability?
CHAPTER 12
Ination risk and asset allocation with no risk-free asset
12.1 Introduction
We should recognize that the models discussed in the previous chapters really use the consump-
tion good as the numeraire and, hence, all asset prices are assumed to be formulated in real terms,
i.e., the price of an asset is the number of units of the consumption good into which the asset can
be exchanged. In particular, the bonds considered in the models have been real bonds that pay out
in units of the consumption good. However, in many markets real bonds are not traded (at least
not at a volume ensuring liquid prices). Furthermore, the risk-free asset in the previous models is
assumed to be risk-free in real terms and the short-term interest rate is the real short rate. The
risk-free asset has been modeled as a continuous roll-over strategy in deposits over innitesimal
short periods. While such a strategy is of course quite extreme, it may be seen as a reasonable ap-
proximation to a strategy of frequently rolling over short-term deposits. While it may be possible
to lock in a risk-free nominal return over a short period, it seems to be more questionable to get
someone to promise you a return which is risk-free in real terms. In this chapter we will discuss
eects of ination risk on asset allocation, optimal asset allocation involving nominal bonds and
asset allocation without a truly risk-free asset.
12.2 Real and nominal price dynamics
In order to study the link between the real and the nominal return on an asset we have to model
the dynamics of the nominal asset price and the price of the consumption good. Let

P
it
denote
the nominal price of asset i at time t, i.e., the price in monetary units (like dollars). Let
t
denote
the monetary price of the consumption good at time t. (With many consumption goods we may
loosely think of
t
as the consumer price index.) Then the real price of asset i is P
it
=

P
it
/
t
.
Assume that
d

P
it
=

P
it
[
it
dt +
>
it
d z
t
]
167
168 Chapter 12. Ination risk and asset allocation with no risk-free asset
and that
d
t
=
t
[
t
dt +
>
t
d z
t
+
t
dz
t
] ,
where z

is a one-dimensional standard Brownian motion independent of z. We can interpret


d
t
/
t
as the realized ination over the period [t, t + dt], which in general will not be known
before t + dt, and interpret
t
as the expected rate of ination per year. Using Itos Lemma, we
can derive the dynamics of the real price of asset i as
dP
it
= P
it
_

it

t

>
it

t
+k
t
k
2
+
2
t
_
dt + (
it

t
)
>
d z
t

t
dz
t

.
It is important to realize that if there is uncertainty about the change in consumer prices over
the deposit period, the real value of a nominal deposit will not be risk-free. Let r
t
denote the
nominal risk-free short rate at time t. Then the nominal value of the nominally risk-free asset
satises
d

A
t
= r
t

A
t
dt.
The real value of the nominally risk-free asset is again obtained by deating with the price of
consumption, A
t
=

A
t
/
t
, and applying Itos Lemma we get
dA
t
= A
t
_
r
t

t
+k
t
k
2
+
2
t
_
dt
>
t
d z
t

t
dz
t

.
Unless there is no uncertainty about the realized ination, the nominally risk-free asset is risky in
real terms.
Assume now that we have m nominally risky assets with nominal price dynamics of the form
d

P
t
= diag(

P
t
)
_

t
dt +
t
d z
t
_
,
where z is an m-dimensional standard Brownian motion. Suppose that we invest fractions of
wealth given by the vector
t
in the nominally risky assets and, consequently, the fraction 1
>
t
1
in the nominally risk-free asset. Then the nominal wealth

W
t
will evolve as
d

W
t
=

r
t

W
t
+

W
t

>
t
(
t
r
t
1) c
t

dt +

W
t

>
t

t
d z
t
,
where c
t
is the number of units consumed of the consumption good. The real wealth is W
t
=

W
t
/
t
,
which evolves as
dW
t
=

W
t

r
t

t
+k
t
k
2
+
2
t

+W
t

>
t


t
r
t
1
t

c
t

dt
+W
t

>
t

t

>
t

d z
t
W
t

t
dz
t
.
(12.1)
Of course, we could also derive the dynamics of real wealth directly from the dynamics of real
prices. We can see that any asset will have the same real sensitivity towards the shock process z

so that it will be impossible to hedge against such a shock.


If
t
is non-singular, we can dene

t
=
1
t
(
t
r
t
1) ,
which has the interpretation as the nominal market price of risk vector. Then the dynamics of real
wealth can be rewritten as
dW
t
=

W
t

r
t

t
+k
t
k
2
+
2
t

+W
t

>
t

t

c
t

dt
+W
t

>
t

t

>
t

d z
t
W
t

t
dz
t
.
(12.2)
12.3 Constant investment opportunities 169
If
t
= 0 and
t
is non-singular, the ination uncertainty is spanned by the traded assets and
we can obtain a risk-free real return by investing in the portfolio given by
safe
t
=


>
t

t
.
The rate of return in this portfolio is then the real short-term interest rate which will be
r
t
= r
t

t
+k
t
k
2
+
2
t
+
_

safe
t
_
>


t
r
t
1
t

= r
t

t
+
>
t

t
.
The above set-up involves m+ 1 assets in total, all of them being risky in an ination-adjusted
sense except when
t
= 0 and
t
is non-singular. We have given special attention to one of these
assets, namely the nominally risk-free asset. Since we can loosely interpret this asset as cash, it
may make sense to include that asset in an asset allocation framework. However, there is really
nothing especially attractive about that asset. Therefore we might as well collect all real risky
asset prices in a vector P
t
with dynamics of the form
dP
t
= diag(P
t
)

t
dt +
t
dz
t

. (12.3)
and no other assets, in particular no risk-free asset. If we let
t
denote the vector of portfolio
weights invested in these assets, we have to require that
>
t
1 = 1. The real wealth dynamics is
dW
t
= [W
t

>
t

t
c
t
] dt +W
t

>
t

t
dz
t
. (12.4)
The earlier formulation of the price dynamics can be tted into this more general framework by
letting the nominally risk-free asset be one of the assets, say the one corresponding to the last
element in the price vector. Furthermore, we must let

t
=
_

t
1
>
t
1
_
, (12.5)
z
t
=
_
z
t
z
t
_
,

t
=
_

t

t

t
k
t
k
2

2
t
_
1
r
t

t
+k
t
k
2
+
2
t
_
,

t
=
_

t
1
>
t

t
1

>
t

t
_
. (12.6)
12.3 Constant investment opportunities
In this section we solve our general utility maximization problem in the case where no asset is
risk-free in real terms and real investment opportunities are constant.
12.3.1 General formulation
First, we consider the case where the real price dynamics of all available assets are given by (12.3)
so that the real wealth dynamics for a given consumption process c = (c
t
) and a given portfolio
process = (
t
) is represented by (12.4). The indirect utility function is
J(W, t) = sup
(c
s
,
s
)
s[t,T]
E
W,t
_
_
T
t
e
(st)

1
u(c
s
) ds +e
(Tt)

2
u(W
T
)
_
,
170 Chapter 12. Ination risk and asset allocation with no risk-free asset
where the indicators
1
and
2
are either zero or one with at least one of them being equal to one,
and where it is implicitly understood that
>
s
1 = 1 for all s. The HJB-equation associated with
the utility maximization problem is
J(W, t) = sup
c0,
>
1=1
_

1
(u(c) cJ
W
(W, t)) +
J
t
(W, t) +J
W
(W, t)W
>

+
1
2
J
WW
(W, t)W
2

>

>

_
with the terminal condition J(W, T) =
2
u(W). The rst-order condition for consumption is the
usual envelope condition. The rst-order condition for the portfolio is now dierent since we have
to maximize under the constraint
>
1 = 1. The Lagrangian for this constrained maximization
problem is
L = J
W
(W, t)W
>
+
1
2
J
WW
(W, t)W
2

>

>
(1
>
1) ,
where is the Lagrange multiplier. Solving for , we get
=
J
W
(W, t)
WJ
WW
(W, t)
_

>
_
1


W
2
J
WW
(W, t)
_

>
_
1
1.
The constraint
>
1 = 1
>
= 1 implies that
1 =
J
W
(W, t)
WJ
WW
(W, t)
1
>
_

>
_
1


W
2
J
WW
(W, t)
1
>
_

>
_
1
1,
so that


W
2
J
WW
(W, t)
=
1

J
W
(W,t)
WJ
WW
(W,t)

1
>
_

>
_
1

1
>
_

>
_
1
1
.
The optimal portfolio is therefore
=
J
W
(W, t)
WJ
WW
(W, t)
_

>
_
1
+
1

J
W
(W,t)
WJ
WW
(W,t)

1
>
_

>
_
1

1
>
_

>
_
1
1
_

>
_
1
1.
This is a combination of two portfolios, namely the portfolio

slope
=
1
1
>
_

>
_
1

>
_
1

and the portfolio

min
=
1
1
>
_

>
_
1
1
_

>
_
1
1
since the optimal portfolio can be written as
=
J
W
(W, t)
WJ
WW
(W, t)

1
>
_

>
_
1

slope
+

J
W
(W, t)
WJ
WW
(W, t)

1
>
_

>
_
1

min
.
(12.7)
Again, we have a two-fund separation result. As discussed in the one-period mean-variance frame-
work, see Equations (3.16) and (3.17), we can interpret
min
as the minimum-variance portfolio
and
slope
as the portfolio with the largest mean-to-standard deviation ratio (i.e., the maximum
slope in a (, )-diagram).
With CRRA utility of both consumption and terminal wealth, it can be shown (by solving the
HJB-equation) that the indirect utility function is given by
J(W, t) =
1
1
g(t)

W
1
, (12.8)
12.3 Constant investment opportunities 171
where
g(t) =
1

1/
1
+ (
1/
2

A
1/
1
)e

A[Tt]

,
and

A =


1
2
_

>
(
>
)
1
k
2
1
>
(
>
)
1
1
_
,
k =
1
1
>
(
>
)
1
1

1
1

1
>
(
>
)
1

.
The optimal portfolio is then
=
1

(
>
)
1
+k(
>
)
1
1,
while the optimal consumption rate is
c =
1/
1
W
g(t)
.
The general structure of the solution is thus the same as for the case with a traded risk-free asset.
12.3.2 Formulation with a nominally risk-free asset
Now let us consider the formulation where one of the assets is the nominally risk-free asset so
that the dynamics of real wealth is of the form (12.1). The indirect utility function is now
J(W, t) = sup
(c
s
,
s
)
s[t,T]
E
W,t
_
_
T
t
e
(st)

1
u(c
s
) ds +e
(Tt)

2
u(W
T
)
_
,
and the associated HJB-equation is
J(W, t) = sup
c,
_

1
(u(c) cJ
W
(W, t)) +
J
t
(W, t)
+WJ
W
(W, t)

r +k

k
2
+
2

+
>
_
r1

_
+
1
2
W
2
J
WW
(W, t)
_

>

>
+k

k
2
+
2

2
>

_
_
with terminal condition J(W, T) =
2
u(W). Here there is no constraint on the portfolio vector .
As always, if
1
= 1, the rst-order condition for c is the envelope condition, which implies that
c = I
u
(J
W
(W, t)), where I
u
is the inverse of u
0
. The rst-order condition for implies that
=
_

>
_
1

J
W
(W, t)
WJ
WW
(W, t)
_

>
_
1
_
r1

_
, (12.9)
which gives the optimal portfolio weights in the nominally risky assets so that the optimal weight
in the nominally risk-free asset is

0
t
= 1
>
1 = 1 1
>
_

>
_
1

+
J
W
(W, t)
WJ
WW
(W, t)
1
>
_

>
_
1
_
r1

_
. (12.10)
If is non-singular, we may simplify these expressions to
=
_

>
_
1

J
W
(W, t)
WJ
WW
(W, t)
_

>
_
1

, (12.11)

0
t
= 1
>
1 = 1 1
>
_

>
_
1

+
J
W
(W, t)
WJ
WW
(W, t)
1
>

.
Apparently, the optimal portfolio exhibits three-fund separation with the three funds being
172 Chapter 12. Ination risk and asset allocation with no risk-free asset
(1) the portfolio of nominally risky assets given by the weights
=
1
1
>
_

>
_
1

_

>
_
1

;
this basically mimics the ination process as well as possible,
(2) the portfolio of nominally risky assets given by the weights
=
1
1
>
_

>
_
1
_
r1

_
_

>
_
1
_
r1

_
,
(3) the nominally risk-free asset.
However, since the formulation in this subsection is just a special case of that in the previous
subsection, we know that two-fund separation obtains. In fact, using the link between the two
formulations given by (12.5)(12.6), one can verify (after some hours work!) that the portfolio
vector
_

0
t
_
dened by (12.9) and (12.10) is identical to the portfolio vector
t
dened by (12.7).
For CRRA utility, the indirect utility function is, of course, given by (12.8), but we can rewrite
the constant

A as

A =

+
1

_
r +
1
2
( r1)
>
_

>
_
1
( r1)
+
1

( r1)
>
_

>
_
1

1

2

_
k

k
2
+
2

1

2

1
2

>


>
_

>
_
1

_
.
If is non-singular, we can write

A as

A =

+
1

_
r +
1
2
k

k
2
+
1

>

1

2

+
1
2
k

k
2
_
,
where

=
1
( r1) as dened earlier.
12.4 General stochastic investment opportunities
We now turn to the case where investment opportunities are stochastic. Let us consider the
setting in which the investor will invest in a nominally risk-free asset and a number of nominally
risky assets so that the dynamics of her real wealth for a given consumption process c = (c
t
) and
a given portfolio process = (
t
) is represented by (12.1).
MORE TO COME LATER go to specic case in next subsection...
12.5 Hedging real interest rate risk without real bonds
It is sometimes claimed that stocks are appropriate for hedging ination uncertainty so that the
real returns on stocks are quite stable relative to the real returns on long-term nominal bonds.
This could explain the popular advice that long-term investors should invest more in stocks than
short-term investors.
If only nominal bonds are traded, the optimal investment strategy of an investor with utility
of terminal wealth only is to combine the mean-variance portfolio and the portfolio that has the
12.5 Hedging real interest rate risk without real bonds 173
highest correlation with the return on an indexed bond with a maturity equal to the remaining
horizon. The hedge portfolio generally involves both stocks and nominal bonds, the precise mix will
be determined by the correlation structure. If ination uncertainty is modest, nominal bonds are
good substitutes for real bonds (true in the U.S. for the period 1983-2000; not true for 1950-1982)
and nominal bonds will dominate the hedge portfolio. Estimates on U.S. data over the period of
approximately 19502000 show that the stock index is slightly positively correlated with the real
interest rate. Hence the stock will enter the hedge portfolio with a negative weight unlike the
popular advice.
General aspects of the portfolio choice problem with uncertain ination are discussed by Munk
and Srensen (2007). The eects of uncertain ination on portfolio choice have been studied
in concrete settings by e.g., Brennan and Xia (2002), Munk, Srensen, and Vinther (2004), and
Campbell and Viceira (2001). Both Brennan and Xia (2002) and Munk, Srensen, and Vinther
(2004) consider investors with CRRA utility of wealth at the end of a nite horizon, whereas
Campbell and Viceira (2001) allow for intermediate consumption and a more general recursive
utility specication in an innite horizon setting. The innite horizon assumption, however, makes
it dicult to address eects due to investors having dierent investment horizons. In both Brennan
and Xia (2002) and Campbell and Viceira (2001) (a proxy for) the real interest rate is described by
a one-factor Vasicek model and the expected ination dynamics is given by an Ornstein-Uhlenbeck
process. The term structure of nominal interest rates is therefore described by a two-factor model.
Munk, Srensen, and Vinther (2004) dier slightly by assuming a one-factor Vasicek model for
the nominal interest rates, while the implied term structure of real interest rates is described by
a two-factor model. In the model of Munk, Srensen, and Vinther it is impossible to replicate a
real bond by trading in any number of nominal bonds whereas this is possible in the other models.
The main conclusions of Brennan and Xia (2002) and Munk, Srensen, and Vinther (2004) are
very close, however. For concreteness, let us follow the set-up of Munk, Srensen, and Vinther.
1
We consider the investment problem of an investor who has CRRA utility of terminal (time T)
real wealth only. As before represents the relative risk aversion of the agent. The investor can
hold cash (i.e., a money market bank account), nominal bonds, and stocks. The nominal short
rate dynamics is described by an Ornstein-Uhlenbeck process,
d r
t
= ( r r
t
) dt
r
dz
1t
,
as we have previously assumed to hold for the real short rate. The dynamics of the nominal price

B
t
of any bond (or other xed-income securities) is of the form
d

B
t
=

B
t
_
r
t
+

1

B
( r
t
, t)

dt +
B
( r
t
, t) dz
1t
_
,
where

1
is the (nominal) market price of risk induced by the exogenous shock process z
1
. The
nominal stock price or stock index value (with dividends reinvested) is assumed to evolve according
to the stochastic dierential equation
d

S
t
=

S
t
_

r
t
+


S

dt +
BS

S
dz
1t
+
_
1
2
BS

S
dz
2t
_
.
1
The model of Munk, Srensen, and Vinther (2004) also allows for mean reversion in stock returns in a similar
way as studied in Chapter 11. We ignore that feature in the discussion here.
174 Chapter 12. Ination risk and asset allocation with no risk-free asset
The parameter
BS
is the correlation between bond market returns and stock market returns,
S
is the volatility of the nominal stock price, and

is the Sharpe ratio of the stock which we assume
constant. In total, the dynamics of nominal asset prices can be written as
_
d

B
t
d

S
t
_
=
_

B
t
0
0

S
t
___
r
t
1 +
_

B
( r
t
, t) 0

BS

S
_
1
2
BS

S
__

2
__
dt
+
_

B
( r
t
, t) 0

BS

S
_
1
2
BS

S
__
dz
1t
dz
2t
__
,
where

2
= (

BS

1
)/
_
1
2
BS
. Letting = (
B
,
S
)
>
denote the fractions of wealth invested
in the bond and the stock, the nominal wealth

W
t
will evolve as
d

W
t
=

W
t
_

r
t
+
>
t
( r
t
, t)

dt +
>
t
( r
t
, t)
_
dz
1t
dz
2t
__
,
where
( r
t
, t) =
_

B
( r
t
, t) 0

BS

S
_
1
2
BS

S
_
,

=
_

2
_
.
The dynamics of the nominal price of the consumption good is given by the following system of
dierential equations:
d
t

t
=
t
dt +
1
dz
1t
+
2
dz
2t
+
3
dz
3t
,
and
d
t
= (
t
) dt +
1
dz
1t
+
2
dz
2t
+
3
dz
3t
+
4
dz
4t
,
where
t
is the expected rate of ination, describes the long-run mean of the rate of ination,
describes the degree of mean-reversion, and the volatility coecients
k
and
k
are all constant.
Dene
2

=
2
1
+
2
2
+
2
3
and
2

=
2
1
+
2
2
+
2
3
+
2
4
. The instantaneous variance rates
of the price index and the expected ination rate are then
2

2
t
and
2

, respectively.
The real wealth of the investor at time t is W
t
=

W
t
/
t
, which by Itos Lemma has the dynamics
dW
t
= W
t
__
r
t

t
+
2

+
>
t
( r
t
, t)
_

2
___
dt
+
>
t
( r
t
, t)
_
dz
1t
dz
2t
_

_
_
_
_

3
_
_
_
_
>
_
_
_
_
dz
1t
dz
2t
dz
3t
_
_
_
_
_
,
which is just how the equation (12.2) looks like in this specic model. The variables W, r, and
form a Markov system and provide sucient information for the decisions of the investor. Hence,
the indirect utility is given as a function J(W, r, , t).
Let us focus on the utility maximization problem of an investor with utility of terminal wealth
so that the indirect utility function is
J(W, r, , t) = sup
(
s
)
s[t,T]
E
W, r,,t
[u(W
T
)] .
12.5 Hedging real interest rate risk without real bonds 175
The associated HJB-equation is
0 = sup
=(
B
,
S
)R
2
_
J
t
+WJ
W
_
r +
2

+
>
( r, t)
_

2
___
+
1
2
W
2
J
WW
_

>
( r, t) ( r, t)
>
+
2

2
>
( r, t)
_

2
__
+ ( r r)J
r
+
1
2

2
r
J
r r
+ ( )J

+
1
2

WJ
W r

r
(
B

B

1
) J
r

1
+WJ
W
_
_
_
_

>
( r, t)
_

2
_

_
_
_
_

3
_
_
_
_
>
_
_
_
_

3
_
_
_
_
_
_
_
_
_
.
(12.12)
The boundary condition is J(W, r, , T) = u(W). The rst-order condition of the maximization
problem in (12.12) provides the following characterization of the optimal risky asset proportions :
=
_

S
_
=
_
( r, t)
>
_
1
_

2
_

J
W
WJ
WW
_
( r, t)
>
_
1
_

2
__
+
J
W r
WJ
WW

r

B
( r, t)
_
1
0
_

J
W
WJ
WW
_
( r, t)
>
_
1
_

2
_
.
(12.13)
The rst two terms are also present in the setting with constant investment opportunities, cf. (12.11).
The last two terms hedge variations in the two state variables, i.e., the nominal short-term interest
rate r and the expected ination rate . Since the nominal bond price is perfectly correlated
with the nominal short rate, only the nominal bond is used for hedging those variations. This is
similar to the analysis in Chapter 10. On the other hand, both the nominal bond and the stock are
generally used for hedging variations in the expected ination rate with the weights determined by
_
( r, t)
>
_
1
_

2
_
=
_
_
_
1

B
( r,t)


BS

1
2
BS

2

S

1
2
BS
_
_
_
Note that the values of
1
and
2
capture the correlations between the asset returns and the
expected ination:

1
=

,
BS

1
+
_
1
2
BS

2
=

.
Now let us specialize to the case of CRRA utility, u(W) =
1
1
W
1
. Note that the dynamics
of the state variables r and have an ane structure. Given the analysis of Chapter 7, it should
therefore come as no surprise that the indirect utility function of the CRRA investor is given by
J(W, r, , t) =
1
1

We
A
0
(Tt)+A
1
(Tt) r+A
2
(Tt)

1
,
where
A
1
() =
1

_
1 e

_
, A
2
() =
1

_
1 e

_
,
and A
0
can be found explicitly, but is not important for the optimal portfolio choice. By substitu-
tion of the relevant derivatives into (12.13), the vector of optimal risky asset allocations at time t
176 Chapter 12. Ination risk and asset allocation with no risk-free asset
is given by
_

S
_
=
_
( r, t)
>
_
1
_

2
_
+
1

_
( r, t)
>
_
1
_

2
__
+

1
1

A
1
(T t)

r

B
( r, t)
_
1
0
_

1
1

A
2
(T t)
_
( r, t)
>
_
1
_

2
_
=
1

_
( r, t)
>
_
1

1
1

A
1
(T t)

r

B
( r, t)
_
1
0
_
+

1
1

_
( r, t)
>
_
1
__

2
_
A
2
(T t)
_

2
__
. (12.14)
The residual 1
S

B
is invested in the nominally risk-free bank account.
The optimal portfolio weights for CRRA investors are linear combinations of the speculative
portfolio and the dierent hedge portfolios. In particular, for investors with the same investment
horizon T the optimal portfolios are linear combinations of the speculative portfolio and a single
hedge portfolio; the relative risk tolerance, 1/, describes the weights on the two relevant portfolios.
The second term in (12.14) describes the hedge against changes in the nominal interest rate
and consists entirely of a position in the bond. As noted in Section 10.2, the occurrence of this
hedge term implies that the bond/stock ratio will increase with the risk aversion consistent with
popular recommendations. If the bond is a zero-coupon bond of the same maturity as the horizon
of the investor, it is a well-known result from Vasiceks model that the volatility of the bond is

B
( r, t) =
r
A
1
(T t). In that case this hedge term will be constant over time. The last hedge
term in (12.14) describes the ination hedge and involves the stock. This term is depending on
the investment horizon through the negative and decreasing function A
2
(T t). In particular, the
parameter determines the dierence on the stock allocations for myopic and long term investors
with the same relative risk aversion. If is small, changes in the expected ination rate are
relatively permanent, and horizon eects may be signicant. However, whether this horizon eect
implies more or fewer stocks for the long-term investor depends on the sign of the correlation
S
between stock returns and ination, that is whether the stock serves as a relatively good substitute
for the real bond that should ideally be used for hedging changes in real rates in a complete market
setting. Moreover, while the last term in (12.14) can potentially explain the typically recommended
horizon dependence for stocks, it may also change the ratio between bonds and stocks.
Munk, Srensen, and Vinther calibrate the model using historical US data from the period 1951
2001. The estimation is based on maximum likelihood and an application of the Kalman lter. The
point estimate of the correlation parameter
S
is slightly negative so that the optimal stock weight
for > 1 is slightly decreasing with the investment horizon in contrast to popular investment
advice. The stock index is, in fact, positively correlated with the (proxy for the) real interest
rate
2
and is therefore a bad substitute for the relevant real bond that should ideally be used as the
instrument for hedging long term ination risk and real interest rate risk. However, when the capital
market parameters are allowed to vary within intervals of plus-minus two standard deviations on the
estimates (which could reect reasonable uncertainty on the parameter estimates), the theoretical
asset allocation results can closely mimic popular asset allocation advice. In particular, the model
2
Under the assumptions of the model, the (proxy for the) real short-term interest rate is given by the nominal
interest rate minus the expected ination rate plus a constant.
12.5 Hedging real interest rate risk without real bonds 177
can generate both a bond/stock ratio which is increasing in the risk aversion coecient and a stock
investment that increases with the length of the investment horizon. The recommendations are
quantitatively very dicult to match, however.
CHAPTER 13
Labor income
13.1 Introduction
In the general description of the continuous-time model in Section 5.2 we allowed for the case
where the agent receives income from non-nancial sources at a rate y
t
. But in all the concrete
problems studied until now we have assumed y
t
0. We shall refer to income from non-nancial
sources as labor income although this may in general include gifts, welfare payments, etc. In this
section we will study the inuence of labor income on optimal portfolio and consumption choice.
Intuitively, the eects of labor income will depend on the present value of the future labor income,
which we will refer to as the human wealth, and on the riskiness of labor income. An investor
will focus on the magnitude and riskiness of her total wealth, i.e., the sum of the current nancial
and the human wealth. The size of the human wealth will therefore aect how much to consume
and how much to invest and the riskiness of the human wealth will aect the allocation of nancial
wealth between risky assets and the risk-free asset.
13.2 A motivating example
Let us look at a small numerical example illustrating the main eects of labor income.
1
Assume
that investment opportunities are constant and that a single risky nancial asset (representing the
stock market index) is traded. With constant interest rates the risk-free asset is equivalent to a
bond. Consider an investor with a nancial wealth of 500,000 dollars and a constant relative risk
aversion of = 2. Assume that the risk-free interest rate is r = 4%, the expected rate of return
on stocks is = 10%, and the volatility of the stock is = 20%. (The market price of risk is
= ( r)/ = 0.3.) We know from the analysis in Chapter 6 that, in the absence of labor
income, it is optimal for the investor to have 75% of her wealth invested in stocks and 25% in the
risk-free asset, i.e., the bond. When the investor receives labor income it seems fair to conjecture
that he will invest her nancial wealth such that the riskiness of her total position corresponds to
1
The example is inspired by Jagannathan and Kocherlakota (1996).
179
180 Chapter 13. Labor income
Stock investment Bond investment
Risk-free income 0 (0%) 500,000 (100%)
Financial inv. 750,000 (150%) -250,000 (-50%)
Total position 750,000 (75%) 250,000 (25%)
Quite risky income 250,000 (50%) 250,000 (50%)
Financial inv. 500,000 (100%) 0 (0%)
Total position 750,000 (75%) 250,000 (25%)
Very risky income 500,000 (100%) 0 (0%)
Financial inv. 250,000 (50%) 250,000 (50%)
Total position 750,000 (75%) 250,000 (25%)
Table 13.1: Investments with a relatively short horizon. The table shows the optimal
investment strategy for three types of labor income. The nancial wealth is 500,000 and the
capitalized labor income is 500,000 corresponding to a relatively short investment horizon.
investing 75% of her total wealth in stocks. We will verify this in a following section.
Let us rst assume that the investor has a labor income stream with a present value of 500,000
dollars and, hence, a total wealth of one million. It is then optimal to have a total position of
750,000 dollars in stocks and 250,000 dollars in the risk-free asset. How the nancial wealth is to
be allocated depends on the riskiness of his labor income. In Table 13.1 we consider three cases:
(a) If the labor income is completely risk-free, it is equivalent to a position of 0 dollars in stocks
and 500,000 dollars in the risk-free asset. To obtain the desired overall riskiness, she has to
allocate her nancial wealth of 500,000 by investing 750,000 dollars in stocks and -250,000
dollars in the risk-free asset. This corresponds to a stock investment of 150% of the nancial
wealth, nanced in part by borrowing 50% of the nancial wealth. The certain labor income
corresponds to the returns of a risk-free investment. Hence the nancial wealth (and more)
has to be invested in stocks to achieve the desired balance between risky and risk-free returns.
(b) If the labor income is quite risky and corresponds to an equal combination of stocks and
bonds, the entire nancial wealth (100%) is to be invested in stocks.
(c) If the labor income is extremely risky and corresponds to a 100% investment in stocks, the
nancial wealth is to be split equally between stocks and bonds.
Clearly, the optimal allocation of nancial wealth is highly dependent on the risk prole of labor
income.
Next, let us consider an investor with the same risk aversion, but a longer investment horizon
and, consequently, a higher capitalized labor income, namely 1,500,000 dollars. Table 13.2 shows
the allocation of the nancial wealth that is needed to obtain the desired 75-25 split between risky
and risk-free returns. Comparing with Table 13.1 we see that the younger investor in Table 13.2
will have a signicantly higher fraction of nancial wealth invested in stocks than the older investor
in Table 13.1, except for the case where the income is extremely uncertain. The optimal stock
weight in the portfolio is clearly depending on the investment horizon.
13.3 Exogenous income in a complete market 181
Stock investment Bond investment
Risk-free income 0 (0%) 1,500,000 (100%)
Financial inv. 1,500,000 (300%) -1,000,000 (-200%)
Total position 1,500,000 (75%) 500,000 (25%)
Quite risky income 750,000 (50%) 750,000 (50%)
Financial inv. 750,000 (150%) -250,000 (-50%)
Total position 1,500,000 (75%) 500,000 (25%)
Very risky income 1,500,000 (100%) 0 (0%)
Financial inv. 0 (0%) 500,000 (100%)
Total position 1,500,000 (75%) 500,000 (25%)
Table 13.2: Investments with a relatively long horizon. The table shows the optimal
investment strategy for three types of labor income. The nancial wealth is 500,000 and the
capitalized labor income is 1,500,000 corresponding to a relatively long investment horizon.
According to empirical studies, the correlation between labor income and the stock market index
is very small for most individuals.
2
In that case, labor income resembles a risk-free investment
more than a stock investment, and the fraction of nancial wealth invested in stocks should increase
with the length of the investment horizonin line with typical investment advice. However, for
some investors the labor income may be highly correlated with the stock market, or at least some
individual stocks, and in that case the weight of stocks in the nancial portfolio should decrease
with the length of the horizon.
13.3 Exogenous income in a complete market
13.3.1 General income and price dynamics
Now we will look at the problems more formally. We take our standard setting with an instan-
taneously risk-free asset with a rate of return of r
t
and d risky assets with price dynamics
dP
t
= diag(P
t
)
_
r
t
1 +
t

t
_
dt +
t
dz
t

,
where z = (z
1
, . . . , z
d
)
>
is a d-dimensional standard Brownian motion. We let
t
be the vector of
amounts invested in these risky assets at time t. The labor income rate is given by the process
y = (y
t
). From (5.4) we have that wealth evolves as
dW
t
=

r
t
W
t
+
>
t

t

t
+y
t
c
t

dt +
>
t

t
dz
t
,
where c = (c
t
) is the consumption rate process. We take a Markovian framework so that we can
apply the dynamic programming approach. In this section we consider the case where the labor
2
Davis and Willen (2000) nd that depending on the individuals sex, age, and educational level the correlation
between aggregate stock market returns and labor income shocks is between -0.25 and 0.3, while the correlation
between industry-specic stock returns and labor income shocks is between -0.4 and 0.1. Campbell and Viceira
(2002) report that the correlation between aggregate stock market returns and labor income shocks is between 0.328
and 0.516. Heaton and Lucas (2000) nd that the labor income of entrepreneurs typically is more highly correlated
with the overall stock market (0.14) than with the labor income of ordinary wage earners (-0.07).
182 Chapter 13. Labor income
income rate is exogenously given. In a later section we incorporate explicitly the labor supply
decision of the agent.
Most studies of the eect of labor income on consumption and portfolio choice assume a process
for the labor income rate such as
dy
t
= y
t
_
(y
t
, t) dt +(y
t
, t)
>
dz
t
+

(y
t
, t) d z
t
_
.
If

6= 0, the income risk is not fully hedgeable in the nancial market, which seems to be the
realistic situation. However, this is a more dicult problem to analyze, so let us rst look at
the complete market case where

= 0 so that the labor income process is spanned by the price
processes of traded assets. It is well-documented that typical income growth rates and income
volatility depend on the age of the individual, so that , , and

in general should depend on
time. Income growth rates tend to be high for young individuals and then slow down and eventually
become slightly negative with age. See, e.g., Cocco, Gomes, and Maenhout (2005).
In the complete market case where

0, the income stream is fully hedgeable and can be valued
as any nancial asset. We can think of the income as the dividend stream from some (possibly
strange) trading strategy in the traded nancial assets. The time t value of the income stream
(y
s
)
s[t,T]
must be
H(x, y, t) = E
Q
x,y,t
_
_
T
t
e

R
s
t
r(x
u
) du
y
s
ds
_
= E
x,y,t
_
_
T
t
exp

_
s
t
r(x
u
) du
_
s
t
(x
u
)
>
dz
u

1
2
_
s
t
k(x
u
)k
2
du
_
y
s
ds
_
,
where Q is the risk-neutral probability measure, and x is a state variable aecting the short-term
interest rate r and the market price of risk vector . We refer to H(x, y, t) as the human wealth
of the agent at time t. In this situation we can think of the agent selling her future income at
the nancial market in the exchange of the payment H(x, y, t) so that she has a total wealth of
W +H(x, y, t) to invest. Intuitively, she will invest in a nancial portfolio such that the riskiness
of her total position of nancial investments and labor income is similar to the riskiness of her
optimal nancial portfolio in the absence of labor income.
13.3.2 Constant investment opportunities and GBM income
For simplicity, we will in the following consider the classical Merton setting with constant in-
vestment opportunities, i.e., a constant interest rate r and a constant market price of risk . Then
the human wealth expression simplies to
H(y, t) = E
y,t
_
_
T
t
exp

r +
1
2
kk
2

(s t)
>
(z
s
z
t
)
_
y
s
ds
_
(13.1)
and it is known from the Feynman-Kac theorem frequently applied in the option pricing literature
that the function H(y, t) satises the PDE
H
t
(y, t) + ((y, t) (y, t)
>
) yH
y
(y, t) +
1
2
k(y, t)k
2
y
2
H
yy
(y, t) rH(y, t) +y = 0. (13.2)
If we further assume that and are constants, the labor income process is a geometric Brownian
motion so that
y
s
= y
t
exp


1
2
kk
2

(s t) +
>
(z
s
z
t
)
_
.
13.3 Exogenous income in a complete market 183
If we substitute this into (13.1), we can compute the human wealth in closed form as
H(y, t) = y E
y,t
_
_
T
t
exp

r +
1
2
kk
2
+
1
2
kk
2

(s t) + ( )
>
(z
s
z
t
)
_
ds
_
= y
_
T
t
E
y,t
_
exp

r +
1
2
kk
2
+
1
2
kk
2

(s t) + ( )
>
(z
s
z
t
)
__
ds
= y
_
T
t
e
(r+
>
)(st)
ds
=
_
_
_
y
r+
>

1 e
(r+
>
)(Tt)

, if r +
>
6= 0,
y(T t), if r +
>
= 0,
yM(t),
(13.3)
i.e., the present value of the future income stream is given by the product of the current income
and a time-dependent multiplier. The third equality in the above computation is due to the fact
that ( )
>
(z
s
z
t
) N(0, k k
2
(s t)) and that for a random variable x N(m, s
2
),
we have E[exp{a x}] = exp{am +
1
2
a
2
s
2
}. Note that the human wealth itself depends on the
riskiness of the labor income stream, in contrast to our numerical example in the previous section.
Let us study the human wealth in a simple numerical example. The risk-free rate is r = 0.02,
and we assume a single risky asset (the stock market index) with a volatility of = 0.2 and a
Sharpe ratio of = 0.3. With a single risky asset, the sensitivity of the income rate is just a
scalar, , and since the income rate is assumed to be spanned, it must be either perfectly positively
or perfectly negatively correlated with the price of the risky asset. It will be perfectly positively
correlated with the asset price if is positive, and perfectly negatively correlated with the asset
price if is negative. The volatility of the income rate is the absolute value, ||. Let us assume
that is either +0.1 or 0.1 so that the income rate volatility is 10%. In Figure 13.1 we illustrate
how the income multiplier M(t) and, hence, the human wealth depends on the time horizon for
various values of the expected income growth rate ranging from 1% to 6%. The human wealth
naturally increases signicantly with the expected growth rate. The left panel is for the case with
an income-asset correlation of +1, while the right panel is for an income-asset correlation of 1.
For young individuals with a long time horizon, the income multiplier is in all cases very large.
For an individual with a 40-year income ahead with an expected annual growth rate of 4%, the
human wealth is 33 times her current annual income if the correlation is +1 and 128 times current
income if the correlation is 1! Clearly, the human wealth will dominate nancial wealth for many
young individuals. The income stream is more valuable if it is negatively correlated with the stock
market than if it is positively correlated. The income is like the dividends from a traded asset and
from the basic CAPM we know that assets that are positively correlated with the overall stock
market have a high required expected return and a low present value. It follows from (13.3) that
human wealth is decreasing in the term
>
, which in the one asset framework is equal to ||,
where the correlation is either 1 or +1. Consequently, the human wealth will be increasing in
the income volatility || if the income-asset correlation is negative.
We have from Theorem 6.2 that without labor income it is optimal for a CRRA utility investor
to invest the proportions
t
=
1

>
_
1
or, equivalently, the amounts
t
=
W
t

>
_
1
in the
184 Chapter 13. Labor income
0
10
20
30
40
50
60
70
0 10 20 30 40 50
horizon, years
i
n
c
o
m
e

m
u
l
t
i
p
l
i
e
r
(a) Perfect positive correlation
0
100
200
300
400
500
0 10 20 30 40 50
horizon, years
i
n
c
o
m
e

m
u
l
t
i
p
l
i
e
r
(b) Perfect negative correlation
Figure 13.1: The income multiplier and the time horizon. The gures show how the
present value of future income pr. unit of current income, i.e., M(t), depends on the time horizon
with either perfectly positive or perfectly negative correlation between the income rate and the
asset price. The lowest curve is for = 1%, the one just above is for = 2%, etc., so that the
top curve is for = 6%.
risky assets. With the optimal investment strategy the wealth will evolve as
dW
t
= . . . dt +W
t
1

>
dz
t
,
cf. (6.13). An investor with labor income has a total wealth of W
t
+ H(y, t). We conjecture that
the investor will seek to invest such that the dynamics of total wealth is
d (W
t
+H(y
t
, t)) = . . . dt + (W
t
+H(y
t
, t))
1

>
dz
t
.
By Itos Lemma, the dynamics of human wealth is
dH(y
t
, t) = . . . dt +H
y
(y
t
, t)y
t

>
dz
t
.
So the dynamics of the optimally invested nancial wealth must be given by
dW
t
= . . . dt + (W
t
+H(y
t
, t))
1

>
dz
t
H
y
(y
t
, t)y
t

>
dz
t
= . . . dt +
_
(W
t
+H(y
t
, t))
1

H
y
(y
t
, t)y
t

_
>
dz
t
.
This is the case for an investment strategy
t
that satises

>
t

t
=
_
(W
t
+H(y
t
, t))
1

H
y
(y
t
, t)y
t

_
>
,
i.e., the optimal amounts invested in the risky nancial assets are given by the vector
t
=
(W
t
, y
t
, t), where
(W, y, t) =
1

(W +H(y, t))
_

>
_
1
H
y
(y, t)y
_

>
_
1
.
Since H
y
(y, t) = H(y, t)/y under our assumptions, we can rewrite the optimal investment strategy
as
(W, y, t) =
1

W
_

>
_
1
+H(y, t)
_

>
_
1

. (13.4)
13.3 Exogenous income in a complete market 185
The rst term is identical to the optimal investment without labor income so that the second
term represents the eect of labor income on the optimal investment strategy. The indirect utility
function of the investor with constant relative risk aversion is
J(W, y, t) =
1
1
g(t)

(W +H(y, t))
1
, (13.5)
where, exactly as in Chapter 6, g(t) is given by
g(t) =
1
A

1/
1
+
_

1/
2
A
1/
1
_
e
A(Tt)

(13.6)
with
A =
r(1 )


1
2
1

2
kk
2
.
The optimal consumption rate is c

t
= C(W
t
, y
t
, t), where
C(W, y, t) =
1/
1
W +H(y, t)
g(t)
=
A
1 +

(
2
/
1
)
1/
A1

e
A(Tt)
(W +H(y, t)).
Let us outline how to verify these ndings. The indirect utility function is dened as
J(W, y, t) = sup
(c
s
,
s
)
s[t,T]
E
W,y,t
_

1
_
T
t
e
(st)
u(c
s
) ds +
2
e
(Tt)
u(W
T
),
_
,
where u(c) = c
1
/(1). Given the dynamics of wealth and income, the associated HJB equation
is
J = sup
c,
_

1
c
1
1
+
J
t
+J
W
_
rW +
>
+y c
_
+
1
2
J
WW

>

>

+J
y
y +
1
2
J
yy
y
2
kk
2
+J
Wy
y
>

_
with the terminal condition J(W, y, T) =
2
W
1
/(1 ). The rst-order conditions for the
optimal controls imply that
c =
1/
1
J
1/
W
, =
J
W
J
WW
_

>
_
1

yJ
Wy
J
WW
_

>
_
1
. (13.7)
Substituting these relations back into the HJB equation and removing the sup-operator, we arrive
after some simplications at the PDE
J =
1/
1

1
J
1
1

W
+
J
t
+rWJ
W

1
2
J
2
W
J
WW
kk
2
+yJ
W

1
2
J
2
Wy
J
WW
y
2
kk
2

J
W
J
Wy
J
WW
y
>
+J
y
y +
1
2
J
yy
y
2
kk
2
,
which extends (6.9) to the case with labor income. Then substitute in J from (13.5) and the
relevant partial derivatives. Rewrite the term rW as r(W + H(y, t)) rH(y, t). There will now
be two terms involving (W + H)
1
, but these terms cancel. There will be a number of terms
involving g

(W +H)

, but these cancel since


H
t
+ (
>
) yH
y
+
1
2
kk
2
y
2
H
yy
rH +y = 0,
cf. the PDE (13.2). By dividing all the remaining terms by

1
g(t)
1
(W +H(y, t))
1
, we arrive
at
g
0
(t) =
1

+r( 1) +
1
2
kk
2

g(t)
1/
1
,
186 Chapter 13. Labor income
and the associated terminal condition is g(T) =
1/
2
. This is equivalent to the ODE (6.10)
in the case without labor income. Hence, the solution for g(t) is the same and therefore given
by (13.6). The expressions for the optimal strategies follow from substituting (13.5) into the
rst-order conditions (13.7).
Note that we have expressed the investment strategy in terms of the amounts invested rather
than in terms of portfolio weights. The reason is that portfolio weights are not suitable for the
case where nancial wealth does not stay strictly positive under all circumstances. The portfolio
weights are undened when nancial wealth is zero and hard to relate to when nancial wealth
is negative. In the present case we can only be sure that the sum of nancial wealth and human
wealth stays positive, but the nancial wealth by itself may very well be negative. For example,
if you initially have no nancial wealth but a sure future labor income, you will probably want to
borrow funds in order to be able to consume goods right now.
If nancial wealth is positive, we see from (13.4) that the optimal portfolio weights can be written
as
(W, y, t) =
1

>
_
1
+
H(y, t)
W
_

>
_
1

.
With a single risky asset this reduces to
(W, y, t) =
1

+
H(y, t)
W
1

.
The human wealth is increasing in the horizon so the optimal portfolio weights will generally
depend on the horizon of the investor. If > , we see that the optimal portfolio weight will
be increasing in the horizon. In that case it is optimal for investors to decrease their fraction of
nancial wealth invested in the stock market as they grow older. This is consistent with popular
investment advice but not with the explanation that usually accompanies the advice, cf. the
discussion in Section 6.5. And note that if < , we get the opposite conclusion.
Let us again consider a numerical example with a single risky asset and market parameters
r = 2%, = 20%, = 0.3. The individual has constant relative risk aversion with a time
preference rate of = 3% and an income process with an expected growth rate of = 4% and
a volatility of || = 10%. Table 13.3 illustrates the optimal strategies for the case with perfect
positive correlation for the four combinations of a risk aversion of 2 or 10 and a time horizon of 10
or 30 years. For each combination the table shows the optimal fraction of nancial wealth invested
in the stock and in cash as well as the optimal consumption-to-income ratio for various values of
the wealth-to-income ratio. When the wealth-to-income ratio is very high, human wealth becomes
unimportant and the optimal portfolio is close to the one without any labor income, which is a
75-25 split between stocks and cash for a risk aversion of 2 and a 15-85 split for a risk aversion
of 10. With lower and more reasonable wealth-to-income ratios, the optimal portfolios are very
dierent from the no-income case. For = 2, the inequality > is satised so that the optimal
stock investment is higher than without income and increasing in the time horizon. With = 10,
the inequality is reversed leading to a lower stock investment which decreases with the horizon. For
low wealth-to-income ratios the optimal portfolios are in all cases extreme with either substantial
borrowing or substantial short-selling of the stock.
In Table 13.4 we x = 2 and T t = 30 and compare the optimal strategies for an income-asset
correlation of +1 (left panel) and 1 (right panel). The optimal portfolio is even more extreme
13.3 Exogenous income in a complete market 187
= 2, T t = 10, M = 9.52 = 2, T t = 30, M = 25.92
W/y stock cash c/y stock cash c/y
0.2 12.6453 -11.6453 1.0696 33.1477 -32.1477 1.4023
0.6 4.7151 -3.7151 1.1136 11.5492 -10.5492 1.4238
1 3.1291 -2.1291 1.1577 7.2295 -6.2295 1.4453
2 1.9395 -0.9395 1.2678 3.9898 -2.9898 1.4990
5 1.2258 -0.2258 1.5980 2.0459 -1.0459 1.6600
10 0.9879 0.0121 2.1484 1.3980 -0.3980 1.9285
50 0.7976 0.2024 6.5518 0.8796 0.1204 4.0761
1000 0.7524 0.2476 111.1318 0.7565 0.2435 55.0825
= 10, T t = 10, M = 9.52 = 10, T t = 30, M = 25.92
W/y stock cash c/y stock cash c/y
0.2 -16.5035 17.5035 1.0096 -45.2068 46.2068 1.2112
0.6 -5.4012 6.4012 1.0511 -14.9689 15.9689 1.2298
1 -3.1807 4.1807 1.0927 -8.9214 9.9214 1.2483
2 -1.5153 2.5153 1.1966 -4.3857 5.3857 1.2947
5 -0.5161 1.5161 1.5083 -1.6643 2.6643 1.4338
10 -0.1831 1.1831 2.0278 -0.7571 1.7571 1.6657
50 0.0834 0.9166 6.1840 -0.0314 1.0314 3.5207
1000 0.1467 0.8533 104.8929 0.1409 0.8591 47.5774
Table 13.3: Optimal strategies with positive income-asset correlation. The table shows
how the optimal strategies vary with the wealth-to-income ratio W/y for dierent combinations
of the risk aversion coecient and the time horizon T t. The numbers in the columns labeled
stock and cash show the fractions of current nancial wealth optimally invested in the stock
and in cash (the bank account), respectively. The numbers in the column labeled c/y show
the optimal consumption-to-income ratio. The income is assumed to be perfectly positively
correlated with the stock price.
188 Chapter 13. Labor income
positive correlation, M = 25.92 negative correlation, M = 69.63
W/y stock cash c/y stock cash c/y
0.2 12.6453 -11.6453 1.0696 435.9611 -434.9611 3.7494
0.6 4.7151 -3.7151 1.1136 145.8204 -144.8204 3.7709
1 3.1291 -2.1291 1.1577 87.7922 -86.7922 3.7924
2 1.9395 -0.9395 1.2678 44.2711 -43.2711 3.8461
5 1.2258 -0.2258 1.5980 18.1584 -17.1584 4.0072
10 0.9879 0.0121 2.1484 9.4542 -8.4542 4.2756
50 0.7976 0.2024 6.5518 2.4908 -1.4908 6.4233
1000 0.7524 0.2476 111.1318 0.8370 0.1630 57.4297
Table 13.4: Optimal strategies with negative income-asset correlation. The table
shows how the optimal strategies vary with the wealth-to-income ratio W/y for a risk aversion
of 2 and a time horizon of 30 years. The left [right] side of the table is for the case where the
income and the stock price are perfectly positively [negatively] correlated. The numbers in the
columns labeled stock and cash show the fractions of current nancial wealth optimally invested
in the stock and in cash (the bank account), respectively. The numbers in the column labeled
c/y show the optimal consumption-to-income ratio.
with a negative correlation both due to the fact the human wealth is larger and because the hedge
term is much larger. Basically, the individual can take on much more nancial risk since the income
process provides an implicit hedge.
It is not without loss of generality to assume a single risky asset. To obtain a spanned income
with a single asset, it is clear that the income has to be perfectly correlated with the price of that
asset. Perfect correlation is certainly unrealistic. If we add further risky assets, the income does
not have to be perfectly correlated with any individual asset. Suppose n risky assets are traded
and assume that (i) the prices of any two risky assets have the same correlation given by
PP
, (ii)
all the risky assets have the same volatility, and (iii) all assets have the same correlation
Py
with
the income rate. Then it can be shown that the income risk is spanned if the condition

2
Py
=
PP
+
1
PP
n
is satised. Clearly, this is decreasing in n. With many risky assets traded, the required correlation
between the income process and each individual asset can be quite small.
Moreover, the labor income of a given individual may not be signicantly correlated with the
overall stock market, but highly correlated with a specic stock. One could imagine that the labor
income of an employee of a corporation was positively correlated with the price of the companys
stocks and maybe also with stock prices of other companies in the same industry. If this is true, the
labor income will to some extent replace a nancial investment in these stocks. Consequently, the
individual should invest less of her nancial wealth in these stocks. Following this line of thought,
a pension fund with members in a given industry should perhaps underinvest in the stocks of the
corporations in which the members work - simply to give the members a better diversied total
13.4 Exogenous income in incomplete markets 189
position. The horror example is the case of the pension fund of Enron employees, which had 58%
of the total fund invested in Enron stocks prior to the 98.8% drop in the Enron stock price in 2001.
Not only did Enron employees lose their jobs, they also lost a major part of their pension savings.
13.3.3 Stochastic interest rates
Stochastic interest rates is the main source of shifts in the investment opportunity set, and the
eect of interest rate uncertainty on the optimal strategies of an investor without labor income is
by now relatively well-studied in the literature, cf. Chapter 10. In order to analyze how individuals
should allocate their funds to various asset classes, e.g., cash, bonds, and stocks, it is important
to combine stochastic interest rates and labor income. The relative allocation to bonds and stocks
can be signicantly aected by the presence of uncertain labor income for several reasons. First,
bonds and stocks can be dierently correlated with labor income shocks so that bonds may be
better for hedging income rate shocks than stocks or vice versa. Second, risk-averse investors want
to hedge total wealth against shifts in investment opportunities. When the short-term interest rate
captures the investment opportunities, the appropriate asset for this hedging motive is the bond.
Third, since human wealth is dened as the discounted value of the future income stream, it will in
general be sensitive to the interest rate level like a bond and, hence, the income stream represents
an implicit investment in a bond, so that the explicit bond investment is reduced. Moreover,
the expected growth rate and variability of labor income may itself vary over the business cycle,
which we can approximate by the level of interest rates. Such dependencies between income and
interest rates will also aect the asset allocation decision. These issues are formalized by Munk
and Srensen (2010) to which the reader is referred.
13.4 Exogenous income in incomplete markets
As seen in the earlier numerical examples, the optimal strategy outlined above may involve ex-
tensive borrowing of young investors that anticipate high future income rates. In practice, investors
cannot actually sell their future income stream as slavery is forbidden these days. Moreover, young
investors will nd it extremely dicult to borrow substantial amounts for risky stock investments
putting up only anticipated future income as implicit collateral or the acquired stocks as explicit
collateral. This can be explained by the moral hazard and adverse selection features of labor in-
come. In reality the income rate is not exogenously given, but reects the abilities and the eort
of the investor.
Some models take these problems partially into account by still assuming an exogenous income
process, but restricting the agent to consumption and investment strategies that have the property
that nancial wealth W
t
always stays positive. The future income stream will then have a lower
value than in the unrestricted, complete market case. See Due and Zariphopoulou (1993), Due,
Fleming, Soner, and Zariphopoulou (1997), Koo (1998), and Munk (2000). For example, Due,
Fleming, Soner, and Zariphopoulou (1997) and Munk (2000) study the case with a single risky
asset with price process
dP
t
= P
t
[dt + dz
t
] ,
190 Chapter 13. Labor income
constant r, , and , and where the income rate follows the geometric Brownian motion
dy
t
= y
t
_
dt +
y
dz
t
+
_
1
2

y
d z
t
_
.
Here is the correlation between the asset price and the labor income. The agent must keep
nancial wealth positive, W
t
> 0, so that she faces a liquidity constraint. Furthermore, she faces
undiversiable income risk. The numerical results of Munk (2000) show that the implicit value
the agent associates with her income stream can be considerably less than without the liquidity
constraint and the undiversiable part of the income risk, especially if she has a high preference for
current consumption and a low current nancial wealth. The results indicate that the reduction
in human wealth is mainly due to the liquidity constraint, while the undiversiability is of minor
importance.
A few papers nd closed-form solutions in settings with unspanned (undiversiable) income
risk, but have to assume negative exponential utility and a Gaussian income process. Svensson
and Werner (1993) solve for the optimal consumption and portfolio strategies in an innite time
horizon setting, whereas Henderson (2005) assumes a nite horizon and utility of terminal wealth
only. Henderson (2005) also nds near-explicit solutions for more general income processes. Due
and Jackson (1990) and Tepla (2000) derive similar solutions for investors receiving an unspanned
income only at the terminal date. Christensen, Larsen, and Munk (2012) derive optimal consump-
tion and portfolio strategies with an unspanned income stream and a nite time horizon.
Explicit solutions have only been found in the following special cases involving CARA (negative
exponential) utility, a normally distributed income stream, a constant risk-free rate, and a constant
drift and volatility of the stock price. Svensson and Werner (1993) and Wang (2006) consider
innite time horizon settings where a transversality condition has to be imposed on the utility
maximization problem. The models of Svensson and Werner (1993) and Wang (2006) dier slightly
with respect to the specication of the income process. Furthermore, in the model of Wang (2006)
only a risk-free asset is traded, whereas Svensson and Werner (1993) allow for risky assets. In
similar settings, Wang (2004, 2009) investigates the impact of unobservable or partially observable
income growth on consumption and investment decisions. Henderson (2005) assumes a nite time
horizon with utility of terminal wealth only, and she also derives near-explicit solutions for more
general income processes. Due and Jackson (1990) and Tepla (2000) derive similar solutions for
investors receiving an unspanned income only at the terminal date. Christensen, Larsen, and Munk
(2012) generalizes Hendersons explicit solution to the case of consumption over a nite lifetime.
The transversality condition in the innite horizon model mentioned above restricts the rate at
which the debt of the investor can grow, but does not force the investor to ever pay back her debt
and can thus lead to excessive borrowing compared to the more realistic nite horizon setting. In
contrast, in their nite horizon model, Christensen, Larsen, and Munk ensure that the debt of the
investor equals zero at the end of the horizon.
It seems dicultif not impossibleto move beyond the assumptions of negative exponential
utility and a Gaussian income process and still obtain closed-form solutions to the investors utility
maximization problem with unspanned income risk. Several recent papers have numerically solved
for optimal consumption and portfolio strategies in more general settings, e.g., Cocco, Gomes, and
Maenhout (2005), Koijen, Nijman, and Werker (2010), Lynch and Tan (2011), Munk and Srensen
(2010), Van Hemert (2010), and Viceira (2001).
13.5 Endogenous labor supply and income 191
For the case with stochastic interest rates, the reader is again referred to Munk and Srensen
(2010). Also see Van Hemert (2010).
13.5 Endogenous labor supply and income
13.5.1 The model and the solution
Bodie, Merton, and Samuelson (1992) endogenize the labor supply decision of the agent. Let
us look at a version of their model. Let
t
denote the wage rate, which is assumed to follow the
geometric Brownian motion
d
t
=
t
[mdt +v
>
dz
t
] .
In particular, the wage rate is spanned by the nancial securities traded. Let
t
[0, 1] denote
the fraction of time working so that the total labor income over the interval [t, t + dt] is
t

t
dt.
Equivalently, we can let l
t
1
t
denote the fraction of time not working and think of the agent
receiving
t
and then paying l
t

t
on the consumption good leisure. The wage rate
t
is the unit
price of leisure measured in units of the consumption good. Assuming a constant interest rate and
a constant market price of risk, the wealth of the investor will then follow
dW
t
=
_
rW
t
+
>
t
c
t
+
t

t
_
dt +
>
t
dz
t
=
_
rW
t
+
>
t
+
t
c
t
l
t

t
_
dt +
>
t
dz
t
.
For tractability assume a Cobb-Douglas type utility of consumption and leisure,
u(c, l) =
1
1

l
1

1
,
where is a constant between 0 and 1 determining the relative weights of consumption and leisure,
and we can interpret > 0 as the coecient of risk aversion with respect to the composite
consumption c

l
1
. We ignore utility of terminal wealth and dene the indirect utility function
as
J(W, , t) = sup
(c
s
,
s
,l
s
)
s[t,T]
E
W,,t
_
_
T
t
e
(st)
1
1

s
l
1
s

1
ds
_
,
where the supremum is taken over all non-negative consumption strategies c, all investment strate-
gies , and all labor-leisure strategies l valued in [0, 1].
We demonstrate below that the indirect utility function is given in closed-form by
J(W, , t) =
1
1

(1)
(1 )
(1)(1)
G(t)

(1)(1)
(W + F(t))
1
, (13.8)
where
G(t) =
1
k

1 e
k(Tt)

, (13.9)
F(t) =
1
r m+v
>

1 e
(rm+v
>
)(Tt)

,
k =

r
1


1
2
2

>
+
1

(1 )
_
m+
1

v
>

1
2
(1 (1 )) v
>
v
_
.
192 Chapter 13. Labor income
The optimal strategies are c

t
= C(W
t
,
t
, t), l

t
= L(W
t
,
t
, t), and

t
= (W
t
,
t
, t), where
C(W, , t) =

G(t)
(W + F(t)) , (13.10)
L(W, , t) =
1
G(t)
W + F(t)

, (13.11)
(W, , t) =
1

(W + F(t))
_

>
_
1
F(t)
_

>
_
1
v

(1 )(1 )

(W + F(t))
_

>
_
1
v. (13.12)
Here
t
F(t) denotes the time t value of the maximum labor income that the agent can receive. To
see this note that the future wage rate is

s
=
t
exp

m
1
2
v
>
v

(s t) +v
>
(z
s
z
t
)
_
.
Working at a maximum rate,
s
1 for all s [t, T], the time t value of future labor income is
E
t
_
_
T
t
exp

r(s t)
>
[z
s
z
t
]
1
2
kk
2
(s t)
_

s
ds
_
=
t
_
T
t
E
t
_
exp

mr
1
2
kk
2

1
2
kvk
2

(s t) + (v )
>
(z
s
z
t
)
__
ds
=
t
_
T
t
e
(mrv
>
)(st)
ds
=
t
F(t).
This solution is only valid if the leisure strategy L(W, , t) stated above is always valued in [0, 1],
which is not necessarily the case.
13.5.2 Verifying the solution
Again we attack the problem by solving the associated HJB-equation:
J = sup
c,,l
_
1
1
c
(1)
l
(1)(1)
+
J
t
+J

m+
1
2
J

2
kvk
2
+J
W
_
rW +
>
+ c l
_
+
1
2
J
WW

>

>
+J
W

>
v
_
.
The rst-order conditions for c and l are
c
(1)1
l
(1)(1)
= J
W
,
(1 )c
(1)
l
(1)(1)1
= J
W
,
which imply the simple relation
c =

1
l
between the optimal consumption rate and the optimal leisure rate. This relation ensures that the
ratio of (i) the marginal utility with respect to leisure and (ii) the marginal utility with respect to
consumption will equal the relative price . Solving the two rst-order conditions, we nd that
c =
1+
(1)

(1 )
(1)(1)

(1)(1)

J
1/
W
,
l =
(1)

(1 )
1(1)

1(1)

J
1/
W
.
13.5 Endogenous labor supply and income 193
Inserting the derivative of the candidate indirect utility function (13.8), these expressions will
give (13.10) and (13.11). The rst-order condition with respect to implies that
=
J
W
J
WW
_

>
_
1

J
W
J
WW

>
_
1
v.
Applying our candidate for J, we have

J
W
J
WW
=
1

(W + F(t)) ,
J
W
J
WW
= F(t) +
(1 )(1 )

W + F(t)

,
and we obtain (13.12).
Substituting the maximizing values for c, l, and into the HJB-equation and deleting the sup-
operator, we arrive after some simplications at the PDE
J =

1

(1)

(1 )
(1)(1)

(1)(1)

J
1
1

W
+
J
t
+J

m+
1
2
J

2
kvk
2
+r(W + F(t))J
W
rF(t)J
W
+ J
W

1
2
J
2
W
J
WW
kk
2

1
2
J
2
W
J
WW

2
kvk
2

J
W
J
W
J
WW
v
>
.
It remains to verify that our candidate (13.8) satises this PDE. Substituting the relevant deriva-
tives into the PDE, we get a lot of terms. They all involve (W + F(t)) raised either to the
power 1 , the power , or the power 1. First observe that the terms with the power
1 cancel. Next note that the terms with the power cancel due to the fact that the
function F(t) satises the ordinary dierential equation F
0
(t) [r m + v
>
]F(t) + 1 = 0.
Then only the terms involving (W + F(t))
1
are left. Dividing through by

1

(1)
(1
)
(1)(1)
G(t)
1

(1)(1)
(W + F(t))
1
, we end up with the equation G
0
(t) = kG(t) 1
which with the terminal condition G(T) = 0 has the solution stated in (13.9).
13.5.3 Inexible labor supply
To study the eect of labor supply exibility on optimal investments let us look at an agent who
once and for all xes a constant labor supply rate 1

l. For a given supply , the agent nds


the optimal consumption and investment strategies by solving the optimization problem
J(W, , t; ) = sup
(c,)
E
W,,t
_
_
T
t
e
(st)
1
1

s
(1 )
1

1
ds
_
= (1 )
(1)(1)
sup
(c,)
E
W,,t
_
_
T
t
e
(st)
1
(1 )
c
(1)
s
ds
_
.
The supremum in the last expression equals the indirect utility of an investor with a constant
relative risk aversion of 1 (1 ) and an exogenously given labor income at the rate y
t
=
t
.
Clearly the present value of future labor income will be H(y
t
, t) =
t
F(t), where F(t) is given
above. Using the previously derived results for the case with exogenous income, we get
J(W, , t; ) =
1
1
(1 )
(1)(1)
g(t)
1(1)
(W + F(t))
(1)
,
where g(t) is given by (13.6). The optimal investment strategy for a given is given by
(W, , t; ) =
1
1 (1 )
(W + F(t))
_

>
_
1
F(t)
_

>
_
1
v.
194 Chapter 13. Labor income
The optimal value of is found by maximizing J(W
0
,
0
, 0; ) and turns out to be
=
(1 )W
0

0
F(0)
.
13.5.4 Comparison of results
For easy comparison let us assume a deterministic wage rate, v 0. Then the optimal investment
strategy of the agent with exible labor supply is
(W, , t) =
1

(W + F(t))
_

>
_
1
,
while the optimal investment strategy of the agent with xed labor supply at a rate is
(W, , t; ) =
1
1 (1 )
(W + F(t))
_

>
_
1
.
First note that the amounts invested in any given asset by each of the two agents have the same
sign; if one agent is long [short] a given asset so is the other agent. There are two dierences
between these two expressions: the relevant risk aversion coecient and the valuation of future
income. With exible supply the labor income enters as the maximum value of future wages,
which can only be obtained by working all the time. On the other hand, the total risk aversion
is relevant for the exible supplier instead of the consumption risk aversion 1 (1 ) relevant
for the xed supplier.
Let us consider assets with positive amounts invested. If < 1, then < 1 (1 ), and
hence the exible supplier will unambiguously invest more in the risky assets. If is suciently
larger than 1, the relation between the amounts invested is ambiguous and will depend on the
exact parameter values, the remaining life-time, and the xed labor supply rate. For moderately
risk-averse investors at an early stage in their working life, the nancial investments of the exible
labor supplier tend to be more risky than those of the xed labor supplier. The intuition is that
investors incurring losses on their nancial investments may compensate by working harder and
drive up labor income. Labor supply exibility serves as a kind of insurance. Changes of labor
supply have the largest eect on capitalized labor income for young investors. The exibility of
labor supply may therefore amplify the horizon eect of labor income on risky investments which
is present already for an exogenously given labor income stream. With an uncertain wage rate
spanned by the risky nancial assets, this conclusion seems to hold as long as the wage rate is not
too risky, cf. the discussion in Bodie, Merton, and Samuelson (1992). Apparently, the eects of
labor supply exibility have not been studied in the more reasonable incomplete market setting,
where the wage rate is not fully diversiable.
13.6 More
Further references on labor income in portfolio and consumption choice: Cocco, Gomes, and
Maenhout (2005), El Karoui and Jeanblanc-Picque (1998), Constantinides, Donaldson, and Mehra
(2002), Cuoco (1997), He and Pag`es (1993), Koo (1995), Viceira (2001), Chan and Viceira (2000),
Heaton and Lucas (2000), Lynch and Tan (2011), Koijen, Nijman, and Werker (2010), Bick, Kraft,
and Munk (2012).
CHAPTER 14
Consumption and portfolio choice with housing
Brueckner (1997), Cocco (2005), Damgaard, Fuglsbjerg, and Munk (2003), Flavin and Yamashita
(2002), de Jong, Driessen, and Van Hemert (2008), Kraft and Munk (2011), Yao and Zhang (2005a,
2005b)
The purchase of a house serves a dual role by both generating consumption services and by
constituting an investment aecting future wealth and consumption opportunities.
Several recent papers include housing in life-cycle decision problems. Campbell and Cocco (2003)
study the mortgage choice in a life-cycle framework with stochastic house price, labor income,
and interest rates. They do not allow housing investment to dier from housing consumption
and, furthermore, x the house size (the number of housing units), so they cannot address the
interaction between housing decisions and portfolio decisions. Cocco (2005) considers a model in
which house prices and aggregate income shocks are perfectly correlated. Also in his model housing
consumption and housing investment cannot be disentangled, as renting is not possible and there
are no house price linked nancial assets traded. The individual can only enjoy the consumption
benets of a home by buying a house and is thus forced into home ownership. Since there is a
minimum choice of house size, a young individual has to tie up a large share of wealth in real estate
and will invest little in stocks (also because of borrowing constraints and an imposed stock market
entry cost). Cocco concludes that house price risk crowds out stock holdings and can therefore
help in explaining limited stock market participation.
Yao and Zhang (2005a) generalize Coccos setting to an imperfect correlation between income
and house prices, and they show that there are substantial welfare gains from allowing renting
and that the renting/owning decision changes the optimal investment strategy. In their model, the
individual would prefer owning a house to renting, but cannot always do so because of constraints
(e.g., a down payment is required to buy a house). If the individual decides to rent a house of a
given size, that will be equal to her housing consumption and she will have zero wealth exposure
to house price risk. If the individual decides to own a house, the size of the house determines her
housing consumption and is identical to her housing investment position. Yao and Zhang (2005a)
195
196 Chapter 14. Consumption and portfolio choice with housing
nd that home-owners invest less in stocks than home-renters.
Van Hemert (2010) generalizes the setting further by allowing for stochastic variations in interest
rates and thereby introducing a role for bonds, and his focus is on the interest rate exposure and
choice of mortgage over the life-cycle. Kraft and Munk (2011) disconnect the housing consumption
and investment positions further, as the individual can simultaneously rent and own, and her
investment position can be higher or lower than the housing consumption by renting out part of
the owned property or by investing in house price linked nancial contracts. In the other models,
the housing investment position is closely linked to the demand for housing consumption, and
that level of housing investment will aect the investments in the other risky assets to obtain the
best overall level of risk-taking and exposure to dierent risks. In the Kraft-Munk setting, the
housing investment is more freely determined and, hence, does not have similar repercussions for
the stock and bond demand. Their results indicate that access to well-functioning markets for
nancial assets linked to house price will lead to welfare gains that are non-negligible, although of
a moderate magnitude. In related work, de Jong, Driessen, and Van Hemert (2008) conclude that
the welfare gains from having access to housing futures are small, but their model ignores labor
income risk, does not allow for renting, xes the housing investment, and assumes utility only from
terminal wealth. Other papers addressing various aspects of housing in individual decision making
include Sinai and Souleles (2005), Li and Yao (2007), Cauley, Pavlov, and Schwartz (2007), and
Corradin, Fillat, and Vergara-Alert (2010).
1
Most of the papers listed above impose various realistic constraints on the investment decisions
of the individual and/or allow labor income to have an unspanned risk component. Therefore,
they solve the decision problems by numerical dynamic programming with a coarse discretization
of time and the state space (Van Hemert (2010) is able to handle a ner discretization by relying on
60 parallel computers). This computational procedure is highly time-consuming and cumbersome,
and little is known about the precision of the numerical results. Kraft and Munk (2011) derive
closed-form solutions that are much easier to analyze, interpret, and implement and thus facilitate
an understanding and a quantication of the economic forces at play. On the other hand, Kraft
and Munk (2011) must disregard unspanned components of income risk, housing transaction costs,
borrowing constraints, short-sales constraint, etc.
1
Papers on the impact of housing decisions and prices on nancial asset prices include Piazzesi, Schneider, and
Tuzel (2007), Lustig and van Nieuwerburgh (2005), and Yogo (2006).
CHAPTER 15
Other variations of the problem...
15.1 Multiple and/or durable consumption goods
References: Several perishable: Breeden (1979), Wachter and Yogo (2010)
With durable: Grossman and Laroque (1990), Hindy and Huang (1993), Detemple and Giannikos
(1996), Cuoco and Liu (2000), Damgaard, Fuglsbjerg, and Munk (2003)
15.2 Uncertain time of death; insurance
References: See Richard (1975), Steensen (2004), Kraft and Steensen (2008)
197
CHAPTER 16
International asset allocation
TO COME...
References: Grubel (1968), Cooper and Kaplanis (1994), French and Poterba (1991), Lioui and
Poncet (2003), Ang and Bekaert (2002), Das and Uppal (2004), Larsen (2010)
199
CHAPTER 17
Non-standard assumptions on investors
17.1 Preferences with habit formation
It has long been recognized by economists that preferences may not be intertemporally separable.
According to Browning (1991), this idea dates back to the 1890 book Principles of Economics
by Alfred Marshall. See Brownings paper for further references to the critique on intertemporally
separable preferences. In particular, the utility associated with the choice of consumption at a
given date may depend on past choices of consumption. This is modeled by replacing u(c
t
, t) by
u(c
t
, h
t
, t), where u is decreasing in h
t
, which is a measure of the standard of living or the habit
level of consumption, e.g., a weighted average of past consumption rates:
h
t
= h
0
e
t
+
_
t
0
e
(ts)
c
s
ds,
where h
0
, , and are non-negative constants. High past consumption generates a desire for high
current consumption, so that preferences display intertemporal complementarity. As additional
motivation for such preferences, note that several papers have documented the importance of al-
lowing for habit formation in utilities when it comes to equilibrium asset pricing. Empirical facts
that seem puzzling relative to models with a representative agent having time-separable utility can
be resolved by introducing habit formation into the utility function. For example, Constantinides
(1990) and Sundaresan (1989) demonstrate that models with habit formation can obtain a high eq-
uity premium with low risk aversion. Campbell and Cochrane (1999) and Wachter (2006) construct
representative agent models with habit formation that are consistent with observed variations in
expected returns on stocks and bonds over time. Detemple and Zapatero (1991) also study asset
pricing implications of habit formation preferences.
1
Sundaresan (1989), Constantinides (1990), and Ingersoll (1992) all derive the optimal strate-
gies for an investor with an innite time horizon under the assumption of a constant investment
1
Both Campbell and Cochrane (1999) and Wachter (2006) consider utility with external habit formation in the
sense that the agent does not take into account the eect that the choice of current consumption has on future habit
levels. In the other papers referred to, these eects are considered.
201
202 Chapter 17. Non-standard assumptions on investors
opportunity set. In addition, Ingersoll (1992) considers a nite-horizon investor with log utility.
Detemple and Zapatero (1992) derive conditions under which optimal policies exist for an investor
with habit persistence in preferences. They are able to characterize the optimal consumption
strategy in a general setting, but, except for the case of deterministic investment opportunities,
they state the optimal portfolio in terms of an unknown stochastic process that comes out of the
martingale representation theorem. Detemple and Karatzas (2003) provide a similar analysis for
a preference structure that also involves habit formation but is more general in several respects.
Schroder and Skiadas (2002) show that the general decision problem of an investor with habit
persistence in preferences who can trade in a given nancial market is equivalent to the decision
problem of an investor who does not exhibit habit formation, and who can trade in a nancial
market with more complex dynamics of investment opportunities.
Munk (2008) gives a precise characterization of the optimal portfolio in a general complete market
setting and derive explicit results in concrete settings with stochastic investment opportunities. The
assumed objective is
J
t
= sup
(c,)A(t)
E
t
_
_
T
t
e
(st)
u(c
s
, h
s
) ds
_
,
where A(t) denotes the set of feasible consumption and portfolio strategies over the period [t, T],
and the instantaneous utility function u(c, h) is assumed to be power-linear,
u(c, h) =
1
1
(c h)
1
,
where the constant > 0 is a risk aversion parameter. With this specication the consumption
rate is required to exceed the habit level, so that the habit level plays the role of a minimum or
subsistence consumption rate determined by past consumption rates. Let us briey summarize the
main ndings of that paper without going into the modeling details:
Mean-reverting stock returns. Stock returns are assumed to be predictable in the sense that
the market price of risk follows a mean-reverting process. Interest rates are assumed constant.
Under the assumption of perfect negative correlation between the stock price and the market price
of risk, Munk nds an explicit solution for the optimal strategies. This is a generalization of
the results of Wachter (2002), cf. Chapter 11, who assumes time-separable utility. The optimal
fraction of wealth invested in stocks is the sum of a myopic demand and a (positive) hedge demand.
Habit persistence has dierent eects on these two components, but in our numerical examples
the dierences are very small. It is argued that, contrary to the case of time-additive utility, the
optimal fraction of wealth invested in stocks is not necessarily monotonically decreasing over the
life of an investor with habit persistence in preferences for consumption. Finally, relative to the
case of constant expected returns, mean reverting returns support a higher consumption rate, but
in the numerical examples the increase is considerably smaller for investors with habit persistence
than investors without.
Stochastic interest rates. The short-term interest rate is assumed to follow a square-root
process as suggested by Cox, Ingersoll, and Ross (1985) with the market prices of risk being fully
determined by the interest rate level. The assets available for investment are a stock (index), cash
(i.e., the bank account), and a single bond (without loss of generality). While the optimal stock
17.2 Recursive utility 203
portfolio weight can be found in closed form, the optimal allocation to the bond and cash as well
as the optimal consumption rate involve a time and interest rate dependent function which is the
solution to a relatively simple partial dierential equation (PDE). With time-additive preferences
the PDE has an explicit solution, cf. Section 10.3, but with habit preferences the PDE must be
solved numerically. The bond portfolio weight has all three components identied in the general
model: a myopic term, a hedge term, and a term ensuring that the future consumption at least
reaches the habit level. The stock portfolio weight, on the other hand, has only the myopic
component. The numerical experiments shown in the paper verify that habit formation have very
dierent eects on stock and bond investments and show that the eects on consumption are
ambiguous.
Labor income. The agent is assumed to receive a continuous stream of labor income. The
income stream has two eects. Firstly, the initial wealth is to be increased by the present value
of the future income stream, which implies that a larger fraction of nancial wealth is to be
invested in the risky assets. Habit persistence in preferences dampens this eect. Secondly, a
labor income stream is implicitly equivalent to a stream of returns on a nancial portfolio, so
the explicit investment strategy must be adjusted accordingly. This adjustment is independent of
the preference parameters and, hence, unaected by habit persistence. Except for extreme habit
persistence and very low present value of income (relative to nancial wealth), the eects of labor
income seem to dominate the eects of habit persistence.
In sum, habit persistence dampens the speculative investments of investors due to the fact that
some funds must be reserved for the purpose of ensuring that consumption in the future will meet
the habit level. The hedge investments may be aected dierently by habit persistence, but in
the numerical examples given by Munk (2008) the dierences are small. The main eect on the
relative allocations to dierent assets stems from the fact that some assets (bonds and cash) are
better investment objects than others (stocks) when it comes to ensuring that future consumption
will not fall below the habit level.
Further references: Hindy, Huang, and Zhu (1997)
17.2 Recursive utility
Schroder and Skiadas (1999) and Campbell and Viceira (1999, 2001) study consumption and
portfolio decisions with so-called recursive utility or stochastic dierential utility... See also Bhamra
and Uppal (2006) and Cocco, Gomes, and Maenhout (2005).
Assume a single consumption good. We use a stochastic dierential utility or recursive utility
specication for the preferences of the individual so that the utility index V
c,
t
associated at time t
with a given consumption process c and portfolio process over the remaining lifetime [t, T] is
recursively given by
V
c,
t
= E
t
_
_
T
t
f (c
u
, V
c,
u
) du +

V
c,
T
_
. (17.1)
204 Chapter 17. Non-standard assumptions on investors
We assume that the so-called normalized aggregator f is dened by
f(c, V ) =
_

11/
c
11/
([1 ]V )
11/
V, for 6= 1
(1 )V ln c V ln ([1 ]V ) , for = 1
where = (1 )/(1
1

). The preferences are characterized by the three parameters , , . It is


well-known that is a time preference parameter, > 1 reects the degree of relative risk aversion
towards atemporal bets (on the composite consumption level z in our case), and > 0 reects
the elasticity of intertemporal substitution (EIS) towards deterministic consumption plans.
2
The
term

V
c,
T
represents terminal utility and we assume that

V
c,
T
=

1
(W
c,
T
)
1
, where 0 and
W
c,
T
is the terminal wealth induced by the strategies c, . The special case where = 1/ (so
that = 1) corresponds to the classic time-additive power utility. More precisely, with = 1/
the recursion (17.1) is satised by
V
c,
t
=
_
E
t
_
_
T
t
e
(ut)
1
1
c
1
u
du +
1

e
(Tt)

1
(W
c,
T
)
1
__
,
which is a positive multiple of the traditional time-additive power utility specication. Note that
= would correspond to the case where utility of a terminal wealth of W will count roughly as
much as the utility of consuming W over the nal year.
The above utility specication is the continuous-time analogue of the Kreps-Porteus-Epstein-Zin
recursive utility dened in a discrete-time setting. Such utility specications and their properties
have been discussed at a general level by, e.g., Kreps and Porteus (1978), Epstein and Zin (1989),
Due and Lions (1992), Due and Epstein (1992), Skiadas (1998), Schroder and Skiadas (1999),
and Kraft and Seifried (2010). Both the discrete-time and the continuous-time versions have been
applied in a few recent studies of utility maximization problems involving a single consumption
good, cf. Campbell and Viceira (1999), Campbell, Cocco, Gomes, Maenhout, and Viceira (2001),
and Chacko and Viceira (2005), and has also been applied in a two-good setting by Yao and Zhang
(2005b). Bhamra and Uppal (2006) provide a detailed analysis of the eects of the relative risk
aversion and the elasticity of intertemporal substitution parameters on the optimal portfolios in a
two-period model with stochastic interest rates.
17.2.1 Solution via dynamic programming
Let A
t
denote the set of admissible control processes (c, ) over the remaining lifetime [t, T].
Constraints on the controls are reected by A
t
. At any point in time t < T, the individual
maximizes V
c,
t
over all admissible control processes given the values of the state variables at
time t. The indirect utility is dened as
J
t
= sup
(c,)A
t
V
c,
t
.
Due and Epstein (1992) have demonstrated the validity of the dynamic programming solution
technique in the case of stochastic dierential utility. For simplicity, we assume that the individual
2
It is also possible to dene a normalized aggregator for = 1 and for 0 < < 1 but we focus on the empirically
more reasonable case of > 1.
17.2 Recursive utility 205
does not receive any income from non-nancial sources. Suppose the relevant information for the
decision problem is captured by wealth W
t
with dynamics
dW
t
=
_
W
t

r(x
t
) +
>
t
(x
t
, t)(x
t
)

c
t
_
dt +W
t

>
t
(x
t
, t) dz
t
.
and a one-dimensional Markov process x = (x
t
) so that J
t
= J(W
t
, x
t
, t) and the dynamics of x
has the form
dx
t
= m(x
t
) dt +v(x
t
)
>
dz
t
+ v(x
t
) d z
t
.
Then the Hamilton-Jacobi-Bellman (HJB) equation to solve is
0 = sup
c0,R
d
_
f
_
c, J(W, x, t)
_
+
J
t
(W, x, t) +J
W
(W, x, t)
_
W

r(x) +
>
(x, t)(x)

c
_
+
1
2
J
WW
(W, x, t)W
2

>
(x, t)(x, t)
>
+J
x
(W, x, t)m(x)
+
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
) +J
Wx
(W, x, t)W
>
(x, t)v(x)
_
with the terminal condition J(W, x, T) =

1
W
1
. We rewrite the HJB-equation as
0 = L

J(W, x, t) + sup
c0
_
f
_
c, J(W, x, t)
_
cJ
W
(W, x, t)
_
+
J
t
(W, x, t)
+J
W
(W, x, t)Wr(x) +J
x
(W, x, t)m(x) +
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
),
(17.2)
where
L

J(W, x, t) = sup
R
d
_
J
W
(W, x, t)W
>
(x, t)(x) +
1
2
J
WW
(W, x, t)W
2

>
(x, t)(x, t)
>

+J
Wx
(W, x, t)W
>
(x, t)v(x)
_
.
(17.3)
The maximization with respect to is exactly as for the case with general time-additive expected
utility in Section 7.2.1. The maximizer is
=
J
W
(W, x, t)
WJ
WW
(W, x, t)
_
(x, t)
>
_
1
(x)
J
Wx
(W, x, t)
WJ
WW
(W, x, t)
_
(x, t)
>
_
1
v(x), (17.4)
which implies that
L

J(W, x, t) =
1
2
J
W
(W, x, t)
2
J
WW
(W, x, t)
k(x)k
2

1
2
J
Wx
(W, x, t)
2
J
WW
(W, x, t)
kv(x)k
2

J
W
(W, x, t)J
Wx
(W, x, t)
J
WW
(W, x, t)
v(x)
>
(x).
(17.5)
Note that the specication of the aggregator does not directly aect terms involving the portfolio
. Hence, the above expressions for and L

J are exactly as in the case with time-additive


power utility and is also the same whether = 1 or not. The terms involving consumption will be
dierent from power utility and will depend on the value of and, therefore, the indirect utility
function solving the HJB-equation will also depend on the value of , so we have to consider
dierent cases separately. Of course, when the indirect utility function is substituted into (17.4),
the optimal portfolio as a function of W, x, and t is also going to depend on the value of .
206 Chapter 17. Non-standard assumptions on investors
17.2.2 The case = 1
When substituting the aggregator for = 1 into (17.2), we can reformulate the HJB-equation
as
0 = L

J(W, x, t) +L
c
J(W, x, t) J(W, x, t) ln ([1 ]J(W, x, t)) +
J
t
(W, x, t)
+J
W
(W, x, t)Wr(x) +J
x
(W, x, t)m(x) +
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
),
(17.6)
where
L
c
J(W, x, t) = sup
c0
{(1 )J(W, x, t) ln c cJ
W
(W, x, t)} .
The rst-order condition for the consumption choice is
(1 )J(W, x, t)
1
c
= J
W
(W, x, t) c = (1 )J(W, x, t)J
W
(W, x, t)
1
,
which implies that
L
c
J(W, x, t) = (1 )J(W, x, t) (ln + ln ([1 ]J(W, x, t)) ln J
W
(W, x, t)) (1 )J(W, x, t)
= (1 )J(W, x, t) {ln + ln ([1 ]J(W, x, t)) ln J
W
(W, x, t) 1} .
(17.7)
Substituting (17.5) and (17.7) into (17.6), we arrive at
0 =
1
2
J
W
(W, x, t)
2
J
WW
(W, x, t)
k(x)k
2

1
2
J
Wx
(W, x, t)
2
J
WW
(W, x, t)
kv(x)k
2

J
W
(W, x, t)J
Wx
(W, x, t)
J
WW
(W, x, t)
v(x)
>
(x)
+ (1 )J(W, x, t) {ln + ln ([1 ]J(W, x, t)) ln J
W
(W, x, t) 1}
J(W, x, t) ln ([1 ]J(W, x, t)) +
J
t
(W, x, t)
+J
W
(W, x, t)Wr(x) +J
x
(W, x, t)m(x) +
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
),
We conjecture a solution of the form
J(W, x, t) =
1
1
G(x, t)

W
1
(17.8)
for some deterministic function G to be determined. The terminal condition is J(W, x, T) =

1
W
1
so we need G(x, T) =
1/
for all possible values of x. The relevant derivatives are
J
W
(W, x, t) = G(x, t)

,
J
WW
(W, x, t) = G(x, t)

W
1
,
J
x
(W, x, t) =

1
G(x, t)
1
G
x
(x, t)W
1
,
J
xx
(W, x, t) = G(x, t)
2
G
x
(x, t)
2
W
1
+

1
G(x, t)
1
G
xx
(x, t)W
1
,
J
Wx
(W, x, t) = G(x, t)
1
G
x
(x, t)W

,
J
t
(W, x, t) =

1
G(x, t)
1
G
t
(x, t)W
1
.
Our candidates for the optimal decisions then become
c

t
= W
t
,

t
=
1

_
(x
t
, t)
>
_
1
(x
t
) +
G
x
(x
t
, t)
G(x
t
, t)
_
(x
t
, t)
>
_
1
v(x
t
).
17.2 Recursive utility 207
Compared to the case with time-additive power utility, the portfolio seems unchanged but, of
course, the function G(x, t) might be dierent from the function g(x, t) appearing with power
utility. The candidate for the optimal consumption rate is very dierent from the power utility
case. With power utility, it is optimal to consume a fraction 1/g(x, t) of wealth. With recursive
utility and = 1, it is optimal to consume a constant fraction of wealth equal to the subjective
time preference rate .
With the conjecture (17.8), we get
L
c
J(W, x, t) = G(x, t)

W
1
_
ln + ln
_
G(x, t)

W
1
_
ln
_
G(x, t)

_
1
_
= G(x, t)

W
1
(ln W + ln 1) ,
L

J(W, x, t) = G(x, t)

W
1

1
2
kk
2
+

2

G
x
(x, t)
G(x, t)

2
kv(x)k
2
+
G
x
(x, t)
G(x, t)
v(x)
>
(x)

. (17.9)
Substitute into the HJB equation (17.6), multiply through by
1

G(x, t)
1
W
1
, and simplify.
Then you will get the following PDE for G(x, t):
0 =
1
2
_
kv(x)k
2
+ v(x)
2
_
G
xx
(x, t) +

m(x)
1

(x)
>
v(x)

G
x
(x, t) +
1
2
( 1) v(x)
2
G
x
(x, t)
2
G(x, t)
+
G
t
(x, t)

ln G(x, t) +
1

[ln 1] +
1

r(x) +
1
2
2
k(x)k
2

G(x, t),
(17.10)
which we have to solve with the terminal condition G(x, T) =
1/
. We can obtain an explicit
solution to this PDE under some assumptions on the dependence of r, , v, and v on x, whereas
numerical solution techniques have to be implemented for other cases. Let us try a solution of the
form
G(x, t) =
1/
e
D
0
(Tt)D
1
(Tt)x
.
The terminal condition implies that D
0
(0) = D
1
(0) = 0. After substitution into the PDE (17.10)
and simplications, we nd that
0 =
1
2
_
kv(x)k
2
+ v(x)
2
_
D
1
(T t)
2
+

m(x)
1

(x)
>
v(x) + x

D
1
(T t) +D
0
1
(T t)x
+D
0
0
(T t) + D
0
(t)

ln +
1

[ln 1] +
1

r(x) +
1
2
2
k(x)k
2

.
If kv(x)k
2
, v(x)
2
, m(x), (x)
>
v(x), r(x), and k(x)k
2
are all ane functions of x, the above
equation can be decomposed in a system of two ordinary dierential equations for D
0
and D
1
.
Note that even though we are considering a case with utility of intermediate consumption, we can
allow for incomplete markets (i.e., v(x) 6= 0), and the solution G(x, t) does not involve an integral;
these ndings contrast the results for time-additive power utility.
17.2.3 The case 6= 1
When substituting the aggregator for 6= 1 into (17.2), we can reformulate the HJB-equation
as
0 = L

J(W, x, t) +L
c
J(W, x, t) J(W, x, t) +
J
t
(W, x, t)
+J
W
(W, x, t)Wr(x) +J
x
(W, x, t)m(x) +
1
2
J
xx
(W, x, t)(v(x)
>
v(x) + v(x)
2
),
(17.11)
208 Chapter 17. Non-standard assumptions on investors
where L
c
J is now dened by
L
c
J(W, x, t) = sup
c0


1 1/
c
11/
([1 ]J)
11/
cJ
W
(W, x, t)
_
and L

J is still dened by (17.3), which leads to (17.5). The rst-order condition with respect to
consumption yields
c =

J
W
(W, x, t)

([1 ]J(W, x, t))


(1
1

)
,
which implies that
L
c
J(W, x, t) =
1
1

J
W
(W, x, t)
1
([1 ]J(W, x, t))
(1
1

)
.
Again, we conjecture that indirect utility is of the form (17.8) for some function G to be deter-
mined. If that is true, the optimal consumption rate is
c

t
=

_
G(x
t
, t)

t
_

G(x
t
, t)

W
1
t

(1
1

)
=

G(x
t
, t)
/
W
t
, (17.12)
which implies that
L
c
J(W, x, t) =
1
1

_
G(x, t)

_
1
_
G(x, t)

W
1
_
(1
1

)
=
1
1

G(x, t)
(1+
1
1
)
W
1
.
By substituting that equation together with (17.9) into the HJB-equation (17.11), multiplying
through by
1

G(x, t)
1
W
1
, and simplifying, one arrives at the PDE
0 =
1
2
_
kv(x)k
2
+ v(x)
2
_
G
xx
(x, t) +

m(x)
1

(x)
>
v(x)

G
x
(x, t) +
1
2
( 1) v(x)
2
G
x
(x, t)
2
G(x, t)
+
G
t
(x, t) +

G(x, t)
1
1

+
1

r(x) +
1
2
2
k(x)k
2

G(x, t),
(17.13)
which we have to solve with the terminal condition G(x, T) =
1/
.
The term with G
1
1
is a potential complication. In the case of power utility, i.e., = 1/, the
power of G reduces to 0 so the term is simply the constant
1/
. It is then well-known that we can
nd closed-form solutions for G(x, t) if the market is complete (so that v(x) 0) and the model
has an ane or quadratic structure. For example, with an ane structure, the solution is of the
form
G(x, t) =
_
T
t

1/
exp

(s t) +
1

A
0
(s t) +
1

A
1
(s t)x
_
ds
+
1/
exp

(T t) +
1

A
0
(T t) +
1

A
1
(T t)x
_
for some deterministic functions A
0
and A
1
that solve certain ODEs.
In other cases than power utility, the PDE (17.13) does not seem to have an explicit solution.
One way to proceed is to solve the PDE numerically. Another way is to introduce an approximation
so that the nasty term disappears. For simplicity, consider rst the case of constant investment
opportunities, where we do not need any state variable x. Then we are searching for the function
G(t) solving the non-linear ODE
0 = G
0
(t) +

G(t)
1
1
AG(t), A =

+
1

r +
1
2
2
kk
2
, (17.14)
17.2 Recursive utility 209
which we have to solve with the terminal condition G(x, T) =
1/
. Following an idea originally
put forward by Campbell (1993) in a discrete-time setting and adapted to a continuous-time setting
by Chacko and Viceira (2005), we can obtain a closed-form approximate solution in the following
way. A Taylor approximation of z 7 e
z
around z gives e
z
e
z
(1 + z z). When we apply that
to z =
(1)
1
ln G(t), we get
G(t)
1
1
= G(t)G(t)
(1)
1
= G(t)e
(1)
1
ln G(t)
G(t)e
(1)
1
ln

G(t)

1 +
( 1)
1
[ln G(t) ln

G(t)]

= G(t)

G(t)
(1)
1

1 +
( 1)
1
[ln G(t) ln

G(t)]

.
(17.15)
Using that approximation in the ODE (17.14), we get
0 = G
0
(t) a(t)G(t) b(t)G(t) ln G(t), (17.16)
where
a(t) = A


G(t)
(1)
1

+ ln

G(t)

, b(t) =


G(t)
(1)
1
.
The solution to (17.16) with G(T) =
1/
is
G(t) =
1/
e
D(t)
, D(t) =
_
T
t
e

R
s
t
b(u) du

a(s) +b(s)
1

ln

ds.
Using the approximation to G(t) in the optimal consumption rule (17.12), we get
c

t
=

1/
e
D(t)

/
W
t
=

D(t)
W
t
,
i.e., the optimal consumption rate is a time-dependent fraction of wealth. It remains to decide
on the function

G(t) in the approximation. We should make sure that ln G(t) is rather close to
ln

G(t). One idea is to presume that the optimal consumption/wealth ratio from (17.12) is close
to the optimal consumption/wealth ratio in the special case of = 1, i.e.,

G(t)
/
G(t)


G(t).
In that case, the functions a and b are simply constants,
b = , a = A

ln

,
so that D(t) reduces to
D(t) =

+
1

ln +
1

ln

1 e
(Tt)

.
In the case of stochastic investment opportunities, the approximation (17.15) becomes
G(x, t)
1
1
G(x, t)

G(t)
(1)
1

1 +
( 1)
1
[ln G(x, t) ln

G(t)]

.
By substituting that into (17.13), we obtain
0 =
1
2
_
kv(x)k
2
+ v(x)
2
_
G
xx
(x, t) +

m(x)
1

(x)
>
v(x)

G
x
(x, t) +
1
2
( 1) v(x)
2
G
x
(x, t)
2
G(x, t)
+
G
t
(x, t) +

G(x, t)

G(t)
(1)
1

1 +
( 1)
1
[ln G(x, t) ln

G(t)]

+
1

r(x) +
1
2
2
k(x)k
2

G(x, t),
210 Chapter 17. Non-standard assumptions on investors
which is a PDE of the same form as the relevant PDE (17.10) for the case = 1, except that we
now have an explicit time-dependence in the coecient of the approximated term via

G(t). If the
model has an ane structure, the approximated PDE will therefore have a solution of the form
G(x, t) =
1/
e
D
0
(t,T)D
1
(t,T)x
,
where the deterministic functions D
0
and D
1
now depend separately on t and T because of the
time-dependent coecients in the PDE. In particular, D
0
(t, T) and D
1
(t, T) will depend on the
values of

G(u) for u (t, T). Again, D
0
and D
1
solve some equations that depend on the specic
ane structure of the model. Intuitively, the approximation works best if

G(t) is chosen so that
ln G(x
t
, t) stays close to ln

G(t), which is now potentially harder due to the presence of the stochastic
process x
t
. One idea is to determine

G(t) so that
ln

G(t) = E[ln G(x
t
, t)] =
1

ln D
0
(t, T) D
1
(t, T) E[x
t
].
Since the right-hand side depends on all

G(u) for u (t, T), this involves a recursive procedure
moving backwards from T.
In any case, it seems impossible to say anything concrete about the precision of the approx-
imation. Of course, for a concrete problem the approximate solution could be compared to the
solution stemming from a numerical solution of the relevant PDE for G, but apparently no such
studies have been published.
17.3 Model/parameter uncertainty, incomplete information, learning
References: See Brennan (1998), Barberis (2000), Gennotte (1986), Karatzas and Xue (1991)
17.4 Ambiguity aversion
See Maenhout (2004)
17.5 Other objective functions
Portfolio choice problems of portfolio managers whose compensation depends on the performance
of the portfolio chosen and a benchmark portfolio. The compensation may include option elements.
See Carpenter (2000), Browne (1999).
17.6 Consumption and portfolio choice for non-price takers
References: See Cuoco and Cvitanic (1998), Basak (1997)
17.7 Non-utility based portfolio choice
References: See Cover (1991), Jamshidian (1992)
17.8 Allowing for bankruptcy 211
17.8 Allowing for bankruptcy
References: See Lehoczky, Sethi, and Shreve (1983), Sethi, Taksar, and Presman (1992),
Presman and Sethi (1996)
CHAPTER 18
Trading and information imperfections
18.1 Trading constraints
References: See Bardhan (1994), Cuoco (1997), Cvitanic (1996), Cvitanic and Karatzas (1992),
Fleming and Zariphopoulou (1991), Grossman and Vila (1991), He and Pearson (1991), Shirakawa
(1994), Srensen (2007), Tepla (2000, 2001), Xu and Shreve (1992a), Xu and Shreve (1992b),
Zariphopoulou (1992), Zariphopoulou (1994)
Value-at-risk constraints: Basak and Shapiro (2001), Cuoco, He, and Issaenko (2002), Cuoco
and Liu (2006)
Drawdown constraints: Cvitanic and Karatzas (1995), Grossman and Zhou (1993),
18.2 Transaction costs
The simplest type of transaction costs to handle is proportional costs. Some initial, heuristic
work was mad by Magill and Constantinides (1976) and Constantinides (1979, 1986). A more
formal analysis was provided by Davis and Norman (1990) and we follow their presentation.
Model set-up:
(1) Risk-free bank account with constant interest rate r (continuously compounded), traded
without transaction costs.
(2) A single risky asset (the stock). The listed unit price P
t
follows geometric Brownian motion:
dP
t
= P
t
[dt + dz
t
] .
Buying one unit costs (1 +a)P
t
, selling one unit provides (1 b)P
t
, where a, b 0.
(3) Investment strategy in the stock is represented by the pair of processes (L, U) with L
t
de-
noting the cumulative amounts of stock purchased on the time interval [0, t] and U
t
the
cumulative amounts of stock sold on [0, t], where the amounts are measured by the listed
213
214 Chapter 18. Trading and information imperfections
price (if x units of the stock is purchased at time t, L
t
increases by xP
t
). Let L
0
= U
0
= 0.
L and U are right-continuous and nondecreasing.
(4) Let S
0t
denote the balance of the bank account at time t and let S
1t
denote the value of the
stocks owned at time t (measured at the listed unit price at time t). The dynamics is
dS
0t
= (rS
0t
c
t
) dt (1 +a) dL
t
+ (1 b) dU
t
, S
00
= x,
dS
1t
= S
1t
dt + S
1t
dz
t
+dL
t
dU
t
, S
10
= y.
Here c
t
is the consumption rate at time t.
(5) The individual is required to stay solvent, so that after eliminating her position in the stock,
she should have non-negative wealth. If S
1t
> 0, the requirement is S
0t
+(1 b)S
1t
0, i.e.,
S
1t

1
1b
S
0t
. If S
1t
< 0, the requirement is S
0t
+(1 +a)S
1t
0, i.e., S
1t

1
1+a
S
0t
. The
solvency region is therefore
S =
_
(x, y) R
2
: x + (1 b)y 0, x + (1 +a)y 0
_
.
(6) The set of admissible consumption and trading strategies is
U(x, y) = {(c, L, U) : (S
0t
, S
1t
) S for all t 0 (a.s.), c
t
0}
(7) For preferences, assume innite horizon and power utility with > 1 denoting the relative
risk aversion. Let
J(x, y) = sup
(c,L,U)U(x,y)
E
x,y
__

0
e
t
1
1
c
1
t
dt
_
.
For the case without transaction costs (a = b = 0), we solved the similar problem for a nite
time horizon in Section 6.3. Let = ( r)/. If the constant
A =
+r( 1)

+
1
2
1

2

2
is positive, the limit as T of the solution is
J(x, y) =
1
1
A

(x +y)
1
,
c

= A[x +y],

,
where x + y is the total wealth. Since is the fraction of total wealth optimally invested in the
stock, we have

t
=
S
1t
S
0t
+S
1t
and hence
S
1t
S
0t
=

=


,
corresponding to a straight line through the origin in the (S
0
, S
1
)-space, the socalled Merton line.
Let us turn to the case with transaction costs. Here is the rst result:
Theorem 18.1. The value function J(x, y) has the following properties:
18.2 Transaction costs 215
(a) J is concave, i.e., for [0, 1]
J (x
1
+ [1 ]x
2
, y
1
+ [1 ]y
2
) J(x
1
, y
1
) + [1 ]J(x
2
, y
2
).
(b) J is homogeneous of degree 1 , i.e., for k > 0
J(kx, ky) = k
1
J(x, y).
Proof. (a) For any variable or process , dene

=
1
+ [1 ]
2
, where
i
is associated with
initial conditions (x
i
, y
i
), i = 1, 2. Let a = (c, L, U) denote the control process. Then
J(x

, y

) = sup
aU(x

,y

)
E
x

,y

__

0
e
t
1
1
c
1
t
dt
_
sup
a

U(x

,y

)
E
x

,y

__

0
e
t
1
1
(c
1
+ [1 ]c
2
)
1
dt
_
sup
a
1
+[1]a
2
U(x

,y

)
E
x

,y

__

0
e
t

1
1
_
c
1
t
_
1
+ [1 ]
1
1
_
c
2
t
_
1
_
dt
_
= sup
a
1
U(x
1
,y
1
)
E
x
1
,y
1
__

0
e
t
1
1
_
c
1
t
_
1
dt
_
+ [1 ] sup
a
2
U(x
2
,y
2
)
E
x
2
,y
2
__

0
e
t
1
1
_
c
2
t
_
1
dt
_
= J(x
1
, y
1
) + [1 ]J(x
2
, y
2
),
where the rst inequality holds due to the restriction to controls of the form a

instead of the
general controls a, and the second inequality is due to the concavity of the power utility function.
(b) It is clear from the dynamics of S
0
and S
1
and the form of the solvency region that
(c, L, U) U(x, y) (kc, kL, kU) U(kx, ky).
Therefore
J(kx, ky) = sup
(c,L,U)U(x,y)
E
x,y
__

0
e
t
1
1
(kc
t
)
1
dt
_
= k
1
J(x, y).
Of course, it follows from (b) that
J(x, y) = k
1
J(kx, ky)
for any k > 0. Consequently,
J
x
(x, y)
J
x
(x, y) =

x
_
k
1
J(kx, ky)
_
= k

J
x
(kx, ky)
and, similarly,
J
y
(x, y)
J
y
(x, y) = k

J
y
(kx, ky).
It follows that
J
y
(kx, ky)
J
x
(kx, ky)
=
J
y
(x, y)
J
x
(x, y)
for all k > 0. In other words, the ratio of the derivatives J
y
/J
x
is constant along any straight line
through the origin.
216 Chapter 18. Trading and information imperfections
To derive and understand the optimal strategies, it is useful to apply some heuristic arguments
by assuming that the trading strategies are of the form
L
t
=
_
t
0
l
s
ds, U
t
=
_
t
0
u
s
ds; l
s
, u
s
[0, K]
for some constant K. In particular, dL
t
= l
t
dt and dU
t
= u
t
dt. The HJB equation is then
J(x, y) = sup
c0,l[0,K],u[0,K]
_
1
1
c
1
+J
x
[rx c (1 +a)l + (1 b)u]
+J
y
[y +l u] +
1
2
J
yy

2
y
2
_
= sup
c0

1
1
c
1
cJ
x
_
+ sup
l[0,K]
{(J
y
(1 +a)J
x
) l}
+ sup
u[0,K]
{((1 b)J
x
J
y
) u} +r
x
J
x
+yJ
y
+
1
2

2
y
2
J
yy
.
The rst-order conditions imply
l =
_

_
K, if J
y
(1 +a)J
x
,
0, otherwise,
u =
_

_
0, if J
y
> (1 b)J
x
,
K, otherwise.
Intuitively, purchasing stocks with a total listed price of one unit of account leads to an increase
in utility equal to J
y
(1 +a)J
x
. As long as this is positive, it is optimal to purchase more stocks.
So the optimal strategy can be described in the following way:
J
y
(1 +a)J
x
: buy stocks
(1 +a)J
x
> J
y
> (1 b)J
x
: do not trade stocks
J
y
(1 b)J
x
: sell stocks.
This divides the solvency region into three regions: a buying region, a no trade region, and a selling
region. The boundary B between the buying region and the no trade region is the set of points
(x, y) for which J
y
(x, y) = (1+a)J
x
(x, y), i.e., J
y
(x, y)/J
x
(x, y) = 1+a. According to our analysis
above, these points form a straight line in the (x, y)-plane through the origin. Let the slope of this
line be denoted by 1/
B
. The boundary S between the selling region and the no trade region is
the set of points for which J
y
(x, y) = (1 b)J
x
(x, y), i.e., J
y
(x, y)/J
x
(x, y) = 1 b, which again is
true for points along a straight line through the origin. Denote the slope of this line by 1/
S
. The
S line is steeper than the B line, so we have
B

S
. The no trade region is a wedge in the
(x, y)-plane bounded by the S and B lines.
In the selling region it is optimal to sell exactly the number of stocks needed to move to the
selling boundary S. Similarly, in the buying region it is optimal to buy the number of stocks
needed to move to the buying boundary B. If the initial holdings (x, y) fall in the selling region
or in the buying region, there will thus be an initial transaction to the nearest boundary. After
that (S
0t
, S
1t
) will stay in the no trade region or on the boundaries S and B. Even when no
trades are made, the investments (S
0t
, S
1t
) will move around as the stock prices moves. As soon
as the selling boundary is reached, enough stocks must be sold so that (S
0t
, S
1t
) does not move
beyond the boundary S and into the interior of the selling region. Similarly when the buying
18.2 Transaction costs 217
boundary is reached from inside the no trade region. After a potential initial trade, we will have
1

S
1t
S
0t

S
.
The fraction of wealth invested in the stock is
t
= S
1t
/(S
0t
+S
1t
), which will then satisfy
1
1 +
B

t

1
1 +
S
.
In the case without transaction costs, transactions are made continuously to keep
t
constant.
With transaction costs that strategy would be innitely costly, and the solution shows that it
is optimal to allow
t
to vary in an interval without making any transactions. Under some,
apparently reasonable, conditions, the Merton portfolio weight

will fall in the interval


between 1/(1 +
B
) and 1/(1 +
S
), cf. Davis and Norman (1990). Intuitively, the investor will
allow some deviation from the Merton weight before trading to save on transaction costs. There
are cases, however, in which the Merton weight is outside the interval, cf. Shreve and Soner (1994).
Inside the no trade region, the HJB equation simplies to
J = sup
c0

1
1
c
1
cJ
x
_
+r
x
J
x
+yJ
y
+
1
2

2
y
2
J
yy
=

1
J
1
1

x
+r
x
J
x
+yJ
y
+
1
2

2
y
2
J
yy
.
We can reduce the dimensionality of this partial dierential equation by exploiting the homogeneity
of the value function, since
J

x
y
, 1

1
y

1
J(x, y) J(x, y) = y
1
J

x
y
, 1

y
1

x
y

.
We thus have that
J
x
= y

x
y

,
J
y
= (1 )y

x
y

xy
1

x
y

,
J
yy
= (1 )y
1

x
y

+ 2xy
2

x
y

+x
2
y
3

00

x
y

.
Substituting into the HJB equation, we arrive at an ordinary dierential equation for :
1
2

00
() + (r +
2
)
0
()

+ ( 1)
1
2

2
( 1)

() +

1

0
()
1
1

= 0, [
S
,
B
].
In the selling region, we must have J(x, y) constant along any line of slope 1/(1 b), so
that J(x, y) = F(x + [1 b]y) for some function F. Then J
x
= F
0
and J
y
= (1 b)F
0
so that
J
y
= (1 b)J
x
. Inserting the above expressions for J
x
and J
y
, we see that

0
()( + 1 b) = (1 )(),
which is satised by
() = A
1
1
( + 1 b)
1
218 Chapter 18. Trading and information imperfections
for a constant A. Hence, J(x, y) = y
1
(x/y) = A
1
1
(x+[1b]y)
1
. Using similar arguments,
it can be shown that
() = B
1
1
( + 1 +a)
1
for some constant B in the buying region, i.e., J(x, y) = B
1
1
(x + [1 +a]y)
1
.
To sum up, in order to obtain the full solution to the problem we have to nd constants

B
,
S
, A, B and a function so that
1
2

00
() + (r +
2
)
0
() +

1

0
()
1
1

+ ( 1)
1
2

2
( 1)

() = 0, [
S
,
B
],
() = A
1
1
( + 1 b)
1
,
S
,
() = B
1
1
( + 1 +a)
1
,
B
.
Theorem 4.2 in Davis and Norman (1990) shows that (under a technical condition) a solution to
this problem will lead to the optimal strategies as described above. The optimal consumption rate
will be
c

t
= S
1t
(
0
(S
0t
/S
1t
))
1/
.
Theorem 5.1 in Davis and Norman (1990) conrms that a solution to the problem exists. At the
boundaries, we have the so-called value-matching conditions
(
S
) = A
1
1
(
S
+ 1 b)
1
,
(
B
) = B
1
1
(
B
+ 1 +a)
1
.
The so-called smooth-pasting conditions ensure that the derivative of at
S
is the same from
the left and from the right, and equivalently at
B
. Therefore

0
(
S
) = A(
S
+ 1 b)

0
(
B
) = B(
B
+ 1 +a)

.
Numerical solution techniques are required!
Relevant extensions:
nite time horizon: Gennotte and Jung (1994), Cvitanic and Karatzas (1996), Liu and
Loewenstein (2002)
proportional and xed transaction costs: ksendal and Sulem (2002)
multiple risky assets: Akian, Menaldi, and Sulem (1996), Liu (2004), Muthuraman and
Kumar (2006), Lynch and Tan (2010)
predictable stock returns: Balduzzi and Lynch (1999), Lynch and Tan (2010)
costs of trading durable consumption goods: Grossman and Laroque (1990), Cuoco and Liu
(2000), Damgaard, Fuglsbjerg, and Munk (2003)
Further references: Taksar, Klass, and Assaf (1988), Due and Sun (1990), Dumas and
Luciano (1991), Korn (1997), Framstad, ksendal, and Sulem (2001), Chellathurai and Draviam
(2007).
APPENDIX A
Results on the lognormal distribution
A random variable Y is said to be lognormally distributed if the random variable X = ln Y is
normally distributed. In the following we let m be the mean of X and s
2
be the variance of X, so
that
X = ln Y N(m, s
2
).
The probability density function for X is given by
f
X
(x) =
1

2s
2
exp

(x m)
2
2s
2
_
, x R.
Theorem A.1. The probability density function for Y is given by
f
Y
(y) =
1

2s
2
y
exp

(ln y m)
2
2s
2
_
, y > 0,
and f
Y
(y) = 0 for y 0.
This result follows from the general result on the distribution of a random variable which is given
as a function of another random variable; see any introductory text book on probability theory
and distributions.
Theorem A.2. For X N(m, s
2
) and R we have
E

e
X

= exp

m+
1
2

2
s
2
_
.
Proof. Per denition we have
E

e
X

=
_
+

e
x
1

2s
2
e

(xm)
2
2s
2
dx.
Manipulating the exponent we get
E

e
X

= e
m+
1
2

2
s
2
_
+

2s
2
e

1
2s
2
[(xm)
2
+2(xm)s
2
+
2
s
4
]
dx
= e
m+
1
2

2
s
2
_
+

2s
2
e

(x[ms
2
])
2
2s
2
dx
= e
m+
1
2

2
s
2
,
219
220 Appendix A. Results on the lognormal distribution
where the last equality is due to the fact that the function
x 7
1

2s
2
e

(x[ms
2
])
2
2s
2
is a probability density function, namely the density function for an N(m s
2
, s
2
) distributed
random variable.
Using this theorem, we can easily compute the mean and the variance of the lognormally distributed
random variable Y = e
X
. The mean is (let = 1)
E[Y ] = E

e
X

= exp

m+
1
2
s
2
_
.
With = 2 we get
E

Y
2

= E

e
2X

= e
2(m+s
2
)
,
so that the variance of Y is
Var[Y ] = E

Y
2

(E[Y ])
2
= e
2(m+s
2
)
e
2m+s
2
= e
2m+s
2

e
s
2
1

.
The next theorem provides an expression for the truncated mean of a lognormally distributed
random variable, i.e., the mean of the part of the distribution that lies above some level. We dene
the indicator variable 1
{Y >K}
to be equal to 1 if the outcome of the random variable Y is greater
than the constant K and equal to 0 otherwise.
Theorem A.3. If X = ln Y N(m, s
2
) and K > 0, then we have
E

Y 1
{Y >K}

= e
m+
1
2
s
2
N

mln K
s
+s

= E[Y ] N

mln K
s
+s

.
Proof. Because Y > K X > ln K, it follows from the denition of the expectation of a random
variable that
E

Y 1
{Y >K}

= E

e
X
1
{X>ln K}

=
_
+
ln K
e
x
1

2s
2
e

(xm)
2
2s
2
dx
=
_
+
ln K
1

2s
2
e

(x[m+s
2
])
2
2s
2
e
2ms
2
+s
4
2s
2
dx
= e
m+
1
2
s
2
_
+
ln K
f
X
(x) dx,
where
f
X
(x) =
1

2s
2
e

(x[m+s
2
])
2
2s
2
221
is the probability density function for an N(m+s
2
, s
2
) distributed random variable. The calcula-
tions
_
+
ln K
f
X
(x) dx = Prob(

X > ln K)
= Prob


X [m+s
2
]
s
>
ln K [m+s
2
]
s

= Prob


X [m+s
2
]
s
<
ln K [m+s
2
]
s

= N

ln K [m+s
2
]
s

= N

mln K
s
+s

complete the proof.


Theorem A.4. If X = ln Y N(m, s
2
) and K > 0, we have
E[max (0, Y K)] = e
m+
1
2
s
2
N

mln K
s
+s

KN

mln K
s

= E[Y ] N

mln K
s
+s

KN

mln K
s

.
Proof. Note that
E[max (0, Y K)] = E

(Y K)1
{Y >K}

= E

Y 1
{Y >K}

KProb (Y > K) .
The rst term is known from Theorem A.3. The second term can be rewritten as
Prob (Y > K) = Prob (X > ln K)
= Prob

X m
s
>
ln K m
s

= Prob

X m
s
<
ln K m
s

= N

ln K m
s

= N

mln K
s

.
The claim now follows immediately.
APPENDIX B
Stochastic processes and stochastic calculus
B.1 Introduction
Most interest rates and asset prices vary over time in a non-deterministic way. We can observe
the price of a given asset today, but the price of the same asset at any future point in time will
typically be unknown, i.e., a random variable. In order to describe the uncertain evolution in the
price of the asset over time, we need a collection of random variables, namely one random variable
for each point in time. Such a collection of random variables is called a stochastic process. Modern
nance models therefore apply stochastic processes to represent the evolution in prices and rates
over time.
This chapter gives an introduction to stochastic processes and the mathematical tools needed
to do calculations with stochastic processes, the so-called stochastic calculus. We will omit many
technical details that are not important for a reasonable level of understanding and focus on
processes. For more details and proofs, the reader is referred to textbooks on stochastic processes
such as, for example, ksendal (2003) and Karatzas and Shreve (1988), and to more extensive
and formal introductions to stochastic processes in the mathematical nance textbooks of Dothan
(1990), Due (2001), and Bjork (2009).
The outline of the remainder of the chapter is as follows. In Section B.2 we dene the concept
of a stochastic process more formally and introduce much of the terminology used. We dene
and a particular process, the so-called Brownian motion, in Section B.3. This will be the basic
building block in the denition of other processes. In Section B.4 we introduce the class of diusion
processes, which contains most of the processes used in popular xed income models. Section B.5
gives a short introduction to the more general class of Ito processes. Both diusions and Ito pro-
cesses involve stochastic integrals, which are discussed in Section B.6. In Section B.7 we state
the very important Itos Lemma, which is frequently applied when handling stochastic processes.
Three diusions that are widely used in nance models are introduced and studied in Section B.8.
Section B.9 discusses multi-dimensional processes. Finally, Section B.10 explains the change of
probability measure which is often used in nancial models.
223
224 Appendix B. Stochastic processes and stochastic calculus
B.2 What is a stochastic process?
B.2.1 Probability spaces and information ltrations
The basic object for studies of uncertain events is a probability space, which is a triple
(, F, P). Let us look at each of the three elements.
is the state space, which is the set of possible states or outcomes of the uncertain object.
For example, if one studies the outcome of a throw of a dice (meaning the number of eyes on
top of the dice), the state space is = {1, 2, 3, 4, 5, 6}. In our nance models an outcome is a
realization of all relevant uncertain objects over the entire time interval studied in the model. Only
one outcome, the true outcome, will be realized.
F is the set of events to which a probability can be assigned, i.e., the set of probabilizable
events. Here, an event is a set of possible outcomes, i.e., a subset of the state space. In the
example with the dice, some events are {1, 2, 3}, {4, 5}, {1, 3, 5}, {6}, and {1, 2, 3, 4, 5, 6}. In a
nance model an event is some set of realizations of the uncertain object. For example, in a model
of the uncertain dynamics of a given asset price over a period of 10 years, one event is that the
asset price one year into the future is above 100. Since F is a set of events, it is really a set of
subsets of the state space. It is required that
(i) the entire state space can be assigned a probability, i.e., F;
(ii) if some event F can be assigned a probability, so can its complement F
c
\ F, i.e.,
F F F
c
F; and
(iii) given a sequence of probabilizable events, the union is also probabilizable, i.e., F
1
, F
2
, F

i=1
F
i
F.
Often F is referred to as a sigma-algebra.
P is a probability measure, which formally is a function from the sigma-algebra F into the
interval [0, 1]. To each event F F, the probability measure assigns a number P(F) in the interval
[0, 1]. This number is called the P-probability (or simply the probability) of F. A probability
measure must satisfy the following conditions:
(i) P() = 1 and P() = 0, where denotes the empty set;
(ii) the probability of the state being in the union of disjoint sets is equal to the sum of the
probabilities for each of the sets, i.e., given F
1
, F
2
, F with F
i
F
j
= for all i 6= j, we
have P(

i=1
F
i
) =

i=1
P(F
i
).
Many dierent probability measures can be dened on the same sigma-algebra, F, of events. In
the example of the dice, a probability measure P corresponding to the idea that the dice is fair
is dened by
P({1}) = P({2}) = = P({6}) = 1/6.
Another probability measure, Q, can be dened by
Q({1}) = 1/12, Q({2}) = = Q({5}) = 1/6, Q({6}) = 3/12,
which may be appropriate if the dice is believed to be unfair in a particular way.
B.2 What is a stochastic process? 225
Two probability measures P and Q dened on the same state space and sigma-algebra F are
called equivalent if the two measures assign probability zero to exactly the same events, i.e., if
P(A) = 0 Q(A) = 0. The two probability measures in the dice example are equivalent. In the
stochastic models of nancial markets switching between equivalent probability measures turns out
to be important.
In our models of the uncertain evolution of nancial markets, the uncertainty is resolved gradually
over time. At each date we can observe values of prices and rates that were previously uncertain
so we learn more and more about the true outcome. We need to keep to track of the information
ow. Let us again consider the throw of a dice so that the state space is = {1, 2, 3, 4, 5, 6} and
the set F of probabilizable events consists of all subsets of . Suppose now that the outcome of
the throw of the dice is not resolved at once, but sequentially. In the beginning, at time 0, we
know nothing about the true outcome so it can be any element in . Then, at time 1, you will
be told that the outcome is either in the set {1, 2}, in the set {3, 4, 5}, or in the set {6}. Of course,
in the latter case you will know exactly the true outcome, but in the rst two cases there is still
uncertainty about the true outcome. Later on, at time 2, the true outcome will be announced.
We can represent the information available at a given point in time by a partition of . By a
partition of a given set, we simply mean a collection of disjoint subsets of so that the union of
these subsets equals the entire set . At time 0, we only know that one of the six elements in
will be realized. This corresponds to the (trivial) partition F
0
= {}. The information at time 1
can be represented by the partition
F
1
=
_
{1, 2}, {3, 4, 5}, {6}
_
.
At time 2 we know exactly the true outcome, corresponding to the partition
F
2
=
_
{1}, {2}, {3}, {4}, {5}, {6}
_
.
As time passes we receive more and more information about the true path. This is reected by
the fact that the partitions become ner and ner in the sense that every set in F
1
is a subset
of some set in F
0
and every set in F
2
is a subset of some set in F
1
. The information ow in this
simple example can then be represented by the sequence (F
0
, F
1
, F
2
) of partitions of . In more
general models, the information ow can be represented by a sequence (F
t
)
tT
of partitions, where
T is the set of relevant points in time in the model. Each F
t
consists of disjoint events and the
interpretation is that at time t we will know which of these events the true outcome belongs to.
The fact that we learn more and more about the true outcome implies that the partitions will be
increasingly ne meaning that, for u > t, every element in F
t
is a union of elements in F
u
.
An alternative way of representing the information ow is in terms of an information ltration.
Given a partition F
t
of , we can dene F
t
as the set of all unions of sets in F
t
, including the
empty union, i.e., the empty set . Where F
t
contains the disjoint decidable events at time t,
F
t
contains all decidable events at time t. Each F
t
is a sigma-algebra. For our example above
we get
F
0
=
_
,
_
,
F
1
=
_
, {1, 2}, {3, 4, 5}, {6}, {1, 2, 3, 4, 5}, {1, 2, 6}, {3, 4, 5, 6},
_
,
whereas F
2
becomes the collection of all possible subsets of . The sequence F = (F
0
, F
1
, F
2
) is
called an information ltration. In models involving the set T of points in time, the information
226 Appendix B. Stochastic processes and stochastic calculus
ltration is written as F = (F
t
)
tT
. We will always assume that the time 0 information is trivial,
corresponding to F
0
= {, } and that all uncertainty is resolved at or before some nal date T so
that F
T
is equal to the set F of all probabilizable events. The fact that we accumulate information
dictates that F
t
F
t
0 whenever t < t
0
, i.e., every set in F
t
is also in F
t
0 .
Above we constructed an information ltration from a sequence of partitions. We can also go
from a ltration to a sequence of partitions. In each F
t
, simply remove all sets that are unions
of other sets in F
t
. Therefore there is a one-to-one relationship between information ltration
and a sequence of partitions. When we go to models with an innite state space, the information
ltration representation is preferable. Hence, our formal model of uncertainty and information is
a ltered probability space (, F, P, F), where (, F, P) is a probability space and F = (F
t
)
tT
is an information ltration. We will always assume that all the uncertainty is resolved over time.
Hence, F
T
= F in an economy where the terminal time point is T. We will also assume that
to begin with we know nothing about the future realizations of uncertainty, i.e., F
0
is the trivial
sigma-algebra consisting of only the full state space and the empty set .
It might seem frightening to have to specify a certain ltered probability space in which the
behavior of interest rates, bond prices, etc., can be studied. However, in the models we are going
to consider, the relevant ltered probability space will be implicitly dened via assumptions about
the way the key variables can evolve over time.
In our models we will often deal with expectations of random variables, e.g., the expectation
of the (discounted) payo of an asset at a future point in time. In the computation of such an
expectation we should take the information currently available into account. Hence we need to
consider conditional expectations. One can generally write the expectation of a random variable
X given the -algebra F
t
as E[X|F
t
]. For our purposes the -algebra F
t
will always represent
the information at time t and we will write E
t
[X] instead of E[X|F
t
]. Since we assume that the
information at time 0 is trivial, conditioning on time 0 information is the same as not conditioning
on any information, hence E
0
[X] = E[X]. If we assume that all uncertainty is resolved at time T,
we have E
T
[X] = X. We will sometimes use the following result:
Theorem B.1 (The Law of Iterated Expectations). If F and G are two -algebras with F G and
X is a random variable, then E[E[X|G] | F] = E[X|F]. In particular, if (F
t
)
tT
is an information
ltration and t
0
> t, we have
E
t
[E
t
0 [X]] = E
t
[X].
Loosely speaking, the theorem says that what you expect today of some variable that will be
realized in two days is equal to what you expect today that you will expect tomorrow about the
same variable. This is a very intuitive result. For a more formal statement and proof, see ksendal
(2003).
We can dene conditional variances, covariances, and correlations from the conditional expecta-
tion exactly as one denes (unconditional) variances, covariances, and correlations from (uncondi-
tional) expectations:
Var
t
[X] = E
t
_
(X E
t
[X])
2
_
= E
t
[X
2
] (E
t
[X])
2
,
Cov
t
[X, Y ] = E
t
[(X E
t
[X])(Y E
t
[Y ])] = E
t
[XY ] E
t
[X] E
t
[Y ],
Corr
t
[X, Y ] =
Cov
t
[X, Y ]
_
Var
t
[X] Var
t
[Y ]
.
B.2 What is a stochastic process? 227
Again the conditioning on time t information is indicated by a t subscript.
B.2.2 Random variables and stochastic processes
A random variable is a function from into R
K
for some integer K. The random variable
x : R
K
associates to each outcome a value x() R
K
. Sometimes we will emphasize
the dimension and say that the random variable is K-dimensional. With sequential resolution
of the uncertainty the values of some random variables will be known before all uncertainty is
resolved.
In the dice example with sequential information from before, suppose that your friend George
will pay you 10 dollars if the dice shows either three, four, or ve eyes and nothing in other cases.
The payment from George is a random variable x. Of course, at time 2 you will know the true
outcome, so the payment x will be known at time 2. We say that x is time 2 measurable or
F
2
-measurable. At time 1 you will also know the payment x because you will be told either that
the true outcome is in {1, 2}, in which case the payment will be 0, or that the true outcome is in
{3, 4, 5}, in which case the payment will be 10, or that the true outcome is 6, in which case the
payment will be 0. So the random variable x is also F
1
-measurable. Of course, at time 0 you will
not know what payment you will get so x is not F
0
-measurable. Suppose your friend John promises
to pay you 10 dollars if the dice shows 4 or 5 and nothing otherwise. Represent the payment from
John by the random variable y. Then y is surely F
2
-measurable. However, y is not F
1
-measurable,
because if at time 1 you learn that the true outcome is in {3, 4, 5}, you still will not know whether
you get the 10 dollars or not.
A stochastic process x is a collection of random variables, namely one random variable for each
relevant point in time. We write this as x = (x
t
)
tT
, where each x
t
is a random variable. We
still have an underlying ltered probability space (, F, P, F = (F
t
)
tT
) representing uncertainty
and information ow. We will only consider processes x that are adapted in the sense that for
every t T the random variable x
t
is F
t
-measurable. This is just to say that the time t value
of the process will be known at time t. Some models consider the dynamic investment decisions
of utility-maximizing investors (or other dynamic decisions under uncertainty). The investment
decision is represented by a portfolio process characterizing the portfolio to be held at given points
in time depending on the information of the investor at that date. Hence, it is natural to require
that the portfolio process is adapted to the information ltration. You cannot base investment
decisions on information you have not yet received.
By observing a given stochastic process x adapted to a given ltered probability space (, F, P, F =
(F
t
)
tT
), we obtain some information about the true state. In fact, we can dene an information
ltration F
x
= (F
x
t
)
tT
generated by x. Here, F
x
t
represents the information that can be deduced
by knowing the values x
s
for s t (for technical reasons, this sigma-algebra is completed by
including all sets of F that have zero P-probability). F
x
is the smallest sigma-algebra with respect
to which x is adapted. By construction, F
x
t
F
t
.
B.2.3 Other important concepts and terminology
Let x = (x
t
)
tT
denote a stochastic process dened on a ltered probability space (, F, P, F =
(F
t
)
tT
). Each possible outcome will fully determine the value of the process at all points in
228 Appendix B. Stochastic processes and stochastic calculus
time. We refer to this collection (x
t
())
tT
of realized values as a (sample) path of the process.
As time goes by, we can observe the evolution in the object which the stochastic process describes.
At any given time t
0
, the previous values (x
t
)
tt
0 will be known. These values constitute the history
of the process up to time t
0
. The future values are (typically) still stochastic.
As time passes and we obtain new information about the true outcome, we will typically revise
our expectations of the future values of the process or, more precisely, revise the probability
distribution we attribute to the value of the process at any future point in time. Suppose we stand
at time t and consider the value of a process x at a future time t
0
> t. The distribution of the
value of x
t
0 is characterized by probabilities P(x
t
0 A) for dierent sets A. If for all t, t
0
T with
t < t
0
and all A, we have that
P
_
x
t
0 A | (x
s
)
s[0,t]
_
= P(x
t
0 A | x
t
) ,
then x is called a Markov process. Broadly speaking, this condition says that, given the presence,
the future is independent of the past. The history contains no information about the future value
that cannot be extracted from the current value. Markov processes are often used in nancial
models to describe the evolution in prices of nancial assets, since the Markov property is consistent
with the so-called weak form of market eciency, which says that extraordinary returns cannot
be achieved by use of the precise historical evolution in the price of an asset.
1
If extraordinary
returns could be obtained in this manner, all investors would try to prot from it, so that prices
would change immediately to a level where the extraordinary return is non-existent. Therefore, it
is reasonable to model prices by Markov processes. In addition, models based on Markov processes
are often more tractable than models with non-Markov processes.
A stochastic process is said to be a martingale if, at all points in time, the expected change in
the value of the process over any given future period is equal to zero. In other words, the expected
future value of the process is equal to the current value of the process. Because expectations
depend on the probability measure, the concept of a martingale should be seen in connection with
the applied probability measure. More rigorously, a stochastic process x = (x
t
)
t0
is a P-martingale
if for all t T we have that
E
P
t
[x
t
0 ] = x
t
, for all t
0
T with t
0
> t.
Here, E
P
t
denotes the expected value computed under the P-probabilities given the information
available at time t, that is, given the history of the process up to and including time t. Sometimes
the probability measure will be clear from the context and can be notationally suppressed.
We assume, furthermore, that all the random variables x
t
take on values in the same set S, which
we call the value space of the process. More precisely this means that S is the smallest set with
the property that P({x
t
S}) = 1. If S R, we call the process a one-dimensional, real-valued
process. If S is a subset of R
K
(but not a subset of R
K1
), the process is called a K-dimensional,
real-valued process, which can also be thought of as a collection of K one-dimensional, real-valued
processes. Note that as long as we restrict ourselves to equivalent probability measures, the value
space will not be aected by changes in the probability measure.
1
This does not conict with the fact that the historical evolution is often used to identify some characteristic
properties of the process, e.g., for estimation of means and variances.
B.2 What is a stochastic process? 229
B.2.4 Dierent types of stochastic processes
A stochastic process for the state of an object at every point in time in a given interval is called
a continuous-time stochastic process. This corresponds to the case where the set T takes the
form of an interval [0, T] or [0, ). In contrast, a stochastic process for the state of an object at
countably many separated points in time is called a discrete-time stochastic process. This
is, for example, the case when T = {0, t, 2t, . . . , T Nt} or T = {0, t, 2t, . . . } for some
t > 0. If the process can take on all values in a given interval (e.g., all real numbers), the process
is called a continuous-variable stochastic process. On the other hand, if the state can take
on only countably many dierent values, the process is called a discrete-variable stochastic
process.
What type of processes should we use in our nancial models? Our choice will be guided both by
realism and tractability. First, let us consider the time dimension. The investors in the nancial
markets can trade at more or less any point in time. Due to practical considerations and transaction
costs, no investor will trade continuously. However, it is not possible in advance to pick a fairly
moderate number of points in time where all trades take place. Also, with many investors there will
be some trades at almost any point in time, so that prices and interest rates etc. will also change
almost continuously. Therefore, it seems to be a better approximation of real life to describe
such economic variables by continuous-time stochastic processes than by discrete-time stochastic
processes. Continuous-time stochastic processes are in many aspects also easier to handle than
discrete-time stochastic processes.
Next, consider the value dimension. Strictly speaking, most economic variables can only take on
countably many values in practice. Stock prices are multiples of the smallest possible unit (0.01 cur-
rency units in many countries), and interest rates are only stated with a given number of decimals.
But since the possible values are very close together, it seems reasonable to use continuous-variable
processes in the modeling of these objects. In addition, the mathematics involved in the analysis
of continuous-variable processes is simpler and more elegant than the mathematics for discrete-
variable processes. Integrals are easier to deal with than sums, derivatives are easier to handle
than dierences, etc. Some models were originally formulated using discrete-time, discrete-variable
processes as, for example, the binomial option pricing model. For many years, the most signif-
icant model developments have applied continuous-time, continuous-variable processes, and such
continuous-time term structure models are now standard in the nancial industry and in academic
work. In sum, we will use continuous-time, continuous-variable stochastic processes throughout to
describe the evolution in prices and rates. Therefore the remaining sections of this chapter will be
devoted to that type of stochastic processes.
It should be noted that discrete-time and/or discrete-variable processes also have their virtues.
First, many concepts and results are easier understood or illustrated in a simple framework. Sec-
ond, even if we have low-frequency data for many nancial variables, we do not have continuous
data. When it comes to estimation of parameters in nancial models, continuous-time processes
often have to be approximated by discrete-time processes. Third, although explicit results on asset
prices, optimal investment strategies, etc. are easier to obtain with continuous-time models, not
all relevant questions can be explicitly answered. Some problems are solved numerically by com-
puter algorithms and also for that purpose it is often necessary to approximate continuous-time,
continuous-variable processes with discrete-time, discrete-variable processes (see Chapter 9).
230 Appendix B. Stochastic processes and stochastic calculus
B.2.5 How to write up stochastic processes
Many nancial models describe the movements and comovements of various variables simultane-
ously. The standard modeling procedure is to assume that there is some common exogenous shock
that aects all the relevant variables and then model the response of all these variables to that
shock. First, consider a discrete-time framework with time set T = {0, t
1
, t
2
, . . . , t
N
T} where
t
n
= nt. The shock over any period [t
n
, t
n+1
] is represented by a random variable
t
n+1
, which
in general may be multi-dimensional, but let us for now just focus on the one-dimensional case.
The sequence of shocks
t
1
,
t
2
, . . . ,
t
N
constitutes the basic or the underlying uncertainty in the
model. Since the shock should represent some unexpected information, assume that every
t
n
has
mean zero.
A stochastic process x = (x
t
)
tT
representing the dynamics of a price, an interest rate, or
another interesting variable can then be dened by the initial value x
0
and the increments x
t
n+1

x
t
n+1
x
t
n
, n = 0, . . . , N 1, which are typically assumed to be of the form
x
t
n+1
=
t
n
t +
t
n

t
n+1
. (B.1)
In general
t
n
and
t
n
can themselves be stochastic, but must be known at time t
n
, i.e., they
must be F
t
n
-measurable random variables. In fact, we can form adapted processes = (
t
)
tT
and = (
t
)
tT
. Given the information available at time t
n
, the only random variable on the
right-hand side of (B.1) is
t
n+1
, which is assumed to have mean zero and some variance Var[
t
n+1
].
Hence, the mean and variance of x
t
n+1
, conditional on time t
n
information, are
E
t
n
[x
t
n+1
] =
t
n
t, Var
t
n
[x
t
n+1
] =
2
t
n
Var[
t
n+1
].
We can see that
t
n
has the interpretation of the expected change in x per time period.
If the shocks
t
1
, . . . ,
t
N
are the only source of randomness in all the quantities we care about,
then the relevant information ltration is exactly F

= (F

t
)
tT
, i.e., F
t
= F

t
. In that case
t
n
and

t
n
are required to be measurable with respect to F

t
n
, i.e., they can depend on the realizations of

t
1
, . . . ,
t
n
. If
t
n
is non-zero at all times and for all states, we can invert (B.1) to get

t
n+1
=
x
t
n+1

t
n
t

t
n
.
It is then clear that we learn exactly the same from observing the x-process as observing the
exogenous shocks directly, i.e., F
x
= F

= F. We can x the set of probabilizable events F to


F

T
= F
x
T
. The probability measure P will be dened by specifying the probability distribution of
each of the shocks
t
n
.
From the sequence
t
1
,
t
2
, . . . ,
t
N
of exogenous shocks we can dene a stochastic process z =
(z
t
)
tT
by letting z
0
= 0 and z
t
n
=
t
1
+ +
t
n
. Consequently,
t
n+1
= z
t
n+1
z
t
n
z
t
n+1
. Now
the process z captures the basic uncertainty in the model. The information ltration of the model
is then dened by the information that can be extracted from observing the path of z. Without
loss of generality we can assume that Var[z
t
n+1
] = Var[
t
n+1
] = t for any period [t
n
, t
n+1
].
With the z-notation we can rewrite (B.1) as
x
t
n+1
=
t
n
t +
t
n
z
t
n+1
(B.2)
and now Var
t
n
[x
t
n+1
] =
2
t
n
t so that
2
t
n
can be interpreted as the variance of the change in x
per time period.
B.3 Brownian motions 231
The distribution of x
t
n+1
will be determined by the distribution assumed for the shocks
t
n+1
=
z
t
n+1
. If the shocks are assumed to be normally distributed, the increment x
t
n+1
will be
normally distributed conditional on time t information, but not necessarily if we condition on
earlier or no information.
We can loosely think of a continuous-time model as the result of taking a discrete-time model and
let t go to zero. In that spirit we will often dene a continuous-time stochastic process x = (x
t
)
tT
by writing
dx
t
=
t
dt +
t
dz
t
(B.3)
which is to be thought of as the limit of (B.2) as t 0. Hence, dx
t
represents the change in x over
the innitesimal (i.e., innitely short) period after time t. Similarly for dz
t
. The interpretations of

t
and
t
are also similar to the discrete-time case. While (B.3) might seem very intuitive, it does
not really make much sense to talk about the change of something over a period of innitesimal
length. The expression (B.3) really means that the change in the value of x over any time interval
[t, t
0
] T is given by
x
t
0 x
t
=
_
t
0
t

u
du +
_
t
0
t

u
dz
u
.
The problem is that the right-hand side of this equation will not make sense before we dene the
two integrals. The integral
_
t
0
t

u
du is simply dened as the random variable whose value in any
state is given by
_
t
0
t

u
() du, which is an ordinary integral of real-valued function of time.
If is adapted, the value of the integral
_
t
0
t

u
du will become known at time t
0
. The denition of
the integral
_
t
0
t

u
dz
u
is much more delicate. We will return to that issue in Section B.6.
In almost all the continuous-time models studied in this book we will assume that the basic
exogenous shocks are normally distributed, i.e., that the change in the shock process z over any
time interval is normally distributed. A process z with this property is the so-called standard
Brownian motion. In the next section we will formally dene this process and study some of its
properties. Then in later sections we will build various processes x from that basic process z.
B.3 Brownian motions
All the stochastic processes we shall apply in the nancial models in the following chapters
build upon a particular class of processes, the so-called Brownian motions. A (one-dimensional)
stochastic process z = (z
t
)
t0
is called a standard Brownian motion, if it satises the following
conditions:
(i) z
0
= 0,
(ii) for all t, t
0
0 with t < t
0
: z
t
0 z
t
N(0, t
0
t) [normally distributed increments],
(iii) for all 0 t
0
< t
1
< < t
n
, the random variables z
t
1
z
t
0
, . . . , z
t
n
z
t
n1
are mutually
independent [independent increments],
(iv) z has continuous paths.
Here N(a, b) denotes the normal distribution with mean a and variance b.
If we suppose that a standard Brownian motion z represents the basic exogenous shock to an
economy over a time interval [0, T], then the relevant ltered probability space (, F, P, F) is
232 Appendix B. Stochastic processes and stochastic calculus
implicitly given as follows. The state space is the set of all possible paths (z
t
)
t[0,T]
. The
information ltration is the one generated by z, i.e., F = F
z
. The set of probabilizable events F is
equal to F
z
T
. The probability measure P is dened by the requirement that
P

z
t
0 z
t

t
0
t
< h

= N(h)
_
h

2
e
a
2
/2
da
for all t < t
0
and all h R, where N() denotes the cumulative distribution function for an
N(0, 1)-distributed random stochastic variable.
Note that a standard Brownian motion is a Markov process, since the increment from today to
any future point in time is independent of the history of the process. A standard Brownian motion
is also a martingale, since the expected change in the value of the process is zero.
The name Brownian motion is in honor of the Scottish botanist Robert Brown, who in 1828
observed the apparently random movements of pollen submerged in water. The often used name
Wiener process is due to Norbert Wiener, who in the 1920s was the rst to show the existence
of a stochastic process with these properties and who initiated a mathematically rigorous analysis
of the process. As early as in the year 1900, the standard Brownian motion was used in a model
for stock price movements by the French researcher Louis Bachelier, who derived the rst option
pricing formula, cf. Bachelier (1900).
The choice of using standard Brownian motions to represent the underlying uncertainty has
an important consequence. All the processes dened by equations of the form (B.3) will then
have continuous paths, i.e., there will be no jumps. Stochastic processes which have paths with
discontinuities also exist. The jumps of such processes are often modeled by Poisson processes
or related processes. It is well-known that large, sudden movements in nancial variables occur
from time to time, for example, in connection with stock market crashes. There may be many
explanations of such large movements, for example, a large unexpected change in the productivity
in a particular industry or the economy in general, perhaps due to a technological break-through.
Another source of sudden, large movements is changes in the political or economic environment
such as unforseen interventions by the government or central bank. Stock market crashes are
sometimes explained by the bursting of a bubble. Whether such sudden, large movements can be
explained by a sequence of small continuous movements in the same direction or jumps have to be
included in the models is an empirical question, which is still open. Large movements over a short
period of time seem to be less frequent in interest rates and bond prices than in stock prices.
The dening characteristics of a standard Brownian motion look very nice, but they have some
drastic consequences. It can be shown that the paths of a standard Brownian motion are nowhere
dierentiable, which broadly speaking means that the paths bend at all points in time and are
therefore strictly speaking impossible to illustrate. However, one can get an idea of the paths by
simulating the values of the process at dierent times. If
1
, . . . ,
n
are independent draws from a
standard N(0, 1) distribution, we can simulate the value of the standard Brownian motion at time
0 t
0
< t
1
< t
2
< < t
n
as follows:
z
t
i
= z
t
i1
+
i
_
t
i
t
i1
, i = 1, . . . , n.
With more time points and hence shorter intervals we get a more realistic impression of the paths
of the process. Figure B.1 shows a simulated path for a standard Brownian motion over the interval
[0, 1] based on a partition of the interval into 200 subintervals of equal length. Note that since
B.3 Brownian motions 233
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
0 0.2 0.4 0.6 0.8 1
Figure B.1: A simulated path of a standard Brownian motion based on 200 subintervals.
a normally distributed random variable can take on innitely many values, a standard Brownian
motion has innitely many paths that each has a zero probability of occurring. The gure shows
just one possible path.
Another property of a standard Brownian motion is that the expected length of the path over any
future time interval (no matter how short) is innite. In addition, the expected number of times
a standard Brownian motion takes on any given value in any given time interval is also innite.
Intuitively, these properties are due to the fact that the size of the increment of a standard Brownian
motion over an interval of length t is proportional to

t, in the sense that the standard deviation


of the increment equals

t. When t is close to zero,

t is signicantly larger than t, so the


changes are large relative to the length of the time interval over which the changes are measured.
The expected change in an object described by a standard Brownian motion equals zero and
the variance of the change over a given time interval equals the length of the interval. This can
easily be generalized. As before let z = (z
t
)
t0
be a one-dimensional standard Brownian motion
and dene a new stochastic process x = (x
t
)
t0
by
x
t
= x
0
+t + z
t
, t 0,
where x
0
, , and are constants. The constant x
0
is the initial value for the process x. It
follows from the properties of the standard Brownian motion that, seen from time 0, the value x
t
is normally distributed with mean x
0
+t and variance
2
t, i.e., x
t
N(x
0
+t,
2
t).
The change in the value of the process between two arbitrary points in time t and t
0
, where
t < t
0
, is given by
x
t
0 x
t
= (t
0
t) + (z
t
0 z
t
).
The change over an innitesimally short interval [t, t +t] with t 0 is often written as
dx
t
= dt + dz
t
, (B.4)
234 Appendix B. Stochastic processes and stochastic calculus
where dz
t
can loosely be interpreted as a N(0, dt)-distributed random variable. As discussed earlier,
this must really be interpreted as a limit of the expression
x
t+t
x
t
= t + (z
t+t
z
t
)
for t 0. The process x is called a generalized Brownian motion, or an arithmetic Brownian
motion, or a generalized Wiener process. The parameter reects the expected change in the
process per unit of time and is called the drift rate or simply the drift of the process. The
parameter reects the uncertainty about the future values of the process. More precisely,
2
reects the variance of the change in the process per unit of time and is often called the variance
rate of the process. is a measure for the standard deviation of the change per unit of time and
is referred to as the volatility of the process.
A generalized Brownian motion inherits many of the characteristic properties of a standard
Brownian motion. For example, also a generalized Brownian motion is a Markov process, and the
paths of a generalized Brownian motion are also continuous and nowhere dierentiable. However,
a generalized Brownian motion is not a martingale unless = 0. The paths can be simulated by
choosing time points 0 t
0
< t
1
< < t
n
and iteratively computing
x
t
i
= x
t
i1
+(t
i
t
i1
) +
i

_
t
i
t
i1
, i = 1, . . . , n,
where
1
, . . . ,
n
are independent draws from a standard normal distribution. Figures B.2 and B.3
show simulated paths for dierent values of the parameters and . The straight lines represent
the deterministic trend of the process, which corresponds to imposing the condition = 0 and
hence ignoring the uncertainty. Both gures are drawn using the same sequence of random numbers

i
, so that they are directly comparable. The parameter determines the trend, and the parameter
determines the size of the uctuations around the trend.
If the parameters and are allowed to be time-varying in a deterministic way, the process
x is said to be a time-inhomogeneous generalized Brownian motion. In dierential terms such a
process can be written as dened by
dx
t
= (t) dt + (t) dz
t
. (B.5)
Over a very short interval [t, t+t] the expected change is approximately (t)t, and the variance
of the change is approximately (t)
2
t. More precisely, the increment over any interval [t, t
0
] is
given by
x
t
0 x
t
=
_
t
0
t
(u) du +
_
t
0
t
(u) dz
u
.
The last integral is a so-called stochastic integral, which we will dene and describe in a later
section. There we will also state a theorem, which implies that, seen from time t, the integral
_
t
0
t
(u) dz
u
is a normally distributed random variable with mean zero and variance
_
t
0
t
(u)
2
du.
B.4 Diusion processes
For both standard Brownian motions and generalized Brownian motions, the future value is
normally distributed and can therefore take on any real value, i.e., the value space is equal to R.
Many economic variables can only have values in a certain subset of R. For example, prices of
B.4 Diusion processes 235
-0,6
-0,4
-0,2
0
0,2
0,4
0,6
0,8
1
1,2
1,4
0 0,2 0,4 0,6 0,8 1
sigma = 0.5 sigma = 1.0
Figure B.2: Simulation of a generalized Brownian motion with = 0.2 and = 0.5 or = 1.0.
The straight line shows the trend corresponding to = 0. The simulations are based on 200
subintervals.
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
0 0.2 0.4 0.6 0.8 1
sigma = 0.5 sigma = 1.0
Figure B.3: Simulation of a generalized Brownian motion with = 0.6 and = 0.5 or = 1.0.
The straight line shows the trend corresponding to = 0. The simulations are based on 200
subintervals.
236 Appendix B. Stochastic processes and stochastic calculus
nancial assets with limited liability are non-negative. The evolution in such variables cannot be
well represented by the stochastic processes studied so far. In many situations we will instead use
so-called diusion processes.
A (one-dimensional) diusion process is a stochastic process x = (x
t
)
t0
for which the change
over an innitesimally short time interval [t, t +dt] can be written as
dx
t
= (x
t
, t) dt + (x
t
, t) dz
t
, (B.6)
where z is a standard Brownian motion, but where the drift and the volatility are now functions
of time and the current value of the process.
2
This expression generalizes (B.4), where and
were assumed to be constants, and (B.5), where and were functions of time only. An equation
like (B.6), where the stochastic process enters both sides of the equality, is called a stochastic
dierential equation. Hence, a diusion process is a solution to a stochastic dierential equation.
If both functions and are independent of time, the diusion is said to be time-homo-
geneous, otherwise it is said to be time-inhomogeneous. For a time-homogeneous diusion
process, the distribution of the future value will only depend on the current value of the process
and how far into the future we are looking not on the particular point in time we are standing
at. For example, the distribution of x
t+
given x
t
= x will only depend on x and , but not on t.
This is not the case for a time-inhomogeneous diusion, where the distribution will also depend
on t.
In the expression (B.6) one may think of dz
t
as being N(0, dt)-distributed, so that the mean and
variance of the change over an innitesimally short interval [t, t +dt] are given by
E
t
[dx
t
] = (x
t
, t) dt, Var
t
[dx
t
] = (x
t
, t)
2
dt,
where E
t
and Var
t
denote the mean and variance, respectively, conditionally on the available
information at time t. To be more precise, the change in a diusion process over any interval [t, t
0
]
is
x
t
0 x
t
=
_
t
0
t
(x
u
, u) du +
_
t
0
t
(x
u
, u) dz
u
, (B.7)
where
_
t
0
t
(x
u
, u) dz
u
is a stochastic integral, which we will discuss in Section B.6. However, we
will continue to use the simple and intuitive dierential notation (B.6). The drift rate (x
t
, t) and
the variance rate (x
t
, t)
2
are really the limits
(x
t
, t) = lim
t0
E
t
[x
t+t
x
t
]
t
,
(x
t
, t)
2
= lim
t0
Var
t
[x
t+t
x
t
]
t
.
A diusion process is a Markov process as can be seen from (B.6), since both the drift and the
volatility only depend on the current value of the process and not on previous values. A diusion
process is not a martingale, unless the drift (x
t
, t) is zero for all x
t
and t. A diusion process
will have continuous, but nowhere dierentiable paths. The value space for a diusion process and
the distribution of future values will depend on the functions and . If (x, t) is continuous and
non-zero, the information generated by x will be identical to the information generated by z, i.e.,
F
x
= F
z
.
2
For the process x to be mathematically meaningful, the functions (x, t) and (x, t) must satisfy certain condi-
tions. See, e.g., ksendal (2003, Ch. 7) and Due (2001, App. E).
B.5 Ito processes 237
In Section B.8 we will give some important examples of diusion processes which we shall use
in later chapters to model the evolution of some economic variables.
B.5 It o processes
It is possible to dene even more general continuous-variable stochastic processes than those
in the class of diusion processes. A (one-dimensional) stochastic process x
t
is said to be an Ito
process, if the local increments are on the form
dx
t
=
t
dt +
t
dz
t
, (B.8)
where the drift and the volatility themselves are stochastic processes. A diusion process is
the special case where the values of the drift
t
and the volatility
t
are given by t and x
t
. For a
general Ito process, the drift and volatility may also depend on past values of the x process. Or
the drift and volatility can depend on another exogenous shock, for example, another standard
Brownian motion than z. It follows that Ito processes are generally not Markov processes. They
are generally not martingales either, unless
t
is identically equal to zero (and
t
satises some
technical conditions). The processes and must satisfy certain regularity conditions for the x
process to be well-dened. We will refer the reader to ksendal (2003, Ch. 4).
The expression (B.8) gives an intuitive understanding of the evolution of an Ito process, but it
is more precise to state the evolution in the integral form
x
t
0 x
t
=
_
t
0
t

u
du +
_
t
0
t

u
dz
u
, (B.9)
where the last term again is a stochastic integral.
B.6 Stochastic integrals
B.6.1 Denition and properties of stochastic integrals
In (B.7) and (B.9) and similar expressions a term of the form
_
t
0
t

u
dz
u
appears. An integral of
this type is called a stochastic integral or an Ito integral. We will only consider stochastic integrals
where the integrator z is a standard Brownian motion, although stochastic integrals involving
more general processes can also be dened. For given t < t
0
, the stochastic integral
_
t
0
t

u
dz
u
is a
random variable. Assuming that
u
is known at time u, the value of the integral becomes known
at time t
0
. The process is called the integrand.
The stochastic integral can be dened for very general integrands. The simplest integrands are
those that are piecewise constant. Assume that there are points in time t t
0
< t
1
< < t
n
t
0
,
so that
u
is constant on each subinterval [t
i
, t
i+1
). The stochastic integral is then dened by
_
t
0
t

u
dz
u
=
n1

i=0

t
i
_
z
t
i+1
z
t
i
_
.
If the integrand process is not piecewise constant, a sequence of piecewise constant processes

(1)
,
(2)
, . . . exists, which converges to . For each of the processes
(m)
, the integral
_
t
0
t

(m)
u
dz
u
is dened as above. The integral
_
t
0
t

u
dz
u
is then dened as a limit of the integrals of the
238 Appendix B. Stochastic processes and stochastic calculus
approximating processes:
_
t
0
t

u
dz
u
= lim
m
_
t
0
t

(m)
u
dz
u
.
We will not discuss exactly how this limit is to be understood and which integrand processes we can
allow. Again the interested reader is referred to ksendal (2003). The distribution of the integral
_
t
0
t

u
dz
u
will, of course, depend on the integrand process and can generally not be completely
characterized, but the following theorem gives the mean and the variance of the integral:
Theorem B.2. If = (
t
) satises some regularity conditions, the stochastic integral
_
t
0
t

u
dz
u
has the following properties:
E
t
_
_
t
0
t

u
dz
u
_
= 0,
Var
t
_
_
t
0
t

u
dz
u
_
=
_
t
0
t
E
t
[
2
u
] du.
Proof. Suppose that is piecewise constant and divide the interval [t, t
0
] into subintervals dened
by the time points t t
0
< t
1
< < t
n
t
0
so that is constant on each subinterval [t
i
, t
i+1
)
with a value
t
i
which is known at time t
i
. Then
E
t
_
_
t
0
t

u
dz
u
_
=
n1

i=0
E
t

t
i
_
z
t
i+1
z
t
i
_
=
n1

i=0
E
t

t
i
E
t
i
_
z
t
i+1
z
t
i
_
= 0,
using the Law of Iterated Expectations. For the variance we have
Var
t
_
_
t
0
t

u
dz
u
_
= E
t
_
_
_
_
t
0
t

u
dz
u
_
2
_
_

_
E
t
_
_
t
0
t

u
dz
u
__
2
= E
t
_
_
_
_
t
0
t

u
dz
u
_
2
_
_
and
E
t
_
_
_
_
t
0
t

u
dz
u
_
2
_
_
= E
t
_
_
n1

i=0
n1

j=0

t
i

t
j
(z
t
i+1
z
t
i
)(z
t
j+1
z
t
j
)
_
_
=
n1

i=0
E
t

2
t
i
(z
t
i+1
z
t
i
)
2

=
n1

i=0
E
t

2
t
i

(t
i+1
t
i
) =
_
t
0
t
E
t
[
2
u
] du.
If is not piecewise constant, we can approximate it by a piecewise constant process and take
appropriate limits. We skip the details.
If the integrand is a deterministic function of time, (u), the integral will be normally distributed,
so that the following result holds:
Theorem B.3. If (u) is a deterministic function of time, the random variable
_
t
0
t
(u) dz
u
is
normally distributed with mean zero and variance
_
t
0
t
(u)
2
du.
Proof. We present a sketch of the proof. Dividing the interval [t, t
0
] into subintervals dened by
the time points t t
0
< t
1
< < t
n
t
0
, we can approximate the integral with a sum,
_
t
0
t
(u) dz
u

n1

i=0
(t
i
)
_
z
t
i+1
z
t
i
_
.
B.6 Stochastic integrals 239
The increment of the Brownian motion over any subinterval is normally distributed with mean
zero and a variance equal to the length of the subinterval. Furthermore, the dierent terms in
the sum are mutually independent. It is well-known that a sum of normally distributed random
variables is itself normally distributed, and that the mean of the sum is equal to the sum of the
means, which in the present case yields zero. Due to the independence of the terms in the sum,
the variance of the sum is also equal to the sum of the variances, i.e.,
Var
t
_
n1

i=0
(t
i
)
_
z
t
i+1
z
t
i
_
_
=
n1

i=0
(t
i
)
2
Var
t
_
z
t
i+1
z
t
i
_
=
n1

i=0
(t
i
)
2
(t
i+1
t
i
),
which is an approximation of the integral
_
t
0
t
(u)
2
du. The result now follows from an appropriate
limit where the subintervals shrink to zero length.
Note that the process y = (y
t
)
t0
dened by y
t
=
_
t
0

u
dz
u
is a martingale (under regularity
conditions on ), since
E
t
[y
t
0 ] = E
t
_
_
t
0
0

u
dz
u
_
= E
t
_
_
t
0

u
dz
u
+
_
t
0
t

u
dz
u
_
= E
t
__
t
0

u
dz
u
_
+ E
t
_
_
t
0
t

u
dz
u
_
=
_
t
0

u
dz
u
= y
t
,
so that the expected future value is equal to the current value.More generally y
t
= y
0
+
_
t
0

u
dz
u
for some constant y
0
, is a martingale. The converse is also true in the sense that any martingale
can be expressed as a stochastic integral. This is the so-called martingale representation theorem:
Theorem B.4. Suppose the process M = (M
t
) is a martingale with respect to a ltered probability
space implicitly dened by the standard Brownian motion z = (z
t
)
t[0,T]
so that, in particular, the
information ltration is F = F
z
. Then a unique adapted process = (
t
) exists such that
M
t
= M
0
+
_
t
0

u
dz
u
for all t.
For a mathematically more precise statement of the result and a proof, see ksendal (2003,
Thm. 4.3.4).
B.6.2 Leibnitz rule for stochastic integrals
Leibnitz dierentiation rule for ordinary integrals is as follows: If f(t, s) is a deterministic
function, and we dene Y (t) =
_
T
t
f(t, s) ds, then
Y
0
(t) = f(t, t) +
_
T
t
f
t
(t, s) ds.
If we use the notation Y
0
(t) =
dY
dt
and
f
t
=
df
dt
, we can rewrite this result as
dY = f(t, t) dt +
_
_
T
t
df
dt
(t, s) ds
_
dt,
240 Appendix B. Stochastic processes and stochastic calculus
and formally cancelling the dt-terms, we get
dY = f(t, t) dt +
_
T
t
df(t, s) ds.
We will now consider a similar result in the case where f(t, s) and, hence, Y (t) are stochastic
processes.
Theorem B.5. For any s [t
0
, T], let f
s
= (f
s
t
)
t[t
0
,s]
be the Ito process dened by the dynamics
df
s
t
=
s
t
dt +
s
t
dz
t
,
where and are suciently well-behaved stochastic processes. Then the dynamics of the stochas-
tic process Y
t
=
_
T
t
f
s
t
ds is given by
dY
t
=
__
_
T
t

s
t
ds
_
f
t
t
_
dt +
_
_
T
t

s
t
ds
_
dz
t
.
Since the result is usually not included in standard textbooks on stochastic calculus, a sketch
of the proof is included. The proof applies the generalized Fubini-rule for stochastic processes,
which was stated and demonstrated in the appendix of Heath, Jarrow, and Morton (1992). The
Fubini-rule says that the order of integration in double integrals can be reversed, if the integrand
is a suciently well-behaved function we will assume that this is indeed the case.
Proof. Given any arbitrary t
1
[t
0
, T]. Since
f
s
t
1
= f
s
t
0
+
_
t
1
t
0

s
t
dt +
_
t
1
t
0

s
t
dz
t
,
we get
Y
t
1
=
_
T
t
1
f
s
t
0
ds +
_
T
t
1
__
t
1
t
0

s
t
dt
_
ds +
_
T
t
1
__
t
1
t
0

s
t
dz
t
_
ds
=
_
T
t
1
f
s
t
0
ds +
_
t
1
t
0
_
_
T
t
1

s
t
ds
_
dt +
_
t
1
t
0
_
_
T
t
1

s
t
ds
_
dz
t
= Y
t
0
+
_
t
1
t
0
_
_
T
t

s
t
ds
_
dt +
_
t
1
t
0
_
_
T
t

s
t
ds
_
dz
t

_
t
1
t
0
f
s
t
0
ds
_
t
1
t
0
__
t
1
t

s
t
ds
_
dt
_
t
1
t
0
__
t
1
t

s
t
ds
_
dz
t
= Y
t
0
+
_
t
1
t
0
_
_
T
t

s
t
ds
_
dt +
_
t
1
t
0
_
_
T
t

s
t
ds
_
dz
t

_
t
1
t
0
f
s
t
0
ds
_
t
1
t
0
__
s
t
0

s
t
dt
_
ds
_
t
1
t
0
__
s
t
0

s
t
dz
t
_
ds
= Y
t
0
+
_
t
1
t
0
_
_
T
t

s
t
ds
_
dt +
_
t
1
t
0
_
_
T
t

s
t
ds
_
dz
t

_
t
1
t
0
f
s
s
ds
= Y
t
0
+
_
t
1
t
0
__
_
T
t

s
t
ds
_
f
t
t
_
dt +
_
t
1
t
0
_
_
T
t

s
t
ds
_
dz
t
,
where the Fubini-rule was employed in the second and fourth equality. The result now follows from
the nal expression.
B.7 Itos Lemma 241
B.7 It os Lemma
In our dynamic models of the term structure of interest rates, we will take as given a stochas-
tic process for the dynamics of some basic quantity such as the short-term interest rate. Many
other quantities of interest will be functions of that basic variable. To determine the dynamics of
these other variables, we shall apply It os Lemma, which is basically the chain rule for stochastic
processes. We will state the result for a function of a general Ito process, although we will most
frequently apply the result for the special case of a function of a diusion process.
Theorem B.6. Let x = (x
t
)
t0
be a real-valued Ito process with dynamics
dx
t
=
t
dt +
t
dz
t
,
where and are real-valued processes, and z is a one-dimensional standard Brownian motion. Let
g(x, t) be a real-valued function which is two times continuously dierentiable in x and continuously
dierentiable in t. Then the process y = (y
t
)
t0
dened by
y
t
= g(x
t
, t)
is an It o process with dynamics
dy
t
=

g
t
(x
t
, t) +
g
x
(x
t
, t)
t
+
1
2

2
g
x
2
(x
t
, t)
2
t

dt +
g
x
(x
t
, t)
t
dz
t
.
The proof is based on a Taylor expansion of g(x
t
, t) combined with appropriate limits, but a
formal proof is beyond the scope of this book. Once again, we refer to ksendal (2003, Ch. 4)
and similar textbooks. The result can also be written in the following way, which may be easier
to remember:
dy
t
=
g
t
(x
t
, t) dt +
g
x
(x
t
, t) dx
t
+
1
2

2
g
x
2
(x
t
, t)(dx
t
)
2
. (B.10)
Here, in the computation of (dx
t
)
2
, one must apply the rules (dt)
2
= dt dz
t
= 0 and (dz
t
)
2
= dt,
so that
(dx
t
)
2
= (
t
dt +
t
dz
t
)
2
=
2
t
(dt)
2
+ 2
t

t
dt dz
t
+
2
t
(dz
t
)
2
=
2
t
dt.
The intuition behind these rules is as follows: When dt is close to zero, (dt)
2
is far less than
dt and can therefore be ignored. Since dz
t
N(0, dt), we get E[dt dz
t
] = dt E[dz
t
] = 0 and
Var[dt dz
t
] = (dt)
2
Var[dz
t
] = (dt)
3
, which is also very small compared to dt and is therefore
ignorable. Finally, we have E[(dz
t
)
2
] = Var[dz
t
] (E[dz
t
])
2
= dt, and it can be shown that
3
Var[(dz
t
)
2
] = 2(dt)
2
. For dt close to zero, the variance is therefore much less than the mean, so
(dz
t
)
2
can be approximated by its mean dt.
In standard mathematics, the dierential of a function y = g(x, t) where x and t are real variables
is dened as dy =
g
t
dt +
g
x
dx. When x is an Ito process, (B.10) shows that we have to add a
second-order term.
In Section B.8, we give examples of the application of Itos Lemma, which is used extensively in
modern continuous-time nance.
3
This is based on the computation Var[(z
t+t
z
t
)
2
] = E[(z
t+t
z
t
)
4
]

E[(z
t+t
z
t
)
2
]

2
= 3(t)
2
(t)
2
=
2(t)
2
and a passage to the limit.
242 Appendix B. Stochastic processes and stochastic calculus
70
80
90
100
110
120
130
140
150
0 0.2 0.4 0.6 0.8 1
sigma = 0.2 sigma = 0.5
Figure B.4: Simulation of a geometric Brownian motion with initial value x
0
= 100, relative
drift rate = 0.1, and a relative volatility of = 0.2 and = 0.5, respectively. The smooth
curve shows the trend corresponding to = 0. The simulations are based on 200 subintervals
of equal length, and the same sequence of random numbers has been used for the two -values.
B.8 Important diusion processes
In this section we will discuss particular examples of diusion processes that are frequently
applied in modern nancial models, as those we consider in the following chapters.
B.8.1 Geometric Brownian motions
A stochastic process x = (x
t
)
t0
is said to be a geometric Brownian motion if it is a solution
to the stochastic dierential equation
dx
t
= x
t
dt + x
t
dz
t
, (B.11)
where and are constants. The initial value for the process is assumed to be positive, x
0
> 0.
A geometric Brownian motion is the particular diusion process that is obtained from (B.6) by
inserting (x
t
, t) = x
t
and (x
t
, t) = x
t
. Paths can be simulated by computing
x
t
i
= x
t
i1
+x
t
i1
(t
i
t
i1
) + x
t
i1

i
_
t
i
t
i1
.
Figure B.4 shows a single simulated path for = 0.2 and a path for = 0.5. For both paths we
have used = 0.1 and x
0
= 100, and the same sequence of random numbers.
The expression (B.11) can be rewritten as
dx
t
x
t
= dt + dz
t
,
which is the relative (percentage) change in the value of the process over the next innitesimally
short time interval [t, t +dt]. If x
t
is the price of a traded asset, then dx
t
/x
t
is the rate of return
B.8 Important diusion processes 243
on the asset over the next instant. The constant is the expected rate of return per period, while
is the standard deviation of the rate of return per period. In this context it is often which is
called the drift (rather than x
t
) and which is called the volatility (rather than x
t
). Strictly
speaking, one must distinguish between the relative drift and volatility ( and , respectively) and
the absolute drift and volatility (x
t
and x
t
, respectively). An asset with a constant expected
rate of return and a constant relative volatility has a price that follows a geometric Brownian
motion. For example, such an assumption is used for the stock price in the famous Black-Scholes-
Merton model for stock option pricing and a geometric Brownian motion is also used to describe
the evolution in the short-term interest rate in some models of the term structure of interest rate,
cf. Munk (2011).
Next, we will nd an explicit expression for x
t
, i.e., we will nd a solution to the stochastic
dierential equation (B.11). We can then also determine the distribution of the future value
of the process. We apply Itos Lemma with the function g(x, t) = ln x and dene the process
y
t
= g(x
t
, t) = ln x
t
. Since
g
t
(x
t
, t) = 0,
g
x
(x
t
, t) =
1
x
t
,

2
g
x
2
(x
t
, t) =
1
x
2
t
,
we get from Theorem B.6 that
dy
t
=

0 +
1
x
t
x
t

1
2
1
x
2
t

2
x
2
t

dt +
1
x
t
x
t
dz
t
=


1
2

dt + dz
t
.
Hence, the process y
t
= ln x
t
is a generalized Brownian motion. In particular, we have
y
t
0 y
t
=


1
2

(t
0
t) + (z
t
0 z
t
),
which implies that
ln x
t
0 = ln x
t
+


1
2

(t
0
t) + (z
t
0 z
t
).
Taking exponentials on both sides, we get
x
t
0 = x
t
exp


1
2

(t
0
t) + (z
t
0 z
t
)
_
. (B.12)
This is true for all t
0
> t 0. In particular,
x
t
= x
0
exp


1
2

t + z
t
_
.
Since exponentials are always positive, we see that x
t
can only have positive values, so that the
value space of a geometric Brownian motion is S = (0, ).
Suppose now that we stand at time t and have observed the current value x
t
of a geometric
Brownian motion. Which probability distribution is then appropriate for the uncertain future
value, say at time t
0
? Since z
t
0 z
t
N(0, t
0
t), we see from (B.12) that the future value x
t
0
(given x
t
) will be lognormally distributed. The probability density function for x
t
0 (given x
t
) is
f(x) =
1
x
_
2
2
(t
0
t)
exp
_

1
2
2
(t
0
t)

ln

x
x
t


1
2

(t
0
t)

2
_
, x > 0,
and the mean and variance are
E
t
[x
t
0 ] = x
t
e
(t
0
t)
,
Var
t
[x
t
0 ] = x
2
t
e
2(t
0
t)
_
e

2
(t
0
t)
1
_
,
244 Appendix B. Stochastic processes and stochastic calculus
cf. Appendix A.
The geometric Brownian motion in (B.11) is time-homogeneous, since neither the drift nor the
volatility are time-dependent. We will also make use of the time-inhomogeneous variant, which is
characterized by the dynamics
dx
t
= (t)x
t
dt + (t)x
t
dz
t
,
where and are deterministic functions of time. Following the same procedure as for the time-
homogeneous geometric Brownian motion, one can show that the inhomogeneous variant satises
x
t
0 = x
t
exp
_
_
t
0
t

(u)
1
2
(u)
2

du +
_
t
0
t
(u) dz
u
_
.
According to Theorem B.3,
_
t
0
t
(u) dz
u
is normally distributed with mean zero and variance
_
t
0
t
(u)
2
du. Therefore, the future value of the time-inhomogeneous geometric Brownian motion
is also lognormally distributed. In addition, we have
E
t
[x
t
0 ] = x
t
e
R
t
0
t
(u) du
,
Var
t
[x
t
0 ] = x
2
t
e
2
R
t
0
t
(u) du

e
R
t
0
t
(u)
2
du
1

.
B.8.2 Ornstein-Uhlenbeck processes
Another stochastic process we shall apply in models of the term structure of interest rate is the
so-called Ornstein-Uhlenbeck process. A stochastic process x = (x
t
)
t0
is said to be an Ornstein-
Uhlenbeck process, if its dynamics is of the form
dx
t
= [ x
t
] dt + dz
t
, (B.13)
where , , and are constants with > 0. Alternatively, this can be written as
dx
t
= [ x
t
] dt + dz
t
,
where = /. An Ornstein-Uhlenbeck process exhibits mean reversion in the sense that the drift
is positive when x
t
< and negative when x
t
> . The process is therefore always pulled towards
a long-term level of . However, the random shock to the process through the term dz
t
may
cause the process to move further away from . The parameter controls the size of the expected
adjustment towards the long-term level and is often referred to as the mean reversion parameter
or the speed of adjustment.
To determine the distribution of the future value of an Ornstein-Uhlenbeck process we proceed
as for the geometric Brownian motion. We will dene a new process y
t
as some function of x
t
such that y = (y
t
)
t0
is a generalized Brownian motion. It turns out that this is satised for
y
t
= g(x
t
, t), where g(x, t) = e
t
x. From Itos Lemma we get
dy
t
=
_
g
t
(x
t
, t) +
g
x
(x
t
, t) ( x
t
) +
1
2

2
g
x
2
(x
t
, t)
2
_
dt +
g
x
(x
t
, t) dz
t
=

e
t
x
t
+e
t
( x
t
)

dt +e
t
dz
t
= e
t
dt + e
t
dz
t
.
B.8 Important diusion processes 245
This implies that
y
t
0 = y
t
+
_
t
0
t
e
u
du +
_
t
0
t
e
u
dz
u
.
After substitution of the denition of y
t
and y
t
0 and a multiplication by e
t
0
, we arrive at the
expression
x
t
0 = e
(t
0
t)
x
t
+
_
t
0
t
e
(t
0
u)
du +
_
t
0
t
e
(t
0
u)
dz
u
= e
(t
0
t)
x
t
+

1 e
(t
0
t)

+
_
t
0
t
e
(t
0
u)
dz
u
.
This holds for all t
0
> t 0. In particular, we get that the solution to the stochastic dierential
equation (B.13) can be written as
x
t
= e
t
x
0
+
_
1 e
t
_
+
_
t
0
e
(tu)
dz
u
.
According to Theorem B.3, the integral
_
t
0
t
e
(t
0
u)
dz
u
is normally distributed with mean
zero and variance
_
t
0
t

2
e
2(t
0
u)
du =

2
2

1 e
2(t
0
t)

. We can thus conclude that x


t
0 (given
x
t
) is normally distributed, with mean and variance given by
E
t
[x
t
0 ] = e
(t
0
t)
x
t
+

1 e
(t
0
t)

, (B.14)
Var
t
[x
t
0 ] =

2
2

1 e
2(t
0
t)

. (B.15)
The value space of an Ornstein-Uhlenbeck process is R. For t
0
, the mean approaches ,
and the variance approaches
2
/(2). For , the mean approaches , and the variance
approaches 0. For 0, the mean approaches the current value x
t
, and the variance approaches

2
(t
0
t). The distance between the level of the process and the long-term level is expected to be
halved over a period of t
0
t = (ln 2)/, since E
t
[x
t
0 ] =
1
2
(x
t
) implies that e
(t
0
t)
=
1
2
and, hence, t
0
t = (ln 2)/.
The eect of the dierent parameters can also be evaluated by looking at the paths of the process,
which can be simulated by
x
t
i
= x
t
i1
+ [ x
t
i1
](t
i
t
i1
) +
i
_
t
i
t
i1
.
Figure B.5 shows a single path for dierent combinations of x
0
, , , and . In each sub-gure one
of the parameters is varied and the others xed. The base values of the parameters are x
0
= 0.08,
= 0.08, = ln 2 0.69, and = 0.03. All paths are computed using the same sequence
of random numbers
1
, . . . ,
n
and are therefore directly comparable. None of the paths shown
involve negative values of the process, but other paths will (see Figure B.6). As a matter of fact, it
can be shown that an Ornstein-Uhlenbeck process with probability one will sooner or later become
negative.
We will also apply the time-inhomogeneous Ornstein-Uhlenbeck process, where the constants
and are replaced by deterministic functions:
dx
t
= [(t) x
t
] dt + (t) dz
t
= [(t) x
t
] dt + (t) dz
t
.
246 Appendix B. Stochastic processes and stochastic calculus
0.04
0.06
0.08
0.1
0.12
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
x0 = 0.06 x0 = 0.08 x0 = 0.12
(a) Dierent initial values x
0
0
0.02
0.04
0.06
0.08
0.1
0.12
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
ka = 0.17 ka = 0.69 ka = 2.77
(b) Dierent -values; x
0
= 0.04
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
th = 0.04 th = 0.08 th = 0.12
(c) Dierent -values
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
be = 0.01 be = 0.03 be = 0.05
(d) Dierent -values
Figure B.5: Simulated paths for an Ornstein-Uhlenbeck process. The basic parameter values
are x
0
= = 0.08, = ln 2 0.69, and = 0.03.
Following the same line of analysis as above, it can be shown that the future value x
t
0 given x
t
is
normally distributed with mean and variance given by
E
t
[x
t
0 ] = e
(t
0
t)
x
t
+
_
t
0
t
(u)e
(t
0
u)
du,
Var
t
[x
t
0 ] =
_
t
0
t
(u)
2
e
2(t
0
u)
du.
One can also allow to depend on time, but we will not make use of that extension.
One of the earliest (but still frequently applied) dynamic models of the term structure of interest
rates, the Vasicek model, is based on the assumption that the short-term interest rate follows an
Ornstein-Uhlenbeck process; see Section 10.2. In an extension of that model, the short-term interest
rate is assumed to follow a time-inhomogeneous Ornstein-Uhlenbeck process.
B.8.3 Square-root processes
Another stochastic process frequently applied in term structure models is the so-called square-
root process. A one-dimensional stochastic process x = (x
t
)
t0
is said to be a square-root
B.8 Important diusion processes 247
process, if its dynamics is of the form
dx
t
= [ x
t
] dt +

x
t
dz
t
= [ x
t
] dt +

x
t
dz
t
, (B.16)
where = . Here, , , , and are positive constants. We assume that the initial value of the
process x
0
is positive, so that the square root function can be applied. The only dierence to the
dynamics of an Ornstein-Uhlenbeck process is the term

x
t
in the volatility. The variance rate
is now
2
x
t
which is proportional to the level of the process. A square-root process also exhibits
mean reversion.
A square-root process can only take on non-negative values. To see this, note that if the value
should become zero, then the drift is positive and the volatility zero, and therefore the value of the
process will with certainty become positive immediately after (zero is a so-called reecting barrier).
It can be shown that if 2
2
, the positive drift at low values of the process is so big relative
to the volatility that the process cannot even reach zero, but stays strictly positive.
4
Hence, the
value space for a square-root process is either S = [0, ) or S = (0, ).
Paths for the square-root process can be simulated by successively calculating
x
t
i
= x
t
i1
+ [ x
t
i1
](t
i
t
i1
) +

x
t
i1

i
_
t
i
t
i1
.
Variations in the dierent parameters will have similar eects as for the Ornstein-Uhlenbeck pro-
cess, which is illustrated in Figure B.5. Instead, let us compare the paths for a square-root process
and an Ornstein-Uhlenbeck process using the same drift parameters and , but where the -
parameter for the Ornstein-Uhlenbeck process is set equal to the -parameter for the square-root
process multiplied by the square root of , which ensures that the processes will have the same
variance rate at the long-term level. Figure B.6 compares two pairs of paths of the processes. In
part (a), the initial value is set equal to the long-term level, and the two paths continue to be
very close to each other. In part (b), the initial value is lower than the long-term level, so that
the variance rates of the two processes dier from the beginning. For the given sequence of ran-
dom numbers, the Ornstein-Uhlenbeck process becomes negative, while the square-root process of
course stays positive. In this case there is a clear dierence between the paths of the two processes.
Since a square-root process cannot become negative, the future values of the process cannot be
normally distributed. In order to nd the actual distribution, let us try the same trick as for the
Ornstein-Uhlenbeck process, that is we look at y
t
= e
t
x
t
. By Itos Lemma,
dy
t
= e
t
x
t
dt +e
t
( x
t
) dt +e
t

x
t
dz
t
= e
t
dt + e
t

x
t
dz
t
,
so that
y
t
0 = y
t
+
_
t
0
t
e
u
du +
_
t
0
t
e
u

x
u
dz
u
.
Computing the ordinary integral and substituting the denition of y, we get
x
t
0 = x
t
e
(t
0
t)
+

1 e
(t
0
t)

+
_
t
0
t
e
(t
0
u)

x
u
dz
u
.
4
To show this, the results of Karlin and Taylor (1981, p. 226) can be applied.
248 Appendix B. Stochastic processes and stochastic calculus
0.05
0.06
0.07
0.08
0.09
0.1
0.11
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
OU sq root
(a) Initial value x
0
= 0.08, same random
numbers as in Figure B.5
-0.02
0
0.02
0.04
0.06
0.08
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
OU sq root
(b) Initial value x
0
= 0.06, dierent ran-
dom numbers
Figure B.6: A comparison of simulated paths for an Ornstein-Uhlenbeck process and a square-
root process. For both processes, the parameters = 0.08 and = ln 2 0.69 are used, while
is set to 0.03 for the Ornstein-Uhlenbeck process and to 0.03/

0.08 0.1061 for the square-root


process.
Since x enters the stochastic integral we cannot immediately determine the distribution of x
t
0 given
x
t
from this equation. We can, however, use it to obtain the mean and variance of x
t
0 . Due to the
fact that the stochastic integral has mean zero, cf. Theorem B.2, we easily get
E
t
[x
t
0 ] = e
(t
0
t)
x
t
+

1 e
(t
0
t)

= + (x
t
) e
(t
0
t)
.
To compute the variance we apply the second equation of Theorem B.2:
Var
t
[x
t
0 ] = Var
t
_

_
t
0
t
e
(t
0
u)

x
u
dz
u
_
=
2
_
t
0
t
e
2(t
0
u)
E
t
[x
u
] du
=
2
_
t
0
t
e
2(t
0
u)

+ (x
t
) e
(ut)

du
=
2

_
t
0
t
e
2(t
0
u)
du +
2
(x
t
) e
2t
0
+t
_
t
0
t
e
u
du
=

2

1 e
2(t
0
t)

+

2

(x
t
)

e
(t
0
t)
e
2(t
0
t)

=

2
x
t

e
(t
0
t)
e
2(t
0
t)

+

2

1 e
(t
0
t)

2
.
Note that the mean is identical to the mean for an Ornstein-Uhlenbeck process, whereas the
variance is more complicated for the square-root process. For t
0
, the mean approaches ,
and the variance approaches
2
/(2). For , the mean approaches , and the variance
approaches 0. For 0, the mean approaches the current value x
t
, and the variance approaches

2
x
t
(t
0
t).
It can be shown that, conditional on the value x
t
, the value x
t
0 with t
0
> t is given by the non-
central
2
-distribution. A non-central
2
-distribution is characterized by a number a of degrees
B.9 Multi-dimensional processes 249
of freedom and a non-centrality parameter b and is denoted by
2
(a, b). More precisely, the
distribution of x
t
0 given x
t
is identical to the distribution of the random variable Y/c(t
0
t) where
c is the deterministic function
c() =
4

2
(1 e

)
and Y is a
2
(a, b(t
0
t))-distributed random variable with
a =
4

2
, b() = x
t
c()e

.
The density function for a
2
(a, b)-distributed random variable is
f

2
(a,b)
(y) =

i=0
e
b/2
(b/2)
i
i!
f

2
(a+2i)
(y) =

i=0
e
b/2
(b/2)
i
i!
(1/2)
i+a/2
(i +a/2)
y
i1+a/2
e
y/2
,
where f

2
(a+2i)
is the density function for a central
2
-distribution with a +2i degrees of freedom.
Inserting this density in the rst sum will give the second sum. Here denotes the so-called
gamma-function dened as (m) =
_

0
x
m1
e
x
dx. The probability density function for the
value of x
t
0 conditional on x
t
is then
f(x) = c(t
0
t) f

2
(a,b(t
0
t))
_
c(t
0
t)x
_
.
The mean and variance of a
2
(a, b)-distributed random variable are a+b and 2(a+2b), respectively.
This opens another way of deriving the mean and variance of x
t
0 given x
t
. We leave it for the
reader to verify that this procedure will yield the same results as given above.
A frequently applied dynamic model of the term structure of interest rates is based on the
assumption that the short-term interest rate follows a square-root process, cf. Section 10.3. Since
interest rates are positive and empirically seem to have a variance rate which is positively correlated
to the interest rate level, the square-root process gives a more realistic description of interest rates
than the Ornstein-Uhlenbeck process. On the other hand, models based on square-root processes
are more complicated to analyze than models based on Ornstein-Uhlenbeck processes.
B.9 Multi-dimensional processes
So far we have only considered one-dimensional processes, i.e., processes with a value space which
is R or a subset of R. In many cases we want to keep track of several processes, e.g., price processes
for dierent assets, and we will often be interested in covariances and correlations between dierent
processes.
In a continuous-time model where the exogenous shock process z = (z
t
)
t[0,T]
is one-dimensional,
the instantaneous increments of any two processes will be perfectly correlated. For example, if we
consider the two Ito processes x and y dened by
dx
t
=
xt
dt +
xt
dz
t
, dy
t
=
yt
dt +
yt
dz
t
,
then Cov
t
[dx
t
, dy
t
] =
xt

yt
dt so that the instantaneous correlation becomes
Corr
t
[dx
t
, dy
t
] =
Cov
t
[dx
t
, dy
t
]
_
Var
t
[dx
t
] Var
t
[dy
t
]
=

xt

yt
dt
_

2
xt
dt
2
yt
dt
= 1.
250 Appendix B. Stochastic processes and stochastic calculus
Increments over any non-innitesimal time interval are generally not perfectly correlated, i.e., for
any h > 0 a correlation like Corr
t
[x
t+h
x
t
, y
t+h
y
t
] is typically dierent from one but close to
one for small h.
To obtain non-perfectly correlated changes over the shortest time period considered by the
model we need an exogenous shock of a dimension higher than one, i.e., a shock vector. One can
without loss of generality assume that the dierent components of this shock vector are mutually
independent and generate non-perfect correlations between the relevant processes by varying the
sensitivities of those processes towards the dierent exogenous shocks. We will rst consider the
case of two processes and later generalize.
B.9.1 Two-dimensional processes
In the example above, we can avoid the perfect correlation by introducing a second standard
Brownian motion so that
dx
t
=
xt
dt +
x1t
dz
1t
+
x2t
dz
2t
, dy
t
=
yt
dt +
y1t
dz
1t
+
y2t
dz
2t
,
where z
1
= (z
1t
) and z
2
= (z
2t
) are independent standard Brownian motions. This generates an
instantaneous covariance of Cov
t
[dx
t
, dy
t
] = (
x1t

y1t
+
x2t

y2t
) dt, instantaneous variances of
Var
t
[dx
t
] =
_

2
x1t
+
2
x2t
_
dt and Var
t
[dy
t
] =
_

2
y1t
+
2
y2t
_
dt, and thus an instantaneous correla-
tion of
Corr
t
[dx
t
, dy
t
] =

x1t

y1t
+
x2t

y2t
_
(
2
x1t
+
2
x2t
)
_

2
y1t
+
2
y2t
_
,
which again can be anywhere in the interval [1, +1].
The shock coecients
x1t
,
x2t
,
y1t
, and
y2t
are determining the two instantaneous variances
and the instantaneous correlation. But many combinations of the four shock coecients will give
rise to the same variances and correlation. We have one degree of freedom in xing the shock
coecients. For example, we can put
x2t
0, which has the nice implication that it will simplify
various expressions and interpretations. If we thus write the dynamics of x and y as
dx
t
=
xt
dt +
xt
dz
1t
, dy
t
=
yt
dt +
yt
_

t
dz
1t
+
_
1
2
t
dz
2t
_
,

2
xt
and
2
yt
are the variance rates of x
t
and y
t
, respectively, while the covariance is Cov
t
[dx
t
, dy
t
] =

xt

yt
. If
xt
and
yt
are both positive, then
t
will be the instantaneous correlation between
the two processes x and y.
In many continuous-time models, one stochastic process is dened in terms of a function of two
other, not necessarily perfectly correlated, stochastic processes. For that purpose we need the
following two-dimensional version of Itos Lemma.
Theorem B.7. Suppose x = (x
t
) and y = (y
t
) are two stochastic processes with dynamics
dx
t
=
xt
dt +
x1t
dz
1t
+
x2t
dz
2t
, dy
t
=
yt
dt +
y1t
dz
1t
+
y2t
dz
2t
, (B.17)
where z
1
= (z
1t
) and z
2
= (z
2t
) are independent standard Brownian motions. Let g(x, y, t) be a
real-valued function for which all the derivatives
g
t
,
g
x
,
g
y
,

2
g
x
2
,

2
g
y
2
, and

2
g
xy
exist and are
B.9 Multi-dimensional processes 251
continuous. Then the process W = (W
t
) dened by W
t
= g(x
t
, y
t
, t) is an Ito process with
dW
t
=
_
g
t
+
g
x

xt
+
g
y

yt
+
1
2

2
g
x
2
_

2
x1t
+
2
x2t
_
+
1
2

2
g
y
2
_

2
y1t
+
2
y2t
_
+

2
g
xy
(
x1t

y1t
+
x2t

y2t
)
_
dt
+

g
x

x1t
+
g
y

y1t

dz
1t
+

g
x

x2t
+
g
y

y2t

dz
2t
,
where the dependence of all the partial derivatives on (x
t
, y
t
, t) has been notationally suppressed.
Alternatively, the result can be written more compactly as
dW
t
=
g
t
dt +
g
x
dx
t
+
g
y
dy
t
+
1
2

2
g
x
2
(dx
t
)
2
+
1
2

2
g
y
2
(dy
t
)
2
+

2
g
xy
(dx
t
)(dy
t
),
where it is understood that (dt)
2
= dt dz
1t
= dt dz
2t
= dz
1t
dz
2t
= 0.
Example B.1. Suppose that the dynamics of x and y are given by (B.17) and W
t
= x
t
y
t
. In
order to nd the dynamics of W, we apply the above version of Itos Lemma with the function
g(x, y) = xy. The relevant partial derivatives are
g
t
= 0,
g
x
= y,
g
y
= x,

2
g
x
2
= 0,

2
g
y
2
= 0,

2
g
xy
= 1.
Hence,
dW
t
= y
t
dx
t
+x
t
dy
t
+ (dx
t
)(dy
t
).
In particular, if the dynamics of x and y are written on the form
dx
t
= x
t
[m
xt
dt +v
x1t
dz
1t
+v
x2t
dz
2t
] , dy
t
= y
t
[m
yt
dt +v
y1t
dz
1t
+v
y2t
dz
2t
] , (B.18)
we get
dW
t
= W
t
[(m
xt
+m
yt
+v
x1t
v
y1t
+v
x2t
v
y2t
) dt + (v
x1t
+v
y1t
) dz
1t
+ (v
x2t
+v
y2t
) dz
2t
] .
For the special case, where both x and y are geometric Brownian motion so that m
x
, m
y
, v
x1
, v
x2
,
v
y1
, and v
y2
are all constants, it follows that W
t
= x
t
y
t
is also a geometric Brownian motion. 2
Example B.2. Dene W
t
= x
t
/y
t
. In this case we need to apply Itos Lemma with the function
g(x, y) = x/y which has derivatives
g
t
= 0,
g
x
=
1
y
,
g
y
=
x
y
2
,

2
g
x
2
= 0,

2
g
y
2
= 2
x
y
3
,

2
g
xy
=
1
y
2
.
Then
dW
t
=
1
y
t
dx
t

x
t
y
2
t
dy
t
+
x
t
y
3
t
(dy
t
)
2

1
y
2
t
(dx
t
)(dy
t
)
= W
t
_
dx
t
x
t

dy
t
y
t
+

dy
t
y
t

dx
t
x
t
dy
t
y
t
_
.
In particular, if the dynamics of x and y are given by (B.18), the dynamics of W
t
= x
t
/y
t
becomes
dW
t
= W
t
_
_
m
xt
m
yt
+ (v
2
y1t
+v
2
y2t
) (v
x1t
v
y1t
+v
x2t
v
y2t
)
_
dt
+ (v
x1t
v
y1t
) dz
1t
+ (v
x2t
v
y2t
) dz
2t
_
.
252 Appendix B. Stochastic processes and stochastic calculus
Note that for the special case, where both x and y are geometric Brownian motions, W = x/y is
also a geometric Brownian motion. 2
We can apply the two-dimensional version of Itos Lemma to prove the following useful result
relating expected discounted values and the drift rate.
Theorem B.8. Under suitable regularity conditions, the relative drift rate of an Ito process x =
(x
t
) is given by the process m = (m
t
) if and only if x
t
= E
t
[x
T
exp{
_
T
t
m
s
ds}].
Proof. Suppose rst that the relative drift rate is given by m so that dx
t
= x
t
[m
t
dt +v
t
dz
t
]. Let
us use It os Lemma to identify the dynamics of the process W
t
= x
t
exp{
_
t
0
m
s
ds} or W
t
=
x
t
y
t
, where y
t
= exp{
_
t
0
m
s
ds}. Note that dy
t
= y
t
m
t
dt so that y is a locally deterministic
stochastic process. From Example B.1, the dynamics of W becomes
dW
t
= W
t
[(m
t
m
t
+ 0) dt +v
t
dz
t
] = W
t
v
t
dz
t
.
Since W has zero drift, it is a martingale. It follows that W
t
= E
t
[W
T
], i.e., x
t
exp{
_
t
0
m
s
ds} =
E
t
[x
T
exp{
_
T
0
m
s
ds}] and hence x
t
= E
t
[x
T
exp{
_
T
t
m
s
ds}].
If, on the other hand, x
t
= E
t
[x
T
exp{
_
T
t
m
s
ds}] for all t, then the absolute drift of x follows
from this computation:
1
t
E
t
[x
t+t
x
t
] =
1
t
E
t
_
E
t+t
_
x
T
e

R
T
t+t
m
s
ds
_

E
t
_
x
T
e

R
T
t
m
s
ds
__
=
1
t
E
t
_
x
T
e

R
T
t+t
m
s
ds
x
T
e

R
T
t
m
s
ds
_
= E
t
_
x
T
e

R
T
t
m
s
ds
e
R
t+t
t
m
s
ds
1
t
_
m
t
E
t
_
x
T
e

R
T
t
m
s
ds
_
= m
t
x
t
,
so that the relative drift rate equals m
t
.
B.9.2 K-dimensional processes
Simultaneously modeling the dynamics of a lot of economic quantities requires the use of a lot
of shocks to those quantities. For that purpose we will work with represent shocks to the economy
by a vector standard Brownian motion. We dene this below and state Itos Lemma for processes
of a general dimension.
A K-dimensional standard Brownian motion z = (z
1
, . . . , z
K
)
>
is a stochastic process for
which the individual components z
i
are mutually independent one-dimensional standard Brownian
motions. If we let 0 = (0, . . . , 0)
>
denote the zero vector in R
K
and let I denote the identity
matrix of dimension K K (the matrix with ones in the diagonal and zeros in all other entries),
then we can write the dening properties of a K-dimensional Brownian motion z as follows:
(i) z
0
= 0,
(ii) for all t, t
0
0 with t < t
0
: z
t
0 z
t
N(0, (t
0
t)I) [normally distributed increments],
(iii) for all 0 t
0
< t
1
< < t
n
, the random variables z
t
1
z
t
0
, . . . , z
t
n
z
t
n1
are mutually
independent [independent increments],
B.9 Multi-dimensional processes 253
(iv) z has continuous sample paths in R
K
.
Here, N(a, b) denotes a K-dimensional normal distribution with mean vector a and variance-
covariance matrix b.
A K-dimensional diusion process x = (x
1
, . . . , x
K
)
>
is a process with increments of the
form
dx
t
= (x
t
, t) dt + (x
t
, t) dz
t
,
where is a function from R
K
R
+
into R
K
, and is a function from R
K
R
+
into the space
of K K-matrices. As before, z is a K-dimensional standard Brownian motion. The evolution of
the multi-dimensional diusion can also be written componentwise as
dx
it
=
i
(x
t
, t) dt +
i
(x
t
, t)
>
dz
t
=
i
(x
t
, t) dt +
K

k=1

ik
(x
t
, t) dz
kt
, i = 1, . . . , K,
where
i
(x
t
, t)
>
is the ith row of the matrix (x
t
, t), and
ik
(x
t
, t) is the (i, k)th entry (i.e.,
the entry in row i, column k). Since dz
1t
, . . . , dz
Kt
are mutually independent and all N(0, dt)
distributed, the expected change in the ith component process over an innitesimal period is
E
t
[dx
it
] =
i
(x
t
, t) dt, i = 1, . . . , K,
so that
i
can be interpreted as the drift of the ith component. Furthermore, the covariance
between changes in the ith and the jth component processes over an innitesimal period becomes
Cov
t
[dx
it
, dx
jt
] = Cov
t
_
K

k=1

ik
(x
t
, t) dz
kt
,
K

l=1

jl
(x
t
, t) dz
lt
_
=
K

k=1
K

l=1

ik
(x
t
, t)
jl
(x
t
, t) Cov
t
[dz
kt
, dz
lt
]
=
K

k=1

ik
(x
t
, t)
jk
(x
t
, t) dt
=
i
(x
t
, t)
>

j
(x
t
, t) dt, i, j = 1, . . . , K,
where we have applied the usual rules for covariances and the independence of the components
of z. In particular, the variance of the change in the ith component process of an innitesimal
period is given by
Var
t
[dx
it
] = Cov
t
[dx
it
, dx
it
] =
K

k=1

ik
(x
t
, t)
2
dt = k
i
(x
t
, t)k
2
dt, i = 1, . . . , K.
The volatility of the ith component is given by k
i
(x
t
, t)k. The variance-covariance matrix of
changes of x
t
over the next instant is (x
t
, t) dt = (x
t
, t)(x
t
, t)
>
dt. The correlation between
instantaneous increments in two component processes is
Corr
t
[dx
it
, dx
jt
] =

i
(x
t
, t)
>

j
(x
t
, t) dt
_
k
i
(x
t
, t)k
2
dt k
j
(x
t
, t)k
2
dt
=

i
(x
t
, t)
>

j
(x
t
, t)
k
i
(x
t
, t)k k
j
(x
t
, t)k
,
which can be any number in [1, 1] depending on the elements of
i
and
j
.
254 Appendix B. Stochastic processes and stochastic calculus
Similarly, we can dene a K-dimensional Ito process x = (x
1
, . . . , x
K
)
>
to be a process with
increments of the form
dx
t
=
t
dt +
t
dz
t
,
where = (
t
) is a K-dimensional stochastic process and = (
t
) is a stochastic process with
values in the space of K K-matrices.
Next, we state a multi-dimensional version of Itos Lemma, where a one-dimensional process is
dened as a function of time and a multi-dimensional process.
Theorem B.9. Let x = (x
t
)
t0
be an It o process in R
K
with dynamics dx
t
=
t
dt +
t
dz
t
or,
equivalently,
dx
it
=
it
dt +
>
it
dz
t
=
it
dt +
K

k=1

ikt
dz
kt
, i = 1, . . . , K,
where z
1
, . . . , z
K
are independent standard Brownian motions, and
i
and
ik
are well-behaved
stochastic processes.
Let g(x, t) be a real-valued function for which all the derivatives
g
t
,
g
x
i
, and

2
g
x
i
x
j
exist and
are continuous. Then the process y = (y
t
)
t0
dened by y
t
= g(x
t
, t) is also an Ito process with
dynamics
dy
t
=
_
_
g
t
(x
t
, t) +
K

i=1
g
x
i
(x
t
, t)
it
+
1
2
K

i=1
K

j=1

2
g
x
i
x
j
(x
t
, t)
ijt
_
_
dt
+
K

i=1
g
x
i
(x
t
, t)
i1t
dz
1t
+ +
K

i=1
g
x
i
(x
t
, t)
iKt
dz
Kt
,
where
ij
=
i1

j1
+ +
iK

jK
is the covariance between the processes x
i
and x
j
.
The result can also be written as
dy
t
=
g
t
(x
t
, t) dt +
K

i=1
g
x
i
(x
t
, t) dx
it
+
1
2
K

i=1
K

j=1

2
g
x
i
x
j
(x
t
, t)(dx
it
)(dx
jt
),
where in the computation of (dx
it
)(dx
jt
) one must use the rules (dt)
2
= dt dz
it
= 0 for all i,
dz
it
dz
jt
= 0 for i 6= j, and (dz
it
)
2
= dt for all i. Alternatively, the result can be expressed using
vector and matrix notation:
dy
t
=

g
t
(x
t
, t) +

g
x
(x
t
, t)

>

t
+
1
2
tr

>
t
_

2
g
x
2
(x
t
, t)
_

dt +

g
x
(x
t
, t)

>

t
dz
t
,
where
g
x
(x
t
, t) =
_
_
_
_
_
g
x
1
(x
t
, t)
. . .
g
x
K
(x
t
, t)
_
_
_
_
_
,

2
g
x
2
(x
t
, t) =
_
_
_
_
_
_
_
_
_

2
g
x
2
1
(x
t
, t)

2
g
x
1
x
2
(x
t
, t) . . .

2
g
x
1
x
K
(x
t
, t)

2
g
x
2
x
1
(x
t
, t)

2
g
x
2
2
(x
t
, t) . . .

2
g
x
2
x
K
(x
t
, t)
.
.
.
.
.
.
.
.
.
.
.
.

2
g
x
K
x
1
(x
t
, t)

2
g
x
K
x
2
(x
t
, t) . . .

2
g
x
2
K
(x
t
, t)
_
_
_
_
_
_
_
_
_
,
and tr denotes the trace of a quadratic matrix, i.e., the sum of the diagonal elements. For example,
tr(A) =

K
i=1
A
ii
.
The probabilistic properties of a K-dimensional diusion process is completely specied by the
drift function and the variance-covariance function . The values of the variance-covariance
B.10 Change of probability measure 255
function are symmetric and positive-denite matrices. Above we had =
>
for a general
(KK)-matrix . But from linear algebra it is well-known that a symmetric and positive-denite
matrix can be written as
>
for a lower-triangular matrix , i.e., a matrix with
ik
= 0 for
k > i. This is the so-called Cholesky decomposition. Hence, we may write the dynamics as
dx
1t
=
1
(x
t
, t) dt +
11
(x
t
, t) dz
1t
dx
2t
=
2
(x
t
, t) dt +
21
(x
t
, t) dz
1t
+
22
(x
t
, t) dz
2t
.
.
.
dx
Kt
=
K
(x
t
, t) dt +
K1
(x
t
, t) dz
1t
+
K2
(x
t
, t) dz
2t
+ +
KK
(x
t
, t) dz
Kt
(B.19)
We can think of building up the model by starting with x
1
. The shocks to x
1
are represented by
the standard Brownian motion z
1
and its coecient
11
is the volatility of x
1
. Then we extend the
model to include x
2
. Unless the innitesimal changes to x
1
and x
2
are always perfectly correlated
we need to introduce another standard Brownian motion, z
2
. The coecient
21
is xed to match
the covariance between changes to x
1
and x
2
and then
22
can be chosen so that
_

2
21
+
2
22
equals the volatility of x
2
. The model may be extended to include additional processes in the same
manner.
Some authors prefer to write the dynamics in an alternative way with a single standard Brownian
motion z
i
for each component x
i
such as
dx
1t
=
1
(x
t
, t) dt +V
1
(x
t
, t) d z
1t
dx
2t
=
2
(x
t
, t) dt +V
2
(x
t
, t) d z
2t
.
.
.
dx
Kt
=
K
(x
t
, t) dt +V
K
(x
t
, t) d z
Kt
(B.20)
Clearly, the coecient V
i
(x
t
, t) is then the volatility of x
i
. To capture an instantaneous non-zero
correlation between the dierent components the standard Brownian motions z
1
, . . . , z
K
have to
be mutually correlated. Let
ij
be the correlation between z
i
and z
j
. If (B.20) and (B.19) are
meant to represent the same dynamics, we must have
V
i
=
_

2
i1
+ +
2
ii
, i = 1, . . . , K,

ii
= 1;
ij
=

i
k=1

ik

jk
V
i
V
j
,
ji
=
ij
, i < j.
B.10 Change of probability measure
When we represent the evolution of a given economic variable by a stochastic process and discuss
the distributional properties of this process, we have implicitly xed a probability measure P. For
example, when we use the square-root process x = (x
t
) in (B.16) for the dynamics of a particular
interest rate, we have taken as given a probability measure P under which the stochastic process
z = (z
t
) is a standard Brownian motion. Since the process x is presumably meant to represent the
uncertain dynamics of the interest rate in the world we live in, we refer to the measure P as the real-
world probability measure. Of course, it is the real-world dynamics and distributional properties
of economic variables that we are ultimately interested in. Nevertheless, it turns out that in order
to compute and understand prices and rates it is often convenient to look at the dynamics and
256 Appendix B. Stochastic processes and stochastic calculus
distributional properties of these variables assuming that the world was dierent from the world
we live in. The prime example is a hypothetical world in which investors are assumed to be risk-
neutral instead of risk-averse. Loosely speaking, a dierent world is represented mathematically
by a dierent probability measure. Hence, we need to be able to analyze stochastic variables and
processes under dierent probability measures. In this section we will briey discuss how we can
change the probability measure.
Consider rst a state space with nitely many elements, = {
1
, . . . ,
n
}. As before, the set of
events, i.e., subsets of , that can be assigned a probability is denoted by F. Let us assume that
the single-element sets {
i
}, i = 1, . . . , n, belong to F. In this case we can represent a probability
measure P by a vector (p
1
, . . . , p
n
) of probabilities assigned to each of the individual elements:
p
i
= P({
i
}) , i = 1, . . . , n.
Of course, we must have that p
i
[0, 1] and that

n
i=1
p
i
= 1. The probability assigned to any
other event can be computed from these basic probabilities. For example, the probability of the
event {
2
,
4
} is given by
P({
2
,
4
}) = P({
2
} {
4
}) = P({
2
}) +P({
4
}) = p
2
+p
4
.
Another probability measure Q on F is similarly given by a vector (q
1
, . . . , q
n
) with q
i
[0, 1] and

n
i=1
q
i
= 1. We are only interested in equivalent probability measures. In this setting, the two
measures P and Q will be equivalent whenever p
i
> 0 q
i
> 0 for all i = 1, . . . , n. With a nite
state space there is no point in including states that occur with zero probability so we can assume
that all p
i
, and therefore all q
i
, are strictly positive.
We can represent the change of probability measure from P to Q by the vector = (
1
, . . . ,
n
),
where

i
=
q
i
p
i
, i = 1, . . . , n.
We can think of as a random variable that will take on the value
i
if the state
i
is realized.
Sometimes is called the Radon-Nikodym derivative of Q with respect to P and is denoted by
dQ/dP. Note that
i
> 0 for all i and that the P-expectation of = dQ/dP is
E
P
_
dQ
dP
_
= E
P
[] =
n

i=1
p
i

i
=
n

i=1
p
i
q
i
p
i
=
n

i=1
q
i
= 1.
Consider a random variable x that takes on the value x
i
if state i is realized. The expected value
of x under the measure Q is given by
E
Q
[x] =
n

i=1
q
i
x
i
=
n

i=1
p
i
q
i
p
i
x
i
=
n

i=1
p
i

i
x
i
= E
P
[x] .
Now let us consider the case where the state space is innite. Also in this case the change from
a probability measure P to an equivalent probability measure Q is represented by a strictly positive
random variable = dQ/dP with E
P
[] = 1. Again the expected value under the measure Q of a
random variable x is given by E
Q
[x] = E
P
[x], since
E
Q
[x] =
_

xdQ =
_

x
dQ
dP
dP =
_

x dP = E
P
[x].
B.10 Change of probability measure 257
In our economic models we will model the dynamics of uncertain objects over some time span
[0, T]. For example, we might be interested in determining bond prices with maturities up to
T years. Then we are interested in the stochastic process on this time interval, i.e., x = (x
t
)
t[0,T]
.
The state space is the set of possible paths of the relevant processes over the period [0, T] so
that all the relevant uncertainty has been resolved at time T and the values of all relevant random
variables will be known at time T. The Radon-Nikodym derivative = dQ/dP is also a random
variable and is therefore known at time T and usually not before time T. To indicate this the
Radon-Nikodym derivative is often denoted by
T
=
dQ
dP
.
We can dene a stochastic process = (
t
)
t[0,T]
by setting

t
= E
P
t
_
dQ
dP
_
= E
P
t
[
T
] .
This denition is consistent with
T
being identical to dQ/dP, since all uncertainty is resolved at
time T so that the time T expectation of any variable is just equal to the variable. Note that the
process is a P-martingale, since for any t < t
0
T we have
E
P
t
[
t
0 ] = E
P
t
_
E
P
t
0 [
T
]
_
= E
P
t
[
T
] =
t
.
Here the rst and the third equalities follow from the denition of . The second equality follows
from the Law of Iterated Expectations, Theorem B.1. The following result turns out to be very
useful in our dynamic models of the economy. Let x = (x
t
)
t[0,T]
be any stochastic process. Then
we have
E
Q
t
[x
t
0 ] = E
P
t
_

t
0

t
x
t
0
_
. (B.21)
This is called Bayes Formula. For a proof, see Bjork (2009, Prop. B.41).
Suppose that the underlying uncertainty is represented by a standard Brownian motion z = (z
t
)
(under the real-world probability measure P), as will be the case in all the models we will consider.
Let = (
t
)
t[0,T]
be any suciently well-behaved stochastic process.
5
. Here, z and must have
the same dimension. For notational simplicity, we assume in the following that they are one-
dimensional, but the results generalize naturally to the multi-dimensional case. We can generate
an equivalent probability measure Q

in the following way. Dene the process

= (

t
)
t[0,T]
by

t
= exp

_
t
0

s
dz
s

1
2
_
t
0

2
s
ds
_
. (B.22)
Then

0
= 1,

is strictly positive, and it can be shown that

is a P-martingale (see Exercise B.6)


so that E
P
[

T
] =

0
= 1. Consequently, an equivalent probability measure Q

can be dened by
the Radon-Nikodym derivative
dQ

dP
=

T
= exp
_

_
T
0

s
dz
s

1
2
_
T
0

2
s
ds
_
.
From (B.21), we get that
E
Q

t
[x
t
0 ] = E
P
t
_

t
0

t
x
t
0
_
= E
P
t
_
x
t
0 exp
_

_
t
0
t

s
dz
s

1
2
_
t
0
t

2
s
ds
__
for any stochastic process x = (x
t
)
t[0,T]
. A central result is Girsanovs Theorem:
5
Basically, must be square-integrable in the sense that
R
T
0

2
t
dt is nite with probability 1 and that satises
Novikovs condition, i.e., the expectation E
P
h
exp
n
1
2
R
T
0

2
t
dt
oi
is nite.
258 Appendix B. Stochastic processes and stochastic calculus
Theorem B.10 (Girsanov). The process z

= (z

t
)
t[0,T]
dened by
z

t
= z
t
+
_
t
0

s
ds, 0 t T,
is a standard Brownian motion under the probability measure Q

. In dierential notation,
dz

t
= dz
t
+
t
dt.
This theorem has the attractive consequence that the eects on a stochastic process of changing
the probability measure from P to some Q

are captured by a simple adjustment of the drift. If


x = (x
t
) is an Ito process with dynamics
dx
t
=
t
dt +
t
dz
t
,
then
dx
t
=
t
dt +
t
_
dz

t

t
dt
_
= (
t

t
) dt +
t
dz

t
.
Hence, is the drift under the probability measure Q

, which is dierent from the drift


under the original measure P unless or are identically equal to zero. In contrast, the volatility
remains the same as under the original measure.
In many nancial models, the relevant change of measure is such that the distribution under Q

of the future value of the central processes is of the same class as under the original P measure,
but with dierent moments. For example, consider the Ornstein-Uhlenbeck process
dx
t
= ( x
t
) dt + dz
t
and perform the change of measure given by a constant
t
= . Then the dynamics of x under the
measure Q

is given by
dx
t
= ( x
t
) dt + dz

t
,
where = . Consequently, the future values of x are normally distributed both under P
and Q

. From (B.14) and (B.15), we see that the variance of x


t
0 (given x
t
) is the same under Q

and P, but the expected values will dier (recall that = /):
E
P
t
[x
t
0 ] = e
(t
0
t)
x
t
+

1 e
(t
0
t)

,
E
Q

t
[x
t
0 ] = e
(t
0
t)
x
t
+

1 e
(t
0
t)

.
However, in general, a shift of probability measure may change not only some or all moments of
future values, but also the distributional class.
B.11 Exercises
Exercise B.1. Suppose x = (x
t
) is a geometric Brownian motion, dx
t
= x
t
dt + x
t
dz
t
. What
is the dynamics of the process y = (y
t
) dened by y
t
= (x
t
)
n
? What can you say about the
distribution of future values of the y process?
Exercise B.2. Let y be a random variable and dene a stochastic process x = (x
t
) by x
t
= E
t
[y].
Show that x is a martingale.
B.11 Exercises 259
Exercise B.3 ((Adapted from Bjork (2009).)). Dene the process y = (y
t
) by y
t
= z
4
t
, where
z = (z
t
) is a standard Brownian motion. Find the dynamics of y. Show that
y
t
= 6
_
t
0
z
2
s
ds + 4
_
t
0
z
3
s
dz
s
.
Show that E[y
t
] E[z
4
t
] = 3t
2
, where E[ ] denotes the expectation given the information at time 0.
Exercise B.4 ((Adapted from Bjork (2009).)). Dene the process y = (y
t
) by y
t
= e
az
t
, where a
is a constant and z = (z
t
) is a standard Brownian motion. Find the dynamics of y. Show that
y
t
= 1 +
1
2
a
2
_
t
0
y
s
ds +a
_
t
0
y
s
dz
s
.
Dene m(t) = E[y
t
]. Show that m satises the ordinary dierential equation
m
0
(t) =
1
2
a
2
m(t), m(0) = 1.
Show that m(t) = e
a
2
t/2
and conclude that
E[e
az
t
] = e
a
2
t/2
.
Exercise B.5. Consider the two general stochastic processes x
1
= (x
1t
) and x
2
= (x
2t
) dened
by the dynamics
dx
1t
=
1t
dt +
1t
dz
1t
,
dx
2t
=
2t
dt +
t

2t
dz
1t
+
_
1
2
t

2t
dz
2t
,
where z
1
and z
2
are independent one-dimensional standard Brownian motions. Interpret
it
,
it
,
and
t
. Dene the processes y = (y
t
) and w = (w
t
) by y
t
= x
1t
x
2t
and w
t
= x
1t
/x
2t
. What is the
dynamics of y and w? Concretize your answer for the special case where x
1
and x
2
are geometric
Brownian motions with constant correlation, i.e.,
it
=
i
x
it
,
it
=
i
x
it
, and
t
= with
i
,
i
,
and being constants.
Exercise B.6. Find the dynamics of the process

dened in (B.22).
APPENDIX C
Solutions to Ordinary Dierential Equations
Theorem C.1. The ordinary dierential equation
A
0
() = a() b()A(), A(0) = 0,
has the solution
A() =
_

0
e

u
b(s) ds
a(u) du.
Theorem C.2. If b
2
> 4ac, then the ordinary dierential equation
A
0
() = a bA() +cA()
2
, A(0) = 0,
has the solution
A() =
2a(e

1)
( +b) (e

1) + 2
,
where =

b
2
4ac. Furthermore, if c 6= 0,
_

0
A(u) du =
1
c

1
2
( +b) + ln

2
( +b)(e

1) + 2
_
and
_

0
A(u)
2
du = - ugly expression to be lled in - .
In the special case in which c = 0, the solution is
A() =
a
b
_
1 e
b
_
,
and
_

0
A(u) du =
1
b
(a A()) ,
_

0
A(u)
2
du =
1
ab
2
_
a
3
A()
_

1
2a
2
b
A()
2
.
261
262 Appendix C. Solutions to Ordinary Dierential Equations
Of course, the special case c = 0 in Theorem C.2 can also be seen as the special case of Theo-
rem C.1 in which a and b are constants.
Theorem C.3. If b
2
> 4ac, the solution to the system of ordinary dierential equations
A
0
2
() = a bA
2
() +cA
2
()
2
, A
2
(0) = 0,
A
0
1
() = d +fA
2
()

1
2
b cA
2
()

A
1
(), A
1
(0) = 0
is given by
A
2
() =
2a(e

1)
( +b) (e

1) + 2
,
A
1
() =
d
a
A
2
() +
2

(db + 2fa)
_
e
/2
1
_
2
( +b)(e

1) + 2
=
_
d
a
+
db + 2af

_
e

1
_
2
e

1
_
A
2
(),
where =

b
2
4ac.
Proof. The expression for A
2
follows from Theorem C.2. From Theorem C.1 we get
A
1
() =
_

0
e

u
(
b
2
cA
2
(s)) ds
(d +fA
2
(u)) du
=
_

0
e

b
2
(u)+c
R

u
A
2
(s) ds
(d +fA
2
(u)) du
=
_

0
e

2
(u)
( +b) (e
u
1) + 2
( +b) (e

1) + 2
(d +fA
2
(u)) du
=
de

( +b) (e

1) + 2
_

0
_
( +b)e

2
u
+ ( b)e

2
u
_
du +
2afe

( +b) (e

1) + 2
_

0
_
e

2
u
e

2
u
_
du
=
2d/
( +b) (e

1) + 2
_
( +b)e

( b) 2be

_
+
4af/
( +b) (e

1) + 2
_
e

1
_
2
=
2d(e

1)
( +b) (e

1) + 2
+
2

db + 2af
( +b) (e

1) + 2
_
e

1
_
2
=
d
a
A
2
() +
db + 2af

A
2
()
_
e

1
_
2
e

1
=
_
d
a
+
db + 2af

_
e

1
_
2
e

1
_
A
2
().
Bibliography
Ahn, D.-H., R. F. Dittmar, and A. R. Gallant (2002). Quadratic term structure models: Theory
and evidence. Review of Financial Studies 15(1), 243288.
At-Sahalia, Y. and M. Brandt (2001). Variable selection for portfolio choice. Journal of Fi-
nance 56, 12971351.
Akian, M., J. L. Menaldi, and A. Sulem (1996). On an investment-consumption model with
transaction costs. SIAM Journal of Control and Optimization 34, 329364.
Alexander, G. J. (1993). Short selling and ecient sets. Journal of Finance 48(4), 14971506.
Alexander, G. J., A. Baptista, and S. Yan (2007). Mean-variance portfolio selection with at-risk
constraints and discrete distributions. Journal of Banking and Finance 31(12), 37613781.
Allais, M. (1953). Le comportement de lhomme rationnel devant le risque critique des postulats
et axiomes de lecole Americaine. Econometrica 21(4), 503546.
Ameriks, J. and S. P. Zeldes (2004, September). How do household portfolio shares vary with
age? Working paper, The Vanguard Group and Columbia University.
Ang, A. and G. Bekaert (2002). International asset allocation with regime shifts. Review of
Financial Studies 15(4), 11371187.
Ang, A. and G. Bekaert (2007). Stock return predictability: Is it there? Review of Financial
Studies 20(3), 651707.
Anscombe, F. and R. Aumann (1963). A denition of subjective probability. Annals of Mathe-
matical Statistics 34(1), 199205.
Arrow, K. J. (1971). Essays in the Theory of Risk Bearing. North-Holland.
Bachelier, L. (1900). Theorie de la Speculation, Volume 3 of Annales de lEcole Normale
Superieure. Gauthier-Villars. English translation in Cootner (1964).
Bajeux-Besnainou, I., J. V. Jordan, and R. Portait (2001). The stock/bond ratio asset allocation
puzzle: A comment. American Economic Review 91, 11701179.
Bakshi, G. S. and N. Kapadia (2003). Delta-hedged gains and the negative market volatility risk
premium. Review of Financial Studies 16, 527566.
263
264 Bibliography
Balduzzi, P. and A. M. Lynch (1999). Transaction costs and predictability: Some utility cost
calculations. Journal of Financial Economics 52, 4778.
Bansal, R. (2007). Long-run risks and nancial markets. Federal Reserve Bank of St. Louis
Review 89(4), 283300.
Barber, B., R. Lehavy, M. McNichols, and B. Trueman (2001). Can investors prot from the
prophets? Security analyst recommendations and stock returns. Journal of Finance 56(2),
531563.
Barberis, N. (2000). Investing for the long run when returns are predictable. Journal of Fi-
nance 55, 225264.
Bardhan, I. (1994). Consumption and investment under constraints. Journal of Economic Dy-
namics and Control 18, 909929.
Bardhan, I. and X. Chao (1995). Martingale analysis for assets with discontinuous returns.
Mathematics of Operations Research 20(1), 243256.
Basak, S. (1997). Consumption choice and asset pricing with a non-price-taking agent. Economic
Theory 10, 437462.
Basak, S. and A. Shapiro (2001). Value-at-risk based risk management: Optimal policies and
asset prices. Review of Financial Studies 14, 371405.
Beaglehole, D. R. and M. S. Tenney (1991). General solutions of some interest rate-contingent
claim pricing equations. Journal of Fixed Income 1(2), 6983.
Bernoulli, D. (1954). Exposition of a new theory on the measurement of risk. Econometrica 22(1),
2336. Translation of the 1738 version.
Best, M. J. and R. R. Grauer (1991). On the sensitivity of mean-variance-ecient portfolios
to changes in asset means: Some analytical and computational results. Review of Financial
Studies 4(2), 315342.
Bhamra, H. S. and R. Uppal (2006). The role of risk aversion and intertemporal substitution in
dynamic consumption-portfolio choice with recursive utility. Journal of Economic Dynamics
and Control 30(6), 967991.
Bick, B., H. Kraft, and C. Munk (2012). Solving constrained consumption-investment prob-
lems by simulation of articial market strategies. Available at SSRN: http://ssrn.com/
abstract=1357339. Management Science, forthcoming.
Bjork, T. (2009). Arbitrage Theory in Continuous Time (Third ed.). Oxford University Press.
Bodie, Z. (2003). Thoughts on the future: Life-cycle investing in theory and practice. Financial
Analysts Journal 59(1), 2429.
Bodie, Z. and D. B. Crane (1997, May). Personal investing: Advice, theory, and evidence from a
survey of TIAA-CREF participants. Working paper, Boston University and Harvard Business
School.
Bodie, Z., R. C. Merton, and W. F. Samuelson (1992). Labor supply exibility and portfolio
choice in a life cycle model. Journal of Economic Dynamics and Control 16(3-4), 427449.
Box, G. E. P. and M. E. Muller (1958). A note on the generation of random normal deviates.
The Annals of Mathematical Statistics 29(2), 610611.
Bibliography 265
Brandt, M. (1999). Estimating portfolio and consumption choice: A conditional Euler equations
approach. Journal of Finance 54, 16091645.
Brandt, M. W., A. Goyal, P. Santa-Clara, and J. R. Stroud (2005). A simulation approach to
dynamic portfolio choice with an application to learning about return predictability. Review
of Financial Studies 18(3), 831873.
Branger, N., B. Breuer, and C. Schlag (2010). Discrete-time implementation of continuous-time
portfolio strategies. European Journal of Finance 16(2), 137152.
Branger, N., L. S. Larsen, and C. Munk (2012). Robust portfolio choice with ambiguity and
learning about return predictability. Journal of Banking and Finance, forthcoming.
Branger, N., C. Schlag, and E. Schneider (2008). Optimal portfolios when volatility can jump.
Journal of Banking and Finance 32, 10871097.
Breeden, D. T. (1979). An intertemporal asset pricing model with stochastic consumption and
investment opportunities. Journal of Financial Economics 7(3), 265296.
Brennan, M. J. (1998). The role of learning in dynamic portfolio decisions. European Finance
Review 1(3), 295306.
Brennan, M. J., E. S. Schwartz, and R. Lagnado (1997). Strategic asset allocation. Journal of
Economic Dynamics and Control 21(8-9), 13771403.
Brennan, M. J. and Y. Xia (2000). Stochastic interest rates and the bond-stock mix. European
Finance Review 4(2), 197210.
Brennan, M. J. and Y. Xia (2002). Dynamic asset allocation under ination. Journal of Fi-
nance 57(3), 12011238.
Browne, S. (1999). Beating a moving target: Optimal portfolio strategies for outperforming a
stochastic benchmark. Finance and Stochastics 3, 275294.
Browning, M. (1991). A simple nonadditive preference structure for models of household behavior
over time. Journal of Political Economy 99(3), 607637.
Browning, M. and T. Crossley (2001). The life-cycle model of consumption and saving. Journal
of Economic Perspectives 15, 322.
Brueckner, J. K. (1997). Consumption and investment motives and the portfolio choices of
homeowners. Journal of Real Estate Finance and Economics 15(2), 159180.
Brunnermeier, M. K. and S. Nagel (2008). Do wealth uctuations generate time-varying risk
aversion? Micro-evidence on individuals asset allocation. American Economic Review 98(3),
713736.
Bullard, J. and J. Feigenbaum (2007). A leisurely reading of the life-cycle consumption data.
Journal of Monetary Economics 54(8), 23052320.
Buraschi, A., P. Porchia, and F. Trojani (2010). Correlation risk and optimal portfolio choice.
Journal of Finance 65(1), 393420.
Calvet, L. E., J. Y. Campbell, and P. Sodini (2007). Down or out: Assessing the welfare costs
of household investment mistakes. Journal of Political Economy 115, 707747.
266 Bibliography
Campbell, J. Y. (1993, November). Understanding risk and return. NBER Working Paper 4554,
NBER and Woodrow Wilson School, Princeton University, Princeton, NJ 08544, USA.
Campbell, J. Y. (1999). Asset prices, consumption, and the business cycle. In J. B. Taylor and
M. Woodford (Eds.), Handbook of Macroeconomics, Volume 1. Elsevier.
Campbell, J. Y. (2006). Household nance. Journal of Finance 61(4), 15531604.
Campbell, J. Y. and J. F. Cocco (2003). Household risk management and optimal mortgage
choice. The Quarterly Journal of Economics 118(4), 14491494.
Campbell, J. Y., J. F. Cocco, F. Gomes, P. J. Maenhout, and L. M. Viceira (2001). Stock market
mean reversion and the optimal equity allocation of a long-lived investor. European Finance
Review 5(3), 269292.
Campbell, J. Y. and J. H. Cochrane (1999). By force of habit: A consumption-based explanation
of aggregate stock market behavior. Journal of Political Economy 107(2), 205251.
Campbell, J. Y., A. W. Lo, and A. C. MacKinlay (1997). The Econometrics of Financial Markets.
Princeton University Press.
Campbell, J. Y. and L. M. Viceira (1999). Consumption and portfolio decisions when expected
returns are time varying. The Quarterly Journal of Economics 114(2), 433495.
Campbell, J. Y. and L. M. Viceira (2001). Who should buy long-term bonds? American Eco-
nomic Review 91(1), 99127.
Campbell, J. Y. and L. M. Viceira (2002). Strategic Asset Allocation. New York: Oxford Uni-
versity Press.
Campbell, J. Y. and T. Vuolteenaho (2004). Bad beta, good beta. American Economic Re-
view 94(5), 12491275.
Carpenter, J. (2000). Does option compensation increase managerial risk appetite? Journal of
Finance 55(5), 23112331.
Cauley, S. D., A. D. Pavlov, and E. S. Schwartz (2007). Home ownership as a constraint on asset
allocation. Journal of Real Estate Finance and Economics 34(3), 283311.
Chacko, G. and L. M. Viceira (2005). Dynamic consumption and portfolio choice with stochastic
volatility in incomplete markets. Review of Financial Studies 18, 13691402.
Chan, Y. L. and L. Kogan (2002). Catching up with the Joneses: Heterogeneous preferences and
the dynamics of asset prices. Journal of Political Economy 110(6), 12551285.
Chan, Y. L. and L. M. Viceira (2000, December). Asset allocation with endogenous labor income:
The case of incomplete markets. Unpublished working paper, Harvard Business School and
NBER.
Chellathurai, T. and T. Draviam (2007). Dynamic portfolio selection with xed and/or pro-
portional transaction costs using non-singular stochastic optimal control theory. Journal of
Economic Dynamics and Control 31(7), 21682195.
Chernov, M. and E. Ghysels (2000). A study towards a unied approach to the joint estimation
of objective and risk neutral measures for the purposes of options valuation. Journal of
Financial Economics 56, 407458.
Bibliography 267
Chopra, V. K. and W. T. Ziemba (1993). The eect of errors in means, variances, and covariances
on optimal portfolio choice. Journal of Portfolio Management 19(2), 611.
Christensen, P. O., K. Larsen, and C. Munk (2012). Equilibrium in securities markets with
heterogeneous investors and unspanned income risk. Journal of Economic Theory 147(3),
10351063.
Christiansen, C., J. S. Joensen, and J. Rangvid (2008). Are economists more likely to hold
stocks? Review of Finance 12(3), 465496.
Cicchetti, C. J. and J. A. Dubin (1994). A microeconomic analysis of risk aversion and the
decision to self-insure. Journal of Political Economy 102(1), 169186.
Cocco, J. F. (2005). Portfolio choice in the presence of housing. Review of Financial Stud-
ies 18(2), 535567.
Cocco, J. F., F. J. Gomes, and P. J. Maenhout (2005). Consumption and portfolio choice over
the life cycle. Review of Financial Studies 18(2), 491533.
Cochrane, J. H. (1989). The sensitivity of tests of the intertemporal allocation of consumption
to near-rational alternatives. American Economic Review 79(3), 319337.
Cochrane, J. H. (2005). Asset Pricing (Revised ed.). Princeton University Press.
Collin-Dufresne, P. and R. S. Goldstein (2002). Do bonds span the xed income markets? theory
and evidence for unspanned stochastic volatility. Journal of Finance 57(4), 16851730.
Constantinides, G. M. (1979). Multiperiod consumption and investment behavior with convex
transactions costs. Management Science 25(11), 11271137.
Constantinides, G. M. (1986). Capital market equilibrium with transaction costs. Journal of
Political Economy 94(4), 842862.
Constantinides, G. M. (1990). Habit formation: A resolution of the equity premium puzzle.
Journal of Political Economy 98(3), 519543.
Constantinides, G. M., J. B. Donaldson, and R. Mehra (2002). Junior cant borrow: A new
perspective on the equity premium puzzle. Quarterly Journal of Economics 117(1), 269
296.
Cooper, I. and E. Kaplanis (1994). Home bias in equity portfolios, ination hedging, and inter-
national capital market equilibrium. Review of Financial Studies 7(1), 4560.
Cootner, P. H. (1964). The Random Character of Stock Market Prices. MIT Press.
Corradin, S., J. L. Fillat, and C. Vergara-Alert (2010). Optimal portfolio choice with predictabil-
ity in house prices and transaction costs. Working paper QAU10-2, Federal Reserve Bank of
Boston.
Cover, T. (1991). Universal portfolios. Mathematical Finance 1(1), 129.
Cox, J. C. and C.-f. Huang (1989). Optimal consumption and portfolio policies when asset prices
follow a diusion process. Journal of Economic Theory 49, 3383.
Cox, J. C. and C.-f. Huang (1991). A variational problem arising in nancial economics. Journal
of Mathematical Economics 20, 465487.
268 Bibliography
Cox, J. C., J. E. Ingersoll, Jr., and S. A. Ross (1985). A theory of the term structure of interest
rates. Econometrica 53(2), 385407.
Cuoco, D. (1997). Optimal consumption and equilibrium prices with portfolio constraints and
stochastic income. Journal of Economic Theory 71(1), 3373.
Cuoco, D. and J. Cvitanic (1998). Optimal consumption choices for a large investor. Journal
of Economic Dynamics and Control 22, 401436.
Cuoco, D., H. He, and S. Issaenko (2002, September). Optimal dynamic trading strategies with
risk limits. Working paper.
Cuoco, D. and H. Liu (2000). Optimal consumption of a divisible durable good. Journal of
Economic Dynamics and Control 24(4), 561613.
Cuoco, D. and H. Liu (2006). An analysis of VaR-based capital requirements. Journal of Finan-
cial Intermediation 15, 362394.
Curcuru, S., J. Heaton, D. Lucas, and D. Moore (2009). Heterogeneity and portfolio choice:
Theory and evidence. In Handbook of Financial Econometrics, Volume 1, Chapter 6, pp.
337382. North-Holland.
Cvitanic, J. (1996). Optimal trading under constraints. Lecture notes, Department of Statistics,
Columbia University.
Cvitanic, J., L. Goukasian, and F. Zapatero (2003). Monte Carlo computation of optimal port-
folios in complete markets. Journal of Economic Dynamics and Control 27(6), 971986.
Cvitanic, J. and I. Karatzas (1992). Convex duality in constrained portfolio optimization. Annals
of Applied Probability 2(4), 767818.
Cvitanic, J. and I. Karatzas (1995). On portfolio optimization under drawdown constraints. In
M. H. A. Davis, D. Due, W. H. Fleming, and S. E. Shreve (Eds.), Mathematical Finance,
Volume 65 of The IMA Volumes in Mathematics and Its Applications, pp. 3545. Springer-
Verlag.
Cvitanic, J. and I. Karatzas (1996). Hedging and portfolio optimization under transactions costs:
A martingale approach. Mathematical Finance 6, 133165.
Dai, Q. and K. J. Singleton (2000). Specication analysis of ane term structure models. Journal
of Finance 55(5), 19431978.
Damgaard, A., B. Fuglsbjerg, and C. Munk (2003). Optimal consumption and investment strate-
gies with a perishable and an indivisible durable consumption good. Journal of Economic
Dynamics and Control 28(2), 209253.
Danthine, J.-P. and J. B. Donaldson (2002). Intermediate Financial Theory. Prentice Hall, Pear-
son Education.
Das, S. R. and R. Uppal (2004). Systemic risk and international portfolio choice. Journal of
Finance 59(6), 28092834.
Davido, T., J. R. Brown, and P. Diamond (2005). Annuities and individual welfare. American
Economic Review 95(5), 15731590.
Davis, M. H. A. and A. R. Norman (1990). Portfolio selection with transaction costs. Mathe-
matics of Operations Research 15(4), 676713.
Bibliography 269
Davis, S. J. and P. Willen (2000, March). Using nancial assets to hedge labor income risks:
Estimating the benets. Working paper, University of Chicago and Princeton University.
de Jong, F., J. Driessen, and O. Van Hemert (2008, July). Hedging house price risk: Portfolio
choice with housing futures. Available at SSRN: http://ssrn.com/abstract=740364.
Deelstra, G., M. Grasselli, and P.-F. Koehl (2000). Optimal investment strategies in a CIR
framework. Journal of Applied Probability 37, 936946.
Detemple, J., R. Garcia, and M. Rindisbacher (2003). A Monte-Carlo method for optimal port-
folios. Journal of Finance 58(1), 401446.
Detemple, J., R. Garcia, and M. Rindisbacher (2005). Intertemporal asset allocation: A com-
parison of methods. Journal of Banking and Finance 29(11), 28212848.
Detemple, J. and I. Karatzas (2003). Non-addictive habits: Optimal consumption-portfolio poli-
cies. Journal of Economic Theory 113, 265285.
Detemple, J. and M. Rindisbacher (2010). Dynamic asset allocation: Portfolio decomposition
formula and applications. Review of Financial Studies 23(1), 25100.
Detemple, J. B. and C. I. Giannikos (1996). Asset and commodity prices with multi-attribute
durable goods. Journal of Economic Dynamics and Control 20(8), 14511504.
Detemple, J. B. and F. Zapatero (1991). Asset prices in an exchange economy with habit for-
mation. Econometrica 59(6), 16331658.
Detemple, J. B. and F. Zapatero (1992). Optimal consumption-portfolio policies with habit
formation. Mathematical Finance 2(4), 251274.
Dimson, E., P. Marsh, and M. Staunton (2002). Triumph of the Optimists: 101 Years of Global
Investment Returns. Princeton, NJ: Princeton University Press.
Dothan, M. U. (1990). Prices in Financial Markets. Oxford University Press.
Due, D. (2001). Dynamic Asset Pricing Theory (Third ed.). Princeton University Press.
Due, D. and L. G. Epstein (1992). Stochastic dierential utility. Econometrica 60(2), 353394.
Due, D., W. Fleming, H. M. Soner, and T. Zariphopoulou (1997). Hedging in incomplete
markets with HARA utility. Journal of Economic Dynamics and Control 21(45), 753782.
Due, D. and M. O. Jackson (1990). Optimal hedging and equilibrium in a dynamic futures
market. Journal of Economic Dynamics and Control 14(1), 2133.
Due, D. and R. Kan (1996). A yield-factor model of interest rates. Mathematical Finance 6(4),
379406.
Due, D. and P.-L. Lions (1992). PDE solutions of stochastic dierential utility. Journal of
Mathematical Economics 21, 577606.
Due, D. and T.-s. Sun (1990). Transactions costs and portfolio choice in a discrete-continuous-
time setting. Journal of Economic Dynamics and Control 14, 3551.
Due, D. and T. Zariphopoulou (1993). Optimal investment with undiversiable income risk.
Mathematical Finance 3(2), 135148.
Dumas, B. and E. Luciano (1991). An exact solution to a dynamic portfolio choice problem
under transaction costs. Journal of Finance 46(2), 577595.
270 Bibliography
Dynan, K. E. (2000). Habit formation in consumer preferences: Evidence from panel data.
American Economic Review 90(3), 391406.
El Karoui, N. and M. Jeanblanc-Picque (1998). Optimization of consumption with labor income.
Finance and Stochastics 2(4), 409440.
Elton, E. J. and M. J. Gruber (2000). The rationality of asset allocation recommendations.
Journal of Financial and Quantitative Analysis 35(1), 2741.
Elton, E. J., M. J. Gruber, and M. D. Padberg (1976). Simple criteria for optimal portfolio
selection. Journal of Finance 31(5), 13411357.
Epstein, L. G. and S. E. Zin (1989). Substitution, risk aversion, and the temporal behavior of
consumption and asset returns: A theoretical framework. Econometrica 57(4), 937969.
Epstein, L. G. and S. E. Zin (1991). Substitution, risk aversion, and the temporal behavior of
consumption and asset returns: An empirical analysis. Journal of Political Economy 99(2),
263286.
Fama, E. F. (1970). Multiperiod consumption-investment decisions. American Economic Re-
view 60(1), 163174. Correction: Fama (1976).
Fama, E. F. (1976). Multiperiod consumption-investment decisions: A correction. American
Economic Review 66(4), 723724.
Fama, E. F. and K. R. French (1989). Business conditions and expected returns on stocks and
bonds. Journal of Financial Economics 25(1), 2349.
Fama, E. F. and K. R. French (1992). The cross-section of expected stock returns. Journal of
Finance 47(2), 427465.
Fama, E. F. and K. R. French (2007). The anatomy of value and growth stock returns. Financial
Analysts Journal 63(6), 4454.
Feigenbaum, J. (2008). Can mortality risk explain the consumption hump? Journal of Macro-
economics 30(3), 844872.
Fishburn, P. (1970). Utility Theory for Decision Making. John Wiley and Sons.
Fitzpatrick, B. G. and W. H. Fleming (1991). Numerical methods for an optimal investment-
consumption model. Mathematics of Operations Research 16(4), 823841.
Flavin, M. and T. Yamashita (2002). Owner-occupied housing and the composition of the house-
hold portfolio. American Economic Review 91(1), 345362.
Fleming, W. H. and H. M. Soner (1993). Controlled Markov Processes and Viscosity Solutions,
Volume 25 of Applications of Mathematics. New York: Springer-Verlag.
Fleming, W. H. and T. Zariphopoulou (1991). An optimal investment/consumption model with
borrowing. Mathematics of Operations Research 16(4), 802822.
Framstad, N. C., B. ksendal, and A. Sulem (2001). Optimal consumption and portfolio in a
jump diusion market with proportional transaction costs. Journal of Mathematical Eco-
nomics 35, 233257.
French, K. R. and J. M. Poterba (1991). Investor diversication and international equity markets.
American Economic Review 81(2), 222226.
Bibliography 271
Friend, I. and M. E. Blume (1975). The demand for risky assets. American Economic Re-
view 65(5), 900922.
Garlappi, L., R. Uppal, and T. Wang (2007). Portfolio selection with parameter and model
uncertainty: A multi-prior approach. Review of Financial Studies 20(1), 4181.
Gennotte, G. (1986). Optimal portfolio choice under incomplete information. Journal of Fi-
nance 41, 733746.
Gennotte, G. and A. Jung (1994). Investment strategies under transaction costs: The nite
horizon case. Management Science 40(3), 385404.
Gollier, C. (2001). The Economics of Risk and Time. MIT Press.
Gomes, F. (2007). Exploiting short-run predictability. Journal of Banking and Finance 31, 1427
1440.
Gomes, F. and A. Michaelides (2003). Portfolio choice with internal habit formation: A life-cycle
model with uninsurable labor income risk. Review of Economic Dynamics 6(4), 729766.
Gomes, F. and A. Michaelides (2005). Optimal life-cycle asset allocation: Understanding the
empirical evidence. Journal of Finance 60(2), 869904.
Gourinchas, P.-O. and J. A. Parker (2002). Consumption over the life cycle. Econometrica 70(1),
4789.
Grasselli, M. (2000, April). HJB equations with stochastic interest rates and HARA utility
functions. Working paper, CREST, Malako Cedex, France.
Grether, D. M. and C. R. Plott (1979). Economic theory of choice and the preference reversal
phenomenon. American Economic Review 69(4), 623638.
Grossman, S. J. and G. Laroque (1990). Asset pricing and optimal portfolio choice in the presence
of illiquid durable consumption goods. Econometrica 58(1), 2551.
Grossman, S. J. and J.-L. Vila (1991). Optimal dynamic trading with leverage constraints.
Journal of Financial and Quantitative Analysis 27(2), 151168.
Grossman, S. J. and Z. Zhou (1993). Optimal investment strategies for controlling drawdowns.
Mathematical Finance 3(3), 241276.
Grubel, H. G. (1968). Internationally diversied portfolios: Welfare gains and capital ows.
American Economic Review 58, 12991314.
Hakansson, N. H. (1970). Optimal investment and consumption strategies under risk for a class
of utility functions. Econometrica 38(5), 587607.
Hansen, G. D. and S.

Imrohoroglu (2008). Consumption over the life cycle: The role of annuities.
Review of Economic Dynamics 11(3), 566583.
Hansen, L. P. and S. F. Richard (1987). The role of conditioning information in deducing testable
restrictions implied by dynamic asset pricing models. Econometrica 55(3), 587614.
Haugh, M. B., L. Kogan, and J. Wang (2006). Evaluating portfolio policies: A duality approach.
Operations Research 54(3), 405418.
He, H. and H. F. Pag`es (1993). Labor income, borrowing constraints, and equilibrium asset
prices. Economic Theory 3, 663696.
272 Bibliography
He, H. and N. D. Pearson (1991). Consumption and portfolio policies with incomplete markets
and short-sale constraints: The innite dimensional case. Journal of Economic Theory 54,
259304.
Heath, D., R. Jarrow, and A. Morton (1992). Bond pricing and the term structure of interest
rates: A new methodology for contingent claims valuation. Econometrica 60(1), 77105.
Heaton, J. and D. Lucas (2000). Portfolio choice and asset prices: The importance of en-
trepreneurial risk. Journal of Finance 55(3), 11631198.
Heidari, M. and L. Wu (2003). Are interest rate derivatives spanned by the term structure of
interest rates? Journal of Fixed Income 13(1), 7586.
Henderson, V. (2005). Explicit solutions to an optimal portfolio choice problem with stochastic
income. Journal of Economic Dynamics and Control 29(7), 12371266.
Heston, S. L. (1993). A closed-form solution for options with stochastic volatility with applica-
tions to bond and currency options. Review of Financial Studies 6(2), 327343.
Hindy, A. and C.-f. Huang (1993). Optimal consumption and portfolio rules with durability and
local substitution. Econometrica 61(1), 85121.
Hindy, A., C.-f. Huang, and H. Zhu (1997). Optimal consumption and portfolio rules with
durability and habit formation. Journal of Economic Dynamics and Control 21(23), 525
550.
Huang, C.-f. and R. H. Litzenberger (1988). Foundations for Financial Economics. Prentice-Hall.
Hull, J. and A. White (1994). Numerical procedures for implementing term structure models II:
Two-factor models. Journal of Derivatives 2(2), 3748.
Hull, J. C. (2009). Options, Futures, and Other Derivatives (7th ed.). Prentice-Hall, Inc.
Ibbotson, R. G. and P. Chen (2003). Long-run stock returns. Financial Analysts Journal 59(1),
8898.
Ingersoll, Jr., J. E. (1987). Theory of Financial Decision Making. Lanham, MD: Rowman &
Littleeld.
Ingersoll, Jr., J. E. (1992). Optimal consumption and portfolio rules with intertemporally de-
pendent utility of consumption. Journal of Economic Dynamics and Control 16, 681712.
Inkmann, J., P. Lopes, and A. Michaelides (2011). How deep is the annuity market participation
puzzle? Review of Financial Studies 24(1), 279319.
Jagannathan, R. and N. R. Kocherlakota (1996). Why should older people invest less in stocks
than younger people? Federal Reserve Bank of Minneapolis Quarterly Review 20(3), 1123.
Jamshidian, F. (1992). Asymptotically optimal portfolios. Mathematical Finance 2(2), 131150.
Jarrow, R. A., H. Li, and F. Zhao (2007). Interest rate caps smile too! But can the LIBOR
market models capture the smile? Journal of Finance 62(1), 345382.
Jeanblanc-Picque, M. and M. Pontier (1990). Optimal portfolio for a small investor in a market
model with discontinuous prices. Applied Mathematics and Optimization 22, 287310.
Jegadeesh, N. and W. Kim (2006). Value of analyst recommendations: International evidence.
Journal of Financial Markets 9(3), 274309.
Bibliography 273
Jurek, J. W. and L. M. Viceira (2011). Optimal value and growth tilts in long-horizon portfolios.
Review of Finance 15(1), 2974.
Karatzas, I., J. P. Lehoczky, S. P. Sethi, and S. E. Shreve (1986). Explicit solution of a general
consumption/investment problem. Mathematics of Operations Research 11(2), 261294.
Karatzas, I., J. P. Lehoczky, and S. E. Shreve (1987). Optimal portfolio and consumption de-
cisions for a small investor on a nite horizon. SIAM Journal on Control and Optimiza-
tion 25(6), 15571586.
Karatzas, I., J. P. Lehoczky, S. E. Shreve, and G.-L. Xu (1991). Martingale and duality methods
for utility maximization in an incomplete market. SIAM Journal on Control and Optimiza-
tion 29(3), 702730.
Karatzas, I. and S. E. Shreve (1988). Brownian Motion and Stochastic Calculus, Volume 113 of
Graduate Texts in Mathematics. New York: Springer-Verlag.
Karatzas, I. and S. E. Shreve (1998). Methods of Mathematical Finance, Volume 39 of Applica-
tions of Mathematics. New York: Springer-Verlag.
Karatzas, I. and X.-X. Xue (1991). A note on utility maximization under partial observations.
Mathematical Finance 1(2), 5770.
Karlin, S. and H. M. Taylor (1981). A Second Course in Stochastic Processes. Academic Press,
Inc.
Kim, T. S. and E. Omberg (1996). Dynamic nonmyopic portfolio behavior. Review of Financial
Studies 9(1), 141161.
Koijen, R. S. J., T. E. Nijman, and B. J. M. Werker (2007, August). Appendix describing the
numerical method used in When can life-cycle investors benet from time-varying bond risk
premia?. Available at SSRN: http://ssrn.com/abstract=945720.
Koijen, R. S. J., T. E. Nijman, and B. J. M. Werker (2010). When can life-cycle investors benet
from time-varying bond risk premia? Review of Financial Studies 23(2), 741780.
Koijen, R. S. J., J. C. Rodriguez, and A. Sbuelz (2009). Momentum and mean reversion in
strategic asset allocation. Management Science 55(7), 11991213.
Koo, H. K. (1995). Consumption and portfolio selection with labor income: Evaluation of human
capital. Working paper, Olin School of Business, Washington University.
Koo, H. K. (1998). Consumption and portfolio selection with labor income: A continuous time
approach. Mathematical Finance 8(1), 4965.
Korn, R. (1997). Optimal Portfolios. World Scientic.
Korn, R. and H. Kraft (2001). A stochastic control approach to portfolio problems with stochastic
interest rates. SIAM Journal on Control and Optimization 40(4), 12501269.
Kraft, H. (2005). Optimal portfolios and Hestons stochastic volatility model. Quantitative Fi-
nance 5, 303313.
Kraft, H. (2009). Optimal portfolios with stochastic short rate: Pitfalls when the short rate is
non-Gaussian or the market price of risk is unbounded. International Journal of Theoretical
and Applied Finance 12(6), 767796.
274 Bibliography
Kraft, H. and C. Munk (2011). Optimal housing, consumption, and investment decisions over
the life-cycle. Management Science 57(6), 10251041.
Kraft, H. and F. T. Seifried (2010). Foundations of continuous-time recursive utility: Dierentia-
bility and normalization of certainty equivalents. Mathematics and Financial Economics 3(3-
4), 115138.
Kraft, H. and M. Steensen (2008). Optimal consumption and insurance: A continuous-time
Markov chain approach. ASTIN Bulletin 38(1), 231257.
Kreps, D. M. (1990). A Course in Microeconomic Theory. Harvester Wheatsheaf.
Kreps, D. M. and E. Porteus (1978). Temporal resolution of uncertainty and dynamic choice
theory. Econometrica 46, 185200.
Larsen, L. S. (2010). Optimal investment strategies in an international economy with stochastic
interest rates. International Review of Economics & Finance 19(1), 145165.
Larsen, L. S. and C. Munk (2012). The costs of suboptimal dynamic asset allocation: General
results and applications to interest rate risk, stock volatility risk, and growth/value tilts.
Journal of Economic Dynamics and Control 36(2), 266293.
Lehoczky, J. P., S. P. Sethi, and S. E. Shreve (1983). Optimal consumption and investment
policies allowing comsumption constraints and bankruptcy. Mathematics of Operations Re-
search 8(4), 613636.
Leippold, M. and L. Wu (2003). Estimation and design of quadratic term structure models.
Review of Finance 7(1), 4773.
Li, H. and F. Zhao (2006). Unspanned stochastic volatility: Evidence from hedging interest rate
derivatives. Journal of Finance 61(1), 341378.
Li, W. and R. Yao (2007). The life-cycle eects of house price changes. Journal of Money, Credit
and Banking 39(6), 13751409.
Lintner, J. (1965). The valuation of risky assets and the selection of risky investment in stock
portfolios and capital budgets. Review of Economics and Statistics 47(1), 1337.
Lioui, A. and P. Poncet (2003). International asset allocation: A new perspective. Journal of
Banking and Finance 27, 22032230.
Liu, H. (2004). Optimal consumption and investment with transaction costs and multiple risky
assets. Journal of Finance 59(1), 289338.
Liu, H. and M. Loewenstein (2002). Optimal portfolio selection with transaction costs and nite
horizons. Review of Financial Studies 14(3), 805835.
Liu, J. (1999, August). Portfolio selection in stochastic environments. Working paper, Stanford
University.
Liu, J. (2007). Portfolio selection in stochastic environments. Review of Financial Studies 20(1),
139.
Liu, J., F. A. Longsta, and J. Pan (2003). Dynamic asset allocation with event risk. Journal
of Finance 58(1), 231259.
Bibliography 275
Liu, J. and J. Pan (2003). Dynamic derivative strategies. Journal of Financial Economics 69(3),
401430.
Lustig, H. N. and S. G. van Nieuwerburgh (2005). Housing collateral, consumption insurance,
and risk premia: An empirical perspective. Journal of Finance 60(3), 11671219.
Lynch, A. W. (2001). Portfolio choice and equity characteristics: Characterizing the hedging
demands introduced by return predictability. Journal of Financial Economics 62(1), 67
130.
Lynch, A. W. and S. Tan (2010). Multiple risky assets, transaction costs and return predictabil-
ity: Allocation rules and implications for U.S. investors. Journal of Financial and Quantita-
tive Analysis 45(4), 10151053.
Lynch, A. W. and S. Tan (2011). Labor income dynamics at business-cycle frequencies: Impli-
cations for portfolio choice. Journal of Financial Economics 101(2), 333359.
Maenhout, P. (2004). Robust portfolio rules and asset pricing. Review of Financial Studies 17(4),
951983.
Magill, M. J. P. and G. M. Constantinides (1976). Portfolio selection with transactions costs.
Journal of Economic Theory 13, 245263.
Malmendier, U. and D. Shanthikumar (2007). Are small investors naive about incentives? Jour-
nal of Financial Economics 85, 457489.
Markowitz, H. (1952). Portfolio selection. Journal of Finance 7(1), 7791.
Markowitz, H. (1959). Portfolio Selection: Ecient Diversication of Investment. Wiley.
Mehra, R. (2003). The equity premium: Why is it a puzzle? Financial Analysts Journal 59(1),
5469.
Mehra, R. and E. C. Prescott (1985). The equity premium: A puzzle. Journal of Monetary
Economics 15(2), 145162.
Merton, R. C. (1969). Lifetime portfolio selection under uncertainty: The continuous-time case.
Review of Economics and Statistics 51(3), 247257. Reprinted as Chapter 4 in Merton (1992).
Merton, R. C. (1971). Optimum consumption and portfolio rules in a continuous-time model.
Journal of Economic Theory 3(4), 373413. Erratum: Merton (1973a). Reprinted as Chap-
ter 5 in Merton (1992).
Merton, R. C. (1972). An analytic derivation of the ecient portfolio frontier. Journal of Fi-
nancial and Quantitative Analysis 7, 18511872.
Merton, R. C. (1973a). Erratum. Journal of Economic Theory 6(2), 213214.
Merton, R. C. (1973b). An intertemporal capital asset pricing model. Econometrica 41(5), 867
887. Reprinted in an extended form as Chapter 15 in Merton (1992).
Merton, R. C. (1980). On estimating the expected return on the market: An exploratory inves-
tigation. Journal of Financial Economics 8, 323361.
Merton, R. C. (1992). Continuous-Time Finance. Padstow, UK: Basil Blackwell Inc.
Merton, R. C. (2003). Thoughts on the future: Theory and practice in investment management.
Financial Analysts Journal 59(1), 1723.
276 Bibliography
Mossin, J. (1966). Equilibrium in a capital asset market. Econometrica 34(4), 768783.
Munk, C. (1997a). Numerical methods for continuous-time, continuous-state stochastic control
problems. Publications from Department of Management 97/11, Odense University.
Munk, C. (1997b). Optimal Consumption-Portfolio Policies and Contingent Claims Pricing and
Hedging in Incomplete Markets. Ph. D. thesis, Odense University, DK-5230 Odense M, Den-
mark.
Munk, C. (2000). Optimal consumption-investment policies with undiversiable income risk and
liquidity constraints. Journal of Economic Dynamics and Control 24(9), 13151343.
Munk, C. (2003). Numerical methods for continuous-time, continuous-state stochastic con-
trol problems: Experiences from Mertons problem. Applied Mathematics and Computa-
tion 136(1), 4777.
Munk, C. (2008). Portfolio and consumption choice with stochastic investment opportunities
and habit formation in preferences. Journal of Economic Dynamics and Control 32(11),
35603589.
Munk, C. (2011). Fixed Income Modelling. Oxford University Press.
Munk, C. (2012, January 30). Financial asset pricing theory. Lecture notes, Aarhus University.
To be published by Oxford University Press.
Munk, C. and C. Srensen (2004). Optimal consumption and investment strategies with stochas-
tic interest rates. Journal of Banking and Finance 28(8), 19872013.
Munk, C. and C. Srensen (2007). Optimal real consumption and investment strategies in dy-
namic stochastic economies. In B. S. Jensen and T. Palokangas (Eds.), Stochastic Economic
Dynamics, Chapter 9, pp. 271316. CBS Press.
Munk, C. and C. Srensen (2010). Dynamic asset allocation with stochastic income and interest
rates. Journal of Financial Economics 96(3), 433462.
Munk, C., C. Srensen, and T. N. Vinther (2004). Dynamic asset allocation under mean-reverting
returns, stochastic interest rates and ination uncertainty. International Review of Economics
and Finance 13(2), 141166.
Muthuraman, K. and S. Kumar (2006). Multidimensional portfolio optimization with propor-
tional transaction costs. Mathematical Finance 16(2), 301335.
Nielsen, L. T. and M. Vassalou (2006). The instantaneous capital market line. Economic The-
ory 28(3), 651664.
Ogaki, M. and Q. Zhang (2001). Decreasing relative risk aversion and tests of risk sharing.
Econometrica 69(2), 515526.
ksendal, B. (2003). Stochastic Dierential Equations (Sixth ed.). Springer-Verlag.
ksendal, B. and A. Sulem (2002). Optimal consumption and portfolio with xed and propor-
tional transaction costs. SIAM Journal of Control & Optimization 40, 17651790.
Pastor, L. and R. F. Stambaugh (2012). Are stocks really less volatile in the long run? Journal
of Finance 67, 431478.
Bibliography 277
Piazzesi, M., M. Schneider, and S. Tuzel (2007). Housing, consumption, and asset pricing.
Journal of Financial Economics 83(3), 531569.
Pindyck, R. S. (1988). Risk aversion and the determinants of stock market behavior. Review of
Economic Studies 70(2), 183190.
Pliska, S. R. (1986). A stochastic calculus model of continuous trading: Optimal portfolios.
Mathematics of Operations Research 11(2), 371382.
Poterba, J. M. and L. H. Summers (1988). Mean reversion in stock prices: Evidence and impli-
cations. Journal of Financial Economics 22, 2759.
Pratt, J. (1964). Risk aversion in the small and the large. Econometrica 32(1-2), 122136.
Presman, E. and S. Sethi (1996). Distribution of bankruptcy time in a consumption/investment
problem. Journal of Economic Dynamics and Control 20, 471477.
Press, W. H., S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery (2007). Numerical Recipes:
The Art of Scientic Computing (Third ed.). Cambridge University Press.
Quinn, J. B. (1997). Making the Most of Your Money (Second ed.). Simon & Schuster.
Ravina, E. (2007, November). Habit persistence and Keeping up with the Joneses: Evidence
from micro data. Available at SSRN: http://ssrn.com/abstract=928248.
Richard, S. F. (1975). Optimal consumption, portfolio and life insurance rules for an uncertain
lived individual in a continuous time model. Journal of Financial Economics 2, 187203.
Rockafellar, R. T. (1970). Convex Analysis. Princeton, New Jersey: Princeton University Press.
Rogers, L. (2001). The relaxed investor and parameter uncertainty. Finance and Stochastics 5,
131154.
Samuelson, P. A. (1969). Lifetime portfolio selection by dynamic stochastic programming. Review
of Economics and Statistics 51(3), 239246.
Sangvinatsos, A. and J. A. Wachter (2005). Does the failure of the expectations hypothesis
matter for long-term investors? Journal of Finance 60(1), 179230.
Savage, L. J. (1954). The Foundations of Statistics. Wiley.
Schroder, M. and C. Skiadas (1999). Optimal consumption and portfolio selection with stochastic
dierential utility. Journal of Economic Theory 89, 68126.
Schroder, M. and C. Skiadas (2002). An isomorphism between asset pricing models with and
without linear habit formation. Review of Financial Studies 15(4), 11891221.
Sethi, S. P. and M. I. Taksar (1988). A note on Mertons Optimum consumption and portfolio
rules in a continuous-time model. Journal of Economic Theory 46, 395401.
Sethi, S. P., M. I. Taksar, and E. L. Presman (1992). Explicit solution of a general consump-
tion/portfolio problem with subsistence consumption and bankruptcy. Journal of Economic
Dynamics and Control 16, 747768.
Seydel, R. U. (2009). Tools for Computational Finance (4 ed.). Springer Verlag.
Sharpe, W. (1964). Capital asset prices: A theory of market equilibrium under conditions of
risk. Journal of Finance 19(3), 425442.
278 Bibliography
Shiller, R. J. (2000). Irrational Exuberance. Princeton, NJ: Princeton University Press.
Shirakawa, H. (1994). Optimal consumption and portfolio selection with incomplete markets and
upper and lower bound constraints. Mathematical Finance 4(1), 124.
Shreve, S. E. and H. M. Soner (1994). Optimal investment and consumption with transaction
costs. Annals of Applied Probability 4(3), 609692.
Siegel, J. J. (2002). Stocks for the Long Run (Third ed.). McGraw Hill.
Sinai, T. and N. S. Souleles (2005). Owner-occupied housing as a hedge against rent risk. The
Quarterly Journal of Economics 120(2), 763789.
Skiadas, C. (1998). Recursive utility and preferences for information. Economic Theory 12, 293
312.
Srensen, C. (1999). Dynamic asset allocation and xed income management. Journal of Finan-
cial and Quantitative Analysis 34(4), 513531.
Srensen, C. (2007, February). Interest rate uncertainty and strategic asset allocation with
borrowing and short sales constraints. Available at SSRN: http://ssrn.com/abstract=
966207.
Steensen, M. (2004). On Mertons problem for life insurers. ASTIN Bulletin 34(1), 525.
Sundaresan, S. M. (1989). Intertemporally dependent preferences and the volatility of consump-
tion and wealth. Review of Financial Studies 2(1), 7389.
Svensson, L. E. O. and I. M. Werner (1993). Nontraded assets in incomplete markets. European
Economic Review 37(5), 11491168.
Szpiro, G. G. (1986). Measuring risk aversion: An alternative approach. Review of Economic
Studies 68(1), 156159.
Taksar, M., M. J. Klass, and D. Assaf (1988). A diusion model for optimal portfolio selection
in the presence of brokerage fees. Mathematics of Operations Research 13(2), 277294.
Tavella, D. and C. Randall (2000). Pricing Financial Instruments: The Finite Dierence Method.
Wiley.
Tepla, L. (2000). Optimal hedging and valuation of nontraded assets. European Finance Re-
view 4(3), 231251.
Tepla, L. (2001). Optimal investment with minimum performance constraints. Journal of Eco-
nomic Dynamics and Control 25(10), 16291645.
Thomas, J. W. (1995). Numerical Partial Dierential Equations, Volume 22 of Texts in Applied
Mathematics. Springer-Verlag.
Thurow, L. (1969). The optimum lifetime distribution of consumption expenditures. American
Economic Review 59(3), 324330.
Trolle, A. B. (2009, September). The price of interest rate variance risk and optimal investments
in interest rate derivatives. Available at SSRN: http://ssrn.com/abstract=1342331.
Trolle, A. B. and E. S. Schwartz (2009). A general stochastic volatility model for the pricing of
interest rate derivatives. Review of Financial Studies 22(5), 20072057.
Bibliography 279
van Binsbergen, J. H. and M. W. Brandt (2007). Solving dynamic portfolio choice problems
by recursing on optimized portfolio weights or on the value function? Computational Eco-
nomics 29(3-4), 355367.
Van Hemert, O. (2010). Household interest rate risk management. Real Estate Economics 38(3),
467505.
Vasicek, O. (1977). An equilibrium characterization of the term structure. Journal of Financial
Economics 5(2), 177188.
Viceira, L. M. (2001). Optimal portfolio choice for long-horizon investors with nontradable labor
income. Journal of Finance 56(2), 433470.
Vissing-Jrgensen, A. (2002, March). Towards an explanation of household portfolio choice
heterogeneity: Nonnancial income and participation cost structures. Available at SSRN:
http://ssrn.com/abstract=307121.
Vissing-Jrgensen, A. and O. P. Attanasio (2003). Stock market participation, intertemporal
substitution, and risk-aversion. American Economic Review 93(2), 383391.
von Neumann, J. and O. Morgenstern (1944). Theory of Games and Economic Behavior. New
Jersey: Princeton University Press.
Wachter, J. A. (2002). Portfolio and consumption decisions under mean-reverting returns: An
exact solution for complete markets. Journal of Financial and Quantitative Analysis 37(1),
6391.
Wachter, J. A. (2003). Risk aversion and allocation to long-term bonds. Journal of Economic
Theory 112, 325333.
Wachter, J. A. (2006). A consumption-based model of the term structure of interest rates.
Journal of Financial Economics 79(2), 365399.
Wachter, J. A. and M. Warusawitharana (2009). Predictable returns and asset allocation: Should
a skeptical investor time the market? Journal of Econometrics 148(2), 162178.
Wachter, J. A. and M. Yogo (2010). Why do household portfolio shares rise in wealth? Review
of Financial Studies 23(11), 39293965.
Wang, N. (2004). Precautionary saving and partially observed income. Journal of Monetary
Economics 51(8), 16451681.
Wang, N. (2006). Generalizing the permanent-income hypothesis: Revisiting Friedmans conjec-
ture on consumption. Journal of Monetary Economics 53(4), 737752.
Wang, N. (2009). Optimal consumption and asset allocation with unknown income growth.
Journal of Monetary Economics 56(4), 524534.
Weil, P. (1989). The equity premium puzzle and the risk-free rate puzzle. Journal of Monetary
Economics 24(3), 401421.
Welch, I. (2000). Views of nancial economists on the equity premium and other issues. Journal
of Business 73(4), 501537.
Wilmott, P. (1998). Derivatives: The Theory and Practice of Financial Engineering. Oxford
University Press.
280 Bibliography
Wilmott, P., J. Dewynne, and S. Howison (1993). Option Pricing: Mathematical Models and
Computation. Oxford Financial Press.
Wu, L. (2003). Jumps and dynamic asset allocation. Review of Quantitative Finance and Ac-
counting 20(3), 207243.
Xu, G.-L. and S. E. Shreve (1992a). A duality method for optimal consumption and investment
under short-selling prohibition. I. General market coecients. Annals of Applied Probabil-
ity 2(1), 87112.
Xu, G.-L. and S. E. Shreve (1992b). A duality method for optimal consumption and investment
under short-selling prohibition. II. Constant market coecients. Annals of Applied Probabil-
ity 2(2), 314328.
Yang, F. (2009). Consumption over the life cycle: How dierent is housing? Review of Economic
Dynamics 12(3), 423443.
Yao, R. and H. H. Zhang (2005a). Optimal consumption and portfolio choices with risky housing
and borrowing constraints. Review of Financial Studies 18(1), 197239.
Yao, R. and H. H. Zhang (2005b, November). Optimal life-cycle asset allocation with housing
as collateral. Working paper, Baruch College and University of Texas at Dallas.
Yogo, M. (2006). A consumption-based explanation of expected stock returns. Journal of Fi-
nance 61(2), 539580.
Zariphopoulou, T. (1992). Investment-consumption models with transaction fees and Markov-
chain parameters. SIAM Journal on Control and Optimization 30(3), 613636.
Zariphopoulou, T. (1994). Consumption-investment models with constraints. SIAM Journal on
Control and Optimization 32(1), 5985.

You might also like