Optimal Portfolios and Pricing of Financial Derivatives Under Proportional Transaction Costs

Sass, Jörn; Schäl, Manfred

doi:10.1007/978-3-319-47766-4_21

Jörn Sass⁶ &
Manfred Schäl⁷

Part of the book series: International Series in Operations Research & Management Science ((ISOR,volume 248))

4277 Accesses

Abstract

A utility optimization problem is studied in discrete time 0 ≤ n ≤ N for a financial market with two assets, bond and stock. These two assets can be traded under transaction costs. A portfolio (Y _n, Z _n) at time n is described by the values Y _n and Z _n of the stock account and the bank account, respectively. The choice of (Y _n, Z _n) is controlled by a policy. Under concavity and homogeneity assumptions on the utility function U, the optimal policy has a simple cone structure. The final portfolio (Y _N ^∗, Z _N ^∗) under the optimal policy has an important property. It can be used for the construction of a consistent price system for the underlying financial market.

Access provided by CONRICYT-eBooks. Download chapter PDF

Optimal investment and consumption for financial markets with jumps under transaction costs

Article 10 November 2023

Optimal investment and price dependence in a semi-static market

Article 20 September 2014

Non-concave utility maximisation on the positive real axis in discrete time

Article 08 May 2015

Keywords

1 Introduction

We will start with discrete-time utility optimization which is now a classical subject and can be treated as a Markov decision process in discrete time 0 ≤ n ≤ N. Our main goal will be an application to adequate pricing of financial derivatives, in particular options, which is an important subject of financial mathematics. A financial market is studied where two assets, bond and stock, can be traded under transaction costs. A mutual fund is a good example for the stock. Under concavity and homogeneity assumptions on the utility function U, it is known that the optimal policy has a cone structure not only for models without but also for models with linear transaction costs, see below. In the present paper we will focus on such models.

An Explanatory Model

In order to describe the application of the optimal policy from utility maximization to pricing of financial derivatives, let us first consider a simple model with only one period [0, N] (starting in 0 and finishing in N = 1) and without transaction costs. Let B _N be the value on the bank account at N if we start with one unit of money B ₀ = 1. Then B _N ⁻¹ is the classical discount factor. For fixed initial wealth x, the policy can be described by a real number θ, the investment in the stock. Then the wealth at N is X _N ^θ = (x −θ)B _N + S ₀ ⁻¹ θ S _N = B _N(x −θ + S ₀ ⁻¹ θ B _N ⁻¹ S _N), where S ₀ and S _N are the stock prices at 0 and N and S ₀ ⁻¹ θ is the invested number of stocks.

The classical present value principle for pricing future incomes is based on the expectation of discounted quantities. According to this principle, an adequate price for a contingent claim offering S _N, i.e. one unit of stock, at N would be $\mathop{\mathrm{pr}}\nolimits (S_{N}) = E[B_{N}^{-1}S_{N}]$. But this answer may be wrong, because we know in the present situation of a financial market that S ₀ is the adequate price. Starting with S ₀ one is sure to have S _N at N. But in general one has E[B _N ⁻¹ S _N] ≠ B ₀ ⁻¹ S ₀ = S ₀ and not the equality one would like to have. Note that the equality means that the discounted stock price process {B ₀ ⁻¹ S ₀, B _N ⁻¹ S _N} is a martingale. It was a great discovery for the stochastic community when one realized that martingales come into play. This is the reason for a change of measure where the original real-world probability measure P is replaced by an artificial martingale measure Q with Radon-Nikodym density q w.r.t. P. One wants to study adequate prices $\mathop{\mathrm{pr}}\nolimits (C)$ for a contingent claim C depending on the underlying financial derivative and maturing at N. In the present simple model, one has C = f(S _N) for some function f, since S _N is the only random variable. In multiperiod models, C is contingent upon the whole development of the stock up to N. After a change of measure, one considers the present value principle under Q:

$$\displaystyle{ \mathop{\mathrm{pr}}\nolimits (C) = E[q\,B_{N}^{-1}C] = E_{ Q}[B_{N}^{-1}C]\quad \mbox{ with }\quad \mathop{\mathrm{pr}}\nolimits (S_{ N}) = E_{Q}[B_{N}^{-1}S_{ N}] = S_{0}. }$$

(21.1)

Then {B ₀ ⁻¹ S ₀, B _N ⁻¹ S _N} is a martingale under Q and $\mathop{\mathrm{pr}}\nolimits (\,\cdot \,)$ is called a consistent price system because of the relation $\mathop{\mathrm{pr}}\nolimits (S_{N}) = S_{0}$. In general however, one has several choices for a martingale measure Q and one has to specify an additional preference in order to distinguish one measure Q and thus one generally agreed prize. Therefore, no preference-independent pricing of financial derivatives is possible.

Construction of a Price System

Now we explain the relations to utility optimization and how to construct a martingale measure Q and thus a consistent price system by the optimal investment θ ^∗. Let us consider the portfolio optimization problem where the wealth at N is $X_{N}^{\theta } = B_{N}(x -\theta +S_{0}^{-1}\theta \,B_{N}^{-1}S_{N})$ defined as above and where we study max_θ E[U(B _N ⁻¹ X _N ^θ)]. Then we get for the optimal investment θ ^∗ by differentiating:

$$\displaystyle{E\left [U'\left (B_{N}^{-1}X_{ N}^{\theta ^{{\ast}} }\right )\left (S_{0}^{-1}B_{ N}^{-1}S_{ N} - 1\right )\right ] = 0\quad \mbox{ or }\quad E\left [c\,U'(B_{N}^{-1}X_{ N}^{\theta ^{{\ast}} })B_{N}^{-1}S_{ N}\right ] = S_{0},}$$

if the constant c is chosen such that E[c U′(B _N ⁻¹ X _N ⁻¹)] = 1. By a simple calculation one obtains $c = x\,E[U^{{\ast}}(B_{N}^{-1}X_{N}^{\theta ^{{\ast}} })]^{-1}$ with U ^∗(w): = U′(w)w. Now we can set $q = c\,U'(B_{N}^{-1}X_{N}^{\theta ^{{\ast}} })$ for q as above and we get

$$\displaystyle{ \mathop{\mathrm{pr}}\nolimits (C) = x\,E\left [U^{{\ast}}\left (B_{ N}^{-1}X_{ N}^{\theta ^{{\ast}} }\right )\right ]^{-1}\,E\left [U'\left (B_{ N}^{-1}X_{ N}^{\theta ^{{\ast}} }\right )B_{N}^{-1}C\right ] }$$

(21.2)

where typically x = 1. In fact we then have E[q B _N ⁻¹ S _N] = S ₀ and q thus defines a martingale measure. By a ‘marginal rate of substitution’ argument it can be shown how this price depends in a traditional way on the investor’s preference or relative risk aversion (see Davis [7], Schäl [26, Introduction]).

The Numeraire Portfolio

In the present paper, a special martingale measure Q is studied which is defined by the concept of the numeraire portfolio. Then the choice of Q can be justified by a change of numeraire (discount factor) in place of a change of measure. For this approach one has to choose for U the log-utility with U′(w) = w ⁻¹ and U ^∗(w) = 1 (see Becherer [2], Bühlmann and Platen [3], Christensen and Larsen [4], Goll and Kallsen [9], Karatzas and Kardaras [13], Korn et al. [17], Korn and Schäl [15, 16], Long [19], Platen [21], Schäl [25]). The optimal investment θ ^∗ is called log-optimal. In fact, then one obtains $q = c\,(B_{N}^{-1}X_{N}^{\theta ^{{\ast}} })^{-1}$ and $\mathop{\mathrm{pr}}\nolimits (C) = E[q\,B_{N}^{-1}C] = E[c\,(X_{N}^{\theta ^{{\ast}} })^{-1}C]$ and c = 1 for x = 1 since U ^∗(w) = 1. As a result we finally get

$$\displaystyle{ \mathop{\mathrm{pr}}\nolimits (C) = E[(X_{N}^{\theta ^{{\ast}} })^{-1}C]. }$$

(21.3)

Comparing ( 21.1) with the possibly wrong prize $\mathop{\mathrm{pr}}\nolimits (C) = E[B_{N}^{-1}C]$ (see above) and with a consistent prize ( 21.1), we see the following: In ( 21.3) we stick to the original probability measure but replace B _N with the wealth $X_{N}^{\theta ^{{\ast}} }$ which can be realized on the market when starting with x = 1 on the bank account and investing according to θ ^∗. When looking for a discount factor, we thus assume that we will use x = 1 in an optimal way instead of investing exclusively in the bank account. By the way, as a consequence the (generalized) discount factor $(X_{N}^{\theta ^{{\ast}} })^{-1}$ is random.

We think that it is easier to explain a change of the discount factor to a non-expert than a change of measure since we here have a financial market where we have more choices for investing one unit of money and not only the choice to invest in the bank account.

The General Model with Transaction Costs

The problem of the paper is to carry over this idea to multiperiod financial models (where N ≥ 1) in the presence of transaction costs. For such models, utility maximization and in particular log-optimality are also well studied. The wealth at stage n will be given by portfolios (Y _n, Z _n) with generic values (y, z) describing the value of the stock account and the bank account at time n, respectively. It is known that the log-optimal dynamic portfolio can be described by two Merton lines in the (y, z)-plane (see Kamin [12], Constantinides [5], Sass [22]) in place of one Merton line as in the setting without transaction costs. For results in continuous time see Davis and Norman [8], Magill and Constantinides [20] and Shreve and Soner [27].

Here we will contribute to that theory. We need a natural region for portfolios (y, z) and therefore allow for negative values of y and z (but with y + z > 0), i.e. for short selling and borrowing. For any stage n < N, the region of admissible portfolios will be the solvency region and it is divided by the two Merton lines into three cones where it is optimal either (i) to buy (ii) to sell or (iii) not to trade, respectively. These properties simplify numerical studies considerably. When looking for a natural region, ‘natural’ means that it is as large as possible and that these three cones are not empty. The latter fact can happen if one restricts to nonnegative values of y and z. We will provide a moment condition (R3) on the returns for the latter property. Furthermore we will deal with open action spaces in order to be sure that the optimal action lies in the interior. This is needed for the argument that the derivative vanishes at a maximum point which was also used above in the simple explanatory model.

Martingale Measures and the Numeraire Portfolio

Martingale measures and price systems are also discussed in the literature for models with transaction costs, see Jouini and Kallal [10], Koehl et al. [14], Kusuoka [18], Schachermayer [24]. As explained above, they are basic for the concept of a numeraire portfolio. Now the goal of the paper is the following: Study the log-optimal dynamic portfolio and show that it defines a numeraire portfolio. The definition of martingale measures is not so evident in the presence of transaction cost.

When maximizing the expected utility E[U(B _N ⁻¹(Y _N + Z _N))], we will use Y _N + Z _N as total wealth at time N as in Bäuerle and Rieder [1, Sect. 4.5] and Cvitanić and Karatzas [6]. A more general concept can also be used where one introduces liquidation costs L at time N and considers L(Y _N) + Z _N in place of Y _N + Z _N. For this problem we refer the reader to Sass and Schäl [23]. Since L is not differentiable, this case would cause a lot of additional problems and additional assumptions are needed. Indeed, this paper aims at providing the proof in the case without liquidation costs, since this case allows for much more straightforward arguments and requires less assumptions.

A contingent claim C, maturing in N, is split into a contingent claim Y ^C for the stock account and a contingent claim Z ^C for the bank account. Then a price for (Y ^C, Z ^C) turns out to be

$$\displaystyle{ \mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C}) = E\left [(Y _{ N}^{{\ast}} + Z_{ N}^{{\ast}})^{-1}(Y ^{C} + Z^{C})\right ]. }$$

(21.4)

Here Y _N ^∗ + Z _N ^∗ is the wealth at N under the optimal dynamic portfolio. The role of (Y _N ^∗ + Z _N ^∗)⁻¹ is that of a generalized discount factor and (Y _N ^∗, Z _N ^∗) is then called a numeraire portfolio at N.

Main Result

As main result, the log-optimal portfolio indeed turns out to define a numeraire portfolio also for models with transaction costs. As in the classical case without transactions costs, the message is the following: under very general conditions you don’t need to change the measure for pricing a contingent claim. You can stick to the probability measure P describing the real market and thus being open to statistical procedures. Instead of the bank account you must use the wealth of the log-optimal policy, starting with one unit of money as usual, as reference unit or benchmark (in the terminology of Platen [21]). Thus we see a contingent claim C relative to Y _N ^∗ + Z _N ^∗. Working with P is also extremely useful when integrating the modeling of risk into finance as in combined finance and insurance problems, see Bühlmann and Platen [3].

2 The Financial Model

The bond with prices B _n, n = 0, …, N, will be described by positive deterministic interest rates r _n − 1 ≥ 0 and the stock with prices S _n, n = 0, …, N, will be described by the relative return process consisting of positive independent random variables {R _n − 1, n = 1, …, N}. Let B ₀ = 1 and S ₀ > 0 be deterministic. Then

$$\displaystyle{ B_{n} = B_{n-1}r_{n},\quad B_{n}^{-1}S_{ n} = B_{n-1}^{-1}S_{ n-1}R_{n},\quad n = 1,\ldots,N. }$$

(21.5)

We write $\mathbb{F} =\{ \mathcal{F}_{n},n = 0,\ldots,N\}$ for the filtration generated by {R _n, n = 1, …, N} where $\mathcal{F}_{0}$ is trivial and $\mathcal{F} = \mathcal{F}_{N}$.

A trading strategy is given by a real valued $\mathbb{F}$-adapted stochastic process {Δ _n, 0 ≤ n < N} describing the amount of money (wealth) invested in the stock. For the transaction Δ _n, the total cost K(Δ _n) with transaction costs 0 ≤ μ < 1, λ ≥ 0 has to be paid, where

$$\displaystyle{ K(\theta ):= (1+\lambda )\theta \mbox{ for }\theta \geq 0,\quad K(\theta ):= (1-\mu )\theta \mbox{ for }\theta \leq 0. }$$

(21.6)

A trading strategy will define a dynamic portfolio $\{(Y _{n},Z_{n}),0 \leq n \leq N\}$ describing the wealth {Y _n} on the stock account and the wealth {Z _n} on the bank account. We get the budget equations

$$\displaystyle\begin{array}{rcl} Y _{n}& =& \overline{Y }_{n-1}r_{n}R_{n},\quad Z_{n} = \overline{Z}_{n-1}r_{n} {}\end{array}$$

(21.7)

$$\displaystyle\begin{array}{rcl}\overline{Y }_{n-1}& =& Y _{n-1} +\varDelta _{n-1},\quad \overline{Z}_{n-1} = Z_{n-1} - K(\varDelta _{n-1}),{}\end{array}$$

(21.8)

where $\overline{Y }_{n-1}$ and $\overline{Z}_{n-1}$ are the wealth on the stock account and the bank account after trading. We consider self-financing trading strategies where no additional wealth is added or consumed. Then we have K(y) ≥ y and $K(\alpha y) =\alpha K(y)$ (positive homogeneity).

We will only consider admissible trading strategies where the investor stays solvent at any time in the following sense:

$$\displaystyle{ (a)\quad Y _{N} + Z_{N}> 0\quad \mbox{ and }\quad (b)\quad Z_{n} - K(-Y _{n})> 0\quad \mbox{ for }\quad n <N. }$$

(21.9)

Note that ( 21.9) implies Y _n + Z _n > 0 for n ≤ N.

3 The Markov Decision Model

To ease notation we shall now assume r _n = 1 and thus B _n = 1, 1 ≤ n ≤ N. This a usual assumption and means that one uses directly discounted quantities as B _n ⁻¹ S _n and B _n ⁻¹ B _n = 1 instead of S _n and B _n.

We will work with a Markov decision process where the state is described by (y, z) where y denotes the wealth on the stock account and z the wealth on the bank account.

Definition 21.3.1.

a.
The state space at n is $\mathcal{S}_{N}:=\{ (y,z)\,:\, y + z> 0\}$ for n = N and $\mathcal{S}:=\{ (y,z)\,:\, z - K(-y)> 0\} =\{ (y,z)\,:\, (1-\mu )y + z> 0,(1+\lambda )y + z> 0\}$ for n < N.
b.
An action θ will denote the transaction describing the amount of money (wealth) invested in the stock. The set of admissible actions will be defined below.
c.
The law of motion is defined by the budget Eqs. ( 21.7) and ( 21.8) where {R _n, n = 1, …, N} are independent (but not necessarily identically distributed) random variables. Thus, given the state (y, z) and the action θ at n − 1, the distribution of the state at n is that of
$$\displaystyle{\left ((y+\theta )R_{n},z - K(\theta )\right ).}$$

$\mathcal{S}_{N}$ is called the solvency region at stage N and $\mathcal{S}$ is called the solvency region at all stages n < N. Obviously $\mathcal{S}_{N}$ is defined as $\mathcal{S}$ replacing (λ, μ) by (0, 0). Thus, $\mathcal{S}_{N}$ and $\mathcal{S}$ are open convex cones and the boundaries are formed by half-lines. The condition ( 21.9) can be written as $(Y _{N},Z_{N}) \in \mathcal{S}_{N}$ and $(Y _{n},Z_{n}) \in \mathcal{S}$ for n < N. We will make the following assumptions on R _n.

Assumption 21.3.2.

We assume for n = 1, …, N that R _n is bounded by real constants R, $\overline{R}$ with

$$\displaystyle\begin{array}{rcl} \mathrm{(R1)}& & 0 <\underline{ R} \leq R_{n} \leq \overline{R}, {}\\ \mathrm{(R2)}& & \underline{R} <1-\mu,\quad 1+\lambda <\overline{R}, {}\\ \mathrm{(R3)}& & E[(R_{n} -\underline{ R})^{-1}] = E[(\overline{R} - R_{ n})^{-1}] = \infty. {}\\ \end{array}$$

For convenience, we omit the index n for R, $\overline{R}$. Assumption (R3) implies that R, $\overline{R}$ are in the support of R _n. Then (R2) implies a no-arbitrage condition, i.e., there is a chance that one can loose money and that one can win money when investing in the stock. Assumption (R3) is by far not necessary. Indeed, one only needs that E[(R _n − R)⁻¹] and $E[(\overline{R} - R_{n})^{-1}]$ are big enough. But it is complicated to quantify this property for each stage. Assumption (R3) is satisfied if P(R _n = r) > 0 for r = R, $\overline{R}$ or if R _n has the uniform distribution on $[\underline{R},\overline{R}]$.

Definition 21.3.3.

$\varGamma:=\{ (y,z)\,:\, (y\,r,z) \in \mathcal{S}\mbox{ for }\underline{R} \leq r \leq \overline{R}\}$ and Γ _N are the pre-solvency regions where Γ _N is defined as Γ replacing $\mathcal{S}$ with $\mathcal{S}_{N}$ and thus (λ, μ) by (0, 0).

Obviously Γ _N contains all states at time N − 1 after trading such that the system is in $\mathcal{S}_{N}$ at time N for every possible value r of R _N. Assumption (R2) now guarantees that $\varGamma _{N} \subset \mathcal{S}$ and one can move from any state $(y,z) \in \mathcal{S}\setminus \varGamma _{N}$ to a state (y +θ, z − K(θ)) ∈ Γ _N by buying (θ > 0) or selling (θ < 0).

Lemma 21.3.4.

$\varGamma =\{ (y,z)\,:\, (1-\mu )\underline{R}\,y + z> 0,\,(1+\lambda )\overline{R}\,y + z> 0\}$ and $\varGamma _{N} =\{ (y,z)\,:\,\underline{ R}\,y + z> 0,\,\overline{R}\,y + z> 0\}$ . Γ and Γ _N are closed convex cones and their boundaries are formed by two rays.

Definition 21.3.5.

The set of admissible actions θ at stage n < N − 1 will be chosen as

$$\displaystyle{\mathcal{A}(y,z):=\{\theta \,:\, (y+\theta,z - K(\theta )) \in \varGamma \},\quad (y,z) \in \mathcal{S},}$$

and at stage N − 1 as $\mathcal{A}_{N-1}(y,z)$ defined as $\mathcal{A}(y,z)$ replacing Γ with Γ _N.

Thus $\varDelta _{n-1} \in \mathcal{A}(Y _{n-1},Z_{n-1})$ implies $(Y _{n},Z_{n}) \in \mathcal{S}$ for n < N. Important quantities will depend on the state (y, z) only through y∕(y + z) and are thus independent of α on the ray {(α y, α z) : α > 0}. This fact will entail an important cone structure. Therefore we introduce the risky fraction

$$\displaystyle{ \varPi _{n}:= Y _{n}/(Y _{n} + Z_{n}). }$$

(21.10)

We will restrict attention to situations where Y _n + Z _n is strictly positive. Then Π _n is well-defined.

Convention 21.3.6.

If y, z, and π appear in the same context, then we always mean π = y∕(y + z).

By use of Assumption (R2), it is easy to prove the following lemma.

Lemma 21.3.7.

There exist some functions $\underline{\vartheta },\overline{\vartheta }: (-\lambda ^{-1},\mu ^{-1}) \rightarrow \mathbb{R}$ such that

$$\displaystyle{\mathcal{A}(y,z) =\{\theta \,;\,\underline{\vartheta }(\pi ) <\theta /(y + z) <\overline{\vartheta }(\pi )\}.}$$

The same result holds for $\mathcal{A}_{N-1}$ replacing (−λ ⁻¹ ,μ ⁻¹ ) by $\mathbb{R}$ , i.e. (λ,μ) by (0,0).

Then the interval $(\underline{\vartheta }(\cdot ),\overline{\vartheta }(\cdot ))$ will be a function of Π _n for $(Y _{n},Z_{n}) \in \mathcal{S}$. Note that $\overline{\vartheta }(\pi )$ may be negative (if π is too large) and ϑ(π) may be positive (if π is too small).

We will use the log-utility and consider the following maximization problem:

$$\displaystyle{ G_{n}^{{\ast}}(y,z):=\sup \, E[\log (Y _{ N} + Z_{N})\,\vert \,Y _{n} = y,\,Z_{n} = z], }$$

(21.11)

where the supremum is taken over all admissible trading strategies. The expectation in ( 21.11) is well-defined. In fact, for given (y, z), the integrand log(Y _N + Z _N) is bounded from above. For that fact it is sufficient to consider the case without transaction costs which was treated in Korn and Schäl [15, Theorem 4.12]. From dynamic programming we know that we can restrict to Markov policies where Δ _n = δ _n(Y _n, Z _n). There a trading strategy will be described by a Markov policy {δ _n, n = 0, …, N − 1} if the decision rule δ _n is a function on $\mathcal{S}$ with $\delta _{N-1}(y,z) \in \mathcal{A}_{N-1}(y,z)$ and $\delta _{n}(y,z) \in \mathcal{A}(y,z)$ for n < N − 1. Set

$$\displaystyle{ G_{n}(y,z):= E[G_{n+1}^{{\ast}}(y\,R_{ n+1},z)]. }$$

(21.12)

Then the following optimality equation holds:

$$\displaystyle{ G_{n}^{{\ast}}(y,z) = \text{max}_{\theta }G_{ n}(y+\theta,z - K(\theta )), }$$

(21.13)

where θ runs through $\mathcal{A}_{N-1}(y,z)$ for n = N − 1 and through $\mathcal{A}(y,z)$ for n < N − 1. The optimality criterion states (see e.g. [1, Theorem 2.3.8]): If there are maximizers $\theta ^{{\ast}} =\delta _{n}(y,z)$ such that

$$\displaystyle{ G_{n}(y +\theta ^{{\ast}},z - K(\theta ^{{\ast}})) = \text{max}_{\theta }G_{ n}(y+\theta,z - K(\theta )), }$$

(21.14)

then {δ _n} defines an optimal Markov policy.

Definition 21.3.8.

We call a line $\{(y+\theta,z - (1-\mu )\theta )\,:\,\theta \in \mathbb{R}\}$ a sell-line and a line $\{(y+\theta,z - (1+\lambda )\theta )\,:\,\theta \in \mathbb{R}\}$ a buy-line.

We can now state the main theorem on the structure of the optimal Markov policy.

Theorem 21.3.9.

For n = N − 1,…,1,0 we have

a.
There exist numbers $-1/\lambda <a_{n} \leq b_{n} <1/\mu$ such that the following holds: There exists an optimal Markov policy {δ _n } where {δ _n } is defined by
1. (i)
  δ _n = 0 on the no-trading cone $\mathcal{T}_{n}^{\mathrm{notr}}:=\{ (y,z) \in \mathcal{S}\,:\, a_{n} \leq \pi \leq b_{n}\}$,
2. (ii)
  δ _n (y,z) = θ < 0 on the sell cone $\mathcal{T}_{n}^{\mathrm{sell}}:=\{ (y,z) \in \mathcal{S}\,;\,b_{n} <\pi <1/\mu \}$ such that (y + θ,z − (1 −μ)θ) is situated on the ray {(αb _n ,α(1 − b _n )) : α ≥ 0},
3. (iii)
  δ _n (y,z) = θ > 0 on the buy cone $\mathcal{T}_{n}^{\mathrm{buy}}:=\{ (y,z) \in \mathcal{S}\,:\, -1/\lambda <\pi <a_{n}\}$ such that (y + θ,z − (1 + λ)θ) is situated on the ray {(αa _n ,α(1 − a _n )) : α ≥ 0}.
b.
G _n ^∗ (αy,αz) = log α + G _n ^∗ (y,z) for α > 0 and G _n ^∗ (y,z) is concave and isotone in each component.
c.
On the sell-line through (y,z), G _n attains its maximum in a point (αb _n ,α(1 − b _n )) for some $\alpha \in \mathbb{R}$ . On the buy-line through (y,z), G _n attains its maximum in a point (αa _n ,α(1 − a _n )) for some $\alpha \in \mathbb{R}$.
d.
The sell cone and the buy cone (and of course the no-trading cone) are not empty.

Condition (R3) is only used for part (d) in Theorem 21.3.9, but it will play an important role in Sects. 21.4 and 21.5. Now the theorem has the following interpretation. Selling can be interpreted as walk on a sell-line in the (y, z)-plane. For (y, z) in the sell-cone, optimal selling then means to walk on a sell-line (starting in (y, z)) until one reaches the boundary of the no-trading-cone. The situation for the buy-cone is similar. $\mathcal{T}_{n}^{\mathrm{notr}} \cup \{ 0\}$ is a closed convex cone and $\mathcal{T}_{n}^{\mathrm{notr}}$ degenerates to the Merton-line if μ = λ = 0. In the present general case the boundaries of $\mathcal{T}_{n}^{\mathrm{notr}}$ may be called the two Merton-lines. The proof of the theorem is given in Appendix 21.6. A similar result holds for the power utility function U _γ(w) = γ ⁻¹ w ^γ, 0 ≠ γ < 1 (see Sass and Schäl [23]).

4 Martingale Properties of the Optimal Markov Decision Process

Given the optimal policy {δ _n} from Theorem 21.3.9, the initial value (y, z), and the sequence R _n(ω), n ≥ 1, we can construct the state process (Y _n(ω), Z _n(ω)), n ≥ 0. In the sequel we will only consider this process {(Y _n, Z _n), n = 0, …, N} determined by the optimal policy. In this section we want to prove a martingale property of the optimal Markov decision process which is important for the financial application. In the model without transaction costs, {(Y _n + Z _n)⁻¹} is a martingale. In the presence of transaction costs one has to modify Y _n by a factor ρ _n which is close to one if the transaction costs are small. Our main goal will be to prove that {(ρ _n Y _n + Z _n)⁻¹} is a martingale then.

Besides the risky fraction Π _n we will consider the risky fraction after trading $\overline{\varPi }_{n}$ defined by

$$\displaystyle{ \overline{\varPi }_{n}:= \overline{Y }_{n}/(\overline{Y }_{n} + \overline{Z}_{n}). }$$

(21.15)

Further we introduce

$$\displaystyle{ \hat{\varPi }(\pi,r):= \frac{\pi r} {\pi r + 1-\pi }. }$$

(21.16)

Then we obtain from Theorem 21.3.9:

$$\displaystyle\begin{array}{rcl} \overline{\varPi }_{n}& =& 1_{\{\varPi _{n}\leq a_{n}\}}a_{n} + 1_{\{a_{n}<\varPi _{n}<b_{n}\}}\varPi _{n} + 1_{\{\varPi _{n}\geq b_{n}\}}b_{n} {}\end{array}$$

(21.18)

$$\displaystyle\begin{array}{rcl} \varPi _{n+1}& =& \overline{Y }_{n}R_{n+1}/(\overline{Y }_{n}R_{n+1} + \overline{Z}_{n}) =\hat{\varPi } (\overline{\varPi }_{n},R_{n+1}).{}\end{array}$$

(21.19)

By the definition of (Y _n, Z _n) above, we know that ( 21.11) becomes

$$\displaystyle{ G_{n}^{{\ast}}(y,z) = E[\log (Y _{ N} + Z_{N})\,\vert \,Y _{n} = y,Z_{n} = z]. }$$

(21.19)

Then we have G _N−1 ^∗(y, z) = G _N−1(y, z) for (y, z) in the no-trading cone $a_{N-1} \leq \pi \leq b_{N-1}$ where

$$\displaystyle{ G_{N-1}(y,z) = E[\log (y\,R_{N} + z)]. }$$

(21.20)

Definition 21.4.1.

We define H _N: = Y _N + Z _N = ρ _N Y _N + Z _N, where ρ _N: = 1, and for n = N − 1, …, 0

$$\displaystyle\begin{array}{rcl} \rho _{n}&:=& E[\rho _{n+1}R_{n+1}H_{n+1}^{-1}\,\vert \,\mathcal{F}_{ n}]/E[H_{n+1}^{-1}\,\vert \,\mathcal{F}_{ n}], {}\\ H_{n}&:=& \rho _{n}Y _{n} + Z_{n}. {}\\ \end{array}$$

Remark 21.4.2.

In Definition 21.4.1, ρ _n is well-defined since H _n+1 is positive and bounded away from zero given $(\overline{Y }_{n},\overline{Z}_{n}) = (y,z) \in \varGamma _{N}$ (and Γ, respectively).

Lemma 21.4.3.

One can write $\rho _{n} =\hat{\rho } _{n}(\varPi _{n})$ for some function $\hat{\rho }_{n}$ , i.e. ρ _n depends on the history only through Π _n.

a.
For a _n ≤π ≤ b _n
$$\displaystyle{\hat{\rho }_{n}(\pi ) = E[\hat{\rho }_{n+1}(\hat{\varPi }(\pi,R_{n+1}))R_{n+1}H_{n+1}^{-1}]/E[H_{ n+1}^{-1}],}$$
where $H_{n+1} =\hat{\rho } _{n+1}(\hat{\varPi }(\pi,R_{n+1}))\pi \,R_{n+1} + 1-\pi$.
b.
For π ≤ a _n we have $\hat{\rho }_{n}(\pi ) =\hat{\rho } _{n}(a_{n})$.
c.
For π ≥ b _n we have $\hat{\rho }_{n}(\pi ) =\hat{\rho } _{n}(b_{n})$.

Proof.

For n = N we set $\hat{\rho }_{N} = 1$. For the induction step n + 1 → n let $\overline{\varPi }_{n} =\pi$ and $\overline{Y }_{n} + \overline{Z}_{n} = x$ be fixed. Then $\rho _{n} = E[\hat{\rho }_{n+1}(\hat{\varPi }(\pi,R_{n+1})R_{n+1}H_{n+1}^{-1}\,\vert \,\mathcal{F}_{n}]/E[H_{n+1}^{-1}\,\vert \,\mathcal{F}_{n}]$, where $H_{n+1} =\hat{\rho } _{n+1}(\varPi _{n+1})Y _{n+1} + Z_{n+1} = x\left (\hat{\rho }_{n+1}(\hat{\varPi }(\pi,R_{n+1}))\pi \,R_{n+1} + 1-\pi \right )$. Thus ρ _n is in fact a function of $\overline{\varPi }_{n} =\pi$ and thus $\hat{\rho }_{n}$ a function of Π _n.

Now (b) and (c) follow in view of (21.17). □

Lemma 21.4.4.

$\hat{\rho }_{n}$ is continuous.

Proof.

We know that ρ _N ≡ 1 is continuous. We will prove now that $\hat{\rho }_{n}$ is continuous if $\hat{\rho }_{n+1}$ is continuous. By Lemma 21.4.3(b), (c), $\hat{\rho }_{n}$ is continuous for π ≤ a _n and for π ≥ b _n. For a _n ≤ π ≤ b _n the statement follows from Lemma 21.4.3(a), since $\hat{\varPi }(\pi,r)$ is continuous in π. □

Theorem 21.4.5.

a.
{H _n ⁻¹ ,n = 0,…,N} is a martingale,
b.
1 −μ ≤ρ _n ≤ 1 + λ, n = 0,…,N.

The proof is given in Appendix 21.6.

5 Price Systems and the Numeraire Portfolio

Price Systems and Martingale Measures Q

In this section discount factors play an important role. Then the theory seems to become more transparent if we write the discount factor B _n ⁻¹ explicitly. We are interested in an alternative probability measure Q with density q = dQ∕dP w.r.t P, where Q has the same null sets as P, i.e. Q and P are equivalent. Then we have

$$\displaystyle{ q> 0\mbox{ a.s. and }\quad E[q] = 1,\quad Q(A) =\int _{A}q\,dP\quad \mbox{ for }A \in \mathcal{F}. }$$

(21.21)

Now consider a contingent claim (Y ^C, Z ^C) maturing in N and split into a contingent claim Y ^C for the stock account and a contingent claim Z ^C for the bank account. We want to find a price $\mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C})$ for (Y ^C, Z ^C) and will use the following approach (ansatz) if (Y ^C, Z ^C) is bounded or if Y ^C + Z ^C ≥ 0:

$$\displaystyle{ \mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C}) = E_{ Q}\left [B_{N}^{-1}(Y ^{C} + Z^{C})\right ] = E\left [q\,B_{ N}^{-1}(Y ^{C} + Z^{C})\right ]. }$$

(21.22)

Theorem 21.5.1.

$\mathop{\mathrm{pr}}\nolimits (\,\cdot \,)$ as given by ( 21.22 ) defines a price system, i.e. one has $\mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C})> 0$ for any (Y ^C ,Z ^C ) with the properties

$$\displaystyle{ Y ^{C} + Z^{C} \geq 0\quad \mbox{ a.s.},\qquad P(Y ^{C} + Z^{C}> 0)> 0. }$$

(21.23)

The proof of Theorem 21.5.1 is given by Kusuoka [18] for finite probability spaces. There it is shown that the form ( 21.22) is also necessary for a consistent price system as defined in Theorem 21.5.3 below. See also Sass and Schäl [23]. We will write

$$\displaystyle{ q_{n}:= E[q\,\vert \,\mathcal{F}_{n}]. }$$

(21.24)

Then {q _n} is the density process and is a martingale under P by definition. Now we define {ρ _n} given q = q _N, ρ _N = 1. It will turn out that the process will agree with {ρ _n} as defined in Sect. 21.4.

Definition 21.5.2.

$q_{n}\rho _{n}B_{n}^{-1}S_{n}:= E[q\,B_{N}^{-1}S_{N}\,\vert \,\mathcal{F}_{n}]$, (i.e. $\rho _{n} = E_{Q}[R_{n+1}\cdots R_{N}\,\vert \,\mathcal{F}_{n}]$).

The equation in parentheses follows from Bayes’ rule. Then {q _n ρ _n B _n ⁻¹ S _n} is a martingale under P by definition which also means, in view of Bayes’ rule, that {ρ _n B _n ⁻¹ S _n} is a martingale under Q. If there are no transaction costs, i.e. λ = μ = 0, we have under condition ( 21.25) below ρ _n = 1, 1 ≤ n ≤ N. Then the discounted stock price process {B _n ⁻¹ S _n} forms a martingale under the probability measure Q with density q and density process {q _n}. That is the reason for calling Q a martingale measure then.

Now we define the notion of a consistent price system and give a condition in terms of {ρ _n}.

Theorem 21.5.3.

Assume for 1 ≤ n ≤ N

$$\displaystyle{ 1-\mu \leq \rho _{n} \leq 1 +\lambda. }$$

(21.25)

Then the price system $\mathop{\mathrm{pr}}\nolimits (\,\cdot \,)$ is consistent , i.e.

$$\displaystyle\begin{array}{rcl} & & \mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C}) = 1\quad \mbox{ for }\quad (Y ^{C},Z^{C}) = (0,B_{ N});{}\end{array}$$

(21.26)

$$\displaystyle\begin{array}{rcl} & & (1-\mu )S_{0} \leq \mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C}) \leq (1+\lambda )S_{ 0}\quad \mbox{ for }\quad (Y ^{C},Z^{C}) = (S_{ N},0);{}\end{array}$$

(21.27)

$$\displaystyle\begin{array}{rcl} & & \mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C}) \leq 0\quad \mbox{ for }\quad (Y ^{C},Z^{C}) = (Y _{ N},Z_{N}),{}\end{array}$$

(21.28)

where (Y _N ,Z _N ) is the terminal portfolio under an arbitrary admissible policy with start in (Y ₀ ,Z ₀ ) = (0,0).

Relation ( 21.26) is natural. If one starts with 1 unit of bond, then one can be sure to have B _N on the bank account at N. Relation ( 21.27) is also natural. Let us only consider the case λ = μ = 0 without transaction costs. If one starts then with 1 unit of stock, then one can be sure to have S _N on the stock account at N. Relation ( 21.28) excludes a sort of arbitrage opportunity. Starting with nothing one can never reach a portfolio with a positive price. The proof of Theorem 21.5.3 is given by Kusuoka [18] for finite probability spaces. There it is shown that ( 21.25) is also necessary for a consistent price system.

The Numeraire Portfolio

Now we can explain the main purpose of the paper in terms of this section. We study the following problem. Can we replace the discount factor B _N ⁻¹ by a more general one, H _N ⁻¹, where H _N is the terminal total wealth under some traded portfolio, and then keep to the original (physical) probability measure in place of Q. Thus we want find an admissible policy with start in (Y ₀, Z ₀) and with total wealth H _N = Y _N + Z _N at N such that E[q B _N ⁻¹(Y ^C + Z ^C)] = E[H _N ⁻¹(Y ^C + Z ^C)]. Then we have to define q by

$$\displaystyle{ B_{N}^{-1}q = c\,(Y _{ N} + Z_{N})^{-1} = c\,H_{ N}^{-1},\quad c = E[H_{ N}^{-1}B_{ N}]^{-1}, }$$

(21.29)

where the case c = 1 is of particular interest.

From now on, we return to the setting where B _n ≡ 1.

Lemma 21.5.4.

The definition of {ρ _n } in Sect. 21.4 agrees with Definition 21.5.2 and we have q _n = c H _n ⁻¹.

We will require that c = 1 in Corollary 21.1 below.

Proof.

Let (Y _N, Z _N) be the portfolio at N under the optimal policy as in Sect. 21.4. Set H _N: = Y _N + Z _N = ρ _N Y _N + Z _N, $\rho _{n}:= E[\rho _{n+1}R_{n+1}H_{n+1}^{-1}\,\vert \,\mathcal{F}_{n}]/E[H_{n+1}^{-1}\,\vert \,\mathcal{F}_{n}]$ as in Definition 21.4.1 and define H _n: = ρ _n Y _n + Z _n, n < N. Then we can conclude from Theorem 21.4.5(a) that

$$\displaystyle{ \{H_{n}^{-1}\}\quad \mbox{ is a martingale.} }$$

(21.30)

Upon setting q = q _N: = c H _N ⁻¹ as above, we obtain $q_{n} = E[c\,H_{N}^{-1}\,\vert \,\mathcal{F}_{n}] = c\,H_{n}^{-1}$ and $\rho _{n}H_{n}^{-1} =\rho _{n}E[H_{n+1}^{-1}\,\vert \,\mathcal{F}_{n}] = E[\rho _{n+1}R_{n+1}H_{n+1}^{-1}\,\vert \,\mathcal{F}_{n}]$. This yields

$$\displaystyle\begin{array}{rcl} q_{n}\rho _{n}S_{n}& =& c\,H_{n}^{-1}\rho _{ n}S_{n} = c\,S_{n}\,E[\rho _{n+1}R_{n+1}H_{n+1}^{-1}\,\vert \,\mathcal{F}_{ n}] {}\\ & =& c\,E[\rho _{n+1}S_{n+1}H_{n+1}^{-1}\,\vert \,\mathcal{F}_{ n}] = E[q_{n+1}\rho _{n+1}S_{n+1}\,\vert \,\mathcal{F}_{n}]. {}\\ \end{array}$$

Thus {q _n ρ _n S _n} is a martingale under P and the definition of ρ _n in Sect. 21.4 agrees with Definition 21.5.2. □

Now we are allowed to apply Theorem 21.4.5(b) and we get condition ( 21.25). Hence Theorem 21.5.3 applies and we know that $\mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C}) = c\,[H_{N}^{-1}(Y ^{C} + Z^{C})]$ is a consistent price system. For c we have 1 = E[q] = c E[H _N ⁻¹] = c H ₀ ⁻¹ by ( 21.30). Thus

$$\displaystyle{ c = H_{0} =\rho _{0}Y _{0} + Z_{0}. }$$

(21.31)

For models without transaction costs, one usually starts with one unit of money to get the discount factor. If we do the same in the present case, then we start with (Y ₀, Z ₀) = (0, 1) and thus with c = H ₀ = 1. Thus we get the following corollary as main result.

Corollary 21.1.

Let {(Y _n ,Z _n )} be generated by an optimal policy as in Sect. 21.4 . If we start with (Y ₀ ,Z ₀ ) = (0,1) or more generally with H ₀ = ρ ₀ Y ₀ + Z ₀ = 1, then a consistent price system is given by

$$\displaystyle{\mathop{\mathrm{pr}}\nolimits (Y ^{C},Z^{C}) = E[(Y _{ N} + Z_{N})^{-1}(Y ^{C} + Z^{C})].}$$

Definition 21.5.5.

In the situation of Corollary 21.1 we call the dynamic portfolio {(Y _n, Z _n)} a numeraire portfolio.

6 Conclusive Remarks

Extension 21.6.1.

A similar result can be derived for power utility U _γ(x) = x ^γ∕γ with U′_γ(w) = w ^γ−1 and U _γ ^∗(w) = U′_γ(w) w = w ^γ for 0 ≠ γ < 1, where γ = 0 would correspond to the log-utility. When starting again with (Y ₀, Z ₀) = (0, 1), one obtains a consistent price system (see Sass and Schäl [23]) by

$$\displaystyle{ \mathop{\mathrm{pr}}\nolimits ^{\gamma }(Y ^{C},Z^{C}) = E[U_{\gamma }^{{\ast}}(Y _{ N} + Z_{N})]^{-1}E[U'_{\gamma }(Y _{ N} + Z_{N})(Y ^{C} + Z^{C})], }$$

(21.32)

where {(Y _n, Z _n)} now is the optimal dynamic portfolio for U _γ. Then (R3) is to be replaced by $E[(R_{n} -\underline{ R})^{\gamma -1}] = E[(\overline{R} - R_{n})^{\gamma -1}] = \infty$. Now ( 21.32) formally corresponds to formula ( 21.2), but Y _N + Z _N still depends on the transaction costs. On the one hand, the power utility allows to work with a more general relative risk aversion 1 −γ of the investor. On the other hand we have to work with a probability measure Q _γ ≠ P. In fact, we then have

$$\displaystyle{Q_{\gamma }(A) =\int q_{\gamma }dP,\quad A \in \mathcal{F},\quad \mbox{ and }\quad q_{\gamma } = E[U_{\gamma }^{{\ast}}(Y _{ N} + Z_{N})]^{-1}U'_{\gamma }(Y _{ N} + Z_{N})\tilde{B}_{N}}$$

if we decide for $\tilde{B}_{N}^{-1}$ as discount factor. We can choose $\tilde{B}_{N} = B_{N}$ or $\tilde{B}_{N} = Y _{N} + Z_{N}$ or more generally $\tilde{B}_{N} = Y _{N}^{0} + Z_{N}^{0}$, where {(Y _n ⁰, Z _n ⁰)} is the dynamic portfolio under any admissible policy {δ _n ⁰}.

Algorithm 21.6.2.

The pricing of financial derivatives under proportional transaction costs can now be done efficiently as follows. First, by backward induction one can find numerically the boundaries a _N−1, …, a ₀ and b _N−1, …, b ₀ of the no-trade-region which exist according to Theorem 21.3.9(c). Second, having computed these constants, the dynamic portfolio (Y _n, Z _n), n = 0, …, N, under the optimal policy can then be computed forwardly for any path of the stock prices. These computations are independent of the specific claims we want to price. For any financial derivative C = (Y ^C, Z ^C) we find a price according to Corollary 21.1. Since this price system is consistent, the resulting price does not lead to arbitrage. This price is preference based. Since it depends on the log-optimal portfolio it corresponds to an investor with logarithmic utility which has relative risk aversion 1. Different relative risk aversions 1 −γ > 0 can be covered by using power utility functions as in Extension 21.6.1. Also for these the computation is efficient in the sense that the optimal policy can be computed first and then prices for any claim can be found by taking expectations as in ( 21.32).

The formulation of a utility optimization problem in discrete time 0 ≤ n ≤ N for a financial market as a Markov decision model is now classical. This is also true for models with transaction costs (see Kamin [12], Constantinides [5]). However we add some new features. In particular, we use the first order condition of the optimal action as for ( 21.2). For that argument, it is necessary that the optimal action lies in the interior of the action space which is guaranteed by working with open action spaces. In fact, the first order condition leads to the martingale property in Theorem 21.4.5(a).

In Lemma 21.5.4, {H _n ⁻¹} is identified as the density process {q _n} and we see that the martingale property for {H _n ⁻¹} must necessarily hold. Moreover this property is also used in Lemma 21.5.4 to show that {H _n ⁻¹ ρ _n S _n} is a martingale as well.

The paper treats a financial model with one stock (and one bond). But models with d stocks (d > 1) and transition costs play an important role and one can ask for extensions of the present results to models with several stocks. Numerical results show that for d > 1 the structure of the optimal policy may be complicated. Without knowing the structure of the optimal policy, one can however prove by use of the methods of Kallsen and Muhle-Karbe [11] that the main result remains true for models where the underlying probability space is finite. In fact, for such models the optimal policy defines a dynamic portfolio which is a numeraire portfolio. It seems to be unknown whether this extends to infinite probability spaces.

References

N. Bäuerle, U. Rieder, Markov Decision Processes with Applications in Finance (Springer, Berlin, 2011)
Book Google Scholar
D. Becherer, The numeraire portfolio for unbounded semimartingales. Finance Stochast. 5, 327–341 (2001)
Article Google Scholar
H. Bühlmann, E. Platen, A discrete time benchmark approach for insurance and finance. ASTIN Bull. 33, 153–172 (2003)
Article Google Scholar
M.M. Christensen, K. Larsen, No arbitrage and the growth optimal portfolio. Stoch. Anal. Appl. 25, 255–280 (2007)
Article Google Scholar
G.M. Constantinides, Multiperiod consumption and investment behaviour with convex transaction costs. Manag. Sci. 25, 1127–1137 (1979)
Article Google Scholar
J. Cvitanić, I. Karatzas, Hedging and portfolio optimization under transaction costs: a martingale approach. Math. Financ. 6, 133–166 (1996)
Article Google Scholar
M.H.A. Davis, Option pricing in incomplete markets, in Mathematics of Derivative Securities, ed. By M. Dempster, S. Pliska (Cambridge University Press, Cambridge, 1997), pp. 216–226
Google Scholar
M.H.A. Davis, A.R. Norman, Portfolio selection with transaction costs. Math. Oper. Res. 15, 676–713 (1990)
Article Google Scholar
T. Goll, J. Kallsen, A complete explicit solution to the log-optimal portfolio problem. Adv. Appl. Probab. 13, 774–779 (2003)
Article Google Scholar
E. Jouini, H. Kallal, Martingales and arbitrage in securities markets with transaction const. J. Econ. Theory 66, 178–197 (1995)
Article Google Scholar
J. Kallsen, J. Muhle-Karbe, On the existence of shadow prices in finite discrete time. Math. Meth. Oper. Res. 73, 251–262 (2011)
Article Google Scholar
J.H. Kamin, Optimal portfolio revision with a proportional transaction costs. Manag. Sci. 21, 1263–1271 (1975)
Article Google Scholar
I. Karatzas, C. Kardaras, The numéraire portfolio in semimartingale financial models. Finance Stochast. 11, 447–493 (2007)
Article Google Scholar
P.F. Koehl, H. Pham, N. Touzi, On super-replication in discrete time under transaction costs. Theory Probab. Appl. 45, 667–673 (2001)
Article Google Scholar
R. Korn, M. Schäl, On value preserving and growth optimal portfolios. Math. Meth. Oper. Res. 50, 189–218 (1999)
Article Google Scholar
R. Korn, M. Schäl, The numeraire portfolio in discrete time: existence, related concepts and applications. Radon Ser. Comput. Appl. Math. 8, 1–25 (2009). De Gruyter
Google Scholar
R. Korn, F. Oertel, M. Schäl, The numeraire portfolio in financial markets modeled by a multi-dimensional jump diffusion process. Decisions Econ. Finan. 26, 153–166 (2003)
Article Google Scholar
S. Kusuoka, Limit theorem on option replication with transaction costs. Ann. Appl. Probab. 5, 198–121 (1995)
Article Google Scholar
J. Long, The numeraire portfolio. J. Financ. 44, 205–209 (1990)
Google Scholar
M.J.P. Magill, M. Constantinides, Portfolio selection with transaction costs. J. Econ. Theory 13, 245–263 (1976)
Article Google Scholar
E. Platen, A benchmark approach to finance. Math. Financ. 16, 131–151 (2006)
Article Google Scholar
J. Sass, Portfolio optimization under transaction costs in the CRR model. Math. Meth. Oper. Res. 61, 239–259 (2005)
Article Google Scholar
J. Sass, M. Schäl, Numerairs portfolios and utility-based price systems under proportional transaction costs. Decisions Econ. Finan. 37, 195–234 (2014)
Article Google Scholar
W. Schachermayer, The fundamental theorem of asset pricing under proportional transaction costs in finite discrete time. Math. Financ. 14, 19–48 (2004)
Article Google Scholar
M. Schäl, Portfolio optimization and martingale measures. Math. Financ. 10, 289–304 (2000)
Article Google Scholar
M. Schäl, Price systems constructed by optimal dynamic portfolios. Math. Meth. Oper. Res. 51, 375–397 (2000)
Article Google Scholar
S.E. Shreve, H.M. Soner, Optimal investment and consumption with transaction costs. Ann. Appl. Probab. 4, 609–692 (1994)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Fachbereich Mathematik, TU Kaiserslautern, Erwin-Schrödinger-Str., Kaiserslautern, D-67663, Germany
Jörn Sass
Institut für Angewandte Mathematik, Universität Bonn, Endenicher Allee 60, Bonn, D-53115, Germany
Manfred Schäl

Authors

Jörn Sass
View author publications
You can also search for this author in PubMed Google Scholar
Manfred Schäl
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jörn Sass .

Editor information

Editors and Affiliations

Stochastic Operations Research, University of Twente, Enschede, The Netherlands
Richard J. Boucherie
Stochastic Operations Research, University of Twente, Enschede, The Netherlands
Nico M. van Dijk

Appendices

1.1 Proof of Theorem 21.3.9

We will use backward induction in the dynamic programming procedure. Thus stage N − 1 will be the stage of the induction start. We set

$$\displaystyle\begin{array}{rcl} g_{N}(y,z)&:=& \log (y + z)\quad \mbox{ for }\quad (y,z) \in \mathcal{S}_{N}, {}\\ G_{N-1}(y,z)&:=& E[g_{N}(y\,R_{N},z)]\quad \mbox{ for }\quad (y,z) \in \varGamma _{N}. {}\\ \end{array}$$

For the induction, we now consider the following more general optimization problem: The gain function g(y, z) is any function on $\mathcal{S}_{N}$ satisfying the following hypotheses:

$$\displaystyle{ g\mbox{ is isotone in each component, concave, and }g(\alpha y,\alpha z)=\,\log (\alpha ) + g(y,z)\mbox{ for }\alpha>0. }$$

(21.33)

Moreover we will use the following technical assumption:

$$\displaystyle\begin{array}{rcl} & & \mbox{ For }0\neq (y',z') \in \partial \mathcal{S}_{N}\mbox{ there is a neighborhood }\mathcal{N}\mbox{ of }(y',z') \\ & & \mbox{ such that }g(y,z) =\log (y + z) + \mathrm{const}\mbox{ on }\mathcal{N}. {}\end{array}$$

(21.34)

Obviously ( 21.33) and ( 21.34) generalize the case where g = g _N. Define the objective function by G(y, z): = E[g(y R _N, z)], (y, z) ∈ Γ _N,

$$\displaystyle\begin{array}{rcl} G^{{\ast}}(y,z)&:=& \sup _{\theta \in \mathcal{A}_{N-1}(y,z)}G(y+\theta,z - K(\theta )) {}\\ & =& \sup _{\underline{\vartheta }(\pi )<\vartheta <\overline{\vartheta }(\pi )}G(y +\vartheta (y + z),z - K(\vartheta (y + z))) {}\\ \end{array}$$

for $(y,z) \in \mathcal{S}$. From dynamic programming we know that θ ^∗ = δ ^∗(y, z) is optimal in state (y, z) at stage N − 1 if G ^∗(y, z) = G(y +θ ^∗, z − K(θ ^∗)) where G ^∗ is the optimal gain function at stage N − 1 for the special case “g = g _N”. G ^∗ will inherit the properties of g.

Lemma 21.7.1.

a.
G(y,z) is concave and isotone in each component and
$$\displaystyle{G(\alpha y,\alpha z) =\log (\alpha ) + G(y,z)\mbox{ for }\alpha> 0.}$$
b.
(Concavity and Isotony of $\mathcal{A}_{N-1}$)
1. (i)
  If $\theta _{i} \in \mathcal{A}_{N-1}(y_{i},z_{i})$ , γ _i > 0, i = 1,2, $\gamma _{1} +\gamma _{2} = 1$ , then $\sum \gamma _{i}\theta _{i} \in \mathcal{A}_{N-1}(\sum \gamma _{i}(y_{i},z_{i}))$.
2. (ii)
  $\mathcal{A}_{N-1}$ is increasing in each component, i.e., $\mathcal{A}_{N-1}(y_{1},z_{1}) \subseteq \mathcal{A}_{N-1}(y_{2},z_{2})$ for y ₁ ≤ y ₂ , z ₁ ≤ z ₂.
c.
G ^∗ (αy,αz) = log (α) + G ^∗ (y,z) for α > 0.

The simple proof is omitted. It makes use of the convexity of K and the relation

$$\displaystyle{\theta \in \mathcal{A}_{N-1}(\alpha y,\alpha z)\quad \mbox{ if and only if }\quad \theta \in \{\vartheta \,\alpha (y + z)\,:\,\underline{\vartheta } (\pi ) <\vartheta <\overline{\vartheta }(\pi )\}.}$$

The hypothesis ( 21.33) for G ^∗ in place of g will now follow from the following fact.

Proposition 21.7.2.

G ^∗ (y,z) is concave and isotone in each component.

The arguments of the proof are standard in dynamic programming (see Bäuerle and Rieder [1]). The proof of Lemma 21.7.1(c) (also standard) would show that α θ ^∗ is a maximizer for

$$\displaystyle{G^{{\ast}}(\alpha y,\alpha z) =\sup _{\theta \in \mathcal{A}_{N-1}(\alpha y,\alpha z)}G(\alpha y+\theta,\alpha z - K(\theta )),}$$

if θ ^∗ is a maximizer for G ^∗(y, z). Therefore we can restrict attention to the case y + z = 1 and we will consider $(y,z) = (\pi,1-\pi ) \in \mathcal{S}$. Now fix some π, say $\pi = \frac{1} {2}$, and consider the following sell-line ℓ ^sell and buy-line ℓ ^buy in the (y, z)-plane parametrized by ϑ:

$$\displaystyle\begin{array}{rcl} \ell^{\mathrm{sell}}& =& \left \{\left (\frac{1} {2}+\vartheta, \frac{1} {2} - (1-\mu )\vartheta \right )\,:\,\vartheta \in \mathbb{R}\right \}, {}\\ \ell^{\mathrm{buy}}& =& \left \{\left (\frac{1} {2}+\vartheta, \frac{1} {2} - (1+\lambda )\vartheta \right )\,:\,\vartheta \in \mathbb{R}\right \}. {}\\ \end{array}$$

Proposition 21.7.3.

The maxima of G on ℓ ^sell ∩Γ _N and on ℓ ^buy ∩Γ _N are attained.

Proof.

(i) We will only consider ℓ ^sell and set R: = R _N. We know that $(y_{N},z_{N}):= (\frac{1} {2} + \overline{\vartheta }, \frac{1} {2} - (1-\mu )\overline{\vartheta }) \in \partial \varGamma _{N}$ where $\overline{\vartheta }:= \overline{\vartheta }(\frac{1} {2})$. Now set $s:=\vartheta -\overline{\vartheta } <0$ and define the concave function

$$\displaystyle{I(\vartheta ):= G(\frac{1} {2}+\vartheta, \frac{1} {2} - (1-\mu )\vartheta ) = I(s + \overline{\vartheta }) = E[g((y_{N} + s)R,z_{N} - (1-\mu )s)].}$$

We will show below that the one-sided derivative $\frac{d^{-}} {d\vartheta } I(\vartheta ) = \frac{d^{-}} {ds} I(s + \overline{\vartheta })$ is negative if ϑ is close to $\overline{\vartheta }$. This fact implies that I(ϑ) is decreasing if ϑ approaches $\overline{\vartheta }$ and thus I(ϑ) cannot be close to supI. We only consider the case where y _N > 0, z _N < 0. A similar argument will hold for the other boundary point of ℓ ^sell.

(ii) Now we study $\frac{d^{-}} {ds} I(s + \overline{\vartheta }) = E[\frac{d^{-}} {ds} g((y_{N} + s)R,z_{N} - (1-\mu )s)]$, where the equality follows from the monotone convergence theorem and the concavity. If 0 < η < y _N ∧ (1 −μ −R) is small, then ((y _N + s)r, z _N − (1 −μ)s) is close to $(y_{N}\underline{R},z_{N}) \in \partial \mathcal{S}_{N}$ for −η < s < 0 and $\underline{R} \leq r \leq \underline{ R}+\eta$. By hypothesis ( 21.34) we then may assume that

$$\displaystyle{ g(y,z) =\log (y + z) + \mathrm{const}\quad \mbox{ for }\quad (y,z) = ((y_{N} + s)r,z_{N} - (1-\mu )s). }$$

(21.35)

In order to use Fatou’s lemma we will show that $\frac{d^{-}} {ds} g((y_{N} + s)R,z_{N} - (1-\mu )s)$ is bounded from above by some c, say. Indeed we know from ( 21.33) that

$$\displaystyle{g((y_{N} + s)r,z_{N} - (1-\mu )s) =\log ((y_{N} + s)r) + g(1,q(s)/r)}$$

for q(s): = (z _N − (1 −μ)s)∕(y _N + s). Note that q(s) is decreasing. Now g(1, q(s)∕r) inherits this property since g is increasing; therefore its one-sided derivative $\frac{d^{-}} {ds}$ is bounded from above by zero. The derivative log((y _N + s)r) is obviously bounded from above. Now we can conclude

$$\displaystyle{\limsup _{\vartheta \rightarrow \overline{\vartheta }}\frac{d^{-}} {d\vartheta } I(\vartheta ) \leq E[\limsup _{s\nearrow 0}\frac{d^{-}} {ds} g((y_{N} + s)R,z_{N} - (1-\mu )s)] \leq A + c\,P(R>\underline{ R}+\eta ),}$$

where $A:= E\left [1_{\{R\leq \underline{R}+\eta \}}(y_{N}R + z_{N})^{-1}(R - (1-\mu ))\right ]$ in view of ( 21.35). There we have R − (1 −μ) ≤ (R +η) − (1 −μ) ≤ R − (1 −μ) +η < 0. Now (y _N, z _N) ∈ ∂ Γ _N implies R y _N + z _N = 0 and thus $(y_{N}R + z_{N})^{-1} = (y_{N}(R -\underline{ R}))^{-1}$. From (R3) we then know that $E[1_{\{R\leq \underline{R}+\eta \}}(y_{N}\,R + z_{N})^{-1}] = \infty$. This finally implies $A = -\infty$. □

Definition 21.7.4.

Let (y ₋, z ₋) and (y ₊, z ₊) be maximum points of G on ℓ ^sell ∩Γ _N and ℓ ^buy ∩Γ _N, respectively. If there is more than one, define (y ₋, z ₋) (resp. (y ₊, z ₊)) such that the y-value y ₋ is maximal (resp. y ₊ is minimal). Set a: = y ₊∕(y ₊ + z ₊), b: = y ₋∕(y ₋ + z ₋).

Then in view of Lemma 21.7.1 we have for each α > 0

$$\displaystyle\begin{array}{rcl} G(\alpha y_{-},\alpha z_{-})& \geq & G(\alpha y_{-}+\theta,\alpha z_{-}- (1-\mu )\theta )\mbox{ for all }\theta \mbox{ and } \textquotedblleft \mbox{ $>$ }\textquotedblright \mbox{ if }\theta> 0 \\ G(\alpha y_{+},\alpha z_{+})& \geq & G(\alpha y_{+}+\theta,\alpha z_{+} - (1+\lambda )\theta )\mbox{ for all }\theta \mbox{ and } \textquotedblleft \mbox{ $>$ }\textquotedblright\mbox{ if }\theta <0.{}\end{array}$$

(21.36)

Lemma 21.7.5.

a≤b.

Since the proof is similar to the proofs in the literature (Sass and Schäl [23] applies literally), it will be omitted. We will now study the following non-empty cones.

Definition 21.7.6.

$\mathcal{T}^{\mathrm{sell}}:=\{ (y,z) \in \mathcal{S}\,;\,b <\pi <1/\mu \}$,

$\mathcal{T}^{\mathrm{buy}}:=\{ (y,z) \in \mathcal{S}\,;\,-1/\lambda <\pi <a\}$,

$\mathcal{T}^{\mathrm{notr}}:=\{ (y,z) \in \mathcal{S}\,;\,a \leq \pi \leq b\} = \mathcal{S}\setminus (\mathcal{T}^{\mathrm{sell}} \cup \mathcal{T}^{\mathrm{buy}})$.

By the definition of y _±, the interval [a, b] is chosen as large as possible. Thus one does not need to trade under the optimal policy if it is not absolutely necessary.

Proposition 21.7.7.

For $(y,z) \in \mathcal{T}^{\mathrm{notr}}$ , it is optimal not to buy and not to sell.

For $(y,z) \in \mathcal{T}^{\mathrm{sell}}$ it is optimal to sell |θ ₋ | where $\theta _{-} =\delta ^{{\ast}}(y,z)$ is defined by ( 21.37 ) below.

For $(y,z) \in \mathcal{T}^{\mathrm{buy}}$ it is optimal to buy θ ₊ where $\theta _{+} =\delta ^{{\ast}}(y,z)$ is defined by ( 21.38 ) below.

Proof.

If $(y,z) \in \mathcal{T}^{\mathrm{sell}}$, then

$$\displaystyle{ (y +\theta _{-},z - (1-\mu )\theta _{-}) =\alpha '(b,1 - b) =\alpha (y_{-},z_{-}) \in \alpha \,\ell^{\mathrm{sell}} }$$

(21.37)

for some α, α′ > 0, $\theta _{-} <0$. As a consequence

$$\displaystyle\begin{array}{rcl} G(y +\theta _{-},z - (1-\mu )\theta _{-})& =& G(\alpha y_{-},\alpha z_{-}) {}\\ & =& \text{max}_{\theta }G(\alpha y_{-}+\theta,\alpha z_{-}- (1-\mu )\theta ) {}\\ & =& \text{max}_{\theta '}G(y +\theta ',z - (1-\mu )\theta ') {}\\ & \geq & \text{max}_{\theta '\geq 0}G(y +\theta ',z - (1+\lambda )\theta ') {}\\ \end{array}$$

in view of Lemma 21.7.1(c) and ( 21.36). Since

$$\displaystyle{G^{{\ast}}(y,z) = \text{max}\{\sup _{\theta \geq 0}G(y+\theta,z - (1+\lambda )\theta ),\sup _{\theta \leq 0}G(y+\theta,z - (1-\mu )\theta )\},}$$

we conclude that G(y +θ ₋, z − (1 −μ)θ ₋) = G ^∗(y, z). Hence it is optimal to sell | θ ₋ | (i.e. buy θ ₋ < 0) in state (y, z).

Now let $(y,z)\notin \mathcal{T}^{\mathrm{sell}}$ Then $(y,z) = (\alpha y_{-} +\theta _{-},\alpha z_{-}- (1-\mu )\theta _{-})$ for some α > 0, $\theta _{-} \leq 0$. Now G(α y ₋ +θ, α z ₋− (1 −μ)θ) is concave in θ. Then for ε > 0 we know that G(α y ₋, α z ₋) ≥ G(y, z) ≥ G(y −ε, z − (1 −μ)(−ε)) Therefore “no selling” is as least as good as “selling any amount ε” in state (y, z).

Analogous results hold for $\mathcal{T}^{\mathrm{buy}}$ where we define θ ₊ for $(y,z) \in \mathcal{T}^{\mathrm{buy}}$ by

$$\displaystyle{ (y +\theta _{+},z - (1+\lambda )\theta _{+}) =\alpha '(a,1 - a) =\alpha (y_{-},z_{-}) }$$

(21.38)

for some α, α′ > 0, θ ₊ > 0. □

Corollary 21.2.

a.
Let (y,z) be in the closure of $\mathcal{T}^{\mathrm{sell}}$ . Then
$$\displaystyle{G^{{\ast}}(y,z) =\log ((1-\mu )y + z) + G(b,1 - b) -\log (1 -\mu b).}$$
b.
Let (y,z) be in the closure of $\mathcal{T}^{\mathrm{buy}}$ . Then
$$\displaystyle{G^{{\ast}}(y,z) =\log ((1+\lambda )y + z) + G(a,1 - a) -\log (1 +\lambda a)}$$
c.
For $(y,z) \in \mathcal{T}^{\mathrm{notr}}$ we have G ^∗ (y,z) = G(y,z).

Proof.

We only consider (a). By continuity it is sufficient to consider $(y,z) \in \mathcal{T}^{\mathrm{sell}}$. Then it is optimal to sell | θ ₋ | yielding according to ( 21.37)

$$\displaystyle{G^{{\ast}}(y,z) = G(y +\theta _{ -},z - (1-\mu )\theta _{-}) = G(\alpha b,\alpha (1 - b)) =\log (\alpha ) + G(b,1 - b).}$$

From $(y +\theta _{-},z - (1-\mu )\theta _{-}) = (\alpha b,\alpha (1 - b))$ we get $\alpha = ((1-\mu )y + z)/(1 -\mu b)$. □

From the corollary we conclude that G ^∗ and $\mathcal{S}$ satisfy hypothesis ( 21.34) in place of g and $\mathcal{S}_{N}$.

Now we can start the induction step of dynamic programming in order to find an optimal trading strategy {δ _n, 0 ≤ n < N} which is known to be Markovian, i.e. δ _n is a function of the state $(y,z) \in \mathcal{S}$ in stage n. Upon choosing g = g _N, G = G _N−1 (defined as above), we obtain δ _N−1: = δ ^∗ where δ ^∗ is also defined as above. As G ^∗ satisfies the hypothesis imposed on g, we can now repeat the optimization step, if we replace $\mathcal{S}_{N}$ by $\mathcal{S}$ and $\mathcal{A}_{N-1}$ by $\mathcal{A}(y,z):=\{\theta \,:\, (y+\theta,z - K(\theta )) \in \varGamma \}$.

1.2 Proof of Theorem 21.4.5

From now on we use the notion martingale for a martingale under P (and not under Q) and we write $E_{n}[\,\cdot \,]:= E[\,\cdot \,\vert \,\mathcal{F}_{n}]$ for the conditional expectations given R ₁, …, R _n.

1.2.1 Induction Start

Set R = R _N, a = a _N−1, b = b _N−1, $\hat{G}(y,z) = G_{N-1}(y,z) = E[\log (y\,R + z)]$.

Lemma 21.7.8.

$\frac{\partial } {\partial \theta }\hat{G}(y+\theta,z - k\theta )\vert _{\theta =0} = E[(R - k)(yR + z)^{-1}]$ for $k> 0$.

Proof.

We will prove

$$\displaystyle{ \frac{\partial ^{\pm }} {\partial \theta } \hat{G}(y+\theta,z - k\theta )\vert _{\theta =0} = E[(R - k)(yR + z)^{-1}]\quad \mbox{ for }\quad k> 0. }$$

(21.39)

We know that log((y +θ)R + z − kθ) and thus $\hat{G}(y+\theta,z - k\theta )$ are concave in θ. In $\lim _{\theta \rightarrow 0\pm }\frac{1} {\theta } \left (\hat{G}(y+\theta,z - k\theta ) -\hat{ G}(y,z)\right )$ we only need to interchange lim and expectation which can be justified by monotone convergence. □

Lemma 21.7.9.

Let (Y _N−1 ,Z _N−1 ) = (y,z), a ≤π ≤ b. Then

a.
$E[R(y\,R + z)^{-1}] \leq (1+\lambda )E[(y\,R + z)^{-1}]$;
b.
$E[R(y\,R + z)^{-1}] \geq (1-\mu )E[(y\,R + z)^{-1}]$.

Proof.

(a) In (y, z) “not to order” is at least as good as “to buy”, hence

$$\displaystyle{0 \geq \frac{1} {\theta } \left (\hat{G}(y+\theta,z - (1+\lambda )\theta ) -\hat{ G}(y,z)\right )\quad \mbox{ for }\quad \theta> 0}$$

by the optimality criterion ( 21.14). Part (b) is similar. □

Lemma 21.7.10 (First Order Condition).

a.
$E[R(b\,R + 1 - b)^{-1}] = (1-\mu )E[(b\,R + 1 - b)^{-1}]$;
b.
$E[R(a\,R + 1 - a)^{-1}] = (1+\lambda )E[(a\,R + 1 - a)^{-1}]$.

Proof.

(a) By Theorem 21.3.9, (b, 1 − b) is a maximum point on the sell-line through (b, 1 − b) and (a, 1 − a) is a maximum point on the buy-line through (a, 1 − a). Now Lemma 21.7.8 applies. □

Lemma 21.7.11.

a.
1 −μ ≤ρ _N−1 ≤ 1 + λ;
b.
$\hat{\rho }_{N-1}(a) = 1+\lambda =\hat{\rho } _{N-1}(\pi )$ for π ≤ a; $\hat{\rho }_{N-1}(b) = 1-\mu =\hat{\rho } _{N-1}(\pi )$ for π ≥ b.

Proof.

In view of Lemma 21.4.3(b), (c), we only consider the case (Y _N−1, Z _N−1) = (y, z), a ≤ π ≤ b. Then we have H _N = y R + z

We get $\hat{\rho }_{N-1}(\pi ) = E[RH_{N}^{-1}]/E[H_{N}^{-1}]$ from Lemma 21.4.3 and thus statement (a) from Lemma 21.7.9. In the same way we obtain (b) from Lemma 21.7.10. □

Theorem 21.7.12.

a.
E _N−1 [H _N ⁻¹ ] = H _N−1 ⁻¹ (martingale property of H ⁻¹ );
b.
E _N−1 [ρ _N R _N H _N ⁻¹ ] = ρ _N−1 H _N−1 ⁻¹.

Proof.

(a) We have

$$\displaystyle\begin{array}{rcl} 1& =& E_{N-1}[H_{N}H_{N}^{-1}] = E_{ N-1}[(\rho _{N}Y _{N} + Z_{N})H_{N}^{-1}] {}\\ & =& \overline{Y }_{N-1}E_{N-1}[\rho _{N}R_{N}H_{N}^{-1}] + \overline{Z}_{ N-1}E_{N-1}[H_{N}^{-1}] {}\\ & =& \left (\rho _{N-1}\overline{Y }_{N-1} + \overline{Z}_{N-1}\right )E_{N-1}[H_{N}^{-1}] {}\\ & =& \left (\rho _{N-1}(Y _{N-1} +\varDelta _{N-1}) + Z_{N-1} - K(\varDelta _{N-1})\right )E_{N-1}[H_{N}^{-1}] {}\\ & =& \left (H_{N-1} +\rho _{N-1}\varDelta _{N-1} - K(\varDelta _{N-1})\right )E_{N-1}[H_{N}^{-1}]. {}\\ \end{array}$$

From Lemma 21.7.11(b) we get ρ _N−1 Δ _N−1 = K(Δ _N−1) which yields (a).

Part (b) follows now from the definition of ρ _N−1. □

Corollary 21.3 (Induction Start).

For k > 0

$$\displaystyle{\frac{\partial } {\partial \theta }E_{N-1}[G_{N}^{{\ast}}((y+\theta )R_{ N},z - k\theta )]\vert _{\theta =0} = (\rho _{N-1} - k)H_{N-1}^{-1}}$$

where G _N ^∗ (y,z) = log (y + z).

Proof.

Lemma 21.7.8 applies directly, where H _N = y R _N + z. □

We thus know that the following induction hypothesis holds for n = N − 1:

Induction Hypothesis 21.7.13.

i.
For Y _n = y, Z _n = z, Π _n = π
$$\displaystyle{\frac{\partial } {\partial \theta }E[G_{n+1}^{{\ast}}((y+\theta )R_{ n+1},z - k\theta )]\vert _{\theta =0} = (\hat{\rho }_{n}(\pi ) - k)H_{n}^{-1}\mbox{ for }a_{ n} \leq \pi <b_{n};}$$
ii.
$\hat{\rho }_{n}(a_{n}) = 1+\lambda =\hat{\rho } _{n}(\pi )$ for π ≤ a _n; $\hat{\rho }_{n}(b_{n}) = 1-\mu =\hat{\rho } _{n}(\pi )$ for π ≤ b _n.

1.2.2 Induction Step “N > n → n − 1”

We assume throughout this section that the induction hypothesis holds for n < N. Suppose that Y _n−1 = y, Z _n−1 = z are given. We know that $\varPi _{n} =\hat{\varPi } (\pi,R_{n})$ where $\hat{\varPi }$ is defined by ( 21.16) and set G(y, z): = E[G _n ^∗(yR _n, z)], hence G _n−1 ^∗(y, z) = sup_θ G(y +θ, z − K(θ)), $\rho _{n}:=\hat{\rho } _{n}(\pi _{n})$. Then we have H _n = ρ _n yR _n + z for a _n−1 ≤ π ≤ b _n−1.

Proposition 21.7.14.

Suppose a _n−1 ≤π ≤ b _n−1 and k > 0. Then

$$\displaystyle{\frac{d} {d\theta }G(y+\theta,z - k\theta )\vert _{\theta =0} = E_{n-1}[(\rho _{n}R_{n} - k)H_{n}^{-1}] = (\hat{\rho }_{ n-1}(\pi ) - k)E_{n-1}[H_{n}^{-1}]}$$

Proof.

Let y, z be arbitrary. We consider one-sided derivatives. Since $\theta \mapsto G_{n}^{{\ast}}((y+\theta )R_{n},z - k\theta )$ is concave by Theorem 21.3.9, we can interchange lim (i.e. $\frac{d^{\pm }} {d\theta }$) and E[ ⋅ ] by the monotone convergence theorem. Consider first lim_{θ → 0+}.

Then we have to study for fixed R _n = s and hence for fixed Π _n = ys∕(ys + z)

$$\displaystyle{ \lim _{\theta \rightarrow 0+}\frac{1} {\theta } \left (G_{n}^{{\ast}}((y+\theta )s,z - k\theta ) - G_{ n}^{{\ast}}(ys,z)\right ). }$$

(21.40)

Case (i, ii): π _n ≥ b _n or π _n < a _n, respectively. We know (by Theorem 21.3.9) that $G_{n}^{{\ast}}(ys,z) =\log (\ell\,y\,s + z) + \mathrm{const}$ with ℓ = 1 −μ or ℓ = 1 +λ, respectively. By continuity this is also true for π _n = b _n and π _n = a _n. We can write for the limit in ( 21.40)

$$\displaystyle\begin{array}{rcl} \frac{d^{+}} {d\theta } \log (\ell(y+\theta )s + z - k\,\theta )\vert _{\theta =0}& =& (\ell\,s - k)(\ell\,y\,s + z)^{-1} {}\\ & & \mbox{ } = (\hat{\rho }_{n}(\varPi _{n})s - k)(\hat{\rho }_{n}(\varPi _{n})y\,s + z)^{-1} = (\hat{\rho }_{ n}(\varPi _{n})s - k)H_{n}^{-1}.{}\\ \end{array}$$

Case (iii) a _n ≤ π _n < b _n. Then $G_{n}^{{\ast}}(ys,z) = E_{n}[G_{n+1}^{{\ast}}(y\,s\,R_{n+1},z)]$ by the optimality properties ( 21.13), ( 21.14) and Theorem 21.3.9. Hence for small θ

$$\displaystyle\begin{array}{rcl} & & \mbox{ }\frac{1} {\theta } \left (G_{n}^{{\ast}}((y+\theta )R_{ n},z - k\theta ) - G_{n}^{{\ast}}(yR_{ n},z)\right ) {}\\ & =& E\left [\frac{1} {\theta } \left (G_{n+1}^{{\ast}}((y+\theta )sR_{ n+1},z - k\theta ) - G_{n+1}^{{\ast}}(ysR_{ n+1},z)\right )\right ] {}\\ & =& s\,E\left [\frac{1} {s\theta }\left (G_{n+1}^{{\ast}}((ys +\theta s)R_{ n+1},z -\frac{k} {s}s\theta ) - G_{n+1}^{{\ast}}(ysR_{ n+1},z)\right )\right ]. {}\\ \end{array}$$

The latter term converges for θ → 0+ by Induction Hypothesis 21.7.13 (i) to $s\,(\hat{\rho }_{n}(\pi _{n}) - k/s)H_{n}^{-1} = (s\hat{\rho }_{n}(\pi _{n}) - k)H_{n}^{-1}$.

Altogether for all cases:

$$\displaystyle{\lim _{\theta \rightarrow 0+}\frac{1} {\theta } \left (G_{n}^{{\ast}}(y\,s + s\,\theta,z - k\theta ) - G_{ n}^{{\ast}}(y\,s,z)\right ) = (\hat{\rho }_{ n}(\varPi _{n})s - k)H_{n}^{-1}.}$$

Thus we finally obtain

$$\displaystyle{\lim _{\theta \rightarrow 0+}\frac{1} {\theta } \left (G(y+\theta,z - k\theta ) - G(y,z)\right ) = E_{n-1}[(\hat{\rho }_{n}(\pi _{n}) \cdot R_{n} - k)H_{n}^{-1}].}$$

The case lim_{θ → 0−} is similar. □

Lemma 21.7.15.

a _n−1 < b _n−1 for (λ,μ)≠(0,0).

Proof.

We will write a = a _n−1, b = b _n−1. We must prove that a ≠ b since we know a ≤ b. Assume that a = b. Then a and b are maximum points on the buy-line and the sell-line through (a, 1 − a) = (b, 1 − b), respectively. From Proposition 21.7.14 we then obtain for y = a = b, k ∈ { 1 +λ, 1 −μ}

$$\displaystyle{\frac{d} {d\theta }G(y+\theta,z - k\theta )\vert _{\theta =0} = E_{n-1}[(\rho _{n}R_{n} - k)H_{n}^{-1}] = 0,}$$

hence E _n−1[ρ _n R _n H _n ⁻¹] = kE _n−1[H _n ⁻¹]. This equation cannot hold for two different values of $k \in \{ 1+\lambda,1-\mu \}$. Thus a < b. □

Proposition 21.7.16.

a.
1 −μ ≤ρ _n−1 ≤ 1 + λ;
b.
$\hat{\rho }_{n-1}(a_{n-1}) = 1+\lambda =\hat{\rho } _{n-1}(\pi )$ for π ≤ a _n, $\hat{\rho }_{n-1}(b_{n-1}) = 1-\mu =\hat{\rho } _{n-1}(\pi )$ for π ≥ b _n.

Proof.

By use of Proposition 21.7.14, the proof is similar to that of Lemmata 21.7.11. □

Proposition 21.7.17.

The martingale property of {H _n−1 ⁻¹ ,H _n ⁻¹ } holds: E _n−1 [H _n ⁻¹ ] = H _n−1 ⁻¹.

Proof.

By use of Propositions 21.7.14 and 21.7.16, the proof is the same as the proof of Theorem 21.7.12(a). □

In view of Propositions 21.7.16 and 21.7.17 we thus proved Theorem 21.4.5 for n − 1 and the proof by induction is finished.

1.3 Notation

Since we have a non-stationary model and since we need some concepts (and their notation) from finance, our notation is not always standard and we shall in this appendix relate some of our notation to the concepts of classical MDP.

$\mathcal{S}$ and $\mathcal{S}_{N}$	state space at time n < N and at time N, respectively,
$(y,z) \in \mathbb{R}^{2}$	state vector,
log(y + z)	final reward at time N depending on the final state
	(Y _N, Z _N) = (y, z); the reward at time n < N is 0,
θ	action,
E[log(y +θ)R _N + z − K(θ))]	expected one-step reward at time N − 1 in state (y, z)
	under action θ,
$\mathcal{A}(y,z),\mathcal{A}_{N-1}(y,z)$	set of actions available in state (y, z) at time n < N
	and at time N, respectively,
δ _n	decision rule at time n,
δ _n(x, y)	action at time n under decision rule δ _n if in
	state (y, z),
{δ ₀, …, δ _N−1} = {δ _n}	policy with decision rule δ _n at time
	n = 0, 1, …, N − 1.

Further,

$$\displaystyle\begin{array}{rcl} P(B' \times B''\,\vert \,n,y_{n-1},z_{n-1},\theta _{n-1})& =& \int _{B'\times B''}P(dy_{n},dz_{n}\,\vert \,n,y_{n-1},z_{n-1},\theta _{n-1}) {}\\ & & \mbox{ } = \mathrm{Prob}\left (((y_{n-1} +\theta _{n-1})R_{n},z_{n-1} - K(\theta _{n-1})) \in B' \times B''\right ){}\\ \end{array}$$

for measurable $B' \times B'' \subseteq \mathbb{R}^{2}$ is the (non-stationary) transition probability, and

$$\displaystyle{E[\log (Y _{N} + Z_{N}\,\vert \,Y _{n} = y,Z_{n} = z]}$$

is the value function at time n in state (y, z) over N − n future steps under a Markov policy with decision rules {δ ₀, …, δ _N−1}, where (Y _m, Z _m) for n < m ≤ N is described by the random variables R _n+1, …, R _N according to

$$\displaystyle{Y _{m+1} = (Y _{m} +\delta _{m}(Y _{m},Z_{m}))R_{m}\quad \mbox{ and }\quad Z_{m+1} = Z_{m} - K(\delta _{m}(Y _{m},Z_{m})).}$$

Finally, the optimal value function at time n in state (y, z) over N − n future steps is

$$\displaystyle{G_{n}^{{\ast}}(y,z) =\sup \, E[\log (Y _{ N} + Z_{N})\,\vert \,Y _{n} = y,Z_{n} = z],}$$

where the supremum is taken over all admissible Markov policies.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sass, J., Schäl, M. (2017). Optimal Portfolios and Pricing of Financial Derivatives Under Proportional Transaction Costs. In: Boucherie, R., van Dijk, N. (eds) Markov Decision Processes in Practice. International Series in Operations Research & Management Science, vol 248. Springer, Cham. https://doi.org/10.1007/978-3-319-47766-4_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-47766-4_21
Published: 11 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47764-0
Online ISBN: 978-3-319-47766-4
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics

Optimal Portfolios and Pricing of Financial Derivatives Under Proportional Transaction Costs

Abstract

Similar content being viewed by others

Optimal investment and consumption for financial markets with jumps under transaction costs

Optimal investment and price dependence in a semi-static market

Non-concave utility maximisation on the positive real axis in discrete time

Keywords

1 Introduction

An Explanatory Model

Construction of a Price System

The Numeraire Portfolio

The General Model with Transaction Costs

Martingale Measures and the Numeraire Portfolio

Main Result

2 The Financial Model

3 The Markov Decision Model

Definition 21.3.1.

Assumption 21.3.2.

Definition 21.3.3.

Lemma 21.3.4.

Definition 21.3.5.

Convention 21.3.6.

Lemma 21.3.7.

Definition 21.3.8.

Theorem 21.3.9.

4 Martingale Properties of the Optimal Markov Decision Process

Definition 21.4.1.

Remark 21.4.2.

Lemma 21.4.3.

Proof.

Lemma 21.4.4.

Proof.

Theorem 21.4.5.

5 Price Systems and the Numeraire Portfolio

Price Systems and Martingale Measures Q

Theorem 21.5.1.

Definition 21.5.2.

Theorem 21.5.3.

The Numeraire Portfolio

Lemma 21.5.4.

Proof.

Corollary 21.1.

Definition 21.5.5.

6 Conclusive Remarks

Extension 21.6.1.

Algorithm 21.6.2.

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

Appendices

1.1 Proof of Theorem 21.3.9

Lemma 21.7.1.

Proposition 21.7.2.

Proposition 21.7.3.

Proof.

Definition 21.7.4.

Lemma 21.7.5.

Definition 21.7.6.

Proposition 21.7.7.

Proof.

Corollary 21.2.

Proof.

1.2 Proof of Theorem 21.4.5

1.2.1 Induction Start

Lemma 21.7.8.

Proof.

Lemma 21.7.9.

Proof.

Lemma 21.7.10 (First Order Condition).

Proof.

Lemma 21.7.11.

Proof.

Theorem 21.7.12.

Proof.

Corollary 21.3 (Induction Start).

Proof.