Finite-horizon optimal investment with transaction costs: construction of the optimal strategies

Belak, Christoph; Sass, Jörn

doi:10.1007/s00780-019-00404-4

Finite-horizon optimal investment with transaction costs: construction of the optimal strategies

Published: 05 September 2019

Volume 23, pages 861–888, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Finance and Stochastics Aims and scope Submit manuscript

Finite-horizon optimal investment with transaction costs: construction of the optimal strategies

Download PDF

Christoph Belak¹ &
Jörn Sass²

652 Accesses
5 Citations
Explore all metrics

Abstract

We revisit the problem of maximising expected utility of terminal wealth in a Black–Scholes market with proportional transaction costs. While it is known that the value function of this problem is the unique viscosity solution of the HJB equation and that the HJB equation admits a classical solution on a reduced state space, it has been an open problem to verify that these two coincide. We establish this result by devising a verification procedure based on superharmonic functions. In the process, we construct optimal strategies and provide a detailed analysis of the regularity of the value function.

Optimal investment with random endowments and transaction costs: duality theory and shadow prices

Article 01 September 2018

Almost Surely Optimal Portfolios Under Proportional Transaction Costs

Optimal Investment with Bounded VaR for Power Utility Functions

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The aim of this paper is to solve the problem of maximising expected utility of terminal wealth for an investor facing proportional transaction costs in a Black–Scholes market. While this problem has been studied extensively in the literature, it is still an open problem in the finite-horizon case to construct optimal strategies and verify their optimality. The aim of this paper is to close this gap.

Starting with the seminal contribution of Magill and Constantinides [15], the continuous-time optimal investment problem with proportional transaction costs has been studied extensively over the last decades, and three different approaches to tackle this type of problem have emerged: (1) The primal approach based on stochastic control and viscosity solution theory, in which one studies the Hamilton–Jacobi–Bellman (HJB) equation of the problem; (2) the dual approach based on shadow prices, in which one determines an auxiliary frictionless market with unfavourable price processes yielding the same optimal strategy as the original problem; and (3) asymptotics for vanishing costs.

For the problem of optimal consumption over an infinite horizon, the primal approach was utilised, among others, by Davis and Norman [23], Shreve and Soner [37], Akian et al. [1], Kabanov and Klüppelberg [32] and de Vallière et al. [25], whereas Kallsen and Muhle-Karbe [34], Choi et al. [13] and Herczegh and Prokaj [29] used the dual approach to solve this problem. Asymptotic optimality results were obtained by Janeček and Shreve [31] and Gerhold et al. [27]. Moreover, Akian et al. [3] use the primal approach and Gerhold et al. [28] and Gerhold et al. [26] the dual approach to determine closed-form solutions for the problem of maximising the asymptotic growth rate under (small) transaction costs.

In the present paper, we focus on the finite-horizon optimal terminal wealth problem without intermediate consumption in a Black–Scholes market, which was introduced in Akian et al. [2]. The HJB equation of this problem has also been studied in Davis et al. [24] in the context of utility indifference pricing. An adaptation of the results of Davis et al. [24] implies that the value function is a viscosity solution of the HJB equation, and uniqueness holds in the case of bounded utility functions. Belak et al. [9] extend the uniqueness result to more general utility functions including log and power utility. Dai and Yi [21] show that the HJB equation admits a classical solution in the case of log and power utility if the state space is reduced to positive stock holdings, and Dai et al. [20] and Chen et al. [12] extend this result to the problem with intermediate consumption and CARA utility, respectively. Moreover, Czichowsky et al. [19] and Czichowsky and Schachermayer [16, 17, 18] use the shadow price approach to establish existence of optimal strategies for general price processes extending beyond semimartingales. Numerical schemes in the Black–Scholes setting can be found in Kunisch and Sass [36], Dai and Zhong [22] and Herzog et al. [30]. Finally, Bichuch [11] in the Black–Scholes setting and Kallsen and Muhle-Karbe [35] and Kallsen and Li [33] for more general price processes solve the finite-horizon problem asymptotically for small transaction costs. Summing up the results on the finite-horizon problem, it is known that

1) the value function $\mathcal{V}$ is the unique viscosity solution of the HJB equation;

2) there exists a classical solution $V$ of the HJB equation if the state space is reduced to positive stock holdings; and

3) there exists a frictionless market in which the optimal strategy coincides with the optimal strategy in the transaction costs market.

It is, however, not immediate that the classical solution $V$ coincides with the value function $\mathcal{V}$, nor if the optimal strategy obtained from the dual approach is a reflected diffusion in the no-trading region implied by the HJB equation. More precisely:

1) $V$ is a classical solution on the reduced state space, whereas the value function $\mathcal{V}$ is a viscosity solution on the entire state space. While it is not particularly difficult to verify that the classical solution $V$ is also a viscosity solution on the reduced state space, this does not imply that $V = \mathcal{V}$ on the reduced state space since the existing literature only provides uniqueness results for viscosity solutions on the entire state space. Thus, in order to rigorously conclude that $V = \mathcal{V}$, one either needs to (a) prove uniqueness of viscosity solutions on the reduced state space, or (b) show that $V$ extends to a viscosity solution on the entire state space. In both cases, a careful inspection of the behaviour of $V$ and $\mathcal{V}$ at the boundary of the reduced state space is necessary.

2) No link between the auxiliary frictionless market and the HJB equation is known. Hence, while existence of an optimal strategy is guaranteed by the dual approach, it is an open question whether optimal strategies are determined by the trading regions implied by the HJB equation. In particular, without establishing this link, it is not clear if the trading regions obtained from solving the HJB equation numerically determine an optimal strategy.

3) The classical approach of verifying optimality of candidate strategies via the primal approach requires a sufficiently smooth value function $\mathcal{V}$ to justify the application of Itô’s formula along all controlled state processes (and hence on the entire state space). We shall see, however, that the value function is not of class $C^{1,2}$ everywhere (see Theorem 4.14 for the precise statement), and thus there is a need for a verification argument requiring less regularity on $\mathcal{V}$.

To show that $V$ and $\mathcal{V}$ coincide and that optimal strategies are determined by the HJB equation, we devise a novel verification procedure which only requires to evaluate the candidate value function along the uncontrolled state process. Since the uncontrolled state process naturally avoids states in which the HJB equation degenerates (i.e., regularity fails), this puts Itô’s formula at our disposal to establish the claims. For this, we characterise the value function as the smallest continuous function which is superharmonic with respect to the uncontrolled state process and nondecreasing in the direction of transactions. More precisely, we proceed as follows:

1) We define trading regions in terms of the classical solution $V$ of the HJB equation and show that for every initial state, there exists a trading strategy which turns the corresponding state process into a diffusion reflected at the boundaries between the trading regions.

2) We define a function $h_{0}$ which maps the initial state to the expected utility obtained by following the trading strategy constructed in Step 1) and show that $h_{0}$ is superharmonic and nondecreasing in the direction of transactions. A simple argument shows that $h_{0}$ coincides with the classical solution $V$ on the reduced state space, and by construction, $h_{0}$ is dominated by the value function $\mathcal{V}$.

3) We argue that every superharmonic function which is nondecreasing in the direction of transactions is a viscosity supersolution of the HJB equation. By the previous step and the comparison principle in Belak et al. [9], this implies that $h_{0}$ dominates the value function. By Step 2), this shows that $h_{0}$ and $\mathcal{V}$ coincide, yielding that $\mathcal{V}$ and $V$ are equal on the reduced state space and the trading strategies constructed in Step 1) are optimal.

In particular, as we characterise the value function as the smallest superharmonic function, our approach may be seen as an alternative duality theory for singular control problems. Moreover, since our approach naturally avoids points of singularity of the infinitesimal generator of the underlying stochastic process (which are typically the points at which it is difficult to verify regularity of the value function), it is conceivable that the verification argument can be applied to other singular control problems as well.

Our verification argument is inspired by recent results of Christensen [14] and Belak et al. [8] in the context of stochastic impulse control. However, since optimal trading regions for singular control problems are given in terms of first-order derivatives of the value function, as opposed to the value function itself for impulse control problems, the mathematical analysis in the present paper differs significantly from the corresponding impulse control results.

Moreover, our superharmonic function approach can be seen as a version of the stochastic Perron method; see Bayraktar and Sîrbu [4, 5, 6] for early developments and Bayraktar and Zhang [7] for the case of a singular control problem with transaction costs. In contrast to the stochastic Perron method, we require the superharmonicity property along the uncontrolled state process together with the monotonicity in the direction of transactions, whereas for the stochastic Perron method, one would typically ask for superharmonicity along every controlled state process (or at least a subset of state processes containing a maximising sequence). As a consequence, it is more involved in our setting to argue that the superharmonic functions dominate the value function (we rely on viscosity arguments for this, whereas it is immediate in the setting of [7]). On the other hand, our definition makes verification significantly easier since we only need to verify superharmonicity for the uncontrolled state process. We note, however, that our setup implies superharmonicity along any state processes obtained from a piecewise constant strategy (see Remark 4.2 below) and hence the two concepts coincide as soon as there exists a maximising sequence consisting of piecewise constant strategies.

The remainder of this article is structured as follows. In Sect. 2, we set up the market model, recall existing results from the literature, and discuss implications of our results. In Sect. 3, we construct the candidate optimal strategies as well as the corresponding reflected diffusions. Our main results can be found in Sect. 4, where we present the verification theorem to show that these candidate optimal strategies are indeed optimal, analyse the regularity of the value function in detail, and prove that the classical solution of Dai and Yi [20] coincides with the value function on the reduced state space.

2 Market model and problem formulation

2.1 The market model

We let $W = (W(t))_{t\geq 0}$ be a standard Brownian motion defined on the canonical Wiener space $(\Omega ,\mathcal{F},\mathbb{P})$. For each $t\geq 0$, we denote the augmented filtration generated by $(W(u)-W(t))_{u \geq t}$ by $\mathbb{F}^{t} = (\mathcal{F}^{t}(u))_{u\geq t}$ and set $\mathbb{F} := \mathbb{F}^{0}$. Moreover, we fix some terminal time $T>0$ as well as some initial time $t\in [0,T)$.

We consider a Black–Scholes market $(P^{0},P^{1}) = (P^{0}(u),P^{1}(u))_{u \in [t,T]}$ with

$$\begin{aligned} \mathrm{d}P^{0}(u) &= 0,\qquad u\in [t,T],\ P^{0}(t)=1, \\ \mathrm{d}P^{1}(u) &= \alpha P^{1}(u)\,\mathrm{d}u+\sigma P^{1}(u)\, \mathrm{d}W(u),\qquad u\in [t,T],\ P^{1}(t)=1. \end{aligned}$$

Here, $\alpha >0$ and $\sigma >0$ denote the excess return and volatility of the stock, respectively. With this, we assume that the investor buys shares of the stock at the ask price $(1+\lambda )P^{1}$, where $\lambda >0$, and sells shares of the stock at the bid price $(1-\mu )P^{1}$, where $\mu \in (0,1)$.

Next, to model trading strategies, we take $\mathbb{F}^{t}$-adapted, nondecreasing, càdlàg processes $L=(L(u))_{u\in [t,T]}$ and $M=(M(u))_{u \in [t,T]}$ with $L(t-)=M(t-)=0$. Here, $L$ and $M$ represent the cumulative units of money used for purchases and sales of the stock, respectively. With this, we denote by $B = B_{t,b}^{L,M} = (B_{t,b} ^{L,M}(u))_{u\in [t,T]}$ and $S = S_{t,s}^{L,M} = (S_{t,s}^{L,M}(u))_{u \in [t,T]}$ the investor’s wealth invested in the bond and the stock, respectively. Assuming that the strategy $(L,M)$ is self-financing, the evolution of $B$ and $S$ can be written as

$$\begin{aligned} \mathrm{d}B(u) &= -(1+\lambda )\,\mathrm{d}L(u) + (1-\mu )\,\mathrm{d}M(u),\qquad u \in [t,T], \\ \mathrm{d}S(u) &= \alpha S(u)\,\mathrm{d}u + \sigma S(u)\,\mathrm{d}W(u) + \mathrm{d}L(u) - \mathrm{d}M(u),\qquad u\in [t,T], \end{aligned}$$

where the initial values are given by $B(t-)=b$ and $S(t-)=s$, respectively. The net wealth $X = X_{t,b,s}^{L,M} = (X_{t,b,s}^{L,M}(u))_{u \in [t,T]}$ of the investor after liquidation of the stock position is then given by

We say that a trading strategy is admissible if the corresponding net wealth process is nonnegative. For this, we define the solvency cone

$$ \mathcal{S} := \{(b,s)\in \mathbb{R}^{2} : b+(1+\lambda )s>0, b+(1- \mu )s>0\}. $$

With this, whenever $(b,s)\in \overline{\mathcal{S}}$, the investor can liquidate her stock holdings to end up with nonnegative wealth. A trading strategy $(L,M)$ is therefore admissible for an initial position $(b,s)\in \overline{\mathcal{S}}$ if the corresponding pair $(B_{t,b}^{L,M},S_{t,s}^{L,M})$ takes values in $\overline{ \mathcal{S}}$. The set of all admissible trading strategies of this form is denoted by $\mathcal{A}(t,b,s)$.

The objective of the investor is to maximise expected utility of the net terminal wealth after liquidation, i.e.,

$$ \mathcal{V}(t,b,s) := \sup _{(L,M)\in \mathcal{A}(t,b,s)} \mathbb{E} \bigl[ U_{p}\bigl(X_{t,b,s}^{L,M}(T)\bigr)\bigr], $$

(2.1)

where the utility function $U_{p}:(0,\infty )\to \mathbb{R}$ is defined as

$$ U_{p}(x) := \textstyle\begin{cases} x^{p}/p &\quad \text{if }p< 1,p\neq 0, \\ \log x &\quad \text{if }p=0. \end{cases} $$

We extend $U_{p}$ to $[0,\infty )$ by setting $U_{p}(0):= \lim _{x\downarrow 0}U_{p}(x)$.

2.2 Overview of existing results

As pointed out in Sect. 1, the portfolio problem defined in (2.1) has received considerable interest in the past. In this section, we briefly summarise the results which will be needed in the sequel.

First, it is easy to see that the value function $\mathcal{V}$ defined in (2.1) is finite on $[0,T]\times \mathcal{S}$. More specifically, we have (see Belak et al. [9])

$$ U_{p}\big(b + \min \{(1-\mu )s,(1+\lambda )s\}\big) \leq \mathcal{V}(t,b,s) \leq \varphi _{p}(t,b,s), $$

where for $\gamma \in (1-\mu ,1+\lambda )$ and $K>1$, the function $\varphi _{p}:[0,T]\times \overline{\mathcal{S}}\to \mathbb{R}$ is given by

$$ \varphi _{p}(t,b,s) := U_{p}\bigl( (b+\gamma s) f(t) \bigr) $$

with $f:[0,T]\to \mathbb{R}$ given by

$$ f(t) := \exp \bigg(K\frac{1}{2(1-p)}\frac{\alpha ^{2}}{\sigma ^{2}}(T-t) \bigg). $$

Davis et al. [24] (with some adaptations) and Belak et al. [9, 10] establish that the value function $\mathcal{V}$ is continuous and the unique viscosity solution of the HJB equation

$$ 0 = \min \{ \mathcal{L}^{\mathrm{nt}}\mathcal{V}(t,b,s), \mathcal{L} ^{\mathrm{buy}}\mathcal{V}(t,b,s), \mathcal{L}^{\mathrm{sell}} \mathcal{V}(t,b,s)\} $$

(2.2)

on $[0,T)\times \mathcal{S}$ with terminal condition

$$ \mathcal{V}(T,b,s) = U_{p}\big(b + \min \{(1-\mu )s,(1+\lambda )s\} \big), \qquad (b,s)\in \overline{\mathcal{S}}, $$

and boundary condition

$$ \mathcal{V}(t,b,s) = U_{p}(0), \qquad (t,b,s)\in [0,T]\times \partial \mathcal{S}. $$

The differential operators $\mathcal{L}^{\mathrm{nt}}$, $\mathcal{L} ^{\mathrm{buy}}$ and $\mathcal{L}^{\mathrm{sell}}$ in (2.2) are given by

$$\begin{aligned} \mathcal{L}^{\mathrm{nt}}\mathcal{V}(t,b,s) &:= -\partial _{t} \mathcal{V}(t,b,s) - \alpha s \partial _{s}\mathcal{V}(t,b,s) - \frac{1}{2} \sigma ^{2} s^{2} \partial _{s}^{2}\mathcal{V}(t,b,s), \\ \mathcal{L}^{\mathrm{buy}}\mathcal{V}(t,b,s) &:= (1+\lambda )\partial _{b}\mathcal{V}(t,b,s) - \partial _{s}\mathcal{V}(t,b,s), \\ \mathcal{L}^{\mathrm{sell}}\mathcal{V}(t,b,s) &:= -(1-\mu )\partial _{b}\mathcal{V}(t,b,s) + \partial _{s}\mathcal{V}(t,b,s). \end{aligned}$$

The uniqueness of the value function is a consequence of the following comparison principle, obtained in Belak et al. [9, Theorem 4.4].

Theorem 1

Let$u,v:[0,T]\times \overline{\mathcal{S}}\to \overline{\mathbb{R}}$and fix$\varepsilon >0$. Assume that$u$is an upper semicontinuous viscosity subsolution and$v$is a lower semicontinuous viscosity supersolution of (2.2) such that

$$ U_{p}\big(b+\min \{(1-\mu )s,(1+\lambda )s\}\big) \leq u(t,b,s),v(t,b,s) \leq \varphi _{p}(t,b,s). $$

If$u(T,b,s)\leq v(T,b+\varepsilon ,s)$and$u(t,b,s)\leq U_{p}(0)$for every$(b,s)\in \partial \mathcal{S}$, then$u(t,b,s)\leq v(t,b+ \varepsilon ,s)$on$[0,T]\times \overline{\mathcal{S}}$.

It is expected that the operators $\mathcal{L}^{\mathrm{nt}}$, $\mathcal{L}^{\mathrm{buy}}$ and $\mathcal{L}^{\mathrm{sell}}$ determine the optimal strategies. To be more precise, given a smooth solution $v$ of (2.2), we define

$$\begin{aligned} {\mathcal{R}}^{\mathrm{buy}}(v) &:= \{(t,b,s) \in [0,T)\times \mathcal{S}: {\mathcal{L}}^{\mathrm{buy}}v(t,b,s) = 0\}, \\ {\mathcal{R}}^{\mathrm{sell}}(v) &:= \{(t,b,s) \in [0,T)\times \mathcal{S}: {\mathcal{L}}^{\mathrm{sell}}v(t,b,s) = 0\}, \\ {\mathcal{R}}^{\mathrm{nt}}(v) &:= ([0,T)\times \mathcal{S})\setminus ({\mathcal{R}}^{\mathrm{buy}}\cup {\mathcal{R}}^{\mathrm{sell}}). \end{aligned}$$

We expect that the optimal strategy keeps the process $(B,S)$ inside the no-trading region $\mathcal{R}^{\mathrm{nt}}(v)$ by reflecting $(B,S)$ at the boundary $\partial \mathcal{R}^{\mathrm{nt}}(v)$. Note that we do not include the terminal time $T$ in the definition of the trading regions since we require the investor to liquidate her holdings in the stock at time $T$, and we do not include the boundary $\partial \mathcal{S}$ since the only admissible, and hence optimal, strategy on $\partial \mathcal{S}$ is to instantly close the stock position and refrain from further trading; see Lemma 3.1 below.

While with the previous result, the value function $\mathcal{V}$ is completely characterised (and can be computed numerically), it does not suffice to construct and verify the optimal strategies. It is therefore necessary to study the HJB equation in more detail for the existence of a regular solution.

We denote by $\mathcal{S}_{0}$ and $\overline{\mathcal{S}_{0}}$ the restrictions to positive stock holdings of the solvency region and of its closure, respectively, i.e.,

$$ \mathcal{S}_{0} := \{(b,s)\in \mathcal{S} : s > 0\} \qquad \text{and} \qquad \overline{\mathcal{S}_{0}} := \{(b,s)\in \overline{\mathcal{S}} : s > 0\}. $$

Dai and Yi [21, Theorem 5.1] show that the HJB equation admits a classical solution on the restricted solvency region:

Theorem 2

There exists a continuous function$V:[0,T]\times \overline{ \mathcal{S}_{0}}\to \mathbb{R}$such that$V\in C^{1,2}(([0,T) \times \mathcal{S}_{0})\setminus F)$and$\partial _{t} V\leq 0$which solves the HJB equation (2.2) in the classical sense. Here, the set$F$is given by

$$ F := \textstyle\begin{cases} \emptyset &\quad \textit{if }\pi _{M} < 1, \\ \{(t,b,s)\in [0,T)\times \mathcal{S}_{0}: b = 0\} &\quad \textit{if } \pi _{M} = 1, \\ \{(t,b,s)\in [0,T)\times \mathcal{S}_{0}: b = 0, t = t^{\mathrm{up}} \} &\quad \textit{if }\pi _{M} > 1, \end{cases} $$

(2.3)

where

$$ t^{\mathrm{up}} := T - \frac{\log (1+\lambda ) - \log (1-\mu )}{ \alpha -(1-p)\sigma ^{2}} $$

(2.4)

and$\pi _{M} := \alpha /((1-p)\sigma ^{2})$denotes the Merton fraction.

Let us emphasise here that the combination of the fact that $\mathcal{V}$ is a viscosity solution of the HJB equation on $[0,T)\times \mathcal{S}$, the fact that $V$ is a classical solution of the HJB equation on $[0,T)\times \mathcal{S}_{0}$, and the uniqueness result implied by the comparison principle in Theorem 2.1 do not imply that $\mathcal{V} = V$ on $[0,T)\times \mathcal{S}_{0}$; some additional work is necessary to arrive at this conclusion. Indeed, while it is not too difficult to show that $V$ is also a viscosity solution on $[0,T)\times \mathcal{S}_{0}$ (which is in particular immediate outside the set $F$), one would need to extend $V$ to a continuous viscosity solution defined on the entire state space $[0,T)\times \mathcal{S}$ to apply the comparison theorem. Verifying that such an extension exists, however, requires additional work including a careful study of the behaviour of $V$ (and its partial derivatives) as $s\downarrow 0$. While we believe that it is possible to follow this direct approach, we shall nevertheless take a different route which has the advantage of additionally verifying optimality of our candidate optimal trading strategies. These candidate strategies are defined in terms of trading regions implied by the classical solution $V$. More precisely, Theorem 2.2 allows us to define the reduced trading regions

$$\begin{aligned} \mathcal{R}_{0}^{\mathrm{buy}} &:= \{(t,b,s) \in [0,T)\times \mathcal{S}_{0}: \mathcal{L}^{\mathrm{buy}} V(t,b,s) = 0\}, \\ \mathcal{R}_{0}^{\mathrm{sell}} &:= \{(t,b,s) \in [0,T)\times \mathcal{S}_{0}: \mathcal{L}^{\mathrm{sell}} V(t,b,s) = 0\}, \\ \mathcal{R}_{0}^{\mathrm{nt}} &:= ([0,T)\times \mathcal{S}_{0}) \setminus (\mathcal{R}_{0}^{\mathrm{buy}}\cup \mathcal{R}_{0}^{ \mathrm{sell}}). \end{aligned}$$

Note that we must have $\mathcal{L}^{\mathrm{nt}} V = 0$ on $\mathcal{R}_{0}^{\mathrm{nt}}$. In order to construct the optimal strategies, it is important to determine the geometry of these sets and the location of the boundaries between them. Dai and Yi [21, Theorems 4.3, 4.5 and 4.7] provide the following characterisation of these free boundaries.

Theorem 3

1) There exist nonnegative, nonincreasing functions$\underline{\pi }:[0,T)\to \mathbb{R}$and$\overline{\pi }:[0,T) \to \mathbb{R}$with$\underline{\pi }(t) < \overline{\pi }(t)$for all$t\in [0,T)$such that

$$\begin{aligned} \mathcal{R}_{0}^{\mathrm{buy}} & = \bigg\{ (t,b,s) \in [0,T)\times \mathcal{S}_{0}: \frac{s}{b+s} \leq \underline{\pi }(t) \bigg\} , \\ \mathcal{R}_{0}^{\mathrm{sell}} & = \bigg\{ (t,b,s) \in [0,T)\times \mathcal{S}_{0}: \frac{s}{b+s} \geq \overline{\pi }(t) \bigg\} , \\ \mathcal{R}_{0}^{\mathrm{nt}} & = \bigg\{ (t,b,s) \in [0,T)\times \mathcal{S}_{0}: \underline{\pi }(t) < \frac{s}{b+s} < \overline{ \pi }(t)\bigg\} . \end{aligned}$$

Moreover, $V$is of class$C^{\infty }$on$\mathcal{R}_{0}^{ \mathrm{nt}}$.

2) The function$\underline{\pi }$is continuous and satisfies

$$ \underline{\pi }(t) \textstyle\begin{cases} < 1 &\quad \textit{if }\pi _{M}\leq 1, \\ > 1 &\quad \textit{if }\pi _{M}>1, t < t^{\mathrm{up}}, \\ = 1 &\quad \textit{if }\pi _{M}>1, t = t^{\mathrm{up}}, \\ < 1 &\quad \textit{if }\pi _{M}>1, t > t^{\mathrm{up}}, \end{cases} $$

where$t^{\mathrm{up}}$is defined in (2.4). Furthermore, $\underline{\pi }(t) = 0$for$t\in [t^{\mathrm{down}},T)$, where

$$ t^{\mathrm{down}} := T - \frac{\log (1+\lambda ) - \log (1-\mu )}{ \alpha }. $$

3) It holds that

$$ \overline{\pi }(t) \textstyle\begin{cases} < 1 &\quad \textit{if }\pi _{M}< 1, \\ = 1 &\quad \textit{if }\pi _{M}=1, \\ > 1 &\quad \textit{if }\pi _{M}>1, \end{cases} $$

and$\overline{\pi }\in C^{\infty }([0,T))$.

Remark 4

A close inspection of the results in Dai and Yi [21] implies that

$$ \inf _{t\in [0,T)} |\overline{\pi }(t) - \underline{\pi }(t)| > 0. $$

Indeed, Dai and Yi [21, Sect. 5] show that the HJB equation (2.2) can be transformed into a double obstacle problem with obstacles given by $1/(x+1+\lambda )$ (determining $\underline{\pi }$) and $1/(x+1-\mu )$ (determining $\overline{\pi }$), respectively. Since $V$ is continuous and the distance between the obstacles is strictly positive, this implies that the above infimum is also strictly positive.

Figures 1–3 below sketch the different scenarios for the location of the free boundaries. Note that $t^{\mathrm{up}}$ is the time point at which the lower free boundary is equal to one, i.e., $\underline{\pi }(t^{\mathrm{up}}) = 1$ (this may only happen if $\pi _{M}>1$), and $t^{\mathrm{down}}$ is the time point from which onwards the lower free boundary is equal to zero, i.e., $\underline{\pi }(t) = 0$ for all $t\in [t^{\mathrm{down}},T]$.

For obvious reasons, we refer to $\underline{\pi }$ and $\overline{ \pi }$ as the buy and sell boundary, respectively. If our conjecture that the buy and sell boundaries characterise the optimal strategies is indeed correct (which will be rigorously proved in Sect. 4), Theorem 2.3 has the following implications:

1) If $\pi _{M}<1$ (cf. Fig. 1), i.e., if borrowing is not optimal in the absence of costs, then it is also not optimal in the presence of costs.

2) If $\pi _{M} = 1$ (cf. Fig. 2), i.e., if it is optimal to invest all money in the stock in the absence of transaction costs, then two cases must be distinguished in the presence of costs. If the initial position of the investor is such that $b\leq 0$, then the bond position is closed and all money is kept in the stock (since $\overline{\pi }=1$). However, if the initial position is such that $b>0$, then it is not optimal to close the bond position. This is because we force the investor to close the stock position at the terminal time $T$, and hence it is too expensive to first buy shares of the stock at the initial time just to liquidate the stock position once the investment horizon is reached.

3) If $\pi _{M}>1$ (cf. Fig. 3), i.e., if borrowing is optimal without costs, we need to distinguish three cases. Since the investor never switches from borrowing to no-borrowing or vice versa after the initial transaction ($\overline{\pi }>1$ and $\underline{\pi }$ is nonincreasing), the initial transaction determines whether borrowing or no-borrowing is optimal:

$t^{\mathrm{up}} > 0$. In this case, borrowing is optimal since $\underline{\pi}(0)>1$.

$t^{\mathrm{up}} = 0$. If the initial position is such that $b<0$, then borrowing is optimal. Otherwise the investor invests all her wealth in the stock (since $\underline{\pi}(0) = 1$).

$t^{\mathrm{up}} < 0$. In this case, borrowing is optimal if $b<0$ and no-borrowing is optimal if $b\geq 0$. This is because $\underline{\pi }(t)<1<\overline{\pi }(t)$ for all $t\in [0,T)$.

4) In any case, as soon as $t\geq t^{\mathrm{down}}$, the investor refrains from buying shares of the stock since $\underline{\pi }(t) = 0$.

5) If the initial position $(b,s)$ is such that $s\leq 0$, then whenever $\underline{\pi }(t)=0$, it is optimal to liquidate the stock position and refrain from further trading. Whenever $\underline{ \pi }(t)>0$, the investor performs an initial transaction which takes her position to the boundary of the no-trading region. This is proved in Sect. 4, but intuitively this behaviour is clear: Since the excess return $\alpha $ is positive and the investor has to liquidate her stock holdings at time $T$, it should never be optimal to have a short position in the stock.

3 Construction of the optimal strategies

We proceed with the construction of the candidate optimal strategies. For this, we fix an arbitrary initial datum $(t_{0},b_{0},s_{0}) \in [0,T)\times \overline{\mathcal{S}}$. We first observe that if $(b _{0},s_{0})\in \partial \mathcal{S}$, then the only admissible and hence optimal strategy is to immediately close the position and refrain from further trading. The proof follows as in Shreve and Soner [37, Remark 2.1].

Lemma 1

Let$(b_{0},s_{0})\in \partial \mathcal{S}$. Then the only admissible strategy is to instantly jump to the position$(0,0)$and remain there.

In what follows, we may thus assume$(b_{0},s_{0})\in \mathcal{S}$. For the construction of the optimal strategy, we need to prove the existence of nondecreasing processes $L^{*} = (L^{*}(u))_{u \in [t,T]}$ and $M^{*} = (M^{*}(u))_{u\in [t,T]}$ which turn the controlled wealth process $(B^{*},S^{*}) := (B^{L^{*},M^{*}}_{t_{0},b _{0}}, S^{L^{*},M^{*}}_{t_{0},s_{0}})$ into a diffusion reflected at the boundary of $\mathcal{R}_{0}^{\mathrm{nt}}$. We first observe that we can without loss of generality assume that the initial position $(t_{0},b_{0},s_{0})$ is an element of the closure of the no-trading region $\mathcal{R}_{0}^{\mathrm{nt}}$. Indeed, if $(t_{0},b_{0},s _{0})\not \in \overline{\mathcal{R}_{0}^{\mathrm{nt}}}$, then we can find $(b^{*},s^{*})$ and (minimal) $\ell ,m\geq 0$ such that $(t_{0}, b^{*}, s^{*})\in \partial \mathcal{R}_{0}^{\mathrm{nt}}$ and

$$ b^{*} = b_{0} - (1+\lambda )\ell + (1-\mu )m,\qquad s^{*} = s_{0} + \ell - m. $$

With this, if $(L^{*},M^{*})$ is the candidate optimal strategy for $(t_{0},b^{*},s^{*})$, the pair $(L^{*} + \ell , M^{*} + m)$ is the candidate optimal strategy for $(t_{0},b_{0},s_{0})$. In other words, by a suitable initial transaction, we can always ensure that we start within the closure of the no-trading region.

Next, we observe that in the following, we can rule out all cases in which the investor liquidates either the bond or the stock position at time $t_{0}$ and refrains from further transactions. Comparing with Figs. 1–3 and recalling that we may assume $(t_{0},b_{0},s_{0})\in \overline{\mathcal{R}_{0}^{\mathrm{nt}}}$, these cases are

(a)
$\pi _{M}$ arbitrary, $t_{0}\geq t^{\mathrm{down}}$ and $s_{0}=0$;
(b)
$\pi _{M} = 1$ with $s_{0}>0$ and $b_{0}= 0$;
(c)
$\pi _{M} > 1$ with $s_{0}>0$, $b_{0}=0$, and $t_{0}\geq t ^{\mathrm{up}}$;
(d)
$\pi _{M} > 1$ with $s_{0}=0$, $b_{0}>0$, and $t_{0} = t ^{\mathrm{up}}$.

The remaining cases are given by

(e)
$\pi _{M} < 1$ with $s_{0},b_{0}>0$;
(f)
$\pi _{M} = 1$ with $s_{0},b_{0}>0$;
(g)
$\pi _{M} > 1$ with $s_{0},b_{0}>0$ and $t_{0}>t^{ \mathrm{up}}$;
(h)
$\pi _{M} > 1$ with $s_{0}>0$, $b_{0}<0$.

The cases (e)–(g) are no-borrowing cases, whereas we expect borrowing to be optimal in the case (h). It turns out that for the construction of the reflected diffusions, it is advantageous to consider the change of variables $s/b$ in the no-borrowing case and $s/(-b)$ in the borrowing case. For this, we define

$$ {\mathcal{S}}_{+} := \left \{(b,s)\in \mathcal{S}: b>0, s>0\right \}, \qquad {\mathcal{S}}_{-} := \left \{(b,s)\in \mathcal{S}: b< 0, s>0\right \}. $$

In the sequel, we work on the reduced state space $[0,T]\times {\mathcal{S}} _{+}$ in the no-borrowing cases (e)–(g) and $[0,T]\times {\mathcal{S}} _{-}$ in the borrowing case (h).

3.1 Construction in the no-borrowing case (e)

The main idea for the construction of the optimal strategy is to find a suitable transformation of the state space so that the problem of constructing an obliquely reflected diffusion in an unbounded and time-dependent cone simplifies to normal reflection in a time-dependent interval. The transformation and construction is based on ideas from Gerhold et al. [26], and hence we keep the exposition to a minimum. We restrict ourselves to the case $p<1$, $p\neq 0$ (power utility) and remark that the construction for the case $p=0$ (log utility) follows similarly.

In the situation of case (e), i.e., $\pi _{M} < 1$ with $s_{0},b_{0}>0$, we first observe that $V\in C^{1,2}([0,T)\times \mathcal{S}_{0})$ by Theorem 2.2. We define

$$ \ell (t) := \frac{\underline{\pi }(t)}{1-\underline{\pi }(t)}\qquad \text{and}\qquad u(t) := \frac{\overline{\pi }(t)}{1-\overline{ \pi }(t)}, $$

(3.1)

and note that $\ell $ and $u$ constitute the buy and sell boundaries under the change of variables $(b,s)\mapsto s/b$, i.e.,

$$ \mathcal{R}_{0}^{\mathrm{nt}} = \bigg\{ (t,b,s)\in [0,T)\times \overline{ \mathcal{S}} : \ell (t) < \frac{s}{b} < u(t)\bigg\} . $$

By Theorem 2.3, we see that $\ell < u$, $\ell \in C([0,T))$ and $u\in C^{\infty }([0,T))$.

On the set $[0,T]\times \mathcal{S}_{+}$ we consider the transformation

$$ V(t,b,s) = b^{p}\exp \bigg( - p\int _{\log \frac{s}{bu(t)}}^{0} w(t,y)\, \mathrm{d}y \bigg) = b^{p}\exp \bigg( - p\int _{x}^{0} w(t,y)\, \mathrm{d}y \bigg), $$

where

$$ x = x(t,b,s) := \log \frac{s}{bu(t)}. $$

With this and using that $V$ satisfies $\mathcal{L}^{\mathrm{buy}}V \geq 0$ and $\mathcal{L}^{\mathrm{sell}} V\geq 0$, we see that $w$ satisfies

$$ 1-\mu \leq \frac{w(t,x)}{u(t)(1-w(t,x))e^{x}} \leq 1+\lambda , $$

(3.2)

and equality holds if and only if $\mathcal{L}^{\mathrm{sell}} V = 0$ or $\mathcal{L}^{\mathrm{buy}} V = 0$, respectively. Moreover, since $\mathcal{L}^{\mathrm{nt}}V = 0$ whenever $\mathcal{L}^{\mathrm{buy}} V > 0$ and $\mathcal{L}^{\mathrm{sell}} V > 0$, we have

$$\begin{aligned} 0 &=\int _{x}^{0} \partial _{t}w(t,y)\,\mathrm{d}y - \bigg(\alpha -\frac{1}{2} \sigma ^{2} - \frac{u'(t)}{u(t)}\bigg)w(t,x) \\ & \phantom{=:}- \frac{1}{2}p\sigma ^{2}w(t,x)^{2} - \frac{1}{2}\sigma ^{2}\partial _{x}w(t,x) \end{aligned}$$

whenever $w/(u(1-w)e^{x}) \not \in \{1-\mu ,1+\lambda \}$. Taking the derivative with respect to $x$ in the last equation, we obtain

$$\begin{aligned} \frac{1}{2}\sigma ^{2}\partial _{x}^{2}w(t,x) & = - \partial _{t}w(t,x) - \bigg(\alpha - \frac{1}{2}\sigma ^{2} - \frac{u'(t)}{u(t)}\bigg) \partial _{x}w(t,x) \\ & \phantom{=:}- p\sigma ^{2}w(t,x)\partial _{x}w(t,x). \end{aligned}$$

Consider again the fraction in (3.2), i.e.,

$$ f(t,x) := \frac{w(t,x)}{u(t)(1-w(t,x))e^{x}}. $$

Since by (3.1) the points $x = 0$ and $x = \log ( \ell (t)/u(t))$ constitute the boundary points of the no-trading region in the new variables, we must have

$$\begin{aligned} f(t,x) &= 1-\mu \qquad \text{if }x\geq 0, \end{aligned}$$

(3.3)

$$\begin{aligned} f(t,x) &= 1+\lambda \qquad \text{if } x\leq \log \frac{\ell (t)}{u(t)}. \end{aligned}$$

(3.4)

Moreover, note that for the point $x = \log (\ell (t)/u(t))$, these considerations are only valid for $t\in [0,t^{\mathrm{down}})$ since otherwise $\log (\ell (t)/u(t)) = -\infty $.

Remark 2

We have

$$ f(t,x) = \frac{w(t,x)}{u(t)(1-w(t,x))e^{x}} \in [1-\mu ,1+\lambda ] $$

and $f(t,x)\in \{1-\mu ,1+\lambda \}$ inside the buy and sell regions. This suggests that

$$ f\bigl(t,X^{*}(t)\bigr)P^{1}(t) \qquad \text{with} \qquad X^{*}(t) = \log \frac{S^{*}(t)}{B^{*}(t)u(t)} $$

(where $(B^{*},S^{*})$ is the optimally controlled portfolio process) is the shadow price in our problem. This can be confirmed as in Gerhold et al. [26].

The next step is to construct a reflected diffusion in the time-dependent interval $[\log (\ell /u),0]$.

Lemma 3

There exist a process$\Psi = (\Psi (t))_{t\in [t_{0},T)}$and nondecreasing processes$L = (L(t))_{t\in [t_{0},T)}$and$M = (M(t))_{t \in [t_{0},T)}$such that$L$is constant on$[t^{\mathrm{down}},T)$and

$$ \mathrm{d}\Psi (t) = \bigg(\alpha -\frac{1}{2}\sigma ^{2} - \frac{u'(t)}{u(t)}\bigg)\,\mathrm{d}t + \sigma \,\mathrm{d}W(t) + \mathrm{d}L(t) - \mathrm{d}M(t), $$

(3.5)

with

$$ \Psi (t_{0}) = \log \frac{s_{0}}{b_{0}u(t_{0})}, $$

and such that$\Psi $is a diffusion reflected on the boundaries of the time-dependent interval$[\log (\ell /u),0]$.

Proof

This follows from Słomiński and Wojciechowski [38, Theorem 3.3] together with Remark 2.4. □

Let us now define a process $N = (N(t))_{t\in [t_{0},T)}$ by $N({t_{0}}) = s_{0}/P^{1}({t_{0}})$ and, for $t\in [t_{0},T)$,

$$\begin{aligned} \mathrm{d}N(t) &= N(t)\biggl(1 - w\Bigl(t,\log \frac{\ell (t)}{u(t)} \Bigr)\biggr)\,\mathrm{d}L(t)- N(t)\bigl(1 - w(t,0)\bigr)\,\mathrm{d}M(t). \end{aligned}$$

(3.6)