1 Introduction

The classical result in Mathematical Finance states that in a complete frictionless market with no transaction costs and no portfolio constraints the "fair" (no-arbitrage) price of a derivative security equals the expectation, with respect to the unique martingale measure, of the security’s discounted payoff (see, for instance, Pliska (1997); Björk (1998), or Föllmer and Schied (2002)). This result has been extended to financial markets with transaction costs and trading constraints by numerous researchers, including Bensaid et al. (1992); Cvitanić and Karatzas (1996); Soner et al. (1995); Jouini and Kallal (1995a, 1995b, 1999); Jouini (2000, 2001); Föllmer and Kramkov (1997); Carassus et al. (2001); Napp (2001); Jouini and Napp (2001); Kabanov and Stricker (2001); Kabanov (1999, 2001); Stettner (2002); Schachermayer (2004); Roux (2011), and others.

Dempster et al. (2006) established the pricing principle which unifies previous results by leveraging the parallelism between paths of economic dynamics in the von Neumann-Gale model and hedging strategies in a dynamic model of a financial market. Von Neumann-Gale dynamical systems, originally introduced by Von Neumann (1937); Gale (1956), and Rockafellar (1967) for deterministic economic growth modelling, have been extended to stochastic settings by Dynkin, Radner, and others in the 1970 s Dynkin (1971, 1972); Dynkin and Yushkevich (1979); Radner (1971, 1972). Recent advancements (see, e.g., Evstigneev and Schenk-Hoppé 2006, 2008; Bahsoun et al. 2008; Zhitlukhin 2019) have surmounted earlier challenges, and shown that these stochastic systems provide a natural framework for modelling financial markets with frictions. Dempster et al. (2006) addressed no-arbitrage pricing and hedging with proportional transaction costs, and later was extended to a broader context by Evstigneev and Zhitlukhin (2013). Babaei et al. (2020a, 2020b, 2021) explored von Neumann-Gale dynamics in the capital growth theory under proportional transaction costs, further extending the results in this field.

The primary contribution of this paper lies in the extension of the hedging and pricing principle in multiple directions: incorporating fixed and proportional transaction costs, allowing short selling under specified constraints, including margin requirements, and accounting for assets paying dividends with potentially different rates for long and short positions. The main results of the paper present general hedging criteria expressed in terms of consistent discount factors and consistent valuation systems, extending the notion of an equivalent martingale measure.

Let us briefly mention other results related to the topic of the present paper. The equivalence between the absence of arbitrage and the existence of risk-neutral measures under fixed transaction costs has been studied by Jouini et al. (2001). Lépinette and Tran (2016, 2017) investigated the separation of risk measures in the same context. Brown and Zastawniak (2020) extended this analysis to situations where both fixed and proportional transaction costs apply concurrently. They demonstrated that the absence of arbitrage in a model consisting of one risky asset and one risk-free asset with both fixed and proportional transaction costs is equivalent to the existence of a family of absolutely continuous single-step probability measures. This family, together with an adapted process within the bid-ask intervals satisfying the martingale property for each measure, ensures the absence of arbitrage.

The paper’s structure is as follows. Section 2 describes the model. Section 3 states the hedging problem and provides some basic results about it in the model at hand. Section 4 establishes the hedging criteria in terms of consistent discount factors. Section 5 defines and discusses the notion of a consistent valuation system and its relation to the hedging problem. The Appendix assembles general mathematical facts used in this work.

2 The model

We consider a model of a financial market with fixed and proportional transaction costs, as well as portfolio constraints, which is based on von Neumann-Gale dynamical systems.

Let \((\Omega ,\mathcal {F},P)\) be a finite probability space and \(\mathcal {F}_{0}\subseteq \mathcal {F}_{1}\subseteq ...\subseteq \mathcal {F} _{T}=\mathcal {F}\) a sequence of algebras representing a filtration on this space. Sets in the algebra \(\mathcal {F}_{t}\) are interpreted as events observable prior to date t. Without loss of generality the probability of each \(\omega \in \Omega \) is positive. Equalities and inequalities for random variables will be understood to hold for all \(\omega \).

We consider a market where m assets are traded at dates \(t=0,1,\ldots , T\). The positions of a portfolio \(a\in \mathbb {R}^{m}\) will be expressed in terms of their value.

For each \(t=0,1, \ldots , T\) and \(i=1,\ldots ,m\) the following \(\mathcal {F}_{t} \)-measurable random variables are given: market prices \(s_{t,i}>0\), transaction cost rates for selling and buying assets \(0\le \lambda _{t,i} ^{+}<1\), \(\lambda _{t,i}^{-}\ge 0\), fixed transaction costs \(C_{t} \ge 0\), dividend yield for long and short positions \(0\le D_{t,i}^{+}\le D_{t,i}^{-}\). Let \(R_{t,i}=s_{t,i}/s_{t-1,i}\) denote the return of assets.

Portfolio constraints in the model are specified by the conesFootnote 1

$$\begin{aligned} X_{t}(\omega )=\left\{ a\in \mathbb {R}^{m}:\sum _{i=1}^{m}\left( 1-\lambda _{t,i} ^{+}(\omega )\right) a_{+}^{i}\ge \mu _{t}(\omega )\sum _{i=1}^{m}\left( 1+\lambda _{t,i}^{-} (\omega )\right) a_{-}^{i}\right\} , \end{aligned}$$
(1)

where \(\mu _{t}(\omega )>1\) are \(\mathcal {F}_t\)-measurable random variables which can be interpreted as margin requirement coefficients: a trader must be able to liquidate the long positions of her portfolio to cover the short positions with excess determined by \(\mu _{t}\).

Trading in the model at hand goes on as follows. At each date t, \(t=1, \ldots , T-1 \), a trader pays \(C_{t}\) as fixed transaction costs. We will assume that \(C_{0}=C_{T}=0 \). Then she receives the dividend on her portfolio \(a(\omega )\) that she purchased at the previous date. The amount of dividend is specified by the function \(d_{t}(\omega ,a)\) defined by the formula

$$\begin{aligned} d_{t}(a)=\sum _{i=1}^{m}\left( {D}^{+}_{t,i}a_{+}^{i}-{D}^{-}_{t,i}a_{-}^{i}\right) . \end{aligned}$$

Here \({D}^{\pm }_{t,i}\) specify the amount of dividend received or returnedFootnote 2 for each dollar invested in asset i. The amount of dividend received or returned for 1 physical unit of asset i will be \(D_{t,i}^{\pm }s_{t-1,i}\).

After that, the trader rearranges her portfolio \(a(\omega )\) with added dividend to a portfolio \(b(\omega )\) subject to the self-financing constraint. The possibility of rearrangement is specified by the inequality

$$\begin{aligned} \sum _{i=1}^{m}\left( 1-\lambda _{t,i}^{+}\right) \left( R_{t,i}a^{i}-b^{i}\right) _{+}+d_{t}(a)\ge \sum _{i=1}^{m}\left( 1+\lambda _{t,i}^{-}\right) \left( R_{t,i}a^{i}-b^{i}\right) _{-}+C_{t}.\nonumber \\ \end{aligned}$$
(2)

The left-hand side of (2) is the amount of money the trader receives for selling the assets plus the dividends, the right-hand side is the amount of money she pays for buying the assets, including fixed transaction costs. This inequality means that the trader does not use external funds to rearrange her portfolio, so it can be regarded as a self-financing condition (with a possibility of free disposal).

Let

$$\begin{aligned} \psi _{t}(\omega , a,b){} & {} =\sum _{i=1}^{m}\left( 1-\lambda _{t,i}^{+}\right) \left( R_{t,i}a^{i}-b^{i}\right) _{+}\nonumber \\{} & {} \quad \ -\sum _{i=1}^{m}\left( 1+\lambda _{t,i}^{-}\right) \left( R_{t,i}a^{i}-b^{i}\right) _{-}+d_{t}(a). \end{aligned}$$
(3)

Then inequality (2) can be stated as \(\psi _{t}(\omega ,a,b)\ge C_{t}(\omega )\). The above description of the model corresponds to the requirement that pairs of portfolios (ab) belong to the sets

$$\begin{aligned} {Z}_{t}(\omega ):=\left\{ (a,b)\in X_{t-1}(\omega )\times X_{t}(\omega ):\psi _{t}(\omega ,a,b)\ge C_{t}(\omega )\right\} . \end{aligned}$$
(4)

A natural instance of this model is when asset 1 represents cash deposited with a bank account, and the other assets represent holdings in shares of stock. Then it is natural to put \(s_{t,1}=1\), \(\lambda _{t,1}^{\pm }=0\) (value is expressed in terms of cash, and there are no transaction costs for cash). The random variables \(D_{t,1}^{\pm }\) may be \(\mathcal {F}_{t-1}\)-measurable and represent risk-free interest rates for lending and borrowing money. The random variables \(D_{t,i}^{\pm }\), \(i\ge 2\), may be \(\mathcal {F}_{t} \)-measurable and represent dividend yield rates on stock. The possibility to have different dividend rates for long and short positions may be used e.g. when some assets pay dividends in a currency different from asset 1 and there is a bid–ask spread in the exchange rates.

Observe that \({Z}_{t}(\omega )\) is a convex set but not a cone: it is convex, since the function \(\psi _{t}(a,b)\) is concave as follows from the representation

$$\begin{aligned} \begin{aligned} \psi _{t}(a,b)&=\sum _{i=1}^{m}\left[ \left( 1-\lambda _{t,i}^{+}\right) \left( R_{t,i}a^{i} -b^{i}\right) +D_{t,i}^{+}a^{i}\right] \\&-\sum _{i=1}^{m}\left[ \left( \lambda _{t,i}^{-}+\lambda _{t,i}^{+}\right) \left( R_{t,i}a^{i} -b^{i}\right) _{-}+\left( D_{t,i}^{-}-D_{t,i}^{+}\right) a_{-}^{i}\right] , \end{aligned} \end{aligned}$$

where the first sum is a linear function of ab and the second sum is a convex function of ab. However, it does not contain with any pair (ab) all pairs \(\lambda (a,b)\), where \(\lambda \ge 0\).

The sets \(X_{t}(\omega ) \) and \(Z_{t}(\omega ) \) described above generate a stochastic dynamical system over the time interval \(t=0,1, \ldots , T \). Let \(\mathcal {L}^{m}_{t}\) (\(t=0,1,...\)) be a linear space of \(\mathcal {F}_{t} \)-measurable vector functions \(x(\omega )\) with values in \(\mathbb {R}^{m}\). We say that a vector function \(x(\omega )\) is a random state of the system and write \(x\in \mathcal {X}_{t}\) if \(x\in \mathcal {L} ^{m}_{t}\) and \(x(\omega )\in X_{t}(\omega )\).

A sequence of random states \(x_{0}\in \mathcal {X}_{0},x_{1}\in \mathcal {X}_{1},...\) is called a feasible trading strategy if

$$\begin{aligned} (x_{t-1}(\omega ),x_{t}(\omega ))\in Z_{t}(\omega ), \ t=1, \ldots , T-1. \end{aligned}$$

Note that a feasible trading strategy in the model is nothing but a path of the dynamical system under consideration. Thus, this model is a version of a von Neumann-Gale model in which the sets \(X_t\) are cones, but the sets \(Z_t\) are just convex. This extends previous literature on applications of von Neumann-Gale models to mathematical finance which dealt with conic models (see, e.g., Dempster et al. 2006; Evstigneev and Zhitlukhin 2013).

3 The hedging problem

Let us define the following cones describing possibilities of constructing initial portfolios and liquidating terminal ones:

$$\begin{aligned} V_{0}(\omega )=\{(a,b)\in R_+\times {X}_{0}:a\ge -\psi _{0}(\omega ,0,b)\}, \end{aligned}$$
(5)

and

$$\begin{aligned} V_{T}(\omega )=\{(a,b)\in {X}_{T-1}(\omega )\times R:\psi _{T}(\omega ,a,0)\ge b\}, \end{aligned}$$
(6)

where \(\psi _0\) and \(\psi _{T}\) are defined by (3) (in the formula for \(\psi _0\) one can formally put \(R_{0,i} = 1\) and \(d_0(a)\equiv 0\), since we only consider \(\psi _0(\omega ,0, \cdot )\)).

Thus, we state that to create a portfolio b at time 0, it is essential to have at least an amount of money \(-\psi _{0}(\omega ,0,-b)\). When liquidating a portfolio a created at time \(T-1\) to obtain funds at time T, one must ensure that the resulting amount of money \(\psi _{T}(\omega ,a,0)\) exceeds the value of b. This rationale underlies the definitions of the cones \(V_{0}\) and \(V_{T}\) in equations (5) and (6).

A sequence \((v_{0},x_{0},x_{1},\ldots ,x_{T-1},v_{T})\) is called a hedging strategy if

  1. (a)

    \((x_{0},x_{1}, \ldots , x_{T-1}) \) is a feasible trading strategy,

  2. (b)

    \(v_{0} \in \mathcal {L}_0^1\) and \((v_{0}(\omega ),x_{0}(\omega ))\in V_{0}(\omega )\),

  3. (c)

    \(v_{T}\in \mathcal {L}_{T}^{1}\) and \((x_{T-1}(\omega ),v_{T}(\omega ))\in V_{T} (\omega )\).

We shall say that an initial endowment \(v_{0}\in \mathcal {L}_0^1\) allows the hedging of a contingent claim \(v_{T}\in \mathcal {L}_{T}^{1}\) if there exists a hedging strategy of the form \((v_{0},x_{0},x_{1},\ldots , x_{T-1},v_{T})\).

Denote by \(\mathcal {H}\) the set of pairs \((v_{0},v_{T})\in \mathcal {L}_{0}^{1} \times \mathcal {L}_{T}^{1}\) such that \(v_{0}\) allows the hedging of \(v_{T}\). The main question we are interested in is how to characterize the set \(\mathcal {H}\). This question is of fundamental importance for asset pricing: the smallest element of the set \( \lbrace v_0: (v_0, v_T) \in \mathcal {H} \rbrace \) represents the hedging price of the contingent claim \(v_T\).

Let us introduce two basic assumptions that will be assumed to hold throughout the paper. Define \(\Lambda _{t,i}^{+}(\omega )=1-\lambda _{t,i}^{+}\) and \(\Lambda _{t,i} ^{-}(\omega )=1+\lambda _{t,i}^{-}\). Then we require the following to hold.

(B1) For each t, there exist constants \(\underline{R}_{t}\), \(\overline{R}_{t}\), \(\underline{\Lambda }_{t}\), \(\overline{\Lambda }_{t}\), \(\overline{D}_{t}\), and \(\overline{C}_{t}\) such that \(0<\underline{R}_{t}\le R_{t,i}(\omega )\le \overline{R}_{t}\), \(0<\underline{\Lambda }_{t}\le \Lambda _{t,i}^{+} (\omega )\), \(\Lambda _{t,i}^{-}(\omega )\le \overline{\Lambda }_{t}\), \(D_{t,i} ^{-}(\omega )\le \overline{D}_{t}\), and \(C_{t}(\omega )\le \overline{C}_{t}\) for all i, \(\omega \).

(B2) For each t, we have \(\mu _{t}>\nu _{t}\) where

$$\begin{aligned} \nu _{t}:=\max \{(\overline{\Lambda }_{t+1}\overline{R}_{t+1}+\overline{D} _{t+1})/(\underline{\Lambda }_{t+1}\underline{R}_{t+1}+\underline{D} _{t+1});\overline{\Lambda }_{t}/\underline{\Lambda }_{t}\} \end{aligned}$$

and \(\underline{D}_{t}\ge 0\) is a constant such that \(\underline{D}_{t}\le D_{t,i}^{+}(\omega )\) for all \(\omega ,i\).

These assumptions are not restrictive, and in fact, (B1) is always satisfied since the probability space is finite.

The following propositions prove some technical properties of the model that will be used in what follows. For convenience of notation, denote

$$\begin{aligned} Z_{0}(\omega ):=V_{0}(\omega ),\quad Z_{T}(\omega ):=V_{T}(\omega ). \end{aligned}$$

The first proposition shows that the model has a property of "strict monotonicity".

Proposition 1

Suppose \((a, b) \in Z_{t}(\omega )\) for some \(t=0, \ldots , T\). Then for any \(a'\ge a\) such that \(a'\ne a\), there exists \(b'\) such that \(b'\ge b\), \(b'\ne b\) and \((a^{\prime }, b^{\prime }) \in Z_{t}(\omega )\).

Proof

We first prove it for \(t=0\). Let \((a,b)\in Z_{0}(\omega )=V_{0}(\omega )\), and \(a^{\prime }>a\). Let \(r=(\frac{a^{\prime } -a}{\overline{\Lambda }_{1}},0,\ldots ,0)\), and \(b^{\prime }=b+r\). As \(X_{t}(\omega )\supseteq R_{+}^{m}\) and is a cone, we have \(b^{\prime }\in X_{0}(\omega )\). Then

$$\begin{aligned} -\psi _{0}(\omega ,0,b^{\prime })=-\psi _{0}(\omega ,0,b+r)\le -\psi _{0} (\omega ,0,b)-\psi _{0}(\omega ,0,r)\le a^{\prime }, \end{aligned}$$

where the first inequality holds because \(\psi _{0}(\omega ,0,x)\) is concave and positively homogeneous (of degree one) in x, and the second inequality holds by using condition (B1) and the fact that \(-\psi _{0}\left( \omega ,0,r\right) =\left( 1+\lambda _{0,1}^{+}\right) \frac{a^{\prime } -a}{\overline{\Lambda }_{1}}\le a^{\prime }-a\). This proves that \((a^{\prime },b^{\prime })\in Z_{0}(\omega )\)

Let us prove it for \(t=T\). Let \((a,b)\in Z_{T}(\omega )=V_{T}(\omega )\), and \(a^{\prime }\ge a\), \(a'\ne a\). We can write \(a^{\prime }=a+r\), where \(r=\left( r^{1},\ldots ,r^{m}\right) \), \(r^{i}\ge 0\) for all i, and \(r^{j}>0\) for some j. Then

$$\begin{aligned} \psi _{T}\left( \omega ,a^{\prime },0\right) \ge \psi _{T}(\omega ,a,0) +\psi _{T}(\omega ,r,0) \ge b+r_{j}\left( \underline{D}_{T}+\underline{R}_{T} \underline{\Lambda }_{T}\right) >b. \end{aligned}$$

Let \(b^{\prime }=b+r_{j}\left( \underline{D}_{T}+\underline{R}_{T} \underline{\Lambda }_{T})\right) \). Then \(\left( a^{\prime },b^{\prime }\right) \in Z_{T}(\omega )\).

We now consider \(t=1,\ldots ,T-1\). Let \((a,b)\in Z_{t}(\omega )\), and \(a^{\prime }\ge a\), \(a'\ne a\). Then we have \(\psi _{t}(\omega ,a,b)\ge C_{t}\), where \(\psi _{t}\) is defined by (3), and \(a^{\prime }=a+r\), where \(r=(r^{1},\ldots ,r^{m})\), \(r^{i}\ge 0\) for all i, and \(r^{j}>0\) for some j. Let \(b^{\prime }=b+r^{\prime }\), where \(r^{\prime }=(0,\ldots ,\underline{R}_{t}r^{j},\ldots ,0)\). Note that \(b^{\prime }\in X_{t}(\omega )\). We will prove that \((a^{\prime },b^{\prime })\in Z_{t}(\omega )\). Indeed,

$$\begin{aligned} \psi _{t}(\omega ,a^{\prime },b^{\prime })\ge \psi _{t}(\omega ,a,b)+\psi _{t}(\omega ,r,r^{\prime } )>C_{t}(\omega ), \end{aligned}$$

where the first inequality holds because \(\psi _{t}(\omega ,x,y)\) is concave and positively homogeneous (of degree one) in (xy), and the second inequality holds in view of condition (B1) and the relation

$$\begin{aligned} \psi _{t}(\omega ,r,r^{\prime })=\sum _{i\ne j}^{m}(1-\lambda _{t,i}^{+})(R_{t,i} r^{i})+(1-\lambda _{t,j}^{+})(R_{t,j}r^{j}-\underline{R}_{t}r^{j})+d_{t}(r)>0. \end{aligned}$$

\(\square \)

Let \(\left| \cdot \right| \) denote the norm of a vector in a finite-dimensional space defined as the sum of the absolute values of its coordinates. For a finite-dimensional vector a, we will denote by \(\mathbb {B}(a,r)\ \)the ball \(\{b:|b-a|\le r\}\). In the next proposition, we prove that the sets \(Z_t\) have non-empty interior.

Proposition 2

For each \(t=1,2,\ldots ,T-1\), there exists an \(\mathcal {F}_{t}\)-measurable vector function \(\mathring{z}_{t}=\) \((\mathring{x}_{t},\mathring{y}_{t})\ \)such that for all \(\omega \in \Omega \), we have

$$\begin{aligned} \mathbb {B}(\mathring{z}_{t},\varepsilon _{t})\subseteq Z_{t}(\omega ), \end{aligned}$$
(7)

where \(\varepsilon _{t}>0\) is some constant.

We will need the following auxiliary result to prove Proposition 2.

Lemma 1

  1. (a)

    For each t there exists a constant \(\tau _{t}>0\) such that if \(a\in X_{t}(\omega )\) then \(|a_{+}|-\nu _{t}|a_{-}|\ge \tau _{t} |a|\).

  2. (b)

    For each t there exist positive constants \(\kappa _{t}^{1}\) and \(\kappa _{t}^{2}\) such that if \(a\in X_{t-1}(\omega )\), \(b\in X_{t}(\omega )\) and \(|b|\le \kappa _{t} ^{1}|a|-\kappa _{t}^{2}\), then \((a,b)\in Z_{t}(\omega )\).

Proof

(a) Observe that \(X_{t}(\omega )\subseteq \tilde{X}_{t}\), where \(\tilde{X}_{t}=\{a\in \mathbb {R}^{m}:\mu _{t}|a_{-}|\le |a_{+}|\}\), and since \(\mu _{t}>1\) we have \(\tilde{X}_{t}\cap (-\tilde{X}_{t})=\{0\}\). The continuous function \(h_{t}(a)=|a_{+}|-\nu _{t}|a_{-}|\) is strictly positive on the compact set \(\hat{X}_{t}=\tilde{X}_{t}\cap \{a:|a|=1\}\). Indeed, since \(h_{t} (a)\ge (\mu _{t}-\nu _{t})|a_{-}|\) on \(\tilde{X}_{t}\), then the equality \(h_{t}(a)=0\) would imply \(|a_{-}|=0\), and hence \(|a_{+}|=h_{t}(a)=0\), so that \(|a|=0\). Then \(h_{t}(a)\) attains a strictly positive minimum on \(\hat{X}_{t}\), which can be taken as \(\tau _{t}\).

(b) Let \(b\in X_{t}(\omega )\). It is straightforward to check that for any numbers xy we have \((x-y)_{+}\ge x_{+}-y_{+}\) and \((x-y)_{-}\le x_{-}+y_{+}\). Using this, we obtain for any \(a\in X_{t-1}(\omega )\)

$$\begin{aligned} \begin{aligned} \psi _{t}(a,b)&\ge \sum _{i=1}^m((\Lambda _{t,i}^{+}R_{t,i}+D_{t,i}^{+})a_{+} ^{i}-(\Lambda _{t,i}^{-}R_{t,i}+D_{t,i}^{-})a_{-}^{i})-\sum _{i=1}^m(\Lambda _{t,i}^{+}+\Lambda _{t,i}^{-})b_{+}^{i}\\&\ge (\underline{\Lambda }_{t}\underline{R}_{t}+\underline{D}_{t} )|a_{+}|-(\overline{\Lambda }_{t}\overline{R}_{t}+\overline{D}_{t} )|a_{-}|-2\overline{\Lambda }_{t}|b_{+}|\\&\ge (\underline{\Lambda }_{t}\underline{R}_{t}+\underline{D}_{t} )(|a_{+}|-\nu _{t-1}|a_{-}|)-2\overline{\Lambda }_{t}|b|\\&\ge \tau _{t-1}(\underline{\Lambda }_{t}\underline{R}_{t}+\underline{D} _{t})|a|-2\overline{\Lambda }_{t}|b|, \end{aligned} \end{aligned}$$
(8)

where the third inequality follows from (B2). Then statement (b) can be fulfilled with the constant \(\kappa _{t}^{1} =\tau _{t-1}(\underline{\Lambda }_{t}\underline{R}_{t}+\underline{D} _{t})/(2\overline{\Lambda }_{t})\) and \(\kappa _{t}^{2}=\overline{C} _{t}/2\overline{\Lambda }_{t}\), since in that case \(\psi _{t}(a,b)\ge C_{t}\), implying \((a,b)\in Z_{t}\). \(\square \)

Proof of Proposition 2

Let \(\mathring{x}_{t}=\frac{2\kappa _{t}^{2} }{\kappa _{t}^{1}m}(1,...,1)\in \mathbb {R}^{m}\). Put \(\mathring{z} _{t}=(\mathring{x}_{t},\mathring{y}_{t})\) with \(\mathring{y}_{t}=(\kappa _{t}^{1}/4)\mathring{x}_{t}\). Note that \(|\mathring{x}_{t}|=2\kappa _{t} ^{2}/\kappa _{t}^{1}\), and \(|\mathring{y}_{t}|=\kappa _{t}^{2}/2<\kappa _{t} ^{1}|\mathring{x}_{t}|-\kappa _{t}^{2}=\kappa _{t}^{2}\), thus statement (b) of Lemma 1 implies \(\mathring{z}_{t}\in Z_{t}\). Observe that there exists \(\delta _{t}>0\) such that \(\mathbb {B}(\mathring{z}_{t},\delta _{t} )\subset \mathbb {R}_{+}^{2m}\) and therefore \(\mathbb {B}(\mathring{z}_{t},\delta _{t})\subset X_{t-1}\times X_{t}\). Since \(|\mathring{y}_{t}|<\kappa _{t}^{1}|\mathring{x}_{t}|-\kappa _{t}^{2}\), then one can find \(0<\varepsilon _t \le \delta _t\) such that \(|b| \le \kappa _{t} ^{1}|a|-\kappa _{t}^{2}\) for any \((a,b) \in \mathbb {B}(\mathring{z_t},\varepsilon _t)\). Indeed, we have

$$\begin{aligned} |b|\le & {} |b-\mathring{y}_{t}|+|\mathring{y}_{t}| \le \varepsilon _t+\kappa _{t}^{2}/2=\varepsilon _t+2\kappa _{t}^{2}-3 \kappa _{t}^{2}/2\\\le & {} \varepsilon _t +\kappa _{t}^{1} |a|+\kappa _{t}^{1} \varepsilon _t -3 \kappa _{t}^{2}/2 \le \kappa _{t}^{1}|a|-\kappa _{t}^{2}. \end{aligned}$$

The third inequality holds because \( |a| \ge |\mathring{x}_{t}|-\varepsilon _t \), and the last inequality holds as long as \( \varepsilon _t \le \kappa _{t}^{2}/2(1+\kappa _{t}^{1}) \). Hence, \(\mathring{z}_{t}\ \)and \(\varepsilon _{t}\) satisfy conditions of proposition. \(\square \)

Proposition 3

For any \(b\in {X}_{t}(\omega )\), we have \(\psi _{t}(\omega ,0,-b)\ge 0\), and \(\psi _{t}(\omega ,0,b)\le 0\). If \( b \ne 0 \), then \(\psi _{t}(\omega ,0,-b) >0\), and \(\psi _{t}(\omega ,0,b) <0\).

Proof

This follows from conditions (B1), (B2) and Lemma 1. Indeed,

$$\begin{aligned} \psi _{t}(\omega ,0,-b)= & {} \sum _{i=1}^{m}(1-\lambda _{t,i} ^{+}(\omega ))b_{+}^{i}-\sum _{i=1}^{m}(1+\lambda _{t,i}^{-}(\omega ))b_{-}^{i}\\\ge & {} \underline{\Lambda }_{t} |b_{+}|-\overline{\Lambda }_{t}|b_{-}| \ge \underline{\Lambda }_{t}(|b_{+}|-\nu _{t}|b_{-}|)\ge \underline{\Lambda }_{t}\tau _{t} |b|. \end{aligned}$$

In this chain, the first inequality follows from (B1), the second follows from (B2), and the last one follows from Lemma 1.

The second claim of the proposition follows from the chain of relations

$$\begin{aligned} -\psi _{t}(\omega ,0,b)= & {} \sum _{i=1}^{m}(1+\lambda _{t,i}^{-}(\omega ))b_{+}^{i}-\sum _{i=1}^{m}(1-\lambda _{t,i} ^{+}(\omega ))b_{-}^{i}\\\ge & {} |b_{+}|-|b_{-}| \ge |b_{+}|-\nu _{t}|b_{-}|\ge \tau _{t} |b|. \end{aligned}$$

\(\square \)

4 Consistent discount factors and solution of the hedging problem

Let us call consistent discount factors with a transaction costs term (hereinafter, simply consistent discount factors) any triple \((q_0,q_T,r)\), where \(q_0\in \mathcal {L}_0^1\), \(q_T\in \mathcal {L}_T^1\) are strictly positive functions and \(r\ge 0\) is a constant such that

$$\begin{aligned} E q_0 v_{0}\ge Eq_{T}v_{T} + r\ \text {for any}\ (v_{0},v_{T})\in \mathcal {H}. \end{aligned}$$

The next theorem provides the general solution to the hedging problem in terms of consistent discount factors.

Theorem 1

(a) Consistent discount factors exist.

(b) We have \((v_0,v_T) \in \mathcal {H}\) if and only if \(E q_0v_0 \ge Eq_T v_T + r\) for all consistent discount factors \((q_0,q_T,r)\).

To prove the theorem, we will need several auxiliary definitions and preparatory results.

In what follows, let us fix the market prices, proportional transaction cost rates and dividend yields, but allow the transaction costs \(C_t\ge 0\) to vary. Introduce the set \(\mathcal {U} \subseteq \mathcal {L}_0^1\times \mathcal {L}_1^1\times \dots \times \mathcal {L}^1_{T-1}\times \mathcal {L}_T^1\) which consists of sequences \(u=(v_0,C_1,\dots ,C_{T-1},v_T)\) such that in the market with transaction costs \(C_t\) the initial endowment \(v_0\) allows the hedging of the contingent claim \(v_T\). Observe that \(\mathcal {U}\) is a cone.

Proposition 4

The cone \(\mathcal {U}\) is closed.

To prove this result, we first establish a lemma.

Lemma 2

There exist a constant \(\gamma >0\) not depending on the transaction costs \(C_t\) such that for any hedging strategy \((v_0, x_0,\ldots , x_{T-1}, v_T )\) it holds that \(|x_t| \le \gamma v_0\) for all \(t = 0,\ldots , T-1\).

Proof

First observe that there exists a constant \(\delta >0\) such that if \(b \in X_t(\omega )\), then \(|b| \le \delta \sum _{i=1}^m b^i\). Indeed, since the underlying probability space is finite, we can find a constant \(\underline{\mu }> 1\) such that \(\underline{\mu }\le \mu _t\) for all \(t=0,\dots ,T-1\). If \(b\in X_t(\omega )\), then, as follows from the definition of \(X_t\), we have \(|b_+| \ge \mu _t |b_-| \ge \underline{\mu }|b_-|\). Consequently, \(\sum _{i=1}^mb^i = |b_+| - |b_-| \ge (1-\frac{1}{\underline{\mu }})|b_+|\) and \(|b| = |b_+| + |b_-| \le (1+\frac{1}{\underline{\mu }})|b_+|\). Hence, we can take \(\delta = (\underline{\mu }+1)/(\underline{\mu }-1)\).

Next observe that the inequality \(\psi _t(a,b) \ge c\) implies

$$\begin{aligned} \sum _{i=1}^m (R_{t,i} a^i - b^i + D_{t,i}^- a^i) \ge c. \end{aligned}$$

Let \(\overline{R} = \max _t \overline{R}_t\) and \(\overline{D} = \max _t \overline{D}_t\) be constants such that \(R_{t,i} \le \overline{R}\) and \(D_{t,i}^- \le \overline{D}\) for all \(t=1,\dots ,T-1\) and \(i=1,\dots ,m\). Consequently, if \(\psi _t(a,b) \ge c\) and \(b\in X_t(\omega )\), then

$$\begin{aligned} |b| \le \delta \sum _{i=1}^m b^i \le \delta ((\bar{R} + \bar{D})|a| - c). \end{aligned}$$

Suppose \((v_0, x_0,\ldots , x_{T-1}, v_T )\) is a hedging strategy. For \(t=0\), from the condition \(\psi (0,x_0) \ge -v_0\), we find that \(|x_0| \le \delta v_0\). For \(t=1,\dots ,T-1\), the condition \(\psi _t(x_{t-1},x_t) \ge C_t\) implies \(|x_t| \le \delta (\bar{R} + \bar{D})|x_{t-1}|\), since it is assumed that \(C_t\ge 0\). Now the claim of the proposition follows by induction. \(\square \)

Proof of Proposition 4

Consider \(u^n=(v_0^n,C_1^n,\dots ,C_{T-1}^n,v_T^n)\in \mathcal {U}\) such that \(u^n\rightarrow u = (v_0,C_1,\dots ,C_{T-1},v_t)\). We have to show that \(u\in \mathcal {U}\).

Let \(h^n=(v_0^n,x_0^n,\dots ,x_{T-1}^n,v_T^n)\) be the corresponding hedging strategies. From Lemma 2, it follows that \(x_t^n\) are bounded sequences for each \(t=0,\dots ,T-1\). The sequences \(v_0^n\) and \(v_T^n\) are also bounded, which follows from the convergence \(u^n\rightarrow u\).

Consequently, it is possible to choose a convergent subsequence \(h^{n_k} \rightarrow h = (v_0,x_0,\dots ,x_{T-1},v_T)\). In view of the continuity of the functions \(\psi _t(\omega ,a,b)\), we have \(v_0 \ge -\psi _0(0,x_0)\), \(\psi _t(x_{t-1},x_t) \ge C_t\) for all \(t=1,\dots ,T-1\), and \(\psi _T(x_{T-1},0) \ge v_T\). Thus h is hedging strategy in the model with transaction costs \(C_t\). Consequently, \(u \in \mathcal {U}\). \(\square \)

Next introduce the closed cone

$$\begin{aligned} \mathcal {K}= & {} \{ (\tilde{v}_0,C_1,\dots ,C_{T-1},v_{T}):\\{} & {} \tilde{v}_0\in \mathcal {L}_0^1,\ \tilde{v}_0\le 0,\ C_t \in \mathcal {L}_t^1,\ C_t\ge 0\ \text {for all}\,t,\ v_{T} \in \mathcal {L}_{T}^{1}, \ v_{T} \ge 0 \}. \end{aligned}$$

The following result can be regarded as the no-arbitrage hypothesis for the model at hand.

Proposition 5

We have \(\mathcal {U} \cap \mathcal {K} = \{0\}\).

Proof

If \(u = (v_0,x_0,\dots ,x_{T-1},v_T) \in \mathcal {U} \cap \mathcal {K}\), then necessarily \(v_0=0\) as follows from the definition of a hedging strategy. Proposition 2 implies that in this case \(x_0=\dots =x_{T-1} = 0\). Consequently, the inequality \(\psi _t(x_{t-1},x_t) \ge C_t\) implies that \(C_t=0\) for all t, and \(v_T\le 0\). Thus \(u=0\). \(\square \)

Proposition 6

Let \(u=(v_0,C_1,\dots ,C_{T-1},v_T)\in \mathcal {U} - \mathcal {K}\) be such that \(C_t\ge 0\) for all \(t=1,\dots ,T\). Then \(u\in \mathcal {U}\).

Proof

Let \(u=u' - k\), where \(u'=(v_0',C_1',\dots ,C_{T-1}',v_T')\in \mathcal {U}\) and \(k\in \mathcal {K}\). Denote by \(h = (v_0',x_0,\dots ,x_{T-1},v_T')\) the corresponding hedging strategy which allows the hedging of the contingent claim \(v_T'\) from the initial endowment \(v_0'\) in the market with transaction costs \(C_t'\).

We have \(v_0' \ge -\psi _0(0,x_0)\), hence \(v_0\ge -\psi _0(0,x_0)\), since \(v_0\ge v_0'\). From the condition \(\psi _t(x_{t-1},x_t) \ge C_t'\), we find that \(\psi _t(x_{t-1},x_t) \ge C_t\) for all \(t=1,\dots ,T-1\). Finally, as \(\psi _T(x_{T-1},0) \ge v_T'\), we have \(\psi _T(x_{T-1},0) \ge v_T\) because \(v_T'\ge v_T\).

As a result, we see that h is a strategy which allows to hedge \(v_T\) from the initial endowment \(v_0\) in the market with transaction costs \(C_t\). Consequently, \(u\in \mathcal {U}\). \(\square \)

Proof of Theorem 1

We apply Proposition 7 (see Appendix) to the cones \(\mathcal {U}\) and \( \mathcal {K}\). The conditions of applicability of this proposition are met in view of the above results.

Both cones \(\mathcal {U}\) and \(\mathcal {K}\) are contained in the linear space \(\mathcal {L}_0^1\times \mathcal {L}_1^1\times \dots \times \mathcal {L}_T^1\). Since the probability space is finite, this space is finite dimensional.

Any linear functional l(u) on this space can be represented as

$$\begin{aligned} l(u) = -Eq_0v_0 + \sum _{t=1}^{T-1} E q_tC_t + E q_T v_T, \end{aligned}$$
(9)

where \(q_t \in \mathcal {L}_{t}^{1}\).

By virtue of Proposition 7, there exists a linear functional l of the form (9) such that \( l(u) > 0\) for any \(u\in \mathcal {K} \backslash \{0\} \), and \( l(u)\le 0 \) for any \( u \in \mathcal {U} \). The first property implies that all \(q_t>0\). Then the triple \((q_0,q_T,r)\) with \(r = \sum _{t=1}^{T-1} E q_t C_t\) is a consistent discount factor in the model with given transaction costs \(C_t\). This proves the first claim of the theorem.

Let us prove the second claim. If \((v_0,v_T) \in \mathcal {H}\), then \(E q_0 v_0\ge E q_Tv_T + r\) for any consistent discount factors as follows from their definition. Conversely, suppose \(E q_0 v_0\ge E q_Tv_T + r\) for any consistent discount factors. Then, in particular, this inequality holds for any \((q_0,q_T,v)\) for which there exists a functional l(u) defined through factors \((q_0,q_1,\dots ,q_T)\) by formula (9) such that \(r = \sum _{t=1}^{T-1} E q_t C_t\) and which is strictly positive on \(\mathcal {K}\setminus \{0\}\) and non-positive on \(\mathcal {U}\). Let \(u=(v_0,C_1,\dots ,C_{T-1},v_T)\) with given transaction costs \(C_t\). By Proposition 8, we have \(u\in \mathcal {U} - \mathcal {K}\). Then \(u\in \mathcal {U}\) by Proposition 6. Consequently, \((v_0,v_T) \in \mathcal {H}\). This proves the second claim. \(\square \)

5 Consistent valuation systems

Now we provide a solution of the hedging problem in terms of consistent valuation systems, a notion which extends the notion of consistent discount factors.

A consistent price system with transaction costs terms (which we will call simply a consistent price system) is pair consisting of a sequence of non-negative vector functions \(p_{0}(\omega ),\ldots ,p_{T}(\omega )\) such that \(p_{t}\in \mathcal {L}_{t}^{m}\), and a sequence of non-negative constants \(r_1,\dots ,r_{T-1}\) which satisfy the relations

$$\begin{aligned}&\bar{p}_{1}(\omega )a \le p_{0}(\omega )a\ \text {for all } a\in X_{0}(\omega ), \end{aligned}$$
(10)
$$\begin{aligned}&\bar{p}_{t+1}(\omega )b + r_t\le p_{t}(\omega )a\ \text {for all }(a,b)\in Z_{t}(\omega ),\ t=1,...,T-1, \end{aligned}$$
(11)

where \(\bar{p}_{t+1}(\omega ):=E_{t}p_{t+1}(\omega )\) and \(E_{t}(\cdot )=E(\cdot |\mathcal {F}_{t})\).

Note that by virtue of conditions (10) and (11), for any trading strategy \( x_0,x_1,\ldots ,x_{T-1} \), the sequence

$$\begin{aligned} p_0x_0, \ p_1x_0,\ p_2x_1 + r_1,\ p_3x_2 + r_1 + r_2,\ \ldots \, p_Tx_{T-1} + (r_1+\dots +r_{T-1}) \end{aligned}$$

is a non-negative supermartingale.

The idea of the notion of a consistent price system goes back to the notion of competitive prices supporting competitive paths maximizing profits over each time period \(t-1, \ t\) in the theory of economic dynamics – see, e.g. Malinvaud (1953); Radner (1967); Gale (1967); Peleg (1974); Dasgupta and Mitra (1999), and Clark (2008). Consistent price systems generalize the concept of an equivalent martingale measure in classical no-arbitrage criteria pertaining to frictionless markets, see Dempster et al. (2006).

Let \(q_{0}\in \mathcal {L}_{0}^{1}\) and \(q_{T}\in \mathcal {L}_{T}^{1}\) be strictly positive scalar functions, and \(((p_{0},\ldots , p_{T}),\ (r_1,\dots ,r_{T-1}))\) be a consistent price system. Then the pair of sequences \(((q_0,p_{0},\ldots , p_{T},q_T),\ (r_1,\dots ,r_{T-1}))\) is called a consistent valuation system if

$$\begin{aligned}&p_0(\omega )b \le q_0(\omega )a\ \text {for all}\ (a,b) \in V_0(\omega ), \end{aligned}$$
(12)
$$\begin{aligned}&p_T(\omega ) a \ge q_T(\omega )b\ \text {for all}\ (a,b) \in V_T(\omega ). \end{aligned}$$
(13)

According to (12) and (13), the prices and discount factors under consideration are such that the profit one can get in the course of portfolio creation or liquidation cannot be strictly positive.

Theorem 2

The following claims hold true.

  1. (a)

    If \(((q_0, p_0, \ldots , p_T, q_T),\ (r_1,\dots ,r_{T-1}))\) is a consistent valuation system, then \((q_0, q_T, r)\), where \(r=r_1+\ldots +r_{T-1}\), are consistent discount factors.

  2. (b)

    If \((q_0, q_T, r)\) are consistent discount factors, then there exists a consistent valuation system \(((q_0, p_0, \ldots , p_T, q_T),\ (r_1,\dots ,r_{T-1}))\) with \(r=r_1+\ldots +r_{T-1}\).

From Theorems 1 and 2, we immediately get the following corollary.

Corollary 1

Consistent valuation systems exist and we have \((v_0,v_T) \in \mathcal {H}\) if and only if \(E q_0 v_0 \ge Eq_T v_T + \sum _{t=1}^{T-1} r_t\) for all consistent valuation systems \(((q_0, p_0, \ldots , p_T, q_T),\ (r_1,\dots ,r_{T-1}))\).

In the course of proof of Theorem 2, the following notion will be needed. We shall call a sequence of vector functions

$$\begin{aligned} \xi := \lbrace v_0,x_0,y_0,x_1,y_1,\ldots , x_{T-1},y_{T-1},x_T,v_T \rbrace , \end{aligned}$$
(14)

where

$$\begin{aligned} v_0 \in \mathcal {L}_{0}^{1}, \ (x_t,y_t) \in \mathcal {L}_{t}^{m} \times \mathcal {L}_{t}^{m} \ [t=0,\ldots , T-1], \ x_T \in \mathcal {L}_{T}^{m},\ v_T\in \mathcal {L}_T^1, \end{aligned}$$
(15)

a generalized hedging strategy if

$$\begin{aligned}&(v_0,x_0) \in V_0(\omega ), \quad y_0 \in X_0(\omega ), \end{aligned}$$
(16)
$$\begin{aligned}&(x_t,y_t) \in Z_t(\omega ) \ [t=1,\ldots ,T-1],\end{aligned}$$
(17)
$$\begin{aligned}&(x_T,v_T) \in V_T(\omega ),\end{aligned}$$
(18)
$$\begin{aligned}&x_0 \ge y_0, \ y_0 \ge x_1, \ y_1 \ge x_2, \ldots , \ y_{T-1} \ge x_T. \end{aligned}$$
(19)

Note that if the inequalities in (19) hold as equalities, we obtain an ordinary hedging strategy.

Lemma 3

For any generalized hedging strategy, there is a hedging strategy \( (v'_0,y'_0,\ldots ,y'_{T-1},v'_T) \) such that \( v'_0=v_0, \ y'_t \ge y_t \ [t=0,\ldots ,T-1]\) and \( v'_T \ge v_T \).

Proof

This is easily proved by induction using Proposition 1. \(\square \)

Lemma 4

There exists a generalized hedging strategy

$$\begin{aligned} \hat{\xi }= \lbrace \hat{v}_0,\hat{x}_0,\hat{y}_0,\hat{x}_1,\hat{y}_1,\ldots , \hat{x}_{T-1},\hat{v}_{T-1},\hat{x}_T,\hat{v}_T \rbrace \end{aligned}$$

such that

$$\begin{aligned} \hat{x}_0> \hat{y}_0, \ \hat{y}_0>\hat{x}_1, \ \hat{y}_1> \hat{x}_2, \ldots , \ \hat{y}_{T-1} > \hat{x}_T. \end{aligned}$$

Furthermore, there exists a hedging strategy \( (v'_0,y'_0,\ldots ,y'_{T-1},v'_T) \) such that \( v'_0>0, \ y'_t >0 \ [t=0,\ldots ,T-1]\) and \( v'_T >0 \).

Proof

In the proof of Proposition 2, we have constructed constant vectors \(\mathring{x}_{t}\) and \(\mathring{y}_{t}\) for each \(t = 1,2,\ldots ,T-1\). These vectors can be expressed as \(\mathring{x}_{t} = \alpha _t(1,\ldots ,1)\) and \(\mathring{y}_{t} = \beta _t(1,\ldots ,1)\), where \(\alpha _t\) and \(\beta _t\) are positive constants. Importantly, for all \(\omega \in \Omega \), it holds that \((\mathring{x}_{t},\mathring{y}_{t})\in Z_{t}(\omega )\).

Now, let us proceed with a backward induction argument. We begin by setting \(( \hat{x}_{T-1},\hat{y}_{T-1} )=( \mathring{x}_{T-1},\mathring{y}_{T-1})\), \(\hat{x}_T=\frac{1}{2} \mathring{y}_{T-1}\), and \(\hat{v}_T= \psi _{T}(\omega ,\hat{x}_T,0)\). Note that \( ( \hat{x}_T,\hat{v}_T) \in V_T(\omega )\), \(\hat{y}_{T-1} > \hat{x}_T \), \( (\hat{x}_{T-1},\hat{y}_{T-1}) \in Z_{T-1}(\omega ) \), and according to Proposition 3, \( \hat{v}_T>0 \).

Now, if \( \mathring{y}_{T-2} > \mathring{x}_{T-1} \) (\( \beta _{T-2}>\alpha _{T-1} \)), we can simply set \( (\hat{x}_{T-2},\hat{y}_{T-2})=(\mathring{x}_{T-2},\mathring{y}_{T-2})\). However, if \( \mathring{y}_{T-2} \le \mathring{x}_{T-1} \), we can find a \(\lambda >1\) such that \( \lambda \mathring{y}_{T-2} > \mathring{x}_{T-1} \). In this case, we set \( (\hat{x}_{T-2},\hat{y}_{T-2})=\lambda (\mathring{x}_{T-2},\mathring{y}_{T-2})\). Since \( \lambda >1 \) and \((\mathring{x}_{T-2},\mathring{y}_{T-2})\in Z_{T-2}(\omega )\), it follows that \(\ (\hat{x}_{T-2},\hat{y}_{T-2}) \in Z_{T-2}(\omega ) \).

By induction, let us assume that we have already constructed vectors \( \hat{x}_{1} \) and \( \hat{y}_{1} \) such that \( ( \hat{x}_{1},\hat{y}_{1}) \in Z_{1}(\omega ) \), and \( \hat{y}_{1} > \hat{x}_{2} \). Now, let \( \hat{y}_{0} =2 \hat{x}_{1}\), \( \hat{x}_{0}=4\hat{x}_{1} \), and \( v_0=-\phi _{0}(\omega ,-\hat{x}_{0}) \). Notably, according to Proposition 3, \( \hat{v}_0>0 \). With this, we have successfully constructed

$$\begin{aligned} \hat{\xi }= \lbrace \hat{v}_0,\hat{x}_0,\hat{y}_0,\hat{x}_1,\hat{y}_1,\ldots ,\hat{x}_{T-1},\hat{v}_{T-1},\hat{x}_T,\hat{v}_T \rbrace \end{aligned}$$

that satisfies all the required conditions. The second part of the proof is immediate from Lemma 3. \(\square \)

Lemma 4 is an analogue of Slater’s constraint qualification. It says that there exists a generalized hedging strategy (which can be regarded as a hedging strategy with consumption) such that an investor following it can sell, with a view to consumption, strictly positive amounts of assets of each type at every date. In the literature, such models are called regular (see, e.g., Dempster et al. 2006).

Now we are ready to give a proof of Theorem 2.

Proof of Theorem 2

Claim (a) clearly follows from the definitions of consistent valuation systems and consistent discount factors.

Let us prove claim (b). We will mainly follow the lines of the proof of Theorem 7.1 in Dempster et al. (2006). Consider consistent discount factors \((q_0,q_T,r)\). By virtue of their definition, we have \(E q_Tv_T - E q_0v_0 + r \le 0\) for all \((v_0,v_T)\in \mathcal {H}\).

Denote by \(\Xi \) the set of generalized hedging strategies. For \(\xi \in \Xi \), define

$$\begin{aligned} F(\xi ) = Eq_Tv_T - Eq_0v_0 + r. \end{aligned}$$

In view of Lemma 3, we have \(F(\xi ) \le 0\) for any \(\xi \in \Xi \). Thus, the maximum of F over \(\Xi \) is equal to zero (the maximum is attained since one can increase \(v_0\) without changing the other components of \(\xi \)). By applying the Kuhn–Tucker theorem (see Proposition 9) to this maximization problem, we relax constraints (19) in the definition of a generalized hedging strategy. The Kuhn–Tucker theorem can be applied in view of the regularity of the model.

As a result, there exist non-negative functions \(p_t\in \mathcal {L}_t^m\) such that

$$\begin{aligned} E q_Tv_T - Eq_0v_0 + r + E p_0(x_0-y_0) + \sum _{t=1}^T E p_t(y_{t-1}-x_t) \le 0 \end{aligned}$$

for all sequences \((v_0,x_0,y_0,x_1,y_1,\dots ,x_{T-1},y_{T-1},x_T,v_T)\) satisfying conditions (15)–(18). Rearranging the terms, we obtain

$$\begin{aligned}{} & {} E (p_0x_0 - Eq_0v_0) + (Ep_1y_0 - Ep_0y_0) \nonumber \\{} & {} \quad + \sum _{t=1}^{T-1}(Ep_{t+1}y_t - Ep_tx_t) + (Eq_Tv_T -Ep_Tx_T) \le -r. \end{aligned}$$
(20)

Let \(-r_0\), \(-r_0'\), \(-r_1,\dots ,-r_{T-1}\), and \(-r_T\) denote the maximal values of the corresponding terms in the left-hand side of this inequality, which are taken over, respectively, \((x_0,y_0) \in V_0\), \(y_0\in X_0\), \((x_t,y_t)\in Z_t\), \(t=1,\dots ,T-1\), and \((x_T,v_T)\in V_T\). Since \(V_0\), \(X_0\) and \(V_T\) are cones, we have \(r_0=r_0'=r_T=0\). Therefore, \(\sum _{t=1}^{T-1} r_t \ge r\). Also observe that if \((x_t,y_t)\in Z_t\), then \((\alpha x_t, \alpha y_t)\in Z_t\) for any \(\alpha \ge 1\), which implies that \(r_1,\dots ,r_{T-1}\) should be non-negative.

Consequently, we have inequalities

$$\begin{aligned}&E p_0x_0 - Eq_0v_0 \le 0\ \text {for all}\ (v_0,x_0)\in V_0(\omega ), \end{aligned}$$
(21)
$$\begin{aligned}&E \bar{p}_1 y_0 - Ep_0y_0 \le 0\ \text {for all}\ y_0\in X_0(\omega ),\end{aligned}$$
(22)
$$\begin{aligned}&E \bar{p}_{t+1} y_t - Ep_tx_t \le -r_t\ \text {for all}\ (x_t,y_t)\in Z_t(\omega ),\ t=1,\dots ,T-1,\end{aligned}$$
(23)
$$\begin{aligned}&E q_Tv_T -Ep_Tx_T \le 0\ \text {for all}\ (x_T,v_T)\in V_T(\omega ). \end{aligned}$$
(24)

Then (21), (22), (23) and (24) imply (12), (10), (11) and (13), respectively. As a result, \(((q_0, p_0,\ldots , p_T , q_T ),\ (r_1,\dots ,r_{T-1}))\) is a consistent valuation system.

Finally, observe that if we decrease some of the values \(r_t\) (but keeping them non-negative), we will still have a consistent valuation system. Thus, we can choose \(r_1,\dots ,r_{T-1}\) such that \(r_1+\dots +r_{T-1}=r\). \(\square \)