Keywords

1 Introduction

In incomplete markets, a basic Black-Scholes perspective has to be complemented by the valuation of market imperfections. Otherwise this results in Black-Scholes Ponzi schemes, such as the ones at the core of the last global financial crisis, where always more derivatives need to be issued for remunerating the capital attracted by the already opened positions. In this paper we consider the sustainable Black-Scholes equations that arise for a portfolio of options if one adds to their trade additive Black-Scholes price, on top of a nonlinear funding cost, the cost of remunerating at a hurdle rate the residual risk left by imperfect hedging. We assess the impact of model uncertainty in this setup.

Section 2 revisits the pricing of a book of options accounting for cost of capital and cost of funding, which are material in incomplete markets. Section 3 specializes the pricing equations to a Markovian Black–Scholes setup. Section 4 assesses the impact of model risk in a UVM (uncertain volatility model) setup. Section 5 refines the model risk add-ons by accounting for calibrability constraints.

We consider a portfolio of options made of ω i vanilla call options of maturity T i and strike K i on a stock S, with 0 < T 1 < < T n = T. Note that, if a corporate holds a bank payable, it typically has an appetite to close it, receive cash, and restructure the hedge otherwise with a par contract (the bank would agree to close the deal as a market maker, charging fees for the new trade). Because of this natural selection, a bank is mostly in the receivables (i.e. “ω i ≥ 0”) in its derivative business with corporates.

We write x ± = max(±x, 0).

2 Cost of Capital and Cost of Funding

2.1 Cost of Capital

In presence of hedging imperfections resulting in a nonvanishing loss (and profit) process ϱ of the bank, a conditional risk measure EC = EC t (ϱ) must be dynamically computed and reserved by the bank as economic capital.

It is established in Albanese et al. (2016, Sect. 5) that the capital valuation adjustment (KVA) needed by the bank in order to remunerate its shareholders for their capital at risk at some average hurdle rate h (e.g. 10%) at any point in time in the future is:

$$\displaystyle\begin{array}{rcl} \begin{array}{l} \mbox{ KVA} = \mbox{ KVA}_{t}(\varrho ) = h\mathbb{E}_{t}\int _{t}^{T}e^{-(r+h)(s-t)}\mbox{ EC}_{s}(\varrho )ds,\end{array} & &{}\end{array}$$
(1)

where \(\mathbb{E}_{t}\) stands for the conditional expectation with respect to some probability measure \(\mathbb{Q}\) and model filtration.

In principle, the probability measure used in capital and cost of capital calculations should be the historical probability measure. But, in the present context of optimization of a portfolio of derivatives, the historical probability measure is hard to estimate in a relevant way, especially for long maturities. As a consequence, we do all our price and risk computations under a risk-neutral measure \(\mathbb{Q}\) calibrated to the market (or a family of pricing measures, in the context of model uncertainty later below), assuming no arbitrage.

2.2 Cost of Funding

Let r t denote a risk-free OIS short-term interest rate and \(\beta _{t} = e^{-\int _{0}^{t}r_{ s}ds}\) be the corresponding risk-neutral discount factor. We assume that the bank can invest at the risk-free rate r but can only obtain unsecured funding at a shifted rate r + λ > r. This entails funding costs over OIS and a related funding valuation adjustment (FVA) for the bank. Given our focus on capital and funding in this paper, we ignore counterparty risk for simplicity, so that λ is interpreted as a pure funding liquidity basis. In order to exclude arbitrages in the primary market of hedging instruments, we assume that the vector gain process \(\mathcal{M}\) of unit positions held in the hedging assets is a risk-neutral martingale. The bank “marks to the model” its derivative portfolio, assumed bought from the client at time 0, by means of an FVA-deducted value process \(\Theta\). The bank may also set up a (possibly imperfect) hedge ( −η) in the hedging assets, for some predictable row-vector process η of the same dimension as \(\mathcal{M}\). We assume that the depreciation of \(\Theta\), the funding expenditures and the loss \(\eta d\mathcal{M}\) on the hedge, minus the option payoffs as they mature, are instantaneously realized into the loss(-and-profit) process ϱ of the bank. In particular, at any time t, the amount on the funding account of the bank is \(\Theta _{t}.\) Moreover, we assume that the economic capital can be used by the trader for her funding purposes provided she pays to the shareholders the OIS rate on EC that they would make otherwise by depositing it (assuming it all cash for simplicity).

Note that the value process \(\Theta\) of the trade already includes the FVA as a deduction, but ignores the KVA, which is considered as a risk adjustment computed in a second step (in other words, we assume that the trader’s account and the KVA account are kept separate from each other). Rephrasing in mathematical terms the above description, the loss equation of the trader is written, for t ∈ (0, T], as (starting from ϱ 0 = y, the accrued loss of the portfolio):

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &d\varrho _{t} = -\mathop{\underbrace{\sum _{i}\omega _{i}(S_{T_{i}} - K_{i})^{+}\delta _{T_{i}}(dt)}}\limits _{\mbox{ call payoffs}} \\ & +\mathop{\underbrace{ r_{t}\mbox{ EC}_{t}(\varrho )dt}}\limits _{\mbox{ Payment of internal lending of the EC funding source at OIS rate}} \\ & +\mathop{\underbrace{ \Big((r_{t} +\lambda _{t})\big(\Theta _{t} -\mbox{ EC}_{t}(\varrho )\big)^{+} - r_{t}\big(\Theta _{t} -\mbox{ EC}_{t}(\varrho )\big)^{-}\Big)dt}}\limits _{\mbox{ portfolio funding costs/benefits}} \\ & +\mathop{\underbrace{ (-d\Theta _{t})}}\limits _{\mbox{ depreciation of}\ \Theta } +\mathop{\underbrace{ \eta _{t}d\mathcal{M}_{t}}}\limits _{\mbox{ loss on the hedge}} \\ & = -d\Theta _{t} -\sum _{i}\omega _{i}(S_{T_{i}} - K_{i})^{+}\delta _{T_{i}}(dt) +\Big (\lambda _{t}\big(\Theta _{t} -\mbox{ EC}_{t}(\varrho )\big)^{+} + r_{t}\Theta _{t}\Big)dt +\eta _{t}d\mathcal{M}_{t}.\end{array} & &{}\end{array}$$
(2)

Hence, a no-arbitrage condition that the loss process ϱ of the bank should follow a risk-neutral martingale (assuming integrability) and the terminal condition \(\Theta _{T} = 0\) lead to the following FVA-deducted risk-neutral valuation BSDE:

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\Theta _{t} =\mathop{\underbrace{ \mathbb{E}_{t}\Big[\sum _{t<T_{i}}\beta _{t}^{-1}\beta _{T_{i}}\omega _{i}(S_{T_{i}} - K_{i})^{+}\Big]}}\limits _{\Theta _{t}^{0}} -\mathop{\underbrace{ \mathbb{E}_{t}\Big[\int _{t}^{T}\beta _{t}^{-1}\beta _{s}\lambda _{s}\big(\Theta _{s} -\mbox{ EC}_{s}(\varrho )\big)^{+}ds\Big]}}\limits _{\mbox{ FVA}_{ t}},\,t \in \left [0,T\right ]\end{array}& &{}\end{array}$$
(3)

(since we consider a portfolio of options with several maturities, we treat option pay-offs as cash-flows at their maturity times rather than a terminal condition in the equations, in particular \(\Theta _{T} = 0\)).

The funding source provided by economic capital creates a feedback loop from EC into FVA, which makes the FVA smaller.

Note that, in the usual case of a risk measure EC only affected by the time fluctuations of ϱ, the Eqs. (3) and in turn (1) are independent of the accrued loss y, which eventually does not affect \(\Theta\) nor the KVA.

If λ = 0, then, whatever the hedge η, \(\Theta\) reduces to \(\Theta ^{0},\) which corresponds to the usual trade additive (linear) no-arbitrage pricing formula for a portfolio of options, with zero FVA, but with a KVA given by (1), depending on the hedge η. 

If λ ≠ 0, we introduce the following backward SDE:

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\Theta _{t}^{\star } = \mathbb{E}_{t}\Big[\sum _{t<T_{i}}\beta _{t}^{-1}\beta _{T_{i}}\omega _{i}(S_{T_{i}} - K_{i})^{+} -\int _{t}^{T}\beta _{t}^{-1}\beta _{s}\lambda _{s}(\Theta _{s}^{\star })^{+}ds\Big],\,t \in \left [0,T\right ].\end{array} & &{}\end{array}$$
(4)

This is a monotone driver backward SDE, admitting as such a unique square integrable solution \(\Theta ^{\star }\) (see, e.g., Kruse and Popier (2016, Sect. 4)), provided λ is bounded from below and \(\Theta ^{0}\) is square integrable. If there exists a replicating hedge η, i.e. η = η such that the corresponding ϱ is constant in (2), i.e. \(\eta _{t}^{\star }d\mathcal{M}_{t}\) coincides with the martingale part of \(\Theta ^{\star }\), then the resulting ϱ, EC and KVA vanish (since we assumed EC(0) = 0) and the ensuing FVA-deducted value process is given by \(\Theta ^{\star }.\)

Example 2.1 (Single option positions)

If n = 1 and ω 1 = 1 (one long call position), then, by application of the comparison theorem for BSDEs with a monotonic generator (see Kruse and Popier (2016, Sect. 4)), we have \(\Theta ^{\star } \geq 0\), hence

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\Theta _{t}^{\star } = \mathbb{E}_{t}\big[\widetilde{\beta }_{t}^{-1}\widetilde{\beta }_{T_{1}}(S_{T_{1}} - K_{1})^{+}\big],\end{array} & &{}\end{array}$$
(5)

where \(\widetilde{\beta }_{t} = e^{-\int _{0}^{t}(r+\lambda _{ s})ds}.\) With respect to \(\Theta ^{(0)},\) the value \(\Theta ^{\star }\) corresponds to an FVA rebate on the buying price by the bank (since we assumed a positive liquidity basis λ).

If n = ω 1 = −1 (one short call position), then we deduce likewise that \(\Theta ^{\star } \leq 0,\) hence \(\Theta ^{\star } = \Theta ^{(0)}.\)

But, apart from the above special cases where λ = 0 or η = η , the BSDE (3) for \(\Theta\) is nonstandard due to the term EC = EC t (ϱ) in the FVA.

3 Markovian Black-Scholes Setup

In this section we assume a constant risk-free rate r and a Black-Scholes stock S with volatility σ and constant dividend yield q. The risk-neutral martingale \(\mathcal{M}\) is then taken as the gain process of a continuously rolled unit position on the stock S, assumed funded at the risk-free rate via a repo market, i.e. \(d\mathcal{M}_{t} = dS_{t} - (r - q)S_{t}dt.\) We denote by \(\mathcal{A}_{S}^{bs} = (r - q)S\partial _{S} + \frac{1} {2}\sigma ^{2}S^{2}\partial _{ S^{2}}^{2}\) the corresponding risk-neutral Black-Scholes generator.

Doing our modeling exercise in the context of the Black-Scholes model, where perfect replication, hence no KVA, is possible, may seem rather artificial. However, doing all the computations in a stylized Black-Scholes setup with a single risk factor S yields useful practical insights. In addition, this conveys the message that, in real-life incomplete markets, a basic Black-Scholes perspective has to be complemented by the valuation of market imperfections, otherwise this unavoidably results in Black-Scholes Ponzi schemes, such as the ones that have been involved in the global financial crisis, where always more derivatives are issued to remunerate the capital required by the already opened positions (if priced and risk-managed in a basic Black-Scholes way ignoring the cost of capital).

In the Black-Scholes setup and assuming a stylized Markovian specification

$$\displaystyle\begin{array}{rcl} \begin{array}{l} \mbox{ EC}_{t}(\varrho ) = f\sqrt{ \frac{d\langle \varrho \rangle } {dt}}\end{array} & &{}\end{array}$$
(6)

(the stylized VaR which is proportional to the instantaneous volatility of the loss process ϱ modulo a suitable “quantile level” f) as well as λ = λ(t, S t ), η t = η(t, S t ), then the above FVA and KVA equations can be reduced to the “sustainable Black-Scholes PDEs” (12), as follows (resulting in an FVA- and KVA-deducted price that would be sustainable for the bank even in the limit case of a portfolio held on a run-off basis, with no new trades ever entered in the future).

First, observe that given a tentative FVA-deducted price process of the form \(\Theta _{t} = u(t,S_{t})\) for some to-be-determined function u = u(t, S), we have, assuming (6):

$$\displaystyle\begin{array}{rcl} \begin{array}{l} \sqrt{ \frac{d\langle \varrho \rangle } {dt}} = \sigma S_{t}\big\vert \partial _{S}u(t,S_{t}) -\eta (t,S_{t})\big\vert.\end{array} & &{}\end{array}$$
(7)

Accordingly, let the function u be defined by u i (t, S) on each strip (T i−1, T i ] × (0, ), where (u i )1 ≤ in is the unique sequence of viscosity solutions, which can then be shown to be classical solutions, to the following PDE cascade, for i decreasing from n to 1 (closing the system by setting u n+1 = 0 and T 0 = 0):

$$\displaystyle{ \begin{array}{c} \left \{\begin{array}{l} u_{i}(T_{i},S) = u_{i+1}(T_{i},S) +\omega _{i}(S - K_{i})^{+}\mbox{ on }(0,\infty ) \\ \partial _{t}u_{i} + \mathcal{A}_{S}^{bs}u_{i} -\lambda \big (u_{i} - f\sigma S\vert \partial _{S}u_{i} -\eta \vert \big)^{+} - ru_{i} = 0\mbox{ on }[T_{i-1},T_{i}) \times (0,\infty ).\end{array} \right. \end{array} }$$
(8)

Itô calculus shows that the process \(\Theta = (u(t,S_{t}))_{t}\) solves the Markovian, monotonic driver (assuming λ bounded from below) BSDE

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &u(t,S_{t}) = \mathbb{E}_{t}\Big[\sum _{t<T_{i}}\beta _{t}^{-1}\beta _{T_{i}}\omega _{i}(S_{T_{i}} - K_{i})^{+} \\ & \quad \quad \quad -\int _{t}^{T}\beta _{t}^{-1}\beta _{s}\lambda _{s}\Big(u(s,S_{s}) - f\sigma S_{s}\big\vert \partial _{S}u(s,S_{s}) -\eta (s,S_{s})\big\vert \Big)^{+}ds\Big],\,t \in \left [0,T\right ],\end{array} & &{}\end{array}$$
(9)

which in view of (6)– (7) is precisely (3).

The ensuing FVA\(= \Theta ^{(0)} - \Theta\) and KVA processes are given as (cf. (3) and (1)):

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\mbox{ FVA}_{t}(\varrho ) = \mathbb{E}_{t}\Big[\int _{t}^{T}e^{-r(s-t)}\lambda _{s}\Big(u(s,S_{s}) - f\sqrt{ \frac{d\langle \varrho \rangle } {ds}}\Big)^{+}ds\Big] \\ &\mbox{ KVA}_{t}(\varrho ) = h\mathbb{E}_{t}\Big[\int _{t}^{T}e^{-(r+h)(s-t)}f\sqrt{ \frac{d\langle \varrho \rangle } {ds}}ds\Big],\end{array} & &{}\end{array}$$
(10)

where \(\sqrt{ \frac{d\langle \varrho \rangle } {dt}}\) is given by (7). We set η = (1 −α) S u, where α in [0, 100%] is the mis-hedge parameter (noting that, for α = 0, the BSDE (9) reduces to the replication BSDE (4)), then the latter reduces to \(\alpha \sigma S_{t}\big\vert \partial _{S}u(t,S_{t})\big\vert\) and we have

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\mbox{ FVA}_{t}(\varrho ) = \mathbb{E}_{t}\Big[\int _{t}^{T}e^{-r(s-t)}\lambda _{s}\big(u(s,S_{s}) -\alpha f\sigma S_{s}\big\vert \partial _{S}u(s,S_{s})\big\vert \big)^{+}ds\Big] \\ &\quad \quad \quad \quad \quad \quad = v(t,S_{t}) = u_{bs}(t,S_{t}) - u(t,S_{t}), \\ &\mbox{ KVA}_{t}(\varrho ) = h\mathbb{E}_{t}\Big[\int _{t}^{T}e^{-(r+h)(s-t)}\alpha f\sigma S_{s}\big\vert \partial _{S}u(s,S_{s})\big\vert ds\Big] = w(t,S_{t}),\end{array} & &{}\end{array}$$
(11)

where u bs is the trade additive Black-Scholes portfolio value and where the FVA and KVA pricing functions v and w satisfy

$$\displaystyle{ \begin{array}{c} \left \{\begin{array}{l} v(T,S) = w(T,S) = 0\mbox{ on }(0,\infty ) \\ \partial _{t}v + \mathcal{A}_{S}^{bs}v +\lambda \big (u_{bs} - v -\alpha f\sigma S\vert \Delta _{bs} - \partial _{S}v\vert \big)^{+} - rv = 0\mbox{ on }[0,T) \times (0,\infty ) \\ \partial _{t}w + \mathcal{A}_{S}^{bs}w +\alpha hf\sigma S\vert \Delta _{bs} - \partial _{S}v\vert - (r + h)w = 0\mbox{ on }[0,T) \times (0,\infty ), \end{array} \right. \end{array} }$$
(12)

in which \(\Delta _{bs} = \partial _{S}u_{bs}.\)

These “sustainable Black-Scholes PDEs” (12) allow computing an FVA and KVA deducted price

$$\displaystyle{u - w = u_{bs} - v - w}$$

that would be sustainable for the bank even in the limit case of a portfolio held on a run-off basis, with no new trades ever entered in the future.

4 With Volatility Uncertainty

An important and topical issue, referred to by the regulation as AVA (additional valuation adjustment), is the magnifying impact of model risk on the different XVA metrics.

In this section, we assess model risk from the angle of Avellaneda et al. (1995)’s uncertain volatility model (UVM). Namely, we only assume positive bounds σ and \(\overline{\sigma }\) but we do not assume any specific dynamic on the stock volatility process σ. Therefore, there is a model uncertainty about it. That is, we only consider \(d\mathcal{M}_{t}:=\sigma _{t}S_{t}dW_{t} = dS_{t} - (r - q)S_{t}dt,\) where \(\sigma _{t} \in [\underline{\sigma },\overline{\sigma }]\) for every t.

We call \(\mathcal{C}\) the space of continuous paths on \(\mathbb{R}_{+},\) C the canonical process on the space \(\mathcal{C},\) \(\mathbb{F} = (\mathcal{F}_{t})_{0\leq t\leq T}\) the canonical filtration generated by C and \(\mathcal{Q}\) the set of \(\mathbb{F}\) local martingale probability measures for C. We recall from Soner et al. (2012) that, for any probability measure \(\mathbb{Q} \in \mathcal{Q}\), the process C satisfies \(dC_{t} = a_{t}^{1/2}dW_{t}^{\mathbb{Q}}\), for some \(\mathbb{Q}\) Brownian motion \(W^{\mathbb{Q}}\), where a t is the Lebesgue density of the aggregated quadratic variation of C. In the following we restrict attention to the probability measures \(\mathbb{Q}\) such that \(a_{t}^{1/2} \in [\underline{\sigma },\overline{\sigma }]\) holds \(dt \times \mathbb{Q}\) almost surely, still denoting by \(\mathcal{Q}\) the (restricted) set of measures, and we model \(d\mathcal{M}_{t} = dS_{t} - (r - q)S_{t}dt\) as S t dC t . 

Under each \(\mathbb{Q}\), similarly to (2), the loss equation of the trader is written, for t ∈ (0, T], as:

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &d\varrho _{t}^{\mathbb{Q}} = -d\Theta _{t}^{\mathbb{Q}} -\sum _{i}\omega _{i}(S_{T_{i}} - K_{i})^{+}\delta _{T_{i}}(dt) +\Big (\lambda _{t}\big(\Theta _{t}^{\mathbb{Q}} -\mbox{ EC}_{t}^{\mathbb{Q}}(\varrho ^{\mathbb{Q}})\big)^{+} \\ & \qquad \quad + r_{t}\Theta _{t}^{\mathbb{Q}}\Big)dt +\eta _{t}d\mathcal{M}_{t}\end{array} & &{}\end{array}$$
(13)

where \(\mbox{ EC}^{\mathbb{Q}}\) is some conditional risk measure under \(\mathbb{Q}\). The ensuing equation for the \(\mathbb{Q}\) FVA-deducted value \(\Theta ^{\mathbb{Q}}\) appears as

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\Theta _{t}^{\mathbb{Q}} = \mathbb{E}_{t}^{\mathbb{Q}}\Big[\sum _{t<T_{i}}\beta _{t}^{-1}\beta _{T_{i}}\omega _{i}(S_{T_{i}} - K_{i})^{+} -\int _{t}^{T}\beta _{t}^{-1}\beta _{s}\lambda _{s}\big(\Theta _{s}^{\mathbb{Q}} -\mbox{ EC}_{s}^{\mathbb{Q}}(\varrho ^{\mathbb{Q}})\big)^{+}ds\Big], \\ &\quad \qquad\,t \in \left [0,T\right ].\end{array} & &{}\end{array}$$
(14)

Under each \(\mathbb{Q}\), the trader should value the derivative portfolio \(\Theta _{0}^{\mathbb{Q}}\) at time 0 (or \(\Theta _{t}^{\mathbb{Q}}\) at time t). However, due to the model uncertainty, the trader values it \(\Theta _{0} =\mathop{\inf }\limits_{ \mathbb{Q} \in \mathcal{Q}}\Theta _{0}^{\mathbb{Q}}\) (or at time t, \(\Theta _{t} =\mathop{\mathop{ \mathrm{ess\,inf}}\limits }\limits_{ \mathbb{Q} \in \mathcal{Q}}\ \Theta _{t}^{\mathbb{Q}}\)), which is a robust non-arbitrage price in the sense of Biagini et al. (2015).

At time t, \(\mbox{ EC}_{t}^{\mathbb{Q}}(\varrho ^{\mathbb{Q}})\) may depend on the whole future of the process (\(\varrho _{s}^{\mathbb{Q}}\)), st. This makes (14) a so-called anticipated BSDE under \(\mathbb{Q}\) (ABSDE in the sense of Peng and Yang (2009)), with generator \(\lambda _{t}\big(\Theta _{t}^{\mathbb{Q}} -\mbox{ EC}_{t}^{\mathbb{Q}}(\varrho ^{\mathbb{Q}})\big)^{+}\), where \(\Theta ^{\mathbb{Q}}\) corresponds to the “Y -component” and (\(d\varrho _{s}^{\mathbb{Q}}-\eta _{s}S_{s}dC_{s}\)) to the “Z-component” of the solution. However, in the Markovian setting of Sect. 3, \(\mbox{ EC}_{t}^{\mathbb{Q}}(\varrho ^{\mathbb{Q}})\) only depends on (\(\varrho _{t}^{\mathbb{Q}}\)) at time t, so that the ABSDE (14) reduces to a BSDE.

For taking model risk (i.e. the impact of several \(\mathbb{Q}\)) into consideration, we need the notion of second order BSDE. Wellposedness results regarding second order anticipated BSDEs are not yet available in the literature. Hence, we only give heuristic formulations in this regard. Namely, by analogy with the second order BSDEs theory introduced by Soner et al. (2012), we should have the following representation, where \(\mathbb{F}_{+} = (\mathcal{F}_{t}^{+})_{0\leq t\leq T}\) the right limit of \(\mathbb{F}\), i.e. \(\mathcal{F}_{t}^{+} = \cap _{s>t}\mathcal{F}_{s}\) for all t ∈ [0, T) and \(\mathcal{F}_{T}^{+} = \mathcal{F}_{T}\):

There exists a process ϱ such that, for each \(\mathbb{Q} \in \mathcal{Q}\), ϱ is a \(\mathbb{Q}\)-local martingale and it \(\mathbb{Q} - a.s.\) holds that

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &d\varrho _{t} = -d\Theta _{t} -\sum _{i}\omega _{i}(S_{T_{i}} - K_{i})^{+}\delta _{T_{i}}(dt) \\ &\quad \quad \quad +\Big (\lambda _{t}\big(\Theta _{t} -\mbox{ EC}_{t}^{\mathbb{Q}}(\varrho )\big)^{+} + r_{t}\Theta _{t}\Big)dt +\eta _{t}d\mathcal{M}_{t} + dA_{t}^{\mathbb{Q}},\end{array} & &{}\end{array}$$
(15)

where \(\mbox{ EC}^{\mathbb{Q}}\) is some conditional risk measure and the family \(\{A^{\mathbb{Q}}\}\) of non-decreasing processes satisfies the minimality condition

$$\displaystyle{ A_{t}^{\mathbb{Q}} =\mathop{ \mathop{\mathrm{ess\,inf}}\limits ^{\mathbb{Q}}}\limits_{\mathbb{Q}^{'} \in \mathcal{Q}(t, \mathbb{Q}, \mathbb{F}_{ +})}\mathbb{E}^{\mathbb{Q}^{'}}\left [\left.A_{ T}^{\mathbb{Q}^{'}}\right \vert \mathcal{F}_{ t}^{\mathbb{Q}+}\right ],\text{ }0 \leq t \leq T,\text{ }\mathbb{Q} - a.s.,\text{ }\forall \mathbb{Q} \in \mathcal{Q}, }$$
(16)

where \(\mathcal{Q}(t, \mathbb{Q}, \mathbb{F}_{+}):= \left \{\mathbb{Q}^{'} \in \mathcal{Q},\ \mathbb{Q}^{'} = \mathbb{Q}\text{ on }\mathcal{F}_{t}^{+}\right \}\).

The corresponding equation for the FVA-deducted value \(\Theta\) would appear as

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\Theta _{t} =\mathop{ \mathop{\mathrm{ess\,inf}}\limits }\limits_{\mathbb{Q}' \in \mathcal{Q}(t, \mathbb{Q}, \mathbb{F}_{+})}\ \mathbb{E}_{t}^{\mathbb{Q}'}\Big[\sum _{t<T_{i}}\beta _{t}^{-1}\beta _{T_{i}}\omega _{i}(S_{T_{i}} - K_{i})^{+} \\ & \quad \quad \quad -\int _{t}^{T}\beta _{t}^{-1}\beta _{s}\lambda _{s}\big(\Theta _{s} -\mbox{ EC}_{s}^{\mathbb{Q}'}(\varrho )\big)^{+}ds\Big],\,t \in \left [0,T\right ],\, \mathbb{Q} - a.s.\end{array} & &{}\end{array}$$
(17)

4.1 Equations in the Markovian Setting

By contrast, in the Markovian setting of Sect. 3 with VaR-like specification of Economic Capital, we can make rigorous statements. According to the second order BSDE theory introduced in Soner et al. (2012), the PDE (8) becomes:

$$\displaystyle{ \begin{array}{c} \left \{\!\!\begin{array}{l} u_{i}(T_{i},S) = u_{i+1}(T_{i},S) +\omega _{i}(S - K_{i})^{+}\mbox{ on }(0,\infty ) \\ \partial _{t}u_{i} +\mathop{ \inf }\limits_{\sigma \in [\underline{\sigma },\overline{\sigma }]}\left [\mathcal{A}_{S}^{bs}u_{i} -\lambda \big (u_{i} - f\sigma S\vert \partial _{S}u_{i} -\eta \vert \big)^{+}\right ] - ru_{i} = 0\mbox{ on }[T_{i-1},T_{i}) \times (0,\infty ).\end{array}\right. \end{array} }$$
(18)

Let u be defined by u i (t, S) on each strip (T i−1, T i ] × (0, ). The FVA can be defined as \(\Theta ^{\lambda =0} - \Theta\) and the ensuing KVA process is given as (cf. (3) and (1)):

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\mbox{ KVA}_{t}(\varrho ) = h\ \mathop{\mbox{ esssup}}\limits_{\mathbb{Q}' \in \mathcal{Q}(t, \mathbb{Q}, \mathbb{F}_{+})}\ \mathbb{E}_{t}^{\mathbb{Q}'}\Big[\int _{t}^{T}e^{-(r+h)(s-t)}f\sqrt{ \frac{d\langle \varrho \rangle } {ds}}ds\Big],\, \mathbb{Q}\ a.s.,\end{array} & &{}\end{array}$$
(19)

where \(\sqrt{ \frac{d\langle \varrho \rangle } {dt}} = a_{t}^{1/2}S_{ t}\big\vert \partial _{S}u(t,S_{t}) -\eta (t,S_{t})\big\vert.\) In the case where η = (1 −α) S u, we obtain

$$\displaystyle{\mbox{ KVA}_{t}(\varrho ) = w(t,S_{t}),}$$

where

$$\displaystyle{ \begin{array}{c} \left \{\begin{array}{l} w(T,S) = 0\mbox{ on }(0,\infty ) \\ \partial _{t}w+\mathop{\sup }\limits_{\sigma \in [\underline{\sigma },\overline{\sigma }]}\left [\mathcal{A}_{S}^{bs}w +\alpha hf\sigma S\vert \partial _{S}u\vert \right ] - (r + h)w = 0\mbox{ on }[0,T) \times (0,\infty ), \end{array} \right. \end{array} }$$
(20)

in which (cf. (18))

$$\displaystyle{\left \{\begin{array}{l} u_{i}(T_{i},S) = u_{i+1}(T_{i},S) +\omega _{i}(S - K_{i})^{+}\mbox{ on }(0,\infty ) \\ \partial _{t}u_{i} +\mathop{ \inf }\limits_{\sigma \in [\underline{\sigma },\overline{\sigma }]}\left [\mathcal{A}_{S}^{bs}u_{i} -\lambda \big (u_{i} -\alpha f\sigma S\vert \partial _{S}u_{i}\vert \big)^{+}\right ] - ru_{i} = 0\mbox{ on }[T_{i-1},T_{i}) \times (0,\infty ). \end{array} \right.}$$

5 Optimal Transportation Approach

Since vanilla call options are liquidly traded, their time 0 price components

$$\displaystyle{\mathbb{E}^{\mathbb{Q}}\big[\beta _{ T_{i}}(S_{T_{i}} - K_{i})^{+}\big]}$$

should not be seen as subject to model risk, but calibrated to the market. Hence, we need to refine our preliminary UVM assessment of model risk in order to account for these calibration constraints. For simplicity we consider a single call option (T, K) and we set λ = 0, focusing on KVA in this section. Hence, the system (18) reduces to a single PDE with λ = 0, with solution denoted by u.

(Tan and Touzi (2013)) consider the optimal transportation problem consisting of minimizing a cost among all continuous semimartingales with given initial and terminal distributions. They show an extension of the Kantorovich duality to this context and suggest a finite-difference scheme combined with the gradient projection algorithm to approximate the dual value. Their results can be applied to our setup as follows.

Let \(\mu _{0} =\delta _{S_{0}}\) denote the Dirac measure on the initial value of S 0 and let μ T denote the marginal distribution of S T , inferred by calibration to the market prices of all European call options with maturity T (assuming quotations available for all strikes). Let

$$\displaystyle{ \begin{array}{l} \mathcal{Q}(\mu _{0}) =\{ \mathbb{Q} \in \mathcal{Q}: \mathbb{Q} \circ S_{0}^{-1} =\mu _{0}\},\,\mathcal{Q}(\mu _{0},\mu _{T}) =\{ \mathbb{Q} \in \mathcal{Q}(\mu _{0}): \mathbb{Q} \circ S_{T}^{-1} =\mu _{T}\}.\end{array} }$$

From the Remark 2.3 in Tan and Touzi (2013), \(\mathcal{Q}(\mu _{0},\mu _{T})\) is not empty in our setting.

The KVA with model uncertainty and terminal marginal constraint is defined as follows:

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\mbox{ KVA}_{0}(\varrho ) = h\mathop{\sup }\limits_{\mathbb{Q} \in \mathcal{Q}(\mu _{0},\mu _{T})}\mathbb{E}^{\mathbb{Q}}\Big[\int _{0}^{T}e^{-(r+h)(s)}f\sqrt{ \frac{d\langle \varrho \rangle } {ds}}ds\Big],\end{array} & &{}\end{array}$$
(21)

where ϱ represents the portfolio loss in this setting, that is, the loss and profit of the bank in a world with uncertain volatility subject to the law of S T . However, it is not clear how to extrapolate the theory of Tan and Touzi (2013) to valuation at future time points when only the unconditional law of S T is known. Hence for the sake of tractability we conservatively assume that ϱ in (21) is the UVM one and we only apply the constraint to the outer expectation in (21) (as opposed to the conditional expectations that are hidden in ϱ).

With this understanding of (21), given any measure ν, we define

$$\displaystyle{ \nu (\phi ) =\int _{\mathbb{R}^{d}}\phi (x)\nu (dx) }$$

on the set \(C_{b}(\mathbb{R}^{d})\) of all bounded continuous functions ϕ on \(\mathbb{R}^{d}\). We can readily check that Assumptions 3.1–3.3 in Tan and Touzi (2013) are satisfied. Hence, by an application of their main duality result, we can rewrite the KVA as

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\mbox{ KVA}_{0}(\varrho ) =\mathop{\inf }\limits_{\phi \in C_{b}(\mathbb{R}^{d})}\Big\{\mu _{0}(\Phi _{0}) - e^{-(r+h)T}\mu _{T}(\phi )\Big\},\end{array} & &{}\end{array}$$
(22)

where the “pseudo-payoff function” ϕ corresponds to a Lagrangian for the constrained optimization problem (21) and where

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\Phi _{0}(x) =\mathop{\sup }\limits_{ \mathbb{Q} \in \mathcal{Q}(\delta _{x})}\mathbb{E}^{\mathbb{Q}}\Big[e^{-(r+h)T}\phi (S_{T}) +\int _{ 0}^{T}e^{-(r+h)s}hf\sqrt{ \frac{d\langle \varrho \rangle } {ds}}ds\Big].\end{array} & &{}\end{array}$$
(23)

Hence, the KVA in an optimal transportation (OT) setting can be represented as an infimum of KVAs in modified UVM setting.

5.1 Equations in the Markovian Setting

In the Markovian setting of Sect. 3, we consider the probability measures \(\mathbb{Q}\) on the canonical space \((\Omega,\mathcal{F}_{T})\), under which the canonical process C is a local martingale on [t, T]. Define \(\mathcal{Q}_{t}\) as the collection of all such martingale probability measures \(\mathbb{Q}\) such that \(a_{s}^{1/2} \in [\underline{\sigma },\overline{\sigma }]\) \(d\mathbb{Q} \times ds\mbox{ -}a.e.\) on \(\Omega \times [t,T]\). Denote \(\mathcal{Q}_{t,x}:=\{ \mathbb{Q} \in \mathcal{Q}_{t}: \mathbb{Q}[S_{s} = x,0 \leq s \leq t] = 1\}\). For any \(\phi \in C_{b}(\mathbb{R}^{d})\), let

$$\displaystyle\begin{array}{rcl} \begin{array}{rl} &\Phi (t,x) =\mathop{\sup }\limits_{ \mathbb{Q} \in \mathcal{Q}_{t,x}}\mathbb{E}^{\mathbb{Q}}\Big[e^{-(r+h)(T-t)}\phi (S_{T}) +\int _{ t}^{T}e^{-(r+h)(s-t)}hf\sqrt{ \frac{d\langle \varrho \rangle } {ds}}ds\Big],\end{array} & &{}\end{array}$$
(24)

where \(\sqrt{ \frac{d\langle \varrho \rangle } {dt}} = a_{t}^{1/2}S_{ t}\big\vert \partial _{S}u(t,S_{t}) -\eta (t,S_{t})\big\vert,\) in which u is the solution to (18) with λ = 0. 

Then, in the case where η = (1 −α) S u, \(\Phi\) is a viscosity solution to the dynamic programming equation

$$\displaystyle{ \begin{array}{c} \left \{\begin{array}{l} \Phi (T,S) =\phi (S)\mbox{ on }(0,\infty ) \\ \partial _{t}\Phi +\mathop{\sup }\limits_{\sigma \in [\underline{\sigma },\overline{\sigma }]}\Big[\mathcal{A}_{S}^{bs}\Phi +\alpha hf\sigma S\vert \partial _{S}u\vert \Big] - (r + h)\Phi = 0\mbox{ on }[0,T) \times (0,\infty ).\end{array} \right. \end{array} }$$
(25)

In view of (22), in the present OT setup, KVA0 is obtained as the minimum of

$$\displaystyle\begin{array}{rcl} \begin{array}{l} \Phi (0,S_{0}) - e^{-(r+h)T}\int _{\mathbb{R}}\phi (x)\mu _{T}(dx)\end{array} & &{}\end{array}$$
(26)

over \(\phi \in C_{b}(\mathbb{R}^{d}).\) This minimization is achieved numerically by the Nelder-Mead simplex algorithm.

As a sanity check, observe that, if μ T is Black-Scholes σ and \(\underline{\sigma }= \overline{\sigma } =\sigma,\) then (26) is exactly the time 0 KVA of Sect. 3, independent of ϕ. 

6 Numerical Results

Figure 1 shows the results obtained by solving the related PDEs (and minimizing (26) in the OT setup) without model uncertainty as of Sect. 3 (left panel), with UVM uncertainty as of Sect. 4.1 (middle panel) and with OT uncertainty as of Sect. 5.1 (right panel), for a level of the mis-hedge parameter α increasing from 0 to 100%. We used the following parameters:

Fig. 1
figure 1

XVAs and FTP as a function of the mis-hedge parameter α. Left: Without model uncertainty. Middle: With UVM uncertainty (\(\underline{\sigma }= 15\%,\overline{\sigma } = 60\%\)). Right: With OT uncertainty (\(\underline{\sigma }= 15\%,\overline{\sigma } = 60\%,\sigma = 30\%\))

$$\displaystyle\begin{array}{rcl} & & S_{0} = 100, r = 2\%, q = 0, \sigma = 30\%, {}\\ & & \lambda = 200\mbox{ bps}, f = 1.2, h = 10\% {}\\ \end{array}$$

and considered a single call option of maturity T = 5 years and strike K = 107.

The main observation from the left panel is that, unless the hedge is very good (of the order of 25% of mis-hedge or less), the KVA dominates the FVA, and becomes about ten times greater than the FVA in the absence of hedge (α = 1). This is logical given that EC has only an indirect reduction effect on the FVA, whereas it directly sizes the KVA.

Going to the middle panel, the FVA changes little, but both u and the KVA (unless the hedge is almost perfect) are tremendously impacted by the uncertainty on the volatility. Regarding the KVA this is in line with the fact that it is the cost of a risk measure, which nonlinearly amplifies the impact of perturbations to its input data.

In reality the time 0 price of a vanilla option such as the one considered in our numerics is given by the market, so there is no model risk on it, but only on the KVA. This is what is reflected by the OT right panel. The model risk on the KVA component however is essentially the same as in the UVM case, because it is conservatively assessed by using the UVM u in (25), fault of a developed theory of valuation at future time points under uncertain volatility subject to the unconditional law of S T .

XVA desks, KVA in particular, are the first consulted desks in all major trades today. Our results in a toy model where all the quantities of interest can be computed exactly (modulo the numerical error on the PDE solutions) emphasize that, accounting for model risk, the relative importance of the KVA should become even larger. Moreover one can easily imagine how to transpose these results to the setup of Albanese et al. (2016) where each option payoff \((S_{T_{i}} - K_{i})^{+}\) is replaced by the CVA exposure of the bank to the default at time of its counterparty i, at the (random) time T i , with corresponding position of the bank \(\omega _{i}S_{T_{i}}\) and margins received by the bank ω i K i . However in this case a relevant risk measure really needs to be computed at a 1-year horizon (as opposed to instantaneous in (6)), in order to leave time to credit events to develop. This points out to developments of a slightly different nature, which would be interesting to develop.