Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

In the wake of the big crisis one has witnessed a significant increase in the spreads between LIBORs of different tenors as well as the spread between a LIBOR and the discount curve (LIBOR-OIS). This has led to the construction of multicurve models where, typically, future cash flows are generated through curves associated to the underlying rates, but are discounted by another curve.

The majority of the models that have been considered reflects the usual classical distinction between

  1. (i)

    short rate models;

  2. (ii)

    HJM setup;

  3. (iii)

    BGM or LIBOR market models.

By analogy to credit risk we may call the first two categories of models as bottom-up models, while the third one could be classified as top-down. In addition, methodologies have appeared that are related to foreign exchange.

Here we consider only the first two setups. We begin by discussing some issues arising with the HJM methodology and concentrate then on short rate models. The third setup (top-down) is mainly present in work by F. Mercurio and co-authors (see e.g. Mercurio (2010a, b)), but also in other recent work such as Keller-Ressel et al. (2013). There are advantages and disadvantages with each setup. Among the possible advantages of short rate models is the fact that they lead more easily to a Markovian setting, which is convenient for various calculations (see Crépey et al. (2013b)). On the other hand, one of the major advantages of HJM over a direct short rate modeling is that the model is automatically calibrated to the initial term structure. Short rate models in a multi-curve setup have already appeared in the literature, e.g. Kijima et al. (2009), Kenyon (2010), Filipović and Trolle (2013).

To present the basic ideas in a simple way, here we consider a two-curve model, namely with a curve for discounting and one for generating future cash flows. The choice of the discount curve is not unique; we follow the common choice of considering the OIS swap curve. For the risky cash flows without collateral we consider a single LIBOR (i.e. for a given tenor structure).

We present an approach for the pricing of some basic LIBOR-related derivatives, namely FRAs and CAPs (linear/nonlinear) and consider only clean valuation formulas, namely without counterparty risk. Although real pricing problems require a more global approach (see e.g. the discussions in Fuji et al. (2009, 2011), Piterbarg (2010), Crépey et al. (2013a) as well as in recent work by D. Brigo and co-authors such as Pallavicini and Brigo (2013), Brigo, Morini and Pallavicini (2013)), clean valuation formulas are nevertheless useful for various reasons: as pointed out in Crépey et al. (2013b), market quotes typically reflect prices of fully collateralized transactions so that clean price formulas may turn out to be sufficient for calibration also when using the model to compute possible value adjustments; furthermore (see Crépey et al. (2013b)), TVA adjustments are often computed on top of clean prices. Concerning methodology, since our approach is of the bottom-up type that considers short rate modeling, we heavily exploit the advantages of an affine term structure. This is in contrast with top-down approaches, where (see Mercurio (2010a, b)) log-normal models are common (see however Keller-Ressel et al. (2013) and Grbac et al. (2014) for affine LIBOR models with general distributions in a multicurve context).

Traditionally, interest rates are defined to be coherent with the bond prices \(p(t,T)\), which represent the expectation of the market concerning the future value of money. For the discrete compounding forward LIBORs, which we denote here by \(L(t;T,S)\), this leads to (\(t<T<S\))

$$\begin{aligned} L(t;T,S)=\frac{1}{S-T}\,\left( \frac{p(t,T)}{p(t,S)}-1\right) \end{aligned}$$
(1)

which can also be justified as representing the fair value of the fixed rate in a FRA on the LIBOR. Since we consider only a single LIBOR that corresponds to a given tenor structure, we assume \(S=T+\Delta \) (for tenor \(\Delta \)). In this way one obtains a single curve for the term structure. The actual LIBOR rates, which in what follows we shall denote by \(\bar{L}(t;T,T+\Delta )\), are determined by the LIBOR panel that takes into account various factors such as credit risk, liquidity, etc. (see the discussion in Filipović and Trolle (2013)). Following some of the recent literature, in particular Crépey et al. (2012) (see also Kijima et al. (2009)), we keep the formal relationship (1) between LIBOR rates and bond prices, but replace the risk-free bond prices \(p(t,T)\) by fictitious “risky” bond prices \(\bar{p}(t,T)\) that are supposed to be affected by the same factors as the actual LIBORs and that, analogously to the risk-free bond prices, we define then as

$$\begin{aligned} \bar{p}(t,T)=E^Q\left\{ \exp \left[ -\int \limits _t^T(r_u+s_u)du\right] \mid \fancyscript{F}_t\right\} \end{aligned}$$
(2)

where \(r_t\) is the classical short rate, whereas \(s_t\) represents the short rate spread (hazard rate in case of only default risk). Notice that in this way the spread is introduced from the outset. Notice also that the fictitious bond prices \(\bar{p}(t,T)\) are not actual prices.

Since in what follows we are interested in FRAs and CAPs that are based on the \(T-\)spot LIBOR \(\bar{L}(T;T,T+\Delta )\), we actually postulate the relationship (1) only at the inception time \(t=T\). Our starting point is thus the following relationship

$$\begin{aligned} \bar{L}(T;T,T+\Delta )=\frac{1}{\Delta }\,\left( \frac{1}{\bar{p}(T,T+\Delta )}-1\right) \end{aligned}$$
(3)

where we have taken into account the fact that also for the “risky” bonds we have \(\bar{p}(T,T)=1\).

In addition to the pricing of FRAs and CAPs in our two-curve setup, our major goal here is to derive a relationship between theoretically risk-free and actual FRAs (possibly also CAPs) thereby exhibiting an adjustment factor which plays a role analogous to that of the quanto adjustments in the pricing of cross-currency derivatives or the “multiplicative forward basis” in Bianchetti (2012).

2 The Model

2.1 Preliminary Considerations

We start with some comments concerning HJM-like approaches to better motivate our short rate approach. Given the bond price processes \(p(t,T)\) and \(\bar{p}(t,T)\), in order to apply an HJM-approach, we need to introduce corresponding forward rate processes \(f^T(t)\) and \(\bar{f}^T(t)\) that lead to a forward rate spread expressed as \(g^T(t):=\bar{f}^T(t)-f^T(t)\). One then also obtains corresponding short rates and a short rate spread, namely \(r_t=f^t(t),\>\bar{r}_t=\bar{f}^t(t),\>s_t=g^t(t)=\bar{r}_t-r_t\). Notice that a consistent model should lead to \(\bar{p}(t,T)\le p(t,T)\), which implies \(\bar{f}^T(t)\ge f^T(t)\) or, equivalently \(g^T(t)\ge 0\>\>\forall t<T\le \bar{T},\) where \(\bar{T}\) is a given maximal maturity.

An extensive study within the multicurve HJM approach has appeared in Crépey et al. (2012). The driving random process is a Levy and a corresponding HJM drift condition is derived. Conditions are given for the non-negativity of rates and spreads; explicit formulas are obtained for various interest rate derivatives. What may not be fully satisfactory in Crépey et al. (2012) is that:

  1. (i)

    some difficulties arise when dealing not only with credit risk, but also other risks such as liquidity. In particular, when looking for a condition that corresponds to the defaultable HJM drift condition;

  2. (ii)

    a fictitious default has to be considered explicitly (with pre default bond prices).

The study in Crépey et al. (2012) is continued in the recent paper Crépey et al. (2013b) with the main purpose of taking into account also counterparty risk and funding costs and of determining various valuation adjustments on top of the clean prices. The methodology in Crépey et al. (2013b) is again based on an HJM approach, but with explicit ingredients for the induced short rate models in order to obtain a Markovian structure and to be able to actually perform the value adjustment calculations. In particular, the authors in Crépey et al. (2013b) use a Levy Hull & White extended Vasicek model for \(r_t\) and introduce an additional factor that can be interpreted as representing a short rate spread. In this latter sense it becomes analogous to the approach to be presented here.

Another HJM-based approach, limited to default risk, appears in Chiarella et al. (2007) with emphasis on obtaining Markovian models with state dependent volatilities. The driving processes are of the jump-diffusion type. The difficulties here appear to be given by the fact that, for convenient specifications of the volatilities, one obtains deterministic short rate spreads. For more general, stochastic volatilities the authors obtain only approximate Markovianity. These difficulties have been overcome in the subsequent paper Chiarella et al. (2010), where the authors obtain finite-dimensional Markovian realizations also with stochastic spreads and, in addition, obtain a correlation structure between credit spread, interest rate and the stochastic volatility. When trying to extend their approach to a multi curve setting, beyond that implied by credit risk alone, there appear though some computational difficulties due to the stochastic volatility.

Before coming now to describing our short rate model, we recall some basics concerning FRAs. We start from the

Definition 2.1

A FRA (forward rate agreement) is an OTC derivative that allows the holder to lock in at \(t<T\) the interest rate between the inception date \(T\) and the maturity \(T+\Delta \) at a fixed value \(K\). At maturity \(T+\Delta \), a payment based on \(K\) is made and one based on \(\bar{L}(T;T,T+\Delta )\) is received.

We shall denote the value of the FRA at \(t<T\) by \({ {FRA}}^T(t,K)\). In our two-curve risky setup, the fair price of a FRA in \(t<T\) with fixed rate \(K\) and notional \(N\) is

$$\begin{aligned} \begin{array}{lcl} { {FRA}}^T(t,K)&{}=&{}N\Delta p(t, T+\Delta ) E^{T+\Delta }\left[ \bar{L}(T;T,T+\Delta )-K\mid \fancyscript{F}_t\right] \\ &{}=&{}N p(t, T+\Delta )E^{T+\Delta }\left[ \frac{1}{\bar{p}(T, T+\Delta )}-(1+\Delta K)\mid \fancyscript{F}_t\right] \end{array} \end{aligned}$$
(4)

where \(E^{T+\Delta }\) denotes expectation under the \((T+\Delta )-\) forward measure \(Q^{T+\Delta }\). Notice that the simultaneous presence of \(p(t, T+\Delta )\) and \(\bar{p}(t, T+\Delta )\) does not allow for the convenient reduction of the formula to a simpler form as in the one-curve setup.

2.2 Description of the Model Itself

For the short-rate model approach we shall have to start by modeling directly the short rate \(r_t\) and the short rate spread \(s_t\) and we do it under the standard martingale measure \(Q\) (to be calibrated to the market) for the risk-free money market account as numeraire. In order to account for a possible (negative) correlation between \(r_t\) and \(s_t\) we introduce a factor model: given three independent affine factor processes \(\Psi ^i_t,\,i=1,2,3\) let

$$\begin{aligned} \left\{ \begin{array}{lcl} r_t&{}=&{}\Psi ^2_t-\Psi ^1_t\\ s_t&{}=&{}\kappa \Psi _t^1+\Psi _t^3\end{array}\right. \end{aligned}$$
(5)

where \(\kappa \) is a constant that measures the instantaneous correlation between \(r_t\) and \(s_t\) (negative correlation for \(\kappa >0\)). This setup could be generalized in various ways, in particular by using more factors to drive \(s_t\). In view of the existing literature one could, instead of using an affine model structure as we do it here, consider e.g. ambit-type processes as presented in Corcuera, Farkas, Schoutens and Valkeila (2013). Such a model, which is not of the semimartingale type, allows also for analytical computations and gives the possibility to take into account long-range dependence. Remaining within the pure credit risk setting where, see the comment after (2), the spread is given by the default intensity, some of the factors affecting the spread could be given a specific meaning as in Douady and Jeanblanc (2002) where, using an HJM-type approach, the authors consider a spread field process with one of the variables representing the rating of the issuer. The approach in Douady and Jeanblanc (2002) could possibly be generalized also to the present setting.

A common approach to modeling the factors in an affine context is to assume them of the type of a square root diffusion. This guarantees positivity of the spread, but the negative correlation comes at the expense of possibly negative interest rates (even if only with small probability). With such a model, by passing to the \((T+\Delta )-\)forward measure, one can compute the value of a FRA and of the fair fixed rate.

For various reasons, in particular in view of our main goal to obtain an adjustment factor, it is convenient to be able to have the same factor model for FRAs with different maturities. We therefore aim at performing the calculations under a single reference measure, namely the standard martingale measure \(Q\). More precisely, for the factor processes we assume the following affine diffusions under \(Q\) that are of the Vasicek type, namely

$$\begin{aligned} \left\{ \begin{array}{lcl} d\Psi _t^1&{}=&{}(a^1-b^1\Psi _t^1)dt+\sigma ^1\,dw_t^1\\ d\Psi _t^i&{}=&{}(a^i-b^i\Psi _t^i)dt+\sigma ^i\sqrt{\Psi _t^i}\,dw_t^i,\quad i=2,3\end{array}\right. \end{aligned}$$
(6)

where \(a^i, b^i, \sigma ^i\) are positive constants with \(a^i\ge (\sigma ^i)^2/2\) for \(i=2,3\), and \(w_t^i\) independent Wiener processes. We have chosen a Vasicek-type model for simplicity, but the results below can be easily extended to the Hull and White version of the Vasicek model. Notice that the factor \(\Psi _t ^1\) may take negative values implying that, not only \(r_t\), but also \(s_t\) may become negative (see however later under “comments on the main result”). Results completely analogous to those that we shall obtain here for the above pure diffusion model may be derived also for affine jump-diffusions at the sole expense of more complicated notation.

3 Main Result (FRAs)

3.1 Preliminary Notions and Results

Recalling the expression for a FRA under the forward measure, namely

$$\begin{aligned} \mathrm{{FRA}}^T(t,K)=N p(t, T+\Delta )E^{T+\Delta }\Bigg [\frac{1}{\bar{p}(T, T+\Delta )}-(1+\Delta K)\mid \fancyscript{F}_t\Bigg ], \end{aligned}$$
(7)

one has that the crucial quantity to compute is

$$\begin{aligned} \bar{\nu }_{ t,T}:=E^{T+\Delta }\Bigg [\frac{1}{\bar{p}(T, T+\Delta )}\mid \fancyscript{F}_t\Bigg ] \end{aligned}$$
(8)

and that the fixed rate to make the FRA a fair contract at time \(t\) is

$$\begin{aligned} \bar{K}_{t}:= \frac{1}{\Delta }(\bar{\nu }_{t,T }-1) \end{aligned}$$
(9)

In the classical single curve case we have instead

$$\begin{aligned} \nu _{t,T}:=E^{T+\Delta }\Bigg [\frac{1}{{p}(T, T+\Delta )}\mid \fancyscript{F}_t\Bigg ]=\frac{p(t,T)}{p(t,T+\Delta )} \end{aligned}$$
(10)

being \(\frac{p(t,T)}{p(t,T+\Delta )}\) an \(\fancyscript{F}_t\)—martingale under the \((T+\Delta )\)—forward measure. The fair fixed rate in the single curve case is then

$$\begin{aligned} K_t=\frac{1}{\Delta }\left( \nu _{t,T}-1\right) =\frac{1}{\Delta }\left( \frac{p(t,T)}{p(t,T+\Delta )}-1\right) \end{aligned}$$
(11)

and notice that, in order to compute \(K_t\), no interest rate model is needed (contrary to \(\bar{K}_t\)).

Due to the affine dynamics of \(\Psi _t^i\>(i=1,2,3)\) under \(Q\), we have for the risk-free bond

$$\begin{aligned} \begin{array}{lcl} p(t,T)&{}=&{}E^{Q}\Big \{\exp \left[ -\int \limits _{t}^{T}r_udu\right] \mid \fancyscript{F}_t\Big \}=E^{Q}\Big \{\exp \left[ \int \limits _{t}^{T}(\Psi _{u}^{1}-\Psi _{u}^{2})du\right] \mid \fancyscript{F}_t\Big \}\\ \\ {}&{}=&{}\exp \left[ A(t,T)-B^1(t,T)\Psi _t^1-B^2(t,T)\Psi _t^2\right] \end{array} \end{aligned}$$
(12)

The coefficients satisfy

$$\begin{aligned} \left\{ \begin{array}{ll} B^1_t-b^1B^1-1=0,&{}B^1(T,T)=0\\ B^2_t-b^2B^2-\frac{(\sigma ^2)^2}{2}(B^2)^2+1=0,&{}B^2(T,T)=0\\ A_t=a^1B^1-\frac{(\sigma ^1)^2}{2}(B^1)^2+a^2B^2\>,\>&{}A(T,T)\,\,\,=0\end{array}\right. \end{aligned}$$
(13)

leading, in particular, to

$$\begin{aligned} B^1(t,T)=\frac{1}{b^1}\,\left( e^{-b^1(T-t)}-1\right) . \end{aligned}$$
(14)

For the risky bond we have instead

$$\begin{aligned} \begin{array}{lcl} \bar{p}(t,T)&{}=&{}E^{Q}\Big \{\exp \left[ -\int \limits _{t}^{T}(r_u+s_u)du\right] \mid \fancyscript{F}_t\Big \}\\ \\ &{}=&{}E^{Q}\Big \{\exp \left[ - \int \limits _{t}^{T}((\kappa -1)\Psi _{u}^{1}+\Psi _{u}^{2}+\Psi _{u}^{3})du\right] \mid \fancyscript{F}_t\Big \}\\ \\ &{}=&{}\exp \left[ \bar{A}(t,T)-\bar{B}^1(t,T)\Psi _t^1-\bar{B}^2(t,T)\Psi _t^2-\bar{B}^3(t,T)\Psi _t^3\right] \end{array} \end{aligned}$$
(15)

This time the coefficients satisfy

$$\begin{aligned} {\left\{ \begin{array}{ll} \bar{B}^1_t-b^1\bar{B}^1+(\kappa -1)=0,&{}\bar{B}^1(T,T)=0\\ \bar{B}^2_t-b^2\bar{B}^2-\frac{(\sigma ^2)^2}{2}(\bar{B}^2)^2+1=0,&{}\bar{B}^2(T,T)=0\\ \bar{B}^3_t-b^3\bar{B}^3-\frac{(\sigma ^3)^2}{2}(\bar{B}^3)^2+1=0,&{}\bar{B}^3(T,T)=0\\ \bar{A}_t=a^1\bar{B}^1-\frac{(\sigma ^1)^2}{2}(\bar{B}^1)^2+a^2\bar{B}^2+a^3\bar{B}^3\>,\>&{}\bar{A}(T,T)\,\,\,=0\end{array}\right. } \end{aligned}$$
(16)

leading, in particular, to

$$\begin{aligned} \bar{B}^1(t,T)=\frac{1-\kappa }{b^1}\,\left( e^{-b^1(T-t)}-1\right) =(1-\kappa )\,B^1(t,T) \end{aligned}$$
(17)

From the above 1st order equations it follows that

$$\begin{aligned} \left\{ \begin{array}{lcl} \bar{B}^1(t,T)&{}=&{}(1-\kappa )\,B^1(t,T)\\ \bar{B}^2(t,T)&{}=&{}B^2(t,T)\\ \bar{A}(t,T)&{}=&{}A(t,T)-a^1\kappa \int \limits _t^TB^1(u,T)du\\ &{}{}&{}+\frac{(\sigma ^1)^2}{2}\kappa ^2\int \limits _t^T(B^1(u,T))^2du-(\sigma ^1)^2\kappa \int \limits _t^TB^1(u,T)du\\ &{}{}&{}-a^3\int \limits _t^T\bar{B}^3(u,T)du\end{array}\right. \end{aligned}$$
(18)

Letting then

$$\begin{aligned} \tilde{A}(t,T):=\bar{A}(t,T)-A(t,T) \end{aligned}$$
(19)

we obtain

$$\begin{aligned} \bar{p}(t,T)=&\exp \Big [\bar{A}(t,T)-B^1(t,T)\Psi _t^1-B^2(t,T)\Psi _t^2\nonumber \\&-\bar{B}^3(t,T)\Psi _t^3+\kappa B^1(t,T)\Psi _t^1\Big ] \nonumber \\ {}=&p(t,T)\,\exp \left[ \tilde{A}(t,T)+\kappa B^1(t,T)\Psi _t^1-\bar{B}^3(t,T)\Psi _t^3\right] \end{aligned}$$
(20)

so that, putting for simplicity \(\tilde{B}^1:=B^1(T,T+\Delta )\), one may write

$$\begin{aligned} \frac{p(T,T+\Delta )}{\bar{p}(T,T+\Delta )}=\exp \left[ -\tilde{A}(T,T+\Delta )-\kappa \tilde{B}^1\Psi _T^1+\bar{B}^3(T,T+\Delta )\Psi _T^3\right] . \end{aligned}$$
(21)

3.2 The Result Itself

We introduce the

Definition 3.1

We call adjustment factor the process

$$\begin{aligned} \mathrm{{Ad}}_t^{T,\Delta }:=E^Q\left\{ \frac{p(T,T+\Delta )}{\bar{p}(T,T+\Delta )}\mid \fancyscript{F}_t\right\} , \end{aligned}$$
(22)

and shall prove the following

Proposition 3.1

We have

$$\begin{aligned} \bar{\nu }_{t,T}=\nu _{t,T}\cdot \mathrm{{Ad}}_t^{T,\Delta }\cdot \exp \left[ \kappa \frac{(\sigma ^1)^2}{2(b^1)^3}\left( 1-e^{-b^1\Delta }\right) \left( 1-e^{-b^1(T-t)}\right) ^2\right] \end{aligned}$$
(23)

with two adjustment factors on the right, of which the first one can be expressed as

$$\begin{aligned} \begin{array}{lcl}\mathrm{{Ad}}_t^{T,\Delta }&{}=&{}e^{-\tilde{A}(T,T+\Delta )}E^Q\left\{ e^{-\kappa \tilde{B}^1\Psi _T^1+\bar{B}^3(T,T+\Delta )\Psi _T^3}\mid \fancyscript{F}_t\right\} \\ {} &{}:=&{}A(\theta ,\kappa , \Psi _t^1, \Psi _t^3)\end{array} \end{aligned}$$
(24)

with \(\theta :=(a^i,b^i,\sigma ^i,\>i=1,2,3).\)

One may notice the analogy here with the multiplicative forward basis in Bianchetti (2012).

As a consequence of the previous proposition we have the following relation between the fair value \(\bar{K}_t\) of the fixed rate in an actual FRA and the fair value \(K_t\) in a corresponding riskless one:

Corollary 3.1

The following relationship holds

$$\begin{aligned} \bar{K}_t=\left( K_t+\frac{1}{\Delta }\right) \cdot \mathrm{{Ad}}_t^{T,\Delta }\cdot \exp \left[ \kappa \frac{(\sigma ^1)^2}{2(b^1)^3}\left( 1-e^{-b^1\Delta }\right) \left( 1-e^{-b^1(T-t)}\right) ^2\right] -\frac{1}{\Delta } \end{aligned}$$
(25)

Notice that the factor given by the exponential is equal to \(1\) for zero correlation, i.e. for \((\kappa =0)\).

3.3 Comments on the Main Result

3.3.1 Comments Concerning the Adjustment Factors

An easy intuitive interpretation of the main result can be obtained in the case of \(\kappa =0\) (independence of \(r_t\) and \(s_t\)): in this case we have \(r_t+s_t>r_t\) implying \(\bar{p}(T,T+\Delta )<p(T,T+\Delta )\) so that \(Ad_t^{T,\Delta }\ge 1\) (the exponential adjustment factor is equal to \(1\)). As expected, from Proposition 3.1 and Corollary 3.1 it then follows that

$$\begin{aligned} \bar{\nu }_{t,T}\ge \nu _{t,T},\quad \bar{K}_t\ge K_t \end{aligned}$$
(26)

To gain some intuition for the cases when \(\kappa \not =0\), let \(\bar{p}^{\kappa }(t,T), \bar{\nu }_{t,T}^{\kappa }, Ad_t^{T,\Delta ,\kappa }\) denote the given quantities by stressing that the correlation parameter has value \(\kappa \). Notice that \(p(t,T)\) and thus also \(\nu _{t,T}\) do not depend on \(\kappa \). Consider then the case \(\kappa >0\), which is the standard case implying negative correlation between \(r_t\) and \(s_t\). (The case \(\kappa <0\) is analogous/dual). For illustrative purposes we distinguish between the two events \(\{\Psi _t^1>0,\>\forall t\in [T,T+\Delta ]\},\> \{\Psi _t^1<0,\>\forall t\in [T,T+\Delta ]\}\) where the latter occurs only with small probability (in reality, \(\Psi _t^1\) will be positive for certain values of \(t\) and negative for the remaining ones).

On \(\{\Psi _t^1>0,\>t\in [T,T+\Delta ]\}\) we now have

$$\begin{aligned} \begin{array}{l}\bar{p}^{\kappa }(T,T+\Delta )<\bar{p}^{0}(T,T+\Delta )\\ \Rightarrow \bar{\nu }_{t,T}^{\kappa }>\bar{\nu }_{t,T}^{0}\quad \Rightarrow \quad \bar{\nu }_{t,T}^{\kappa }/\nu _{t,T}>\bar{\nu }_{t,T}^{0}/\nu _{t,T}\end{array} \end{aligned}$$
(27)

Recalling then

$$\begin{aligned} \bar{\nu }_{t,T}^{\kappa }=\nu _{t,T}\cdot Ad_t^{T,\Delta ,\kappa }\cdot \exp \left[ \kappa \frac{(\sigma ^1)^2}{2(b^1)^3}\left( 1-e^{-b^1\Delta }\right) \left( 1-e^{-b^1(T-t)}\right) ^2\right] \end{aligned}$$
(28)

the last inequality in (27) can be seen to be in line with the fact that, in this case, in (28) the exponential factor is \(>\)1 and \(Ad_t^{T,\Delta ,\kappa }>Ad_t^{T,\Delta ,0}\) (recall Definition 3.1).

On the other hand, on \(\{\Psi _t^1<0,\>t\in [T,T+\Delta ]\}\), we have

$$\begin{aligned} \bar{p}^{\kappa }(T,T+\Delta )>\bar{p}^{0}(T,T+\Delta )\quad \Rightarrow \quad \bar{\nu }_{t,T}^{\kappa }/\nu _{t,T}<\bar{\nu }_{t,T}^{0}/\nu _{t,T} \end{aligned}$$
(29)

This inequality can be seen to be in line with the fact that, here, \(Ad_t^{T,\Delta ,\kappa }<Ad_t^{T,\Delta ,0}\), but the exponential factor is still \(>\)1. This can nevertheless be explained by noticing that, in this case, \(r_t\) is relatively large and \(r_t+s_t\) is closer to \(r_t\) (may be even \({<}r_t\)). This implies a push of \(\bar{\nu }_{t,T}^{\kappa }/\nu _{t,T}\) towards smaller values than in the previous case.

3.3.2 Comments Concerning the Use of the Results for Calibration

For what concerns calibration of our model to FRA and other available market data, notice that the coefficients \(a^ 1,a^ 2, b^ 1,b^ 2, \sigma ^ 1,\sigma ^ 2\) can be calibrated in the usual way on the basis of the observations of default-free bonds \(p(t,T)\) (if we had a Hull & White extension of our Vasicek-type model (6) then also for this model the calibration could be performed as in the standard case). To calibrate \(a^ 3,b^ 3,\sigma ^ 3\), notice that, contrary to \(p(t,T)\), the “risky” bonds \(\bar{p}(t,T)\) are not observable (relation (3) does not imply a unique inverse relationship to determine \(\bar{p}(t,T)\) from observations of the LIBORs). One can however observe \(K_t=\frac{1}{\Delta }\left( \frac{p(t,T)}{p(t,T+\Delta )}-1\right) \) as well as the “risky” FRA rate \(\bar{K}_t\). Recalling then Corollary 3.1 and the fact that \(Ad_t^{T,\Delta }=A(\theta ,\kappa , \Psi _t^1, \Psi _t^3),\) notice that, having calibrated \(a^i,b^i,\sigma ^i\>(i=1,2)\), from the observations of \(K_t\) and \(\bar{K}_t\) one could thus calibrate \(a^3,b^3,\sigma ^3\) as well as \(\kappa \). If there is a way to determine directly \(Ad_t^{T,\Delta }\) (e.g. by observing the FRA rates for uncorrelated \(r_t\) and \(s_t\)), then the relationship between \(K_t\) and \(\bar{K}_t\) as expressed in Corollary 3.1 would allow to calibrate separately \(\kappa \). We furthermore recall that, as pointed out in Crépey et al. (2013b), calibration of clean prices is sufficient also when using the model to compute possible value adjustments.

3.4 Proof of the Main Result

Since the quantities of interest, namely \(\bar{\nu }_{t,T}\) and \(\nu _{t,T}\) were defined under the forward measure (see (8) and (10)), as a first step we perform a change from the forward measure \(Q^{T+\Delta }\) to the standard martingale measure \(Q\). To this effect, putting \(b_t:=\exp \left[ \int _0^tr_udu\right] \), the density process for changing from \(Q\) to \(Q^{T+\Delta }\) is \(L_t=\frac{p(t,T+\Delta )}{p(0,T+\Delta )b_t}.\) We can thus write

$$\begin{aligned} \begin{array}{lcl} \bar{\nu }_{t,T}&{}=&{}E^{T+\Delta }\left\{ \frac{1}{\bar{p}(T,T+\Delta )}\mid \fancyscript{F}_t\right\} =L_t^{-1}E^Q\left\{ \frac{L_{T+\Delta }}{\bar{p}(T,T+\Delta )}\mid \fancyscript{F}_t\right\} \\ \\ &{}=&{}\frac{1}{p(t,T+\Delta )}E^Q\left\{ \exp [-\int \limits _t^{T}r_udu]\,\frac{p(T,T+\Delta )}{\bar{p}(T,T+\Delta )}\mid \fancyscript{F}_t\right\} \end{array} \end{aligned}$$
(30)

Recalling the expression for \(p(T,T+\Delta )/\bar{p}(T,T+\Delta )\) (see (21)) this becomes

$$\begin{aligned} \bar{\nu }_{t,T}=&\frac{1}{p(t,T+\Delta )}E^Q\Big \{e^{-\int \limits _t^{T}r_udu} \cdot \exp \left[ -\tilde{A}(T,T+\Delta )-\kappa \tilde{B}^1\Psi _T^1+\bar{B}^3(T,T+\Delta )\Psi _T^3\right] \mid \fancyscript{F}_t\Big \} \nonumber \\ =&\frac{1}{p(t,T+\Delta )}\exp \left[ -\tilde{A}(T,T+\Delta )\right] E^Q\left\{ e^{\bar{B}^3(T,T+\Delta )\Psi _T^3}\mid \fancyscript{F}_t\right\} \\&\cdot E^Q\left\{ e^{-\int \limits _t^T(-\Psi _u^1+\Psi _u^2)du}e^{-\kappa \tilde{B}^1\Psi _T^1}\mid \fancyscript{F}_t\right\} \nonumber \end{aligned}$$
(31)

To proceed, consider the process \(F_t\) given by the last factor in (31), namely

$$\begin{aligned} F_t:=E^Q\left\{ e^{-\int \limits _t^T(-\Psi _u^1+\Psi _u^2)du}e^{-\kappa \tilde{B}^1\Psi _T^1}\mid \fancyscript{F}_t\right\} \end{aligned}$$
(32)

Due to the affine dynamics of \(\Psi _t^i,\>i=1,2,\) and the independence of \(\Psi _t^1\) and \(\Psi _t^2\), we may write

$$\begin{aligned} F_t:=&\,E^Q\left\{ e^{\int \limits _t^T\Psi _u^1du}e^{-\kappa \tilde{B}^1\Psi _T^1}\mid \fancyscript{F}_t\right\} E^Q\left\{ e^{-\int \limits _t^T\Psi _u^2du}\mid \fancyscript{F}_t\right\} \nonumber \\ =&\exp \left[ \alpha ^1(t,T)-\beta ^1(t,T)\Psi _t^1\right] \,\exp \left[ \alpha ^2(t,T)-\beta ^2(t,T)\Psi _t^2\right] \end{aligned}$$
(33)

where the coefficients satisfy

$$\begin{aligned} \left\{ \begin{array}{ll} \beta _t^1-b^1\beta ^1-1=0,&{} \beta ^1(T,T)=\kappa \tilde{B}^1\\ \beta ^2_t-b^2\beta ^2-\frac{(\sigma ^2)^2}{2}(\beta ^2)^2+1=0,&{}\beta ^2(T,T)=0\\ \alpha ^1_t=-\frac{(\sigma ^1)^2}{2}(\beta ^1)^2+a^1\beta ^1,&{}\alpha ^1(T,T)=0\\ \alpha ^2_t=a^2\beta ^2,&{}\alpha ^2(T,T)=0\end{array}\right. \end{aligned}$$
(34)

Recalling also (12)–(14), the solutions of the system (34) can be expressed as

$$\begin{aligned} \left\{ \begin{array}{lcl}\beta ^1(t,T)&{}=&{}\frac{1}{b^1}\left[ (b^1\kappa \tilde{B}^1+1)e^{-b^1(T-t)}-1\right] =B^1(t,T)+\kappa \tilde{B}^1e^{-b^1(T-t)}\\ \beta ^2(t,T)&{}=&{}B^2(t,T)\\ \alpha ^1(t,T)&{}=&{}\frac{(\sigma ^1)^2}{2}\int \limits _t^T(\beta ^1(u,T))^2du-a^1\int \limits _t^T\beta ^1(u,T)du\\ &{}=&{}\frac{(\sigma ^1)^2}{2}\int \limits _t^T(B^1(u,T))^2du-a^1\int \limits _t^TB^1(u,T)du\\ &{}{}&{}+\,\frac{(\sigma ^1)^2}{2}(\kappa \tilde{B}^1)^2\int \limits _t^Te^{-2b^1(T-u)}du\\ &{}{}&{}+\,\kappa \tilde{B}^1(\sigma ^1)^2\int \limits _t^TB^1(u,T)e^{-b^1(T-u)}du-a^1\kappa \tilde{B}^1\int \limits _t^Te^{-b^1(T-u)}du\\ \alpha ^2(t,T)&{}=&{}-a^2\int \limits _t^TB^2(u,T)du\end{array}\right. \end{aligned}$$
(35)

Consequently

$$\begin{aligned} \begin{array}{l} F_t=\exp \Bigl [\frac{(\sigma ^1)^2}{2}\int \limits _t^T(B^1(u,T))^2du-a^1\int \limits _t^TB^1(u,T)du\\ \qquad \,\,\,-a^2\int \limits _t^TB^2(u,T)du-B^1(t,T)\Psi _t^1-B^2(t,T)\Psi _t^2\Big ]\\ \qquad \,\,\,\cdot \exp \Big [\frac{(\sigma ^1)^2}{2}(\kappa \tilde{B}^1)^2\int \limits _t^Te^{-2b^1(T-u)}du-a^1\kappa \tilde{B}^1\int \limits _t^Te^{-b^1(T-u)}du\\ \qquad \,\,\,-\kappa \tilde{B}^1e^{-b^1(T-t)}\Psi _t^1\Big ]\\ \qquad \,\,\,\cdot \exp \left[ \kappa \tilde{B}^1(\sigma ^1)^2\int \limits _t^TB^1(u,T)e^{-b^1(T-u)}du\right] \\ \\ \quad \,=p(t,T)\cdot \exp \Big [\frac{(\sigma ^1)^2}{2}(\kappa \tilde{B}^1)^2\int \limits _t^Te^{-2b^1(T-u)}du-a^1\kappa \tilde{B}^1\int \limits _t^Te^{-b^1(T-u)}du\\ \qquad \,\,\,-\kappa \tilde{B}^1e^{-b^1(T-t)}\Psi _t^1\Big ]\\ \qquad \,\,\,\cdot \exp \left[ \kappa \tilde{B}^1(\sigma ^1)^2\int \limits _t^TB^1(u,T)e^{-b^1(T-u)}du\right] \end{array} \end{aligned}$$
(36)

On the other hand, recalling (21), one obtains

$$\begin{aligned} \begin{array}{l} E^Q\left\{ \frac{p(T,T+\Delta )}{\bar{p}(T,T+\Delta )}\mid \fancyscript{F}_t\right\} \\ \quad =e^{-\tilde{A}(T,T+\Delta )}E^Q\left\{ e^{\bar{B}^3(T,T+\Delta )\Psi _T^3}\mid \fancyscript{F}_t\right\} E^Q\left\{ e^{-\kappa \tilde{B}^1\Psi _T^1}\mid \fancyscript{F}_t\right\} \end{array} \end{aligned}$$
(37)

where, due to the affine dynamics of \(\Psi _t^1\), we may write

$$\begin{aligned} E^Q\left\{ e^{-\kappa \tilde{B}^1\Psi _T^1}\mid \fancyscript{F}_t\right\} =\exp \left[ \bar{\alpha }(t,T)-\bar{\beta }(t,T)\Psi _t^1\right] \end{aligned}$$
(38)

with \(\bar{\alpha }(\cdot )\) and \(\bar{\beta }(\cdot )\) satisfying

$$\begin{aligned} \left\{ \begin{array}{ll} \bar{\beta }_t-b^1\bar{\beta }=0, &{}\bar{\beta }(T,T)=\kappa \tilde{B}^1\\ \bar{\alpha }_t=a^1\bar{\beta }-\frac{(\sigma ^1)^2}{2}(\bar{\beta })^2,&{}\bar{\alpha }(T,T)=0\end{array}\right. \end{aligned}$$
(39)

so that

$$\begin{aligned} \begin{array}{lcl} \bar{\beta }(t,T)&{}=&{}\kappa \tilde{B}^1e^{-b^1(T-t)}\\ \bar{\alpha }(t,T)&{}=&{}-a^1\kappa \tilde{B}^1\int \limits _t^Te^{-b^1(T-u)}du+\frac{(\sigma ^1)^2}{2}(\kappa \tilde{B}^1)^2\int \limits _t^Te^{-2b^1(T-u)}du\end{array} \end{aligned}$$
(40)

and, consequently,

$$\begin{aligned} \begin{array}{l}E^Q\left\{ e^{-\kappa \tilde{B}^1\Psi _T^1}\mid \fancyscript{F}_t\right\} =\exp \left[ -\kappa \tilde{B}^1e^{-b^1(T-t)}\Psi _t^1\right] \\ \quad \exp \left[ -a^1\kappa \tilde{B}^1\int \limits _t^Te^{-b^1(T-u)}du+\frac{(\sigma ^1)^2}{2}(\kappa \tilde{B}^1)^2\int \limits _t^Te^{-2b^1(T-u)}du\right] \end{array} \end{aligned}$$
(41)

Combining (31) with (36) as well as with (37) together with (41), we obtain

$$\begin{aligned} \begin{array}{l} \bar{\nu }_{t,T}\,=\frac{1}{p(t,T+\Delta )}\exp \left[ -\tilde{A}(T,T+\Delta )\right] E^Q\left\{ e^{\bar{B}^3(T,T+\Delta )\Psi _T^3}\mid \fancyscript{F}_t\right\} \cdot F_t\\ \\ \qquad \,\,=\frac{p(t,T)}{p(t,T+\Delta )}\,E^Q\left\{ \frac{p(T,T+\Delta )}{\bar{p}(T,T+\Delta )}\mid \fancyscript{F}_t\right\} \\ \qquad \quad \,\cdot \exp \left[ \kappa (\sigma ^1)^2 \tilde{B}^1\int \limits _t^TB^1(u,T)e^{-b^1(T-u)}du\right] .\end{array} \end{aligned}$$
(42)

The result then follows noticing that

$$\begin{aligned} \tilde{B}^1\int \limits _t^TB^1(u,T)e^{-b^1(T-u)}du=\frac{1}{2(b^1)^3}\left( 1-e^{-b^1\Delta }\right) \left( 1-e^{-b^1(T-t)}\right) ^2. \end{aligned}$$
(43)

4 Aspects of CAP Pricing

4.1 Preliminary Comments

This part is related to work in progress, but we want nevertheless to present some ideas on how our results obtained for FRAs (linear derivatives) can be extended to nonlinear derivatives. To discuss a specific case, we concentrate here on the pricing of a single Caplet, with strike \(K\), maturity \(T\) on the spot LIBOR for the period \([T,T+\Delta ]\). Using the forward measure \(Q^{T+\Delta }\), its price in \(t<T\) is then given by

$$\begin{aligned} Capl^{T,\Delta }(t)=&\,\Delta p(t,T+\Delta )E^{T+\Delta }\left\{ \left( \bar{L}(T;T,T+\Delta )-K\right) ^+\mid \fancyscript{F}_t\right\} \nonumber \\ =&p(t,T+\Delta )E^{T+\Delta }\left\{ \left( \frac{1}{\bar{p}(T,T+\Delta )}-\tilde{K}\right) ^+\mid \fancyscript{F}_t\right\} \end{aligned}$$
(44)

with \(\tilde{K}:=1+\Delta K\).

As model, we may use the same “ risky” short rate model as for the FRAs that we may consider as already calibrated (for the standard martingale measure \(Q\)). It may thus suffice to derive just a pricing algorithm that need not also be used for calibration. The most convenient way to price a Caplet is, as in (44) and as we do it below, to compute the expectations under the forward measure. Notice however that expectations with respect to a forward measure can be easily computed by performing a change to the standard martingale measure (see e.g. (30)), namely the one for which we may already have calibrated the model. Besides pricing, it may be desirable to obtain also here an “adjustment factor”.

4.2 A Possible Pricing Methodology

For the pricing, in the forward measure, we may use Fourier transform methods as in Crépey et al. (2012) and Crépey et al. (2013b) thereby representing the claim as

$$\begin{aligned} \left( e^X-\bar{K}\right) ^+\quad \text{ with }\quad X:=-\log \bar{p}(T,T+\Delta ) \end{aligned}$$
(45)

We then need only to compute the moment generating function of \(X\), which is a linear combination of the factors (this computation is feasible thanks to the affine structure) and use the Fourier transform of \(f(x)=\left( e^x-\bar{K}\right) ^+\), which is well-known.

Notice that one could possibly also apply a Gram-Charlier expansion as in Kijima et al. (2009).

With the Fourier transform method the price in \(t=0\) of the Caplet can then be obtained in the form (see Crépey et al. (2013b))

$$\begin{aligned} \mathrm{{Capl}}(0,T, T+\Delta )=\frac{p(0,T+\Delta )}{2\pi }\,\int \frac{\tilde{K}^{1-iv-R}\bar{M}_X^{T+\Delta }(R+iv)}{(R+iv)\,(R+iv-1)}\,dv \end{aligned}$$
(46)

where \(\bar{M}_X^{T+\Delta }(\cdot )\) is the moment generating function of \(X\) under the \((T+\Delta )\)—forward measure and \(R\) is such that \(\bar{M}_X^{T+\Delta }(R+iv)\) is finite. This moment generating function can be computed for each of the various forward measures in terms of the \(Q\)—characteristics of the factors, analogously to the computations in Sect. 3.4 (see, in particular, (30)). From these computations one can also see that the Radon-Nikodym-derivative to change from \(Q\) to \(Q^{T+\Delta }\) can in fact be expressed in explicit form and it preserves the affine structure, see Corollary 10.2 in Filipović (2009) (For a recent account on conditions for an absolutely continuous measure transformation to preserve the affine structure see Fontana and Montes (2014)).

If \(M_X^{T+\Delta }(z)\) is the moment generating function of \(X\) with \(p(T,T+\Delta )\) instead of \(\bar{p}(T,T+\Delta )\), then

$$\begin{aligned} \bar{M}_X^{T+\Delta }(z)=M_X^{T+\Delta }(z) A(z; \theta , \kappa , \Psi _0^1, \Psi _0^2,\Psi _0^3) \end{aligned}$$
(47)

where \(A(z; \theta , \kappa , \Psi _0^1, \Psi _0^2, \Psi _0^3)=E^{T+\Delta }\left\{ \left( M_X^{T+\Delta }(z)\right) ^{-1}\,e^{z\,X}\right\} .\) Now, from the expression for \(p(t,T)\) in (12) we obtain

$$\begin{aligned} M_X^{T+\Delta }(z)=&\, E^{T+\Delta }\left\{ e^{zX}\right\} =E^{T+\Delta }\left\{ e^{-z\log p(T,T+\Delta )}\right\} \nonumber \\ =&\, E^{T+\Delta }\left\{ \exp \left[ -zA(T,T+\Delta )+zB^1(T,T+\Delta )\Psi _T^1+zB^2(T,T+\Delta )\Psi _T^2\right] \right\} \end{aligned}$$
(48)

On the other hand, from the expression for \(\bar{p}(t,T)\) in (15) (see also the variant in (20), where the parameter \(\kappa \) appears explicitly) we obtain

$$\begin{aligned} \begin{array}{l} A(z; \theta , \kappa , \Psi _0^1, \Psi _0^2, \Psi _0^3)=\left( M_X^{T+\Delta }(z)\right) ^{-1}E^{T+\Delta }\left\{ e^{-z\log \bar{p}(T,T+\Delta )}\right\} \\ \quad =E^{T+\Delta }\left\{ \exp \left[ -z\bar{A}(\cdot )+z\bar{B}^1(\cdot )\Psi _T^1+z\bar{B}^2(\cdot )\Psi _T^2+z\bar{B}^3(\cdot )\Psi _T^3\right] \right\} \end{array} \end{aligned}$$
(49)

where \((\cdot )\) stands for \((T,T+\Delta )\). Given the affine nature of the factors, both expressions in (48) and (49) can be explicitly computed as a function of the parameters of the model and the initial values \(\Psi _0^1, \Psi _0^2, \Psi _0^3\) of the factors, as expressed by the symbol \(A(z; \theta , \kappa , \Psi _0^1, \Psi _0^2, \Psi _0^3)\). We may now consider \(A(z; \theta , \kappa , \Psi _0^1, \Psi _0^2, \Psi _0^3)\) as adjustment factor for this nonlinear example given by the Caplets. It is not as explicit as the adjustment factor for the FRAs in (23) and (25) and we are presently working on obtaining a more explicit form also in this case.