Pricing swaptions and zero-coupon futures options under the discrete-time arbitrage-free Nelson–Siegel model

Godin, Frédéric; Eghbalzadeh, Ramin; Gaillardetz, Patrice

doi:10.1007/s11147-023-09196-4

Pricing swaptions and zero-coupon futures options under the discrete-time arbitrage-free Nelson–Siegel model

Published: 04 October 2023

Volume 26, pages 171–206, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Review of Derivatives Research Aims and scope Submit manuscript

Pricing swaptions and zero-coupon futures options under the discrete-time arbitrage-free Nelson–Siegel model

Download PDF

Frédéric Godin ORCID: orcid.org/0000-0001-5097-5269^1,2,
Ramin Eghbalzadeh¹ &
Patrice Gaillardetz^1,2

221 Accesses
Explore all metrics

Abstract

The paper outlines pricing procedures for several interest rate derivatives under the discrete-time arbitrage-free Nelson–Siegel (DTAFNS) model of Eghbalzadeh et al. (The discrete-time arbitrage-free Nelson–Siegel model: a closed-form solution and applications to mixed funds representation, 2022). Derivatives considered include swaptions, zero-coupon futures, and options on such futures. Formulas for expected excess returns are also provided for options on futures. Whereas swaption pricing relies on Monte-Carlo simulation, closed-form formulas are obtained for all other derivatives.

Interest Rate Derivatives: One Factor Spot Rate Models

American options and stochastic interest rates

Article Open access 12 May 2022

Black’s model in a negative interest rate environment, with application to OTC derivatives

Article Open access 02 July 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Interest rate risk management is of paramount importance for financial institutions such as banks and insurance companies. Several financial derivatives can be used for this purpose, e.g. interest rate swaps, zero-coupon futures, and options on such contracts. Developing pricing procedures for these derivatives is therefore essential. Indeed, the calibration of interest rate dynamics models relies on these pricing procedures. Furthermore, a strand of literature is interested in studying the risk premium embedded in option prices, see for example Coval and Shumway (2001), Bakshi et al. (2023), Bakshi et al. (2022). In the context of interest rate options, Bakshi et al. (2023) recall the puzzling stylized fact of negative excess returns for both out-of-the-money call and put options on treasury futures, and propose a pricing kernel model explaining such feature. The ability to calculate option risk premia implied by interest rate models is therefore useful to better analyse whether a given model is consistent with observed properties of option prices.

The main objective of this study is to provide procedures and formulas to obtain prices for several interest derivatives, namely swaps, swaptions, zero-coupon futures and European options on such futures. Expected excess return formulas for options on zero-coupon futures are also provided. The formulas presented are based on the discrete-time arbitrage-free Nelson–Siegel (DTAFNS) model of Eghbalzadeh et al. (2022), which is a discrete-time version of the original arbitrage-free Nelson–Siegel model developed in Christensen et al. (2011). Such model has numerous advantages. Firstly, being within the family of affine term structure (ATS) models (see Duffie and Kan, 1996) where spot rates are linear combinations of risk factors, it is highly tractable. Secondly, it provides a clear interpretation for factors driving term structure movements: they respectively drive the yield curve’s level, its slope and its curvature. Finally it ensures absence of arbitrage.

Several other works also study the pricing of swaptions and other interest rate options in the context of multi-factor or ATS models. We list a few here. The pioneering works of Black et al. (1990) and Black and Karasinski (1991) show how to price zero-coupon options based respectively on a one-factor binomial tree model and a log-normal diffusion model. Munk (1999) demonstrates that the price of a European option on a coupon bond (e.g. a swaption) is roughly equal to some multiple of the price of a European option on a zero-coupon bond with maturity equal to the coupon bond’s stochastic duration. Collin-Dufresne and Goldstein (2002) propose to apply an Edgeworth expansion to approximate the density of the coupon bond price and obtain the price of a swaption. Singleton and Umantsev (2002) rely on Fourier inversion methods to calculate swaption prices in the ATS framework. Schrager and Pelsser (2006) propose to approximate the swap rate volatility under the swap measure, which is a low-variance martingale, by its time-zero value. Such strategy leads to a closed-form formula for the swaption price.

The paper is structured as follows. Section 2 provides a description of the DTAFNS term structure model. Section 3 presents pricing procedures for swaptions, whereas Sect. 4 provides formulas for prices and expected excess returns of options on zero-coupon futures. Section 5 briefly discusses calibration methods for the DTAFNS model relying on option prices. Section 6 concludes.

2 The DTAFNS model

This section discusses interest rate dynamics in the DTAFNS model of Eghbalzadeh et al. (2022). Dynamics are provided under three probability measures: the physical measure, the risk-neutral measure and the forward measure. All three measures are required for the computation of prices and associated risk premia of derivatives considered in this study.

2.1 Risk-neutral dynamics in the DTAFNS model

This section provides a description of risk-neutral dynamics in the DTAFNS interest rate term structure model. Consider a discrete-time setting with monthly time points $t=0,\ldots ,T$ and time elapse $\Delta$ year between each point. The filtration $\mathcal {F}=\{\mathcal {F}_t\}^{T}_{t=0}$ characterizes the information flow in the market. The DTAFNS model assumes that the term structure of interest rates is determined by three factors: the long-term level of interest rates, the slope of the yield curve and its curvature. The time-t short rate applying over period $[t,t+1)$ is

$$\begin{aligned} r_t= X^{(1)}_t + X^{(2)}_t, \end{aligned}$$

(2.1)

with $\{X_t\}^{T}_{t=0}$ denoting the term structure factor process, where time-t factors are the triplet $X_t= [X^{(1)}_t, X^{(2)}_t, X^{(3)}_t]^\top$.

Under the risk-neutral measure $\mathbb {Q}$, factors exhibit the following auto-regressive dynamics:

$$\begin{aligned} \left( {\begin{array}{c} X^{(1)}_{t+1}-X^{(1)}_t \\ X^{(2)}_{t+1}-X^{(2)}_t \\ X^{(3)}_{t+1}-X^{(3)}_t \\ \end{array} } \right)&= \underbrace{\left[ \begin{array}{ccc} 0 &{} 0 &{} 0 \\ 0 &{} \lambda &{} -\lambda \\ 0 &{} 0 &{} \lambda \end{array} \right] }_{ \kappa ^\mathbb {Q}} \underbrace{\left[ \begin{array}{c} \theta ^{\mathbb {Q}}_1-X^{(1)}_t \\ \theta ^{\mathbb {Q}}_2-X^{(2)}_t \\ \theta ^{\mathbb {Q}}_3-X^{(3)}_t \end{array} \right] }_{ \theta ^{\mathbb {Q}}-X_t } + \underbrace{\left( {\begin{array}{ccc} \Sigma _{1,1} &{} 0 &{} 0 \\ 0 &{} \Sigma _{2,2} &{} 0 \\ 0&{} 0 &{} \Sigma _{3,3} \\ \end{array} } \right) }_{\Sigma } \left( {\begin{array}{c} Z^{\mathbb {Q}}_{t+1,1} \\ Z^{\mathbb {Q}}_{t+1,2} \\ Z^{\mathbb {Q}}_{t+1,3} \\ \end{array} } \right) , \end{aligned}$$

(2.2)

where scalar $\lambda \in (0,1)$ and matrices $\theta ^{\mathbb {Q}}$, $\kappa ^{\mathbb {Q}}$ and $\Sigma$, with $\Sigma _{i,i}>0$, represent model parameters, and $\{ Z^{\mathbb {Q}}_{t,i} \}^{n}_{t=1}$, $i=1,2,3$ are $\mathcal {F}$-adapted standard Gaussian white noises with contemporaneous correlation $\text {Corr}[Z^{\mathbb {Q}}_{t_1,i},Z^{\mathbb {Q}}_{t_2,j}] = \mathbbm {1}_{ \{t_1=t_2\} } \rho _{i,j}$ represented by correlation matrix $\rho = \left[ \rho _{i,j} \right] ^3_{i,j=1}$. We set $\theta ^{\mathbb {Q}}_1=0$ since such parameter is unused.

As shown in Eghbalzadeh et al. (2022), the time-t price of a risk-free zero-coupon bond paying one dollar on maturity $\mathcal {T}>t$ is, under the such model,

$$\begin{aligned} P(t,\mathcal {T}) = A_\tau \exp \left[ -\Delta \mathcal {B}_\tau ^\top X_t \right] , \end{aligned}$$

(2.3)

where $\tau =\mathcal {T}-t$, $\mathcal {B}_\tau =\left[ \mathcal {B}^{(1)}_\tau , \,\, \mathcal {B}^{(2)}_\tau , \, \, \mathcal {B}^{(3)}_\tau \right] ^\top$ and

$$\begin{aligned} \mathcal {B}^{(1)}_\tau&= \tau , \quad \mathcal {B}^{(2)}_\tau = \dfrac{1-(1-\lambda )^\tau }{\lambda }, \quad \mathcal {B}^{(3)}_\tau = \frac{1-(1-\lambda )^{\tau -1}}{\lambda } - (\tau -1) (1-\lambda )^{\tau -1}, \end{aligned}$$

(2.4)

$$\begin{aligned} \log A_\tau&= -\Delta \theta ^{\mathbb {Q}}_2 \left( \mathcal {B}^{(1)}_\tau - \mathcal {B}^{(2)}_\tau \right) + \Delta \theta ^{\mathbb {Q}}_3 \mathcal {B}^{(3)}_\tau + \frac{1}{2} \Delta ^2\upsilon _\tau , \end{aligned}$$

(2.5)

with

$$\begin{aligned} \upsilon _\tau&= \left( \sum ^3_{i=1} \sum ^3_{j=1}\upsilon ^{(i,j)}_\tau \right) , \\ \upsilon ^{(1,1)}_\tau&= \Sigma ^2_{1,1}\dfrac{\tau (\tau -1)(2\tau -1)}{6}, \\ \upsilon ^{(2,2)}_\tau&= \frac{\Sigma ^2_{2,2}}{\lambda ^2} \left( \tau - 2 \left[ \frac{1-(1-\lambda )^\tau }{\lambda }\right] + \frac{1-(1-\lambda )^{2 \tau }}{1-(1-\lambda )^2}\right) , \\ \upsilon ^{(3,3)}_\tau&= \mathbbm {1}_{ \{ \tau> 1\} }\frac{\Sigma ^2_{3,3}}{\lambda ^2} \Bigg [ \tau -2 + \zeta _0\left( (1-\lambda )^2,\tau -1\right) +\lambda ^2 \zeta _2\left( (1-\lambda )^2,\tau -1\right) \\&\quad - 2\zeta _0\left( (1-\lambda ),\tau -1\right) - 2\lambda \zeta _1\left( (1-\lambda ),\tau -1\right) + 2\lambda \zeta _1\left( (1-\lambda )^2,\tau -1\right) \bigg ], \\ \upsilon ^{(1,2)}_\tau&= \upsilon ^{(2,1)}_t = \rho _{1,2}\Sigma _{1,1} \Sigma _{2,2} \frac{1}{\lambda }\left( \frac{\tau (\tau -1)}{2} - \zeta _1 ((1-\lambda ),\tau )\right) , \\ \upsilon ^{(1,3)}_\tau&= \upsilon ^{(3,1)}_t = \mathbbm {1}_{ \{ \tau> 1\} }\rho _{1,3}\Sigma _{1,1} \Sigma _{3,3} \frac{1}{\lambda } \\&\qquad \bigg [\frac{\tau (\tau -1)}{2}-1 -\zeta _0\left( (1-\lambda ),\tau -1\right) -(\lambda +1)\zeta _1\left( (1-\lambda ),\tau -1\right) \\&\quad -\lambda \zeta _2\left( (1-\lambda ),\tau -1\right) \bigg ], \\ \upsilon ^{(2,3)}_\tau&= \upsilon ^{(3,2)}_t = \mathbbm {1}_{ \{ \tau > 1\} } \rho _{2,3}\Sigma _{2,2} \Sigma _{3,3} \\&\quad \bigg ( \frac{\tau -2- (2-\lambda )\zeta _0\left( (1-\lambda ),\tau -1\right) + (1-\lambda )\zeta _0\left( (1-\lambda )^2,\tau -1\right) }{\lambda ^2} \\&\quad + \frac{- \zeta _1\left( (1-\lambda ),\tau -1\right) + (1-\lambda )\zeta _1\left( (1-\lambda )^2,\tau -1\right) }{\lambda } \bigg ), \end{aligned}$$

(2.6)

and

$$\begin{aligned} \zeta _0(r,\tau )&\equiv \sum _{u=1}^{\tau -1}r^{u} = \dfrac{r-r^{\tau }}{1-r}, \end{aligned}$$

(2.7)

$$\begin{aligned} \zeta _1(r,\tau )&\equiv \sum _{u=1}^{\tau -1}u r^{u} = \dfrac{r- \tau r^{\tau }+(\tau -1) r^{\tau +1}}{(1-r)^2}, \end{aligned}$$

(2.8)

$$\begin{aligned} \zeta _2(r,\tau )&\equiv \sum _{u=1}^{\tau -1}u^2 r^{u} = \dfrac{ -(\tau -1)^2 r^{\tau +2} + (2\tau ^2-2\tau -1)r^{\tau +1} - \tau ^2 r^{\tau } + r^2+r}{(1-r)^3}. \end{aligned}$$

(2.9)

Remark 2.1

For the rest of the paper, the convention $\mathcal {B}^{(1)}_0=\mathcal {B}^{(2)}_0=\mathcal {B}^{(3)}_0=0$, $A_0=1$ and $\upsilon _{0}=0$ is used, which makes (2.3) hold for $\tau =0$.

2.2 Forward measure dynamics in the DTAFNS model

Prices of financial derivatives are expressed as expected discounted payoffs under the risk-neutral measure. However, the interaction between the stochastic discount factor and the derivatives payoff is often non-trivial and complexifies the pricing. A common technique to ease the calculation of prices in the context of stochastic interest rates is the change of numéraire (see for instance Geman et al., 1995). This approach relies on the construction of a new probability measure, called the forward measure, under which the price of a zero-coupon is used a numéraire for discounting. This allows directly discounting with zero-coupon bond prices, thus circumventing the difficulty associated with representing the potentially complex dependence between the payoff and the stochastic discount factor.

The probability measure using the risk-free zero-coupon bond maturing at time $\mathcal {T}$ as a numéraire is known as the $\mathcal {T}$-forward measure and is denoted by $\mathbb {Q}^\mathcal {T}$. The Radon–Nikodym derivative allowing to pass from the risk-neutral to the $\mathcal {T}$-forward measure, which is provided by Jamshidian (1996) or Brigo and Mercurio (2007), is

$$\begin{aligned} \dfrac{d\mathbb {Q}^{\mathcal {T}}}{d\mathbb {Q}}=\dfrac{B(0) P(\mathcal {T},\mathcal {T})}{P(0,\mathcal {T})B(\mathcal {T})}=\dfrac{D(0,\mathcal {T})}{P(0,\mathcal {T})}, \end{aligned}$$

(2.10)

where $B(t)=\exp (\Delta \sum _{s=0}^{t-1}r_s)$ is the year-t bank account numéraire under the risk-neutral measure and $D(t_1,t_2) = B(t_1)/B(t_2)$ is the stochastic discount factor for any $0\le t_1 \le t_2 \le T$. Note that the Radon–Nikodym derivative allowing to go from the forward measure to the risk-neutral measure is $\dfrac{d\mathbb {Q}}{d\mathbb {Q}^{\mathcal {T}}} = \left( \dfrac{d\mathbb {Q}^{\mathcal {T}}}{d\mathbb {Q}}\right) ^{-1}$.

Let $\mathbb {E}^{\mathcal {T}}[\cdot ]$ represent the expectation under the $\mathcal {T}$-forward measure. Asset prices discounted by the zero-coupon price maturing at $\mathcal {T}$ are martingales under the forward measure (Geman, 1989). As a consequence, as discussed in Brigo and Mercurio (2007), the time-t price $H_t$ of an asset providing a payoff $H_\mathcal {T}$ at time $\mathcal {T}$ is

$$\begin{aligned} H_t= P(t,\mathcal {T})\mathbb {E}^{\mathcal {T}}\left[ H_\mathcal {T}|\mathcal {F}_t\right] . \end{aligned}$$

(2.11)

The following proposition defines so-called forward measure innovations and outlines their dynamics.

Proposition 2.1

For any $\mathcal {T} \in \{1,\ldots ,T \}$, $t<\mathcal {T}$ and $\tau =\mathcal {T}-t$, conditional on $\mathcal {F}_t$, the forward measure innovation defined as $Z^{\mathcal {T}}_{t+1}=Z^{\mathbb {Q}}_{t+1}+ \Delta \rho \Sigma \mathcal {B}_{\tau -1}$ follows the multivariate Gaussian distribution with mean vector zero and covariance matrix $\rho$ under the $\mathcal {T}$-forward measure.

Proof

See “Appendix”. $\square$

Corollary 2.1

Since the conditional distribution of $Z^{\mathcal {T}}_{t+1}$ with respect $\mathcal {F}_t$ under the $\mathcal {T}$-forward measure does not depend on $Z^{\mathcal {T}}_1, \dots ,Z^{\mathcal {T}}_t$, and since the latter variables characterize the information contained in $\mathcal {F}_t$, elements of the sequence $\{ Z^{\mathcal {T}}_{j} \}^{\mathcal {T}}_{j=1}$ are independent.

Based on the above results, we now provide an expression for the dynamics of term structure factors $\{ X_t\}^T_{t=0}$ analogous to (2.2), but using instead the $\mathcal {T}$-forward measure innovations. Define

$$\begin{aligned} \theta ^{\mathcal {T}} = \theta ^{\mathbb {Q}}, \quad \kappa ^{\mathcal {T}} = \kappa ^{\mathbb {Q}}, \quad \eta ^{\mathcal {T}}_t=\Delta \Sigma \rho \Sigma \mathcal {B}_{\tau -1} = \Delta \Sigma \rho \Sigma \mathcal {B}_{\mathcal {T}-t-1}. \end{aligned}$$

(2.12)

A direct consequence of the application of Proposition 2.1 into (2.2) is that

$$\begin{aligned} X_{t+1}&=X_t-\eta ^{\mathcal {T}}_t+\kappa ^{\mathcal {T}} (\theta ^{\mathcal {T}}-X_t)+\Sigma Z_{t+1}^{\mathcal {T}}. \end{aligned}$$

(2.13)

Representation (2.13), along with some additional lemmas provided in “Appendix”, allow obtaining the t-conditional distribution of $X_{\mathcal {T}}$ under the $\mathcal {T}$-forward measure.

Proposition 2.2

Under the $\mathcal {T}$-forward measure, conditionally on $\mathcal {F}_t$ and for $t+n \le \mathcal {T}$, factors $X_{t+n}$ follow the multivariate Gaussian distribution with mean vector $\mathcal {M}_{t,n}=\left[ \mathcal {M}^{(i)}_{t,n}\right] ^3_{i=1}$ and covariance matrix $\mathcal {V}_{n}=\left[ \mathcal {V}^{(i,j)}_{n}\right] ^3_{i,j=1}$ where

$$\begin{aligned} \mathcal {M}^{(1)}_{t,n}&=X^{(1)}_{t} - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(1)}_{t+l}, \\ \mathcal {M}^{(2)}_{t,n}&=X^{(2)}_{t}(1-\lambda )^{n}+(\theta ^\mathcal {T}_2-\theta ^\mathcal {T}_3)\left( 1-(1-\lambda )^n\right) - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(2)}_{t+l} (1- \lambda )^{n-1-l}\\&\quad + \lambda \bigg (nX^{(3)}_{t} (1-\lambda )^{n-1} + \theta ^{\mathcal {T}}_3 \left( \dfrac{1-(1-\lambda )^n}{\lambda }-n(1-\lambda )^{n-1} \right) \\&\quad -\sum _{l=0}^{n-1}(n-l-1) \eta ^{\mathcal {T},(3)}_{t+l} (1-\lambda )^{n-l-2}\bigg ),\\ \mathcal {M}^{(3)}_{t,n}&=X^{(3)}_{t}(1-\lambda )^{n}+ \theta ^\mathcal {T}_3\left( 1-(1-\lambda )^n\right) - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(3)}_{t+l} (1- \lambda )^{n-1-l}, \\ \mathcal {V}^{(1,1)}_{n}&= n \Sigma _{1,1}^2,\\ \mathcal {V}^{(2,2)}_{n}&= \Sigma _{2,2}^2 \left( 1+\zeta _0((1-\lambda )^2,n)\right) + \lambda ^2\Sigma _{3,3}^2(1-\lambda )^{-2}\zeta _2((1-\lambda )^2,n)\\&\quad +2\Sigma _{2,2} \lambda \Sigma _{3,3}\rho _{2,3}(1-\lambda )^{-1}\zeta _1\left( \left( 1-\lambda \right) ^2,n\right) ,\\ \mathcal {V}^{(3,3)}_{n}&= \Sigma _{3,3}^2 \left( 1+\zeta _0((1-\lambda )^2,n)\right) ,\\ \mathcal {V}^{(1,2)}_{n}&=\mathcal {V}^{(2,1)}_{n}=\Sigma _{1,1}\Sigma _{2,2}\rho _{1,2} \left( 1+\zeta _0(1-\lambda ,n) \right) +\lambda \Sigma _{1,1}\Sigma _{3,3}\rho _{1,3}\dfrac{\zeta _1(1-\lambda ,n)}{1-\lambda },\\ \mathcal {V}^{(1,3)}_{n}&=\mathcal {V}^{(3,1)}_{n}=\Sigma _{1,1}\Sigma _{3,3}\rho _{1,3} \left( 1+ \zeta _0(1-\lambda ,n)\right) ,\\ \mathcal {V}^{(2,3)}_{n}&=\mathcal {V}^{(3,2)}_{n}=\Sigma _{2,2}\Sigma _{3,3} \rho _{2,3}\left( 1+\zeta _0((1-\lambda )^2,n)\right) +\lambda \Sigma _{3,3}^2\dfrac{\zeta _1((1-\lambda )^2,n)}{1-\lambda }. \end{aligned}$$

Proof

See “Appendix”. $\square$

The following quantities appearing in Proposition 2.2 can be further simplified.

Lemma 2.1

Considering the case $n = \mathcal {T}-t$,

$$\begin{aligned} \sum _{l=0}^{\mathcal {T}-t-1}\eta ^{\mathcal {T},(1)}_{t+l}&=\Delta \Sigma _{1,1}\bigg [\Sigma _{1,1} \frac{(\mathcal {T}-t-1)(\mathcal {T}-t)}{2} \\&\quad +\dfrac{\Sigma _{2,2} \rho _{1,2}}{\lambda } \left( \mathcal {T}-t-1-\zeta _0\left( 1-\lambda ,\mathcal {T}-t\right) \right) \\&\quad +\Sigma _{3,3} \rho _{1,3}\bigg ( \dfrac{\mathcal {T}-t-1-\left( 1+\zeta _0\left( 1-\lambda ,\mathcal {T}-t-1\right) \right) }{\lambda }\\&\quad -\zeta _1\left( 1-\lambda , \mathcal {T}-t-1\right) \bigg )\bigg ]. \end{aligned}$$

Moreover, for $i=2,3$,

$$\begin{aligned}&\sum _{l=0}^{\mathcal {T}-t-1}\eta ^{\mathcal {T},(i)}_{t+l} (1 \!-\! \lambda )^{\mathcal {T}-t-1-l}=\Delta \Sigma _{i,i}\bigg [\Sigma _{1,1} \rho _{i,1} \zeta _1\left( 1-\lambda ,\mathcal {T}-t\right) \\&\quad +\dfrac{\Sigma _{2,2} \rho _{i,2}}{\lambda } \left( \zeta _0\left( 1-\lambda ,\mathcal {T}-t\right) -\zeta _0 \left( \left( 1-\lambda \right) ^{2},\mathcal {T}-t\right) \right) \\&\quad +\Sigma _{3,3} \rho _{i,3}\bigg (\dfrac{\zeta _0\left( 1-\lambda ,\mathcal {T}-t\right) -(1-\lambda )^{-1}\zeta _0\left( \left( 1-\lambda \right) ^{2},\mathcal {T}-t\right) }{\lambda }\\&\quad -(1+\lambda )\zeta _1\left( \left( 1-\lambda \right) ^{2},\mathcal {T}-t-1\right) \bigg )\bigg ]. \end{aligned}$$

Lastly,

$$\begin{aligned}&\sum _{l=0}^{\mathcal {T}-t-1}(\mathcal {T}-t-l-1) \eta ^{\mathcal {T},(3)}_{t+l} (1-\lambda )^{\mathcal {T}-t-l-2}\\&\quad = \frac{\Delta \Sigma _{3,3}}{1-\lambda } \bigg ( \Sigma _{1,1} \rho _{3,1} \zeta _2\left( 1-\lambda ,\mathcal {T}-t\right) \\&\qquad+ \frac{\Sigma _{2,2} \rho _{2,1}}{\lambda } \left[ \zeta _1\left( 1-\lambda ,\mathcal {T}-t\right) \right. \left. - \zeta _1\left( (1-\lambda )^2,\mathcal {T}-t\right) \right] \\&\quad \quad + \Sigma _{3,3} \rho _{3,1} \left[ \frac{\zeta _1\left( 1-\lambda ,\mathcal {T}-t\right) }{\lambda } - \frac{1}{\lambda }\zeta _1\left( (1-\lambda )^2,\mathcal {T}-t\right) \right. \\&\qquad - (1-\lambda )^{-1}\zeta _2\left( (1-\lambda )^2,\mathcal {T}-t\right) \bigg] \bigg ). \end{aligned}$$

Proof

See “Appendix”. $\square$

2.3 Physical measure dynamics in the DTAFNS model

To determine option risk premia and expected excess returns, dynamics of interest rates under the physical measure $\mathbb {P}$ must be specified. The $\mathbb {P}$-dynamics considered in Eghbalzadeh et al. (2022) are used here since they are shown in that paper to exhibit natural compatibility with the $\mathbb {Q}$-dynamics model; the form of the pricing kernel allowing to pass from such $\mathbb {P}$-measure to the $\mathbb {Q}$-measure outlined in Sect. 2.1 is provided in that paper.

Under the risk-neutral measure $\mathbb {P}$, factors are assumed to have the following auto-regressive dynamics:

$$\begin{aligned} \left( {\begin{array}{c} X^{(1)}_{t+1}-X^{(1)}_t \\ X^{(2)}_{t+1}-X^{(2)}_t \\ X^{(3)}_{t+1}-X^{(3)}_t \\ \end{array} } \right)&= \underbrace{\left[ \begin{array}{ccc} \kappa ^\mathbb {P}_{1,1} &{} 0 &{} 0 \\ 0 &{} \kappa ^\mathbb {P}_{2,2} &{} -\lambda \\ 0 &{} 0 &{} \kappa ^\mathbb {P}_{3,3} \end{array} \right] }_{ \kappa ^\mathbb {P}} \underbrace{\left[ \begin{array}{c} \theta ^{\mathbb {P}}_1-X^{(1)}_t \\ \theta ^{\mathbb {P}}_2-X^{(2)}_t \\ \theta ^{\mathbb {P}}_3-X^{(3)}_t \end{array} \right] }_{ \theta ^{\mathbb {P}}-X_t } + \underbrace{\left( {\begin{array}{ccc} \Sigma _{1,1} &{} 0 &{} 0 \\ 0 &{} \Sigma _{2,2} &{} 0 \\ 0&{} 0 &{} \Sigma _{3,3} \\ \end{array} } \right) }_{\Sigma } \left( {\begin{array}{c} Z^{\mathbb {P}}_{t+1,1} \\ Z^{\mathbb {P}}_{t+1,2} \\ Z^{\mathbb {P}}_{t+1,3} \\ \end{array} } \right) , \end{aligned}$$

(2.14)

where $\kappa ^\mathbb {P}_{1,1} \in [0,1)$, $\kappa ^\mathbb {P}_{i,i} \in (0,1)$, $i=2,3$, and $\{(Z^{\mathbb {P}}_{t,1},Z^{\mathbb {P}}_{t,2},Z^{\mathbb {P}}_{t,3})\}^T_{t=1}$ is again a 3-dimensional Gaussian standard white noise with $\text {Corr}[Z^{\mathbb {P}}_{t_1,i},Z^{\mathbb {P}}_{t_2,j}] = \mathbbm {1}_{ \{t_1=t_2\} } \rho _{i,j}$. The $\mathbb {P}$-dynamics are slightly more flexible than the $\mathbb {Q}$-dynamics since the components in the diagonal of $\kappa ^\mathbb {P}$ are not required to be either 0 or $\lambda$. A possibility would be to impose $\kappa ^\mathbb {P}_{1,1}=0$ to replicate the non-stationary dynamics of factor $X^{(1)}$ under $\mathbb {Q}$. Nevertheless Eghbalzadeh et al. (2022) argue that not imposing such restriction provides a better fit for their dataset.

Under the above $\mathbb {P}$-dynamics, Eghbalzadeh et al. (2022) provide the relationship between the long-term mean parameters $\theta ^\mathbb {P}$ and $\theta ^\mathbb {Q}$ for both measures:

$$\begin{aligned} \theta _2^{\mathbb {Q}} = \lambda ^{-1}(\kappa _{2,2}^{\mathbb {P}} \theta _2^{\mathbb {P}} + \kappa _{3,3}^{\mathbb {P}} \theta _3^{\mathbb {P}} - \lambda \theta _3^{\mathbb {P}}), \quad \theta _3^{\mathbb {Q}} = \lambda ^{-1} \kappa _{3,3}^{\mathbb {P}} \theta _3^{\mathbb {P}} \end{aligned}$$

and

$$\begin{aligned} \theta _3^{\mathbb {P}}=\lambda \theta _3^{\mathbb {Q}}/\kappa ^{\mathbb {P}}_{3,3}, \quad \theta _2^{\mathbb {P}}=\frac{\lambda }{\kappa ^{\mathbb {P}}_{2,2}} \left[ \theta _2^{\mathbb {Q}} -\frac{\theta _3^{\mathbb {Q}}}{\kappa ^{\mathbb {P}}_{3,3}}(\kappa ^{\mathbb {P}}_{3,3}-\lambda ) \right] , \end{aligned}$$

whereas the $\kappa ^{\mathbb {P}}_{i,i}$, $i=1,2,3$ are allowed to vary freely.

Remark 2.2

A straightforward possible extension of the model could consist in still using (2.14) for $\mathbb {P}$-dynamics, but instead considering a physical volatility matrix $\Sigma ^{\mathbb {P}}$ whose components are different (most likely lower) than those of the risk-neutral volatility $\Sigma$. This could help producing negative excess returns for out-of-the-money options (either calls or put) on zero-coupon futures, which are observed empirically as documented in Bakshi et al. (2023).

The following proposition, whose proof is in “Appendix”, is analogous to Proposition 2.2 and provides transition distributions for factors X under the physical measure $\mathbb {P}$. Such proposition is used subsequently in the derivation of option expected excess returns.

Proposition 2.3

Assume $\kappa ^{\mathbb {P}}_{2,2} \ne \kappa ^{\mathbb {P}}_{3,3}$ and define $\omega =\frac{1-\kappa ^{\mathbb {P}}_{3,3}}{1-\kappa ^{\mathbb {P}}_{2,2}}$. Under measure $\mathbb {P}$, conditionally on $\mathcal {F}_t$ and for $t+n \le T$, factors $X_{t+n}$ follow the multivariate Gaussian distribution with mean vector $\mathcal {M}^\mathbb {P}_{t,n}=\left[ \mathcal {M}^{\mathbb {P},(i)}_{t,n}\right] ^3_{i=1}$ and covariance matrix $\mathcal {V}^\mathbb {P}_{n}=\left[ \mathcal {V}^{\mathbb {P},(i,j)}_{n}\right] ^3_{i,j=1}$ where

$$\begin{aligned} \mathcal {M}^{\mathbb {P},(i)}_{t,n}&= X^{(i)}_{t}(1-\kappa ^{\mathbb {P}}_{i,i})^{n}+ \theta ^\mathbb {P}_i\left( 1-(1-\kappa ^{\mathbb {P}}_{i,i})^n\right) \\&\quad+ \mathbbm {1}_{ \{i=2\} } \lambda (X^{(3)}_{t}-\theta ^{\mathbb {P}}_3)\frac{1-\omega ^n}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1} \end{aligned}$$

and

$$\begin{aligned} \mathcal {V}^{\mathbb {P},(1,1)}_{n}&= {\left\{ \begin{array}{ll} \Sigma _{1,1}^2 \left( 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{1,1})^2,n)\right) \quad \text { if } \kappa ^{\mathbb {P}}_{1,1}\in (0,1),\\ n \Sigma _{1,1}^2 \quad \text { if } \kappa ^{\mathbb {P}}_{1,1}=0, \end{array}\right. }\\ \mathcal {V}^{\mathbb {P},(2,2)}_{n}&= \Sigma _{2,2}^2 \left( 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{2,2})^2,n)\right) \\&\quad + \lambda ^2 \Sigma ^2_{3,3} \frac{(1-\kappa ^{\mathbb {P}}_{2,2} )^{2n-2}}{(1-\omega )^2} \left[ \zeta _0\left( (1-\kappa ^{\mathbb {P}}_{2,2})^{-2},n\right) -2\omega ^n \zeta _0\left( \omega (1-\kappa ^{\mathbb {P}}_{3,3})^{-2},n\right) \right. \\&\quad \left. + \omega ^{2n} \zeta _0\left( (1-\kappa ^{\mathbb {P}}_{3,3})^{-2},n\right) \right] \\&\quad +2 \frac{\rho _{2,3} \lambda \Sigma _{2,2} \Sigma _{3,3}}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{(2n-1)} \left[ \zeta _0\left( \frac{\omega }{(1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) \right. \\&\quad \left. -\omega ^n \zeta _0\left( \frac{1}{(1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) \right] \\ \mathcal {V}^{\mathbb {P},(3,3)}_{n}&=\Sigma _{3,3}^2 \left( 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{3,3})^2,n)\right) \\ \mathcal {V}^{\mathbb {P},(1,2)}_{n}&=\Sigma _{1,1}\Sigma _{2,2}\rho _{1,2} \left[ 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{2,2}),n) \right] \\&\quad +\lambda \Sigma _{1,1}\Sigma _{3,3}\rho _{1,3} \frac{(1-\kappa ^{\mathbb {P}}_{1,1})^n (1-\kappa ^{\mathbb {P}}_{2,2})^{n-1}}{1-\omega } \times \\&\quad \left( \zeta _0 \left( \frac{\omega }{(1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) - \omega ^n \zeta _0 \left( \frac{1}{(1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) \right) ,\\ \mathcal {V}^{\mathbb {P},(1,3)}_{n}&=\Sigma _{1,1}\Sigma _{3,3}\rho _{1,3} \left[ 1+\zeta _0( (1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{3,3}),n) \right] ,\\ \mathcal {V}^{\mathbb {P},(2,3)}_{n}&=\Sigma _{2,2}\Sigma _{3,3} \rho _{2,3} \left[ 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3}),n)\right] \\&\quad +\lambda \Sigma _{3,3}^2 \frac{(1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1} (1-\kappa ^{\mathbb {P}}_{3,3} )^{n}}{1-\omega } \left( \zeta _0\left( \frac{1}{(1-\kappa ^{\mathbb {P}}_{2,2} )(1-\kappa ^{\mathbb {P}}_{3,3} )},n\right) \right. \\&\quad \left. - \omega ^n \zeta _0\left( (1-\kappa ^{\mathbb {P}}_{3,3})^{-2} ,n\right) \right) . \end{aligned}$$

Remark 2.3

Closed-form formulas for conditional moments of $X_{t+n}$ given $X_t$ under $\mathbb {P}$ could also be derived in the case $\kappa ^{\mathbb {P}}_{2,2} = \kappa ^{\mathbb {P}}_{3,3}$. However, since such case is very unlikely to occur in practice, it is omitted.

3 European swaption pricing

This section describes two pricing procedures for European swaptions and outlines their respective advantages. The first relies on a risk-neutral simulation, whereas the second uses the forward measure to perform the simulation.

3.1 Risk-neutral pricing of European swaptions

Swaptions are classified into three types: European, Bermudan, and American, which differ in their possible exercise dates. Whereas American and Bermudan swaptions allow the exercise of the option on multiple dates, the European swaption has a single possible exercise date. We shall focus on European swaptions in this study. The European swaption considered, which is a payer swaption, is a financial option that gives the holder the right to enter, at time $T_\alpha$, into a swap with payment dates $T_{\alpha +1},\dots , T_\beta$ on which the holder pays the strike rate as the fixed rate, and receives the prevailing floating rate on each payment date.^{Footnote 1} Typically, the floating rate is tied to an interbank offered rate, such as LIBOR in the United Kingdom or the CDOR in Canada.

As shown in Brigo and Mercurio (2007), for $t<T_\alpha$, the time-t price of a European payer swaption with maturity $T_\alpha$, strike K, nominal value N and payment dates $\{T_i\}^{\beta }_{i=\alpha +1}$ is

$$\begin{aligned} PS\left[ t; \{T_i\}^{\beta }_{i=\alpha };K;N \right] = \mathbb {E^Q}\left[ D(t,T_\alpha )\left( N \left( S_{\alpha ,\beta }(T_\alpha )-K \right) ^+ \sum _{i=\alpha +1}^{\beta } \delta _i P(T_\alpha ,T_i)\right) \bigg | \mathcal {F}_t \right] \end{aligned}$$

(3.1)

where $\delta _i=T_i-T_{i-1}$. The time-t forward swap rate $S_{\alpha ,\beta }(t)$ is

$$\begin{aligned} S_{\alpha ,\beta }(t)&=\dfrac{P(t,T_\alpha )-P(t,T_\beta )}{\sum _{i=\alpha +1}^\beta \delta _i P(t,T_i)}. \end{aligned}$$

(3.2)

The swap rate $S_{\alpha ,\beta }(t)$ corresponds to a value of the fixed rate which would make the time-t value of the swap nil. The rationale underlying (3.1) is that a market participant could, while exercising the option, enter without fee into a receiver swap with the swap rate as the fixed rate. Combining both positions would lead to a net payment being the difference between the swap rate and the strike rate at each payment date.

A straightforward approach to obtain the swaption price via (3.1) is to conduct a Monte-Carlo simulation of the term structure factors under the risk-neutral measure and to average discounted cash flows, thereby approximating the expectation in (3.1). Algorithm 1 summarizes this process. The risk-neutral approach for swaption pricing has the advantage of requiring a single simulation to price multiple swaptions at once, which could be desirable in a calibration exercise. The drawback of using such an approach is that it requires simulating the entire path of the term structure factors, which might not be needed if a single swaption needs to be priced, as explained in the following section.

3.2 Pricing swaptions under the forward measure

Calculating European swaption prices using Algorithm 1 requires simulating the entire path of risk-free rate factors, which might be numerically cumbersome in some situations. By applying a change of numéraire, we can obtain a pricing approach which is more time-efficient. Detailing such an approach in the context of the DTAFNS model is the objective of this subsection.

Considering the zero-coupon bond maturing at $\mathcal {T}=T_\alpha$ as the new numéraire makes the computation of the swaption price much more convenient. In such case, the payer swaption price may therefore be rewritten based on (2.11), (3.1) and (3.2) as

$$\begin{aligned}&PS\left[ t; \{T_i\}^{\beta }_{i=\alpha };K;N \right] =P(t,T_\alpha ) \mathbb {E}^{T_\alpha }\left[ \left( N \left( S_{\alpha ,\beta }(T_\alpha )-K \right) ^+ \sum _{i=\alpha +1}^{\beta } \delta _i P(T_\alpha ,T_i)\right) \bigg | \mathcal {F}_t \right] \\&\quad =P(t,T_\alpha ) \mathbb {E}^{T_\alpha }\left[ \left( N \left( 1 - P(T_\alpha ,T_\beta )-K \sum _{i=\alpha +1}^{\beta } \delta _i P(T_\alpha ,T_i) \right) ^+\right) \bigg | \mathcal {F}_t \right] . \end{aligned}$$

(3.3)

Equation (3.3) involves the t-conditional expectation of a function of time-$T_\alpha$ zero-coupon bond prices, which are fully characterized by term structure factors $X_{T_\alpha } =\left[ X^{(1)}_{T_\alpha }, X^{(2)}_{T_\alpha }, X^{(3)}_{T_\alpha }\right] ^\top$. As a result, Proposition 2.2 can be used to calculate (3.3).

Algorithm 2 highlights the procedure to price swaptions using such an approach. When pricing a single swaption, such $\mathcal {T}$-forward measure simulation is much quicker than the Algorithm 1 based on the risk-neutral measure, which requires computing expectations over entire paths of the term structure factors.

4 Zero-coupon futures and options on futures

This section discusses calculation steps for prices and expected excess returns associated with European options on risk-free zero-coupon futures.

4.1 Futures price

Consider a futures contract with maturity $\mathcal {T}_2$ on a zero-coupon bond maturing on $\mathcal {T}_3$. Its time-$\mathcal {T}_1$ price $F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}$ is given by

$$\begin{aligned} F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} = \mathbb {E^Q}\left[ P(\mathcal {T}_2,\mathcal {T}_3)\bigg | \mathcal {F}_{\mathcal {T}_1}\right] , \end{aligned}$$

see for instance Björk (2009). Such expression can be calculated in closed-form, as indicated by the following theorem whose proof is found in “Appendix”.

Theorem 4.1

The time-$\mathcal {T}_1$ price of a zero-coupon bond futures whose maturity is $\mathcal {T}_2$ and whose underlying risk-free zero-coupon bond matures at $\mathcal {T}_3$ is

$$\begin{aligned} F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} = \tilde{A}_{\tau _2,\tau _3} \exp \left[ -\Delta \sum ^3_{i=1} \tilde{\mathcal {B}}^{(i)}_{\tau _3} X^{(i)}_{\mathcal {T}_1} \right] \end{aligned}$$

(4.1)

with $\tau _2=\mathcal {T}_2-\mathcal {T}_1$, $\tau _3=\mathcal {T}_3-\mathcal {T}_2$, $\mathcal {V}_{\tau _2}$ being defined in Proposition 2.2 and

$$\begin{aligned} \tilde{A}_{\tau _2,\tau _3}&= A_{\tau _3} \exp \bigg [ \frac{\Delta ^2}{2} \mathcal {B}^\top _{\tau _3} \mathcal {V}_{\tau _2} \mathcal {B}_{\tau _3} -\Delta \mathcal {B}^{(2)}_{\tau _3} (\theta ^\mathbb {Q}_2-\theta ^\mathbb {Q}_3)\left( 1-(1-\lambda )^{\tau _2}\right) \\&\quad -\Delta \mathcal {B}^{(2)}_{\tau _3} \lambda \theta ^{\mathbb {Q}}_3\left( \dfrac{\zeta _0(1-\lambda ,\tau _2+1)}{1-\lambda }-\tau _2(1-\lambda )^{\tau _2-1} \right) \\&\quad -\Delta \mathcal {B}^{(3)}_{\tau _3} \theta ^\mathbb {Q}_3\left( 1-(1-\lambda )^{\tau _2}\right) \bigg ],\\ \tilde{\mathcal {B}}^{(1)}_{n}&= \mathcal {B}^{(1)}_{n}, \quad \tilde{\mathcal {B}}^{(2)}_{n} = \mathcal {B}^{(2)}_{n} (1-\lambda )^{n}, \quad \tilde{\mathcal {B}}^{(3)}_{n} = \mathcal {B}^{(3)}_{n} (1-\lambda )^{n} + \mathcal {B}^{(2)}_{n} \lambda n (1-\lambda )^{n-1}. \end{aligned}$$

4.2 Price for options on futures

The closed-form solution for futures prices leads to a Black-Scholes-type formula for the price of options on the zero-coupon futures. Indeed, denote by $\text {Call}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)$ the time-t price of a European option with strike K maturing at $\mathcal {T}_1$ on a futures maturing at $\mathcal {T}_2$ whose underlying asset is a zero-coupon maturing at $\mathcal {T}_3$.

Recall the following result, see for instance Lemma A.1 from Godin (2019) for a proof. Suppose Y is a Gaussian random variable with mean $\mu$ and standard deviation $\sigma$. Then $\mathbb {E} \left[ e^Y \mathbbm {1}_{\left\{ Y> y\right\} }\right] = e^{\mu +\sigma ^2/2} \bar{\Phi }\left( \frac{y-\mu -\sigma ^2}{\sigma } \right)$ where $\bar{\Phi }$ is the survival function (i.e. one minus the CDF) of the Gaussian distribution.

Lemma 4.1

Suppose Y is a Gaussian random variable with mean $\mu$ and standard deviation $\sigma$. Then,

$$\begin{aligned} \mathbb {E}[ \max (0,e^Y -K) ]= & {} \mathbb {E}[ e^Y \mathbbm {1}_{\left\{ Y> \log (K)\right\} } ] -K \mathbb {E}[ \mathbbm {1}_{\left\{ Y> \log (K)\right\} } ]\\= & {} e^{\mu +\sigma ^2/2} \bar{\Phi }\left( \frac{\log (K)-\mu -\sigma ^2}{\sigma } \right) -K \bar{\Phi }\left( \frac{\log (K)-\mu }{\sigma } \right) . \end{aligned}$$

The following result is obtained by combining Proposition 2.2 with (4.1).

Lemma 4.2

The forward measure $\mathbb {Q}^{\mathcal {T}_1}$ distribution of time-$\mathcal {T}_1$ the log-futures price $\log F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}$ conditional on time-t information is Gaussian with mean $\nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}$ and variance $\varsigma ^2_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}$, where

$$\begin{aligned} \nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}= & {} \log \tilde{A}_{\tau _2,\tau _3} -\Delta \sum ^3_{i=1} \tilde{\mathcal {B}}^{(i)}_{\tau _3}\mathcal {M}^{(i)}_{t,\tau _1},\\ \varsigma ^2_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}= & {} \Delta \tilde{\mathcal {B}}^\top _{\tau _3} \mathcal {V}_{\tau _1} \tilde{\mathcal {B}}_{\tau _3}, \end{aligned}$$

with $\tau _1 = \mathcal {T}_1-t$ and $\tilde{\mathcal {B}}_\tau =\left[ \tilde{\mathcal {B}}^{(1)}_\tau , \,\, \tilde{\mathcal {B}}^{(2)}_\tau , \, \, \tilde{\mathcal {B}}^{(3)}_\tau \right] ^\top$.

Using Lemma 4.1, the time-t call option price is

$$\begin{aligned} \text {Call}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)= & {} P(t,\mathcal {T}_1) \mathbb {E}^{\mathcal {T}_1}[ \max (0,F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} -K) \vert \mathcal {F}_t]\\= & {} P(t,\mathcal {T}_1) \bigg [e^{\nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}+\varsigma ^2_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}/2} \bar{\Phi }\left( \frac{\log (K)-\nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}-\varsigma ^2_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}}{\varsigma _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}} \right) \\{} & {} \quad \quad \quad \quad -K \bar{\Phi }\left( \frac{\log (K)-\nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}}{\varsigma _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}} \right) \bigg ]. \end{aligned}$$

Furthermore, denoting by $\text {Put}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)$ the corresponding European put option, the put-call parity leads to

$$\begin{aligned} \text {Put}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K) = \text {Call}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K) - ( \mathbb {E}^{\mathcal {T}_1}[ F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} | \mathcal {F}_t]-K) P(t, \mathcal {T}_1) \end{aligned}$$

where Lemma 4.2 leads to

$$\begin{aligned} \mathbb {E}^{\mathcal {T}_1}[ F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} | \mathcal {F}_t] = \exp \left( \nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} + \varsigma ^2_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}/2\right) . \end{aligned}$$

4.3 Price for quadratic options on futures

A quadratic option on futures with time-$\mathcal {T}_1$ payoff $\left( \frac{F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}}{F_{t,\mathcal {T}_2,\mathcal {T}_3}}-1 \right) ^2$ can also be considered. Such option bears resemblance to a straddle option since it is more likely to produce higher payoffs in higher volatility environments for interest rates. Using Lemma 4.2, its time-t price is given by

$$\begin{aligned} \text {Quad}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}= & {} P(t,\mathcal {T}_1) \mathbb {E}^{\mathcal {T}_1} \left[ \left( \frac{F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}}{F_{t,\mathcal {T}_2,\mathcal {T}_3}}-1 \right) ^2 \bigg |\mathcal {F}_t\right] \\= & {} P(t,\mathcal {T}_1) \mathbb {E}^{\mathcal {T}_1} \left[ \frac{ \exp ( 2\log F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})}{F^2_{t,\mathcal {T}_2,\mathcal {T}_3}} -2 \frac{ \exp ( \log F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} )}{F_{t,\mathcal {T}_2,\mathcal {T}_3}} + 1 \bigg |\mathcal {F}_t\right] \\= & {} P(t,\mathcal {T}_1) \left[ \frac{ \exp \left( 2\nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} + 2\varsigma ^2_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}\right) }{F^2_{t,\mathcal {T}_2,\mathcal {T}_3}}\right. \\{} & {} \quad \left. -2 \frac{ \exp \left( \nu _{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} + \varsigma ^2_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}/2\right) }{F_{t,\mathcal {T}_2,\mathcal {T}_3}} + 1 \bigg |\mathcal {F}_t\right] . \end{aligned}$$

4.4 Option expected excess returns

Consider a European-type derivative whose time-t price is $\text {Price}_t$ and whose time $\mathcal {T}_1$ payoff is $\text {Payoff}_{\mathcal {T}_1}$. Its (periodic) expected excess return (EER) could be calculated in two ways:

$$\begin{aligned} \text {EER}^{\text {Approach 1}}_{t,\mathcal {T}_1}= & {} \frac{1}{\mathcal {T}_1-t} \log \frac{ \mathbb {E}^\mathbb {P} \left[ \text {Payoff}_{\mathcal {T}_1} |\mathcal {F}_t\right] }{\text {Price}_{t,\mathcal {T}_1}} - s(t,\mathcal {T}_1), \\ \text {EER}^{\text {Approach 2}}_{t,\mathcal {T}_1}= & {} \frac{1}{\mathcal {T}_1-t} \log \frac{ \mathbb {E}^\mathbb {P} \left[ D(t,\mathcal {T}_1) \text {Payoff}_{\mathcal {T}_1} |\mathcal {F}_t\right] }{\text {Price}_{t,\mathcal {T}_1}}, \end{aligned}$$

(4.2)

where the risk-free spot rate is obtained through $s(t,\mathcal {T}_1) = -\frac{1}{\mathcal {T}_1-t} \log P(t,\mathcal {T}_1)$. The first formulation relies on a future value perspective, whereas the second sees the expected excess return through the lens of a present value. The second approach has the conceptual advantage of producing an exactly null premium if $\mathbb {P}=\mathbb {Q}$. Nevertheless, it is more cumbersome to compute and as such we consider (4.2) in this work. Bakshi et al. (2023) also use a formulation similar to (4.2) in their work.

The expected excess return for the European call, the European put and the quadratic options presented above are therefore respectively

$$\begin{aligned} \text {EER}^\text {Call}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)= & {} \frac{1}{\mathcal {T}_1-t} \log \frac{ \mathbb {E}^\mathbb {P} \left[ \max (0;F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}-K)|\mathcal {F}_t\right] }{\text {Call}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)} - s(t,\mathcal {T}_1), \end{aligned}$$

(4.3)

$$\begin{aligned} \text {EER}^\text {Put}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)= & {} \frac{1}{\mathcal {T}_1-t} \log \frac{ \mathbb {E}^\mathbb {P} \left[ \max (0;K-F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})|\mathcal {F}_t\right] }{\text {Put}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)} - s(t,\mathcal {T}_1), \end{aligned}$$

(4.4)

$$\begin{aligned} \text {EER}^\text {Quad}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)= & {} \frac{1}{\mathcal {T}_1-t} \log \frac{ \mathbb {E}^\mathbb {P} \left[ \left( \frac{F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}}{F_{t,\mathcal {T}_2,\mathcal {T}_3}}-1 \right) ^2 \bigg |\mathcal {F}_t\right] }{\text {Quad}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}(K)} - s(t,\mathcal {T}_1). \end{aligned}$$

(4.5)

The following result is obtained by combining Proposition 2.3 with (4.1).

Lemma 4.3

Assuming $\kappa ^{\mathbb {P}}_{2,2} \ne \kappa ^{\mathbb {P}}_{3,3}$, the $\mathbb {P}$-distribution of time-$\mathcal {T}_1$ the log-futures price $\log F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}$ conditional on time-t information is Gaussian with mean $\nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}$ and variance $(\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})^2$, where

$$\begin{aligned} \nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}= & {} \log \tilde{A}_{\tau _2,\tau _3} -\Delta \sum ^3_{i=1} \tilde{\mathcal {B}}^{(i)}_{\tau _3}\mathcal {M}^{\mathbb {P},(i)}_{t,\tau _1},\\ (\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})^2= & {} \Delta \tilde{\mathcal {B}}^\top _{\tau _3} \mathcal {V}^\mathbb {P}_{\tau _1} \tilde{\mathcal {B}}_{\tau _3}, \end{aligned}$$

with $\tau _1 = \mathcal {T}_1-t$ and $\tilde{\mathcal {B}}_\tau =\left[ \tilde{\mathcal {B}}^{(1)}_\tau , \,\, \tilde{\mathcal {B}}^{(2)}_\tau , \, \, \tilde{\mathcal {B}}^{(3)}_\tau \right] ^\top$.

Again, using Lemma 4.1, the time-t European call and put option expected payoffs are respectively

$$\begin{aligned} \mathbb {E}^{\mathbb {P}}[ \max (0,F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} -K) \vert \mathcal {F}_t]= & {} \bigg [e^{\nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}+(\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})^2/2} \times \\{} & {} \quad \bar{\Phi }\left( \frac{\log (K)-\nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}-(\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})^2}{\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}} \right) \\{} & {} \quad -K \bar{\Phi }\left( \frac{\log (K)-\nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}}{\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}} \right) \bigg ],\\ \mathbb {E}^{\mathbb {P}}[ \max (0,K-F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}) \vert \mathcal {F}_t]= & {} \mathbb {E}^{\mathbb {P}}[ \max (0,F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} -K) \vert \mathcal {F}_t] - ( \mathbb {E}^{\mathbb {P}}[ F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} | \mathcal {F}_t]-K) \end{aligned}$$

where, from (4.2),

$$\begin{aligned} \mathbb {E}^{\mathbb {P}}[ F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} | \mathcal {F}_t] = \exp \left( \nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} + (\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})^2/2\right) . \end{aligned}$$

Moreover, the quadratic option’s expected payoff under $\mathbb {P}$ is

$$\begin{aligned} \mathbb {E}^{\mathbb {P}} \left[ \left( \frac{F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}}{F_{t,\mathcal {T}_2,\mathcal {T}_3}}-1 \right) ^2 \bigg |\mathcal {F}_t\right]= & {} \mathbb {E}^{\mathbb {P}} \left[ \frac{ \exp ( 2\log F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})}{F^2_{t,\mathcal {T}_2,\mathcal {T}_3}} -2 \frac{ \exp ( \log F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} )}{F_{t,\mathcal {T}_2,\mathcal {T}_3}} + 1 \bigg |\mathcal {F}_t\right] \\= & {} \left[ \frac{ \exp \left( 2\nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} + 2(\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})^2\right) }{F^2_{t,\mathcal {T}_2,\mathcal {T}_3}}\right. \\{} & {} \quad \left. -2 \frac{ \exp \left( \nu ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} + (\varsigma ^\mathbb {P}_{t,\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3})^2/2\right) }{F_{t,\mathcal {T}_2,\mathcal {T}_3}} + 1 \bigg |\mathcal {F}_t\right] \!. \end{aligned}$$

Substituting the above formulas in (4.3)–(4.5) provides values for the option’s expected excess return.

5 Methods for the calibration of the DTANFS model to option prices

While a full-blown calibration of the model to interest rate derivatives prices is left out-of-scope, we briefly highlight potential approaches for such a purpose. Assume a set of d derivatives prices $Y_t = [Y^{(1)}_{t},\ldots ,Y^{(d)}_{t}]$ is available on any period t, each of which are associated with a set of deterministically chosen strike prices $K_t = [K^{(1)}_{t},\ldots ,K^{(d)}_{t}]$.^{Footnote 2} Observed derivatives prices are assumed to be a noisy version of their true prices, and thus we can consider the following system of equations to depict the dynamics of derivatives prices:

$$\begin{aligned} Y_t = G(X_t, K_t) + N_t, \quad X_{t+1} = \mathbf{a} + b X_t + \Sigma Z_{t+1}, \quad t=0,\ldots ,T, \end{aligned}$$

(5.1)

where G is the non-linear function mapping risk factors and strikes into option prices, and the process $N=\{ N_t\}^T_{t=0}$ is assumed to be a Gaussian d-dimensional white noise. Furthermore, $\mathbf{a}=\kappa ^{\mathbb {P}} \theta ^{\mathbb {P}}$ and $b= I-\kappa ^{\mathbb {P}}$ with I being the $3\times 3$ identity matrix.

Since (5.1) involves a non-linear transformation G of the Gaussian latent factors, the conventional Kalman filter cannot be applied. Nevertheless, the unscented Kalman filter (UKF) developed in Julier and Uhlmann (1997) can be used, as the non-linear system (5.1) is a special case of their equations (1)-(2). The UKF is a generalization of the Kalman filter allowing to tackle non-linearities through a deterministic sampling method leading to a better approximation of filtered moments of the observable quantities. An alternative approach consists in using particle filters, which instead apply stochastic sampling of latent quantities. See for instance Del Moral (1997), Creal (2012) or Remillard (2013) for more information about particle filters. Both the UKF and particle filters have been applied to term structure models estimation in the literature, for instance by Christoffersen et al. (2014).

6 Conclusion

This paper describes how to calculate prices of swaptions and European options (either conventional or quadratic) on zero-coupon futures under the DTAFNS model. Expressions for the option expected excess return associated with the European options are also provided. Whereas Monte-Carlo simulation is used for swaptions, closed-form solutions are provided for the zero-coupon futures options. All pricing expressions are obtained after deriving exact formulas for transition distributions of risk factors underlying the term structure dynamics under the following three respective measures: the physical measure, the risk-neutral measure and the forward measure. A potential future work could consist of studying option risk premia produced by the DTAFNS model and determining whether or not they are consistent with empirical stylized facts outlined for instance in Bakshi et al. (2023). Tackling American options could also be an interesting subsequent work.

Notes

For a payment date $T_{\xi +1}$, $\xi =\alpha ,\ldots ,\beta -1$, the floating rate is determined at the reset date $T_{\xi }$.
To produce more homogeneous errors $N_t$, it might be desirable to fix the moneyness of options rather than their strike price.

References

Bakshi, G., Crosby, J., Gao, X., & Hansen, J. W. (2023). Treasury option returns and models with unspanned risks. Journal of Financial Economics(forthcoming).
Bakshi, G., Crosby, J., & Gao, X. (2022). Dark matter in (volatility and) equity option risk premiums. Operations Research, 70(6), 3108–3124.
Article Google Scholar
Björk, T. (2009). Arbitrage theory in continuous time. Oxford University Press.
Google Scholar
Black, F., Derman, E., & Toy, W. (1990). A one-factor model of interest rates and its application to treasury bond options. Financial Analysts Journal, 46(1), 33–39.
Article Google Scholar
Black, F., & Karasinski, P. (1991). Bond and option pricing when short rates are lognormal. Financial Analysts Journal, 47(4), 52–59.
Article Google Scholar
Brigo, D., & Mercurio, F. (2007). Interest rate models-theory and practice: With smile, inflation and credit. Springer.
Google Scholar
Christensen, J. H., Diebold, F. X., & Rudebusch, G. D. (2011). The affine arbitrage-free class of Nelson–Siegel term structure models. Journal of Econometrics, 164(1), 4–20.
Article Google Scholar
Christoffersen, P., Dorion, C., Jacobs, K., & Karoui, L. (2014). Nonlinear Kalman filtering in affine term structure models. Management Science, 60(9), 2248–2268.
Article Google Scholar
Collin-Dufresne, P., & Goldstein, R. S. (2002). Pricing swaptions within an affine framework. The Journal of Derivatives, 10(1), 9–26.
Article Google Scholar
Coval, J. D., & Shumway, T. (2001). Expected option returns. The Journal of Finance, 56(3), 983–1009.
Article Google Scholar
Creal, D. (2012). A survey of sequential Monte Carlo methods for economics and finance. Econometric Reviews, 31(3), 245–296.
Article Google Scholar
Del Moral, P. (1997). Nonlinear filtering: Interacting particle resolution. Comptes Rendus de l’Académie des Sciences-Series I-Mathematics, 325(6), 653–658.
Article Google Scholar
Duffie, D., & Kan, R. (1996). A yield-factor model of interest rates. Mathematical Finance, 6(4), 379–406.
Article Google Scholar
Eghbalzadeh, R., Godin, F., & Gaillardetz, P. (2022). The discrete-time arbitrage-free Nelson–Siegel model: a closed-form solution and applications to mixed funds representation.
Geman, H. (1989). The importance of the forward neutral probability in a stochastic approach of interest rates. Technical report, Working paper, ESSEC.
Geman, H., El Karoui, N., & Rochet, J.-C. (1995). Changes of numeraire, changes of probability measure and option pricing. Journal of Applied Probability, 32(2), 443–458.
Article Google Scholar
Godin, F. (2019). A closed-form solution for the global quadratic hedging of options under geometric Gaussian random walks. The Journal of Derivatives, 26(3), 97–107.
Article Google Scholar
Jamshidian, F. (1996). Bond, futures and option evaluation in the quadratic interest rate model. Applied Mathematical Finance, 3(2), 93–115.
Article Google Scholar
Julier, S. J. & Uhlmann, J. K. (1997). New extension of the Kalman filter to nonlinear systems. In Signal processing, sensor fusion, and target recognition VI (Vol. 3068, pp. 182–193). SPIE.
Munk, C. (1999). Stochastic duration and fast coupon bond option pricing in multi-factor models. Review of Derivatives Research, 3(2), 157–181.
Article Google Scholar
Remillard, B. (2013). Statistical methods for financial engineering. CRC Press.
Google Scholar
Schrager, D. F., & Pelsser, A. A. (2006). Pricing swaptions and coupon bond options in affine term structure models. Mathematical Finance, 16(4), 673–694.
Article Google Scholar
Singleton, K. J., & Umantsev, L. (2002). Pricing coupon-bond options and swaptions in affine term structure models. Mathematical Finance, 12(4), 427–446.
Article Google Scholar

Download references

Acknowledgements

We thank the Natural Sciences and Engineering Research Council of Canada (Godin: RGPIN-2017-06837, Gaillardetz: RGPIN-2020-06821) for their financial support.

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Concordia University, Montreal, Canada
Frédéric Godin, Ramin Eghbalzadeh & Patrice Gaillardetz
Quantact Laboratory, Centre de Recherches Mathématiques, Montreal, Canada
Frédéric Godin & Patrice Gaillardetz

Authors

Frédéric Godin
View author publications
You can also search for this author in PubMed Google Scholar
Ramin Eghbalzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Patrice Gaillardetz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Frédéric Godin.

Ethics declarations

Conflict of interest

Authors have no conflict of interests to report in relation to this work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Proofs

Before proving Proposition 2.1, several lemmas are presented.

Lemma 6.1

For $i=1,2,3$ and any integer $\tau >1$,

$$\begin{aligned} \mathcal {B}_{\tau }^{(i)}-1 &=\mathcal {B}_{\tau -1}^{(i)}\left( 1-\kappa ^{\mathbb {Q}}_{i,i}\right) -\mathbbm {1}_{ \{i=3\} } (1-\lambda )^{\tau -1} \\ &= \mathcal {B}_{\tau -1}^{(i)}(1-\lambda \mathbbm {1}_{ \{i>1\} } )-\mathbbm {1}_{ \{i=3\} } (1-\lambda )^{\tau -1}. \end{aligned}$$

Proof of Lemma 6.1

See Lemma A.2 of Eghbalzadeh et al. (2022).

Lemma 6.2

The following recursive relationship between the time-t and time-$t+1$ zero-coupon bond prices presented in (2.3) holds for any integer $\tau >0$:

$$\begin{aligned} P(t+1,t+\tau )&= P(t,t+\tau )e^{\Delta r_t} \exp \left[ \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) -\Delta \mathcal {B}_{\tau -1}^{\top }\left( \kappa ^\mathbb {Q}\theta ^\mathbb {Q}+ \Sigma Z_{t+1}^\mathbb {Q}\right) \right] . \end{aligned}$$

Proof of Lemma 6.2

The case $\tau =1$ is trivial. For $\tau >1$, using (2.1) and (2.3) for the first equality, and then Lemma 6.1 for the third one,

$$\begin{aligned}&\log \left( \dfrac{P(t+1,t+\tau )}{P(t,t+\tau )} \right) - \Delta r_t = \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) - \Delta \sum _{i=1}^3 \left( \mathcal {B}_{\tau -1}^{(i)} X_{t+1}^{(i)}\right. \\&\qquad \left. -\left( \mathcal {B}_{\tau }^{(i)}-1\right) X_t^{(i)}\right) +\Delta \sum _{i=1}^3 X_t^{(i)} - \Delta (X_t^{(1)}+X_t^{(2)}) \\&\quad = \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) - \Delta \sum _{i=1}^2 \mathcal {B}_{\tau -1}^{(i)}\left( X_{t+1}^{(i)}-\left( \dfrac{\mathcal {B}_{\tau }^{(i)}-1}{\mathcal {B}_{\tau -1}^{(i)}}\right) X_t^{(i)}\right) + \Delta X_t^{(3)} \\&\quad \quad - \Delta \left( \mathcal {B}_{\tau -1}^{(3)} X_{t+1}^{(3)}-\left( \mathcal {B}_{\tau }^{(3)}-1\right) X_t^{(3)}\right) \\&\quad = \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) - \Delta \sum _{i=1}^2 \mathcal {B}_{\tau -1}^{(i)}\left( X_{t+1}^{(i)}-\left( 1-\kappa ^{\mathbb {Q}}_{i,i}\right) X_t^{(i)}\right) + \Delta X_t^{(3)} \\&\quad \quad - \left( \Delta \mathcal {B}_{\tau -1}^{(3)} X_{t+1}^{(3)}-\left( \Delta \mathcal {B}_{\tau -1}^{(3)}(1-\lambda )-(1-\lambda )^{\tau -1}\right) X_t^{(3)}\right) \\&\quad =\log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) - \Delta \sum _{i=1}^3 \mathcal {B}_{\tau -1}^{(i)}\left( X_{t+1}^{(i)}-\left( 1-\kappa ^{\mathbb {Q}}_{i,i}\right) X_t^{(i)}\right) \\&\quad \quad+ \Delta \left( 1-(1-\lambda )^{\tau - 1}\right) X_t^{(3)} \\&\quad = \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) - \Delta \mathcal {B}_{\tau -1}^{\top }\left( \kappa ^\mathbb {Q}\theta ^\mathbb {Q}+ \Sigma Z_{t+1}^\mathbb {Q}\right) + \Delta \left( 1-(1-\lambda )^{\tau - 1} - \mathcal {B}_{\tau -1}^{(2)}\lambda \right) X_t^{(3)}\\&\quad = \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) - \Delta \mathcal {B}_{\tau -1}^{\top }\left( \kappa ^\mathbb {Q}\theta ^\mathbb {Q}+ \Sigma Z_{t+1}^\mathbb {Q}\right) . \end{aligned}$$

Therefore,

$$\begin{aligned} P(t+1,t+\tau )&= P(t,t+\tau )e^{\Delta r_t} \exp \left[ \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) -\Delta \mathcal {B}_{\tau -1}^{\top }\left( \kappa ^\mathbb {Q}\theta ^\mathbb {Q}+ \Sigma Z_{t+1}^\mathbb {Q}\right) \right] . \end{aligned}$$

$\square$

Lemma 6.3

Consider any integer $\tau >0$ and any real number r. Using the convention $0^0=0$, for functions $\zeta _0(r,\tau ),\zeta _1(r,\tau )$ and $\zeta _2(r,\tau )$ defined in (2.7)–(2.9),

$$\begin{aligned} \zeta _0(r,\tau -1)-\zeta _0(r,\tau )&= -r^{\tau -1} , \end{aligned}$$

(6.1)

$$\begin{aligned} \zeta _1(r,\tau -1)-\zeta _1(r,\tau )&= -r^{\tau -1}(\tau -1) , \end{aligned}$$

(6.2)

$$\begin{aligned} \zeta _2(r,\tau -1)-\zeta _2(r,\tau )&= -r^{\tau -1}(\tau -1)^2 . \end{aligned}$$

(6.3)

Proof of Lemma 6.3

This result is a direct consequence of the sum representations of $\zeta _0(r,\tau ),\zeta _1(r,\tau )$ and $\zeta _2(r,\tau )$ provided in (2.7)–(2.9). $\square$

Lemma 6.4

The following recursive connection holds for the quantity $\upsilon _\tau$ defined in (2.6) for any integer $\tau >0$:

$$\begin{aligned} \upsilon _{\tau } = \mathcal {B}_{\tau -1}^{\top } \Sigma \rho ( \mathcal {B}_{\tau -1}^{\top } \Sigma )^\top + \upsilon _{\tau -1}. \end{aligned}$$

(6.4)

Proof of Lemma 6.4

The case $\tau =1$ is trivial as it leads to $0=0$. For $\tau >1$, first,

$$\begin{aligned} \mathcal {B}_{\tau -1}^{\top } \Sigma \rho ( \mathcal {B}_{\tau -1}^{\top } \Sigma )^\top&= \sum ^3_{i=1} \sum ^3_{j=1} \mathcal {B}_{\tau -1}^{(i)} \mathcal {B}_{\tau -1}^{(j)}\Sigma _{i,i}\Sigma _{j,j}\rho _{i,j}. \end{aligned}$$

Moreover, based on (2.6),

$$\begin{aligned} \upsilon _{\tau } - \upsilon _{\tau -1}&= \sum ^3_{i=1} \sum ^3_{j=1} a^{(i,j)}_\tau \Sigma _{i,i} \Sigma _{j,j} \rho _{i,j}, \end{aligned}$$

with $a^{(i,j)}_\tau = (\upsilon ^{(i,j)}_{\tau } - \upsilon ^{(i,j)}_{\tau -1})/(\Sigma _{i,i} \Sigma _{j,j} \rho _{i,j})$.

To complete the proof, we now show that $a^{(i,j)}_\tau =\mathcal {B}_{\tau -1}^{(i)} \mathcal {B}_{\tau -1}^{(j)}$ for any $i,j=1,2,3$.

First, for $i=j=1$,

$$\begin{aligned} a^{(1,1)}_\tau&= \dfrac{\tau (\tau -1)(2\tau -1)}{6} - \dfrac{(\tau -1)(\tau -2)(2\tau -3)}{6}. \\&=\dfrac{(\tau -1)(2\tau ^2-\tau -2\tau ^2+7\tau -6)}{6}\\&=(\tau -1)^2\\&=\mathcal {B}_{\tau -1}^{(1)} \mathcal {B}_{\tau -1}^{(1)}. \end{aligned}$$

Secondly, for $i=1$ and $j=2$, using (6.2),

$$\begin{aligned} a^{(1,2)}_\tau&= \frac{1}{\lambda }\left( \frac{\tau (\tau -1)}{2} - \zeta _1 (1-\lambda ,\tau ) - \frac{(\tau -1)(\tau -2)}{2} + \zeta _1 (1-\lambda ,\tau -1)\right) \\&= \dfrac{(\tau -1)-(1-\lambda )^{\tau -1}(\tau -1)}{\lambda }\\ &=\mathcal {B}_{\tau -1}^{(1)}\mathcal {B}_{\tau -1}^{(2)}. \end{aligned}$$

Thirdly, consider $i=1$ and $j=3$. For $\tau =2$, $a^{(1,3)}_2= \mathcal {B}_{1}^{(1)}\mathcal {B}_{1}^{(3)}=0$ since $\upsilon ^{(1,3)}_{2}=\upsilon ^{(1,3)}_{1}=0$. For $\tau >2$, using (6.1), (6.2), and (6.3),

$$\begin{aligned} a^{(1,3)}_\tau&= \frac{1}{\lambda } \bigg [\frac{\tau (\tau -1)}{2}-1 -\zeta _0\left( 1-\lambda ,\tau -1\right) -(1+\lambda )\zeta _1\left( 1-\lambda ,\tau -1\right) \\&\quad -\lambda \zeta _2\left( 1-\lambda ,\tau -1\right)- \frac{(\tau -1)(\tau -2)}{2}+1 +\zeta _0\left( 1-\lambda ,\tau -2\right) \\&\quad +(1+\lambda )\zeta _1\left( 1-\lambda ,\tau -2\right) +\lambda \zeta _2\left( 1-\lambda ,\tau -2\right) \bigg ]\\&=\dfrac{1}{\lambda }\bigg [\tau - 1 - (1-\lambda )^{\tau -2}-(1-\lambda )^{\tau - 2}(\tau -2)(1+\lambda )-\lambda (1-\lambda )^{\tau -2 }(\tau - 2)^2 \bigg ]\\&=\dfrac{1}{\lambda }\bigg [\tau - 1 - (1-\lambda )^{\tau -2}-(1-\lambda )^{\tau - 2}(\tau -1 - 1)(1+\lambda )\\&\quad -\lambda (1-\lambda )^{\tau -2 }(\tau - 2)(\tau - 1 -1) \bigg ]\\&=\dfrac{1}{\lambda }\bigg [\tau - 1 -(1-\lambda )^{\tau - 2}(\tau -1 )(1+\lambda )\\&\quad +\lambda (1-\lambda )^{\tau - 2}-\lambda (1-\lambda )^{\tau -2 }(\tau - 2)(\tau - 1 )+\lambda (1-\lambda )^{\tau -2 }(\tau - 1-1) \bigg ]\\&=\dfrac{\tau -1}{\lambda }\bigg [1 -(1-\lambda )^{\tau - 2}-\lambda (1-\lambda )^{\tau -2 }(\tau - 2) \bigg ]\\&=\mathcal {B}_{\tau -1}^{(1)}\mathcal {B}_{\tau -1}^{(3)}. \end{aligned}$$

Fourthly, for $i=j=2$,

$$\begin{aligned} a^{(2,2)}_\tau&=\frac{1}{\lambda ^2} \left( \tau - 2 \left[ \frac{1-(1-\lambda )^\tau }{\lambda }\right] + \frac{1-(1-\lambda )^{2 \tau }}{1-(1-\lambda )^2} - \tau + 1 + 2 \left[ \frac{1-(1-\lambda )^{\tau -1} }{\lambda }\right] \right. \\&\quad \left. -\frac{1-(1-\lambda )^{2 (\tau -1)}}{1-(1-\lambda )^2}\right) \\&=\frac{1}{\lambda ^2} \left( 1 -2(1-\lambda )^{\tau -1}+(1-\lambda )^{2(\tau -1)}\right) \\&=\mathcal {B}_{\tau -1}^{(2)} \mathcal {B}_{\tau -1}^{(2)}. \end{aligned}$$

Fifthly, consider $i=2$ and $j=3$. For $\tau =2$, $a^{(2,3)}_2= \mathcal {B}_{1}^{(2)}\mathcal {B}_{1}^{(3)}=0$ since $\upsilon ^{(2,3)}_{2}=\upsilon ^{(2,3)}_{1}=0$. For $\tau >2$, using (6.1) and (6.2),

$$\begin{aligned} a^{(2,3)}_\tau&= \frac{\tau -2- (2-\lambda )\zeta _0\left( 1-\lambda ,\tau -1\right) + (1-\lambda )\zeta _0\left( (1-\lambda )^2,\tau -1\right) }{\lambda ^2}\\&\quad + \frac{- \zeta _1\left( 1-\lambda ,\tau -1\right) + (1-\lambda )\zeta _1\left( (1-\lambda )^2,\tau -1\right) }{\lambda }\\&\quad - \frac{\tau -3- (2-\lambda )\zeta _0\left( 1-\lambda ,\tau -2\right) + (1-\lambda )\zeta _0\left( (1-\lambda )^2,\tau -2\right) }{\lambda ^2}\\&\quad - \frac{- \zeta _1\left( 1-\lambda ,\tau -2\right) + (1-\lambda )\zeta _1\left( (1-\lambda )^2,\tau -2\right) }{\lambda }\\&= \frac{1- (2-\lambda )(1-\lambda )^{\tau -2} + (1-\lambda )(1-\lambda )^{2(\tau -2)} }{\lambda ^2} \\&\quad +\frac{-(1-\lambda )^{\tau -2}(\tau -2)+(1-\lambda )(1-\lambda )^{2(\tau -2)}(\tau -2)}{\lambda }\\&= \frac{\left( 1-(1-\lambda )^{\tau -2}\right) ^2 +\lambda (1-\lambda )^{\tau -2}\left( 1- (1-\lambda )^{\tau -2}\right) }{\lambda ^2} \\&\quad +\frac{(1-\lambda )^{\tau -2}(\tau -2)\left( (1-\lambda )^{\tau -1}-1\right) }{\lambda }\\&= \frac{\left( 1-(1-\lambda )^{\tau -2}\right) \left( 1-(1-\lambda )^{\tau -2}(1-\lambda )\right) }{\lambda ^2} \\&\quad+ \frac{(1-\lambda )^{\tau -2}(\tau -2)\left( (1-\lambda )^{\tau -1}-1\right) }{\lambda }\\&= \frac{1-(1-\lambda )^{\tau -1}}{\lambda }\left( \frac{1-(1-\lambda )^{\tau -2}}{\lambda }-(1-\lambda )^{\tau -2}(\tau -2)\right) \\&=\mathcal {B}_{\tau -1}^{(2)}\mathcal {B}_{\tau -1}^{(3)}. \end{aligned}$$

Lastly, consider $i=j=3$. For $\tau =2$, $a^{(3,3)}_2= \mathcal {B}_{1}^{(3)}\mathcal {B}_{1}^{(3)}=0$ since $\upsilon ^{(3,3)}_{2}=\upsilon ^{(3,3)}_{1}=0$. For $\tau >2$, using (6.1), (6.2), and (6.3),

$$\begin{aligned} a^{(3,3)}_\tau&= \frac{1}{\lambda ^2} \bigg [ \tau -2 + \zeta _0\left( (1-\lambda )^2,\tau -1\right) +\lambda ^2 \zeta _2\left( (1-\lambda )^2,\tau -1\right) \\&\quad - 2\zeta _0\left( 1-\lambda ,\tau -1\right) - 2\lambda \zeta _1\left( 1-\lambda ,\tau -1\right) + 2\lambda \zeta _1\left( (1-\lambda )^2,\tau -1\right) \\&\quad - \tau +3 - \zeta _0\left( (1-\lambda )^2,\tau -2\right) -\lambda ^2 \zeta _2\left( (1-\lambda )^2,\tau -2\right) \\&\quad + 2\zeta _0\left( 1-\lambda ,\tau -2\right) + 2\lambda \zeta _1\left( 1-\lambda ,\tau -2\right) - 2\lambda \zeta _1\left( (1-\lambda )^2,\tau -2\right) \bigg ]\\&=\frac{1}{\lambda ^2} \bigg [1+(1-\lambda )^{2(\tau -2)}+\lambda ^2(1-\lambda )^{2(\tau -2)}(\tau -2)^2-2(1-\lambda )^{\tau -2}\\&\quad -2\lambda (1-\lambda )^{\tau -2}(\tau -2)+2\lambda (1-\lambda )^{2(\tau -2)}(\tau -2)\bigg ]\\&=\frac{1}{\lambda ^2} \bigg [\left( 1-(1-\lambda )^{\tau -2}\right) ^2+\lambda ^2(1-\lambda )^{2(\tau -2)}(\tau -2)^2\\&\quad -2\lambda (1-\lambda )^{\tau -2}(\tau -2)\left( 1-(1-\lambda )^{\tau -2}\right) \bigg ]\\&=\frac{1}{\lambda ^2} \bigg [\left( 1-(1-\lambda )^{\tau -2} -\lambda (1-\lambda )^{\tau -2}(\tau -2)\right) ^2\bigg ]\\&=\left( \dfrac{1-(1-\lambda )^{\tau -2}}{\lambda } -(1-\lambda )^{\tau -2}(\tau -2)\right) ^2\\&=\mathcal {B}_{\tau -1}^{(3)} \mathcal {B}_{\tau -1}^{(3)}. \end{aligned}$$

$\square$

Lemma 6.5

For $i=1,2,3$ and any integer $\tau >0$, the following recursive relationships between $\mathcal {B}^{(i)}_{\tau }$ and $\mathcal {B}^{(i)}_{\tau -1}$ hold:

$$\begin{aligned} \mathcal {B}^{(1)}_{\tau }= & {} \mathcal {B}^{(1)}_{\tau -1} + 1,\\ \mathcal {B}^{(2)}_{\tau }= & {} \mathcal {B}^{(2)}_{\tau -1}+(1-\lambda )^{\tau -1},\\ \mathcal {B}^{(3)}_{\tau }= & {} \mathcal {B}^{(3)}_{\tau -1}+(1-\lambda )^{\tau -2}\lambda (\tau -1). \end{aligned}$$

Proof of Lemma 6.5

The case $\tau =1$ is trivial. For $\tau >1$, based on (2.4), $\mathcal {B}^{(1)}_{\tau } =\tau -1+1=\mathcal {B}^{(1)}_{\tau -1}+1.$ Furthermore,

$$\begin{aligned} \mathcal {B}^{(2)}_{\tau }&= \dfrac{1-(1-\lambda )^{\tau }}{\lambda }-1+1 = (1-\lambda )\mathcal {B}^{(2)}_{\tau -1}+1\\&=\mathcal {B}^{(2)}_{\tau -1}-1+(1-\lambda )^{\tau -1}+1=\mathcal {B}^{(2)}_{\tau -1}+(1-\lambda )^{\tau -1}. \end{aligned}$$

Lastly,

$$\begin{aligned} \mathcal {B}^{(3)}_{\tau }&=\frac{1-(1-\lambda )^{\tau -1}}{\lambda } -1+1 - (\tau -2+1) (1-\lambda )^{\tau -1}\\&=(1-\lambda )\left( \frac{1-(1-\lambda )^{\tau -2}}{\lambda } +\dfrac{1}{1-\lambda } - (\tau -2) (1-\lambda )^{\tau -2}-(1-\lambda )^{\tau -2}\right) \\&=\mathcal {B}^{(3)}_{\tau -1} +\dfrac{1}{1-\lambda } -(1-\lambda )^{\tau -2} \\&\quad -1+(1-\lambda )^{\tau -2}-\dfrac{\lambda }{1-\lambda }+ \lambda (\tau -2) (1-\lambda )^{\tau -2}+\lambda (1-\lambda )^{\tau -2} \\&=\mathcal {B}^{(3)}_{\tau -1}+(1-\lambda )^{\tau -2}\lambda (\tau -1). \end{aligned}$$

$\square$

Lemma 6.6

The following recursive relationship holds for quantity $A_{\tau }$ defined in (2.3) and any integer $\tau >0$:

$$\begin{aligned} A_{\tau }= A_{\tau -1} \exp \left( \frac{1}{2} \Delta ^2\mathcal {B}_{\tau -1}^{\top } \Sigma \rho ( \mathcal {B}_{\tau -1}^{\top } \Sigma )^\top - \Delta \mathcal {B}_{\tau -1}^{\top } \kappa ^\mathbb {Q}\theta ^\mathbb {Q}\right) . \end{aligned}$$

Proof of Lemma 6.6

For $\tau =1$, the proof is trivial. For $\tau >1$, using Lemma 6.4, Lemma 6.5 and (2.4) to substitute into (2.5),

$$\begin{aligned}&\log \left( \dfrac{A_{\tau -1}}{A_{\tau }}\right) -\Delta \mathcal {B}_{\tau -1}^{\top } \kappa ^\mathbb {Q}\theta ^\mathbb {Q} = -\Delta \theta _2^\mathbb {Q}\left( \mathcal {B}_{\tau -1}^{(1)} - \mathcal {B}_{\tau }^{(1)} + \mathcal {B}_{\tau }^{(2)} - \mathcal {B}_{\tau -1}^{(2)} \right) \\&\qquad + \Delta \theta _3^\mathbb {Q}\left( \mathcal {B}_{\tau -1}^{(3)}-\mathcal {B}_{\tau }^{(3)}\right)-\Delta \mathcal {B}_{\tau -1}^{\top } \kappa ^\mathbb {Q}\theta ^\mathbb {Q}- \frac{1}{2} \Delta ^2\left( \upsilon _\tau -\upsilon _{\tau -1}\right) \\&\quad = -\Delta \theta _2^\mathbb {Q}\left( -1 + (1-\lambda )^{\tau -1} \right) - \Delta \theta _3^\mathbb {Q}(1-\lambda )^{\tau -2}\lambda (\tau -1) \\&\quad \quad -\Delta \mathcal {B}_{\tau -1}^{\top } \kappa ^\mathbb {Q}\theta ^\mathbb {Q}- \frac{1}{2} \Delta ^2\left( \upsilon _\tau -\upsilon _{\tau -1}\right) \\&\quad =\Delta \theta _2^\mathbb {Q}\lambda \mathcal {B}_{\tau -1}^{(2)} - \Delta \theta _3^\mathbb {Q}\lambda (1-\lambda )^{\tau -2}(\tau -1)-\Delta \mathcal {B}_{\tau -1}^{(2)} \lambda (\theta _2^\mathbb {Q}- \theta _3^\mathbb {Q}) -\Delta \mathcal {B}_{\tau -1}^{(3)} \lambda \theta _3^\mathbb {Q}\\&\quad \quad - \frac{1}{2} \Delta ^2\left( \upsilon _\tau -\upsilon _{\tau -1}\right) \\&\quad = - \Delta \theta _3^\mathbb {Q}\lambda (1-\lambda )^{\tau -2}(\tau -1) + \Delta \lambda \theta _3^\mathbb {Q}( \mathcal {B}_{\tau -1}^{(2)} - \mathcal {B}_{\tau -1}^{(3)}) - \frac{1}{2} \Delta ^2\mathcal {B}_{\tau -1}^{\top } \Sigma \rho ( \mathcal {B}_{\tau -1}^{\top } \Sigma )^\top \\&\quad =- \frac{1}{2} \Delta ^2\mathcal {B}_{\tau -1}^{\top } \Sigma \rho ( \mathcal {B}_{\tau -1}^{\top } \Sigma )^\top . \end{aligned}$$

$\square$

Proof of Proposition 2.1

The proof relies on calculating the moments generating function of innovations under the $\mathcal {T}$-forward measure. Consider the row vector $\Gamma =[\Gamma _1,\Gamma _2,\Gamma _3]$ with $\Gamma _i \in \mathbb {R}$ for $i=1,2,3$. Then

$$\begin{aligned}&\mathbb {E}^{\mathcal {T}}\left[ \exp (\Gamma Z^{\mathcal {T}}_{t+1})\bigg |\mathcal {F}_t\right] =\mathbb {E}^{\mathcal {T}}\left[ \exp (\Gamma Z^{\mathbb {Q}}_{t+1}+\Gamma \Delta \rho \Sigma \mathcal {B}_{\tau -1}) \bigg |\mathcal {F}_t\right] \\&\quad = \frac{\mathbb {E}^{{\mathbb {Q}}}\left[ \exp (\Gamma Z^{\mathbb {Q}}_{t+1}+\Gamma \Delta \rho \Sigma \mathcal {B}_{\tau -1}) \dfrac{d\mathbb {Q}^{\mathcal {T}}}{d\mathbb {Q}} \bigg |\mathcal {F}_t\right] }{\mathbb {E}^{{\mathbb {Q}}}\left[ \dfrac{d\mathbb {Q}^{\mathcal {T}}}{d\mathbb {Q}} \bigg |\mathcal {F}_t\right] }\\&\quad = \exp \left[ \Gamma \Delta \rho \Sigma \mathcal {B}_{\tau -1}\right] \dfrac{\mathbb {E^Q}\left[ \exp (\Gamma Z^{\mathbb {Q}}_{t+1})\dfrac{P(t+\tau ,t+\tau )B(0)}{P(0,t+\tau )B(t+\tau )} \bigg |\mathcal {F}_t \right] }{\mathbb {E^Q}\left[ \dfrac{P(t+\tau ,t+\tau )B(0)}{P(0,t+\tau )B(t+\tau )} \bigg |\mathcal {F}_t \right] }\\&\quad = \exp \left[ \Gamma \Delta \rho \Sigma \mathcal {B}_{\tau -1} \right] \frac{1}{P(0,t+\tau )}\dfrac{\mathbb {E^Q}\left[ \exp (\Gamma Z^{\mathbb {Q}}_{t+1})\dfrac{P(t+\tau ,t+\tau )}{B(t+\tau )} \bigg |\mathcal {F}_t \right] }{\dfrac{P(t,t+\tau )}{P(0,t+\tau )B(t)}}\\&\quad =\exp \left[ \Gamma \rho \Delta \Sigma \mathcal {B}_{\tau -1} \right] \dfrac{B(t)}{P(t,t+\tau )} \mathbb {E^Q}\left[ \exp (\Gamma Z^{\mathbb {Q}}_{t+1})\mathbb {E^Q}\left[ \dfrac{P(t+\tau ,t+\tau )}{B(t+\tau )} \bigg |\mathcal {F}_{t+1} \right] \bigg |\mathcal {F}_t \right] \\&\quad = \exp \left[ \Gamma \Delta \rho \Sigma \mathcal {B}_{\tau -1} \right] \mathbb {E^Q}\left[ \exp (\Gamma Z^{\mathbb {Q}}_{t+1}) \dfrac{B(t)}{P(t,t+\tau )}\dfrac{P(t+1,t+\tau )}{B(t+1)} \bigg |\mathcal {F}_t \right] , \end{aligned}$$

where the fourth and fifth equalities rely on the fact that $\frac{P(\cdot ,\mathcal {T})}{B(\cdot )}$ is a $\mathbb {Q}$-martingale.

Define

$$\begin{aligned} Y(Z_{t+1}^\mathbb {Q}) \equiv \Gamma Z^{\mathbb {Q}}_{t+1}+\log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) -\Delta \mathcal {B}_{\tau -1}^{\top }\left( \kappa ^\mathbb {Q}\theta ^\mathbb {Q}+ \Sigma Z_{t+1}^\mathbb {Q}\right) . \end{aligned}$$

Using Lemma 6.2 therefore leads to

$$\begin{aligned} \mathbb {E}^{\mathcal {T}}\left[ \exp \left( \Gamma Z^{\mathcal {T}}_{t+1}\right) \bigg |\mathcal {F}_t\right] =\exp \left[ \Gamma \Delta \rho \Sigma \mathcal {B}_{\tau -1} \right] \mathbb {E^Q}\bigg [ \exp \left( Y(Z_{t+1}^\mathbb {Q}) \right) \bigg |\mathcal {F}_t \bigg ], \end{aligned}$$

where, given $\mathcal {F}_t$, $Y(Z_{t+1}^\mathbb {Q})$ follows the Gaussian distribution with conditional mean and variance

$$\begin{aligned} \mathbb {E^Q}\left[ Y(Z_{t+1}^\mathbb {Q}) \bigg |\mathcal {F}_t \right]&= \log \left( \dfrac{A_{\tau -1}}{A_\tau }\right) -\Delta \mathcal {B}_{\tau -1}^{\top } \kappa ^\mathbb {Q}\theta ^\mathbb {Q},\\ \text {Var}^{\mathbb {Q}} \left[ Y(Z_{t+1}^\mathbb {Q}) \bigg |\mathcal {F}_t \right]&= \Gamma \rho \Gamma ^\top + \Delta \mathcal {B}_{\tau -1}^{\top } \Sigma \rho (\Delta \mathcal {B}_{\tau -1}^{\top } \Sigma )^\top - 2 \Gamma \Delta \rho \Sigma \mathcal {B}_{\tau -1}. \end{aligned}$$

Thus,

$$\begin{aligned} \mathbb {E}^{\mathcal {T}}\left[ \exp (\Gamma Z^{\mathcal {T}}_{t+1})\bigg |\mathcal {F}_t\right]&=\dfrac{A_{\tau -1}}{A_\tau } \exp \left( -\Delta \mathcal {B}_{\tau -1}^{\top } \kappa ^\mathbb {Q}\theta ^\mathbb {Q}+ \dfrac{1}{2} \Delta \mathcal {B}_{\tau -1}^{\top } \Sigma \rho (\Delta \mathcal {B}_{\tau -1}^{\top } \Sigma )^\top \right)\times \\&\quad \exp \left( \dfrac{1}{2}\Gamma \rho \Gamma ^\top \right) . \end{aligned}$$

Therefore, using Lemma 6.6 leads to

$$\begin{aligned} \mathbb {E}^{\mathcal {T}}\left[ \exp (\Gamma Z^{\mathcal {T}}_{t+1})\bigg |\mathcal {F}_t\right] =\exp \left( \dfrac{1}{2}\Gamma \rho \Gamma ^\top \right) . \end{aligned}$$

$\square$

Lemma 6.7

Assume (2.13) holds. For $t=0,\ldots ,\mathcal {T}$ and $n=0,\ldots ,\mathcal {T}-t$,

$$\begin{aligned} X^{(i)}_{t+n}&=X^{(i)}_{t}(1-\kappa ^{\mathcal {T}}_{i,i})^{n}+\kappa ^{\mathcal {T}}_{i,i} \theta ^{\mathcal {T}}_i \sum _{l=1}^{n}(1-\kappa ^{\mathcal {T}}_{i,i})^{(n-l)}+\Sigma _{i,i}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,i}(1-\kappa ^{\mathcal {T}}_{i,i})^{(n-l)} \\&\quad +\sum _{l=1}^{n}\sum _{j\ne i}^3 \kappa ^{\mathcal {T}}_{i,j} (\theta ^{\mathcal {T}}_j-X^{(j)}_{t+l-1}) (1-\kappa ^{\mathcal {T}}_{i,i})^{(n-l)} - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(i)}_{t+l} (1- \kappa ^{\mathcal {T}}_{i,i})^{n-1-l}. \end{aligned}$$

(6.5)

Proof of Lemma 6.7

This proof is analogous to that of Lemma A.1 from Eghbalzadeh et al. (2022), which is based on induction. We apply the convention $\sum _{l=0}^{-1} x_l= \sum _{l=1}^{0} x_l \equiv 0$. The case $n=0$ is therefore trivial. Then, assume (6.5) holds for some $n \le \mathcal {T}-t-1$. Using (2.13), for any $i=1,2,3$,

$$\begin{aligned} X^{(i)}_{t+n+1}&=X^{(i)}_{t+n}+\sum _{j=1}^{3}\kappa ^{\mathcal {T}}_{i,j}(\theta ^{\mathcal {T}}_j-X^{(j)}_{t+n})+\Sigma _{i,i}Z^{\mathcal {T}}_{t+n+1,i} - \eta _{t+n}^{\mathcal {T},(i)}\\&=X^{(i)}_{t+n}(1-\kappa ^{\mathcal {T}}_{i,i})+ \kappa ^{\mathcal {T}}_{i,i}\theta ^{\mathcal {T}}_i +\sum _{j\ne i}^{3}\kappa ^{\mathcal {T}}_{i,j}(\theta ^{\mathcal {T}}_j-X^{(j)}_{t+n})+\Sigma _{i,i}Z^{\mathcal {T}}_{t+n+1,i} - \eta _{t+n}^{\mathcal {T},(i)}. \end{aligned}$$

Applying (6.5) in the latter equality yields

$$\begin{aligned} X^{(i)}_{t+n+1}&= X^{(i)}_{t}(1-\kappa ^{\mathcal {T}}_{i,i})^{n+1}+\kappa ^{\mathcal {T}}_{i,i}\theta ^{\mathcal {T}}_i \sum _{l=1}^{n}(1-\kappa ^{\mathcal {T}}_{i,i})^{(n+1-l)}+\Sigma _{i,i}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,i}(1-\kappa ^{\mathcal {T}}_{i,i})^{(n+1-l)}\\&\quad +\sum _{l=1}^{n}\sum _{j\ne i}^{3}\kappa ^{\mathcal {T}}_{i,j}(\theta ^{\mathcal {T}}_j-X^{(j)}_{t+l-1})(1-\kappa ^{\mathcal {T}}_{i,i})^{(n+1-l)} + \kappa ^{\mathcal {T}}_{i,i}\theta ^{\mathcal {T}}_i - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(i)}_{t+l} (1- \kappa ^{\mathcal {T}}_{i,i} )^{n-l}\\&\quad +\sum _{j\ne i}^{3}\kappa ^{\mathcal {T}}_{i,j}(\theta ^{\mathcal {T}}_j-X^{(j)}_{t+n})+\Sigma _{i,i}Z^{\mathcal {T}}_{t+n+1,i} - \eta ^{\mathcal {T},(i)}_{t+n}\\&= X^{(i)}_{t}(1-\kappa ^{\mathcal {T}}_{i,i})^{n+1}+\kappa ^{\mathcal {T}}_{i,i}\theta ^{\mathcal {T}}_i \sum _{l=1}^{n+1}(1-\kappa ^{\mathcal {T}}_{i,i})^{(n+1-l)}+\Sigma _{i,i}\sum _{l=1}^{n+1}Z^{\mathcal {T}}_{t+l,i}(1-\kappa ^{\mathcal {T}}_{i,i})^{(n+1-l)}\\&\quad +\sum _{l=1}^{n+1}\sum _{j\ne i}^{3}\kappa ^{\mathcal {T}}_{i,j}(\theta ^{\mathcal {T}}_j-X^{(j)}_{t+l-1})(1-\kappa ^{\mathcal {T}}_{i,i})^{(n+1-l)} - \sum _{l=0}^{n}\eta ^{\mathcal {T},(i)}_{t+l} (1- \kappa ^{\mathcal {T}}_{i,i} )^{n-l} , \end{aligned}$$

thereby finishing the induction. $\square$

Lemma 6.8

For $n=0,\ldots ,\mathcal {T}-t$, the factors $X^{(i)}_{t+n}$ can be expressed in terms of $X_{t}$ and innovations $\{ Z^{\mathcal {T}}_{t+l} \}^{\mathcal {T}-t}_{l=1}$ as follows:

$$\begin{aligned} X^{(1)}_{t+n}=& X^{(1)}_{t}+\Sigma _{1,1}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,1} - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(1)}_{t+l},\\ X^{(2)}_{t+n}=& X^{(2)}_{t}(1-\lambda )^{n}+(\theta ^\mathcal {T}_2-\theta ^\mathcal {T}_3)\left( 1-(1-\lambda )^n\right) +\Sigma _{2,2}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,2}(1-\lambda )^{(n-l)}\\&\quad + \lambda \bigg (nX^{(3)}_{t} (1-\lambda )^{n-1} + \theta ^{\mathcal {T}}_3 \left( \frac{\left( 1-(1-\lambda )^n\right) }{\lambda }-n(1-\lambda )^{n-1} \right) \\&\quad + \Sigma _{3,3} \sum _{k=1}^{n-1} (n-k)(1-\lambda )^{n-k-1} Z^{\mathcal {T}}_{t+k,3} -\sum _{k=0}^{n-1}(n-k-1) \eta ^{\mathcal {T},(3)}_{t+k} (1-\lambda )^{n-k-2}\bigg )\\&\quad -\sum _{l=0}^{n-1}\eta ^{\mathcal {T},(2)}_{t+l} (1- \lambda )^{n-1-l},\\ X^{(3)}_{t+n}=& X^{(3)}_{t}(1-\lambda )^{n}+ \theta ^\mathcal {T}_3\left( 1-(1-\lambda )^n\right) +\Sigma _{3,3}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,3}(1-\lambda )^{(n-l)}\\&\quad - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(3)}_{t+l} (1- \lambda )^{n-1-l}. \end{aligned}$$

Proof of Lemma 6.8

From (2.12), $\kappa ^{\mathcal {T}}_{1,1}=\kappa ^{\mathcal {T}}_{1,2}=\kappa ^{\mathcal {T}}_{1,3}=\kappa ^{\mathcal {T}}_{2,1}=\kappa ^{\mathcal {T}}_{3,1}=\kappa ^{\mathcal {T}}_{3,2}=0$, $\kappa ^{\mathcal {T}}_{2,2}=\kappa ^{\mathcal {T}}_{3,3}=\lambda$ and $\kappa ^{\mathcal {T}}_{2,3}=-\lambda$. When placed into (6.5), this leads to

$$\begin{aligned} X^{(1)}_{t+n}= & {} X^{(1)}_{t}+\Sigma _{1,1}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,1} - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(1)}_{t+l}, \\ X^{(2)}_{t+n}= & {} X^{(2)}_{t}(1-\lambda )^{n}+\lambda \theta ^\mathcal {T}_2 \sum _{l=1}^{n}(1-\lambda )^{(n-l)}+\Sigma _{2,2}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,2}(1-\lambda )^{(n-l)} \\{} & {} \quad -\lambda \sum _{l=1}^{n} (\theta ^\mathcal {T}_3-X^{(3)}_{t+l-1}) (1-\lambda )^{(n-l)} - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(2)}_{t+l} (1- \lambda )^{n-1-l}, \end{aligned}$$

(6.6)

$$\begin{aligned} X^{(3)}_{t+n}= & {} X^{(3)}_{t}(1-\lambda )^{n}+\lambda \theta ^\mathcal {T}_3 \sum _{l=1}^{n}(1-\lambda )^{(n-l)}+\Sigma _{3,3}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,3}(1-\lambda )^{(n-l)} \\{} & {} \quad - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(3)}_{t+l} (1- \lambda )^{n-1-l}. \end{aligned}$$

(6.7)

Furthermore,

$$\begin{aligned}\sum _{l=1}^{n} (1-\lambda )^{n-l} X^{(3)}_{t+l-1} &= \sum _{l=0}^{n-1}(1-\lambda )^{n-l-1} X^{(3)}_{t+l} \\& = \sum _{l=0}^{n-1} (1-\lambda )^{n-l-1} \bigg [ X^{(3)}_{t}(1-\lambda )^{l}+\lambda \theta ^{\mathcal {T}}_3 \sum _{k=1}^{l}(1-\lambda )^{(l-k)} \\&\quad \quad +\Sigma _{3,3}\sum _{k=1}^{l}Z^{\mathcal {T}}_{t+k,3}(1-\lambda )^{(l-k)}- \sum _{k=0}^{l-1}(1-\lambda )^{l-1-k}\eta ^{\mathcal {T},(3)}_{t+k}\bigg ] \\& = nX^{(3)}_{t} (1-\lambda )^{n-1} + \lambda \theta ^{\mathcal {T}}_3 \sum _{l=0}^{n-1} \frac{(1-\lambda )^{n-l-1}-(1-\lambda )^{n-1}}{\lambda } \\&\quad \quad + \Sigma _{3,3} \sum _{l=0}^{n-1} \sum _{k=1}^{n-1} \mathbbm {1}_{\{k \le l\}} Z^{\mathcal {T}}_{t+k,3} (1-\lambda )^{n-k-1} - \sum _{l=0}^{n-1} \sum _{k=0}^{n-1}\mathbbm {1}_{\{k \le l-1\}}(1-\lambda )^{n-k-2} \eta ^{\mathcal {T},(3)}_{t+k} \\ &= nX^{(3)}_{t} (1-\lambda )^{n-1} + \theta ^{\mathcal {T}}_3 \left( \dfrac{1-(1-\lambda )^n}{\lambda }-n(1-\lambda )^{n-1} \right) \\&\quad \quad + \Sigma _{3,3} \sum _{k=1}^{n-1} Z^{\mathcal {T}}_{t+k,3} \sum _{l=k}^{n-1} (1-\lambda )^{n-k-1}-\sum _{k=0}^{n-1} \eta ^{\mathcal {T},(3)}_{t+k} \sum _{l=k+1}^{n-1}(1-\lambda )^{n-k-2} \\& = nX^{(3)}_{t} (1-\lambda )^{n-1} + \theta ^{\mathcal {T}}_3 \left( \dfrac{1-(1-\lambda )^n}{\lambda }-n(1-\lambda )^{n-1} \right) \\&\quad \quad + \Sigma _{3,3} \sum _{k=1}^{n-1} (n-k)(1-\lambda )^{n-k-1} Z^{\mathcal {T}}_{t+k,3} -\sum _{k=0}^{n-1}(n-k-1) \eta ^{\mathcal {T},(3)}_{t+k} (1-\lambda )^{n-k-2}. \end{aligned}$$

(6.8)

Using (6.8) in (6.6),

$$\begin{aligned} X^{(2)}_{t+n}&=X^{(2)}_{t}(1-\lambda )^{n}+\theta ^\mathcal {T}_2\left( 1-(1-\lambda )^n\right) +\Sigma _{2,2}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,2}(1-\lambda )^{(n-l)}\\&\quad - \theta ^\mathcal {T}_3\left( 1-(1-\lambda )^n\right) \\&\quad + \lambda \bigg (n X^{(3)}_{t} (1-\lambda )^{n-1} + \theta ^{\mathcal {T}}_3 \left( \dfrac{1-(1-\lambda )^n}{\lambda }-n(1-\lambda )^{n-1} \right) \\&\quad + \Sigma _{3,3} \sum _{k=1}^{n-1} (n-k)(1-\lambda )^{n-k-1} Z^{\mathcal {T}}_{t+k,3} -\sum _{k=0}^{n-1}(n-k-1) \eta ^{\mathcal {T},(3)}_{t+k} (1-\lambda )^{n-k-2}\bigg ) \\&\quad - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(2)}_{t+l} (1- \lambda )^{n-1-l}. \end{aligned}$$

Moreover, using (2.7) in (6.7) leads to

$$\begin{aligned} X^{(3)}_{t+n}&=X^{(3)}_{t}(1-\lambda )^{n}+ \theta ^\mathcal {T}_3\left( 1-(1-\lambda )^n\right) +\Sigma _{3,3}\sum _{l=1}^{n}Z^{\mathcal {T}}_{t+l,3}(1-\lambda )^{(n-l)}\\&\quad - \sum _{l=0}^{n-1}\eta ^{\mathcal {T},(3)}_{t+l} (1- \lambda )^{n-1-l}. \end{aligned}$$

$\square$

Proof of Proposition 2.2

The joint normality of $X_{t+n}$ given $\mathcal {F}_t$ is a direct consequence of Lemma 6.8, which expresses $X_{t+n}$ as a linear combination of the $\mathcal {F}_t$-measurable elements of $X_t$ and of jointly Gaussian innovations $Z^\mathcal {T}_{t+1},\ldots , Z^\mathcal {T}_{t+n}$ that are independent of $\mathcal {F}_t$. The composition of $\mathcal {M}_{t,n}$ is also a direct consequence of Lemma 6.8, and of the null expectation of innovations $Z^\mathcal {T}_{t+1},\ldots , Z^\mathcal {T}_{t+n}$ given $\mathcal {F}_t$.

Components of $\mathcal {V}_{n}$ are also obtained through Lemma 6.8 and (2.7)–(2.9):

$$\begin{aligned} \mathcal {V}^{(1,1)}_{n}&=\text {Var}^\mathcal {T}(X^{(1)}_{t+n} |\mathcal {F}_t)= \Sigma _{1,1}^2 \sum _{l=1}^{n}\text {Var}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,1}) =n \Sigma _{1,1}^2,\\ \mathcal {V}^{(2,2)}_{n}&=\text {Var}^\mathcal {T}(X^{(2)}_{t+n}|\mathcal {F}_t) = \Sigma ^2_{2,2} \sum _{l=1}^{n}(1-\lambda )^{2(n-l)} \text {Var}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,2}) \\&\quad + \lambda ^2 \Sigma ^2_{3,3} \sum _{l=1}^{n-1} (n-l)^2(1-\lambda )^{2(n-l-1)} \text {Var}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,3})\\&\quad +2\Sigma _{2,2} \lambda \Sigma _{3,3}\sum _{l=1}^{n-1} (n-l)(1-\lambda )^{2(n-l)-1} \text {Cov}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,2},Z^{\mathcal {T}}_{t+l,3})\\&= \Sigma _{2,2}^2 \left( 1+\zeta _0((1-\lambda )^2,n)\right) + \lambda ^2\Sigma _{3,3}^2(1-\lambda )^{-2}\zeta _2((1-\lambda )^2,n)\\&\quad +2\Sigma _{2,2} \lambda \Sigma _{3,3}\rho _{2,3}(1-\lambda )^{-1}\zeta _1\left( \left( 1-\lambda \right) ^2,n\right) ,\\ \mathcal {V}^{(3,3)}_{n}&=\text {Var}^\mathcal {T}(X^{(3)}_{t+n}|\mathcal {F}_t)=\Sigma _{3,3}^2\sum _{l=1}^n(1-\lambda )^{2(n-l)}\text {Var}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,3}) \\&=\Sigma _{3,3}^2 \left( 1+\zeta _0((1-\lambda )^2,n)\right) \end{aligned}$$

and

$$\begin{aligned} \mathcal {V}^{(1,2)}_{n}=\mathcal {V}^{(2,1)}_{n}&=\text {Cov}^\mathcal {T}(X^{(1)}_{t+n},X^{(2)}_{t+n}|\mathcal {F}_t)=\Sigma _{1,1}\Sigma _{2,2}\sum _{l=1}^{n}(1-\lambda )^{n-l}\text {Cov}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,1},Z^{\mathcal {T}}_{t+l,2})\\&\quad +\lambda \Sigma _{1,1}\Sigma _{3,3}\sum _{l=1}^{n-1}(n-l)(1-\lambda )^{n-l-1}\text {Cov}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,1},Z^{\mathcal {T}}_{t+l,3})\\&=\Sigma _{1,1}\Sigma _{2,2}\rho _{1,2}\left( 1+\zeta _0(1-\lambda ,n)\right) +\lambda \Sigma _{1,1}\Sigma _{3,3}\rho _{1,3}\dfrac{\zeta _1(1-\lambda ,n)}{1-\lambda },\\ \mathcal {V}^{(1,3)}_{n}=\mathcal {V}^{(3,1)}_{n}&=\text {Cov}^\mathcal {T}(X^{(1)}_{t+n},X^{(3)}_{t+n}|\mathcal {F}_t)=\Sigma _{1,1}\Sigma _{3,3}\sum _{l=1}^{n}(1-\lambda )^{n-l}\text {Cov}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,1},Z^{\mathcal {T}}_{t+l,3})\\&=\Sigma _{1,1}\Sigma _{3,3}\rho _{1,3} \left( 1+\zeta _0(1-\lambda ,n)\right) ,\\ \mathcal {V}^{(2,3)}_{n}=\mathcal {V}^{(3,2)}_{n}&=\text {Cov}^\mathcal {T}(X^{(2)}_{t+n},X^{(3)}_{t+n}|\mathcal {F}_t)\\&=\Sigma _{2,2}\Sigma _{3,3}\sum _{l=1}^{n}(1-\lambda )^{2(n-l)}\text {Cov}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,2},Z^{\mathcal {T}}_{t+l,3})\\&\quad +\lambda \Sigma _{3,3}^2\sum _{l=1}^{n-1}(n-l)(1-\lambda )^{2(n-l)-1}\text {Var}^\mathcal {T}(Z^{\mathcal {T}}_{t+l,3})\\&=\Sigma _{2,2}\Sigma _{3,3} \rho _{2,3}\left( 1+\zeta _0((1-\lambda )^2,n)\right) +\lambda \Sigma _{3,3}^2\dfrac{\zeta _1((1-\lambda )^2,n)}{1-\lambda }. \end{aligned}$$

$\square$

Proof of Lemma 2.1

From, (2.12), for $i=1,2,3$, $\eta ^{\mathcal {T},(i)}_t = \Delta \Sigma _{i,i}\sum ^3_{j=1} \Sigma _{j,j} \rho _{i,j} \mathcal {B}^{(j)}_{\mathcal {T}-t-1}$. Therefore,

$$\begin{aligned} \sum _{l=0}^{\mathcal {T}-t-1} \eta ^{\mathcal {T},(1)}_{t+l}&= \Delta \Sigma _{1,1}\sum ^3_{j=1} \Sigma _{j,j} \rho _{1,j} \sum _{l=0}^{\mathcal {T}-t-1} \mathcal {B}^{(j)}_{\mathcal {T}-t-l-1}\\&= \Delta \Sigma _{1,1}\sum ^3_{j=1} \Sigma _{j,j} \rho _{1,j} \sum _{l=0}^{\mathcal {T}-t-1} \mathcal {B}^{(j)}_{l}\\&= \Delta \Sigma _{1,1}\sum ^3_{j=1} \Sigma _{j,j} \rho _{1,j} \sum _{l=1}^{\mathcal {T}-t-1} \mathcal {B}^{(j)}_{l}\\&=\Delta \Sigma _{1,1}\bigg [\Sigma _{1,1} \frac{(\mathcal {T}-t-1)(\mathcal {T}-t)}{2} +\dfrac{\Sigma _{2,2} \rho _{1,2}}{\lambda }\left( \mathcal {T}-t-1 -\zeta _0\left( 1-\lambda ,\mathcal {T}-t\right) \right) \\&\quad +\Sigma _{3,3} \rho _{1,3}\bigg ( \dfrac{\mathcal {T}-t-1-\left( 1+\zeta _0\left( 1-\lambda ,\mathcal {T}-t-1\right) \right) }{\lambda }\\&\quad -\zeta _1\left( 1-\lambda , \mathcal {T}-t-1\right) \bigg )\bigg ]. \end{aligned}$$

Moreover, for $i=2,3$,

$$\begin{aligned}&\sum _{l=0}^{\mathcal {T}-t-1}\eta ^{\mathcal {T},(i)}_{t+l} (1- \lambda )^{\mathcal {T}-t-1-l}\\&\quad = \Delta \Sigma _{i,i}\sum _{l=0}^{\mathcal {T}-t-1}(1- \lambda )^{\mathcal {T}-t-1-l}\left( \Sigma _{1,1} \rho _{i,1} \mathcal {B}^{(1)}_{\mathcal {T}-t-l-1} \!+\! \Sigma _{2,2} \rho _{i,2} \mathcal {B}^{(2)}_{\mathcal {T}-t-l-1} \!\right. \\&\quad \quad \left. +\! \Sigma _{3,3} \rho _{i,3} \mathcal {B}^{(3)}_{\mathcal {T}-t-l-1} \right) \\&\quad = \Delta \Sigma _{i,i}\sum _{l=1}^{\mathcal {T}-t-1}(1- \lambda )^{l}\left( \Sigma _{1,1} \rho _{i,1} \mathcal {B}^{(1)}_{l} + \Sigma _{2,2} \rho _{i,2} \mathcal {B}^{(2)}_{l} + \Sigma _{3,3} \rho _{i,3} \mathcal {B}^{(3)}_{l} \right) \\&\quad =\Delta \Sigma _{i,i}\bigg [\Sigma _{1,1} \rho _{i,1} \zeta _1\left( 1-\lambda ,\mathcal {T}-t\right) \\&\quad \quad+\dfrac{\Sigma _{2,2} \rho _{i,2}}{\lambda }\left( \zeta _0\left( 1-\lambda ,\mathcal {T}-t\right) -\zeta _0\left( \left( 1-\lambda \right) ^{2},\mathcal {T}-t\right) \right) \\&\quad \quad +\Sigma _{3,3} \rho _{i,3}\bigg (\dfrac{\zeta _0\left( 1-\lambda ,\mathcal {T}-t\right) -(1-\lambda )^{-1}\zeta _0\left( \left( 1-\lambda \right) ^{2},\mathcal {T}-t\right) }{\lambda } \\&\qquad -(1+\lambda )\zeta _1\left( \left( 1-\lambda \right) ^{2},\mathcal {T}-t-1\right) \bigg )\bigg ]. \end{aligned}$$

Lastly,

$$\begin{aligned}&\sum _{k=0}^{\mathcal {T}-t-1}(\mathcal {T}-t-k-1) \eta ^{\mathcal {T},(3)}_{t+k} (1-\lambda )^{\mathcal {T}-t-k-2}\\&\quad = \sum _{k=0}^{\mathcal {T}-t-1}(\mathcal {T}-t-k-1) \Delta \Sigma _{3,3}\sum ^3_{j=1} \Sigma _{j,j} \rho _{3,j} \mathcal {B}^{(j)}_{\mathcal {T}-t-k-1} (1-\lambda )^{\mathcal {T}-t-k-2}\\&\quad = \Delta \Sigma _{3,3} \sum _{k=0}^{\mathcal {T}-t-1} k \sum ^3_{j=1} \Sigma _{j,j} \rho _{3,j} \mathcal {B}^{(j)}_{k} (1-\lambda )^{k-1}\\&\quad = \Delta \Sigma _{3,3} \sum _{k=1}^{\mathcal {T}-t-1} \sum ^3_{j=1} \Sigma _{j,j} \rho _{3,j} k \mathcal {B}^{(j)}_{k} (1-\lambda )^{k-1}\\&\quad = \frac{\Delta \Sigma _{3,3}}{1-\lambda } \bigg ( \Sigma _{1,1} \rho _{3,1} \zeta _2\left( 1-\lambda ,\mathcal {T}-t\right) + \frac{\Sigma _{2,2} \rho _{2,1}}{\lambda } \left[ \zeta _1\left( 1-\lambda ,\mathcal {T}-t\right) \right. \\&\qquad \left. - \zeta _1\left( (1-\lambda )^2,\mathcal {T}-t\right) \right] \\&\quad \quad + \Sigma _{3,3} \rho _{3,1} \left[ \frac{\zeta _1\left( 1-\lambda ,\mathcal {T}-t\right) }{\lambda } - \frac{1}{\lambda }\zeta _1\left( (1-\lambda )^2,\mathcal {T}-t\right) \right. \\&\qquad \left. - (1-\lambda )^{-1}\zeta _2\left( (1-\lambda )^2,\mathcal {T}-t\right) \right] \bigg ). \end{aligned}$$

Lemma 6.9

Under the risk-neutral measure $\mathbb {Q}$, conditionally on $\mathcal {F}_t$, factors $X_{t+n}$ follow the multivariate Gaussian distribution with mean vector $\tilde{\mathcal {M}}_{t,n}=\left[ \tilde{\mathcal {M}}^{(i)}_{t,n}\right] ^3_{i=1}$ and covariance matrix $\mathcal {V}_{n}=\left[ \mathcal {V}^{(i,j)}_{n}\right] ^3_{i,j=1}$, where

$$\begin{aligned} \tilde{\mathcal {M}}^{(1)}_{t,n}&=X^{(1)}_{t}, \\ \tilde{\mathcal {M}}^{(2)}_{t,n}&=X^{(2)}_{t}(1-\lambda )^{n}+(\theta ^\mathbb {Q}_2-\theta ^\mathbb {Q}_3)\left( 1-(1-\lambda )^n\right) \\&\quad + \lambda \bigg (nX^{(3)}_{t} (1-\lambda )^{n-1} + \theta ^{\mathbb {Q}}_3 \left( \dfrac{\zeta _0(1-\lambda ,n+1)}{1-\lambda }-n(1-\lambda )^{n-1} \right) \bigg ),\\ \tilde{\mathcal {M}}^{(3)}_{t,n}&=X^{(3)}_{t}(1-\lambda )^{n}+ \theta ^\mathbb {Q}_3\left( 1-(1-\lambda )^n\right) . \end{aligned}$$

Proof of Lemma 6.9

The proof is analogous to that of Proposition 2.2, replacing forward measure superscripts $\mathcal {T}$ by $\mathbb {Q}$ and setting the process $\eta$ to zero. $\square$

Proof of Proposition 2.3

From (2.14), For $t=0,\ldots ,T$ and $n=0,\ldots ,T-t$,

$$\begin{aligned} X^{(i)}_{t+n}&=X^{(i)}_{t}(1-\kappa ^{\mathbb {P}}_{i,i})^{n}+\kappa ^{\mathbb {P}}_{i,i} \theta ^{\mathbb {P}}_i \sum _{l=1}^{n}(1-\kappa ^{\mathbb {P}}_{i,i})^{(n-l)}+\Sigma _{i,i}\sum _{l=1}^{n}Z^{\mathbb {P}}_{t+l,i}(1-\kappa ^{\mathbb {P}}_{i,i})^{(n-l)} \\&\quad +\sum _{l=1}^{n}\sum _{j\ne i}^3 \kappa ^{\mathbb {P}}_{i,j} (\theta ^{\mathbb {P}}_j-X^{(j)}_{t+l-1}) (1-\kappa ^{\mathbb {P}}_{i,i})^{(n-l)}. \end{aligned}$$

(6.9)

Indeed, steps for the proof of (6.9) are identical to these for the proof of Lemma 6.7, setting the process $\eta$ to zero. This leads to

$$\begin{aligned} X^{(i)}_{t+n}&=X^{(i)}_{t}(1-\kappa ^{\mathbb {P}}_{i,i})^{n}+ \theta ^\mathbb {P}_i\left( 1-(1-\kappa ^{\mathbb {P}}_{i,i})^n\right) +\Sigma _{i,i}\sum _{l=1}^{n}Z^{\mathbb {P}}_{t+l,i}(1-\kappa ^{\mathbb {P}}_{i,i})^{(n-l)}, \quad i=1,3. \end{aligned}$$

(6.10)

and

$$\begin{aligned} X^{(2)}_{t+n}= & {} X^{(2)}_{t}(1-\kappa ^{\mathbb {P}}_{2,2})^{n}+\kappa ^{\mathbb {P}}_{2,2} \theta ^\mathbb {P}_2 \sum _{l=1}^{n}(1-\kappa ^{\mathbb {P}}_{2,2})^{(n-l)}+\Sigma _{2,2}\sum _{l=1}^{n}Z^{\mathbb {P}}_{t+l,2}(1-\kappa ^{\mathbb {P}}_{2,2})^{(n-l)} \\{} & {} \quad -\lambda \sum _{l=1}^{n} (\theta ^\mathbb {P}_3-X^{(3)}_{t+l-1}) (1-\kappa ^{\mathbb {P}}_{2,2})^{(n-l)}. \end{aligned}$$

(6.11)

Furthermore, assuming $\kappa ^{\mathbb {P}}_{2,2} \ne \kappa ^{\mathbb {P}}_{3,3}$ and denoting $\omega =\frac{1-\kappa ^{\mathbb {P}}_{3,3}}{1-\kappa ^{\mathbb {P}}_{2,2}}$,

$$\begin{aligned} -\sum _{l=1}^{n} (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-l} \left( \theta ^\mathbb {P}_3- X^{(3)}_{t+l-1}\right)&= -\sum _{l=0}^{n-1}(1-\kappa ^{\mathbb {P}}_{2,2} )^{n-l-1} \left( \theta ^\mathbb {P}_3- X^{(3)}_{t+l} \right) \\&= \sum _{l=0}^{n-1} (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-l-1} \bigg [ X^{(3)}_{t}(1-\kappa ^{\mathbb {P}}_{3,3} )^{l}- \theta ^{\mathbb {P}}_3 (1-\kappa ^{\mathbb {P}}_{3,3})^l \\&\quad +\Sigma _{3,3}\sum _{k=1}^{l}Z^{\mathbb {P}}_{t+k,3}(1-\kappa ^{\mathbb {P}}_{3,3} )^{(l-k)} \bigg ] \\&= (X^{(3)}_{t}-\theta ^{\mathbb {P}}_3)\frac{1-\omega ^n}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1} \\&\quad + \Sigma _{3,3} \sum _{l=0}^{n-1} \sum _{k=1}^{n-1} \mathbbm {1}_{\{k \le l\}} Z^{\mathbb {P}}_{t+k,3} \omega ^l\frac{ (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{k}} \\&= (X^{(3)}_{t}-\theta ^{\mathbb {P}}_3)\frac{1-\omega ^n}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1} \\&\quad + \Sigma _{3,3} \sum _{k=1}^{n-1} Z^{\mathbb {P}}_{t+k,3} \frac{ (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{k}} \sum _{l=k}^{n-1} \omega ^{l} \\&= (X^{(3)}_{t}-\theta ^{\mathbb {P}}_3)\frac{1-\omega ^n}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1} \\&\quad + \Sigma _{3,3} \sum _{k=1}^{n-1} Z^{\mathbb {P}}_{t+k,3} \frac{ (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{k}} \frac{\omega ^{k}-\omega ^{n}}{1-\omega } . \end{aligned}$$

(6.12)

Substituting (6.12) into (6.11) leads to

$$\begin{aligned} X^{(2)}_{t+n}&= X^{(2)}_{t}(1-\kappa ^{\mathbb {P}}_{2,2})^{n}+ \theta ^\mathbb {P}_2 \left( 1-(1-\kappa ^{\mathbb {P}}_{2,2})^{n} \right) +\Sigma _{2,2}\sum _{l=1}^{n}Z^{\mathbb {P}}_{t+l,2}(1-\kappa ^{\mathbb {P}}_{2,2})^{(n-l)} \\&\quad + \lambda \bigg ( (X^{(3)}_{t}-\theta ^{\mathbb {P}}_3)\frac{1-\omega ^n}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1} + \Sigma _{3,3} \sum _{l=1}^{n-1} Z^{\mathbb {P}}_{t+l,3} \frac{ (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{l}} \frac{\omega ^{l}-\omega ^{n}}{1-\omega } \bigg ). \end{aligned}$$

(6.13)

Combining (6.10) and (6.13) directly yields expressions for $\mathcal {M}^{\mathbb {P},(i)}_{t,n}$. Additionally,

$$\begin{aligned} \mathcal {V}^{\mathbb {P},(1,1)}_{n}&=\text {Var}^{\mathbb{P}}(X^{(1)}_{t+n}|\mathcal {F}_t)=\Sigma _{1,1}^2\sum _{l=1}^n (1-\kappa ^{\mathbb {P}}_{1,1})^{2(n-l)}\text {Var}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,1})\\&= {\left\{ \begin{array}{ll} \Sigma _{1,1}^2 \left( 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{1,1})^2,n)\right) \quad \text { if } \kappa ^{\mathbb {P}}_{1,1}\in (0,1),\\ n \Sigma _{1,1}^2 \quad \text { if } \kappa ^{\mathbb {P}}_{1,1}=0, \end{array}\right. }\\ \mathcal {V}^{\mathbb {P},(2,2)}_{n}&=\text {Var}^{\mathbb{P}}(X^{(2)}_{t+n}|\mathcal {F}_t) = \Sigma ^2_{2,2} \sum _{l=1}^{n}(1-\kappa ^{\mathbb {P}}_{2,2})^{2(n-l)} \text {Var}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,2})\\&\quad + \lambda ^2 \Sigma ^2_{3,3} \sum _{l=1}^{n-1} \left( \frac{ (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{l}} \frac{\omega ^{l}-\omega ^{n}}{1-\omega }\right) ^2 \text {Var}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,3})\\&\quad +2\Sigma _{2,2} \lambda \Sigma _{3,3}\sum _{l=1}^{n-1} (1-\kappa ^{\mathbb {P}}_{2,2} )^{(2n-l-1)} \left( \frac{1}{1-\omega }\left( \frac{\omega }{1-\kappa ^{\mathbb {P}}_{3,3}} \right) ^l \right. \\&\quad \left. - \frac{\omega ^n}{1-\omega }\left( \frac{1}{1-\kappa ^{\mathbb {P}}_{3,3}} \right) ^l\right) \text {Cov}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,2},Z^{\mathbb {P}}_{t+l,3})\\&= \Sigma _{2,2}^2 \left( 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{2,2})^2,n)\right) + \lambda ^2 \Sigma ^2_{3,3} \frac{(1-\kappa ^{\mathbb {P}}_{2,2} )^{2n-2}}{(1-\omega )^2} \sum _{l=1}^{n-1} \left( \frac{ \omega ^{2l}-2\omega ^{l+n}+\omega ^{2n}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{2l}} \right) \\&\quad +2 \frac{\rho _{2,3} \lambda \Sigma _{2,2} \Sigma _{3,3}}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{(2n-1)}\sum _{l=1}^{n-1} \left( \left( \frac{\omega }{(1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3})} \right) ^l \right. \\&\quad \left. -\omega ^n\left( \frac{1}{(1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3})} \right) ^l\right) \\&= \Sigma _{2,2}^2 \left( 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{2,2})^2,n)\right) \\&\quad + \lambda ^2 \Sigma ^2_{3,3} \frac{(1-\kappa ^{\mathbb {P}}_{2,2} )^{2n-2}}{(1-\omega )^2} \left[ \zeta _0\left( (1-\kappa ^{\mathbb {P}}_{2,2})^{-2},n\right) -2\omega ^n \zeta _0\left( \omega (1-\kappa ^{\mathbb {P}}_{3,3})^{-2},n\right) \right. \\&\quad \left. + \omega ^{2n}\zeta _0\left( (1-\kappa ^{\mathbb {P}}_{3,3})^{-2},n\right) \right] \\&\quad +2 \frac{\rho _{2,3} \lambda \Sigma _{2,2} \Sigma _{3,3}}{1-\omega } (1-\kappa ^{\mathbb {P}}_{2,2} )^{(2n-1)} \left[ \zeta _0\left( \frac{\omega }{(1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) \right. \\&\quad \left. -\omega ^n \zeta _0\left( \frac{1}{(1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) \right] ,\\ \mathcal {V}^{\mathbb {P},(3,3)}_{n}&=\text {Var}^{\mathbb{P}}(X^{(3)}_{t+n}|\mathcal {F}_t)=\Sigma _{3,3}^2\sum _{l=1}^n(1-\kappa ^{\mathbb {P}}_{3,3})^{2(n-l)}\text {Var}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,3})\\&=\Sigma _{3,3}^2 \left( 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{3,3})^2,n)\right) \end{aligned}$$

and

$$\begin{aligned} \mathcal {V}^{\mathbb {P},(1,2)}_{n}&=\mathcal {V}^{\mathbb {P},(2,1)}_{n}=\text {Cov}^\mathbb {P}(X^{(1)}_{t+n},X^{(2)}_{t+n}|\mathcal {F}_t)\\&=\Sigma _{1,1}\Sigma _{2,2}\sum _{l=1}^{n}(1-\kappa ^{\mathbb {P}}_{1,1})^{n-l} (1-\kappa ^{\mathbb {P}}_{2,2})^{n-l} \text {Cov}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,1},Z^{\mathbb {P}}_{t+l,2})\\&\quad +\lambda \Sigma _{1,1}\Sigma _{3,3}\sum _{l=1}^{n-1} (1-\kappa ^{\mathbb {P}}_{1,1})^{n-l} \frac{ (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{l}} \frac{\omega ^{l}-\omega ^{n}}{1-\omega } \text {Cov}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,1},Z^{\mathbb {P}}_{t+l,3})\\&=\Sigma _{1,1}\Sigma _{2,2}\rho _{1,2} \left[ 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{2,2}),n) \right] \\&\quad +\lambda \Sigma _{1,1}\Sigma _{3,3}\rho _{1,3} \frac{(1-\kappa ^{\mathbb {P}}_{1,1})^n (1-\kappa ^{\mathbb {P}}_{2,2})^{n-1}}{1-\omega } \times \\&\quad \left( \zeta _0 \left( \frac{\omega }{(1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) - \omega ^n \zeta _0 \left( \frac{1}{(1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{3,3})},n\right) \right) ,\\ \mathcal {V}^{\mathbb {P},(1,3)}_{n}&=\mathcal {V}^{\mathbb {P},(3,1)}_{n}=\text {Cov}^\mathbb {P}(X^{(1)}_{t+n},X^{(3)}_{t+n}|\mathcal {F}_t)\\&=\Sigma _{1,1}\Sigma _{3,3}\sum _{l=1}^{n} (1-\kappa ^{\mathbb {P}}_{1,1})^{(n-l)} (1-\kappa ^{\mathbb {P}}_{3,3})^{(n-l)} \text {Cov}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,1},Z^{\mathbb {P}}_{t+l,3})\\&=\Sigma _{1,1}\Sigma _{3,3}\rho _{1,3}\left[ 1 +\zeta _0( (1-\kappa ^{\mathbb {P}}_{1,1})(1-\kappa ^{\mathbb {P}}_{3,3}),n)\right] ,\\ \mathcal {V}^{\mathbb {P},(2,3)}_{n}&=\mathcal {V}^{\mathbb {P},(3,2)}_{n}=\text {Cov}^\mathbb {P}(X^{(2)}_{t+n},X^{(3)}_{t+n}|\mathcal {F}_t)\\&=\Sigma _{2,2}\Sigma _{3,3}\sum _{l=1}^{n}(1-\kappa ^{\mathbb {P}}_{2,2})^{n-l} (1-\kappa ^{\mathbb {P}}_{3,3})^{n-l} \text {Cov}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,2},Z^{\mathbb {P}}_{t+l,3})\\&\quad +\lambda \Sigma _{3,3}^2\sum _{l=1}^{n-1}\frac{ (1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1}}{(1-\kappa ^{\mathbb {P}}_{3,3} )^{l}} \frac{\omega ^{l}-\omega ^{n}}{1-\omega } (1-\kappa ^{\mathbb {P}}_{3,3})^{n-l}\text {Var}^\mathbb {P}(Z^{\mathbb {P}}_{t+l,3})\\&=\Sigma _{2,2}\Sigma _{3,3} \rho _{2,3} \left[ 1+\zeta _0((1-\kappa ^{\mathbb {P}}_{2,2})(1-\kappa ^{\mathbb {P}}_{3,3}),n)\right] \\&\quad +\lambda \Sigma _{3,3}^2 \frac{(1-\kappa ^{\mathbb {P}}_{2,2} )^{n-1} (1-\kappa ^{\mathbb {P}}_{3,3} )^{n}}{1-\omega } \left( \zeta _0\left( \frac{1}{(1-\kappa ^{\mathbb {P}}_{2,2} )(1-\kappa ^{\mathbb {P}}_{3,3} )},n\right) \right. \\&\quad - \omega ^n \zeta _0\left( (1-\kappa ^{\mathbb {P}}_{3,3})^{-2} ,n\right) \bigg) \!. \end{aligned}$$

$\square$

Proof of Theorem 4.1

Using Lemma (6.9), the futures prices can be obtained from the moment generating function of the normal distribution:

$$\begin{aligned} F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3}= & {} \mathbb {E^Q}\left[ P(\mathcal {T}_2,\mathcal {T}_3)\bigg | \mathcal {F}_{\mathcal {T}_1}\right] ,\\= & {} \mathbb {E^Q}\left[ A_{\tau _3}\exp \left[ -\Delta \mathcal {B}_{\tau _3}^\top X_{\mathcal {T}_2}\right] \bigg | \mathcal {F}_{\mathcal {T}_1}\right] \\= & {} A_{\tau _3} \exp \left[ -\Delta \sum ^3_{i=1} \mathcal {B}^{(i)}_{\tau _3} \tilde{\mathcal {M}}^{(i)}_{\mathcal {T}_1,\tau _2} +\frac{\Delta ^2}{2} \mathcal {B}^\top _{\tau _3} \mathcal {V}_{\tau _2} \mathcal {B}_{\tau _3} \right] . \end{aligned}$$

This implies

$$\begin{aligned} F_{\mathcal {T}_1,\mathcal {T}_2,\mathcal {T}_3} = \tilde{A}_{\tau _2,\tau _3} \exp \left[ -\Delta \sum ^3_{i=1} \tilde{\mathcal {B}}^{(i)}_{\tau _3} X^{(i)}_{\mathcal {T}_1} \right] \end{aligned}$$

where

$$\begin{aligned} \tilde{A}_{\tau _2,\tau _3}&= A_{\tau _3} \exp \bigg [ \frac{\Delta ^2}{2} \mathcal {B}^\top _{\tau _3} \mathcal {V}_{\tau _2} \mathcal {B}_{\tau _3} -\Delta \mathcal {B}^{(2)}_{\tau _3} (\theta ^\mathbb {Q}_2-\theta ^\mathbb {Q}_3)\left( 1-(1-\lambda )^{\tau _2}\right) \\&\quad \quad -\Delta \mathcal {B}^{(2)}_{\tau _3} \lambda \theta ^{\mathbb {Q}}_3 \left( \dfrac{\zeta _0(1-\lambda ,\tau _2+1)}{1-\lambda }-\tau _2(1-\lambda )^{\tau _2-1} \right) \bigg ) \\&\qquad -\Delta \mathcal {B}^{(3)}_{\tau _3} \theta ^\mathbb {Q}_3\left( 1-(1-\lambda )^{\tau _2}\right) \bigg ],\\ \tilde{\mathcal {B}}^{(1)}_{n}&= \mathcal {B}^{(1)}_{n}, \quad \tilde{\mathcal {B}}^{(2)}_{n} = \mathcal {B}^{(2)}_{n} (1-\lambda )^{n}, \quad \tilde{\mathcal {B}}^{(3)}_{n} = \mathcal {B}^{(3)}_{n} (1-\lambda )^{n} + \mathcal {B}^{(2)}_{n} \lambda n (1-\lambda )^{n-1}. \end{aligned}$$

$\square$

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Godin, F., Eghbalzadeh, R. & Gaillardetz, P. Pricing swaptions and zero-coupon futures options under the discrete-time arbitrage-free Nelson–Siegel model. Rev Deriv Res 26, 171–206 (2023). https://doi.org/10.1007/s11147-023-09196-4

Download citation

Accepted: 31 August 2023
Published: 04 October 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s11147-023-09196-4

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Pricing swaptions and zero-coupon futures options under the discrete-time arbitrage-free Nelson–Siegel model

Abstract

Similar content being viewed by others

Interest Rate Derivatives: One Factor Spot Rate Models

American options and stochastic interest rates

Black’s model in a negative interest rate environment, with application to OTC derivatives

1 Introduction

2 The DTAFNS model

2.1 Risk-neutral dynamics in the DTAFNS model

Remark 2.1

2.2 Forward measure dynamics in the DTAFNS model

Proposition 2.1

Proof

Corollary 2.1

Proposition 2.2

Proof

Lemma 2.1

Proof

2.3 Physical measure dynamics in the DTAFNS model

Remark 2.2

Proposition 2.3

Remark 2.3

3 European swaption pricing

3.1 Risk-neutral pricing of European swaptions

3.2 Pricing swaptions under the forward measure

4 Zero-coupon futures and options on futures

4.1 Futures price

Theorem 4.1

4.2 Price for options on futures

Lemma 4.1

Lemma 4.2

4.3 Price for quadratic options on futures

4.4 Option expected excess returns

Lemma 4.3

5 Methods for the calibration of the DTANFS model to option prices

6 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix: Proofs

Appendix: Proofs

Lemma 6.1

Proof of Lemma 6.1

Lemma 6.2

Proof of Lemma 6.2

Lemma 6.3

Proof of Lemma 6.3

Lemma 6.4

Proof of Lemma 6.4

Lemma 6.5

Proof of Lemma 6.5

Lemma 6.6

Proof of Lemma 6.6

Proof of Proposition 2.1

Lemma 6.7

Proof of Lemma 6.7

Lemma 6.8

Proof of Lemma 6.8

Proof of Proposition 2.2

Proof of Lemma 2.1

Lemma 6.9

Proof of Lemma 6.9

Proof of Proposition 2.3

Proof of Theorem 4.1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation