Linear credit risk models

Ackerer, Damien; Filipović, Damir

doi:10.1007/s00780-019-00409-z

Linear credit risk models

Published: 04 October 2019

Volume 24, pages 169–214, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Finance and Stochastics Aims and scope Submit manuscript

Linear credit risk models

Download PDF

Damien Ackerer¹ &
Damir Filipović²

1127 Accesses
10 Citations
3 Altmetric
Explore all metrics

Abstract

We introduce a novel class of credit risk models in which the drift of the survival process of a firm is a linear function of the factors. The prices of defaultable bonds and credit default swaps (CDS) are linear–rational in the factors. The price of a CDS option can be uniformly approximated by polynomials in the factors. Multi-name models can produce simultaneous defaults, generate positively as well as negatively correlated default intensities, and accommodate stochastic interest rates. A calibration study illustrates the versatility of these models by fitting CDS spread time series. A numerical analysis validates the efficiency of the option price approximation method.

Credit Risk Modeling: A General Framework

Modeling stochastic recovery rates and dependence between default rates and recovery rates within a generalized credit portfolio framework

Article 01 June 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We introduce a novel class of flexible and tractable reduced-form models for the term structure of credit risk, the linear credit risk models. We directly specify the survival process of a firm, that is, its conditional survival probability given the economic background information. Specifically, we assume a multivariate factor process with a linear drift and let the drift of the survival process be linear in the factors. Prices of defaultable bonds and credit default swaps (CDS) are given in closed form by linear–rational functions in the factors. By linearity, the same result holds for the prices of CDSs on indices (CDISs). The implied default intensity is a linear–rational function of the factors. In contrast, the price of a CDS in an affine default intensity model is a sum of exponential-affine functions in the factor process and whose coefficients are given by the solutions of nonlinear ordinary differential equations that are not in closed form, in general. In addition, the linear credit risk models offer new tractable features such as a multi-name model with negatively correlated default intensity.

Within the linear framework, we define the linear hypercube (LHC) model which is a single-name model. The factor process is diffusive with quadratic diffusion function so that it takes values in a hypercube whose edges’ length is given by the survival process. The quadratic diffusion function is concave and bi-monotonic. This feature allows factors to virtually jump between low and high values. This facilitates the persistence and likelihood of term structure shifts. The factors’ volatility parameters do not enter the bond and CDS pricing formulas, yet they impact the volatility of CDS spreads and thus affect CDS option prices. This may facilitate the joint calibration of credit spread and option price time series. We discuss in detail the one-factor LHC model and compare it with the one-factor affine default intensity model. We provide an identifiable canonical representation and the market price of risk specifications that preserve the linear drift of the factors.

We present a price approximation methodology for European-style options on credit risky underlyings that exploits the compactness of the state space and the closed form of the conditional moments of the factor process. First, by the Stone–Weierstrass theorem, any continuous payoff function on the compact state space can be approximated by a polynomial to any given level of accuracy. Second, the conditional expectation of any polynomial in the factors is a polynomial in the prevailing factor values. In consequence, the price of a CDS option can be uniformly approximated by polynomials in the factors. This method also applies to the computation of credit valuation adjustments.

We build multi-name models by letting the survival processes be linear and polynomial combinations of independent LHC models. Bond and CDS prices are still linear–rational, but with respect to an extended factor representation. These direct extensions can easily accommodate the inclusion of new factors and new firms. Stochastic short-rate models with a similar specification as the survival processes can be introduced while preserving the setup tractability. Simultaneous defaults can be generated either by introducing a common jump process in the survival processes or a stochastic clock.

We perform an empirical and numerical analysis of the LHC model. Assuming a parsimonious cascading drift structure, we fit two-factor and three-factor LHC models to the ten-year long time series of weekly CDS spreads on an investment grade and a high yield firm. The three-factor model is able to capture the complex term structure dynamics remarkably well and performs significantly better than the two-factor model. We illustrate the numerical efficiency of the option pricing method by approximating the prices of CDS options with different moneyness. Polynomials of relatively low orders are sufficient to obtain accurate approximations for in-the-money options. Out-of-the money options typically require a higher order. We derive the pricing formulas for CDIS options and tranches on a homogeneous portfolio to illustrate that their prices can also be approximated with similar techniques. In general, the pricing of CDIS options and tranches requires manipulating multivariate polynomial bases of possibly large dimensions. In practice, computationally efficient multi-name credit derivative pricing necessitates the use of special algorithms which are not studied in this paper.

We now review some of the related literature. Our approach follows a standard doubly stochastic construction of default times as described in Elliott et al. [21] or Bielecki and Rutkowski [7, Sect. 6.5]. The early contributions by Lando [38] and Duffie and Singleton [19] already make use of affine factor processes. In contrast, the factor process in the LHC model is a strictly non-affine polynomial diffusion, whose general properties are studied in [23]. The stochastic volatility models developed in Hull and White [31] and Ackerer et al. [1] are two other examples of non-affine polynomial models. Factors in the LHC models have a compact support and can exhibit jump-like dynamics similar to the multivariate Jacobi process introduced by Gourieroux and Jasiak [29]. Our approach has some similarities with the linearity-generating process by Gabaix [27] and the linear–rational models by Filipović et al. [25]. These models also exploit the tractability of factor processes with linear drift, but focus on the pricing of non-defaultable assets. To our knowledge, we are the first to model directly the survival process of a firm with linear drift characteristics.

Options on CDS contracts are complex derivatives and intricate to price. The pricing and hedging of credit derivatives in a generic hazard process framework is discussed in Bielecki et al. [4, Sect. 4], applied to CDS options in Bielecki et al. [5], and specialised to the square-root diffusion factor process in Bielecki et al. [6]. More recently Brigo and El-Bachir [10] developed a semi-analytical expression for CDS option prices in the context of a shifted square-root jump-diffusion default intensity model that was introduced in Brigo and Alfonsi [8]. Another strand of the literature has focused on developing market models in the spirit of LIBOR market models. We refer the interested reader to Schönbucher [48], Hull and White [32], Schönbucher [47], Jamshidian [34] and Brigo and Morini [11]. Black–Scholes-like formulas are then obtained for the prices of CDS options by assuming, for example, that the underlying CDS spread follows a geometric Brownian motion under the survival measure. Although offering more tractability, this approach makes it difficult, if not impossible, to consistently price multiple instruments exposed to the same source of credit risk. Di Graziano and Rogers [16] introduced a framework where they obtained closed-form expressions similar to ours for CDS prices, but under the assumption that the firm default intensity is driven by a continuous-time finite-state irreducible Markov chain.

Another important approach to default risk modelling is the use of subordinators to model the cumulative hazard process. It has in particular been shown that time-inhomogeneous models can reproduce well CDIS tranche prices. For more details on these models we refer to Kokholm and Nicolato [37], Sun et al. [51], and references therein.

The simulation-based work by Peng and Kou [44] shows that a hazard-rate model with systemic and idiosyncratic risk factors can fit both CDS and CDIS tranches, and therefore confirms that a bottom-up model with common risk factors can yield an accurate and fully consistent risk-management framework. A tractable alternative to price multi-name credit derivatives is to model the dependence between defaults with a copula function, as for example in Li [41], Laurent and Gregory [40] and Ackerer and Vatter [2]. However, these models are by construction static, require repeated calibration and in general become intractable when combined with stochastic survival processes as in Schönbucher and Schubert [49].

The idea of approximating option prices by power series can be traced back to Jarrow and Rudd [35]. However, most of the previous literature has focussed on approximating the transition density function of the underlying process; see for example Corrado and Su [14] and Filipović et al. [26]. In contrast, we approximate directly the payoff function by a polynomial.

The remainder of the paper is structured as follows. Section 2 presents the linear credit risk framework along with generic pricing formulas. Section 3 describes the single-name LHC model. The numerical and empirical analysis of the LHC model is in Sect. 4. Multi-name models as well as models with stochastic interest rates are discussed in Sect. 5. Section 6 concludes. The proofs are collected in the Appendix, as well as some additional results on market price of risk specifications that preserve the linear drift of the factors, and on the two-dimensional Chebyshev interpolation.

2 The linear framework

We introduce the linear credit risk model framework and derive closed-form expressions for defaultable bond prices and credit default swap spreads. We also discuss the pricing of credit index tranches, credit default swap options and credit valuation adjustments.

2.1 Survival process specification

We fix a probability space $(\Omega , {\mathcal{F}}, {\mathbb{Q}})$ equipped with a right-continuous filtration ${{\mathbb{F}}=({\mathcal{F}} _{t})_{t\ge 0}}$ representing the economic background information, and where ℚ is the risk-neutral pricing measure. We consider $N$ firms and let $S^{i}$ be the survival process of firm $i$. This is a right-continuous ${\mathbb{F}}$-adapted and nonincreasing positive process with ${S_{0}^{i} = 1}$. Let $U^{1},\dots ,U^{N}$ be independent standard uniform random variables that are independent from ${\mathcal{F}}_{\infty }$. For each firm $i$, we define the random default time

$$ \tau _{i} = \inf \{t\geq 0 : S^{i}_{t} \leq U_{i} \}, $$

which is infinity if the set is empty. Let $({\mathcal{H}}^{i}_{t})_{t \ge 0}$ be the filtration generated by the indicator process which is one as long as firm $i$ has not defaulted by time $t$ and zero afterwards, $H_{t}^{i} = 1_{{τ_{i} > t}}$ for $t\ge 0$. The default time $\tau _{i}$ is a stopping time in the enlarged filtration $({\mathcal{F}}_{t} \vee {\mathcal{H}}^{i}_{t})_{t\ge 0} $. It is ${\mathbb{F}}$-doubly stochastic in the sense that

$$ {\mathbb{Q}}[\tau _{i}>t \, | \, {\mathcal{F}}_{\infty }] = {\mathbb{Q}}[ S^{i}_{t} > U_{i} \, | \, {\mathcal{F}}_{\infty }] = S^{i}_{t}. $$

The filtration $({\mathcal{G}}_{t})_{t\ge 0}=({\mathcal{F}}_{t} \vee {\mathcal{H}}^{1}_{t} \vee \cdots \vee {\mathcal{H}}^{N}_{t})_{t \ge 0}$ contains all the information about the occurrence of firm defaults, as well as the economic background information. Henceforward we omit the index$i$of the firm and refer to any of the $N$ firms as long as there is no ambiguity.

In a linear credit risk model, the survival process of a firm is defined by

$$ S_{t} = a^{\top }Y_{t}, \qquad t\ge 0, $$

(2.1)

for some firm specific parameter $a\in {\mathbb{R}}^{n}_{+}$ and some common factor process $(Y,X)$ taking values in ${\mathbb{R}}^{n}_{+} \times {\mathbb{R}}^{m}$ with linear drift of the form

$$\begin{aligned} dY_{t} &= ( c Y_{t} + \gamma X_{t})\, dt + dM^{Y}_{t}, \end{aligned}$$

(2.2)

$$\begin{aligned} dX_{t} &= ( b Y_{t} + \beta X_{t})\,dt + dM^{X}_{t} \end{aligned}$$

(2.3)

for some $c\in {\mathbb{R}}^{n\times n}$, $b\in {\mathbb{R}}^{m\times n}$, $\gamma \in {\mathbb{R}}^{n\times m}$, $\beta \in {\mathbb{R}} ^{m\times m}$, $m$-dimensional ${\mathbb{F}}$-martingale $M^{X}$ and $n$-dimensional ${\mathbb{F}}$-martingale $M^{Y}$. The process $S$ being positive and nonincreasing, we necessarily have that its martingale component $M^{S} = a^{\top }M^{Y}$ is of finite variation and thus purely discontinuous (see [33, Lemma I.4.14]) and that $- S_{t-} < \Delta M_{t}^{S} \le 0 $ for all $t\ge 0$ because $\Delta S_{t} = \Delta M_{t}^{S}$. This observation motivates the decomposition of the factor process into a component $X$ and a component $Y$ with finite variation. Although we do not specify further the dynamics of the factor process at the moment, it is important to emphasise that additional conditions should be satisfied to ensure that $S$ is a valid survival process.

Remark 1

In practice, we consider a componentwise nonincreasing process $Y$ with $Y_{0}=\mathbf{1}$. Survival processes can then easily be constructed by choosing any vector $a\in {\mathbb{R}}^{n}_{+}$ with $a^{\top }{\mathbf{1}} =1$.

The linear drift of the process $(Y,X)$ implies that the ${\mathcal{F}} _{t}$-conditional expectation of $(Y_{u},X_{u})$ is linear of the form

E [(\begin{array}{c} Y_{u} \\ X_{u} \end{array}) | F_{t}] = e^{A (u - t)} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}), t \leq u,

(2.4)

where the $(m+n)\times (m+n)$-matrix $A$ is defined by

$$ A= \begin{pmatrix} c & \gamma \\ b & \beta \end{pmatrix} . $$

(2.5)

Remark 2

If $S$ is absolutely continuous, so that $a^{\top }dM^{Y}_{t}=0$ for all $t\ge 0$, the corresponding default intensity $\lambda $, which is derived from the relation $S_{t} = \mathrm{e}^{-\int _{0}^{t} \lambda _{s} \,ds}$, is linear–rational in $(Y,X)$ of the form

$$ \lambda _{t} = -\frac{a^{\top }( c Y_{t} + \gamma X_{t})}{S_{t}} . $$

In this framework, the default times are correlated because the survival processes are driven by common factors. Simultaneous defaults are possible and may be caused by the martingale component of $Y$ that forces the survival processes to jump downward at the same time. Additionally, and in contrast to affine default intensity models, the linear credit risk framework allows negative correlation between default intensities as illustrated by the following stylised example.

Example 3

Consider the factor process $(Y,X)$ taking values in ${\mathbb{R}} _{+}^{2}\times {\mathbb{R}}$ defined by

\begin{aligned} d Y_{t} & = \frac{ϵ}{2} ((\begin{array}{c} - 1 & 0 \\ 0 & - 1 \end{array}) Y_{t} + (\begin{array}{c} - 1 \\ 1 \end{array}) X_{t}) d t, \\ d X_{t} & = - κ X_{t} d t + σ \sqrt{(e^{- ϵ t} - X_{t}) (e^{- ϵ t} + X_{t})} d W_{t} \end{aligned}

for some $\kappa >\epsilon >0$, $\sigma >0$, $X_{0}\in [-1,1]$ and an ${\mathbb{F}}$-adapted univariate Brownian motion $W$. The process $X$ takes values in the interval $[-\mathrm{e}^{-\epsilon t}, \mathrm{e}^{-\epsilon t}]$ at time $t$. Let $N=2$ survival processes be defined by $S^{1}_{t} = Y_{1t}$ and $S^{2}_{t} = Y_{2t}$ for all $t\ge 0$, so that the implied default intensities of the two firms are given by

$$ \lambda ^{1}_{t} = \frac{\epsilon }{2}\left (1 + \frac{X_{t}}{Y_{1t}}\right ) \quad \text{and} \quad \lambda ^{2}_{t} = \frac{\epsilon }{2}\left (1-\frac{X_{t}}{Y_{2t}}\right ), \qquad t \ge 0. $$

This results in $d\langle \lambda ^{1},\lambda ^{2}\rangle _{t} \le 0$ and $d\langle \lambda ^{1},\lambda ^{2}\rangle _{t} < 0$ with positive probability, and $\lambda _{t}^{1},\lambda _{t}^{2}\le \epsilon $. Moreover, the default intensities $\lambda ^{1}$ and $\lambda ^{2}$ both mean-revert towards $\epsilon /2$. The proof of these statements is given in Appendix A.

2.2 Defaultable bonds

We consider securities with notional amount equal to one and exposed to the credit risk of a reference firm. We assume a constant risk-free interest rate equal to $r$ so that the time-$t$ price of the risk-free zero-coupon bond with maturity $t_{M}$ and notional amount one is given by $\mathrm{e}^{-r(t_{M}-t)}$. The following result gives a closed-form expression for the price of a defaultable bond with constant recovery rate at maturity.

Proposition 4

The time-$t$price of a defaultable zero-coupon bond with maturity$t_{M}$and recovery$\delta \in [0,1]$at maturity is

\begin{aligned} B_{M} (t, t_{M}) & = E [e^{- r (t_{M} - t)} (1_{{τ > t_{M}}} + δ 1_{{τ \leq t_{M}}}) | G_{t}] \\ = (1 - δ) B_{Z} (t, t_{M}) + 1_{{τ > t}} δ e^{- r (t_{M} - t)}, \end{aligned}

where $B_{Z} (t, t_{M}) = e^{- r (t_{M} - t)} E [1_{{τ > t_{M}}} | G_{t}]$ denotes the time-$t$price of a defaultable zero-coupon bond with maturity$t_{M}$and zero recovery. It is of the form

B_{Z} (t, t_{M}) = 1_{{τ > t}} \frac{1}{a^{⊤} Y_{t}} ψ_{Z} {(t, t_{M})}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}),

(2.6)

where the vector$\psi _{\mathrm{Z}}(t,t_{M})\in {\mathbb{R}}^{n+m}$is given by

ψ_{Z} {(t, t_{M})}^{⊤} = e^{- r (t_{M} - t)} (\begin{array}{c} a^{⊤} & 0_{m}^{⊤} \end{array}) e^{A (t_{M} - t)},

where the$m$-dimensional vector$\mathbf{0}_{m}$contains only zeros.

The next result shows that the price of a defaultable bond paying a constant recovery rate at default can also be retrieved in closed form.

Proposition 5

The time-$t$price of a defaultable zero-coupon bond with maturity$t_{M}$and recovery$\delta \in [0,1]$at default is

B_{D} (t, t_{M}) = E [e^{- r (t_{M} - t)} 1_{{τ > t_{M}}} + δ e^{- r (τ - t)} 1_{{t < τ \leq t_{M}}} | G_{t}] = B_{Z} (t, t_{M}) + δ C_{D} (t, t_{M}),

where $C_{D} (t, t_{M}) = E [e^{- r (τ - t)} 1_{{t < τ \leq t_{M}}} | G_{t}]$ denotes the time-$t$price of a contingent claim paying one at default if this occurs between dates$t$and$t_{M}$. It is of the form

C_{D} (t, t_{M}) = 1_{{τ > t}} \frac{1}{a^{⊤} Y_{t}} ψ_{D} {(t, t_{M})}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}),

(2.7)

where the vector$\psi _{\mathrm{D}}(t,t_{M})\in {\mathbb{R}}^{n+m}$is given by

ψ_{D} {(t, t_{M})}^{⊤} = - a^{⊤} (\begin{array}{c} c & γ \end{array}) \int_{t}^{t_{M}} e^{A_{*} (s - t)} d s,

(2.8)

where$A_{*}=A-r \operatorname{\mathrm{Id}}$.

The price of a security whose only cash flow is proportional to the default time is given in the following corollary. It is used to compute the expected accrued interests at default for some contingent securities such as CDSs.

Corollary 6

The time-$t$price of a contingent claim paying$\tau $at default if this occurs between date$t$and$t_{M}$is of the form

C_{D_{*}} (t, t_{M}) = E [τ e^{- r (τ - t)} 1_{{τ \leq t_{M}}} | G_{t}] = 1_{{τ > t}} \frac{1}{a^{⊤} Y_{t}} ψ_{D_{*}} {(t, t_{M})}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}),

(2.9)

where the vector$\psi _{\mathrm{D_{*}}}(t,t_{M})\in {\mathbb{R}}^{n+m}$is given by

ψ_{D_{*}} {(t, t_{M})}^{⊤} = - a^{⊤} (\begin{array}{c} c & γ \end{array}) \int_{t}^{t_{M}} s e^{A_{*} (s - t)} d s .

(2.10)

Note the presence of the factor $s$ in the integrand on the right-hand side of (2.10), which is absent in (2.8).

Remark 7

By setting $r=0$ in (2.9), we get a closed-form expression for $E [τ 1_{{τ \leq t_{M}}} | G_{t}]$ . This expression can be used to price a defaultable bond whose recovery value at maturity $t_{M}$ depends on the default time $\tau $ in a linear way, via

B_{D_{0}} (t, t_{0}, t_{M}) = B_{Z} (t, t_{M}) + e^{- r (t_{M} - t)} E [(δ_{0} \frac{τ - t_{0}}{t_{M} - t_{0}} + δ_{1}) 1_{{τ \leq t_{M}}} | G_{t}]

for some parameters $\delta _{0},\delta _{1}\ge 0$ with $\delta _{0} + \delta _{1} \le 1$ and for some time $t_{0}\le t$.

The following lemma shows that the pricing formulas (2.7)–(2.10) further simplify under an additional condition.

Lemma 8

Assume that the matrix$A_{*}$is invertible. Then we have the closed-form expressions

\begin{aligned} ψ_{D} {(t, t_{M})}^{⊤} & = - a^{⊤} (\begin{array}{c} c & γ \end{array}) A_{*}^{- 1} (e^{A_{*} (t_{M} - t)} - Id), \\ ψ_{D_{*}} {(t, t_{M})}^{⊤} & = - a^{⊤} (\begin{array}{c} c & γ \end{array}) ((t_{M} - t) A_{*}^{- 1} e^{A_{*} (t_{M} - t)} \\ + A_{*}^{- 1} (Id t - A_{*}^{- 1}) (e^{A_{*} (t_{M} - t)} - Id)), \end{aligned}

where$\operatorname{\mathrm{Id}}$is the$(n+m)$-dimensional identity matrix.

This is a remarkable result since the prices of contingent cash flows become closed-form expressions composed of basic matrix operations and are thus easily computed. Closed-form formulas for defaultable securities render the linear framework appealing for large-scale applications, for example with a large number of firms and contracts, in comparison to standard affine default intensity models that in general require the use of additional numerical methods. For illustration, assume that the survival process $S$ is absolutely continuous so that it admits the default intensity $\lambda $ as in Remark 2.2. Then $C_{\mathrm{D}}(t,t_{M})$ can be rewritten as

C_{D} (t, t_{M}) = 1_{{τ > t}} \int_{t}^{t_{M}} e^{- r (u - t)} E [λ_{u} e^{- \int_{t}^{u} λ_{s} d s} | F_{t}] d u .

With affine default intensity models, the expectation to be integrated requires solving Riccati equations, which have a closed-form solution only when the default intensity is driven by a sum of independent univariate CIR processes. Numerical methods such as finite difference are usually employed to compute the expectation with time-$u$ cash flow for $u\in [t,t_{M}]$. The integral can then only be approximated by means of another numerical method such as quadrature, that necessitates solving the corresponding ordinary differential equations at many different points $u$. For more details on affine default intensity models, we refer to Duffie and Singleton [20, Sect. 3.4], Filipović [22, Sect. 12.3] and Lando [39, Sect. 5].

2.3 Credit default swaps

We derive closed-form expressions for credit default swaps (CDS) on a single firm and multiple firms. We conclude the section with a discussion of factors unspanned by bonds and CDS prices.

A single-name CDS is an insurance contract that pays at default the realised loss on a reference bond—the protection leg—in exchange for periodic payments that stop after default—the premium leg. We consider the discrete tenor structure $t \le t_{0} < t _{1} < \cdots < t_{M}$ and a contract offering default protection from date $t_{0}$ to date $t_{M}$. When $t< t_{0}$, the contract is usually called a knock-out forward CDS and generates cash flows only if the firm has not defaulted by time $t_{0}$. We consider a CDS contract with notional amount equal to one. The time-$t$ value of the premium leg with spread $k$ is given by $k V_{\mathrm{prem}}(t,t_{0},t_{M})$, where

$$ V_{\mathrm{prem}}(t,t_{0},t_{M}) = V_{\mathrm{coup}}(t,t_{0},t_{M})+ V _{\mathrm{ai}}(t,t_{0},t_{M}) $$

is the sum of the value of coupon payments before default,

V_{coup} (t, t_{0}, t_{M}) = \sum_{j = 1}^{M} E [e^{- r (t_{j} - t)} (t_{j} - t_{j - 1}) 1_{{t_{j} < τ}} | G_{t}],

and the value of the accrued coupon payment at the time of default,

V_{ai} (t, t_{0}, t_{M}) = \sum_{j = 1}^{M} E [e^{- r (τ - t)} (τ - t_{j - 1}) 1_{{t_{j - 1} < τ \leq t_{j}}} | G_{t}] .

The time-$t$ value of the protection leg is

V_{prot} (t, t_{0}, t_{M}) = (1 - δ) E [e^{- r (τ - t)} 1_{{t_{0} < τ \leq t_{M}}} | G_{t}],

where $\delta \in [0,1]$ denotes the constant recovery rate at default. This specification of payments is in line with the ISDA model; see White [52]. The (forward) CDS spread $\mathrm{CDS}(t,t _{0},t_{M})$ is the spread $k$ that makes the premium leg and the protection leg equal in value at time $t$, that is,

$$ {\mathrm{CDS}}(t,t_{0},t_{M}) = \frac{V_{\mathrm{prot}}(t,t_{0},t_{M})}{V _{\mathrm{prem}}(t,t_{0},t_{M})}. $$

Proposition 9

The values of the protection and premium legs are given by

\begin{aligned} V_{prot} (t, t_{0}, t_{M}) & = 1_{{τ > t}} \frac{1}{S_{t}} ψ_{prot} {(t, t_{0}, t_{M})}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}), \\ V_{prem} (t, t_{0}, t_{M}) & = 1_{{τ > t}} \frac{1}{S_{t}} ψ_{prem} {(t, t_{0}, t_{M})}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}), \end{aligned}

where the vectors $\psi _{\mathrm{prot}}(t,t_{0},t_{M}), \psi _{\mathrm{prem}}(t,t_{0},t_{M})\in {\mathbb{R}}^{n+m}$ are given by

$$\begin{aligned} \psi _{\mathrm{prot}}(t,t_{0},t_{M}) & = (1-\delta )\big(\psi _{ \mathrm{D}}(t,t_{M}) - \psi _{\mathrm{D}}(t,t_{0}) \big), \\ \psi _{\mathrm{prem}}(t,t_{0},t_{M}) & = \sum _{j=1}^{M} (t_{j} - t _{j-1})\psi _{\mathrm{Z}}(t,t_{j}) + \psi _{\mathrm{D_{*}}}(t,t_{M}) - \psi _{\mathrm{D_{*}}}(t,t_{0}) \\ & \phantom{=:} + t_{M-1} \psi _{\mathrm{D}}(t,t_{M}) - \sum _{j=1}^{M-1} (t_{j} - t _{j-1}) \psi _{\mathrm{D}}(t,t_{j}) - t_{0} \psi _{\mathrm{D}}(t,t_{0}). \end{aligned}$$

As a consequence of Proposition 2.9, the CDS spread is given by a readily available linear–rational expression, namely

CDS (t, t_{0}, t_{M}) = 1_{{τ > t}} \frac{ψ_{prot} {(t, t_{0}, t_{M})}^{⊤} (\binom{Y_{t}}{X_{t}})}{ψ_{prem} {(t, t_{0}, t_{M})}^{⊤} (\binom{Y_{t}}{X_{t}})} .

This is a remarkably simple expression that allows us to see how the factors $(Y,X)$ affect the CDS spread through the vectors $\psi _{\mathrm{prot}}(t,t_{0},t_{M})$ and $\psi _{\mathrm{prem}}(t,t _{0},t_{M})$. For comparison, in an affine default intensity model, the two legs $V_{\mathrm{prot}}(t,t_{0},t_{M})$ and $V_{\mathrm{prem}}(t,t _{0},t_{M})$ are given as sums of exponential-affine terms that cannot be simplified further. In the following, we denote by $V_{ \mathrm{CDS}}(t,t_{0},t_{M},k)$ the time-$t$ price of a CDS contract starting at time $t_{0}$ with maturity $t_{M}$ and spread $k$,

V_{CDS} (t, t_{0}, t_{M}, k) = 1_{{τ > t}} {(ψ_{prot} (t, t_{0}, t_{M}) - ψ_{prem} (t, t_{0}, t_{M}))}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}) .

(2.11)

A multi-name CDS, or credit default index swap (CDIS), is an insurance on a reference portfolio of $N$ firms with equal weight, which we assume to be $1/N$ so that the portfolio total notional amount is equal to one. The protection buyer pays a regular premium that is proportional to the current notional amount of the CDIS. Let $\delta \in [0,1]$ be the recovery rate determined at inception. Upon default of a firm, the protection seller pays $1-\delta $ to the protection buyer and the notional amount of the CDIS decreases by $1/N$. These steps are repeated until maturity or until all firms in the reference portfolio have defaulted, whichever comes first.

Denote by $S^{i}=a_{i}^{\top }Y$ the survival process of firm $i$ as defined in (2.1). The CDIS spread simplifies to a double linear–rational expression, i.e.,

CDIS (t, t_{0}, t_{M}) = \frac{\sum_{i = 1}^{N} 1_{{τ_{i} > t}} (1 / a_{i}^{⊤} Y_{t}) ψ_{prot}^{i} {(t, t_{0}, t_{M})}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array})}{\sum_{i = 1}^{N} 1_{{τ_{i} > t}} (1 / a_{i}^{⊤} Y_{t}) ψ_{prem}^{i} {(t, t_{0}, t_{M})}^{⊤} (\begin{array}{c} Y_{t} \\ X_{t} \end{array})},

where $\psi ^{i}_{\mathrm{prot}}(t,t_{0},t_{M})$ and $\psi ^{i}_{ \mathrm{prem}}(t,t_{0},t_{M})$ are defined as in Proposition 2.9 for each firm $i$.

Remark 10

The characteristics of the martingales $M^{Y}$ and $M^{X}$ do not appear explicitly in the bond, CDS and CDIS pricing formulas. This leaves the freedom to specify exogenous factors that feed into $M^{Y}$ and $M^{X}$. Such factors would be unspanned by the term structures of defaultable bonds and CDS and give rise to unspanned stochastic volatility, as described in Filipović et al. [25]. They provide additional flexibility for fitting time series of bond prices and CDS spreads. These unspanned stochastic volatility factors affect the distribution of the survival and factor processes and therefore can be recovered from the prices of credit derivatives such as those discussed later.

2.4 CDIS tranche

A CDIS tranche is a partial insurance on the losses of a reference portfolio in the sense that only losses larger than the attachment point $K_{a}$ and lower than the detachment point $K_{d}$ are insured. We assume the same tenor structure and reference portfolio as for the CDIS contract; the protection buyer pays a periodic premium that is proportional to the current notional amount of the tranche,

$$ T_{t} = \Big(K_{d}-K_{a}-\big(N_{t}(1-\delta )/N -K_{a}\big)^{+} \Big)^{+}, $$

(2.12)

where $N_{t} = \sum_{i = 1}^{N} 1_{{τ_{i} \leq t}}$ is the total number of firms which have defaulted in the reference portfolio at time $t$. The values of the protection leg and the premium leg at time $t$ are respectively given by

$$\begin{aligned} V_{\mathrm{prot }}(t,t_{M},K_{a},K_{d}) & = {\mathbb {E}}^{} \bigg[ { \int _{t}^{t_{M}} \mathrm {e}^{-r u}\, dT_{u}} \, \bigg| \, {{\mathcal {G}}_{t}} \bigg], \end{aligned}$$

(2.13)

$$\begin{aligned} V_{\mathrm{prem}}(t,t_{M},K_{a},K_{d}) &= \sum _{j=1}^{M} {\mathrm{e}} ^{-rt_{j}} \int _{t_{j-1}}^{t_{j}} ( K_{d}-K_{a}- {\mathbb {E}}^{} \left [ { T_{u}} \, \middle | \, {{\mathcal {G}}_{t}} \right ] ) \,du. \end{aligned}$$

(2.14)

The value of the tranche is then simply given by the difference of the cash flow values,

$$ V_{\mathrm{T}}(t,t_{M},K_{a},K_{d},k) = V_{\mathrm{prot }}(t,t_{M},K _{a},K_{d}) - k V_{\mathrm{prem}}(t,t_{M},K_{a},K_{d}), $$

(2.15)

where $k$ is the tranche spread. The following proposition shows that the ${({\mathcal{F}}_{\infty }\vee {\mathcal{G}}_{t})}$-conditional distribution of the number of defaults at time $u>t$ can be exactly retrieved in closed form by applying the discrete Fourier transform as described in Ackerer and Vatter [2].

Proposition 11

The$({\mathcal{F}}_{\infty }\vee {\mathcal{G}}_{t})$-conditional distribution of the number of defaults$N_{u}$, for$u>t$, is given by

Q [N_{u} = n | F_{\infty} \lor G_{t}] = \frac{1}{N + 1} \sum_{j = 0}^{N} ζ^{n j} \prod_{i = 1}^{N} (ζ^{j} + (1 - ζ^{j}) 1_{{τ_{i} > t}} \frac{a_{i}^{⊤} Y_{u}}{a_{i}^{⊤} Y_{t}})

(2.16)

for any$n=0,\dots ,N$, and where$\zeta =\exp (2{\mathrm{i}}\pi /(N+1))$with the imaginary number $\mathrm{i}$.

From (2.12), it follows immediately that the conditional expectation of $T_{u}$ can be expressed as a function of the conditional distribution of $N_{u}$. Assume for simplicity that $K_{a}=n_{a} (1- \delta )/N$ and $K_{d}=n_{d} (1-\delta )/N$ for some integers $0\le n_{a}< n_{d}\le N$. Then the conditional expectation of $T_{u}$ for $u>t$ is given by

$$ {\mathbb {E}}^{} \left [ {T_{u}} \, \middle | \, {{\mathcal {F}}_{\infty}\vee {\mathcal {G}}_{t}} \right ] = \sum _{j=1}^{N-n_{a}} \frac{(1- \delta ) \min (j, n_{d}-n_{a}) }{N} \, {\mathbb{Q}}[N_{u}=n_{a}+j \, | \, {\mathcal{F}}_{\infty }\vee {\mathcal{G}}_{t}]. $$

(2.17)

The tranche price (2.15) has therefore a closed-form expression as long as the conditional probability ${{\mathbb{Q}}[N _{u}=j\, | \, {\mathcal{G}}_{t}]}$ is available in closed form for all $t\le u \le t_{M}$ and $j=0,\dots ,N$. An example is given in Sect. 4.4 for a polynomial model.

2.5 CDS option and CDIS option

A CDS option with strike spread $k$ is a European call option on the CDS contract exercisable only if the firm has not defaulted before the option maturity date $t_{0}$. Its payoff at time $t_{0}$ is

{(V_{CDS} (t_{0}, t_{0}, t_{M}))}^{+} = \frac{1_{{τ > t_{0}}}}{a^{⊤} Y_{t_{0}}} {(ψ_{cds} {(t_{0}, t_{0}, t_{M}, k)}^{⊤} (\begin{array}{c} Y_{t_{0}} \\ X_{t_{0}} \end{array}))}^{+}

with

$$ \psi _{\mathrm{cds}}(t,t_{0},t_{M},k) = \psi _{\mathrm{prot}}(t,t_{0},t _{M})-k \psi _{\mathrm{prem}}(t,t_{0},t_{M}). $$

(2.18)

Denote by $V_{\mathrm{CDSO}}(t,t_{0},t_{M},k)$ the price of the CDS option at time $t$,

\begin{aligned} V_{CDSO} (t, t_{0}, t_{M}, k) & = E [e^{- r (t_{0} - t)} \frac{1_{{τ > t_{0}}}}{a^{⊤} Y_{t_{0}}} {(ψ_{cds} {(t_{0}, t_{0}, t_{M}, k)}^{⊤} (\begin{array}{c} Y_{t_{0}} \\ X_{t_{0}} \end{array}))}^{+} | G_{t}] \\ = 1_{{τ > t}} \frac{e^{- r (t_{0} - t)}}{a^{⊤} Y_{t}} E [{(ψ_{cds} {(t_{0}, t_{0}, t_{M}, k)}^{⊤} (\begin{array}{c} Y_{t_{0}} \\ X_{t_{0}} \end{array}))}^{+} | F_{t}], \end{aligned}

where the second equality follows directly from Lemma A.1.

A CDIS option gives the right at time $t_{0}$ to enter a CDIS contract with strike spread $k$ and maturity $t_{M}$ on the firms in the reference portfolio which have not defaulted and, simultaneously, to receive the losses realised before the exercise date $t_{0}$. Denote by $V_{\mathrm{CDISO}}(t,t_{0},t_{M},k)$ the price of the CDIS option at time $t \le t_{0}$, so that

V_{CDISO} (t, t_{0}, t_{M}, k) = \frac{e^{- r (t_{0} - t)}}{N} E [{(\sum_{i = 1}^{N} V_{CDS}^{i} (t_{0}, t_{0}, t_{M}, k) + (1 - δ) 1_{{τ_{i} \leq t_{0}}})}^{+} | G_{t}],

where $V^{i}_{\mathrm{CDS}}(t_{0},t_{0},t_{M},k)$ is defined as in (2.11) for firm $i$.

Proposition 12

The price of a CDIS option is given by

$$ V_{\mathrm{CDISO}}(t,t_{0},t_{M},k)= \sum _{\alpha \in \{0,1\}^{N}} \frac{ \mathrm{e}^{-r(t_{0}-t)}}{N} {\mathbb {E}}^{} \big[ { \big(V_{*}(\alpha ,t_{0},t_{M},k)\big)^{+} q(\alpha ,t, t_{0})} \, \big| \, {{\mathcal {F}}_{t}} \big] $$

with the conditional payoffs

$$ V_{*}(\alpha ,t_{0},t_{M},k) = \sum _{i=1}^{N} \frac{\alpha _{i}}{a_{i} ^{\top }Y_{t_{0}}} \psi ^{i}_{\mathrm{cds}}(t_{0},t_{0},t_{M},k)^{ \top } \begin{pmatrix} Y_{t_{0}} \\ X_{t_{0}} \end{pmatrix} + (1-\delta )(1-\alpha _{i}) $$

and the conditional probabilities

q (α, t, t_{0}) = \prod_{i = 1}^{N} \frac{{(a_{i}^{⊤} Y_{t_{0}})}^{α_{i}} {(a_{i}^{⊤} (Y_{t} - Y_{t_{0}}))}^{1 - α_{i}}}{a_{i}^{⊤} Y_{t}} 1_{{τ_{i} > t}} + {(1_{{τ_{i} \leq t}})}^{1 - α_{i}},

where$\alpha =(\alpha _{1},\dots ,\alpha _{N})$and with the convention$0^{0}=0$.

The time-$t$ price of a CDS option, or of a CDIS option, is therefore given by the expected value of a non-smooth continuous function in $(Y_{t_{0}},X_{t_{0}})$, where $t< t_{0}$. A methodology to price such contracts is presented in Sect. 3.2.

2.6 Credit valuation adjustment

The unilateral credit valuation adjustment (UCVA) of a position in a bilateral contract is the present value of losses resulting from its cancellation when the counterparty defaults.

Proposition 13

The time-$t$price of the UCVA with maturity$t_{M}$and time-$u$net positive exposure$f(u,Y_{u},X_{u})$, for some continuous function$f(u,y,x)$, is

\begin{aligned} UCVA (t, t_{M}) & = E [e^{- r (τ - t)} 1_{{t < τ \leq t_{M}}} f (τ, Y_{τ}, X_{τ}) | G_{t}] \\ = \frac{1_{{τ > t}}}{a^{⊤} Y_{t}} \int_{t}^{t_{M}} e^{- r (u - t)} E [f (u, Y_{u}, X_{u}) a^{⊤} (c Y_{u} + γ X_{u}) | F_{t}] d u, \end{aligned}

where$\tau $is the counterparty default time.

Computing the UCVA therefore boils down to a numerical integration of European-style option prices. As is the case for CDS and CDIS options, these option prices can be uniformly approximated as described in Sect. 3.2. We refer to Brigo et al. [9] for a thorough analysis of bilateral counterparty risk valuation in a doubly stochastic default framework.

3 The linear hypercube model

The linear hypercube (LHC) model is a single-name model, that is, $n=1$ so that $S=Y$. The survival process is absolutely continuous, as in Remark 2.2, and the factor process $X$ is diffusive and takes values in a hypercube whose edges’ length is given by $Y_{t}$, for all $t\ge 0$. More formally, the state space of $(Y,X)$ is given by

$$ E = \{ (y,x)\in {\mathbb{R}}^{1+m} : y\in (0,1]\text{ and } x \in [0,y]^{m} \}. $$

The dynamics of $(Y,X)$ is

$$ \begin{aligned} dY_{t} & = -\gamma ^{\top }X_{t} \,dt, \\ dX_{t} & = ( b Y_{t} + \beta X_{t})\,dt + \Sigma (Y_{t},X_{t})\,dW_{t} \end{aligned} $$

(3.1)

for some $\gamma \in {\mathbb{R}}^{m}_{+}$ and some $m$-dimensional Brownian motion $W$, and where the volatility matrix $\Sigma (y,x)$ is given by

$$ \Sigma (y,x)=\mathrm{diag}\big( \sigma _{1}\sqrt{x_{1}(y-x_{1})} , \dots , \sigma _{m}\sqrt{x_{m}( y-x_{m})} \big) $$

(3.2)

with volatility parameters $\sigma _{1}, \dots , \sigma _{m}\ge 0$.

Let $(Y,X)$ be an $E$-valued solution of (3.1). It is readily verified that $Y$ is nonincreasing and that the parameter $\gamma $ controls the speed at which it decreases, i.e.,

$$ 0 \le \gamma ^{\top }X_{t} \le \gamma ^{\top }\mathbf{1} Y_{t}, $$

which implies

$$ 0\le \lambda _{t}\le \gamma ^{\top }\mathbf{1} \quad \text{and} \quad Y _{t} \ge Y_{0} {\mathrm{e}}^{-\gamma ^{\top }\mathbf{1} t} > 0 \qquad \text{for any } t\ge 0. $$

Note that the default intensity upper bound $\gamma ^{\top }\mathbf{1}$ depends on $\gamma $, which is estimated from data. Therefore, a crucial step in the model validation procedure is to verify that the range of possible default intensities is sufficiently wide.

The following theorem gives conditions on the parameters such that the LHC model (3.1) is well defined.

Theorem 1

Assume that for all$i=1,\dots , m$, we have

$$\begin{aligned} b_{i} - \sum _{j\neq i} \beta _{ij}^{-} &\ge 0 , \end{aligned}$$

(3.3)

$$\begin{aligned} \gamma _{i} + \beta _{ii} + b_{i} + \sum _{j\neq i} ( \gamma _{j} + \beta _{ij}) ^{+} &\le 0. \end{aligned}$$

(3.4)

Then for any initial law of$(Y_{0},X_{0})$with support in$E$, there exists a unique in law$E$-valued solution$(Y,X)$of (3.1). It satisfies the boundary non-attainment, for any$i=1,\dots ,m$, that

(i)
$X_{it}>0$for all$t\ge 0$if$X_{i0}>0$and
$$ b_{i} - \sum _{j\neq i} \beta _{ij}^{-} \ge \frac{\sigma _{i}^{2}}{2} ; $$
(3.5)
(ii)
$X_{it}< Y_{t}$for all$t\ge 0$if$X_{i0}< Y_{0}$and
$$ \gamma _{i} + \beta _{ii} + b_{i} + \sum _{j\neq i} ( \gamma _{j} + \beta _{ij}) ^{+} \le - \frac{\sigma _{i}^{2}}{2}. $$
(3.6)

The state space $E$ is a regular $(m+1)$-dimensional hyperpyramid. Figure 1 shows $E$ when $m=1$ and illustrates the drift inward pointing conditions (3.3) and (3.4) at the boundaries of $E$.

In Sect. B, we describe all possible market price of risk specifications under which the drift function of $(Y,X)$ remains linear.

Remark 2

The volatility of $X_{i}$ is maximal at the center of its support when ${X_{i}=Y/2}$ and decreases to zero at its boundaries for $X_{i} \to $ 0 and $X_{i}\to Y$. As a consequence, a factor may rapidly move from the lower to the upper part of its support without spending much time in the middle part; this may mimic a regime-shifting behaviour.

Remark 3

If we define the normalised process $Z = X/Y$, then the dynamics of $(Z, \lambda )$ is given by

$$\begin{aligned} dZ_{t} &= \Big(b + \big(\beta + \operatorname{diag}(\gamma ^{\top }Z_{t}) \big)Z _{t} \Big)\,dt + \Sigma (1 Z_{t})\,dW_{t}, \\ d\lambda _{t} &= \gamma ^{\top }\,dZ_{t}. \end{aligned}$$

We derive closed-form expressions for the stationary points of the drift of $(Z,\lambda )$ in Sects. 3.1 and 4.1 and in Example 2.3.

3.1 One-factor LHC model

The default intensity of the one-factor LHC model, $m=1$, has autonomous dynamics of the form

$$ d\lambda _{t} = ( \lambda _{t}^{2} + \beta \lambda _{t} + b \gamma )\,dt + \sigma \sqrt{\lambda _{t}(\gamma -\lambda _{t})}\,dW_{t}. $$

The diffusion function of $\lambda $ is the same as the diffusion function of a Jacobi process taking values in the compact interval $[0,\gamma ]$. However, the drift of $\lambda $ includes a quadratic term that is present neither in Jacobi nor in affine processes.^{Footnote 1} Conditions (3.3) and (3.4) in Theorem 3.1 can be rewritten as

$$ b \ge 0 \quad \text{and}\quad (\gamma + b + \beta ) \le 0. $$

In other words, the drift of $\lambda $ is nonnegative at $\lambda =0$ and nonpositive at $\lambda =\gamma $. We can factorise the drift as

$$ \lambda _{t}^{2} + \beta \lambda _{t} + b\gamma = (\lambda _{t}-\ell _{1})( \lambda _{t}-\ell _{2}) $$

for some roots $0\le \ell _{1}\le \gamma \le \ell _{2}$. Hence $\lambda $ drifts towards $\ell _{1}$ as long as not ${\lambda _{t}=\ell _{2}=\gamma }$. The corresponding original parameters are given by $\beta =-(\ell _{1}+\ell _{2})$ and ${b\gamma =\ell _{1}\ell _{2}}$, so that the drift of the factor $X$ reads

$$ \beta Y_{t}+B X_{t} = (\ell _{1}+\ell _{2})\bigg(\frac{\ell _{1}\ell _{2}}{ \gamma (\ell _{1}+\ell _{2})} Y_{t} -X_{t}\bigg). $$

As a sanity check, we verify that the constant default intensity case, $\lambda _{t} = \gamma $ for all $t\ge 0$, is nested as a special case. This is equivalent to having $X= Y$, which can be obtained by specifying the dynamics $dX_{t} = -\gamma X_{t}\,dt $ for the factor process and the initial condition $X_{0}=1$. This corresponds to the stationary points $\ell _{1}=0$ and $\ell _{2}=\gamma $.

The dynamics of the standard one-factor affine model on ${\mathbb{R}} _{+}$ is

$$ d\lambda _{t} = \ell _{2}(\ell _{1}-\lambda _{t})\,dt + \sigma \sqrt{ \lambda _{t}}\,dW_{t}, $$

where $\ell _{2}$ is the mean-reversion speed and $\ell _{1}$ the mean-reversion level of $\lambda $. Figure 2 shows the drift and diffusion functions of the default intensity for the one-factor LHC and affine models. The drift function is affine in the affine model, whereas it is quadratic in the LHC model. However, for reasonable parameters values, the drift functions look similar when the default intensity is smaller than the mean-reversion level $\lambda < \ell _{1}$. On the other hand, when $\lambda >\ell _{1}$, the force of drifting towards $\ell _{1}$ is smaller and concave in the LHC model. The diffusion function is strictly increasing and concave for the affine model, whereas it has a concave semi-ellipse shape in the LHC model. The diffusion functions have the same shape on $[0,\gamma /2]$, but typically do not scale equivalently in the parameter $\sigma $. Note that the parameter $\gamma $ can always be set sufficiently large so that the likelihood of $\lambda $ going above $\gamma /2$ is arbitrarily small.

3.2 Option price approximation

We saw in Sects. 2.5 and 2.6 that the pricing of a CDS option, a CDIS option or a UCVA boils down to computing an ${\mathcal{F}}_{t}$-conditional expectation of the form

$$ \Phi (f;t,t_{M})={\mathbb {E}}^{} [ {f(Y_{t_{M}},X_{t_{M}})} \, | \, {{\mathcal {F}}_{t}} ] $$

for some continuous function $f(y,x)$ on $E$. We now show how to approximate $\Phi (f;t,t_{M})$ in closed form by means of a polynomial approximation of $f(y,x)$. The methodology presented hereinafter applies to any linear credit risk model which has a compact state space $E$ and for which the ${\mathcal{F}}_{t}$-conditional moments of $(Y_{t_{M}},X _{t_{M}})$ are computable.

To this end, we first recall how the ${\mathcal{F}}_{t}$-conditional moments of $(Y_{t_{M}},X_{t_{M}})$ for $t\le t_{M}$ can be obtained in closed form as described in Filipović and Larsson [23]. Denote by $\text{Pol}_{n}(E)$ the set of polynomials $p(y,x)$ on $E$ of degree $n$ or less. It is readily seen that the generator of $(Y,X)$,

$$ {\mathcal{G}}f(y,x)= \big( -\gamma ^{\top }x \quad (\beta y+Bx)^{ \top }\big) \nabla f(y,x) +\frac{1}{2} \sum _{i=1}^{m} \frac{\partial ^{2} f(y,x)}{\partial x_{i}^{2}} \sigma _{i}^{2} x_{i}(y-x_{i}), $$

is polynomial in the sense that

$$ {\mathcal{G}}\text{Pol}_{n}(E) \subseteq \text{Pol}_{n}(E) \qquad \text{for any $n\in {\mathbb{N}}$.} $$

Let $N_{n} = (\begin{array}{c} n + 1 + m \\ n \end{array})$ denote the dimension of $\text{Pol}_{n}(E)$ and fix a polynomial basis $\{h_{1},\dots ,h_{N_{n}}\}$ of $\text{Pol}_{n}(E)$. We define the function of $(y,x)$

$$ H_{n}(y,x)=\big(h_{1}(y,x),\dots ,h_{N_{n}}(y,x)\big)^{\top } $$

with values in ${\mathbb{R}}^{N_{n}}$. There exists a unique matrix representation $G_{n}$ of ${\mathcal{G}}\, | \, _{\text{Pol}_{n}(E) }$ with respect to this polynomial basis such that for any $p\in \text{Pol}_{n}(E) $, we can write

$$ {\mathcal{G}}p(y,x) = H_{n}(y,x)^{\top }G_{n} {\mathbf{{p}}}, $$

where $\mathbf{p}$ is the coordinate representation of $p$. This implies the moment formula

$$ {\mathbb {E}}^{} [ {p(Y_{t_{M}},X_{t_{M}})} \, | \, {{\mathcal {F}}_{t}} ] = H_{n}(Y_{t},X_{t})^{ \top }{\mathrm{e}}^{G_{n} (t_{M}-t)} {\mathbf{p}} $$

(3.7)

for any $t\le t_{M}$; see [23, Theorem 3.1].

Remark 4

The choice for the basis $H_{n}(y,x)$ of $\text{Pol}_{n}(E)$ is arbitrary and one may simply consider the monomial basis,

$$ H_{n}(y,x)=\{1, y, x_{1},\dots , x_{m}, y^{2}, yx_{1}, x_{1}^{2}, \dots , x_{m}^{n}\}$$

in which $G_{n}$ is block-diagonal. There are efficient algorithms to compute the matrix exponential $\mathrm{e}^{G_{n} (t_{M}-t)}$; see for example Higham [30, Sect. 10]. Note that only the action of the matrix exponential is required, that is, $\mathrm{e}^{G_{n} (t_{M}-t)} {\mathbf{p}}$ for some $p\in \text{Pol}_{n}(E)$, for which specific algorithms exist as well; see for example Al-Mohy and Higham [3] and Sidje [50] and references within.

Now let $\epsilon >0$. From the Stone–Weierstrass approximation theorem [45, Theorem 5.8], there exists a polynomial $p\in {\mathrm{Pol}}_{n}(E)$ for some $n$ such that

$$ \sup _{(y,x)\in E}\left \lvert f(y,x) - p(y,x) \right \lvert \le \epsilon . $$

(3.8)

Combining (3.7) and (3.8), we obtain the desired approximation of $\Phi (f;t,T)$.

Theorem 5

Let$p\in {\mathrm{Pol}}_{n}(E)$be as in (3.8). Then$\Phi (f;t,t _{M})$is uniformly approximated by

$$ \sup _{t\le t_{M}}\| \Phi (f;t,t_{M})- H_{n}(Y_{t},X_{t})^{\top }{\mathrm{e}} ^{G_{n} (t_{M}-t)} {\mathbf{p}}\|_{L^{\infty }}\le \epsilon . $$

(3.9)

The approximating polynomial $p$ in (3.8) needs to be found case by case. We illustrate this for the CDS option in Sect. 4.2 and for the CDIS option on an homogenous portfolio in Sect. 4.3.

Remark 6

Approximating the payoff function $f(y,x)$ on a strict subset of the state space $E$ is sufficient to approximate an option price. Indeed, for any times $t \le u \le s $, the process $(Y_{u},X_{u})_{t\le u \le s}$ takes values in

$$ \{(y,x)\in E : Y_{t} \ge y \ge {\mathrm{e}}^{-\gamma ^{\top }\mathbf{1}(s-t)}Y _{t} \} \subseteq E. $$

A polynomial approximation on a compact subset of $E$ can be expected to be more precise and, as a result, to produce a more accurate price approximation. See Sect. 4.2 for an implementation example.

4 Case studies

We show that the LHC model can reproduce complex term structure dynamics, that option prices can be accurately approximated, and that the prices of derivatives on homogeneous portfolios can similarly be computed. First, we fit a parsimonious LHC model specification to CDS data and discuss the estimated parameters and factors. Then we accurately approximate the price of CDS options at different moneyness. Finally, for a homogeneous portfolio, we derive closed-form expressions for the payoff function of a CDIS option and for the tranche prices.

4.1 CDS calibration

We calibrate the LHC model to a high-yield firm, Bombardier Inc., and also to an investment-grade firm, Walt Disney Co., in order to show that the model flexibly adjusts to different spread levels and dynamics. We also present a fast filtering and calibration methodology which is specific to LHC models.

Data description

The empirical analysis is based on composite CDS spread data from Markit which are essentially averaged quotes provided by major market makers. The sample starts on January 1, 2005 and ends on January 1, 2015. The data set contains 552 weekly observations summing up to 3620 observed CDS spreads for each firm. At each date, we include the available spreads with the modified restructuring clause on contracts with maturities of 1, 2, 3, 4, 5, 7 and 10 years.

Time series of the 1-year, 5-year and 10-year CDS spreads are displayed in Fig. 3, as well as the relative changes on the 5-year versus 1-year CDS spread. The two term structures of CDS spreads exhibit important fluctuations of their level, slope and curvature. The time series can be split into three time periods. The first period, before the subprime crisis, exhibits low spreads in contango and low volatility. The second period, during the subprime crisis, exhibits high volatility with skyrocketing spreads temporarily in backwardation. The crisis had a significantly larger impact on the high-yield firm for which the spreads have more than quadrupled. The third period is characterised by a steep contango and a lot of volatility. Figure 3 also shows that CDS spread changes are strongly correlated across maturities. Summary statistics are reported in Table 1.

Table 1 CDS spread summary statistics

Full size table

Model specification

The risk-neutral dynamics of each survival process is given by the LHC model of Sect. 3 with two and three factors. We set $\gamma = \gamma _{1} \boldsymbol{e}_{1}$ for some $\gamma _{1}\ge 0$ and consider a cascading structure of the form

$$ dX_{it} = \kappa _{i}(\theta _{i} X_{(i+1)t}-X_{it})\,dt + \sigma _{i}\sqrt{X _{it}(Y_{t}-X_{it})}\,dW_{it} $$

(4.1)

for $i=1,\dots , m-1$ and

$$ dX_{mt} = \kappa _{m}(\theta _{m} Y_{t}-X_{mt})\,dt + \sigma _{m}\sqrt{X _{mt}(Y_{t}-X_{mt})}\,dW_{mt} $$

(4.2)

for some parameters $\kappa ,\theta ,\sigma \in {\mathbb{R}}^{m}_{+}$ satisfying

$$ \theta _{i} \le 1 - \frac{\gamma _{1}}{\kappa _{i}} $$

(4.3)

for $i=1,\dots , m$. We have $\beta _{ii}=-\kappa _{i}$, $\beta _{i,i+i}=\kappa _{i}\theta _{i}$ and $\beta _{ij}=0$ otherwise, ${b_{m}=\kappa _{m} \theta _{m}}$ and $b_{i}=0$ otherwise. It directly follows that

0 \leq b_{i} - \sum_{j \neq i} β_{i j}^{-} = 1_{{i = m}} κ_{m} θ_{m} = 1_{{i = m}} β_{m m}

and for $i=1,\dots , m$ that

\begin{aligned} 0 & \geq γ_{i} + β_{i i} + b_{i} + \sum_{j \neq i} {(γ_{j} + β_{i j})}^{+} = γ_{1} - κ_{i} + κ_{i} θ_{i} \\ = γ_{1} + β_{i i} + 1_{{i \neq m}} β_{i, i + 1} + 1_{{i = m}} b_{m} . \end{aligned}

This shows that the parameter conditions (3.3) and (3.4) are satisfied. Note that (3.3) and (3.4) boil down to standard linear parameter constraints when expressed in terms of $\beta $ and $b$. They are therefore compatible with efficient optimisation algorithms.

This specification allows default intensity values to persistently be close to zero over extended periods of time. It also allows to work with a multidimensional model parsimoniously as the number of free parameters is equal to $3m+1$, whereas it is equal to $3m + m^{2}$ for the generic LHC model. The default intensity is then proportional to the first factor and given by $\lambda =\gamma _{1}X_{1}/Y$.

We denote the two- and three-factor linear hypercube cascade models by $\mathrm{LHCC }(2)$ and $\mathrm{LHCC }(3)$, respectively. In addition, we estimate a three-factor model, denoted by $\mathrm{LHCC }(3)^{*}$, where the parameter $\gamma _{1}$ is an exogenous fixed parameter. This parameter value is fixed so as to be about twice as large as the estimated $\gamma _{1}$ from the $\mathrm{LHCC }(3)$ model. We estimate the constrained model in order to determine whether the choice of the default intensity upper bound is critical for the empirical results.

We set the risk-free rate equal to the average 5-year risk-free yield over the sample, $r=2.52\%$. We make the usual assumption that the recovery rate is equal to $\delta =40\%$. We also use Lemma 2.8 to compute efficiently the CDS spreads, which is justified by the following result.

Lemma 1

Assume that$r>0$. Then the matrix$A^{*}=A-r\operatorname{\mathrm{Id}}$with$A$as in (2.5) is invertible for the cascade LHCC model defined in (4.1) and (4.2) and with$\gamma =\gamma _{1} {\mathrm{e}}_{1}$.

Remark 2

The drift of the normalised process $Z=X/Y$ admits the stationary points $\bar{\mu }_{t}$ given by the system of equations

$$ \bar{\mu }_{it} = (-1)^{m-i+1}\prod _{j=i}^{m} \frac{\kappa _{j}\theta _{j}}{\bar{\mu }_{1t}\gamma _{1} - \kappa _{j}}, \qquad i=1,\dots ,m, $$

(4.4)

as shown in Appendix A. In fact, $\bar{\mu }_{1t}$ implies the values of $\bar{\mu }_{it}$ for $i=2,\dots ,m$. The stationary point of the drift of $\lambda $ is given by $\gamma _{1}\bar{ \mu }_{1t}$.

Filtering and calibration

We present an efficient methodology to filter the factors from the CDS spreads. We recall that the CDS spread $\mathrm{CDS}(t,t_{0},t_{M})$ is the strike spread that renders the initial values of the CDS contract equal to zero. We therefore obtain the affine equation

$$ \psi _{\mathrm{cds}}\big(t,t_{0},t_{M},\mathrm{CDS}(t,t_{0},t_{M}) \big)^{\top } \begin{pmatrix} 1 \\ Z_{t} \end{pmatrix} = 0, $$

(4.5)

conditionally on $\{\tau >t\}$ and with the normalised process $Z = X / Y \in [0,1]^{m}$. Therefore, in theory, we could extract the value $Z_{t}$ from the observation of at least $m$ spreads with different maturities. The factor value $(S_{t},X_{t})$ at time $t$ can in turn be inferred, for example, by applying the Euler scheme to compute the survival-process value and then rescaling the pseudo factor $Z_{t}$, via

$$ Y_{t_{i}} = Y_{t_{i-1}} - \gamma ^{\top }X_{t_{i-1}} \Delta t \qquad \text{and} \qquad X_{t_{i}} = Y_{t_{i}} Z_{t_{i}} , $$

(4.6)

for the observation dates $t_{i}$ and with $Y_{t_{0}}=1$. In practice, there might not be a value $Z_{t}$ such that (4.5) is satisfied for all observed market spreads. Therefore, we consider all the observable spreads and minimise the weighted mean squared error, i.e.,

\begin{aligned} min_{z} \frac{1}{2} \sum_{k = 1}^{n_{i}} {(\frac{ψ_{cds} {(t_{i}, t_{i}, t_{M}^{k}, CDS (t_{i}, t_{i}, t_{M}^{k}))}^{⊤} (\begin{array}{c} 1 \\ z \end{array})}{ψ_{prem} {(t_{i}, t_{i}, t_{M}^{k})}^{⊤} (\begin{array}{c} 1 \\ Z_{t_{i - 1}} \end{array})})}^{2} \\ such that 0 \leq z_{i} \leq 1, i = 1, \dots, m, \end{aligned}

(4.7)

where $t_{M}^{1},\dots ,t_{M}^{n_{i}}$ are the maturities of the $n_{i}$ observed spreads at date $t_{i}$, and $t_{i-1}$ is the previous observation date. Dividing the CDS price error by an approximation of the CDS premium leg value gives an accurate approximation of the CDS spread error when $Z_{t_{i}}\approx Z_{t_{i-1}}$. The above minimisation problem is a linearly constrained quadratic optimisation problem which can be numerically solved virtually instantaneously.

For any parameter set, we can extract the observable factor process at each date by recursively solving (4.7) and applying (4.6). With the parameters and the factor-process values, we can in turn compute the difference between the model and market CDS spreads. Therefore, we numerically search the parameter set that minimises the aggregated CDS spread root-mean-squared error (RMSE) by using the gradient-free Nelder–Mead algorithm together with a penalty term to enforce the parameter constraints and starting from several randomised initial parameter sets.

Note that we do not calibrate the volatility parameters $\sigma _{i}$ for $i=1,\dots ,m$ since CDS spreads do not depend on the martingale components with linear credit risk models and since the factor process is observable directly from the CDS spreads. Furthermore, we only fit the risk-neutral drift parameters $\kappa $ and $\theta $ implied by the CDS spreads. The total number of parameters for LHCC(2), LHCC(3) and LHCC(3)^∗ model is therefore equal to 5, 7 and 6, respectively. Equipped with a fast filter and a low-dimensional parameter space, the calibration procedure is swift.

Remark 3

Alternatively, one could estimate the parameters by performing a quasi-maximum-likelihood estimation or a more advanced generalised method of moments estimation. This can be implemented in a straightforward manner with the LHC model if the market price of risk specification preserves the polynomial property of the factors, as the real-world conditional moments of $(Y,X)$ are then given in closed form; see Appendix B. The availability of conditional moments also enables direct usage of the unscented Kalman filter to recover the factor values at each date. However, this approach comes at the cost of more parameters and possibly more stringent conditions on them, as well as unnecessary computational costs if we are only interested in market prices.

Parameters, fitted spreads and factors

The fitted parameters are reported in Table 2. An important observation is that the parameter constraint in (4.3) is binding for each dimension in all the fitted models. The calibrated parameter values are similar across the different specifications which is comforting, and the calibrated default intensity upper bounds appear large enough to cover the high spread values observed during the subprime crisis.

Table 2 Fitted and fixed (in bold) parameters for the LHC models

Full size table

The fitted factors extracted from the calibration are used as input to compute the fitted spreads. With these, we compute the fitting errors for each date and maturity. Not surprisingly, the more flexible specification $\mathrm{LHCC }(3)$ performs best. Estimating the default intensity upper bound $\gamma _{1}$ instead of setting an arbitrarily large value improves the calibration. Table 3 reports summary statistics of the errors by maturity. The $\mathrm{LHCC }(3)$ model has the smallest RMSE for each maturity. In particular, its overall RMSE is half the one of the two-factor model. The $\mathrm{LHCC }(3)^{*}$ model faces difficulties in reproducing long-term spreads; for example, its RMSE is twice as large as the one of the unconstrained $\mathrm{LHCC }(3)$ for the 10-year maturity spread for both firms. Figure 4 displays the fitted spreads and the RMSE time series. Again, the $\mathrm{LHCC }(3)$ appears to have the smallest level of errors over time. The two other models do not perform as well during the low-spreads period before the financial crisis, and during the recent volatile period. Overall, the fitted models appear to reproduce relatively well the observed CDS spread values.

Table 3 Comparison of CDS spread fits for the LHC models

Full size table

Figure 5 shows the estimated factors. They are remarkably similar across the different specifications. The default intensity explodes and the survival process decreases rapidly during the financial crisis. The $m$th factor controls the long-term default intensity level. The second factor controls the medium-term behaviour of the term structure of credit risk in the $\mathrm{LHCC }(3)$ and $\mathrm{LHCC }(3)^{*}$ models. The $\mathrm{LHCC }(2)$ model requires a default intensity almost equal to zero to capture the steep contango of the term structure at the end of the sample period, even lower than before the financial crisis. This seems counterfactual and illustrates the limitations of the $\mathrm{LHCC }(2)$ model in capturing changing dynamics. The $m$th factor visits the second half of its support $[0,Y_{t}]$ and appears to stabilise in this region for the three models.

4.2 CDS option pricing

We describe an accurate and efficient methodology to price CDS options that builds on the payoff approximation approach presented in Sect. 3.2 and illustrate it with numerical examples. The model used for the numerical illustration is the one-factor LHC model from Sect. 3.1 with stylised but realistic parameters $\gamma =0.25$, $\ell _{1}=0.05$, $\ell _{2}=1$, $\sigma =0.75$, $X_{0}=0.2$ and $r=0$.

From Sect. 2.5, we know that the time-$t$ CDS option price with strike spread $k$ is of the form

V_{CDSO} (t, t_{0}, t_{M}, k) = 1_{{τ > t}} E [f (Z (t_{0}, t_{M}, k)) | F_{t}]

with the payoff function $f(z) = \mathrm{e}^{-r(t_{0}-t)} z^{+} / Y _{t}$ and where the random variable $Z(t_{0},t_{M},k)$ is defined by

$$ Z(t_{0},t_{M},k)=\psi _{\mathrm{cds}}(t_{0},t_{0},t_{M},k)^{\top } \begin{pmatrix} Y_{t_{0}} \\ X_{t_{0}} \end{pmatrix} $$

with $\psi _{\mathrm{cds}}(t_{0},t_{0},t_{M},k)$ as in (2.18). Furthermore, the random variable $Z(t_{0},t_{M},k)$ takes values in the interval $[b_{\min },b_{\max }]$, which is with the LHC model given by

$$\begin{aligned} b_{\min } &= \sum _{i=1}^{m+1} \min \big(0,\psi _{\mathrm{cds}}(t_{0},t _{0},t_{M},k)_{i}\big), \\ b_{\max } &= \sum _{i=1}^{m+1} \max \big(0,\psi _{\mathrm{cds}}(t_{0},t _{0},t_{M},k)_{i}\big). \end{aligned}$$

We now show how to approximate the payoff function $f$ with a polynomial by truncating its Fourier–Legendre series, and then how the conditional moments of $Z(t_{0},t_{M},k)$ can be computed recursively from the conditional moments of $(Y_{t_{0}},X_{t_{0}})$.

Let ${\mathcal{L}}e_{n}(x)$ denote the generalised Legendre polynomials defined on the closed interval $[b_{\min },b_{\max }]$ and given by

$$ \mathcal{L}e_{n}(x) = \sqrt{\frac{1+2n}{2\sigma ^{2}}} Le_{n}\bigg(\frac{x- \mu }{\sigma }\bigg), $$

where $\mu =(b_{\max }+b_{\min })/2$, $\sigma =(b_{\max }-b_{\min })/2$ and the standard Legendre polynomials $Le_{n}(x)$ on $[-1,1]$ are defined recursively by

$$ Le_{n+1}(x) = \frac{2n+1}{n+1} x Le_{n}(x) - \frac{n}{n+1} Le_{n-1}(x) $$

with $Le_{0}\equiv 1$ and $Le_{1}(x)=x$. The generalised Legendre polynomials form a complete orthonormal system on $[b_{\min },b_{ \max }]$ in the sense that the mean squared error of the Fourier–Legendre series approximation $f^{(n)}(x)$ of any piecewise continuous function $f(x)$, defined by

$$ f^{(n)}(x) = \sum _{k=0}^{n} f_{n} {\mathcal{L}}e_{n}(x), \quad \text{where } f_{n} = \int _{b_{\min }}^{b_{\max }} f(x) {\mathcal{L}}e _{n}(x) \, dx, $$

(4.8)

converges to zero,

$$ \lim _{n\rightarrow \infty } \int _{b_{\min }}^{b_{\max }} \big(f(x) - f ^{(n)}(x) \big)^{2} \,dx = 0. $$

The coefficients for the CDS option payoff are given in closed form by

f_{n} = 1_{{τ > t}} \frac{e^{- r (t_{0} - t)}}{Y_{t}} \int_{0}^{b_{max}} z L e_{n} (z) d z,

since the integrands are polynomial functions. Note that a similar approach is followed in Ackerer et al. [1] on the unbounded interval ℝ with a Gaussian weight function.

The ${\mathcal{F}}_{t}$-conditional moments of $Z(t_{0},t_{M},k)$ can be computed recursively from the conditional moments of $(Y_{t_{0}},X _{t_{0}})$. Let $\pi :{\mathcal{E}}\mapsto \{1,\dots ,N_{n}\}$ be an enumeration of the set of exponents with total order less or equal to $n$, that is,

$$ {\mathcal{E}}= \bigg\{ \boldsymbol{\alpha } \in {\mathbb{N}}^{1+m} : \sum _{i=1}^{1+m} {\alpha }_{i} \le n \bigg\} . $$

Define the polynomials

$$ h_{\pi (\boldsymbol{\alpha })}(s,x)=s^{\alpha _{1}} \prod _{i=1}^{m} x_{i}^{ \mathbf{\alpha }_{1+i}}, $$

which form a basis of $\mathrm{Pol}_{n}(E)$. Denote by $\mathbf{1}$ the $(1+m)$-dimensional vector of ones and by $\mathrm{e}_{i}$ the $(1+m)$-dimensional vector whose $i$th coordinate is equal to one and zero otherwise.

Lemma 4

For all$n\ge 2$, we have

$$ {\mathbb {E}}^{} [ {Z(t_{0},t_{M},k)^{n}} \, | \, {{\mathcal {F}}_{t}} ] = \sum _{\boldsymbol{\alpha }^{\top }\mathbf{1}=n} c_{\pi (\boldsymbol{\alpha })} {\mathbb {E}}^{} [ {h_{\pi (\boldsymbol{\alpha} )}(Y_{t_{0}},X_{t_{0}})} \, | \, {{\mathcal {F}}_{t}} ], $$

where the coefficients$c_{\pi (\boldsymbol{\alpha })}$are recursively given by

c_{π (α)} = \sum_{i = 1}^{1 + m} 1_{{α_{i} - 1 \geq 0}} c_{π (α - e_{i})} ψ_{cds} {(t_{0}, t_{0}, t_{M}, k)}_{i} .

We now report the main numerical findings. We take $t_{0}=1$, $t_{M}=t_{0}+5$ and three reference strike spreads $k\in \{250,300,350 \}$ basis points that represent in-, at- and out-of-the-money CDS options. The first row in Fig. 6 shows the payoff approximation $f^{(n)}(z)$ in (4.8) for the polynomial orders $n\in \{1,5,30\}$ and the strike spreads $k\in \{250,300,350\}$. A more accurate approximation of the hockey-stick payoff function is naturally obtained by increasing the order $n$, especially around the kink. The width of the support $[b_{\min },b_{\max }]$ increases with the strike spread $k$; hence the uniform error bound should be expected to be larger for out-of-the-money options. This is confirmed by the second row of Fig. 6 that shows the error bound (3.9) as a function of the approximation order $n$ for the Fourier–Legendre approach described above. It also displays the error bound when the CDS option payoff function is interpolated by means of Chebyshev polynomials; see Appendix C for more details. The error bound is approximated by taking the maximum distance between the payoff function and the polynomial approximation on a regular grid of $10^{4}$ points over $[b_{\min },b_{\max }]$. We remark that the error bound of the Chebyshev approach is oscillating around the error bound of the Fourier–Legendre approach. This seems to be caused by variation of the polynomial approximation accuracy around the payoff kink as the Chebyshev nodes change. Note that the error bound is typically non-tight in practice, as illustrated in the following pricing application in which the pricing error is far lower than the error bound, at least for $n\le 20$.

Figure 7 shows the price approximation as a function of the polynomial order, up to $n=30$. The price approximations stabilise rapidly with the Fourier–Legendre approach so that a price approximation using the first $n=10$ moments appears to be accurate up to a basis point. On the other hand, the price approximations exhibit large oscillations with the Chebyshev approach. Figure 7 also shows that it takes a fraction of a second on a standard desktop to compute the price approximation. Note that almost all of the CPU time is spent on the computation of the moments of $Z(t_{0},t_{M},k)$.

We recall that the volatility parameter $\sigma $ of the LHC model does not affect the CDS spreads and can therefore be used to improve the joint calibration of CDS and CDS options. We illustrate this in the left panel of Fig. 8 where the CDS option price is displayed as a function of the volatility parameter for different strike spreads. As expected, the option price is an increasing function of the volatility parameter. The right panel of Fig. 8 also shows that $X_{0}$ has an almost linear impact on the CDS option price.

Note that the dimension $(\begin{array}{c} 1 + m + n \\ n \end{array})$ of the polynomial basis becomes a programming and computational challenge when both the expansion order $n$ and the number of factors $1+m$ are large. For example, for $n=20$ and $1+m=2$, the basis has dimension 231, whereas it has dimension 10 626 when $1+m=4$. In practice, we successfully implemented examples with $1+m=4$ and $n=50$ on a standard desktop computer, in which case the basis dimension is 31 6251.

4.3 CDIS option pricing

We discuss the approximation of the payoff function by means of Chebyshev polynomials for a CDIS option on a homogeneous portfolio. Let $N_{t} = \sum_{i = 0}^{N} 1_{{τ_{i} \leq t}}$ denote the number of firms which have defaulted by time $t$. Consider a CDIS option on a homogeneous portfolio so that $S_{t}^{i}=a^{\top }Y_{t}$ for all $i=1,\dots ,N$. From Proposition 2.12, it follows that the time-$t$ price of the CDIS option is given by

$$ V_{\mathrm{CDISO}}(t,t_{0},t_{M},k) = \frac{\mathrm{e}^{-r(t_{0}-t)}}{N} \sum _{j=0}^{N-N_{t}} {\mathbb {E}}^{} \big[ { \big(V_{*}(j,t_{0},t_{m})\big)^{+} q(j,t,t_{0})} \, \big| \, {{\mathcal {F}}_{t}} \big] $$

with the conditional payoffs

$$ V_{*}(j,t_{0},t_{m}) = \frac{j}{a^{\top }Y_{t_{0}}} \psi _{ \mathrm{cds}}(t_{0},t_{0},t_{M},k)^{\top } \begin{pmatrix} Y_{t_{0}} \\ X_{t_{0}} \end{pmatrix} + (1-\delta )(N-j) $$

and the conditional probabilities

$$ q(j,t,t_{0}) = \binom{N-N_{t}}{j} \frac{(a^{\top }Y_{t_{0}})^{j}(a ^{\top }Y_{t}-a^{\top }Y_{t_{0}})^{N-N_{t}-j}}{(a^{\top }Y_{t})^{N-N _{t}}}, $$

(4.9)

with the notable difference that now the summation contains at most $N+1$ terms because the defaults are symmetric and thus interchangeable. Define the random variables

$$ Y(t_{0}) = a^{\top }Y_{t_{0}}, \qquad X(t_{0},t_{M},k)= \psi _{\mathrm{cds}}(t_{0},t_{0},t_{M},k)^{\top } \begin{pmatrix} Y_{t_{0}} \\ X_{t_{0}} \end{pmatrix} . $$

The CDIS option price can then be rewritten as

$$ V_{\mathrm{CDISO}}(t,t_{0},t_{M},k) = {\mathbb {E}}^{} \big[ {f\big(Y(t_{0}),X(t_{0},t_{M},k)\big)} \, \big| \, {{\mathcal {F}}_{t} \vee N_{t}} \big], $$

where the payoff function $f(y,x)$ is given by

$$\begin{aligned} f(y,x) & = \frac{\mathrm{e}^{-r(t_{0}-t)}}{N (a^{\top }Y_{t})^{N-N _{t}}} \bigg( (1-\delta ) N (a^{\top }Y_{t}-y)^{N-N_{t}} \\ & \quad + \sum _{j=1}^{N-N_{t}} \binom{N-N_{t}}{j} \big(j x +y(1- \delta )(N-j)\big)^{+} y^{j-1}(a^{\top }Y_{t}-y)^{N-N_{t}-j} \bigg). \end{aligned}$$

The ${\mathcal{F}}_{t}$-conditional moments of $(Y(t_{0}), X(t_{0},t _{M},k))$ can be computed recursively in a similar way as in Lemma 4.4. The payoff function $f(y,x)$ can be approximated using Chebyshev polynomials and nodes, see Appendix C, or using its two-dimensional Fourier–Legendre series representation.

4.4 CDIS tranche pricing

As in Sect. 4.3, we consider a homogeneous portfolio so that $S^{i}=a^{\top }Y$ for all $i=1,\dots ,N$. In this case, a simpler expression for (2.16) can be derived, namely

$$\begin{aligned} {\mathbb{Q}}[ N_{u}=j \, | \, {\mathcal{F}}_{\infty }\vee {\mathcal{G}} _{t}] = {\mathbb{Q}}[ N-N_{u}=N -j \, | \, {\mathcal{F}}_{\infty } \vee {\mathcal{G}}_{t}] = q(N-j,t,u) \end{aligned}$$

(4.10)

for $u>t$ and $j=N_{t},\dots ,N$, and where $q(N-j,t,u)$ is defined as in (4.9). We fix the attachment point ${K_{a} = n _{a}(1-\delta )/N}$ and the detachment point ${K_{d} = n_{d}(1-\delta )/N}$, for some integers $0\le n_{a} < n_{d}\le N$. Assuming for simplicity that $N_{t} \le n_{a}$, we obtain from (2.17) and (4.10) that

$$ {\mathbb {E}}^{} \left [ {T_{u}} \, \middle | \, {{\mathcal {F}}_{\infty}\vee {\mathcal {G}}_{t}} \right ] = \sum _{{j=n_{a}+1}}^{N} \frac{(1- \delta ) \min (j -n_{a}, n_{d}-n_{a}) }{N} q(N-j, t, u), $$

and by differentiating with respect to $u$ that

$$\begin{aligned} \frac{d{\mathbb {E}}^{} \left [ {T_{u}} \, \middle | \, {{\mathcal {F}}_{\infty}\vee {\mathcal {G}}_{t}} \right ]}{du} &= \sum _{j=n_{a}+1}^{N} \frac{(1-\delta ) \min (j-n_{a}, n_{d}-n_{a}) }{N} \\ & \phantom{=:\sum _{j=n_{a}+1}} \times \binom{N-N_{t}}{N-j} \frac{ (a^{\top }Y_{u})^{N-j-1}(a^{\top }Y _{t}-a^{\top }Y_{u})^{j-N_{t}-1}}{(a^{\top }Y_{t})^{N-N_{t}}} \\ & \phantom{=:\sum _{j=n_{a}+1}} \times \big((N-j)a^{\top }Y_{t} - (N-N_{t})a^{\top }Y_{u}\big) a^{ \top }(c Y_{u} + \gamma X_{u}) \end{aligned}$$

for any $u>t$. The protection and premium legs in (2.13), (2.14) can thus in principle be computed in closed form using the moment formula (3.7).

5 Extensions

We present several model extensions offering additional features. We first construct multi-name models, then include stochastic interest rates possibly correlated with credit spreads, and conclude by discussing jumps and stochastic clocks to generate simultaneous defaults.

5.1 Multi-name models

We build upon the LHC model to construct multi-name models with correlated default intensities and which can easily accommodate the inclusion of new factors and firms. This approach can be applied to other linear credit risk models as long as they belong to the class of polynomial models. We consider $n$ independent LHC processes

$$ (Y^{1},X^{1}), \dots , (Y^{n},X^{n}), $$

(5.1)

with each $(\!Y^{j}\!,X^{j})$ as in (3.1), (3.2), and define the stacked processes ${Y\!\!=(Y^{1},\dots ,Y^{n})}$ with $Y_{0}=\mathbf{1}$ and ${X=(X^{1},\dots ,X^{n})}$ with $X_{0}\in [0,1]^{m}$, where $m=\sum _{j=1}^{n} m_{j}$. We denote by $E$ the state space of $(Y,X)$.

Let $h=(h^{1},\dots ,h^{n})$ be the ${\mathbb{R}}^{n}_{+}$-valued process whose $j$th component is given by

$$ h_{t}^{j} = \frac{{\gamma ^{j}}^{\top }X_{t}^{j}}{Y_{t}^{j}}, \qquad t \ge 0, $$

(5.2)

where the vector $\gamma ^{j}\in {\mathbb{R}}^{m_{j}}$ is the drift parameter of $Y^{j}$; see (3.1).

Linear construction

The survival process of the firm $i=1,\dots ,N$ can be defined as in (2.1) by $S^{i} = a_{i}^{\top }Y$ for some vector $a_{i}\in {\mathbb{R}}^{n}_{+}$ satisfying $a^{\top }{\mathbf{1}} =1$. The corresponding default intensity $\lambda ^{i}$ of firm $i$ is for all $t\ge 0$ given by a weighted sum of $h$, that is, $\lambda _{t}^{i} = {w^{i}_{t}}^{\top }h_{t}$ with stochastic weights $w^{i}_{jt} = a_{ij}Y ^{j}_{t} / S^{i}_{t}>0$ satisfying $\sum _{j=1}^{d} w_{jt}^{i}=1$.

Polynomial construction

Fix a degree $d$ and define the survival process $S^{i} $ of each firm $i=1,\dots ,N$ by $S_{t}^{i} = p_{i}(Y_{t})$ for all $t\ge 0$, for some polynomial ${p_{i}(y)\in {\mathrm{Pol}}_{d}([0,1]^{n})}$ which is componentwise nonincreasing and positive on $[0,1]^{n}$ and such that $p_{i}({\mathbf{1}})=1$. Let $H_{d}(y,x)$ be a polynomial basis of $\mathrm{Pol}_{d}(E)$ stacked in a row vector and of the form $H_{d}(y,x)= (H_{d}(y), H^{*}_{d}(y,x))$, where $H_{d}(y)$ is itself a polynomial basis of $\mathrm{Pol}_{d}([0,1]^{n})$. The survival process of firm $i$ then becomes $S^{i} = a_{i}^{\top }{\mathcal{Y}}$ with the finite variation process ${\mathcal{Y}}=H_{d}(Y)$, the factor process ${\mathcal{X}}=H_{d}^{*}(Y,X)$ and where the vector $a_{i}$ is given by the equation $p_{i}(y)=H_{d}(y) a_{i}$. It follows from the polynomial property that the process $({\mathcal{Y}},{\mathcal{X}})$ has a linear drift as in (2.2) and (2.3); see [24, Theorem 4.3]. The specific values for the drift of $({\mathcal{Y}},{\mathcal{X}})$ depend on the choice of the polynomial basis $H_{d}(y,x)$.

Example 1

Take $p(y)=y^{\alpha }=\prod _{i=1}^{n} y_{i}^{\alpha _{i}}$ for some $\alpha \in {\mathbb{N}}^{n}$; then the implied default intensity is a weighted sum $\lambda _{t} = \alpha ^{\top }h_{t}$ with $h_{t}$ as defined in (5.2). The weights are constant, as opposed to the stochastic weights in the linear construction.

Remark 2

The dimension of $H_{d}(y,x)$ is $(\begin{array}{c} d + n + m \\ d \end{array})$ and may be large depending on the values of $m+n$ and $d$. However, given that the pairs $(Y_{t}^{i},X_{t}^{i})$ in (5.1) are independent, the conditional expectation of a monomial in $(Y_{u},X _{u})$ can be rewritten as

$$ {\mathbb{E}}\bigg[ \prod _{i=1}^{n}(Y^{i}_{u})^{\alpha _{i}}(X^{i}_{u})^{ \beta _{i}} \, \bigg|\, {\mathcal{F}}_{t}\bigg] = \prod _{i=1}^{n} {\mathbb{E}}[ (Y^{i}_{u})^{\alpha _{i}}(X^{i}_{u})^{\beta _{i}} \, |\, {\mathcal{F}}_{t}], \qquad u>t, $$

for some $\alpha _{i}\in {\mathbb{N}}$ and $\beta _{i}\in {\mathbb{N}} ^{m_{j}}$ for all $i=1,\dots ,n$. Hence, to compute bonds and CDSs prices, we only need to consider $n$ independent polynomial bases of total dimension equal to $\sum_{i = 1}^{n} (\begin{array}{c} d + 1 + m \\ d \end{array})$ .

5.2 Stochastic interest rates

We next include stochastic interest rates possibly correlated with credit spreads. We denote the discount process by $D_{t} = \exp (-\int _{0}^{t} r_{s}\, ds)$ for $t\ge 0$, where $r_{s}$ is the short rate value at time $s$. We specify that $D=a_{r}^{\top }Y$ for some vector $a_{r}\in {\mathbb{R}}^{n}$. This is similar to the specification of the survival process of a firm, but we do not require that $D$ is nonincreasing. That is, we allow negative interest rates. We follow Sect. 5.1 and let $H_{2}(y,x)$ be a polynomial basis of $\mathrm{Pol}_{2}(E)$ which defines a new linear credit risk model $({\mathcal{Y}},{\mathcal{X}})=(H_{2}(Y),H_{2}^{*}(Y,X))$ whose linear drift is given by a matrix ${\mathcal{A}}$ as in (2.5).

Proposition 3

The pricing formulas (2.6), (2.7) and (2.9) also apply with$({\mathcal{Y}}_{t},{\mathcal{X}} _{t})$in place of$(Y_{t},X_{t})$, by using the vector

$$ \psi _{\mathrm{Z}}(t,t_{M})^{\top }= (a_{\mathrm{Z}}^{\top }\>\> 0) \mathrm{e}^{ {\mathcal{A}}(t_{M}-t)}, $$

where the vector$a_{\mathrm{Z}}$is given by$H_{2}(y)^{\top }a_{ \mathrm{Z}} = (a_{r}^{\top }y)(a^{\top }y)$, and the vectors

$$\begin{aligned} \psi _{\mathrm{D}}(t,t_{M})^{\top } &= a_{\mathrm{D}} ^{\top }\int _{t} ^{t_{M}} {\mathrm{e}}^{{\mathcal{A}}(s-t)}\,ds , \\ \psi _{\mathrm{D_{*}}}(t,{t_{M}})^{\top } &= a_{\mathrm{D}} ^{\top } \int _{t}^{t_{M}} s \mathrm{e}^{{\mathcal{A}}(s-t)}\,ds, \end{aligned}$$

where the vector$a_{\mathrm{D}}$is given by$H_{2}(y,x) a_{ \mathrm{D}} = (a_{r}^{\top }y ) ( -a^{\top }(cy \gamma x))$.

In practice, it can be sufficient to consider a basis strictly smaller than $H_{2}(y,x)$, as the following example suggests.

Example 4

Consider two independent LHC processes $(Y^{j},X^{j})$ with $m_{j}=1$ for $j\in \{1,2\}$ and consider the linear credit risk model with stochastic interest rate given by

$$ D_{t} = Y^{1}_{t} \quad \text{and} \quad S_{t} = \nu Y_{t}^{1} + (1- \nu ) Y_{t}^{2} \qquad \text{for all}\ t \ge 0, $$

for some parameter $\nu \in (0,1)$. The calculation of bond and CDS prices only requires the subbases

H_{0} (y, x) = (y_{1}^{2} y_{1} y_{2}), H_{1} (y, x) = (\begin{array}{c} y_{1} x_{1} & y_{1} x_{2} & x_{1} y_{2} & x_{1}^{2} & x_{1} x_{2} \end{array}),

whose total dimension is $\dim ((H_{0}(y,x), H_{1}(y,x))) = 7 < \dim (\mathrm{Pol}_{2}(E))=15$. The drift term of the process $(H_{0}(Y,X), H_{1}(Y,X))$ is

$$ {\mathcal{A}}= \begin{pmatrix} 0 & 0 & -2 \gamma _{1} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & -\gamma _{2} & -\gamma _{1} & 0 & 0 \\ b_{1} & 0 & \beta _{1} & 0 & 0 & -\gamma _{1} & 0 \\ 0 & b_{2} & 0 & \beta _{2} & 0 & 0 & -\gamma _{1} \\ 0 & b_{1} & 0 & 0 & \beta _{1} & 0 & 0 \\ \sigma _{1}^{2} & 0 & 2b_{1} - \sigma _{1}^{2} & 0 & 0 & 2 \beta _{1} & 0 \\ 0 & 0 & 0 & b_{1} & b_{2} & 0 & \beta _{1} + \beta _{2} \\ \end{pmatrix} , $$

where the parameters with subscripts $j\in \{1,2\}$ correspond to the LHC process $(Y^{j},X^{j})$. The pricing vectors in this basis are

a_{Z} = (\begin{array}{c} ν & 1 - ν \end{array}) and a_{D} = (\begin{array}{c} 0 & 0 & - ν γ_{1} & - (1 - ν) γ_{2} & 0 & 0 & 0 \end{array}) .

5.3 Jumps and simultaneous defaults

There are two ways to include jumps in the survival process dynamics that may result in the simultaneous default of several firms. The first is to let the martingale part of $Y$ be driven by a jump process so that multiple survival processes may jump at the same time. The second is to let time run with a stochastic clock leaping forward, hence producing synchronous jumps in the factors and the survival processes.

The survival process remains defined as in (2.1), but the factors are extensions of the LHC process in what follows. For simplicity, we discuss a unique pair $(Y, X)$ as in (3.1) whose parameters $\gamma ,\beta ,B$ satisfy (3.3) and (3.4). Let $Z$ be a nondecreasing Lévy process with Lévy measure $\nu ^{Z}(d\zeta )$ and drift $b^{Z}\ge 0$ that is independent from the Brownian motion $W$ and the uniform random variables $U^{1},\dots ,U^{N}$.

Jump-diffusion model

Assume that $\Delta Z_{t} \le 1$ for all $t\ge 0$. We define the dynamics of the LHC model with jumps as

$$\begin{aligned} d \begin{pmatrix} Y_{t} \\ X_{t} \end{pmatrix} & = \begin{pmatrix} -c \>\>& -\gamma ^{\top }- \delta ^{\top }{\mathbb{E}}[Z_{1}] \\ b \>\>& \beta -\operatorname{diag}(\nu ) {\mathbb{E}}[Z_{1}] \end{pmatrix} \begin{pmatrix} Y_{t-} \\ X_{t-} \end{pmatrix} dt + \begin{pmatrix} 0 \\ \Sigma (Y_{t-},X_{t-}) \end{pmatrix} dW_{t} \\ & \qquad \> - \begin{pmatrix} c Y_{t-} + \delta ^{\top }X_{t-} \\ \operatorname{diag}(\nu ) X_{t-} \end{pmatrix} dN_{t} \end{aligned}$$

with the martingale $N$ given by $N_{t} = Z_{t} - {\mathbb{E}}[Z_{1}] t$ for $t\ge 0$, for some $c>0$, $\delta \in {\mathbb{R}}^{m}_{+}$ and $\nu \in {\mathbb{R}}^{m}_{+}$ such that

$$\begin{aligned} c + \delta ^{\top }{\mathbf{1}}< 1, \quad c+ \delta ^{\top }{\mathbf{1}} \le \nu _{i} \le 1, \qquad i=1,\dots ,m, \end{aligned}$$

(5.3)

$$\begin{aligned} \text{and $\nu _{i} < 1$ if (3.5) applies,}\quad i=1, \dots ,m. \end{aligned}$$

(5.4)

Conditions (5.3) and (5.4) ensure that the process always jumps inside its state space. Note that the same process $Z$ can affect the dynamics of multiple LHC processes $(Y^{i},X^{i})$.

Stochastic clock

We consider the time-changed process $(\bar{Y}_{t},\! \bar{X}_{t})_{t \ge 0}=(Y_{Z_{t}},\!X_{Z_{t}})_{t\ge 0}$ that directly feeds into (2.1) in place of $(Y_{t},X_{t})$ and whose factor dynamics is given by

(\begin{array}{c} d {\bar{Y}}_{t} \\ d {\bar{X}}_{t} \end{array}) = \bar{A} (\begin{array}{c} {\bar{Y}}_{t} \\ {\bar{X}}_{t} \end{array}) d t + (\begin{array}{c} d M_{t}^{\bar{Y}} \\ d M_{t}^{\bar{X}} \end{array}),

where the $(m+n)\times (m+n)$-matrix $\bar{A}$ is now given by

$$ \bar{A} = b^{Z} A + \int _{0}^{\infty }( \mathrm{e}^{A\zeta } - \operatorname{\mathrm{Id}}) \nu ^{Z} (d\zeta ) $$

(5.5)

with the matrix $A$ as in (2.5); see [46, Chap. 6] and [24, Theorem 6.1]. The time-changed LHC model remains a linear credit risk model. The background filtration ${\mathbb{F}}$ is now the natural filtration of the process $(Y_{Z},X_{Z})$. Denote by $\Psi (\cdot )$ the Laplace exponent of $Z$ defined by $\mathbb{E}[\exp (-u Z_{t})]=\exp (-t\Psi (u))$. The following proposition shows that the matrix $\bar{A}$ may be computed in closed form.^{Footnote 2}

Proposition 5

Assume that$A=UDU^{-1}$, where$U$is a unitary matrix and$D$a diagonal matrix with nonpositive entries. Then$\bar{A} = -U \Psi (-D) U^{-1}$.

In some cases, the expression for $\bar{A}$ simplifies and does not require factoring the matrix $A$ as shown in the following example.

Example 6

Let $Z$ be a gamma process so that $\nu ^{Z}(d\zeta )=\gamma _{Z} \zeta ^{-1} {\mathrm{e}}^{-\lambda _{Z} \zeta }d\zeta $ for some constants $\lambda _{Z},\gamma _{Z}>0$ and $b^{Z}=0$. If the eigenvalues of the matrix $A$ have nonpositive real parts, the drift of the time changed process $(Y_{Z},X_{Z})$ is then equal to

$$ \bar{A} = -\gamma _{Z} \log (\operatorname{\mathrm{Id}}- A \lambda _{Z}^{-1} ), $$

(5.6)

as shown in Appendix A.

Survival processes built from independent LHC models can be time-changed with the same stochastic clock $Z$ in order to generate simultaneous defaults and thus default correlation. Note that the idea of using a time change to generate simultaneous jumps in the cumulative hazard or the survival processes is not new; see for example Mendoza-Arriaga and Linetsky [43] for an earlier contribution where a multi-name unified credit–equity model with simultaneous defaults is developed.

Remark 7

One could use the additive subordinators presented in Li et al. [42] in order to increase the model’s flexibility. These subordinators are time-dependent and may therefore help to better fit term structures, at the cost of introducing additional parameters. In this case, the drift of the factor process $(\bar{Y}, \bar{X})$ remains linear, but the matrix $\bar{A}$ in (5.5) may then be time-dependent and need not have a closed-form representation, which would in turn lead to higher computational costs.

6 Conclusion

The class of linear credit risk models is rich and offers new modelling possibilities. The survival process and its drift are linear in the factor process whose drift is also linear. Consequently, the prices of defaultable bonds, credit default swaps (CDSs) and credit default index swaps (CDISs) become linear–rational expressions in the factors. We introduce and study the single-name linear hypercube (LHC) model which consists of a diffusive factor process with a quadratic diffusion function and taking values in a compact state space. These features are employed to develop an efficient European option pricing methodology. By building upon the LHC model, we construct parsimonious and versatile multi-name models. The setup can accommodate stochastic interest rates correlated with credit spreads by constructing the discount process similarly as a survival process. Jumps in the factor dynamics as well as stochastic clocks can be used to generate simultaneous defaults. An empirical analysis shows that the LHC model can reproduce complex CDS term structure dynamics. We numerically verify that CDS option prices at different moneyness can be accurately approximated for the LHC model. We also show that CDIS option prices and tranche prices on a homogeneous portfolio can be approximated with the same approach. Future research directions include the development of efficient algorithms to price multi-name credit derivatives, and the joint empirical study of single-name and multi-name credit contracts.

Notes

The Jacobi process has been used in Delbaen and Shirakawa [15] to model the short rate in which case the risk-free bond prices are given by weighted series of Jacobi polynomials in the short-rate value.
We thank an anonymous referee for suggesting this result.

References

Ackerer, D., Filipović, D., Pulido, S.: The Jacobi stochastic volatility model. Finance Stoch. 22, 667–700 (2018)
MathSciNet MATH Google Scholar
Ackerer, D., Vatter, T.: Dependent defaults and losses with factor copula models. Depend. Model. 5, 375–399 (2017)
MathSciNet MATH Google Scholar
Al-Mohy, A.H., Higham, N.J.: Computing the action of the matrix exponential, with an application to exponential integrators. SIAM J. Sci. Comput. 33, 488–511 (2011)
MathSciNet MATH Google Scholar
Bielecki, T.R., Jeanblanc, M., Rutkowski, M.: Hedging of defaultable claims. In: Carmona, R.A., et al. (eds.) Paris–Princeton Lectures on Mathematical Finance 2003. Lecture Notes in Math., vol. 1847, pp. 1–132. Springer, Berlin (2004)
Google Scholar
Bielecki, T.R., Jeanblanc, M., Rutkowski, M.: Pricing and trading credit default swaps in a hazard process model. Ann. Appl. Probab. 18, 2495–2529 (2008)
MathSciNet MATH Google Scholar
Bielecki, T.R., Jeanblanc, M., Rutkowski, M.: Hedging of a credit default swaption in the CIR default intensity model. Finance Stoch. 15, 541–572 (2011)
MathSciNet MATH Google Scholar
Bielecki, T.R., Rutkowski, M.: Credit Risk: Modeling, Valuation and Hedging. Springer, Berlin (2002)
MATH Google Scholar
Brigo, D., Alfonsi, A.: Credit default swap calibration and derivatives pricing with the SSRD stochastic intensity model. Finance Stoch. 9, 29–42 (2005)
MathSciNet MATH Google Scholar
Brigo, D., Capponi, A., Pallavicini, A.: Arbitrage-free bilateral counterparty risk valuation under collateralization and application to credit default swaps. Math. Finance 24, 125–146 (2014)
MathSciNet MATH Google Scholar
Brigo, D., El-Bachir, N.: An exact formula for default swaptions’ pricing in the SSRD stochastic intensity model. Math. Finance 20, 365–382 (2010)
MathSciNet MATH Google Scholar
Brigo, D., Morini, M.: CDS market formulas and models. In: Presentation at the 18th Annual Warwick Options Conference (2005). Available online at www.researchgate.net/publication/228722682_CDS_market_formulas_and_models
Google Scholar
Cheridito, P., Filipović, D., Kimmel, R.L.: Market price of risk specifications for affine models: theory and evidence. J. Financ. Econ. 83, 123–170 (2007)
Google Scholar
Cheridito, P., Filipović, D., Yor, M.: Equivalent and absolutely continuous measure changes for jump-diffusion processes. Ann. Appl. Probab. 15, 1713–1732 (2005)
MathSciNet MATH Google Scholar
Corrado, C.J., Su, T.: Skewness and kurtosis in S&P 500 index returns implied by option prices. J. Financ. Res. 19, 175–192 (1996)
Google Scholar
Delbaen, F., Shirakawa, H.: An interest rate model with upper and lower bounds. Asia-Pac. Financ. Mark. 9, 191–209 (2002)
MATH Google Scholar
Di Graziano, G., Rogers, L.: A dynamic approach to the modeling of correlation credit derivatives using Markov chains. Int. J. Theor. Appl. Finance 12, 45–62 (2009)
MathSciNet MATH Google Scholar
Duarte, J.: Evaluating an alternative risk preference in affine term structure models. Rev. Financ. Stud. 17, 379–404 (2004)
Google Scholar
Duffee, G.R.: Term premia and interest rate forecasts in affine models. J. Finance 57, 405–443 (2002)
Google Scholar
Duffie, D., Singleton, K.J.: Modeling term structures of defaultable bonds. Rev. Financ. Stud. 12, 687–720 (1999)
Google Scholar
Duffie, D., Singleton, K.J.: Credit Risk: Pricing, Measurement, and Management. Princeton University Press, Princeton (2003)
Google Scholar
Elliott, R.J., Jeanblanc, M., Yor, M.: On models of default risk. Math. Finance 10, 179–195 (2000)
MathSciNet MATH Google Scholar
Filipović, D.: Term-Structure Models: A Graduate Course. Springer, Berlin (2009)
MATH Google Scholar
Filipović, D., Larsson, M.: Polynomial diffusions and applications in finance. Finance Stoch. 20, 931–972 (2016)
MathSciNet MATH Google Scholar
Filipović, D., Larsson, M.: Polynomial jump-diffusion models. Swiss Finance Institute Research Paper No. 17-60 (2017). Available online at https://ssrn.com/abstract=3075520
Filipović, D., Larsson, M., Trolle, A.B.: Linear-rational term structure models. J. Finance 72, 655–704 (2017)
Google Scholar
Filipović, D., Mayerhofer, E., Schneider, P.: Density approximations for multivariate affine jump-diffusion processes. J. Econom. 176, 93–111 (2013)
MathSciNet MATH Google Scholar
Gabaix, X.: Linearity-generating processes: a modelling tool yielding closed forms for asset prices. NBER Working Paper No. 13430 (2007). Available online at https://ssrn.com/abstract=1293140
Gaß, M., Glau, K., Mahlstedt, M., Mair, M.: Chebyshev interpolation for parametric option pricing. Finance Stoch. 22, 701–731 (2018)
MathSciNet MATH Google Scholar
Gourieroux, C., Jasiak, J.: Multivariate Jacobi process with application to smooth transitions. J. Econom. 131, 475–505 (2006)
MathSciNet MATH Google Scholar
Higham, N.J.: Functions of Matrices: Theory and Computation. SIAM, Philadelphia (2008)
MATH Google Scholar
Hull, J.C., White, A.D.: The pricing of options on assets with stochastic volatilities. J. Finance 42, 281–300 (1987)
MATH Google Scholar
Hull, J.C., White, A.D.: The valuation of credit default swap options. J. Deriv. 10(3), 40–50 (2003)
Google Scholar
Jacod, J., Shiryaev, A.N.: Limit Theorems for Stochastic Processes, 2nd edn. Springer, Berlin (2003)
MATH Google Scholar
Jamshidian, F.: Valuation of credit default swaps and swaptions. Finance Stoch. 8, 343–371 (2004)
MathSciNet MATH Google Scholar
Jarrow, R., Rudd, A.: Approximate option valuation for arbitrary stochastic processes. J. Financ. Econ. 10, 347–369 (1982)
Google Scholar
Karatzas, I., Shreve, S.E.: Brownian Motion and Stochastic Calculus, 2nd edn. Graduate Texts in Mathematics. Springer, Berlin (1991)
MATH Google Scholar
Kokholm, T., Nicolato, E.: Sato processes in default modelling. Appl. Math. Finance 17, 377–397 (2010)
MathSciNet MATH Google Scholar
Lando, D.: On Cox processes and credit risky securities. Rev. Deriv. Res. 120, 99–120 (1998)
MATH Google Scholar
Lando, D.: Credit Risk Modeling: Theory and Applications. Princeton University Press, Princeton (2009)
MATH Google Scholar
Laurent, J.-P., Gregory, J.: Basket default swaps, CDOs and factor copulas. J. Risk 7, 103–122 (2005)
Google Scholar
Li, D.X.: On default correlation: a copula function approach. J. Fixed Income 9, 43–54 (2000)
Google Scholar
Li, J., Li, L., Mendoza-Arriaga, R.: Additive subordination and its applications in finance. Finance Stoch. 20, 589–634 (2016)
MathSciNet MATH Google Scholar
Mendoza-Arriaga, R., Linetsky, V.: Multivariate subordination of Markov processes with financial applications. Math. Finance 26, 699–747 (2016)
MathSciNet MATH Google Scholar
Peng, X.H., Kou, S.S.: Connecting the top-down to the bottom-up: pricing CDO under a conditional survival model. In: Mason, S., et al. (eds.) Proceedings of the 40th Conference on Winter Simulation, pp. 578–586. IEEE Press, New York (2008)
Google Scholar
Rudin, W.: Functional Analysis. McGraw-Hill, New York (1974)
MATH Google Scholar
Sato, K-i.: Lévy Processes and Infinitely Divisible Distributions. Cambridge University Press, Cambridge (1999)
MATH Google Scholar
Schönbucher, P.J.: A measure of survival. Risk 17, 79–85 (2004)
Google Scholar
Schönbucher, P.J.: A Libor market model with default risk Preprint (2000). Available online at https://ssrn.com/abstract=261051
Schönbucher, P.J., Schubert, D.: Copula-dependent default risk in intensity models (2001). Preprint, available online at https://ssrn.com/abstract=301968
Sidje, R.B.: Expokit: A software package for computing matrix exponentials. ACM Trans. Math. Softw. (TOMS) 24, 130–156 (1998)
MATH Google Scholar
Sun, Y., Mendoza-Arriaga, R., Linetsky, V.: Marshall–Olkin distributions, subordinators, efficient simulation, and applications to credit risk. Adv. Appl. Probab. 49, 481–514 (2017)
MathSciNet MATH Google Scholar
White, R.: The pricing and risk management of credit default swaps, with a focus on the ISDA model. OpenGamma Quantitative Research No. 16, (2013). Available online at https://developers.opengamma.com/quantitative-research

Download references

Acknowledgements

The authors would like to thank for useful comments Agostino Capponi, David Lando, Martin Larsson, Jongsub Lee, Andrea Pallavicini, Sander Willems and two anonymous referees, as well as participants from the 2015 AMaMeF and Swissquote conference in Lausanne, the 2016 Bachelier World Congress in New York, the 2016 EFA Annual Meeting in Oslo, the 2016 AFFI Paris December meeting and the 2017 CIB workshop “Dynamical Models in Finance”.

Author information

Authors and Affiliations

Swissquote Bank, Chemin de la Crétaux 33, 1196, Gland, Switzerland
Damien Ackerer
EPFL and Swiss Finance Institute, Quartier UNIL-Dorigny, Extranef 218, 1015, Lausanne, Switzerland
Damir Filipović

Authors

Damien Ackerer
View author publications
You can also search for this author in PubMed Google Scholar
Damir Filipović
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Damien Ackerer.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The research leading to these results has received funding from the European Research Council under the European Union’s Seventh Framework Programme (FP/2007-2013)/ERC Grant Agreement n. 307465-POLYTE.

Appendices

Appendix A: Proofs

This appendix contains the proofs of all theorems and propositions in the main text.

Proof of (2.4)

This follows as in [25, Lemma 3]. □

Proof of Example 2.3

The autonomous process $X$ admits a solution which takes values in $[-e^{-\epsilon t}, e^{-\epsilon t}]$ at time $t$ with $\epsilon >0$ and $X_{0}\in [-1,1]$ if and only if $\kappa >\epsilon $; see [23, Theorem 5.1]. The two coordinates of $Y$ are bounded below by $X$. Indeed, we have for $i=1,2$ that

$$ \frac{dY_{it}}{dt} = -\frac{\epsilon }{2}(Y_{it} \pm X_{t}) \ge -\frac{ \epsilon }{2}(Y_{it} + \mathrm{e}^{-\epsilon t}), \qquad t \geq 0. $$

The solution of $dZ_{t} = -(\epsilon /2)(Z_{t} + \mathrm{e}^{-\epsilon t})\,dt$ with $Z_{0}=1$ is given by ${Z_{t}=\mathrm{e}^{-\epsilon t}}$, ${t\ge 0}$, which proves that $Y_{it}\ge Z_{t} \ge |X_{t}|$ for $i=1,2$. Finally, by applying Itô’s lemma, we obtain

$$ \frac{d\langle \lambda ^{1},\lambda ^{2}\rangle _{t}}{dt} = - \frac{ \epsilon ^{2}}{4} \frac{\sigma ^{2}(\mathrm{e}^{-\epsilon t}-X_{t})( \mathrm{e}^{-\epsilon t}+X_{t})}{Y_{1t}Y_{2t}}, $$

which is negative with positive probability. The dynamics of $\lambda ^{i}$ is given by

$$\begin{aligned} d\lambda ^{i}_{t} & = ({\epsilon ^{2}}/{4}) \big(\pm (1 - 2\kappa / \epsilon ) ({X_{t}}/{Y_{it}}) + ({X_{t}}/{Y_{it}})^{2}\big)\, dt \pm dM _{it} \\ & = \big( ({\epsilon }/{2}) (1 - 2\kappa /\epsilon ) (\lambda ^{i} _{t} - \epsilon /2) + (\lambda ^{i}_{t} - \epsilon /2)^{2} \big)\, dt \pm dM_{it}, \end{aligned}$$

where $dM_{it} = \epsilon \sigma /(2Y_{it})\sqrt{(\mathrm{e}^{- \epsilon t}-X_{t})(\mathrm{e}^{-\epsilon t}+X_{t})}\, dW_{t}$ and $\kappa >\epsilon $. The quadratic drift of $\lambda ^{i}$ has two positive roots $\kappa $ and $\epsilon /2$, is positive at zero and negative at $\epsilon $. Since $\kappa >\epsilon $, this shows that $\lambda ^{i}$ mean-reverts towards $\epsilon /2$ for $i=1,2$. □

Proof of Proposition 2.4

Proposition 2.4 is an immediate consequence of (2.4) and the following lemma. □

Lemma A.1

Let$Y$be a nonnegative${\mathcal{F}}_{\infty }$-measurable random variable. Then for any time${t\le t_{M}<\infty }$, we have

E [1_{{τ > t_{M}}} Y | G_{t}] = 1_{{τ > t}} \frac{1}{S_{t}} E [S_{t_{M}} Y | F_{t}] .

Note that$t_{M}<\infty $is essential unless we assume that$S_{\infty }= 0$.

Lemma A.1 follows from [7, Corollary 5.1.1]. For the convenience of the reader, we provide here a sketch of its proof. As in [7, Lemma 5.1.2], one can show that for any nonnegative random variable $Z$, we have

E [1_{{τ > t}} Z | H_{t} \lor F_{t}] = 1_{{τ > t}} \frac{1}{S_{t}} E [1_{{τ > t}} Z | F_{t}] .

Setting $Z = 1_{{τ > t_{M}}} Y$ , we can now derive

\begin{aligned} E [1_{{τ > t_{M}}} Y | G_{t}] & = E [1_{{τ > t}} Y 1_{{τ > t_{M}}} | G_{t}] = 1_{{τ > t}} \frac{1}{S_{t}} E [1_{{τ > t_{M}}} Y | F_{t}] \\ = 1_{{τ > t}} \frac{1}{S_{t}} E [E [1_{{τ > t_{M}}} | F_{\infty}] Y | F_{t}] \\ = 1_{{τ > t}} \frac{1}{S_{t}} E [S_{t_{M}} Y | F_{t}] . \end{aligned}

□

Proof of Proposition 2.5

The subsequent proofs build on the following lemma that follows from [7, Proposition 5.1.1]. □

Lemma A.2

Let$Z$be a bounded${\mathbb{F}}$-predictable process. For any${t\le {t_{M}}<\infty }$, we have

E [1_{{t < τ \leq t_{M}}} Z_{τ} | G_{t}] = 1_{{t < τ}} \frac{1}{S_{t}} E [\int_{(t, t_{M}]} - Z_{u} d S_{u} | F_{t}] .

Note that${t_{M}}<\infty $is essential unless we assume that$S_{\infty }= 0$.

We can now proceed to the proof of Proposition 2.5. The value of the contingent cash flow is given by the expression

C_{D} (t, t_{M}) = E [e^{- r (τ - t)} 1_{{t \leq τ \leq t_{M}}} | G_{t}] .

By applying Lemma A.2, we get

\begin{aligned} C_{D} (t, t_{M}) & = \frac{1_{{τ > t}}}{S_{t}} E [\int_{t}^{t_{M}} - e^{- r (s - t)} d S_{s} | F_{t}] \\ = \frac{1_{{τ > t}}}{S_{t}} \int_{t}^{t_{M}} e^{- r (s - t)} E [- a^{⊤} (c Y_{s} + γ X_{s}) | F_{t}] d s \\ = \frac{1_{{τ > t}}}{S_{t}} \int_{t}^{t_{M}} (e^{- r (s - t)} - a^{⊤} (\begin{array}{c} c & γ \end{array}) e^{A (s - t)} (\begin{array}{c} Y_{t} \\ X_{t} \end{array})) d s, \end{aligned}

where the second equality comes from the fact that $\int {\mathrm{e}} ^{-ru}\,dM^{S}_{u}$ is a martingale. The third equality follows from (2.4). □

Proof of Corollary 2.6

The value of this contingent bond is given by

C_{D_{*}} (t, t_{M}) = E [τ e^{- r (τ - t)} 1_{{t < τ \leq t_{M}}} | G_{t}] = \frac{1_{{τ > t}}}{S_{t}} E [\int_{t}^{t_{M}} - s e^{- r (s - t)} d S_{s} | F_{t}],

and the result follows as in the proof of Proposition 2.5. □

Proof of Lemma 2.8

Observe that for any matrix $A$ and real $r$, we have $\mathrm{e}^{r}{\mathrm{e}}^{A}=\mathrm{e} ^{\operatorname{diag}(r)+A}$ and that the matrix exponential integration can be computed in closed form as

$$\begin{aligned} \int _{0}^{u} {\mathrm{e}}^{As}\, ds & = \int _{0}^{u} \bigg(I + As + A ^{2} \frac{s^{2}}{2}+ \cdots \bigg)\, ds \\ &= Iu + A\frac{u^{2}}{2} + A^{2} \frac{u^{3}}{6}+ \cdots = A^{-1}( \mathrm{e}^{Au} -I ). \end{aligned}$$

By a change of variable $u=s-t$, we obtain

$$ \int _{t}^{t_{M}} s \mathrm{e}^{A_{*}(s-t)}\,ds = \int _{0}^{{t_{M}}-t}u \mathrm{e}^{A_{*}u}\,du + t \int _{0}^{{t_{M}}-t}{\mathrm{e}}^{A_{*}u}\,du, $$

where the second term on the RHS is given in Lemma 2.5. The first term can be derived using integration by parts as

$$\begin{aligned} \int _{0}^{{t_{M}}-t} u\mathrm{e}^{A_{*}u}\,du &= ({t_{M}}-t) A_{*}^{-1} {\mathrm{e}}^{A_{*}({t_{M}}-t)} - A_{*}^{-1} A_{*}^{-1} (\mathrm{e} ^{A_{*}({t_{M}}-t)} - I). \end{aligned}$$

□

Proof of Proposition 2.9

The calculation of the protection leg $V^{i}_{\mathrm{prot}}(t,t_{0},t_{M})$ and the coupon part $V^{i}_{\mathrm{coup}}(t,t_{0},t_{M})$, respectively, follows from Propositions 2.4 and 2.5. The accrued interest $V^{i}_{\mathrm{ai}}(t,t_{0},t_{M})$ is given by the sum of contingent cash flows and of weighted zero-recovery coupon bonds, and thus its calculation follows from Propositions 2.5 and 2.6. The series of contingent cash flows is in fact equal to a single contingent payment paying $\tau $ at default, so that

C_{D_{*}} (t, t_{M}) = \sum_{j = 1}^{M} E [τ e^{- r (τ - t)} 1_{{t_{j - 1} < τ \leq t_{j}}} | G_{t}] = E [τ e^{- r (τ - t)} 1_{{t < τ \leq t_{M}}} | G_{t}] .

Using the identity $1_{{t_{j - 1} < τ \leq t_{j}}} = 1_{{τ > t_{j - 1}}} - 1_{{τ > t_{j}}}$ , we obtain that the second term of $V^{i}_{\mathrm{ai}}(t,t_{0},t_{M})$ is given by

\begin{aligned} - \sum_{j = 1}^{M} E [e^{- r (τ - t)} t_{j - 1} 1_{{t_{j - 1} < τ \leq t_{j}}} | G_{t}] & = \sum_{j = 1}^{M} t_{j - 1} (C_{D} (t, t_{j}) - C_{D} (t, t_{j - 1})) \\ = t_{M - 1} C_{D} (t, t_{M}) - T_{0} C_{D} (t, t_{0}) \\ - \sum_{j = 1}^{M - 1} (t_{j} - t_{j - 1}) C_{D} (t, t_{j}) . \end{aligned}

□

Proof of Proposition 2.11

The conditional characteristic function of $N_{u}$ is given by

\begin{aligned} ϕ (t, ξ) & = E [exp (i ξ N_{u}) | F_{\infty} \lor G_{t}] \\ = E [exp (i ξ \sum_{i = 1}^{N} 1_{{τ_{i} \leq u}}) | F_{\infty} \lor G_{t}] \\ = E [\prod_{i = 1}^{N} (1_{{τ_{i} > u}} + e^{i ξ} (1 - 1_{{τ_{i} > u}})) | F_{\infty} \lor G_{t}] \\ = \prod_{i = 1}^{N} (\frac{1_{{τ_{i} > t}}}{S_{t}^{i}} (S_{u}^{i} + e^{i ξ} (S_{t}^{i} - S_{u}^{i})) + 1_{{τ_{i} \leq t}} e^{i ξ}) \\ = \prod_{i = 1}^{N} (e^{i ξ} + 1_{{τ_{i} > t}} (1 - e^{i ξ}) \frac{S_{u}^{i}}{S_{t}^{i}}), \end{aligned}

where the third equality follows from [7, Lemma 9.1.3], which gives the expression

E [1_{{τ_{1} > t_{0}, \dots, τ_{N} > t_{0}}} | F_{t_{0}} \lor G_{t}] = \prod_{i = 1}^{N} 1_{{τ_{i} > t}} \frac{S_{t_{0}}^{i}}{S_{t}^{i}} .

(A.1)

The expression (2.16) then directly follows by applying the discrete Fourier transform; see [2, Sect. 3] for more details. □

Proof of Proposition 2.12

The payoff at time $t_{0}$ of the CDIS option can always be decomposed into $2^{N}$ terms by conditioning on all the possible default events via writing

q (α) = \prod_{i = 1}^{N} ({(1_{{τ_{i} > t_{0}}})}^{α_{i}} + {(1_{{τ_{i} \leq t_{0}}})}^{1 - α_{i}})

(A.2)

for $\alpha \in \{0,1\}^{N}$ and with the convention $0^{0}=0$, so that the payoff function can be rewritten as

\begin{aligned} {(\sum_{i = 1}^{N} \frac{1_{{τ_{i} > t_{0}}}}{S_{t_{0}}^{i}} ψ_{cds}^{i} {(t_{0}, t_{0}, t_{M}, k)}^{⊤} (\begin{array}{c} Y_{t_{0}} \\ X_{t_{0}} \end{array}) + (1 - δ) 1_{{τ_{i} \leq t_{0}}})}^{+} \\ = \sum_{α \in {0, 1}^{N}} {(\sum_{i = 1}^{N} \frac{α_{i}}{S_{t_{0}}^{i}} ψ_{cds}^{i} {(t_{0}, t_{0}, t_{M}, k)}^{⊤} (\begin{array}{c} Y_{t_{0}} \\ X_{t_{0}} \end{array}) + (1 - δ) (1 - α_{i}))}^{+} q (α) . \end{aligned}

We can apply [7, Lemma 9.1.3] to compute the probability (A.1) so that by writing (A.2) as a linear combination of indicator functions, we obtain

q (α, t, t_{0}) = E [q (α) | F_{t_{0}} \lor G_{t}] = \prod_{i = 1}^{N} (\frac{{(S_{t_{0}}^{i})}^{α_{i}} {(S_{t}^{i} - S_{t_{0}}^{i})}^{1 - α_{i}}}{S_{t}^{i}} 1_{{τ_{i} > t}} + {(1_{{τ_{i} \leq t}})}^{1 - α_{i}}),

which completes the proof. □

Proof of Theorem 3.1

We define the bounded continuous map ${({\mathcal{Y}},{\mathcal{X}})\!:\!R^{1+m}\to \!R^{1+m}}$ by

$$ {\mathcal{Y}}(y,x)=y^{+}\wedge 1,\qquad {\mathcal{X}}_{i}(y,x)=x_{i}^{+}\wedge y^{+}\wedge 1,\quad i=1, \dots ,m, $$

so that $({\mathcal{Y}},{\mathcal{X}})(y,x)=(y,x)$ on $E$. In a similar vein, extend the dispersion matrix $\Sigma (y,x)$ to a bounded continuous mapping $\Sigma (({\mathcal{Y}},{\mathcal{X}})(y,x))$ on ${\mathbb{R}}^{1+m}$. The stochastic differential equation (3.1) then extends to ${\mathbb{R}}^{1+m}$ by

$$ \begin{aligned} dY_{t} & = -\gamma ^{\top }{\mathcal{X}}(Y_{t},X_{t}) \,dt, \\ dX_{t} & = \big( b {\mathcal{Y}}(Y_{t})+ \beta {\mathcal{X}}(Y_{t},X_{t}) \big)\, dt + \Sigma \big(({\mathcal{Y}},{\mathcal{X}})(Y_{t},X_{t}) \big)\,dW_{t}. \end{aligned} $$

(A.3)

Since drift and dispersion of (A.3) are bounded and continuous on ${\mathbb{R}}^{1+m}$, there exists a weak solution $(Y,X)$ of (A.3) for any initial law of $(Y_{0},X_{0})$ with support in $E$; see [36, Theorem V.4.22].

We now show that any weak solution $(Y,X)$ of (A.3) with $(Y_{0},X_{0})\in E$ stays in $E$, i.e.,

$$ (Y_{t},X_{t})\in E\qquad \mbox{for all}\ t\ge 0. $$

(A.4)

To this end, for $i=1,\dots ,m$, note that

$$ \Sigma _{ii}\big(({\mathcal{Y}},{\mathcal{X}})(y,x)\big)= 0\qquad \mbox{for all}\ (y,x)\ \mbox{with}\ x_{i}\le 0\ \mbox{or}\ x_{i}\ge y. $$

(A.5)

Condition (3.3) implies that

$$ \big( b {\mathcal{Y}}(y)+ \beta {\mathcal{X}}(y,x) \big)_{i} \ge 0\qquad \mbox{for all}\ (y,x)\ \mbox{with}\ x_{i}\le 0. $$

(A.6)

For $\delta ,\epsilon >0$, we define

$$ \tau _{\delta ,\epsilon }=\inf \{ t\ge 0 : X_{it}\le -\epsilon \ \mbox{and}\ -\epsilon < X_{\mathrm{is}}< 0\ \mbox{for all}\ s\in [t-\delta ,t)\} . $$

Then on $\{\tau _{\delta ,\epsilon }<\infty \}$, we have, in view of (A.5) and (A.6), that

$$ 0> X_{i\tau _{\delta ,\epsilon }} - X_{i\tau _{\delta ,\epsilon }- \delta } = \int _{\tau _{\delta ,\epsilon }-\delta }^{ \tau _{\delta ,\epsilon }} \big(b {\mathcal{Y}}(Y_{u})+ \beta {\mathcal{X}}(Y _{u},X_{u}) \big)_{i} \,du \ge 0, $$

which is absurd. Hence $\tau _{\delta ,\epsilon }=\infty $ a.s. and therefore $X_{it}\ge 0$ for all $t\ge 0$. Similarly, condition (3.4) implies that

$$ -\gamma ^{\top }{\mathcal{X}}(y,x)-\big(b {\mathcal{Y}}(y)+ \beta {\mathcal{X}}(y,x)\big)_{i}\ge 0\qquad \text{for all}\ (y,x)\ \mbox{with}\ x_{i}\ge y. $$

(A.7)

Using the same argument as above for $Y_{t}-X_{it}$ instead of $X_{it}$, and (A.7) instead of (A.6), we see that $Y_{t}-X_{it}\ge 0$ for all $t\ge 0$. Note that $0\le \gamma ^{\top }{\mathcal{X}}(y,x)\le \gamma ^{\top }\mathbf{1} y^{+}$ for all $(y,x)$, and thus $1\ge Y_{t}\ge {\mathrm{e}}^{-\gamma ^{ \top }\mathbf{1} t}>0$ for all $t\ge 0$. This proves (A.4) and thus the existence of an $E$-valued solution of (3.1).

Uniqueness in law of the $E$-valued solution $(Y,X)$ of (3.1) follows from [23, Theorem 4.2] and the fact that $E$ is relatively compact.

The boundary non-attainment conditions (3.5), (3.6) follow from [23, Theorem 5.7(i) and (ii)] for the polynomials $p(y,x)=x_{i}$ and $y-x_{i}$, for $i=1,\dots ,m$. □

Proof of Lemma 4.1

The matrix $A_{*}$ in the LHCC model is given by

$$ A_{*} = \begin{pmatrix} -r & -\gamma _{1} & 0 & 0 & \\ 0 & -(\kappa _{1} + r) & \kappa _{1}\theta _{1} & 0 &\vdots \\ \vdots & & & \ddots & \\ \theta _{m} & & & 0 & - (\kappa _{m}+r) \end{pmatrix} , $$

and its determinant is therefore equal to

$$\begin{aligned} \begin{aligned} |A_{*}| &= -r \begin{vmatrix} -(\kappa _{1} + r) & \kappa _{1}\theta _{1} & 0 &\vdots \\ \vdots & & \ddots & \\ 0 & & 0& - (\kappa _{m}+r) \end{vmatrix} \\ &\quad {}+ (-1)^{m} \begin{vmatrix} -\gamma _{1} & 0 & 0 & \\ -(\kappa _{1} + r) & \kappa _{1}\theta _{1} & 0 &\vdots \\ \vdots & & \ddots & \\ 0 & & - (\kappa _{m}+r) & \kappa _{m} \theta _{m} \end{vmatrix}. \end{aligned} \end{aligned}$$

With $r>0$, the first term on the right-hand side is nonzero with sign equal to $(-1)^{1+m}$ and the second element also has a sign equal to $(-1)^{1+m}$. This is because the determinant of a triangular matrix is equal to the product of its diagonal elements. As a result, the determinant of $A_{*}$ is nonzero, which concludes the proof. □

Proof of (4.4)

For $i=1,\dots ,m$, we have that $d(1/Y_{t}) = (\gamma _{1} Z_{1t}/Y_{t})\,dt$ for all $t\ge 0$. The dynamics of $Z$ is thus given by

$$ dZ_{it} = (\kappa _{i} \theta _{i} Z_{(i+1)t} - \kappa _{i} Z_{it} + \gamma _{1} Z_{1t} Z_{it})\,dt + \sigma _{i}\sqrt{Z_{it}(1-Z_{it})}\,dW _{it} $$

for $i=1,\dots ,m-1$ and

$$ dZ_{mt} = (\kappa _{m} \theta _{m} - \kappa _{m} Z_{mt} + \gamma _{1} Z _{1t} Z_{mt})\,dt + \sigma _{m}\sqrt{Z_{mt}(1-Z_{mt})}\,dW_{mt}. $$

Fixing $Z_{1t}=\bar{\mu }_{1t}$ and solving for the value of $Z_{mt}$ which cancels its drift, we obtain

$$ \bar{\mu }_{mt} = \frac{-\kappa _{m}\theta _{m}}{\bar{\mu }_{1t} \gamma _{1} - \kappa _{m}}, $$

and solving recursively for $i=m-1,\dots ,1$ gives (4.4). □

Proof of Lemma 4.4

The $n$th power of $Z(t_{0},t_{M},k)$ is given by

\begin{aligned} Z {(t_{0}, t_{M}, k)}^{n} & = {(ψ_{cds} {(t_{0}, t_{0}, t_{M}, k)}^{⊤} (\begin{array}{c} Y_{t_{0}} \\ X_{t_{0}} \end{array}))}^{n} \\ = ψ_{cds} {(t_{0}, t_{0}, t_{M}, k)}^{⊤} (\begin{array}{c} Y_{t_{0}} \\ X_{t_{0}} \end{array}) \sum_{α^{⊤} 1 = n - 1} c_{π (α)} h_{π (α)} (Y_{t_{0}}, X_{t_{0}}) \\ = \sum_{i = 1}^{1 + m} \sum_{α^{⊤} 1 = n - 1} c_{π (α)} ψ_{cds} {(t_{0}, t_{0}, t_{M}, k)}_{i} h_{π (α + e_{i})} (Y_{t_{0}}, X_{t_{0}}), \end{aligned}

which is a polynomial containing all and only the monomials in $(Y_{t_{0}},X_{t_{0}})$ of degree $n$. The lemma follows by rearranging the terms. □

Proof of Proposition 5.3

The time-$t$ price of the zero-coupon zero-recovery bond is now given by

\begin{aligned} B_{Z} (t, t_{M}) & = E [\frac{D_{t_{M}}}{D_{t}} 1_{{τ > t_{M}}} | G_{t}] \\ = \frac{1_{{τ > t}}}{D_{t} S_{t}} E [D_{t_{M}} S_{t_{M}} | F_{t}] \\ = \frac{1_{{τ > t}}}{(a_{r}^{⊤} Y_{t}) (a^{⊤} Y_{t})} E [(a_{r}^{⊤} Y_{t_{M}}) (a^{⊤} Y_{t_{M}}) | F_{t}] \\ = \frac{1_{{τ > t}}}{a_{Z}^{⊤} Y_{t}} (\begin{array}{c} a_{Z}^{⊤} & 0 \end{array}) e^{A (t_{M} - t)} (\begin{array}{c} Y_{t} \\ X_{t} \end{array}), \end{aligned}

by applying Lemma A.1. Applying Lemma A.2, we show that the price of a security paying 1 or $\tau $ at the default time $\tau $ if default happens before maturity is given by

\begin{aligned} E [\frac{D_{t_{M}}}{D_{t}} 1_{{t \leq τ \leq t_{M}}} | G_{t}] = \frac{1_{{τ > t}}}{S_{t} D_{t}} E [\int_{t}^{t_{M}} - s D_{s} d S_{s} | F_{t}] \\ = \frac{1_{{τ > t}}}{(a_{r}^{⊤} Y_{t}) (a^{⊤} Y_{t})} \int_{t}^{t_{M}} s E [- (a_{r}^{⊤} Y_{s}) (c Y_{s} + γ X_{s}) | F_{t}] d s \\ = \frac{1_{{τ > t}}}{a_{Z}^{⊤} Y_{t}} \int_{t}^{t_{M}} s a_{D}^{⊤} e^{A (s - t)} d s (\begin{array}{c} Y_{t} \\ X_{t} \end{array}), \end{aligned}

which completes the proof. □

Proof of Proposition 5.5

The Lévy–Khintchine theorem shows that

$$ \Psi (u)=b^{Z} u + \int _{0}^{\infty }(1- \mathrm{e}^{-u\xi })\nu ^{Z} \,d \xi . $$

(A.8)

We conclude the proof by applying Sylvester’s formula $\mathrm{e}^{UDU ^{-1}}=U\mathrm{e}^{D}U^{-1}$ and by using (A.8) in (5.5) to get

$$\begin{aligned} \bar{A} &= b^{Z} UDU^{-1} + \int _{0}^{\infty }(\mathrm{e}^{UDU^{-1} \xi } - \operatorname{\mathrm{Id}})\nu ^{Z} \,d\xi \\ &= b^{Z} UDU^{-1} + \int _{0}^{\infty }(U\mathrm{e}^{D\xi }U^{-1} - UU ^{-1})\nu ^{Z} \,d\xi \\ &= -U\bigg( b^{Z} (-D) + \int _{0}^{\infty }\big(\operatorname{\mathrm{Id}}- \mathrm{e}^{-(-D) \xi }\big)\nu ^{Z} \,d\xi \bigg)U^{-1} \\ &= -U \Psi (D) U^{-1}. \end{aligned}$$

□

Proof of (5.6)

The matrix $\bar{A}$ in (5.5) can be rewritten as

$$\begin{aligned} \bar{A} &= \int _{0}^{\infty }(\mathrm{e}^{At}-\operatorname{\mathrm{Id}})\gamma _{Z} t^{-1} {\mathrm{e}}^{-\lambda _{Z} t}\, dt = \gamma _{Z} \sum _{k=1}^{\infty }\frac{A ^{k}}{k!} \int _{0}^{\infty }t^{k-1} {\mathrm{e}}^{-\lambda _{Z} t}\, dt \\ &= \gamma _{Z} \sum _{k=1}^{\infty }\frac{A^{k}}{k!} \frac{\Gamma (k)}{ \lambda _{Z}^{k}} = \gamma _{Z} \sum _{k=1}^{\infty }\frac{(A \lambda _{Z}^{-1})^{k}}{k} = -\gamma _{Z} \log (\operatorname{\mathrm{Id}}- A \lambda _{Z}^{-1} ), \end{aligned}$$

where the second equality follows from the definition of the matrix exponential, the third from the definition of the gamma function and its values for integer values, and the last one from the definition of the matrix logarithm. □

Appendix B: Market price of risk specifications

We discuss market price of risk (MPR) specifications such that $X$ has a linear drift also under the real-world measure ${\mathbb{P}}\approx {\mathbb{Q}}$. This may further facilitate the empirical estimation of the LHC model.

Let $\Lambda (Y_{t},X_{t})$ denote the time-$t$ MPR such that the drift of $X$ under ℙ becomes

$$ \mu ^{\mathbb{P}}_{t}= b Y_{t} +\beta X_{t} + \Sigma (Y_{t},X_{t}) \Lambda (Y_{t},X_{t}) . $$

This is linear in $(Y_{t},X_{t})$ of the form

$$ \mu ^{\mathbb{P}}_{t}= b^{\mathbb{P}}Y_{t} + \beta ^{\mathbb{P}}X_{t} $$

for some vector $b^{\mathbb{P}}\in {\mathbb{R}}^{m}$ and matrix $\beta ^{\mathbb{P}}\in {\mathbb{R}}^{m\times m}$ if and only if

$$ \Lambda _{i}(y,x) = \frac{ ( (b^{\mathbb{P}}-b) s+ (\beta ^{\mathbb{P}}- \beta ) x )_{i} }{\sigma _{i}\sqrt{x_{i}(y-x_{i})}},\qquad i=1, \dots ,m. $$

(B.1)

In order to have that $\Lambda (Y_{t},X_{t})$ is well defined and induces an equivalent measure change, that is, the candidate Radon–Nikodým density process

$$ \exp \bigg( \int _{0}^{t} \Lambda (Y_{u},X_{u})\,dW_{u} - \frac{1}{2} \int _{0}^{t} \left \| \Lambda (Y_{u},X_{u})\right \| ^{2}\,du\bigg) $$

(B.2)

is a uniformly integrable ℚ-martingale, we need that $(Y,X)$ does not reach all parts of the boundary of $E$. This is clarified by the following theorem, which follows from Cheridito et al. [13].

Theorem B.1

The MPR$\Lambda (Y_{t},X_{t})$in (B.1) is well defined and induces an equivalent measure${\mathbb{P}}\approx {\mathbb{Q}}$with Radon–Nikodým density process (B.2) if for all$i=1,\dots ,m$, we have$X_{i0} \in (0,Y_{0})$and (3.5), (3.6) hold for the ℚ-drift parameters$\beta ,b$and for the ℙ-drift parameters$\beta ^{\mathbb{P}},b^{\mathbb{P}}$instead of$\beta ,b$.

If for some$i=1,\dots ,m$, $\beta ^{\mathbb{P}}_{ij}=\beta _{ij}$for all$j\neq i$and

(i) $b^{\mathbb{P}}_{i}=b_{i}$such that

$$ \Lambda _{i}(y,x) = \frac{ (\beta ^{\mathbb{P}}_{ii}-\beta _{ii}) \sqrt{x _{i}} }{\sigma _{i}\sqrt{y-x_{i}}}, $$

then it is enough if$X_{i0}\in [0,Y_{0})$instead of$X_{i0}\in (0,Y _{0})$and (3.3) instead of (3.5) holds for$\beta _{ij},b_{i}$, and thus for$\beta ^{\mathbb{P}}_{ij},b^{ \mathbb{P}}_{i}$.

(ii) $b^{\mathbb{P}}_{i}-b_{i}=\beta ^{\mathbb{P}}_{ii}-\beta _{ii}$such that

$$ \Lambda _{i}(y,x) = \frac{ (\beta ^{\mathbb{P}}_{ii}-\beta _{ii}) \sqrt{y-x _{i}} }{\sigma _{i}\sqrt{x_{i}}}, $$

then it is enough if$X_{i0}\in (0,Y_{0}]$instead of$X_{i0}\in (0,Y _{0})$and (3.4) instead of (3.6) holds for$\beta _{ij},b_{i}$, and thus for$\beta ^{\mathbb{P}}_{ij},b^{ \mathbb{P}}_{i}$.

The assumption of a linear-drift-preserving change of measure is often made for parsimony and to facilitate the empirical estimation procedure. For example, the specification of MPRs that preserve the affine nature of risk factors has been theoretically and empirically investigated in Duffee [18], Duarte [17] and Cheridito et al. [12], among others.

Appendix C: Chebyshev interpolation

This appendix describes how to perform a Chebyshev interpolation of an arbitrary function on a rectangle $[a,b]\times [c,d]\subseteq {\mathbb{R}} ^{2}$. The Chebyshev polynomials of the first kind take values in $[-1,1]$, but can be shifted and scaled so as to form a basis on $[a,b]$. In this case, they are given by the recursion formula

$$\begin{aligned} T_{0}^{a,b}(x) & = 1, \\ T_{1}^{a,b}(x) & = \frac{x-\mu }{\sigma }, \\ T_{n+1}^{a,b}(x) & = \frac{2(x-\mu )}{\sigma }T_{n}^{a,b}(x) - T_{n-1} ^{a,b}(x), \end{aligned}$$

with $\mu =(a+b)/2$ and $\sigma =(b-a)/2$. The Chebyshev nodes for the interval $[a,b]$ are then given by

$$ {x}^{a,b}_{j} = \mu + \sigma \cos (z_{j}), \quad z_{j}=\frac{(1/2+j) \pi }{N+1}, \qquad \text{for $j=0, \ldots , N$.} $$

The polynomial interpolation of order $N$ is

$$ p_{N}(s,x) = \sum _{n=0}^{N} \sum _{m=0}^{N} c_{n,m} T^{a,b}_{n}(s) T ^{c,d}_{m}(x), $$

where the coefficients are given by

c_{n, m} = 2^{1_{{n \neq 0}} + 1_{{m \neq 0}}} \sum_{i = 0}^{N} \sum_{j = 0}^{N} \frac{f (x_{i}^{a, b}, x_{j}^{c, d}) cos (n z_{i}) cos (m z_{j})}{{(N + 1)}^{2}} .

The coefficients can be computed in an effective way by applying Clenshaw’s method or by applying the discrete cosine transform. This straightforward interpolation has the advantage to prevent Runge’s phenomenon. We refer to Gaßet al. [28] for more details on the multidimensional Chebyshev interpolation and for an interesting financial application of multivariate function interpolation in the context of fast model estimation or calibration.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ackerer, D., Filipović, D. Linear credit risk models. Finance Stoch 24, 169–214 (2020). https://doi.org/10.1007/s00780-019-00409-z

Download citation

Received: 18 January 2019
Accepted: 09 July 2019
Published: 04 October 2019
Issue Date: January 2020
DOI: https://doi.org/10.1007/s00780-019-00409-z

Linear credit risk models

Abstract

Similar content being viewed by others

Credit Risk Modeling: A General Framework

Credit Risk Modeling: A General Framework

Modeling stochastic recovery rates and dependence between default rates and recovery rates within a generalized credit portfolio framework

1 Introduction

2 The linear framework

2.1 Survival process specification

Remark 1

Remark 2

Example 3

2.2 Defaultable bonds

Proposition 4

Proposition 5

Corollary 6

Remark 7

Lemma 8

2.3 Credit default swaps

Proposition 9

Remark 10

2.4 CDIS tranche

Proposition 11

2.5 CDS option and CDIS option

Proposition 12

2.6 Credit valuation adjustment

Proposition 13

3 The linear hypercube model

Theorem 1

Remark 2

Remark 3

3.1 One-factor LHC model

3.2 Option price approximation

Remark 4

Theorem 5

Remark 6

4 Case studies

4.1 CDS calibration

Data description

Model specification

Lemma 1

Remark 2

Filtering and calibration

Remark 3

Parameters, fitted spreads and factors

4.2 CDS option pricing

Lemma 4

4.3 CDIS option pricing

4.4 CDIS tranche pricing

5 Extensions

5.1 Multi-name models

Linear construction

Polynomial construction

Example 1

Remark 2

5.2 Stochastic interest rates

Proposition 3

Example 4

5.3 Jumps and simultaneous defaults

Jump-diffusion model

Stochastic clock

Proposition 5

Example 6

Remark 7

6 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Appendices

Appendix A: Proofs

Proof of (2.4)

Proof of Example 2.3

Proof of Proposition 2.4

Lemma A.1

Proof of Proposition 2.5