XVA modelling: validation, performance and model risk management

Silotto, Lorenzo; Scaringi, Marco; Bianchetti, Marco

doi:10.1007/s10479-023-05323-4

XVA modelling: validation, performance and model risk management

Original Research
Published: 14 June 2023

Volume 336, pages 183–274, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Annals of Operations Research Aims and scope Submit manuscript

XVA modelling: validation, performance and model risk management

Download PDF

379 Accesses
1 Altmetric
Explore all metrics

Abstract

Valuation adjustments, collectively named XVA, play an important role in modern derivatives pricing to take into account additional price components such as counterparty and funding risk premia. They are an exotic price component carrying a significant model risk and computational effort even for vanilla trades. We adopt an industry-standard realistic and complete XVA modelling framework, typically used by XVA trading desks, based on multi-curve time-dependent volatility G2++ stochastic dynamics calibrated on real market data, and a multi-step Monte Carlo simulation including both variation and initial margins. We apply this framework to the most common linear and non-linear interest rates derivatives, also comparing the MC results with XVA analytical formulas. Within this framework, we identify the most relevant model risk sources affecting the precision of XVA figures and we measure the corresponding computational effort. In particular, we show how to build a parsimonious and efficient MC time simulation grid able to capture the spikes arising in collateralized exposure during the margin period of risk. As a consequence, we also show how to tune accuracy versus performance, leading to sufficiently robust XVA figures in a reasonable time, a very important feature for practical applications. Furthermore, we provide a quantification of the XVA model risk stemming from the existence of a range of different parameterizations according to the EU prudent valuation regulation. Finally, this work also serves as an handbook containing step-by-step instructions for the implementation of a complete, realistic and robust modelling framework of collateralized exposure and XVA.

Analyzing Short-Rate Models for Efficient Bond Option Pricing: A Review

Article 17 August 2024

Nonlinearity Valuation Adjustment

Options (New Perspectives)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The credit crunch crisis started in August 2007 forced market practitioners and academics to review the methodologies used to price over-the-counter (OTC) derivatives consistently with the available market quotations. In particular, basis spreads between interest rate instruments characterised by different underlying rate tenors (e.g. IBOR 3 M, IBOR 6 M, overnight,^{Footnote 1} etc.) exploded from few to hundreds of basis points. This change of regime led to the adoption of a “multi-curve” valuation framework, based on distinct yield curves to compute forward rates with different tenors,^{Footnote 2} and discounting curves consistent with the collateral remuneration rate.^{Footnote 3} The multi-curve framework is extensively discussed in the literature, a non-exhaustive list of references includes (Ametrano & Bianchetti, 2009; Bianchetti, 2010; Henrard, 2007, 2009; Kenyon, 2010; Mercurio, 2009; Piterbarg, 2012). Such framework was recently simplified by the interest benchmark reform and the progressive replacement of many IBOR rates with the corresponding overnight rates. We refer to e.g. Scaringi and Bianchetti (2020) and references therein for a discussion.

The credit crunch crisis also forced market participants to extend the multi-curve valuation framework to include additional risk factors, such as counterparty and funding risk, leading to a set of valuation adjustments collectively named XVA, for which we refer to the wide existing literature (see e.g. Brigo et al., 2013; Kjaer, 2018; Gregory, 2020). In particular, Credit and Debt Valuation Adjustments (CVA and DVA, respectively) take into account the bilateral counterparty default risk premium affecting derivative transactions, and are also required by international accounting standards (in EU since the introduction of IFRS13 in 2013, see IASB, 2011).

After the crisis, regulators pushed to mitigate counterparty default risk for OTC derivatives. With regard to non-cleared OTC derivatives, in 2015 the Basel Committee on Banking Supervision (BCBS) and the International Organization of Securities Commissions (IOSCO) finalized a framework (BCBS-IOSCO, 2015), introduced progressively from 2016, which requires derivatives counterparties to bilaterally post Variation Margin (VM) and Initial Margin (IM) on a daily basis at netting set level. VM aims at covering the current exposure stemming from changes in the value of the portfolio by reflecting its current size, while IM aims at covering the potential future exposure that could arise, in the event of default of the counterparty, from changes in the value of the portfolio in the period between last VM exchange and the close-out of the position. In particular, in 2016 ISDA published the Standard Initial Margin Model (SIMM) (see ISDA, 2013, 2018), with the aim to provide market participant with a uniform risk-sensitive IM model, and to prevent both potential disputes and the overestimation of IM requirements due to the use of the BCBS-IOSCO non-risk-sensitive standardized model. The ISDA-SIMM is a parametric VaR model based on Delta, Vega and Curvature (i.e. “pseudo” Gamma) sensitivities, defined across risk factors by asset class, tenor and expiry, and computed according to specific definitions.

In general, XVA pricing is subject to a significant model risk, since it depends on the many assumptions made for modelling and calculating the relevant quantities. Since computational constraints impose to reduce the number of floating-point calculations, model risk arises principally from the need to find an acceptable compromise between accuracy of the XVA figures and computational performance. In the EU pricing model risk is envisaged in EU (2013) (art. 105.10), and EC (2016) (art. 11), and refers precisely to the valuation uncertainty of fair-valued positions linked to the “potential existence of a range of different models or model calibrations used by market participants”. This may occur when a unique model recognized as a clear market standard for computing the price of a certain financial instrument does not exist, or when a model allows for different parameterizations or numerical solution algorithms leading to different model prices. We stress that this measure of model risk does not refer to the universe of possible pricing models and model parameterizations, which is virtually illimited, but, on the contrary, it does intentionally focus the range to those alternatives effectively used by market participants. While the pricing models and their parameterizations used by market participants are not, in general, easily observable, the situation for XVA pricing is slightly different, since market participants may occasionally infer some information from the XVA prices observed in the case of competitive corporate auctions, novations,^{Footnote 4} negotations of collateral agreements, and also systematically from consensus pricing services (e.g. Totem).

In light of the considerations above, our paper is intended to answer to the following three interconnected Research Questions:

Q1
which are the most critical model risk factors, to which exposure modelling and thus XVA are most sensitive?
Q2
How to set the XVA calculation parameters in order to achieve an acceptable compromise between accuracy and performance?
Q3
How to quantify the model risk affecting XVA figures?

We address these questions as follows. We identify all the relevant calculation parameters involved in the Monte Carlo simulation used to compute the exposure and the XVA figures under different collateralization schemes. For each parameter we quantify its relevance in terms of impacts on XVA figures and computational effort required. Putting all these results together, we may identify the parametrization which allows a compromise between accuracy and performance, i.e. leading to sufficiently robust XVA figures in a reasonable time, a very important feature for practical applications. As a consequence, we are also able to provide a quantification of the XVA model risk stemming from the existence of a range of different pricing model calibrations, numerical methods and their related parameterizations according to the EU provisions.

To these purposes we adopt an industry-standard realistic and complete modelling framework, typically adopted by XVA trading desks, including both VM and ISDA-SIMM dynamic IM, based on real market data, i.e. distinct discounting and forwarding yield curves, CDS spread curves, and swaption volatility cube. We apply this framework to the most diffused derivative financial instruments, i.e. interest rate Swaps and physically settled European Swaption, both with different maturities and moneyness. The stochastic dynamics of the underlying risk factors is modelled with a multi-curve two-factors G2++ short rate model with time-dependent volatility, which, including 19 parameters (see Table 15), allows a richer yield curve dynamics and a better calibration of the market swaption cube. Our XVA numerical implementation is based on a multi-step Monte Carlo simulation with nested exposures calculated by means of analytical formulae for Swaps and semi-analytical formulas for Swaptions. Since the G2++ model allows for an analytical expression for the transition probability under the T-forward measure, we may use a parsimonious time simulation grid^{Footnote 5} able to capture the spikes arising in collateralized exposure during the margin period of risk. The Monte Carlo XVA figures are also compared against the results obtained through analytical XVA formulas available for Swaps. The latter require the valuation of a strip of co-terminal European Swaptions for which we used both the G2++ and the SABR models, calibrated to the same market swaption cube.

Regarding collateral modelling, since VM and IM determine important mitigations of XVA figures, it is crucial to correctly model their dynamics taking into account the most important collateral parameters, i.e. the margin threshold, the minimum transfer amount, and the margin period of risk. On the one hand, extensive literature exists regarding dynamic VM modelling (see e.g. Brigo et al., 2013, 2018, 2019). On the other hand, dynamic IM modelling of the ISDA-SIMM involves the simulation of several forward sensitivities and their aggregation according to a set of predefined rules, imposing difficult implementation and computational challenges. Different methods have been proposed to overcome such challenges: approximations based on normal distribution assumptions (see Gregory, 2016; Andersen et al., 2017), approximated pricing formulas to speed up the calculation (see e.g. Zeron & Ruiz, 2018; Maran et al., 2021), adjoint algorithmic differentiation (AAD) for fast sensitivities calculation (see e.g. Capriotti & Giles, 2012; Huge & Savine, 2020) and regression techniques (see e.g. Anfuso et al., 2017; Caspers et al., 2017; Crépey & Dixon, 2020). We focus on the implementation of ISDA-SIMM avoiding as much as possible any approximation, computing forward sensitivities by a classic finite-difference (“bump-and-run”) approach which, although computationally intensive, may be used when performances are not critical.

The choice of the risk factors dynamics is a crucial aspect and should be based on a careful balance between the model sophistication and the corresponding unavoidable calibration and computational constraints. Even though our G2++ model model does not embed advanced features like stochastic volatility (see Bormetti et al., 2018), stochastic basis (see Konikov & McClelland, 2019) or stochastic credit process (see Glasserman & Xu, 2014), it is commonly preferred by financial institutions because of several reasons, as extensively argued in Green (2015) (Sects. 16.1.3, 16.3, 19.1.2) and Gregory (2020) (Sect. 15.4.2), which we summarize here: (i) more sophisticated models require the calibration of additional model parameters which are difficult to manage, particularly if one takes into account the complex covariance structure associated to multiple stochastic risk factors; (ii) more sophisticated models are typically much more computationally demanding and easily become unsustainable, e.g. because they do not allow for analytical formulas for transition probabilities and/or for the price of plain vanilla instruments; (iii) simpler models, like the G2++ adopted in this work, allow (semi-)analytical pricing formulas for the most diffused plain vanilla instruments, like Swaps and Swaptions; (iv) since we deal with trades under collateral, which reduces and possibly neutralize the corresponding exposures, the sophistication of the stochastic dynamics chosen for risk factors simulation is dominated in importance by the modelling choices adopted for the collateral dynamics, which we extensively discuss in this work; (v) in particular, for interest rates derivatives, the adoption of a stochastic basis between discounting and forward rates is not crucial since the sensitivity w.r.t. discounting rates is much smaller, and, historically, the volatility of the basis is typically much smaller that the volatility of the corresponding rates. A stochastic basis would play a role only in the case of basis swaps, as discussed in Konikov and McClelland (2019), which are typically traded on the OTC interbank market for hedging purposes. Moreover, the ongoing financial benchmark reform is gradually reducing the importance of such kind of instruments due to the IBOR rates cessation.

Finally, it’s worth to notice that, while the multi-curve single-factor G1++ model is commonly used, and may also be found in commercial software packages, the multi-curve two-factors G2++ model with time-dependent volatility parameters is less straightforward and, to the best of our knowledge, less diffused. Because of this reason, we report all the relevant G2++ equations in “Appendix A.3”. As a consequence, this work also serves as a handbook containing step-by step instructions for the implementation of a complete, realistic and robust modelling framework of collateralized exposure and XVA.

The aforementioned considerations led us to the choice of our G2++ framework, in order to provide an XVA model risk investigation based on a realistic XVA pricing architecture, typically adopted by XVA trading desks, and consistent with the prescriptions of the EU prudent valuation framework.

The paper is organized as follows. In Sect. 2 we briefly remind the XVA framework and the numerical steps involved in the calculation. In Sect. 3 we show the results both in terms of counterparty exposure and XVA figures for the selected financial instruments and collateralization schemes using the target parameterization of the framework, allowing an acceptable compromise between accuracy and performance. In Sect. 4 we report the analyses conducted on model parameters in order to answer the research questions n. 1 and n. 2 above. In Sect. 5 we describe the calculation of the AVA MoRi, answering to research question n. 3 above. In Sect. 6 we draw the conclusions. Finally, the four apps. A to D reports many details related to the corresponding main sections.

2 XVA pricing framework

The framework required for XVA calculation including Variation and Initial Margins is a complex combination of many theoretical and numerical approaches that we summarize in the following list.

1.
The general no-arbitrage pricing formulas for financial instruments subject to XVA, discussed in “Appendix A.1”.
2.
The description of the financial instruments that we wish to test in our XVA calculations, i.e. interest rate Swaps and European Swaptions, discussed in “Appendix A.2”.
3.
The G2++ model adopted to describe the evolution of forward and discount curves, discussed in “Appendix A.3”, including: (i) the multi-curve, time-dependent volatility G2++ stochastic dynamics (“Appendix A.3.1”), (ii) the corresponding G2++ pricing formulas for Swaps and European Swaptions (“Appendix A.3.2”), (iii) the calibration procedure of G2++ model parameters to the available market data (“Appendix A.3.3”), and (iv) the G2++ dynamics under the forward measure, suitable for efficient Monte Carlo simulation (“Appendix A.3.4”).
4.
The model adopted to describe the XVA, discussed in “Appendix A.4”, including: (i) the XVA definition and pricing formulas (“Appendix A.4.1”), (ii) the discretized XVA formulas suitable for Monte Carlo simulation (“Appendix A.4.2”), and (iii) the analytical XVA formulas applicable to single, uncollateralized linear derivatives (“Appendix A.4.3”).
5.
The model adopted to describe the collateral dynamical evolution, discussed in “Appendix B”, including: (i) the formulas for the collateralised exposure with both VM and IM (“Appendix B.1”), (ii) the formulas to dynamically compute VM (“Appendix B.2”), and (iii) the formulas to dynamically compute IM (“Appendix B.3”) according to the ISDA Standard Initial Margin Model formulas (“Appendix B.3.2”).
6.
The market data set used to calibrate the G2++ model parameters and to compute the XVA, discussed in “Appendix D”.

The framework described above requires a precise sequence of calculation steps to compute XVA for the selected instruments, that can be summarized as follows.

1.
Calibration of the G2++ model parameters to market data (see Sect. A.3.3).
2.
Construction of the parsimonious time grid $\left\{ t_i\right\} _{i=0}^N$ for MC simulation, which includes both the primary time grid $\left\{ {\bar{t}}_i\right\} $ and the collateral time grid $\left\{ {\hat{t}}_i\right\} $ (see Sect. 4.2.2).
3.
Simulation of the processes $[x_m(t_i), y_m(t_i)]$ for each time step $t_i, i=1,\ldots ,N$ and Monte Carlo path $m = 1,\dots ,N_{MC}$ according to the G2++ dynamics under the forward measure (see “Appendix A.3.4”).
4.
Calculation of the collateralized exposure $H_m(t_i)$ for each time step $t_i$ and simulated path m (see Eq. B1). This requires to:
1. a.
  compute the instrument’s future mark-to-market values $V_{0,m}({\bar{t}}_i)$ at time ${\bar{t}}_i$ on the primary time grid using the G2++ pricing formulas (see “Appendix A.3.2”);
2. b.
  compute the Variation Margin $\text {VM}_m({\bar{t}}_i)$ available at time ${\bar{t}}_i$ on the primary time grid, which is a function of $V_{0,m}({\hat{t}}_i)$ at the previous time ${\hat{t}}_i = {\bar{t}}_i-l$ on the collateral time grid (see Eq. B2);
3. c.
  compute the ISDA-SIMM dynamic Initial Margin $\text {IM}_m({\bar{t}}_i)$ available at time ${\bar{t}}_i$ on the primary time grid, which is function of instrument Delta $\Delta _m^c({\hat{t}}_i)$ and Vega $\nu _m({\hat{t}}_i)$ sensitivities at the previous time ${\hat{t}}_i = {\bar{t}}_i-l$ on the collateral time grid (see Eq. B8).
5.
Calculation of EPE ${\mathcal {H}}^{+} (t;t_i)$ and ENE ${\mathcal {H}}^{-} (t;t_i)$ for each time step $t_i$ (see Eqs. A60 and A61).
6.
Calculation of survival probabilities for each time step $t_i$ from default curves built from market CDS quotes.
7.
Calculation of CVA and DVA (see Eqs. A58 and A59).

As discussed in the introduction, our multi-curve, time-dependent volatility G2++ model has a number of important characteristics for XVA calculation: (i) it allows perfect calibration of the market term structures of both discount and forward curves, (ii) it allows a better calibration of the market term structure of volatility, (iii) it allows a volatility skew that can be fitted, at least partially, to the market volatility skews, (iv) it allows, under the forward probability measure, an efficient Monte Carlo simulation, (v) it allows analytical pricing formulas for European Options (Caps/Floors/Swaptions).

3 XVA numerical calculations

In this section we report the results for exposure profiles and XVA figures for the financial instruments and the collateralization schemes considered in this work. In order to do so, we use the XVA pricing framework discussed in the previous Sect. 2, and the set of parameters meeting our acceptable compromise between accuracy and performance, discussed in the next Sect. 4 and summarized in Sect. 4.7.

We consider the most diffused derivative instrument, i.e. interest rate Swaps, which are typically traded in at least two very common situations: (i) between banks and their corporate clients, frequently without collateralization, and (ii) between banks for hedging purposes, collateralized with variation margin and (frequently) with initial margin. We include both spot and forward starting Swaps with different maturities and moneyness. Besides linear derivatives, we also consider another common interest rate option, i.e. the physically settled European Swaption, with different moneyness. All the instruments considered are listed in “Appendix A.2.1” (Table 10). Accordingly, we consider three different collateralization schemes: without collateral, with Variation Margin (VM) only, and with both VM and Initial Margin (IM). The case with IM only is not analyzed since IM is typically associated to VM.

Overall, we consider 36 different cases, as shown in the following section reporting the XVA figures (Table 1).

Table 1 XVA figures for instruments in Table 10 for the three collateralization schemes considered

Full size table

3.1 Exposure results

We show in Fig. 1 the most general and complex case of exposures with VM and IM, and we report in “Appendix C.1” the complete set of results for the instruments listed in Table 10 and the three collateralization schemes mentioned above, along with the corresponding detailed comments.

Overall, we observe that the expected exposure profiles are broadly consistent with those found in the existing literature (see e.g. Brigo et al., 2013; Gregory, 2020). A closer inspection reveals that our approach is able to capture the detailed and complex shape of the exposure mitigated by VM and IM, an important feature for XVA calculation. In particular we notice in Fig. 1, for all the instruments and moneynesses considered, that EPE/ENE profiles display short and medium term flat shapes with spikes appearing with increasing magnitude close to maturity, suggesting that IM turns out to be inadequate to fully suppress the exposure because of its decreasing profile. These spikes originate from the semi-annual jumps in Swaps’ future simulated mark-to-market values at cash flow dates, due to the different frequency of the two legs, captured by VM with a delay equal to the length of the margin period of risk (MPoR, see Sect. 4.2.1 for details).

3.2 XVA results

We report in Table 1 the XVA figures for the full set of 36 cases considered in this work. Notice that we compute XVA from the point of view of the instruments’ holder (i.e. with positive nominal amount N). Accordingly, uncollateralized physically settled Swaptions show non-zero DVA figures.^{Footnote 6}

Regarding uncollateralized XVA, we observe that Swaps display larger CVA (DVA) figures for ITM (OTM) instruments due to the greater probability to observe positive (negative) future simulated mark-to-market values, also reflected in lower Monte Carlo errors. Moreover, CVA is larger than DVA except for the OTM 15Y Swap due to the simulated forward rates structure which causes expected floating leg values greater than those of the fixed leg. Finally, analysing the results for different maturities, the higher risk of 30 years Swaps leads to larger adjustments compared to those maturing in 15 years. Analogous results are obtained for forward Swaps. In this case, the asymmetric effect of simulated forward rates on opposite transactions causes larger CVA and smaller DVA for the OTM payer forward Swap with respect to the OTM receiver one. Slightly lower (absolute) CVA values are observed for the corresponding physically settled European Swaptions, since OTM paths are excluded after the exercise, while DVA values are considerably lower as negative exposure exists only after the expiry.

Regarding XVA with VM only, we observe that the adjustments are reduced on average by approx. two orders of magnitude with respect to the uncollateralized case, and are widely driven by spikes in exposure profiles. In general, DVA figures are greater compared to CVA ones, since for payer instruments the magnitude of the spikes in ENE becomes larger by approaching the maturity, where the default probability increases.

Finally, regarding XVA with VM and IM, we observe that the adjustments are reduced on average by approx. four orders of magnitude with respect to the uncollateralized case. Here, XVA figues are entirely driven by the spikes closest to maturity which are not fully suppressed by IM, thus confirming the importance of a detailed simulation of the collateralized exposure.

4 XVA model validation

The previous Sects. 2 and 3 suggest that XVA calculation depends on a number of assumptions which affect the results in different ways. We can look at these assumptions as sources of model risk, which need to be properly addressed.

The purpose of this section is threefold: (i) we want to identify and analyze in detail the most significant sources of model risk; (ii) we want to validate our XVA framework by assessing its robustness and tuning the corresponding calculation parameters; (iii) we look for a strategy to set the acceptable compromise between accuracy and performance, a very important feature for practical applications. These analyses will also lead to a distribution of XVA values which will be the basis to compute a model risk measure in the following Sect. 5.

In order to ease the presentation we report the results only for a subset of the instruments listed in Table 10, mainly the 15Y ATM payer Swap and the 5x10Y ATM physically settled European payer Swaption. Analogous results are obtained for the other instruments and are reported in the corresponding sections of “Appendix C”.

4.1 G2++ model calibration

Our XVA framework is based on the G2++ model, whose parameters p are calibrated on market ATM Swaption prices using the procedure described in “Appendix A.3.3”. We will refer to this calibration as the baseline calibration.

Calibrating the model to ATM swaption quotes is a standard approach for at least two reasons: (i) ATM quotes are the most liquid, in particular for the most frequently traded expiries/tenors, (ii) there is less interest into the smile risk when one has to deal with many counterparties and large netting sets dominated by linear interest rate derivatives, a typical situation for XVA trading desks. Nevertheless, this choice is not unique, and different approaches can be adopted according to market conditions, specific trades and the relevance of smile risk. For example, a specific calibration approach could be adopted when structuring a new trade for a client, especially in the case of a competitive auctions.

Since XVA figures depend on the G2++ model parameters, the G2++ model calibration is a source of model risk, and we address it by considering different alternative calibrations, also including the Swaption smile risk. In particular, we compare the baseline calibration (parameters denoted with p) with six alternative calibrations (parameters denoted with $p_i$, with $i=1,\dots ,6$), which we define by tuning the following features: (i) the maximum expiry of market points used for calibration, (ii) the maximum/minimum strikes of market points used for calibration, and (iii) imposing flat volatility G2++ parameters.

We report in “Appendix C.2” all the details about the 7 different calibrations. We show here in Table 2 the XVA figures obtained with the seven different calibrations for the 15Y ATM payer Swap and the 5x10Y ATM physically settled European payer Swaption, both without collateral. Similar results are obtained for the other collateralization schemes. We observe that the range of values obtained using different model calibrations is always lower than the $3\sigma $ statistical uncertainty due to MC simulation. Therefore we can conclude that the XVA model risk stemming from different G2++ model calibrations is limited, and that our choice of the baseline calibration on market ATM Swaptions is sufficiently robust for our purposes.

Table 2 XVA figures and 3$\sigma $ confidence intervals (in parentheses) obtained with 7 different G2++ model calibrations for 15Y ATM payer Swap and 5x10Y ATM physically settled European payer Swaption, EUR 100 Mio nominal amount, no collateral

Full size table

4.2 Time simulation grid

The numerical calculation of XVA in Eqs. A55 and A56 requires the discretization of the integral on a time grid in correspondence of which the exposure is computed using the Monte Carlo simulation, as shown in “Appendix A.4.2”.

The construction of this MC time simulation grid is a crucial step to find an acceptable compromise between accuracy and performance for at least two reasons: (i) an high granularity reduces the discretization error but increases the computational time, leading to poor performances, and (ii) the presence of the margin period of risk requires a careful distribution of the time simulation points in order to capture the spikes in collateralized exposures, which have material impact on XVA. As a consequence, both the granularity and the distribution of the time simulation grid are an important model risk factor in XVA calculation.

In order to identify the optimal construction of the time simulation grid we proceed as follows: (i) we analyse the spikes in collateralized exposure using the most accurate choice, i.e. a daily grid; (ii) we propose a workaround which allows to capture all the spikes using lower granularities and (iii) we perform a convergence analysis looking for the granularity which ensures an acceptable compromise between accuracy and performance.

4.2.1 Spikes analysis

Spikes arising in collateralized exposure are due to the MPoR, since it implies that the collateral available at time step $t_i$ depends on instrument’s simulated mark-to-market values at time step ${\hat{t}}_i = t_i - l$, assumed to be the last date at which VM and IM are fully exchanged (see “Appendix B.1”). Clearly, the best possible choice in terms of accuracy is a daily time simulation grid, that both reduces the discretization error in the integrals in Eqs. A55 and A56, and automatically captures all the details of the exposure, including the spikes.

We show in Fig. 2 the EPE/ENE profiles obtained with a daily grid for the 15Y Swap (left-hand side panel) and the 5x10Y Swaption (right-hand side panel) for the three collateralization schemes considered.

As can be seen, when only VM is considered, spikes emerge at inception as no collateral is posted, and at cash flow dates as sudden changes in future simulated mark-to-market values are captured by VM with a delay due to MPoR. When also IM is considered, spikes closest to maturity persist due to the downward profile of IM. Further investigations on the nature of the exposure’s spikes are reported in “Appendix C.3”.

The impact of these spikes on XVA figures is significant: with VM only the contribution is respectively of $+7\%$ and $+6\%$ for the 15Y Swap and of $+3\%$ and $+6\%$ for the 5x10Y Swaption. With also IM the exposure between spikes is suppressed, therefore CVA and DVA are completely attributable to spikes. In other words, neglecting the spikes would significantly underestimate the (absolute) XVA figures. On the other hand, since using a daily grid is unfeasible in practice, in the next section we look for a possible solution.

4.2.2 Parsimonious time grid

Although a daily grid, as discussed in the previous Sect. 4.2.1, clearly represents the best discrete approximation to compute the XVA integrals in Eqs. A55 and A56, this choice is often unfeasible in practice because of the poor computational performance, as can be observed in the last column of Table 3. Notice that the most time consuming component is the IM, which involves the calculation of several forward sensitivities for each path (see “Appendix B.3.3”). Another bottleneck is the numerical integration of the semi-analytical G2++ pricing formula for Swaptions (see Eq. A29). On the other hand, the adoption of simple, less granular, evenly spaced time grids would be inadequate to capture the spikes in collateralised exposure and could produce biased XVA figures.

Table 3 XVA convergence analysis for 15Y ATM payer Swap and 5x10Y ATM physically settled European payer Swaption, EUR 100 Mio nominal amount

Full size table

In order to overcome these issues we build a parsimonious time simulation grid $\left\{ t_i\right\} _{i=0}^N$, which is obtained by joining an initial time grid $\left\{ {\bar{t}}_i\right\} $, evenly-spaced with time step $\Delta t$, a cash flow grid, including the trade (or portfolio) cash flow dates, and a collateral time grid, where each previous date is shifted by the MPoR. The resulting final joint time grid depends on the initial time step $\Delta t$ but is no longer evenly spaced. See “Appendix C.3” for more details.

We show in Fig. 3 the exposure profiles for the 15Y Swap obtained with different grids for the three collateralization schemes considered. For testing purposes we compare the joint time grid with a standard time grid, obtained adding the primary time grid and its corresponding collateral time grid, which does not include the cash flow time grid.

We observe that uncollateralized exposures are similar for both grids (top panels), but the standard grid fails to capture the spikes in exposures with VM (panel c vs d), because of the lack of the cash flow time grid. Adding the IM with standard grid completely suppresses the residual exposure (panel e), thus leading to null XVA figures. Instead, the joint time grid allows to correctly model all spikes in the collateralized exposure (panel f) with considerable computational benefits with respect to the daily grid discussed in the previous section. In fact, the joint grid $\Delta t =1M$ exposures in Fig. 3 (right-hand side) are very similar to the corresponding $\Delta t =1D$ exposures in Fig. 2 (left-hand side). Similar results are obtained for the 5x10Y Swaption (see “Appendix C.3”).

The parsimonious time simulation grid discussed above is governed by two parameters: i.e. the constant granularity $\Delta t$ used in the initial time grid and the number n of cash flows in the cash flow grid. Since the number of cash flows is fixed exhogenously according to the trade or portfolio under analysis, the other parameter $\Delta t$ can be used to tune the compromise between accuracy and computational performance in the XVA calculation.

We show in Table 3 the XVA results obtained for the 15Y Swap and the 5x10Y Swaption using the joint time grid with different granularities $\Delta t$, taking the results obtained with the daily time grid as benchmark. We observe that uncollateralized XVA, without spikes, show a good convergence already for low granularities, i.e. $\Delta t = 6$M. In fact, the relative grid error is smaller than the MC $3\sigma $ error. Instead, collateralized XVA require higher granularities, up to $\Delta t = 1$M, due to the exposure spikes. In particular, XVA with VM are dominated by the grid error for the Swap (except for DVA with $\Delta t = 1M$), and by the MC error for the Swaption (except for DVA with $\Delta t = 12M,1M$). XVA with both VM and IM, very small and highly spike dependent, are mainly dominated by the grid error, except for Swaption’s CVA. The differences between collateralized Swaps and Swaptions are not surprising, since in the collateralized exposure for physical Swaptions (i) cash flows and spikes appear only after the Swaptions’ expiry and (ii) many MC paths after the Swaptions’ expiry date go OTM and give zero prices (e.g. the 5x10Y Swaption goes OTM for 45.2% of the MC paths). This is clearly visible in Fig. 10, where the spikes for the 5x10Y collateralized Swaptions (panels d-i) are much smaller w.r.t. the corresponding collateralized Swaps in Fig. 9.

Looking at the computational performance (last column), we observe that, overall, the computational time is roughly proportional to the number of time simulation steps $N_S$. In particular, the monthly grid is approx. 26–27 times faster than the daily grid and 2.3$-$2.5 times slower than the quarterly grid. Regarding the instruments, the Swaption is approx. 10–12 times slower than the Swap. Regarding the collateral, adding VM costs approx. a factor of 2, and adding also IM costs another factor of 14–15, in total approx. 28–30 times slower than the uncollateralized case, both for the Swap and the Swaption.

In light of this analysis we may confirm that the construction of the time simulation grid is a relevant source of model risk. For the purposes of the present work, we identify $\Delta t = 1M$ as an acceptable compromise between accuracy and performance.

4.3 Monte Carlo convergence

The Monte Carlo simulation used in this work for XVA calculation (Eq. A60), although computational intensive, allows to manage the complexities inherent XVA calculation, such as collateralization. Obviously, the most important parameter for MC is the number of MC scenarios, which has to be tuned to find an acceptable compromise between precision and computational effort.

Accordingly, we investigate the XVA convergence with respect to the number of Monte Carlo scenarios $N_{MC}$. In order to do so, we assume the XVA figures calculated with a large number of MC scenarios (i.e. $N_{MC} = 10^6$) as proxies for the “exact” XVA figures, and we use them as benchmarks to assess the XVA convergence for smaller numbers of scenarios (always using the same seed in the pseudo-random number generator). Furthermore, in order to investigate the XVA Monte Carlo error, we use the upper and lower bounds on EPE/ENE in Eq. A64.

In order to clarify the MC convergence, we show in Fig. 4 the XVA convergence diagrams for the 5x10Y Swaption, for the three collateralization schemes considered.

We observe that XVA converge, for all collateralization schemes, to “exact” values with small absolute percentage differences already for few paths (i.e. $N_{MC} = 1000$). As expected, higher differences can be observed for IM (bottom panels) due to small XVA values; nevertheless, $N_{MC} \ge 5000$ ensures an absolute percentage difference below 5%. Similar results are obtained for the 15Y Swap (see “Appendix C.4”). Regarding the computational effort, since it scales linearly with the number of simulated paths, we observe that beyond $N_{MC} = 5000$ the benefits in terms of accuracy would be exceeded by the computational costs, particularly for IM.

In light of this analysis we may identify $N_{MC} = 5000$ as an acceptable compromise between accuracy and performance for the purposes of the present work.

4.4 Forward vega sensitivity calculation

In this section we report the analyses conducted to validate the approach adopted to calculate the Vega sensitivity when simulating ISDA-SIMM dynamic IM (see “Appendix B.2”), which is a source of model risk for the Swaption’s XVA.

4.4.1 Sensitivity to G2++ parameters

ISDA-SIMM defines Vega sensitivity as the price change with respect to a 1% shift up in ATM shifted-Black implied volatility. Since the G2++ pricing formula for European Swaption does not depend explicitly on the Black implied volatility (see Eq. A29), in our framework Vega for Swaptions cannot be calculated at future time steps according to ISDA prescriptions. In “Appendix B.3.3” we propose an approximation scheme to calculate forward Vega by shifting up the G2++ model parameters governing the underlying process volatility. In order to validate this approach, we compare the Vega obtained at valuation date $t_0$ through Eq. B31 with a “market” Vega and a “model” Vega, both consistent with ISDA prescriptions. Specifically, for a given combination of expiry and tenor, we computed the following three Vega sensitivities,

$$\begin{aligned} \nu _{1}(t_0)&= \frac{V^{\text {Blk}} \bigl ( t_0; \sigma ^{\text {Blk}}(t_0) + 0.01 \bigr )-V^{\text {Mkt}} (t_0) }{0.01} , \end{aligned}$$

(1)

$$\begin{aligned} \nu _{2}(t_0)&= \frac{V^{\text {G2++}} \bigl (t_0; {\hat{p}} \bigr )-V^{G2++} \bigl (t_0; p \bigr ) }{0.01} , \end{aligned}$$

(2)

$$\begin{aligned} \nu _{3}(t_0)&= \frac{V^{\text {G2++}} \bigl ( t_0;\sigma + \epsilon _{\sigma },\eta + \epsilon _{\eta } \bigr ) - V^{\text {G2++}} \bigl ( t_0; \sigma , \eta \bigr ) }{{\hat{\sigma }}^{\text {Blk/G2++}}(t_0) - \sigma ^{\text {Blk/G2++}}(t_0)}, \end{aligned}$$

(3)

where $\nu _{1}$ denotes the “market” Vega obtained by shifting the ATM Black implied volatility by $+1\%$ and re-pricing the Swaption via Black pricing formula; $\nu _{2}$ denotes the “model” Vega obtained by shifting the ATM Black implied volatility matrix by $+1\%$, re-pricing market Swaptions via Black pricing formula, re-calibrating the G2++ parameters p on these prices, and computing the Swaption price using the re-calibrated parameters ${\hat{p}}$; $\nu _{3}$ denotes the Vega obtained according to the approximation outlined in “Appendix B.3.3”, i.e. by applying the shocks $\epsilon _{\sigma }$ and $\epsilon _{\eta }$ on the G2++ parameters $\sigma $ and $\eta $ governing the underlying process volatility, recomputing the G2++ Swaptions’ prices and the corresponding Black implied volatilities.

In addition, we also tested Eq. 3 against different values of the shocks $\epsilon _{\sigma }, \epsilon _{\eta }$, considering both $\epsilon _{\sigma } = \epsilon _{\eta }$ and $\epsilon _{\sigma } \ne \epsilon _{\eta }$. In the latter case, we recovered the values for the shocks from the re-calibrated parameters ${\hat{\sigma }}$ and ${\hat{\eta }}$ of Eq. 2, i.e. $\epsilon _{\sigma } = 1\%$ and $ \epsilon _{\eta } = 4\%$. The results of the comparison are reported in Table 4.

Table 4 5x10Y ATM physically settled European payer Swaption, EUR 100 Mio nominal amount, comparison between prices $V(t_0)$ and ${\hat{V}}(t_0)$, Vega sensitivity $\nu (t_0)$ and Vega Risk $VR(t_0)$ at time step $t_0$ for the different calculation approaches and parameterizations considered

Full size table

We observe that Vega sensitivities are fairly aligned among the three approaches and the different shocks values examined. Hence, at the initial time step $t_0$ our approximation produces Vega sensitivity and Vega Risk values consistent with those obtained by applying the ISDA definition. Therefore, we assume that this approach can be adopted also for future time steps. As regards the choice of shocks sizes, in order to avoid any arbitrary element, we decide to compute forward Vega by using the re-calibrated ${\hat{\sigma }}$ and ${\hat{\eta }}$, corresponding to $\epsilon _{\sigma } = 1\%$ and $ \epsilon _{\eta } = 4\%$.

4.4.2 Implied volatility calculation

Looking closely at the Monte Carlo simulation of forward swap rates, we find that some paths exhibit deeply negative rates, exceeding (in absolute terms) the value of the Black shift $\lambda _{x}(t_0)$ used at the initial time step $t_0$ in the calibration of the model parameters. This feature prevents the calculation of Black implied volatilities at future time steps, needed to compute Vega sensitivity according to Eq. 3. In Fig. 5 we show the MC simulation of the 5x10Y forward swap rate, where in 2538 paths out of 5000 (51% of the total) the rate falls below $\lambda _{6\text {m}}(t_0)=1\%$ for at least one time step.

Table 5 5x10Y ATM physically settled European payer Swaption, EUR 100 Mio nominal amount

Full size table

In light of this fact, in order to ensure Vega sensitivity calculation for each time step and path, we are forced to use Black shift values larger than those necessary and sufficient at time step $t_0$. To this end, we analysed the impact of different Black shifts on shifted-Black implied volatility, Vega sensitivity and Vega Risk at $t_0$. The results for the 5x10Y ATM Swaption are reported in Table 5. We observe that shifted-Black implied volatility and Vega sensitivity are highly impacted by the different black shift but the Vega Risk, given by the product of the two quantities (see Eq. B16), is fairly stable, differing up to a maximum of 2% w.r.t. the case $\lambda _{6\text {m}}=1\%$.

We conclude that Black shift values larger than those typically used at $t_0$ ensure the inversion of the Black formula for each path with an acceptable accuracy in Vega Risk. For this reason we set $\lambda _{6\text {m}} = 6\%$ in our calculations.

4.5 XVA sensitivities to CSA parameters

In order to establish the most relevant CSA parameters driving the exposure and the XVA, we analysed the corresponding sensitivities with respect to the most important CSA parameters, i.e. the margin threshold K, the minimum transfer amount MTA, and the length of margin period of risk (MPoR), keeping the other model parameters as in Table 7. In carrying out this analysis we distinguished between the following three collateralization schemes for both Swaps and Swaptions: (i) XVA with VM only; (ii) XVA with VM and IM, with K and MTA applied on VM only; (iii) XVA with VM and IM, with K and MTA applied on both VM and IM.

We report all the results in “Appendix C.5”. Overall, we found that MPoR does not contribute significantly with respect to K and MTA parameters. In particular, the threshold K is the most important parameter. As expected, for increasing values of K and MTA, the collateralized XVA converges to the uncollateralized value. We notice that this analysis does not identifies XVA model risk factors, but it is very useful in practical situations, in particular when collateral agreements are negotiated, as widely happened during the financial benchmarks reform.

4.6 Monte Carlo versus analytical XVA

As shown in “Appendix A.4.3”, in the case of uncollateralized Swaps, there exist analytical XVA formulas in terms of an integral over the values of co-terminal European Swaptions (see Eqs. A65, A66).

The corresponding numerical solution requires the discretization of the integrals on a time grid (see Eqs. A67, A68), whose granularity clearly introduces a model risk in the XVA figures. Therefore, we tested these formulas for different time grids with different frequencies. Moreover, given the model independent nature of this approach, we calculated co-terminal Swaptions’ prices according to two different approaches:

1.
using our G2++ model, Eq. A29, with G2++ parameters in Table 7;
2.
using the shifted-SABR model, using shifted-Black formulas and shifted-lognormal SABR volatilities (see Hagan et al., 2002; Obloj, 2007) calibrated on the market swaption cube for each available smile section. We stress that this approach is not straightforward, since typically only a few co-terminal Swaptions entering into the XVA analytical formula correspond to quoted smile sections, where the SABR formula can be directly used. All the remaining Swaptions insisting on non-quoted smile sections require delicate interpolation/extrapolation of the calibrated SABR parameters (see “Appendix A.4.3” for further details).

The purpose of this analysis is twofold: one the one hand, we want to test the results of the Monte Carlo approach against analytical formulas, on the other hand, we want to quantify the model risk stemming from the use of alternative pricing models.

The results for the 15Y ATM and OTM payer Swaps are shown in Table 6, We observe that, with respect to the Monte Carlo approach (last column), analytical formulas generally underestimate CVA and DVA values. The G2++ results (third column) are always consistent with the $3\sigma $ Monte Carlo error (marked with an asterisk), with only one exception (the DVA of the OTM Swap with annual time grid granularity). This evidence confirms the robustness of the Monte Carlo simulation parameterized as discussed in the previous Sect. 4.3. Instead, the SABR results (fourth column) show considerable differences, particularly for the CVA, which is consistent with $3\sigma $ Monte Carlo error only in one case. This is not surprising, since we are using two completely different dynamics of the underlying risk factors (G2++ for MC vs SABR for anaytical). In terms of computational performance, the analytical approach is obviously much faster than that of Monte Carlo approach: even with a daily time grid robust results can be obtained almost immediately, meaning that, for an uncollateralized Swap, the analytical approach can replace the Monte Carlo approach whereas performance is critical.

4.7 Tuning accuracy versus performance

The model validations performed in the previous sections allowed to identify the most important model risk factors and the corresponding calculation parameters governing the XVA framework and affecting the XVA figures, both in terms of accuracy and computational performance, which we summarize in Table 7.

Table 6 XVA figures obtained for 15Y ATM/OTM payer Swap, EUR 100 Mio nominal amount

Full size table

Table 7 XVA framework parameters and respective values adopted according to the model validations performed in the previous sections

Full size table

In the last column we report the parameter values identified in our model validation analyses which set our acceptable compromise between accuracy and performance. Regarding the CSA parameters, we considered bilateral CSA with $\text {K}=\text {MTA}=0$ both for VM and IM, with $l=2$ days, which is a common practice and also get close to the perfect collateralization case.

Essentially, the most important parameters are the number of time steps $N_S$ in the time simulation grid and the number $N_{MC}$ of Monte Carlo scenarios. Their product $N_C = N_S \times N_{MC}$ is proportional to the computational time $T_C$ required for the XVA calculation, i.e. $T_C = \alpha N_C$, where the proportionality coefficient $\alpha $ depends on the hardware available, and all the rest being the same. Hence, given a computational budget $\Delta T$, i.e. the maximum time that one is willing to wait to compute the XVA figures, tuning precision vs performance roughly amounts to set $N_S$ and $N_{MC}$ such that $T_C \le \Delta T$.

We stress that this choice is not unique, since it depends on the specific context, in particular: (i) the trades or portfolio under analysis, (ii) the presence of collateral, in particular the IM, (iii) the hardware available, (iv) the calculation time constraints, (v) the desired level of accuracy, (vi) the purpose of the XVA calculation, e.g. either structuring a single trade for a client, or end of day XVA revaluation, or end of quarter accounting fair value measurement, (vii) the purpose of the model validation, e.g. either for the Front Office quants developing the XVA engine for the XVA trading desk, or for the Model Validation quants challenging the Front Office framework.

The considerations above answer to our first and second research questions reported in Sect. 1.

5 XVA model risk

According to the EU regulation (see EU, 2013; EC, 2016) financial institutions are required to apply prudent valuation to fair-valued positions in order to mitigate their valuation risk, i.e. the risk of losses deriving from the valuation uncertainty in the exit price of financial instruments. The prudent value has to be computed on the top of the fair value, including possible fair valuation adjustments accounted in the income statement, considering 9 different valuation risk factors at the $90\%$ confidence level from a distribution of exit prices. The corresponding 9 differences between the prudent value and the fair value, called Additional Valuation Adjustments are aggregated and finally deducted from the Common Equity Tier 1 (CET1) capital In particular, the Model Risk (MoRi) AVA, envisaged in art. 11 of EC (2016), comprises the valuation uncertainty linked to the “potential existence of a range of different models or model calibrations used by market participants”. Accordingly, for MoRi AVA the prudent value at a 90% confidence level corresponds to the $10{\textrm{th}}$ percentile of the distribution of the plausible prices obtained from different models/parameterizations.^{Footnote 7}

Hence, we compute a MoRi AVA based on the analyses described in the previous Sect. 4. In particular, we build the distribution of XVA exit prices by considering the following four sources of model risk: (i) G2++ model calibration approach (see Sect. 4.1), (ii) time grid construction approach and related granularity $\Delta t$ (see Sect. 4.2), (iii) number of MC scenarios $N_{MC}$ (see Sect. 4.3), and (iv) the fast analytical XVA formulas for uncollateralized Swaps with different time grid granularities and SABR pricing formulas for the strip of co-terminal Swaptions (see Sect. 4.6).

According to the ranges of parameter values examined in Sect. 4 for the four sources of model risk above, we would obtain a distribution of XVA exit prices including 1440 points for the uncollateralized Swaps^{Footnote 8} and 1400 points for the uncollateralized Swaption.^{Footnote 9} Since the production of such an high number of XVA exit prices is computationally prohibitive, we restrict our analysis by considering 236 points^{Footnote 10} for the Swap, and 196 points^{Footnote 11} for the Swaption. Therefore, we compute MoRi AVA at time $t_0$ as^{Footnote 12}

$$\begin{aligned} \begin{aligned} \text {AVA}^{\text {MoRi}}(t_0)&= V(t_0;M) - \text {PV}(t_0;M^{*})\\&= \text {CVA}(t_0;M) - \text {CVA}(t_0;M^{*}), \end{aligned} \end{aligned}$$

(4)

where:

$V(t_0;M) = V_0(t_0) + \text {XVA} \left( t_0; M \right) $ is the fair-value of the instrument, intended as the price obtained from our XVA framework, denoted here with M (see Table 7);
$\text {PV}(t_0;M^{*}) = V_0(t_0) + \text {XVA}(t_0;M^{*})$ is the prudent value obtained from the prudent XVA framework, denoted by $M^{*}$, determined as the $10{\textrm{th}}$ percentile of the XVA exit price distribution. In other words, $M^{*}$ ensures that one can exit the XVA at a price equal to or larger than $\text {PV}(t_0;M^{*})$ with a degree of certainty equal to or larger than 90%^{Footnote 13};
the final formula reduces to the CVA only since the EU regulation expressly excludes any own credit risk component, as the DVA, which is filtered out from the CET1 capital; furthermore, we are not considering the valuation uncertainty related to the base value $V_0(t_0)$.

We report in Fig. 6 the CVA distributions for the Swap and the Swaption. We observe that both distributions have a positive skew, with many points concentrated around the left tail. The less conservative points falling in the right tail are attributable to the analytical formulas and to MC simulations with a low number of scenarios and/or less granular time grids for both instruments, as visible in Tables 17 and 18 which detail these distributions.

In the following Tables 8 and 9 we show a summary of the full Tables 17 and 18. Looking at the 15Y Swap, the MoRi AVA, corresponding to the prudent XVA framework ${M_{24}}$ (calibration $p_2$, joint grid, $\Delta t = 1$M and $N_{MC} = 18{,}000$), is equal to $0.20\%$ of the CVA obtained from our XVA framework. For the 5x10Y Swaption, the MoRi AVA, corresponding to ${M_{20}}$ (calibration $p_1$, joint grid, $\Delta t = 1$M and $N_{MC} = 14{,}000$), is equal to $0.66\%$ of the CVA.

Table 8 15 years ATM payer swap, no collateral

Full size table

Table 9 5x10 years ATM physically settled European payer Swaption, no collateral

Full size table

In conclusion, we observe that the small relative AVA values are due to the fact that our XVA framework produces already conservative CVA figures, and most of the mass of the XVA distribution is concentrated in the left tail. A different compromise between accuracy and performance, e.g. faster but less accurate, may produce more significant relative AVA values, leading to non-negligible CET1 reductions in the case of large financial institutions with important XVA figures.

The considerations above answer to our third research question reported in Sect. 1.

6 Conclusions

In this work we investigated the XVA model risk. To this scope we focused on an industry-standard realistic and complete XVA modelling framework, typically used by XVA trading desks, based on multi-curve time-dependent volatility G2++ stochastic dynamics calibrated on real market data, i.e. distinct discounting and forwarding yield curves, CDS spread curves, and swaption volatility cube. The numerical XVA calculation is based on a multi-step Monte Carlo simulation including both dynamic variation margin and initial margins under the ISDA Standard Initial Margin Model. We applied this framework to the most common linear and non-linear interest rates derivatives, i.e. Swaps and European Swaptions with different maturities and strikes. Within this context, we formulated in Sect. 1 three research questions, to which we report the corresponding answers below.

A1
Within the XVA modelling framework above, we were able to identify and investigate the most important model risk factors to which XVA exposure modelling and thus XVA are most sensitive, and to measure the associated computational effort. In particular, we showed that a crucial model risk factor is the construction of a MC time simulation grid able to capture the spikes arising in collateralized exposure during the margin period of risk, which have a material impact on XVA figures. To this end, we proposed a strategy to build a parsimonious and efficient grid which ensures to capture all the spikes and reduce the computational effort. Regarding the MC simulation, we observed a convergence of XVA figures even for a limited number of MC scenarios, leaving room for further saving of computational time if necessary. Regarding the simulation of the initial margin, further assumptions are required to compute G2++ forward vega sensitivities according to the ISDA-SIMM prescriptions. The related model risk has been addressed by ensuring that our calculation strategy is aligned with two alternative approaches, both consistent with ISDA-SIMM definition at time $t_0$ (i.e. valuation date) and by avoiding any arbitrary elements in the choice of the associated parameters. Finally, we showed that XVA analytical formulas for uncollateralized Swaps represent a useful tool for validating the MC results and to speed up XVA calculations. In this case we found that model risk arises from the discrete time grid used in the XVA analytical formula and from the model used to price the corresponding strip of co-terminal European Swaptions. XVA figures obtained with the G2++ pricing formulas, consistent with the G2++ dynamics of the underlying risk factors, resulted to be superior with respect to those obtained with the SABR model, which assumes a different dynamics.
A2
The model risk analyses above allowed to identify a parametrization of the XVA modelling framework allowing a compromise between accuracy and performance, i.e. leading to sufficiently robust XVA figures in a reasonable time, a very important feature for practical applications. Obviously, this choice is not unique, and our analyses allow to adapt the parameters to different contexts and purposes of the XVA calculation.
A3
Finally, based on the large number of different parameterizations considered in the analyses above, we were able to estimate the XVA model risk using the Additional Valuation Adjustment (AVA) envisaged by the EU regulation as the 10th percentile of the XVA distribution, corresponding to the 90% confidence level for XVA.

Our framework is general and could be extended to include other valuation adjustments, e.g. Funding Valuation Adjustment (FVA) and Margin Valuation Adjustment (MVA), other financial instruments, and XVA calculation at portfolio level. The computational performance could be enhanced by using last generation high dimensional scrambled Sobol sequences generators, which allow to reduce the number of scenarios while keeping the MC error under control (see. e.g. Atanassov & Kucherenko 2020; Scoleri et al., 2021). Adjoint algorithmic differentiation (Capriotti & Giles, 2012; Huge & Savine, 2020) or Chebyshev decomposition (Maran et al., 2021) could be be used to speed up and stabilize sensitivities calculation for initial margin modelling.

Availability of data and materials

Input market data available in “Appendix D”.

Code Availability

Available on demand.

Notes

IBOR denotes a generic Interbank Offered Rate, such as EURIBOR. 3 M, 6 M, etc. denote the rate tenor, i.e. the time period used to compute the interest amount. Overnight rates have a 1-day tenor.
i.e. yield curves built from market quotations of homogeneous interest rate instruments with the same underlying rate tenor.
Collateral agreements are used to mitigate the counterparty default risk of derivatives transactions. They are typically based on the Credit Support Annex (CSA), a section of the International Swaps and Derivatives Association (ISDA) master agreement used to contractualise OTC derivatives. Collateral remuneration rates are typically overnight rates. Discounting curves are typically built from Overnight Indexed Swaps, based on (compounded) overnight rates. Hence the names “CSA discounting” or “OIS discounting”. This post-crisis valuation framework is different from the previous “single-curve framework”, characterized by a single yield curve, used to compute both forward and discount rates, built from inhomogeneous market instruments with mixed tenors, e.g. Deposits, Futures, Forward Rate Agreements, Swaps, etc.
A novation occurs when a bank A is called to step in an existing trade between another bank B and a client C. Since typically the two banks A and B have a collateral agreement, while the client trade is not collateralized, the XVA exit price is observed in the transaction.
Our Monte Carlo simulation framework does not depend neither on the specific stochastic dynamics of the risk factors nor on the length of the time simulation steps, and could be used with more complex stochastic dynamics requiring short time simulation steps. Obviously, the corresponding pricing formulas should be plugged in the framework.
From the point of view of the holder, uncollateralised cash-settled Swaptions have zero ENE and DVA. In the presence of collateral, small ENE and DVA figures may appear because of MC scenarios where the received collateral exceeds the Swaption’s price.
Notice that we conventionally adopt positive/negative prices for assets/liabilities.
1440 = 7 G2++ calibrations x 2 time grids x 5 values of $\Delta t$ x 20 values of $N_{MC}$ + 40 analytical XVA, where 40 = 7 G2++ calibrations x 5 $\Delta t$ for G2++ model + 5 $\Delta t$ for SABR model.
$1400 = 1440 - 40$, since for the Swaption there are no XVA analytical formulas.
236 = 7 G2++ calibrations x (2 time grids + 5 $\Delta _t$ + 20 $N_{MC}$ + 1 baseline) + 40 analytical XVA.
$196 = 236 - 40$.
EC (2016) prescribes an aggregation coefficient equal to 0.5 to take into account diversification benefit, which we do not consider here.
Notice that, according to our conventions, $\text {PV}(t_0) \le V(t_0)$ and $\text {AVA}(t_0)\ge 0$
In order to ease the notation, in the following sections we omit subscript 0 unless clearly necessary, denoting the base value simply with V.
An ideal Credit Support Annex (CSA) ensuring a perfect match between the price $V_{0}(t)$ and the corresponding collateral at any time t. This condition is realised in practice with a real CSA minimizing any friction between the price and the collateral, i.e. with daily margination, cash collateral in the same currency of the trade, flat overnight collateral rate, zero threshold and minimum transfer amount.
Namely an issuer with a credit risk equal to the average credit risk of the IBOR panel, see e.g. Morini (2009).
OIS are fixed versus floating Swaps where the floating cash flows are based on a daily compounded overnight rate, e.g. € STR. Their pricing formula is similar to Eq. A7, see e.g. Scaringi and Bianchetti (2020) for a detailed derivation.
Since $ {\mathbb {E}}^{Q} \left[ D(t;T) V(T) \vert {\mathcal {F}}_t\right] = A(t;{{\textbf {S}}}) {\mathbb {E}}^{Q_{S}} \left[ \frac{V(T)}{A(T;{{\textbf {S}}})} \vert {\mathcal {F}}_t \right] $.
More sophisticated pricing models are used, typically based on SABR stochastic volatility model, see Hagan et al. (2016).
Here we assume “risk free” close-out at the mark to market, without any further adjustment.
We neglect here the wrong way risk arising when the exposure with the counterparty is inversely related to the creditworthiness of the counterparty itself.
We interpolated and extrapolated linearly.
Notice that when VM only is considered the collateralized exposure simply reduces to $H_m(t_i)=V_{0,m} \left( t_i \right) -\text {VM}_m \left( t_i;V_{0,m} ({\hat{t}}_i),\text {K}_\text {VM},\text {MTA}_\text {VM} \right) $.
For Credit (Qualifying) Risk Class, which includes instruments whose price is sensitive to correlation between the defaults of different credits within an index or basket (e.g. CDO tranches), an additional margin component, i.e. the BaseCorrMargin shell be calculated (see ISDA, 2018).
Version 2.1 was effective from 1 December 2018 to 30 November 2019 when Version 2.2 was published. In particular, the values reported in this appendix refer to Version 2.1.
Low Volatility currencies: JPY; Regular Volatility currencies: USD, EUR, GBP, CHF, AUD, NZD, CAD, SEK, NOK, DKK, HKD, KRW, SGD and TWD; High Volatility currencies: all other currencies.
Low Volatility currencies: JPY; Regular Volatility well-traded currencies: USD, EUR, GBP; Regular Volatility less well-traded currencies: CHF, AUD, NZD, CAD, SEK, NOK, DKK, HKD, KRW, SGD, TWD; High Volatility currencies: all other currencies.
See footnote 27.
ISDA defines for both OIS and IBOR curves the following 12 tenors at which Delta shall be computed: 2 weeks, 1 month, 3 months, 6 months, 1 year, 2 year, 3 year, 5 year, 10 year, 15 year, 20 year, 30 year.
The Expected Exposure (EE) is defined as: ${\mathcal {H}}(t,t_i) = P(t;t_i) \frac{1}{N_{MC}} \sum _{m=1}^{N_{MC}} H_{m}(t_i)$.
For the instruments considered in this paper, the fixed cash flow dates are a subset of the floating ones, otherwise both fixed and floating cash flow dates should be added to the time grid.
We abuse the notation naming as $t_i$ the points of both the initial and the joint grids, since the initial grid is only the starting point of our construction and is never used.

References

Ametrano, F. M., & Bianchetti, M. (2009). Smooth yield curves bootstrapping for forward libor rate estimation and pricing interest rate derivatives. In: Modelling interest rates: Latest advances for derivatives pricing risk books.
Ametrano, F. M., & Bianchetti, M. (2013). Everything you always wanted to know about multiple interest rate curve bootstrapping but were afraid to ask. Available at SSRN 2219548.
Andersen, L. B., Pykhtin, M., & Sokol, A. (2016). Credit exposure in the presence of initial margin. Available at SSRN 2806156.
Andersen, L. B., Pykhtin, M., & Sokol, A. (2017). Rethinking the margin period of risk. Journal of Credit Risk 13(1).
Anfuso, F., Aziz, D., & Giltinan, P., et al. (2017). A sound modelling and backtesting framework for forecasting initial margin requirements. Available at SSRN 2716279.
Atanassov, E., & Kucherenko, S. (2020). Implementation of Owen’s scrambling with additional permutations for Sobol’ sequences.
Basel Committee on Banking Supervision and Board of the International Organization of Securities Commissions. (2013). Second consultative document, margin requirements for non-centrally cleared derivatives. Bank for International Settlements.
Basel Committee on Banking Supervision and Board of the International Organization of Securities Commissions. (2015). Margin requirements for non-centrally cleared derivatives. Bank for International Settlements.
Bianchetti, M. (2010). Two curves, one price. Risk, 23(8), 66.
Google Scholar
Bielecki, T., & Rutkowski, M. (2004). Credit risk: Modeling, valuation and hedging. Berlin: Springer.
Book Google Scholar
Bormetti, G., Brigo, D., Francischello, M., & Pallavicini, A. (2018). Impact of multiple curve dynamics in credit valuation adjustments under collateralization. Quantitative Finance, 18(1), 31–44.
Article Google Scholar
Brigo, D., Buescu, C., & Francischello, M., et al. (2018). Risk-neutral valuation under differential funding costs, defaults and collateralization. Defaults and Collateralization (February 28, 2018).
Brigo, D., Capponi, A., & Pallavicini, A. (2014). Arbitrage-free bilateral counterparty risk valuation under collateralization and application to credit default swaps. Mathematical Finance: An International Journal of Mathematics, Statistics and Financial Economics, 24(1), 125–146.
Article Google Scholar
Brigo, D., Francischello, M., & Pallavicini, A. (2019). Nonlinear valuation under credit, funding, and margins: Existence, uniqueness, invariance, and disentanglement. European Journal of Operational Research, 274(2), 788–805.
Article Google Scholar
Brigo, D., & Masetti, M. (2005). Risk neutral pricing of counterparty risk.
Brigo, D., & Mercurio, F. (2007). Interest rate models-theory and practice: With smile, inflation and credit. Berlin: Springer.
Google Scholar
Brigo, D., Morini, M., & Pallavicini, A. (2013). Counterparty credit risk, collateral and funding: With pricing cases for all asset classes (Vol. 478). Hoboken: Wiley.
Book Google Scholar
Brigo, D., Pallavicini, A., & Papatheodorou, V. (2011). Arbitrage-free valuation of bilateral counterparty risk for interest-rate products: Impact of volatilities and correlations. International Journal of Theoretical and Applied Finance, 14(06), 773–802.
Article Google Scholar
Burgard, C., & Kjaer, M. (2011). In the balance. In C Burgard, M Kjaer In the balance, Risk, November (pp. 72–75).
Capriotti, L., & Giles, M. (2012). Adjoint Greeks made easy. Risk, 25(9), 92.
Google Scholar
Caspers, P., Giltinan, P., Lichters, R., et al. (2017). Forecasting initial margin requirements: A model evaluation. Journal of Risk Management in Financial Institutions, 10(4), 365–394.
Google Scholar
Crépey, S., & Dixon, M. (2020). Gaussian process regression for derivative portfolio modeling and application to CVA computations. Journal of Computational Finance, 24, 47–81.
Google Scholar
European Commission. (2016). Commission Delegated Regulation (EU) 2016/101 of 26 October 2015 supplementing Regulation (EU) No 575/2013 of the European Parliament and of the Council with regard to regulatory technical standards for prudent valuation under Article 105(14). Official Journal of the European Union.
European Parliament and Council of the European Union. (2013). Regulation (EU) No 575/2013 of the European Parliament and of the Council of 26 June 2013 on prudential requirements for credit institutions and investment firms and amending Regulation (EU) No 648/2012. Official Journal of the European Union.
Glasserman, P., & Xu, X. (2014). Robust risk measurement and model risk. Quantitative Finance, 14(1), 29–58.
Article Google Scholar
Green, A. (2015). XVA: Credit, funding and capital valuation adjustments. Hoboken: Wiley.
Book Google Scholar
Green, A., & Kenyon, C. (2015). MVA: Initial margin valuation adjustment by replication and regression. Available at SSRN 2432281.
Green, A., Kenyon, C., & Dennis, C. (2014). KVA: Capital valuation adjustment. Risk.
Gregory, J. (2016). The impact of initial margin. Available at SSRN 2790227.
Gregory, J. (2020). The XVA challenge: Counterparty risk, funding, collateral, capital and initial margin. Hoboken: Wiley.
Book Google Scholar
Hagan, P. S., Kumar, D., Woodward, A. S., & Lesniewski, D. E. (2002). Managing smile risk. The Best of Wilmott, 1, 249–296.
Google Scholar
Hagan, P. S., Kumar, D., Woodward, A. S., & Lesniewski, D. E. (2016). Universal smiles. Wilmott, 84, 40–55.
Article Google Scholar
Henrard, M. (2007). The irony in the derivatives discounting. Wilmott 92–98.
Henrard, M. (2009). The irony in the derivatives discounting part II: The crisis. Wilmott, 2(6), 301–316.
Article Google Scholar
Huge, B., & Savine, A. (2020). Differential machine learning: The shape of things to come. Risk (10).
International Accounting Standards Board. (2011). International financial reporting standard 13—Fair value measurement.
International Swaps and Derivatives Association. (2013). Standard initial margin model for non-cleared derivatives.
International Swaps and Derivatives Association. (2016). ISDA SIMM: From Principles to Model Specification.
International Swaps and Derivatives Association. (2018). ISDA SIMM. Methodology, version 2.1.
Kenyon, C. (2010). Short-rate pricing after the liquidity and credit shocks: including the basis. Risk, November.
Kjaer, M. (2018). KVA Unmasked. Available at SSRN 3143875.
Konikov, M., & McClelland, A. (2019). Multi-curve Cheyette-style models with lower bounds on tenor basis spreads. Available at SSRN 3524703.
Maran, A., Pallavicini, A., & Scoleri, S. (2022). Chebyshev Greeks: Smoothing gamma without bias. Risk, November.
Mercurio, F. (2009). Post credit crunch interest rates: Formulas and market models. Bloomberg portfolio research paper 2010-01.
Morini, M. (2009). Solving the puzzle in the interest rate market. Available at SSRN 1506046.
Morini, M., & Prampolini, A. (2011). Risky funding with counterparty and liquidity charges. Risk, 24(3), 70.
Google Scholar
Obloj, D. (2007). Fine-tune your smile: Correction to Hagan et al. Arxiv.
Pallavicini, A., Perini, D., & Brigo, D. (2012). Funding, collateral and hedging: Uncovering the mechanics and the subtleties of funding valuation adjustments. arXiv:1210.3811.
Piterbarg, V. V. (2012). Cooking with collateral. Risk, 25(8), 46.
Google Scholar
Scaringi, M., & Bianchetti, M. (2020). No fear of discounting-how to manage the transition from EONIA to €STR. Available at SSRN 3674249.
Scoleri, S., Bianchetti, M., & Kucherenko, S. (2021). Application of quasi Monte Carlo and global sensitivity analysis to option pricing and Greeks: Finite differences vs. AAD. Wilmott, 2021, 66–83.
Article Google Scholar
Zeron, M., & Ruiz, I. (2018). Dynamic initial margin via Chebyshev spectral decomposition. Working paper (24 August).

Download references

Acknowledgements

The authors acknowledge fruitful discussions with many colleagues in Intesa Sanpaolo Risk Management and Front Office Departments. A. Principe and M. Terraneo collaborated to the early stage of this work. The views and opinions expressed here are those of the authors and do not represent the opinions of their employers. They are not responsible for any use that may be made of these contents.

Funding

The authors did not receive support from any organization for the submitted work.

Author information

All the authors contributed equally to this work.

Authors and Affiliations

Deloitte Consulting, S.r.l. S.B. via Tortona 25, 20144, Milan, Italy
Lorenzo Silotto
Financial and Market Risk Management, Intesa Sanpaolo, Piazza P. Ferrari 10, 20121, Milan, Italy
Marco Scaringi & Marco Bianchetti
Department of Statistical Sciences “Paolo Fortunati”, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy
Marco Bianchetti

Authors

Lorenzo Silotto
View author publications
You can also search for this author in PubMed Google Scholar
Marco Scaringi
View author publications
You can also search for this author in PubMed Google Scholar
Marco Bianchetti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Bianchetti.

Ethics declarations

Ethical Approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Theoretical framework

In this appendix we detail the theoretical framework used in this work.

1.1 Pricing with collateral and XVA

We describe here the general no-arbitrage, additive pricing formulas for financial instruments subject to XVA.

Table 10 Details on the financial instruments analysed in this work

Full size table

Assuming no arbitrage and the usual probabilistic framework ($\Omega ,{\mathcal {F}},{\mathcal {F}}_t,Q$) with market filtration ${\mathcal {F}}_{t}$ and risk-neutral probability measure Q, the general pricing formula of a financial instrument with payoff V(T) paid at time $T>t$ is

$$\begin{aligned}&V(t) = V_{0}(t) + \text {XVA}(t), \end{aligned}$$

(A1)

$$\begin{aligned}&V_{0}(t) ={\mathbb {E}}^{Q}\left[D(t;T)V(T)\vert {\mathcal {F}}_t \right] =P(t;T) {\mathbb {E}}^{Q^T}\left[V(T)\vert {\mathcal {F}}_t \right], \end{aligned}$$

(A2)

$$\begin{aligned}&D(t;T) = \dfrac{B(t)}{B(T)}=e^{-\int _{t}^{T}r(u)du}, \end{aligned}$$

(A3)

$$\begin{aligned}&P(t;T) ={\mathbb {E}}^{Q}\left[D(t;T)\vert {\mathcal {F}}_t \right], \end{aligned}$$

(A4)

where the base value,^{Footnote 14} or mark to market), $V_{0}(t)$ in Eq. A2 is interpreted as the price of the financial instrument under perfect collateralization,^{Footnote 15} the discount (short) rate r(t) in Eq. A3 is the corresponding collateral rate, B(t) is the collateral bank account growing at rate r(t), D(t; T) is the stochastic collateral discount factor, P(t; T) is the perfectly collateralized Zero Coupon Bond (ZCB) price, and $Q^T$ is the T-forward probability measure associated to the numeraire P(t; T).

Valuation adjustments in Eq. A1, collectively named XVA, represent a crucial and consolidated component in modern derivatives pricing which takes into account additional risk factors not included among the risk factors considered in the base value $V_0$ in Eq. A2. These risk factors are typically related to counterparties default, funding, and capital, leading, respectively to Credit/Debt Valuation Adjustment (CVA/DVA, see e.g. Brigo and Masetti (2005); Brigo et al. (2011, 2014)), Funding Valuation Adjustment (FVA, see e.g. Burgard and Kjaer (2011); Morini and Prampolini (2011); Pallavicini et al. (2012)), often split into Funding Cost/Benefit Adjustment (FCA/FBA), Margin Valuation Adjustment (MVA, see e.g. Green and Kenyon (2015)), Capital Valuation Adjustment (KVA, see e.g. Green et al. (2014)). A complete discussion on XVA may be found e.g. in Gregory (2020). For XVA pricing we must consider the enlarged filtration $ {\mathcal {G}}_{t}= {\mathcal {F}}_{t}\vee {\mathcal {H}}_{t}\supseteq {\mathcal {F}}_{t}$ where $ {\mathcal {H}}_{t}=\sigma (\{\tau \le u\}:u\le t)$ is the filtration generated by default events. More details can be found in a number of papers, see e.g. Brigo et al. (2013, 2018, 2019) and references therein.

1.2 Financial instruments

We describe here the detailed list of financial instruments considered in this work and their corresponding pricing formulas.

1.2.1 Instruments’ list

According to the discussion in Sect. 3, we show in the following Table 10 the complete list of financial instruments considered in this work.

1.2.2 Interest rate swap

A Swap is a contract which allows the exchange of a fixed rate K against a floating rate, characterised by the following time schedules

$$\begin{aligned} \begin{aligned} {{\textbf {S}}}&=\{S_{0},\dots ,T_{i},\dots ,S_{m}\}\text {, fixed leg schedule,} \\ {{\textbf {T}}}&=\{T_{0},\dots ,T_{j},\dots ,T_{n}\}\text {, floating leg schedule,} \\ S_0&=T_{0} \text {, } S_m=T_{n}, \end{aligned} \end{aligned}$$

(A5)

and by the following payoffs for the fixed and floating cash flows, respectively,

$$\begin{aligned} \begin{aligned} {\textbf {Swaplet}}_{\text {fix}}(S_i;S_{i-1},S_i,K)&= NK\tau _{K}(S_{i-1},S_i), \\ {\textbf {Swaplet}}_{\text {float}}(T_j;T_{j-1},T_j)&= NR_x(T_{j-1},T_j)\tau _{R}(T_{j-1},T_j), \end{aligned} \end{aligned}$$

(A6)

where $\tau _{K}$ and $\tau _{R}$ are the year fractions for fixed and floating rate conventions, respectively, and $R_x(T_{j-1},T_j)$ is the underlying spot floating rate with tenor x, consistent with the time interval $\left[ T_{j-1}, T_{j} \right] $ (e.g. $x=6M$ for EURIBOR 6 M and semi-annual coupons).

The price of the Swap at time $t\le T_n = S_m$ is given by the sum of the prices of fixed and floating cash flows occurring after t,

$$\begin{aligned} \begin{aligned}&{\textbf {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}},K,\omega ) = N \omega \left[ \sum _{j=\eta _L(t)}^{n} P(t;T_j) F_{x,j}(t)\tau _{R}(T_{j-1},T_j) - K A(t;{{\textbf {S}}}) \right] \\&A(t;{{\textbf {S}}}) = \sum _{i=\eta _K(t)}^{m}P(t;S_i)\tau _{K}(S_{i-1},S_i), \end{aligned} \end{aligned}$$

(A7)

where N is the nominal amount, $\omega = +/- 1$ denotes a payer/receiver Swap (referred to the fixed leg), $\eta _L(t)=\min \{j\in \{1,\ldots ,n\}\text { s.t. } T_j\ge t\}$ and $\eta _K(t)=\min \{i\in \{1,\ldots ,m\}\text { s.t. } S_j\ge t\}$ are the first future cash flows in the Swap’s schedules, $A(t;{{\textbf {S}}})$ is the Swap annuity, and $F_{x,j}(t)$ is the forward rate observed at time t, fixing at future time $T_{j-1}$ and spanning the future time interval $\left[T_{j-1},T_{j} \right]$, given by

$$\begin{aligned} \begin{aligned} F_{x,j} (t)&= F_x(t;T_{j-1},T_j) = {\mathbb {E}}^{Q^{T_j}} \left[ R_x(T_{j-1},T_j) \vert {\mathcal {F}}_t \right] ,\\ F_{x,j} (t)&= R_x(T_{j-1},T_j)\;\textit{for } t\ge T_{j-1}. \end{aligned} \end{aligned}$$

(A8)

By construction, the forward rate $F_{x,j}(t)$ is a martingale under the forward measure $Q^{T_j}$ associated to the discounting numeraire $P(t;T_j)$. The par Swap rate $R_{x}^{\text {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}})$, i.e. the fixed rate K such that the Swap is worth zero, is given by

$$\begin{aligned} R_{x}^{\text {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}})= \dfrac{\sum _{j=\eta _L(t)}^{n} P(t;T_j) F_{x,j}(t)\tau _{R}(T_{j-1},T_j)}{A(t;{{\textbf {S}}})}. \end{aligned}$$

(A9)

The Swap’s price in terms of the par Swap rate can be expressed as

$$\begin{aligned} {\textbf {Swap}}\left( t;{{\textbf {T}}},{{\textbf {S}}},K,\omega \right) = N\omega \left[ R_{x}^{\text {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}})-K \right] A(t;{{\textbf {S}}}). \end{aligned}$$

(A10)

IBOR forward rates $F_{x,j}(t)$ in Eqs. A7–A9 are computed from IBOR ZCB curves ${\mathcal {C}}_x(t) = \left\{ T\rightarrow P_x(t;T)\right\} $, built from homogeneous market Swap quotes (i.e. with the same underlying IBOR tenor x) using the usual expression of forward rates

$$\begin{aligned} F_{x,j}(t) = \dfrac{1}{\tau _F(T_{j-1},T_j)}\left[ \dfrac{P_x(t;T_{j-1})}{P_x(t;T_{j})}-1\right] , \end{aligned}$$

(A11)

where $\tau _{F}$ is the year fraction with the forward rate convention and $P_x(t;T_{j})$ can be interpreted as the price of a risky ZCB issued by an average IBOR counterparty.^{Footnote 16} Expressions A9 and A11 are consistently used during the bootstrapping procedure.

Discounting ZCBs $P(t;T_{j})$ in Eq. A7 are computed from discounting ZCB curve ${\mathcal {C}}(t) = \left\{ T\rightarrow P(t;T)\right\} $, built from market quotes of Overnight Indexed Swaps (OIS^{Footnote 17}). We notice that the discounting curve ${\mathcal {C}}(t)$ is also required to build IBOR curves ${\mathcal {C}}_x(t)$ using recursively Eq. A7. Overall, this procedure is commonly called multi-curve bootstrapping, since OIS and IBOR curves with different tenors are involved.

The market quotes OIS and IBOR Swaps with different tenors and maturities, which, along with other similar quoted instruments (i.e. Forward Rate Agreements, Futures, Basis Swaps) can be used to build OIS and multiple IBOR yield curves for each rate tenor x. see e.g. Ametrano and Bianchetti (2013) for a detailed discussion of market quotes and multi-curve bootstrapping. In “Appendix D” we report the yield curves used in this paper.

1.2.3 European swaption

In this paper we consider physically-settled European Swaptions, i.e. contracts which give to the holder the right to enter, at a given expiry date $T_{e}$, into a Swap contract starting at $T_{0} \ge T_{e}$ as described in “Appendix A.2.2”.

The payoff can be written as

$$\begin{aligned} \begin{aligned} {\textbf {Swaption}}(T_{e};{{\textbf {T}}},{{\textbf {S}}},K,\omega )&= \max \left[ {\textbf {Swap}}(T_{e};{{\textbf {T}}},{{\textbf {S}}},K,\omega );0 \right] , \\&= N A(T_{e}; {{\textbf {S}}}) \max \left\{ \omega \left[ R_{x}^{\text {Swap}}(T_{e};{{\textbf {T}}},{{\textbf {S}}})-K \right] ;0\right\} , \end{aligned} \end{aligned}$$

(A12)

where N is the nominal amount, $A(T_{e}; {{\textbf {S}}})$ is the Swap annuity in Eq. A7 and $R_{x}^{\text {Swap}}(T_{e};{{\textbf {T}}},{{\textbf {S}}})$ is the Swap rate in Eq. A9, both evaluated at expiry date $T_e$.

The market practice is to value such Swaptions through the shifted-Black formula, assuming a shifted log-normal driftless dynamics with instantaneous volatility $\sigma _{x} (t; {{\textbf {T}}}, {{\textbf {S}}})$ for the evolution of the Swap rate $R_{x}^{\text {Swap}}(T_{e};{{\textbf {T}}},{{\textbf {S}}})$ under its corresponding discounting Swap measure^{Footnote 18}$Q_{S}$ associated to the numeraire $A(t,{{\textbf {S}}})$. The shifted-Black price at time t is thus given by

$$\begin{aligned} \begin{aligned}&{\textbf {Swaption}}(t;{{\textbf {T}}},{{\textbf {S}}},K,\omega ) \\&\quad = N {\mathbb {E}}^{Q} \left\{ D(t;T_e) A(T_e;{{\textbf {S}}}) \max \left[ \omega \left[ R_{x}^{\text {Swap}}(T_{e};{{\textbf {T}}},{{\textbf {S}}})-K \right] ;0 \right] \vert {\mathcal {F}}_t \right\} , \\&\quad = N A(t; {{\textbf {S}}}) {\mathbb {E}}^{Q_{S}} \left\{ \max \left[ \omega \left[ R_{x}^{\text {Swap}}(T_{e};{{\textbf {T}}},{{\textbf {S}}})-K \right] ;0 \right] \vert {\mathcal {F}}_t \right\} , \\&\quad = N A(t; {{\textbf {S}}}) \text {Black} \left[ R_{x}^{\text {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}})+\lambda _x, K+\lambda _x, v_x(t;{{\textbf {T}}},{{\textbf {S}}}), \omega \right] ,\\&\quad v_x(t;{{\textbf {T}}},{{\textbf {S}}}) = \int _{t}^{T_{e}} \sigma _{x} (u; {{\textbf {T}}}, {{\textbf {S}}})^2 du,\quad \sigma _{x} (t; T_{e}, {{\textbf {T}}}, {{\textbf {S}}}) = \sqrt{\frac{v_x(t; {{\textbf {T}}}, {{\textbf {S}}})}{\tau _x(t,T_{e})}}, \end{aligned} \end{aligned}$$

(A13)

where $\lambda _x$ is the constant log-normal shift, $v_x(t;{{\textbf {T}}},{{\textbf {S}}})$ is the shifted log-normal implied forward variance, and $\sigma _{x} (t; T_{e}, {{\textbf {T}}}, {{\textbf {S}}})$ is the shifted log-normal implied forward volatility. Actually, Eq. A13 is not used “as is” for pricing purposes,^{Footnote 19} but, rather, as a standard tool to imply shifted-Black volatilities from market prices.

The market quotes physically-settled Swaptions with different expiry dates, underlying Swap lenghts, and strikes, i.e. a 3D structure called Swaption price cube. In “Appendix D” we report the price cube used in this paper.

1.3 G2++ model

We describe here the multi-curve time-dependent volatility two-factor Shifted-Vasicek Gaussian model, also known as G2++, adopted in this work, including the corresponding pricing formulas for Swaps and European Swaptions, the methodologies used to calibrate the model parameters to market data, and the Monte Carlo simulation. We refer to e.g. Brigo and Mercurio (2007) for the standard single-curve constant-volatility G2++ version and to Kenyon (2010) for the multi-curve G1++ version.

1.3.1 Short-rates dynamics

The dynamics of the instantaneous short rate processes under the risk-neutral measure Q are given by

$$\begin{aligned} \begin{aligned}&r^{c}(t) = x(t) + y(t) + \varphi ^{c}(t), \qquad r^{c}(0) = r_0^{c},\\&dx(t) = -ax(t)dt + \sigma (t) dW_1(t), \qquad x(0) = 0, \\&dy(t) = -by(t)dt + \eta (t) dW_2(t), \qquad y(0) = 0, \\&d \left\langle W_1,W_2\right\rangle \left( t\right) = \rho dt,\\&\sigma (t)= \sigma \Gamma (t),\; \eta (t)=\eta \Gamma (t),\;\Gamma (t) = \sum _i \Gamma _i \mathbbm {1}_{[T_i,T_{i+1}]}, \end{aligned} \end{aligned}$$

(A14)

where $c\in \{d,x\}$, with d and x denoting the discount curve ${\mathcal {C}}_d$ and the forward curve ${\mathcal {C}}_x$, respectively, $-1\leqslant \rho \leqslant 1$, $\Gamma : {\mathbb {R}} \rightarrow {\mathbb {R}}^{+}$ is a piece-wise constant function on time intervals $[T_i,T_{i+1}]$, $\varphi ^{c}: {\mathbb {R}}^+ \rightarrow {\mathbb {R}}$, and $r^{c}_{0}, a, b, \sigma , \eta \in {\mathbb {R}}$.

By integrating Eq. A14 we obtain, for each $s<t$,

$$\begin{aligned} r^c(t)= & {} x(s)e^{-a(t-s)} + y(s)e^{-b(t-s)} \nonumber \\{} & {} +\, \sigma \int _s^t \Gamma (u) e^{-a(t-u)} dW_1(u) + \eta \int _s^t \Gamma (u) e^{-b(t-u)} dW_2(u) + \varphi ^c(t), \end{aligned}$$

(A15)

from which we can see that $r^c(t)$, conditional on the sigma-field ${\mathcal {F}}_s$ generated by the pair (x, y) up to time s, is normally distributed with mean ad variance, respectively,

$$\begin{aligned} {\mathbb {E}} \left( r^c(t)\vert {\mathcal {F}}_s \right)&= x(s)e^{-a(t-s)} + y(s) e^{-b(t-s)} + \varphi ^c(t), \end{aligned}$$

(A16)

$$\begin{aligned} \textrm{Var}(r^c(t)\vert {\mathcal {F}}_s)&= \frac{\sigma ^2}{2a} \sum _{i=u}^v \Gamma _i^2 e^{-2a t} e^{2a(T_{i+1}-T_i)} + \frac{\eta ^2}{2b} \sum _{i=u}^v \Gamma _i^2 e^{-2b t} e^{2b(T_{i+1}-T_i)} \nonumber \\&\quad +\, 2 \frac{\rho \eta \sigma }{a+b}(1-e^{-(a+b)t}) \sum _{i=u}^v \Gamma _i^2 e^{(a+b)(T_{i+1}-T_i)}, \end{aligned}$$

(A17)

where for the variance we assumed, without loosing generality, that $T_u = s$ and $T_{v+1} = t$ for a certain u and v with $u<v$.

1.3.2 Pricing

Interest Rate Swap

In our G2++ framework, the price at time t of a Swap’s floating coupon is given by

$$\begin{aligned} {\textbf {Swaplet}}_{\text {float}}(t;T_{j-1},T_j)= & {} N {\mathbb {E}}^Q_t \left[ D_d(t;T_j) R_x(T_{j-1},T_j) \tau _{R}(T_{j-1},T_j) \right] \nonumber \\= & {} N \left[ \frac{\phi ^d(T_{j-1};T_j)}{\phi ^x(T_{j-1};T_j)} P_d(t;T_{j-1}) - P_d(t;T_j) \right] , \end{aligned}$$

(A18)

where $\phi ^c(t;T) = \exp \left\{ -\int _t^T \varphi ^c(u)du \right\} $, $c\in \{d,x\}$ with d and x denoting respectively the discount ${\mathcal {C}}_d$ and the forward ${\mathcal {C}}_x$ curves. This result comes from the fact that, setting $z(s)=x(s)+y(s)$,

(A19)

Therefore, the G2++ Swap price is given by

$$\begin{aligned}&{\textbf {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}},K,\omega ) \nonumber \\&\quad = N \omega \left[ \sum _{j=1}^n \left[ \psi (T_{j-1};T_j) P_d(t;T_{j-1}) - P_d(t;T_j) \right] - K A(t;{{\textbf {S}}})\right] , \end{aligned}$$

(A20)

with

$$\begin{aligned} \psi (T_{j-1};T_j) = \frac{\phi ^d(T_{j-1};T_j)}{\phi ^x(T_{j-1};T_j)} = \frac{P_d^{M}(0;T_j)}{P_d^{M}(0;T_{j-1})} \frac{P_{x}^{M}(0;T_{j-1})}{P_{x}^{M}(0;T_j)}, \end{aligned}$$

(A21)

where $P_c^{M}(0;T)$ is the ZCB price observed on the market at time $t=0$ for maturity T, and the second equality follows from

$$\begin{aligned} \phi ^c(T_{j-1};T_j)&= \exp \left\{ -\int _{T_{j-1}}^{T_j} \varphi ^c(u)du \right\} \nonumber \\&= \frac{P_{c}^{M}(0;T_j)}{P_{c}^{M}(0;T_{j-1})} \exp \left\{ -\frac{1}{2} \left[ V(0;T_j) - V(0;T_{j-1}) \right] \right\} , \end{aligned}$$

(A22)

where $V(t;T_j)$ is the variance of the random variable $I(t;T_j) = \int _t^{T_j} [x(s) + y(s)] ds$, which is normally distributed with mean $M(t;T_j)$ (see Brigo & Mercurio, 2007 for the proof). By considering the piece-wise constant volatility parameters introduced in “Appendix A.3.1” we obtain the following expression

$$\begin{aligned} \begin{aligned}&V(s;t) \\&\quad = \frac{\sigma ^2}{a^2} \sum _{i=u}^v \Gamma _i^2 \left[ (T_{i+1}-T_i) - \frac{2e^{-at}}{a} \left( e^{aT_{i+1}}-e^{aT_i} \right) - \frac{e^{-2at}}{2a} \left( e^{2aT_{i+1}}-e^{2aT_i} \right) \right] \\&\qquad +\, \frac{\eta ^2}{b^2} \sum _{i=u}^v \Gamma _i^2 \left[ (T_{i+1}-T_i) - \frac{2e^{-bt} }{b}\left( e^{bT_{i+1}}-e^{bT_i} \right) - \frac{e^{-2bt}}{2b}\left( e^{2bT_{i+1}}-e^{2bT_i} \right) \right] \\&\qquad +\, \frac{2\rho \eta \sigma }{ab} \sum _{i=u}^v \Gamma _i^2 \left[ (T_{i+1}-T_i) - \frac{e^{-at}}{a} \left( e^{aT_{i+1}}-e^{aT_i} \right) - \frac{e^{-bt}}{b} \left( e^{bT_{i+1}}-e^{bT_i} \right) \right. \\&\qquad \left. +\,\frac{e^{-(a+b)t}}{a+b} \left( e^{(a+b)T_{i+1}}-e^{(a+b)T_i}\right) \right] , \end{aligned} \end{aligned}$$

(A23)

where here we assumed that $T_u = s$ and $T_{v+1} = t$.

The multi-curve G2++ Swap price can be written in terms of the single-curve G2++ Swap price assuming the existence of a vector of $n+1$ coefficients $c=\left\{ c_0,\ldots ,c_n\right\} $ such that

$$\begin{aligned} {\textbf {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}},K,\omega )&= c_0\ \omega \left( P(t;T_0) - \sum _{j=1}^{n} c_j P(t;T_j) \right) , \end{aligned}$$

(A24)

where

$$\begin{aligned} \begin{aligned} c_0&= \psi (T_0;T_1), \\ c_j&= c_{0}^{-1} \left[ 1-\psi (T_{j};T_{j+1}) \mathbbm {1}_{ \left\{ T_j \ne T_n \right\} } + K \tau _K(S_{i},S_{i+1}) \mathbbm {1}_{ \left\{ T_j = S_i \right\} } \right] , \quad j = 1,\dots ,n, \end{aligned} \end{aligned}$$

(A25)

where we assumed that the the schedule ${{\textbf {S}}}$ is a subset of schedule ${{\textbf {T}}}$. As a practical example, we consider a payer Swap with unitary nominal amount and maturity 2 years with semi-annual floating leg and annual fixed leg, characterized by the schedules ${{\textbf {T}}}=\{T_{0},T_{1},T_{2},T_{3},T_{4}\}$ and ${{\textbf {S}}}=\{S_{0},S_{1},S_{2}\}$, where $T_0 = S_0$, $T_2 = S_1$ and $T_4 = S_2$. The G2++ price is given by

$$\begin{aligned} \begin{aligned} {\textbf {Swap}}(t;{{\textbf {T}}},{{\textbf {S}}},K,1)&= P_d(t;T_0) \psi (T_0;T_1) \\&\quad - P_d(t;T_1) \left[ 1 - \psi (T_1;T_2) \right] \\&\quad - P_d(t;T_2) \left[ 1 - \psi (T_2;T_3) + K \tau _K(S_0,S_1) \right] \\&\quad - P_d(t;T_3) \left[ 1 - \psi (T_3;T_4) \right] \\&\quad - P_d(t;T_4) \left[ 1+K \tau _K(S_1,S_2) \right] , \end{aligned} \end{aligned}$$

(A26)

from which it is possible to recognize the following coefficients

$$\begin{aligned} \begin{aligned} c_0&= \psi (T_0;T_1) \\ c_1&= c_0^{-1} \left[ 1 - \psi (T_1;T_2) \right] \\ c_2&= c_0^{-1} \left[ 1 - \psi (T_2;T_3) + K \tau _K(S_0,S_1) \right] \\ c_3&= c_0^{-1} \left[ 1 - \psi (T_3;T_4) \right] \\ c_4&= c_0^{-1} \left[ 1+K \tau _K(S_1;S_2) \right] . \end{aligned} \end{aligned}$$

(A27)

European Swaption

The price at time t of an European Swaption expiring at $T_e$ under the forward measure $Q^{T_e}$ can be written, using the single-curve G2++ Swap pricing formula in Eq. A24 as

$$\begin{aligned} \begin{aligned}&{\textbf {Swaption}}(t;{{\textbf {T}}},{{\textbf {S}}},K,\omega ) \\&\quad = N P(t;T_e) {\mathbb {E}}_t^{Q^{T_e}} \left\{ \max \left[ {\textbf {Swap}}(T_e;{{\textbf {T}}},{{\textbf {S}}},K,\omega );0 \right] \right\} \\&\quad = N P(t,T_e) {\mathbb {E}}_t^{Q^{T_e}} \left\{ \max \left[ c_0\,\omega \left( P(T_e;T_0) - \sum _{j=1}^{n} c_j P(T_e;T_j) \right) ; 0 \right] \right\} , \end{aligned} \end{aligned}$$

(A28)

where the coefficients $c_i, i=0,\ldots ,m$ are given by Eq. A25. Since the G2++ Swaption price in Eq. A28 above is reduced to a single-curve expression, the semi-analytical single-curve G2++ pricing formula derived in Brigo and Mercurio (2007) (Theorem 4.2.3) is preserved and can be generalised to piece-wise constant volatility parameters as follows,

$$\begin{aligned}{} & {} {\textbf {Swaption}}(0;{{\textbf {T}}},{{\textbf {S}}},K,\omega ) = N \omega P_d(0;T_e) \nonumber \\{} & {} \quad \int _{-\infty }^{+\infty }\frac{e^{-\frac{1}{2}(\frac{s-\mu _x}{\sigma _x})^2}}{\sigma _x\sqrt{2\pi }}\left[ \Phi \left[ -\omega h_1(s)\right] -\sum _{j=1}^{n}\lambda _j(s)e^{{\mathcal {K}}_j(s)} \Phi \left[ -\omega h_2(s)\right] \right] ds, \end{aligned}$$

(A29)

where

$$\begin{aligned} h_1(s)&= \frac{{\bar{q}}-\mu _y}{\sigma _y\sqrt{1-\rho _{xy}^2}}-\frac{\rho _{xy}(s-\mu _x)}{\sigma _x\sqrt{1-\rho _{xy}^2}}, \end{aligned}$$

(A30)

$$\begin{aligned} h_2(s)&= h_1(s)+{\mathcal {B}}(b,T_e,T_j)\sigma _y\sqrt{1-\rho _{xy}^2}, \end{aligned}$$

(A31)

$$\begin{aligned} \lambda _j(s)&= c_j {\mathcal {A}}_d(T_e;T_j)e^{-{\mathcal {B}}(a,T_e,T_j)s}, \end{aligned}$$

(A32)

$$\begin{aligned} {\mathcal {K}}_j(s)&= -{\mathcal {B}}(b,T_e,T_j)\left[ \mu _y-\frac{1}{2}(1-\rho _{xy}^2)\sigma _y^2 {\mathcal {B}}(b,T_e,T_j)+\rho _{xy}\sigma _y\frac{s-\mu _x}{\sigma _x}\right] , \end{aligned}$$

(A33)

$$\begin{aligned} {\mathcal {A}}(t_1;t_2)&= \frac{P_{c}(0;t_2)}{P_{c}(0;t_1)} \exp \left\{ \frac{1}{2} \left[ V(t_1;t_2) - V(0;t_2) + V(0;t_1) \right] \right\} , \end{aligned}$$

(A34)

$$\begin{aligned} {\mathcal {B}}(z,t_1;t_2)&= \frac{1-e^{-z(t_2-t_1)}}{z}, \quad z\in \{a,b\}, \end{aligned}$$

(A35)

and ${\bar{q}}={\bar{q}}(s)$ is the solution of the following equation

$$\begin{aligned} \sum _{j=1}^{n} c_j{\mathcal {A}}(T_e;T_j)e^{-{\mathcal {B}}(a,T_e,T_j)s-{\mathcal {B}}(b,T_e,T_j){\bar{q}}}=1, \end{aligned}$$

(A36)

and, by setting $T_u = 0$ and $T_{v+1} = T_e$,

$$\begin{aligned} \mu _x&= - M^{T_e}_x(0;T_e) , \end{aligned}$$

(A37)

$$\begin{aligned} \mu _y&= - M^{T_e}_y(0;T_e), \end{aligned}$$

(A38)

$$\begin{aligned} \sigma _x&= \sqrt{ \frac{\sigma ^2}{2a} \sum _{i=u}^v \Gamma _i^2 e^{-2a T_e} e^{2a(T_{i+1}-T_i)}}, \end{aligned}$$

(A39)

$$\begin{aligned} \sigma _y&= \sqrt{\frac{\eta ^2}{2b} \sum _{i=u}^v \Gamma _i^2 e^{-2b T_e} e^{2b(T_{i+1}-T_i)}}, \end{aligned}$$

(A40)

$$\begin{aligned} \rho _{xy}&=\frac{\rho \sigma \eta }{(a+b)\sigma _x\sigma _y} \left[ 1-e^{-(a+b)T_e} \right] \sum _{i=u}^v \Gamma _i^2 e^{(a+b)(T_{i+1}-T_i)}, \end{aligned}$$

(A41)

where $M^{T_e}_x(0;T_e)$ and $M^{T_e}_y(0;T_e)$ are drift components stemming from the dynamics of the processes x and y under the forward measure $T_e$ given, setting $T_u = 0$ and $T_{v+1} = T_e$, by

$$\begin{aligned} M^{T}_x(0;T_e)= & {} \frac{\sigma ^2}{a} \sum _{i=u}^v \Gamma _i^2 \left[ \frac{e^{-aT_e}}{a}\left( e^{a T_{i+1}} - e^{a T_{i}}\right) - \frac{e^{-a(T_e+T)}}{2a} \left( e^{2a T_{i+1}} - e^{2a T_{i}}\right) \right] \nonumber \\{} & {} -\, \frac{\rho \sigma \eta }{b(a+b)} \sum _{i=u}^v \Gamma _i^2 \left[ \frac{e^{-aT_e}}{a} \left( e^{a T_{i+1}} - e^{a T_{i}} \right) \right. \nonumber \\{} & {} \left. -\,\frac{e^{-(aT_e+bT)}}{a+b}\left( e^{(a+b)T_{i+1}} - e^{(a+b)T_{i}} \right) \right] , \end{aligned}$$

(A42)

$$\begin{aligned} M^{T}_y(0;T_e)= & {} \frac{\eta ^2}{b} \sum _{i=u}^v \Gamma _i^2 \left[ \frac{1}{b}e^{-bT_e}\left( e^{b T_{i+1}} - e^{b T_{i}}\right) - \frac{e^{-b(T_e+T)}}{2b} \left( e^{2b T_{i+1}} - e^{2b T_{i}}\right) \right] \nonumber \\{} & {} - \,\frac{\rho \sigma \eta }{a(a+b)} \sum _{i=u}^v \Gamma _i^2 \left[ \frac{e^{-bT_e}}{b} \left( e^{b T_{i+1}} - e^{b T_{i}} \right) \right. \nonumber \\{} & {} \left. -\,\frac{e^{-(aT_e+bT)}}{a+b}\left( e^{(a+b)T_{i+1}} - e^{(a+b)T_{i}} \right) \right] . \end{aligned}$$

(A43)

1.3.3 Calibration

We calibrate the G2++ model parameters on ATM Swaption prices following a two-steps procedure.

1.
Constant volatility calibration firstly, we obtain the G2++ parameters $a,b,\sigma ,\eta ,\rho $ in Eq. A14 by minimizing the following objective function, which represents the Mean Squared Relative Error (MSRE) between market prices $V^{\text {mkt}}$ and G2++ model prices $V^{\text {G2++}}$ for each combination of expiry $ \{ \xi _i \}_{i=1}^{M} $ and tenor $ \{ {\mathfrak {T}}_j \}_{j=1}^{N}$,
$$\begin{aligned} \left\{ {\hat{a}},{\hat{b}},{\hat{\sigma }},{\hat{\eta }},{\hat{\rho }} \right\} = \mathop {\mathrm{arg\,min}}\limits _{ \left\{ a,b,\sigma ,\eta ,\rho \right\} } \frac{1}{M N}\sum _{i=1}^{M}\sum _{j=1}^{N}\left[ \frac{V^{\text {G2++}}(a,b,\sigma ,\eta ,\rho ;\xi _i,{\mathfrak {T}}_j)+\epsilon }{V^{\text {mkt}}(\xi _i,{\mathfrak {T}}_j)+\epsilon } -1\right] ^2, \qquad \end{aligned}$$
(A44)
where $\epsilon $ is a regularization parameter set to 10 bps in order to avoid divergence in case of small prices.
2.
Time-dependent volatility calibration secondly, given the G2++ parameters ${\hat{a}},{\hat{b}},{\hat{\sigma }},{\hat{\eta }},{\hat{\rho }}$ obtained above, we calibrate the piece-wise function $\Gamma (t)$ in Eq. A14 using an iterative forward procedure to obtain at step $i=1,\ldots ,M$ the piece $\Gamma _i = \Gamma (\xi _i)$ using the previous pieces $\{ {\hat{\Gamma }}_k \}_{k=1}^{i-1} $ by minimizing the same MSRE objective function
$$\begin{aligned} {\hat{\Gamma }}_i = \mathop {\mathrm{arg\,min}}\limits _{\Gamma _i} \frac{1}{N} \sum _{j=1}^{N}\left[ \frac{V^{\text {G2++}}({\hat{a}},{\hat{b}},{\hat{\sigma }} \Gamma _i, {\hat{\eta }} \Gamma _i,{\hat{\rho }};\xi _i,{\mathfrak {T}}_j)+\epsilon }{V^{\text {mkt}}(\xi _i,{\mathfrak {T}}_j)+\epsilon } -1 \right] ^2, \end{aligned}$$
(A45)

Notice that, at first order, the price calibration using the MSRE objective function above is equivalent to a Vega-weighted volatility calibration. In fact, if we consider the following minimization problem

$$\begin{aligned} {\hat{p}} = \mathop {\mathrm{arg\,min}}\limits _{p} \frac{1}{M N} \sum _{i=1}^{M} \sum _{j=1}^{N} \left\{ w_{i,j} \left[ \sigma (p;\xi _i,{\mathfrak {T}}_j) - \sigma ^{\text {mkt}}(\xi _i,{\mathfrak {T}}_j) \right] ^2 \right\} , \end{aligned}$$

(A46)

where $\sigma $ is the (normal or log-normal) volatility implied in G2++ option prices and where $\{w_{1,1}, \dots , w_{i,j}, \dots , w_{M,N} \} $ are calibration weights set equal to normalized Vega sensitivities,

$$\begin{aligned} w_{i,j} = \frac{\nu _{i,j}}{\sum _{i=1}^{M} \sum _{j=1}^{N} \nu _{i,j}}, \quad \sum _{i=1}^{M} \sum _{j=1}^{N} w_{i,j} = 1, \end{aligned}$$

(A47)

where $\nu _{i,j}$ is the corresponding (normal or log-normal) Vega sensitivty. At first order,

$$\begin{aligned} \nu _{i,j} \left[ \sigma (p;\xi _i,{\mathfrak {T}}_j) - \sigma ^{\text {mkt}}(\xi _i,{\mathfrak {T}}_j) \right] \approx V(p;\xi _i,{\mathfrak {T}}_j) - V^{\text {mkt}}(\xi _i,{\mathfrak {T}}_j). \end{aligned}$$

(A48)

The formulas above are restricted to the ATM case but can be easily extended to the more general case of ITM and OTM Swaptions prices with strikes $k_h$, with $h = 1,\dots , K_i$ (since the number of strikes depends on the given expiry).

Full details about the different G2++ model calibrations are reported in “Appendix C.2”.

1.3.4 Monte Carlo simulation

In Brigo and Mercurio (2007) the dynamics of x(t) and y(t) in Eq. A14 are rewritten under the T-forward measure $Q^T$, which is convenient for Monte Carlo simuation. Here we generalize that MC simulation scheme for time-dependent volatility parameters as follows

$$\begin{aligned} x(t)&= x(s)e^{-a(t-s) } - M^{T}_{x}(s;t) + N_{1}(s,t), \end{aligned}$$

(A49)

$$\begin{aligned} y(t)&= y(s)e^{-b(t-s) } - M^{T}_{y}(s;t)+ N_{2}(s,t), \end{aligned}$$

(A50)

where $s \le t \le T$, $M^{T}_{x}(s;t)$ and $M^{T}_{y}(s;t)$ are the drift components defined in “Appendix A.3.2”, and $N(t-s)$ is a two-dimensional normal random vector with zero mean and $2\times 2$ covariance matrix given by

$$\begin{aligned} \begin{aligned} \Sigma (s,t)_{1,1}&= \frac{\sigma ^2}{2a} \sum _{i=u}^v \Gamma _i^2 e^{-2a t} e^{2a(T_{i+1}-T_i)},\\ \Sigma (s,t)_{1,2}&= \frac{\rho \eta \sigma }{a+b} \left[ 1-e^{-(a+b)t} \right] \sum _{i=u}^v \Gamma _i^2 e^{(a+b)(T_{i+1}-T_i)},\\ \Sigma (s,t)_{2,1}&= \Sigma (s,t)_{1,2},\\ \Sigma (s,t)_{2,2}&= \frac{\eta ^2}{2b} \sum _{i=u}^v \Gamma _i^2 e^{-2b t} e^{2b(T_{i+1}-T_i)}, \end{aligned} \end{aligned}$$

(A51)

where $T_u = s$ and $T_{v+1} = t$.

The MC simulation is performed using a discrete time grid $\left\{ t=t_0,t_1,\ldots ,t_{N_T}=T\right\} $. The mark-to-market of each instrument at future time step $t_i$ is calculated by simulating its future market risk factors using the pair $\left\{ x(t_i), y(t_i)\right\} $ computed from Eqs. A49 and A50, for $i = 1,\dots ,N_T$. It should be noticed that, for our purposes, we do not need to simulate r(t) since the price of a Swap or an European Swaptions can be obtained directly through x(t) and y(t) (see “Appendix A.3.2”).

1.4 XVA pricing

We describe here the Credit and Debt Valuation Adjustments (CVA and DVA), which take into account the risks related to counterparties default on derivative transactions. Below we give their definitions, we derive their pricing formulas, we show how they can be computed in practice and we also consider the analytical expressions available for linear derivatives.

1.4.1 XVA definitions and formulas

Let us consider a contract between a bank (B) and a counterparty (C) engaged at time t in a derivative contract with final maturity at time $T>t$, which can default at future times $\tau _B$ and $\tau _C$. Hence, we can distinguish six cases, according to the six possible orderings of the three time instants $T,\tau _B,\tau _C$ as follows.

In the two cases, either $T<\tau _B<\tau _C$ or $T<\tau _C<\tau _B$, i.e. when both counterparties default after the contract maturity, the present value of the contract V(t) is unchanged, since all the due cash flows will be regularly exchanged. Hence, in this case the fair value of the trade is given, according to Eq. A1, by the mark to market without any adjustment, i.e. $V(t) = V_0(t)$. Since counterparties’ default is not effective, we may associate this value to a perfect collateralization, even if the contract is not subject to a collateral agreement (CSA).
In the two cases, either $\tau _C< \tau _B < T$ or $\tau _C< T < \tau _B$, i.e. when the counterparty defaults before the bank and contract maturity T, and the bank has positive exposure with respect to the counterparty, the bank suffers a loss equal to the replacement cost of the position. The CVA is defined as the discounted value of the expected future loss suffered by the bank due to the default of the counterparty, and it is a negative quantity from the bank’s perspective. Hence, in this case the fair value of the contract is given, according to Eq. A1, by the mark to market plus the (negative) CVA, i.e. $V(t) = V_0(t)+\text {CVA}(t)$.
In the two last cases, either $\tau _B< \tau _C < T$ or $\tau _B< T < \tau _C$, i.e. when the bank defaults before the counterparty and contract maturity T, and the counterparty has a positive exposure with respect to the bank (i.e. the bank has negative exposure with respect to the counterparty), the counterparty suffers a loss equal to the replacement cost of the position, and therefore computes a (negative) CVA. This is by definition the DVA of the bank, i.e. the discounted value of the expected future gain suffered by the bank due to the default of the counterparty, and it is a positive quantity from the bank’s perspective. Hence, in this case the fair value of the contracy is given, according to Eq. A1, by the mark to market plus the (positive) DVA, i.e. $V(t) = V_0(t)+\text {DVA}(t)$.

Overall, considering all the six possible cases, we have the total fair value $V(t) = V_0(t)+\text {CVA}(t)+\text {DVA}(t)$, where the CVA/DVA definitions above translate into the following pricing expressions,

$$\begin{aligned} \text {CVA}(t)&= -{\mathbb {E}}^Q\left[ D(t;\tau _\text {C}) LGD_\text {C}(\tau _\text {C}) \left[ H(\tau _\text {C}) \right] ^+ \mathbbm {1}_{\{ t< \tau _C< \tau _B < T \}} \vert {\mathcal {G}}_t \right] , \end{aligned}$$

(A52)

$$\begin{aligned} \text {DVA}(t)&= - {\mathbb {E}}^Q \left[ D(t;\tau _B) LGD_\text {B}(\tau _\text {B}) \left[ H(\tau _B) \right] ^- \mathbbm {1}_{\{ t< \tau _B< \tau _C < T \}} \vert {\mathcal {G}}_t \right] , \end{aligned}$$

(A53)

$$\begin{aligned} H(\tau _\text {X})&= {\mathbb {E}}_{\tau _\text {X}}^Q \left[ V_0 (\tau _\text {X}) \right] - C(\tau _\text {X}) , \end{aligned}$$

(A54)

where $H(\tau _\text {X})$ denotes the bank’s exposure at time $t<\tau _\text {X} \le T$ in the event of default of X, for $X\in \{B,C\}$, $V_0(\tau _\text {X})$ is the mark to market^{Footnote 20} of the instrument at time $\tau _\text {X}$, $C(\tau _\text {X})$ denotes generically the collateral available at time $\tau _\text {X}$ (see “Appendix B” for collateral modelling), and $LGD_\text {X}(\tau _\text {X}) = 1 - R_\text {X}(\tau _\text {X})$ is the Loss Given Default of X at time $\tau _\text {X}$, which represents the percentage amount of the exposure expected to be lost in case of X’s default, with $R_\text {X}$ denoting the corresponding Recovery Rate. We stress that the exposure may refer to single contracts between the bank and the counterparty, or, more generally, to groups of contracts subject to netting and/or collateral agreements.

Assuming independent interest rate and default processes^{Footnote 21} and constant $LGD_\text {X}$, the expectations in Eqs. A52 and A53 simplify to

$$\begin{aligned} \text {CVA}(t)&= - \int _{t}^{T} {\mathbb {E}}^Q\left[ D(t;u) LGD_\text {C} \left[ H(u) \right] ^{+} \vert {\mathcal {F}}_t \right] {\mathbb {E}}^Q \left[ \mathbbm {1}_{\{ t< \tau _C< \tau _B < T \}} \vert {\mathcal {F}}_t \right] du , \nonumber \\&= - LGD_\text {C} \int _{t}^{T} {\mathbb {E}}^Q\left[ D(t;u) \left[ H(u) \right] ^{+} \vert {\mathcal {F}}_t \right] S_B(t,u) dQ_C(t,u) , \end{aligned}$$

(A55)

$$\begin{aligned} \text {DVA}(t)&= - \int _{t}^{T} {\mathbb {E}}^Q\left[ D(t;u)LGD_\text {B} \left[ H(u) \right] ^{-} \vert {\mathcal {F}}_t \right] {\mathbb {E}}^Q \left[ \mathbbm {1}_{\{ t< \tau _B< \tau _C < T \}} \vert {\mathcal {F}}_t \right] du , \nonumber \\&= - LGD_\text {B} \int _{t}^{T} {\mathbb {E}}^Q\left[ D(t;u) \left[ H(u) \right] ^{-} \vert {\mathcal {F}}_t \right] S_C(t,u) dQ_B(t,u) , \end{aligned}$$

(A56)

$$\begin{aligned} S_X(t,u)&= {\mathbb {E}}^Q \left[ \mathbbm {1}_{\left\{ \tau _X > u \right\} } \vert {\mathcal {F}}_t \right] = {\mathbb {E}}^Q \left[ e^{-\int _t^u \gamma _X(s)ds} \vert {\mathcal {F}}_t \right] = 1-Q_X(t,u), \end{aligned}$$

(A57)

where $S_X(t,u)$ is the survival probability of X until time u valued at t, $\gamma _X$ denotes the stochastic hazard rate of X, $dQ_X(t,u) = Q_X(t,u,u+du)$ is the marginal default probability of X referred to the infinitesimal time interval $[u,u+du]$ valued at t, and ${\mathcal {F}}_t$ denotes the market filtration (see e.g. Bielecki & Rutkowski, 2004; Brigo et al., 2013, 2018, 2019).

Similar formulas for other XVA can be found in e.g. Gregory (2020). The survival probabilities in Eq. A57 are computed from default curves built from market CDS quotes through standard bootstrapping procedure.

1.4.2 XVA numerical formulas

The numerical XVA calculation requires the discretization of the integrals in Eqs. A55, A56 using a time grid $\{t = t_0,\ldots ,t_i,\ldots , t_{N_T} = T\}$ as follows (see e.g. Gregory, 2020; Brigo et al., 2013),

$$\begin{aligned} \text {CVA}(t)&\simeq - LGD_C \sum _{i=1}^{N_T} {\mathcal {H}}^{+} (t;t_i) {\mathcal {S}}_B(t,t_i) \Delta {\mathcal {Q}}_C(t,t_i), \end{aligned}$$

(A58)

$$\begin{aligned} \text {DVA}(t)&\simeq - LGD_B \sum _{i=1}^{N_T} {\mathcal {H}}^{-} (t;t_i) {\mathcal {S}}_C(t,t_i) \Delta {\mathcal {Q}}_B(t,t_i), \end{aligned}$$

(A59)

$$\begin{aligned} {\mathcal {H}}^{\pm }(t;t_i)&= {\mathbb {E}}^Q\left[ D(t;t_i) \left[ H(t_i) \right] ^{\pm } \vert {\mathcal {F}}_t \right] = P(t;t_i) {\mathbb {E}}^{Q^{t_i}} \left[ \left[ H(t_i) \right] ^{\pm } \vert {\mathcal {F}}_t \right] , \end{aligned}$$

(A60)

where ${\mathcal {H}}^{+} (t;t_i)$ is the Expected Positive Exposure (EPE) at time t, discretized on interval $(t_{i-1},t_i ]$, $ {\mathcal {H}}^{-} (t;t_i)$ is the Expected Negative Exposure (ENE), ${\mathcal {S}}_X(t,t_i) = 1 - {\mathcal {Q}}_X(t,t_i)$ is the survival probability of X at time t, referred to time interval $[t,t_i]$ and $\Delta {\mathcal {Q}}_X(t,t_i) = {\mathcal {Q}}_X(t,t_i) - {\mathcal {Q}}_X(t,t_{i-1})$ is the marginal default probability of X at time t, referred to interval $(t_{i-1},t_i]$.

In general, the exposure ${\mathcal {H}}^{\pm }(t;t_i)$ in Eq. A60 cannot be computed analytically, except in particular cases as discussed in “Appendix A.4.3” below, and one has to resort to Monte Carlo simulation,

$$\begin{aligned} {\mathcal {H}}^{\pm }(t;t_i) \simeq P(t;t_i) \frac{1}{N_{MC}} \sum _{m=1}^{N_{MC}} \left[ H_{m}(t_i)\right] ^{\pm }, \end{aligned}$$

(A61)

where $H_m(t_i)$ is the exposure at time step $t_i$ for MC path m, which may include both Variation and Initial margins.

In order to investigate the XVA Monte Carlo error, we build, for each time step $t_i$, the following $n\sigma $ upper and lower bounds on EPE/ENE,

$$\begin{aligned} \begin{aligned} {\mathcal {H}}^{\pm }_{\text {UB}}(t_0;t_i)&= {\mathcal {H}}(t_0;t_i)^{\pm } + n \frac{\sigma _{{\mathcal {H}}^{\pm }(t_0;t_i)}}{\sqrt{N_{MC}}}, \\ {\mathcal {H}}^{\pm }_{\text {LB}}(t_0;t_i)&= {\mathcal {H}}(t_0;t_i)^{\pm } - n \frac{\sigma _{{\mathcal {H}}^{\pm }(t_0;t_i)}}{\sqrt{N_{MC}}}, \end{aligned} \end{aligned}$$

(A62)

where

$$\begin{aligned} \sigma _{{\mathcal {H}}^{\pm }(t_0;t_i)} = \sqrt{\frac{1}{N_{MC}-1} \sum _{m=1}^{N_{MC}} \left[ P(t_0;t_i) \left[ H_{m}(t_i) \right] ^{\pm } - {\mathcal {H}}^{\pm }(t_0;t_i) \right] ^{2}}, \end{aligned}$$

(A63)

is the MC standard deviation of the EPE/ENE in Eq. A60. Then, we may use the quantities above to get the following confidence interval for CVA/DVA

$$\begin{aligned} \begin{aligned} \text {CVA}_x(t_0)&= -LGD_C\sum _{i = 1}^{N_T} {\mathcal {H}}^{+}_x(t_0;t_i) {\mathcal {S}}_B(t_0,t_i) \Delta {\mathcal {Q}}_C(t_0,t_i), \\ \text {DVA}_x(t_0)&= -LGD_B\sum _{i = 1}^{N_T} {\mathcal {H}}^{-}_x(t_0;t_i) {\mathcal {S}}_C(t_0,t_i) \Delta {\mathcal {Q}}_B(t_0,t_i), \end{aligned} \end{aligned}$$

(A64)

where $x\in \left\{ UB,LB\right\} $, using, e.g., $n=3 \sigma $.

1.4.3 XVA analytical formulas

In the special case of single linear derivative without collateral it is possible to solve analytically the Eqs. A55 and A56, with considerable benefits in terms of computational time (see e.g. Brigo et al., 2013 and references therein). Nevertheless, this approach can be used to validate the results obtained through the Monte Carlo simulation, as discussed in Sect. 4.6.

In the case of an uncollateralized interest rate Swap, the CVA at time t can be written, from Eq. A55, as

$$\begin{aligned} \begin{aligned}&\text {CVA}(t) \\&\quad = - LGD_C \int _{t}^{T} {\mathbb {E}}^Q \left\{ D(t;u) \left[ {\textbf {Swap}}(u;{{\textbf {T}}},{{\textbf {S}}},K,\omega )\right] ^{+} \vert {\mathcal {F}}_t \right\} S_B(t,u) dQ_C(t,u)\\&\quad = - LGD_C \int _{t}^{T} {\textbf {Swaption}}(t;u,{{\textbf {T}}},{{\textbf {S}}},K,\omega ) S_B(t,u) dQ_C(t,u), \end{aligned} \end{aligned}$$

(A65)

where ${\textbf {Swaption}}(t;u,{{\textbf {T}}},{{\textbf {S}}},K,\omega )$ denotes the price at time t of an European Swaption expiring at time u on a Swap having tenor equal to ${\mathcal {T}} =T-u$.

Since $[x]^{-} = [-x]^{+}$, the companion analytical DVA can be obtained as

$$\begin{aligned} \begin{aligned}&\text {DVA}(t) \\&\quad = - LGD_B \int _{t}^{T} {\mathbb {E}}^Q \left\{ D(t;u)\left[ {\textbf {Swap}}(u;{{\textbf {T}}},{{\textbf {S}}},K,\omega )\right] ^{-} \vert {\mathcal {F}}_t \right\} S_C(t,u) dQ_B(t,u), \\&\quad = - LGD_B \int _{t}^{T} {\mathbb {E}}^Q \left\{ D(t;u)\left[ {\textbf {Swap}}(u;{{\textbf {T}}},{{\textbf {S}}},K,-\omega )\right] ^{+} \vert {\mathcal {F}}_t\right\} S_C(t,u) dQ_B(t,u), \\&\quad = - LGD_B \int _{t}^{T} {\textbf {Swaption}}(t;u,{{\textbf {T}}},{{\textbf {S}}},K,-\omega ) S_C(t,u) dQ_B(t,u).\\ \end{aligned} \end{aligned}$$

(A66)

By discretizing the above integrals using a time grid $\{t = t_0,\ldots ,t_i,\ldots , t_{N_T} = T\}$, with T being the maturity of the Swap, we obtain

$$\begin{aligned} \text {CVA}(t)&\simeq -LGD_C \sum _{i=1}^{N_{T}} {\textbf {Swaption}}(t;t_i,{{\textbf {T}}},{{\textbf {S}}},K,\omega ){\mathcal {S}}_B(t,t_i) \Delta {\mathcal {Q}}_C(t,t_i) , \end{aligned}$$

(A67)

$$\begin{aligned} \text {DVA}(t)&\simeq -LGD_B \sum _{i=1}^{N_{T}} {\textbf {Swaption}}(t;t_i,{{\textbf {T}}},{{\textbf {S}}},K,-\omega ){\mathcal {S}}_C(t,t_i) \Delta {\mathcal {Q}}_B(t,t_i) . \end{aligned}$$

(A68)

In conclusion, the CVA of a single, uncollateralized payer (receiver) Swap at time t can be written as a weighted sum of co-terminal payer (receiver) European Swaptions expiring at time $t_i$ to enter in a Swap with tenor ${\mathcal {T}} =T-t_i$. Symmetrically, the DVA at time t can be written as a weighted sum of co-terminal receiver (payer) European Swaptions.

Given a discrete time grid to compute CVA and DVA according to Eqs. A67 and A68 the implementation of the pricing of each co-terminal Swaption involves a number of steps, enumerated below in the case of shifted-SABR framework:

1.
calculation of the shifted Black implied volatility cube starting form the quoted Swaption price cube (see Table 22);
2.
calibration of the shifted-SABR model for each combination of expiries $\xi $ and tenors ${\mathfrak {T}}$;
3.
rearrangement of the calibrated parameters in the corresponding 4 parameters’ matrices $\alpha , \beta , \nu , \rho $;
4.
For each time step $t_i$
1. (a)
  compute the set of SABR parameters $p_i^{SABR}=\{\alpha _i,\beta _i, \nu _i, \rho _i \}$ through a 2-dimensional interpolation and extrapolation^{Footnote 22} on tenor/expiry market grid for expiry $t_i-t$ and tenor $T-t_i$
  $$\begin{aligned} \begin{aligned} \alpha _i&=\text {2DInterp}(\alpha ;t_i-t,T-t_i)\\ \beta _i&=\text {2DInterp}(\beta ;t_i-t,T-t_i)\\ \nu _i&=\text {2DInterp}(\nu ;t_i-t,T-t_i)\\ \rho _i&=\text {2DInterp}(\rho ;t_i-t,T-t_i); \end{aligned} \end{aligned}$$
  (A69)
2. (b)
  compute the underlying forward starting Swap Rate
  $$\begin{aligned} {\textbf {Swap}}(t_i;{{\textbf {T}}},{{\textbf {S}}},K,\omega ); \end{aligned}$$
  (A70)
3. (c)
  compute the SABR shifted-lognormal volatility
  $$\begin{aligned} \sigma ^{SABR}({\textbf {Swap}}(t_i;{{\textbf {T}}},{{\textbf {S}}},K,\omega ),K,p_i^{SABR}) \end{aligned}$$
  (A71)
  using the standard closed-form formula (see Hagan et al., 2002; Obloj, 2007). It’s worth to notice that the the co-terminal swaptions are in general ITM/OTM, even if the underlying swap is ATM;
4. (d)
  compute the Swaption price using shifted-Black formula
  $$\begin{aligned} \begin{aligned}&{\textbf {Swaption}}(t;t_i,{{\textbf {T}}},{{\textbf {S}}},K,\omega )=A_d(t_i,S)\\&\quad \times \text {Black}\left[ {\textbf {Swap}}(t_i;{{\textbf {T}}},{{\textbf {S}}},K,\omega )+\lambda ,K+\lambda ,v^{SABR}\right] \end{aligned} \end{aligned}$$
  (A72)
  where $v^{SABR}=v^{SABR}({\textbf {Swap}}(t_i;{{\textbf {T}}},{{\textbf {S}}},K,\omega ),K,p_i^{SABR})$ is related to the SABR shifted-lognormal volatility as follows
  $$\begin{aligned} \begin{aligned}&\sigma ^{SABR}({\textbf {Swap}}(t_i;{{\textbf {T}}},{{\textbf {S}}},K,\omega ),K,p_i^{SABR})\\&\quad = \sqrt{\dfrac{v^{SABR}({\textbf {Swap}}(t_i;{{\textbf {T}}},{{\textbf {S}}},K,\omega ),K,p_i^{SABR})}{\tau (t_i,T)}}. \end{aligned} \end{aligned}$$
  (A73)

Appendix B: Collateral modelling

In this appendix we introduce collateralization and describe the assumptions made for calculating VM and IM, with a particular focus on ISDA-SIMM dynamic IM.

With the aim to reduce the systemic risk posed by non-cleared OTC derivatives, the BCBS-IOSCO framework (see BCBS-IOSCO, 2015) requires institutions engaging in these transactions to post bilaterally Variation Margin (VM) and Initial Margin (IM) on a daily basis at a netting set level. On the one hand, VM aims at covering the current exposure stemming from changes in instrument’s mark to market by reflecting its current size. On the other hand, IM aims at covering the potential future exposure that could arise, in the event of default of the counterparty, from changes in instrument’s mark to market in the period between the last VM exchange and the close-out of the position, also known as margin period of risk (MPoR).

1.1 Collateral management

The ISDA Master Agreement represents the most common legal framework which governs bilateral OTC derivatives transactions. In particular, the Credit Support Annex (CSA) to ISDA Master Agreement provides the terms under which collateral is posted, along with rules for the resolution of collateral disputes. Certain CSA parameters affect the residual exposure, namely: eligible assets (cash, cash equivalent, government bonds), margin call frequency, threshold (K), defined as the maximum amount of allowed unsecured exposure before any margin call is made, and minimum transfer amount (MTA), defined as the minimum amount that can be transferred for each margin call.

Under perfect collateralization, counterparty risk is suppressed resulting in null XVA. Theoretically, this corresponds to an ideal CSA ensuring a perfect match between the price $V_{0}(t)$ and the corresponding collateral at any time t. This condition is approximated in practice with a CSA minimizing any friction between the mark to market and the collateral, i.e. cash collateral in the same currency of the trade, daily margination, flat overnight collateral rate, zero threshold and minimum transfer amount. Nevertheless, real CSA introduces some frictions causing divergences between the price and the corresponding collateral and hence a non-null counterparty risk. For example, collateral transfers are not instantaneous events and may also take several days to complete in case of disputes; for this reason, in the event of default, the collateral actually available to the non-defaulting party at the close-out date may differ from the prescribed one.

A simple way to capture these divergences is to assume that VM and IM available at time step $t_i$ depend on the instrument price computed at time ${\hat{t}}_i = t_i-l$. In this way the time interval $\left[ t_i-l,t_i \right] $ represents the MPoR, ${\hat{t}}$ is the last date for which collateral was fully posted and $t_i$ is the close-out date, and l is the length of the MPoR. This implies that, while both counterparties stop simultaneously to post collateral for the entire MPoR, contractual cash flows are fully paid. Although simplistic, this assumption can be deemed appropriate in relation to the purposes of our work. More advanced models are developed in Andersen et al. (2016, 2017) and could be easily taken into account in our framework.

In practical terms, the inclusion of MPoR requires a secondary time grid, built by defining for each $t_i$ of the principal time grid a look-back time point ${\hat{t}}_i= t_i - l$ such that ${\hat{t}}_i$ is the collateral calculation date. Formally, from bank’s perspective the collateralized exposure $H_m(t_i)$ at time step $t_i$ for a generic path m, can be written as^{Footnote 23}

$$\begin{aligned} H_m(t_i) = {\left\{ \begin{array}{ll} \left[ V_{0,m} \left( t_i \right) -\text {VM}_m \left( t_i;V_{0,m} ({\hat{t}}_i),\text {K}_\text {VM},\text {MTA}_\text {VM} \right) \right. \\ \quad \left. - \text {IM}_m^{C}\left( t_i;V_{0,m} ({\hat{t}}_i),\text {K}_\text {IM},\text {MTA}_\text {IM} \right) \right] ^{+} &{}\text { if } V_{0,m}-\text {VM}_m \ge 0 \\ \left[ V_{0,m} \left( t_i \right) -\text {VM}_m \left( t_i;V_{0,m} ({\hat{t}}_i),\text {K}_\text {VM},\text {MTA}_\text {VM} \right) \right. \\ \quad \left. + \text {IM}_m^{B}\left( t_i;V_{0,m} ({\hat{t}}_i),\text {K}_\text {IM},\text {MTA}_\text {IM} \right) \right] ^{-} &{}\text { if } V_{0,m}-\text {VM}_m < 0 \end{array}\right. } \end{aligned}$$

(B1)

Here we assumed that a bilateral and symmetrical CSA is in place, VM is netted, and IM is posted by the Counterparty ($\text {IM}_m^C$) or by the bank ($\text {IM}_m^B$) into a segregated account as required by the regulation (see BCBS-IOSCO, 2015). Therefore, from bank’s perspective, VM can be positive (if posted by the counterparty) or negative (if posted by the bank), while IM is always positive. For the sake of generality we distinguished IM posted by the Bank and the one posted by the Counterparty; in fact, for instruments with optionality, it may happen that $\text {IM}_m^C\ne \text {IM}_m^B$ due to convexity effects introduced by the non-linear dependence on the exposure within Curvature Margin definition [see Eq. (B23)].

1.2 Variation margin

VM modelling is fairly straightforward as it depends on instrument’s mark to market, together with K$_\text {VM}$ and MTA$_\text {VM}$. We calculated VM available at time step $t_i$ for a generic path m through the following formula

$$\begin{aligned}&\text {VM}_m \left( t_i;V_{0,m} ({\hat{t}}_i),\text {K}_\text {VM},\text {MTA}_\text {VM} \right) = \widehat{\text {VM}}_m({\hat{t}}_{i-1}) \nonumber \\&\quad + \mathbbm {1}_{ \left\{ \left|\left( V_{0,m}({\hat{t}}_i)-\text {K}_\text {VM}\right) ^{+} - \ \widehat{\text {VM}}^{+}_m({\hat{t}}_{i-1}) \right|> \text {MTA}_\text {VM} \right\} } \bigl [ ( V_{0,m}({\hat{t}}_i)-\text {K}_\text {VM})^{+} - \widehat{\text {VM}}^{+}_m({\hat{t}}_{i-1}) \bigr ] \nonumber \\&\quad + \mathbbm {1}_{ \left\{ \left|\left( V_{0,m}({\hat{t}}_i)+\text {K}_\text {VM}\right) ^{-} - \ \widehat{\text {VM}}^{-}_m({\hat{t}}_{i-1}) \right|> \text {MTA}_\text {VM} \right\} } \bigl [ ( V_{0,m}({\hat{t}}_i)+\text {K}_\text {VM})^{-} - \widehat{\text {VM}}^{-}_m({\hat{t}}_{i-1}) \bigr ] \end{aligned}$$

(B2)

$$\begin{aligned}&\widehat{\text {VM}}_m({\hat{t}}_{i-1}) = \frac{\text {VM}_m({\hat{t}}_{i-1})}{P({\hat{t}}_{i-1},{\hat{t}}_{i})} , \end{aligned}$$

(B3)

where $\widehat{\text {VM}}_m({\hat{t}}_{i-1})$ is the value of VM just before its update at ${\hat{t}}_i$, and the second and the third terms of Eq. B2 correspond to the amount of VM posted at time step ${\hat{t}}_i$ by the counterparty and the bank respectively. In particular, in the second term, the counterparty will update VM for the amount exceeding the threshold and VM already in place, provided that this amount is greater than the minimum transfer amount; the same holds for the bank in case of negative exposure (third term). We imposed null VM for $t_i = t_0$ and $t_i = t_{N_{T}}$.

1.3 Initial margin

1.3.1 ISDA standard initial margin model

In 2013 ISDA, in cooperation with entities first impacted by bilateral initial margin requirements, started developing the Standard Initial Margin Model (SIMM) with the aim to provide market participant with a uniform risk-sensitive model for calculating bilateral IM (see ISDA, 2013), preventing both potential disputes between counterparties related to IM determination with different internal models and the overestimation of margin requirements due to the use of the non-risk-sensitive standard approach (see BCBS-IOSCO, 2013). The first version of the model was published in 2016. On an annual basis the model parameters are recalibrated and the methodology is reviewed in order to ensure that regulatory requirements are met. Since our work relies on market data at 28 December 2018, we considered the ISDA-SIMM Version 2.1 which was effective from 1 December 2018 to 30 November 2019 (see ISDA, 2018).

In general, ISDA-SIMM is a parametric VaR model based on Delta, Vega and Curvature (i.e. “pseudo” Gamma) sensitivities, defined across risk factors by asset class, tenor and expiry, computed in line with specific definitions. More in detail, each trade of a portfolio (under a certain CSA agreement) is assigned to a Product Class among Interest Rates & FX, Credit, Equity and Commodity. Since a given trade may have sensitivity to different risk factors, six Risk Classes are defined among Interest Rate, FX, Credit (Qualifying), Credit (non-Qualifying), Equity and Commodity. The margin contributions stemming from the different Risk Classes are combined by means of an aggregation function taking account of Risk Classes correlations. Formally, IM for a generic instrument can be written as

$$\begin{aligned} \text {IM}&= \sqrt{\sum _{x}\text {M}^2_x + \sum _{x} \sum _{s \ne x} \psi _{x,s} \text {M}_x \text {M}_s}, \end{aligned}$$

(B4)

$$\begin{aligned} \text {M}_x&= \text {DeltaMargin}_x + \text {VegaMargin}_x + \text {CurvatureMargin}_x, \end{aligned}$$

(B5)

where M$_x$ is the margin component for the Risk Class^{Footnote 24}x, with $x \in \left\{ \text {Interest Rate, FX,}\dots \right\} $, and $\psi _{x,s}$ is the correlation matrix between Risk Classes. IM at portfolio level is obtained by adding together IM contributions from each trade. see ISDA (2016, 2018) for a complete description of the model and underlying assumptions.

In our case, Swaps and European Swaptions are assigned to the Interest Rates & FX Product Class and exposed only to Interest Rate (IR) Risk Class, thus $\text {IM} = \text {M}_{\text {IR}}$. Moreover, for a Swap $\text {IM} = \text {DeltaMargin}_{\text {IR}}$ given the linearity of its payoff. As discussed above, we imposed that collateral available at time step $t_i$ is function of instrument’s value (sensitivities in IM case) at time ${\hat{t}}_i = t_i - l$. Therefore, at time step $t_i$ and for a generic path m the ISDA-SIMM dynamic IM is given by

$$\begin{aligned} \text {IM}^{\text {Swap}}_m(t_i)&= \text {DeltaMargin}_{\text {IR},m}\left( t_i;V_m({\hat{t}}_i)\right) , \end{aligned}$$

(B6)

$$\begin{aligned} \text {IM}^{\text {Swpt}}_m(t_i)&= \text {DeltaMargin}_{\text {IR},m}\left( t_i;V_m({\hat{t}}_i)\right) + \text {VegaMargin}_{\text {IR},m}\left( t_i;V_m({\hat{t}}_i)\right) \nonumber \\&\quad + \text {CurvatureMargin}_{\text {IR},m}\left( t_i;V_m({\hat{t}}_i)\right) . \end{aligned}$$

(B7)

Finally, allowing for K$_\text {IM}$ and MTA$_\text {IM}$

$$\begin{aligned} \begin{aligned}&\text {IM}^{p}_m\bigl ( t_i;V_m ({\hat{t}}_i),\text {K}_\text {IM},\text {MTA}_\text {IM} \bigr ) \\&\quad = \mathbbm {1}_{ \left\{ \left( \text {IM}^p_m (t_i) - \text {K}_\text {IM} \right) ^{+} > \text {MTA}_\text {IM} \right\} } \left[ \left( \text {IM}^p_m (t_i) - \text {K}_\text {IM} \right) ^{+} \right] , \quad p \in \left\{ \text {Swap, Swpt} \right\} . \end{aligned} \end{aligned}$$

(B8)

Similar to VM, we imposed null IM for $t_i = t_0$ and $t_i = t_{N_{T}}$.

1.3.2 ISDA-SIMM for swaps and swaptions

In this appendix we describe the implementation of the ISDA Standard Initial Margin Model with respect to interest rate Swaps and Swaptions investigated in this paper (at trade level). In particular, we considered ISDA-SIMM Version 2.1 in order to be consistent with the valuation date considered (28 December 2018)^{Footnote 25} used in our analyses. Further details may be found in ISDA (2018). As shown in “Appendix B.3.1”, IM for a Swap and an European Swaption is given by

$$\begin{aligned} \text {IM}^{\text {Swap}}&= \text {DeltaMargin}^{\text {IR}}, \end{aligned}$$

(B9)

$$\begin{aligned} \text {IM}^{\text {Swpt}}&= \text {DeltaMargin}^{\text {IR}} + \text {VegaMargin}^{\text {IR}} + \text {CurvatureMargin}^{\text {IR}}. \end{aligned}$$

(B10)

The details on the calculation process for the three IM components are given in sections below.

Table 11 Risk weights RW$_j$ parameters which apply to IR Delta tenors for regular volatility currencies

Full size table

Table 12 Correlation parameters $\rho _{j,l}$ between tenors/expiries

Full size table

Delta Margin for Interest Rate Risk Class

Delta Margin for IR Risk Class is computed through the following step-by-step process.

1.
Calculation of IR Delta sensitivities vector
$$\begin{aligned} \Delta ^{c} = [\Delta ^{c}_{2\text {w}},\Delta ^{c}_{1\text {m}},\Delta ^{c}_{3\text {m}},\Delta ^{c}_{6\text {m}},\Delta ^{c}_{1\text {y}},\Delta ^{c}_{2\text {y}},\Delta ^{c}_{3\text {y}},\Delta ^{c}_{5\text {y}},\Delta ^{c}_{10\text {y}},\Delta ^{c}_{15\text {y}},\Delta ^{c}_{20\text {y}},\Delta ^{c}_{30\text {y}}], \end{aligned}$$
(B11)
where $c\in \{f,d\}$. See “Appendix B.3.3” for details on definition and calculation methodology.
2.
Calculation of Weighted Sensitivity WS$_{j,c}$ for the jth SIMM tenor through the following formula
$$\begin{aligned} \text {WS}_{j,c} = \text {RW}_{j} \Delta ^{c}_{j} \text {CR}_b, \end{aligned}$$
(B12)
where:
- RW$_j$ is the Risk Weight for the jth SIMM tenor (Table 11). ISDA specifies three different vectors based upon the volatility of the currency in which the instrument is denominated^{Footnote 26};
- $\Delta ^c_j$ is Delta sensitivity corresponding to the jth SIMM tenor for the interest rate curve c;
- CR$_b$ is the Concentration Risk Factor for the Currency Group b. In this case ISDA specifies four Currency Groups based upon the volatility of the currency in which the instrument is denominated.^{Footnote 27} CR$_b$ is calculated as follows
  $$\begin{aligned} \text {CR}_b = \text {max} \left\{ 1,\left( \frac{\vert \sum _{j,c} \Delta ^{c}_{j}\vert }{\text {T}_b} \right) ^\frac{1}{2}\right\} , \end{aligned}$$
  (B13)
  where T$_b$ is the Concentration Threshold for the Currency Group b. For Regular Volatility well-traded currencies T$_b =$ 210 USD Mio/bp.
3.
Calculation of Delta Margin through the aggregation of Weighted Sensitivities
$$\begin{aligned} \text {DeltaMargin}^\text {IR} = \sqrt{\sum _{c,j} \text {WS}_{j,c}^2 + \sum _{c,j} \sum _{(k,l) \ne (c,j)} \phi _{c,k} \rho _{j,l} \text {WS}_{j,c} \text {WS}_{l,k}}, \end{aligned}$$
(B14)
where:
- $\phi _{c,k} =$ 98% is the correlation between interest rate curves of the same currency;
- $\rho _{j,l}$ is the correlation matrix between SIMM tenors (Table 12).

Vega Margin for Interest Rate Risk Class

Table 13 Scaling function values for SIMM expiries calculated through Eq. B21, where expiries have been converted to calendar days using the convention that 12 month = 365 days, with pro-rata scaling for other tenors, e.g. 1 month $= \frac{365}{12}$ days and 5 years $= 365 \times 5$ days

Full size table

Vega Margin for IR Risk Class is computed through the following step-by-step process.

1.
Calculation of Vega Risks vector
$$\begin{aligned} \text {VR}= & {} [\text {VR}_{2\text {w}}, \text {VR}_{1\text {m}}, \text {VR}_{3\text {m}}, \text {VR}_{6\text {m}}, \text {VR}_{1\text {y}}, \text {VR}_{2\text {y}}, \text {VR}_{3\text {y}}, \text {VR}_{5\text {y}},\nonumber \\{} & {} \text {VR}_{10\text {y}}, \text {VR}_{15\text {y}}, \text {VR}_{20\text {y}}, \text {VR}_{30\text {y}}], \end{aligned}$$
(B15)
where:
$$\begin{aligned} \text {VR}_{j} = \nu _{j} \sigma ^{\text {Blk}}_{j}, \end{aligned}$$
(B16)
is the Vega Risk for an European Swaption with expiry equal to tenor j, with $\nu _{j}$ and $\sigma ^{\text {Blk}}_{j}$ being, respectively, IR Vega sensitivity and Black implied volatility. See “Appendix B.3.3” for details on definition and calculation methodology.
2.
Calculation of Vega Risk Exposure VRE$_j$ for the jth SIMM expiry through the following formula
$$\begin{aligned} \text {VRE}_{j} = \text {VRW}^{\text {IR}} \text {VR}_j \text {VCR}_b, \end{aligned}$$
(B17)
where:
- VRW$^{\text {IR}}= 0.16$ is the Vega Risk Weight for IR Risk Class;
- VCR$_b$ is the Vega Concentration Risk Factor for the Currency Group b calculated as^{Footnote 28}
  $$\begin{aligned} \text {VCR}_b = \text {max} \left\{ 1,\left( \frac{\vert \sum _{j} \text {VR}_{j}\vert }{\text {VT}_b} \right) ^\frac{1}{2} \right\} , \end{aligned}$$
  (B18)
  where VT$_b$ is the Concentration Threshold for the Currency Group b. For Regular Volatility well-traded currencies T$_b = 2200$ USD Mio.
3.
Calculation of Vega Margin through the aggregation of the Vega Risk Exposures
$$\begin{aligned} \text {VegaMargin}^\text {IR} = \sqrt{\sum _{j} \text {VRE}_{j}^2 + \sum _{j} \sum _{l \ne j} \rho _{j,l} \text {VRE}_{j} \text {VRE}_{l}}, \end{aligned}$$
(B19)
where $\rho _{j,l}$ is the correlation matrix between expiries (Table 12).

Curvature Margin for Interest Rate Risk Class

Curvature Margin for IR Risk Class is computed through the following step-by-step process.

1.
Calculation of Curvature Risk vector by applying a Scaling Function (Table 13) to Vega Risk vector calculated for Vega Margin, in particular, for the jth element
$$\begin{aligned} \text {CVR}_{j} = \text {SF}(t_{j}) \text {VR}_{j}, \end{aligned}$$
(B20)
the value of the scaling function SF$(t_{j})$ is defined as
$$\begin{aligned} \text {SF}(t_{j}) = 0.5 \text { min} \left\{ 1, \frac{14 \text { days}}{t_{j} \text { days}} \right\} , \end{aligned}$$
(B21)
where $t_{j}$ is the time (in calendar days) from the valuation date to the jth SIMM expiry. The values of the Scaling Function for all SIMM expiries are reported in Table 13.
2.
Aggregation of Curvature Risk vector elements
$$\begin{aligned} K = \sqrt{\sum _{j} \text {CVR}_{j}^2 + \sum _{j} \sum _{l \ne j} \rho ^2_{j,l} \text {CVR}_{j} \text {CVR}_{l}}, \end{aligned}$$
(B22)
where $\rho _{j,l}$ is the correlation matrix between expiries (Table 12).
3.
Calculation of Curvature Margin through the following formula
$$\begin{aligned} \text {CurvatureMargin}^\text {IR} = \frac{\text {max} \left\{ 0, \sum _j \text {CVR}_{j} + \lambda K \right\} }{\text {HVR}_{\text {IR}}^2}, \end{aligned}$$
(B23)
where:
- HVR$_{\text {IR}} = 0.62$ is the Historical Volatility Ratio for the IR Risk Class;
- $\lambda = (\Phi ^{-1}(0.995)^2-1)(1+\theta )-\theta $, with $\Phi ^{-1}(0.995)$ is the 99.5$^\text {th}$ percentile of the standard normal distribution, and
  $$\begin{aligned} \theta = \text {min}\left\{ 0,\frac{\sum _j \text {CVR}_j}{\sum _j \vert \text {CVR}_j\vert } \right\} . \end{aligned}$$
  (B24)

1.3.3 Forward sensitivity calculation

The most challenging task underlying ISDA-SIMM dynamic IM is the simulation of forward sensitivities coherently with ISDA definitions, since the subsequent application of weights and aggregation functions is straightforward if we assume that parameters and aggregation rules do not change during the lifetime of the trade. In this section we report the methodology used to compute forward sensitivities.

According to ISDA, Delta for the IR Risk Class is defined as price change with respect to a 1 bp shift up in a given tenor^{Footnote 29} of the interest rate curve, expressed in monetary terms. Moreover, ISDA specifies that if computed by the internal system at different tenors, Delta shall be linearly re-allocated onto the SIMM tenors. In general, the price of an instrument which depends on an interest rate curve ${\mathcal {C}}_{c}$ depends explicitly on the zero rates of the curve, which, in turn, depends on the market rates from which the curve is constructed via bootstrapping procedure (see e.g. Ametrano & Bianchetti, 2013). Therefore, being $Z^{c} = [Z^{c}_{1}, \dots , Z^{c}_{N_{Z}} ]$ and $R^{c} = [R^{c}_{1}, \dots , R^{c}_{N_{R}} ]$, respectively, the zero rates and the market rates in correspondence of the term structure of the same curve ${\mathcal {C}}_c$, we calculated the sensitivity with respect to the jth market rate $R^{c}_{j}$ at a generic time step $t_i$ as (to ease the notation we neglect subscripts referring to the path)

$$\begin{aligned} \Delta ^{x}_{j} (t_i)&= \frac{\partial V(t_i)}{\partial R^{x}_{j}(t_i)} = \sum ^{N^{x}_{Z}}_{k = 1} \frac{\partial V(t_i)}{\partial Z^{x}_{k}(t_i)} J^{x,x}_{j,k}(t_0), \quad j = 1,\dots ,N^{x}_{R}, \end{aligned}$$

(B25)

$$\begin{aligned} \Delta ^{d}_{j} (t_i)&= \frac{\partial V(t_i)}{\partial R^{d}_{j}(t_i)} = \sum ^{N^{d}_{Z}}_{k = 1} \frac{\partial V(t_i)}{\partial Z^{d}_{k}(t_i)} J^{d,d}_{j,k}(t_0) + \sum ^{N^{x}_{Z}}_{k = 1} \frac{\partial V(t_i)}{\partial Z^{x}_{k}(t_i)} J^{x,d}_{j,k}(t_0), \, j = 1,\dots ,N^{d}_{R} \end{aligned}$$

(B26)

where x and d denote, respectively, the forwarding curve ${\mathcal {C}}_{x}$ and the discounting curve ${\mathcal {C}}_{d}$,

$$\begin{aligned} J^{m,n}_{j,k}(t_0) = \frac{\partial Z^{m}_{k}(t_0) }{\partial R^{n}_{j}(t_0)}, \end{aligned}$$

(B27)

is an element of the Jacobian matrix (with $m \ne n $), assumed to be constant for each $t_i > t_0$, and

$$\begin{aligned} \frac{\partial V(t_i)}{\partial Z^{c}_{k}(t_i)} \approx \frac{V \bigl ( t_i;Z^{c}_{k}(t_i)+h \bigr )-V \bigl ( t_i;Z^{c}_{k}(t_i) \bigr )}{h}, \end{aligned}$$

(B28)

is the zero rate Delta sensitivity at $t_i$, with $h=10^{-4}$. The last term of Eq. B26 takes into account the indirect Delta sensitivity component of the forwarding zero curve to the discounting zero curve, due to the exogenous nature of the bootstrapping procedure. In line with ISDA prescriptions, for each tenor j of the curve ${\mathcal {C}}_c$, we multiplied Delta sensitivity by shock size and linearly allocated this quantity onto the SIMM tenors, obtaining the (1-by-12) Delta vector

$$\begin{aligned} \Delta ^{c}(t_i) = \left[ \Delta ^{c}_{2\text {w}}(t_i), \dots , \Delta ^{c}_{30\text {y}}(t_i) \right] . \end{aligned}$$

(B29)

We then calculated and aggregated the Weighted Sensitivities in order to get Delta Margin (see Eqs. B12, B14).

According to ISDA, Vega for the IR Risk Class is defined as price change with respect to a 1% shift up in at-the-money (ATM) Black implied volatility $\sigma _{x}^{\text {Blk}}$, formally

$$\begin{aligned} \begin{aligned} \nu (t_i)&= \frac{\partial V (t_i)}{\partial \sigma _{x}^{\text {Blk}}(t_i;T_{e},{{\textbf {T}}},{{\textbf {S}}})} \\&\approx \frac{ V \bigl ( t_i; \sigma _{x}^{\text {Blk}}(t_i;T_{e},{{\textbf {T}}},{{\textbf {S}}}) + h \bigr ) - V \bigl ( t_i; \sigma _{x}^{\text {Blk}}(t_i;T_{e},{{\textbf {T}}},{{\textbf {S}}}) \bigr ) }{h}. \end{aligned} \end{aligned}$$

(B30)

where we use superscript Blk to distinguish between Black implied volatility $\sigma _{x}^{\text {Blk}}$ and G2++ parameter $\sigma $, with $h=10^{-2}$.

Vega sensitivity shall be multiplied by implied volatility to obtain the Vega Risk for expiry $T_{e}$ and linearly allocated onto the SIMM expiries, which correspond to the tenors defined for Delta sensitivity. Since the G2++ European Swaption pricing formula (see Eq. A29) does not provide for an explicit dependence on Black implied volatility, when performing time simulation Vega cannot be calculated according to the definition above. To overcome this limit we propose the following approximation

$$\begin{aligned} \frac{\partial V (t_i)}{\partial \sigma _{x}^{\text {Blk}}(t_i;T_{e},{{\textbf {T}}},{{\textbf {S}}})} \approx \frac{V \bigl ( t_i;\sigma + \epsilon _{\sigma },\eta + \epsilon _{\eta } \bigr ) - V \bigl ( t_i; \sigma , \eta \bigr ) }{{\hat{\sigma }}_{x}^{\text {Blk/G2++}}(t_i;\lambda _x,T_{e},{{\textbf {T}}},{{\textbf {S}}}) - \sigma _{x}^{\text {Blk/G2++}}(t_i;\lambda _x,T_{e},{{\textbf {T}}},{{\textbf {S}}})}, \end{aligned}$$

(B31)

where $\epsilon _{\sigma }$ and $\epsilon _{\eta }$ are shocks applied on G2++ model parameters governing the underlying process volatility, and implied volatilities ${\hat{\sigma }}_{x}^{\text {Blk}}(t_i)$ and $\sigma _{x}^{\text {Blk}}(t_i)$ are obtained respectively from European Swaption’s prices $V (t_i;\sigma + \epsilon _{\sigma },\eta + \epsilon _{\eta })$ and $V( t_i; \sigma , \eta )$ by solving the shifted Black pricing formula (see Eq. A13). In Sect. 4.4 we report the analyses conducted to validate this approach and to select the values for the shocks $\epsilon _{\sigma }$, $\epsilon _{\eta }$ and the Black shift $\lambda _x$. In line with ISDA prescriptions, we multiplied Vega sensitivity by implied volatility $\sigma _{x}^{\text {Blk}}$ and linearly allocated the resulting Vega Risk onto the SIMM expiries, obtaining the (1-by-12) Vega Risk vector

$$\begin{aligned} \text {VR}(t_i) = \left[ \text {VR}_{2\text {w}}(t_i), \dots , \text {VR}_{30\text {y}}(t_i) \right] . \end{aligned}$$

(B32)

We then calculated and aggregated the Vega Risk Exposures to get Vega Margin (see Eqs. B17 and B19).

Curvature for the IR Risk Class is calculated by using an approximation of the Vega-Gamma relationship (see ISDA, 2016). The (1-by-12) Curvature Risk Vector

$$\begin{aligned} \text {CVR}(t_i) = \left[ \text {CVR}_{2\text {w}}(t_i), \dots , \text {CVR}_{30\text {y}}(t_i) \right] \end{aligned}$$

(B33)

is obtained multiplying the Vega Risk vector by a Scaling Function. We then aggregated the elements of Curvature Risk vector to get Curvature Margin (see Eqs. B22, B23).

Appendix C: Additional results

In this appendix we report additional details and comments with respect to Sects. 3, 4 and 5.

1.1 XVA numerical simulation

In this section we report additional results concerning the exposure calculation discussed in Sect. 3.1. Figures 7, 8, 9 and 10 show the exposure profiles for the four financial instruments (15Y Swap, 30Y Swap, 5x10Y forward Swap and 5x10Y Swaption), respectively, each considering three collateralization schemes (no collateral, with VM, with VM and IM) and three moneyness (OTM/ATM/ITM).

Looking at uncollateralized exposures (top panels in Figs. 7, 8, 9, 10), the jagged shape observed for spot-starting Swaps (Figs. 7, 8) is due to the different coupons frequency (receive semi-annual floating coupons, pay annual fixed coupons) which determines semi-annual jumps in the future simulated mark-to-market values at cash flow dates. The EPE is larger than the ENE, in absolute terms, except for the out-of-the-money (OTM) 15Y Swap, due to the forward rates structure which causes expected floating leg values greater than those of the fixed leg. This is evident for in-the-money (ITM) Swaps for which the ENE is also almost flat, given the low probability to observe negative future simulated mark-to-market values. Longer maturities are clearly riskier, due to the greater number of coupons to be exchanged. Similar exposure’s shapes are observed for the 5x10Y forward Swaps (Fig. 9), starting from $t = 5$ years. Here we observe the asymmetric effect of forward rates on opposite transactions: the OTM payer forward Swap displays larger EPE and smaller ENE compared to the OTM receiver one. The 5x10Y physically settled European Swaptios (Fig. 10) are written on the same forward Swaps, before the expiry the exposure is always positive and greater than the one of the corresponding underlying forward Swap as the price is always positive. After the expiry, OTM paths are excluded as the exercise do not take place, determining smaller EPE and ENE (in absolute value) with respect to the corresponding underlying forward Swap.

Looking at collateralized exposures with VM (middle panels in Figs. 7, 8, 9, 10), we observe a reduction of one order of magnitude, since the VM tracks the instrument’s future simulated mark-to-market values with a delay equal to the MPoR’s length (2 days). This collateral friction causes an imperfect collateralzation, leading to a material residual exposure characterized by spikes at cash flow dates, when the mark to market suddely changes but the VM is adjusted 2 days later. In general, when only floating coupons are received, one can expect to observe downward spikes due to the fact that the counterparty makes a payment for which the bank still has not returned VM. Conversely, upward spikes arise when fixed cash flows occur and are counterbalanced by the downward ones stemming from floating coupons. The magnitude of these spikes is determined by the simulated forward rates structure, e.g. the large upward spikes displayed at the early stage of payer Swaps’ life are due to negative rates (see Sect. 4.2.1 for details). We point out that the negative exposure arising before the expiry of the Swaption is due to the fact that for some paths the MPoR produces an overcollateralization.

Collateralized exposures with both VM and IM (bottom panels in Figs. 7, 8, 9, 10), have been discussed in Sect. 3.1.

In order to appreciate the distinct effects of VM and IM on the total EPE/ENE discussed before, we show in Fig. 11 the expected VM and IM profiles. We observe that the decreasing profile with downward steps at cash flow dates of the IM, implies an incomplete suppression of the exposure close to maturity. Furthermore, it is possible to notice how, in absolute terms, payer (receiver) instruments show larger spikes in ENE (EPE) as, on average, fixed rates are greater than floating rates.

1.2 G2++ model calibration

In this section we report additional details regarding the 7 different calibrations used in Sect. 4.1 to assess the G2++ model calibration risk.

The 7 alternative calibrations are defined as follows.

1.
Baseline calibration: calibration of the full ATM price matrix (parameters denoted with p). This calibration comprises 182 market quotes.
2.
Calibration of the full ATM price matrix imposing flat volatility G2++ parameters (parameters denoted with $p_1$). This calibration comprises 182 market quotes (same as baseline calibration).
3.
Calibration of the ATM price matrix excluding the expiries greater than 15Y since we tested instruments with 15Y maturities (parameters denoted with $p_2$). This calibration comprises 143 market quotes.
4.
Calibration of the full price cube (parameters denoted with $p_3$). This calibration comprises 30 ATM and 300 smile market quotes.
5.
Calibration of the price cube excluding strikes outside the interval ATM ± 0.005 (parameters denoted with $p_4$). This calibration comprises 30 ATM and 120 smile market quotes.
6.
Calibration of the price cube excluding the expiries greater than 15Y, similarly to $p_2$ (parameters denoted with $p_5$). This calibration comprises 20 ATM and 200 smile market quotes.
7.
Calibration of the price cube excluding both the expiries greater than 15Y and strikes outside the interval ATM $\pm 0.005$, similarly to $p_2$ plus $p_4$ (parameters denoted with $p_6$). This calibration comprises 20 ATM and 80 smile market quotes.

We summarize the above calibrations in Table 14.

The resulting calibrated parameters, computed according to Eqs. A44 and A45, are reported in Table 15.

In the following Table 16 we show the accuracies of the 7 different calibrations, both with respect to the calibration set and to the complete market data set. We observe that, in general, the calibration accuracy worsens when less liquid OTM market quotes (i.e. calibrations $p_3$-$p_6$) are included in the calibration set. As expected, the pricing error is much larger than the calibration error, since much more market points are included and the pricing error is higher on the extreme strikes far from the ATM.

In Fig. 12 we focus on the baseline calibration (p) accuracy showing the differences between the shifted-Black volatilities implied in G2++ prices and those implied in market prices. Regarding the ATM accuracy (Fig. 12a), the larger errors (5%) can be observed for short expiry-tenor combinations, where it is more difficult for the G2++ model to calibrate smaller prices (see Table 21) with small Vega sensitivities. Excluding this section of the volatility surface, the remaining points show calibration errors below 1.5%, which can be considered a good accuracy, achieved thanks to the time-dependent volatility in our G2++ model. Regarding the OTM accuracy (Fig. 12b), the larger errors are observed for extreme strikes, as expected since these quotes are illiquid. The calibration accuracy on the smile can be appreciated in Fig. 12c, d, where we also compare calibrations p and $p_3$. Similar results are obtained for the other calibrations. As expected, the G2++ model is able to capture the market skew in the ATM region, while the precision worsens for OTM quotes. The two calibrations p and $p_3$ yield similar results since their average absolute volatility difference is equal to respectively $2.05\%$ and $1.89\%$ for the 5x10Y smile section, and to respectively $2.83\%$ and $2.57\%$ for the 30x10Y smile section.

Finally, we report the accuracies for all the alternative calibrations $p_1,\ldots ,p_6$ in the following Fig. 13 for the full Swaption ATM matrix and Fig. 14 for the full Swaption cube.

1.3 Time simulation grid

In this section we report additional details regarding the construction of the time simulation grid discussed in Sect. 4.2.

We show in Fig. 15 the results of a further investigation on the nature of the exposure’s spikes by focusing on the 15Y ATM payer Swap’s Expected Exposure (EE)^{Footnote 30} without collateral and with VM.

Table 14 Details of the different calibrations considered, with baseline calibration denoted with p

Full size table

Table 15 G2++ model parameters for the different calibrations considered (see Table 14)

Full size table

Table 16 Accuracies of the different calibrations tested

Full size table

Left-hand side panels show that both the jagged shape of uncollalteralized EE (Fig. 15a) and the spikes in EE with VM (Fig. 15b) are determined by sudden changes in the average values of Swap legs right after coupon payments. Right-hand side panels show a focus around $t=1$y, when both fixed and floating coupons take place: after these cash flow, the average value of floating leg increases and fixed leg one decreases, resulting in a positive jump in uncollateralized EE at $t =$ 1y3d (Fig. 15c). This jump is captured by VM at $t =$ 1y5d, this delay of 2 days due to MPoR causes an upward spike in EE with VM (Fig. 15d). The direction and magnitude depend on the simulated forward rates structure and on whether fixed or floating coupon payments take place.

Focusing on semi-annual floating coupons paid by the counterparty two different behaviours can be observed during the lifetime of the Swap. The first three floating coupons determine positive jumps in uncollateralized EE and upward spikes in EE with VM. This is due to the fact that the simulated forward rates are on average negative until 2.5 years implying that, until that time, these coupons are actually paid by B. Once forward rates revert to positive values, negative jumps in uncollateralized EE and downward spikes in EE with VM arise in correspondence of the remaining floating coupons (Fig. 15a, b).

When also annual fixed coupons paid by the bank take place, positive jumps in uncollateralized EE and upward spikes with decreasing magnitude in EE with VM can be observed. Negative rates cause wide spikes in correspondence of the first two fixed and floating coupons due to the simultaneous increase in the average value of floating leg and decrease in the fixed leg one, in bank’s perspective, right after coupons payment (Fig. 15c, d).

Below we report the process followed to build a parsimonious time simulation grid capable of capturing the spikes in collateralized exposure in a reasonable computational time (Fig. 16).

a.
We select an initial time grid $\left\{ t_i\right\} _{i=0}^{N_T}$ evenly spaced with constant $\Delta t = t_i - t_{i-1}$, where $t_0$ is the valuation date, consistent with the market data set used, and $t_{N_T}>T$, where T is the last cash flow date of the instrument (or portfolio) considered.
b.
As discussed in Sect. 4.2.1, in order to capture the spikes arising in collateralized exposure, we enrich the time grid above adding a second set of points $\left\{ u_j\right\} _{j=1}^{n-1}$, called cash flow time grid, such that $u_j\in (T_j,T_j + l]$, where $\left\{ T_j\right\} _{j=1}^n$ are the instruments’ (or portfolio’s) cash flow dates,^{Footnote 31}$T_n = T$ and l is the length of the MPoR (in days). We do not include the last cash flow date $u_n = T$ since we impose null exposure for $t\ge T$.
c.
Joining the previous time grids we obtain the primary time grid $\left\{ {\bar{t}}_k\right\} _{k=0}^{N_T+n-1} = \left\{ t_i\right\} _{i=0}^{N_T} \cup \left\{ u_j\right\} _{j=1}^{n-1}$ contains $N_T + n$ points.
d.
Then, in order to compute VM and IM for collateralized exposures, we add to the previous grid a third set of points $\left\{ {\hat{t}}_k\right\} _{k=1}^{N_T+n-1}$, called collateral time grid, where ${\hat{t}}_k = {\bar{t}}_k - l$ is a look-back point at which we compute the collateral available at ${\bar{t}}_k$ taking into account the MPoR (see “Appendix B.1” for details).
e.
The final joint time grid^{Footnote 32}$\left\{ t_i\right\} _{i=0}^{N_S} = \left\{ {\bar{t}}_k\right\} _{k=0}^{N_T+n-1} \cup \left\{ {\hat{t}}_k\right\} _{k=1}^{N_T+n-1}$ is our choice for the Monte Carlo time simulation. In general, this time grid includes $N_S=2(N_T+n-1)+1$ points. Clearly the number of points may be lower when one or more points from the different grids coincide.

We show in Fig. 17 the exposure profiles for the 5x10Y Swaption obtained with different grids for the three collateralization schemes considered. For testing purposes we compare the joint time grid with a standard time grid, obtained adding the primary time grid and its corresponding collateral time grid, which does not include the cash flow time grid. The results are similar to those obtained for the 15Y Swap reported in Sect. 4.2.

1.4 Monte Carlo convergence

In this section we report additional details regarding the XVA convergence with respect to the number of Monte Carlo scenarios discussed in Sect. 4.3.

We show in Fig. 18 the XVA convergence diagrams for the 15Y Swap, for the three collateralization schemes considered. The results are similar to those obtained for the 5x10Y Swaption reported in Sect. 4.3 (Fig. 4).

1.5 XVA sensitivities to CSA parameters

In this section we report additional details regarding the XVA sensitivities with respect to CSA parameters discussed in Sect. 4.5.

Convergence diagrams for the 15Y ATM payer Swap are shown in Figs. 19 and 20. In particular, left-hand side panels show XVA convergence with respect to K, with $\text {MTA}= 0$ EUR and $l=2$ days; conversely, right-hand side panels show the convergence with respect to MTA, with $\text {K}= 0$ EUR and $l=2$ days. As can be seen, case 1. (top panels) and case 3. (bottom panels) display similar results except for small values of K and MTA, thus IM is ineffective when significant frictions are considered. Instead, case 2. (middle panels) displays collateralized XVA not converging to uncollateralized ones, this means that without frictions IM is effective in reducing residual credit exposure. Furthermore, the results suggest that K leads to a faster convergence to uncollateralized figures compared to MTA; e.g. in case 1. $\text {K}= 5$ Mio EUR leads to an increase in absolute terms of approx. 770% in CVA and 280% in DVA with respect to the reference case (see Table 7), while $\text {MTA}= 5$ Mio EUR leads to an increase of approx. 3600% in CVA and 280% in DVA. As expected, K introduces an higher degree of friction since determining the maximum amount of allowed unsecured exposure; on the other hand, MTA governs only the minimum amount for each margin call, therefore significant impacts can be observed only for large values (see Eqs. B2, B8). The 5x10Y ATM physically settled European payer Swaption displays same results and is not reported.

A focus on the effects of K on collateralized exposure is shown in Fig. 21, which displays EPE/ENE for the 15Y Swap (left-hand side panels) and the 5x10Y ATM physically settled European payer Swaption (right-hand side panels) for the three collateralization schemes above, for different values of K (with $\text {MTA}= 0$ EUR and $l=2$ days).

Focusing on the Swap with VM (top panel), the average EPE (ENE) over the time steps, expressed as a percentage of the nominal amount, is equal to $0.13\%$, $0.63\%$ and $2.56\%$ ($-\,0.13\%$, $-\,0.33\%$ and $-\,1.12\%$), respectively for threshold values of 0, 1 and 5 EUR Mio. By adding IM with no threshold (middle panel), large part of residual exposure is suppressed: average EPE (ENE) is equal to $0.00\%$, $0.04\%$ and $0.91\%$ ($-\,0.02\%$, $-\,0.03\%$ and $-\,0.25\%$) of the nominal amount. By considering threshold also on IM (bottom panel), average EPE (ENE) is equal to $0.00\%$, $0.14\%$ and $2.46\%$ ($-\,0.02\%$, $-\,0.07\%$ and $-\,0.97\%$) of the nominal amount meaning that, as K increases, IM looses its effectiveness and EPE/ENE approach to those with only VM.

1.6 XVA model risk

In this section we report additional details regarding the model risk analyses discussed in Sect. 5. In particular, we show in the following Tables 17 and 18 the full XVA distributions obtained from the alternative combinations considered of model and parametrizations for the 15Y Swap and the 5x10Y Swaption, respectively.

Table 17 XVA distribution for the 15Y ATM payer Swap, EUR 100 Mio nominal amount, no collateral

Full size table

Table 18 XVA distribution for the 5x10Y ATM physically settled European payer Swaption, EUR 100 Mio nominal amount, no collateral

Full size table

Appendix D: Market data

In this appendix we report the market data set considered in this work. All the data refer to end of day 28 December 2018. This is also the valuation date considered in all the numerical calculations reported throughout this paper (see Tables 19, 20, 21, 22, 23).

Table 19 EURIBOR 6M Zero Coupon Bond curve ${\mathcal {C}}_{6M}(t)$ built using EURIBOR 6M Swaps quoted on the market at $t=$ 28 December 2018

Full size table

Table 20 EONIA zero coupon bond curve ${\mathcal {C}}(t)$ built using EONIA Overnight Index Swaps (OIS) quoted on the market at $t=$ 28 December 2018

Full size table

Table 21 EUR ATM Swaption Straddle prices with expiries ($\xi $) and tenors (${\mathfrak {T}}$) on rows and columns respectively (in years), quoted on the market at 28 December 2018

Full size table

Table 22 Quoted Swaption prices cube (physically settled LCH, EUR 10,000 nominal amount, expiries ($\xi $) and tenors (${\mathfrak {T}}$), in years)

Full size table

Table 23 CDS spread term structure for counterparties the bank and the counterparty (in bps)

Full size table

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Silotto, L., Scaringi, M. & Bianchetti, M. XVA modelling: validation, performance and model risk management. Ann Oper Res 336, 183–274 (2024). https://doi.org/10.1007/s10479-023-05323-4

Download citation

Accepted: 29 March 2023
Published: 14 June 2023
Issue Date: May 2024
DOI: https://doi.org/10.1007/s10479-023-05323-4

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

XVA modelling: validation, performance and model risk management

Abstract

Similar content being viewed by others

Analyzing Short-Rate Models for Efficient Bond Option Pricing: A Review

Nonlinearity Valuation Adjustment

Options (New Perspectives)

1 Introduction

2 XVA pricing framework

3 XVA numerical calculations

3.1 Exposure results

3.2 XVA results

4 XVA model validation

4.1 G2++ model calibration

4.2 Time simulation grid

4.2.1 Spikes analysis

4.2.2 Parsimonious time grid

4.3 Monte Carlo convergence

4.4 Forward vega sensitivity calculation

4.4.1 Sensitivity to G2++ parameters

4.4.2 Implied volatility calculation

4.5 XVA sensitivities to CSA parameters

4.6 Monte Carlo versus analytical XVA

4.7 Tuning accuracy versus performance

5 XVA model risk

6 Conclusions

Availability of data and materials

Code Availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical Approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Appendices

Appendix A: Theoretical framework

1.1 Pricing with collateral and XVA

1.2 Financial instruments

1.2.1 Instruments’ list

1.2.2 Interest rate swap

1.2.3 European swaption

1.3 G2++ model

1.3.1 Short-rates dynamics

1.3.2 Pricing

1.3.3 Calibration

1.3.4 Monte Carlo simulation

1.4 XVA pricing

1.4.1 XVA definitions and formulas

1.4.2 XVA numerical formulas

1.4.3 XVA analytical formulas

Appendix B: Collateral modelling

1.1 Collateral management

1.2 Variation margin

1.3 Initial margin

1.3.1 ISDA standard initial margin model

1.3.2 ISDA-SIMM for swaps and swaptions

1.3.3 Forward sensitivity calculation

Appendix C: Additional results

1.1 XVA numerical simulation

1.2 G2++ model calibration

1.3 Time simulation grid

1.4 Monte Carlo convergence

1.5 XVA sensitivities to CSA parameters

1.6 XVA model risk

Appendix D: Market data

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation