Finite-Element Methods

Seydel, Rüdiger U.

doi:10.1007/978-1-4471-7338-0_5

Rüdiger U. Seydel⁹

Part of the book series: Universitext ((UTX))

3856 Accesses

Abstract

Finite-element methods provide more flexibility in approximating functions and domains. These advantages can be exploited for the pricing of options, which is explained in this chapter. It starts with an elementary introduction to finite-element methods, with topics Ritz-Galerkin approaches, hat functions, and assembling. Then the ideas are applied to standard options based on a single asset. Next, the two-asset case is treated, with an example of a basket with double barrier. The chapter ends with a proof of quadratic convergence for a standard scenario.

Access provided by CONRICYT-eBooks. Download chapter PDF

The finite-difference approach with equidistant grids is easy to understand and straightforward to implement. Resulting uniform rectangular grids are comfortable, but in many applications not flexible enough. Steep gradients of the solution require a finer grid locally such that the difference quotients provide good approximations of the differentials. On the other hand, a flat gradient may be well modeled on a coarse grid. Arranging such a flexibility of the grid with finite-difference methods is possible but cumbersome.

An alternative type of methods for solving PDEs that does provide high flexibility is the class of finite-element methods (FEM). A “finite element” designates a mathematical topic such as an interval and thereupon defined a piece of function. There are alternative names such as variational methods, or weighted residuals, or Ritz–Galerkin methods. These names hint at underlying principles that serve to derive suitable equations. As these different names suggest, there are several different approaches leading to finite elements. The methods are closely related.

The flexibility of finite-element methods is not only favorable to approximate functions, but also to approximate domains of computation that are not rectangular. This is important for multifactor options. For the one-dimensional situation of standard options, the possible improvement of a finite-element method over the standard methods of the previous chapter is not significant. With the focus on standard options, Chap. 5 may be skipped on first reading. But options with several underlyings may lead to domains of computation that are more “fancy.”

For example, a two-asset basket with portfolio value α ₁ S ₁ + α ₂ S ₂ in the case of a call option leads to a payoff of type Ψ(S ₁, S ₂) = (α ₁ S ₁ + α ₂ S ₂ − K)⁺. If such an option is endowed with barriers, then it is reasonable to set up barriers such that the payoff takes a constant value. For the two-asset basket, this amounts to barrier lines α ₁ S ₁ + α ₂ S ₂ = constant. This naturally leads to trapezoidal shapes of domains. For a special case with two knock-out barriers the payoff and the domain are illustrated by Fig. 5.1. This example will be considered in Sect. 5.4, see the domain in Fig. 5.5. In more complicated examples, the domain may be elliptic ( → Exercise 5.1). In such situations of non-rectangular domains, finite elements are ideally applicable and highly recommendable.

Faced with the huge field of finite-element methods, in this chapter we confine ourselves to a step-by-step exposition towards the solution of two-asset options. We start with an overview on basic approaches and ideas (in Sect. 5.1). Then, in Sect. 5.2, we describe the approximation with the simplest finite elements, namely, piecewise straight-line segments, and apply this to a stationary model problem. These approaches will be applied to the time-dependent situation of pricing standard options, in Sect. 5.3. This sets the stage to the main application of FEM in financial engineering, options on two or more assets. Section 5.4 will present an application to an exotic option with two underlyings. Here we derive a weak form of the PDE, and discuss boundary conditions. Finally, in Sect. 5.5, we will introduce to error estimates. Methods more subtle than just the Taylor expansion of the discretization error are required to show that quadratic convergence is possible with unstructured grids and nonsmooth solutions. To keep the exposition of an error analysis short, we concentrate on the one-dimensional situation. But the ideas extend to multidimensional scenarios.

5.1 Weighted Residuals

Many of the principles on which finite-element methods are based, can be interpreted as weighted residuals. What does this mean? This heading points at ways in which a discretization can be set up, and how an approximation can be defined. There lies a duality in a discretization. This is illustrated by means of Fig. 5.2, which shows a partition of an x-axis. This discretization is either represented by

(a)
discrete grid points x _i, or by
Fig. 5.2
Discretization of a continuum
Full size image
(b)
a set of subintervals.

The two ways to see a discretization lead to different approaches of constructing an approximation w. Let us illustrate this with the one-dimensional situation of Fig. 5.3. An approximation w based on finite differences is built on the grid points and primarily consists of discrete points (Fig. 5.3a). In contrast, finite elements are founded on subdomains (intervals in Fig. 5.3b) with piecewise functions, which are defined by suitable criteria and constitute a global approximation w. In a narrower sense, a finite element is a pair consisting of one piece of subdomain and the corresponding function defined thereupon, mostly a polynomial. Figure 5.3 reflects the respective basic approaches; in a second step the isolated points of a finite-difference calculation can well be extended to continuous piecewise functions by means of interpolation ( → Appendix C.1).

A two-dimensional domain can be partitioned into triangles, for example, where w is again represented by piecewise polynomials. Figure 5.4 depicts the simplest such situation, namely, a triangle in an (x, y)-plane, and a piece of a linear function defined thereupon. Figure 5.5 below will provide an example how triangles easily fill a seemingly “irregular” domain.

As will be shown next, the approaches of finite-element methods use integrals. If done properly, integrals require less smoothness. This often matches applications better and adds to the flexibility of finite-element methods. The integrals can be derived in a natural way from minimum principles, or are constructed artificially. Finite elements based on polynomials make the calculation of the integrals easy.

5.1.1 The Principle of Weighted Residuals

To explain the principle of weighted residuals we discuss the formally simple case of the differential equation

$$\displaystyle{ Lu = f\,. }$$

(5.1)

Here L symbolizes a linear differential operator. Important examples are

$$\displaystyle\begin{array}{rcl} Lu:& =& -u''\ \text{ for }\ u(x),\ \text{ or }{}\end{array}$$

(5.2)

$$\displaystyle\begin{array}{rcl} Lu:& =& -u_{xx} - u_{yy}\ \text{ for }\ u(x,y)\,.{}\end{array}$$

(5.3)

The right-hand side f is a problem-dependent function. Solutions u of the differential equation (5.1) are studied on a domain $\mathcal{D}\subseteq \mathbb{R}^{n}$, with n = 1 in (5.2) and n = 2 in (5.3). The piecewise approach starts with a partition of the domain into a finite number m of subdomains $\mathcal{D}_{k}$,

$$\displaystyle{ \mathcal{D} =\bigcup _{ k=1}^{m}\mathcal{D}_{ k}\,. }$$

(5.4)

All boundaries of $\mathcal{D}$ should be included, and approximations to u are calculated on the closure of $\mathcal{D}$. The partition is assumed disjoint up to the boundaries of $\mathcal{D}_{k}$, so $\mathcal{D}_{j}^{\circ }\cap \mathcal{D}_{k}^{\circ } =\emptyset$ for j ≠ k. In the one-dimensional case (n = 1), for example, the $\mathcal{D}_{k}$ are subintervals of a whole interval $\mathcal{D}$. In the two-dimensional case, (5.4) may describe a partition into triangles, as illustrated in Fig. 5.5.

The ansatz for approximations w to a solution u is a basis representation with N basis functions φ _i,

$$\displaystyle{ w:=\sum _{ i=1}^{N}c_{ i}\,\varphi _{i}\,. }$$

(5.5)

The functions φ _i are also called trial functions. In the case of one independent variable x the $c_{i} \in \mathbb{R}$ are constant coefficients, and the φ _i are functions of x. Typically, N is chosen and φ ₁, …, φ _N are prescribed. Depending on this choice, the free parameters c ₁, …, c _N are to be determined such that w ≈ u. The ansatz (5.5) was suggested by Ritz in 1908.

We have m subdomains and N basis functions. In the one-dimensional situation (n = 1), nodes and subintervals interlace, and m and N essentially can be identified. For n = 1 the two numbers m and N differ by at most one, depending on whether the solution is known or unknown at the end points of the interval $\mathcal{D}$. In the latter case it is convenient to have the summation index in (5.5) run as i = 0, …, m. For dimensions n > 1 the number m of subdomains (e.g. triangles in case n = 2) in general is different from the number N of basis functions (nodes^{Footnote 1}). For example, in Fig. 5.5 we have 75 triangles and 51 nodes; 26 of the nodes are interior nodes and 25 are placed along the boundary. That is, 1 ≤ k ≤ 75. The number N refers to the number of nodes for which a value of u is to be approximated.

One strategy to determine the coefficients c _i is based on the residual function

$$\displaystyle{ R(w):= Lw - f\,. }$$

(5.6)

We look for a w such that the residual R becomes “small.” Since the φ _i are considered prescribed, in view of (5.5) N conditions or equations must be established to define and calculate the unknown c ₁, …, c _N . To this end we weight the residual R by introducing N weighting functions (test functions) ψ ₁, …, ψ _N and require

$$\displaystyle{ \int _{\mathcal{D}}R(w)\,\psi _{j}\,\mathrm{d}\mathcal{D} = 0\quad \text{ for }j = 1,\ldots,N\,. }$$

(5.7)

This amounts to the requirement that the residual be orthogonal to the set of weighting functions ψ _j. The “$\mathrm{d}\mathcal{D}$” in (5.7) symbolizes the integration that matches $\mathcal{D}\subseteq \mathbb{R}^{n}$, as dx for n = 1. For ease of notation, we frequently drop dx as well as the $\mathcal{D}$ at the n-dimensional integral. For the model problem (5.1) the system of Eqs. (5.7) consists of the N equations

$$\displaystyle{ \int _{\mathcal{D}}Lw\,\psi _{j} =\int _{\mathcal{D}}f\,\psi _{j}\quad (\,j = 1,\ldots,N) }$$

(5.8)

for the N unknowns c ₁, …, c _N, which define w. Often the equations in (5.8) are written using a formulation with inner products,

$$\displaystyle{(Lw,\psi _{j}) = (\,f,\psi _{j})\,,}$$

defined as the corresponding integrals in (5.8). For linear L the ansatz (5.5) implies

$$\displaystyle{\int Lw\,\psi _{j} =\int \left (\sum _{i}c_{i}L\varphi _{i}\right )\psi _{j} =\sum _{i}c_{i}\mathop{\underbrace{ \int L\varphi _{i}\psi _{j}}}\limits _{=:a_{ij}}\,.}$$

The integrals a _ij constitute a matrix A. The r _j: = ∫fψ _j set up the elements of a vector r and the coefficients c _j a vector c = (c ₁, …, c _N)^tr. In vector notation the system of equations is rewritten as

$$\displaystyle{ Ac = r\,. }$$

(5.9)

This outlines the general principle, but leaves open the questions how to handle boundary conditions and how to select basis functions φ _i and weighting functions ψ _j. The freedom to choose trial functions φ _i and test functions ψ _j allows to construct several different methods. For the time being suppose that these functions have sufficient potential to be differentiated or integrated. We will enter a discussion of relevant function spaces in Sect. 5.5.

5.1.2 Examples of Weighting Functions

We postpone the choice of basis functions φ _i and begin with listing important examples of how to select weighting functions ψ:

1.)
Galerkin’s choice: Choose ψ _j: = φ _j for all j. Then a _ij = ∫Lφ _i φ _j .
2.)
Collocation: Choose ψ _j: = δ(x − x _j). Here δ denotes Dirac’s delta function , which in $\mathbb{R}^{1}$ satisfies ∫fδ(x − x _j) dx = f(x _j). As a consequence,
$$\displaystyle{\int Lw\,\psi _{j} = Lw(x_{j})\,,\quad \int f\psi _{j} = f(x_{j})\,.}$$

That is, a system of equations Lw(x _j) = f(x _j) results, which amounts to evaluating the differential equation at selected points x _j.
3.)
Least squares:

Choose
$$\displaystyle{\psi _{j}:={ \partial R \over \partial c_{j}}\,.}$$
This choice of test functions deserves its name least-squares, because to minimize ∫(R(c ₁, …, c _N))² the necessary criterion is the vanishing of the gradient, so
$$\displaystyle{\int _{\mathcal{D}}R{\partial R \over \partial c_{j}} = 0\quad \text{ for all }j\,.}$$

5.1.3 Examples of Basis Functions

The construction of suitable basis functions φ _i observes the underlying partition into subdomains $\mathcal{D}_{k}$. Our concern will be to meet two aims: resulting methods must be accurate, and their implementation should become efficient.

The efficiency can be focused on the sparsity of matrices. In particular, if the matrix A of the linear equations is sparse, then the system can be solved efficiently even when it is large. In order to achieve sparsity we require that φ _i ≡ 0 on most of the subdomains $\mathcal{D}_{k}$. Figure 5.6 illustrates an example for the one-dimensional case n = 1. This hat function of Fig. 5.6 is the simplest example related to finite elements. It is piecewise linear, and each function φ _i has a support consisting of only two subintervals, φ _i(x) ≠ 0 for x ∈ support. A consequence is

$$\displaystyle{ \int _{\mathcal{D}}\varphi _{i}\varphi _{j} = 0\ \text{ for }\vert i - j\vert> 1\,, }$$

(5.10)

as well as an analogous relation for ∫φ _i ^′ φ _j ^′ . We will discuss hat functions in the following Sect. 5.2. Basis functions more advanced than the canonical hat functions are constructed using piecewise polynomials of higher degree. In this way, basis functions can be obtained with $\mathcal{C}^{1}$- or $\mathcal{C}^{2}$-smoothness ( → Exercise 5.2). Recall from interpolation ( → Appendix C.1) that polynomials of degree three can lead to $\mathcal{C}^{2}$-smooth splines.

5.1.4 Smoothness

We have left open how close an approximation w of (5.5)/(5.9) is to the solution u of (5.1). Clearly, R(u) = 0 and u satisfies (5.7). But w in general does not solve (5.1). The differential equation (5.1) is a stronger requirement than the integral relations (5.7).

The accuracy depends on the smoothness of the basis functions. Depending on the chosen method, different kinds of smoothness are relevant. Let us illustrate this matter on the model problem (5.2),

$$\displaystyle{Lu = -u'',\quad \text{with }\ u,\varphi,\psi \in \{\, u\,\mid \,u(0) = u(1) = 0\,\}\,.}$$

Integration by parts formally implies

$$\displaystyle{\int _{0}^{1}\varphi ''\psi = -\int _{ 0}^{1}\varphi '\psi ' =\int _{ 0}^{1}\varphi \psi ''\,,}$$

because the boundary conditions u(0) = u(1) = 0 let the nonintegral terms vanish. These three versions of the integral can be distinguished by the smoothness requirements on φ and ψ, and by the question whether the integrals exist. One will choose the integral version that corresponds to the underlying method, and to the smoothness of the solution. For example, for Galerkin’s approach the elements a _ij of A consist of the integrals

$$\displaystyle{-\int _{0}^{1}\varphi _{ i}^{{\prime}}\varphi _{ j}^{{\prime}}\,.}$$

We will return to the topics of accuracy, convergence, and function spaces in Sect. 5.5 (with Appendix C.3).

5.2 Ritz–Galerkin Method with One-Dimensional Hat Functions

As mentioned before, any required flexibility is provided by finite-element methods. This holds to a larger extent in higher-dimensional spaces. In this section, for simplicity, we stick to the one-dimensional situation, $x \in \mathbb{R}$. The dependence on the time variable t will be postponed to Sect. 5.3.

Assume a partition of the x-domain by a set of increasing mesh points x ₀, …, x _m. A nonuniform spacing is advisable in several instances in order to improve the accuracy. For example, close to the strike, a denser grid is appropriate to mollify the lack of smoothness of a payoff. In contrast, to model infinity, one rarefies the nodes for larger x and shifts the final node x _m to a large value. One strategy is to select a spacing such that locally (up to additional scaling and shifts) sinh(x _i) = η _i, where η _i are chosen equidistantly. A dense spacing is also advisable for barrier options close to the barrier, where the gradient of option prices is high.

5.2.1 Hat Functions

The prototype of a finite-element method makes use of the hat functions, which we define formally (compare Figs. 5.6 and 5.7).

Definition 5.1 (Hat Functions)

For 1 ≤ i ≤ m − 1 set φ _i(x): = 0 on all subintervals except two:

$$\displaystyle\begin{array}{rcl} \varphi _{i}(x):& =&{ x - x_{i-1} \over x_{i} - x_{i-1}}\quad \text{ for }x_{i-1} \leq x <x_{i}\,, {}\\ \varphi _{i}(x):& =&{ x_{i+1} - x \over x_{i+1} - x_{i}}\quad \text{ for }x_{i} \leq x <x_{i+1}\,, {}\\ \end{array}$$

and boundary functions φ ₀, φ _m nonzero on just one subinterval:

$$\displaystyle\begin{array}{rcl} & & \varphi _{0}(x):={ x_{1} - x \over x_{1} - x_{0}}\quad \text{ for }x_{0} \leq x <x_{1}\,, {}\\ & & \varphi _{m}(x):={ x - x_{m-1} \over x_{m} - x_{m-1}}\quad \text{ for }x_{m-1} \leq x \leq x_{m}\,. {}\\ \end{array}$$

For each node x _i there is one hat function. These m + 1 hat functions satisfy the following properties.

Properties 5.2 (Hat Functions)

The following properties (a)–(e) hold:

(a)
The φ ₀, …, φ _m form a basis of the space of polygons
$$\displaystyle\begin{array}{rcl} \{\,g \in \mathcal{C}^{0}[x_{ 0},x_{m}]\ \mid \ & & g\text{ straight line on }\mathcal{D}_{k}:= [x_{k},x_{k+1}]\,, {}\\ & & \text{for all }k = 0,\ldots,m - 1\,\}\;. {}\\ \end{array}$$
That is to say, for each polygon v on the union of $\mathcal{D}_{0},\ldots,\mathcal{D}_{m-1}$ there are unique coefficients c ₀, …, c _m such that
$$\displaystyle{v =\sum _{ i=0}^{m}c_{ i}\varphi _{i}\;.}$$
(b)
On any $\mathcal{D}_{k}$ only φ _k and φ _k+1 ≠ 0 are nonzero. Hence
$$\displaystyle{\varphi _{i}\varphi _{j} = 0\ \text{ for }\ \vert i - j\vert> 1\;,}$$
which explains (5.10).
(c)
A simple approximation of the integral $\int _{x_{0}}^{x_{m}}f\varphi _{j}\,\mathrm{d}x$ can be calculated as follows: Substitute f by the interpolating polygon
$$\displaystyle{f_{\mathrm{p}}:=\sum _{ i=0}^{m}f_{ i}\varphi _{i}\quad \text{, where}\quad f_{i}:= f(x_{i})\;,}$$
and obtain for each j the approximating integral
$$\displaystyle{I_{j}:=\int _{ x_{0}}^{x_{m} }f_{\mathrm{p}}\varphi _{j}\,\mathrm{d}x =\int _{ x_{0}}^{x_{m} }\sum _{i=0}^{m}f_{ i}\varphi _{i}\varphi _{j}\,\mathrm{d}x =\sum _{ i=0}^{m}f_{ i}\mathop{\underbrace{ \int _{x_{0}}^{x_{m}}\varphi _{ i}\varphi _{j}\,\mathrm{d}x}}\limits _{=:b_{ij}}\,.}$$
The b _ij constitute a symmetric matrix B and the f _i a vector $\bar{f}$. If we arrange all integrals I _j (0 ≤ j ≤ m) into a vector, then all integrals can be written in a compact way in vector notation as
$$\displaystyle{B\bar{f}\;.}$$
This will approximate the vector r in (5.9).
(d)
The “large” (m + 1)²–matrix B: = (b _ij) can be set up $\mathcal{D}_{k}$-elementwise by (2 × 2)-matrices (discussed below in Sect. 5.2.2). The (2 × 2)-matrices are those integrals that integrate only over a single subdomain $\mathcal{D}_{k}$. For each $\mathcal{D}_{k}$ in our one-dimensional setting exactly the four integrals ∫φ _i φ _jdx for i, j ∈ {k, k + 1} are nonzero. They can be arranged into a (2 × 2)-matrix
$$\displaystyle{\int _{x_{k}}^{x_{k+1} }\left (\begin{array}{*{10}c} \varphi _{k}^{2} & \varphi _{k}\varphi _{k+1} \\ \varphi _{k+1}\varphi _{k}& \varphi _{k+1}^{2}\\ \end{array} \right )\,\mathrm{d}x\,.}$$
(The integral over a matrix is understood elementwise.) These are the integrals on $\mathcal{D}_{k}$, where the integrand is a product of the factors
$$\displaystyle{{ x_{k+1} - x \over x_{k+1} - x_{k}}\ \text{ and }\ { x - x_{k} \over x_{k+1} - x_{k}}\,.}$$
The four numbers
$$\displaystyle{{ 1 \over (x_{k+1} - x_{k})^{2}}\int _{x_{k}}^{x_{k+1} }\left (\begin{array}{*{10}c} (x_{k+1} - x)^{2} & (x_{k+1} - x)(x - x_{k}) \\ (x - x_{k})(x_{k+1} - x)& (x - x_{k})^{2}\\ \end{array} \right )\,\mathrm{d}x}$$
result. With h _k: = x _k+1 − x _k integration yields the element-mass matrix ( → Exercise 5.3)
$$\displaystyle{{1 \over 6}h_{k}\left (\begin{array}{*{10}c} 2&1\\ 1 &2\\ \end{array} \right )\,.}$$
(e)
Analogously, integrating φ _i ^′ φ _j ^′ yields
$$\displaystyle\begin{array}{rcl} & & \int _{x_{k}}^{x_{k+1} }\left (\begin{array}{*{10}c} \varphi _{k}^{{\prime}2} & \varphi _{k}^{{\prime}}\varphi _{k+1}^{{\prime}} \\ \varphi _{k+1}^{{\prime}}\varphi _{k}^{{\prime}}& \varphi _{k+1}^{{\prime}2}\\ \end{array} \right )\,\mathrm{d}x {}\\ & & ={ 1 \over h_{k}^{2}}\int _{x_{k}}^{x_{k+1} }\left (\begin{array}{*{10}c} (-1)^{2} & (-1)1 \\ 1(-1)& 1^{2}\\ \end{array} \right )\,\mathrm{d}x ={ 1 \over h_{k}}\left (\begin{array}{*{10}c} 1 &-1\\ -1 & 1\\ \end{array} \right )\,. {}\\ & {}\\ \end{array}$$
These matrices are called element-stiffness matrices. They are used to set up the matrix A.

5.2.2 Assembling

The next step is to assemble the matrices A and B. It might be tempting to organize this task as follows: run a double loop on all basis indices i, j (N node indices) and check for each (i, j) on which $\mathcal{D}_{k}$ the integral

$$\displaystyle{\int _{\mathcal{D}_{k}}\varphi _{i}\varphi _{j}}$$

is nonzero. Such a procedure of performing a double loop has the complexity of O(N ² m). This is cumbersome as compared to the alternative of running a single loop on the subdomain index k and benefit from all relevant integrals on $\mathcal{D}_{k}$, which are precalculated above (Fig. 5.8).

To this end, split the integrals

$$\displaystyle{\int _{x_{0}}^{x_{m} } =\sum _{ k=0}^{m-1}\int _{ \mathcal{D}_{k}}}$$

to construct the (m + 1) × (m + 1)-matrices A = (a _ij) and B = (b _ij) additively out of the small element matrices. For the case of the one-dimensional hat functions with subintervals

$$\displaystyle{\mathcal{D}_{k} =\{\, x\ \mid \ x_{k} \leq x \leq x_{k+1}\,\}}$$

the element matrices are (2 × 2), see above. In this case only those integrals of φ _i ^′ φ _j ^′ and φ _i φ _j are nonzero, for which $i,j \in \mathcal{I}_{k}$, where

$$\displaystyle{ i,j \in \mathcal{I}_{k}:=\{ k,k + 1\}\,. }$$

(5.11)

$\mathcal{I}_{k}$ is the set of indices of those products of basis functions that are nonzero on $\mathcal{D}_{k}$. The assembling algorithm performs a loop over the subdomain index k = 0, 1, …, m − 1 and distributes the (2 × 2)-element matrices additively to the positions $i,j \in \mathcal{I}_{k}$. Before the assembling is started, the matrices A and B must be initialized with zeros. For k = 0, …, m − 1 one obtains for A the (m + 1)²-matrix

$$\displaystyle{ A = \left (\begin{array}{*{10}c} { 1 \over h_{0}} & -{ 1 \over h_{0}} & & & \\ -{ 1 \over h_{0}} &{ 1 \over h_{0}} +{ 1 \over h_{1}} & -{ 1 \over h_{1}} & & \\ & -{ 1 \over h_{1}} &{ 1 \over h_{1}} +{ 1 \over h_{2}} & -{ 1 \over h_{2}} & \\ & & -{ 1 \over h_{2}} & \ddots & \ddots \\ & & & \ddots &\\ \end{array} \right )\,. }$$

(5.12)

The matrix B is assembled in an analogous way. In the one-dimensional situation the matrices are tridiagonal. For an equidistant grid with h = h _k the matrix A specializes to

$$\displaystyle{ A ={ 1 \over h}\left (\begin{array}{*{10}c} 1 &-1& & & & 0\\ -1 & 2 &-1 & & & \\ &-1& 2 &\ddots&\\ & & \ddots &\ddots& \ddots\\ & & &\ddots & 2 &-1 \\ 0 & & & &-1& 1\\ \end{array} \right ) }$$

(5.13)

and B to

$$\displaystyle{ B ={ h \over 6}\left (\begin{array}{*{10}c} 2&1& & & &0\\ 1 &4 &1 & & & \\ &1&4&\ddots&\\ & & \ddots &\ddots& \ddots\\ & & &\ddots &4&1 \\ 0& & & &1&2\\ \end{array} \right )\,. }$$

(5.14)

5.2.3 A Simple Application

In order to demonstrate the procedure, let us consider the simple time-independent (“stationary”) model boundary-value problem

$$\displaystyle{ Lu:= -u'' = f\ \text{ with }\ u(x_{0}) = u(x_{m}) = 0\,. }$$

(5.15)

Substituting w: = ∑ _{i = 0} ^m c _i φ _i into the differential equation, in view of (5.8), leads to

$$\displaystyle{\sum _{i=0}^{m}c_{ i}\int _{x_{0}}^{x_{m} }L\varphi _{i}\,\varphi _{j}\;\mathrm{d}x =\int _{ x_{0}}^{x_{m} }f\varphi _{j}\;\mathrm{d}x\,.}$$

This is the result of the Ritz–Galerkin approach. Next we apply integration by parts on the left-hand side, and invoke Property 5.2(c) on the right-hand side. The resulting system of equations is

$$\displaystyle{ \sum _{i=0}^{m}c_{ i}\mathop{\underbrace{ \int _{x_{0}}^{x_{m}}\varphi _{ i}^{{\prime}}\varphi _{ j}^{{\prime}}\;\mathrm{d}x}}\limits _{ a_{ij}} =\sum _{ i=0}^{m}f_{ i}\mathop{\underbrace{ \int _{x_{0}}^{x_{m}}\varphi _{ i}\varphi _{j}\;\mathrm{d}x}}\limits _{b_{ij}},\quad j = 0,1,\ldots,m\,. }$$

(5.16)

This system is preliminary because the homogeneous boundary conditions u(x ₀) = u(x _m) = 0 are not yet taken into account.

At this state, the preliminary system of Eqs. (5.16) can be written as

$$\displaystyle{ Ac = B\bar{f}\,. }$$

(5.17)

It is easy to see that the matrix A from (5.13) is singular, because

$$\displaystyle{A(1,1,\ldots,1)^{\mbox{ $tr$}} = 0\,.}$$

The singularity reflects the fact that the system (5.17) does not have a unique solution. This is consistent with the differential equation − u″ = f(x): If u(x) is solution, then also u(x) + α for arbitrary α. Unique solvability is attained by satisfying the boundary conditions; a solution u of − u″ = f must be fixed by at least one essential boundary condition. For our example (5.15) we know in view of u(x ₀) = u(x _m) = 0 the coefficients c ₀ = c _m = 0. This information can be inserted into the system of equations in such a way that the matrix A changes to a nonsingular matrix without losing symmetry. To this end, cancel the first and the last of the n + 1 equations in (5.17), and make use of c ₀ = c _m = 0. Now the inner part of size (m − 1) × (m − 1) of A remains. The matrix B is (m − 1) × (m + 1). Finally, for the special case of an equidistant grid, the system of equations is

$$\displaystyle{ \begin{array}{rcl} &&\left (\begin{array}{*{10}c} 2 &-1&& & 0\\ -1 & 2 &\ddots & & \\ & \ddots &\ddots& \ddots &\\ & &\ddots & 2&-1 \\ 0 & &&-1& 2\\ \end{array} \right )\left (\begin{array}{*{10}c} c_{1} \\ c_{2}\\ \vdots \\ c_{m-2} \\ c_{m-1}\\ \end{array} \right ) = \\ &&{h^{2} \over 6} \left (\begin{array}{*{10}c} 1&4&1& & & &0\\ &1 &4 &1 & & &\\ & & \ddots & \ddots & \ddots & & \\ & & &1&4&1&\\ 0 & & & &1 &4&1\\ \end{array} \right )\left (\begin{array}{*{10}c} \bar{f}_{0} \\ \bar{f}_{1}\\ \vdots \\ \bar{f}_{m-1} \\ \bar{f}_{m}\\ \end{array} \right )\,. \end{array} }$$

(5.18)

In (5.18) we have used an equidistant grid for sake of a lucid exposition. Our main focus is the nonequidistant version, which is also implemented easily. In case nonhomogeneous boundary conditions are prescribed, appropriate values of c ₀ or c _m are predefined. The importance of finite-element methods in structural engineering has lead to call the global matrix A the stiffness matrix, and B is called the mass matrix.

5.3 Application to Standard Options

Finite elements are especially advantageous in higher-dimensional spaces (several underlyings). But it also works for the one-dimensional case of standard options. This is the theme of this section. In contrast to the previous section, time must be included.

5.3.1 European Options

We know that the valuation of single-asset European options with vanilla payoff makes use of the Black–Scholes formula. But for the sake of exposition, and for non-vanilla payoff, let us briefly sketch a finite-element approach. Here we apply the FEM approach to the transformed version y _τ = y _xx of the Black–Scholes equation with constant parameters. In view of the general basis representation in (5.5) one may think of starting from w = ∑w _i φ _i(x, τ) with constant coefficients w _i. This would require two-dimensional basis functions. (We shall come back to such functions in Sect. 5.4.) To make use of one-dimensional hat functions, apply a separation ansatz in the form ∑w _i(τ)φ _i(x) with functions w _i(τ). As a consequence of this simple approach, the same x-grid is applied for all τ, which results in a rectangular grid in the (x, τ)-plane. Dirichlet boundary conditions

$$\displaystyle{y(x_{\mathrm{min}},\tau ) =\alpha (\tau ),\ y(x_{\mathrm{max}},\tau ) =\beta (\tau )}$$

mean that in view of the shape of φ ₀, φ _m (Definition 5.1, Fig. 5.7) the values w ₀ = α or w _m = β would be known. It is practical to separate known terms and restrict the sum to the terms with unknown weights w _i. This can be managed by introducing a special function φ _b that compensates for Dirichlet boundary conditions on y. The function φ _b(x, τ) is no basis function, and is constructed in advance. For example,

$$\displaystyle{\varphi _{\mathrm{b}}(x,\tau ):= (\beta (\tau ) -\alpha (\tau ))\,{ x - x_{\mathrm{min}} \over x_{\mathrm{max}} - x_{\mathrm{min}}} +\alpha (\tau )}$$

does the job for the above boundary conditions. So φ _b can be considered to be known, and the sum ∑w _i φ _i does not reflect any nonzero Dirichlet boundary conditions on y. Then the final ansatz is

$$\displaystyle{ \sum \limits _{i}w_{i}(\tau )\varphi _{i}(x) +\varphi _{\mathrm{b}}(x,\tau )\,, }$$

(5.19)

and the index i counts those nodes x _i for which no boundary conditions of the above type are prescribed, 1 ≤ i ≤ m − 1 in case two Dirichlet boundary conditions are given. The basis functions φ ₁, …, φ _N are chosen to be the hat functions, which incorporate the discretization of the x-axis. Hence, N = m − 1, and x ₀ corresponds to x _min, and x _m to x _max. The functions w ₁, …, w _m−1 are unknown, and w ₀ = w _m = 0.

Calculating derivatives of (5.19) and substituting into y _τ = y _xx leads to the Ritz–Galerkin approach

$$\displaystyle{\int \limits _{x_{0}}^{x_{m} }\left [\sum \limits _{i=1}^{m-1}\dot{w}_{ i}\varphi _{i} +\dot{\varphi } _{\mathrm{b}}\right ]\varphi _{j}\,\mathrm{d}x =\int \limits _{ x_{0}}^{x_{m} }\left [\sum \limits _{i=1}^{m-1}w_{ i}\varphi _{i}^{{\prime\prime}} +\varphi _{ \mathrm{ b}}^{{\prime\prime}}\right ]\varphi _{ j}\,\mathrm{d}x}$$

for j = 1, …, m − 1. The overdot represents differentiation with respect to τ, and the prime with respect to x. Arranging the terms that involve derivatives of φ _b into vectors a(τ), b(τ),

$$\displaystyle{a(\tau ):= \left (\begin{array}{*{10}c} \int \varphi _{\mathrm{b}}^{{\prime\prime}}(x,\tau )\,\varphi _{1}(x)\,\mathrm{d}x\\ \vdots \\ \int \varphi _{\mathrm{b}}^{{\prime\prime}}(x,\tau )\,\varphi _{m-1}(x)\,\mathrm{d}x\\ \end{array} \right )\,,\quad b(\tau ):= \left (\begin{array}{*{10}c} \int \dot{\varphi }_{\mathrm{b}}(x,\tau )\,\varphi _{1}(x)\,\mathrm{d}x\\ \vdots \\ \int \dot{\varphi }_{\mathrm{b}}(x,\tau )\,\varphi _{m-1}(x)\,\mathrm{d}x\\ \end{array} \right )\,,}$$

and using the matrices A, B as in (5.13)/(5.14), we arrive after integration by parts at

$$\displaystyle{ B\dot{w} + b = -Aw - a\,. }$$

(5.20)

Note that for the specific φ _b from above φ _b″ = 0 and a = 0. For vanilla options, α and β can be drawn from (4.28), and b can be set up analytically; a and b can be considered as known. This completes the semidiscretization. Time τ is still continuous, and (5.20) defines the unknown vector function w(τ): = (w ₁(τ), …, w _m−1(τ))^tr as solution of a system of ordinary differential equations. This is a method of lines approach. The lines are defined by x = x _i for 1 ≤ i ≤ m − 1, and the approximations along the lines are given by w _i(τ). Initial conditions for τ = 0 are derived from (5.19). Assume the initial condition from the payoff as y(x, 0) = γ(x), then

$$\displaystyle{\sum _{i=1}^{N}w_{ i}(0)\varphi _{i}(x) +\varphi _{\mathrm{b}}(x,0) =\gamma (x)\,.}$$

For vanilla payoff, γ is given by (4.5)/(4.6). Specifically for x = x _j the sum reduces to w _j(0) ⋅ 1, leading to

$$\displaystyle{w_{j}(0) =\gamma (x_{j}) -\varphi _{\mathrm{b}}(x_{j},0)\,.}$$

To complete the discretization, time τ must be discretized. Standard software for ODEs can be applied to (5.20), in particular, codes for stiff systems. For discretizing with difference quotients consult Sect. 4.2.1. For example, apply the ODE trapezoidal rule as in (4.20) for the discretization of $\dot{w}$ in (5.20). We leave the derivation of the resulting Crank–Nicolson type discretization as an exercise to the reader. With the usual notation of the vector w ^(ν) approximating w(τ _ν), the result can be written

$$\displaystyle{ \begin{array}{rcl} (B +{ \varDelta \tau \over 2}A)\,w^{(\nu +1)} & =&(B -{ \varDelta \tau \over 2}A)\,w^{(\nu )} \\ & & -{ \varDelta \tau \over 2}\,(a^{(\nu )} + a^{(\nu +1)} + b^{(\nu )} + b^{(\nu +1)})\,. \end{array} }$$

(5.21)

The structure of (5.21) strongly resembles the finite-difference approach (4.24). This similarity suggests that the order is the same, because for the finite-element A’s and B’s we have (compare (5.13)/(5.14))

$$\displaystyle{A = O\left ({1 \over \varDelta x}\right )\,\,,\quad B = O(\varDelta x)\,.}$$

The separation of the variables x and τ in (5.19) allows to investigate the orders of the discretizations separately. In Δτ, the order O(Δτ ²) of the Crank–Nicolson type approach (5.21) is clear from the ODE trapezoidal rule. It remains to derive the order of convergence with respect to the discretization in x. Because of the separation of variables it is sufficient to derive the convergence for a one-dimensional model problem. This will be done in Sect. 5.5.

5.3.2 Variational Form of the Obstacle Problem

To warm up for the discussion of the American option case, let us return to the simple obstacle problem of Sect. 4.5.5 with the obstacle function g(x), or g(x, τ). This problem can be formulated as a variational inequality. The function u solving the obstacle problem can be characterized by comparing it to functions v out of a set $\mathcal{K}$ of competing functions

$$\displaystyle\begin{array}{rcl} \mathcal{K}:=\{& & v \in \mathcal{C}^{0}[-1,1]\mid \ v(-1) = v(1) = 0\,, {}\\ & & v(x) \geq g(x)\ \text{ for } - 1 \leq x \leq 1,\ v\text{ piecewise } \in \mathcal{C}^{1}\,\}\,. {}\\ \end{array}$$

The requirements on u imply $u \in \mathcal{K}$. For $v \in \mathcal{K}$ we have v − g ≥ 0 and in view of − u″ ≥ 0 also − u″(v − g) ≥ 0. Hence for all $v \in \mathcal{K}$ the inequality

$$\displaystyle{\int _{-1}^{1} - u''(v - g)\,\mathrm{d}x \geq 0}$$

must hold. By the LCP formulation (4.39) the integral

$$\displaystyle{\int _{-1}^{1} - u''(u - g)\,\mathrm{d}x = 0}$$

vanishes. Subtracting yields

$$\displaystyle{\int _{-1}^{1} - u''(v - u)\,\mathrm{d}x \geq 0\ \text{ for any }v \in \mathcal{K}\,.}$$

The obstacle function g does not occur explicitly in this formulation; the obstacle is implicitly defined in $\mathcal{K}$. Integration by parts leads to

$$\displaystyle{[\mathop{\underbrace{-u'(v - u)}}\limits _{=0}]_{-1}^{1} +\int _{ -1}^{1}u'(v - u)'\,\mathrm{d}x \geq 0\,.}$$

The integral-free term vanishes because of u(−1) = v(−1), u(1) = v(1). In summary, we have derived the statement:

$$\displaystyle{ \begin{array}{rcl} \text{If }u\text{ solves the}&&\text{obstacle problem (4.39), then} \\ &&\qquad \int _{-1}^{1}u'(v - u)'\,\mathrm{d}x \geq 0\quad \text{ for all }v \in \mathcal{K}\,.\end{array} }$$

(5.22)

Since v varies in the set $\mathcal{K}$ of competing functions, an inequality such as in (5.22) is called variational inequality. The characterization of u by (5.22) can be used to construct an approximation w: Instead of u, find a $w \in \mathcal{K}$ such that the inequality (5.22) is satisfied for all $v \in \mathcal{K}$,

$$\displaystyle{\int \limits _{-1}^{1}w'(v - w)'\,\mathrm{d}x \geq 0\quad \text{for all }v \in \mathcal{K}\,.}$$

The characterization (5.22) is related to a minimum problem, because the integral vanishes for v = u.

5.3.3 Variational Form of an American Option

Analogously as the simple obstacle problem also the problem of calculating American options can be formulated as variational problem, compare Problem 4.7. The class of competing functions must be redefined as

$$\displaystyle{ \begin{array}{rcl} \mathcal{K}:=\{\,&&v \in \mathcal{C}^{0}[x_{\min },x_{\max }]\;\mid \;{\partial v \over \partial x}\ \text{ piecewise }\mathcal{C}^{0}\,, \\ &&v(x,\tau ) \geq g(x,\tau )\ \text{ for all }x,\tau \,\ v(x,0) = g(x,0)\,, \\ &&v(x_{\max },\tau ) = g(x_{\mathrm{max}},\tau ),\ v(x_{\min },\tau ) = g(x_{\min },\tau )\,\}\,. \end{array} }$$

(5.23)

For the following, $v \in \mathcal{K}$ for the $\mathcal{K}$ from (5.23). Let y denote the exact solution of Problem 4.7. As solution of the partial differential inequality, y is $\mathcal{C}^{2}$-smooth on the continuation region, and $y \in \mathcal{K}$. From

$$\displaystyle{v \geq g,\quad {\partial y \over \partial \tau } -{ \partial ^{2}y \over \partial x^{2}} \geq 0}$$

we deduce

$$\displaystyle{\int _{x_{\min }}^{x_{\max }}\left ({\partial y \over \partial \tau } -{ \partial ^{2}y \over \partial x^{2}}\right )(v - g)\;\mathrm{d}x \geq 0\,.}$$

Invoking the complementarity

$$\displaystyle{\int _{x_{\min }}^{x_{\max }}\left ({\partial y \over \partial \tau } -{ \partial ^{2}y \over \partial x^{2}}\right )(\,y - g)\,\mathrm{d}x = 0}$$

and subtraction gives

$$\displaystyle{\int _{x_{\min }}^{x_{\max }}\left ({\partial y \over \partial \tau } -{ \partial ^{2}y \over \partial x^{2}}\right )(v - y)\,\mathrm{d}x \geq 0\,.}$$

Integration by parts leads to the inequality

$$\displaystyle{\int _{x_{\min }}^{x_{\max }}\left ({\partial y \over \partial \tau } (v - y) +{ \partial y \over \partial x}\left ({\partial v \over \partial x} -{ \partial y \over \partial x}\right )\right )\,\mathrm{d}x -{ \partial y \over \partial x}(v - y)\Bigg\vert _{x_{\min }}^{x_{\max }} \geq 0\,.}$$

The nonintegral term vanishes, because at the boundary for x _min, x _max, in view of v = g, y = g, the equality v = y holds. The final result is

$$\displaystyle{ I(\,y;v):=\int _{ x_{\min }}^{x_{\max }}\left ({\partial y \over \partial \tau } \cdot (v - y) +{ \partial y \over \partial x}\left ({\partial v \over \partial x} -{ \partial y \over \partial x}\right )\right )\,\mathrm{d}x \geq 0\quad \text{for all }v \in \mathcal{K}\,. }$$

(5.24)

The exact y is characterized by the fact that the inequality (5.24) holds for all comparison functions $v \in \mathcal{K}$. For the special choice v = y the integral takes its minimal value,

$$\displaystyle{\min _{v\in \mathcal{K}}I(\,y;v) = I(\,y;y) = 0\,.}$$

A more general question is, whether the inequality (5.24) holds for a $\widehat{y} \in \mathcal{K}$ that is not $\mathcal{C}^{2}$-smooth on the continuation region.^{Footnote 2} The aim is:

Problem 5.3 (Weak Version)

Construct a $\widehat{y} \in \mathcal{K}$ such that $I(\widehat{y};v) \geq 0$ for all $v \in \mathcal{K}$.

This formulation of our problem is called weak version, because it does not use $\widehat{y} \in \mathcal{C}^{2}$. Solutions $\widehat{y}$ of Problem 5.3, which are globally continuous but only piecewise $\in \mathcal{C}^{1}$, are called weak solutions. The original partial differential equation requires $y \in \mathcal{C}^{2}$ and hence more smoothness. Such $\mathcal{C}^{2}$-solutions are called strong solutions or classical solutions ( → Sect. 5.5).

5.3.4 Implementation of Finite Elements

A discretized version of the weak problem is obtained by replacing the space $\mathcal{K}$ by a finite-dimensional subspace $\widehat{\mathcal{K}}$, which is spanned by a finite number of basis functions. That is, we search for a $\widehat{y} \in \widehat{\mathcal{K}}$ such that

$$\displaystyle{I(\widehat{y};\widehat{v}) \geq 0\quad \text{ for all }\widehat{v} \in \widehat{\mathcal{K}}\,,}$$

where I( y; v) is defined in (5.24). This sets the arena for finite element methods.

As a first step to approximately solve the minimum problem, assume as in Sect. 5.3.1 separation approximations for $\widehat{y}$ and $\widehat{v}$ in the similar forms

$$\displaystyle{ \begin{array}{rcl} &&\widehat{y} =\sum _{i}w_{i}(\tau )\varphi _{i}(x)\,, \\ &&\widehat{v} =\sum _{i}v_{i}(\tau )\varphi _{i}(x)\,. \end{array} }$$

(5.25)

Summation is over a finite number of terms, which represents $\widehat{y},\,\widehat{v} \in \widehat{\mathcal{K}}$. The reduced smoothness of these expressions match the requirements of $\mathcal{K}$ from (5.23); time dependence is incorporated in the coefficient functions w _i and v _i. Since the basis functions φ _i represent the x _i-grid, we again perform a semidiscretization. Plugging the ansatz (5.25) into $I(\widehat{y};\widehat{v})$ from (5.24) gives

$$\displaystyle\begin{array}{rcl} & & \int \left \{\left (\sum _{i}{\,\mathrm{d}w_{i} \over \,\mathrm{d}\tau } \varphi _{i}\right )\left (\sum _{j}(v_{j} - w_{j})\varphi _{j}\right )+\right. {}\\ & & \qquad \left.\quad \left (\sum _{i}w_{i}\varphi _{i}^{{\prime}}\right )\left (\sum _{ j}(v_{j} - w_{j})\varphi _{j}^{{\prime}}\right )\right \}\,\mathrm{d}x {}\\ & =& \sum _{i}\sum _{j}{\mathrm{d}w_{i} \over \mathrm{d}\tau } (v_{j} - w_{j})\int \varphi _{i}\varphi _{j}\,\mathrm{d}x +\sum _{i}\sum _{j}w_{i}(v_{j} - w_{j})\int \varphi _{i}^{{\prime}}\varphi _{ j}^{{\prime}}\,\mathrm{d}x \geq 0\,. {}\\ \end{array}$$

Translated into vector notation for the coefficient functions w _i(τ), v _i(τ), this is equivalent to

$$\displaystyle{\left ({\mathrm{d}w \over \mathrm{d}\tau } \right )^{\mbox{ $tr$}}B(v - w) + w^{\mbox{ $tr$}}A(v - w) \geq 0}$$

or^{Footnote 3}

$$\displaystyle{(v - w)^{\mbox{ $tr$}}\left (B{\mathrm{d}w \over \mathrm{d}\tau } + Aw\right ) \geq 0\,.}$$

This is the (semi-)discretized weak version of $I(\widehat{y};\widehat{v}) \geq 0$. The matrices A and B are defined via the assembling described above; for equidistant steps the special versions in (5.13), (5.14) arise.

As a second step, the time τ is discretized as well. To this end let us define the vectors

$$\displaystyle{w^{(\nu )}:= w(\tau _{\nu }),\quad v^{(\nu )}:= v(\tau _{\nu })\,.}$$

Upon substituting, and θ-averaging the Aw term as in Sect. 4.6.1, we arrive at the inequalities

$$\displaystyle{ \left (v^{(\nu +1)} - w^{(\nu +1)}\right )^{\mbox{ $tr$}}\left (B{ 1 \over \varDelta \tau } (w^{(\nu +1)} - w^{(\nu )}) +\theta Aw^{(\nu +1)} + (1-\theta )Aw^{(\nu )}\right ) \geq 0 }$$

(5.26)

for all ν. For θ = 1∕2 this is a Crank–Nicolson-type method. Rearranging (5.26) leads to

$$\displaystyle{\left (v^{(\nu +1)} - w^{(\nu +1)}\right )^{\mbox{ $tr$}}\left (\left (B +\varDelta \tau \,\theta A\right )w^{(\nu +1)} + \left (\varDelta \tau (1-\theta )A - B\right )w^{(\nu )}\right ) \geq 0\,.}$$

With the abbreviations

$$\displaystyle{ \begin{array}{rcl} r:& =&(B -\varDelta \tau (1-\theta )A)\,w^{(\nu )}\,, \\ C:& =&B +\varDelta \tau \,\theta A\,, \end{array} }$$

(5.27)

the inequality can be rewritten as

$$\displaystyle{ \left (v^{(\nu +1)} - w^{(\nu +1)}\right )^{\mbox{ $tr$}}\left (Cw^{(\nu +1)} - r\right ) \geq 0\,. }$$

(5.28)

This is the fully discretized version of $I(\widehat{y};v) \geq 0$.

5.3.4.1 Side Conditions

To match the requirements of $\mathcal{K}$, the inequalities $\widehat{y} \geq g$ and $\widehat{v} \geq g$ must hold. $\widehat{y}(x,\tau ) \geq g(x,\tau )$ amounts to

$$\displaystyle{\sum w_{i}(\tau )\varphi _{i}(x) \geq g(x,\tau )\,.}$$

For hat functions φ _i (with φ _i(x _i) = 1 and φ _i(x _j) = 0 for j ≠ i) and x = x _j this implies w _j(τ) ≥ g(x _j, τ). With τ = τ _ν we have

$$\displaystyle{w^{(\nu )} \geq g^{(\nu )};\quad \text{ analogously }v^{(\nu )} \geq g^{(\nu )}\,.}$$

For each time level ν we must find a solution that satisfies both the inequality (5.26)–(5.28) and the side condition

$$\displaystyle{w^{(\nu +1)} \geq g^{(\nu +1)}\ \text{ for all }\ v^{(\nu +1)} \geq g^{(\nu +1)}\,.}$$

In summary, the algorithm is

Algorithm 5.4 (Finite Elements for American Standard Options)

$$\displaystyle\begin{array}{rcl} & & \text{Choose }\theta \ (\theta = 1/2).\text{ Calculate }w^{(0)},\text{ and }C\text{ from (5.27)}. {}\\ & & \mathit{For\ }\nu = 1,\ldots,\nu _{\max }: {}\\ & & \quad \text{Calculate }r = (B -\varDelta \tau (1-\theta )A)w^{(\nu -1)}\text{ and }g = g^{(\nu )}\,. {}\\ & & \quad \text{Construct a }w\text{ such that for all }v \geq g {}\\ & & \qquad (v - w)^{\mbox{ $tr$}}(Cw - r) \geq 0,\quad w \geq g. {}\\ & & \quad \text{Set }w^{(\nu )}:= w\,. {}\\ \end{array}$$

This algorithm generates a discretized solution of the weak Problem 5.3: The vectors w define $\widehat{y} \in \widehat{\mathcal{K}}$ via (5.25); $\widehat{v}$ is not needed explicitly. Let us emphasize again the main step (FE), which is the kernel of this algorithm and the main labor: Construct w such that

$$\displaystyle{ \begin{array}{rcl} \mathbf{(FE)}\quad &&\text{for all }v \geq g \\ &&(v - w)^{\mbox{ $tr$}}(Cw - r) \geq 0\,,\quad w \geq g\,. \end{array} }$$

(5.29)

This task (FE) can be reformulated into a task we already solved in Sect. 4.6. To this end recall the finite-difference equation (4.44), replacing A by C, and b by r. There the following holds for w:

$$\displaystyle{ \begin{array}{rcl} \mathbf{(FD)}\quad &&Cw - r \geq 0\,,\quad w \geq g\,, \\ &&(Cw - r)^{\mbox{ $tr$}}(w - g) = 0\,.\qquad \qquad \end{array} }$$

(5.30)

Theorem 5.5 (Equivalence)

The solution of the problem (FE) is equivalent to the solution of problem (FD).

Proof

a)
(FD) ⇒ (FE):

Let w solve (FD), so w ≥ g, and
$$\displaystyle{(v-w)^{\mbox{ $tr$}}(Cw-r) = (v-g)^{\mbox{ $tr$}}\mathop{\underbrace{(Cw - r)}}\limits _{\geq 0}-\mathop{\underbrace{(w - g)^{\mbox{ $tr$}}(Cw - r)}}\limits _{=0}}$$
hence (v − w)^tr(Cw − r) ≥ 0 for all v ≥ g .
b)
(FE) ⇒ (FD):

Let w solve (FE), so w ≥ g, and
$$\displaystyle{v^{\mbox{ $tr$}}(Cw - r) \geq w^{\mbox{ $tr$}}(Cw - r)\quad \text{for all }v \geq g\,.}$$
Suppose the kth component of Cw − r is negative, and make v _k arbitrarily large. Then the left-hand side becomes arbitrarily small, which is a contradiction. So Cw − r ≥ 0. Now
$$\displaystyle{w \geq g\ \Longrightarrow\ (w - g)^{\mbox{ $tr$}}(Cw - r) \geq 0\,.}$$
Set in (FE) v = g, then (w − g)^tr(Cw − r) ≤ 0. Therefore (w − g)^tr(Cw − r) = 0.

5.3.4.2 Implementation

As a consequence of this equivalence, the solution of the finite-element problem (FE) can be calculated with the methods we applied to solve problem (FD) in Sect. 4.6. Following the exposition in Sect. 4.6.2, the kernel of the finite-element Algorithm 5.4 can be written as follows

$$\displaystyle\begin{array}{rcl} \mathbf{(FE')}& & \text{Solve }Cw = r\text{ componentwise such that} {}\\ & & \text{the side condition }w \geq g\text{ is obeyed.} {}\\ \end{array}$$

The vector v is not calculated. Boundary conditions on w are set up in the same way as discussed in Sect. 4.4 and summarized in Algorithm 4.14. Consequently, the finite-element algorithm parallels Algorithm 4.14 closely in the special case of an equidistant x-grid; there is no need to repeat this algorithm ( → Exercise 5.4). In the general nonequidistant case, the off-diagonal and the diagonal elements of the tridiagonal matrix C vary with i. Then the formulation of the SOR-loop gets more involved. The details of the implementation are technical and omitted. The Algorithm 4.15 is the same in the finite-element case.

The computational results match those of Chap. 4 and are not repeated. The costs of the presented simple version of a finite-element approach are slightly lower than that of the finite-difference approach, because we can take advantage of an optimal spacing of the mesh points x _i. For arguments discussing the closeness of $\widehat{y}$ to y, we refer to Sect. 5.5.

5.4 Two-Asset Options

In Sect. 3.5.5 we discussed an option based on two assets with prices S ₁, S ₂. There we applied Monte Carlo to simulate the GBM model, see Example 3.9. For the mathematical model we have chosen the Black–Scholes market. The corresponding PDE for the value function V (S ₁, S ₂, t) is

$$\displaystyle{ \begin{array}{rcl} {\partial V \over \partial t} &+&{1 \over 2}\sigma _{1}^{2}S_{ 1}^{2}{ \partial ^{2}V \over \partial S_{1}^{2}} + (r -\delta _{1})S_{1}{ \partial V \over \partial S_{1}} - rV \\ &+&{1 \over 2}\sigma _{2}^{2}S_{ 2}^{2}{ \partial ^{2}V \over \partial S_{2}^{2}} + (r -\delta _{2})S_{2}{ \partial V \over \partial S_{2}} +\rho \sigma _{1}\sigma _{2}S_{1}S_{2}{ \partial ^{2}V \over \partial S_{1}\partial S_{2}} = 0\,, \end{array} }$$

(5.31)

with dividend rates δ ₁, δ ₂. (For the general case see Sect. 6.2.) Notice that for S ₂ = 0 the familiar one-dimensional Black–Scholes equation results. The model is completed by a payoff function Ψ(S ₁, S ₂) and the terminal condition V (S ₁, S ₂, T) = Ψ(S ₁, S ₂). The computational domain $\mathcal{D}$ is two-dimensional, $\mathcal{D}\subset \mathbb{R}^{2}\,$ (disregarding time t).

Example 5.6 (European Call on a Basket with Double Barrier)

We consider a call on a two-asset basket with two knock-out barriers. The payoff of this exotic European-style option is

$$\displaystyle{\varPsi (S_{1},S_{2}) = (S_{1} + S_{2} - K)^{+}\,,}$$

up to the barriers (see Fig. 5.1). In the underlying basket the two assets are of equal weight. The two knock-out barriers are given by B ₁ and B ₂, down-and-out at B ₁, and up-and-out at B ₂. That is, the option ceases to exist when S ₁ + S ₂ ≤ B ₁, or when S ₁ + S ₂ ≥ B ₂; in both cases V = 0. In this example, the computational domain $\mathcal{D}$ is easy to define: The value function is zero outside the barriers. Hence the domain is bounded by the two lines S ₁ + S ₂ = B ₁ and S ₁ + S ₂ = B ₂. This shape of $\mathcal{D}$ naturally suggests to tile the domain into a grid of triangular elements $\mathcal{D}_{k}$. One possible triangulation is shown in Fig. 5.5, where a structured regular subdivision is applied. For this example we choose the parameters

$$\displaystyle\begin{array}{rcl} & & K = 1\,,\;T = 1\,,\;\sigma _{1} =\sigma _{2} = 0.25\,,\;\rho = 0.7\,,\;r = 0.05\,, {}\\ & & \delta _{1} =\delta _{2} = 0\,,\;\;B_{1} = 1\,,\;B_{2} = 2\,. {}\\ \end{array}$$

The values V for S ₁ → 0 and S ₂ → 0 are known by the one-dimensional Black–Scholes equation; just set either S ₁ = 0 or S ₂ = 0 in (5.31). These values of single-asset double-barrier options for B ₁ ≤ S ≤ B ₂ can be evaluated by a closed-form formula, see [172]. We shall come back to this example below.

5.4.1 Analytical Preparations

It is convenient to solve the Black–Scholes equation in divergence form. To this end, use standard PDE variables x: = S ₁, y: = S ₂ for the independent variables, and u(x, y, t) for the dependent variable, and derive the vector PDE for u

$$\displaystyle{ -\nabla \cdot (D(x,y)\nabla u) + b(x,y)^{\mbox{ $tr$}}\nabla u + ru = u_{t}\,. }$$

(5.32)

This makes use of the formal “nabla” vector $\nabla:= ({ \partial \over \partial x},{ \partial \over \partial y})^{\mbox{ $tr$}}$, and

$$\displaystyle{ \begin{array}{rcl} &&D(x,y):={ 1 \over 2}\left (\begin{array}{*{10}c} \sigma _{1}^{2}x^{2}&\rho \sigma _{1}\sigma _{2}xy \\ \rho \sigma _{1}\sigma _{2}xy&\sigma _{2}^{2}y^{2}\\ \end{array} \right )\,, \\ &&b(x,y):= -\left (\begin{array}{*{10}c} (r -\delta _{1} -\sigma _{1}^{2} -\rho \sigma _{1}\sigma _{2}/2)\,x \\ (r -\delta _{2} -\sigma _{2}^{2} -\rho \sigma _{1}\sigma _{2}/2)\,y\\ \end{array} \right )\,. \end{array} }$$

(5.33)

∇u is the gradient of u, and the dot-product notation

$$\displaystyle{\nabla \cdot U ={ \partial U_{1} \over \partial x} +{ \partial U_{2} \over \partial y} }$$

for a vector function U denotes the divergence; the ⋅ corresponds to the scalar product, similar as^tr for vectors. The reader is invited to check the equivalence with (5.31) ( → Exercise 5.5). The advantage of version (5.32) over (5.31) lies in a simple treatment of the second-order derivatives; they can be removed, and a weak version can be derived. This will become apparent below.

5.4.2 Weighted Residuals

The partial differential equation (5.32) can be represented by R(u, x, y, t) = 0, where

$$\displaystyle\begin{array}{rcl} R(u,x,y,t):=& -& \nabla \cdot (D(x,y)\nabla u(x,y,t)) + b(x,y)^{\mbox{ $tr$}}\nabla u(x,y,t) {}\\ & +& ru(x,y,t) -{ \partial u(x,y,t) \over \partial t} {}\\ \end{array}$$

denotes the residual. As in Sect. 5.1, the residual is used to set up an integral equation. To this end, introduce weighting functions v, multiply the residual of the PDE with v(x, y, t) and request

$$\displaystyle{ \int _{\mathcal{D}}R(u,x,y,t)\,v\;\mathrm{d}x\,\mathrm{d}y = 0\,. }$$

(5.34)

This integral over the computational domain $\mathcal{D}\subset \mathbb{R}^{2}$ is a double integral. It depends on t, and should vanish for all 0 ≤ t ≤ T and arbitrary v. We consider u to be a solution in case (5.34) holds for “all” v. This is a weak version of the PDE and requires less regularity of its “weak” solutions u. Aspects of accuracy are postponed to Sect. 5.5.

To exploit the potential of the integral version (5.34), we transform the second-order derivatives to first order, comparable to integration by parts. The leading integral over the second-order term is

$$\displaystyle{\int _{\mathcal{D}}-\nabla \cdot (D\nabla u)\,v\;\mathrm{d}x\,\mathrm{d}y\,.}$$

The reader may check for the vector U: = vD∇u the formula for the divergence ∇⋅ U, namely,

$$\displaystyle{\nabla \cdot \, (vD\nabla u) = (\nabla v)^{\mbox{ $tr$}}D\nabla u + v\nabla \cdot D\nabla u\,,}$$

and hence

$$\displaystyle{-\int _{\mathcal{D}}v\,\nabla \cdot (D\nabla u)\,\mathrm{d}x\,\mathrm{d}y =\int _{\mathcal{D}}(\nabla v)^{\mbox{ $tr$}}D\nabla u\,\mathrm{d}x\,\mathrm{d}y -\int _{\mathcal{D}}\nabla \cdot (vD\nabla u)\,\mathrm{d}x\,\mathrm{d}y\,.}$$

Next we quote the divergence theorem, here for the two-dimensional situation:

$$\displaystyle{ \int _{\mathcal{D}}\nabla \cdot U\,\mathrm{d}x\,\mathrm{d}y =\int _{\partial \mathcal{D}}U^{\mbox{ $tr$}}n\,\mathrm{d}s\,, }$$

(5.35)

where $\partial \mathcal{D}$ denotes the boundary of $\mathcal{D}$, and n is the outward unit normal vector on $\partial \mathcal{D}$. (n is perpendicular to the curve $\partial \mathcal{D}$ and points away from $\mathcal{D}$.) The parameter s measures the arclength along the boundary $\partial \mathcal{D}$.^{Footnote 4} We apply the divergence theorem to the specific vector U: = vD∇u, and arrive at the result for the second-order term

$$\displaystyle{-\int _{\mathcal{D}}v\,\nabla \cdot (D\nabla u)\,\mathrm{d}x\,\mathrm{d}y =\int _{\mathcal{D}}(\nabla v)^{\mbox{ $tr$}}D\nabla u\,\mathrm{d}x\,\mathrm{d}y-\int _{\partial \mathcal{D}}(vD\nabla u)^{\mbox{ $tr$}}n\,\mathrm{d}s\,.}$$

In (5.32)/(5.33) the matrix D is symmetric, D = D ^tr. For symmetric D the integrand in the boundary integral is v(∇u)^tr Dn. After the above transformations of the leading integral, we rewrite (5.34) into

$$\displaystyle{ \int _{\mathcal{D}}\left [(\nabla v)^{\mbox{ $tr$}}D\nabla u + vb^{\mbox{ $tr$}}\nabla u + ruv -{ \partial u \over \partial t} v\right ]\,\mathrm{d}x\,\mathrm{d}y-\int _{\partial \mathcal{D}}v(\nabla u)^{\mbox{ $tr$}}Dn\,\mathrm{d}s = 0\,. }$$

(5.36)

Recall that both u and v as well as ∇u and ∇v depend on x, y, t, and the integrals on t. This is the weak version of the PDE (5.32).

Next discretize the time 0 ≤ t ≤ T as in Chap. 4, say, with equidistant steps Δt. For the simplest implicit approach, the derivative with respect to time t is resolved by the first-order difference quotient,

$$\displaystyle{{\partial u(x,y,t) \over \partial t} \approx { u(x,y,t +\varDelta t) - u(x,y,t) \over \varDelta t} \,.}$$

For backward running time t,

$$\displaystyle{u_{\mathrm{pre}}:= u(x,y,t +\varDelta t)}$$

is known at time t from the calculation of the previous time level. The analogue of the fully implicit time-stepping method is then to solve (5.36) at time level t for ${\partial u \over \partial t}$ replaced by

$$\displaystyle{{1 \over \varDelta t} (u_{\mathrm{pre}} - u)\,,}$$

starting at t = T −Δt with the payoff, u _pre = Ψ. With this approximation, the function u in (5.36) approximates the value function V at time level t. Alternatively, a second-order time-discretization can be applied, similar as in Sect. 4.3. For the required regularity of the functions u and v, consult Sect. 5.5.

5.4.3 Boundary

Boundary conditions enter via the boundary integral around the boundary $\partial \mathcal{D}$. In practice, the computational domain $\mathcal{D}$ is defined by specifying $\partial \mathcal{D}$. To this end, express the curve $\partial \mathcal{D}$ as the union of a finite number of non-overlapping piecewise smooth boundary curves $\partial \mathcal{D}_{1},\partial \mathcal{D}_{2},\ldots$. Each of these curves must be parameterized as in

$$\displaystyle{\partial \mathcal{D}_{1}:=\{\, (g_{1}(\xi ),h_{1}(\xi ))\,\mid \,a_{1} \leq \xi \leq b_{1}\,\}\,.}$$

In this way, an orientation is given by starting the curve at the parameter value ξ = a ₁ and ending at ξ = b ₁. By specifying parameter intervals as a ₁ ≤ ξ ≤ b ₁ and parametric functions as g ₁, h ₁, the entire boundary is defined. The convention is that the orientation is done such that the domain $\mathcal{D}$ is on the left-hand side, as we run through the parameterizations for increasing parameter values ξ.

Now the curve $\partial \mathcal{D}$ is defined and we address the boundary integral along that curve. It is split into a sum of integrals according to the piecewise smooth curves $\partial \mathcal{D}_{1},\partial \mathcal{D}_{2},\ldots$. For example, the boundary of the domain in Fig. 5.5 consists of four such parts ( → Exercise 5.6).

The product-type integrand f(x, y): = v(∇u)^tr Dn suggests to place emphasis on two specific kinds of boundary condition, namely,

v is prescribed (Dirichlet boundary conditions) ,
(∇u)^tr Dn is prescribed (Neumann boundary conditions ).

The boundary differential operator (∇u)^tr Dn = n ^tr D∇u can be considered as a generalized directional derivative since ${\partial u \over \partial n} = n^{\mbox{ $tr$}}\nabla u$. Mixed boundary conditions are possible as well. If we cast the components of the vector n ^tr D into a vector (α ₁, α ₂), then all type of boundary conditions can be written in the form

$$\displaystyle{\alpha _{1}(x,y){\partial u \over \partial x} +\alpha _{2}(x,y){\partial u \over \partial y} =\alpha _{0}(x,y)\,u +\beta (x,y)}$$

with proper functions α ₀ and β. Then

$$\displaystyle{v\,(\alpha _{0}(x,y)\,u +\beta (x,y))}$$

is substituted into the boundary integral, which is approximated numerically using the edges of the triangulation of $\mathcal{D}$.

Fortunately, boundary conditions are frequently of simple form. In particular one encounters the two types

u = 0 (or v = 0), which is of Dirichlet type with α ₁ = α ₂ = β = 0 and α ₀ ≠ 0.
(∇u)^tr Dn = 0, which is of Neumann type with α ₀ = β = 0 and nonzero vector (α ₁, α ₂).

The boundary $\partial \mathcal{D}$ may consist, for example, of two parts $\partial \mathcal{D}_{\mathrm{D}}$ and $\partial \mathcal{D}_{\mathrm{N}}$ with $\partial \mathcal{D} = \partial \mathcal{D}_{\mathrm{D}} \cup \partial \mathcal{D}_{\mathrm{N}}$, $\partial \mathcal{D}_{\mathrm{D}} \cap \partial \mathcal{D}_{\mathrm{N}} =\emptyset$, and Dirichlet conditions on $\partial \mathcal{D}_{\mathrm{D}}$ and Neumann conditions on $\partial \mathcal{D}_{\mathrm{N}}$. Clearly, boundary integrals vanish for the special cases v = 0 or (∇u)^tr Dn = 0. Neumann conditions are advantageous in that they need not be specified for weak formulations. This entails an advantage of FEM over discretizing the PDEs by finite differences. In the latter case, all boundary conditions must be implemented. For FEM it suffices to implement Dirichlet conditions. Defining the right boundary conditions can be demanding. Aside to be financially meaningful, another aim is the problem to be well-posed —that is, it defines a unique solution. To some extent, defining proper boundary conditions is an art.

Example 5.7 (European Binary Put as in Example 3.9)

In Chap. 3 the Example 3.9 of a binary put was simulated with Monte Carlo, and no boundary or boundary conditions were needed. Here we prepare the example to be solved by FEM. Again, x: = S ₁, y: = S ₂. As in Chap. 4, the domain 0 < x < ∞, 0 < y < ∞ must be truncated to finite size. A simple choice of a computational domain is a rectangle

$$\displaystyle{\mathcal{D} =\{\, (x,y)\mid \,0 \leq x \leq x_{\mathrm{max}},\ 0 \leq y \leq y_{\mathrm{max}}\,\}}$$

with x _max, y _max large enough such that zero boundary conditions u = 0 can be chosen as approximation for x = x _max or y = y _max. The rectangle is bounded by four straight lines, which can be parameterized, for example, by

$$\displaystyle\begin{array}{rcl} & & \partial \mathcal{D}_{1}:=\{\, x =\xi,\,y = 0\ \mid \quad 0 \leq \xi \leq x_{\mathrm{max}}\,\}\,, {}\\ & & \partial \mathcal{D}_{2}:=\{\, x = x_{\mathrm{max}},\,y =\xi \ \mid \quad 0 \leq \xi \leq y_{\mathrm{max}}\,\}\,, {}\\ & & \partial \mathcal{D}_{3}:=\{\, x = x_{\mathrm{max}}-\xi,\,y = y_{\mathrm{max}}\ \mid \quad 0 \leq \xi \leq x_{\mathrm{max}}\,\}\,, {}\\ & & \partial \mathcal{D}_{4}:=\{\, x = 0,\,y = y_{\mathrm{max}} -\xi \ \mid \quad 0 \leq \xi \leq y_{\mathrm{max}}\,\}\,. {}\\ \end{array}$$

Now $\partial \mathcal{D} = \partial \mathcal{D}_{1} \cup \partial \mathcal{D}_{2} \cup \partial \mathcal{D}_{3} \cup \partial \mathcal{D}_{4}$, and the parameterized curve has the domain on the left. Dirichlet conditions are imposed for $\partial \mathcal{D}_{2}$ and $\partial \mathcal{D}_{3}$, where we have chosen to approximate boundary values by requesting u = 0. For y = 0 the boundary conditions can be chosen as the values of the one-dimensional European binary put. An analytic formula for the one-dimensional case of a European binary put is

$$\displaystyle{V _{\mathrm{binP}}^{\mathrm{Eur}}(S,t):= c\,\mathrm{e}^{-r(T-t)}\,F\left (-{\log (S/K) + (r -\sigma ^{2}/2)(T - t) \over \sigma \sqrt{T - t}} \right )\,,}$$

for a face value c, with standard normal distribution F [172]. For y = 0 we set S = x. The same formula can be applied for the boundary with x = 0; then S = y. In this way, on $\partial \mathcal{D}_{1}$ and $\partial \mathcal{D}_{4}$ the boundary conditions are of Dirichlet type with u = V _binP ^Eur. With this choice of boundary conditions, $\partial \mathcal{D}_{\mathrm{D}} = \partial \mathcal{D}$ and $\partial \mathcal{D}_{\mathrm{N}} =\emptyset$. But there is a simpler choice: As [300] points out, this Dirichlet condition is implicitly defined by the PDE, because the one-dimensional PDE is embedded in (5.31) for S ₁ = 0 or S ₂ = 0. So no boundary condition needs to be specified along $\partial \mathcal{D}_{1}$ and $\partial \mathcal{D}_{4}$. This amounts to zero Neumann conditions. Both the Dirichlet version and the Neumann version work. The latter has the advantage of avoiding the effort of evaluating V _binP ^Eur.

The implementation of the weak form in (5.36) is straightforward when, for example, the package FreeFem++ is applied. Thereby a figure similar as Fig. 3.7 is produced easily.

5.4.4 Involved Matrices

The accuracy of FEM depends on how the grid is chosen. Algorithms for mesh generation and mesh adaption are needed, but these are demanding topics. It is cumbersome to implement a two-dimensional FEM yourself. For first results, one may work with a fixed structured grid. But in general it is advisable and comfortable to apply a FEM package to solve (5.36). Here we merely focus on how the two-dimensional analogue of the hat functions enters.

For the Ritz–Galerkin approach we apply the basis representation

$$\displaystyle{ w(x,y,t) =\sum _{i}w_{i}(t)\,\varphi _{i}(x,y) }$$

(5.37)

as approximation for u, and set v = φ _j. This ansatz separates time τ and “space” (x, y). The functions φ _i are defined on $\mathcal{D}$.

For basis functions, we choose the two-dimensional hat functions, which perfectly match triangular elements. The situation is shown schematically in Fig. 5.9. There the central node l is node of several adjacent triangles, which constitute the support (shaded) on which φ _l is built by planar pieces. This approach defines a tent-like hat function φ _l, which is zero “outside.” By linear combination of such basis functions, piecewise planar surfaces above the computational domain are constructed. Locally, for one triangle, this may look like the element in Fig. 5.4.

Notice that ∇w = ∑w _i∇φ _i. The weak form of (5.36) leads to

$$\displaystyle\begin{array}{rcl} & & \int _{\mathcal{D}}(\nabla \varphi _{j})^{\mbox{ $tr$}}D\sum w_{i}\nabla \varphi _{i} + {}\\ & & \qquad \varphi _{j}\left [b^{\mbox{ $tr$}}(\sum w_{i}\nabla \varphi _{i}) + r\sum w_{i}\varphi _{i} -\sum { \partial w_{i} \over \partial t} \varphi _{i}\right ]\,\mathrm{d}x\,\mathrm{d}y {}\\ & -& \int _{\partial \mathcal{D}}\varphi _{j}(\sum w_{i}\nabla \varphi _{i})^{\mbox{ $tr$}}Dn\,\mathrm{d}s = 0\,, {}\\ \end{array}$$

for all j. This is a system of ODEs

$$\displaystyle{ \begin{array}{rcl} &&\sum _{i}w_{i}\int _{\mathcal{D}}\left [(\nabla \varphi _{j})^{\mbox{ $tr$}}D\nabla \varphi _{i} +\varphi _{j}b^{\mbox{ $tr$}}\nabla \varphi _{i} +\varphi _{j}r\varphi _{i}\right ]\,\mathrm{d}x\,\mathrm{d}y \\ &&\quad -\sum _{i}{\partial w_{i} \over \partial t} \int _{\mathcal{D}}\varphi _{i}\varphi _{j}\,\mathrm{d}x\,\mathrm{d}y -\sum _{i}w_{i}\int _{\partial \mathcal{D}}\varphi _{j}(\nabla \varphi _{i})^{\mbox{ $tr$}}Dn\,\mathrm{d}s = 0\,. \end{array} }$$

(5.38)

As an exercise, the reader should rewrite this ODE system in matrix-vector notation. In summary, FEM needs the integrals over the domain $\mathcal{D}$

$$\displaystyle\begin{array}{rcl} & &\int (\nabla \varphi _{j})^{\mbox{ $tr$}}\,D\,\nabla \varphi _{i}\quad \mbox{ ($``$ diffusion terms" )}\,, {}\\ & & \int \varphi _{j}b^{\mbox{ $tr$}}\nabla \varphi _{i}\quad \mbox{ ($``$ convection terms'' )}\,, {}\\ & & \int \gamma \varphi _{j}\varphi _{i}\quad \mbox{ ($``$ reaction terms'' )}\,, {}\\ \end{array}$$

where γ is chosen appropriately, and in addition boundary integrals along $\partial \mathcal{D}$.

For each number k of a triangle, there are three vertices of the triangle, with node numbers i, j, l in Fig. 5.9. Hence the table $\mathcal{I}$ of index sets that assigns nodes to triangles includes the entry

$$\displaystyle{\mathcal{I}_{k}:=\{ i,j,l\}\,.}$$

Only for the three node numbers $i,j,l \in \mathcal{I}_{k}$ the local integrals on $\mathcal{D}_{k}$ are nonzero. They can be arranged into 3 × 3 element matrices. For the derivation of the integrals, it makes sense to use a local numbering 1_k, 2_k, 3_k for the nodes of $\mathcal{D}_{k}$. For each global matrix, the assembling loop over k distributes up to 27 local integrals calculated on $\mathcal{D}_{k}$, nine integrals of each of the above three types.^{Footnote 5}

Back to Example 5.6, we solve (5.36) with FEM. Figure 5.10 shows a FEM solution with 192 triangles. Figure 5.11 illustrates a mesh structure for higher resolution obtained with FreeFem++. In the two-dimensional case, because of higher costs, we typically confine ourselves to an accuracy lower than in the one-dimensional situation. Based on our results we state

$$\displaystyle{V (1.25,\,0.25,\,0) \approx 0.2949\,.}$$

Example 5.8 (Heston’s PDE)

In Example 1.16 Heston’s model was introduced, where v denotes a stochastic volatility. The corresponding PDE from [178] is

$$\displaystyle{ \begin{array}{rcl} {\partial V \over \partial t} &+&{1 \over 2}vS^{2}{\partial ^{2}V \over \partial S^{2}} +{ 1 \over 2}\sigma _{\mathrm{v}}^{2}v{\partial ^{2}V \over \partial v^{2}} +\rho \sigma _{\mathrm{v}}vS{ \partial ^{2}V \over \partial S\partial v} \\ &+&rS{\partial V \over \partial S} + [\kappa (\theta -v) -\lambda v]{\partial V \over \partial v} - rV = 0\,,\end{array} }$$

(5.39)

with parameters as in (1.59), and λ standing for the market price of volatility risk. Here we are interested in solutions V (S, v, t) on part of a two-dimensional (S, v)-plane. The PDE (5.39) can be cast into version (5.32). As an exercise, the reader is encouraged to derive D and b, and with the payoff of a call and an own choice of parameters, to think about suitable boundary conditions, and to do experiments with (5.39). Note that for a call a reasonable requirement for maximum values of the volatility v is V = S. When in addition the interest rate r is replaced by a stochastic variable, the PDE is based on a three-dimensional domain [163].

5.5 Error Estimates

The similarity of the finite-element equation (5.21) with the finite-difference equation (4.24) suggests that the errors may be of the same order. In fact, numerical experiments confirm that the finite-element approach with the linear basis functions from Definition 5.1 produces errors decaying quadratically with the mesh size. Applying the finite-element Algorithm 5.4 and entering the calculated data into a diagram as Fig. 4.14, confirms the quadratic order experimentally. The proof of this order of the error is more difficult for finite-element methods because weak solutions assume less smoothness. For standard options, the separation of variables in (5.19) also separates the discussion of the order, and an analysis of the one-dimensional situation suffices. This section explains some basic ideas of how to derive error estimates. We begin with reconsidering some of the related topics that have been introduced in previous sections.

5.5.1 Strong and Weak Solutions

Our exposition will be based on the model problem (5.15). That is, the simple second-order differential equation

$$\displaystyle{ -u'' = f(x)\quad \text{ for }\alpha <x <\beta }$$

(5.40)

with given f, and homogeneous Dirichlet-boundary conditions

$$\displaystyle{ u(\alpha ) = u(\beta ) = 0 }$$

(5.41)

will serve as illustration. The differential equation is of the form Lu = f, compare (5.2). The domain $\mathcal{D}\subseteq \mathbb{R}^{n}$ on which functions u are defined specializes for n = 1 to the open and bounded interval $\mathcal{D} =\{\, x \in \mathbb{R}^{1}\mid \alpha <x <\beta \,\}$. For continuous f, solutions of the differential equation (5.40) satisfy $u \in \mathcal{C}^{2}(\mathcal{D})$. In order to have operative boundary conditions, solutions u must be continuous on $\mathcal{D}$ including its boundary, which is denoted $\partial \mathcal{D}$. Therefore we require $u \in \mathcal{C}^{0}(\overline{\mathcal{D}})$ where $\overline{\mathcal{D}}:= \mathcal{D}\cup \partial \mathcal{D}$. In summary, classical solutions of second-order differential equations require

$$\displaystyle{ u \in \mathcal{C}^{2}(\mathcal{D}) \cap \mathcal{C}^{0}(\overline{\mathcal{D}})\,. }$$

(5.42)

The function space $\mathcal{C}^{2}(\mathcal{D}) \cap \mathcal{C}^{0}(\overline{\mathcal{D}})$ must be reduced further to comply with the boundary conditions.

For weak solutions the function space is larger ( → Appendix C.3). For functions u and v we define the inner product

$$\displaystyle{ (u,v):=\int _{\mathcal{D}}uv\,\mathrm{d}x\,. }$$

(5.43)

Strong solutions u of Lu = f satisfy also

$$\displaystyle{ (Lu,v) = (\,f,v)\quad \text{ for all }v\,. }$$

(5.44)

Specifically for the model problem (5.40)/(5.41) integration by parts leads to

$$\displaystyle{(Lu,v) = -\int _{\alpha }^{\beta }u''v\,\mathrm{d}x = -u'v\Big\vert _{\alpha }^{\beta } +\int _{ \alpha }^{\beta }u'v'\,\mathrm{d}x\,.}$$

The nonintegral term on the right-hand side of the equation vanishes in case also v satisfies the homogeneous boundary conditions (5.41). The remaining integral is a bilinear form, which we abbreviate

$$\displaystyle{ b(u,v):=\int _{ \alpha }^{\beta }u'v'\,\mathrm{d}x\,. }$$

(5.45)

Bilinear forms as b(u, v) from (5.45) are linear in each of the two arguments u and v. For example, b(u ₁ + u ₂, v) = b(u ₁, v) + b(u ₂, v) holds. The bilinear form (5.45) is symmetric, b(u, v) = b(v, u). For several classes of more general differential equations analogous bilinear forms are obtained. Formally, (5.44) can be rewritten as

$$\displaystyle{ b(u,v) = (\,f,v)\,, }$$

(5.46)

where we assume that v satisfies the homogeneous boundary conditions (5.41).

The Eq. (5.46) has been derived out of the differential equation, for the solutions of which we have assumed smoothness in the sense of (5.42). Many “solutions” of practical importance do not satisfy (5.42) and, accordingly, are not smooth. In several applications, u or derivatives of u have discontinuities. For instance consider the obstacle problem of Sect. 4.5.5: The second derivative u″ of the solution fails to be continuous at α and β. Therefore $u\notin \mathcal{C}^{2}(-1,1)$ no matter how smooth the data function is, compare Fig. 4.10 As mentioned earlier, integral relations require less smoothness.

In the derivation of (5.46) the integral version has resulted as a consequence of the primary differential equation. This is contrary to wide areas of applied mathematics, where an integral relation is based on first principles, and the differential equation is derived in a second step. For example, in the calculus of variations a minimization problem may be described by an integral performance measure, and the differential equation is a necessary criterion [350]. This situation suggests considering the integral relation as an equation of its own right rather than as offspring of a differential equation. This leads to the question, what is the maximal function space such that (5.46) with (5.43), (5.45) is meaningful? That means to ask, for which functions u and v do the integrals exist? For a more detailed background we refer to Appendix C.3. For the introductory exposition of this section it may suffice to sketch the maximal function space briefly. The suitable function space is denoted $\mathcal{H}^{1}$, the version equipped with the boundary conditions is denoted $\mathcal{H}_{0}^{1}$. This Sobolev space consists of those functions that are continuous on $\mathcal{D}$ and that are piecewise differentiable and satisfy the boundary conditions (5.41). This function space corresponds to the class of functions $\mathcal{K}$ in (5.23). By means of the Sobolev space $\mathcal{H}_{0}^{1}$ a weak solution of Lu = f is defined, where L is a second-order differential operator and b the corresponding bilinear form.

Definition 5.9 (Weak Solution)

$u \in \mathcal{H}_{0}^{1}$ is called weak solution [of Lu = f], if b(u, v) = ( f, v) holds for all $v \in \mathcal{H}_{0}^{1}\,$.

This definition implicitly expresses the task: find a $u \in \mathcal{H}_{0}^{1}$ such that b(u, v) = ( f, v) for all $v \in \mathcal{H}_{0}^{1}$. This problem is called variational problem. The model problem (5.40)/(5.41) serves as example for Lu = f; the corresponding bilinear form b(u, v) is defined in (5.45) and ( f, v) in (5.43). For the integrals (5.43) to exist, we in addition require f to be square integrable $(\,f \in \mathcal{L}^{2}$, compare Appendix C.3). Then ( f, v) exists because of the Schwarzian inequality (C.16). In a similar way, weak solutions are introduced for more general problems; the formulation of Definition 5.9 applies.

5.5.2 Approximation on Finite-Dimensional Subspaces

For a practical computation of a weak solution the infinite-dimensional space $\mathcal{H}_{0}^{1}$ is replaced by a finite-dimensional subspace. Such finite-dimensional subspaces are spanned by basis functions φ _i. Simple examples are the hat functions of Sect. 5.2. Reminding of the important role splines play as basis functions, the finite-dimensional subspaces are denoted $\mathcal{S}$, and are called finite-element spaces. As stated in Property 5.2(a), the hat functions φ ₀, …, φ _m span the space of polygons. Recall that each such polygon v can be represented as linear combination

$$\displaystyle{v =\sum _{ i=0}^{m}c_{ i}\varphi _{i}\,.}$$

The coefficients c _i are uniquely determined by the values of v at the nodes, c _i = v(x _i). We call hat functions “linear elements” because they consist of piecewise straight lines. Apart from linear elements, for example, also quadratic or cubic elements are used, which are piecewise polynomials of second or third degree [79, 335, 382]. The attainable accuracy is different for basis functions consisting of higher-degree polynomials.

Since by definition the functions of the Sobolev space $\mathcal{H}_{0}^{1}$ fulfill the homogeneous boundary conditions, each subspace does so as well. Again the subscript₀ indicates the realization of the homogeneous boundary conditions (5.41).^{Footnote 6} A finite-dimensional subspace of $\mathcal{H}_{0}^{1}$ is defined by

$$\displaystyle{ \mathcal{S}_{0}:= \left \{\,v =\sum \limits _{ i=0}^{m}c_{ i}\varphi _{i}\,\mid \,\varphi _{i} \in \mathcal{H}_{0}^{1}\,\right \}\,. }$$

(5.47)

Properties of $\mathcal{S}_{0}$ are determined by the basis functions φ _i. As mentioned earlier, basis functions with small supports give rise to sparse matrices. The partition (5.4) of $\mathcal{D}$ is implicitly included in the definition $\mathcal{S}_{0}$ because this information is contained in the definition of the φ _i. For our purposes the hat functions suffice. The larger m is, the better $\mathcal{S}_{0}$ approximates the space $\mathcal{H}_{0}^{1}$, since a finer discretization (smaller $\mathcal{D}_{k}$) allows to approximate the functions from $\mathcal{H}_{0}^{1}$ better by polygons. We denote the largest diameter of the $\mathcal{D}_{k}$ by h, and ask for convergence. That is, we study the behavior of the error for h → 0 (basically m → ∞).

In analogy to the variational problem expressed in connection with Definition 5.9, a discrete weak solution w is defined by replacing the space $\mathcal{H}_{0}^{1}$ by a finite-dimensional subspace $\mathcal{S}_{0}$:

Problem 5.10 (Discrete Weak Solution)

Find a $w \in \mathcal{S}_{0}$ such that b(w, v) = ( f, v) for all $v \in \mathcal{S}_{0}$.

The quality of the approximation relies on the discretization fineness h of $\mathcal{S}_{0}$, which is occasionally emphasized by writing w _h.

5.5.3 Quadratic Convergence

Having defined a weak solution u and a discrete approximation w, we turn to the error u − w. To measure the distance between functions in $\mathcal{H}_{0}^{1}$ we use the norm ∥ ∥₁ ( → Appendix C.3). That is, our first aim is to construct a bound on ∥u − w∥₁. Let us suppose that the bilinear form is continuous and $\mathcal{H}^{1}$-elliptic:

Assumptions 5.11 (Continuous $\mathcal{H}^{1}$-Elliptic Bilinear Form)

(a)
There is a γ ₁ > 0 such that | b(u, v) | ≤ γ ₁∥u∥₁∥v∥₁ for all $u,v \in \mathcal{H}^{1}\,$.
(b)
There is a γ ₂ > 0 such that b(v, v) ≥ γ ₂∥v∥₁ ² for all $v \in \mathcal{H}^{1}\,$.

The assumption (a) is the continuity, and the property in (b) is called $\mathcal{H}^{1}$-ellipticity. Under the Assumptions 5.11, the problem to find a weak solution following Definition 5.9, possesses exactly one solution $u \in \mathcal{H}_{0}^{1}$; the same holds true for Problem 5.10. This is guaranteed by the Theorem of Lax–Milgram [53, 79]. In view of $\mathcal{S}_{0} \subseteq \mathcal{H}_{0}^{1}$,

$$\displaystyle{b(u,v) = (\,f,v)\quad \text{ for all }v \in \mathcal{S}_{0}\,.}$$

Subtracting b(w, v) = ( f, v) and invoking the bilinearity implies

$$\displaystyle{ b(w - u,v) = 0\quad \text{ for all }v \in \mathcal{S}_{0}\,. }$$

(5.48)

The property of (5.48) is called error-projection property. The Assumptions 5.11 and the error projection are the basic ingredients to obtain a bound on the error ∥u − w∥₁:

Lemma 5.12 (Céa)

Suppose the Assumptions 5.11 are satisfied. Then

$$\displaystyle{ \Vert u - w\Vert _{1} \leq { \gamma _{1} \over \gamma _{2}}\inf _{v\in \mathcal{S}_{0}}\Vert u - v\Vert _{1}\,. }$$

(5.49)

Proof

$v \in \mathcal{S}_{0}$ implies $\tilde{v}:= w - v \in \mathcal{S}_{0}$. Applying (5.48) for $\tilde{v}$ yields

$$\displaystyle{b(w - u,w - v) = 0\quad \text{ for all }v \in \mathcal{S}_{0}\,.}$$

Therefore

$$\displaystyle\begin{array}{rcl} b(w - u,w - u)& =& b(w - u,w - u) - b(w - u,w - v) {}\\ & =& b(w - u,v - u)\,. {}\\ \end{array}$$

Applying the assumptions shows

$$\displaystyle\begin{array}{rcl} \gamma _{2}\Vert w - u\Vert _{1}^{2}& \leq & \vert b(w - u,w - u)\vert =\vert b(w - u,v - u)\vert {}\\ & \leq & \gamma _{1}\Vert w - u\Vert _{1}\Vert v - u\Vert _{1}\,, {}\\ \end{array}$$

from which

$$\displaystyle{\Vert w - u\Vert _{1} \leq { \gamma _{1} \over \gamma _{2}}\Vert v - u\Vert _{1}}$$

follows. Since this holds for all $v \in \mathcal{S}_{0}$, the assertion of the lemma is proven.

Let us check whether the Assumptions 5.11 are fulfilled by the model problem (5.40)/(5.41). For (a) this follows from the Schwarzian inequality (C.16) with the norms

$$\displaystyle{\Vert u\Vert _{1} = \left (\int _{\alpha }^{\beta }(u^{2} + u{\prime}^{2})\,\mathrm{d}x\right )^{1/2}\,\,,\ \Vert u\Vert _{ 0} = \left (\int _{\alpha }^{\beta }u^{2}\,\mathrm{d}x\right )^{1/2}\,,}$$

because

$$\displaystyle{\left (\int _{\alpha }^{\beta }u'v'\,\mathrm{d}x\right )^{2} \leq \left (\int _{\alpha }^{\beta }u{\prime}^{2}\,\mathrm{d}x\right )\left (\int _{\alpha }^{\beta }v{\prime}^{2}\,\mathrm{d}x\right ) \leq \Vert u\Vert _{ 1}^{2}\ \Vert v\Vert _{ 1}^{2}\,.}$$

The Assumption 5.11(b) can be derived from the inequality of the Poincaré-type

$$\displaystyle{\int _{\alpha }^{\beta }v^{2}\,\mathrm{d}x \leq (\beta -\alpha )^{2}\int _{ \alpha }^{\beta }v{\prime}^{2}\,\mathrm{d}x\,,}$$

which in turn is proven with the Schwarzian inequality ( → Exercise 5.10). Adding ∫v′ ² dx on both sides leads to

$$\displaystyle{\Vert v\Vert _{1}^{2} \leq [(\beta -\alpha )^{2} + 1]\,b(v,v)\,,}$$

from which the constant γ ₂ of Assumption 5.11(b) results. Hence Céa’s lemma applies to the model problem.

The next question is, how small the infimum in (5.49) may be. This is equivalent to the question, how close the subspace $\mathcal{S}_{0}$ can approximate the space $\mathcal{H}_{0}^{1}$ ( → Fig. 5.12). We will show that for hat functions and $\mathcal{S}_{0}$ from (5.47) the infimum is of the order O(h). Again h denotes the maximum mesh size, and the notation w _h reminds us that the discrete solution depends on the grid with a spacing symbolized by h. To apply Céa’s lemma, we need an upper bound for the infimum of ∥u − v∥₁. Such a bound is found easily by a specific choice of v, which is taken as an arbitrary interpolating polygon u _I. Then by (5.49)

$$\displaystyle{ \Vert u - w_{h}\Vert _{1} \leq { \gamma _{1} \over \gamma _{2}}\inf _{v\in \mathcal{S}_{0}}\Vert u - v\Vert _{1} \leq \ { \gamma _{1} \over \gamma _{2}}\Vert u - u_{\mathrm{I}}\Vert _{1}\,. }$$

(5.50)

It remains to bound the error of interpolating polygons. This bound is provided by the following lemma, which is formulated for $\mathcal{C}^{2}$-smooth functions u:

Lemma 5.13 (Error of an Interpolating Polygon)

For $u \in \mathcal{C}^{2}$ let u _I be an arbitrary interpolating polygon and h the maximal distance between two consecutive nodes. Then

(a)
$\mathop{\max }\limits _{x}\vert u(x) - u_{\mathrm{I}}(x)\vert \leq { h^{2} \over 8} \max \vert u''(x)\vert \,$ ,
(b)
$\mathop{\max }\limits _{x}\vert u'(x) - u_{\mathrm{I}}^{{\prime}}(x)\vert \leq h\max \vert u''(x)\vert \,$.

We leave the proof to the reader ( → Exercise 5.11). Lemma 5.13 asserts

$$\displaystyle{\Vert u - u_{\mathrm{I}}\Vert _{1} = O(h)\,,}$$

which together with (5.50) implies the claimed error statement

$$\displaystyle{ \Vert u - w_{h}\Vert _{1} = O(h)\,. }$$

(5.51)

Recall that this assertion is based on a continuous and $\mathcal{H}^{1}$-elliptic bilinear form and on hat functions φ _i. The O(h)-order in (5.51) is dominated by the unfavorable O(h)-order of the first-order derivative in Lemma 5.13(b). This low order is at variance with the actually observed O(h ²)-order attained by the approximation w _h itself (not its derivative). In fact, the square order holds. The final result is

$$\displaystyle{ \Vert u - w_{h}\Vert _{0} \leq Ch^{2}\Vert u\Vert _{ 2} }$$

(5.52)

for a constant C. This result is proven with the following lemma, which is based on a tricky idea due to Nitsche.

Lemma 5.14 (Nitsche)

Assume b is a symmetric bilinear form satisfying Assumptions 5.11 , and u and w are defined as above. Then

$$\displaystyle{\Vert u - w\Vert _{1} \leq Kh^{1}\Vert \,f\Vert _{ 0}\;\mathit{\text{ implies }}\;\Vert u - w\Vert _{0} \leq Ch^{2}\Vert \,f\Vert _{ 0}\,.}$$

Proof

Consider the auxiliary problem $Lz =\tilde{ f}:= u - w$, with weak version

$$\displaystyle{b(z,\tilde{v}) = (\,\tilde{f},\tilde{v})_{0}\quad \text{for all }\tilde{v} \in \mathcal{H}_{0}^{1}\,,}$$

which defines z. Choose specifically $\tilde{v} = u - w =\tilde{ f}$. Then

$$\displaystyle{b(z,u - w) = (u - w,u - w)_{0} =\Vert u - w\Vert _{0}^{2}\,.}$$

Invoking the error-projection property (5.48) we note

$$\displaystyle{0 = b(u - w,v) = b(v,u - w)\quad \text{ for all }v \in \mathcal{S}_{0}\,.}$$

Subtracting this, yields

$$\displaystyle{b(z - v,u - w) =\Vert u - w\Vert _{0}^{2}\quad \text{ for all }\;v \in \mathcal{S}_{ 0}\,.}$$

We apply the continuity of b,

$$\displaystyle{\Vert u - w\Vert _{0}^{2} \leq \gamma _{ 1}\Vert z - v\Vert _{1}\;\Vert u - w\Vert _{1}\quad \text{for all }v \in \mathcal{S}_{0}\,,}$$

and choose specifically v as the finite-element approximation of z. Then

$$\displaystyle{\Vert u - w\Vert _{0}^{2} \leq \gamma _{ 1}K_{1}h^{1}\Vert \,\tilde{f}\Vert _{ 0} \cdot K_{2}h^{1}\Vert \,f\Vert _{ 0} = Ch^{2}\Vert u - w\Vert _{ 0}\;\Vert \,f\Vert _{0}\,,}$$

from which the assertion follows.

This error of the order h ² can be observed for the examples of Sect. 5.4, but not easily. The error is somewhat hidden among the other errors, namely, localization error, interpolation error, and the error of the time discretization.

The derivations of this section have been focused on the model problem (5.40)/(5.41) with a second-order differential equation and one independent variable x (n = 1), and have been based on linear elements. Most of the assertions can be generalized to higher-order differential equations, to higher-dimensional domains (n > 1), and to nonlinear elements. For example, in case the elements in $\mathcal{S}$ are polynomials of degree k, and the differential equation is of order 2l, $\mathcal{S}\subseteq \mathcal{H}^{l}$, and the corresponding bilinear form on $\mathcal{H}^{l}$ satisfies the Assumptions 5.11 with norm ∥ ∥_l, then the inequality

$$\displaystyle{\Vert u - w_{h}\Vert _{l} \leq Ch^{k+1-l}\Vert u\Vert _{ k+1}}$$

holds. This general statement includes for k = 1, l = 1 the special case of Eq. (5.52) discussed above. For the analysis of the general case, we refer to [79, 162]. This includes boundary conditions more general than the homogeneous Dirichlet conditions of (5.41).

5.6 Notes and Comments

On Sect. 5.1

As an alternative to piecewise defined finite elements one may use polynomials φ _j that are defined globally on $\mathcal{D}$, and that are pairwise orthogonal. Then the orthogonality is the reason for the vanishing of many integrals. Such type of methods are called spectral methods. Since the φ _i are globally smooth on $\mathcal{D}$, spectral methods can produce high accuracies. In other context, spectral methods were applied in [142]. For historical remarks on Ritz–Galerkin type methods, see [145].

Specifically designed basis functions can be generated by some low-dimensional approximation, comparable to PCA in finite dimensions ( → Exercise 43). Functions are suitable that represent preferred patterns of the solution. Then the number N of modes φ _i can be small. Such methods are described under the heading principal orthogonal decomposition (POD), or Karhunen–Loève expansion.

On Sect. 5.2

In the early stages of their development, finite-element methods have been applied intensively in structural engineering. In this field, stiffness matrix and mass matrix have a physical meaning leading to these names [382].

On Sect. 5.3

The approximation ∑w _i(τ)φ _i(x) for $\hat{y}$ is a one-dimensional finite-element approach. The geometry of the grid and the accuracy resemble the finite-difference approach. A two-dimensional approach as in

$$\displaystyle{\sum w_{i}\varphi _{i}(x,\tau )}$$

with two-dimensional hat functions and constant w _i is more involved and more flexible. Sections 5.3.2–5.3.4 widely follow [376].

On Sect. 5.4

For the calculation of the local integrals on an arbitrary triangle $\mathcal{D}_{k}$ consult the special FEM literature, such as [335]. In general an irregular triangulation better exploits the potential adaptivity of FEM. In particular, close to the barriers a fine mesh is required for high accuracy [304]. Since the gradient of u varies with time, a dynamic mesh refinement might be advisable, provided accuracy or stability do not deteriorate. For American options, boundary conditions V = Ψ along the boundary are recommendable. For an illustration of assembling, see Topic 12 of the Topics fCF.

On Sect. 5.5

The assumption $u \in \mathcal{C}^{2}$ in Lemma 5.13 can be weakened to $u'' \in \mathcal{L}^{2}$ [351]. For domains $\mathcal{D}\in \mathbb{R}^{2}$ the claim of Lemma 5.13 holds analogously; then the second-order derivative u″ is replaced by the Hessian matrix of the second-order derivatives of u. This can be applied to mesh adaption, where one attempts to place nodes such that the Hessian is equilibrated across the mesh. The finite-dimensional function space $\mathcal{S}_{0}$ in (5.47) is assumed to be subspace of $\mathcal{H}_{0}^{1}$. Elements with this property are called conforming elements. A more accurate notation for $\mathcal{S}_{0}$ of (5.47) is $\mathcal{S}_{0}^{1}$. In the general case, conforming elements are characterized by $\mathcal{S}^{l} \subseteq \mathcal{H}^{l}$. In the representation of v in Eq. (5.47) we avoid discussing the technical issue of how to organize different types of boundary conditions.

There are also smooth basis functions φ, for example, cubic Hermite polynomials. For sufficiently smooth solutions, such basis functions produce higher accuracy than hat functions do. For the accuracy of finite-element methods consult, for example, [2, 19, 53, 79, 162, 351].

On Other Methods

Finite-element methods are frequently used for approximating exotic options, in particular in multidimensional situations. For different types of options special methods have been developed. For applications, computational results and accuracies see also [2, 361, 362]. Front-fixing has been applied with finite elements in [188]. The accuracy aspect is also treated in [144]. Ritz–Galerkin methods are used with wavelet functions in [185, 263]; the latter paper is specifically devoted to stochastic volatility. A penalty approach with FEM is discussed in [230], where rectangular subdomains are furnished with basis functions as product of one-dimensional hat functions of the type φ(x, y) = φ _i(x)φ _j( y).

5.7 Exercises

5.1 (Elliptical Probability Curves).

Suppose the situation of two asset prices S ₁(t) and S ₂(t) for t > 0 governed by GBM (3.35), with initial price point (S ₁(0), S ₂(0)). Barriers of a barrier option can be aligned such that the probability of (S ₁(t), S ₂(t)) reaching the barrier has the same constant value. Define Y ₁: = logS ₁, Y ₂: = logS ₂.

(a)
Show that the curve of constant probability in the ( Y ₁, Y ₂)-plane has an elliptical shape.
(b)
Let the covariance matrix be
$$\displaystyle{\varSigma = \left (\begin{array}{*{10}c} \sigma _{1}^{2} & \rho \sigma _{1}\sigma _{2} \\ \rho \sigma _{1}\sigma _{2} & \sigma _{2}^{2}\\ \end{array} \right )\,.}$$
Calculate its eigenvalues and eigenvectors.
(c)
Sketch representative ellipses in a (Y ₁, Y ₂)-plane. How do they depend on ρ?

5.2 (Cubic B-Spline).

Suppose an equidistant partition of an interval be given with mesh size h = x _k+1 − x _k. Cubic B-splines have a support of four subintervals. In each subinterval the spline is a piece of polynomial of degree three. Apart from special boundary splines, the cubic B-splines φ _i are determined by the requirements

$$\displaystyle\begin{array}{rcl} \varphi _{i}(x_{i})& =& 1 {}\\ \varphi _{i}(x)& \equiv & 0\quad \text{ for }x <x_{i-2} {}\\ \varphi _{i}(x)& \equiv & 0\quad \text{ for }x> x_{i+2} {}\\ \varphi & \in & \mathcal{C}^{2}(-\infty,\infty )\,. {}\\ \end{array}$$

To construct these φ _i proceed as follows:

(a)
Construct a spline S(x) that satisfies the above requirements for the special nodes
$$\displaystyle{\tilde{x}_{k}:= -2 + k\quad \mbox{ for }k = 0,1,\ldots,4\,.}$$
(b)
Find a transformation T _i(x), such that φ _i = S(T _i(x)) satisfies the requirements for the original nodes.
(c)
For which i, j does φ _i φ _j = 0 hold?

5.3 (Finite-Element Matrices).

For the hat functions φ from Sect. 5.2 calculate for arbitrary subinterval $\mathcal{D}_{k}$ all nonzero integrals of the form

$$\displaystyle{\int \varphi _{i}\varphi _{j}\,\mathrm{d}x,\quad \int \varphi _{i}^{{\prime}}\varphi _{ j}\,\mathrm{d}x,\quad \int \varphi _{i}^{{\prime}}\varphi _{ j}^{{\prime}}\,\mathrm{d}x}$$

and represent them as local 2 × 2 matrices.

5.4 (Calculating Options with Finite Elements).

Design an algorithm for the pricing of standard options by means of finite elements. To this end proceed as outlined in Sect. 5.3. Start with a simple version using an equidistant discretization step Δx. If this is working properly change the algorithm to a version with nonequidistant x-grid. Distribute the nodes x _i closer around x = 0. Always place a node at the strike.

5.5 (Black-Scholes Equation in Divergence-Free Form).

(a)
Prove the equivalence of (5.31) and (5.32), where D and b are given by (5.33). Specialize this to the one-dimensional case of the Black–Scholes equation.
(b)
Show
$$\displaystyle{b^{\mbox{ $tr$}}\nabla u + ru = \nabla \cdot (bu) +\gamma u}$$
and determine γ for the two-dimensional case, and for the Black–Scholes equation.
(c)
With the transformation
$$\displaystyle{x:=\log ({ S_{1} \over K_{1}}),\ y:=\log ({ S_{2} \over K_{2}})}$$
and writing u(x, y, t) for V leads to the PDE
$$\displaystyle\begin{array}{rcl} u_{t}& +&{ 1 \over 2}\sigma _{1}^{2}u_{ xx} + (r -\delta _{1} -{ 1 \over 2}\sigma _{1}^{2})u_{ x} - ru {}\\ & +&{ 1 \over 2}\sigma _{2}^{2}u_{ yy} + (r -\delta _{2} -{ 1 \over 2}\sigma _{2}^{2})u_{ y} +\rho \sigma _{1}\sigma _{2}u_{xy} = 0\,.\ {}\\ \end{array}$$
What are the matrix D and the vector b such that we arrive at (5.32)?

5.6 (Outward Normals).

The boundary $\partial \mathcal{D}$ of the trapezoidal domain $\mathcal{D}$ in Fig. 5.5 consists of four straight lines. What are the four unit outward vectors n orthogonal to $\partial \mathcal{D}$? Give a parameter representation of the boundary.

5.7 (Gradient on a Triangle).

Consider hat functions φ on a triangular element $\mathcal{D}_{k}$ with vertex nodes numbers $\mathcal{I}_{k} =\{ i,j,l\}$, and the local plane on $\mathcal{D}_{k}$ represented by

$$\displaystyle{w(x,y) = w_{i}\varphi _{i}(x,y) + w_{j}\varphi _{j}(x,y) + w_{l}\varphi _{l}(x,y)\,.}$$

(a)
In the three-dimensional (x, y, w)-space let the plane w(x, y) = c ₁ + c ₂ x + c ₃ y interpolate the three points (x _i, y _i, w _i), i = 1, 2, 3 (local node numbering). That is,
$$\displaystyle{\left (\begin{array}{*{10}c} 1&x_{1} & y_{1} \\ 1&x_{2} & y_{2} \\ 1&x_{3} & y_{3}\\ \end{array} \right )\left (\begin{array}{*{10}c} c_{1} \\ c_{2} \\ c_{3}\end{array} \right ) = \left (\begin{array}{*{10}c} w_{1} \\ w_{2} \\ w_{3}\end{array} \right ),}$$
shortly Ac = w. Establish a formula for the gradient ∇w = (c ₂, c ₃)^tr, showing that there is a (2 × 3)-matrix G _k such that
$$\displaystyle{\nabla w = G_{k}w\,.}$$
Hint: Use Cramer’s rule; | F _k | is the area of the triangle, where
$$\displaystyle{F_{k}:={ 1 \over 2}\det (A)\,.}$$
(b)
Show
$$\displaystyle{(\nabla \varphi _{i}\,\vert \,\nabla \varphi _{j}\,\vert \,\nabla \varphi _{l}) = G_{k}\,.}$$
(c)
Show
$$\displaystyle{\int _{\mathcal{D}_{k}}\nabla \varphi _{i}^{\mbox{ $tr$}}\nabla \varphi _{ j}\,\mathrm{d}x\,\mathrm{d}y = \nabla \varphi _{i}^{\mbox{ $tr$}}\nabla \varphi _{ j}\,\vert F_{k}\vert \,,}$$
and all nine integrals of the element stiffness matrix are obtained by
$$\displaystyle{\vert F_{k}\vert G_{k}^{\mbox{ $tr$}}G_{ k}\,.}$$

5.8 (Assembling).

Consider the domain $\mathcal{D}:= \{(x,y)\,\vert \;x \geq 0,\,y \geq 0,\;1 \leq x + y \leq 2\}$ tiled by 12 triangles $\mathcal{D}_{k}$, where triangles and vertices are numbered as in Fig. 5.13.

(a)
Set up the index set $\mathcal{I}$ with entries $\mathcal{I}_{k} = \{i_{k},j_{k},l_{k}\}$, which assigns node numbers to the kth triangle, for 1 ≤ k ≤ 12.
Fig. 5.13
Specific triangulation and numbering, see Exercise 5.8
Full size image
(b)
Formulate the assembling algorithm that builds up the global stiffness matrix out of the element stiffness matrices
$$\displaystyle{\left (\begin{array}{*{10}c} s_{11}^{(k)} & s_{12}^{(k)} & s_{13}^{(k)} \\ s_{21}^{(k)} & s_{22}^{(k)} & s_{23}^{(k)} \\ s_{31}^{(k)} & s_{32}^{(k)} & s_{33}^{(k)}\\ \end{array} \right )}$$
for a general index set $\mathcal{I}$ and 1 ≤ k ≤ m.
(c)
The example of Fig. 5.13 leads to a banded stiffness matrix. What is the bandwidth?

5.9 (Variable Volatility (Project)).

For variable volatility σ(S, t) and constant K, T, r, δ , PDEs of the type

$$\displaystyle{{\partial y \over \partial \tau } -{ 1 \over 2}\hat{\sigma }^{2}(x,\tau )\left ({\partial ^{2}y \over \partial x^{2}} -{ 1 \over 4}y\right ) = 0}$$

are to be solved, with τ = T − t and transformations S ↔ x, V ↔ y from the Black–Scholes model given by (A.25), (A.26); consult Appendix A.6.

(a)
For an American put, apply these transformations to derive from V (S, t) ≥ (K − S)⁺ an inequality y(x, τ) ≥ g(x, τ).
(b)
Carry out the finite-element formulation for the linear complementarity problem analogously as in Sect. 5.3.4.
(c)
Integrals will include local integrals
$$\displaystyle{\int \sigma ^{2}(x,\tau )\varphi _{ i}\varphi _{j}\,\mathrm{d}x\,,\quad \int \sigma ^{2}(x,\tau )\varphi _{ i}^{{\prime}}\varphi _{ j}\,\mathrm{d}x\,.}$$
Apply Simpson’s quadrature rule
$$\displaystyle{\int _{a}^{b}f(x)dx \approx { b - a \over 6} \left [f(a) + 4f\left ({a + b \over 2} \right ) + f(b)\right ]}$$
to approximate the above local integrals.
(d)
Set up a finite-element code, and test it with the artificial function [128]
$$\displaystyle{\sigma (S):= 0.3 -{ 0.2 \over \log (S/K)^{2} + 1}\,.}$$

5.10.

Assume a function v(ζ) with α ≤ ζ ≤ β and v(α) = 0.

(a)
Show
$$\displaystyle{(v(\zeta ))^{2} \leq (\zeta -\alpha )\int _{\alpha }^{\zeta }(v^{{\prime}}(x))^{2}\,\mathrm{d}x\,.}$$
Hint: Recall v(ζ) = ∫ _α ^ζ v ^′(x) dx, and apply the Schwarzian inequality (C.16).
(b)
Use (a) to show
$$\displaystyle{\int _{\alpha }^{\beta }(v(\zeta ))^{2}\,\mathrm{d}\zeta \leq { 1 \over 2}(\beta -\alpha )^{2}\int _{ \alpha }^{\beta }(v^{{\prime}}(x))^{2}\,\mathrm{d}x\,.}$$

5.11.

Prove Lemma 5.13, and for $u \in \mathcal{C}^{2}$ the assertion ∥u − w _h∥₁ = O(h).

Notes

1.
Basis functions can be constructed such that there is one for each node. Then N represents also the number of nodes.
2.
For the Black–Scholes y(x, τ) or V (S, t) the weaker $y \in \mathcal{C}^{2,1}$ suffices. Recall that the American option is widely $\mathcal{C}^{2}$-smooth, except across the early-exercise curve .
3.
Notation: Now v is the vector of the coefficient functions.
4.
Recall from calculus the definition $\int _{C}f(x,y)\mathrm{d}s =\int _{ a}^{b}f(g(\xi ),h(\xi )){\mathrm{d}s \over \mathrm{d}\xi } \mathrm{d}\xi$ where (g(ξ), h(ξ)) for a ≤ ξ ≤ b is a parameterization of a planar curve C; ξ is the curve parameter. The value of this line integral is independent of the orientation of the curve C and independent of the particular parameterization .
5.
Basic ingredients for the calculation of the local integrals on an arbitrary triangle $\mathcal{D}_{k}$ are the relations in Exercise 5.7. See also Exercises 5.8 and 5.9.
6.
In this subsection the meaning of the index₀ is twofold: It is the index of the “first” hat function, and serves as symbol of the homogeneous boundary conditions (5.41).

References

Abramowitz, M., Stegun, I.: Handbook of Mathematical Functions. With Formulas, Graphs, and Mathematical Tables. Dover, New York (1968)
MATH Google Scholar
Achdou, Y., Pironneau, O.: Computational Methods for Option Pricing. SIAM, Philadelphia (2005)
Book MATH Google Scholar
Adams, R.A.: Sobolev Spaces. Academic Press, New York (1975)
MATH Google Scholar
AitSahlia, F., Carr, P.: American options: a comparison of numerical methods. In: Rogers, L.C.G., Talay, D. (eds.) Numerical Methods in Finance, pp. 67–87. Cambridge University Press, Cambridge (1997)
Chapter Google Scholar
Alfonsi, A.: On the discretization schemes for the CIR (and Besselsquared) processes. Monte Carlo Methods Appl. 11, 355–384 (2005)
Article MathSciNet MATH Google Scholar
Almendral, A., Oosterlee, C.W.: Numerical valuation of options with jumps in the underlying. Appl. Numer. Math. 53, 1–18 (2005)
Article MathSciNet MATH Google Scholar
Almendral, A., Oosterlee, C.W.: Highly accurate evaluation of European and American options under the Variance Gamma process. J. Comput. Finance 10(1), 21–42 (2006)
Article Google Scholar
Andersen, L., Andreasen, J.: Jump diffusion process: volatility smile fitting and numerical methods for option pricing. Rev. Deriv. Res. 4, 231–262 (2000)
Article MATH Google Scholar
Andersen, L., Broadie, M.: Primal-dual simulation algorithm for pricing multidimensional American options. Manag. Sci. 50, 1222–1234 (2004)
Article Google Scholar
Andersen, L.B.G., Brotherton-Ratcliffe, R.: The equity option volatility smile: an implicit finite-difference approach. J. Comput. Finance 1(2), 5–38 (1997/1998)
Google Scholar
Ané, T., Geman, H.: Order flow, transaction clock, and normality of asset returns. J. Finance 55, 2259–2284 (2000)
Article Google Scholar
Arnold, L.: Stochastic Differential Equations (Theory and Applications). Wiley, New York (1974)
MATH Google Scholar
Arouna, B.: Robbins-Monro algorithms and variance reduction in finance. J. Comput. Finance 7(2), 35–61 (2003)
Article Google Scholar
Artzner, P., Delbaen, F., Eber, J.-M., Heath, D.: Coherent measures of risk. Math. Finance 9, 203–228 (1999)
Article MathSciNet MATH Google Scholar
Avellaneda, M.: Quantitative Modeling of Derivative Securities. From Theory to Practice. Chapman & Hall, Boca Raton (2000)
MATH Google Scholar
Avellaneda, M., Levy, A., Parás, A.: Pricing and hedging derivative securities in markets with uncertain volatilities. Appl. Math. Finance 2, 73–88 (1995)
Article Google Scholar
Avellaneda, M., Parás, A.: Dynamic hedging portfolios for derivative securities in the presence of large transaction costs. Appl. Math. Finance 1, 165–194 (1994)
Article Google Scholar
Avellaneda, M., Parás, A.: Managing the volatility risk of derivative securities: the Lagrangian volatility model. Appl. Math. Finance 3, 21–53 (1996)
Article MATH Google Scholar
Babuška, I., Strouboulis, T.: The Finite Element Method and Its Reliability. Oxford Science, Oxford (2001)
MATH Google Scholar
Ball, C.A., Roma, A.: Stochastic volatility option pricing. J. Financ. Quant. Anal. 29, 589–607 (1994)
Article Google Scholar
Barles, G.: Convergence of numerical schemes for degenerate parabolic equations arising in finance theory. In: Rogers, L.C.G., Talay, D. (eds.) Numerical Methods in Finance, pp. 2–21. Cambridge University Press, Cambridge (1997)
Google Scholar
Barles, G., Burdeau, J., Romano, M., Samsœn, N.: Critical stock prices near expiration. Math. Finance 5, 77–95 (1995)
Article MATH Google Scholar
Barles, G., Daher, Ch., Romano, M.: Convergence of numerical schemes for parabolic equations arising in finance theory. Math. Models Methods Appl. Sci. 5, 125–143 (1995)
Article MathSciNet MATH Google Scholar
Barles, G., Soner, H.M.: Option pricing with transaction costs and a nonlinear Black-Scholes equation. Finance Stochast. 2, 369–397 (1998)
Article MathSciNet MATH Google Scholar
Barndorff-Nielsen, O.E.: Processes of normal inverse Gaussian type. Finance Stochast. 2, 41–68 (1997)
Article MathSciNet MATH Google Scholar
Barone-Adesi, G., Whaley, R.E.: Efficient analytic approximation of American option values. J. Finance 42, 301–320 (1987)
Article Google Scholar
Barone-Adesi, G., Whaley, R.E.: On the valuation of American put options on dividend-paying stocks. Adv. Futures Options Res. 3, 1–13 (1988)
Google Scholar
Barraquand, J., Pudet, T.: Pricing of American path-dependent contingent claims. Math. Finance 6, 17–51 (1996)
Article MathSciNet MATH Google Scholar
Barrett, R., et al.: Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods. SIAM, Philadelphia (1994)
Book Google Scholar
Bates, D.: Jumps and stochastic volatility: the exchange rate processes implicit in Deutschmark options. Rev. Financ. Stud. 9, 69–107 (1996)
Article Google Scholar
Baxter, M., Rennie, A.: Financial Calculus. An Introduction to Derivative Pricing. Cambridge University Press, Cambridge (1996)
Book MATH Google Scholar
Behrends, E.: Introduction to Markov Chains. Vieweg, Braunschweig (2000)
Book MATH Google Scholar
Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
MATH Google Scholar
Ben Hamida, S., Cont, R.: Recovering volatility from option prices by evolutionary optimization. J. Comput. Finance 8(4), 43–76 (2005)
Article Google Scholar
Bensoussan, A.: On the theory of option pricing. Acta Appl. Math. 2, 139–158 (1984)
MathSciNet MATH Google Scholar
Berridge, S.J., Schumacher, J.M.: Pricing high-dimensional American options using local consistency conditions. In: Appleby, J.A.D., et al. (eds.) Numerical Methods for Finance. Chapman & Hall, Boca Raton (2008)
Google Scholar
Billingsley, P.: Probability and Measure. Wiley, New York (1979)
MATH Google Scholar
Bischi, G.I., Sushko, I. (eds.): Dynamic Modelling in Economics & Finance. Special Issue of Chaos, Solitons and Fractals 29(3) (2006)
Google Scholar
Bischi, G.I., Valori, V.: Nonlinear effects in a discrete-time dynamic model of a stock market. Chaos Solitons Fractals 11, 2103–2121 (2000)
Article MathSciNet MATH Google Scholar
Björk, T.: Arbitrage Theory in Continuous Time. Oxford University Press, Oxford (1998)
Book MATH Google Scholar
Black, F., Scholes, M.: The pricing of options and corporate liabilities. J. Polit. Econ. 81, 637–659 (1973)
Article MathSciNet MATH Google Scholar
Blomeyer, E.C.: An analytic approximation for the American put price for options with dividends. J. Financ. Quant. Anal. 21, 229–233 (1986)
Article Google Scholar
Bouchaud, J.-P., Potters, M.: Theory of Financial Risks. From Statistical Physics to Risk Management. Cambridge University Press, Cambridge (2000)
MATH Google Scholar
Bouleau, N.: Martingales et Marchés Financiers. Edition Odile Jacob, Paris (1998)
Book Google Scholar
Box, G.E.P., Muller, M.E.: A note on the generation of random normal deviates. Ann. Math. Stat. 29, 610–611 (1958)
Article MATH Google Scholar
Boyle, P.P.: Options: a Monte Carlo approach. J. Financ. Econ. 4, 323–338 (1977)
Article Google Scholar
Boyle, P., Broadie, M., Glasserman, P.: Monte Carlo methods for security pricing. J. Econ. Dyn. Control 21, 1267–1321 (1997)
Article MathSciNet MATH Google Scholar
Boyle, P.P., Evnine, J., Gibbs, S.: Numerical evaluation of multivariate contingent claims. Rev. Financ. Stud. 2, 241–250 (1989)
Article Google Scholar
Brachet, M.-E., Taflin, E., Tcheou, J.M.: Scaling transformation and probability distributions for time series. Chaos Solitons Fractals 11, 2343–2348 (2000)
Article MATH Google Scholar
Brandimarte, P.: Numerical Methods in Finance and Economics. A MATLAB-Based Introduction. Wiley, Hoboken (2006)
Book MATH Google Scholar
Breen, R.: The accelerated binomial option pricing model. J. Financ. Quant. Anal. 26, 153–164 (1991)
Article Google Scholar
Brennan, M.J., Schwartz, E.S.: The valuation of American put options. J. Finance 32, 449–462 (1977)
Article Google Scholar
Brenner, S.C., Scott, L.R.: The Mathematical Theory of Finite Element Methods, 2nd edn. Springer, New York (2002)
Book MATH Google Scholar
Brent, R.P.: On the periods of generalized Fibonacci recurrences. Math. Comput. 63, 389–401 (1994)
Article MathSciNet MATH Google Scholar
Briani, M., La Chioma, C., Natalini, R.: Convergence of numerical schemes for viscosity solutions to integro-differential degenerate parabolic problems arising in financial theory. Numer. Math. 98, 607–646 (2004)
Article MathSciNet MATH Google Scholar
Broadie, M., Detemple, J.: American option valuation: new bounds, approximations, and a comparison of existing methods. Rev. Financ. Stud. 9, 1211–1250 (1996)
Article Google Scholar
Broadie, M., Detemple, J.: Recent advances in numerical methods for pricing derivative securities. In: Rogers, L.C.G., Talay, D. (eds.) Numerical Methods in Finance, pp. 43–66. Cambridge University Press, Cambridge (1997)
Chapter Google Scholar
Broadie, M., Glasserman, P.: Pricing American-style securities using simulation. J. Econ. Dyn. Control 21, 1323–1352 (1997)
Article MathSciNet MATH Google Scholar
Broadie, M., Glasserman, P.: A stochastic mesh method for pricing high-dimensional American options. J. Comput. Finance 7(4), 35–72 (2004)
Article Google Scholar
Brock, W.A., Hommes, C.H.: Heterogeneous beliefs and routes to chaos in a simple asset pricing model. J. Econ. Dyn. Control 22, 1235–1274 (1998)
Article MathSciNet MATH Google Scholar
Broyden, C.G.: The convergence of a class of double-rank minimization algorithms 1. General considerations. IMA J. Appl. Math. 6, 76–90 (1970)
Article MATH Google Scholar
Bruti-Liberati, N., Platen, E.: On weak predictor-corrector schemes for jump-diffusion processes in finance. Research Paper, University of Sydney (2006)
MATH Google Scholar
Bunch, D.S., Johnson, H.: A simple and numerically efficient valuation method for American puts using a modified Geske-Johnson approach. J. Finance 47, 809–816 (1992)
Article Google Scholar
Caflisch, R.E., Morokoff, W., Owen, A.: Valuation of mortgaged-backed securities using Brownian bridges to reduce effective dimension. J. Comput. Finance 1(1), 27–46 (1997)
Article Google Scholar
Carmona, R., Durrleman, V.: Generalizing the Black–Scholes formula to multivariate contingent claims. J. Comput. Finance 9(2), 43–67 (2005)
Article Google Scholar
Carr, P., Faguet, D.: Fast accurate valuation of American options. Working paper, Cornell University (1995)
Google Scholar
Carr, P., Geman, H., Madan, D.B., Yor, M.: Stochastic volatility for Lévy processes. Math. Finance 13, 345–382 (2003)
Article MathSciNet MATH Google Scholar
Carr, P., Madan, D.B.: Option valuation using the fast Fourier transform. J. Comput. Finance 2(4), 61–73 (1999)
Article Google Scholar
Carr, P., Wu, L.: Time-changed Lévy processes and option pricing. J. Financ. Econ. 71, 113–141 (2004)
Article Google Scholar
Carriere, J.F.: Valuation of the early-exercise price for options using simulations and nonparametric regression. Insur. Math. Econ. 19, 19–30 (1996)
Article MathSciNet MATH Google Scholar
Cash, J.R.: Two new finite difference schemes for parabolic equations. SIAM J. Numer. Anal. 21, 433–446 (1984)
Article MathSciNet MATH Google Scholar
Chan, T.F., Golub, G.H., LeVeque, R.J.: Algorithms for computing the sample variance: analysis and recommendations. Am. Stat. 37, 242–247 (1983)
MathSciNet MATH Google Scholar
Chen, S.-H. (ed.): Genetic Algorithms and Genetic Programming in Computational Finance. Kluwer, Boston (2002)
Google Scholar
Chen, X., Chadam, J.: Analytical and numerical approximations for the early exercise boundary for American put options. Dyn. Continuous Discrete Impulsive Syst. A 10, 649–660 (2003)
MathSciNet MATH Google Scholar
Chen, X., Chadam, J.: A mathematical analysis of the optimal exercise boundary for American put options. SIAM J. Math. Anal. 38, 1613–1641 (2007)
Article MathSciNet MATH Google Scholar
Chiarella, C., Dieci, R., Gardini, L.: Speculative behaviour and complex asset price dynamics. In: Bischi, G.I. (ed.) Proceedings Urbino 2000 (2000)
Google Scholar
Choi, H.I., Heath, D., Ku, H.: Valuation and hedging of options with general payoff under transaction costs. J. Kor. Math. Soc. 41, 513–533 (2004)
Article MathSciNet MATH Google Scholar
Chung, K.L., Williams, R.J.: Introduction to Stochastic Integration. Birkhäuser, Boston (1983)
Book MATH Google Scholar
Ciarlet, P.G.: Basic error estimates for elliptic problems. In: Ciarlet, P.G., Lions, J.L. (eds.) Handbook of Numerical Analysis, Vol. II. Elsevier/North-Holland, Amsterdam (1991)
Google Scholar
Ciarlet, P., Lions, J.L.: Finite Difference Methods (Part 1) Solution of Equations in $\mathbb{R}^{n}$. North-Holland/Elsevier, Amsterdam (1990)
MATH Google Scholar
Clarke, N., Parrot, A.K.: Multigrid for American option pricing with stochastic volatility. Appl. Math. Finance 6, 177–179 (1999)
Article MATH Google Scholar
Clewlow, L., Strickland, C.: Implementing Derivative Models. Wiley, Chichester (1998)
Google Scholar
Coleman, T.F., Li, Y., Verma, Y.: A Newton method for American option pricing. J. Comput. Finance 5(3), 51–78 (2002)
Article Google Scholar
Cont, R., Tankov, P.: Financial Modelling with Jump Processes. Chapman & Hall, Boca Raton (2004)
MATH Google Scholar
Cont, R., Voltchkova, E.: Finite difference methods for option pricing in jump-diffusion and exponential Lévy models. SIAM J. Numer. Anal. 43, 1596–1626 (2005)
Article MathSciNet MATH Google Scholar
Cox, J.C., Ingersoll, J.E., Ross, S.A.: A theory of the term structure of interest rates. Econometrica 53, 385–407 (1985)
Article MathSciNet MATH Google Scholar
Cox, J.C., Ross, S., Rubinstein, M.: Option pricing: a simplified approach. J. Financ. Econ. 7, 229–263 (1979)
Article MATH Google Scholar
Cox, J.C., Rubinstein, M.: Options Markets. Prentice Hall, Englewood Cliffs (1985)
Google Scholar
Crandall, M., Ishii, H., Lions, P.L.: User’s guide to viscosity solutions of second order partial differential equations. Bull. Am. Math. Soc. 27, 1–67 (1992)
Article MathSciNet MATH Google Scholar
Crank, J.: Free and Moving Boundary Problems. Clarendon Press, Oxford (1984)
MATH Google Scholar
Crank, J.C., Nicolson, P.: A practical method for numerical evaluation of solutions of partial differential equations of the heat-conductive type. Proc. Camb. Philos. Soc. 43, 50–67 (1947)
Article MATH Google Scholar
Cryer, C.: The solution of a quadratic programming problem using systematic overrelaxation. SIAM J. Control 9, 385–392 (1971)
Article MathSciNet MATH Google Scholar
Cyganowski, S., Kloeden, P., Ombach, J.: From Elementary Probability to Stochastic Differential Equations with MAPLE. Springer, Heidelberg (2001)
MATH Google Scholar
Dahlbokum, A.: Empirical performance of option pricing models based on time-changed Lévy processes. Available at SSRN: http://ssrn.com/abstract=1675321 (2010)
Dai, M.: A closed-form solution for perpetual American floating strike lookback options. J. Comput. Finance 4(2), 63–68 (2000)
Article MathSciNet Google Scholar
Dai, T.-S., Lyuu, Y.-D.: The bino-trinomial tree: a simple model for efficient and accurate option pricing. J. Deriv. 17, 7–24 (2010)
Article Google Scholar
Dana, R.-A., Jeanblanc, M.: Financial Markets in Continuous Time. Springer, Berlin (2003)
MATH Google Scholar
Dempster, M.A.H., Hutton, J.P.: Pricing American stock options by linear programming. Math. Finance 9, 229–254 (1999)
Article MathSciNet MATH Google Scholar
Dempster, M.A.H., Hutton, J.P., Richards, D.G.: LP valuation of exotic American options exploiting structure. J. Comput. Finance 2(1), 61–84 (1998)
Article Google Scholar
Derman, E., Kani, I.: Riding on a smile. Risk 7, 32–39 (1994)
Google Scholar
Detemple, J.: American options: symmetry properties. In: Jouini, E., et al. (eds.) Option Pricing, Interest Rates and Risk Management. Cambridge University Press, Cambridge (2001)
Google Scholar
Deutsch, H.-P.: Derivatives and Internal Models. Palgrave, Houndmills (2002)
Book Google Scholar
Devroye, L.: Non-uniform Random Variate Generation. Springer, New York (1986)
Book MATH Google Scholar
d’Halluin, Y., Forsyth, P.A., Labahn, G.: A semi-Lagrangian approach for American Asian options under jump diffusion. SIAM J. Sci. Comput. 27, 315–345 (2005)
Article MathSciNet MATH Google Scholar
d’Halluin, Y., Forsyth, P.A., Vetzal, K.R.: Robust numerical methods for contingent claims under jump diffusion processes. IMA J. Numer. Anal. 25, 87–112 (2005)
Article MathSciNet MATH Google Scholar
Dieci, R., Bischi, G.-I., Gardini, L.: From bi-stability to chaotic oscillations in a macroeconomic model. Chaos Solitons Fractals 12, 805–822 (2001)
Article MathSciNet MATH Google Scholar
Doeblin, W.: Sur l’équation de Kolmogorov (1940)
MATH Google Scholar
Doob, J.L.: Stochastic Processes. Wiley, New York (1953)
MATH Google Scholar
Dowd, K.: Beyond Value at Risk: The New Science of Risk Management. Wiley, Chichester (1998)
MATH Google Scholar
Duffie, D.: Dynamic Asset Pricing Theory, 2nd edn. Princeton University Press, Princeton (1996)
MATH Google Scholar
Duffie, D., Pan, J., Singleton, K.: Transform analysis and asset pricing for affine jump-diffusions. Econometrica 68, 1343–1376 (2000)
Article MathSciNet MATH Google Scholar
Dupire, B.: Pricing with a smile. Risk 7, 18–20 (1994)
Google Scholar
Eberlein, E., Frey, R., Kalkbrener, M., Overbeck, L.: Mathematics in financial risk management. Jahresber. DMV 109, 165–193 (2007)
MathSciNet MATH Google Scholar
Eberlein, E., Keller, U.: Hyperbolic distributions in finance. Bernoulli 1, 281–299 (1995)
Article MATH Google Scholar
Egloff, D.: Monte Carlo algorithms for optimal stopping and statistical learning. Ann. Appl. Probab. 15, 1396–1432 (2005)
Article MathSciNet MATH Google Scholar
Ehrhardt, M. (ed.): Nonlinear Models in Mathematical Finance. New Research Trends in Option Pricing. Nova Science, Hauppauge (2008)
MATH Google Scholar
Ekström, E., Lötstedt, P., Tysk, J.: Boundary values and finite difference methods for the single factor term structure equation. Appl. Math. Finance 16, 253–259 (2009)
Article MathSciNet MATH Google Scholar
El Karoui, N., Jeanblanc-Picqué, M., Shreve, S.E.: Robustness of the Black and Scholes formula. Math. Finance 8, 93–126 (1998)
Article MathSciNet MATH Google Scholar
Elliott, C.M., Ockendon, J.R.: Weak and Variational Methods for Moving Boundary Problems. Pitman, Boston (1982)
MATH Google Scholar
Elliott, R.J., Kopp, P.E.: Mathematics of Financial Markets. Springer, New York (1999)
Book MATH Google Scholar
Embrechts, P., Klüppelberg, C., Mikosch, T.: Modelling Extremal Events. Springer, Berlin (1997)
Book MATH Google Scholar
Ender, M.: Model risk in option pricing. www.risknet.de/risknet-elibrary/kategorien/market-risk (2008)
Epps, T.W.: Pricing Derivative Securities. World Scientific, Singapore (2000)
Book MATH Google Scholar
Faigle, U., Schrader, R.: On the Convergence of Stationary Distributions in Simulated Annealing Algorithms. Inf. Process. Lett. 27, 189–194 (1988)
Article MathSciNet MATH Google Scholar
Fang, F., Oosterlee, C.W.: A novel option pricing method based on Fourier-cosine series expansions. SIAM J. Sci. Comput. 31, 826–848 (2008)
Article MathSciNet MATH Google Scholar
Fang, F., Oosterlee, C.W.: Pricing early-exercise and discrete barrier options by fourier-cosine series expansions. Numer. Math. 114, 27–62 (2009)
Article MathSciNet MATH Google Scholar
Feller, W.: An Introduction to Probability Theory and Its Applications. Wiley, New York (1950)
MATH Google Scholar
Fengler, M.R.: Semiparametric Modeling of Implied Volatility. Springer, Berlin (2005)
MATH Google Scholar
Figlewski, S., Gao, B.: The adaptive mesh model: a new approach to efficient option pricing. J. Financ. Econ. 53, 313–351 (1999)
Article Google Scholar
Fishman, G.S.: Monte Carlo. Concepts, Algorithms, and Applications. Springer, New York (1996)
Book MATH Google Scholar
Fisz, M.: Probability Theory and Mathematical Statistics. Wiley, New York (1963)
MATH Google Scholar
Föllmer, H., Schied, A.: Stochastic Finance: An Introduction to Discrete Time. de Gruyter, Berlin (2002)
Book MATH Google Scholar
Forsyth, P.A., Vetzal, K.R.: Quadratic convergence for valuing American options using a penalty method. SIAM J. Sci. Comput. 23, 2095–2122 (2002)
Article MathSciNet MATH Google Scholar
Forsyth, P.A., Vetzal, K.R.: Numerical methods for nonlinear PDEs in finance. In: Duan, J.-C., Härdle, W.K., Gentle, J.E. (eds.) Handbook of Computational Finance, pp. 503–528. Springer, Berlin (2012)
Chapter Google Scholar
Forsyth, P.A., Vetzal, K.R., Zvan, R.: A finite element approach to the pricing of discrete lookbacks with stochastic volatility. Appl. Math. Finance 6, 87–106 (1999)
Article MATH Google Scholar
Forsyth, P.A., Vetzal, K.R., Zvan, R.: Convergence of numerical methods for valuing path-dependent options using interpolation. Rev. Deriv. Res. 5, 273–314 (2002)
Article MATH Google Scholar
Fournié, E., Lasry, J.-M., Lebuchoux, J., Lions, P.-L., Touzi, N.: An application of Malliavin calculus to Monte Carlo methods in finance. Finance Stochast. 3, 391–412 (1999)
Article MathSciNet MATH Google Scholar
Franke, J., Härdle, W., Hafner, C.M.: Statistics of Financial Markets. Springer, Berlin (2004)
Book MATH Google Scholar
Freedman, D.: Brownian Motion and Diffusion. Holden Day, San Francisco (1971)
MATH Google Scholar
Frey, R., Patie, P.: Risk management for derivatives in illiquid markets: a simulation-study. In: Sandmann, K., Schönbucher, P. (eds.) Advances in Finance and Stochastics. Springer, Berlin (2002)
Google Scholar
Frey, R., Stremme, A.: Market volatility and feedback effects from dynamic hedging. Math. Finance 7, 351–374 (1997)
Article MathSciNet MATH Google Scholar
Frutos, J. de: A spectral method for bonds. Comput. Oper. Res. 35, 64–75 (2008)
Google Scholar
Fu, M.C., et al.: Pricing American options: a comparison of Monte Carlo simulation approaches. J. Comput. Finance 4(3), 39–88 (2001)
Article Google Scholar
Fusai, G., Sanfelici, S., Tagliani, A.: Practical problems in the numerical solution of PDEs in finance. Rend. Studi Econ. Quant. 2001, 105–132 (2002)
MathSciNet Google Scholar
Gander, M.J., Wanner, G.: From Euler, Ritz, and Galerkin to modern computing. SIAM Rev. 54, 627–666 (2012)
Article MathSciNet MATH Google Scholar
Geman, H., et al., eds.: Mathematical Finance. Bachelier Congress 2000. Springer, Berlin (2002)
Google Scholar
Gentle, J.E.: Random Number Generation and Monte Carlo Methods. Springer, New York (1998)
Book MATH Google Scholar
Gerstner, T., Griebel, M.: Numerical integration using sparse grids. Numer. Algorithms 18, 209–232 (1998)
Article MathSciNet MATH Google Scholar
Gerstner, T., Griebel, M.: Dimension-adaptive tensor-product quadrature. Computing 71, 65–87 (2003)
Article MathSciNet MATH Google Scholar
Geske, R., Johnson, H.E.: The American put option valued analytically. J. Finance 39, 1511–1524 (1984)
Article Google Scholar
Giles, M.: Variance reduction through multilevel Monte Carlo path calculations. In: Appleby, J.A.D., et al. (eds.) Numerical Methods for Finance. Chapman & Hall, Boca Raton (2008)
Google Scholar
Giles, M., Glasserman, P.: Smoking adjoints: fast Monte Carlo methods. Risk 19, 88–92 (2006)
Google Scholar
Gilks, W.R., Richardson, S., Spiegelhalter, D.J. (eds.): Markov Chain Monte Carlo in Practice. Chapman & Hall, Boca Raton (1996)
Google Scholar
Glaser, J., Heider, P.: Arbitrage-free approximation of call price surfaces and input data risk. Quant. Finance 12, 61–73 (2012). doi:10.1080/14697688.2010.514005
Article MathSciNet MATH Google Scholar
Glasserman, P.: Monte Carlo Methods in Financial Engineering. Springer, New York (2004)
MATH Google Scholar
Glover, K.J., Duck, P.W., Newton, D.P.: On nonlinear models of markets with finite liquidity: some cautionary notes. SIAM J. Appl. Math. 70, 3252–3271 (2010)
Article MathSciNet MATH Google Scholar
Golub, G.H., Van Loan, C.F.: Matrix Computations, 3rd edn. The John Hopkins University Press, Baltimore (1996)
MATH Google Scholar
Goodman, J., Ostrov, D.N.: On the early exercise boundary of the American put option. SIAM J. Appl. Math. 62, 1823–1835 (2002)
Article MathSciNet MATH Google Scholar
Grandits, P.: Frequent hedging under transaction costs and a nonlinear Fokker-Planck PDE. SIAM J. Appl. Math. 62, 541–562 (2001)
Article MathSciNet MATH Google Scholar
Grüne, L., Kloeden, P.E.: Pathwise approximation of random ODEs. BIT 41, 710–721 (2001)
Article MATH Google Scholar
Hackbusch, W.: Multi-Grid Methods and Applications. Springer, Berlin (1985)
Book MATH Google Scholar
Hackbusch, W.: Elliptic Differential Equations: Theory and Numerical Treatment. Springer Series in Computational Mathematics, vol. 18. Berlin, Springer (1992)
Google Scholar
Haentjens, T., in ’t Hout, K.: ADI finite difference discretization of the Heston-Hull-White PDE. In: Simos, T.E., et al. (eds.) Numerical Analysis and Applied Mathematics. AIP Conference Proceedings, vol. 1281, pp. 1995–1999 (2010)
Article Google Scholar
Häggström, O.: Finite Markov Chains and Algorithmic Applications. Cambridge University Press, Cambridge (2002)
Book MATH Google Scholar
Hairer, E., Nørsett, S.P., Wanner, G.: Solving Ordinary Differential Equations I. Nonstiff Problems. Springer, Berlin (1993)
MATH Google Scholar
Halton, J.H.: On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals. Numer. Math. 2, 84–90 (1960)
Article MathSciNet MATH Google Scholar
Hammersley, J.M., Handscomb, D.C.: Monte Carlo Methods. Methuen, London (1964)
Book MATH Google Scholar
Hämmerlin, G., Hoffmann, K.-H.: Numerical Mathematics. Springer, Berlin (1991)
Book MATH Google Scholar
Han, H., Wu, X.: A fast numerical method for the Black–Scholes equation of American options. SIAM J. Numer. Anal. 41, 2081–2095 (2003)
Article MathSciNet MATH Google Scholar
Harrison, J.M., Pliska, S.R.: Martingales and stochastic integrals in the theory of continuous trading. Stoch. Process. Appl. 11, 215–260 (1981)
Article MathSciNet MATH Google Scholar
Hart, J.F.: Computer Approximations. Wiley, New York (1968)
MATH Google Scholar
Haug, E.G.: The Complete Guide to Option Pricing Formulas, 2nd edn. 2007. McGraw-Hill, New York (1998)
Google Scholar
He, C., Kennedy, J.S., Coleman, T., Forsyth, P.A., Li, Y., Vetzal, K.: Calibration and hedging under jump diffusion. Rev. Deriv. Res. 9, 1–35 (2006)
Article MATH Google Scholar
Heider, P.: A condition number for the integral representation of American options. J. Comput. Finance 11(2), 95–103 (2007/08)
Google Scholar
Heider, P.: A second-order Nyström-type discretization for the early-exercise curve of American put options. Int. J. Comput. Math. 86, 982–991 (2009)
Article MathSciNet MATH Google Scholar
Heider, P.: Numerical methods for non-linear Black–Scholes equations. Appl. Math. Finance 17, 59–81 (2010)
Article MathSciNet MATH Google Scholar
Heider, P., Schaeling, D.: Numerical methods for American options in nonlinear Black–Scholes models. Preprint, Universität Köln (2010)
Google Scholar
Heston, S.L.: A closed-form solution for options with stochastic volatility with applications to bond and currency options. Rev. Financ. Stud. 6, 327–343 (1993)
Article Google Scholar
Heston, S., Zhou, G.: On the rate of convergence of discrete-time contingent claims. Math. Finance 10, 53–75 (2000)
Article MathSciNet MATH Google Scholar
Higham, D.J.: An algorithmic introduction to numerical solution of stochastic differential equations. SIAM Rev. 43, 525–546 (2001)
Article MathSciNet MATH Google Scholar
Higham, D.J.: An Introduction to Financial Option Valuation. Cambridge University Press, Cambridge (2004)
Book MATH Google Scholar
Higham, D.J., Kloeden, P.E.: Numerical methods for nonlinear stochastic differential equations with jumps. Numer. Math. 101, 101–119 (2005)
Article MathSciNet MATH Google Scholar
Higham, N.J.: Accuracy and Stability of Numerical Algorithms. SIAM, Philadelphia (1996)
MATH Google Scholar
Higham, N.J.: Computing the nearest correlation matrix — a problem from finance. IMA J. Numer. Anal. 22, 329–343 (2002)
Article MathSciNet MATH Google Scholar
Hilber, N., Matache, A.-M., Schwab, C.: Sparse wavelet methods for option pricing under stochastic volatility. J. Comput. Finance 8(4), 1–42 (2005)
Article Google Scholar
Hofmann, N., Platen, E., Schweizer, M.: Option pricing under incompleteness and stochastic volatility. Math. Finance 2, 153–187 (1992)
Article MATH Google Scholar
Hoggard, T., Whalley, A.E., Wilmott, P.: Hedging option portfolios in the presence of transaction costs. Adv. Futur. Options Res. 7, 21–35 (1994)
Google Scholar
Holmes, A.D., Yang, H.: A front-fixing finite element method for the valuation of American options. SIAM J. Sci. Comput. 30, 2158–2180 (2008)
Article MathSciNet MATH Google Scholar
Honoré, P., Poulsen, R.: Option pricing with EXCEL. In: Nielsen, S. (ed.): Programming Languages and Systems in Computational Economics and Finance, pp. 369–402. Kluwer, Amsterdam (2002)
Google Scholar
Huang, J.-Z., Subrahmanyam, M.G., Yu, G.G.: Pricing and hedging American options: a recursive integration method. Rev. Financ. Stud. 9, 227–300 (1996)
Article Google Scholar
Hull, J.C.: Options, Futures, and Other Derivatives, 4th edn. Prentice Hall, Upper Saddle River (2000)
MATH Google Scholar
Hull, J., White, A.: The use of the control variate technique in option pricing. J. Financ. Quant. Anal. 23, 237–251 (1988)
Article Google Scholar
Hunt, P.J., Kennedy, J.E.: Financial Derivatives in Theory and Practice. Wiley, Chichester (2000)
MATH Google Scholar
Ikonen, S., Toivanen, J.: Pricing American options using LU decomposition. Appl. Math. Sci. 1, 2529–2551 (2007)
MathSciNet MATH Google Scholar
Ikonen, S., Toivanen, J.: Operator splitting methods for pricing American options under stochastic volatility. Numer. Math. 113, 299–324 (2009)
Article MathSciNet MATH Google Scholar
Ingersoll, J.E.: Theory of Financial Decision Making. Rowmann and Littlefield, Savage (1987)
Google Scholar
Int-Veen, R.: Avoiding numerical dispersion in option valuation. Report Universität Köln 2002; Comput. Vis. Sci. 10, 185–195 (2007)
Google Scholar
Isaacson, E., Keller, H.B.: Analysis of Numerical Methods. Wiley, New York (1966)
MATH Google Scholar
Jacod, J., Protter, P.: Probability Essentials, 2nd edn. Springer, Berlin (2003)
MATH Google Scholar
Jäckel, P.: Monte Carlo Methods in Finance. Wiley, Chichester (2002)
Google Scholar
Jaillet, P., Lamberton, D., Lapeyre, B.: Variational inequalities and the pricing of American options. Acta Appl. Math. 21, 263–289 (1990)
Article MathSciNet MATH Google Scholar
Jamshidian, F.: An analysis of American options. Rev. Futur. Mark. 11, 72–80 (1992)
Google Scholar
Jiang, L., Dai, M.: Convergence of binomial tree method for European/American path-dependent options. SIAM J. Numer. Anal. 42, 1094–1109 (2004)
Article MathSciNet MATH Google Scholar
Johnson, H.E.: An analytic approximation for the American put price. J. Financ. Quant. Anal. 18, 141–148 (1983)
Article Google Scholar
Jonen, C.: An efficient implementation of a least-squares Monte Carlo method for valuing American-style options. Int. J. Comput. Math. 86, 1024–1039 (2009)
Article MathSciNet MATH Google Scholar
Jonen, C.: Efficient Pricing of High-Dimensional American-Style Derivatives: A Robust Regression Monte Carlo method. PhD dissertation, Universität Köln (2011). http://kups.ub.uni-koeln.de/4442
Google Scholar
Joshi, M.S.: The Concepts and Practice of Mathematical Finance. Cambridge University Press, Cambridge (2003)
MATH Google Scholar
Ju, N.: Pricing an American option by approximating its early exercise boundary as a multipiece exponential function. Rev. Financ. Stud. 11, 627–646 (1998)
Article Google Scholar
Kaebe, C., Maruhn, J.H., Sachs, E.W.: Adjoint-based Monte Carlo calibration of financial market models. Finance Stochast. 13, 351–379 (2009)
Article MathSciNet MATH Google Scholar
Kahaner, D., Moler, C., Nash, S.: Numerical Methods and Software. Prentice Hall Series in Computational Mathematics. Prentice Hall, Englewood Cliffs (1989)
MATH Google Scholar
Kallast, S., Kivinukk, A.: Pricing and hedging American options using approximations by Kim integral equations. Eur. Finance Rev. 7, 361–383 (2003)
Article MATH Google Scholar
Kallsen, J.: A didactic note on affine stochastic volatility models. In: Kabanov, Y., et al. (eds.) From Stochastic Calculus to Mathematical Finance. Springer, Berlin (2006)
Google Scholar
Kamrad, B., Ritchken, P.: Multinomial approximating models for options with k state variables. Manag. Sci. 37, 1640–1652 (1991)
Article MATH Google Scholar
Kangro, R., Nicolaides, R.: Far field boundary conditions for Black-Scholes equations. SIAM J. Numer. Anal. 38, 1357–1368 (2000)
Article MathSciNet MATH Google Scholar
Kantorovich, L.W., Akilov, G.P.: Functional Analysis in Normed Spaces. Pergamon Press, Elmsford (1964)
MATH Google Scholar
Karatzas, I., Shreve, S.E.: Brownian Motion and Stochastic Calculus, 2nd edn. Springer Graduate Texts. Springer, New York (1991)
MATH Google Scholar
Karatzas, I., Shreve, S.E.: Methods of Mathematical Finance. Springer, New York (1998)
Book MATH Google Scholar
Kat, H.M.: Pricing Lookback options using binomial trees: an evaluation. J. Financ. Eng. 4, 375–397 (1995)
Google Scholar
Kebaier, A.: Statistical Romberg extrapolation: a new variance reduction method and applications to option pricing. Ann. Appl. Probab. 15, 2681–2705 (2005)
Article MathSciNet MATH Google Scholar
Kemna, A.G.Z., Vorst, A.C.F.: A pricing method for options based on average asset values. J. Bank. Finance 14, 113–129 (1990)
Article Google Scholar
Khaliq, A.Q.M., Voss, D.A., Yousuf, M.: Pricing exotic options with L-stable Padé schemes. J. Bank. Finance 31, 3438–3461 (2007)
Article Google Scholar
Kim, J.: The analytic valuation of American options. Rev. Financ. Stud. 3, 547–572 (1990)
Article Google Scholar
Kirkpatrick, S., Gelatt, C.D., Vecchi, M.P.: Optimization by simulated annealing. Science 220, 671–680 (1983)
Article MathSciNet MATH Google Scholar
Klassen, T.R.: Simple, fast and flexible pricing of Asian options. J. Comput. Finance 4(3), 89–124 (2001)
Article Google Scholar
Kloeden, P.E., Platen, E.: Numerical Solution of Stochastic Differential Equations. Springer, Berlin (1992)
Book MATH Google Scholar
Knuth, D.: The Art of Computer Programming, vol. 2. Addison-Wesley, Reading (1995)
Google Scholar
Kocis, L., Whiten, W.J.: Computational investigations of low-discrepancy sequences. ACM Trans. Math. Softw. 23, 266–294 (1997)
Article MATH Google Scholar
Korn, R., Müller, S.: The decoupling approach to binomial pricing of multi-asset options. J. Comput. Finance 12(3), 1–30 (2009)
Article MathSciNet MATH Google Scholar
Kou, S.G.: A jump diffusion model for option pricing. Manag. Sci. 48, 1086–1101 (2002)
Article MATH Google Scholar
Kovalov, P., Linetsky, V., Marcozzi, M.: Pricing multi-asset American options: a finite element method-of-lines with smooth penalty. J. Sci. Comput. 33, 209–237 (2007)
Article MathSciNet MATH Google Scholar
Kreiss, H.O., Thomée, V., Widlund, O.: Smoothing of initial data and rates of convergence for parabolic difference equations. Commun. Pure Appl. Math. 23, 241–259 (1970)
Article MathSciNet MATH Google Scholar
Kröner, D.: Numerical Schemes for Conservation Laws. Wiley, Chichester (1997)
MATH Google Scholar
Krylov, N.V.: Controlled Diffusion Processes. Springer, Heidelberg (1980)
Book MATH Google Scholar
Kwok, Y.K.: Mathematical Models of Financial Derivatives. Springer, Singapore (1998)
MATH Google Scholar
Kwok, Y.K., Leung, K.S., Wong, H.Y.: Efficient options pricing using the Fast Fourier Transform. In: Duan, J.-C., Härdle, W.K., Gentle, J.E. (eds.) Handbook of Computational Finance, pp. 579–604. Springer, Berlin (2012)
Chapter Google Scholar
Lambert, J.D.: Numerical Methods for Ordinary Differential Systems. The Initial Value Problem. Wiley, Chichester (1991)
MATH Google Scholar
Lamberton, D., Lapeyre, B.: Introduction to Stochastic Calculus Applied to Finance. Chapman & Hall, London (1996)
MATH Google Scholar
Lange, K.: Numerical Analysis for Statisticians. Springer, New York (1999)
MATH Google Scholar
L’Ecuyer, P.: Tables of linear congruential generators of different sizes and good lattice structure. Math. Comput. 68, 249–260 (1999)
Article MathSciNet MATH Google Scholar
Leentvaar, C.C.W., Oosterlee, C.W.: On coordinate transformation and grid stretching for sparse grid pricing of basket options. J. Comput. Math. 222, 193–209 (2008)
Article MathSciNet MATH Google Scholar
Lehn, J.: Random number generators. GAMM-Mitteilungen 25, 35–45 (2002)
MathSciNet MATH Google Scholar
Leisen, D.P.J.: Pricing the American put option: a detailed convergence analyis for binomial models. J. Econ. Dyn. Control 22, 1419–1444 (1998)
Article MATH Google Scholar
Leisen, D.P.J.: The random-time binomial model. J. Econ. Dyn. Control 23, 1355–1386 (1999)
Article MathSciNet MATH Google Scholar
Leisen, D.P.J., Reimer, M.: Binomial models for option valuation – examining and improving convergence. Appl. Math. Finance 3, 319–346 (1996)
Article Google Scholar
Leland, H.E.: Option pricing and replication with transaction costs. J. Finance 40, 1283–1301 (1985)
Article Google Scholar
Levy, G.: Computational finance using C and C#. Elsevier, Amsterdam (2008)
Google Scholar
Longstaff, F.A., Schwartz, E.S.: Valuing American options by simulation: a simple least-squares approach. Rev. Financ. Stud. 14, 113–147 (2001)
Article Google Scholar
Lord, R., Fang, F., Bervoets, F., Oosterlee, C.W.: A fast and accurate FFT-based method for pricing early-exercise options under Lévy processes. SIAM J. Sci. Comput. 30, 1678–1705 (2008)
Article MathSciNet MATH Google Scholar
Lux, T.: The socio-economic dynamics of speculative markets: interacting agents, chaos, and the fat tails of return distributions. J. Econ. Behav. Organ. 33, 143–165 (1998)
Article Google Scholar
Lyons, T.J.: Uncertain volatility and the risk-free synthesis of derivatives. Appl. Math. Finance 2, 117–133 (1995)
Article Google Scholar
Lyuu, Y.-D.: Financial Engineering and Computation. Principles, Mathematics, Algorithms. Cambridge University Press, Cambridge (2002)
MATH Google Scholar
MacMillan, L.W.: Analytic approximation for the American put option. Adv. Futur. Opt. Res. 1, 119–139 (1986)
Google Scholar
Madan, D.B., Seneta, E.: The variance-gamma (V.G.) model for share market returns. J. Bus. 63, 511–524 (1990)
Google Scholar
Mainardi, R., Roberto, M., Gorenflo, R., Scalas, E.: Fractional calculus and continuous-time finance II: the waiting-time distribution. Physica A 287, 468–481 (2000)
Article Google Scholar
Maller, R.A., Solomon, D.H., Szimayer, A.: A multinomial approximation for American option prices in Lévy process models. Math. Finance 16, 613–633 (2006)
Article MathSciNet MATH Google Scholar
Mandelbrot, B.B.: A multifractal walk down Wall Street. Sci. Am. 280, 70–73 (1999)
Article Google Scholar
Manteuffel, T.A., White, A.B., Jr.: The numerical solution of second-order boundary value problems on nonuniform meshes. Math. Comput. 47, 511–535 (1986)
Article MathSciNet MATH Google Scholar
Marchesi, M., Cinotti, S., Focardi, S., Raberto, M.: Development and testing of an artificial stock market. In: Bischi, G.I. (ed.) Proceedings Urbino 2000 (2000)
Google Scholar
Marsaglia, G.: Random numbers fall mainly in the planes. Proc. Natl. Acad. Sci. USA 61, 23–28 (1968)
Article MathSciNet MATH Google Scholar
Marsaglia, G., Bray, T.A.: A convenient method for generating normal variables. SIAM Rev. 6, 260–264 (1964)
Article MathSciNet MATH Google Scholar
Marsaglia, G., Tsang, W.W.: The ziggurat method for generating random variables. J. Stat. Softw. 5(8), 1–7 (2000)
Article Google Scholar
Mascagni, M.: Parallel pseudorandom number generation. SIAM News 32, 5 (1999)
MathSciNet MATH Google Scholar
Matache, A.-M., von Petersdorff, T., Schwab, C.: Fast deterministic pricing of options on Lévy driven assets. Report 2002–11, Seminar for Applied Mathematics, ETH Zürich (2002)
Google Scholar
Matsumoto, M., Nishimura, T.: Mersenne twister: a 623-dimensionally equidistributed uniform pseudorandom number generator. ACM Trans. Model. Comput. Simul. 8, 3–30 (1998)
Article MATH Google Scholar
Mayo, A.: Fourth order accurate implicit finite difference method for evaluating American options. In: Proceedings of Computational Finance, London (2000)
Google Scholar
McCarthy, L.A., Webber, N.J.: Pricing in three-factor models using icosahedral lattices. J. Comput. Finance 5(2), 1–33 (2001/02)
Google Scholar
McDonald, R.L., Schroder, M.D.: A parity result for American options. J. Comput. Finance 1(3), 5–13 (1998)
Article Google Scholar
Mel’nikov, A.V., Volkov, S.N., Nechaev, M.L.: Mathematics of Financial Obligations. American Mathematical Society, Providence (2002)
MATH Google Scholar
Merton, R.C.: Theory of rational option pricing. Bell J. Econ. Manag. Sci. 4, 141–183 (1973)
Article MathSciNet MATH Google Scholar
Merton, R.: Option pricing when underlying stock returns are discontinous. J. Financ. Econ. 3, 125–144 (1976)
Article MATH Google Scholar
Merton, R.C.: Continuous-Time Finance. Blackwell, Cambridge (1990)
MATH Google Scholar
Metwally, S.A.K., Atiya, A.: Using Brownian bridge for fast simulation of jump-diffusion processes and barrier options. J. Derv. 10, 43–54 (2002)
Article Google Scholar
Meyer, G.H.: Numerical Investigation of early exercise in American puts with discrete dividends. J. Comput. Finance 5(2), 37–53 (2002)
Article Google Scholar
Mikosch, T.: Elementary Stochastic Calculus, with Finance in View. World Scientific, Singapore (1998)
Book MATH Google Scholar
Mil’shtein, G.N.: Approximate integration of stochastic differential equations. Theory Probab. Appl. 19, 557–562 (1974)
Article MathSciNet Google Scholar
Milshtein, G.N.: A method of second-order accuracy integration of stochastic differential equations. Theory Probab. Appl. 23, 396–401 (1978)
Article MathSciNet Google Scholar
van Moerbeke, P.: On optimal stopping and free boundary problems. Rocky Mt. J. Math. 4, 539–578 (1974)
Article MathSciNet MATH Google Scholar
Moro, B.: The full Monte. Risk 8, 57–58 (1995)
Google Scholar
Morokoff, W.J.: Generating quasi-random paths for stochastic processes. SIAM Rev. 40, 765–788 (1998)
Article MathSciNet MATH Google Scholar
Morokoff, W.J., Caflisch, R.E.: Quasi-random sequences and their discrepancies. SIAM J. Sci. Comput. 15, 1251–1279 (1994)
Article MathSciNet MATH Google Scholar
Morton, K.W.: Numerical Solution of Convection-Diffusion Problems. Chapman & Hall, London (1996)
MATH Google Scholar
Musiela, M., Rutkowski, M.: Martingale Methods in Financial Modelling, 2nd edn. 2005. Springer, Berlin (1997)
Google Scholar
Neftci, S.N.: An Introduction to the Mathematics of Financial Derivatives. Academic Press, San Diego (1996)
MATH Google Scholar
Newton, N.J.: Continuous-time Monte Carlo methods and variance reduction. In: Rogers, L.C.G., Talay, D. (eds.) Numerical Methods in Finance, pp. 22–42. Cambridge University Press, Cambridge (1997)
Chapter Google Scholar
Niederreiter, H.: Quasi-Monte Carlo methods and pseudo-random numbers. Bull. Am. Math. Soc. 84, 957–1041 (1978)
Article MathSciNet MATH Google Scholar
Niederreiter, H.: Random Number Generation and Quasi-Monte Carlo Methods. Society for Industrial and Applied Mathematics, Philadelphia (1992)
Book MATH Google Scholar
Niederreiter, H., Jau-Shyong Shiue, P. (eds.): Monte Carlo and Quasi-Monte Carlo methods in scientific computing. In: Proceedings of a Conference at the University of Nevada, Las Vegas, Nevada, USA, 1994. Springer, New York (1995)
Google Scholar
Nielsen, B.F., Skavhaug, O., Tveito, A.: Penalty and front-fixing methods for the numerical solution of American option problems. J. Comput. Finance 5(4), 69–97 (2002)
Article Google Scholar
Nielsen, B.F., Skavhaug, O., Tveito, A.: Penalty methods for the numerical solution of American multi-asset option problems. J. Comput. Appl. Math. 222, 3–16 (2008)
Article MathSciNet MATH Google Scholar
Nielsen, L.T.: Pricing and Hedging of Derivative Securities. Oxford University Press, Oxford (1999)
Google Scholar
Øksendal, B.: Stochastic Differential Equations. Springer, Berlin (1998)
Book MATH Google Scholar
Omberg, E.: The valuation of American put options with exponential exercise policies. Adv. Futur. Opt. Res. 2, 117–142 (1987)
Google Scholar
Oosterlee, C.W.: On multigrid for linear complementarity problems with application to American-style options. Electron. Trans. Numer. Anal. 15, 165–185 (2003)
MathSciNet MATH Google Scholar
Panini, R., Srivastav, R.P.: Option pricing with Mellin transforms. Math. Comput. Model. 40, 43–56 (2004)
Article MathSciNet MATH Google Scholar
Papageorgiou, A., Traub, J.F.: New results on deterministic pricing of financial derivatives. Columbia University Report CUCS-028-96 (1996)
Google Scholar
Paskov, S., Traub, J.: Faster valuation of financial derivatives. J. Portf. Manag. 22, 113–120 (1995)
Article Google Scholar
Pelsser, A., Vorst, T.: The binomial model and the Greeks. J. Deriv. 1, 45–49 (1994)
Article Google Scholar
Peyret, R., Taylor, T.D.: Computational Methods for Fluid Flow. Springer, New York (1983)
Book MATH Google Scholar
Pham, H.: Optimal stopping, free boundary, and American option in a jump-diffusion model. Appl. Math. Optim. 35, 145–164 (1997)
Article MathSciNet MATH Google Scholar
Pironneau, O., Hecht, F.: Mesh adaption for the Black & Scholes equations. East-West J. Numer. Math. 8, 25–35 (2000)
MathSciNet MATH Google Scholar
Platen, E.: An introduction to numerical methods for stochastic differential equations. Acta Numer. 8, 197–246 (1999)
Article MathSciNet MATH Google Scholar
Pliska, S.R.: Introduction to Mathematical Finance. Discrete Time Models. Blackwell, Malden (1997)
Google Scholar
Pooley, D.M., Forsyth, P.A., Vetzal, K.R.: Numerical convergence properties of option pricing PDEs with uncertain volatility. IMA J. Numer. Anal. 23, 241–267 (2003)
Article MathSciNet MATH Google Scholar
Pooley, D.M., Forsyth, P.A., Vetzal, K., Simpson, R.B.: Unstructured meshing for two asset barrier options. Appl. Math. Finance 7, 33–60 (2000)
Article MATH Google Scholar
Pooley, D.M., Vetzal, K.R., Forsyth, P.A.: Convergence remedies for non-smooth payoffs in option pricing. J. Comput. Finance 6(4), 25–40 (2003)
Article Google Scholar
Press, W.H., Teukolsky, S.A., Vetterling, W.T., Flannery, B.P.: Numerical Recipes in FORTRAN. The Art of Scientific Computing, 2nd edn. Cambridge University Press, Cambridge (1992)
Google Scholar
Protter, P.E.: Stochastic Integration and Differential Equations. Springer, Berlin (2004)
MATH Google Scholar
Quarteroni, A., Sacco, R., Saleri, F.: Numerical Mathematics. Springer, New York (2000)
MATH Google Scholar
Quecke, S.: Efficient numerical methods for pricing American options under Lévy models. PhD-dissertation, Universität Köln (2007). http://kups.ub.uni-koeln.de/2018
MATH Google Scholar
Rannacher, R.: Finite element solution of diffusion problems with irregular data. Numer. Math. 43, 309–327 (1984)
Article MathSciNet MATH Google Scholar
Rebonato, R.: Interest-Rate Option Models: Understanding, Analysing and Using Models for Exotic Interest-Rate Options. Wiley, Chichester (1996)
MATH Google Scholar
Reisinger, C.: Numerische Methoden für hochdimensionale parabolische Gleichungen am Beispiel von Optionspreisaufgaben. PhD Thesis, Universität Heidelberg (2004)
Google Scholar
Rendleman, R.J., Bartter, B.J.: Two-state option pricing. J. Finance 34, 1093–1110 (1979)
Article Google Scholar
Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion. Springer, Berlin (1991)
Book MATH Google Scholar
Ribeiro, C., Webber, N.: A Monte Carlo method for the normal inverse Gaussian option valuation model using an inverse Gaussian bridge. Working paper, City University, London (2002)
Google Scholar
Ribeiro, C., Webber, N.: Valuing path dependent options in the variance-gamma model by Monte Carlo with a gamma bridge. J. Comput. Finance 7(2), 81–100 (2003/04)
Google Scholar
Ripley, B.D.: Stochastic Simulation. Wiley Series in Probability and Mathematical Statistics. Wiley, New York (1987)
Book MATH Google Scholar
Risken, H.: The Fokker-Planck Equation. Springer, Berlin (1989)
Book MATH Google Scholar
Rogers, L.C.G.: Monte Carlo valuation of American options. Math. Finance 12, 271–286 (2002)
Article MathSciNet MATH Google Scholar
Rogers, L.C.G., Shi, Z.: The value of an Asian option. J. Appl. Probab. 32, 1077–1088 (1995)
Article MathSciNet MATH Google Scholar
Rogers, L.C.G., Talay, D. (eds.): Numerical Methods in Finance. Cambridge University Press, Cambridge (1997)
Google Scholar
Rubinstein, M.: Implied binomial trees. J. Finance 69, 771–818 (1994)
Article Google Scholar
Rubinstein, M.: Return to oz. Risk 7(11), 67–71 (1994)
Google Scholar
Rubinstein, R.Y.: Simulation and the Monte Carlo Method. Wiley, New York (1981)
Book MATH Google Scholar
Ruppert, D.: Statistics and Finance. An Introduction. Springer, New York (2004)
Book MATH Google Scholar
Saad, Y.: Iterative Methods for Sparse Linear Systems, 2nd edn. SIAM, Philadelphia (2003)
Book MATH Google Scholar
Saito, Y., Mitsui, T.: Stability analysis of numerical schemes for stochastic differential equations. SIAM J. Numer. Anal. 33, 2254–2267 (1996)
Article MathSciNet MATH Google Scholar
Sato, K.-I.: Lévy Processes and Infinitely Divisible Distributions. Cambridge University Press, Cambridge (1999)
MATH Google Scholar
Schöbel, R., Zhu, J.: Stochastic volatility with an Ornstein-Uhlenbeck process: an extension. Eur. Finance Rev. 3(1), 23–46 (1999)
Article MATH Google Scholar
Schönbucher, P.J., Wilmott, P.: The feedback effect of hedging in illiquid markets. SIAM J. Applied Mathematics 61, 232–272 (2000)
Article MathSciNet MATH Google Scholar
Schoenmakers, J.G.M., Heemink, A.W.: Fast Valuation of Financial Derivatives. J. Comput. Finance 1, 47–62 (1997)
Article Google Scholar
Schoutens, W.: Lévy Processes in Finance. Wiley, Chichester (2003)
Book Google Scholar
Schuss, Z.: Theory and Applications of Stochastic Differential Equations. Wiley Series in Probability and Mathematical Statistics. Wiley, New York (1980)
MATH Google Scholar
Schwarz, H.R.: Numerical Analysis. Wiley, Chichester (1989)
Google Scholar
Schwarz, H.R.: Methode der finiten Elemente. Teubner, Stuttgart (1991)
Book MATH Google Scholar
Seydel, R.: Practical Bifurcation and Stability Analysis, 3rd edn. Springer Interdisciplinary Applied Mathematics, vol. 5. Springer, New York (2010)
Google Scholar
Seydel, R.U.: Lattice approach and implied trees. In: Duan, J.-C., Härdle, W.K., Gentle, J.E. (eds.) Handbook of Computational Finance, pp. 551–577. Springer, Berlin (2012)
Chapter Google Scholar
Seydel, R.U.: Risk and computation. In: Glau, K., Scherer, M., Zagst, R. (eds.) Innovations in Quantitative Risk Management, pp. 305–316. Springer, Heidelberg (2015)
Google Scholar
Shiryaev, A.N.: Essentials of Stochastic Finance. Facts, Models, Theory. World Scientific, Singapore (1999)
Book MATH Google Scholar
Shreve, S.E.: Stochastic Calculus for Finance II. Continuous-Time Models. Springer, New York (2004)
MATH Google Scholar
Smith, G.D.: Numerical Solution of Partial Differential Equations: Finite Difference Methods, 2nd edn. Clarendon Press, Oxford (1978)
MATH Google Scholar
Smithson, C.: Multifactor options. Risk 10(5), 43–45 (1997)
Google Scholar
Spanier, J., Maize, E.H.: Quasi-random methods for estimating integrals using relatively small samples. SIAM Rev. 36, 18–44 (1994)
Article MathSciNet MATH Google Scholar
Stauffer, D.: Percolation models of financial market dynamics. Adv. Complex Syst. 4, 19–27 (2001)
Article MATH Google Scholar
Steele, J.M.: Stochastic Calculus and Financial Applications. Springer, New York (2001)
Book MATH Google Scholar
Steiner, M., Wallmeier, M., Hafner, R.: Baumverfahren zur Bewertung diskreter Knock-Out-Optionen. OR Spektrum 21, 147–181 (1999)
Article MATH Google Scholar
Stoer, J., Bulirsch, R.: Introduction to Numerical Analysis. Springer, Berlin (1996)
MATH Google Scholar
Stoer, J., Witzgall, C.: Convexity and Optimization in Finite Dimensions I. Springer, Berlin (1970)
Book MATH Google Scholar
Stojanovic, S.: Computational Financial Mathematics Using MATHEMATICA. Birkhäuser, Boston (2003)
Book MATH Google Scholar
Strang, G.: Computational Science and Engineering. Wellesley, Cambridge (2007)
MATH Google Scholar
Strang, G., Fix, G.: An Analysis of the Finite Element Method. Prentice-Hall, Englewood Cliffs (1973)
MATH Google Scholar
Sweby, P.K.: High resolution schemes using flux limiters for hyperbolic conservation laws. SIAM J. Numer. Anal. 21, 995–1011 (1984)
Article MathSciNet MATH Google Scholar
Tavella, D., Randall, C.: Pricing Financial Instruments. The Finite Difference Method. Wiley, New York (2000)
Google Scholar
Tezuka, S.: Uniform Random Numbers: Theory and Practice. Kluwer, Dordrecht (1995)
Book MATH Google Scholar
Thomas, D.B., Luk, W., Leong, P.H.W., Villasenor, J.D.: Gaussian random number generators. ACM Comput. Surv. 39(4), Article 11 (2007)
Google Scholar
Thomas, J.W.: Numerical Partial Differential Equations: Finite Difference Methods. Springer, New York (1995)
Book MATH Google Scholar
Thomas, J.W.: Numerical Partial Differential Equations. Conservation Laws and Elliptic Equations. Springer, New York (1999)
Book Google Scholar
Tian, Y.: A modified lattice approach to option pricing. J. Futur. Mark. 13, 563–577 (1993)
Article Google Scholar
Tian, Y.: A flexible binomial option pricing model. J. Futur. Mark. 19, 817–843 (1999)
Article MathSciNet Google Scholar
Tilley, J.A.: Valuing American options in a path simulation model. Trans. Soc. Actuar. 45, 83–104 (1993)
Google Scholar
Topper, J.: Finite element modeling of exotic options. In: OR Proceedings 1999, pp. 336–341 (2000)
MATH Google Scholar
Topper, J.: Financial Engineering with Finite Elements. Wiley, New York (2005)
Google Scholar
Traub, J.F., Wozniakowski, H.: The Monte Carlo algorithm with a pseudo-random generator. Math. Comput. 58, 323–339 (1992)
Article MATH Google Scholar
Trottenberg, U., Oosterlee, C., Schüller, A.: Multigrid. Academic Press, San Diego (2001)
MATH Google Scholar
Tsay, R.S.: Analysis of Financial Time Series. Wiley, New York (2002)
Book MATH Google Scholar
van der Vorst, H.A.: Bi-CGSTAB: a fast and smoothly converging variant of Bi-CG for the solution of nonsymmetric linear systems. SIAM J. Sci. Stat. Comput. 13, 631–644 (1992)
Article MathSciNet MATH Google Scholar
Varga, R.S.: Matrix Iterative Analysis. Prentice Hall, Englewood Cliffs (1962)
Google Scholar
Vellekoop, M.H., Nieuwenhuis, J.W.: Efficient pricing of derivatives on assets with discrete dividends. Appl. Math. Finance 13, 265–284 (2006)
Article MATH Google Scholar
Vichnevetsky, R.: Computer Methods for Partial Differential Equations. Volume I. Prentice-Hall, Englewood Cliffs (1981)
MATH Google Scholar
Villeneuve, S., Zanette, A.: Parabolic ADI methods for pricing American options on two stocks. Math. Oper. Res. 27, 121–149 (2002)
Article MathSciNet MATH Google Scholar
Vretblad, A.: Fourier Analysis and Its Applications. Springer, New York (2003)
Book MATH Google Scholar
Wallace, C.S.: Fast pseudorandom numbers for normal and exponential variates. ACM Trans. Math. Softw. 22(1), 119–127 (1996)
Article MATH Google Scholar
Wang, X., Phillips, P.C.B., Yu, J.: Bias in estimating multivariate and univariate diffusion. J. Econ. 161, 228–245 (2011)
Article MathSciNet MATH Google Scholar
Wesseling, P.: Principles of Computational Fluid Dynamics. Springer, Berlin (2001)
Book MATH Google Scholar
Wilmott, P.: Derivatives. Wiley, Chichester (1998)
Google Scholar
Wilmott, P., Dewynne, J., Howison, S.: Option Pricing. Mathematical Models and Computation. Oxford Financial Press, Oxford (1996)
MATH Google Scholar
Wloka, J.: Partial Differential Equations. Cambridge University Press, Cambridge (1987)
Book MATH Google Scholar
Zagst, R.: Interest-Rate Management. Springer, Berlin (2002)
Book MATH Google Scholar
Zhang, J.E.: A semi-analytical method for pricing and hedging continuously sampled arithmetic average rate options. J. Comput. Finance 5(1), 59–79 (2001)
Article Google Scholar
Zhao, Y., Ziemba, W.T.: Hedging errors with Leland’s option model in the presence of transaction costs. Finance Res. Lett. 4, 49–58 (2007)
Article Google Scholar
Zhu, Y.-I., Wu, X., Chern, I.-L.: Derivative Securities and Difference Methods. Springer, New York (2004)
Book MATH Google Scholar
Zienkiewicz, O.C.: The Finite Element Method in Engineering Science. McGraw-Hill, London (1977)
Google Scholar
Zvan, R., Forsyth, P.A., Vetzal, K.R.: Robust numerical methods for PDE models of Asian options. J. Comput. Finance 1(2), 39–78 (1997/98)
Google Scholar
Zvan, R., Forsyth, P.A., Vetzal, K.R.: Penalty methods for American options with stochastic volatility. J. Comput. Appl. Math. 91, 199–218 (1998)
Article MathSciNet MATH Google Scholar
Zvan, R., Forsyth, P.A., Vetzal, K.R.: Discrete Asian barrier options. J. Comput. Finance 3(1), 41–67 (1999)
Article MATH Google Scholar
Zvan, R., Vetzal, K.R., Forsyth, P.A.: PDE methods for pricing barrier options. J. Econ. Dyn. Control 24, 1563–1590 (2000)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Mathematisches Institut, Universität zu Köln, Köln, Germany
Rüdiger U. Seydel

Authors

Rüdiger U. Seydel
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Seydel, R.U. (2017). Finite-Element Methods. In: Tools for Computational Finance. Universitext. Springer, London. https://doi.org/10.1007/978-1-4471-7338-0_5

Download citation

DOI: https://doi.org/10.1007/978-1-4471-7338-0_5
Published: 19 August 2017
Publisher Name: Springer, London
Print ISBN: 978-1-4471-7337-3
Online ISBN: 978-1-4471-7338-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Finite-Element Methods

Abstract

5.1 Weighted Residuals

5.1.1 The Principle of Weighted Residuals

5.1.2 Examples of Weighting Functions

5.1.3 Examples of Basis Functions

5.1.4 Smoothness

5.2 Ritz–Galerkin Method with One-Dimensional Hat Functions

5.2.1 Hat Functions

Definition 5.1 (Hat Functions)

Properties 5.2 (Hat Functions)

5.2.2 Assembling

5.2.3 A Simple Application

5.3 Application to Standard Options

5.3.1 European Options

5.3.2 Variational Form of the Obstacle Problem

5.3.3 Variational Form of an American Option

Problem 5.3 (Weak Version)

5.3.4 Implementation of Finite Elements

5.3.4.1 Side Conditions

Algorithm 5.4 (Finite Elements for American Standard Options)

Theorem 5.5 (Equivalence)

Proof

5.3.4.2 Implementation

5.4 Two-Asset Options

Example 5.6 (European Call on a Basket with Double Barrier)

5.4.1 Analytical Preparations

5.4.2 Weighted Residuals

5.4.3 Boundary

Example 5.7 (European Binary Put as in Example 3.9)

5.4.4 Involved Matrices

Example 5.8 (Heston’s PDE)

5.5 Error Estimates

5.5.1 Strong and Weak Solutions

Definition 5.9 (Weak Solution)

5.5.2 Approximation on Finite-Dimensional Subspaces

Problem 5.10 (Discrete Weak Solution)

5.5.3 Quadratic Convergence

Assumptions 5.11 (Continuous \(\mathcal{H}^{1}\)-Elliptic Bilinear Form)

Lemma 5.12 (Céa)

Proof

Lemma 5.13 (Error of an Interpolating Polygon)

Lemma 5.14 (Nitsche)

Proof

5.6 Notes and Comments

On Sect. 5.1

On Sect. 5.2

On Sect. 5.3

On Sect. 5.4

On Sect. 5.5

On Other Methods

5.7 Exercises

Notes

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation