1 Introduction

The development of modern option pricing began with the publication of the Black–Scholes (BS) option pricing formula in 1973, which was used in computing the value of the European option pricing Black and Scholes (1973). This model is so important in financial mathematics, that Robert Merton and Myron Scholes shared the 1997 Nobel Prize for economics (Fischer Black having died in 1995) because of this innovative Merton (1973) and Myneni (1992). The BS formula computes the value of an European option pricing based on the underlying asset, strike price, volatility of the asset, and the expiration time of option pricing Hull (1997). The European option pricing has the ability of exercising only at expiring date, while the American option pricing could be permitted at any time during the life of option. So, the American option pricing is more complicated. Analytical solutions of BS models for American option pricing problems are seldom available, therefore a numerical techniques must be used. Using numerical methods for solving American option pricing problem has been the subject of numerous researches during the last decades Barraquand and Pudet (1994), Amin and Khanna (1994) and Broadie and Detemple (1996). Elementary introductions to the topic can be found out, in Kwok (1998), Kwok (2009), Ross (1999), San-Lin (2000), Meng et al. (2014), Wilmott (1998), Wilmott et al. (1993), Yonggeng et al. (2002) and Song and Yang (2014). Typically, at any time there is a particular value of the asset, which makes the boundary between two regions: in one side the option pricing should be hold and the other side one should exercise it. An extended BS model with a non-linear penalty source term is considered. The penalty method for solving option pricing problems was introduced by Zvan et al. (1998). This term allows applicability of BS model beyond the basic European option pricing. In the penalty method, the free boundary is removed by adding a small and continuous penalty term to the BS model. Consequently, it is converted to a non-linear problem and can be solved on a fixed domain. The non-linear problem will be dealt with by MOL. Afterward, the non-linear problem is solved by predictor–corrector method which depends on the parameters \(\theta \) and \(\phi \) Fitzsimons et al. (1992). The main idea of this paper is showing that for many test parameters the backward finite difference (FD) approximation is more stable than the central and forward techniques, which is different from what presented in Khaliq et al. (2006). Then we use a rational approximation to speed up the convergence rate in time variable with respect to \(\theta \)-method which is of order \(O(\varDelta t)+O(\varDelta S)^2\) for the American option pricing and it is shown in Fitzsimons et al. (1992) that the order of the given method is \(O(\varDelta t)^2+O(\varDelta S)^2\).

The remainder of the paper is organized as follows: In Sect. 2, the BS model for American put option pricing is introduced using the penalty method. The order of accuracy for the \(\theta \)-method and some stability conditions for the penalty method are given in Sect. 3. In Sect. 4, we use the MOL for converting partial differential equation (PDE) of BS model to an ordinary differential equation (ODE). Then we combine a particular predictor–corrector method Fitzsimons et al. (1992) with rational approximation to get high order of accuracy for American put option. In Sect. 5, the numerical results are presented for various time and space step sizes. Finally, a brief conclusion is given in Sect. 6.

2 BS Model for American Put Option Problem by Penalty Method

The BS model for American put option pricing takes the form of a free boundary problem. Letting \(S(t)\) be the price at time \(t\), then the value \(V(S,t)\) of an American put option is formulated as follows Neftci (2000)

$$\begin{aligned}&\frac{\partial V}{\partial t} +\frac{1}{2} \sigma ^2 S^2 \frac{\partial ^2V}{\partial S^2}+rS \frac{\partial V}{\partial S}-rV=0, \quad S >\overline{S}(t), \quad 0 \le t < T,\nonumber \\&V(S,T)=\max (E-S,0), \quad S \ge 0,\nonumber \\&V(\overline{S}(t),t)=E-\overline{S}(t),\nonumber \\&\lim _{s\rightarrow \infty } V(S,t)=0,\nonumber \\&\overline{S}(T)=E,\nonumber \\&V(S,t)=E-S, \quad 0\le S< \overline{S}(t), \end{aligned}$$
(1)

where \(\overline{S}(t)\) denotes the free boundary. The parameters \(\sigma \), \(r\) and \(E\) denote the volatility of the underlying asset, the interest rate and the exercise price of the option, respectively. Note that, the value \(V(S,t)\) of the option must satisfy the positivity constraint

$$\begin{aligned} V(S,t)\ge \max (E-S,0),\quad S\ge 0,\;\;0\le t\le T, \end{aligned}$$
(2)

because, for American put option an early exercise is permitted.

Analytical solution of the BS model of American option pricing problem is seldom available, therefore a numerical techniques must be used. Thus a penalty term is added to BS model of American put option and give a non-linear parabolic PDE on a fixed domain (see Appendices 1 and 2 for more details). The penalty term is given by Zvan et al. (1998)

$$\begin{aligned} \frac{\epsilon C}{V+\epsilon -q(S)}, \end{aligned}$$
(3)

where \(0 < \epsilon << 1 \) is a small regularization parameter, \(C \ge rE\) is a positive constant and \(q(S)\) is defined by:

$$\begin{aligned} q(S)=E-S. \end{aligned}$$
(4)

So the free boundary problem (1) is reduced to the following nonlinear PDE:

$$\begin{aligned}&\frac{\partial V}{\partial t}+\frac{1}{2}\sigma ^2 S^2 \frac{\partial ^2V}{\partial S^2}+rS\frac{\partial V}{\partial S}-rV+\frac{\epsilon C}{V+\epsilon -q(S)}=0, S\ge 0,t\in [0,T)\nonumber \\&V(S,T)=\text{ max }(E-S,0),\nonumber \\&V(0,t)=E,\nonumber \\&V(S,t)=0,\;\;\text{ as }\quad S\longrightarrow \infty . \end{aligned}$$
(5)

3 The \(\theta \)-Method and Its Consistency

Following discretization on the domain \([0,S_{\infty } ] \times [0,T],\) the well-known \(\theta \)-method applied to (5) takes the form Duffy (2006) (We apply forward for time steps and central for space steps):

$$\begin{aligned}&\frac{V^{n+1}_j\!-\!V^n_j}{\varDelta t}\,+\,\frac{1}{2}\sigma ^2 S^2_j\left[ \theta \frac{V^{n+1}_{j-1}\!-\!2V^{n+1}_j+V^{n+1}_{j+1}}{\varDelta S^2}\,+\,(1-\theta ) \frac{V^{n}_{j-1}\!-\!2V^{n}_j+V^{n}_{j+1}}{\varDelta S^2} \right] \nonumber \\&\quad \,+\, rS_j\left[ \theta \frac{V^{n+1}_{j+1}-V^{n+1}_{j-1}}{2\varDelta S}+(1-\theta )\frac{V^{n}_{j+1}-V^{n}_{j-1}}{2\varDelta S}\right] -r\left[ \theta V^{n+1}_j+(1-\theta )V^n_j\right] \nonumber \\&\quad \,+\,\theta \frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)} +(1-\theta )\frac{\epsilon C}{V^n_j+\epsilon -q(S_j)}=0. \end{aligned}$$
(6)

We consider \(\theta \)-methods that arise by treating the penalty term explicitly in (6), that is, by replacing \(V^n_j\) by \(V^{n+1}_j\) in that term, we obtain

$$\begin{aligned}&\frac{V^{n+1}_j\!-\!V^n_j}{\varDelta t}\,+\,\frac{1}{2}\sigma ^2 S^2_j\left[ \theta \frac{V^{n+1}_{j-1}\!-\!2V^{n+1}_j+V^{n+1}_{j+1}}{\varDelta S^2}\,+\,(1-\theta ) \frac{V^{n}_{j-1}\!-\!2V^{n}_j+V^{n}_{j+1}}{\varDelta S^2}\right] \nonumber \\&\quad \,+\, rS_j\left[ \theta \frac{V^{n+1}_{j+1}-V^{n+1}_{j-1}}{2\varDelta S}+(1-\theta )\frac{V^{n}_{j+1}-V^{n}_{j-1}}{2\varDelta S}\right] -r\left[ \theta V^{n+1}_j+(1-\theta )V^n_j\right] \nonumber \\&\quad +\,\frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)} =0 . \end{aligned}$$
(7)

A detailed analysis is provided in Nielsen et al. (2002) for certain schemes applied to (5) using upwind differencing where it was concluded that a time step constraint was needed to preserve the positivity constraint if the penalty term was treated explicitly. In particular, for the corresponding linear (or semi-) implicit backward Euler method the constraint is

$$\begin{aligned} \varDelta t\le \frac{\epsilon }{rE}, \end{aligned}$$
(8)

see Khaliq et al. (2006).

Now we prove that the \(\theta \)-method has the order \(O(\varDelta t)+O(\varDelta S)^2\) . By using Taylor expansion for \(V^{n}_{j+1},\;V^{n}_{j-1}\) and \(V^{n+1}_{j}\), we get

$$\begin{aligned} V^{n+1}_j&= V^n _j+\varDelta t\left( \frac{\partial V}{\partial t}\right) ^n_j+\frac{\left( \varDelta t\right) ^2}{2!}\left( \frac{\partial ^2 V}{\partial t^2}\right) ^n_j+\cdots , \nonumber \\ V^n_{j+1}&= V^n_j+\varDelta S \left( \frac{\partial V}{\partial S}\right) ^n_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^n_j+\cdots , \nonumber \\ V^n_{j-1}&= V^n_j-\varDelta S \left( \frac{\partial V}{\partial S}\right) ^n_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^n_j-\cdots . \end{aligned}$$
(9)

Substituting from (9) in (7) implies

$$\begin{aligned}&\frac{V^n _j+\varDelta t\left( \frac{\partial V}{\partial t}\right) ^n_j+\frac{(\varDelta t)^2}{2!}\left( \frac{\partial ^2 V}{\partial t^2}\right) ^n_j+\cdots - V^n_j}{\varDelta t} \nonumber \\&\quad +\,\frac{1}{2}\sigma ^2 S^2_j\bigg [\theta \frac{V^{n+1}_j-\varDelta S \left( \frac{\partial V}{\partial S}\right) ^{n+1}_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^{n+1}_j-\cdots -2V^{n+1}_j}{\varDelta S^2}\nonumber \\&\quad +\, \frac{V^{n+1}_j+\varDelta S \left( \frac{\partial V}{\partial S}\right) ^{n+1}_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^{n+1}_j+\cdots }{\varDelta S^2}\nonumber \\&\quad +\,(1-\theta )\frac{V^n_j-\varDelta S \left( \frac{\partial V}{\partial S}\right) ^n_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^n_j+\cdots }{\varDelta S^2} \nonumber \\&\qquad \frac{-2V^n_j+V^n_j+\varDelta S \left( \frac{\partial V}{\partial S}\right) ^n_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^n_j+\cdots }{\varDelta S^2}\bigg ]\nonumber \\&\quad +\,rS_j\bigg [\theta \frac{V^{n+1}_j+\varDelta S \left( \frac{\partial V}{\partial S}\right) ^{n+1}_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^{n+1}_j\cdots -V^{n+1}_j}{2\varDelta S}\nonumber \\&\quad \frac{+\,\varDelta S \left( \frac{\partial V}{\partial S}\right) ^{n+1}_j-\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^{n+1}_j+\cdots }{2\varDelta S}\nonumber \\&\quad +\,(1-\theta )\frac{V^n_j+\varDelta S \left( \frac{\partial V}{\partial S}\right) ^n_j+\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^n_j+\cdots }{2\varDelta S}\nonumber \\&\quad \frac{- V^n_j+\varDelta S \left( \frac{\partial V}{\partial S}\right) ^n_j-\frac{(\varDelta S)^2}{2!}\left( \frac{\partial ^2 V}{\partial S^2}\right) ^n_j+\cdots }{2\varDelta S}\bigg ]\nonumber \\&\quad -\,r\bigg [\theta \left( V^n _j+\varDelta t\left( \frac{\partial V}{\partial t}\right) ^n_j+\frac{(\varDelta t)^2}{2!}\left( \frac{\partial ^2 V}{\partial t^2}\right) ^n_j+\cdots \right) -(1-\theta ) V^n_j\bigg ]\nonumber \\&\quad +\,\frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)}=0, \end{aligned}$$
(10)

or the simplified form

$$\begin{aligned}&\left( \frac{\partial V}{\partial t}\right) ^n_j+O(\varDelta t)+\frac{1}{2}\sigma ^2 S^2_j\bigg [\theta \left( \left( \frac{\partial ^2 V}{\partial t^2}\right) ^{n+1}_j+O(\varDelta S)^2\right) \nonumber \\&\quad +\,(1-\theta )\left( \left( \frac{\partial ^2 V}{\partial t^2}\right) ^n_j+O(\varDelta S)^2\right) \bigg ]+rS_j\bigg [\theta \left( \left( \frac{\partial V}{\partial S}\right) ^{n+1}_j+O(\varDelta S)^2\right) \nonumber \\&\quad +\left( 1-\theta \right) \left( \left( \frac{\partial V}{\partial S}\right) ^{n}_j+O(\varDelta S)^2\right) \bigg ]-r\bigg [\theta \left( V^n _j+\varDelta t\left( \frac{\partial V}{\partial t}\right) ^n_j+O(\varDelta t)\right) \nonumber \\&\quad +\left( 1-\theta \right) \left( V^{n+1} _j+\varDelta t\left( \frac{\partial V}{\partial t}\right) ^{n+1}_j+O(\varDelta t)\right) \bigg ]+\frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)} =0.\nonumber \\ \end{aligned}$$
(11)

By adding and subtracting the term

$$\begin{aligned} \frac{\epsilon C}{V^n_j+\epsilon -q(S_j)}, \end{aligned}$$

and nothing that

$$\begin{aligned} \left( \frac{\partial V}{\partial t}\right) ^n_j+\frac{1}{2}\sigma ^2 S^2_j\left( \frac{\partial ^2 V}{\partial t^2}\right) ^n_j+rS_j \left( \frac{\partial V}{\partial S}\right) ^{n}_j-rV^n_j+\frac{\epsilon C}{V^n_j+\epsilon -q(S_j)} =0, \end{aligned}$$
(12)

we have

$$\begin{aligned}&\frac{1}{2}\sigma ^2 S^2_j\theta \left[ \left( \frac{\partial ^2 V}{\partial t^2}\right) ^{n+1}_j-\left( \frac{\partial ^2 V}{\partial t^2}\right) ^n_j\right] +rS_j \theta \left[ \left( \frac{\partial V}{\partial S}\right) ^{n+1}_j -(\frac{\partial V}{\partial S})^{n}_j\right] \nonumber \\&\quad -\,r\theta \left( V^{n+1}_j-V^n_j\right) +O(\varDelta t)+O(\varDelta S)^2\nonumber \\&\quad +\,\frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)}-\frac{\epsilon C}{V^n_j+\epsilon -q(S_j)} =0. \end{aligned}$$
(13)

Using Taylor expansion we have

$$\begin{aligned} \left( \frac{\partial ^2 V}{\partial t^2}\right) ^{n+1}_j&= \left( \frac{\partial ^2 V}{\partial t^2}\right) ^{n}_j+\varDelta t\frac{\partial \left( \frac{\partial ^2 V}{\partial t^2}\right) ^{n}_j}{\partial t}+\cdots , \nonumber \\ V^{n+1}_j&= V^n _j+\varDelta t\left( \frac{\partial V}{\partial t}\right) ^n_j+\frac{(\varDelta t)^2}{2!}\left( \frac{\partial ^2 V}{\partial t^2}\right) ^n_j+\cdots , \end{aligned}$$
(14)

then by substituting in (13), we obtain

$$\begin{aligned} \frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)}-\frac{\epsilon C}{V^n_j+\epsilon -q(S_j)}+O(\varDelta t)+O(\varDelta S)^2=0. \end{aligned}$$
(15)

On the other hand, for American put option we have

$$\begin{aligned} |V^{n+1}_j-q(S_j)|<\epsilon ,|V^{n}_j-q(S_j)|<\epsilon , \end{aligned}$$
(16)

thus we write

$$\begin{aligned}&\frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)}=\frac{C}{1+\frac{V^{n+1}_j-q(S_j)}{\epsilon }}\nonumber \\&\quad =C\left[ 1-\frac{V^{n+1}_j-q(S_j)}{\epsilon }+\left( \frac{V^{n+1}_j-q(S_j)}{\epsilon }\right) ^2-\cdots \right] ,\nonumber \\&\qquad \frac{\epsilon C}{V^{n}_j+\epsilon -q(S_j)}=\frac{C}{1+\frac{V^{n}_j-q(S_j)}{\epsilon }}\nonumber \\&\quad =C\left[ 1-\frac{V^{n}_j-q(S_j)}{\epsilon }+\left( \frac{V^{n}_j-q(S_j)}{\epsilon }\right) ^2-\cdots \right] , \end{aligned}$$
(17)

since \(\frac{\vert V^{n+1}_j-q(S_j)\vert }{\epsilon }<1\) and \(\frac{\vert V^{n}_j-q(S_j)\vert }{\epsilon }<1\).

By substituting

$$\begin{aligned} V^{n+1}_j=V^{n}_j+O(\varDelta t), \end{aligned}$$

in (17), we get

$$\begin{aligned}&\frac{\epsilon C}{V^{n+1}_j+\epsilon -q(S_j)}-\frac{\epsilon C}{V^n_j+\epsilon -q(S_j)}=O(\varDelta t). \end{aligned}$$
(18)

Therefore, the truncation error of the \(\theta \)-method is of order \(O(\varDelta t)+O(\varDelta S)^2\). In the next section, we use the MOL and predictor–corrector technique to get high accuracy for time and obtain parameters of this method using rational approximation.

4 Rational Approximation and Stability

By defining \(\tau := T-t\), we consider (6) in the more conventional form

$$\begin{aligned}&\frac{\partial V}{\partial \tau }=\frac{1}{2}\sigma ^2 S^2 \frac{\partial ^2V}{\partial S^2}+rS\frac{\partial V}{\partial S}-rV+\frac{\epsilon C}{V+\epsilon -q(S)},S\in [0,S_{\infty }],\tau \in (0,T] \nonumber \\&V(S,\tau =0) = \max (E-S,0),\nonumber \\&V(0,\tau )\rightarrow V_{\tau }=-rV+\frac{\epsilon C}{V+\epsilon -E},\nonumber \\&V(S ,\tau ) = 0, S\rightarrow \infty \nonumber \\&V(0,\tau )=E. \end{aligned}$$
(19)

Applying the MOL semi-discretization approach Duffy (2006) by using a central FD for the first order derivative, \(\frac{\partial V}{\partial S}\), it can be written in the compact form

$$\begin{aligned} \frac{\partial V}{\partial \tau }=F(\tau ,V),\quad V(0)=V_0, \end{aligned}$$
(20)

where \(F(\tau ,V)=AV+g(\tau ,V)\), in which \(g(\tau , V) \) is the non-linear discretized penalty term and \(A\) is the nonsymmetric tridiagonal matrix

$$\begin{aligned} A = \left( \begin{array}{ccccccc} -r&{} 0 &{}\\ a(2)&{} b(2) &{} c(2)&{} &{} &{} \\ &{}a(3)&{}b(3) &{} c(3)&{} \\ &{} &{}\ddots &{}\ddots &{}\ddots \\ &{} &{} &{}a(N-2)&{} b(N-2) &{} c(N-2)\\ &{} &{} &{} &{} a(N-1)&{} b(N-1) &{} c(N-1)\\ &{} &{} &{} &{} &{} 0&{} 0 \end{array} \right) , \end{aligned}$$

where

$$\begin{aligned}&a(i)=\frac{1}{2}\frac{\sigma ^2 S(i)^2}{\varDelta S^2}-\frac{rS(i)}{2\varDelta S},\\&b(i)=-\left( \frac{\sigma ^2 S(i)^2}{\varDelta S^2}+r\right) , \\&c(i)=\frac{1}{2}\frac{\sigma ^2 S(i)^2}{\varDelta S^2}+\frac{rS(i)}{2\varDelta S}, \end{aligned}$$

for \( i=2,\ldots ,N-1 \).

Now we use the predictor–corrector method for solving (20). For simplicity, we assume a uniform time step size \(k\). Then for the corrector

$$\begin{aligned} V^{n+1}-V^n=k\left[ \frac{1}{2}\{AV^n+g^n\}+\theta \{AV^{n+1}+g^{n+1}\}+\phi \{ A\widehat{V}^{n+1}+\widehat{g}^{n+1}\}\right] , \end{aligned}$$
(21)

we use the \(\theta \)-method to compute the predicted solution \(\widehat{V}^{n+1}\):

$$\begin{aligned} \widehat{V}^{n+1}-V^n=k\left[ (1-\theta )\{AV^n+g^n\}+\theta \{ A\widehat{V}^{n+1}+\widehat{g}^{n+1}\}\right] . \end{aligned}$$
(22)

For \(\theta =0\) and \(\phi =\frac{1}{2}\), (21) reduces to the modified Euler method. We treat the implicit non-linear term \(g(\tau , V)\), explicitly, in both the predictor and corrector throughout evaluating \(g\) at the current available values. This yields the following linear implicit method with tridiagonal coefficient matrix \(T=I-k\theta A\),

$$\begin{aligned} TV^{n+1}&= \left[ I+\frac{k}{2}A\right] V^n+k\left[ \frac{1}{2}g^n+(\phi +\theta )\widehat{g}^{n+1}\right] +\phi kA\widehat{V}^{n+1}\nonumber \\ T\widehat{V}^{n+1}&= \left[ I+k(1-\theta )A\right] V^n+kg^n. \end{aligned}$$
(23)

The following definitions and lemmas are recalled from Brayton et al. (1972), Lambert (1978) and Mock (1983). The linear stability properties of the linear implicit method (23) are determined through applying to the scalar equation \(\frac{\partial v}{\partial t}=\lambda v\), \(\; Re(\lambda )<0\). Applying (23) to this problem yields \(v_{n + 1} = R(q)v_n\), \(q = k\lambda \) with the rational stability function

$$\begin{aligned} R(q)_2^2=\frac{N(q)}{D(q)}=\frac{1+\left( \frac{1}{2}-\theta +\phi \right) q+\left( \phi -\theta \phi -\frac{1}{2}\theta \right) q^2}{1-2\theta q+\theta ^2 q^2}, \end{aligned}$$
(24)

(see Voss and Casper (1989), Voss and Khaliq (1999) and Fitzsimons et al. (1992)).

Definition 1

A numerical method is said to be A-stable if its region of absolute stability contains the whole of the left half-plane. A one-step numerical method is said to be L-stable if it is A-stable and if, in addition, when it is applied to the scalar test equation \(y ' = \lambda y\), where \(\lambda \) is a complex constant with \(\text{ Re }(\lambda ) < 0\), it has the form \(y_{k+1} = R(k\lambda )y_k\), where \(\mid R(k\lambda )\mid \rightarrow 0\) as \(\text{ Re }(k\lambda )\rightarrow -\infty \).

Definition 2

Let q be a complex number and let \(R^S_T(q)\), where \(S >0, T > 0, \) be given by

$$\begin{aligned} R^S_T(q)=\frac{\varSigma ^S_{i=0}a_iq^i }{\varSigma ^T_{i=0}b_iq^i }, \end{aligned}$$

where all the \(a_i\) and \(b_i\) are real and \(a_0 = b_0= 1\). We say that \(R^S_T(q)\) is an \((S, T)\) Rational approximation of order \(P\) to \(e^q \) if \(R^S_T(q) = e^q + O(q^{P+1})\). If \(P = S + T\), it is called the \(Pad\acute{e}\) approximation to \(e^q\).

Remark 1

Consider the \((2, 2)\) Rational approximation \(R^2_2(q)\) containing the free parameters \(\xi \) and \(\zeta \)

$$\begin{aligned} R^2_2(q; \xi ,\zeta )=\frac{1+\frac{1}{2}(1-\xi )q+\frac{1}{4}(\zeta -\xi )q^2}{1-\frac{1}{2}(1+\xi )q+\frac{1}{4}(\xi +\zeta )q^2}:=\frac{N(q)}{D(q)}. \end{aligned}$$
(25)

It is of order 2 in general and of order 3, if \(\xi \ne 0\), \(\zeta =\frac{1}{3}\). It is a \(Pad\acute{e}\) approximation of order 4, if \(\xi =0\), \(\zeta =\frac{1}{3}\).

Theorem 1

Butcher (2008) \(R^2_2(q; \xi ,\zeta )\) is A-stable iff

  1. (a)

    all poles of \(R^2_2(q; \xi ,\zeta )=\frac{N(q)}{D(q)}\) are in the right half-plane.

  2. (b)

    \( E(y)= \mid D(iy)\mid ^2-\mid N(iy)\mid ^2\ge 0\), for all real y.

Proof

The necessity of (a) follows from the fact that if \(q^*\) is a pole, then \(\lim _{q\rightarrow q^*} \mid R(q)\mid = \infty \), and hence \(|R(q)| > 1\), for \(q\) close enough to \(q^*\). The necessity of (b) follows from the fact that \(E(y) < 0\) implies that \(|R(iy)| > 1\), so that \(|R(q)| > 1\) for some \(q =-\epsilon +iy\) in the left half-plane. Sufficiency of these conditions follows from the fact that (a) implies that R is analytic in the left half-plane so that, by the maximum modulus principle, \( |R(q)| > 1\) in this region implies \( |R(q)| > 1 \) on the imaginary axis, which contradicts (b). \(\square \)

Lemma 1

\(R^2_2(q; \xi ,\zeta )\) is A-stable iff \(\xi \ge 0\), \(\zeta \ge 0\), and L-stable iff \(\xi =\zeta >0\).

Proof

Let \(R^2_2(q; \xi ,\zeta )\) be A-stable. Then by Theorem 1, (a) and (b) are true, thus both of the roots of \(D(q)=0\) must be in the right half-plane, that is,

$$\begin{aligned} q_1&= \text{ Re }\left( \frac{1+\xi +\sqrt{(1-2\xi +\xi ^2-4\zeta )}}{\xi +\zeta }\right) \ge 0,\\ q_2&= \text{ Re }\left( \frac{1+\xi -\sqrt{(1-2\xi +\xi ^2-4\zeta )}}{\xi +\zeta }\right) \ge 0, \end{aligned}$$

thus

$$\begin{aligned} q1=\left\{ \begin{array}{llllll} \frac{1+\xi }{\xi +\zeta }&{}\quad \text{ if } \;\;(1-2\xi +\xi ^2-4\zeta )<0,\\ \frac{1+\xi +\sqrt{(1-2\xi +\xi ^2-4\zeta )}}{\xi +\zeta }&{}\quad \text{ if } \;\;(1-2\xi +\xi ^2-4\zeta )\ge 0, \end{array}\right. \end{aligned}$$
(26)
$$\begin{aligned} q2=\left\{ \begin{array}{llllll} \frac{1+\xi }{\xi +\zeta }&{}\quad \text{ if } \;\;(1-2\xi +\xi ^2-4\zeta )<0,\\ \frac{1+\xi -\sqrt{(1-2\xi +\xi ^2-4\zeta )}}{\xi +\zeta }&{}\quad \text{ if } \;\;(1-2\xi +\xi ^2-4\zeta )\ge 0, \end{array}\right. \end{aligned}$$
(27)

and

$$\begin{aligned} E(y)=\frac{1}{4}y^4\xi \zeta \ge 0. \end{aligned}$$
(28)

From (28), we conclude that \(\xi \ge 0\) and \(\zeta \ge 0\), since otherwise, we should have \(\xi < 0\) and \(\zeta < 0\), this yields \(\xi +\zeta <0\) and \(1+\xi -\sqrt{(1-2\xi +\xi ^2-4\zeta )}\le 0\), which contradicts (27).

Conversely, let \(\xi \ge 0\), \(\zeta \ge 0\). Then we show that \(R^2_2(q; \xi ,\zeta )\) is A-stable. Since \(\xi \ge 0\), \(\zeta \ge 0\), then (28) and (26) are clearly true. To prove (27), it is clear that \(\frac{1+\xi }{\xi +\zeta }\ge 0, \) when \(1-2\xi +\xi ^2-4\zeta < 0\). Let \(1-2\xi +\xi ^2-4\zeta \ge 0\). Since \(\xi +\zeta \ge 0\), we should prove \(1+\xi -\sqrt{(1-2\xi +\xi ^2-4\zeta )}\ge 0\). If

$$\begin{aligned} 1+\xi -\sqrt{(1-2\xi +\xi ^2-4\zeta )}< 0, \end{aligned}$$

then

$$\begin{aligned} (1+\xi )^2 < 1-2\xi +\xi ^2-4\zeta , \end{aligned}$$

or equivalently

$$\begin{aligned} 4\xi <-4\zeta , \end{aligned}$$

which contradicts the hypothesis \(\xi \ge 0\), \(\zeta \ge 0\). Thus by Theorem 1, \(R^2_2(q; \xi ,\zeta )\) is A-stable.

To prove the second part, let \(\xi =\zeta >0\), then (25) is converted to

$$\begin{aligned} R^2_2(q; \xi ,\xi )=\frac{1+\frac{1}{2}(1-\xi )q}{1-\frac{1}{2}(1+\xi )q+\frac{1}{2}\xi q^2}. \end{aligned}$$
(29)

Therefore \(\mid R^2_2(q; \xi ,\xi )\mid \rightarrow 0\) as \(\text{ Re }(q)\rightarrow -\infty \), so according to the Definition 1 \(R^2_2(q; \xi ,\xi )\) is L-stable. Conversely, let \(R^2_2(q; \xi ,\xi )\) be L-stable, then by Definition 1, it is A-stable and so by the first part, we have \(\xi \ge 0\), \(\zeta \ge 0\), and since \(\mid R^2_2(q; \xi ,\xi )\mid \rightarrow 0\), as \(\text{ Re }(q)\rightarrow -\infty \), we get \(\xi =\zeta >0\), (see (25)). \(\square \)

According to the Remark 1, and Lemma 1, the predictor–corrector method is L-stable, if

$$\begin{aligned} \phi -\theta \phi -\frac{1}{2}\theta =0, \end{aligned}$$
(30)

and

$$\begin{aligned} 2\theta -2\phi =\xi ,\;\;\; 4\theta -1=\xi , \end{aligned}$$

which implies

$$\begin{aligned} \phi =\frac{-2\theta +1}{2}. \end{aligned}$$
(31)

Substituting from (31) in (30) and solving it, yields

$$\begin{aligned} \theta =1\pm \frac{ \sqrt{2}}{2},\;\;\;\phi =\frac{-1\mp \sqrt{2}}{2}. \end{aligned}$$

We choose

$$\begin{aligned} \theta =1-\frac{ \sqrt{2}}{2},\;\;\phi =\frac{-1+ \sqrt{2}}{2}, \end{aligned}$$

since this choice yields more accurate results on various test problems. We use a rational approximation for three different kinds of FD and draw the numerical results for different data of BS model as shown in Fig. 1a, b, c, d and e.

Fig. 1
figure 1

a Plots of \(R(q)\) for three kinds of FD with \( \sigma =0.25\) and \(r=0.05\). b Plots of \(R(q)\) for three kinds of FD with \( \sigma =0.3\) and \(r=0.02\). c Plots of \(R(q)\) for three kinds of FD with \( \sigma =0.2\) and \(r=0.1\). d Plots of \(R(q)\) for three kinds of FD with \( \sigma =0.15\) and \(r=0.07\). e lots of \(R(q)\) for three kinds of FD with \( \sigma =0.1\) and \(r=0.01\)

The Fig. 1a, b, c, d and e show that the backward FD is more stable than the forward and central FDs for the first derivative, \(\frac{\partial V}{\partial S}\), of BS model.

5 Numerical Results

The numerical simulations are run on a computer with an Intel(R) Pentium processor 2.00 GHz 8 GB RAM and the software programs are written in Matlab.

Let \(V_{PC}\) denote the approximate solution obtained by the MOL and predictor–corrector method developed in Sect. 4. The error on \(V_{PC}\) at the current time \((\tau = T )\) is computed as

$$\begin{aligned} \text{ Error } = \max _S \mid V_{PC}(S,T)-V(S,T)\mid . \end{aligned}$$
(32)

The exact solution \(V(S,T)\), needed in (32) is not available.

Remark 2

For a call (put) option, either European or American, when the current asset price is higher, it has a strictly higher (lower) chance to be exercised and when exercises, it induces higher (lower) cash inflow. Therefore, the call (put) option price functions are increasing (decreasing) of the asset price, that is,

$$\begin{aligned} C(S_2, \tau ; E) > C(S1, \tau ; E), \quad S_2 > S_1, \end{aligned}$$

or

$$\begin{aligned} P(S_2, \tau ; E) < P(S1, \tau ; E), \quad S_2 > S_1. \end{aligned}$$
(33)

It is shown in Fig. 2 that our results are true for \(T=\tau \), according to Remark (2).

Fig. 2
figure 2

American put option with \(\epsilon =0.0001\), \(N_{\tau }=1,000\), \(N_S=100\), \(T=1\) and \(E=1\)

Therefore, more accurate estimation is obtained using the FD method with a large number of time and space steps (\(N_{\tau }\), \(N_{S}\)).

To compute the error of approximate solution from (32), we save the solution for \(N_S=1{,}000\) and \(N_{\tau }=10, 100, 1{,}000,\) then we compare it with the results obtain for \(N_{S}=10, 50,200, 400, 800\). See the numerical results in Table 1a, b and c for the above mentioned values of \(N_{\tau }\) and \(N_{S},\) and \(\sigma = 2,\; T = 1\), \(S_{\infty } = 2,\; r = 0.1, \;E = 1\) and \(\epsilon =\frac{T}{N_{\tau }}rE\) (see 8).

Table 1 Estimation Error of American option pricing

In Table 1d, we save the approximate solution for \(N{\tau }=10\) and \(N_S=1{,}000\), then we change the parameters to \(N_{S}=10, 50, 200, 400, 800\), \(\sigma =0 .25,\; T = 1\), \(S_{\infty } = 2,\; r = 0.02\) and \(E = 1.\)

Finally, we save the approximate solution for \(N{\tau }=10\) and \(N_S=1{,}000\), then we change the parameters to \( \sigma =0 .3,\; T = 1\), \(S_{\infty } = 2,\; r = 0.02,\;E = 1\) and obtain the Table  1e for \(N_{S}=10, 50, 200, 400, 800\). In this section numerical solution showed which \(\varDelta t \rightarrow 0\) (\(\epsilon \longrightarrow 0\)) the estimated option values provided by the penalty approach, \(V_{PC}(S,T)\), converge towards the \(V(S,T)\) (in (32)) and \(V_{PC}(S,T)\) is obtained satisfied financial condition (Remark (2)).

6 Conclusion

We investigated the stability and consistency of American put option pricing and we showed the truncation error of \(\theta \)-method is of order \(O(\varDelta t)+O (\varDelta S)^2.\) We obtained a high accuracy by using MOL and predictor–corrector method which depends on the parameters \(\theta \) and \(\phi \). By using Rational approximation, we obtained \(\theta \) and \(\phi \) to guarantee L-stability for American put option under the popular BS model. We used three different kinds of FD, i.e. backward, central and forward FDs for the first order space derivative in BS model, \(\frac{\partial V}{\partial S}\) and showed by numerical results that the backward method is more stable and more accurate than the others for \(\frac{\partial V}{\partial S}\).