Interpretation of parareal as a two-level additive Schwarz in time preconditioner and its acceleration with GMRES

Nguyen, Van-Thanh; Grigori, Laura

doi:10.1007/s11075-022-01492-8

Interpretation of parareal as a two-level additive Schwarz in time preconditioner and its acceleration with GMRES

Original Paper
Published: 01 March 2023

Volume 94, pages 29–72, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Numerical Algorithms Aims and scope Submit manuscript

Interpretation of parareal as a two-level additive Schwarz in time preconditioner and its acceleration with GMRES

Download PDF

327 Accesses
Explore all metrics

Abstract

We describe an interpretation of parareal as a two-level additive Schwarz preconditioner in the time domain. We show that this two-level preconditioner in time is equivalent to parareal and to multigrid reduction in time (MGRIT) with F-relaxation. We also discuss the case when additional fine or coarse propagation steps are applied in the preconditioner. This leads to procedures equivalent to MGRIT with FCF-relaxation and to MGRIT with F(CF)²-relaxation or overlapping parareal. Numerical results show that these variants have faster convergence in some cases. In addition, we also apply a Krylov subspace method, namely GMRES (generalized minimal residual), to accelerate the parareal algorithm. Better convergence is obtained, especially for the advection-reaction-diffusion equation in the case when advection and reaction coefficients are large.

Tight Two-Level Convergence of Linear Parareal and MGRIT: Extensions and Implications in Practice

On the Use of Reduced Basis Methods to Accelerate and Stabilize the Parareal Method

Multigrid interpretations of the parareal algorithm leading to an overlapping variant and MGRIT

Article 29 June 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In this paper, we focus on parareal, an algorithm introduced by J.L. Lions et al. [1] in 2001, which allows to exploit parallelism in time for initial value problems. Over the last two decades, this algorithm has been studied for a range of applications, going from molecular dynamics simulations [2], unsteady hydrodynamic simulations [3], kinetic neutron diffusion equation [4, 5], the Korteveg-deVries-Burgers’ equations [6], Hamiltonian systems [7, 8], to financial mathematics as the Black-Scholes equations [9,10,11]. Its stability and convergence are studied in a series of papers, e.g., [12,13,14,15]. Given a time-dependent problem, parareal allows parallel in time integration by relying on a combination between a fine propagator, which gives a very accurate approximate of the solution, and a coarse propagator, which is less expensive and gives a coarse approximate of the solution. For this, the time domain is decomposed into a number of uniform time subdomains. From an initial solution obtained by sequentially using the coarse propagator, parareal iteratively corrects it by the difference between the fine solution obtained in parallel using the fine propagator and the coarse solution obtained from the previous iteration.

Several different interpretations of parareal exist in the literature. A derivation of the parareal algorithm as a multiple shooting method is given in [13]. An investigation of the usage of spectral deferred corrections in the framework of parareal is given in [16, 17]. Coupling parareal in time with Schwarz waveform relaxation methods [18, 19] to exploit parallelism in both time and space are promising directions of research as well. Parareal can also be interpreted as a multigrid method in time, referred to as MGRIT with F-relaxation [20, 21]. Following this interpretation, several different variants have been investigated, as MGRIT with FCF-relaxation, MGRIT with F(CF)²-relaxation, e.g., [21, 22], where F refers to the F-relaxation and C refers to the C-relaxation.

Given that parareal relies on a decomposition of the time domain into subdomains, in this paper, we study the connection between parareal and domain decomposition methods. Traditionally domain decomposition methods are used for solving a linear system of equations ${\tilde {A}}\tilde {u} = {\tilde {f}}$, ${\tilde {A}} \in \mathbb {R}^{n \times n}$, arising from the discretization of a PDE by using, for example, the finite element method, and they rely on a decomposition of the space domain into subdomains. We consider here the case in which this linear system is solved by using an iterative method as a Krylov subspace method, preconditioned by ${\tilde {M}}^{-1}$,

$$ {\tilde{M}}^{-1}{\tilde{A}}\tilde{u} = {\tilde{M}}^{-1}{\tilde{f}}, $$

where ${\tilde {M}}^{-1}$ is a domain decomposition method. One-level domain decomposition preconditioners such as additive and multiplicative Schwarz preconditioners are well-known in the literature for domain decomposition in space, see, e.g., [23]. However, their convergence rate deteriorates when the number of subdomains becomes large because of a lack of global information coupling the subdomains. In order to obtain a scalable domain decomposition algorithm which depends weakly on the number of subdomains, a coarse space can be used to couple global information of all subdomains. This leads to the idea of two-level domain decomposition preconditioners. Given a spatial decomposition of the degrees of freedom of ${\tilde {A}}$ into $\tilde {N}$ subdomains, the restriction of ${\tilde {A}}$ to a spatial subdomain i, for $i=1, {\ldots } ,\tilde {N}$, is referred to as ${\tilde {A}}_{i}$ and is obtained by defining a restriction matrix ${\tilde {R}}_{i}$ together with a prolongation matrix ${\tilde {R}}_{i}^{T}$, such that ${\tilde {A}}_{i} = {\tilde {R}}_{i} {\tilde {A}} {\tilde {R}}_{i}^{T}$. By defining the coarse matrix ${\tilde {A}}_{0}$ and corresponding restriction and prolongation matrices ${\tilde {R}}_{0},{\tilde {R}}_{0}^{T}$, the two-level additive Schwarz preconditioner is defined as follows,

$$ {\tilde{M}}^{-1}_{AS2} = {\tilde{R}}^{T}_{0}{\tilde{A}}^{-1}_{0}{\tilde{R}}_{0} + \sum\limits_{i=1}^{\tilde{N}}{\tilde{R}}^{T}_{i}{\tilde{A}}^{-1}_{i}{\tilde{R}}_{i}, $$

and the two-level multiplicative Schwarz preconditioner is analogously defined as follows

$$ {\tilde{M}}^{-1}_{MS2} =\left[{{\mathcal{I}}} - ({{\mathcal{I}}} - {\tilde{R}}^{T}_{0}{\tilde{A}}^{-1}_{0}{\tilde{R}}_{0}{\tilde{A}} )\prod\limits_{i=1}^{\tilde{N}}({{\mathcal{I}}} -{\tilde{R}}^{T}_{i}{\tilde{A}}^{-1}_{i}{\tilde{R}}_{i} {\tilde{A}}) \right] {\tilde{A}}^{-1}. $$

To show the equivalence between parareal and two-level domain decomposition methods, we consider the linear time-dependent problem,

$$ \frac{d{u}}{dt} = {f}({u}) \ \ , \ {u}(0)= \mathrm{u}_{0}, \ \ {u}(t) \in \mathbb{R}^{d} , t \ \ \text{in} \ (0,T), $$

(1)

and an algebraic framework in which the solution to (1) can be obtained by solving with a residual correction scheme the linear system of equations,

$$ {A} {U}_{F} = {f}, $$

(2)

where the time domain (0,T) was decomposed into N time subdomains, ${A} \in \mathbb {R}^{(N+1)d\times (N+1)d}$ is bidiagonal and denotes the time-stepping coefficient matrix with the form,

$$ {A}:= \begin{bmatrix} {\mathcal{I}} & & & \\ -{\phi} & {\mathcal{I}} & & \\ & {\ddots} &{\ddots} & \\ & & -{\phi} &{\mathcal{I}} \end{bmatrix}. $$

In this equation, ${\mathcal {I}} \in \mathbb {R}^{d\times d}$ is the identity matrix, ${\phi } \in \mathbb {R}^{d\times d}$ denotes an arbitrary stable discretization method in space and time, U_F := [u₀,…,u_N]^T denotes the solution at fine time steps, and f := [u₀,0…,0]^T is the right-hand side. The matrix A includes all the time steps for the whole time domain. If N and d become large, (2) results in a very large and sparse system. This is the case where domain decomposition type methods show their advantages. We consider the problem on a uniform grid, the time steps and space steps do not change from one to the next so the discretization matrix for each time step, namely ϕ, does not change . We show that parareal is equivalent to using the preconditioned stationary iteration which computes a new approximate solution ${U}_{F}^{k+1}$ from ${U}_{F}^{k}$,

$$ {U}_{F}^{k+1} = {U}_{F}^{k} + {M}_{SC}^{-1}({f} - {A}{U}^{k}_{F}), $$

where ${M}_{SC}^{-1}$ is a two-level additive Schwarz in time preconditioner defined as follows,

$$ {M}_{SC}^{-1} = ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0}) \left( \sum\limits_{i=1}^{\hat{N}}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i} \right). $$

(3)

Remark 1

The preconditioner ${M}_{SC}^{-1}$ in (3) is different from the so-called hybrid preconditioner where subdomain preconditioning is applied additively, but the coarse solve is applied multiplicatively in the second stage, i.e.,

$$ {M}_{hybrid}^{-1} = {R}_{0}^{T} A_{0}^{-1}{R}_{0} + ({\mathbb{I}} - {R}_{0}^{T} A_{0}^{-1}{R}_{0}) \left( \sum\limits_{i=1}^{\hat{N}}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i} \right). $$

A symmetrized version of this preconditioner appears as P_hy1 in the standard reference of Toselli and Widlund (2005) for domain decomposition methods, [24].

We give in Section 3 the exact definitions of the subdomain matrices A_i, for $i=1, \ldots , \hat {N}$, the coarse time correction matrix A₀, as well as the restriction and prolongation matrices ${R}_{i}, {R}_{i}^{T}$, for $i=0, \ldots , \hat {N}$, where $\hat {N}$ is the number of subdomain matrices of A. The matrix ${\mathbb {I}} \in \mathbb {R}^{(N+1)d\times (N+1)d}$ is the identity matrix. The first term denotes an additive Schwarz preconditioner in time, which is computed in parallel by using the fine propagators, followed by a coarse correction in time, based on a coarse propagator, which is computed sequentially and transfers the information globally between the different time subdomains.

Furthermore, we show that this two-level additive Schwarz in time preconditioner has the same error propagation as MGRIT with F-relaxation at coarse time points, discussed in [20, 21, 25]. As expected, this shows that the three algorithms parareal, MGRIT with F-relaxation, and two-level additive Schwarz in time preconditioner from (3) are equivalent. We also discuss that applying additional fine or coarse propagation steps in the two-level additive Schwarz in time preconditioner is equivalent to MGRIT with FCF-relaxation and MGRIT with F(CF)²-relaxation or overlapping parareal, discussed in [26]. Faster convergence can be achieved in some cases, but the trade-off is also important to consider. To improve the convergence, a variant of two-level domain decomposition method, referred to as SCS² two-level additive Schwarz in time preconditioner, provides a good alternative, since it relies on increasing the number of additive Schwarz in time steps, while keeping only one coarse correction step, which is performed in sequential. Note that the notations S and C used here in the context of two-level additive Schwarz in time preconditioner correspond to the use of fine and coarse propagators. They are different from F-relaxation and C-relaxation used in MGRIT. Specifically, S and C propagation steps in the two-level additive Schwarz in time preconditioner start from the same coarse time points and propagate to obtain the approximate solution at the end of each time subdomain. While F-relaxation propagates to obtain the approximate solution at fine time points based on the coarse time points, and C-relaxation propagates to obtain the approximate solution at coarse time points based on the previous fine time points, for more details, see [25]. We also explore the usage of Krylov subspace methods for solving the system (2). This gives promising numerical results, especially for solving the advection-reaction-diffusion equation with large advection and reaction terms.

The paper is organized as follows. Section 2 recalls parareal algorithm and its formulation as a residual correction scheme. Section 3 introduces an interpretation of parareal as a two-level additive Schwarz in time preconditioner. Section 4 discusses several variants of this two-level additive Schwarz in time preconditioner and gives their convergence analysis. Furthermore, theoretical convergence bounds are given in Section 5. Several numerical experiments are presented in Section 6, where we consider the Dahlquist problem, the heat equation, and the advection-reaction-diffusion equation. Conclusions and perspectives are given in Section 7.

2 Parareal algorithm

In this section, we describe the parareal algorithm by following its presentation from, e.g., [13]. For the simplicity of the exposition, we consider the scalar linear time-dependent problem,

$$ \frac{d{u}}{dt} = {f(u)} \ \ , \ {u}(0)= \mathrm{u}_{0}, \ \ {u}(t) \in \mathbb{R}^{d} , t \ \ \text{in} \ (0,T), $$

(4)

The time interval [0,T] is decomposed into N_C uniform time subdomains [T_n,T_n+ 1] with $n = 0,\dots ,N_{C}-1$. Parareal uses two solvers, a fine solver $\mathcal {F}(T_{n+1},T_{n},{U}_{n})$, which gives a very good approximate, and a coarse solver $\mathcal {G}(T_{n+1},T_{n},{U}_{n})$, which gives a coarse approximate of the solution at time T_n+ 1 starting from the initial solution U_n at time T_n. The initial approximate ${U}^{0}_{n}$ at coarse time points is obtained typically by using sequentially the coarse solver,

$$ {U}^{0}_{n+1} = \mathcal{G}(T_{n+1},T_{n},{U}^{0}_{n}), \ {U}^{0}_{0} = \mathrm{u}_{0}. $$

From this initial solution in time, parareal iteratively computes a new approximate of the solution of (4) until some convergence criterion is met. At each iteration k + 1, k ≥ 0, a new approximate is computed as follows,

$$ {U}^{k+1}_{n+1} = \mathcal{G}(T_{n+1},T_{n},{U}^{k+1}_{n}) +\mathcal{F}(T_{n+1},T_{n},{U}^{k}_{n}) - \mathcal{G}(T_{n+1},T_{n},{U}^{k}_{n}). $$

(5)

The coarse and the fine solvers can be chosen in various ways. Very often a higher order approximation is used for the fine solver and a lower order approximation is used for the coarse solver. The coarse solver can also solve a different problem, which is simpler to solve than the original one, as long as it gives an acceptable approximate of the solution. However, the coarse solver plays an important role in the convergence of the parareal algorithm. It should be chosen in such a way that it is cheap but accurate enough compared to the fine one, otherwise parareal algorithm can converge slowly. One simple approach is to choose the same discretizations in both time and space for both coarse and fine solvers, but with larger time step Δt for the coarse solver and smaller δt for the fine solver. Furthermore, one can also use a coarsened spatial mesh for the coarse solver, see [27].

2.1 Parareal execution from an algebraic point of view

Consider the time-dependent problem from (4) for which the time interval [0,T] is divided into N uniform time slices [t_n,t_n+ 1] with length δt, for $n = 0,\dots ,N-1$. On the other hand, [0,T] is also partitioned into N_C uniform coarse time intervals [T_l,T_l+ 1] with length ΔT, for $l = 0,\dots ,N_{C}-1$. We denote by ϕ a stable discretization method in time such as forward Euler, backward Euler, Runge-Kutta or higher order methods, and by ϕ_ΔT the coarse solver for which the same methods are used but with larger time step, or lower order methods or spatial coarsening, in particular ϕ_ΔT approximates the fine solver ϕ^m. Let δt be the fine time step and Δt = ΔT = mδt be the coarse time step (we use one coarse time step for the coarse solver on each coarse time interval), in which m denotes the number of fine time steps on each coarse time interval. We note that the error propagation and convergence analysis in Sections 3, 4, and 5 are based on the assumption that ϕ and ϕ_ΔT can be diagonalized by the same set of eigenvectors, in cases when when ϕ and ϕ_ΔT have the same spatial discretization as stated in [25]. Furthermore, the analysis of spatial discretization can also be found in [28]. Without loss of generality, we consider in this work the same discretization methods in both time and space for both coarse and fine solvers, namely the backward Euler in time and centered finite difference method in space. However, discretizations as forward Euler, Runge-Kutta or higher order methods can also be used in the same framework, we illustrate this by using Runge-Kutta 4 for the fine solver in Section 6.4. In this paper, we focus on the linear constant-coefficient partial differential equations, in particular the heat equation and the advection-reaction-diffusion equation. By sequentially applying ϕ, the linear system of equations obtained has the form:

$$ {A}{U}_{F} :=\begin{bmatrix} {\mathcal{I}} & & & \\ -{\phi} & {\mathcal{I}} & & \\ & {\ddots} &{\ddots} & \\ & & -{\phi} &{\mathcal{I}} \end{bmatrix}\begin{bmatrix} {u}_{0}\\ {u}_{1}\\ \vdots\\ {u}_{N} \end{bmatrix} = \begin{bmatrix} \mathrm{u}_{0}\\ 0\\ \vdots\\ 0 \end{bmatrix}=:{f}, $$

(6)

where ${A} \in \mathbb {R}^{(N+1)d\times (N+1)d}$ denotes the time-stepping coefficient matrix, ${\mathcal {I}}\in \mathbb {R}^{d\times d}$ denotes the identity matrix, and ${\phi } \in \mathbb {R}^{d\times d}$ denotes the discretization matrix. This system of equations can be solved by using a direct method in which the solutions u_i,i = 0,…,N at different time steps are obtained sequentially. This results in a complexity of N time steps, each time step being solved by using ϕ. But instead of just using ϕ, parareal combines the use of both coarse and fine solvers to result in a faster algorithm in which the fine solvers are performed in parallel.

We describe parareal by considering a simple two-level temporal mesh for which m = 2, as displayed in Fig. 1. With this choice, the fine nodes are defined at all time points {t₀,t₁,t₂,…,t_N}, while the coarse nodes are defined at even time points {t₀,t₂,…,t_N}. At the initial step k = 0, the initial approximate of the coarse solution is obtained by applying ϕ_ΔT sequentially and the fine solution is obtained by interpolating. Let $\mathcal {F}(T_{n+1},T_{n},{U}^{k}_{n}):= {\phi }^{2} {U}^{k}_{n}$ be the fine propagator and $\mathcal {G}(T_{n+1},T_{n},{U}^{k}_{n}):= {\phi }_{\Delta T} {U}^{k}_{n}$ be the coarse propagator, parareal iteration from (5) becomes,

$$ {U}^{k+1}_{n+1} = {\phi}_{\Delta T} {U}^{k+1}_{n} +{\phi}^{2} {U}^{k}_{n} - {\phi}_{\Delta T}{U}^{k}_{n}, $$

where ${U}^{k}_{n}$ corresponds to ${u}^{k}_{2n}$ which denotes the parareal solution at coarse time point $t_{2n}, \ n= 0,\dots ,N/2$ and iteration k. In detail, parareal computes the approximate solutions at fine time points as follows,

$$ {u}^{k+1}_{i} = \left\{\begin{array}{cccc} \mathrm{u}_{0}, & \text{if} \ i &=&0, \\ {\phi}{u}^{k}_{i-1}, & \text{for} \ i &=&1, 3,\dots, N-1, \\ {\phi}_{\Delta T}{u}^{k+1}_{i-2} + {\phi}{u}^{k+1}_{i-1} - {\phi}_{\Delta T}{u}^{k}_{i-2} , & \text{for} \ i &=&2, 4,\dots,N. \end{array}\right. $$

(7)

As it can be seen from Fig. 1, the fine approximate solutions can be computed in parallel based on the coarse approximate solutions from the previous iterations. Generally for arbitrary m ≥ 2, we have similarly,

$$ {U}^{k+1}_{n+1} = {\phi}_{\Delta T} {U}^{k+1}_{n} +{\phi}^{m} {U}^{k}_{n} - {\phi}_{\Delta T}{U}^{k}_{n}. $$

(8)

2.2 Expression of the standard residual correction scheme

As presented in, e.g., [26], parareal algorithm can be seen as a preconditioned residual correction scheme of a reduced system representing only the coarse time solutions, which is obtained from the original system of (6). For this, a coarse matrix A_C represents the time steps of the coarse level (here we keep every second time point on each time interval), U_C represents the unknown solutions and f_C the right-hand side at coarse time points,

$$ {A}_{C}{U}_{C} :=\begin{bmatrix} {\mathcal{I}} & & & & \\ -{\phi}^{2} & {\mathcal{I}} & & & \\ & {\ddots} &{\ddots} & &\\ & &-{\phi}^{2} &{\mathcal{I}} & \\ & & & -{\phi}^{2} & {\mathcal{I}} \end{bmatrix}\begin{bmatrix} {u}_{0}\\ {u}_{2}\\ \vdots\\ {u}_{N-2}\\ {u}_{N} \end{bmatrix} = \begin{bmatrix} \mathrm{u}_{0}\\ 0\\ \vdots\\ 0\\ 0 \end{bmatrix}=: {f}_{C}. $$

(9)

This reduced system of (9) produces exactly the same solutions as the original system (6) at coarse time points. A preconditioner $\tilde {M}$ which approximates the coarse matrix A is obtained by approximating each fine time integration propagator ϕ² by one coarse integration propagator ϕ_ΔT,

$$ {\tilde{M}}:=\begin{bmatrix} {\mathcal{I}} & & & & \\ -{\phi}_{\Delta T} & {\mathcal{I}} & & & \\ & {\ddots} &{\ddots} & &\\ & &-{\phi}_{\Delta T} &{\mathcal{I}} & \\ & & & -{\phi}_{\Delta T} & {\mathcal{I}} \end{bmatrix}. $$

By using the preconditioned stationary iteration at the coarse level, we obtain at iteration k,

$$ {U}_{C}^{k+1} = {U}_{C}^{k} + {\tilde{M}}^{-1}({f}_{C} - {A}_{C}{U}^{k}_{C}), $$

(10)

which can be written as follows,

$$ {\tilde{M}}({U}_{C}^{k+1}-{U}_{C}^{k}) = {f}_{C} - {A}_{C}{U}^{k}_{C}, $$

(11)

or explicitly written as follows,

$$ \begin{bmatrix} {\mathcal{I}} & & & & \\ -{\phi}_{\Delta T} & {\mathcal{I}} & & & \\ & {\ddots} &{\ddots} & &\\ & &-{\phi}_{\Delta T} &{\mathcal{I}} & \\ & & & -{\phi}_{\Delta T} & {\mathcal{I}} \end{bmatrix} \begin{bmatrix} {u}_{0}^{k+1}-{u}_{0}^{k}\\ {u}_{2}^{k+1}-{u}_{2}^{k}\\ \vdots\\ {u}_{N-2}^{k+1}-{u}_{N-2}^{k}\\ {u}_{N}^{k+1}-{u}_{N}^{k} \end{bmatrix} = \begin{bmatrix} \mathrm{u}_{0} - {u}_{0}^{k} \\ {\phi}^{2}{u}_{0}^{k} - {u}_{2}^{k}\\ \vdots\\ {\phi}^{2}{u}_{N-4}^{k}-{u}_{N-2}^{k}\\ {\phi}^{2}{u}_{N-2}^{k}-{u}_{N}^{k} \end{bmatrix}. $$

(12)

It can be easily seen that the solutions ${u}_{2i}^{k+1}$ for i = 0,…,N/2 obtained by solving (12) are the same as the solutions obtained by parareal in (7).

We consider now solving the system AU_F = f from (6) at the fine level. We introduce a matrix M_SC and we show that parareal algorithm is equivalent to solving AU_F = f by using a stationary iteration preconditioned by ${M}_{SC}^{-1}$. The preconditioned stationary iteration for solving AU_F = f at the fine level becomes,

$$ {U}_{F}^{k+1} = {U}_{F}^{k} + {M}_{SC}^{-1}({f} - {A}{U}^{k}_{F}), $$

(13)

or equivalently,

$$ {M}_{SC}({U}_{F}^{k+1} - {U}_{F}^{k})= {f} - {A}{U}^{k}_{F}, $$

(14)

Note that (14) acts at the fine level, so M_SC is different from ${\tilde {M}}$ in (11). In other words, M_SC has to deal with both unknowns at coarse and fine time points. The matrix M_SC is defined in the following lemma.

Lemma 1

Let $\mathcal {F}(T_{n+1},T_{n},{u}^{k}_{n}):= {\phi }^{m}{u}^{k}_{n} $ and $\mathcal {G}(T_{n+1},T_{n},{u}^{k}_{n}):= {\phi }_{\Delta T}{u}^{k}_{n}$ denote the fine and the coarse solvers, respectively. For m ≥ 2, (14) is equivalent to parareal algorithm with M_SC defined as follows,

$$ {M}_{SC}:=\begin{bmatrix} {}{\mathcal{I}} \\ \left.\begin{array}{ccccc} & {}{\mathcal{I}} \\ & {}-{\phi} ~ ~~{\mathcal{I}}\\ & {\kern20pt}{\ddots} \\ -{\phi}_{\Delta T} & &{}-{\phi} ~~~{\mathcal{I}} \end{array}\right\}& md\times md \\ & {}{\ddots} \\ & {\mathcal{I}} \\ & -{\phi} & {}{\mathcal{I}}\\ & & & {\ddots} \\ &{} -{\phi}_{\Delta T} & & -{\phi} & ~{\mathcal{I}}\\ \end{bmatrix}. $$

(15)

Proof

For m = 2, (14) becomes,

$$ \begin{bmatrix} {\mathcal{I}} & & & & & \\ & {\mathcal{I}}& & & & \\ -{\phi}_{\Delta T}& -{\phi} & {\mathcal{I}} & & & \\ & & & {\ddots} & & \\ & & & & {\mathcal{I}} &\\ & & & -{\phi}_{\Delta T}& -{\phi} & {\mathcal{I}} \end{bmatrix} \begin{bmatrix} {u}_{0}^{k+1} - {u}_{0}^{k}\\ {u}_{1}^{k+1} - {u}_{1}^{k}\\ {u}_{2}^{k+1} - {u}_{2}^{k}\\ \vdots\\ {u}_{N-1}^{k+1} - {u}_{N-1}^{k}\\ {u}_{N}^{k+1} - {u}_{N}^{k} \end{bmatrix} = \begin{bmatrix} \mathrm{u}_{0} - {u}_{0}^{k} \\ {\phi} {u}_{0}^{k} - {u}_{1}^{k}\\ {\phi}{u}_{1}^{k} - {u}_{2}^{k}\\ \vdots\\ {\phi}{u}_{N-2}^{k} - {u}_{N-1}^{k}\\ {\phi}{u}_{N-1}^{k} - {u}_{N}^{k} \end{bmatrix}. $$

(16)

Simplifying (16) gives

$$ {u}^{k+1}_{i} = \left\{\begin{array}{cccc} \mathrm{u}_{0}, & \!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\text{if} \ i= 0, \\ {\phi}{u}^{k}_{i-1}, &\text{~~~~~~~for} \ i =1, 3,\dots, N-1, \\ {\phi}_{\Delta T} {u}_{i-2}^{k+1} + {\phi}{u}_{i-1}^{k+1} - {\phi}_{\Delta T}{u}_{i-2}^{k} , & \text{for} \ i =2, 4,\dots,N. \end{array}\right. $$

Generalize for m > 2, for $j= 0, m, 2m,\dots ,N-m$ and by induction, we have,

$$ \begin{array}{@{}rcl@{}} {u}_{0}^{k+1} &=& \mathrm{u}_{0},\\ {u}_{j+1}^{k+1} &=& {\phi}{u}_{j}^{k},\\ {u}_{j+2}^{k+1} &=& {\phi}{u}_{j+1}^{k+1},\\ \vdots\\ {u}_{j+m-1}^{k+1} &=& {\phi}{u}_{j+m-2}^{k+1},\\ {u}_{j+m}^{k+1} &=& {\phi}_{\Delta T} {u}_{j}^{k+1} + {\phi}{u}_{j+m-1}^{k+1} - {\phi}_{\Delta T}{u}_{j}^{k} = {\phi}_{\Delta T} {u}_{j}^{k+1} + {\phi}^{m}{u}_{j}^{k} - {\phi}_{\Delta T}{u}_{j}^{k}, \end{array} $$

which is identical to (8) and concludes the proof. □

Hence, instead of solving the system (6) by using a direct method, parareal algorithm uses the stationary iteration defined in (13) preconditioned by M_SC as defined in (15). In addition, Krylov subspace methods as GMRES can also be used to accelerate the convergence of parareal. In the numerical experiments, Section 6.3 we present results obtained by using GMRES for solving the preconditioned linear system,

$$ {M}^{-1}_{SC}{A}{U}_{F} = {M}^{-1}_{SC}{f}. $$

It will be seen that GMRES improves slightly the convergence of parareal and it allows to solve problems for which parareal has difficulty to converge, as in the case when the advection and reaction coefficients are large compared to the diffusion term for the advection-reaction-diffusion problem. However, in general, it does not improve drastically the convergence of parareal for our test problems, and this was also observed in previous works as [29] which studied the acceleration of waveform relaxation methods.

3 Interpretation of parareal as a two-level additive Schwarz in time preconditioner

In this section, we present an interpretation of parareal as a two-level domain decomposition method. For this, we show that the inverse of the preconditioner M_SC from (15) can be expressed as a first level additive Schwarz preconditioner that relies on using the fine propagator ϕ^m in each time subdomain, followed by a coarse time correction based on using the coarse propagator ϕ_ΔT.

We introduce first some notations. Let ${A} \in \mathbb {R}^{(N+1)d\times (N+1)d}$ be the time-stepping matrix as defined in Section 2.1. The matrices ${\mathbb {I}} \in \mathbb {R}^{(N+1)d\times (N+1)d}$ and ${\mathcal {I}}\in \mathbb {R}^{d\times d}$ are identity matrices. The matrix A is decomposed into N_C + 1 non-overlapping subdomains $\{{\Omega }_{i}\}_{1\leq i\leq N_{C}+1}$, where N_C = N/m denotes the number of coarse time intervals. The matrix A is a block matrix, the blocks being defined as $\{A_{ij}\}_{1\leq i,j\leq N+1} \in \mathbb {R}^{d\times d}$. As displayed in (6), A_ij can be the ϕ matrix, the identity or the zero matrix. Let $\mathfrak {N}= \{1,\dots ,N+1\}$ be the set of indices of A, which corresponds to the fine time steps {t₀,…,t_N}. Let $\mathfrak {N}_{i}$, $i \in \{1,\dots ,N_{C}+1\}$ be the subset of $\mathfrak {N}$ such that $\mathfrak {N}_{i}$ represents the subset of indices of subdomain i, we define $\mathfrak {N}_{i}$ as follows,

$$ \mathfrak{N}_{i} = \left\{\begin{array}{cccc} \{1\}, & \text{if} \ i &=&1, \\ \{m(i-2)+2, \ldots, m(i-1)+1\}, & \text{for} \ i &=&2,\dots,N_{C}+1, \end{array}\right. $$

(17)

the restriction matrix R_i is defined as follows,

$$ {R}_{i} = \left\{\begin{array}{ll} ~~~~~{\mathcal{I}},~~~~~~~ \text{if} \ i =\quad\qquad1, \\ {\mathbb{I}}(\mathfrak{N}_{i},:), \text{for} \ i =2,\dots,N_{C}+1, \end{array}\right. $$

(18)

where ${\mathbb {I}}(\mathfrak {N}_{i},:)$ denotes the submatrix of ${\mathbb {I}}$ formed by the rows whose indices belong to $\mathfrak {N}_{i}$. The prolongation matrix ${R}_{i}^{T}$ is the transpose of R_i. The subdomain matrices $\{{A}_{i}\}_{1\leq i\leq N_{C}+1}$ are defined as follows,

$$ {A}_{i} = \left\{\begin{array}{lccc} \qquad\qquad\qquad{\mathcal{I}}, \qquad\qquad\qquad\qquad\text{if} \ i =\qquad1, \\ {R}_{i}A{R}_{i}^{T} = \begin{bmatrix} {\mathcal{I}} & & & & \\ -{\phi} & {\mathcal{I}} & & & \\ & {\ddots} &{\ddots} & &\\ & &-{\phi} &{\mathcal{I}} & \\ & & & -{\phi} & {\mathcal{I}} \end{bmatrix}, \text{for} \ i =2,\dots,N_{C}+1. \end{array}\right. $$

For i ≥ 2, ${A}_{i}= {R}_{i}{A}{R}_{i}^{T} $ is an md × md block matrix. The inverse of A_i can be computed as follows,

$$ {A}_{i}^{-1} = \left\{\begin{array}{cc} ~~~~~~~~~~~~{\mathcal{I}}, ~~~~~~~~~~~~~~~~~~~\quad\quad\quad\quad\text{if} \ i =\qquad\quad1, \\ \begin{bmatrix} {\mathcal{I}} & & & & \\ {\phi} & {\mathcal{I}} & & & \\ \phi^{2} & \phi &\mathcal{I}\\ \phi^{3} &\phi^{2} & \phi &\mathcal{I}\\ \vdots&{\vdots} & & {\ddots} &{\ddots} & &\\ \phi^{m-2}& \phi^{m-3} & &\ldots& {\phi} & &{\mathcal{I}} & \\ \phi^{m-1}&\phi^{m-2} & & \ldots& & {\phi} & & {\mathcal{I}} \end{bmatrix}, ~~~~ \text{for} \ i =2,\dots,N_{C}+1. \end{array}\right. $$

The first level additive Schwarz in time preconditioner is ${\sum }_{i=1}^{N_{C}+1}{R}_{i}^{T} {A}_{i}^{-1} {R}_{i}$ as presented in Section 1. The second level coarse time correction is defined as following. Let $\mathfrak {N}_{0} = \{1+ i m\}_{0\leq i\leq N_{C}}$ be the set of indices corresponding to coarse time points and $A_{0} \in \mathbb {R}^{(N_{C} +1)d \times (N_{C} +1 )d}$ be the coarse matrix that solves the reduced system from (6) at every coarse time point by using the coarse integration propagator ϕ_ΔT,

$$ A_{0} = \begin{bmatrix} {\mathcal{I}} & & & & \\ -{\phi}_{\Delta T} & {\mathcal{I}} & & & \\ & {\ddots} &{\ddots} & &\\ & &-{\phi}_{\Delta T} &{\mathcal{I}} & \\ & & & -{\phi}_{\Delta T} & {\mathcal{I}} \end{bmatrix}. $$

(19)

The coarse problem at coarse time points in the time domain is obtained by using a restriction matrix ${R}_{0} \in \mathbb {R}^{(N_{C} +1)d \times (N+1)d}$, defined such that the entries of R₀ are identities at positions corresponding to the coarse time points and 0 elsewhere. In particular, R₀ is defined as follows,

$$ {R}_{0} = {\mathbb{I}}(\mathfrak{N}_{0},:), $$

(20)

in which $\mathfrak {N}_{0}= \{1,1+m,1+2m,\dots ,1+N_{C}m\}$ and the prolongation matrix for the coarse problem is the transpose of R₀. The inverse of A₀ can be computed as follows,

$$ A_{0}^{-1} = \begin{bmatrix} {\mathcal{I}} & & & & \\ {\phi_{\Delta T}} & {\mathcal{I}} & & & \\ \phi_{\Delta T}^{2} & \phi_{\Delta T} &\mathcal{I}\\ \phi_{\Delta T}^{3} &\phi_{\Delta T}^{2} & \phi_{\Delta T} &\mathcal{I}\\ \vdots& {\vdots} & & {\ddots} &{\ddots} & &\\ \phi_{\Delta T}^{N_{C}-1}& \phi_{\Delta T}^{N_{C}-2} & & \ldots&{\phi_{\Delta T}} &{\mathcal{I}} & \\ \phi_{\Delta T}^{N_{C}}&\phi_{\Delta T}^{N_{C}-1} & & \ldots& & {\phi_{\Delta T}} & &{\mathcal{I}} \end{bmatrix}. $$

Lemma 2

The matrix M_SC defined in (15) can be factored as follows,

$$ {M}_{SC} = \left( \sum\limits_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}{R}_{i} \right) ({R}_{0}^{T}A_{0}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0}), $$

and the additive Schwarz in time preconditioner ${M}_{SC}^{-1}$ is formed by the product of the additive Schwarz term ${\sum }_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i}$ and the coarse time correction term ${R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb {I}} - {R}_{0}^{T}{R}_{0}$,

$$ {M}_{SC}^{-1} = ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0}) \left( \sum\limits_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i} \right). $$

(21)

Proof

We have

$$ \left( \sum\limits_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}{R}_{i} \right) ({R}_{0}^{T}A_{0}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0}) $$

$$ \begin{array}{@{}rcl@{}} \!\!&=&\!\! \begin{bmatrix} {}{\mathcal{I}} \\ \hspace{0.4cm}\left.\begin{array}{ccccc} & {\kern1pt}{\mathcal{I}} \\ & &{} -{\phi} {\kern4pt}{\mathcal{I}}\\ & &&{\kern1pt}{\ddots} \\ & &&&{} -{\phi} {\kern4pt}{\mathcal{I}} \end{array}\right\}& md\!\times\! md \\ & {\ddots} \\ & \hspace{0.2cm}{\mathcal{I}} \\ & -{\phi} & {\mathcal{I}}\\ & & & {\ddots} \\ & & & -{\phi} & {\mathcal{I}}\\ \end{bmatrix} \!\!\begin{bmatrix} {}{\mathcal{I}} \\ \left.\begin{array}{lllll} & {\mathcal{I}} \\ && {\mathcal{I}}\\ & &&{\ddots} \\ -{\phi}_{\Delta T} &&&& {\mathcal{I}} \end{array}\right\}& md\!\times\! md & \\ & {\ddots} \\ & {\mathcal{I}} \\ & & {\mathcal{I}}\\ & & & {}{\ddots} \\ & -{\phi}_{\Delta T}& & & {\mathcal{I}}\\ \end{bmatrix} \\ \!\!&=&\!\! \begin{bmatrix} {\mathcal{I}} \\ \left.\begin{array}{lllll} & {\mathcal{I}} \\ & -{\phi} & {\mathcal{I}}\\ & &&{\ddots} \\ -{\phi}_{\Delta T} &&& -{\phi}& {\mathcal{I}} \end{array}\right\}& md\times md \\ & {}{\ddots} \\ & {}{\mathcal{I}} \\ & {}-{\phi} & {}{\mathcal{I}}\\ & & {} {\ddots} \\ & -{\phi}_{\Delta T}& -{\phi} & {\mathcal{I}} \\ \end{bmatrix}= {M}_{SC}. \end{array} $$

We observe that the matrix $({R}_{0}^{T}A_{0}{R}_{0} + {\mathbb {I}} - {R}_{0}^{T}{R}_{0})$ can be permuted to a matrix whose first diagonal block is A₀ followed by an identity matrix. Additionally the term ${\sum }_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}{R}_{i}$ is a block diagonal matrix. Thus, we obtain,

$$ \begin{array}{@{}rcl@{}} {M}_{SC}^{-1} &=& ({R}_{0}^{T}A_{0}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})^{-1} \left( {\sum}_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}{R}_{i} \right)^{-1}\\ &=& ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0}) \left( {\sum}_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i} \right). \end{array} $$

□

The preconditioner ${M}_{SC}^{-1}$ is applied to a vector at each iteration of the residual correction scheme (13). The inverses ${A}_{i}^{-1}$ and $A_{0}^{-1}$ are never formed explicitly, they are applied to a vector by using a backward solve. We refer to this preconditioner as the SC two-level additive Schwarz in time preconditioner. It is formed by the additive Schwarz preconditioner ${\sum }_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i}$, which corresponds to the use of the fine propagators computed in parallel, followed by a coarse time correction ${R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb {I}} - {R}_{0}^{T}{R}_{0}$, which corresponds to the use of the coarse propagator computed sequentially.

Corollary 1

Solving (6) by using parareal is equivalent to using the residual correction scheme from (13) at the fine level, preconditioned by the SC two-level additive Schwarz in time preconditioner. Each iteration becomes:

$$ {U}_{F}^{k+1} = {U}_{F}^{k} + ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k}). $$

(22)

Proof

The proof is done by combining Lemma 1 and 2. □

We illustrate these results by considering the simple linear time-dependent problem (4) with m = 2, as it can be seen in Fig. 2. After discretization, the linear system from (6) needs to be solved. We first decompose the whole time domain into non-overlapping subdomains with indices $\mathfrak {N}_{i}$ given by (17), with the restriction matrices ${R}_{1} \in \mathbb {R}^{d\times (N+1)d}, {R}_{i} \in \mathbb {R}^{md\times (N+1)d}$, and the prolongation matrices ${R}_{1}^{T} \in \mathbb {R}^{(N+1)d\times d}, {R}_{i}^{T} \in \mathbb {R}^{(N+1)d\times md}$ for $i= 2,\dots ,N_{C}+1$ satisfy (18) such that their entries are ${\mathcal {I}}$ at positions corresponding to the i th subdomain and 0 elsewhere, specifically,

$$ \begin{array}{@{}rcl@{}} {R}_{1} &=& \begin{bmatrix} {\mathcal{I}} & 0 & 0 & 0 &\dots& 0 & 0 \end{bmatrix}, {R}_{2} = \begin{bmatrix} 0 & {\mathcal{I}} & 0 & 0 & 0 &{\dots} & 0\\ 0 & 0 & {\mathcal{I}} & 0 & 0&{\dots} & 0\\ \end{bmatrix}, \ldots,\\ {R}_{N_{C}+1} &=& \begin{bmatrix} 0 & 0 & 0 & 0 & {\dots} & {\mathcal{I}} & 0\\ 0 & 0 & 0 & 0 &{\dots} & 0 & {\mathcal{I}} \end{bmatrix}. \end{array} $$

The subdomain matrices ${A}_{i} = {R}_{i}{A}{R}_{i}^{T}$, for i = 1,…,N_C + 1, become,

$$ {A}_{i} = \left\{\begin{array}{clc} {\mathcal{I}}, & \text{for} \ i =1, \\ \begin{bmatrix} {\mathcal{I}} & 0\\ -{\phi}& {\mathcal{I}} \end{bmatrix}, & \text{for} \ i =2,\dots,N_{C}+1. \end{array}\right. $$

Let $\mathfrak {N}_{0} = \{1,3,5,\dots ,N+1\}$ be the set of indices corresponding to coarse time points {t₀,t₂,…t_N} as displayed in Fig. 3. Let $A_{0} \in \mathbb {R}^{(N_{C} +1)d \times (N_{C} +1 )d}$ be the coarse matrix as defined in (19) and the restriction matrix ${R}_{0} \in \mathbb {R}^{(N_{C} +1) d\times (N+1)d}$ satisfies (20), namely,

$$ {R}_{0} = \begin{bmatrix} {\mathcal{I}} & 0 & 0 & 0 & 0 & {\dots} & 0\\ 0 & 0 & {\mathcal{I}} & 0 & 0& {\dots} & 0\\ 0 & 0 & 0 & 0 & {\mathcal{I}}& {\dots} & 0\\ \vdots&{\vdots} &{\vdots} &{\vdots} &{\vdots} & \ddots&\vdots\\ 0 & 0 & 0 & 0 & 0 & 0 & {\mathcal{I}} \end{bmatrix}. $$

The matrix M_SC becomes,

$$ \begin{array}{@{}rcl@{}} {M}_{SC} &=& \sum\limits_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}{R}_{i}({R}_{0}^{T}A_{0}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\\ &=&\begin{bmatrix} {\mathcal{I}} & & & & & & &\\ 0& {\mathcal{I}} & & & & & &\\ & -{\phi} &{\mathcal{I}} & & & & &\\ & &0 &{\mathcal{I}} & & & & \\ & & &-{\phi} &{\mathcal{I}} & & & \\ & & & &{\ddots} &{\ddots} & &\\ & & & & &0 &{\mathcal{I}} &\\ & & & & & &-{\phi} & &{\mathcal{I}}\\ \end{bmatrix} \begin{bmatrix} {\mathcal{I}} & & & & & & & \\ 0& {\mathcal{I}}& & & & & & \\ -{\phi}_{\Delta T}&0 & {\mathcal{I}} & & & &\\ & & 0 & {\mathcal{I}} & & & \\ & & -{\phi}_{\Delta T}& 0 & {\mathcal{I}}\\ & & & {\ddots} & &{\ddots} & & \\ & & & & 0 & 0 &{\mathcal{I}} & \\ & & & & &-{\phi}_{\Delta T}& 0 & & {\mathcal{I}} \end{bmatrix}\\ &=& \begin{bmatrix} {\mathcal{I}} & & & & & & & \\ 0& {\mathcal{I}}& & & & & & \\ -{\phi}_{\Delta T}&-{\phi} & {\mathcal{I}} & & & &\\ & & 0 & {\mathcal{I}} & & & \\ & & -{\phi}_{\Delta T}& -{\phi} & {\mathcal{I}}\\ & & & {\ddots} & &{\ddots} & & \\ & & & & 0 & 0 &{\mathcal{I}} & \\ & & & & &-{\phi}_{\Delta T}& -{\phi} & &{\mathcal{I}} \end{bmatrix}. \end{array} $$

(23)

It can be seen that M_SC from (23) is the same as the matrix M_SC defined in Lemma 1 in case m = 2, the preconditioner ${M}_{SC}^{-1}$ is computed following Lemma 2,

$$ {M}_{SC}^{-1} = ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i}, $$

and then Corollary 1 gives the residual correction scheme of the problem (4) at the fine level (22) with SC two-level additive Schwarz in time preconditioner $M_{SC}^{-1}$ which is equivalent to parareal.

It was shown in a series of papers, e.g., [13, 21], that MGRIT with F-relaxation is equivalent to parareal algorithm. We show now that MGRIT with F-relaxation is also equivalent to SC two-level additive Schwarz in time preconditioner by computing the error propagation matrix at coarse time points. The error propagation of (22) is governed by

$$ {e}^{k+1} = ({\mathbb{I}} - {M}_{SC}^{-1}{A}){e}^{k}, $$

(24)

where ${e}^{k} := {U}_{F} - {U}_{F}^{k}$, and ${U}_{F}, {U}_{F}^{k}$ denote the exact solution and the approximate solution, respectively. The iteration matrix has the form,

$$ {\mathbb{I}} - {M}_{SC}^{-1}{A} = {\mathbb{I}} - ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{i=1}^{N_{C}+1}{R}_{i}^{T}{A}_{i}^{-1}{R}_{i}{A}. $$

Note that we consider ${M}_{SC}^{-1}$ as a two-level additive Schwarz preconditioner in the time domain and the matrix A is not symmetric. Hence, we cannot exploit the theory of Schwarz-type algorithms for symmetric positive definite matrices for which the preconditioned system $M_{SC}^{-1}{A}$ can be expressed as sums of orthogonal projection matrices P_i, for $i =1,2,\dots ,N_{C}+1$, for further details, see [23]. Instead we study the error propagation matrix produced in the residual correction scheme (22). The following lemma shows that the error propagation matrix produces exactly the same error after one iteration at coarse time points as MGRIT with F-relaxation, for which the error is given in [25, Lemma 3.1].

Remark 2

The error propagation matrix ${\mathbb {I}} - {M}_{SC}^{-1}{A}$ in (24) describes the propagation of errors of (22) at both coarse and fine levels. In the following sections, e.g., Lemma 3, 4, 5, and 6, for the convenience of comparison with parareal and the variants, we only consider the error propagation matrices at the coarse level. We also remark that ϕ and ϕ_ΔT commute due to the assumption that they can be diagonalized by the same set of eigenvectors.

Lemma 3

Let U_F be the exact solution of (6), ${U}_{F}^{k}$ be an approximate solution from (13), ${e}^{k} := {U}_{F} - {U}_{F}^{k}$ and denote by ${e}^{k}_{j}$ the error at iteration k and time t_j with $j= 1, 2,\dots , N$. The error at coarse time points generated at iteration k + 1 of (13) with SC two-level additive Schwarz in time preconditioner defined in (21) satisfies:

$$ \begin{array}{@{}rcl@{}} {e}^{k+1}_{0} &=& 0, \\ {e}_{hm}^{k+1} &=& \sum\limits_{r=0}^{h-1}{\phi}_{\Delta T}^{h-1-r}({\phi}^{m}-{\phi}_{\Delta T}){e}_{rm}^{k}, \ \ h=1,2,\dots,N_{C}. \end{array} $$

(25)

Proof

We denote by ${e}_{C,SC}^{k}$ and ${e}_{C,SC}^{k+1}$ the errors at coarse time points at iteration k and k + 1 respectively and by ${E}_{SC}{:={\mathbb {I}} - {M}_{SC}^{-1}{A}}$, the error propagation matrix at coarse time points for SC two-level additive Schwarz in time preconditioner. The error propagation from (24) at coarse time points yields,

$$ {e}_{C,SC}^{k+1} = {E}_{SC}{e}_{C,SC}^{k}, $$

(26)

note that ϕ and ϕ_ΔT commute, so (26) can also be written as follows,

$$ \begin{bmatrix} {e}^{k+1}_0\\ {e}^{k+1}_m\\ {e}^{k+1}_{2m}\\ {e}^{k+1}_{3m}\\ \vdots\\ {e}^{k+1}_{N_Cm} \end{bmatrix} = \begin{bmatrix} 0 & 0& 0& \dots&0& 0 \\ {\phi}^m - {\phi}_{\Delta T} & 0 & 0& \dots&0 & 0\\ {\phi}_{\Delta T}({\phi}^m - {\phi}_{\Delta T})& {\phi}^m - {\phi}_{\Delta T}& 0& \dots&0 & 0\\ {\phi}_{\Delta T}^2({\phi}^m - {\phi}_{\Delta T}) & {\phi}_{\Delta T}({\phi}^m - {\phi}_{\Delta T}) & {\phi}^m - {\phi}_{\Delta T} & {\dots} &0 & 0\\ {\vdots} &{\vdots} & {\vdots} & {\ddots} &\vdots&\vdots\\ {\phi}_{\Delta T}^{N_C-1}({\phi}^m - {\phi}_{\Delta T}) &{\phi}_{\Delta T}^{N_C-2}({\phi}^m - {\phi}_{\Delta T}) & {\phi}_{\Delta T}^{N_C-3}({\phi}^m - {\phi}_{\Delta T}) & {\dots} & {\phi}^m - {\phi}_{\Delta T} & 0 \end{bmatrix} \begin{bmatrix} {e}^{k}_0\\ {e}^{k}_m\\ {e}^{k}_{2m}\\ {e}^{k}_{3m}\\ \vdots\\ {e}^{k}_{N_Cm} \end{bmatrix}. $$

Equation (25) follows. □

4 Variants of SC two-level additive Schwarz in time preconditioner and convergence analysis

In this section, we study several variants of SC two-level additive Schwarz in time preconditioner and discuss their equivalence with MGRIT with FCF-relaxation, MGRIT with F(CF)²-relaxation, or overlapping parareal. In addition, we derive a method, referred to as SCS² two-level additive Schwarz in time preconditioner, and discuss its suitability for exploiting parallel computing.

We first describe the SCS variant of SC two-level additive Schwarz in time preconditioner. It is obtained by first applying SC two-level additive Schwarz in time preconditioner, that is one fine solve followed by one coarse solve, and then adding one more fine solve. In detail, one iteration of the residual correction scheme is performed as follows:

$$ \begin{array}{@{}rcl@{}} {U}_{F}^{k+\frac{1}{2}} &=& {U}_{F}^{k} + ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k}),\\ {U}_{F}^{k+1} &=& {U}_{F}^{k+\frac{1}{2}} + \sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k+\frac{1}{2}}). \end{array} $$

The error propagation matrix is defined as follows,

$$ \left[{\mathbb{I}} - \sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}{A}\right]\left[{\mathbb{I}} - ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}{A}\right]. $$

The following lemma gives the error propagation of the SCS variant of SC two-level additive Schwarz in time preconditioner. It can be seen that the error propagation matrix produces exactly the same error at coarse time points after one iteration as MGRIT with FCF-relaxation. The result for MGRIT with FCF-relaxation is described in [25, Lemma 3.2].

Lemma 4

Let U_F be the exact solution of (4), ${U}_{F}^{k}$ be an approximate solution from (13), ${e}^{k} := {U}_{F} - {U}_{F}^{k}$ and denote by ${e}^{k}_{j}$ the error at iteration k and time t_j with $j= 1, 2,\dots , N$. The error at coarse time points generated at iteration k + 1 of the residual correction scheme from (13) preconditioned by SCS two-level additive Schwarz in time preconditioner satisfies:

$$ \begin{array}{@{}rcl@{}} {e}^{k+1}_{0} &=& 0,\\ {e}^{k+1}_{m} &=& 0,\\ {e}_{hm}^{k+1} &=& \sum\limits_{r=0}^{h-2}{\phi}_{\Delta T}^{h-2-r}({\phi}^{m}-{\phi}_{\Delta T}){\phi}^{m}{e}_{rm}^{k}, \ \ h=2,3,\dots,N_{C}. \end{array} $$

(27)

Proof

We denote by ${e}_{C,SCS}^{k}$ and ${e}_{C,SCS}^{k+1}$ the errors at coarse time points at iteration k and k + 1 respectively and by E_SCS the error propagation matrix at coarse time points for SCS two-level additive Schwarz in time preconditioner. We have the relation,

$$ {e}_{C,SCS}^{k+1} = {E}_{SCS}{e}_{C,SCS}^{k}, $$

(28)

in which E_SCS =

$$ \begin{bmatrix} 0 & 0& 0 &0&\dots&0 &0 \\ 0 & 0 & 0 &0&\dots&0 &0\\ ({\phi}^m - {\phi}_{\Delta T}){\phi}^m& 0 & 0 &0&\dots&0&0\\ {\phi}_{\Delta T}({\phi}^m - {\phi}_{\Delta T}){\phi}^m &({\phi}^m - {\phi}_{\Delta T}){\phi}^m&0&0&\dots&0&0\\ {\phi}_{\Delta T}^2({\phi}^m - {\phi}_{\Delta T}){\phi}^m &{\phi}_{\Delta T}({\phi}^m - {\phi}_{\Delta T}){\phi}^m&({\phi}^m - {\phi}_{\Delta T}){\phi}^m&0&\dots&0&0\\ {\vdots} & {\vdots} & {\vdots} & {\ddots} & \vdots& \vdots& {\vdots} \\ {\phi}_{\Delta T}^{N_C-2}({\phi}^m - {\phi}_{\Delta T}){\phi}^m &{\phi}_{\Delta T}^{N_C-3}({\phi}^m - {\phi}_{\Delta T}){\phi}^m&{\phi}_{\Delta T}^{N_C-4}({\phi}^m - {\phi}_{\Delta T}){\phi}^m & {\dots} & ({\phi}^m - {\phi}_{\Delta T}){\phi}^m& 0&0 \end{bmatrix}. $$

The relations in (27) follow. □

The SCS² variant of SC two-level additive Schwarz in time preconditioner is obtained by adding one more fine solve based on additive Schwarz as follows:

$$ \begin{array}{@{}rcl@{}} {U}_{F}^{k+\frac{1}{3}} &=& {U}_{F}^{k} + ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k}),\\ {U}_{F}^{k+\frac{1}{2}} &=& {U}_{F}^{k+\frac{1}{3}} + \sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k+\frac{1}{3}}),\\ {U}_{F}^{k+1} &=& {U}_{F}^{k+\frac{1}{2}} + \sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k+\frac{1}{2}}). \end{array} $$

The error propagation matrix is defined as follows,

$$ \left[{\mathbb{I}} - \sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}{A}\right]^{2}\left[{\mathbb{I}} - ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0}){\sum}_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}{A}\right]. $$

Lemma 5

Let U_F be the exact solution of (4), ${U}_{F}^{k}$ be an approximate solution from (13), ${e}^{k} := {U}_{F} - {U}_{F}^{k}$ and denote by ${e}^{k}_{j}$ the error at iteration k and time t_j with $j= 1, 2,\dots , N$. The error at coarse time points generated at iteration k + 1 of the residual correction scheme from (13) with SCS² two-level additive Schwarz in time preconditioner satisfies:

$$ \begin{array}{@{}rcl@{}} {e}^{k+1}_{0} &=& 0,\\ {e}^{k+1}_{m} &=& 0,\\ {e}^{k+1}_{2m} &=& 0,\\ {e}_{hm}^{k+1} &=& \sum\limits_{r=0}^{h-3}{\phi}_{\Delta T}^{h-3-r}({\phi}^{m}-{\phi}_{\Delta T}){\phi}^{2m}{e}_{rm}^{k}, \ \ h=3,4,\dots,N_{C}. \end{array} $$

(29)

Proof

Let ${e}_{C,SCS^{2}}^{k}$ and ${e}_{C,SCS^{2}}^{k+1}$ be the errors at coarse time points at iteration k and k + 1 respectively and let ${E}_{SCS^{2}}$ be the error propagation matrix at coarse time points for SCS² two-level additive Schwarz in time preconditioner. We have the relation,

$$ {e}_{C,SCS^{2}}^{k+1} = {E}_{SCS^{2}}{e}_{C,SCS^{2}}^{k}, $$

(30)

in which ${E}_{SCS^2}=$

$$ \begin{bmatrix} 0 & 0& 0 & 0& {\dots} & 0& 0& 0 \\ 0 & 0& 0 & 0& \dots& 0& 0& 0 \\ 0& 0 & 0 &0 &\dots& 0& 0& 0\\ ({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m}& 0 & 0 &0 &\dots& 0& 0& 0\\ {\phi}_{\Delta T}({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m} &({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m}&0&0 &\dots& 0& 0& 0\\ {\phi}_{\Delta T}^2({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m} &{\phi}_{\Delta T}({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m}&({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m}&0 &\dots& 0& 0& 0\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots&\vdots&\vdots\\ {\phi}_{\Delta T}^{N_C-3}({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m} &{\phi}_{\Delta T}^{N_C-4}({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m}& {\phi}_{\Delta T}^{N_C-5}({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m}& {\dots} &({\phi}^m - {\phi}_{\Delta T}){\phi}^{2m} &0& 0& 0 \end{bmatrix}. $$

Equation (29) follows. □

A variant known in the literature as MGRIT with F(CF)^ν-relaxation or overlapping parareal has been shown to converge at most after k = [N/(ν + 1)] iterations [26, Theorem 5]. For the case ν = 2, in the framework of domain decomposition, this variant is referred to as S(CS)² two-level additive Schwarz in time preconditioner and it is obtained as follows:

$$ \begin{array}{@{}rcl@{}} {U}_{F}^{k+\frac{1}{3}} &=& {U}_{F}^{k} + ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k}),\\ {U}_{F}^{k+\frac{1}{2}} &=& {U}_{F}^{k+\frac{1}{3}} + ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k+\frac{1}{3}}),\\ {U}_{F}^{k+1} &=& {U}_{F}^{k+\frac{1}{2}} + \sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}({f} - {A}{U}_{F}^{k+\frac{1}{2}}). \end{array} $$

The error propagation matrix is defined as follows,

$$ \left[{\mathbb{I}} - \sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}{A}\right]\left[{\mathbb{I}} - ({R}_{0}^{T}A_{0}^{-1}{R}_{0} + {\mathbb{I}} - {R}_{0}^{T}{R}_{0})\sum\limits_{j=1}^{N_{C}+1}{R}_{j}^{T}{A}_{j}^{-1}{R}_{j}{A}\right]^{2}. $$

For completeness, we give in the following lemma the error of this variant.

Lemma 6

Let U_F be the exact solution of (4), ${U}_{F}^{k}$ be an approximate solution from (13), ${e}^{k} := {U}_{F} - {U}_{F}^{k}$ and denote by ${e}^{k}_{j}$ the error at iteration k and time t_j with $j= 1, 2,\dots , N$. The error at coarse time points generated at iteration k + 1 of (13) with S(CS)² two-level additive Schwarz in time preconditioner satisfies:

$$ \begin{array}{@{}rcl@{}} {e}^{k+1}_{0} &=&0,\\ {e}^{k+1}_{m} & =& 0,\\ {e}^{k+1}_{2m} & =& 0,\\ {e}_{hm}^{k+1} & =& \sum\limits_{r=0}^{h-3}(h-2-r){\phi}_{\Delta T}^{h-3-r}({\phi}^{m}-{\phi}_{\Delta T})^{2}{\phi}^{m}{e}_{rm}^{k}, \ \ h=3,4,\dots,N_{C}. \end{array} $$

(31)

Proof

We denote by ${e}_{C,S(CS)^{2}}^{k}$ and ${e}_{C,S(CS)^{2}}^{k+1}$ the errors at coarse time points at iteration k and k + 1 respectively and by ${E}_{S(CS)^{2}}$ the error propagation matrix at coarse time points for S(CS)² two-level additive Schwarz in time preconditioner. We obtain the relation,

$$ {e}_{C,S(CS)^{2}}^{k+1} = {E}_{S(CS)^{2}}{e}_{C,S(CS)^{2}}^{k}, $$

(32)

in which ${E}_{S(CS)^{2}} = $

$$ \begin{bmatrix} 0 & 0& 0 & 0& {\dots} & 0& 0& 0 \\ 0 & 0& 0 & 0& \dots& 0& 0& 0 \\ 0& 0 & 0 &0 &\dots& 0& 0& 0\\ {\Phi}& 0 & 0 &0 &\dots& 0& 0& 0\\ 2{\phi}_{\Delta T}{\Phi} &{\Phi}&0&0 &\dots& 0& 0& 0\\ 3{\phi}_{\Delta T}^2{\Phi} &{\phi}_{\Delta T}{\Phi}&{\Phi}&0 &\dots& 0& 0& 0\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots&\vdots&\vdots\\ (N_C-2){\phi}_{\Delta T}^{N_C-3}{\Phi} &(N_C-3){\phi}_{\Delta T}^{N_C-4}{\Phi}& (N_C-4){\phi}_{\Delta T}^{N_C-5}{\Phi}& {\dots} &{\Phi} &0& 0& 0 \end{bmatrix} $$

from which (31) follows, where ${\Phi }:= ({\phi }^m - {\phi }_{\Delta T})^2{\phi }^{m}$. □

These different variants have different costs in a parallel environment. Given that the fine solve phase based on additive Schwarz is done in parallel and the coarse solve phase has to be done in sequential, the coarse solve is the major limiting factor. For that reason, the advantage becomes more noticeable when they use more fine solve phases based on additive Schwarz and one coarse solve phase which is done in sequential. The impact of additional fine or coarse solve phases in the preconditioner to the error convergence as well as the computational costs will be discussed in more detail in the next section.

5 Convergence estimate

In this section, we estimate the convergence of SC two-level additive Schwarz in time preconditioner and its variants by computing the norms of the error propagation matrices. The convergence is estimated based on an eigenvalue analysis for which the coarse and the fine propagators must have the same eigenvectors. As the assumption in Section 2 that ϕ and ϕ_ΔT have the same eigenvectors, there exists a unitary matrix X, e.g., ${X}^{*}{X}={X}{X}^{*}={\mathcal {I}}$ such that ϕ and ϕ_ΔT can be diagonalized as follows,

$$ \begin{array}{@{}rcl@{}} {\Lambda} &=& {X}^{*}{\phi} {X} = \text{diag}(\lambda_{1},\lambda_{2},\dots,\lambda_{d}),\\ {\Lambda}_{\Delta T} &=& {X}^{*}{\phi}_{\Delta T} {X} = \text{diag}(\mu_{1},\mu_{2},\dots,\mu_{d}), \end{array} $$

with |λ_i| < 1 and |μ_i| < 1 for i = 1,2,…,d since ϕ and ϕ_ΔT are stable time-stepping methods.

The matrix E_SC defined in (26) is the error propagation matrix corresponding to SC two-level additive Schwarz in time preconditioner, each element of this matrix is a block matrix of dimension d × d. The error propagation matrices ${E}_{SCS}, {E}_{SCS^{2}}, {E}_{S(CS)^{2}}$ corresponding to the variants of SC two-level additive Schwarz in time preconditioner are defined in (28), (30), and (32). Let E_SC = Y BY^∗, where $Y \in \mathbb {R}^{N_Cd\times N_Cd}$ is a block diagonal matrix, $Y= BlockDiag(X,X,\dots ,X)$ and B has the form

$$ \begin{array}{@{}rcl@{}} B=\begin{bmatrix} 0 & 0& 0& \dots&0& 0 \\ {\Lambda}^m - {\Lambda}_{\Delta T} & 0 & 0& \dots&0 & 0\\ {\Lambda}_{\Delta T}({\Lambda}^m - {\Lambda}_{\Delta T})& {\Lambda}^m - {\Lambda}_{\Delta T}& 0& \dots&0 & 0\\ {\Lambda}_{\Delta T}^2({\Lambda}^m - {\Lambda}_{\Delta T}) & {\Lambda}_{\Delta T}({\Lambda}^m - {\Lambda}_{\Delta T}) & {\Lambda}^m - {\Lambda}_{\Delta T} & {\dots} &0 & 0\\ {\vdots} &{\vdots} & {\vdots} & {\ddots} &\vdots&\vdots\\ {\Lambda}_{\Delta T}^{N_C-1}({\Lambda}^m - {\Lambda}_{\Delta T}) &{\Lambda}_{\Delta T}^{N_C-2}({\Lambda}^m - {\Lambda}_{\Delta T}) & {\Lambda}_{\Delta T}^{N_C-3}({\Lambda}^m - {\Lambda}_{\Delta T}) & {\dots} & {\Lambda}^m - {\Lambda}_{\Delta T} & 0 \end{bmatrix}. \end{array} $$

We then have,

$$ \vert\vert B\vert \vert_1 = \vert\vert B\vert \vert_{\infty} = \max_{1\leq j \leq d} \sum\limits_{i=0}^{N_C-1} \vert \mu_j^i(\lambda_j^m-\mu_j)\vert. $$

(33)

On the other hand, we also have,

$$ \begin{array}{@{}rcl@{}} \vert\vert E_{SC}\vert \vert _2 = \vert\vert Y BY^{*}\vert \vert_2 = \vert\vert B\vert \vert_2 &\leq&\sqrt{\vert B\vert\vert_1\vert\vert B\vert\vert_{\infty}}\\ &= &\max_{1\leq j \leq d} \sum\limits_{i=0}^{N_C-1} \vert \mu_j^i(\lambda_j^m-\mu_j)\vert\\ &\leq&\max_{1\leq j \leq d} \sum\limits_{i=0}^{N_C-1} \vert \mu_j^i\vert\vert\lambda_j^m-\mu_j\vert\\ &=&\max_{1\leq j \leq d} \left\{\vert\lambda_j^m-\mu_j\vert\sum\limits_{i=0}^{N_C-1} \vert \mu_j^i\vert\right\}\\ &=&\max_{1\leq j \leq d} \left\{\vert\lambda_j^m-\mu_j\vert\frac{1-\vert\mu_j\vert^{N_C}}{1-\vert\mu_j\vert}\right\}. \end{array} $$

(34)

Similarly we have,

$$ \begin{array}{@{}rcl@{}} \vert\vert{E}_{SCS}\vert\vert_{2} & \leq & {\underset{1\leq j\leq d}{\max}}\left\{{\vert\lambda_{j}^{m}} - \mu_{j}\vert\frac{1-\vert\mu_{j}\vert^{N_{C}-1}}{1-\vert\mu_{j}\vert}\vert\lambda_{j}\vert^{m}\right\}, \end{array} $$

(35)

$$ \begin{array}{@{}rcl@{}} \vert\vert{E}_{SCS^{2}}\vert\vert_{2} & \leq & {\underset{1\leq j\leq d}{\max}}\left\{{\vert\lambda_{j}^{m}} - \mu_{j}\vert\frac{1-\vert\mu_{j}\vert^{N_{C}-2}}{1-\vert\mu_{j}\vert}\vert\lambda_{j}\vert^{2m}\right\}, \end{array} $$

(36)

$$ \begin{array}{@{}rcl@{}} \vert\vert{E}_{S(CS)^{2}}\vert\vert_{2} & \leq & \max_{1\leq j\leq d} \mathcal{C}_{j}, \end{array} $$

(37)

in which

$$ \mathcal{C}_{j}= ({\lambda_{j}^{m}} - \mu_{j})^{2}\frac{1 - (N_{C}-1)\vert\mu_{j}\vert^{N_{C}-2} + (N_{C}-2)\vert\mu_{j}\vert^{N_{C}-1}}{(1 - \vert\mu_{j}\vert)^{2}}\vert\lambda_{j}\vert^{m}. $$

(38)

The 2-norm of the errors is estimated in the following theorem.

Theorem 1

Let ϕ and ϕ_ΔT be simultaneously diagonalizable by the same unitary matrix X and be stable time-stepping methods with eigenvalues λ_i and μ_i respectively , e.g., |λ_i| < 1 and |μ_i| < 1 for $i= 1,\dots ,d$. The error at coarse time points generated at iteration k + 1 of (13) satisfies:

$$ \begin{array}{@{}rcl@{}} \vert\vert{e}_{C,SC}^{k+1}\vert\vert_{2} & \leq & \max_{1\leq j\leq d}\left\{{\vert\lambda_{j}^{m}} - \mu_{j}\vert\frac{1-\vert\mu_{j}\vert^{N_{C}}}{1-\vert\mu_{j}\vert}\right\}\vert\vert{e}_{C,SC}^{k}\vert\vert_{2}, \end{array} $$

(39)

$$ \begin{array}{@{}rcl@{}} \vert\vert{e}_{C,SCS}^{k+1}\vert\vert_{2} & \leq & \max_{1\leq j\leq d}\left\{{\vert\lambda_{j}^{m}} - \mu_{j}\vert\frac{1-\vert\mu_{j}\vert^{N_{C}-1}}{1-\vert\mu_{j}\vert}\vert\lambda_{j}\vert^{m}\right\}\vert\vert{e}_{C,SCS}^{k}\vert\vert_{2}, \end{array} $$

(40)

$$ \begin{array}{@{}rcl@{}} \vert\vert{e}_{C,SCS^{2}}^{k+1}\vert\vert_{2} & \leq & \max_{1\leq j\leq d}\left\{{\vert\lambda_{j}^{m}} - \mu_{j}\vert\frac{1-\vert\mu_{j}\vert^{N_{C}-2}}{1-\vert\mu_{j}\vert}\vert\lambda_{j}\vert^{2m}\right\}\vert\vert{e}_{C,SCS^{2}}^{k}\vert\vert_{2}, \end{array} $$

(41)

$$ \begin{array}{@{}rcl@{}} \vert\vert{e}_{C,S(CS)^{2}}^{k+1}\vert\vert_{2} & \leq & \max_{1\leq j\leq d} \mathcal{C}_{j}\vert\vert{e}_{C,S(CS)^{2}}^{k}\vert\vert_{2}, \end{array} $$

(42)

where $\mathcal {C}_{j}$ is defined in (38).

Proof

Combining (33), (34), (35), (36), and (37) leads to the desired results. □

The convergence bounds for SC from (39) and SCS from (40) are already given in the context of MGRIT with F-relaxation and with FCF-relaxation, see [25], in which the authors estimate the convergence by using the eigenvector expansion of the error to compute the error norm for each eigenmode. In this paper, we estimate the convergence of SC two-level additive Schwarz in time preconditioner and its variants by computing directly the norms of the error propagation matrices generated at iteration k + 1 of the residual correction scheme from (13). The theoretical convergence bounds we obtained for SC and SCS two-level additive Schwarz in time preconditioner are exactly the same with those for MGRIT with F-relaxation and with FCF-relaxation. This once again confirms the equivalence between parareal, MGRIT with F-relaxation, and SC two-level additive Schwarz in time preconditioner.

As the work presented in [25], those estimates have a removable singularity that is when |μ_j| tends to 1. They are also shown to be bounded independently of N_C in many applications. Furthermore, the nominator $1 -\vert \mu _{j}\vert ^{N_{C}}$ can be replaced by 1 since the estimates hold for all N_C.

As mentioned in the end of the previous section, these variants have different computational costs for implementation. To make this clear, we follow our setting in Section 2 to recall the speedup of parareal algorithm from [30] as follows,

$$ \mathcal{S}({N_{C}}) = \frac{{N_{C}}m\tau_{f}}{{N_{C}}\tau_{C}+K({N_{C}}\tau_{C}+m\tau_{f})}, $$

(43)

in which the numerator describes the runtime for the fine propagator over N_C coarse time intervals while the denominator shows the runtime of parareal algorithm with N_C processors and K iterations, and τ_C,τ_f denote the computational cost of one step of the coarse and fine propagator. Depending on the number of coarse and fine propagator phases in the preconditioner, we then have different speedup of the variants, more precisely,

$$ \mathcal{S}_{SCS}({N_{C}}) = \frac{{N_{C}}m\tau_{f}}{{N_{C}}\tau_{C}+K({N_{C}}\tau_{C}+2m\tau_{f})}, $$

(44)

$$ \mathcal{S}_{SCS^{2}}({N_{C}}) = \frac{{N_{C}}m\tau_{f}}{{N_{C}}\tau_{C}+K({N_{C}}\tau_{C}+3m\tau_{f})}, $$

(45)

$$ \mathcal{S}_{S(CS)^{2}}({N_{C}}) = \frac{{N_{C}}m\tau_{f}}{{N_{C}}\tau_{C}+K(2{N_{C}}\tau_{C}+3m\tau_{f})}. $$

(46)

It is obvious that the speedup becomes less efficient as the number of coarse or fine propagator phases increases. However, those fine propagator phases are totally performed in parallel, this is a very important characteristic that we can exploit. By adding one or two additional fine propagation steps in the preconditioner, the convergence of parareal from (39) can be reduced by a factor of |λ_j|^m or |λ_j|^2m as it can be seen in (40) and (41), especially in the case when the eigenvalues are very small and the number of fine time step per time slice m is very large.

We provide now the estimation of the computational cost of parareal with GMRES acceleration. In this case we consider only scalar and 1D problems and we denote τ_Cd,τ_fd the computational cost of one step of the coarse and fine propagators to account for the linear cost in d, where d denotes the spatial dimension of ϕ. The operation count is presented in Table 1 and the total cost can be computed as,

$$ \mathcal{P}_{\text{Parareal+GMRES}}(N_{C},K)= N_{C}\tau_{C}d+Kd(N_{C}\tau_{C}+m\tau_{f}) + \mathcal{O}(K^{2}(N_{C} + md)+Kmd), $$

where we have subsumed the least squares solve cost into the $\mathcal {O}(K^{2}(N_{C} + md))$ term since (N_C + md) ≫ 1. The normalized cost over sequential time-stepping can thus be computed as follows,

$$ \mathbb{S}_{\text{Parareal+GMRES}}(N_{C},K)= \frac{N_{C}\tau_{C} + K(N_{C}\tau_{C} + m\tau_{f}) + \mathcal{O}(K^{2}(N_{C}/d + m)+Km)}{N_{C}m\tau_{f}}, $$

(47)

where we cancelled the d in both the numerator and denominator. Formula (47) will be used for the plots in Sections 6.3 and 6.4.

Table 1 Operation count for parareal with GMRES acceleration

Full size table

6 Numerical results

In this section, we first discuss results that show the equivalence between parareal and SC two-level additive Schwarz in time preconditioner for three different problems, Dahlquist problem, heat equation, and advection-reaction-diffusion equation. Numerical experiments investigate the behavior of the convergence rates on short and long time intervals when N_C and m vary. We then discuss the convergence of different variants of two-level domain decomposition preconditioners in time. A comparison between parareal or SC two-level additive Schwarz in time preconditioner and parareal with GMRES acceleration is also conducted. The three linear test cases considered here are the Dahlquist problem with a₀ = − 1,u₀ = 1,

$$ \frac{du}{dt}= a_{0}{u}, \ \ \ \ \ {u}(0) = \mathrm{u}_{0}, \ \ \ \ t \in [0,T], $$

(48)

the heat equation with a^∗ = 3,L = 1,Δx = 0.1, the exact solution $u_{\text {exact}} = x(L-x)^{2}\exp (-2t)$,

$$ \left\{\begin{array}{llll} \frac{\partial{u}}{\partial t } &=& a^{*} \frac{\partial^{2}{u}}{\partial x^{2} } + {f} & in \ \ (0,L) \times (0,T), \\ {u}(x,0) &=& {u}_{0}(x) & x \in \ \ (0,L) ,\\ {u}(0,t) &=& {u}(L,t) = 0 & t \in (0,T) , \end{array}\right. $$

(49)

and the advection-reaction-diffusion equation with a = 1,b = 1,c = 1,L = 1,Δx = 0.1, the exact solution $u_{\text {exact}}=\sin \limits (2\pi x)\exp (-2t)$,

$$ \left\{\begin{array}{llll} \frac{\partial{u}}{\partial t } &=& a \frac{\partial^{2}{u}}{\partial x^{2} } - b\frac{\partial{u}}{\partial x } + c{u} + {f} & in \ \ (0,L) \times (0,T), \\ {u}(x,0) &=& {u}_{0}(x) & x \in \ \ (0,L) ,\\ {u}(0,t) &=& {u}(L,t) = 0 & t \in (0,T) , \end{array}\right. $$

(50)

in which the unknowns u(x,t) in (49) and (50) are considered in $(0,L) \times (0,T) \subset \mathbb {R}^{d}\times \mathbb {R}$, where d is the space dimension. The source term is denoted by f and is chosen to obtain the desired exact solution. For simplicity, we consider the Dirichlet boundary condition; however, the periodic boundary condition is also used in Section 6.4. Note that the same discretization methods are used for both coarse and fine solvers, namely centered finite difference in space and backward Euler in time except the end of Section 6.4 in which Runge-Kutta 4 is used for the fine solver.

6.1 Equivalence between parareal and SC two-level additive Schwarz in time preconditioner

In order to study the short time interval behavior, we use N_C = 20,T = 1, while for the long time interval behavior, we use N_C = 100,T = 100. With time steps Δt = T/N_C,δt = Δt/m, for m = 20, d = 1 for Dahlquist problem, d = 10 for the heat and advection-reaction-diffusion equations, the 2-norm (spectral norm) of the error between the approximate solution and the fine sequential solution (obtained by sequentially using the fine solver) is displayed in Fig. 4 for the three test cases. We observe that the convergence curves of parareal and SC two-level additive Schwarz in time preconditioner are almost the same, except for the last iterations, when this may happen because of round-off errors. The bound for SC two-level additive Schwarz in time preconditioner derived in (39) is sharp, in particular for long time intervals.

For the Dahlquist problem, the errors of parareal and SC two-level additive Schwarz in time preconditioner are in superlinear convergence regime on short time intervals and in linear convergence regime on long time intervals. This behavior is also outlined in [13]. In particular on short time intervals, they reach 10^− 13 after 5 iterations while with the same number of iterations, the attained error is 10^− 4 on long time interval.

For the heat equation, a convergence to 10^− 16 is observed for short time interval after 18 iterations. For long time interval, both parareal and SC two-level additive Schwarz in time preconditioner converge to an error of 10^− 17 after 10 iterations.

For the advection-reaction-diffusion equation, in particular for short time interval, both parareal and SC two-level additive Schwarz in time preconditioner converge to an error of 10^− 14 after 18 iterations. For long time interval, both parareal and SC two-level additive Schwarz in time preconditioner converge to an error of 10^− 16 after 15 iterations.

In addition to this section, numerical tests are performed for the case when m ≫ N_C, specifically, we set N_C = 20 and m = 500. The results are displayed in Fig. 5. We observe that the results are almost the same with the case when m = 20 in Fig. 4 for short time interval. On long time interval, a linear convergence with the same rate is obtained for both parareal and SC two-level additive Schwarz preconditioner.

6.2 Comparison between variants of SC two-level additive Schwarz in time preconditioner

Numerical experiments are performed to study the convergence of several variants of the SC two-level additive Schwarz in time preconditioner that use additional coarse or fine propagation steps. Similarly to the previous section, d = 1 for Dahlquist problem, d = 10 for the heat and advection-reaction-diffusion equations, the short time interval behavior uses N_C = 20 and T = 1, and the long time interval behavior uses N_C = 100 and T = 100. Figures 6, 8, and 10 display the error, in 2-norm, between the approximate solution and the fine sequential solution, with time steps Δt = T/N_C,δt = Δt/m, and m ∈{2,20}, for the Dahlquist problem, heat equation, and advection-reaction-diffusion equation, respectively.

For the Dahlquist problem (Fig. 6), on short time interval, the improvement of SCS, SCS² is not very important compared to SC two-level additive Schwarz in time preconditioner except the S(CS)² which converges faster than the others. However, on long time interval, the improvement becomes more important. In particular for m = 2, SC two-level additive Schwarz in time preconditioner converges to an error of 10^− 10 after 9 iterations, while SCS and SCS² converge in 7 and 5 iterations respectively, and S(CS)² converges in just 4 iterations to an error of 10^− 10. For m = 20, after 9 iterations, SC two-level additive Schwarz in time preconditioner converges to an error of 10^− 6, while SCS, S(CS)², and SCS² converge to much smaller errors, 10^− 10 and 10^− 14, respectively.

Additionally, in this part, we present in Fig. 7 a comparison of the different methods in terms of their computational costs for the Dahlquist problem (when communication costs are neglected). For this purpose, the convergence is presented as a function of computational cost in which the x-axis corresponds to the computational cost normalized by the cost of sequential time-stepping, i.e., the inverse of the formulas (43), (44), (45), and (46). We choose τ_C = τ_f = 8 since the same integrator is used and we solve a tridiagonal system of dimension d with the computational cost 8d. For short time interval, SC converges to the error of 10^− 15 with the cheapest cost compared to the others, while SCS² is the most expensive method. On the contrary, on long time interval T = 100,N_C = 100, SCS² converges with the lowest cost and SC is the most expensive method. Specifically, when m = 20, SCS² converges to the error of 10^− 14 with a cheaper cost compared to the cost of sequential time-stepping.

For the heat problem (Fig. 8), on short time interval, SCS, SCS², and S(CS)² converge faster than SC two-level additive Schwarz in time preconditioner. In particular, for m = 2, SC converges to an error around 10^− 16 after 13 iterations, while SCS, SCS², and S(CS)² need 9, 6, and 5 iterations, respectively. For m = 20, the improvement becomes more important, specifically it takes 18 iterations for SC two-level additive Schwarz in time preconditioner to converge to an error of 10^− 16, while SCS reaches this error in 10 iterations, and both SCS² and S(CS)² require only 7 iterations. We also observe that SCS² has a convergence rate close to S(CS)². On long time interval, both SCS and SCS² have a convergence rate close to the one of S(CS)², and SCS² converges faster than the other variants. In particular, for m = 2, SC two-level additive Schwarz in time preconditioner converges to an error of 10^− 17 after 10 iterations, while it takes 4 iterations for SCS and 3 iterations for both SCS² and S(CS)² to converge to the same error. For m = 20, SCS² converges to an error around 10^− 17 after one iteration, SCS and S(CS)² converge to the same error in 2 iterations, while SC two-level additive Schwarz in time preconditioner requires 10 iterations.

Similarly to the previous test case, a comparison of the different methods in terms of their computational costs for the heat problem is displayed in Fig. 9. Specifically, SCS² always converges with the lowest cost compared to the others. SCS and S(CS)² are slightly higher and SC is the most expensive method.

For the advection-reaction-diffusion problem (Fig. 10), the convergence behavior is similar to the heat equation for m = 2. For short time interval and m = 20, SC two-level additive Schwarz in time preconditioner converges to an error of 10^− 14 in 18 iterations, while SCS, SCS², and S(CS)² converge to the same error in 9, 7, and 6 iterations, respectively. On long time interval and m = 20, it takes 15 iterations for SC two-level additive Schwarz in time preconditioner to converge to an error of 10^− 16, while SCS, S(CS)², and SCS² reach the same error in 4, 3, and 2 iterations, respectively.

As the previous test case, a comparison of the different methods in terms of their computational costs for the advection-reaction-diffusion problem is displayed in Fig. 11. We observe that SCS² always converges with the cheapest cost compared to the others, except for the short time interval with m = 20 in which SCS is slightly cheaper. S(CS)² is slightly higher compared to SCS² and SCS while SC is the most expensive method.

In summary, the SC two-level additive Schwarz in time preconditioner with no additional coarse or fine propagation steps has a slower convergence than the other variants for all our test cases. This indicates that the usage of additional coarse or fine propagation steps leads to a more efficient preconditioner. The S(CS)² variant, corresponding to overlapping parareal or MGRIT with F(CF)²-relaxation, converges faster than the other variants in case of short time interval simulation. The SCS² variant converges faster than SCS for all our test cases. It is close to the convergence rate of S(CS)² for short time interval simulation, and it is even faster than S(CS)² for the heat (49) and the advection-reaction-diffusion (50) on long time interval. It is efficient when m increases, for example, for m = 20, it reaches an error of 10^− 17 after one iteration. For the computational cost comparison, SCS² becomes the best candidate since it converges with the cheapest cost for almost cases of the three problems, especially on long time intervals.

Furthermore, in this part, we perform numerical experiments for the case when m ≫ N_C, specifically, N_C = 20 and m = 500. The results are displayed in Fig. 12. For the Dahlquist test, S(CS)² reaches the error 10^− 15 after 3 iterations while SC,SCS, and SCS² converge to the errors of 10^− 13,10^− 14,and10^− 15 after 5 iterations, respectively. For the heat problem, S(CS)² and SCS² converge nearly with the same rate to the error of 10^− 15 after 6 iterations while SC and SCS reach the same error after 17 and 9 iterations. For the advection-reaction-diffusion equation, we observe that S(CS)², SCS², SCS, and SC converge to the error of 10^− 15 after 6, 7, 9, and 18 iterations, respectively. In summary, the behavior of all methods in this case is quite similar with the case when m = 20 in which S(CS)² and SCS² have more advantage compared to SCS and SC.

A comparison of the different methods in terms of their computational costs for the three problems on short time interval is displayed in Fig. 13. For Dahlquist test, SC is the fastest, S(CS)², SCS are slightly slower and SCS² is the most expensive. For the heat and the advection-reaction-diffusion problems, the behaviors are the same and all methods converge nearly with the same computational cost, the difference is not very significant. Especially for the heat problem (Fig. 12), SCS² and S(CS)² converge after 6 iterations while SC converges after 17 iterations to the error of 10^− 15 nearly with the same computational cost. For long time interval T = 100,N_C = 20 (Fig. 14), SCS² converges with the lowest computational cost, SCS and S(CS)² are slightly slower, and SC is the most expensive for Dahlquist test. For the heat and the advection-reaction-diffusion problems, the behaviors are similar. Specifically, SCS converges with the cheapest computational cost, SCS² and S(CS)² converge with the same computational costs, and SC is the most expensive method.

6.3 Parareal with GMRES acceleration

In this section, we discuss the results obtained by parareal with GMRES acceleration. The tolerance for GMRES is set to 10^− 16. For Dahlquist problem, the 2-norm of the error between the approximate solution and the fine sequential solution and of the relative residual are displayed in Fig. 15 for N_C = 20,T = 1, and for N_C = 100,T = 100. In both tests, Δt = T/N_C,δt = Δt/m,m ∈{5,20}, d = 1 for Dahlquist problem, d = 10 for the heat and advection-reaction-diffusion equations. On short time interval, we observe that GMRES slightly improves the convergence of parareal. On long time interval, the improvement becomes more noticeable. Specifically, for m = 20, parareal with GMRES acceleration converges to an error of 10^− 15 while parareal only converges to an error of 10^− 11, after 16 iterations. For the relative residual, parareal with GMRES acceleration converges to 10^− 15 after 20 iterations, while parareal converges to 10^− 11 after the same numbers of iterations. Since the convergence behavior of the error and the relative residual are similar for the heat equation and the advection-reaction-diffusion equation, we present only the convergence results for the latter equation. They are displayed in Fig. 16 for short and long time intervals. On short time interval, we observe that the convergence rate of parareal with GMRES acceleration is slightly improved for both the error and the relative residual. Parareal with GMRES acceleration allows to reach the same error as parareal, while requiring 2 iterations less. For example for m = 20, parareal with GMRES acceleration converges to an error of 10^− 14 after 16 iterations, while parareal converges to the same error after 18 iterations. On long time interval, the improvement is even less important.

It can be seen that GMRES improves slightly the convergence of parareal for the three test cases as mentioned in the end of Section 2.2. In addition to this part, a comparison of the two methods in terms of their computational costs for Dahlquist problem and advection-reaction-diffusion equation is displayed in Fig. 17, on short time intervals. Specifically, for Dahlquist problem, the difference is not very large at the beginning. It becomes more noticeable when the costs increase, and computational cost of plain parareal is 0.7 time less than the one of parareal with GMRES acceleration, to obtain the error of 10^− 15. For advection-reaction-diffusion equation, the same behaviors are observed for the two curves and the computational cost of plain parareal is 0.5 time less than the one of parareal with GMRES acceleration, to obtain the error of 10^− 14.

6.4 Impact of GMRES acceleration for the advection-reaction-diffusion equation with different coefficients

We study in this section the convergence of parareal with GMRES acceleration for the advection-reaction-diffusion equation with different coefficients than at the beginning of Section 6. We consider the following setting: L = 1,T = 1, a = 0.01,b = 0.5,c = 100, N_C = 20,Δx ∈{0.2,0.05}, Δt = T/N_C,δt = Δt/m,m = 2. The exact solution is $u_{{exact}}=\sin \limits (2\pi x)\exp (-2t)$. The 2-norm of the error between the approximate solution and the fine sequential solution and of the relative residual are displayed in Fig. 18. We observe that both parareal and parareal with GMRES acceleration converge within 20 iterations. However, the error and relative residual of parareal seem to stagnate (Δx ∈{0.2,0.05}) and even increase (Δx = 0.05), while those of parareal with GMRES acceleration always decrease. Specifically, for Δx = 0.2, parareal converges slowly within the first 5 iterations, then stagnates, and continues to converge after 16 iterations. Hence, GMRES acceleration provides a more robust approach on short time interval T = 1. However, on long time interval T = 100, both methods converge with the same rate. As the previous section, a comparison of the two methods in terms of their computational costs is displayed in Fig. 19. For Δx = 0.2, the computational cost of parareal with GMRES acceleration is even less than the computational cost of the plain parareal at the beginning. However, to achieve the error of 10^− 15, the computational cost of parareal with GMRES acceleration is 1.7 times higher than the one of plain parareal. For Δx = 0.05, while parareal stagnates and even blows up at the beginning, parareal with GMRES acceleration still converges. Particularly, with almost the same computational cost, parareal with GMRES acceleration reaches an error less than 10^− 4 while the plain parareal only reaches an error of 10^− 3. Moreover, with a 1.2 times higher computational cost, parareal with GMRES acceleration achieves an error of 10^− 15, while the plain paraeal only reaches an error of 10^− 13.

In this section, we also present numerical experiments for the advection-reaction-diffusion equation in two cases, diffusion dominated and advection dominated. For both cases, we consider the advection-reaction-diffusion (50) with the periodic boundary condition

$$ \left\{\begin{array}{ll} u(0,t) = u(L-{\Delta} x,t),\\ u(L,t) = u({\Delta} x,t), \end{array}\right. $$

with L = 1,T = 1,N_C = 20,Δx = 0.1,Δt = T/N_C,δt = Δt/m,m = 5 and the exact solution $u_{{exact}}=\sin \limits (2\pi (x-bt))\exp (-2t)$. For the advective case, we consider a = 0.0005,b = 1,c = 1, and for the diffusive case, we consider a = 1,b = 0.0005,c = 1. The 2-norm of the error between the approximate solution and the fine sequential solution and of the relative residual are displayed in Fig. 20. We observe that parareal with GMRES acceleration always converges faster than the plain parareal in both cases. In particular for the advective case, parareal converges to the error of 10^− 14 after 17 iterations while parareal with GMRES acceleration reaches the same error after 15 iterations. For the diffusive case, we observe that both parareal and parareal with GMRES acceleration converge with a slower rate than the advective case. In particular, parareal with GMRES acceleration needs 16 iterations to converge to the error of 10^− 14 while parareal needs 18 iterations to reach the same error. It can be seen that GMRES again improves slightly the convergence of parareal as the numerical results in Section 6.3. A comparison of the two methods in terms of their computational costs is also displayed in Fig. 21. We observe that the computational costs in the diffusive case are lower than the ones in the advective case. For the advective case, the difference is not significant at the beginning. However, to achieve the error of 10^− 14, the computational cost of parareal with GMRES acceleration is slightly higher. For the diffusive case, the difference is slightly larger with a small advantage for parareal with GMRES acceleration at the beginning. Nevertheless, as the advective case, the computational cost of parareal with GMRES acceleration is 1.3 times higher to reach the error of 10^− 14.

Additionally in the end of this section, we show the convergence behaviors of parareal and parareal with GMRES acceleration with a different method for the fine propagator. In particular, we keep using backward Euler in time for the coarse propagator but Runge-Kutta 4 for the fine propagator. For the discretization in space, we keep the same centered finite difference method for both coarse and fine propagators. Following the same setting with the periodic boundary condition, the convergence results are displayed in Fig. 22. Specifically, for the advective case, a convergence to the error of 10^− 13 is obtained for parareal after 15 iterations while parareal with GMRES acceleration reaches the same error after 12 iterations. For the diffusive case, a slower convergence rate than the advective case is observed for both parareal and parareal with GMRES acceleration. Specifically, it takes 15 iterations for parareal while parareal with GMRES acceleration needs 13 iterations to converge to the error of 10^− 14. We also observe that parareal with GMRES acceleration slightly improves the convergence in both cases as the previous results in Fig. 20. Moreover, with the more accurate discretization for the fine propagator Runge-Kutta 4, the convergence curves are slightly faster than the ones using backward Euler in Fig. 20. Specifically, in the diffusive case, parareal with GMRES acceleration converges to the error of 10^− 14 after 13 iterations while it needs 16 iterations to reach the same error in case of using backward Euler for the fine propagator as it can be seen in Fig. 20. We also give in Fig. 23 a comparison of the two methods in terms of their computational costs. It can be seen that the computational costs of both plain parareal and parareal with GMRES acceleration are slightly higher than the ones in the previous test case in Fig. 21 for the avective case. Specifically, the difference is not very significant at the beginning, however to reach an error of 10^− 14, the computational cost of parareal with GMRES acceleration is 1.4 times higher than the one of plain parareal. For the diffusive case, the same observation is obtained as the previous test case in figure 21.

We observe from numerical experiments in this section that GMRES acceleration is helpful at the beginning when the overhead is cheap, and becomes less helpful as the orthogonalization cost increases. One potential choice is to exploit a restart strategy after a few iterations.

7 Conclusions and perspectives

In this paper, we propose an interpretation of parareal algorithm based on a domain decomposition strategy that we refer to as SC two-level additive Schwarz in time preconditioner. This preconditioner in time is equivalent to MGRIT with F-relaxation. We study variants of this preconditioner and show that additional fine or coarse propagation steps lead to MGRIT with FCF-relaxation, MGRIT with F(CF)²-relaxation, or overlapping parareal. We also find that SCS² two-level additive Schwarz in time preconditioner converges faster than MGRIT with F(CF)²-relaxation or overlapping parareal on long time interval and with a large number of subdomains. The efficiency of the variants as well as their computational costs has been shown in numerical experiments, especially on long time intervals. Theoretical convergence bounds are verified and numerical results show that they are sharp especially for long time intervals. We also propose using Krylov subspace method, especially GMRES, to accelerate the parareal algorithm. We find that for a specific case of the advection-reaction-diffusion equation in which the advection and reaction coefficients are large compared to the diffusion term, the error of parareal stagnates or even increases for the first iterations, while GMRES provides a faster decrease of the error. This phenomena as well as the convergence analysis of parareal with GMRES acceleration will be studied in our future work.

Availability of data and materials

Not applicable.

Code availability

Not applicable.

References

Lions, J.-L., Maday, Y., Turinici, G.: A “parareal” in time discretization of PDE’s. Comptes Rendus de l’Académie des Sci.- Series I - Math. 332, 661–668 (2001). https://doi.org/10.1016/S0764-4442(00)01793-6
MATH Google Scholar
Baffico, L., Bernard, S., Maday, Y., Turinici, G., Zérah, G.: Parallel-in-time molecular-dynamics simulations. Phys. Rev. E 66, 057701 (2002). https://doi.org/10.1103/PhysRevE.66.057701
Article Google Scholar
Eghbal, A., Gerber, A.G., Aubanel, E.: Acceleration of unsteady hydrodynamic simulations using the parareal algorithm. J. Comput. Sci. 19, 57–76 (2016). https://doi.org/10.1016/j.jocs.2016.12.006
Article Google Scholar
Baudron, A.M., Lautard, J.J., Maday, Y., Mula, O.: The parareal in time algorithm applied to the kinetic neutron diffusion equation. In: Domain decomposition methods in science and engineering XXI. Lecture notes in computational science and engineering, pp 437–445. Springer (2014). https://doi.org/10.1007/978-3-319-05789-7_41
Baudron, A.M., Lautard, J.J., Maday, Y., Riahi, M.K., Salomon, J.: Parareal in time 3D numerical solver for the LWR benchmark neutron diffusion transient model. J. Comput. Phys. 279(0), 67–79 (2014). https://doi.org/10.1016/j.jcp.2014.08.037
Article MathSciNet MATH Google Scholar
Kaber, S.M., Maday, Y.: Parareal in time approximation of the Korteveg-deVries-Burgers’ equations. PAMM 7, 1026403–1026404 (2007). https://doi.org/10.1002/pamm.20070057
Article Google Scholar
Dai, X., Le Bris, C., Legoll, F., Maday, Y.: Symmetric parareal algorithms for Hamiltonian systems. ESAIM: Math. Model. Numerical Anal. 47, 717–742 (2013). https://doi.org/10.1051/m2an/2012046
Article MathSciNet MATH Google Scholar
Gander, M.J., Hairer, E.: Analysis for parareal algorithms applied to Hamiltonian differential equations. J. Comput. Appl. Math. 259 Part A(0), 2–13 (2014). Proceedings of the Sixteenth International Congress on Computational and Applied Mathematics (ICCAM-2012), Ghent, Belgium 9-13 July, 2012, https://doi.org/10.1016/j.cam.2013.01.011
Article MathSciNet MATH Google Scholar
Bal, G., Maday, Y.: A “parareal” time discretization for non-linear PDE’s with application to the pricing of an American put. In: Pavarino, L., Toselli, A. (eds.) Recent Developments in Domain Decomposition Methods. Lecture notes in computational science and engineering, vol. 23, pp. 189–202. Springer (2002). https://doi.org/10.1007/978-3-642-56118-4_12
Pagès, G., Pironneau, O., Sall, G.: The parareal algorithm for american options. SIAM J. Financial Math. 9(3), 966–993 (2018). https://doi.org/10.1137/17M1138832
Article MathSciNet MATH Google Scholar
Magoulès, F., Gbikpi-Benissan, G., Zou, Q.: Asynchronous iterations of parareal algorithm for option pricing models. Mathematics, vol. 6(4). https://doi.org/10.3390/math6040045 (2018)
Bal, G.: On the convergence and the stability of the parareal algorithm to solve partial differential equations. In: Barth, T.J., Griebel, M., Keyes, D.E., Nieminen, R.M., Roose, D., Schlick, T., Kornhuber, R., Hoppe, R., Périaux, J., Pironneau, O., Widlund, O., Xu, J. (eds.) Domain Decomposition Methods in Science and Engineering, pp. 425–432. Springer (2005)
Gander, M.J., Vandewalle, S.: Analysis of the parareal time-parallel time-integration method. SIAM J. Sci. Comput. 29(2), 556–578 (2007). https://doi.org/10.1137/05064607X
Article MathSciNet MATH Google Scholar
Staff, G.A., RØnquist, E.M.: Stability of the parareal algorithm. In: Barth, T.J., Griebel, M., Keyes, D.E., Nieminen, R.M., Roose, D., Schlick, T., Kornhuber, R., Hoppe, R., Périaux, J., Pironneau, O., Widlund, O., Xu, J. (eds.) Domain Decomposition Methods in Science and Engineering, pp. 449–456. Springer (2005)
Gander, M.J., Hairer, E.: Nonlinear convergence analysis for the parareal algorithm. In: Langer, U., Discacciati, M., Keyes, D.E., Widlund, O.B., Zulehner, W. (eds.) Domain Decomposition Methods in Science and Engineering XVII, pp 45–56. Springer (2008)
Minion, M.L., Williams, S.A.: Parareal and spectral deferred corrections. In: AIP conference proceedings, vol. 1048, pp. 388 (2008). https://doi.org/10.1063/1.2990941
Minion, M.L.: A hybrid parareal spectral deferred corrections method. Commun. Appl. Math. Comput. Sci. 5(2), 265–301 (2010). https://doi.org/10.2140/camcos.2010.5.265
Article MathSciNet MATH Google Scholar
Gander, M.J., Jiang, Y.L., Li, R.J.: Parareal Schwarz waveform relaxation methods. In: Bank, R., Holst, M., Widlund, O., Xu, J. (eds.) Domain Decomposition Methods in Science and Engineering XX. Lecture notes in computational science and engineering, vol. 91, pp. 451–458. Springer (2013). https://doi.org/10.1007/978-3-642-35275-1_53
Gander, M., Jiang, Y., Song, B.: A superlinear convergence estimate for the parareal schwarz waveform relaxation algorithm. SIAM J. Sci. Comput. 41(2), 1148–1169 (2019). https://doi.org/10.1137/18M1177226
Article MathSciNet MATH Google Scholar
Friedhoff, S., Falgout, R.D., Kolev, T.V., MacLachlan, S.P., Schroder, J.B.: A multigrid-in-time algorithm for solving evolution equations in parallel. In: Presented At: sixteenth copper mountain conference on multigrid methods, copper mountain, CO, United States, 17-22 Mar, 2013 (2013). http://www.osti.gov/scitech/servlets/purl/1073108
Falgout, R.D., Friedhoff, S., Kolev, T.V., MacLachlan, S.P., Schroder, J.B.: Parallel time integration with multigrid. SIAM J. Sci. Comput. 36, 635–661 (2014). https://doi.org/10.1137/130944230
Article MathSciNet MATH Google Scholar
Hessenthaler, A., Nordsletten, D., Rḧrle, O., Schroder, J.B., Falgout, R.D.: Convergence of the multigrid reduction in time algorithm for the linear elasticity equations. Numerical Linear Algebra Appl. 25(3), 2155 (2018). https://doi.org/10.1002/nla.2155
Article MathSciNet Google Scholar
Chan, T.F., Mathew, T.P.: Domain decomposition algorithms. Acta. Numerica. 3, 61–143 (1994). https://doi.org/10.1017/S0962492900002427
Article MathSciNet MATH Google Scholar
Toselli, A., Widlund, O.: Domain Decomposition Methods – Algorithms and Theory. Springer Series in Computational Mathematics. 34 (2005). https://doi.org/10.1007/b137868
Dobrev, V., Kolev, T., Petersson, N., Schroder, J.: Two-level convergence theory for parallel time integration with multigrid. SIAM J. Scientific Comput. 39, 501–527 (2017). https://doi.org/10.1137/16M1074096
Article MathSciNet Google Scholar
Gander, M.J., Kwok, F., Zhang, H.: Multigrid interpretations of the parareal algorithm leading to an overlapping variant and MGRIT. Comput. Visualization Sci. https://doi.org/10.1007/s00791-018-0297-y (2018)
Ruprecht, D.: Convergence of Parareal with spatial coarsening. PAMM 14(1), 1031–1034 (2014). https://doi.org/10.1002/pamm.201410490
Article Google Scholar
Hirsch, C.: Numerical computation of internal and external flows: the fundamentals of computational fluid dynamics. Elsevier. https://www.bibsonomy.org/bibtex/2dbbc53f6feab1cd458e1c4f2a5c7e11c/tobydriscoll (2007)
Nevanlinna, O.: Linear acceleration of picard-lindelöf iteration. Numer. Math. 57, 147–156 (1990). https://doi.org/10.1007/BF01386404
Article MathSciNet MATH Google Scholar
Minion, M.: A hybrid parareal spectral deferred corrections method. Commun. Appl. Math. Comput. Sci., vol. 5. https://doi.org/10.2140/camcos.2010.5.265 (2010)

Download references

Acknowledgements

We would like to thank Martin J. Gander for his valuable discussions and the reviewers for constructive comments that helped us improve the manuscript.

Funding

Funding was provided by the French National Research Agency (ANR) Contract ANR-15-CE23-0019 (project CINE-PARA).

Author information

Authors and Affiliations

Inria, Project-Team ALPINES, 2 Rue Simone IFF, Paris, 75012, France
Van-Thanh Nguyen & Laura Grigori
CNRS, Laboratoire Jacques-Louis Lions (LJLL), Sorbonne Université and Université de Paris, 4 Place Jussieu, Paris, 75005, France
Van-Thanh Nguyen & Laura Grigori

Authors

Van-Thanh Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Laura Grigori
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

- Describe an interpretation of parareal as a two-level additive Schwarz preconditioner in the time domain so-called SC two-level additive Schwarz in time preconditioner.

- Show the equivalence between the three methods: SC two-level additive Schwarz in time preconditioner, MGRIT with F-relaxation and parareal.

- Introduce some variants as SCS, S(CS)² two-level additive Schwarz in time preconditioner and show their equivalence to MGRIT with FCF-relaxation, and to MGRIT with F(CF)²-relaxation or overlapping parareal.

- Propose a variant referred to as SCS² two-level additive Schwarz in time preconditioner which converges faster and efficiently exploits parallel computing.

- Present the convergence analysis and convergence estimate of SC two-level additive Schwarz in time preconditioner and its variants.

- Present the computational cost analysis of parareal with GMRES acceleration.

- Conduct numerical experiments to show the equivalence between parareal and SC two-level additive Schwarz in time preconditioner, the comparison between SC two-level additive Schwarz in time preconditioner and its variants, the acceleration of parareal with GMRES and the computational cost comparison.

Corresponding author

Correspondence to Van-Thanh Nguyen.

Ethics declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

The authors approved it for publication.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Laura Grigori contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Nguyen, VT., Grigori, L. Interpretation of parareal as a two-level additive Schwarz in time preconditioner and its acceleration with GMRES. Numer Algor 94, 29–72 (2023). https://doi.org/10.1007/s11075-022-01492-8

Download citation

Received: 20 March 2022
Accepted: 20 December 2022
Published: 01 March 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s11075-022-01492-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Interpretation of parareal as a two-level additive Schwarz in time preconditioner and its acceleration with GMRES

Abstract

Similar content being viewed by others

Tight Two-Level Convergence of Linear Parareal and MGRIT: Extensions and Implications in Practice

On the Use of Reduced Basis Methods to Accelerate and Stabilize the Parareal Method

Multigrid interpretations of the parareal algorithm leading to an overlapping variant and MGRIT

1 Introduction

Remark 1

2 Parareal algorithm

2.1 Parareal execution from an algebraic point of view

2.2 Expression of the standard residual correction scheme

Lemma 1

Proof

3 Interpretation of parareal as a two-level additive Schwarz in time preconditioner

Lemma 2

Proof

Corollary 1

Proof

Remark 2

Lemma 3

Proof

4 Variants of SC two-level additive Schwarz in time preconditioner and convergence analysis

Lemma 4

Proof

Lemma 5

Proof

Lemma 6

Proof

5 Convergence estimate

Theorem 1

Proof

6 Numerical results

6.1 Equivalence between parareal and SC two-level additive Schwarz in time preconditioner

6.2 Comparison between variants of SC two-level additive Schwarz in time preconditioner

6.3 Parareal with GMRES acceleration

6.4 Impact of GMRES acceleration for the advection-reaction-diffusion equation with different coefficients

7 Conclusions and perspectives

Availability of data and materials

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Consent for publication

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation