Minimum residual modified HSS iteration method for a class of complex symmetric linear systems

Zhang, Wei-Hong; Yang, Ai-Li; Wu, Yu-Jiang

doi:10.1007/s11075-020-00944-3

Minimum residual modified HSS iteration method for a class of complex symmetric linear systems

Original Paper
Published: 26 May 2020

Volume 86, pages 1543–1559, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Numerical Algorithms Aims and scope Submit manuscript

Minimum residual modified HSS iteration method for a class of complex symmetric linear systems

Download PDF

509 Accesses
8 Citations
Explore all metrics

Abstract

For solving a class of complex symmetric system of linear equations, we apply the minimum residual technique to the modified Hermitian and skew-Hermitian splitting (MHSS) iteration scheme and propose an iteration method referred to as minimum residual MHSS (MRMHSS) iteration method. Compared with the classical MHSS method, the MRMHSS method has two more iteration parameters, which can be automatically and easily computed. Then, some properties of the MRMHSS iteration method are carefully studied. Finally, we use four examples to test the performance of the MRMHSS iteration method by comparing its numerical results with three other iteration methods.

Fast and Unconditional Convergent MRMHSS Iteration Method for Solving Complex Symmetric Linear Systems

Article 28 August 2024

A General MHSS Iteration Method for a Class of Complex Symmetric Linear Systems

Modified Hermitian-normal splitting iteration methods for a class of complex symmetric linear systems

Article 21 June 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We consider the following systems of linear equations

$$ A\mathbf{x} = \mathbf{b},~~~A \in \mathbb{C}^{n\times n}, ~~ \mathbf{x},\mathbf{b} \in \mathbb{C}^{n}, $$

(1)

where A is a complex symmetric matrix of the form

$$ A = W + \imath T , $$

(2)

and $\imath = \sqrt {-1} $ denotes the imaginary unit, $ W,T \in \mathbb {R}^{n \times n}$ are both real symmetric matrices with at least one of them being positive definite. Hereafter, without loss of generality, we suppose that W is positive definite and T≠ 0 is positive semi-definite. This kind of complex symmetric linear systems can be found in many scientific problems, such as fast Fourier transform-based (FFT) solution of certain time-dependent partial differential equations [9], diffuse optical tomography [1], quantum mechanics [20], molecular scattering [17], and structural dynamics [13]. For more applications of this kind of complex symmetric systems, we refer to [8] and the references therein.

Let A = H(A) + S(A) be the Hermitian and skew-Hermitian splitting (HSS) of coefficient matrix A, where

$$ H(A) = \frac{1}{2}(A + A^{\ast}) = W,~~\text{and}~~ S(A) = \frac{1}{2}(A - A^{\ast}) = \imath T $$

are the Hermitian and the skew-Hermitian parts of matrix A, respectively. Here A^∗ denotes the conjugate transpose of matrix A. Since A is a non-Hermitian and positive definite matrix, one can straightforwardly employ the HSS iteration method [6] to solve the above complex symmetric system of linear equation (1). Nevertheless, as is known that there is a potential difficulty with the HSS iteration method since a shifted skew-Hermitian system of linear equations should be solved at each iteration step, which is time-consuming. To avoid this shortcoming, a modified HSS (MHSS) iteration method was presented lately [3] for solving this kind of complex symmetric systems of linear equations, in which the solution of the shifted skew-Hermitian linear system was replaced by solving a real symmetric positive definite systems of linear equations. The iterative scheme of MHSS iteration method can be written as

$$ \left. \begin{cases} (\alpha I+W) \mathbf{x}^{(k+\frac{1}{2})} = (\alpha I - \imath T)\mathbf{x}^{(k)} + \mathbf{b},\\ (\alpha I+T) \mathbf{x}^{(k+1)} = (\alpha I + \imath W)\mathbf{x}^{(k+\frac{1}{2})} - \imath \mathbf{b}, \end{cases}\right. $$

(3)

where α is a given positive constant and I is the identity matrix, k = 0,1,2,… and x⁽⁰⁾ is an initial guess.

The MHSS method is very effective and reliable, thus has been extended and developed to many new variants, such as preconditioned MHSS (PMHSS) iteration method [4, 5], generalized PMHSS [12, 22], lopsided PMHSS [16] and so on. In addition, the MHSS iteration method and its variants have been used to solve complex symmetric nonlinear equations [24], complex singular linear systems [10, 25], and also been used as the smoother of the multigrid methods [15].

Recently, in another point of view, Yang, Cao and Wu [23] proposed a minimum residual HSS (MRHSS) iteration method to improve the efficiency of the HSS iteration method by making use of the minimum residual technique on HSS iteration scheme. Numerical results show that the MRHSS method is much more effective than the HSS iteration method. However, when we use it to solve complex symmetric system of linear (1), a shifted skew-Hermitian system of linear equations still needs to be solved in each step of the MRHSS method. In order to avoid this problem, we further use the minimum residual technique to the MHSS iteration scheme. Then, a new method named as minimum residual MHSS (MRMHSS) iteration method is proposed in this work. In the implementation of this method, the shifted skew-Hermitian system of linear equations need to be solved is replaced by a symmetric positive definite systems of linear equations. Thus, we may expect that the MRMHSS iteration method outperforms the MRHSS iteration method.

Throughout this paper, we denote by (x,y) = y^∗x the Euclidean inner product for any complex vectors $\mathbf {x},\mathbf {y}\in \mathbb {C}^{n}$, and denote by $\| \mathbf {x} \| = \sqrt {(\mathbf {x},\mathbf {x})}$ the Euclidean norm for any complex vector $\mathbf {x}\in \mathbb {C}^{n}$. For an arbitrary matrix $X\in \mathbb {C}^{n\times n}$, ∥X∥ denotes the spectral norm of X, which is actually induced by the Euclidean vector norms. The remainder of this work is organized as follows. In Section 2, based on the residual form of the MHSS method, we first construct the MRMHSS iteration method. Some essential properties of the involved parameters and the convergence property of the MRMHSS method are discussed in Section 3. Numerical experiments are presented in Section 4 to test the efficiency and robustness of the MRMHSS iteration method. Finally, in Section 5, we draw a brief conclusion for this work.

2 The MRMHSS iteration method

In this section, we first rewrite the MHSS method into an equivalent form, and thereafter introduce the MRMHSS iteration method by using the minimum residual technique onto the new form of the MHSS method.

Note that (αI + W)^− 1(αI − ıT) = I − (αI + W)^− 1A and (αI + T)^− 1(αI + ıW) = I + ı(αI + T)^− 1A, and denote r^(k) = b − Ax^(k) and $\mathbf {r}^{(k+\frac {1}{2})} = \mathbf {b} - A\mathbf {x}^{(k+\frac {1}{2})}$, the MHSS iteration scheme (3) can be rewritten as

$$ \begin{cases} \mathbf{x}^{(k+\frac{1}{2})} = \mathbf{x}^{(k)} + (\alpha I + W)^{-1} \mathbf{r}^{(k)},\\ \mathbf{x}^{(k+1)} = \mathbf{x}^{(k+\frac{1}{2})} -\imath (\alpha I + T)^{-1} \mathbf{r}^{(k+\frac{1}{2})}, \end{cases} $$

(4)

which is mathematically equivalent to the direct-splitting scheme (3). However, the numerical behaviors of (3) and (4) are different and the latter generally performs more efficient than the former; See details in [7].

Now, denote

$$ \mathbf{d}^{(k)} = (\alpha I + W )^{-1} \mathbf{r}^{(k)}~~\textup{and}~~\mathbf{d}^{(k+\frac{1}{2})} = (\alpha I + T )^{-1} \mathbf{r}^{(k + \frac{1}{2} )}, $$

(5)

(4) can be rewritten as

$$ \mathbf{x}^{(k + \frac{1}{2} )} = \mathbf{x}^{(k)} + \mathbf{d}^{(k)} ,\quad \mathbf{x}^{(k + 1 )} = \mathbf{x}^{(k + \frac{1}{2} )} - \imath \mathbf{d}^{(k + \frac{1}{2} )}. $$

It is obvious that the search distances along the two updating directions d^(k) and $\mathbf {d}^{(k+\frac {1}{2})}$ are both unitary. Naturally, we can multiply two generalized parameters $\lambda _{k}, \theta _{k} \in \mathbb {C}$ in front of the updating directions d^(k) and $\mathbf {d}^{(k+\frac {1}{2})}$ to expect them work well to improve the convergence rates, which is similar with some other accelerating techniques too, such as the minimum residual smoothing [14, 21, 26]. Then, an improved iteration scheme is derived as

$$ \mathbf{x}^{(k + \frac{1}{2} )} = \mathbf{x}^{(k)} +\lambda_{k} \mathbf{d}^{(k)} ,\quad \mathbf{x}^{(k + 1 )} = \mathbf{x}^{(k + \frac{1}{2} )} - \imath \theta_{k} \mathbf{d}^{(k + \frac{1}{2} )}. $$

(6)

For the reason of clearness in the narrative, we define two notations

$$ G_{1}:=A(\alpha I+W)^{-1}~\text{and}~G_{2}:=A(\alpha I+T)^{-1}, $$

(7)

then the residual form of the iteration scheme (6) can be written as

$$ \mathbf{r}^{(k+\frac{1}{2})} = \mathbf{r}^{(k)} - \lambda_{k} G_{1} \mathbf{r}^{(k)}~~\textup{and}~~ \mathbf{r}^{(k+1)} = \mathbf{r}^{(k+\frac{1}{2})} +\imath \theta_{k} G_{2} \mathbf{r}^{(k+\frac{1}{2})}. $$

(8)

Now our task is to determine proper values of the parameters λ_k and 𝜃_k on minimizing the residual norms $\| \mathbf {r}^{(k+\frac {1}{2})} \|$ and ∥r^(k+ 1)∥ at each iteration step, respectively. By straightforwardly calculating, they can be formulated as

$$ \begin{aligned} \| \mathbf{r}^{(k+\frac{1}{2})} \|^{2} = &~ \| \mathbf{r}^{(k)} \|^{2} - 2Re(\lambda_{k}) (\mathcal{H}(G_{1})\mathbf{r}^{(k)},\mathbf{r}^{(k)} ) \\ &~ - 2Im(\lambda_{k}) (\imath \mathcal{S}(G_{1})\mathbf{r}^{(k)},\mathbf{r}^{(k)} ) + (Re(\lambda_{k})^{2} +Im(\lambda_{k})^{2})\| G_{1} \mathbf{r}^{(k)}\|^{2} \end{aligned} $$

(9)

and

$$ \begin{aligned} \| \mathbf{r}^{(k+1)} \|^{2} \!= &~ \!\| \mathbf{r}^{(k+\frac{1}{2})} \|^{2} - 2 Im(\theta_{k}) (\mathcal{H}(G_{2})\mathbf{r}^{(k+\frac{1}{2})},\mathbf{r}^{(k+\frac{1}{2})} ) \\ &~ + 2Re(\theta_{k}) (\imath \mathcal{S}(G_{2})\mathbf{r}^{(k+\frac{1}{2})},\mathbf{r}^{(k+\frac{1}{2})} ) + (Re(\theta_{k})^{2} + Im(\theta_{k})^{2})\! \| G_{2} \mathbf{r}^{(k+\frac{1}{2})}\|^{2}, \end{aligned} $$

(10)

where Re(⋅) and Im(⋅) represent the real and the imaginary parts of a complex number, ${\mathcal{H}}(\cdot )$ and $\mathcal {S}(\cdot )$ denote the Hermitian and the skew-Hermitian parts of a matrix, respectively. It is easily observed that these two norms can be viewed as two real valued convex functions of two variables Re(λ_k) and Im(λ_k), Re(𝜃_k) and Im(𝜃_k), respectively. Then, the minimum point of each functions can be directly derived as

$$ Re(\lambda_{k}) = \frac{(\mathcal{H}(G_{1})\mathbf{r}^{(k)},\mathbf{r}^{(k)} )} {\| G_{1} \mathbf{r}^{(k)}\|^{2}},~~Im(\lambda_{k}) = \frac{(\imath \mathcal{S}(G_{1})\mathbf{r}^{(k)},\mathbf{r}^{(k)} )} {\| G_{1} \mathbf{r}^{(k)}\|^{2}} $$

(11)

and

$$ Re(\theta_{k}) = - \frac{ (\imath \mathcal{S}(G_{2})\mathbf{r}^{(k+\frac{1}{2})},\mathbf{r}^{(k+\frac{1}{2})} )} {\| G_{2} \mathbf{r}^{(k+\frac{1}{2})}\|^{2}},~~Im(\theta_{k}) = \frac{ (\mathcal{H}(G_{2})\mathbf{r}^{(k+\frac{1}{2})},\mathbf{r}^{(k+\frac{1}{2})} )} {\| G_{2} \mathbf{r}^{(k+\frac{1}{2})}\|^{2}}. $$

(12)

It is certainly true that we get the formulas of two introduced parameters, however, we can hardly compute them by (11) and (12) in actual implementations because of the difficulty involved in calculating the matrix vector products ${\mathcal{H}}(G_{1})\mathbf {r}^{(k)}$, $\mathcal {S}(G_{1})\mathbf {r}^{(k)}$, ${\mathcal{H}}(G_{2})\mathbf {r}^{(k+\frac {1}{2})}$ and $\mathcal {S}(G_{2})\mathbf {r}^{(k+\frac {1}{2})}$, for given vectors r^(k) and $\mathbf {r}^{(k+\frac {1}{2})}$. Thus, we ought to find a practical manner for computing these parameters. Fortunately, they can be reformulated to easier forms in the following

$$ \lambda_{k} = Re(\lambda_{k}) + \imath Im(\lambda_{k}) = \frac{(\mathcal{H}(G_{1})\mathbf{r}^{(k)},\mathbf{r}^{(k)} )} {\| G_{1} \mathbf{r}^{(k)}\|^{2}} + \imath \frac{(\imath \mathcal{S}(G_{1})\mathbf{r}^{(k)},\mathbf{r}^{(k)} )} {\| G_{1} \mathbf{r}^{(k)}\|^{2}} = \frac{(\mathbf{r}^{(k)}, G_{1}\mathbf{r}^{(k)})}{\| G_{1} \mathbf{r}^{(k)}\|^{2}} $$

(13)

and

$$ \begin{array}{llll} \theta_{k} &= Re(\theta_{k}) + \imath Im(\theta_{k}) = - \frac{ (\imath \mathcal{S}(G_{2})\mathbf{r}^{(k+\frac{1}{2})},\mathbf{r}^{(k+\frac{1}{2})} )} {\| G_{2} \mathbf{r}^{(k+\frac{1}{2})}\|^{2}} + \imath \frac{ (\mathcal{H}(G_{2})\mathbf{r}^{(k+\frac{1}{2})},\mathbf{r}^{(k+\frac{1}{2})} )} {\| G_{2} \mathbf{r}^{(k+\frac{1}{2})}\|^{2}} \\ &= \frac{\imath (\mathbf{r}^{(k+\frac{1}{2})},G_{2}\mathbf{r}^{(k+\frac{1}{2})})}{\| G_{2} \mathbf{r}^{(k+\frac{1}{2})}\|^{2}}. \end{array} $$

(14)

Using (7) and (5), the computing formulas (13) and (14) can be further rewritten as

$$ \lambda_{k} = \frac{(\mathbf{r}^{(k)}, A\mathbf{d}^{(k)})}{\| A\mathbf{d}^{(k)}\|^{2}},~~\theta_{k} =\imath\frac{(\mathbf{r}^{(k+\frac{1}{2})},A\mathbf{d}^{(k+\frac{1}{2})})}{\| A\mathbf{d}^{(k+\frac{1}{2})}\|^{2}}. $$

(15)

On the basis of previous analyses of iteration parameters λ_k and 𝜃_k of the residual updating iteration scheme (6), the MRMHSS iteration method can be formulated as follows.

Method 1 (The MRMHSS iteration method)

Given an initial guess $\mathbf {x}^{(0)}\in \mathbb {C}^{n}$, compute r⁽⁰⁾ = b − Ax⁽⁰⁾. For k = 0,1,2,…, until {x^(k)} converges

1.
calculate $\mathbf {x}^{(k+\frac {1}{2})}$ from the iteration formula $\mathbf {x}^{(k+\frac {1}{2})} = \mathbf {x}^{(k)} + \lambda _{k} \mathbf {d}^{(k)}$, where
$$ \mathbf{d}^{(k)} = (\alpha I + W )^{-1} \mathbf{r}^{(k)},~~\lambda_{k} = \frac{(\mathbf{r}^{(k)}, A\mathbf{d}^{(k)})}{\| A\mathbf{d}^{(k)}\|^{2}}; $$
2.
calculate x^(k+ 1) from the iteration formula $\mathbf {x}^{(k+1)} = \mathbf {x}^{(k+\frac {1}{2})} - \imath \theta _{k} \mathbf {d}^{(k+\frac {1}{2})}$, where
$$ \mathbf{d}^{(k+\frac{1}{2})} = (\alpha I + T )^{-1} \mathbf{r}^{(k + \frac{1}{2} )},~~\theta_{k} =\imath\frac{(\mathbf{r}^{(k+\frac{1}{2})},A\mathbf{d}^{(k+\frac{1}{2})})}{\| A\mathbf{d}^{(k+\frac{1}{2})}\|^{2}}, $$

where α is a given positive constant and I is the identity matrix.

Remark 1

When we choose λ_k = 𝜃_k ≡ 1 at each iteration step of Method 1, then the MRMHSS iteration method immediately reduces to the classical MHSS iteration method.

It is evident that there are only two real and symmetric linear subsystems need to be solved at each step of the MRMHSS iteration method, whose coefficient matrices are completely same as those in the MHSS. Moreover, since $W\in \mathbb {R}^{n\times n}$ is symmetric positive definite, $T\in \mathbb {R}^{n\times n}$ is symmetric positive semi-definite, and α is positive real scalar, we know that both matrices αI + W and αI + T are symmetric positive definite. Thus, the two linear subsystems involved in each step of the MRMHSS iteration can be solved effectively using Cholesky factorization or (preconditioned) conjugate gradient (CG) method.

Remark 2

The MRMHSS iteration method is also applicable to the case where matrix W is symmetric positive semi-definite and T is symmetric positive definite. More generally, if there exist real numbers β and δ such that both matrices $\tilde {W} := \beta W + \delta T$ and $\tilde {T} := \delta T - \beta W$ are symmetric positive semi-definite with at least one of them being positive definite, we can first multiply the complex symmetric system (1) by the complex number β − ıδ to obtain the equivalent system

$$ (\tilde{W} + \imath\tilde{T})\mathbf{x} = \tilde{\mathbf{b}},~~\textup{with}~\tilde{\mathbf{b}}:=(\beta-\imath\delta)\mathbf{b}, $$

(16)

and then employ the MRMHSS iteration method to solve (16) approximately.

3 The convergence property of the MRMHSS iteration method

As we know, the iteration parameters λ_k and 𝜃_k involved in the MRMHSS iteration method are chosen to minimize the corresponding residual norms $\| \mathbf {r}^{(k+\frac {1}{2})} \|$ and∥r^(k+ 1)∥, respectively. In this section, we will show a closer relationship between λ_k,𝜃_k and ∥r^(k+ 1)∥. Furthermore, the convergence property of the MRMHSS iteration method will also be discussed.

To do so, we firstly present an important lemma.

Lemma 1 (See 23)

Let $ C = (c_{ij}) \in \mathbb {C}^{n \times n } $ be a Hermitian matrix. For any vector $ \mathbf {z} \in \mathbb {C}^{n} $, we have

$$ \frac{ \partial (C\mathbf{z},\mathbf{z}) }{ \partial (\| \mathbf{z} \|^{2}) } = \frac{ 1 }{ \| \mathbf{z} \|^{2} } (C\mathbf{z},\mathbf{z}), $$

where ∂ is the symbol of partial derivatives.

Now, by making use of the above lemma, we can obtain the following property of the iteration parameters λ_k and 𝜃_k.

Theorem 1

The quadruple (Re(λ_k),Im(λ_k),Re(𝜃_k),Im(𝜃_k)) determined by (11) and (12) is the minimum point of the function ∥r^(k+ 1)∥, which implies that the values of λ_k and 𝜃_k defined by (13) and (14) are optimal in the complex field $\mathbb {C}$.

Proof 1

From (8) and (10), the residual norm ∥r^(k+ 1)∥² can be viewed as a real valued function of the complex variables λ_k and 𝜃_k, or equivalently, of the real variables Re(λ_k), Im(λ_k), Re(𝜃_k) and Im(𝜃_k). Let $\tilde {\mathbf {r}}(\xi _{1},\xi _{2}) := \mathbf {r}^{(k)} - (\xi _{1}+\imath \xi _{2})G_{1} \mathbf {r}^{(k)}$, and define a function ϕ as

$$ \begin{array}{@{}rcl@{}} \phi(\tilde{\mathbf{r}}(\xi_{1},\xi_{2}), \eta_{1}, \eta_{2}) &:=& \| \tilde{\mathbf{r}} \|^{2} - 2\eta_{2}(\mathcal{H}(G_{2})\tilde{\mathbf{r}}, \tilde{\mathbf{r}})\\ &&+ 2\eta_{1}(\imath\mathcal{S}(G_{2})\tilde{\mathbf{r}}, \tilde{\mathbf{r}}) + ({\eta_{1}^{2}} + {\eta_{2}^{2}})\| G_{2}\tilde{\mathbf{r}} \|^{2}. \end{array} $$

(17)

Because of $\| G_{2}\tilde {\mathbf {r}} \|^{2} = (G_{2}^{\ast } G_{2}\tilde {\mathbf {r}}, \tilde {\mathbf {r}})$, thus using (17) and Lemma 1, we have

$$ \begin{aligned} \frac{\partial{\phi}} {\partial{(\| \tilde{\mathbf{r}} \|^{2})}} &= \frac{\| \tilde{\mathbf{r}} \|^{2}} {\| \tilde{\mathbf{r}} \|^{2}} - \frac{2\eta_{2}}{\| \tilde{\mathbf{r}} \|^{2}} (\mathcal{H}(G_{2})\tilde{\mathbf{r}}, \tilde{\mathbf{r}}) \\&~~~~+ \frac{2\eta_{1}} {\| \tilde{\mathbf{r}} \|^{2}} (\imath\mathcal{S}(G_{2})\tilde{\mathbf{r}}, \tilde{\mathbf{r}}) + \frac{{\eta_{1}^{2}} + {\eta_{2}^{2}}}{\| \tilde{\mathbf{r}} \|^{2}}\| G_{2}\tilde{\mathbf{r}} \|^{2} \\&= \frac{1}{\| \tilde{\mathbf{r}} \|^{2}} \phi. \end{aligned} $$

(18)

Denote by

$$ \varphi (\xi_{1},\xi_{2},\eta_{1},\eta_{2}) := \phi(\tilde{\mathbf{r}}(\xi_{1},\xi_{2}),\eta_{1},\eta_{2}). $$

(19)

Then, the four first-order partial derivatives of φ(ξ₁,ξ₂,η₁,η₂) can be calculated, which are

$$\begin{aligned} & \frac{\partial{\varphi}} {\partial{\xi_{1}}} = \frac{\partial{\phi}} {\partial{(\| \tilde{\mathbf{r}} \|^{2})}} \frac{\partial{(\| \tilde{\mathbf{r}} \|^{2})}} {\partial{\xi_{1}}} = \frac{2\phi} {\| \tilde{\mathbf{r}} \|^{2}} \left( \xi_{1}\| G_{1} \mathbf{r}^{(k)} \|^{2} - (\mathcal{H}(G_{1}) \mathbf{r}^{(k)}, \mathbf{r}^{(k)}) \right), \\ & \frac{\partial{\varphi}} {\partial{\xi_{2}}} = \frac{\partial{\phi}} {\partial{(\| \tilde{\mathbf{r}} \|^{2})}} \frac{\partial{(\| \tilde{\mathbf{r}} \|^{2})}} {\partial{\xi_{2}}} = \frac{2\phi} {\| \tilde{\mathbf{r}} \|^{2}} \left( \xi_{2} \| G_{1} \mathbf{r}^{(k)} \|^{2} - (\imath \mathcal{S}(G_{1}) \mathbf{r}^{(k)}, \mathbf{r}^{(k)}) \right),\\ & \frac{\partial{\varphi}} {\partial{\eta_{1}}} = 2\left( \eta_{1}\| G_{2} \tilde{\mathbf{r}} \|^{2} + (\imath \mathcal{S}(G_{2}) \tilde{\mathbf{r}}, \tilde{\mathbf{r}}) \right),\\ & \frac{\partial{\varphi}} {\partial{\eta_{2}}} = 2\left( \eta_{2}\| G_{2} \tilde{\mathbf{r}} \|^{2} - (\mathcal{H}(G_{2}) \tilde{\mathbf{r}}, \tilde{\mathbf{r}}) \right). \end{aligned}$$

It is easy to see, from (11) and (12), that the quadruple (Re(λ_k),Im(λ_k),Re(𝜃_k), Im(𝜃_k)) is the unique stationary point of the function φ defined by (19). Furthermore, in order to verify the quadruple is the minimum point of the function φ, its second-order partial derivatives are required. For the convenience of representation, we firstly define four notations as follows:

$$ \begin{aligned} & {\Psi}_{1}(\xi_{1}) := \xi_{1} \| G_{1} \mathbf{r}^{(k)} \|^{2} - (\mathcal{H}(G_{1}) \mathbf{r}^{(k)},\mathbf{r}^{(k)} ), \\ & {\Psi}_{2}(\xi_{2}) := \xi_{2} \| G_{1} \mathbf{r}^{(k)} \|^{2} - (\imath \mathcal{S}(G_{1}) \mathbf{r}^{(k)}, \mathbf{r}^{(k)} ) ,\\ &{\Psi}_{3}(\xi_{1},\xi_{2},\eta_{1}) := \eta_{1} \| G_{2} \tilde{\mathbf{r}} \|^{2} + (\imath \mathcal{S}(G_{2}) \tilde{\mathbf{r}}, \tilde{\mathbf{r}} ) , \\ &{\Psi}_{4}(\xi_{1},\xi_{2},\eta_{2}) := \eta_{2} \| G_{2} \tilde{\mathbf{r}} \|^{2} - (\mathcal{H}(G_{2}) \tilde{\mathbf{r}},\tilde{\mathbf{r}} ) . \end{aligned} $$

Then, the second-order partial derivatives of φ(ξ₁,ξ₂,η₁,η₂) read

$$ \begin{array}{@{}rcl@{}} \frac{ \partial^{2} \varphi } { \partial {\xi_{1}^{2}} } \!= ~\!{\Psi}_{1}(\xi_{1}) \frac{ \partial }{ \partial \xi_{1} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right) +\frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \| G_{1} \mathbf{r}^{(k)} \|^{2}, \quad ~~\frac{\partial^{2}\varphi} {\partial{\xi_{1}} \partial{\xi_{2}}} = {\Psi}_{1}(\xi_{1}) \frac{ \partial }{ \partial \xi_{2} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right), \\ \frac{ \partial^{2} \varphi } { \partial {\xi_{2}^{2}} } = ~{\Psi}_{2}(\xi_{2}) \frac{ \partial }{ \partial \xi_{2} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right) +\frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \| G_{1} \mathbf{r}^{(k)} \|^{2}, \quad ~~\frac{\partial^{2}\varphi} {\partial{\xi_{2}} \partial{\xi_{1}}} = {\Psi}_{2}(\xi_{2}) \frac{ \partial }{ \partial \xi_{1} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right), \\ \frac{\partial^{2}\varphi} {\partial{\eta_{1}} \partial{\xi_{1}}} = ~4 {\Psi}_{3}(\xi_{1},\xi_{2},\eta_{1}) \frac{ {\Psi}_{1}(\xi_{1}) }{ \| \tilde{\mathbf{r}} \|^{2} },\quad ~~\frac{\partial^{2}\varphi} {\partial{\eta_{1}} \partial{\xi_{2}}} = 4{\Psi}_{3}(\xi_{1},\xi_{2},\eta_{1}) \frac{ {\Psi}_{2}(\xi_{2}) }{ \| \tilde{\mathbf{r}} \|^{2} }, \\ \frac{\partial^{2}\varphi} {\partial{\eta_{2}} \partial{\xi_{1}}} = ~4 {\Psi}_{4}(\xi_{1},\xi_{2},\eta_{2}) \frac{ {\Psi}_{1}(\xi_{1}) }{ \| \tilde{\mathbf{r}} \|^{2} },\quad ~~\frac{\partial^{2}\varphi} {\partial{\eta_{2}} \partial{\xi_{2}}} = 4 {\Psi}_{4}(\xi_{1},\xi_{2},\eta_{2}) \frac{ {\Psi}_{2}(\xi_{2}) }{ \| \tilde{\mathbf{r}} \|^{2} }, \\ \frac{\partial^{2}\varphi} {\partial{\xi_{1}} \partial{\eta_{1}}} = ~ {\Psi}_{1}(\xi_{1}) \frac{ \partial }{ \partial \eta_{1} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right), \quad ~~\frac{\partial^{2}\varphi} {\partial{\xi_{1}} \partial{\eta_{2}}} = {\Psi}_{1}(\xi_{1}) \frac{ \partial }{ \partial \eta_{2} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right), \\ \frac{\partial^{2}\varphi} {\partial{\xi_{2}} \partial{\eta_{1}}} = ~{\Psi}_{2}(\xi_{2}) \frac{ \partial }{ \partial \eta_{1} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right),\quad ~~\frac{\partial^{2}\varphi} {\partial{\xi_{2}} \partial{\eta_{2}}} = {\Psi}_{2}(\xi_{2}) \frac{ \partial }{ \partial \eta_{2} } \left( \frac{ 2\phi }{ \| \tilde{\mathbf{r}} \|^{2} } \right), \\ \frac{\partial^{2} \varphi} {\partial{\eta_{1}}^{2}} = ~\frac{\partial^{2} \varphi} {\partial{\eta_{2}}^{2}} = 2\| G_{2} \tilde{\mathbf{r}} \|^{2},\quad ~~\frac{\partial^{2}\varphi} {\partial{\eta_{1}}\partial{\eta_{2}}} = \frac{\partial^{2}\varphi} {\partial{\eta_{2}}\partial{\eta_{1}}} = 0. \end{array} $$

Meanwhile, it is easy to see that Ψ₁(Re(λ_k)) = Ψ₂(Im(λ_k)) = 0 and

$$ {\Psi}_{3}(Re(\lambda_{k}), Im(\lambda_{k}), Re(\theta_{k})) = {\Psi}_{4}(Re(\lambda_{k}), Im(\lambda_{k}), Im(\theta_{k})) = 0. $$

Hence, the Hessian matrix of the function φ at the unique stationary point (Re(λ_k), Im(λ_k),Re(𝜃_k),Im(𝜃_k)) reads

$$ \mathbf{H} = \left[ \begin{array}{cccc} \frac{ 2 \| \mathbf{r}^{(k + 1 )} \|^{2} \| G_{1} \mathbf{r}^{(k)} \|^{2} }{ \| \mathbf{r}^{(k + \frac{1}{2} )} \|^{2} } & 0 & 0 & 0 \\ 0 & \frac{ 2 \| \mathbf{r}^{(k + 1 )} \|^{2} \| G_{1} \mathbf{r}^{(k)} \|^{2} }{ \| \mathbf{r}^{(k + \frac{1}{2} )} \|^{2} } & 0 & 0 \\ 0 & 0 & 2 \| G_{2} \mathbf{r}^{(k + \frac{1}{2} )} \|^{2} & 0 \\ 0 & 0 & 0 & 2 \| G_{2} \mathbf{r}^{(k + \frac{1}{2} )} \|^{2} \end{array} \right], $$

which is a Hermitian and positive definite matrix.

Therefore, the quadruple (Re(λ_k),Im(λ_k),Re(𝜃_k),Im(𝜃_k)) determined by (11) and (12) is the minimum point of the residual norm ∥r^(k+ 1)∥. □

Now, we study the convergence property of the MRMHSS iteration method. Denote by ${\mathcal{F}}(M)$ the field values of the complex matrix $M\in \mathbb {C}^{n\times n}$, i.e.,

$$ \mathcal{F}(M) = \{(M\mathbf{y},\mathbf{y})/(\mathbf{y},\mathbf{y}): \mathbf{0}\neq \mathbf{y}\in\mathbb{C}^{n} \}, $$

where (⋅,⋅) represents the Euclidean inner product of two vectors in $\mathbb {C}^{n}$.

Theorem 2

The MRMHSS iteration method used for solving the complex symmetric system of linear (1) is convergent for any initial guess $\mathbf {x}^{(0)}\in \mathbb {C}^{n}$ if and only if

$$ 0\notin\mathcal{F}(A(\alpha I + W)^{-1}) \cap \mathcal{F}(A(\alpha I + T)^{-1}). $$

(20)

Furthermore, when (20) holds, the residuals satisfy

$$ \| \mathbf{r}^{(k+1)} \| \leq \frac{ \sqrt{\| A(\alpha I+W)^{-1} \|^{2} - {\delta_{1}^{2}}} }{ \| A(\alpha I+W)^{-1} \| } \frac{ \sqrt{\| A(\alpha I+T)^{-1} \|^{2} - {\delta_{2}^{2}}} }{ \| A(\alpha I+T)^{-1} \| } \cdot \| \mathbf{r}^{(k)} \| $$

(21)

for any nonnegative integer k, where δ₁ and δ₂ represent the distances from the origin to ${\mathcal{F}}(A(\alpha I + W)^{-1})$ and ${\mathcal{F}}(A(\alpha I + T)^{-1})$, respectively.

Proof 2

Denote by $\mathcal {R}(\cdot )$ and superscript ^⊥ the range space of a given matrix and the orthogonal complement of a vector space contained in $\mathbb {C}^{n}$, respectively. From the construction of MRMHSS iteration method in Section 2, we can find that each step of MRMHSS method contains two residual projection procedures, i.e., $\mathbf {r}^{(k+\frac {1}{2})}$ and r^(k+ 1) are the projections of r^(k) and $\mathbf {r}^{(k+\frac {1}{2})}$ onto the subspaces $\mathcal {R}(A(\alpha I+W)^{-1})^{\bot }$ and $\mathcal {R}(A(\alpha I+T)^{-1})^{\bot }$, respectively. Hence, $\| \mathbf {r}^{(k+\frac {1}{2})} \| \leq \| \mathbf {r}^{(k)} \| $ with equality if and only if r^(k) is already orthogonal to A(αI + W)^− 1r^(k), i.e.,

$$(\mathbf{r}^{(k)}, A(\alpha I+W)^{-1} \mathbf{r}^{(k)}) = 0.$$

This implies $\| \mathbf {r}^{(k+\frac {1}{2})} \| < \| \mathbf {r}^{(k)} \| $ if and only if $0\notin {\mathcal{F}}((\alpha I+W)^{-\ast } A^{\ast } )$. Because, for any matrix $M\in \mathbb {C}^{n\times n}$, the field of values of M^∗ is no other than the complex conjugate of the field of values of M, so

$$ \| \mathbf{r}^{(k+\frac{1}{2})} \| < \| \mathbf{r}^{(k)} \| ~~\text{if and only if}~~0\notin\mathcal{F}(A(\alpha I+W)^{-1}). $$

Similarly, we can obtain that

$$ \| \mathbf{r}^{(k+1)} \| < \| \mathbf{r}^{(k+\frac{1}{2})} \| ~~\text{if and only if}~~0\notin\mathcal{F}(A(\alpha I+T)^{-1}). $$

Therefore, we can finally obtain that ∥r^(k+ 1)∥ < ∥r^(k)∥ if and only if

$$ 0\notin\mathcal{F}(A(\alpha I+W)^{-1}) \cap \mathcal{F}(A(\alpha I+T)^{-1}). $$

(22)

Recalling that the field of values is a closed set, the distances of ${\mathcal{F}}(A(\alpha I+W)^{-1})$ and ${\mathcal{F}}(A(\alpha I+T)^{-1})$ from the origin can be, respectively, written as

$$ \delta_{1} = \underset{0\neq y\in\mathbb{C}^{n} }{\min} \!\left| \frac{(A(\alpha I+W)^{-1} y,y)} {(y,y) } - 0 \right| ~~\text{and}~~ \delta_{2} = \underset{0\neq y\in\mathbb{C}^{n} }{\min} \!\left| \frac{(A(\alpha I+T)^{-1} y,y)} {(y,y)} - 0 \right|\!. $$

Substituting (11) and (12), respectively, into (9) and (10), we obtain

$$ \begin{aligned} \| \mathbf{r}^{(k+\frac{1}{2})} \|^{2} =& \| \mathbf{r}^{(k)} \|^{2} - \frac{ | (A(\alpha I+W)^{-1} \mathbf{r}^{(k)}, \mathbf{r}^{(k)}) |^{2} } { \| A(\alpha I+W)^{-1} \mathbf{r}^{(k)} \|^{2} } \\ =& \| \mathbf{r}^{(k)} \|^{2} \left( 1-\left| \frac{ (A(\alpha I+W)^{-1} \mathbf{r}^{(k)}, \mathbf{r}^{(k)}) }{ (\mathbf{r}^{(k)}, \mathbf{r}^{(k)}) } \right|^{2} \frac{ \| \mathbf{r}^{(k)} \|^{2} }{ \| A(\alpha I+W)^{-1} \mathbf{r}^{(k)} \|^{2} } \right) \\ \leq& \| \mathbf{r}^{(k)} \|^{2} \left( 1-\frac{ {\delta_{1}^{2}} }{ \| A(\alpha I+W)^{-1} \|^{2} } \right) \end{aligned} $$

and

$$ \| \mathbf{r}^{(k+1)} \|^{2} \leq \| \mathbf{r}^{(k+\frac{1}{2})} \|^{2} \left( 1- \frac{ {\delta_{2}^{2}} }{ \| A(\alpha I+T)^{-1} \|^{2} } \right). $$

Combining the above two inequalities, the result (21) is obtained, which completes the proof of the theorem. □

Based on the above theorem, we in the following show a specific criterion for guaranteeing the convergence of Method 1. To this end, we firstly give an useful lemma.

Lemma 2

[18, Proposition 1.18] Let $M\in \mathbb {C}^{n\times n}$. Then the field of values of M is a convex set which contains the convex hull of its spectrum. It is equal to the convex hull of its spectrum when M is normal, i.e., $\mathcal {F}(M) = Co(\sigma (M))$, where Co(⋅) denotes the convex hull of a given set and σ(⋅) indicates the spectrum of a matrix.

Corollary 1

Assume that the matrix A = W + ıT is normal, then the MRMHSS iteration method used for solving the complex symmetric system of linear equations (1) is convergent for any initial guess $\mathbf {x}^{(0)}\in \mathbb {C}^{n}$.

Proof 3

Because $A=W+\imath T\in \mathbb {C}^{n\times n}$ is normal, by straightforward computations we find that the real symmetric matrices W and T are communicative, i.e., it holds that WT = TW. Thus, we can easily verify that both the matrices A(αI + W)^− 1 and A(αI + T)^− 1 are normal. From Lemma 2, it follows

$$\mathcal{F}(A(\alpha I + W)^{-1}) = Co(\sigma(A(\alpha I + W)^{-1}) ) $$

and

$$\mathcal{F}(A(\alpha I + T)^{-1}) = Co(\sigma(A(\alpha I + T)^{-1}) ).$$

Hence, (20) reduces to

$$0\notin Co(\sigma(A(\alpha I + W)^{-1}) )\cap Co(\sigma(A(\alpha I + T)^{-1})),$$

while this is automatically valid since in this situation the spectrum of both the matrices A(αI + W)^− 1 and A(αI + T)^− 1 are located in the right half plane of complex field. Therefore, the MRMHSS iteration method used for solving the complex symmetric system of linear equations (1) is convergent for any initial guess $\mathbf {x}^{(0)}\in \mathbb {C}^{n}$ when the system matrix A is normal. □

Corollary 2

Assume that the matrix A = W + ıT is normal, then we have an upper bound of the convergence rate presented in (21), which is followed by

$$ \| \mathbf{r}^{(k+1)} \| \leq\frac{\sqrt{\kappa^{2} -1}}{\kappa}\| \mathbf{r}^{(k)} \|, $$

(23)

where κ₁ and κ₂ are the condition numbers of matrices A(αI + W)^− 1 and A(αI + T)^− 1, respectively, $\kappa = \min \limits (\kappa _{1},\kappa _{2})$.

Proof 4

Since A = W + ıT is normal, it holds that WT = TW. Hence, there exists an orthogonal matrix $Q\in \mathbb {R}^{n\times n}$ can be viewed as the eigenvector matrices of matrices W and T,

$$ W=QDQ^{T} \quad \text{and} \quad T=Q{\Gamma} Q^{T}, $$

where

$$ D = diag(\xi_{1},\xi_{2},\ldots,\xi_{n}) \quad \text{and} \quad {\Gamma}=diag(\gamma_{1},\gamma_{2},\ldots,\gamma_{n}) $$

are diagonal matrices and λ_j > 0,γ_j ≥ 0,j = 1,2,…,n. It follows that

$$ A(\alpha I + W)^{-1} = Q(D + \imath {\Gamma})(\alpha I + D)^{-1}Q^{T} $$

let $\tilde {\mathbf {y}} = Q^{T}\mathbf {y}=(\tilde {y}_{1},\tilde {y}_{2},\ldots ,\tilde {y}_{n})$, it yields that

$$ \frac{(A(\alpha I + W)^{-1}\mathbf{y},\mathbf{y})}{(\mathbf{y},\mathbf{y})} = \frac{((D+\imath {\Gamma})(\alpha I + D)^{-1}\tilde{\mathbf{y}},\tilde{\mathbf{y}})}{(\tilde{\mathbf{y}},\tilde{\mathbf{y}})} =\sum\limits_{j=1}^{n}\mu_{j}\beta_{j}$$

where μ_j = (ξ_j + ıγ_j)(α + ξ_j)^− 1 and $\beta _{j} = \tilde {y}_{j}^{2}/{\sum }_{j=1}^{n}\tilde {y}_{j}^{2}$, j = 1,2,…,n. By straightforward computations, we have

$$|\sum\limits_{j=1}^{n} \mu_{j}\beta_{j}| \geq \underset{j}{\min}\{|\mu_{j}|\} =:\mu_{min},\quad\text{for any }\beta_{j}\in[0,1].$$

Thus, we can obtain δ₁ ≥ μ_min, and then it follows that

$$\frac{ \sqrt{\| A(\alpha I+W)^{-1} \|^{2} - {\delta_{1}^{2}}} }{ \| A(\alpha I+W)^{-1} \| } \leq \frac{ \sqrt{\mu_{max}^{2} - \mu_{min}^{2}} }{ \mu_{max} } = \frac{\sqrt{{\kappa_{1}^{2}}-1}}{\kappa_{1}},$$

where $\mu _{max} := \underset {j}{max}\{|\mu _{j}|\}$ is equal to the 2-norm of matrix A(αI + W)^− 1 and $\kappa _{1} = \frac {\mu _{max}}{\mu _{min}}$ denotes the condition number of matrix A(αI + W)^− 1.

Similarly we can obtain an estimate of the second term in the right-hand side of (21), denoting by κ₂ the condition number of matrix A(αI + T)^− 1, we have

$$\| \mathbf{r}^{(k+1)} \| \leq \frac{ \sqrt{{\kappa_{1}^{2}} -1} }{ \kappa_{1}} \frac{ \sqrt{{\kappa_{2}^{2}} -1} }{ \kappa_{2} } \cdot \| \mathbf{r}^{(k)} \|.$$

Furthermore, both $({\kappa _{1}^{2}}-1)^{1/2}/\kappa _{1}$ and $({\kappa _{2}^{2}}-1)^{1/2}/\kappa _{2}$ are less than unity; thus, the inequality of (23) can be obtained. The proof is completed. □

4 Numerical experiments

In this section, we use four examples to illustrate the feasibility and effectiveness of the MRMHSS iteration method.

We will compare the numerical results including numbers of iteration steps (denoted as IT) and elapsed CPU times (in seconds, denoted as CPU) of the MRMHSS iteration method with those of the MHSS, the generalized successive overrelaxation (GSOR) [19], the precondtioned MHSS (PMHSS) [4] and the MRHSS methods. In the tests, the MRMHSS, the MRHSS, the PMHSS and the MHSS iteration methods are used to solve the original complex system (1), while the GSOR method being used to solve the equivalent real system of linear equations. For more details about the equivalent real system, one can refer to [2, 11, 19] and the references therein.

The subsystems of linear equations with the coefficient matrices αI + W and αI + ıT in each step of the MRHSS iteration method are solved directly by using the Cholesky factorization and the LU decomposition, respectively. The subsystems of linear equations of the MHSS, GSOR, PMHSS and MRMHSS iteration methods are solved directly by using the Cholesky factorization since their coefficient matrices are symmetric positive definite. It is noteworthy to mention that the symmetric approximate minimum degree (SYMAMD) reordering [18] is used before the Cholesky or the LU decomposition.

In actual implementations, we choose the initial guess as zero vector and all the computing processes are terminated once the current iterate x^(k) satisfies the stopping criterion

$$ \frac{\| \mathbf{b} - A \mathbf{x}^{(k)} \|}{\| \mathbf{b} \|} \leq 10^{-6}, $$

(24)

or the number of iteration steps exceeds k_max = 1000. In addition, all the numerical experiments are computed in MATLAB [version 9.1.0.441655 (R2016b)] in double precision on a personal laptop, with 2.53GHz central processing unit (Intel®; Core™ i3 M380), 4.87 G memory and Windows 10 operating system.

For the choice of the parameter values of the MHSS, the MRHSS and the MRMHSS methods, we use the experimentally found optimal ones, which lead to the least number of iteration steps. If the optimal parameter values form an interval, then we use the one that belongs to this interval and leads to the least CPU time. While, for the GSOR method, the optimal iteration parameter values are chosen as presented in [19], and for the PMHSS method, those are chosen as presented in [4] except for Example 4 and for the cases of m = 512 in Examples 1, 2 and 3, which are determined as above-mentioned manner used for the MHSS, the MRHSS and the MRMHSS methods.

Example 1

Consider the complex symmetric linear systems of the form (1) as below (cf. [3])

$$ \left[ \left( K + \frac{ 3 - \sqrt{3} }{ \tau } I \right) + \imath \left( K + \frac{ 3 + \sqrt{3} }{ \tau } I \right) \right] \mathbf{x} = \mathbf{b}, $$

(25)

where τ is the time step-size and K is the matrix of a standard five point centered difference formula, approximating the negative Laplacian operator with homogeneous Dirichlet boundary conditions on an uniform mesh in the two dimensional unit square [0,1] × [0,1], and we set h = 1/(m + 1), τ = h, where m is a number of inner grid-points in one direction. Then, we can know that the matrix $ K \in \mathbb {R}^{n \times n } $ is with the tensor product form K = V_m ⊗ I + I ⊗ V_m, with $ V_{m} = h^{-2} tridiag(-1,2,-1) \in \mathbb {R}^{m \times m} $. What’s more, it is obvious that K is an n × n block tridiagonal matrix and n = m². In our tests, we take

$$ W = K + \frac{ 3 - \sqrt{3} }{ \tau } I \text{~and~} T = K + \frac{ 3 + \sqrt{3} }{ \tau } I, $$

and the right-hand side vector b with the j-th entry b_j being the form as follows

$$ b_{j} = \frac{ (1 - \imath ) j } { \tau (j+1)^{2} },~j = 1,2,\ldots,n. $$

In addition, we normalize the system by multiplying two sides by h². And for other details, we can refer to [3].

Example 2

Consider the complex symmetric linear systems of the form (1) as below (cf. [3])

$$ [(-\omega^{2}M + K ) + \imath (\omega C_{V} + C_{H} )] \mathbf{x} = \mathbf{b}, $$

(26)

where M is the inertia matrix, which is typically real symmetric, possibly singular and we take M = I, K is the (real symmetric) stiffness matrix, C_V is the viscous damping matrix (real, diagonal, “highly singular” and possibly zero) and we choose C_V = 10I, C_H is the hysteretic damping matrix (also real symmetric) and we here take C_H = 𝜖K, with 𝜖 is a damping coefficient, ω is the circular frequency. Besides, we assume that K is the five-point centered difference matrix approximating the negative Laplacian operator with homogeneous Dirichlet boundary conditions on a uniform mesh in the unit square [0,1] × [0,1] whose mesh-size is h = 1/(m + 1). Then, we can know that the matrix $ K \in \mathbb {R}^{n \times n }$ possesses the tensor product form K = V_m ⊗ I + I ⊗ V_m, where $ V_{m} = h^{-2}tridiag(-1,2,-1) \in \mathbb {R}^{m \times m } $ and I is the identical matrix in $ \mathbb {R}^{m \times m } $. Thus, K is an n × n block tridiagonal matrix, n = m². Moreover, we take ω = π, 𝜖 = 0.02, and the right-hand side vector b as b = (1 + ı)A1, where 1 being the vector whose all entries equal to one. Furthermore, we normalize the (26) by multiplying both sides by h². For more details, we refer to [3].

Example 3

The system of linear equations (1) is of the form (W + ıT)x = b, where (cf. [3])

$$ T = I\otimes V + V\otimes I~~\textit{and}~~W = 10(I\otimes V_{c} + V_{c}\otimes I) + 9(\mathbf{e}_{1}\mathbf{e}_{m}^{T} + \mathbf{e}_{m}\mathbf{e}_{1}^{T}) \otimes I $$

with $V=tridiag(-1,2,-1)\in \mathbb {R}^{m\times m},~V_{c} = V - \mathbf {e}_{1}\mathbf {e}_{m}^{T} - \mathbf {e}_{m}\mathbf {e}_{1}^{T}\in \mathbb {R}^{m\times m},$ and e₁ and e_m are the first and the last unit vectors in $\mathbb {R}^{m}$, respectively. We take the right-hand side vector b to be b = (1 + ı)A1, with 1 being the vector of all entries equal to one.

Here T and W correspond to the five-point centered difference matrices approximating the negative Laplacian operator with homogeneous Dirichlet boundary conditions and periodic boundary conditions, respectively, on a uniform mesh in the unit squares [0,1] × [0,1] with the mesh-size h = 1/(m + 1).

Example 4

We consider the complex Helmholtz equation (cf. [9])

$$ -{\Delta} u + \sigma_{1}u + \imath\sigma_{2}u = f, $$

where σ₁ and σ₂ are real coefficient functions, u satisfies Dirichlet boundary conditions in D = [0,1] × [0,1]. We discretize the problem with finite differences on a m × m grid with mesh-size h = 1/(m + 1). This leads to a system of linear equations

$$ ((K+\sigma_{1}I) + \imath\sigma_{2}I) \mathbf{x} = \mathbf{b}, $$

where K = I ⊗ V_m + V_m ⊗ I is the discretization of −Δ by means of centered differences, wherein $V_{m} = h^{-2}tridiag(-1,2,-1)\in \mathbb {R}^{m\times m}$. The right-hand side vector b is taken to be b = (1 + ı)A1, with 1 being the the vector of all entries equal to one. Furthermore, before solving the system we normalize the coefficient matrix and the right-hand side vector by multiplying both by h². For the numerical tests we set σ₁ = σ₂ = 100.

The experimentally found optimal iteration parameters α_exp of the MHSS, GSOR, PMHSS, MRHSS and MRMHSS iteration methods for all the numerical examples with different mesh numbers are listed in Table 1. We observe that, for all the iteration methods and all the numerical examples except for the PMHSS iteration method, the optimal parameters α_exp decrease as the mesh number increasing. In addition, almost for all the numerical examples with different mesh numbers, the optimal parameter values of the MRHSS and the MRMHSS iteration methods are much smaller than those of the MHSS, the PMHSS and the GSOR iteration methods.

Table 1 The optimal parameters α_exp for the MHSS, GSOR, MRHSS and MRMHSS methods

Full size table

In Table 2, we list numerical results of the four tested iteration methods for all the four examples. From the numerical results, we see that all the tested iteration methods are convergent within k_max iteration steps for all the four examples with different mesh numbers m. For Examples 1, 2 and 4, the GSOR and the MRHSS iteration methods perform much better than the MHSS iteration method no matter in iteration number or in computing time. However, among the PMHSS, the GSOR and the MRHSS iteration methods, the MRHSS becomes the worst one for Example 3 when mesh number is large. Meanwhile, the PMHSS costs more computing time than the MHSS for Example 4 since in this example the matrix W is more complicated to decompose than the matrix T. Finally, we should note that the MRMHSS iteration method discussed in this work is always the most efficient one for all the four tested examples, since it costs the least computing time and the number of iteration steps to achieve convergence. Moreover, for Examples 1, 2 and 4, the number of iteration steps of the MRMHSS, the MRHSS and the PMHSS methods are nearly h-independent.

Table 2 The numerical results of the tested iteration methods with respective optimal parameters

Full size table

In Fig. 1, we draw the curves of the computing times of the three iteration methods, i.e., MHSS, MRHSS and MRMHSS, with respect to the values of iteration parameter α when m = 128. The curves vividly illustrate that the MRMHSS iteration method always takes much less computing times than the MHSS and the MRHSS iteration methods in all tested cases. In addition, compared with the MHSS method, the MRMHSS and the MRHSS methods are not very sensitive to the value of parameter α. The MRMHSS iteration method always achieves its minimum computing time when α becomes very small, which is similar to the MRHSS iteration method.

Therefore, we can conclude that the MRMHSS iteration method discussed in this work is very efficient and robust when applied to solve the complex symmetric systems of linear equations (1).

5 Conclusions

For the large sparse complex symmetric system of linear equations (1), we proposed an efficient iteration method named as minimum residual MHSS (MRMHSS) iteration method, by applying the minimum residual technique to the MHSS iteration scheme. Although the MRMHSS method involves two more iteration parameters than the classical MHSS iteration scheme, they can be computed easily and automatically. The convergence property of the MRMHSS iteration method was carefully studied, and the numerical results illustrated that the MRMHSS is very robust and powerful.

References

Arridge, S.R.: Optical tomography in medical imaging. Inverse Probl. 15(2), R41–R93 (1999)
Article MathSciNet Google Scholar
Axelsson, O., Neytcheva, M., Ahmad, B.: A comparison of iterative methods to solve complex valued linear algebraic systems. Numer. Algor. 66(4), 811–841 (2014)
Article MathSciNet Google Scholar
Bai, Z.-Z., Benzi, M., Chen, F.: Modified HSS iteration methods for a class of complex symmetric linear systems. Computing 87(3-4), 93–111 (2010)
Article MathSciNet Google Scholar
Bai, Z.-Z., Benzi, M., Chen, F.: On preconditioned MHSS iteration methods for complex symmetric linear systems. Numer. Algor. 56(2), 297–317 (2011)
Article MathSciNet Google Scholar
Bai, Z.-Z., Benzi, M., Chen, F., Wang, Z.-Q.: Preconditioned MHSS iteration methods for a class of block two-by-two linear systems with applications to distributed control problems. IMA J. Numer. Anal. 33(1), 343–369 (2013)
Article MathSciNet Google Scholar
Bai, Z.-Z., Golub, G.H., Ng, M.K.: Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems. SIAM J. Matrix Anal. Appl. 24(3), 603–626 (2003)
Article MathSciNet Google Scholar
Bai, Z.-Z., Rozložník, M.: On the numerical behavior of matrix splitting iteration methods for solving linear systems. SIAM J. Numer. Anal. 53(4), 1716–1737 (2015)
Article MathSciNet Google Scholar
Benzi, M., Bertaccini, D.: Block preconditioning of real-valued iterative algorithms for complex linear systems. IMA J. Numer. Anal. 28(3), 598–618 (2008)
Article MathSciNet Google Scholar
Bertaccini, D.: Efficient preconditioning for sequences of parametric complex symmetric linear systems. Electron. Trans. Numer. Anal. 18, 49–64 (2004)
MathSciNet MATH Google Scholar
Chen, F., Liu, Q.-Q.: On semi-convergence of modified HSS iteration methods. Numer. Algor. 64(3), 507–518 (2013)
Article MathSciNet Google Scholar
Day, D., Heroux, M.A.: Solving complex-valued linear systems via equivalent real formulations. SIAM J. Sci. Comput. 23(2), 480–498 (2001)
Article MathSciNet Google Scholar
Dehghan, M., Dehghani-Madiseh, M., Hajarian, M.: A generalized preconditioned MHSS method for a class of complex symmetric linear systems. Math. Model. Anal. 18(4), 561–576 (2013)
Article MathSciNet Google Scholar
Feriani, A., Perotti, F., Simoncini, V.: Iterative system solvers for the frequency analysis of linear mechanical systems. Comput. Methods Appl. Mech. Engrg. 190 (13–14), 1719–1739 (2000)
Article Google Scholar
Gutknecht, M.H., Rozložník, M.: By how much can residual minimization accelerate the convergence of orthogonal residual methods? Numer. Algor. 27(2), 189–213 (2001)
Article MathSciNet Google Scholar
Li, S.-S., Li, W.-Q.: The analysis of PMHSS-multigrid methods for elliptic problems with smooth complex coefficients. Appl. Math. Comput. 265, 196–206 (2015)
MathSciNet MATH Google Scholar
Li, X., Yang, A.-L., Wu, Y.-J.: Lopsided PMHSS iteration method for a class of complex symmetric linear systems. Numer. Algor. 66(3), 555–568 (2014)
Article MathSciNet Google Scholar
Poirier, B.: Efficient preconditioning scheme for block partitioned matrices with structured sparsity. Numer. Linear Algebra Appl. 7(7–8), 715–726 (2000)
Article MathSciNet Google Scholar
Saad, Y.: Iterative Methods for Sparse Linear Systems. SIAM, Philadelphia (2003)
Book Google Scholar
Salkuyeh, D.K., Hezari, D., Edalatpour, V.: Generalized successive overrelaxation iterative method for a class of complex symmetric linear system of equations. Int. J. Comput. Math. 92(4), 802–815 (2015)
Article MathSciNet Google Scholar
Dijk, V.W., Toyama, F.M.: Accurate numerical solutions of the time-dependent schrödinger equation. Phys. Rev. E 75(3), 1–10 (2007)
Google Scholar
Vecharynski, E., Knyazev, A.: Preconditioned steepest descent-like methods for symmetric indefinite systems. Linear Algebra Appl. 511, 274–295 (2016)
Article MathSciNet Google Scholar
Xu, W.-W.: A generalization of preconditioned MHSS iteration method for complex symmetric indefinite linear systems. Appl. Math. Comput. 219(21), 10,510–10,517 (2013)
MathSciNet Google Scholar
Yang, A.-L., Cao, Y., Wu, Y.-J.: Minimum residual Hermitian and skew-Hermitian splitting iteration method for non-Hermitian positive definite linear systems. BIT Numer. Math. 59, 299–319 (2019)
Article MathSciNet Google Scholar
Yang, A.-L., Wu, Y.-J.: Newton-MHSS methods for solving systems of nonlinear equations with complex symmetric Jacobian matrices. Numer. Algebra Control Optim. 2, 839–853 (2012)
Article MathSciNet Google Scholar
Yang, A.-L., Wu, Y.-J., Xu, Z.-J.: The semi-convergence properties of MHSS method for a class of complex nonsymmetric singular linear systems. Numer. Algor. 66(4), 705–719 (2014)
Article MathSciNet Google Scholar
Zhou, L., Walker, H.F.: Residual smoothing techniques for iterative methods. SIAM J. Sci. Comput. 15(2), 297–312 (1994)
Article MathSciNet Google Scholar

Download references

Acknowledgments

The authors would like to thank the referees for comments that lead to improvements of the presentation.

Funding

This work is supported by the National Natural Science Foundation of China (Grant Nos. 11401281, 11471150) and the China Scholarship Council (CSC No. 201806180081).

Author information

Authors and Affiliations

School of Mathematics and Statistics, Lanzhou University, Lanzhou, 730000, Gansu, People’s Republic of China
Wei-Hong Zhang, Ai-Li Yang & Yu-Jiang Wu

Authors

Wei-Hong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ai-Li Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Jiang Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei-Hong Zhang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, WH., Yang, AL. & Wu, YJ. Minimum residual modified HSS iteration method for a class of complex symmetric linear systems. Numer Algor 86, 1543–1559 (2021). https://doi.org/10.1007/s11075-020-00944-3

Download citation

Received: 15 October 2019
Accepted: 30 April 2020
Published: 26 May 2020
Issue Date: April 2021
DOI: https://doi.org/10.1007/s11075-020-00944-3

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Minimum residual modified HSS iteration method for a class of complex symmetric linear systems

Abstract

Similar content being viewed by others

Fast and Unconditional Convergent MRMHSS Iteration Method for Solving Complex Symmetric Linear Systems

A General MHSS Iteration Method for a Class of Complex Symmetric Linear Systems

Modified Hermitian-normal splitting iteration methods for a class of complex symmetric linear systems

1 Introduction

2 The MRMHSS iteration method

Method 1 (The MRMHSS iteration method)

Remark 1

Remark 2

3 The convergence property of the MRMHSS iteration method

Lemma 1 (See 23)

Theorem 1

Proof 1

Theorem 2

Proof 2

Lemma 2

Corollary 1

Proof 3

Corollary 2

Proof 4

4 Numerical experiments

Example 1

Example 2

Example 3

Example 4

5 Conclusions

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation