A triple-parameter modified SSOR method for solving singular saddle point problems

Li, Jing; Zhang, Nai-Min

doi:10.1007/s10543-016-0610-4

A triple-parameter modified SSOR method for solving singular saddle point problems

Published: 07 March 2016

Volume 56, pages 501–521, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

BIT Numerical Mathematics Aims and scope Submit manuscript

A triple-parameter modified SSOR method for solving singular saddle point problems

Download PDF

Jing Li¹ &
Nai-Min Zhang¹

408 Accesses
11 Citations
Explore all metrics

Abstract

In this paper, we study a triple-parameter modified SSOR (TMSSOR) method for solving singular saddle point problems. We prove the semi-convergence of the TMSSOR method under suitable restrictions on the iteration parameters, and obtain the local optimal parameters which minimize the pseudo-spectral radii of the associated iteration matrices. Finally, numerical experiments demonstrate the effectiveness of the TMSSOR method for solving singular saddle point problems.

A generalized modified SOR-like method for the singular saddle point problems

Article 04 April 2016

The alternate direction iterative methods for generalized saddle point systems

Article Open access 08 November 2019

On equivalence of three-parameter iterative methods for singular symmetric saddle-point problem

Article 27 May 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We consider the iterative solutions of a consistent linear system with the following form:

$$\begin{aligned} \mathscr {A}\mathscr {X}=\left( \begin{array}{c@{\quad }c} A&{} B\\ -B^{T}&{} \mathbf {0} \end{array} \right) \left( \begin{array}{c} x\\ y \end{array} \right) =\left( \begin{array}{c} b\\ -q \end{array} \right) \end{aligned}$$

(1.1)

where $A \in R^{m\times m}$ is positive real, i.e., $v^TAv>0$, for any nonzero $v\in R^m$, $ B \in R^{m\times n}$ is rank-deficient, i.e., $rank(B)<n$ and $ b\in R^m$, $ q\in R^n$ with $ m\ge n$. We use $B^{T}$ and $B^*$ to denote the transpose and the conjugate transpose of the matrix B, respectively. Linear system (1.1) is often referred to a saddle point problem, which is important and arises in a wide variety of scientific and engineering applications such as computational fluid dynamics, optimization, optimal control, constrained least-squares problems, and so on, see [1, 6, 7, 9, 12, 13]. For a wider class of (generalized) saddle point problems, the readers can refer to [10, 14, 15, 17, 18, 20–22].

In the case of A being symmetric positive definite and B being of full column rank, a number of efficient iteration methods as well as their numerical properties have been studied. Bai et al. [3–7] proposed Hermitian and skew-Hermitian splitting (HSS) iteration method and developed it to solving standard and generalized saddle point problems. Golub et al. [21] presented SOR-like methods for solving the linear system (1.1). Bai et al. [9, 10] developed SOR-like methods, and presented the generalized SOR method and the parameterized inexact Uzawa method. Wu et al. [26] proposed the modified symmetric SOR (MSSOR) method. Recently, Najari Saberi and Edalatpanah [24] proposed a triple-parameter modified SSOR (TMSSOR) method for solving saddle point problems based on a new splitting and new relaxation parameters.

In the linear system (1.1), when B is rank-deficient, then the coefficient matrix is singular, and we call the linear system (1.1) a singular saddle point problem. Some iteration methods and preconditioning techniques for solving singular saddle point problems are proposed in the recent literature, see, e.g., [2, 16, 23, 27, 28, 30, 31]. Zheng et al. [30] applied parameterized Uzawa methods to solve singular saddle point problems. Li and Huang [23] investigated the semi-convergence of the generalized SSOR methods. Bai et al. [8, 22] studied constraint preconditioners, and for non-Hermitian singular saddle point problems, Zhang and Shen [28] provided some constraint preconditioners to accelerate GMRES. In this paper, we study the TMSSOR method for solving singular saddle point problems.

The rest of this paper is organized as follows. In Sect. 2, we present the TMSSOR method for solving the singular saddle point problem (1.1). In Sect. 3, we demonstrate the semi-convergence of this new method. In Sect. 4, we give the local optimal parameters of the TMSSOR method under certain restrictions on the parameters involved. In Sect. 5, several numerical experiments are given to show the efficiency of the method. Finally, conclusions are made for this paper in Sect. 6.

2 The triple-parameter modified SSOR method

For solving the singular saddle point problem (1.1), we make the following matrix splitting [24]

$$\begin{aligned} \left( \begin{array}{c@{\quad }c} A&{} B\\ -B^{T}&{} \mathbf {0} \end{array} \right) = D-L-U \end{aligned}$$

(2.1)

where

$$\begin{aligned} D= \left( \begin{array}{c@{\quad }c} \beta A&{} \mathbf {0}\\ \mathbf {0}&{} Q \end{array} \right) ,\quad L~=~\left( \begin{array}{c@{\quad }c} \mathbf {0}&{} \mathbf {0}\\ B^{T}&{} \alpha Q \end{array}\right) ,\quad U= \left( \begin{array}{c@{\quad }c} (\beta -1)A&{} -B\\ \mathbf {0}&{} (1-\alpha )Q \end{array} \right) \end{aligned}$$

$Q \in R^{n\times n}$ is symmetric positive definite, and $ \alpha ,\beta $ are two real parameters. Let $ c=(b^T,-q^T)^T$ and $z^{(i)}=((x^{(i)})^T,(y^{(i)})^T)^T$ be the i-th approximate solution to (1.1).

We compute the approximate solution $ z^{(i+1)}$ as follows.

$$\begin{aligned} z^{\left( i+\frac{1}{2}\right) }= & {} L_{\alpha ,\beta ,\omega }z^{(i)} +\omega (D-\omega L)^{-1}c \end{aligned}$$

(2.2)

$$\begin{aligned} z^{(i+1)}= & {} U_{\alpha ,\beta ,\omega }z^{\left( i+\frac{1}{2}\right) }+\omega (D-\omega U)^{-1}c \end{aligned}$$

(2.3)

where

$$\begin{aligned} L_{\alpha ,\beta ,\omega }= & {} (D-\omega L)^{-1}[(1-\omega )D+\omega U] \nonumber \\= & {} \left( \begin{array}{c@{\quad }c} (1-\dfrac{\omega }{\beta })I_m&{} -\dfrac{\omega }{\beta }A^{-1}B \\ \dfrac{\omega (\beta -\omega )}{\beta (1-\alpha \omega )}Q^{-1}B^{T} &{} I_n-\dfrac{\omega ^2}{\beta (1-\alpha \omega )}Q^{-1}B^{T}A^{-1}B \end{array} \right) \end{aligned}$$

(2.4)

$$\begin{aligned} U_{\alpha ,\beta ,\omega }= & {} (D-\omega U)^{-1}[(1-\omega )D+\omega L] \nonumber \\= & {} \left( \begin{array}{c@{\quad }c} \dfrac{(1-\omega )\beta }{\beta +\omega -\beta \omega }I_m-\dfrac{\omega ^2}{(\beta +\omega -\beta \omega )(1-\omega +\alpha \omega )}A^{-1}BQ^{-1}B^{T} &{} -\dfrac{\omega }{\beta +\omega -\beta \omega }A^{-1}B\\ \dfrac{\omega }{1-\omega +\alpha \omega }Q^{-1}B^{T}&{} I_n \end{array} \right) \nonumber \\ \end{aligned}$$

(2.5)

Notice

$$\begin{aligned} \quad \quad \quad D-\omega L= & {} \left( \begin{array}{c@{\quad }c}\beta A&{} \mathbf {0} \\ -\omega B^{T}&{}(1-\alpha \omega )Q\end{array}\right) , \\ D-\omega U= & {} \left( \begin{array}{c@{\quad }c}(\beta +\omega -\beta \omega )A&{}\omega B\\ \mathbf {0}&{}(1-\omega +\alpha \omega )Q \end{array}\right) \end{aligned}$$

Assume that $\alpha \ne 0, 1, \beta \ne 1, \omega \ne 0$. Since A is positive real and Q is symmetric positive definite, then we obtain that

$$\begin{aligned} det(D-\omega L)=\beta ^m(1-\alpha \omega )^n det(A)det(Q)\ne 0 \end{aligned}$$

and

$$\begin{aligned} det(D-\omega U)=(\beta +\omega -\beta \omega )^m(1-\omega +\alpha \omega )^n det(A)det(Q)\ne 0 \end{aligned}$$

if and only if $\beta (1-\alpha \omega )\ne 0 $, $(\beta +\omega -\beta \omega )(1-\omega +\alpha \omega )\ne 0 $, $\beta \ne 0 $, i.e., $ \omega \ne \left\{ 0, \dfrac{1}{\alpha },\dfrac{1}{1-\alpha },\dfrac{\beta }{\beta -1}\right\} $. From the above corresponding equations, we obtain the TMSSOR method as follows:

$$\begin{aligned} z^{(i+1)}=T_{\alpha ,\beta ,\omega }z^{(i)}+C_{\alpha ,\beta ,\omega } \end{aligned}$$

(2.6)

with

$$\begin{aligned} T_{\alpha ,\beta ,\omega }=U_{\alpha ,\beta ,\omega }L_{\alpha ,\beta ,\omega }= \left( \begin{array}{c@{\quad }c} T_{11}&{} T_{12}\\ T_{21}&{} T_{22} \end{array} \right) \end{aligned}$$

(2.7)

Here, $T_{\alpha ,\beta ,\omega }$ is the iteration matrix of the TMSSOR iteration, where

$$\begin{aligned} T_{11}= & {} \frac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }I_m \\&-\frac{\omega ^2(\beta -\omega )(2-\omega )}{\beta (\beta +\omega -\beta \omega )(1-\omega +\alpha \omega ) (1-\alpha \omega )} A^{-1}BQ^{-1}B^{T}\\ T_{12}= & {} -\frac{\omega (2-\omega )}{\beta +\omega -\beta \omega } A^{-1}B \\&+ \frac{\omega ^3(2-\omega )}{\beta (\beta +\omega -\beta \omega )(1-\omega +\alpha \omega )(1-\alpha \omega )} A^{-1}BQ^{-1}B^{T}A^{-1}B \\ T_{21}= & {} \dfrac{\omega (\beta -\omega )(2-\omega )}{\beta (1-\omega +\alpha \omega )(1-\alpha \omega )}Q^{-1}B^{T}, \\ T_{22}= & {} I_n-\dfrac{\omega ^2(2-\omega )}{\beta (1-\omega +\alpha \omega )(1-\alpha \omega )}Q^{-1}B^{T}A^{-1}B \end{aligned}$$

and

$$\begin{aligned} \begin{aligned} C_{\alpha ,\beta ,\omega }&= \omega (D-\omega U)^{-1}[(1-\omega )D+\omega L](D-\omega L)^{-1} \left( \begin{array}{c} b \\ -q \end{array} \right) \\&\quad +\,\omega (D-\omega U)^{-1}\left( \begin{array}{c} b \\ -q \end{array} \right) \\&=\omega (2-\omega )(D-\omega U)^{-1}D(D-\omega L)^{-1} \left( \begin{array}{c} b \\ -q \end{array} \right) \\&=\omega (2-\omega ) \\&\quad \times \left( \begin{array}{c} \dfrac{1}{\beta +\omega -\beta \omega }A^{-1}b-\dfrac{\omega ^2}{\beta (\beta +\omega -\beta \omega )(1-\omega +\alpha \omega )(1-\alpha \omega )}A^{-1}BQ^{-1}B^{T}A^{-1}b\\ \quad \quad \quad \quad \quad \quad \quad \quad +\dfrac{\omega }{(\beta +\omega -\beta \omega )(1-\omega +\alpha \omega )(1-\alpha \omega )}A^{-1}BQ^{-1}q\\ \dfrac{\omega }{\beta (1-\omega +\alpha \omega )(1-\alpha \omega )}Q^{-1}B^{T}A^{-1}b-\dfrac{1}{(1-\omega +\alpha \omega )(1-\alpha \omega )}Q^{-1}q \end{array} \right) \end{aligned} \end{aligned}$$

Let

$$\begin{aligned} M_{\alpha ,\beta ,\omega }= & {} \left( \omega (2-\omega )(D-\omega U)^{-1}D(D-\omega L)^{-1}\right) ^{-1}\\ {}= & {} \dfrac{1}{\omega (2-\omega )}(D-\omega L)D^{-1}(D-\omega U)\\= & {} \dfrac{1}{\omega (2-\omega )} \\&\times \left( \begin{array}{c@{\quad }c} (\beta +\omega -\beta \omega )A&{} \omega B \\ \dfrac{-\omega (\beta +\omega -\beta \omega )}{\beta }B^{T}&{} -\dfrac{\omega ^2}{\beta }B^{T}A^{-1}B+(1-\alpha \omega )(1-\omega +\alpha \omega )Q \end{array}\right) \end{aligned}$$

Then $C_{\alpha ,\beta ,\omega }=M_{\alpha , \beta , \omega }^{-1}\left( \begin{array}{c} b \\ -q \end{array}\right) $, and it is easy to see that the TMSSOR method can also be induced by the splitting

$$\begin{aligned} \mathscr {A}=M_{\alpha ,\beta ,\omega }-(M_{\alpha ,\beta ,\omega }-\mathscr {A})=M_{\alpha ,\beta ,\omega }-N_{\alpha ,\beta ,\omega } \end{aligned}$$

(2.8)

and

$$\begin{aligned} T_{\alpha ,\beta ,\omega }=M_{\alpha ,\beta ,\omega }^{-1}N_{\alpha ,\beta ,\omega } \end{aligned}$$

Obviously, $M_{\alpha ,\beta ,\omega }$ can be regarded as the preconditioner of the linear system (1.1). It is easy to see the TMSSOR method (2.6) can be written in the following form:

$$\begin{aligned} \left\{ \begin{array}{l@{\quad }l} y^{(i+1)}=y^{(i)}+\dfrac{\omega (2-\omega )}{\beta (1-\alpha \omega )(1-\omega +\alpha \omega )}Q^{-1}[(\beta -\omega )B^{T}x^{(i)}-\omega B^{T}A^{-1}By^{(i)}+\omega B^{T}A^{-1}b-\beta q]\\ x^{(i+1)}=\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }x^{(i)}+\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega }A^{-1}b-\dfrac{\omega }{\beta +\omega -\beta \omega }A^{-1}B [y^{(i+1)}+(1-\omega )y^{(i)}] \end{array}\right. \end{aligned}$$

(2.9)

Evidently, when the relaxed parameters satisfy $\alpha = \frac{1}{2}$, $\beta =1$, the TMSSOR method reduces to the MSSOR method [26]; which is also the GMSSOR method when the two parameters $\tau $ and $\omega $ of the GMSSOR method [31] are equal.

3 The semi-convergence of the TMSSOR method

In this section, we discuss the semi-convergence of the TMSSOR method. It is well known that if $\mathscr {A}$ is nonsingular, the iteration (2.6) is convergent when the spectral radius of the iteration matrix is less than 1. When $\mathscr {A}$ is singular, so $I-T_{\alpha ,\beta ,\omega }$ is singular. Then $T_{\alpha ,\beta ,\omega }$ has an eigenvalue 1, which means its spectral radius cannot be less than 1. So, for the iteration matrix $T_{\alpha ,\beta ,\omega }$ of the singular linear system (1.1), we need to introduce its pseudo-spectral radius [11] $ \nu (T_{\alpha ,\beta ,\omega })$:

$$\begin{aligned} \nu (T_{\alpha ,\beta ,\omega })=\max \{|\lambda |:\lambda \in \sigma (T_{\alpha ,\beta ,\omega }),\lambda \ne 1\} \end{aligned}$$

If the iteration (2.6) is convergent to a solution of the linear system (1.1), then we call the iteration (2.6) is semi-convergent. It is known that the iteration (2.6) is semi-convergent, if and only if the iteration matrix $T_{\alpha ,\beta ,\omega }$ is semi-convergent. The semi-convergence of the matrix $T_{\alpha ,\beta ,\omega }$ can be described as follows [11]:

(a)
The elementary divisors of the iteration matrix $T_{\alpha ,\beta ,\omega }$ associated with its eigenvalue $\lambda =1$ are linear, i.e., $rank(I-T_{\alpha ,\beta ,\omega })^2=rank(I-T_{\alpha ,\beta ,\omega })$, or equivalently, $index(I-T_{\alpha ,\beta ,\omega })=1$, where index(.) denotes the index of the corresponding matrix;
(b)
$\nu (T_{\alpha ,\beta ,\omega })<1 $.

Theorem 3.1

Let $T_{\alpha ,\beta ,\omega }$ be the iteration matrix of the TMSSOR method with $\omega \ne 0, 2$. For any eigenvalue $ \mu $ of $Q^{-1}B^{T}A^{-1}B$, if $\lambda $ satisfies

$$\begin{aligned} (\lambda -1)(1-\omega +\alpha \omega )(1-\alpha \omega )[(\beta -\omega )(1-\omega )-\lambda (\beta +\omega -\omega \beta )]=\lambda \omega ^2(2-\omega )^2 \mu \end{aligned}$$

(3.1)

then, $ \lambda $ is an eigenvalue of $T_{\alpha ,\beta ,\omega }$. Conversely, for any eigenvalue $\lambda $ of $T_{\alpha ,\beta ,\omega }$, and there exists $\mu $ which satisfies Eq. (3.1), then $ \mu $ is an eigenvalue of $Q^{-1}B^{T}A^{-1}B$.

Proof

Let $\lambda $ be an eigenvalue of $T_{\alpha ,\beta ,\omega }$, not loss of generality, we assume that $\lambda \ne 0$ and $z=(x^T,y^T)^T$ be the corresponding eigenvector. By

$$\begin{aligned}T_{\alpha ,\beta ,\omega }\left( \begin{array}{c} x \\ y \end{array} \right) =\lambda \left( \begin{array}{c} x\\ y \end{array} \right) \end{aligned}$$

we have

$$\begin{aligned} \begin{aligned} (1-\lambda )(D-\omega U)z&= \omega (2-\omega )D(D-\omega L)^{-1}\left( \begin{array}{c@{\quad }c} A&{} B\\ -B^{T}&{} \mathbf {0} \end{array} \right) z \end{aligned} \end{aligned}$$

Then

$$\begin{aligned}&(1-\lambda )\left( \begin{array}{c@{\quad }c} (\beta +\omega -\beta \omega )A&{}\omega B\\ 0&{}(1-\omega +\alpha \omega )Q \end{array} \right) \\&z=\omega (2-\omega )\left( \begin{array}{c@{\quad }c} A&{} B\\ \dfrac{(\omega -\beta )}{\beta (1-\alpha \omega )}B^{T}&{} \dfrac{\omega }{\beta (1-\alpha \omega )}B^{T}A^{-1}B \end{array} \right) z \end{aligned}$$

That is, it holds

$$\begin{aligned}&\left[ (1-\lambda )(\beta +\omega -\beta \omega )-\omega (2-\omega )\right] x=\omega (\lambda +1-\omega ) A^{-1}By \end{aligned}$$

(3.2)

$$\begin{aligned}&(1-\lambda )(1-\omega +\alpha \omega )(1-\alpha \omega )\beta y -\omega ^2(2-\omega )Q^{-1}B^{T}A^{-1}By \nonumber \\&\quad =\omega (2-\omega )(\omega -\beta )Q^{-1}B^{T}x \end{aligned}$$

(3.3)

If $(1-\lambda )(\beta +\omega -\beta \omega )-\omega (2-\omega )=0$, which means $\lambda =\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }$, then it is easy to see $\mu =0$ is a zero eigenvalue of $Q^{-1}B^{T}A^{-1}B$ which satisfies (3.1). If $\lambda \ne \dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }$, then it is easy to see $y\ne 0$. Substituting x of (3.2) to the Eq. (3.3), we obtain

$$\begin{aligned}&[(1-\lambda )^2(\beta +\omega -\beta \omega )-\omega (2-\omega )(1-\lambda )](1-\omega +\alpha \omega )(1-\alpha \omega ) \\&\quad y =-\lambda \omega ^2(2-\omega )^2Q^{-1}B^{T}A^{-1}By \end{aligned}$$

and

$$\begin{aligned} Q^{-1}B^{T}A^{-1}By=\dfrac{[(1-\lambda )^2(\beta +\omega -\beta \omega )-\omega (2-\omega )(1-\lambda )](1-\omega +\alpha \omega )(1-\alpha \omega )}{-\lambda \omega ^2(2-\omega )^2}y \end{aligned}$$

For $\lambda \ne 0$ and $\omega \ne 0, 2$, by the above equation it is easy to see that $\mu $ is an eigenvalue of $Q^{-1}B^{T}A^{-1}B$ which satisfies (3.1).

Conversely, for any eigenvalue $\mu $ of $Q^{-1}B^{T}A^{-1}B$, if $\lambda $ satisfies (3.1), then we can prove similarly that $\lambda $ is an eigenvalue of $T_{\alpha ,\beta ,\omega }$, here omitted. $\square $

Remark 3.1

From Theorem 3.1, we know that $\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }$ is an eigenvalue of $T_{\alpha ,\beta ,\omega }$. In fact, if $\lambda =\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }\ne 0$, then from Eq. (3.2), it holds $A^{-1}By=0$ for $y=0$. Also, we have

$$\begin{aligned} \quad \quad \frac{(1-\omega +\alpha \omega )(1-\alpha \omega )\beta }{\beta +\omega -\beta \omega }y -\omega Q^{-1}B^{T}A^{-1}By=(\omega -\beta )Q^{-1}B^{T}x \end{aligned}$$

Notice B is rank-deficient, then the equation $B^{T}x=0$ has nonzero solutions. Therefore, $\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }$ is an eigenvalue of $T_{\alpha ,\beta ,\omega }$ with the corresponding eigenvector $(x^T,0^T)^T$.

To obtain the semi-convergence conditions of the TMSSOR method, we introduce some lemmas which will be useful for our discussion.

Lemma 3.1

[9, 10] Both roots of the quadratic equation $x^{2}-px+q=0$ are less than one in modulus if and only if $|p-\overline{p}q|<1-|q|^2$, where $\overline{p}$ is the complex conjugate of p.

Lemma 3.2

[31] Let A be positive real, Q be symmetric positive definite and B be rank-deficient. Then for any nonzero eigenvalue $\mu =\mu _{Re}+\mathbf{i} \mu _{Im}$ of $Q^{-1}B^TA^{-1}B$, where $\mu _{Re}$ and $\mu _{Im}$ are the real part and imaginary part of $\mu $, respectively, and $\mathbf{i}=\sqrt{-1} $, it holds $\mu _{Re}>0$.

Theorem 3.2

Let A be positive real, Q be symmetric positive definite and B be rank-deficient. Then $\nu (T_{\alpha ,\beta ,\omega })<1$ if the following conditions hold:

(C1) For $\omega \in (0,1) \cup (2, +\infty )$, it holds $\dfrac{\omega ^{2}}{2(\omega -1 )}<\beta $ ; for $\omega \in (-\infty , 0)\cup (1, 2) $, it holds $\beta <\dfrac{\omega ^{2} }{2(\omega -1 )}$;
(C2) $\omega (2-\omega )(1-\alpha \omega )(1-\omega +\alpha \omega )>0$;
(C3) $\dfrac{\omega (2-\omega )}{\omega ^2-2\omega \beta +2\beta }\mu _{Re}^{2}\,+\,\dfrac{\omega ^2-2\omega \beta +2\beta }{\omega (2-\omega )}\mu _{Im}^{2}<\dfrac{2(1-\alpha \omega )(1-\omega +\alpha \omega )}{\omega (2-\omega )}\mu _{Re} $.

Proof

Making use of Eq. (3.1) and by some algebra, we have

$$\begin{aligned}&\lambda ^2-\left[ 2-\frac{\omega (2-\omega )}{\beta +\omega -\beta \omega }-\frac{\omega ^2(2-\omega )^2\mu }{(1-\alpha \omega )(1-\omega +\alpha \omega )(\beta +\omega -\beta \omega )}\right] \nonumber \\&\quad \times \,\lambda +1-\frac{\omega (2-\omega )}{\beta +\omega -\beta \omega }=0 \end{aligned}$$

(3.4)

From the above equation, we observe that $\lambda =1$ or $\lambda =\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega } $ in the case of $\mu =0$, and $\lambda \ne 1$ in the case of $\mu \ne 0$.

Case 1 When $\lambda =\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }$, it holds

$$\begin{aligned} |\lambda |<1&\Leftrightarrow \left| 1-\frac{\omega (2-\omega )}{\beta +\omega -\beta \omega }\right| <1\\&\Leftrightarrow 0<\frac{\omega (2-\omega )}{\beta +\omega -\beta \omega }<2 \end{aligned}$$

Notice when $\omega =1 $, the above inequality reduces to $0<1<2$, which holds true trivially. Excluding $\omega =1 $, the above inequality is equivalent to

$$\begin{aligned} \left\{ \begin{array}{l} \omega (2-\omega )>0\\ \beta +\omega -\beta \omega >0\\ \omega (2-\omega )<2(\beta +\omega -\beta \omega ) \end{array}\right. \quad or\quad \quad \left\{ \begin{array}{l} \omega (2-\omega )<0\\ \beta +\omega -\beta \omega <0\\ \omega (2-\omega )>2(\beta +\omega -\beta \omega )\end{array}\right. \end{aligned}$$

(3.5)

By some algebra we have

$$\begin{aligned} \left\{ \begin{array}{l@{\quad }l} 0<\omega <1, &{}\quad \beta > \dfrac{\omega ^2 }{2(\omega -1 )}\\ 1<\omega <2, &{}\quad \beta <\dfrac{\omega ^2 }{2(\omega -1 )}\\ \end{array}\right. \quad or\quad \quad \left\{ \begin{array}{l@{\quad }l} \omega < 0, &{}\quad \beta <\dfrac{\omega ^2 }{2(\omega -1 )}\\ \omega >2, &{}\quad \beta >\dfrac{\omega ^2 }{2(\omega -1 )}\end{array}\right. \end{aligned}$$

(3.6)

i.e.,

$$\begin{aligned} \left\{ \begin{array}{l} \omega \in (0, 1) \cup (2, +\infty )\\ \beta > \dfrac{\omega ^2 }{2(\omega -1 )} \end{array}\right. \quad or\quad \quad \left\{ \begin{array}{l} \omega \in (-\infty , 0)\cup (1, 2)\\ \beta < \dfrac{\omega ^2 }{2(\omega -1 )}\end{array}\right. \end{aligned}$$

(3.7)

which means the condition (C1) holds.

Case 2 When $\lambda \ne \left\{ \dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }, 1\right\} $, by Lemma 3.1, for the nonzero eigenvalue $\lambda $ of $T_{\alpha ,\beta ,\omega }$, denote $d=\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega }$, $e=\dfrac{\omega (2-\omega )}{(1-\alpha \omega )(1-\omega +\alpha \omega )}$. Then it holds

$$\begin{aligned}&\quad |\lambda |<1\Leftrightarrow \left\{ \begin{array}{l} \left| 1-d\right| <1 \\ \bigg |2-d-de\mu -[2-d-e\overline{\mu }][1-d]\bigg | +\left| 1-d\right| ^2<1 \end{array}\right. \end{aligned}$$

(3.8)

The first inequality of (3.8) is equivalent to $0<d<2$, same as in Case 1, we immediately obtain the condition (C1).

After some algebra, the second inequality of (3.8) is equivalent to

$$\begin{aligned}&\left| d\left[ 2-d+e\big [(1-d)\overline{\mu }-\mu \big ]\right] \right| <1-\left| 1-d\right| ^2 \Leftrightarrow \left| d\left[ 2-d+e\big [(1-d)\overline{\mu }-\mu \big ] \right] \right| \\&\quad <d\big (2-d\big ) \end{aligned}$$

Notice $0<d<2$, $\mu =\mu _{Re}+\mathbf{i}\mu _{Im}$. Then the above inequality is equivalent to

$$\begin{aligned} \left| 2-d+e[(1-d)\overline{\mu }-\mu ]\right| <2-d&\Leftrightarrow \left| 2-d-e[d\mu _{Re}+\mathbf{i} \big (2-d\big )\mu _{Im}]\right| <2-d \\&\Leftrightarrow [2-d-ed\mu _{Re}]^2+[e\left( 2-d\right) \mu _{Im}]^2 \\&<\left( 2-d\right) ^2 \end{aligned}$$

which is also equivalent to

$$\begin{aligned} \left[ de\mu _{Re}\right] ^2+\left[ \left( 2-d\right) e\mu _{Im}\right] ^2<2d\left( 2-d\right) e\mu _{Re} \end{aligned}$$

(3.9)

By Lemma 3.2, we have $\mu _{Re}>0$, and together with $0<d<2$, it is easy to see $e=\dfrac{\omega (2-\omega )}{(1-\alpha \omega )(1-\omega +\alpha \omega )}>0$ from (3.9), which is equivalent to the condition (C2).

After some algebra, the inequality (3.9) can be simplified as

$$\begin{aligned} \dfrac{d}{2-d}\mu _{Re}^2+\dfrac{2-d}{d}\mu _{Im}^2 <2\dfrac{1}{e}\mu _{Re} \end{aligned}$$

i.e.,

$$\begin{aligned} \dfrac{\omega (2-\omega )}{\omega ^2-2\omega \beta +2\beta }\mu _{Re}^2+\dfrac{\omega ^2-2\omega \beta +2\beta }{\omega (2-\omega )}\mu _{Im}^2 <2\dfrac{(1-\alpha \omega )(1-\omega +\alpha \omega )}{\omega (2-\omega )}\mu _{Re} \end{aligned}$$

which is the condition (C3). $\square $

Lemma 3.3

[29] $Index(I-T_{\alpha ,\beta ,\omega })=1$ if and only if, for any $\mathbf {0}\ne Y\in \mathscr {R}(\mathscr {A})$, $Y\notin \mathscr {N}(\mathscr {A}M_{\alpha , \beta , \omega }^{-1})$.

Theorem 3.3

Assume A be positive real and Q be symmetric positive definite, and B be of rank-deficient. Then $index(I-T_{\alpha ,\beta ,\omega })=1$.

Proof

Let

$$\begin{aligned} \mathbf {0}\ne Y=\mathscr {A}X= \left( \begin{array}{c@{\quad }c} A&{} B\\ -B^{T}&{} \mathbf {0} \end{array} \right) \left( \begin{array}{c} z_1 \\ z_2 \end{array} \right) =\left( \begin{array}{c} Az_1+Bz_2\\ -B^{T}z_1 \end{array} \right) \end{aligned}$$

(3.10)

By Lemma 3.3, to prove $index(I-T_{\alpha ,\beta ,\omega })=1$, it suffices to prove $\mathscr {A}M_{\alpha , \beta , \omega }^{-1}Y\ne \mathbf {0}$, we give the proof for $\mathscr {A}M_{\alpha , \beta , \omega }^{-1}Y\ne \mathbf {0}$ by contradiction. Suppose $\mathscr {A}M_{\alpha , \beta , \omega }^{-1}Y=\mathbf {0}$. Notice $\mathscr {N}(\mathscr {A})=span\left\{ \left( \begin{array}{c} \mathbf {0}\\ \varphi \end{array} \right) \right\} $, where $B\varphi =\mathbf {0}$.

Then there exists a vector $\varphi _0$ such that

$$\begin{aligned} M_{\alpha , \beta , \omega }^{-1}Y= \left( \begin{array}{c} \mathbf {0}\\ \varphi _0 \end{array} \right) ,\quad B\varphi _0=\mathbf {0}. \end{aligned}$$

So

$$\begin{aligned} \begin{aligned} Y&=M_{\alpha , \beta , \omega }\left( \begin{array}{c} \mathbf {0} \\ \varphi _0 \end{array}\right) \\&= \dfrac{1}{\omega (2-\omega )}\left( \begin{array}{c@{\quad }c} (\beta +\omega -\beta \omega )A&{} \omega B \\ -\dfrac{\omega (\beta +\omega -\beta \omega )}{\beta }B^{T}&{} -\dfrac{\omega ^2}{\beta }B^{T}A^{-1}B+(1-\alpha \omega )(1-\omega +\alpha \omega )Q \end{array}\right) \\&\quad \left( \begin{array}{c} \mathbf {0}\\ \varphi _0 \end{array}\right) \\ {}&=\dfrac{1}{\omega (2-\omega )}\left( \begin{array}{c} \mathbf {0}\\ (1-\alpha \omega )(1-\omega +\alpha \omega )Q\varphi _0 \end{array}\right) \end{aligned} \end{aligned}$$

(3.11)

Making use of Eqs. (3.10) and (3.11), we have $B\varphi _0=-BQ^{-1}B^{T}z_1=\mathbf {0}$. Noticing $Q^{-1}$ is symmetric positive definite, then $B^{T}z_1=\mathbf {0}$, which means $ \varphi _0=-Q^{-1}B^{T}z_1=\mathbf {0}$. Hence,

$$\begin{aligned} Y=M_{\alpha , \beta , \omega }\left( \begin{array}{c} \mathbf {0} \\ \varphi _0 \end{array}\right) =\mathbf {0} \end{aligned}$$

which contradicts with $Y\ne \mathbf {0}$, so we reach the conclusion that $index(I-T)=1$.

The proof is completed. $\square $

4 The optimal iteration parameters

In this section, we will discuss the optimal iteration parameters and the corresponding optimal semi-convergence factor for the TMSSOR method. Let $A\in R^{m\times m}$ and $ Q\in R^{n\times n}$ be symmetric positive definite. Then all the eigenvalues of the matrix $Q^{-1}B^TA^{-1}B$ are real and nonnegative. Denote the smallest and largest nonzero eigenvalues of the matrix $Q^{-1}B^TA^{-1}B$ by $\mu _{min}$ and $\mu _{max}$, respectively. By the proof of Theorem 3.2, we see that the eigenvalues $\lambda $ of $T_{\alpha , \beta , \omega }$ can be $\lambda =1$, $\lambda =\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }$ or $\lambda $ can be represented by the following $\lambda _{1}, \lambda _{2}$:

$$\begin{aligned} \lambda _{1}(\alpha ,\beta ,\omega ,\mu )= & {} \frac{1}{2}\left( f(\alpha ,\beta ,\omega ,\mu )+\sqrt{f(\alpha ,\beta ,\omega ,\mu )^2-4g(\beta ,\omega )}\right) \\ \lambda _{2}(\alpha ,\beta ,\omega ,\mu )= & {} \frac{1}{2}\left( f(\alpha ,\beta ,\omega ,\mu )-\sqrt{f(\alpha ,\beta ,\omega ,\mu )^2-4g(\beta ,\omega )}\right) \end{aligned}$$

where

$$\begin{aligned} f(\alpha ,\beta ,\omega ,\mu )= & {} 2-\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega }-\dfrac{\omega ^2(2-\omega )^2\mu }{(1-\alpha \omega )(1-\omega +\alpha \omega )(\beta +\omega -\beta \omega )} \\ g(\beta ,\omega )= & {} 1-\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega } \end{aligned}$$

Consider the following two cases:

1.
If ${\varDelta }=f(\alpha ,\beta ,\omega ,\mu )^2-4g(\beta ,\omega )\le 0$, then
$$\begin{aligned} |\lambda _{1}(\alpha ,\beta ,\omega ,\mu )|=|\lambda _{2}(\alpha ,\beta ,\omega ,\mu )|=\sqrt{1-\dfrac{\omega (2-\omega )}{\beta +\omega - \beta \omega }}=\sqrt{\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }} \end{aligned}$$
(4.1)
2.
If ${\varDelta }>0$, then both $\lambda _{1}(\alpha ,\beta ,\omega ,\mu )$ and $\lambda _{2}(\alpha ,\beta ,\omega ,\mu )$ are real, and it holds
$$\begin{aligned} \lambda (\alpha ,\beta ,\omega ,\mu )=\left\{ \begin{array}{l@{\quad }l} \lambda _{1}(\alpha ,\beta ,\omega ,\mu ) &{}\quad if ~f(\alpha ,\beta ,\omega ,\mu )>0 \\ -\lambda _{2}(\alpha ,\beta ,\omega ,\mu ) &{}\quad if ~f(\alpha ,\beta ,\omega ,\mu )\le 0 \end{array}\right. \end{aligned}$$

where $\lambda (\alpha ,\beta ,\omega ,\mu )=max\left\{ |\lambda _{1}(\alpha ,\beta ,\omega ,\mu )|,|\lambda _{2}(\alpha ,\beta ,\omega ,\mu )|\right\} $. From the Eq. (3.1), we obtain

$\lambda _{1}(\alpha ,\beta ,\omega ,\mu )\lambda _{2}(\alpha ,\beta ,\omega ,\mu )=1-\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega }=\dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }$

So, it is easy to see

$$\begin{aligned} \lambda (\alpha ,\beta ,\omega ,\mu )\ge \sqrt{\left| 1-\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega }\right| }=\sqrt{\left| \dfrac{(1-\omega )(\beta -\omega )}{\beta +\omega -\beta \omega }\right| } \end{aligned}$$

(4.2)

Hence, the pseudo-spectral radius of the TMSSOR iteration matrix can be defined by:

$\nu (T_{\alpha ,\beta ,\omega })=\max \limits _{0\ne \mu \in \sigma (Q^{-1}B^TA^{-1}B) }\{\lambda (\alpha ,\beta ,\omega ,\mu )\}$

Denote

$\lambda _{i}(\alpha ,\beta ,\omega )=\max \limits _{0\ne \mu \in \sigma (Q^{-1}B^TA^{-1}B) }\{|\lambda _{i}(\alpha ,\beta ,\omega ,\mu )|\}, \quad i=1, 2$

Obviously, it holds

$$\begin{aligned} \nu (T_{\alpha ,\beta ,\omega })=\max \left\{ \lambda _{1}(\alpha ,\beta ,\omega ),\lambda _{2}(\alpha ,\beta ,\omega )\right\} \end{aligned}$$

(4.3)

By Theorem 3.2, it follows that $0<\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega }<2$, $\dfrac{\omega (2-\omega )}{(1-\alpha \omega )(1-\omega +\alpha \omega )}>0$ which means

$$\begin{aligned} \dfrac{\omega ^2(2-\omega )^2}{(1-\alpha \omega )(1-\omega +\alpha \omega )(\beta +\omega -\beta \omega )}>0 \end{aligned}$$

(4.4)

From the above equations, it holds $|\lambda _{1}(\alpha ,\beta ,\omega )|\ge |\lambda _{2}(\alpha ,\beta ,\omega )|$ while ${\varDelta }>0$ and $f(\alpha ,\beta ,\omega ,\mu )>0$, and $|\lambda _{2}(\alpha ,\beta ,\omega )|\ge |\lambda _{1}(\alpha ,\beta ,\omega )|$ while ${\varDelta }>0$ and $f(\alpha ,\beta ,\omega ,\mu )\le 0$. So, we have

$$\begin{aligned} {\left\{ \begin{array}{ll} \lambda _{1}(\alpha ,\beta ,\omega )=\frac{1}{2}\left[ f(\alpha ,\beta ,\omega ,\mu _{min})+\sqrt{f(\alpha ,\beta ,\omega ,\mu _{min})^2-4g(\beta ,\omega )}\right] \\ \lambda _{2}(\alpha ,\beta ,\omega )=\frac{1}{2}\left[ -f(\alpha ,\beta ,\omega ,\mu _{max})+\sqrt{f(\alpha ,\beta ,\omega ,\mu _{max})^2-4g(\beta ,\omega )}\right] \end{array}\right. } \end{aligned}$$

(4.5)

Now we consider the optimal parameters which minimize the pseudo-spectral radius of the iteration matrix $T_{\alpha ,\beta ,\omega }$. Generally, it is very difficult to determine the global optimal parameters of the TMSSOR method. So, here, we consider a special case and discuss the local optimal parameters, that is, we assume the three parameters satisfying

$$\begin{aligned} \alpha =\dfrac{\omega (2-\omega )}{\beta +\omega -\beta \omega }\in (0,1) \end{aligned}$$

(4.6)

Remark 4.1

By Eqs. (4.1) and (4.2), it is easy to see, to investigate the optimal parameters which minimize $\nu (T_{\alpha ,\beta ,\omega })$, it suffices to consider the case of ${\varDelta }=f(\alpha ,\beta ,\omega ,\mu )^2-4g(\beta ,\omega )> 0$. So, from now on, we always assume ${\varDelta }> 0$.

Notice

$$\begin{aligned} \dfrac{\omega \alpha (2-\omega )}{(1-\alpha \omega )(1-\omega +\alpha \omega )}>0 \end{aligned}$$

Then it is easy to see there exist two variables $\mu _{1}$ and $\mu _{2}$ satisfying the following equations:

$$\begin{aligned}&2-\alpha -\dfrac{\omega \alpha (2-\omega )\mu _{1}}{(1-\alpha \omega )(1-\omega +\alpha \omega )}=2\sqrt{1-\alpha } \end{aligned}$$

(4.7)

$$\begin{aligned}&\quad 2-\alpha -\dfrac{\omega \alpha (2-\omega )\mu _{2}}{(1-\alpha \omega )(1-\omega +\alpha \omega )}=-2\sqrt{1-\alpha } \end{aligned}$$

(4.8)

where $0\le \mu _{1}\le \mu _{2}$, furthermore, by (4.5), it is easy to see $\mu _{1}, \mu _{2} \in [\mu _{min},\mu _{max}]$. Actually, if $\mu _{1}<\mu _{min} $ or $\mu _{max}<\mu _{2}$, then $-2\sqrt{1-\alpha }<f(\alpha ,\beta ,\omega ,\mu )<2\sqrt{1-\alpha }$, which leads to ${\varDelta }<0$. So it is in contradiction with ${\varDelta }>0$.

From (4.7) and (4.8), by some algebra, it holds

$$\begin{aligned}&1-\alpha = \dfrac{(\sqrt{\mu _{1}}-\sqrt{\mu _{2}})^2}{(\sqrt{\mu _{1}}+\sqrt{\mu _{2}})^2} \end{aligned}$$

(4.9)

$$\begin{aligned}&\dfrac{\omega (2-\omega )}{(1-\alpha \omega )(1-\omega +\alpha \omega )} = \dfrac{1}{\sqrt{\mu _{1}\mu _{2}}} \end{aligned}$$

(4.10)

So, $f(\alpha ,\beta ,\omega ,\mu )$ can be rewritten as

$$\begin{aligned} f(\alpha ,\beta ,\omega ,\mu )=\dfrac{2(\mu _1+\mu _2-2\mu )}{(\sqrt{\mu _1}+\sqrt{\mu _2})^2} \end{aligned}$$

(4.11)

Substituting $f(\alpha ,\beta ,\omega ,\mu )$ to (4.5), therefore, we have

$$\begin{aligned} \left\{ \begin{array}{l} \lambda _{1}(\alpha ,\beta ,\omega )=\dfrac{(\sqrt{\mu _{2}-\mu _{min}}+\sqrt{\mu _{1}-\mu _{min}})^2}{(\sqrt{\mu _{1}}+\sqrt{\mu _{2}})^2} \\ \lambda _{2}(\alpha ,\beta ,\omega )=\dfrac{(\sqrt{\mu _{max}-\mu _{1}}+\sqrt{\mu _{max}-\mu _{2}})^2}{(\sqrt{\mu _{1}}+\sqrt{\mu _{2}})^2} \end{array}\right. \end{aligned}$$

(4.12)

For convenience, let $\lambda _{1}(\alpha ,\beta ,\omega )=\lambda _{1}(\mu _{1},\mu _{2})$, and $\lambda _{2}(\alpha ,\beta ,\omega )=\lambda _{2}(\mu _{1},\mu _{2})$. It is easy to see that the following results hold true.

$$\begin{aligned} \left\{ \begin{aligned}\;&\lambda _{1}(\mu _{1},\mu _{2})=\lambda _{2}(\mu _{1},\mu _{2})\quad \mathrm{if }\quad \mu _{1}+\mu _{2}=\mu _{max}+\mu _{min},\\&\lambda _{1}(\mu _{1},\mu _{2})>\lambda _{2}(\mu _{1},\mu _{2})\quad \mathrm{if }\quad \mu _{1}+\mu _{2}>\mu _{max}+\mu _{min},\\&\lambda _{1}(\mu _{1},\mu _{2})<\lambda _{2}(\mu _{1},\mu _{2})\quad \mathrm{if }\quad \mu _{1}+\mu _{2}<\mu _{max}+\mu _{min}. \end{aligned} \right. \end{aligned}$$

(4.13)

Then by Eqs. (4.3) and (4.13), it holds

$$\begin{aligned} \nu (T_{\alpha ,\beta ,\omega })=\left\{ \begin{array}{l@{\quad }l} \lambda _{1}(\mu _1,\mu _2) &{}\quad if ~\mu _{1}+\mu _{2} \ge \mu _{max}+\mu _{min} \\ \lambda _{2}(\mu _1,\mu _2) &{}\quad if ~\mu _{1}+\mu _{2} <\mu _{max}+\mu _{min} \end{array}\right. \end{aligned}$$

(4.14)

By some algebra, it holds that

$$\begin{aligned} \left\{ \begin{array}{l} \dfrac{\partial { \lambda _1(\mu _1, \mu _2)}}{\partial {\mu _1}}=\dfrac{\sqrt{\mu _2-\mu _{min}}+\sqrt{\mu _1-\mu _{min}}}{\sqrt{\mu _1}+\sqrt{\mu _2}}.\dfrac{\sqrt{\mu _1 \mu _2}+\mu _{min}-\sqrt{(\mu _1-\mu _{min})(\mu _2-\mu _{min})}}{\sqrt{\mu _1(\mu _1-\mu _{min})}(\sqrt{\mu _1}+\sqrt{\mu _2})^2} \\ \dfrac{\partial { \lambda _2(\mu _1, \mu _2)}}{\partial {\mu _1}}=-\dfrac{\sqrt{\mu _{max}-\mu _1}+\sqrt{\mu _{max}-\mu _2}}{\sqrt{\mu _1}+\sqrt{\mu _2}}.\dfrac{\sqrt{\mu _1 \mu _2}+\mu _{max}-\sqrt{(\mu _{max}-\mu _1)(\mu _{max}-\mu _2)}}{\sqrt{\mu _1(\mu _{max}-\mu _1)}(\sqrt{\mu _1}+\sqrt{\mu _2})^2} \end{array}\right. \end{aligned}$$

Hence, we have

$$\begin{aligned} \frac{ \partial { \lambda _1(\mu _1, \mu _2)}}{\partial {\mu _1}}>0, \quad \frac{\partial {\lambda _2(\mu _1, \mu _2)}}{\partial {\mu _1}}<0 \end{aligned}$$

In the same way, we obtain

$$\begin{aligned} \frac{\partial { \lambda _1(\mu _1, \mu _2)}}{\partial {\mu _2}}>0, \quad \frac{\partial {\lambda _2(\mu _1, \mu _2)}}{\partial {\mu _2}}<0 \end{aligned}$$

We now declare that $\nu (T_{\alpha ,\beta ,\omega })$ has no minimum when $\mu _1+\mu _2 \ne \mu _{min}+\mu _{max}$. Observe the following two cases:

1.
Assume $\nu (T_{\alpha ,\beta ,\omega })$ has minimum at $\mu _1+\mu _2 <\mu _{min}+\mu _{max}$. By (4.14), it holds that $\nu (T_{\alpha ,\beta ,\omega })=\lambda _2(\mu _1, \mu _2)$. Let $\overline{\mu }=\mu _{min}+\mu _{max}-\mu _2$. Then, $\mu _1<\overline{\mu }$. By the monotone property of the function $\lambda _2(\mu _1, \mu _2)$, we have $\lambda _2(\mu _1, \mu _2)>\lambda _2(\overline{\mu }, \mu _2)$, which contradicts the assumption.
2.
Assume $\nu (T_{\alpha ,\beta ,\omega })$ has minimum at $\mu _1+\mu _2 >\mu _{min}+\mu _{max}$. By (4.14), it holds that $\nu (T_{\alpha ,\beta ,\omega })=\lambda _1(\mu _1, \mu _2)$. Let $\overline{\mu }=\mu _{min}+\mu _{max}-\mu _2$. Then, $\mu _1>\overline{\mu }$. By the monotone property of the function $\lambda _1(\mu _1, \mu _2)$, we have $\lambda _1(\mu _1, \mu _2)>\lambda _1(\overline{\mu }, \mu _2)$, which also contradicts the assumption.

So $\nu (T_{\alpha ,\beta ,\omega })$ may have minimum only at $\mu _1+\mu _2 = \mu _{min}+\mu _{max}$.

Now we give the local optimal parameters and the corresponding optimal convergence factor of the TMSSOR method by follows.

Theorem 4.1

Let A and Q be symmetric positive definite, and B be rank-deficient. Assume Eq. (4.6) is satisfied. Then the optimal parameters of the TMSSOR method are given by

$$\begin{aligned} \alpha _{opt}= & {} \dfrac{4\sqrt{\mu _{max}\mu _{min}}}{(\sqrt{\mu _{max}}+\sqrt{\mu _{min}})^2}, \nonumber \\ \beta _{opt}= & {} \dfrac{(\sqrt{\mu _{max}}+\sqrt{\mu _{min}})^2\omega _{opt}(2-\omega _{opt})-4\omega _{opt}\sqrt{\mu _{max}\mu _{min}}}{4\sqrt{\mu _{max}\mu _{min}}(1-\omega _{opt})} \end{aligned}$$

(4.15)

$\omega _{opt}$ is the root of the equation

$$\begin{aligned} \sqrt{\mu _{max}\mu _{min}}(a_{1}+4a_{2}-16\sqrt{\mu _{min} \mu _{max}})\omega ^2-(1+2\sqrt{\mu _{max}\mu _{min}})a_{1}\omega +a_{1}=0 \end{aligned}$$

where

$$\begin{aligned} a_{1}=(\sqrt{\mu _{max}}+\sqrt{\mu _{min}})^4, \quad a_{2}=(\sqrt{\mu _{max}}-\sqrt{\mu _{min}})^2 \end{aligned}$$

Furthermore, the corresponding optimal semi-convergence factor of the TMSSOR method is

$$\begin{aligned} \min \limits _{\alpha , \beta , \omega }\nu (T_{\alpha ,\beta ,\omega })=\nu (T_{\alpha _{opt},\beta _{opt},\omega _{opt}})=\dfrac{\sqrt{\mu _{max}}-\sqrt{\mu _{min}}}{\sqrt{\mu _{max}}+\sqrt{\mu _{min}}} \end{aligned}$$

(4.16)

Proof

From the above analysis, we know that $\nu (T_{\alpha ,\beta ,\omega })$ may have minimum only at $\mu _1+\mu _2 = \mu _{min}+\mu _{max}$. When $\mu _1+\mu _2 = \mu _{min}+\mu _{max}$, from (4.13) and (4.14), it holds

$$\begin{aligned} \begin{aligned} \nu (T_{\alpha ,\beta ,\omega })&=\lambda _1(\mu _1, \mu _2)=\lambda _2(\mu _1, \mu _2) \\&= \dfrac{\mu _{max}-\mu _{min}+2\sqrt{-\mu _1^2+(\mu _{min}+\mu _{max})\mu _1-\mu _{min}\mu _{max}}}{\mu _ {min}+\mu _{max}+2\sqrt{-\mu _1^2+(\mu _{min}+\mu _{max})\mu _1}} \end{aligned} \end{aligned}$$

By the technique in [19], let $r=-\mu _1^2+(\mu _{min}+\mu _{max})\mu _1\ge \mu _{min}\mu _{max}$ and define

$$\begin{aligned} g(r)=\nu (T_{\alpha ,\beta ,\omega })=\dfrac{\mu _{max}-\mu _{min}+2\sqrt{r-\mu _{min}\mu _{max}}}{\mu _{min}+\mu _{max}+2\sqrt{r}} \end{aligned}$$

It is easy to see that g(r) has minimum at $r=\mu _{min}\mu _{max}$, which means that $\nu (T_{\alpha ,\beta ,\omega })$ has minimum $\dfrac{\sqrt{\mu _{max}}-\sqrt{\mu _{min}}}{\sqrt{\mu _{max}}+\sqrt{\mu _{min}}}$ at $\mu _1=\mu _{min}$ and $\mu _2=\mu _{max}$. By Eqs. (4.6), (4.9), (4.10) and together with $\mu _1=\mu _{min}$, $\mu _2=\mu _{max}$, after some simple algebra, we obtain the following optimal parameters:

$$\begin{aligned} \alpha _{opt}= & {} \dfrac{4\sqrt{\mu _{max}\mu _{min}}}{(\sqrt{\mu _{max}}+\sqrt{\mu _{min}})^2},\\ \beta _{opt}= & {} \dfrac{(\sqrt{\mu _{max}}+\sqrt{\mu _{min}})^2\omega _{opt}(2-\omega _{opt})-4\omega _{opt}\sqrt{\mu _{max}\mu _{min}}}{4\sqrt{\mu _{max}\mu _{min}}(1-\omega _{opt})} \end{aligned}$$

and $\omega _{opt}$ is the root of the equation

$$\begin{aligned} \sqrt{\mu _{max}\mu _{min}}(a_{1}+4a_{2}-16\sqrt{\mu _{min} \mu _{max}})\omega ^2-(1+2\sqrt{\mu _{max}\mu _{min}})a_{1}\omega +a_{1}=0 \end{aligned}$$

where $a_{1}=(\sqrt{\mu _{max}}+\sqrt{\mu _{min}})^4$, $a_{2}=(\sqrt{\mu _{max}}-\sqrt{\mu _{min}})^2$. $\square $

Remark 4.2

Theorem 4.1 shows that the TMSSOR method has the same asymptotic convergence rate as the GSOR method [9] at their own optimal points of the iteration parameters. Otherwise, these two methods may exhibit different convergence behavior. The GSOR method has less computational cost per iteration, so, the GSOR method is more effective in practical applications.

5 Numerical experiments

In this section, some numerical experiments are given to compare the performance of the TMSSOR method with GSOR, MSSOR, MSSOR-like [25], GMSSOR methods. We denote the number of iteration steps by IT, elapsed CPU time in seconds by CPU and the norm of residual vectors by RES, respectively.

All the computations are implemented in MATLAB 7.0.19920 (R14) on a PC computer with a 1.86GHz 64-bit processor and 2GB memory. In actual computations, we choose the right-hand vector $(b^T,-q^T)^T\in R^{m+n}$ such that the exact solution of the augmented linear system (1.1) is $((x^*)^T,(y^*)^T)^T=(1,1,\ldots ,1)^T\in R^{m+n}$ and all runs are started from the initial vector $((x^{ (0)})^T)^T,(y^{(0)})^T)^T=(0,0,\ldots ,0)^T\in R^{m+n}$. The stopping criterion is taken when the current iteration satisfies $RES\le 10^{-6}$, where

$$\begin{aligned} RES:=\frac{\sqrt{||b-Ax^{(k)}-By^{(k)}||_2^2+||q-B^{T}x^{(k)}||_2^2}}{\sqrt{||b-Ax^{(0)}-By^{(0)}||_2^2+||q-B^{T}x^{(0)}||_2^2}} \end{aligned}$$

with $((x^{(k)})^T, (y^{(k)})^T)^T$ being the final approximate solution.

Example 5.1

Consider the Oseen equation:

$$\begin{aligned} \left\{ \begin{array}{l@{\quad }l} -\nu {\varDelta } \pmb {\mu }+(\pmb {\omega }\cdot \nabla )\pmb {\mu }+\nabla p=f, &{}\quad \text {in}~{\varOmega } \\ -\nabla \cdot \pmb {\mu }=0,&{}\quad \text {in}~{\varOmega }\\ \nabla \cdot \pmb {\omega }=0. \end{array}\right. \end{aligned}$$

(5.1)

where ${\varOmega }$ is an open bounded domain in $ R^{2}$ or $ R^{3}$, the vector field $\pmb {\mu }$ represents the velocity in ${\varOmega }$, p denotes pressure, and the scalar $\nu $ is the viscosity, which is inversely proportional to the Reynolds number. The test problem is a leaky two-dimensional lid-driven cavity problem in the square domain: ${\varOmega }=(0<x<1; 0<y<1)$, where $\pmb {\mu }=(\mu , \upsilon )^{T}$ denotes the velocity field, and $\pmb {\omega }=(a,b)^{T}$ denotes the wind. The boundary conditions are $\mu =\upsilon =0$ on the three walls $(x=0, y=0, x=1)$, and $\mu =1, \upsilon =0$ on the moving wall$(y=1)$. We take constant “wind” $a=1, b=2$. To discretize (5.1), we use the “market and cell”(MAC) finite difference scheme [20]. Divide ${\varOmega }$ into a uniform $l\times l$ grid of cells of width $h=\frac{1}{l}$. The discrete velocities and pressures are defined on a staggered grid in which the discrete values of $\mu $ lie in the centers of the cell boundaries orthogonal to the $x-axis$, the discrete values of $\upsilon $ lie in center boundaries orthogonal to the $y-axis$, and the discrete pressures lie in the cell centers. Then we obtain the matrix representation of the Oseen equations (5.1):

$$\begin{aligned} \left( \begin{array}{c@{\quad }c} A&{} B\\ -B^{T}&{} \mathbf {0} \end{array} \right) \left( \begin{array}{c} \mu \\ p \end{array} \right) =\left( \begin{array}{c} f\\ 0 \end{array} \right) \end{aligned}$$

(5.2)

with the matrix blocks of the following form:

$A=\left( \begin{array}{c@{\quad }c}F_1&{}0\\ 0&{}F_2\end{array}\right) \in R^{2l(l-1)\times 2l(l-1)}$, $B=(B_1,B_2)\in R^{2l(l-1)\times l^2}$,

$F_{i}=\nu A_{i}+N_{i}\in R^{l(l-1)\times l(l-1)}, \quad (i=1,2)$. For convenience, let $m=2l(l-1)$, $n=l^2$ in this example, and A is positive real, $rank(B)=l^2-1$. $\mathscr {N}(B)=span\{e_{n}\}, e_{n}=(1, 1, \ldots , 1)^T\in R^{n}$. To ensure the (1,1)-block matrix symmetric positive definite, finally we take the test coefficient matrix as follows:

$$\begin{aligned} \left( \begin{array}{l@{\quad }l} {\bar{A}} &{} B^{T} \\ -B &{} {\mathbf{0}} \end{array}\right) \end{aligned}$$

where $\bar{A}=\frac{1}{2}(A+A^{T})$. For the preconditioner of the TMSSOR method, notice

$$\begin{aligned}&M_{\alpha ,\beta ,\omega }^{-1}\\&\quad =t\left( \begin{array}{c@{\quad }c} (1-\omega +\alpha \omega )(1-\alpha \omega )\bar{A}^{-1}-\dfrac{\omega ^2}{\beta }\bar{A}^{-1}BQ^{-1}B^{T}\bar{A}^{-1}&{}-\omega \bar{A}^{-1}BQ^{-1}\\ \dfrac{\omega (\beta +\omega -\beta \omega )}{\beta }Q^{-1}B^{T}\bar{A}^{-1}&{}(\beta +\omega -\beta \omega )Q^{-1} \end{array}\right) \end{aligned}$$

and $t=\dfrac{\omega (2-\omega )}{(\beta +\omega -\beta \omega )(1-\omega +\alpha \omega )(1-\alpha \omega )}$. We compute the $\bar{A}^{-1}$, $Q^{-1}$ in $M_{\alpha , \beta , \omega }^{-1}$ by the incomplete LU factorization with drop tolerance 0.001. We take the viscosity $\nu =0.5$ in our experiments and choose the matrix Q by Table 1. For this example, here we consider the following four cases when l = 8, l = 16, l = 24 and l = 32.

In Fig. 1, we choose Q as Case II in Table 1, and we display the distribution on the eigenvalues of matrix $\mathscr {A}$ and the matrices $M_{\alpha , \beta , \omega }^{-1}\mathscr {A}$ for different iteration parameters. We find that the eigenvalues of the preconditioned matrices are quite clustered. In Tables 2, 3, 4 and 5, we choose Q as Case III in Table 1, and we present the numerical results for the different iteration methods, both as solvers and preconditioners to accelerate the restarted GMRES(k). In Table 6, we choose Q as Case I and Case II in Table 1, respectively, and we list the optimal parameters and the corresponding optimal convergence factors of the different iteration methods. When the optimal parameters are employed, it reveals that the TMSSOR method is effective, especially it has the smallest number of iterations.

Table 1 Choices of matrix Q, with $\widehat{Q}=Diag(B^T\widehat{A}^{-1}B,B^T B)$

Full size table

Table 2 Numerical results for Case III with l = 8

Full size table

Table 3 Numerical results for Case III with l = 16

Full size table

Table 4 Numerical results for Case III with l = 24

Full size table

Table 5 Numerical results for Case III with l = 32

Full size table

Table 6 Numerical results for Case I and Case II

Full size table

6 Conclusion

In this paper, we propose a triple-parameter modified SSOR iteration method to solve a class of large sparse singular saddle point problems. We prove the semi-convergence of the TMSSOR method under suitable restrictions on the iteration parameters, and obtain the local optimal parameters which minimize the pseudo-spectral radii of the associated iteration matrices, as for the global optimal parameters, it still needs further studies.

References

Bai, Z.-Z.: Structured preconditioners for nonsingular matrices of block two-by-two structures. Math. Comput. 75, 791–815 (2006)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z.: On semi-convergence of Hermitian and skew-Hermitian splitting methods for singular linear systems. Computing 89, 171–197 (2010)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z., Golub, G.H., Ng, M.K.: Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems. SIAM J. Matrix Anal. Appl. 24, 603–626 (2003)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z., Golub, G.H., Pan, J.-Y.: Preconditioned Hermitian and skew-Hermitian splitting methods for non-Hermitian positive semidefinite linear systems. Numer. Math. 98, 1–32 (2004)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z., Golub, G.H., Li, C.-K.: Optimal parameter in Hermitian and skew-Hermitian splitting method for certain two-by-two block matrices. SIAM J. Sci. Comput. 28, 583–603 (2006)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z., Golub, G.H.: Accelerated Hermitian and skew-Hermitian splitting iteration methods for saddle-point problems. IMA J. Numer. Anal. 27, 1–23 (2007)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z.: Optimal parameters in the HSS-like methods for saddle-point problems. Numer. Linear Algebra Appl. 16, 447–479 (2009)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z., Ng, M.K., Wang, Z.-Q.: Constraint preconditioners for symmetric indefinite matrices. SIAM J. Matrix Anal. Appl. 31, 410–433 (2009)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z., Parlett, B.N., Wang, Z.-Q.: On generalized successive overrelaxation methods for augmented linear systems. Numer. Math. 102, 1–38 (2005)
Article MathSciNet MATH Google Scholar
Bai, Z.-Z., Wang, Z.-Q.: On parameterized inexact Uzawa methods for generalized saddle point problems. Linear Algebra Appl. 428, 2900–2932 (2008)
Article MathSciNet MATH Google Scholar
Berman, A., Plemmons, R.J.: Nonnegative matrices in the mathematical sciences. Academic Press, New York (1979) (SIAM 1994)
Betts, J.T.: Practical methods for optimal control using nonlinear programming. SIAM, Philadelphia (2001)
MATH Google Scholar
Brezzi, F., Fortin, M.: Mixed and hybrid finite element methods. Springer, New York, London (1991)
Book MATH Google Scholar
Benzi, M., Golub, G.H.: A preconditioner for generalized saddle point problems. SIAM J. Matrix Anal. Appl. 26, 20–41 (2005)
Article MathSciNet MATH Google Scholar
Benzi, M., Golub, G.H., Liesen, J.: Numerical solution of saddle point problems. Acta Number 14, 1–137 (2005)
Article MathSciNet MATH Google Scholar
Cao, Z.-H.: Comparison of performance of iterative methods for singular and nonsingular saddle point linear systems arising from Navier-Stokes equations. Appl. Math. Comput. 174, 630–642 (2006)
MathSciNet MATH Google Scholar
Cao, Z.-H.: A note on spectrum distribution of constraint preconditioned generalized saddle point matrices. Numer. Linear Algebra Appl. 16, 503–516 (2009)
Article MathSciNet MATH Google Scholar
Cao, Z.-H.: A note on spectrum analysis of augmentation block preconditioned generalized saddle point matrices. J. Comput. Appl. Math. 238, 109–115 (2013)
Article MathSciNet MATH Google Scholar
Chao, Z., Zhang, N.-M., Lu, Y.-Z.: Optimal parameters of the generalized symmetric SOR method for augmented systems. J. Comput. Appl. Math. 266, 52–60 (2014)
Article MathSciNet MATH Google Scholar
Elman, H.C.: Preconditioning for the steady-state Navier-Stokes equations with low viscosity. SIAM J. Sci. Comput. 20, 1299–1316 (1999)
Article MathSciNet MATH Google Scholar
Golub, G.H., Wu, X., Yuan, J.-Y.: SOR-like methods for augmented systems. BIT 41, 71–85 (2001)
Article MathSciNet MATH Google Scholar
Keller, C., Gould, N.I.M., Wathen, A.J.: Constraint precoditioning for indefinite linear systems. SIAM J. Matrix Anal. Appl. 21, 1300–1317 (2000)
Article MathSciNet MATH Google Scholar
Li, J.-L., Huang, T.-Z.: The semi-convergence of generalized SSOR method for singular augmented systems. In: High performance computing and applications, lecture notes in computer science, vol. 5938, pp. 230–235. Springer, New York (2010)
Saberi, H., Najafi, S.A., Edalatpanah, A.: new modified SSOR iteration method for solving augmented linear systems. Int. J. Comput. Math. 91, 539–552 (2014)
Article MathSciNet MATH Google Scholar
Wen, C., Huang, T.-Z.: Modified SSOR-like method for augmented systems. Math. Model. Anal. 16, 475–487 (2011)
Article MathSciNet MATH Google Scholar
Wu, S.-L., Huang, T.-Z., Zhao, X.-L.: A modified SSOR iterative method for augmented systems. J. Comput. Appl. Math. 228, 424–433 (2009)
Article MathSciNet MATH Google Scholar
Zhang, N.-M.: A note on preconditioned GMRES for solving singular linear equations. BIT 50, 207–220 (2010)
Article MathSciNet MATH Google Scholar
Zhang, N.-M., Shen, P.: Constraint preconditioners for solving singular saddle point problems. J. Comput. Appl. Math. 238, 116–125 (2013)
Article MathSciNet MATH Google Scholar
Zhang, N.-M., Wei, Y.-M.: On the convergence of general stationary iterative methods for Range-Hermitian singular linear systems. Numer. Linear Algebra Appl. 17, 139–154 (2010)
Article MathSciNet MATH Google Scholar
Zheng, B., Bai, Z.-Z., Yang, X.: On semi-convergence of parameterized Uzawa methods for singular saddle point problems. Linear Algebra. Appl. 431, 808–817 (2009)
Article MathSciNet MATH Google Scholar
Zhou, L.-J., Zhang, N.-M.: Semi-convergence analysis of GMSSOR methods for singular saddle point problems. Comput. Math. Appl. 68, 596–605 (2014)
Article MathSciNet Google Scholar

Download references

Acknowledgments

The authors are grateful to the referees and the editor for their very detailed comments and suggestions, which significantly improved this paper.

Author information

Authors and Affiliations

School of Mathematics and Information Science, Wenzhou University, Wenzhou, 325035, China
Jing Li & Nai-Min Zhang

Authors

Jing Li
View author publications
You can also search for this author in PubMed Google Scholar
Nai-Min Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nai-Min Zhang.

Additional information

Communicated by Zhong-Zhi Bai.

This work is supported by National Natural Science Foundation of China under Grant No. 61572018 and Zhejiang Provincial Natural Science Foundation of China under Grant No: LY15A010016.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, J., Zhang, NM. A triple-parameter modified SSOR method for solving singular saddle point problems. Bit Numer Math 56, 501–521 (2016). https://doi.org/10.1007/s10543-016-0610-4

Download citation

Received: 21 March 2015
Accepted: 23 February 2016
Published: 07 March 2016
Issue Date: June 2016
DOI: https://doi.org/10.1007/s10543-016-0610-4

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A triple-parameter modified SSOR method for solving singular saddle point problems

Abstract

Similar content being viewed by others

A generalized modified SOR-like method for the singular saddle point problems

The alternate direction iterative methods for generalized saddle point systems

On equivalence of three-parameter iterative methods for singular symmetric saddle-point problem

1 Introduction

2 The triple-parameter modified SSOR method

3 The semi-convergence of the TMSSOR method

Theorem 3.1

Proof

Remark 3.1

Lemma 3.1

Lemma 3.2

Theorem 3.2

Proof

Lemma 3.3

Theorem 3.3

Proof

4 The optimal iteration parameters

Remark 4.1

Theorem 4.1

Proof

Remark 4.2

5 Numerical experiments

Example 5.1

6 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

A triple-parameter modified SSOR method for solving singular saddle point problems

Abstract

Similar content being viewed by others

A generalized modified SOR-like method for the singular saddle point problems

The alternate direction iterative methods for generalized saddle point systems

On equivalence of three-parameter iterative methods for singular symmetric saddle-point problem

1 Introduction

2 The triple-parameter modified SSOR method

3 The semi-convergence of the TMSSOR method

Theorem 3.1

Proof

Remark 3.1

Lemma 3.1

Lemma 3.2

Theorem 3.2

Proof

Lemma 3.3

Theorem 3.3

Proof

4 The optimal iteration parameters

Remark 4.1

Theorem 4.1

Proof

Remark 4.2

5 Numerical experiments

Example 5.1

6 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation