A new family of hybrid three-term conjugate gradient methods with applications in image restoration

Jiang, Xianzhen; Liao, Wei; Yin, Jianghua; Jian, Jinbao

doi:10.1007/s11075-022-01258-2

A new family of hybrid three-term conjugate gradient methods with applications in image restoration

Original Paper
Published: 13 March 2022

Volume 91, pages 161–191, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Numerical Algorithms Aims and scope Submit manuscript

A new family of hybrid three-term conjugate gradient methods with applications in image restoration

Download PDF

Xianzhen Jiang¹,
Wei Liao¹,
Jianghua Yin¹ &
…
Jinbao Jian ORCID: orcid.org/0000-0001-8048-7397¹

1191 Accesses
26 Citations
Explore all metrics

Abstract

In this paper, based on the hybrid conjugate gradient method and the convex combination technique, a new family of hybrid three-term conjugate gradient methods are proposed for solving unconstrained optimization. The conjugate parameter in the search direction is a hybrid of Dai-Yuan conjugate parameter and any one. The search direction then is the sum of the negative gradient direction and a convex combination in relation to the last search direction and the gradient at the previous iteration. Without choosing any specific conjugate parameters, we show that the search direction generated by the family always possesses the descent property independent of line search technique, and that it is globally convergent under usual assumptions and the weak Wolfe line search. To verify the effectiveness of the presented family, we further design a specific conjugate parameter, and perform medium-large-scale numerical experiments for smooth unconstrained optimization and image restoration problems. The numerical results show the encouraging efficiency and applicability of the proposed methods even compared with the state-of-the-art methods.

A class of new three-term descent conjugate gradient algorithms for large-scale unconstrained optimization and applications to image restoration problems

Article 12 November 2022

Two modified conjugate gradient methods for unconstrained optimization with applications in image restoration problems

Article 28 March 2022

A modified Fletcher-Reeves conjugate gradient method for unconstrained optimization with applications in image restoration

Article 07 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The conjugate gradient method (CGM) is very welcome to solve the following unconstrained optimization problem

$$ \begin{array}{@{}rcl@{}} \underset{x{\in}R^{n}}{\min} f(x), \end{array} $$

(1)

where $f:R^{n}\rightarrow R$ is continuously differentiable and g(x) denotes its gradient at x, i.e., g(x) := ∇f(x). Generally, the CGM iterates along the following form

$$ \begin{array}{@{}rcl@{}} x_{k+1}=x_{k}+ \alpha_{k}d_{k} \end{array} $$

(2)

and

$$ \begin{array}{@{}rcl@{}} d_{k}=\left\{ \begin{array}{lc} -g_{k}, & ~\text{if}~k=1, \\ -g_{k}+\beta_{k}d_{k-1}, & ~\text{if}~k\geq 2, \end{array} \right. \end{array} $$

where α_k > 0 is the step-length, and d_k is the search direction decided by the conjugate parameter β_k. As we all know, the convergence and numerical performance for the CGM depend on the conjugate parameter. Usually, choosing different conjugate parameters leads to obtaining different conjugate gradient methods (CGMs). The classical CGMs include Hestenes-Stiefel (HS) method [1], Fletcher and Reeves (FR) method [2], Polak-Ribière-Polyak (PRP) method [3, 4], Conjugate Descent (CD) method [5], Liu-Storey (LS) method [6] and Dai-Yuan (DY) method [7], and their conjugate parameters are, respectively, given by

$$ \begin{array}{@{}rcl@{}} \beta_{k}^{\text{HS}}&=&\frac{{g_{k}^{T}}y_{k-1}}{d_{k-1}^{T}y_{k-1}}, \beta_{k}^{\text{FR}}=\frac{\|g_{k}\|^{2}}{\|g_{k-1}\|^{2}}, \beta_{k}^{\text{PRP}}=\frac{{g_{k}^{T}}y_{k-1}}{\|g_{k-1}\|^{2}},\\ \beta_{k}^{\text{CD}}&=&\frac{\|g_{k}\|^{2}}{-g_{k-1}^{T} d_{k-1}}, \beta_{k}^{\text{LS}}=\frac{{g_{k}^{T}}y_{k-1}}{-g_{k-1}^{T}d_{k-1}}, \beta_{k}^{\text{DY}}=\frac{\|g_{k}\|^{2}}{d_{k-1}^{T}y_{k-1}}, \end{array} $$

where ∥⋅∥ denotes the Euclidean norm and y_k− 1 := g_k − g_k− 1. Some famous CGMs can be seen in Refs. [8,9,10,11].

In the past two decades, CGMs have been extensively studied. Especially, by varying the structure of the search direction for the classical CGMs, many CGMs are proposed, for example, the preconditioned CGM [12], the spectral CGM [13], the three-term CGM (TTCGM) [14, 15], and the spectral three-term CGM [16], to name just a few.

On the other hand, besides solving (1), CGMs have been used to solve problems arising from other areas, such as the tensor optimization [17], the stochastic optimization [18], the Riemannian manifold optimization [19], and the sparse optimization [20]. It is worthwhile to mention that using the CGM to solve the image restoration problems in the sparse optimization has received wide attention. Specifically, by adopting smoothing functions, Chen and Zhou [21] proposed a TTCGM to deal with the nonsmooth and nonconvex optimization problems arising from image restoration. Recently, Yin et al. [22] transformed the nonsmooth convex problem in image restoration into nonlinear monotone equations as those in [23, 24], and then designed an efficient hybrid three-term conjugate gradient projection method to solve it. More related works can be seen in Refs. [25,26,27].

In this paper, we focus on the efficient TTCGM and its application in image restoration. To this end, firstly, we aim to explore a family of hybrid three-term CGMs (HTTCGMs). Secondly, we design a new conjugate parameter for the family and apply it to solve image restoration problems. The main contributions of this paper have at least four aspects as follows.

A new family of HTTCGMs is proposed, in which the conjugate parameter $\beta _{k}^{\text {new}}$ in the search direction is a hybrid of $\beta _{k}^{\text {DY}}$ and any conjugate parameter. The search direction then is the sum of the negative gradient direction − g_k and a convex combination between $\beta _{k}^{\text {new}}d_{k-1}$ and θ_kg_k− 1, where θ_k is an appropriate parameter; see (7) below.
The search direction generated by the presented family is descent at each iteration without depending on the choice of conjugate parameter and line search criterion. Furthermore, this family is proved to be globally convergent under usual assumptions.
Motivated by the idea of the hybrid CGM introduced in [28], an efficient conjugate parameter is designed for the family.
Numerical experiments of the proposed family are carried out for solving (1) and the image restoration problems, and the corresponding numerical results show that the new methods are very efficient and promising.

The rest of this article is arranged as follows. In Section 2, a family of HTTCGMs is proposed and the corresponding algorithm framework is given for problem (1). In Section 3, the descent property and global convergence of the new family are proved. In Section 4, a new conjugate parameter is designed for the new family, and then its effectiveness are verified by solving medium-large-scale unconstrained optimization and image restoration problems. In Section 5, a conclusion for this work is made.

2 Motivation and algorithm

Beale [29] believed that using the negative gradient direction frequently for restarting may not be optimal. So, he suggested that the search direction of restarting CGM is defined by

$$ \begin{array}{@{}rcl@{}} d_{k}=-g_{k}+\beta_{k}d_{k-1}+\gamma_{k}d_{t}, \end{array} $$

where $\gamma _{k}=\frac {{g_{k}^{T}} y_{t}}{{d_{t}^{T}} y_{t}}$ and 1 ≤ t < k. This is also the earlier form of the TTCGM. Clearly, as long as conjugate parameter β_k is equal to zero, the next iteration will restart along the direction d_k = −g_k + γ_kd_t. Since then, the TTCGM has received much attention, and many TTCGMs are proposed with the following direction structure

$$ \begin{array}{@{}rcl@{}} d_{k}=-g_{k}+\beta_{k}d_{k-1}+\theta_{k}y_{k-1}, \end{array} $$

(3)

where θ_k is a parameter to be determined. Letting $\beta _{k}:=\beta _{k}^{\text {PRP}}$ in (3) and then solving the equation ${g_{k}^{T}}d_{k}=-\|g_{k}\|^{2}$ to yield θ_k, Zhang et al. [30] proposed an improved TTCGM as follows:

$$ \begin{array}{@{}rcl@{}} d_{k}=\left\{ \begin{array}{lc} -g_{k}, & ~\text{if}~k=1, \\ -g_{k}+\beta_{k}^{\text{PRP}}d_{k-1}-\frac{{g_{k}^{T}}d_{k-1}}{\|g_{k-1}\|^{2}}y_{k-1}, & ~\text{if}~k\geq 2. \end{array} \right. \end{array} $$

Based on the form of (3), Kou and Dai [31] proposed a TTCGM with restarting procedure (KD method for short), where

$$ \begin{array}{@{}rcl@{}}\beta_{k}&=&\max\left\{\frac{{g_{k}^{T}} y_{k-1}}{d_{k-1}^{T} y_{k-1}}-\left( \frac{s_{k-1}^{T} y_{k-1}}{\|s_{k-1}\|^{2}}+\frac{\|y_{k-1}\|^{2}}{s_{k-1}^{T} y_{k-1}}\right) \frac{{g_{k}^{T}} s_{k-1}}{d_{k-1}^{T} y_{k-1}},\zeta\frac{{g_{k}^{T}}d_{k-1}}{\|d_{k-1}\|^{2}}\right\},\\ 0&<&\zeta<1, \theta_{k}=\xi_{k}\frac{{g_{k}^{T}}d_{k-1}}{d_{k-1}^{T}y_{k-1}},\ s_{k-1}=x_{k}-x_{k-1},\ 0\leq\xi_{k}\leq1.\end{array} $$

Due to the introduction of the restarting procedure, the KD method achieves much better numerical performance, especially for the hard problems. In [32], Narushima et al. proposed a family of TTCGMs which always generate a search direction satisfying the sufficient descent condition, and this property is independent of choices of β_k and line searches. Specifically, the search direction in [32] is decided by

$$ \begin{array}{@{}rcl@{}} d_{k}=\left\{ \begin{array}{ll} -g_{k}, & ~\text{if}~k=0 ~\text{or} ~ {g_{k}^{T}}p_{k}=0, \\ -g_{k}+\beta_{k}d_{k-1}-\beta_{k}\frac{{g_{k}^{T}}d_{k-1}}{{g_{k}^{T}}p_{k}}p_{k}, & ~\text{otherwise}, \end{array} \right. \end{array} $$

where $p_{k}\in \mathbb {R}^{n}$ is any vector, and it could be g_k, d_k− 1, y_k− 1, etc. Recently, inspired by [33], Liu et al. [34] proposed four efficient TTCGMs with the following direction structure

$$ \begin{array}{@{}rcl@{}} d_{k}= -g_{k}+\beta_{k}d_{k-1}+\theta_{k}g_{k-1}. \end{array} $$

(4)

Setting respectively $\beta _{k}:=\beta _{k}^{\text {LS}}-\frac {\|g_{k-1}\|^{2}{g_{k}^{T}}s_{k-1}}{(d_{k-1}^{T}g_{k-1})^{2}}$ and $\theta _{k}:=\frac {{g_{k}^{T}}d_{k-1}}{-d_{k-1}^{T}g_{k-1}}$ in (4), the global convergence for the resulting algorithm is established with the strong Wolfe line search.

Based on the convex combination technique, Yuan et al. [25] gave an efficient strategy to design the search direction as follows:

$$ \begin{array}{@{}rcl@{}} d_{k}=\left\{ \begin{array}{lc} -g_{k}, & ~\text{if}~k=0, \\ -N_{k}g_{k}+(1-N_{k})\frac{{g_{k}^{T}}y_{k-1}d_{k-1}-d_{k-1}^{T}g_{k}y_{k-1}}{\max\{ 2\chi\|d_{k-1}\|\|y_{k-1}\|,-d_{k-1}^{T}g_{k-1}\}}, & ~\text{if}~k\geq 1, \end{array} \right. \end{array} $$

(5)

where $\chi \in (0, 1), N_{k}=\frac {y_{k-1}^{T}y_{k-1}}{y_{k-1}^{T}s_{k-1}^{\ast }}\in (0, 1], s_{k-1}^{\ast }=s_{k-1}+(\max \limits \{0,\frac {-s_{k-1}^{T}y_{k-1}}{\|y_{k-1}\|^{2}}\}+1)y_{k-1},$ and y_k− 1 = g_k − g_k− 1. The numerical results for solving image restoration problems in [25] verify the effectiveness of the resulting method.

On the other hand, to guarantee $\beta _{k}^{\text {HS}}$ to be nonnegative and the HS CGM to be globally convergent, Dai and Yuan [35] proposed a hybrid CGM (HD CGM for brevity), where the conjugate parameter is given by

$$ \begin{array}{@{}rcl@{}} \beta_{k}^{\text{HD}}=\max\left\{0,\min\left\{\beta_{k}^{\text{HS}},\beta_{k}^{\text{DY}}\right\}\right\}. \end{array} $$

Under usual assumptions, the authors in [35] showed the global convergence of the HD CGM with the weak Wolfe line search. A large number of numerical experiments illustrate the efficiency of the HD CGM.

In this paper, we are devoted to making more use of the information of the objective function at the current iteration to construct efficient algorithms for large-scale unconstrained optimization and image restoration problems. Motivated by the studies in [32, 34] and the convex combination technique used in [25], as well as the idea of the hybrid CGM proposed in [35], we present a new family of hybrid TTCGMs with the following search direction:

$$ \begin{array}{@{}rcl@{}} d_{k}&=&\left\{ \begin{array}{lc} -g_{k}, & ~\text{if}~k=1, \\ -g_{k}+(1-\lambda_{k})\beta_{k}^{\text{new}}d_{k-1}+\lambda_{k}\theta_{k}g_{k-1}, & ~\text{if}~k\geq 2, \end{array} \right. \end{array} $$

(6)

$$ \begin{array}{@{}rcl@{}} \beta_{k}^{\text{new}}&=&\max\left\{0,\min\left\{\beta_{k},\beta_{k}^{\text{DY}}\right\}\right\},\ \lambda_{k}=\frac{|{g_{k}^{T}}d_{k-1}|}{\|g_{k}\|\|d_{k-1}\|},\ \theta_{k}=-\eta\frac{{g_{k}^{T}}g_{k-1}}{\|g_{k-1}\|^{2}}, \end{array} $$

(7)

where β_k is any conjugate parameter and 0 < η < 1. It is easy to see from the definition of λ_k that λ_k ∈ [0, 1] for all k ≥ 2. Therefore, if λ_k = 0 in (6) and $\beta _{k}:=\beta _{k}^{\text {HS}}$ in (7), then the search direction (6) reduces to that of the HD CGM. It is worth mentioning that in the forthcoming analysis (see Lemma 1 and Theorem 1 below), the descent property for the search direction defined in (6) and the global convergence of the proposed family are independent of choices of β_k and line searches. These facts will allow more flexibility from both a theoretical and practical viewpoint.

Now, based on the search direction (6) and the weak Wolfe line search, we formally present the detailed steps of the family (FHTTCGMs).

3 The descent property and global convergence

In this section, we firstly analyze the descent property for the search direction generated by FHTTCGMs. Subsequently, we focus on proving its global convergence.

The following lemma shows that d_k defined in (6) is a descent direction, and provides an estimation of $(1-\lambda _{k})\beta _{k}^{\text {new}}$, which is critical for the subsequent convergence analysis.

Lemma 1

Let {d_k} be a sequence generated by FHTTCGMs, then the following relation holds:

$$ \begin{array}{@{}rcl@{}} {g_{k}^{T}}d_{k}< 0,\ k\geq 1, \end{array} $$

(9)

which implies that the search direction yielded by FHTTCGMs is descent. Furthermore, for all k ≥ 2, we have

$$ \begin{array}{@{}rcl@{}} 0\leq(1-\lambda_{k})\beta_{k}^{\text{new}}\leq \frac{{g_{k}^{T}}d_{k}}{g_{k-1}^{T}d_{k-1}}. \end{array} $$

(10)

Proof

We prove the first assertion by induction. When k = 1, it follows from (6) that ${g_{1}^{T}}d_{1}=-\|g_{1}\|^{2}< 0$. Suppose that (9) holds for k − 1 (∀ k ≥ 2). Recall that λ_k ∈ [0, 1] for all k ≥ 2. Next, we prove that (9) also holds for k by the following three cases.

Case I: $\beta _{k}^{\text {new}}=0$. Multiplying the both sides of (6) by ${g_{k}^{T}}$, it follows that

$$ \begin{array}{@{}rcl@{}} {g_{k}^{T}}d_{k}&=&-\|g_{k}\|^{2}-\eta\lambda_{k}\frac{\left( {g_{k}^{T}}g_{k-1}\right)^{2}}{\|g_{k-1}\|^{2}} =-\left( 1+\eta\lambda_{k}\cos^{2}\vartheta_{k}\right)\|g_{k}\|^{2}<0, \end{array} $$

where 𝜗_k is the angle between g_k and g_k− 1.

Case II: $\beta _{k}^{\text {new}}=\beta _{k}^{\text {DY}}$. By the definition of $\beta _{k}^{\text {new}}$ in (7), we have $\beta _{k}^{\text {DY}}>0$. Further, multiplying both sides of (6) by ${g_{k}^{T}}$, we get

$$ \begin{array}{@{}rcl@{}} {g_{k}^{T}}d_{k}\!&=&-\|g_{k}\|^{2}+(1-\lambda_{k})\beta_{k}^{\text{new}}{g_{k}^{T}}d_{k-1}-\eta\lambda_{k}\frac{\left( {g_{k}^{T}}g_{k-1}\right)^{2}}{\|g_{k-1}\|^{2}}\\ &=&-\left( 1+\eta\lambda_{k}\cos^{2}\vartheta_{k}\right)\|g_{k}\|^{2}+(1-\lambda_{k})\beta_{k}^{\text{DY}}{g_{k}^{T}}d_{k-1}\\ &=\!& - \left( 1 + \eta\lambda_{k}\cos^{2}\vartheta_{k}\right)\|g_{k}\|^{2} + (1 - \lambda_{k}) \left( \frac{\|g_{k}\|^{2}}{d_{k-1}^{T}y_{k-1}}d_{k-1}^{T}y_{k-1} + \beta_{k}^{\text{DY}}g_{k-1}^{T}d_{k-1}\!\right)\\ &=&-\left( 1+\eta\cos^{2}\vartheta_{k}\right)\lambda_{k}\|g_{k}\|^{2}+(1-\lambda_{k})\beta_{k}^{\text{DY}}g_{k-1}^{T}d_{k-1}. \end{array} $$

The above relation can be rewritten as

$$ \begin{array}{@{}rcl@{}} {g_{k}^{T}}d_{k}=\left\{ \begin{array}{ll} (1-\lambda_{k})\beta_{k}^{\text{new}}g_{k-1}^{T}d_{k-1}, & \text{if}~\lambda_{k}=0, \\ -\left( 1+\eta\cos^{2}\vartheta_{k}\right)\|g_{k}\|^{2}, & \text{if}~\lambda_{k}=1, \\ -\left( 1+\eta\cos^{2}\vartheta_{k}\right)\lambda_{k}\|g_{k}\|^{2}+(1-\lambda_{k})\beta_{k}^{\text{new}}g_{k-1}^{T}d_{k-1}, & \text{if}~0<\lambda_{k}<1. \end{array} \right. \end{array} $$

(11)

This together with the induction hypothesis yields ${g_{k}^{T}}d_{k}<0$.

Case III: $\beta _{k}^{\text {new}}=\beta _{k}$. Again, using the definition of $\beta _{k}^{\text {new}}$ in (7) leads us to the relation $0<\beta _{k} \leq \beta _{k}^{\text {DY}}$, which further implies $d_{k-1}^{T}y_{k-1}>0$ from the definition of $\beta _{k}^{\text {DY}}$. Hence, we obtain from (6) that

$$ \begin{array}{@{}rcl@{}} {g_{k}^{T}}d_{k}\!&=&-\|g_{k}\|^{2}+(1-\lambda_{k})\beta_{k}^{\text{new}}{g_{k}^{T}}d_{k-1}-\eta\lambda_{k}\frac{\left( {g_{k}^{T}}g_{k-1}\right)^{2}}{\|g_{k-1}\|^{2}}\\ &=&-\left( 1+\eta\lambda_{k}\cos^{2}\vartheta_{k}\right)\|g_{k}\|^{2}+(1-\lambda_{k})\beta_{k}\left( d_{k-1}^{T}y_{k-1}+g_{k-1}^{T}d_{k-1}\right)\\ &\leq& - \left( 1 + \eta\lambda_{k}\cos^{2}\vartheta_{k}\right)\|g_{k}\|^{2}+(1 - \lambda_{k})\left( \frac{\|g_{k}\|^{2}}{d_{k-1}^{T}y_{k-1}} d_{k-1}^{T}y_{k-1}+\beta_{k}g_{k-1}^{T}d_{k-1}\right)\\ &=&-\left( 1+\eta\cos^{2}\vartheta_{k}\right)\lambda_{k}\|g_{k}\|^{2}+(1-\lambda_{k})\beta_{k}g_{k-1}^{T}d_{k-1}. \end{array} $$

Similarly, we conclude that the relation in (11) still holds. Combining this with the induction hypothesis yields ${g_{k}^{T}}d_{k}<0$. Up to now, we have showed that the relation in (9) holds.

Now, we establish the second assertion. If $\beta _{k}^{\text {new}}=0$, we have from (9) that

$$ 0=(1-\lambda_{k})\beta_{k}^{\text{new}}< \frac{{g_{k}^{T}}d_{k}}{g_{k-1}^{T}d_{k-1}}. $$

If $\beta _{k}^{\text {new}}=\beta _{k}^{\text {DY}}$ or β_k, we deduce from (9) and (11) that

$$ 0\leq(1-\lambda_{k})\beta_{k}^{\text{new}}\leq \frac{{g_{k}^{T}}d_{k}}{g_{k-1}^{T}d_{k-1}}. $$

Thus, the proof is complete. □

From the above proof process, it can be seen that the descent property for the search direction defined in (6) does not depend on any specific conjugate parameter and line search criterion.

To analyze the global convergence property of the FHTTCGMs, the following assumptions for the objective function are required:

A1 The level set Λ = {x ∈ Rⁿ | f(x) ≤ f(x₁)} is bounded;
A2 In a neighborhood U of Λ, f(x) is differentiable and its gradient g(x) is Lipschitz continuous, namely, there exists a constant L > 0 such that

$$ \|g(x)-g(y)\|\leq L\|x-y\|,\ \forall\ x,y\in U. $$

From the weak Wolfe line search (??), we know that the sequence {f(x_k)} is monotonically nonincreasing. Combining this with assumption A1, we obtain that the sequence {x_k} is bounded.

The following lemma is the well-known Zoutendijk condition. It is very important for the convergence analysis of the CGM; see [36] for details.

Lemma 2

Consider iteration of the form (2), where d_k satisfies the descent condition ${g_{k}^{T}}d_{k}<0$ and α_k satisfies the weak Wolfe line search (??). If assumptions A1-A2 hold, then we have $\sum\limits _{k=1}^{\infty }\frac {({g_{k}^{T}}d_{k})^{2}}{\|d_{k}\|{\!}^{2}}<+\infty $.

With Lemmas 1 and 2 at hand, we give the convergence analysis of FHTTCGMs.

Theorem 1

Let {x_k} be a sequence generated by FHTTCGMs. If assumptions A1-A2 hold, then it holds that $\underset {k\to \infty }{\liminf } \|g_{k}\|=0$.

Proof

We prove this claim by contradiction. Naturally, we assume that there exists γ > 0 such that ∥g_k∥ ≥ γ for all k ≥ 1. On the other hand, from assumption A2 and the boundedness of {x_k}, we know that {g_k} is bounded, namely, there is another positive constant $\widetilde {\gamma }$ such that

$$ \begin{array}{@{}rcl@{}} \gamma\leq \|g_{k}\|\leq \widetilde{\gamma}, \ \forall\ k\geq 1. \end{array} $$

(12)

From (6), we have immediately that

$$ \begin{array}{@{}rcl@{}} d_{k}+g_{k}-\lambda_{k}\theta_{k}g_{k-1}=(1-\lambda_{k})\beta_{k}^{\text{new}}d_{k-1}. \end{array} $$

Next, squaring both sides of the above equality, we obtain

$$ \begin{array}{@{}rcl@{}} \|d_{k}\|^{2}+2{g_{k}^{T}}d_{k}+2\eta\lambda_{k}\frac{{g_{k}^{T}}g_{k-1}}{\|g_{k-1}\|^{2}}g_{k-1}^{T}d_{k}+\|g_{k}\|^{2}+ 2\eta\lambda_{k}\frac{{g_{k}^{T}}g_{k-1}}{\|g_{k-1}\|^{2}}{g_{k}^{T}}g_{k-1} \\ + \eta^{2}{\lambda_{k}^{2}}\left( \frac{{g_{k}^{T}}g_{k-1}}{\|g_{k-1}\|^{2}}\right)^{2}\|g_{k-1}\|^{2}=(1-\lambda_{k})^{2}\left( \beta_{k}^{\text{new}}\right)^{2}\|d_{k-1}\|^{2}. \end{array} $$

(13)

In addition, multiplying both sides of (6) by $g_{k-1}^{T}$, we know that

$$ \begin{array}{@{}rcl@{}} g_{k-1}^{T}d_{k} &=& -{g_{k}^{T}}g_{k-1}+(1-\lambda_{k})\beta_{k}^{\text{new}}g_{k-1}^{T}d_{k-1}-\eta\lambda_{k}\|g_{k-1}\|^{2}\frac{{g_{k}^{T}}g_{k-1}}{\|g_{k-1}\|^{2}} \\ &=& -(1+\eta\lambda_{k}){g_{k}^{T}}g_{k-1}+(1-\lambda_{k})\beta_{k}^{\text{new}}g_{k-1}^{T}d_{k-1}. \end{array} $$

(14)

Substituting (14) into (13) and rearranging terms, we deduce that

$$ \begin{array}{@{}rcl@{}} \|d_{k}\|^{2}& =&(1-\lambda_{k})^{2}\left( \beta_{k}^{\text{new}}\right)^{2}\|d_{k-1}\|^{2}-2{g_{k}^{T}}d_{k}-2\eta\lambda_{k}(1-\lambda_{k})\frac{{g_{k}^{T}}g_{k-1}}{\|g_{k-1}\|^{2}} \beta_{k}^{\text{new}}g_{k-1}^{T}d_{k-1} \\ & &+2\eta\lambda_{k}\left( 1+\eta\lambda_{k}\right)\frac{\left( {g_{k}^{T}}g_{k-1}\right)^{2}}{\|g_{k-1}\|^{2}}-\|g_{k}\|^{2}-2\eta\lambda_{k} \frac{\left( {g_{k}^{T}}g_{k-1}\right)^{2}}{\|g_{k-1}\|^{2}}-\eta^{2}{\lambda_{k}^{2}}\frac{\left( {g_{k}^{T}}g_{k-1}\right)^{2}}{\|g_{k-1}\|^{2}} \\ &=& (1-\lambda_{k})^{2}\left( \beta_{k}^{\text{new}}\right)^{2}\|d_{k-1}\|^{2}-2{g_{k}^{T}}d_{k}-2\eta\lambda_{k}(1-\lambda_{k})\frac{{g_{k}^{T}}g_{k-1}}{\|g_{k-1}\|^{2}} \beta_{k}^{\text{new}}g_{k-1}^{T}d_{k-1} \\ && +\eta^{2}{\lambda_{k}^{2}}\frac{\left( {g_{k}^{T}}g_{k-1}\right)^{2}}{\|g_{k-1}\|^{2}}-\|g_{k}\|^{2}. \end{array} $$

Combining this with (9), (10), λ_k ∈ [0, 1] and the Cauchy-Schwarz inequality, we conclude that

$$ \begin{array}{@{}rcl@{}} \|d_{k}\|^{2} &\leq& (1-\lambda_{k})^{2}\left( \beta_{k}^{\text{new}}\right)^{2}\|d_{k-1}\|^{2}-2{g_{k}^{T}}d_{k}-2\eta\lambda_{k}(1-\lambda_{k})\frac{|{g_{k}^{T}}g_{k-1}|}{\|g_{k-1}\|^{2}} \beta_{k}^{\text{new}}g_{k-1}^{T}d_{k-1} \\ &&-\left( 1-\eta^{2}\right)\|g_{k}\|^{2} \\ &\leq& \frac{\left( {g_{k}^{T}}d_{k}\right)^{2}}{\left( g_{k-1}^{T}d_{k-1}\right)^{2}}\|d_{k-1}\|^{2}-2{g_{k}^{T}}d_{k}- 2\eta\lambda_{k}\frac{|{g_{k}^{T}}g_{k-1}|}{\|g_{k-1}\|^{2}}{g_{k}^{T}}d_{k} -\left( 1-\eta^{2}\right)\|g_{k}\|^{2} \\ &\leq& \frac{\left( {g_{k}^{T}}d_{k}\right)^{2}}{\left( g_{k-1}^{T}d_{k-1}\right)^{2}}\|d_{k-1}\|^{2}-2{g_{k}^{T}}d_{k}- 2\frac{\eta\widetilde{\gamma}^{2}}{\gamma^{2}}{g_{k}^{T}}d_{k} -\left( 1-\eta^{2}\right)\|g_{k}\|^{2} \\ &=& \frac{\left( {g_{k}^{T}}d_{k}\right)^{2}}{\left( g_{k-1}^{T}d_{k-1}\right)^{2}}\|d_{k-1}\|^{2}- 2\left( 1+\frac{\eta\widetilde{\gamma}^{2}}{\gamma^{2}}\right){g_{k}^{T}}d_{k}- \left( 1-\eta^{2}\right)\|g_{k}\|^{2}, \end{array} $$

where we made use of (12) for the third inequality. Letting $P=1+\frac {\eta \widetilde {\gamma }^{2}}{\gamma ^{2}}>1$ and Q = 1 − η² ∈ (0, 1), we then obtain

$$ \begin{array}{@{}rcl@{}} \|d_{k}\|^{2}\leq\frac{\left( {g_{k}^{T}}d_{k}\right)^{2}}{\left( g_{k-1}^{T}d_{k-1}\right)^{2}}\|d_{k-1}\|^{2}-2P{g_{k}^{T}}d_{k}-Q\|g_{k}\|^{2}. \end{array} $$

Dividing both sides of the above inequality by $\left ({g_{k}^{T}}d_{k}\right )^{2}$ yields

$$ \begin{array}{@{}rcl@{}} \frac{\|d_{k}\|^{2}}{\left( {g_{k}^{T}}d_{k}\right)^{2}}&\leq& \frac{\|d_{k-1}\|^{2}}{\left( g_{k-1}^{T}d_{k-1}\right)^{2}} -\frac{2P}{{g_{k}^{T}}d_{k}}-Q\frac{\|g_{k}\|^{2}}{\left( {g_{k}^{T}}d_{k}\right)^{2}} \\ &=& \frac{\|d_{k-1}\|^{2}}{\left( g_{k-1}^{T}d_{k-1}\right)^{2}}- \left( \frac{P}{\sqrt{Q}\|g_{k}\|}+\frac{\sqrt{Q}\|g_{k}\|}{{g_{k}^{T}}d_{k}}\right)^{2}+ \frac{P^{2}}{Q\|g_{k}\|^{2}} \\ &\leq& \frac{\|d_{k-1}\|^{2}}{\left( g_{k-1}^{T}d_{k-1}\right)^{2}}+\frac{P^{2}}{Q}\frac{1}{\|g_{k}\|^{2}} \\ &\leq& \frac{\|d_{k-2}\|^{2}}{\left( g_{k-2}^{T}d_{k-2}\right)^{2}}+\frac{P^{2}}{Q}\frac{1}{\|g_{k-1}\|^{2}}+ \frac{P^{2}}{Q}\frac{1}{\|g_{k}\|^{2}} \\ &\leq& \frac{\|d_{1}\|^{2}}{({g_{1}^{T}}d_{1})^{2}}+\frac{P^{2}}{Q}\sum^{k}_{i=2}\frac{1}{\|g_{i}\|^{2}}. \end{array} $$

Hence, using d₁ = −g₁, P > 1, Q ∈ (0, 1) and (12), we obtain

$$ \begin{array}{@{}rcl@{}} \frac{\|d_{k}\|^{2}}{\left( {g_{k}^{T}}d_{k}\right)^{2}}\leq \frac{P^{2}}{Q}\sum^{k}_{i=1}\frac{1}{\|g_{i}\|^{2}} \leq\frac{P^{2}}{Q}\frac{k}{\gamma^{2}}, \end{array} $$

which further implies that $\frac {({g_{k}^{T}}d_{k})^{2}}{\|d_{k}\|^{2}}\geq \frac {Q\gamma ^{2}}{P^{2}}\frac {1}{k}$. It is not hard to see that $\sum\limits _{k=1}^{\infty } \frac {({g_{k}^{T}}d_{k})^{2}}{\|d_{k}\|^{2}}=\infty $. This contradicts with Lemma 2, and therefore the proof is complete. □

4 Numerical experiments

To verify the effectiveness and efficiency of FHTTCGMs, we first design a new conjugate parameter for $\beta _{k}^{\text {new}}$ in the FHTTCGMs. Subsequently, we apply the FHTTCGMs to solve unconstrained optimization and image restoration problems.^{Footnote 1}

In [28], Shi and Guo proposed a family of CGMs, in which the conjugate parameter is defined by

$$ \begin{array}{@{}rcl@{}} \beta_{k}^{\text{SG}}=\frac{{g_{k}^{T}}\left( g_{k}-g_{k-1}\right)}{(1-\mu)\|g_{k-1}\|^{2}-\mu g_{k-1}^{T}d_{k-1}}, \end{array} $$

(15)

where the hybrid parameter μ ∈ [0, 1]. Clearly, the conjugate parameter above is a hybrid of $\beta _{k}^{\text {PRP}}$ and $\beta _{k}^{\text {LS}}$. If μ≠ 1, then formula (15) can be rewritten as

$$ \begin{array}{@{}rcl@{}} \beta_{k}^{\text{SG}}=\frac{1}{1-\mu}\cdot\frac{{g_{k}^{T}}(g_{k}-g_{k-1})}{\|g_{k-1}\|^{2}-\frac{\mu}{1-\mu} g_{k-1}^{T}d_{k-1}}. \end{array} $$

(16)

Inspired by (16), and to avoid the trouble about how to choose the hybrid parameter μ, we design a new conjugate parameter as follows:

$$ \begin{array}{@{}rcl@{}} \beta_{k}^{\mathrm{N}}=\frac{{g_{k}^{T}}(g_{k}-g_{k-1})}{\|g_{k-1}\|^{2}-\sigma g_{k-1}^{T}d_{k-1}}, \end{array} $$

(17)

where σ is the same scalar as that of the weak Wolfe line search (??). Substituting $\beta _{k}:=\beta _{k}^{\mathrm {N}}$ into (7) to obtain $\beta _{k}^{\text {new}}$, and then embedding it in the FHTTCGMs, we call the resulting algorithm the FHTTCGM-N.

4.1 Unconstrained optimization problems

In this subsection, two group experiments are conducted. The first group is used to verify the effectiveness of FHTTCGMs. To this end, the conjugate parameter β_k in (7) is set to $\beta _{k}^{\text {HS}}$, $\beta _{k}^{\text {PRP}}$ and $\beta _{k}^{\text {LS}}$, and the corresponding methods are denoted by FHTTCGM-HS, FHTTCGM-PRP, and FHTTCGM-LS, respectively. We compare them with the original HS, PRP and LS CGMs. The second group is used to show that the proposed FHTTCGM-N is efficient. So, some state-of-the-art methods are chosen for comparison. They are the famous CG-DESCENT method (HZ) [9], the three-term CGM with restart direction (KD) [31], the three-term CGM with the modified direction structure (MTTLS) [34] and the HD CGM [35].

For two group experiments, the same 100 unconstrained problems are tested and compared, in which the problems 1-53 are taken from the CUTE library [37] and the others come from the unconstrained problem collections [38, 39]. The dimensions of the test problems vary from 11 to 800000. For the sake of fairness, all the comparison methods use the weak Wolfe line search (??) to compute the step-length α_k, and the relevant parameters are set to δ = 0.01 and σ = 0.1. For our methods, we set η = 0.4. Moreover, we adopt the strategy described in [40] to compute the initial step-length. The termination criterion is (1) ∥g_k∥≤ 10^− 6 or (2) Itr > 2000, where “Itr” represents the number of iterations. When (2) does happen, we claim that the relevant algorithm is invalid for the corresponding test problem, and denote it by “F”. All codes are written in Matlab 2018b, and run on a Lenovo PC with 3.60 GHz CPU processor and 8 GB RAM memory as well as Windows 10 operation system. In the two group experiments, we report the number of iterations (Itr), CPU time (Tcpu) and the final value for ∥g_k∥ (∥g_∗∥) in Tables 1, 2, 3 and 4, and use the performance profiles proposed by Dolan and Morè [41] to visually describe the performance of these algorithms in terms of Tcpu and Itr, respectively. For the interpretation of the performance profiles, in a general way, the top curve shows that the relevant method is a winner; see [41] for more details.

Table 1 Numerical results for the first group

Full size table

Table 2 Numerical results for the first group (continued)

Full size table

Table 3 Numerical results for the second group

Full size table

Table 4 Numerical results for the second group (continued)

Full size table

Tables 1 and 2 and Figs. 1 and 2 show that the numerical performance of FHTTCGM-PRP, FHTTCGM-HS and FHTTCGM-LS is better than the corresponding original algorithm. This directly indicates that our proposed FHTTCGMs is effective. From Tables 3 and 4 and Figs. 3 and 4, we know that the curve for the proposed FHTTCGM-N is at the top and it can solve about 97% of the test problems successfully. Hence, in the second group experiments, the numerical performance of FHTTCGM-N is superior to the other four methods for the given test problems.

4.2 Image restoration problems

In this part, we use the presented FHTTCGM-N to deal with the image restoration problems.

In [42], Raymond et al. utilized the two-phase scheme to restore images corrupted by impulse noise. In the first phase, a median filter was used to detect noise pixels. Let X be an image of size M-by-N and $A =\{1,2,\dots ,M\}\times \{1,2,\dots ,N\}$ be the index set of the image X. Denote by $\mathcal {N}\subset A$ the set of indices of the noise pixels detected from the first phase, and $|\mathcal {N}|$ means the number of elements in $\mathcal {N}$. Let $\mathcal {V}_{i,j}$ be the set of the four closest neighbors for the pixel at pixel location (i,j) ∈ A, i.e., $\mathcal {V}_{i,j}=\{(i,j-1),(i,j+1),(i-1,j),(i+1,j)\}$, and y_i,j be the observed pixel value of the image at pixel location (i,j). In the second phase, noisy pixels are cleaned by solving the nonsmooth minimization problem

$$ \begin{array}{@{}rcl@{}} \underset{\mathbf{u}}{\min} \sum_{(i, j) \in \mathcal{N}}\left[|u_{i, j}-y_{i, j}|+\frac{\beta}{2}\left( 2 \cdot S_{i, j}^{1}+S_{i, j}^{2}\right)\right], \end{array} $$

(18)

where

$$ S_{i, j}^{1} =\sum_{(m, n) \in \mathcal{V}_{i, j} \backslash \mathcal{N}} \varphi_{\alpha}\left( u_{i, j}-y_{m, n}\right),\ S_{i, j}^{2} =\sum_{(m, n) \in \mathcal{V}_{i, j} \cap \mathcal{N}} \varphi_{\alpha}\left( u_{i, j}-u_{m, n}\right). $$

Here, $\varphi _{\alpha }(t)=\sqrt {t^{2}+\alpha }$ is an edge-preserving function with parameter α > 0 and $\mathbf {u} = [u_{i,j}]_{(i,j)\in \mathcal {N}}$ is a column vector of length $|\mathcal {N}|$ ordered lexicographically. However, it is time-consuming and cost-intensive to solve the nonsmooth minimization problem (18) exactly. Cai et al. [43] removed the nonsmooth term and obtained the following smooth unconstrained optimization:

$$ \begin{array}{@{}rcl@{}} \!\!\!\!\!\!\!\!\!\!\underset{\mathbf{u}}{\min} F_{\alpha}(\mathbf{u}):=\sum_{(i,j)\in \mathcal{N}} \left( 2\sum_{(m,n)\in \mathcal{V}_{i,j}\backslash \mathcal{N}} \varphi_{\alpha}(u_{i,j}-y_{m,n})+\sum_{(m,n)\in \mathcal{V}_{i,j}\cap \mathcal{N}} \varphi_{\alpha}(u_{i,j}-u_{m,n})\right). \end{array} $$

(19)

Obviously, the higher the noise ratio, the larger the scale of (19). The authors in [43] discovered that the contaminated images can be restored efficiently by using the CGM to solve the above problem (19), even though the noise ratio is high or even reaches 90%. We refer the reader to [44,45,46] and the references therein for the applications of CGMs in image restoration.

Now, we focus on applying the two-phase scheme to remove salt-and-pepper noise, which is a special case of impulse noise. In the first phase, we use adaptive median filter [47] to detect noisy pixels. In the second phase, we use FHTTCGM-N to solve (19), and compare it with the PRP CGM used in [43], the classical PRP CGM and the HZ method. Notice that both the classical PRP CGM and the HZ method use the weak Wolfe line search (??) to compute the step-length α_k, while the PRP CGM used in [43] adopts an explicit expression to obtain α_k. For convenience, we directly denote the PRP CGM used in [43] by PRP, while we use PRP-W to denote the classical PRP CGM with the weak Wolfe line search. The step-length calculation formula and related parameters for PRP are the same as those in [43].

The test images are Boat(512 × 512), Hill(512 × 512), Lena(512 × 512) and Man(512 × 512). All the compared methods use the following stopping criterion

$$ \text{Itr}>300\ \text{or}\ \frac{|F_{\alpha}(\mathbf{u}_{k})-F_{\alpha}(\mathbf{u}_{k-1})|}{|F_{\alpha}(\mathbf{u}_{k})|}\leq 10^{-4}. $$

Throughout this part, the operating environment is the same as in Section 4.1. To assess the restoration performance qualitatively, we adopt the peak signal to noise ratio (PSNR, see [48]) defined as:

$$ \text{PSNR} = 10 \log_{10}\frac{255^{2}}{\frac{1}{MN}{\sum}_{i,j}\left( x_{i,j}^{r}-x_{i,j}^{*}\right)^{2}}, $$

where $x_{i,j}^{r}$ and $x_{i,j}^{*}$ denote the pixel values of the restored image and the original one, respectively.

Table 5 reports the number of iterations (Itr), the CPU time (Tcpu), and the PSNR values for the restored images. To save the space of the paper, we only plot the original, noisy and restored images for four algorithms when the salt-and-paper noise ratio is 70% and 90%. For the corresponding results, see Figs. 5 and 6. From Table 5, we observe that the proposed FHTTCGM-N usually requires less time than those of the other three algorithms. Moreover, the PSNR values of the images restored by the FHTTCGM-N are often higher than the other three algorithms except for a few cases. It is noticeable that the HZ method is usually superior to PRP and PRP-W in terms of Itr, Tcpu and PSNR. In this regard, we firmly believe that the numerical performance for the HZ method will be further improved if its own line search is used to compute the step-length. Anyway, our proposed FHTTCGM-N is superior to the other three methods for the given test images.

Table 5 Numerical results for image restoration problems

Full size table

5 Conclusions

In this work, we propose a family of hybrid three-term conjugate gradient methods (FHTTCGMs), in which the search direction always satisfies the descent condition independent of choices of conjugate parameter and line searches. Under some mild conditions, the global convergence for the proposed family is obtained. By embedding the classical HS, PRP and LS conjugate parameters in the FHTTCGMs, respectively, the numerical comparison results associated with the resulting methods show that the FHTTCGMs is very promising. Moreover, we also design a conjugate parameter for the family, and thus propose a specific method. Finally, applying it to deal with the medium-large-scale unconstrained optimization and image restoration problems, the numerical results illustrate the encouraging efficiency and applicability of the proposed method even compared with the state-of-the-art methods.

Notes

All codes are available at https://github.com/jhyin-optim/FHTTCGMs_with_applications

References

Hestenes, M.R., Stiefel, E.: Method of conjugate gradient for solving linear equations. J. Res. Natl. Bur. Stand. 49, 409–436 (1952)
Article MATH Google Scholar
Fletcher, R., Reeves, C.: Function minimization by conjugate gradients. Comput. J. 7(2), 149–154 (1964)
Article MathSciNet MATH Google Scholar
Polak, E., Ribière, G.: Note surla convergence de directions conjugèes. Rev. Fr. Informat Rech. Operationelle 3e Anneè 16(3), 35–43 (1969)
MATH Google Scholar
Polyak, B.T.: The conjugate gradient method in extreme problems. USSR Comput. Math. Math. Phys. 9, 94–112 (1969)
Article MATH Google Scholar
Fletcher, R.: Unconstrained Optimization Practical Methods of Optimization, vol. 1. Wiley, New York (1987)
MATH Google Scholar
Liu, Y., Storey, C.: Efficient generalized conjugate gradient algorithms, part 1: theory. J. Optim. Theory Appl. 69(1), 129–137 (1991)
Article MathSciNet MATH Google Scholar
Dai, Y.H., Yuan, Y.X.: A nonlinear conjugate gradient method with a strong global convergence property. SIAM J. Optim. 10(1), 177–182 (1999)
Article MathSciNet MATH Google Scholar
Dai, Y.H., Liao, L.Z.: New conjugacy conditions and related nonlinear conjugate gradient methods. Appl. Math. Optim. 43(1), 87–101 (2001)
Article MathSciNet MATH Google Scholar
Hager, W.W., Zhang, H.C.: A new conjugate gradient method with guaranteed descent and an efficient line search. SIAM J. Optim. 16(1), 170–192 (2005)
Article MathSciNet MATH Google Scholar
Hager, W.W., Zhang, H.C.: A survey of nonlinear conjugate gradient methods. Pac. J. Optim. 2(1), 35–58 (2006)
MathSciNet MATH Google Scholar
Dai, Y.H., Kou, C.X.: A nonlinear conjugate gradient algorithm with an optimal property and an improved Wolfe line search. SIAM J. Optim. 23(1), 296–320 (2013)
Article MathSciNet MATH Google Scholar
Andrei, N.: Accelerated scaled memoryless BFGS preconditioned conjugate gradient algorithm for unconstrained optimization. Eur. J. Oper. Res. 204(3), 410–420 (2010)
Article MathSciNet MATH Google Scholar
Birgin, E.G., Martínez, J.M.: A spectral conjugate gradient method for unconstrained optimization. Appl. Math. Optim. 43(2), 117–128 (2001)
Article MathSciNet MATH Google Scholar
Li, M.: A three term Polak-Ribière-Polyak conjugate gradient method close to the memoryless BFGS quasi-Newton method. J. Ind. Manag. Optim. 16(1), 245–260 (2020)
Article MathSciNet MATH Google Scholar
Babaie-Kafaki, S.: A modified three-term conjugate gradient method with sufficient descent property. Appl. Math.: J. Chin. Univ. (Ser. B) 30(03), 263–272 (2015)
Article MathSciNet MATH Google Scholar
Faramarzi, P., Amini, K.: A scaled three-term conjugate gradient method for large-scale unconstrained optimization problem. Calcolo 56(4), 1–15 (2019)
Article MathSciNet MATH Google Scholar
Liu, J.K., Du, S.Q., Chen, Y.Y.: A sufficient descent nonlinear conjugate gradient method for solving M-tensor equations. J. Comput. Appl. Math. 371, 112709 (2020)
Article MathSciNet MATH Google Scholar
Ziadi, R., Ellaia, R., Bencherif-Madani, A.: Global optimization through a stochastic perturbation of the Polak-Ribière conjugate gradient method. J. Comput. Appl. Math. 317, 672–684 (2017)
Article MathSciNet MATH Google Scholar
Zhu, X.J.: A Riemannian conjugate gradient method for optimization on the Stiefel manifold. Comput. Optim. Appl. 67(1), 73–110 (2017)
Article MathSciNet MATH Google Scholar
Liu, J.K., Du, X.L.: A gradient projection method for the sparse signal reconstruction in compressive sensing. Appl. Anal. 97(12), 2122–2131 (2018)
Article MathSciNet MATH Google Scholar
Chen, X.J., Zhou, W.J.: Smoothing nonlinear conjugate gradient method for image restoration using nonsmooth nonconvex minimization. SIAM J. Imaging Sci. 3(4), 765–790 (2010)
Article MathSciNet MATH Google Scholar
Yin, J.H., Jian, J.B., Jiang, X.Z., et al.: A hybrid three-term conjugate gradient projection method for constrained nonlinear monotone equations with applications. Numer. Algorithms 88(1), 389–418 (2021)
Article MathSciNet MATH Google Scholar
Xiao, Y., Wang, Q., Hu, Q.: Non-smooth equations based method for ℓ₁-norm problems with applications to compressed sensing. Nonlinear Anal. Theory Methods Appl. 74(11), 3570–3577 (2011)
Article MathSciNet MATH Google Scholar
Xiao, Y., Zhu, H.: A conjugate gradient method to solve convex constrained monotone equations with applications in compressive sensing. J. Math. Anal. Appl. 405(1), 310–319 (2013)
Article MathSciNet MATH Google Scholar
Yuan, G.L., Li, T.T., Hu, W.J.: A conjugate gradient algorithm for large-scale nonlinear equations and image restoration problems. Appl. Numer. Math. 147, 129–141 (2020)
Article MathSciNet MATH Google Scholar
Yin, J., Jian, J., Jiang, X.: A generalized hybrid CGPM-based algorithm for solving large-scale convex constrained equations with applications to image restoration. J. Comput. Appl. Math. 391, 113423 (2021)
Article MathSciNet MATH Google Scholar
Liu, Y.F., Zhu, Z.B., Zhang, B.X.: Two sufficient descent three-term conjugate gradient methods for unconstrained optimization problems with applications in compressive sensing. J. Appl. Math. Comput. https://doi.org/10.1007/s12190-021-01589-8 (2021)
Shi, Z.J., Guo, J.: A new family of conjugate gradient methods. J. Comput. Appl. Math. 224(1), 444–457 (2009)
Article MathSciNet MATH Google Scholar
Beale, E.M.: A Derivation of Conjugate Gradients. Numerical Methods for Nonlinear Optimization, pp. 39–43 (1972)
Zhang, L., Zhou, W.J., Li, D.H.: A descent modified Polak-Ribière-Polyak conjugate gradient method and its global convergence. IMA J. Numer. Anal. 26(4), 629–640 (2006)
Article MathSciNet MATH Google Scholar
Kou, C.X., Dai, Y.H.: A modified self-scaling memoryless Broyden-Fletcher-Goldfarb-Shanno method for unconstrained optimization. J. Optim. Theory Appl. 165 (1), 209–224 (2015)
Article MathSciNet MATH Google Scholar
Narushima, Y., Yabe, H., Ford, J.A.: A three-term conjugate gradient method with sufficient descent property for unconstrained optimization. SIAM J. Optim. 21(1), 212–230 (2011)
Article MathSciNet MATH Google Scholar
Liu, J.K., Xu, J.Ł., Zhang, L.Q.: Partially symmetrical derivative-free Liu-Storey projection method for convex constrained equations. Int. J. Comput. Math. 96(9), 1787–1798 (2019)
Article MathSciNet MATH Google Scholar
Liu, J.K., Zhao, Y.X., Wu, X.L.: Some three-term conjugate gradient methods with the new direction structure. Appl. Numer. Math. 150, 433–443 (2020)
Article MathSciNet MATH Google Scholar
Dai, Y.H., Yuan, Y.X.: An efficient hybrid conjugate gradient method for unconstrained optimization. Ann. Oper. Res. 103, 33–47 (2001)
Article MathSciNet MATH Google Scholar
Zoutendijk, G.: Nonlinear programming, computational methods. Integer and Nonlinear Programming, pp. 37–86 (1970)
Gould, N.I.M., Orban, D., Toint P L.: CUTEr and SifDec: A constrained and unconstrained testing environment, revisited. ACM Trans. Math. Softw. (TOMS) 29(4), 373–394 (2003)
Article MATH Google Scholar
Moré, J J, Garbow, B.S., Hillstrom, K.E.: Testing unconstrained optimization software. ACM Trans. Math. Softw. (TOMS) 7(1), 17–41 (1981)
Article MathSciNet MATH Google Scholar
Andrei, N.: An unconstrained optimization test functions collection. Adv. Model. Optim. 10(1), 147–161 (2008)
MathSciNet MATH Google Scholar
Sellami, B., Laskri, Y., Benzine, R.: A new two-parameter family of nonlinear conjugate gradient methods. Optimization 64(4), 993–1009 (2015)
Article MathSciNet MATH Google Scholar
Dolan, E.D., Moré, J.J.: Benchmarking optimization software with performance profiles. Math. Program. 91(2), 201–213 (2002)
Article MathSciNet MATH Google Scholar
Chan, R.H., Ho, C.W., Nikolova, M.: Salt-and-pepper noise removal by median-type noise detectors and detail-preserving regularization. IEEE Trans. Image Process. 14(10), 1479–1485 (2005)
Article Google Scholar
Cai, J.F., Chan, R., Morini, B.: Minimization of an Edge-Preserving Regularization Functional by Conjugate Gradient type Methods. Image Processing Based on Partial Differential Equations, pp 109–122. Springer, Berlin, Heidelberg (2007)
Google Scholar
Yu, G.H., Huang, J.H., Zhou, Y.: A descent spectral conjugate gradient method for impulse noise removal. Appl. Math. Lett. 23(5), 555–560 (2010)
Article MathSciNet MATH Google Scholar
Cao, J.Y., Wu, J.Z.: A conjugate gradient algorithm and its applications in image restoration. Appl. Numer. Math. 152, 243–252 (2020)
Article MathSciNet MATH Google Scholar
Aminifard, Z., Babaie-Kafaki, S.: Dai-Liao extensions of a descent hybrid nonlinear conjugate gradient method with application in signal processing. Numer. Algoritm. https://doi.org/10.1007/s11075-021-01157-y (2021)
Hwang, H., Haddad, R.A.: Adaptive median filters: New algorithms and results. IEEE Trans. Image Process. 4(4), 499–502 (1995)
Article Google Scholar
Bovik, A.: Handbook of Image and Video Processing. Academic Press, San Diego (2000)
MATH Google Scholar

Download references

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 11771383), the Natural Science Foundation of Guangxi Province (No. 2020GXNSFDA238017), Research Project of Guangxi University for Nationalities (Grant No. 2018KJQD02) and Innovation Project of Guangxi Graduate Education (gxun-chxp201909).

Author information

Authors and Affiliations

College of Mathematics and Physics, Guangxi University for Nationalities, Nanning, 530006, China
Xianzhen Jiang, Wei Liao, Jianghua Yin & Jinbao Jian

Authors

Xianzhen Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Liao
View author publications
You can also search for this author in PubMed Google Scholar
Jianghua Yin
View author publications
You can also search for this author in PubMed Google Scholar
Jinbao Jian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinbao Jian.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jiang, X., Liao, W., Yin, J. et al. A new family of hybrid three-term conjugate gradient methods with applications in image restoration. Numer Algor 91, 161–191 (2022). https://doi.org/10.1007/s11075-022-01258-2

Download citation

Received: 07 October 2021
Accepted: 06 January 2022
Published: 13 March 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s11075-022-01258-2

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A new family of hybrid three-term conjugate gradient methods with applications in image restoration

Abstract

Similar content being viewed by others

A class of new three-term descent conjugate gradient algorithms for large-scale unconstrained optimization and applications to image restoration problems

Two modified conjugate gradient methods for unconstrained optimization with applications in image restoration problems

A modified Fletcher-Reeves conjugate gradient method for unconstrained optimization with applications in image restoration

1 Introduction

2 Motivation and algorithm