Blind Image Deblurring via the Weighted Schatten p-norm Minimization Prior

Xu, Zhenhua; Chen, Huasong; Li, Zhenhua

doi:10.1007/s00034-020-01457-z

Blind Image Deblurring via the Weighted Schatten p-norm Minimization Prior

Published: 22 May 2020

Volume 39, pages 6191–6230, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Blind Image Deblurring via the Weighted Schatten p-norm Minimization Prior

Download PDF

460 Accesses
5 Citations
Explore all metrics

Abstract

In this paper, we propose a new image blind deblurring model, based on a novel low-rank prior. As the low-rank prior, we employ the weighted Schatten p-norm minimization (WSNM), which can represent both the sparsity and self-similarity of the image structure more accurately. In addition, the L₀-regularized gradient prior is introduced into our model, to extract significant edges quickly and effectively. Moreover, the WSNM prior can effectively eliminate harmful details and maintain dominant edges, to generate sharper intermediate images, which is beneficial for blur kernel estimation. To optimize the model, an efficient optimization algorithm is developed by combining the half-quadratic splitting strategy with the generalized soft-thresholding algorithm. Extensive experiments have demonstrated the validity of the WSNM prior. Our flexible low-rank prior enables the proposed algorithm to achieve excellent results in various special scenarios, such as the deblurring of text, face, saturated, and noise-containing images. In addition, our method can be extended naturally to non-uniform deblurring. Quantitative and qualitative experimental evaluations indicate that the proposed algorithm is robust and performs favorably against state-of-the-art algorithms.

Blind Image Deblurring Using Elastic-Net Based Rank Prior

An extended sparse model for blind image deblurring

Article 13 December 2023

Blind image deblurring via L₁-regularized second-order gradient prior

Article 28 April 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Blind image deblurring has become an important research topic in the field of image processing and computer vision. It is a challenging and interesting problem and has been applied widely in various fields, including biomedicine, aerospace, and public safety. A common type of blurring is motion blurring, which is caused by the motion of an object relative to the camera during the exposure time.

When the motion blurring is uniform and spatially invariant, the relationship between the latent sharp image $ L $ and the observed blurred image $ B $ can be established by using the following model:

$$ B = L * k + n, $$

(1)

where *, $ k $, and $ n $ represent the convolution operator, blur kernel, and additive noise, respectively. According to (1), we need to restore both $ L $ and $ k $, with the blurred image $ B $ as the only input. This problem is challenging and ill-posed because there are an infinite number of different solution sets ($ L $, $ k $), each of which can correspond to the same $ B $. In addition, the effect of noise makes blind image restoration more difficult. Therefore, additional constraints on $ L $ and $ k $ are required to ensure that the final optimized solution is close to the true solution.

1.1 Previous Work

In recent decades, many scholars have carried out in-depth research on blind deblurring and have proposed numerous fruitful blind deblurring methods [2,3,4, 7, 11, 14, 18, 19, 21, 22, 25,26,27,28, 32, 33, 35,36,37,38,39,40,41,42,43,44,45, 49, 51, 52]. These methods can be classified into three mainly categories: explicit edge prediction strategies, image statistical priors, and deep-learning-based methods.

1.1.1 Explicit Edge Prediction

Because strong edges have a significant beneficial effect for blur kernel estimation, many methods based on explicit edge prediction have been proposed. Joshi et al. [18] directly detected and predicted latent sharp edges to improve blur kernel estimation. Cho et al. [4] used a combination of bilateral filtering, shock filtering, and edge gradient thresholding to predict salient edges. Xu and Jia [49] developed an effective salient edge selection strategy and proposed a two-phase kernel estimation framework. Subsequently, Pan et al. [37] proposed a self-adaptive edge selection algorithm. Although explicit edge prediction methods are valid for image blind recovery, they rely heavily on heuristic filters. These methods are likely to amplify noise and over-sharpen images, and salient edges are not always present in natural images.

1.1.2 Prior Based on Image Statistics

To mitigate the negative impact of heuristic filters on the selection of strong edges, numerous methods based on image statistical priors have been proposed. To match natural image gradient, which obeys a heavy-tailed distribution, Fergus et al. [11] applied a Gaussian mixture model and performed kernel estimation through a variational Bayesian framework. Shan et al. [42] connected two piecewise functions to approximate the image gradient distribution. Levin et al. [25, 26] elaborated on the limitations of the naive maximum a posteriori method, which is conducive to the generation of delta kernels and blurred images, and an effective maximum marginal optimization algorithm was proposed. Krishnan et al. [21] proposed the normalized sparsity prior and provided a new L₁/L₂ regularization function to estimate the kernel.

To enhance the search for strong edges, Xu et al. [51] proposed a generalized L₀ sparse gradient prior, which can extract strong edges quickly and efficiently. Inspired by this work, many methods utilize L₀ regularization to enhance the sparsity of image gradients, and incorporate various image priors to improve the kernel estimation. Pan et al. [35] further adopted the L₀-regularized priors of intensity and gradient to deblur text images. Li et al. [27] applied L₀ regularization of image gradient and kernel intensity. Interestingly, Pan et al. [38] observed that the dark channel of a clear image is sparser than the blurred image, and L₀ regularization can be performed on the dark channel to improve the restoration performance further. However, not all images have obvious dark channels. Therefore, Yan et al. [52] proposed an extreme channel prior, which integrates the dark channel and bright channel priors of the image. Recently, Li et al. [28] used the effective combination of a convolutional neural network (CNN) and the L₀-regularized gradient for blind deblurring. Chen et al. [3] developed a local maximum gradient prior and combined it with L₀-regularized gradient to obtain good results, but this method is sensitive to noise.

Most of the above methods focus only on the relationship between adjacent pairs of pixels, or the simplicity of pixel intensity, or both. However, they ignore that the salient structure is dependent on a larger range of pixels. Using these methods cannot restore images with complex geometries completely and effectively. To avoid such restrictions, image patch-based methods have been proposed for image restoration [10, 14, 31, 32, 43, 44, 55]. Zoran et al. [55] proposed a Gaussian mixture prior, which learns from natural image patches and restores images using patch likelihoods. Sun et al. [43] learned two types of patch priors, to model image edge primitives: natural images and synthetic structures. Michaeli et al. [32] used internal patch recurrence property for kernel estimation, because cross-scale internal patches were repeated in clear images and significantly reduced in blurred images. Guo et al. [14] proposed an adaptive edge-based patch prior to reconstruct salient edges and other features. Tang et al. [44] used external patch priors, combined with the sparse representation method for kernel estimation. In general, a patch-based prior can cover more pixels than an image gradient or intensity prior can. Therefore, patch-based priors are more beneficial for complex structure extraction and noise suppression.

To extend the effectiveness of patch-based priors further, low-rank matrix approximation (LRMA) methods, based on non-local similarity patches, have been extensively studied and successfully applied to many vision tasks, such as image or video denoising [9, 13, 16, 48], non-blind deblurring [8, 30, 53], blind deblurring [7, 40], and other tasks [1, 9, 12, 17, 29, 48, 53]. LRMA methods can be classified into two categories: nuclear norm minimization (NNM) methods and low-rank matrix factorization methods. In this paper, we mainly study the former type as a regularization term. Although conventional NNM has been used widely in image restoration, it still has some limitations. To pursue convex solutions, standard NNM penalizes each singular value equally. However, this is unreasonable and limits its flexibility to deal with various practical problems. Because each singular value of the matrix represents a different meaning and importance, they need to be processed separately.

To promote the flexibility and effectiveness of NNM, the weighted nuclear norm minimization (WNNM) model was proposed by Gu et al. [12, 13]; this model assigns a different weight to each singular value. WNNM imposes less penalty on larger singular values than on smaller ones, thereby retaining the main structure of the image more rationally. Subsequently, Ma et al. [30] used WNNM and total variation regularization to perform non-blind deblurring of the images. Ren et al. [40] exploited the WNNM low-rank prior to perform blind deblurring.

Inspired by Schatten p-norm minimization ($ 0 \le p \le 1 $) sparse optimization algorithms [34, 56] and WNNM [13], a novel low-rank prior—weighted Schatten p-norm Minimization (WSNM)—was proposed by Xie et al. [48], who applied it to background subtraction and image denoising. When considering different rank components, WSNM has more flexibility than WNNM and can approximate the original LRMA problem better. It is worth noting that WNNM is only a special case of WSNM, in which $ p = 1 $. Zha et al. [53] further developed the alternative direction multiplier method to solve the WSNM model, and applied this method to non-blind restoration.

In blind image deblurring, low-rank priors have inherent essential advantages. They favor sharp images over blurred images [7] and produce sharper intermediate latent images for kernel estimation, by eliminating harmful subtle details while preserving the main structures [40]. Although two low-rank-based methods [7, 40] have been proposed for blind deblurring, both of them have some limitations. The method of [7] combines explicit salient edge extraction with conventional NNM for kernel estimation. The explicit edge extraction of the method of [7] utilizes a conventional structure–texture decomposition strategy and heuristic filtering, which significantly increases the complexity of the algorithm. In addition, because of the negative influence of heuristic filtering and the inflexible NNM, the effect of restoration is reduced. The method of [40] employs the WNNM of image intensity and gradient to recover the image. However, because of the low-rank minimization of the gradient, the computational complexity of the algorithm is greatly increased. Moreover, this method employs only the low-rank prior in the finest layer pyramid, reducing the role of the low-rank prior in the entire blind restoration. In addition, the WNNM prior of this method still lacks sufficient flexibility and accuracy, compared with the latest WSNM prior.

1.1.3 Method Based on Deep Learning

Recently, many deblurring methods [2, 22, 28, 33, 41, 45] based on CNNs have been proposed. However, because of the variability and complexity of real-world blurred images, compared with conventional optimization-based methods, most existing CNN-based methods have difficulty in restoring real-world blurred images effectively, especially the large-scale motion blur. Therefore, the closely related optimization-based methods are mainly introduced in the introduction.

1.2 Motivation and Proposed Approach

The motivation for our work is twofold. First, in view of the latest research work, low-rank priors show obvious advantages and great potential for image restoration. However, existing low-rank blind deblurring methods [7, 40] still have some limitations. Therefore, we can develop better blind deblurring methods based on the latest low-rank priors. Second, several state-of-the-art blind deblurring algorithms adopt a strategy that effectively combines the L₀-regularized gradient prior with different image priors, to further improve kernel estimation. However, these image priors have limitations, which reduce the universality and effectiveness of the methods. Therefore, we can develop a more pervasive, sparse, and efficient prior to replace the existing image priors.

Based on the above two aspects, we propose a new blind deblurring method by extending the application of the flexible WSNM prior and the L₀-regularized gradient prior. Our main contributions are summarized as follows:

1.
We propose a new image blind deblurring model based on the low-rank prior and L₀-regularized gradient prior. The L₀-regularized gradient prior can extract the main edges quickly. Our low-rank prior can further effectively eliminate harmful subtle details, while preserving the main edges. To implement low-rank regularization effectively, a more accurate and flexible Schatten p-norm minimization (i.e., WSNM) is employed.
2.
To solve our model effectively, an iterative optimization algorithm, based on the half-quadratic splitting (HQS) strategy and the generalized soft-thresholding (GST) algorithm, is developed.
3.
The validity of the WSNM prior is demonstrated by extensive experiments. Our low-rank prior enhances the sparsity and self-similarity of the intermediate latent image, thereby improving kernel estimation.
4.
Our algorithm achieves outstanding results on both natural images and domain-specific images. In addition, our method can be extended naturally to non-uniform deblurring.

The remainder of this paper is organized as follows. Section 2 describes the WSNM prior employed in our model. Section 3 shows the proposed model and derives a numerical optimization algorithm, based on HQS and GST, to solve the non-convex regularization terms. Section 4 shows that our algorithm can be effectively extended to non-uniform deblurring. Section 5 presents experimental comparisons with other state-of-the-art methods. Section 6 analyzes and discusses the effectiveness of the proposed algorithm. Section 7 concludes the paper.

2 Weighted Schatten p-norm Minimization Prior

To enhance the low-rank prior, inspired by Schatten p-norm minimization [34, 56] and WNNM [13], Xie et al. [48] proposed a weighted Schatten p-norm minimization (WSNM) model:

$$ \mathop {{\mathrm{min}} }\limits_{X} \left\| {Y - X} \right\|_{F}^{2} + \left\| X \right\|_{{w,S_{p} }}^{p} , $$

(2)

where $ X $ and $ Y $ are the desired low-rank approximation matrix and degraded observation matrix, respectively. The first term in (2) is the F-norm data fidelity term, and the second term is the low-rank regularization term. $ \left\| X \right\|_{{w,S_{p} }}^{{}} $ represents the weighted Schatten p-norm of matrix $ X \in \Re^{m \times n} $, and is defined as

$$ \left\| X \right\|_{{w,S_{p} }} = \left( {\sum\nolimits_{i = 1}^{{{\mathrm{min}} \left\{ {n,m} \right\}}} {w_{i} } \sigma_{i}^{p} } \right)^{{\frac{1}{p}}} , $$

(3)

where $ 0 \le p \le 1 $, $ w $ is a nonnegative weight vector, $ w = \left\{ {w_{1} , \ldots ,w_{{{\mathrm{min}} (n,m)}} } \right\} $ and $ w_{i} \ge 0 $, and $ \sigma_{i} $ is the ith singular value of $ X $. $ \left\| X \right\|_{{w,S_{p} }} $ to the power of $ p $ is

$$ \left\| X \right\|_{{w,S_{p} }}^{p} = \sum\nolimits_{i = 1}^{{{\mathrm{min}} \left\{ {n,m} \right\}}} {w_{i} } \sigma_{i}^{p} = {\mathrm{tr}}(W\Delta^{p} ), $$

(4)

where $ \Delta $ and $ W $ are diagonal matrices composed of all values of $ \sigma_{i} $ and $ w_{i} $, respectively. Because $ \left\| X \right\|_{{w,S_{p} }}^{p} $ contains the weight vector $ w $ and the non-convex Schatten p-norm, problem (2) is difficult to solve effectively. To obtain the optimal solution, we introduce the following theorem:

Theorem 1

Let $ Y = U\sum V^{{\mathrm{T}}} $ be the singular value decomposition (SVD) of $ Y \in \Re^{m \times n} $, $ \sum = {\text{diag}}(\delta_{1} , \ldots ,\delta_{r} ) $, and $ r = {\mathrm{min}} \left( {m,n} \right) $; then the optimal solution to problem (2) is $ X = U\Delta V^{{\mathrm{T}}} $, where $ \Delta = {\text{diag}}(\sigma_{1} , \ldots,\sigma_{r} ) $, and a singular value $ \sigma_{i} $ of $ X $ can be obtained by solving the following problem:

$$ \left\{ {\begin{array}{*{20}c} {\mathop {{\mathrm{min}} }\limits_{{\sigma_{i} }} \sum\limits_{i = 1}^{r} {\left[ {(\delta_{i} - \sigma_{i} )^{2} + w_{i} \sigma_{i}^{p} } \right]} } \\ {{\text{s.t.}}\,\sigma_{i} \ge 0,\quad \sigma_{i} \ge \sigma_{j},\quad {\text{for}}\,i \le j} \\ \end{array} } \right.. $$

(5)

(The detailed proof can be found in [48].) Therefore, solving problem (2) can be transformed to solving problem (5); the GST algorithm [56] can solve problem (5) as follows:

$$ \sigma_{i} = GST(\delta_{i} ,w_{i} ,p,t), $$

(6)

where $ t $ represents the number of iterations. The weights $ w_{i} $ are in non-descending order: $ 0 \le w_{1} \le \cdots \le w_{r} $. This means that larger singular values represent the main components and should be penalized less, whereas smaller values represent harmful details and noise and should be penalized more. Therefore, the preservation of the main data components can be guaranteed. More details can be found in [48, 56]. In blind deblurring, this weight setting is favorable for preserving significant edges for kernel estimation.

3 Model and Optimization

Based on the above introduction and analysis, we propose a novel image blind deblurring model, which is based on the WSNM prior and L₀-regularized gradient prior, and develop an effective optimization algorithm to estimate the kernels. Our model is defined as follows:

$$ \mathop {{\mathrm{min}}}\limits_{L,k} \left\| {L * k - B} \right\|_{2}^{2} + \gamma \left\| k \right\|_{2}^{2} + \mu \left\| {\nabla L} \right\|_{0} + \lambda R(L), $$

(7)

where $ \gamma $, $ \mu $, and $ \lambda $ are the weight parameters corresponding to each term. $ \nabla = (\nabla_{{\mathrm{h}}} ,\nabla_{{\mathrm{v}}} ) $ is the gradient operator; $ \nabla_{{\mathrm{v}}} $ and $ \nabla_{{\mathrm{h}}} $ denote the vertical and horizontal directions of the gradient operator, respectively.

The first term in (7) denotes data fidelity, which constrains the convolution of the clear image $ L $ with the blur kernel $ k $ to be close to the blurred image $ B $. The second term constrains the estimated kernel to obtain a stable solution. The third term is the L₀ regularization gradient prior, which preserves the large gradients and removes the harmful fine structures. The last term is the WSNM prior, which uses the low-rank characteristic of non-local self-similarity patches to eliminate further harmful micro-textures and noise. $ R(L) $ is defined as

$$ R(L) = \sum\limits_{i = 1}^{d} {\left\| {L_{i} } \right\|_{{w,S_{p} }}^{p} } . $$

(8)

Image $ L \in \Re^{N} $ is divided into $ d $ overlapping patches $ l_{i} $, of size $ \sqrt n \times \sqrt n $, $ i = 1, \ldots ,d $. For each example patch $ l_{i} $, the $ m $ most similar patches are collected through an $ S \times S $ search window, and the $ m $ similar patches are stacked into the matrix $ L_{i} \in \Re^{n \times m} $, of which the columns are composed of $ m $ vectorized similar patches, such as $ L_{i} = \left\{ {l_{i1} ,l_{i2} , \ldots ,l_{im} } \right\} $, and $ l_{im} $ is the mth similar patch of the ith group $ L_{i} $. Because the $ m $ similar patches have consistent geometric structures, their permutation combination results in matrix $ L_{i} $ with a low-rank property, and our model (7) can be reformulated as

$$ \mathop {{\mathrm{min}}}\limits_{L,k} \left\| {L * k - B} \right\|_{2}^{2} + \gamma \left\| k \right\|_{2}^{2} + \mu \left\| {\nabla L} \right\|_{0} + \lambda \sum\limits_{i = 1}^{d} {\left\| {L_{i} } \right\|_{{w,S_{p} }}^{p} } . $$

(9)

Problem (9) involves solving for two unknown variables, $ L $ and $ k $. To optimize (9), we divide it into two subproblems (of $ L $ and $ k $) to minimize alternately, where the subproblem for $ L $ is

$$ \mathop{{\mathrm{min}}} \limits_{L} \left\| {L * k - B} \right\|_{2}^{2} + \mu \left\| {\nabla L} \right\|_{0} + \lambda \sum\limits_{i = 1}^{d} {\left\| {L_{i} } \right\|_{{w,S_{p} }}^{p} } , $$

(10)

and the subproblem for $ k $ is

$$ \mathop {{\mathrm{min}}}\limits_{k} \left\| {L * k - B} \right\|_{2}^{2} + \gamma \left\| k \right\|_{2}^{2} . $$

(11)

3.1 Intermediate Latent Image $ L $ Estimation

Because subproblem (10) includes both the L₀-regularized gradient and WSNM terms, which are non-convex, performing the calculation directly is difficult. We adopt the HQS strategy [50] and the GST algorithm [56] to optimize these two non-convex terms.

First, we adopt the HQS strategy, where $ u $ and $ g = \left( {g_{{\mathrm{h}}} ,g_{{\mathrm{v}}} } \right) $ are introduced as two new auxiliary variables; $ u $ and $ g $ correspond to the latent image $ L $ and image gradient $ \nabla L $, respectively. Subproblem (10) can be reformulated as

$$ \mathop {{\mathrm{min}} }\limits_{L,g,u} \left\| {L * k - B} \right\|_{2}^{2} + \alpha \left\| {\nabla L - g} \right\|_{2}^{2} + \beta \left\| {L - u} \right\|_{2}^{2} + \mu \left\| g \right\|_{0} + \lambda \sum\limits_{i = 1}^{d} {\left\| {u_{i} } \right\|_{{w,S_{p} }}^{p} } , $$

(12)

where $ \alpha $ and $ \beta $ are positive regularization parameters. We can solve (12) by alternately minimizing $ L $, $ u $, and $ g $, to avoid solving the non-convex L₀ gradients and the weighted Schatten p-norm directly.

We fix $ u $ and $ g $, to solve the intermediate latent image $ L $ by optimizing the following objective function:

$$ \mathop {{\mathrm{min}} }\limits_{L} \left\| {L * k - B} \right\|_{2}^{2} + \alpha \left\| {\nabla L - g} \right\|_{2}^{2} + \beta \left\| {L - u} \right\|_{2}^{2} . $$

(13)

This is a least squares optimization problem, and its closed solution can be effectively solved by the fast Fourier transform (FFT) method:

$$ L = F^{ - 1} \left( {\frac{{\overline{F(k)} F(B) + \alpha F_{G} + \beta F(u)}}{{\overline{F(k)} F(k) + \alpha \overline{F(\nabla )} F(\nabla ) + \beta }}} \right), $$

(14)

where $ F( \cdot ) $ and $ F^{ - 1} ( \cdot ) $ denote FFT and inverse FFT, respectively, $ \overline{F( \cdot )} $ denotes the complex conjugate operator, and $ F_{G} = \overline{{F(\nabla_{{\mathrm{h}}} )}} F(g_{{\mathrm{h}}} ) + \overline{{F(\nabla_{{\mathrm{v}}} )}} F(g_{{\mathrm{v}}} ) $.

Fixing $ u $ and $ L $, we can solve $ g $ by

$$ \mathop {{\mathrm{min}} }\limits_{g} \alpha \left\| {\nabla L - g} \right\|_{2}^{2} + \mu \left\| g \right\|_{0} . $$

(15)

Because (15) is a pixel-wise minimization problem, it can be directly solved by [50] to obtain $ g $:

$$ g = \left\{ \begin{array}{ll} \nabla L,&\quad \left\| {\nabla L} \right\|^{2} \ge \frac{\mu }{\alpha } \hfill \\ 0,&\quad {\text{otherwise}}. \hfill \\ \end{array} \right. $$

(16)

Fixing $ g $ and $ L $, we can solve $ u $ by

$$ \mathop {{\mathrm{min}} }\limits_{u} \beta \left\| {L - u} \right\|_{2}^{2} + \lambda \sum\limits_{i = 1}^{d} {\left\| {u_{i} } \right\|_{{w,S_{p} }}^{p} } . $$

(17)

However, the second term is severely non-convex. This problem has complex structures and is difficult to optimize directly. To solve (17) more efficiently, we make the following assumptions, based on [53, 54].

Theorem 2

Define $ L,u \in \Re^{N} $, $ L_{i} ,u_{i} \in \Re^{n \times m} $, and error vector $ e = L - u $; $ e(j) $ is each element of $ e $ and $ j = 1, \ldots,N $. Suppose that $ e(j) $ satisfies the independent zero-mean distribution and that its variance is $ \sigma^{2} $. Then, for any $ \varepsilon > 0 $, the following property describes the relationship between $ \left\| {L - u} \right\|_{2}^{2} $ and $ \sum\limits_{i = 1}^{d} {\left\| {L_{i} - u_{i} } \right\|_{F}^{2} } $:

$$ \mathop {\lim }\limits_{\begin{subarray}{l} N \to \infty \\ R \to \infty \end{subarray} } P\left\{ {\left| {\frac{1}{N}\left\| {L - u} \right\|_{2}^{2} - \frac{1}{R}\sum\limits_{i = 1}^{d} {\left\| {L_{i} - u_{i} } \right\|_{F}^{2} } } \right| < \varepsilon } \right\} = 1, $$

(18)

where $ P( \bullet ) $ denotes the probability, and $ R = d \times n \times m $.

Proof

Each $ e(j) $ is assumed to be an independent zero-mean distribution with variance $ \sigma^{2} $, i.e., $ Var[e(j)] = \sigma^{2} $ and $ E[e(j)] = 0 $. Therefore, each $ e(j)^{2} $ is also independent, and the mean of each $ e(j)^{2} $ is

$$ E[e(j)^{2} ] = [E[e(j)]]^{2} + Var[e(j)] = \sigma^{2} ,\quad j = 1,2, \ldots ,N. $$

(19)

By invoking the law of large numbers in probability theory, for any $ \varepsilon > 0 $, $ \mathop {\lim }\limits_{N \to \infty } P\left\{ {\left| {\frac{1}{N}\sum\nolimits_{j = 1}^{N} {e(j)^{2} } - \sigma^{2} } \right|} \right.\left. { < \frac{\varepsilon }{2}} \right\} = 1 $, i.e.,

$$ \mathop {\lim }\limits_{N \to \infty } P\left\{ {\left| {\frac{1}{N}\left\| {L - u} \right\|_{2}^{2} - \sigma^{2} } \right|} \right.\left. { < \frac{\varepsilon }{2}} \right\} = 1. $$

(20)

Further, we let $ {\mathbf{L}} $ and $ {\mathbf{u}} $ denote the concatenation of all groups $ L_{i} $ and $ u_{i} $, $ i = 1, \ldots,d $, respectively, and denote the error of each element of $ {\mathbf{L}} - {\mathbf{u}} $ by $ e(r) $, $ r = 1,2, \ldots ,R $. Suppose $ e(r) $ also follows an independent zero-mean distribution with variance $ \sigma^{2} $.

Therefore, a process similar to that mentioned above is applied to $ e(r)^{2} $ to obtain $ \mathop {\lim }\limits_{R \to \infty } P\left\{ {\left| {\frac{1}{R}\sum\nolimits_{r = 1}^{R} {e(r)^{2} } - \sigma^{2} } \right|} \right.\left. { < \frac{\varepsilon }{2}} \right\} = 1 $, i.e.,

$$ \mathop {\lim }\limits_{R \to \infty } P\left\{ {\left| {\frac{1}{R}\sum\nolimits_{i = 1}^{d} {\left\| {L_{i} - u_{i} } \right\|}_{F}^{2} - \sigma^{2} } \right|} \right.\left. { < \frac{\varepsilon }{2}} \right\} = 1. $$

(21)

Considering (20) and (21) together, we prove (18).□

Therefore, according to Theorem 2, in each iteration we obtain an equation of high probability (approaching 1):

$$ \frac{1}{N}\left\| {L - u} \right\|_{2}^{2} = \frac{1}{R}\sum\limits_{i = 1}^{d} {\left\| {L_{i} - u_{i} } \right\|_{F}^{2} } . $$

(22)

From (17) and (22), we obtain the following equation:

$$ \mathop {{\mathrm{min}} }\limits_{u} \left\| {L - u} \right\|_{2}^{2} + \frac{\lambda }{\beta }\sum\limits_{i = 1}^{d} {\left\| {u_{i} } \right\|_{{w,S_{p} }}^{p} } = \mathop {{\mathrm{min}} }\limits_{u} \sum\limits_{i = 1}^{d} {\left( {\left\| {L_{i} - u_{i} } \right\|_{F}^{2} + \frac{\lambda R}{\beta N}\left\| {u_{i} } \right\|_{{w,S_{p} }}^{p} } \right)} . $$

(23)

According to (2), (5), and (6), the optimal solution of (23) can be obtained by the GST algorithm [56], as follows:

$$ \sigma_{ij} (u_{i} ) = GST\left( {\delta_{ij} (L_{i} ),\eta w_{ij} ,p,t} \right), $$

(24)

where $ \delta_{ij} (L_{i} ) $ is the jth singular value of the stacked similar patch matrix $ L_{i} $, the definition of $ \sigma_{ij} (u_{i} ) $ is similar to that of $ \delta_{ij} (L_{i} ) $, and $ \eta = {{\lambda R} \mathord{\left/ {\vphantom {{\lambda R} {\beta N}}} \right. \kern-0pt} {\beta N}} $. Because large singular values contain the main structure of the image, they should be penalized less, whereas small ones mainly contain the harmful detail and should be penalized more. Therefore, referring to [48], we define the weight $ w_{ij} $ as

$$ w_{ij} = \frac{{2\sqrt {2m} }}{{(\sigma_{ij}^{{{1 \mathord{\left/ {\vphantom {1 p}} \right. \kern-0pt} p}}} (u_{i} ) + \varepsilon )}}, $$

(25)

where $ \varepsilon $ is a very small constant and $ m $ is the number of similar patches. Because $ \sigma_{ij} (u_{i} ) $ is unknowable before estimating $ u_{i} $, we follow [40, 48] and initialize $ \sigma_{ij} (u_{i} ) $ as

$$ \sigma_{ij} (u_{i} ) = \sqrt {{\mathrm{max}} (\delta_{ij}^{2} (L_{i} ) - ms^{2} ,0)} . $$

(26)

Each singular value of $ u_{i} $ can be calculated from (24) to form $ u_{i} = U\Delta V^{{\mathrm{T}}} $, where $ \Delta = {\text{diag}}\left( {\left. {\sigma_{i1} (u_{i} ), \ldots ,\sigma_{ir} (u_{i} )} \right)} \right. $, $ r = {\mathrm{min}} \left( {n,m} \right) $, $ i = 1, \ldots ,d $, and $ s $ denotes the size of the blur kernel. Finally, all values of $ u_{i} $ are aggregated, to reconstruct $ u $.

In addition, we add a stopping criterion to the internal iterative process, to enhance the fast convergence of our algorithm:

$$ {{\left\| {L^{(x)} - L^{(x - 1)} } \right\|_{2} } \mathord{\left/ {\vphantom {{\left\| {L^{(x)} - L^{(x - 1)} } \right\|_{2} } {\left\| {L^{(x)} } \right\|_{2} }}} \right. \kern-0pt} {\left\| {L^{(x)} } \right\|_{2} }} < tol, $$

(27)

where $ x $ is the number of internal iterations and $ tol = 10^{ - 5} $.

3.2 Blur Kernel $ k $ Estimation

After obtaining the intermediate latent image $ L $, subproblem (11) can be used directly to calculate the closed solution by FFT. However, (11) is based on image intensity, and the direct solution cannot yield good results [4, 26, 35, 51]. Therefore, image gradients are used to estimate the blur kernel $ k $, and this method is more effective. We reformulate (11) to

$$ \mathop {{\mathrm{min}}}\limits_{k} \left\| {\nabla L * k - \nabla B} \right\|_{2}^{2} + \gamma \left\| k \right\|_{2}^{2} . $$

(28)

The objective function (28) can be solved directly by FFT:

$$ k = F^{ - 1} \left( {\frac{{\overline{F(\nabla L)} F(\nabla B)}}{{\overline{F(\nabla L)} F(\nabla L) + \gamma }}} \right). $$

(29)

After obtaining the blur kernel $ k $, the negative elements of $ k $ are set to 0, and normalization is performed. In the concrete implementation, similar to other state-of-the-art algorithms [4, 38, 51], a coarse-to-fine multiscale framework based on the image pyramid [4] is adopted in the whole deblurring process; this enables our algorithm to facilitate large-scale blur kernel estimation. We alternately solve $ L $ and $ k $ at each layer of the pyramid and then perform up-sampling, using the estimated $ k $ as the initial value for the next layer of the pyramid. Algorithm 1 presents the main steps of the proposed deblurring algorithm at each layer of the pyramid.

3.3 Non-blind Deconvolution

When using our blind deblurring algorithm (Algorithm 1) to restore the final latent image, the image details may be lost. Therefore, after obtaining the final blur kernel, various state-of-the-art non-blind deconvolution algorithms can be used to restore the final latent image. In this paper, the sparse deconvolution algorithm [24] is adopted, unless otherwise stated.

4 Extension to Non-uniform Deblurring

The proposed deblurring method can be easily extended to non-uniform deblurring, where the blur kernel is spatial-variant. Based on the geometric model of camera motion [47], we can model the blurred image $ B $ as the sum of all the different views in the scene:

$$ B = \sum\limits_{t} {k_{t} H_{t} L + n} , $$

(30)

where $ t $ is the number of each view, $ H_{t} $ is a homography matrix, and $ k_{t} $ is the weight corresponding to the tth view, with $ k_{t} \ge 0 $ and $ \sum\nolimits_{t} {k_{t} = 1} $. Similarly to [47], (30) can be rewritten as

$$ B = KL + n = A{\text{k}} + n, $$

(31)

where $ K = \sum\nolimits_{t} {k_{t} H_{t} } $, $ A = [H_{1} L,H_{2} L, \ldots ,H_{t} L] $, and $ {\text{k}} = [k_{1} ,k_{2} , \ldots ,k_{t} ]^{{\mathrm{T}}} $. We can solve non-uniform blur by alternatively minimizing

$$ \mathop {{\mathrm{min}}}\limits_{L} \left\| {KL - B} \right\|_{2}^{2} + \mu \left\| {\nabla L} \right\|_{0} + \lambda \sum\limits_{i = 1}^{d} {\left\| {L_{i} } \right\|_{{w,S_{p} }}^{p} } $$

(32)

and

$$ \mathop{{\mathrm{min}}}\limits_{\text{k}} \left\| {A{\text{k}} - B} \right\|_{2}^{2} + \gamma \left\| {\text{k}} \right\|_{2}^{2} . $$

(33)

Similarly to the case of uniform deblurring, (32) can be rewritten as

$$ \mathop {{\mathrm{min}} }\limits_{L,g,u} \left\| {KL - B} \right\|_{2}^{2} + \alpha \left\| {\nabla L - g} \right\|_{2}^{2} + \beta \left\| {L - u} \right\|_{2}^{2} + \mu \left\| g \right\|_{0} + \lambda \sum\limits_{i = 1}^{d} {\left\| {u_{i} } \right\|_{{w,S_{p} }}^{p} } . $$

(34)

For (34), we use the same optimization strategies as (15) and (17) to solve for $ g $ and $ u $.

The minimization problem for $ L $ is

$$ \mathop {{\mathrm{min}} }\limits_{L} \left\| {KL - B} \right\|_{2}^{2} + \alpha \left\| {\nabla L - g} \right\|_{2}^{2} + \beta \left\| {L - u} \right\|_{2}^{2} . $$

(35)

Obviously, (35) cannot be solved directly by FFT. Because the blur kernels are similar in a small region, we use local uniform blur to approximate non-uniform blur. Based on the fast forward approximation method [15], the blurred image is divided into $ Q $ overlapping patches. The matrix $ K $ can be expressed as

$$ K = \sum\limits_{r = 1}^{Q} {C_{r}^{ - 1} } (F^{ - 1} ({\text{diag}}(F(a_{r} )))F(C_{r} {\text{diag}}(M_{r} ))), $$

(36)

$ C_{r} ( \cdot ) $ denotes the operator that crops the rth patch from the image, $ C_{r}^{ - 1} ( \cdot ) $ denotes the operator that replaces the patch in the reconstructed image, and $ {\text{diag}}( \cdot ) $ denotes a diagonal matrix. The matrix $ M_{r} $ is a window function that has the same size as $ L $. In addition, $ a_{r} = C_{r} {J}_{r} {\text{k}} $ denotes the blur kernel of the rth patch, and the elements of each matrix $ {J}_{r} $ are simply a rearrangement of the elements of $ H_{t} $ [46].

Based on (36), the optimized solution of (35) can be calculated quickly by frequency-domain convolutions and correlations. For the blur kernel optimization problem (33), we use the method of [46] to estimate the kernel. The main difference between the proposed method and that of [46] is that we do not employ a bilateral filter and shock filter to predict significant edges. This is because the effective combination of WSNM and L₀-regularized gradient priors can eliminate harmful structures and obtain the intermediate latent images with sharp edges.

5 Experimental Results

We evaluated our method on three natural image datasets [20, 25, 43] and real-world images and compared it with state-of-the-art methods. We also evaluated our method on specific domain datasets, such as text [35], face [23], and saturated images [23] and compared it with related specially designed methods. In addition, we evaluated the robustness of our algorithm to blurred images with Gaussian noise. Finally, we tested non-uniform blurred images and other types of blurred images.

In all experiments of the uniform deblurring, we empirically set the parameter values of the proposed algorithm as follows: $ \lambda = 0.005 $, $ \mu = 0.004 $, $ \gamma = 40 $, $ \alpha_{{{\mathrm{max}} }} = 10^{5} $, $ \beta_{{{\mathrm{max}} }} = 0.06 $, $ p = 0.9 $, and $ T_{{{\mathrm{max}} }} = 5 $; we collected $ 9 \times 9 $ non-local similar patches through a $ 30 \times 30 $ search window by using the block matching algorithm [6], and the overlap of adjacent example patches was set to one pixel. For a fair comparison, the other methods were tested with the default parameters of the authors’ original code. Our numerical experiments were performed in MATLAB R2017a on a desktop computer with Intel Core i7-8770 CPU at 3.20 GHz and 32 GB RAM.

5.1 Natural Images

5.1.1 Dataset of Levin et al.

First, we performed a quantitative evaluation on the dataset of Levin et al. [25], which comprises 32 images generated from four grayscale images and eight uniform blur kernels. The sizes of the kernels ranged from $ 13 \times 13 $ to $ 27 \times 27 $. We compared the proposed method with other state-of-the-art methods [4, 7, 11, 26, 28, 38,39,40, 43, 49] and uniformly adopted the non-blind method [24], to ensure a fair comparison. We adopted three measurement criteria to evaluate the recovery results: the cumulative error ratio [25], the peak signal-to-noise ratio (PSNR), and the structural similarity index (SSIM). The error ratio is

$$ r = \frac{{\left\| {L_{t} - L} \right\|_{2}^{2} }}{{\left\| {L_{t} - L_{k} } \right\|_{2}^{2} }}, $$

(37)

where $ L_{t} $, $ L $, and $ L_{k} $ represent the ground-truth sharp image, the recovered latent image, and the deblurred image obtained by the ground-truth kernel deconvolution, respectively. When the error ratio $ r $ is reduced, the recovered image is closer to the ground-truth sharp image. According to [25], when $ r \le 2 $, the deconvolution result is usually visually plausible.

We first compared our method quantitatively with our method without WSNM. Figure 1 shows the average PSNR and average SSIM of four images with eight kernels. Our method achieved a higher PSNR and SSIM, on average. Figure 2a shows that our method achieved a higher success rate for each error ratio. It is worth noting that, when our method omits WSNM, it contains only the L₀ gradient term, which is equivalent to the method of [51]. Compared with other competing methods, our method achieved the highest PSNR and SSIM values, on average, as shown in Table 1. Figure 2b shows that our method also achieved the highest success rate when the error ratio was 2.

Table 1 Comparison of average PSNR (dB) and average SSIM of the results of deblurring, using various methods, on the dataset of [25]

Blind Image Deblurring via the Weighted Schatten p-norm Minimization Prior

Abstract

Similar content being viewed by others

Blind Image Deblurring Using Elastic-Net Based Rank Prior

An extended sparse model for blind image deblurring

Blind image deblurring via L1-regularized second-order gradient prior

Explore related subjects

1 Introduction

1.1 Previous Work

1.1.1 Explicit Edge Prediction

1.1.2 Prior Based on Image Statistics

1.1.3 Method Based on Deep Learning

1.2 Motivation and Proposed Approach

2 Weighted Schatten p-norm Minimization Prior

Theorem 1

3 Model and Optimization

3.1 Intermediate Latent Image \( L \) Estimation

Theorem 2

Proof

3.2 Blur Kernel \( k \) Estimation

3.3 Non-blind Deconvolution

4 Extension to Non-uniform Deblurring

5 Experimental Results

5.1 Natural Images

5.1.1 Dataset of Levin et al.

5.1.2 Dataset of Sun et al.

5.1.3 Dataset of Köhler et al.

5.1.4 Other Real-World Images

5.2 Domain-Specific Images

5.2.1 Text Images

5.2.2 Face Images

5.2.3 Saturated Images

5.2.4 Blurred Images with Gaussian Noise

5.3 Non-uniform Images

5.4 Other Types of Blurred Images

6 Analysis and Discussion

6.1 Effectiveness of the Proposed Method

6.1.1 Effectiveness of WSNM Prior

6.1.2 Effectiveness of L0-regularized Gradient Prior

6.2 Effect of Similar Patch Size

6.3 Computational Complexity and Execution Time

6.4 Main Parameter Analysis

6.5 Limitations

7 Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Blind image deblurring via L₁-regularized second-order gradient prior

6.1.2 Effectiveness of L₀-regularized Gradient Prior