1 Introduction

The visual quality of the outdoor images is seriously degraded due to hazy weather. Haze, fog, water droplets, suspended particles, etc., are different conditions of bad weather. The outdoor images captured under this situation have poor color, contrast, and visibility [7, 33, 38]. The reason for the formation of these types of turbid mediums is scattering and absorption of light by different aerosols present in the atmosphere [22, 36]. Image dehazing is extremely desired in computational photography, image processing, and computer vision [10, 32]. It can significantly reduce color and visibility and increase the contrast of hazy images. To improve the performance of hazy images, many algorithms have been proposed in different applications of computational photography, computer vision, and image processing. Due to unknown distances from the camera to scene points as well as unknown airlight, it is tough to remove haze from the input image. Several methods have been developed to remove haze. Tan et al. proposed a local contrast optimization-based haze removal method [38]. However, experimental results produce color distortion and halo artifacts. He et al. proposed a novel dark channel prior (DCP) [12] method for single image dehazing. However, this method is not applicable to large sky regions and halo artifacts, as well as color distortions, persist in both flat and sharp regions. To reduce halo artifacts, different edge-preserving image filters have been proposed [13, 23, 26, 39, 44]. Bilateral filters and their improved versions are widely used in image processing [39, 44]. But, the pixels cannot maintain consistency near the edges, and hence, edge information is not preserved appropriately. Further, He et al. developed local linear model-based guided image filter (GIF) [13] to overcome the drawback of bilateral filter [39]. It is a good edge-preserving filter. Here, the contents of the guided image or different images are considered as filter input. It has wide applications in image processing like detail enhancement, reduction in edge smoothing, image feathering, denoising, etc. However, halo artifacts and over smoothing persist in the sharp regions. Further, a weighted guided image filter (WGIF) [26] was proposed to reduce the halo artifacts by introducing an edge-aware weighting factor in the existing GIF [13]. But, due to the local linear model, it is unable to preserve edge information in the sharp regions. In this paper, we propose an effective scale-aware edge-smoothing weighting constraint-based weighted guided image filter (ESAESWC-WGIF) for single image dehazing. The main contributions of this work are summarized as

  • In this paper, we propose a new effective scale-aware edge-smoothing weighting constraint-based weighted guided image filter (ESAESWC-WGIF) for single image dehazing.

  • The proposed edge-smoothing weighting is a multi-scale-based local linear filter, and it is less sensitive to the regularization parameter than the GIF, WGIF, GGIF, and EGIF methods.

  • This filter is proposed by incorporating of ESAESWC into the cost function of GIF.

  • It is an excellent edge-preserving filter and removes halo artifacts and over-smoothing strongly in both flat and sharp regions.

  • To analyze the effectiveness of the proposed method, we have evaluated PSNR, SSIM, FADE, and CIEDE2000 metrics for different datasets, viz. Fattal, NYU2, D-HAZY, Haze-RD, and O-Haze datasets. Experimental results prove that the proposed method achieves favorable performance against the existing haze removal methods.

The remaining part of this paper is framed as follows. In Sect. 2, we shortly review the related works. Section 3 covers the preliminary work essential for understanding the GIF and related edge-preserving filtering concept. The proposed dehaze algorithm is detailed in Sect. 4. Experimental outcomes are discussed in Sect. 5, and Sect. 6 concludes the paper.

2 Related Work

This work is related to prior-based, edge-preserving filter-based, and deep-learning-based haze removal methods.

2.1 Prior-Based Haze Removal Methods

Various prior-based haze removal methods are in existence. The prior-based well-known haze removal methods are DCP [12], CAP [48], DSPP [14], CEP [4], BDPK [19], IDGCP [20], SIPSID [29], and IDBP [21]. He et al. proposed a dark channel prior (DCP) [12] method for single image dehazing. In DCP, dark channel is defined as at least one color channel has very low intensity in the non-sky regions. However, it fails when bright objects are present in the scene. Moreover, it is computationally inefficient due to soft matting. It generates halo artifacts and color distortions at depth discontinuity. Zhu et al. proposed a color attenuation prior (CAP) method in [48] for single image dehazing. Here, a linear model-based supervised learning method is used to evaluate the depth information of the input hazy image. Next, He et al. proposed an optimal transmission map-based difference structure preservation prior [14] method for single image dehazing. This method used an image patch as a sparse linear combination of the elements to obtain accurate transmission map. Next, Bui et al. proposed a color ellipsoid prior [4] method for single image dehazing. This method used a color ellipsoid geometry to calculate the transmission map which increases contrast of the restored image pixels, while preventing over-saturated pixels. In this prior, Ju et al. proposed an adaptive and more reliable atmospheric scattering model (RASM)-based algorithm known as Bayesian dehazing algorithm (BDPK) [19]. This method directly converts the image dehazing process into an optimization function using Bayesian theory and prior knowledge and restore the scene albedo with an alternating minimizing technique (AMT). Next, an effective gamma correction prior (GCP)-based atmospheric scattering model (ASM) is proposed in [20] for image dehazing. In this model, first an input image is transformed into a virtual image and it is combined with an input image to calculate scene depth of image dehazing. Next, Lu et al. proposed a saturation-based iterative dark channel prior (IDCP) [29] method for single image dehazing. In IDCP, dark channel is reformulated in saturation and brightness terms and estimates the transmission map without computing the dark channel. In [21], Lu et al. proposed a novel blended prior model for single image dehazing (IDBP). This method has two modules such as atmospheric light estimation (ALE) and a multiple prior constraint (MPC) to remove haze from input image. The prior-based methods remove haze efficiently. However, they fail to preserve edge information in the sharp regions and halo artifacts are observed in the dehazed image.

2.2 Edge-Preserving Filtering-Based Haze Removal Methods

Halo artifact is a major problem in haze removal algorithms. Therefore, He et al. developed a novel guided image filter (GIF) [13] for single image dehazing. In this algorithm, a local linear model is used to represent iterated output with the help of a guided image. This method reduces the halo artifacts and preserves the edge information more accurately. But, GIF [13] fails to preserve edge information in sharp regions due to local linear model and large computational complexity. Next, Li et al. proposed a weighted guided image filter (WGIF) [26] for image dehazing. It removes halo artifacts strongly and preserves the edge information more accurately in the sharp regions than the GIF [13]. However, due to local linear model and fixed regularization parameter, over-smoothing takes place in the sharp regions. Next, Kou et al. proposed a multi-scale edge-aware weighting-based gradient domain guided image filter (GGIF) [23] to avoid over smooth images in flat regions and reduce halo artifacts more strongly than WGIF [26]. But, over-smoothing in the sharp regions increases with an increase in the regularization parameter. In EGIF [28], the average of local variances for all pixels is incorporated in the cost function of GIF [13]. It removes halo artifacts effectively and preserves edge information more precisely than GIF [13], WGIF [26], GGIF [23] methods. However, it also over smooth images in the sharp regions depending on the value of the regularization parameter. Next, Geethu et al. proposed a weighted guided image filter for image dehazing [11]. In [15], Hong et al. developed weighted guided image filtering-based a local stereo matching algorithm to improve scene depth of hazy image. Chen et al. proposed a weighted aggregation model using guided image filter for single image dehazing in [6]. In [16], Hong et al. proposed a fast guided image filtering-based real-time local stereo matching algorithm for image dehazing.

2.3 Deep-Learning-Based Haze Removal Methods

With the popularity of convolutional neural network (CNN), many deep-learning-based haze removal methods such as DehazeNet [5], AOD-Net [24], Proximal dehazeNet[43], PDR-Net[25], FFA-Net[34], and RefineDNet [47] have been proposed for image dehazing. In [5], Cai et al. proposed a trainable deep learning architecture called DehazeNet to estimate transmission map and then remove haze by atmospheric scattering model (ATSM). In [5], a nonlinear activation function called Bilateral Rectified Linear Unit (BReLU) is proposed to restore the haze-free image more accurately. Next, Aod et al. proposed an end-to-end CNN-based deep learning model called AOD-Net [24] to remove haze. It is designed by re-formulating ATSM and directly generates the haze-free image through a lightweight CNN. Yang et al. [43] presented a CNN-based deep architecture for single image dehazing by learning dark channel prior and transmission map. This method used a proximal learning-based iterative deep learning algorithm called proximal DehazeNet for single image dehazing. Next, Li et al. proposed a deep CNN for single image dehazing named PDR-Net [25]. Here, a perception-driven image dehazing sub-network is designed for single image dehazing. The refined sub-network improves the contrast and visual quality of the dehazed image. Next, Qin et al. proposed an end-to-end feature fusion attention network (FFA-Net) [34] which combines the channel attention and pixel attention approach to directly recover the haze-free image. This method performs outstanding in case of thick haze and rich texture details. Zhao et al. proposed a two stage weakly supervised framework named RefineDNet [47] for single image dehazing. In [47], first prior-based DCP is used to recovered the visibility and then introduced generative adversarial network (GAN) to enhance the contrast and realness of the dehazed image. However, these methods are impractical and most of the models are trained on synthetic hazy datasets and often fail when tested on real hazy datasets. They require enormous computation and memory resources, especially with the increase in network depth.

3 Background

In GIF [13], the filtered output is linearly related to the guidance image, and it is expressed as

$$\begin{aligned} {q_i}={a_k}{I_i}+{b_k},\quad \forall i\in {\omega }_{\zeta _1}(k), \end{aligned}$$

where \( I_{i}\) represents an input the guidance image and \( q_{i}\) represents its linear transform in window \({\omega }_{\zeta {_1}}\) at ith pixel position with radius \(\zeta _{1}\) and (\({ a_{k},b_k}\)) represent constant linear coefficients in window \( {{\omega }_{\zeta _1}}\) at pixel position kth.

The minimized cost function in window \(\omega _{\zeta _1}(k)\) can be expressed as

$$\begin{aligned} {E(a_{{k}},b_{{k}})}=\sum _{i\in \omega _{\zeta {_1}}(k)}{((a_{{k}}I_{{i}}+b_{{k}}-p_{{i}})^{2}+{\varepsilon }{a_k^{2}}}), \end{aligned}$$

where \( p_{i}\) represents a filter input and \(\varepsilon \) represents a regularization parameter used to penalize large \(a_{k}\). The cost function in Eq. (2) is a linear ridge regression model [8, 49], and its optimal solution is given in terms of linear coefficients \((a_k, b_k)\) as

$$\begin{aligned} {a_{k}}= & {} \frac{\frac{1}{\vert {\omega }\vert }{\sum _{i\in \omega _{\zeta {_1}}(k)}}{I_{i}p_{i}-\mu _{k}\overline{p}_{k}}}{\sigma _k^{2}+\varepsilon }, \end{aligned}$$
$$\begin{aligned} b_{k}= & {} {\overline{p}_{k}}-{a_{k}\mu _{k}}. \end{aligned}$$

where \({\vert {\omega }\vert }\) represents number of pixels, \(\mu _{k}\) is mean, \({\sigma _k^2}\) is variance and \({\overline{p}}_{k}\) is average or mean of p in window \({\omega _{\zeta {_1}}(k)}\). The regularization parameter (\(\varepsilon \)) and variance \({\sigma _k^{2}}\) play a vital role in preserving edges in smooth and sharp regions. Specifically, \(\varepsilon \) should be larger than \({\sigma _k^{2}}\) to preserve edges in smooth regions and \(\varepsilon \) should be smaller than \({\sigma _k^{2}}\) to preserve edges in the sharp regions.

In order to keep the edge information more accurately than the GIF [13], a weighted guided image filter (WGIF) is proposed in [26]. In WGIF [26], local variance is replaced by a new edge-aware weighting \(\Gamma _{I}(k)\), and it can be expressed as

$$\begin{aligned} {\Gamma _{I}(k)=\frac{1}{N}\sum _{i = 1}^{N}\frac{\sigma _{I,1}^{2}(k)+\lambda }{\sigma _{I,1}^{2}(i)+\lambda }}, \end{aligned}$$

where N indicates the pixel number of the guidance image I and the parameter \(\lambda \) is a small constant and its value is selected by as \((0.001\times M)^2\) with M being the dynamic intensity range of the image. \({\sigma _{I,1}^{2}(k)}\) and \({\sigma _{I,1}^{2}(i)}\) are the local variance of I in the windows \(\omega _k\) and \(\omega _i\), respectively. The optimized cost function for WGIF [26] can be expressed as

$$\begin{aligned} {E(a_{{k}},b_{{k}})}= \sum _{i(x,y)\in \omega _{\zeta _{1}}(k)}{\left\{ (a_{{k}}I_{i}+b_{{k}}-p_{{i}})^2+{a_{k}^{2}\frac{\varepsilon }{\Gamma _{I}(k)}}\right\} }. \end{aligned}$$

The optimal value of \( a_{k}\) and \( b_{k}\) is obtained by the following expression:

$$\begin{aligned} {a_{{k}}}= & {} \frac{\mu _{{{I*p}},{\zeta _{1}}}(k)-\mu _{{{I}}, {\zeta _{1}}}(k)\mu _{{{p}}, {\zeta _{1}}}(k)}{\sigma ^{2}_{{{I}}, {\zeta _{1}}}(k)+\frac{\varepsilon }{\Gamma _{{I}}(k)}}, \end{aligned}$$
$$\begin{aligned} b_{{k}}= & {} {\mu _{{{p}}, {\zeta _{1}}}(k)}-{a_{{k}}\mu _{{{I}}, {\zeta _{1}}}(k)}. \end{aligned}$$

where \(\mu _{I*p}\) represents the mean of \(({I*p})\).

Fig. 1
figure 1

The basic framework of the proposed haze removal method

4 Proposed Method

In this paper, an effective scale-aware edge-smoothing weighting constraint-based weighted guided image filter (ESAESWC-WGIF) is proposed for single image dehazing. The basic framework of the proposed method is shown in Fig. 1. The proposed method has three main steps as follows: In the first step, the dark channel prior (DCP) [12] method is used to compute the atmospheric and transmission maps accurately. In the second step, we refined the raw transmission map by ESAESWC-WGIF algorithm to remove halo artifacts, over smooth, and preserve edge information more accurately in both flat and sharp regions. Finally, we recovered the dehazed image from the scene radiance.

4.1 Dark Channel Prior (DCP)-Based Atmospheric Map and Transmission Map Estimation

The popular Koschmieder’s law [18] is generally used to represent the haze formation. However, McCartney [30] improved Koschmieder’s law by estimating the atmospheric map as well as transmission map more accurately. In this model, haze formation is represented by the following expression:

$$\begin{aligned} I(x)=J(x)t(x)+A(1-t(x)), \end{aligned}$$

where x is pixel’s position into the image, I(x) is input hazy image, J(x) is output dehaze image or scene radiance, t(x) is the medium transmission map, and A is the global atmospheric light or map. In Eq. (9), the first term J(x)t(x) and the second term \({A(1-t(x))}\) are called direct attenuation and airlight, respectively.

The relation of medium transmission map t(x) with the object’s distance d(x) can be expressed as

$$\begin{aligned} t(x)=\exp (1-\beta {d(x)})\le 1, \end{aligned}$$

where \({0\le d(x)\le \infty }\) is the depth (distance) of scene point (pixel) from camera and \(\beta \) is the scattering coefficient related to the wavelength of light and it is exponentially attenuated with the scene depth d(x). The single image haze removal result can be obtained by putting t(x) and A value in Eq. (9).

According to dark channel prior (DCP) [12] method, J can be estimated after assuming some prior information. In DCP [12], dark pixel (lowest pixel) concept is used to calculate transmission map t(x) and A(x). In DCP [12], the atmospheric map A is estimated by selecting top 0.1% of brightest pixels in hazy image. For a given atmospheric map A, Eq. (9) can be modified as

$$\begin{aligned} {\frac{I^{c}(x)}{A^{c}} = t(x)\frac{J^{c}(x)}{A^{c}}+ 1-t(x)}, \end{aligned}$$

where c denotes color channels (rgb). \(A^{c}\) and \(J^{c}\) represent the atmospheric map and dehaze image for color channel, respectively. Due to constant behavior of transmission map t(x) in a local patch \(\Omega (x)\), it is denoted by \({\tilde{t}}(x)\) [12]. Dark channel is computed after substituting the minimum operator on both sides of Eq. (11).

$$\begin{aligned} {\min _{\ y \in \Omega (x)}}\Big ({\min _{\ c\in \{ r,g,b\}} {\frac{I^{c}(y)}{A^{c}}\Big )= {{\tilde{t}}}(x){\min _{\ y \in \Omega (x)}}\Big ({\min _{\ c\in \{r,g,b\}}\frac{J^{c}(y)}{A^{c}}\Big )+ {1-{{\tilde{t}}}(x)}}}}. \end{aligned}$$

According to DCP [12], to restore the scene radiance J as haze-free image, the dark channel of the scene radiance should be zero, and it can be expressed as

$$\begin{aligned} J^{dark}(x)={\min _{\ y \in \Omega (x)}}\Big ({\min _{\ c}\frac{J^{c}(y)}{A^{c}}\Big )}=0, \end{aligned}$$

where \(J^{dark}\) is a scene radiance for dark channel. Since \(A^{c}\) should always positive, Eq. (12) can be modified as

$$\begin{aligned} {\min _{\ y \in \Omega (x)}}\Big ({\min _{\ c} {\frac{I^{c}(y)}{A^{c}}\Big )= 1-{{\tilde{t}}}(x)}}. \end{aligned}$$

After simplification, \({{\tilde{t}}}(x)\) can be expressed as

$$\begin{aligned} {{{\tilde{t}}}(x)} = 1-{\min _{\ y \in \Omega (x)}}\Big ({\min _{\ c} {\frac{I^{c}(y)}{A^{c}}\Big )}} , \end{aligned}$$

We know that the DCP [12] method is not valid for large sky, sea, or white regions because the color of sky or ocean during haze is mostly similar to atmospheric map. Due to that, the transmission map becomes close to 0 [12, 41, 42]. Finally, the transmission map can be expressed as

$$\begin{aligned} {{{\tilde{t}}}(x)} = 1-w{\min _{\ y \in \Omega (x)}}\Big ({\min _{\ c} {\frac{I^{c}(y)}{A^{c}}\Big )}} . \end{aligned}$$

Here, a constant parameter w \((0 < w \le 1)\) is used to retain a very limited amount of haze for distant objects.

figure a

4.2 Effective Scale-Aware Edge-Smoothing Weighting Constraint-Based Weighted Guided Image Filter (ESAESWC-WGIF)

In this paper, a new multi-scale edge-aware weighting constraint-based an effective weighted guided image filter is proposed for single image dehazing. The new multi-scale edge-weighting constraint is incorporated in the cost function of the GIF [13]. The proposed method removes halo artifacts and over-smoothing effect strongly and preserves edge information appropriately in both flat and sharp regions.

In GIF [13], the regularization parameter \(\varepsilon \) is identical for all local windows and due to that it is unable to preserve sharp edges appropriately and hence halo artifacts exhibit near edges in the output images. To overcome this problem, a single scale edge-aware weighting was initially proposed in weighted guided image filter (WGIF) [26]. In this filter, a \({3 \times 3}\) window has been considered for every pixel while computing local variance and the regularization parameter \(\varepsilon \) is replaced with \({\varepsilon }{\Gamma _{I}(k)}\). It removes halo artifacts and preserves edge information more accurately than GIF. However, over-smoothing effect persists in the sharp regions due to single scale edge-aware weighting. In this paper, we are proposing a method to overcome this issue by introducing a new effective scale-aware edge-smoothing weighting constraint-based weighted guided image filter (ESAESWC-WGIF). It is multi-scale edge-aware weighting constraint which is defined using local variance of both \({3 \times 3}\) and \({(2\zeta _1+1) \times (2\zeta _1+1)}\) windows of each pixel in the guidance image I. The following expression can be used to express the average of local variances for all pixels:

$$\begin{aligned} {\overline{\psi }}_{I}(k) =\frac{\sigma _{I,1}(k)}{{\overline{\sigma }}^{2}}, \end{aligned}$$

and \({\overline{\sigma }}^{2}\) is expressed as

$$\begin{aligned} {{\overline{\sigma }}^{2} =\frac{1}{N}\sum _{k = 1}^{N}{\sigma ^{2}_{I, {\zeta _{1}}}}(k)}. \end{aligned}$$

where N indicates the pixel number of the guidance image I, \({\sigma _{I},1(k)}\) and \({\sigma ^{2}_{I, {\zeta _{1}}}(k)}\) are the local variances of I in \({3 \times 3}\) windows and \({(2\zeta _1+1) \times (2\zeta _1+1)}\) windows of all pixels.

The minimum cost function for the proposed ESAESWC-WGIF can be expressed as

$$\begin{aligned} {E(a_{k},b_{k})}=\sum _{i(x,y)\in \omega _{\zeta {_1}}(k)}{\left\{ (a_{k}I_{i}+b_{k}-p_{i})^2+{a_{k}^{2}\frac{\varepsilon }{{\overline{\psi }}_{I}(k)}}\right\} }. \end{aligned}$$

The modified \(a_{k}\) value is calculated by the following expression:

$$\begin{aligned} {a_{k}}=\frac{\mu _{I*p,{\zeta _{1}}}(k)-\mu _{I, {\zeta _{1}}}(k)\mu _{p, {\zeta _{1}}}(k)}{\sigma ^{2}_{I, {\zeta _{1}}}(k)+\frac{\varepsilon }{{\overline{\psi }}_{I}(k)}}, \end{aligned}$$

and \(b_{k}\) is calculated as mentioned in Eq. (8). For better analysis, I and p can be assumed identical and it can be expressed after simplification as

$$\begin{aligned} \mu _{{{I*p},{\zeta _{1}}}}(k)-\mu _{I, {\zeta _{1}}}(k)\mu _{p, {\zeta _{1}}}(k) = {\sigma ^{2}_{I, {\zeta _{1}}}(k)} \end{aligned}$$


$$\begin{aligned} \mu _{p, {\zeta _{1}}}(k) = \mu _{I, {\zeta _{1}}}(k). \end{aligned}$$

After substituting Eqs. (21) and (22) in Eq. (20), we obtain

$$\begin{aligned} {a_{k}}=\frac{{\sigma ^{2}_{{I}, {\zeta _{1}}}}(k)}{\sigma ^{2}_{{{I}, {\zeta _{1}}}}(k)+\frac{\varepsilon }{{\overline{\psi }}_{{I}}(k)}}, \end{aligned}$$


$$\begin{aligned} {b_{k}=(1-{a_{k}})\mu _{I, {\zeta _{1}}}(k)}, \end{aligned}$$

or it is simply written as

$$\begin{aligned} {a_{k}}=\frac{{\sigma ^{2}_{{I}, {\zeta _{1}}}}(k)}{\sigma ^{2}_{{{I}, {\zeta _{1}}}}(k)+\frac{\varepsilon }{{\overline{\psi }}_{{I}}(k)}}. \end{aligned}$$

After dividing Eq. (25) by \({\sigma ^{2}_{I, {\zeta _{1}}}(k)}\), we get

$$\begin{aligned} {a_{k}}=\frac{1}{1+{\varepsilon }{\frac{1}{\sigma ^{2}_{I, {\zeta _{1}}}}(k){{\overline{\psi }}_{I}(k)}}}, \end{aligned}$$

Substituting value of \({\overline{\psi }}_I(k)\) from (17) into (26)

$$\begin{aligned} {a_{{k}}}= & {} \frac{1}{1+{\varepsilon }{\frac{1}{{\Big \{\sigma ^{2}_{{{I}}, {\zeta _{1}}}(k)\Big \}}*\Big \{\frac{\sigma _{{{I}},1}(k)}{{\overline{\sigma }}^{2}}\Big \}}}} \end{aligned}$$
$$\begin{aligned} {a_{{k}}}= & {} \frac{1}{1+{\varepsilon }{\frac{{\overline{\sigma }}^{2}}{{\sigma ^{2}_{{{I}}, {\zeta _{1}}}(k)}*{\sigma _{{{I}},1}(k)}}}}. \end{aligned}$$

Finally, \(q_{{{i}}}\) is expressed as

$$\begin{aligned} q_{i}= & {} \frac{1}{\vert {\omega }_{\zeta {_1}}(k)\vert }\sum _{\ k\in \omega _{\zeta {_1}}(k)}(a_{k}I_{i}+b_{k}), \end{aligned}$$
$$\begin{aligned} q_{i}= & {} ({\overline{a}}_{i} I_{i}+{{\overline{b}}_{i}}), \end{aligned}$$

After obtaining linear constants \( a_{k}\) and \( b_{k}\), the filtered output \(q_i\) is now the refined transmission map (refined filtered output). It can be expressed as

$$\begin{aligned} q_i = \overline{t}(x)=({\overline{a}}_{i} I_{i}+{{\overline{b}} _{i}}), \end{aligned}$$

where \({\overline{a}}_{i}\) and \({\overline{b}}_{i}\) terms in the above expression represent mean of \( a_{k}\) and \( b_{k}\), respectively, in the corresponding window of all pixels, and it is computed as

$$\begin{aligned} \overline{a}_{i}= & {} {\frac{1}{\vert \omega _{\zeta {_1}}(k)\vert }}{\sum _{k\in \omega _{\zeta {_1}}(k)}}{a_{k}}, \end{aligned}$$
$$\begin{aligned} \overline{b}_{i}= & {} {\frac{1}{\vert \omega _{\zeta {_1}}(k)\vert }}{\sum _{k\in \omega _{\zeta {_1}}(k)}}{b_{k}}. \end{aligned}$$

In order to preserve edge information in sharp regions, \(a_{k}\) should be 1 and \(b_{k}\) close to 0, whereas for smooth regions \(a_{k}\) close to 0 and \(b_{k}\) become 1 [26]. Finally, the dehazed output image is calculated by following expression:

$$\begin{aligned} J(x) = \frac{I(x)-A}{\max (\overline{t}(x), t_0)} + A. \end{aligned}$$

where the value of \({t_0}\) is set to 0.1 [13] for avoiding noise amplification.

Fig. 2
figure 2

Comparison of refined transmission maps of different edge-preserving filters. a Input hazy images [10], b DCP [12]-based transmission map, c refined transmission map by GIF [13], WGIF [26], GGIF [23], EGIF [28] and the proposed method d Dehazed output

5 Experimental Results and Analysis

The proposed algorithm is experimented and evaluated using MATLAB R2018a on a PC with Intel (R) Core (TM) i7-6700 CPU @ 3.40 GHz of a 64-bit operating system with RAM-8GB. The performance of the proposed method is tested on natural hazy, non-hazy and synthetic images of different datasets, viz. Fattal’s (580 images) [10], NYU2 (650 images) [37], D-HAZY (400 images) [1], Haze-RD (760 images) [46], and O-HAZE (810 images) [2] and the outcomes are compared with existing DCP [12] GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], and RYF-Net [9] haze removal methods for effective analysis.

5.1 Qualitative Analysis

In this paper, the proposed method is tested on about 3200 images of hazy, non-hazy and synthetic images from Fattal [10], NYU2 [37], D-HAZY [1], Haze-RD [46], and O-HAZE [2] datasets and the outcomes are compared with 7 state-of-the-art haze removal methods, out of which DCP [12] is prior-based dehazing method, GIF [13], WGIF [26], GGIF [23], and EGIF [28] are four edge-preserving image dehazing filters, and DehazeNet [5] and RYF-Net [9] are two deep-learning-based image dehazing methods. The refined transmission map by GIF [13], WGIF [26], GGIF [23], EGIF [28] and the proposed method for input hazy building image [10] is shown in Fig. 2. It is clear from Fig. 2 that the proposed method is refined the raw transmission map more accurately than the rest of the existing methods. It removes halo artifacts, over-smoothing, color distortion strongly and preserves edge information more precisely than the existing GIF [13], WGIF [26], GGIF [23], and EGIF [28] methods. Several hazy images from different datasets, viz. Fattal [10], NYU2 [37], D-HAZY [1], Haze-RD [46], and O-HAZE [2], are tested for better analysis of the proposed method, and their outcomes are compared with existing DCP [12], GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], and RYF-Net [9] haze removal methods. The dehazed outcomes of DCP [12], GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], and RYF-Net [9] and the proposed method are calculated for five benchmark hazy images [10] and presented in Figs. 3, 4, 5, 6, and 7, respectively, for effective visual comparison. It is clear from Figs. 3, 4, 5, 6, and 7 that the proposed method removes halo artifacts, over-smoothing strongly and preserves edge information more precisely in both flat and sharp regions than the existing DCP [12], GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], and RYF-Net [9] methods.

Fig. 3
figure 3

Dehazed outcomes of different haze removal methods and the proposed method. a Input hazy image [10], b DCP [12], c GIF [13], d WGIF [26], e GGIF [23], f EGIF [28], g DehazeNet [5], h RYF-Net [9], i the proposed

Fig. 4
figure 4

Dehazed outcomes of different haze removal methods and the proposed method. a Input hazy image [37], b DCP [12], c GIF [13], d WGIF [26], e GGIF [23], f EGIF [28], g DehazeNet [5], h RYF-Net [9], i the proposed

Fig. 5
figure 5

Dehazed outcomes of different haze removal methods and the proposed method. a Input hazy image [1], b DCP [12], c GIF [13], d WGIF [26], e GGIF [23], f EGIF [28], g DehazeNet [5], h RYF-Net [9], i the proposed

Fig. 6
figure 6

Dehazed outcomes of different haze removal methods and the proposed method. a Input hazy image [46], b DCP [12], c GIF [13], d WGIF [26], e GGIF [23], f EGIF [28], g DehazeNet [5], h RYF-Net [9], i the proposed

Fig. 7
figure 7

Dehazed outcomes of different haze removal methods and the proposed method. a Input hazy image [2], b DCP [12], c GIF [13], d WGIF [26], e GGIF [23], f EGIF [28], g DehazeNet [5], h RYF-Net [9], i the proposed

Table 1 Objective evaluation on images in Fig. 3 by [31]
Table 2 Performance comparison on Fattal dataset [10]
Table 3 Performance comparison on NYU2 dataset [37]
Table 4 Performance comparison on D-HAZY dataset [1]
Table 5 Performance comparison on Haze-RD dataset [46]
Table 6 Performance comparison on O-HAZE [2]
Table 7 Execution time (in sec.)-based assessment

5.2 Quantitative Analysis

The objective evaluation of the proposed filter is compared with the existing GIF [13], WGIF [26], GGIF [23], and EGIF [28] methods using an effective blind object image quality metric [31]. Score of these filters are calculated for different values of the regularization parameter \(\varepsilon \) and their values are listed in Table 1. As we can seen clearly from Table 1 that the scores of GIF [13] and WGIF [26] initially increase and then decrease for large \(\varepsilon \) value. However, GGIF [23] and EGIF [28] both have higher scores than GIF [13] and WGIF [26] but they also generate lower scores for large \(\varepsilon \) values \((\varepsilon = 0.4^{2}, 0.8^{2})\), whereas the scores of the proposed method increase with the increase in \(\varepsilon \) and decrease slightly even for large \(\varepsilon \) values. In this paper, some performance metrics such as peak signal-to-noise ratio (PSNR) [17], structural similarity index (SSIM) [40], fog aware density evaluator (FADE) [7], and CIEDE2000 [35] are used for effective assessment of the proposed method. The performance metrics of DCP [12], GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], RYF-Net [9] and the proposed method are calculated for hazy images from Fattal [10], NYU2 [37], D-HAZY [1], Haze-RD [46], and O-HAZE [2] datasets, and their outcomes are furnished in Tables 2, 3, 4, 5, and 6, respectively. The blue bold faces in Tables 2, 3, 4, 5, and 6 indicate the best value.

5.2.1 Peak Signal-to-Noise Ratio (PSNR)

The higher peak signal-to-noise ratio (PSNR) [17] indicates better image restoration result. It is expressed as

$$\begin{aligned} \hbox {PSNR} = 10*\log \left\{ \frac{f^2_{\max }}{\text {MSE}}\right\} , \end{aligned}$$

where \(f_{\max }\) is the maximum gray level (255 for 8-bit image). The mean squared error (MSE) measure by the original and the restored image. It is written as

$$\begin{aligned} \hbox {MSE} = \frac{1}{m\times n}\sum _{i = 0}^{m-1}\sum _{j = 0}^{n-1}\left[ f_\textrm{o}(i,j)-f_\textrm{r}(i,j)\right] ^2. \end{aligned}$$

where \(m\times n\) represents the size of the image and \(f_\textrm{o}\) and \(f_\textrm{r}\) are the original and restored images, respectively.

5.2.2 Structural Similarity Index (SSIM)

The structural similarity index (SSIM) [40] metric is used to quantify the structural difference between the source image and recovered image. It has a range \([-1,~1]\) and is expressed as

$$\begin{aligned} \hbox {SSIM} = F(L_c, C_c, S_c). \end{aligned}$$

where \(L_c, C_c\), and \(S_c\) represent the luminance, contrast, and saturation comparison, respectively.

5.2.3 Fog Aware Density Evaluator (FADE)

The fog aware density evaluator (FADE) [7] metric evaluate the perceptual fog density in the restored image. The lower FADE value indicates lower haze concentration.

5.2.4 CIEDE2000

The CIEDE2000 metric [35] measures the color fidelity between source image and the dehaze image with ranging [0, 100]. The lower CIEDE2000 value indicates better color correction.

Fig. 8
figure 8

Box plots [27]-based statistical illustration of different haze removal methods. a PSNR [17], b SSIM [40], c FADE [7], d CIEDE2000 [35]

Fig. 9
figure 9

Visual outcomes in dense hazy condition

Fig. 10
figure 10

Visual outcomes in nighttime hazy condition

5.3 Discussion

The performance metrics PSNR [17], SSIM [40], FADE [7] and CIEDE2000 [35] of existing DCP [12] GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], RYF-Net [9] and the proposed method are calculated for natural hazy, non-hazy and synthetic images from Fattal [10], NYU2 [37], D-HAZY [1], Haze-RD [46], and O-HAZE [2] datasets. For the best performance, PSNR [17] and SSIM [40] values must be higher, whereas FADE [7] and CIEDE2000 [35] values must be lower. These results are furnished in Tables 2, 3, 4, 5 and 6. The blue bold face values in each row indicate best measured value. PSNR [17] and SSIM [40] values of EGIF [28], DehazeNet [5] and RYF-Net [9] methods are comparable and usually, these values are low for DCP [12] GIF [13], WGIF [26], GGIF [23] methods. Next, RYF-Net [9] is more comparable method which is capable of retaining structures more accurately than the rest of existing methods. FADE [7] and CIEDE2000 [35] values should be small for a better image dehazing. However, these values are higher in DCP [12] GIF [13], WGIF [26], GGIF [23], and EGIF [28] methods. Usually, deep-learning-based DehazeNet [5], and RYF-Net [9] methods produced more comparable FADE [7] values than the existing DCP [12] GIF [13], WGIF [26], GGIF [23], EGIF [28] methods and similar is the case for CIEDE2000 [35] metric scores. It is clear from Tables 2, 3, 4, 5, and 6 that for all the aforesaid datasets the PSNR and SSIM values of the proposed method are higher than the rest of the existing methods, as expected. The FADE values are the least for all the datasets with the proposed method, as expected, and CIEDE2000 metric values are also the least for Fattal [10], NYU2 [37], Haze-RD [46], and O-HAZE [2] datasets except D-HAZY [1] dataset. This entails that the proposed method is better than the existing dehaze methods. The objective assessment of the performance metrics PSNR [17], SSIM [40], FADE [7], and CIEDE2000 [35] for different existing dehaze methods and the proposed method are computed and tested on images of Fattal [10], NYU2 [37], D-HAZY [1], Haze-RD [46], and O-HAZE [2] datasets. Next, the execution time of DCP [12] GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], RYF-Net [9], and the proposed method for input hazy images in Figs. 3, 4, 5, 6, and 7 having a resolution of \(250\times 200\), \(550\times 400\), \(850\times 600\), \(1000\times 950\), and \(1400\times 1200\) is calculated and listed in Table 7. It is clear from Table 7 that the proposed method executes, more faster than the fastest reported methods. The statistical analysis of quality metrics PSNR [17], SSIM [40], FADE [7] and CIEDE2000 [35] for DCP [12] GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], RYF-Net [9] and the proposed method represented by a box plot [27] shown in Fig. 8a–d, respectively. It is clear from the box plot figures that the proposed method has a higher median value for PSNR [17] and SSIM [40], whereas lower median value for FADE [7] and CIEDE2000 [35] metrics in comparison with other existing methods. The horizontal line within the box plot represents median value. Thus, it proves that the proposed method provides better dehaze outcomes than the existing DCP [12] GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5], and RYF-Net [9]. Due to limited visibility and poor contrast, the proposed method fails in case of dense haze and nighttime hazy conditions. The failure case of the existing DCP, GIF, WGIF, GGIF, EGIF, DehazeNet, RYF-Net methods and the proposed method on dense haze dataset [3] and nighttime haze dataset [45] is shown in Figs. 9 and 10, respectively. Finally, it is proved from Figs. 2, 3, 4, 5, 6, 7, and 8 and Tables 1, 2, 3, 4, 5, 6, and 7 that the proposed method removes halo artifacts, over-smoothing strongly and preserves edge information more accurately than the existing DCP [12] GIF [13], WGIF [26], GGIF [23], EGIF [28], DehazeNet [5] and RYF-Net [9] methods in both regions. Moreover, the proposed method is fast and preserves edge information in sharp region more accurately compared to the existing haze removal methods.

6 Conclusion

In this paper, an effective scale-aware edge-smoothing weighting constraint-based weighted guided image filter (ESAESWC-WGIF) is proposed to remove haze efficiently. In this filter, a new edge-aware weighting is incorporated into the cost function of the GIF. It refines the initial transmission map more accurately than the existing GIF, WGIF, GGIF, and EGIF methods. It removes halo artifacts, over-smoothing strongly and preserves edge information in both flat and sharp regions. Experimental results prove that the proposed method has better visual quality than the existing methods. About 3,200 images from Fattal, NYU2, D-HAZY, Haze-RD, and O-Haze datasets are used to test the performance of the existing and the proposed dehaze method. We analyzed performance parameters such as PSNR, SSIM, FADE, and CIEDE2000 on these images and experimental results prove that the proposed method restore the images with excellent visual quality. Moreover, the proposed method is independent of the nature of the input image. It performs equally well for all datasets compared to the existing dehaze methods. It is noteworthy that the proposed method is faster than the existing methods for a given resolution of images. But, this method fails in case of dense hazy and nighttime hazy conditions. The failure results in dense hazy and nighttime hazy conditions are shown in Figs. 9 and 10, respectively. So, there is a scope to devise a new method which can satisfy these requirements. We will be exploring the suitability of the proposed method for wider applications such as satellite image, underwater image, and low-light images dehazing.