Underwater image restoration using oblique gradient operator and light attenuation prior

Li, Jingyi; Hou, Guojia; Wang, Guodong

doi:10.1007/s11042-022-13605-5

Underwater image restoration using oblique gradient operator and light attenuation prior

Published: 08 August 2022

Volume 82, pages 6625–6645, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Underwater image restoration using oblique gradient operator and light attenuation prior

Download PDF

535 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

Underwater captured images are often degraded with low contrast, color distortion, and poor visibility caused by absorption and scattering when light travels through water. To address these issues, we propose a novel underwater image restoration method which aims at recovering the scene radiance with an accurate scene depth. Depending on the accuracy estimation of scene depth obtained by combing the oblique gradient operator and underwater light attenuation prior, the transmission map can be further precisely determined. Moreover, we utilize the quad-tree subdivision to estimate the background light by both considering smoothness and color difference. After acquiring the background light and transmission map, the scene radiance can be finally restored based on the underwater image formation model. Experiential results demonstrate that the proposed method has a good performance on dehazing, color correction and contrast enhancement. Qualitative and quantitative comparisons with several state-of-the-art methods further validate the superiority of the proposed method.

Underwater Image Restoration Based on Light Attenuation Prior and Scene Depth Fusion Model

A Rapid Scene Depth Estimation Model Based on Underwater Light Attenuation Prior for Underwater Image Restoration

Underwater Image Enhancement by the Combination of Dehazing and Color Correction

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the development of commercial and scientific exploration in the underwater environment, the quality of underwater images and video sequences plays an important role in many fields [38], such as object detection and classification [28, 59], and marine organisms tracking [41]. Although specialized apparatus can improve the imaging quality, it is costly and power-consuming. Moreover, they are inconvenient for us to capture underwater images during diving and snorkeling activities. Thus, improving underwater image quality by developing some image enhancement and restoration technologies has received wide attentions and interests due to its low cost. However, underwater image processing is challenging because of the complexity and diversity of underwater environment. Light absorption and scattering are two main factors to cause visible degradation of underwater captured images. The former wavelength-dependent attenuation will cause color distortion that increases with the distance when light travels through water, and the latter explains that light can be refracted and scattered by water particles causing haze and blur. Therefore, it is meaningful to develop an effective preprocessing method to correct color, enhance contrast, and improve sharpness of underwater images for further applications.

In general, underwater image sharpening methods can be categorized into two types [25]: enhancement-based methods and restoration-based methods. Enhancement-based methods consider the quality perceived by human or the performance of computer vision, however, ignoring the physical property of image degradation. Numerous enhancement methods have been proposed to improve underwater image quality, for example, white balancing algorithm [1, 35], histogram equalization (e.g. DHE [57], CLAHE [15]), unsharp masking operation [51], and color-transfer technique [61] can be employed to remove color cast, increase the contrast, and improve the sharpness, respectively. In contrast, restoration-based methods take the underwater image formation model (UIFM) into account, wherein the parameters of the physical model are deduced using extra priors or scene information. Since the scattering is a function of distance and the transmission is variant within the image, many prior information-based methods [3, 56] were proposed to estimate the transmission map.

Dark channel prior (DCP) [21] is a widely used prior information-based method for outdoor image dehazing, which is based on the observation that the haze density can be regarded as a useful depth clue to estimate transmission map. Since the simplified UIFM is similar to the outdoor fogging model, a lot of UIFM-based methods derived from DCP are proposed for underwater image restoration. Benefiting from DCP, the transmission map can be acquired without estimating the depth map in advance. In [7], DCP was directly used to estimate the depth of the turbid water and restore the clarity of underwater image. Chiang and Chen [8] combined DCP with wavelength compensation to remove haze and correct color distortion for improving the quality of underwater images. In [11], an underwater dark channel prior (UDCP) was developed by only considering green and blue channels, since the red channel cannot provide dependable information to estimate the transmission map. Afterward, Galdran et al. [13] modified the DCP by inverting the red channel, considering that the intensity in red channel rapidly decays as the distance increases. They also incorporated saturation component into the red channel prior (RCP) to handle artificial illumination. In [58], Xie et al. proposed a normalized total variation method based on RCP for restoring underwater images. Likewise, Gao et al. [14] combined the red channel with the inverse of green and blue channels as a new degraded image and proposed a bright channel prior approach to estimate the transmission map. Recently, a generalization of DCP-based method for transmission estimation was proposed in [46], and the relationship between image intensity and depth was modeled by linear regression to estimate ambient light.

Another line of research focuses on estimating the transmission map by reasoning depth information based on different priors. For example, the maximum intensity prior (MIP) [5] extracted depth information by calculating intensity differences of color channels D_mip, and estimated the transmission map by directly using D_mip rather than a depth map. Peng et al. [45] proposed a novel method to estimate the scene depth using image blurriness and light absorption (IBLA). In [44], they further extended it to determine the distance between the closest scene point and the camera. Based on the analysis of a large number of underwater images, Song et al. [55] proposed an underwater light attenuation prior (ULAP) to estimate the scene depth. Afterward, Zhou et al. [63] introduced a color-line model to handle the degradation problem in underwater environment and determined the local depth with a non-linear optimization. In [43], the transmission map was estimated by an observation that the scene depth is inversely proportional to the geodesic color distance from the background light.

In addition to these aforementioned methods, many other methods have been also developed and achieved significant advances. Li et al. [32] proposed a dehazing approach based minimum information loss principle and histogram distribution prior to improve the contrast and visibility of underwater images. The concept of haze-lines for image dehazing was adopted by [2, 40], which describes that the pixels that belong to the same color cluster will be distributed on a straight line. Moreover, a range of variational methods [23, 24, 37] have been proposed to solve the problem of underwater image degradation. In recent years, since deep learning techniques have achieved great performance in natural image dehazing [49, 50], some works [47, 48] attempt to adopt deep learning strategies to enhance and restore underwater images. For example, Cao et al. [4] adopted a multi-scale architecture to estimate the scene depth map. Ding et al. [10] presented a jointly wavelength compensation and dehazing network (JWCDN) to estimate background light, wavelength attenuation and transmission map simultaneously. Considering the depth map estimation as an image-to-image translation, Zhang et al. [62] proposed a depth generative adversarial network (DepthGAN) to obtain high-quality depth map.

Based on the overview of related work of underwater image restoration methods, we find that the scattering effect can be naturally removed if the scene depth is considered. However, most existing methods include a post-processing step to increase the visibility of image, which may compromise the accuracy of the underlying scene radiance. In this paper, we suggest that an accurate scene depth map is the key to successfully estimate the transmission map, and propose a novel restoration method for improving underwater image quality without any post-processing step. The main contributions of our work are as follows.

(a)
An efficient underwater image restoration method is presented based on underwater image formation model, which can effectively remove haze, improve color rendition, and reveal more details.
(b)
Rather than directly estimating the transmission map, we first combine the oblique gradient operator (OGO) and underwater light attenuation prior to extract the scene depth, and then recover the scene radiance depending on the UIFM.
(c)
Instead of simply picking the brightest pixel, we introduce a new scheme to determine the candidate region for estimating background light based on the quad-tree subdivision.

The rest of this paper is organized as follows. Section 2 briefly introduces the underwater image formation model and characteristics of underwater light attenuation prior. In section 3, the proposed method is described in detail. Section 4 presents the performance of the proposed method including qualitative and quantitative comparisons, and application test. Finally, the conclusion is provided in section 5.

2 Background and foundation

2.1 Underwater image formation model

The degradation model of underwater images proposed by Jaffe [30] points out that the total energy E_T detected by a camera includes the direct component E_d, the forescattering E_fs and the backscattering E_bs. Thus, the concrete expression of UIFM can be written as a linear superposition of these three components:

$$ {E}_T={E}_d+{E}_{fs}+{E}_{bs} $$

(1)

In Eq. (1), E_d is the light reflected by the object; E_fs is similar to E_d but has been scattered with a small angle; and E_bs is the light reflected by other suspended particles. Assuming that the forescattering can be neglected when the camera is close to the scene points, Schechner and Karpel [52] defined the direct component E_d and backscattering E_bs by:

$$ {E}_d=\mathrm{Jt} $$

(2)

$$ {E}_{bs}=\mathrm{B}\left(1-t\right) $$

(3)

where J is the scene radiance, B is a scalar that depends on the wavelength and t is the transmission map that represents the percentage of the scene radiance reaching the camera. Based on Schechner-Karpel model, the intensity of degraded underwater image in Eq. (1) can be simplified as:

$$ {I}^c(x)={J}^c(x){t}_c(x)+{B}^c\left(1-{t}_c(x)\right) $$

(4)

where c ∈ (R, G, B) denotes the different color channels, I^c(x) is the captured underwater image and J^c(x) is the undistorted underwater image. From Eq. (4), we can observe that to recover J^c(x) from I^c(x), we first need to estimate background light B^c and transmission maps t_c(x). Following, we also utilize this widely used simplified underwater image formation model [11, 13, 44] to restore the underwater image.

Actually, accurate estimations of background light and transmission map are the basis for underwater image restoration. The background light B^c is often considered to be the intensity of the pixel with the maximum depth when assuming homogeneous lighting along the line of sight. However, it is difficult to find the farthest pixel in a single image. In an in-air image, the global air-light is often estimated as the color of the highest intensity pixel [21]. However, objects that are brighter than the background will lead to an incorrect estimation in underwater environment.

Following, the transmission t_c can also be defined as an exponential decay function correlated with scene depth:

$$ {t}_c(x)={e}^{-{\beta}_cd(x)} $$

(5)

where β_c is the attenuation coefficient depends on wavelength, d(x) is the distance from the camera to the scene point x. The scene depth d can be represented as a sum of the distance of the nearest point to the camera d₀ and infinity distance d_n [56].

2.2 Underwater light attenuation prior

Base on the above analysis, it can be concluded that the scene depth is a key clue to estimate the transmission map. In the early years, Song [55] proposed an effective scene depth estimation model using underwater light attenuation prior. The ULAP describes that there is a strong correlation between the scene depth and the difference between the value of red channel and the maximum value of G-B channel.

Since the absorption of red light can be an order of magnitude greater than the absorption of blue and green light, the intensity of red channel will attenuate faster than that of green or blue channels when the depth increases. To be specific, in the far region, the red light attenuates seriously, leading to a large difference between the maximum value of G-B channel and the value of red channel.

Based on the light attenuation prior, a linear model of the maximum value of G-B channel and the value of R channel was developed to estimate the depth map:

$$ d(x)={\theta}_0+{\theta}_1m(x)+{\theta}_2v(x) $$

(6)

where m(x) is the maximum value of G-B channel, v(x) is the value of R channel, θ₀, θ₁, θ₂ are coefficients. To get the accurate values of parametersθ₀, θ₁, θ₂, the authors manually select 100 proper depth maps obtained by [44] as the training data and train the model with a supervised linear regression. Unfortunately, the ULAP method only considers the light attenuation, which may fail in some cases as shown in Fig. 1. We can observe that some large blue objects in an image are often incorrectly estimated to be farther. Besides, if the color of the water body contains more red tones, it will be estimated to be closer than the foreground.

3 Proposed method

The proposed method is composed of three main parts involving background light estimation, scene depth estimation, and transmission map estimation, which will be explained in detail in the following subsections. The flowchart of the proposed method is shown in Fig. 2.

3.1 Background light estimation

The background light refers to the waterbody color that depends on different water types. Most existing methods believe that the color of water can be obtained from at least one pixel in the image. Generally, the farthest region of an underwater image is often regarded as the candidate region of background light.

We assume that there exists an area that does not contain objects, in which the intensity of the pixels can reflect the color of the water body. Since the amount of light absorption varies with different wavelengths, the dominating color in this area appears green or blue. At the same time, such an area has a low variance. To detect the candidate area of background light, we use an automatic searching method based on quad-tree subdivision [31]. Considering both the color difference and smoothness of this candidate region, the score for each sub-block can be set as:

$$ Score={S}_{\varDelta }+{S}_{\sigma } $$

(7)

where S_Δ is determined by calculating the max difference between G-B value and R value:

$$ {S}_{\varDelta }=\max \left(\max \left(G(x),B(x)\right)-R(x)\right),x\in \Omega $$

(8)

and S_σ is defined as:

$$ {S}_{\sigma }=-\frac{1}{3}\sum \limits_{c\in \left\{r,g,b\right\}}{\sigma}_c $$

(9)

where σ is the standard deviation of the pixel value in a selected region Ω.

After that, the block with the highest score will be further divided into smaller blocks until the size of the block is smaller than a predefined threshold. The final background light is calculated by averaging the pixel value inside the last block. The detailed algorithm is described in Algo.1.

Following, four representative underwater images with different scenes (i.e. images with a white object, horizontal perspective or top-down perspective images, and images with complex foreground) are selected to demonstrate the effectiveness and robustness of quad-tree subdivision method. Their results of background light estimation are presented in Fig. 3 and the final selected block is filled with red color.

3.2 Scene depth estimation

To accurately estimate the distance from the farthest point to the closest one in an image, we also take the image gradient in to consideration. The intensity of image gradient is a rough estimation of depth information based on the observation that the regions of far scene points are smoother than those of close scene points, producing a smaller gradient value.

The magnitude of image gradient G_mag is computed as:

$$ {G}_{mag}=\sqrt{{G_x}^2+{G_y}^2} $$

(10)

where G_x and G_y are calculated by applying horizontal and vertical operators to different patches in an image. A 3×3 patch is presented in Fig. 4a. f_c denotes the value of the central pixel and f_k (k = 1,2,…8) is the value of its kth neighbor. This traditional calculation of gradient only indicates changes along y and x axes. Thus it is unable to represent how illumination changes in an arbitrary direction. Singh [54] used a combined oblique gradient profile prior (OGPP) on haze images and can efficiently estimate their depth maps. The corresponding oblique gradient operator with a patch size of 3×3 is presented in Fig. 4b and is defined as:

$$ o\left(m,n\right)=\arctan \left(\frac{G_y}{G_x}\right)=\arctan \left(\frac{\sum \limits_{k=1}^8\left({f}_c-{f}_k\right)}{8}\right) $$

(11)

To better understand the process of estimating depth map, we give an example to illustrate it, as presented in Fig. 5. Based on Eq. (11), we first use the OGO to generate the gradient magnitude map G_mag of the degraded image as shown in Fig. 5b. Assuming that depth is locally constant in a small path, we further apply the dilation operation and a hole-filling algorithm to improve G_mag (shown in Fig. 5c), which is expressed as

$$ {G}_{dilate}(x)={G}_{mag}(x)\oplus SE $$

(12)

In Eq. (12), the morphological structuring element (SE) is square-shaped whose width is 7 pixels, and the dilation operation for an image I(x,y) using a structuring element b is defined as

$$ I\oplus b\left(x,y\right)=\max \left\{I\left(x-s,y-t\right)+b\Big(s,t\Big)\right\} $$

(13)

Here, we use stretched map on the range [0, 1] to obtain the gradient-based depth map

$$ {d}_{gmag}(x)=1- Strch\left({G}_{fill}(x)\right) $$

(14)

where G_fill is the modified G_dilate after filling holes generated by flat regions and the stretching function, which is defined as

$$ Strch(V)=\frac{V-\min (V)}{\max (V)-\min (V)} $$

(15)

Finally, we utilize the guided filter algorithm [22] to further refine the depth map, as shown in Fig. 5d.

Inspired by the ULAP, we also incorporate the rapid and effective depth estimation d_ulap into Eq. (6) to guarantee a reliable depth map. This prior has been explained in Section 2.2, and the coefficients θ₀, θ₁, θ₂ are set to be 0.53214829, 0.51309827 and − 0.91066194, respectively, according to the best learning results in [55].

The coarse depth map is computed by combining the depth map (d_gmag) generated by image gradient and the depth map (d_ulap) based on light attenuation prior

$$ {d}_c(x)={wd}_{ulap}+\left(1-w\right){d}_{gmag} $$

(16)

where w is the weight to balance the effect of d_ulap and d_gmag, which can be determined by the information of red channel with the help of sigmoid function

$$ w=\frac{1}{1+{e}^{-{\alpha}_1\left(r-{\alpha}_2\right)}} $$

(17)

where r is the average value of red channel, α₁ is the parameter that controls the slope of the curve, which is empirically set to 32, α₂ is the center of the horizontal coordinate.

To get a more accurate scene depth, the distance between the nearest point and the camera needs to be considered

$$ {d}_0=1-\underset{c\in \left\{r,g,b\right\}}{\max}\left(\frac{\max \left|{B}^c-{I}^c(x)\right|}{\max \left({B}^c,1-{B}^c\right)}\right) $$

(18)

To demonstrate the effectiveness of the proposed scheme, five sample images and their depth maps using different methods are shown in Fig. 6. The depth maps of UDCP [11] in Fig. 6b are obtained by inverting Eq. (5) with d(x) = log _{Nrer (r)} (t_r(x)). The depth maps generated by IBLA [44] and our proposed method are presented in Fig. 6c and d, respectively. It can be seen that UDCP method presents unsatisfactory depth maps since it only identifies the distance between foreground and background. The IBLA method works well in most cases but incorrectly estimates the white fish and the difference between objects in the foreground. In contrast, our method can produce proper depth maps and the visual quality of the depth maps shows much better. As shown in the first column of Fig. 6, for instance, only our method can correctly estimate the depth with small values closer to the camera when the image has a white object in the foreground.

3.3 Transmission map estimation

The scene depth acquired by our method needs to be further transformed to actual distance using a constant scaling factor D_∞:

$$ d(x)={D}_{\infty}\times \left({d}_c(x)+{d}_0\right) $$

(19)

Then, the transmission map for each channel can be estimated based on Eq. (5).

In most cases, the attenuation coefficient of red channel is determined as $ {\beta}^r\in \left[\frac{1}{8},\frac{1}{5}\right] $ [44]. According to [32], the attenuation coefficient of green/blue channel can be further calculated by green-red and blue-red ratios:

$$ \frac{\beta^{c\hbox{'}}}{\beta^r}=\frac{\left(-0.00113{\lambda}_{c\hbox{'}}+1.62517\right){B}^r\left(\infty \right)}{\left(-0.00113{\lambda}_r+1.62517\right){B}^{c\hbox{'}}\left(\infty \right)},{c}^{\hbox{'}}\in \left\{g,b\right\} $$

(20)

where B^c is the background light, and λ_c is the wavelength of each channel. The ranges of the wavelength of different channels are 620 ~ 750 nm (red), 490 ~ 550 nm (green) and 400 ~ 490 nm (blue) [8]. Here, we set D_∞ = 8, $ {\beta}^r=\frac{1}{6} $ and λ_c for R-G-B channel as 620, 540 and 450, respectively. The transmission maps estimation is described in Algo.2.

Once the background light B and the transmission map t are obtained, we can restore the scene radiance from Eq. (4). However, the light color area will be excessively restored when the transmission t approaches zero. To assure the restored results appear more natural, we set a constant t₀ = 0.1 as a lower bound of transmission t. Finally, the restored image J can be calculated using the following modified equation:

$$ {J}^c(x)=\frac{I^c(x)-{B}^c}{\max \left({t}_c(x),{t}_0\right)}+{B}^c $$

(21)

4 Experimental results and analysis

In this section, we first present some recovered results and perform a validation of the proposed method. Then, the proposed method is compared with the other five underwater restoration methods to evaluate their performance qualitatively and quantitatively. Finally, we extend to examine the results of our approach for application in object segmentation.

4.1 Qualitative assessment

To verify the effectiveness of our proposed method, 20 underwater images with different degraded scenes (i.e. bluish scene, greenish scene, hazy scene, low light scene, turbid scene) are selected from the UIEB dataset [33], as presented in Fig. 7a. In the first row of Fig. 7a, we can observe that these original images contain a large area of the pure water body. As mentioned above, the estimation of the background light in this scenario is easy to be obtained. On the contrary, as shown in the bottom row of Fig. 7a, these degraded images contain thin mist or almost no water area. Most of them are close-up scenes of fish and coral reefs whose scene depth varies little. From Fig. 7b, it can be seen that the foreground color of these restored images has been well improved because more red color is reproduced. These satisfying results demonstrate our proposed method can effectively remove haze, as well as correct color and reveal more valuable information.

Furthermore, we compare the performance of our method with other five recent competitive methods including MIP method [5], wavelength compensation and image dehazing (WCID) method [8], UDCP method [11], IBLA method [44] and ULAP method [55]. In Fig. 8, due to the space limitation, we present six representatives that contain different characteristics in the scene. MIP method attempts to get the so-called depth using differences between the maximum value of red channel and green, blue channel D_mip and estimate the transmission through a simple shifting of D_mip. As can be seen from Fig. 8b, MIP method has a little effect on dehazing. Specifically, when the haze is thick, the color becomes pale, the differences of the value D_mip among different patches are very small, leading to an inaccurate estimation of depth. Similarly, the restored image generated by the WCID method are also unsatisfactory, as shown in Fig. 8c, despite the fact that the haze can be removed to some extent. UDCP method finds the brightest pixels in the dark channel as an estimation of the background light, generating a darker scene radiance. Although UDCP method has a good performance on dehazing, the whole restored results become darker and the color cast is even more serious. That’s because it is derived from DCP method and the estimated transmission of the whole scene has similar values. Additionally, as shown in Fig. 8d, UDCP renders the restored images in a bluish or greenish tone at the foreground due to a wrong estimated transmission. Although, the dehazing effects of IBLA and ULAP methods are not as good as UDCP method, IBLA method can recover more details, as shown in Fig. 8e (i.e. the top left corner and the bottom left corner of the last two images), yet there remains some fuzzy. Likewise, Fig. 8f demonstrates that the thin haze is also not removed by ULAP method. Moreover, the contrast enhancement of ULAP is not obvious. Since the normalized residual energy ratios $ {Nrer}_c={e}^{-{\beta}_c} $ used in ULAP is fixed, it cannot be adjusted according to various scenes and thus producing visually unnatural results. On the contrary, the recovered results of our proposed method achieve superior performance on dehazing, enhancing contrast, and revealing more details due to the more accurate depth estimation, as shown in Fig. 8g.

4.2 Quantitative assessment

Following, we conduct some quantitative evaluations on the restored results in Fig. 8. Because of the unavailability of ground truth, it is difficult to evaluate the quality of underwater images using full-referenced metrics [18, 36]. Besides, some no-reference (NR) quality assessment methods [16, 17, 19, 20, 53] designed for in-air images are also not suitable for underwater images with various degradations, including haze effect, low contrast and non-uniform color distortion. Therefore, some no-reference metrics [42, 60] specially designed for evaluating underwater images quality are emerging. Here, we adopted six NR metrics including underwater color image quality evaluation (UCIQE) [60], underwater image quality measure (UIQM) [42], no-reference quality assessment of contrast-distorted images (CDIQA) [12], fog aware density evaluator (FADE) [9], no-reference quality metric of contrast (NIQMC) [19] and blind image quality measure of enhanced images (BIQME) [20], to quantitatively evaluate the performance of these compared restored methods. Their calculated results of Fig. 8 are listed in Tables 1 and 2, respectively.

Table 1 Quantitative comparison of UCIQE and UIQM metrics. (The bold values represent the best results)

Full size table

Table 2 Quantitative comparison of CDIQA, FADE, NIQMC and BIQME metrics. (The bold values represent the best results)

Full size table

As shown in Table 1, it can be seen that our proposed method achieves the highest UCIQE values comparing with MIP, WCID, UDCP, IBLA and ULAP. Moreover, the obtained UCIQE values of our method are more stable. For example, for image 5, the UCIQE values obtained by MIP and UDCP are even lower than that of the original image. However, in most cases, the values obtained by the proposed method are higher than 0.6, which indicates that our method can achieve a better balance among chroma, contrast and saturation. Also, for UIQM, the results presented in Table 1 show that the results of our method outperform the other methods in most cases except image 3 of Fig. 8c generated by UDCP with the highest value of 1.4701. While combining with the qualitative assessment, the results of UDCP suffer from underexposure problem even though it sometimes boosts its UIQM scores. In contrast, our method achieves better visual result in increasing contrast and has minimal performance fluctuation compared with other methods.

Table 2 demonstrates the quantitative measures of CDIQA, FADE, NIQMC and BIQME. The obtained highest values of CDIQA and NIQMC values by our method indicate that it can significantly enhance the contrast. For FADE, it is clear that both UDCP and our method outperform the other four compared methods. UDCP method ranks as the first/second with respect to the ability to recover scene visibility. Likewise, WCID can also perform well on some specific scenes (i.e., Image 3 and Image 5). Although our proposed method does not rank first in these two images, it can still rank in the top three among all the methods, demonstrating that our method can efficiently remove the haze and produce a relatively clear scene. In terms of NIQMC, the proposed method scores above 4.9 on all six examples. For BIQME, we can observe that WCID, UDCP, IBLA and ULAP show unsatisfactory results and even yield lower scores than the original image 6. MIP achieves a specific high BIQME score on Image 6 but performs unevenly in the other five tested images. All in all, the proposed method has better robustness and achieves high scores across various metrics.

To further evaluate the effectiveness and robustness of the proposed method, we carry out some experiments on UIEB [33] dataset and RUIE [39] dataset. Table 3 summarizes their average scores of UCIQE, UIQM, CDIQA, FADE, NIQMC, and BIQME of restored images by using different compared methods. It is worth noting that the results of our proposed method are superior to other five state-of-art methods in terms of these six objective NR quality assessment metrics. To sum up, both qualitative and quantitative experimental results demonstrate that our method can achieve better performance in removing haze and improving contrast.

Table 3 Comparison of average UCIQE, FADE, NR-CDIQA, NIQMC, BIQME of different restored methods on UIEB dataset and RUIE dataset. (The bold values represent the best results)

Full size table

4.3 Application test

To further assess the performance of the proposed method, we attempt to examine its application in image segmentation. Image segmentation is a critical and essential task of many computer vision applications [26, 27, 29, 34]. In this section, we employ the original implementation of chan-vese model [6], which is an important image segmentation model based on regional information, to evaluate the performance of our work.

Due to the limited space, we simply display two examples, as shown in Fig. 9. It can be seen from Fig. 9 that the restored results of MIP, UDCP, ULAP and IBLA have large error in segmentation of Image 1, and WCID cannot even detect the fish. However, our proposed method can accurately extract the edge information of the fish. The results of Image 2 recovered by ULAP, IBLA and our method also show that the restored version with high visibility and contrast can achieve good separation. Furthermore, their performance of different compared methods are tested using two simple metrics: intersection over union (IoU) and dice similarity coefficient (DICE), which are the most commonly used indexes in evaluating image segmentation. Both the two metrics are used to measure the similarity between the segmentation result and the standard mask (manually segmented). The results in Table 4 show that the proposed method achieves the higher scores than the other five methods, which suggests that our method can improve the accuracy of conventional segmentation as a pre-processing step.

Table 4 Quantitative comparison of IoU and DICE metrics. (The bold values represent the best results)

Full size table

5 Conclusion

We present a novel restoration method for improving the quality of underwater image. The proposed method is based on an assumption that the intensity of the image gradient is a rough estimation of depth information. Initially, we utilize the quad-tree subdivision to estimate the background light by both considering smoothness and color difference. Afterward, an oblique gradient operator and underwater light attenuation prior are combined to estimate the scene depth. Subsequently, the transmission map is calculated relying on the acquired background light and scene depth. Finally, the scene radiance can be obtained based on the UIFM without any post-processing. Experimental results demonstrate that the proposed method achieves a good performance across different degraded scenes. The qualitative and quantitative comparisons further show that the proposed method outperforms the other five compared methods. Despite of the good performance, our proposed method also has some limitations. One is that it is not satisfactory to recovering non-uniform illumination image caused by auxiliary light sources. In future work, we intend to enhance and restore underwater image under more challenging conditions.

References

Ancuti CO, Ancuti C, De Vleeschouwer C, Bekaert P (2018) Color balance and fusion for underwater image enhancement. IEEE Trans Image Process 27(1):379–393
Article MATH Google Scholar
Berman D, Levy D, Avidan S, Treibitz T (2020) Underwater single image color restoration using haze-lines and a new quantitative dataset. IEEE Trans Patt Anal Mach Intell 8828:1–1 (early access).
Borkar S, Bonde SV (2016) Underwater image restoration using single color channel prior. In: 2016 international conference on signal and information processing (IConSIP). IEEE, pp 1–4.
Cao K, Peng YT, Cosman PC (2018) Underwater image restoration using deep networks to estimate background light and scene depth. In: 2018 IEEE southwest symposium on image analysis and interpretation (SSIAI). IEEE, pp 1–4.
Carlevaris-Bianco N, Mohan A, Eustice RM (2010) Initial results in underwater single image dehazing. In: OCEANS 2010 MTS/IEEE SEATTLE. IEEE, pp 1–8
Chan TF, Vese LA (2001) Active contours without edges. IEEE Trans Image Process 10(2):266–277
Article MATH Google Scholar
Chao L, Wang M (2010) Removal of water scattering. In: 2010 international conference on computer engineering and technology, proceedings. IEEE, pp V2-35-V2-39.
Chiang JY, Chen YC (2012) Underwater image enhancement by wavelength compensation and dehazing. IEEE Trans Image Process 21(4):1756–1769
Article MATH Google Scholar
Choi LK, You J, Bovik AC (2015) Referenceless prediction of perceptual fog density and perceptual image defogging. IEEE Trans Image Process 24(11):3888–3901
Article MATH Google Scholar
Ding X, Wang Y, Yan Y et al (2019) Jointly adversarial network to wavelength compensation and dehazing of underwater images. arXiv:1907.05595.
Drews Jr P, do Nascimento E, Moraes F, et al (2013) Transmission estimation in underwater single images. In: 2013 IEEE International Conference on Computer Vision Workshops. IEEE, pp 825–830
Fang Y, Ma K, Wang Z, … Zhai G (2014) No-reference quality assessment of contrast-distorted images based on natural scene statistics. IEEE Signal Processing Lett 22(7):838–842
Google Scholar
Galdran A, Pardo D, Picón A, Alvarez-Gila A (2015) Automatic red-channel underwater image restoration. J Vis Commun Image Represent 26:132–145
Article Google Scholar
Gao Y, Li H, Wen S (2016) Restoration and enhancement of underwater images based on bright channel prior. Math Probl Eng 2016:1–15
Google Scholar
Garg D, Garg NK, Kumar M (2018) Underwater image enhancement using blending of CLAHE and percentile methodologies. Multimed Tools Appl 77:26545–26561
Article Google Scholar
Gu K, Zhai G, Lin W, … Zhang W (2015) No-reference image sharpness assessment in autoregressive parameter space. IEEE Trans Image Process 24(10):3218–3231
Article MATH Google Scholar
Gu K, Zhai G, Lin W, … Liu M (2016) The analysis of image contrast: from quality assessment to automatic enhancement. IEEE Trans Cybernetics 46(1):284–297
Article Google Scholar
Gu K, Li L, Lu H, … Lin W (2017) A fast reliable image quality predictor by fusing micro- and macro-structures. IEEE Trans Ind Electron 64(5):3903–3912
Article Google Scholar
Gu K, Lin W, Zhai G, … Chen CW (2017) No-reference quality metric of contrast-distorted images based on information maximization. IEEE Trans Cybernetics 47(12):4559–4565
Article Google Scholar
Gu K, Tao D, Qiao J et al (2018) Learning a no-reference quality assessment model of enhanced images with big data. IEEE Trans Neural Networks Learn Syst 29(4):1301–1313
Article Google Scholar
He K, Sun J, Tang X (2009) Single image haze removal using dark channel prior. In: 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 1956–1963
He K, Sun J, Tang X (2013) Guided image filtering. IEEE Trans Pattern Anal Mach Intell 35(6):1397–1409
Article Google Scholar
Hou G, Pan Z, Wang G, … Duan J (2019) An efficient nonlocal variational method with application to underwater image restoration. Neurocomputing 369:106–121
Article Google Scholar
Hou G, Li J, Wang G, … Zhao X (2020) Underwater image dehazing and denoising via curvature variation regularization. Multimed Tools Appl 79:20199–20219
Article Google Scholar
Hou G, Zhao X, Pan Z, … Li J (2020) Benchmarking underwater image enhancement and restoration, and beyond. IEEE Access 8:122078–122091
Article Google Scholar
Huang B, Pan Z, Yang H, … Bai L (2020) Variational level set method for image segmentation with simplex constraint of landmarks. Signal Process Image Commun 82:115745
Article Google Scholar
Huang B, Ge L, Chen G, … Pan Z (2021) Nonlocal graph theory based transductive learning for hyperspectral image classification. Pattern Recogn 116:107967
Article Google Scholar
Huang B, Ge L, Chen X, … Chen G (2022) Vertical structure-based classification of oceanic eddy using 3-D convolutional neural network. IEEE Trans Geosci Remote Sens 60:1–14
Google Scholar
Huang B, Wang Z, Shang J et al (2022) A spectral sequence-based nonlocal long short-term memory network for hyperspectral imagery classification. IEEEJ selected topics Appl earth Observ remote sensing 1–1.
Jaffe JS (1990) Computer modeling and the design of optimal underwater imaging systems. IEEE J Ocean Eng 15:101–111
Article Google Scholar
Kim JH, Jang WD, Sim JY, Kim CS (2013) Optimized contrast enhancement for real-time image and video dehazing. J Vis Commun Image Represent 24:410–425
Article Google Scholar
Li C, Guo J, Cong R et al (2016) Underwater image enhancement by dehazing with minimum information loss and histogram distribution prior. IEEE Trans Image Process 25(12):5664–5677
Article MATH Google Scholar
Li C, Guo C, Ren W, … Tao D (2020) An underwater image enhancement benchmark dataset and beyond. IEEE Trans Image Process 29:4376–4389
Article MATH Google Scholar
Li K, Qi S, Yang H, … Song D (2020) Extensible image object co-segmentation with sparse cooperative relations. Inf Sci 521:422–434
Article Google Scholar
Li X, Hou G, Tan L, … Liu W (2020) A hybrid framework for underwater image enhancement. IEEE Access 8:197448–197462
Article Google Scholar
Li Y, Huang B, Yang H, … Duan J (2020) Efficient image structural similarity quality assessment method using image regularised feature. IET Image Process 14(16):4401–4411
Article Google Scholar
Li X, Hou G, Li K, … Pan Z (2022) Enhancing underwater image via adaptive color and contrast enhancement, and denoising. Eng Appl Artif Intell 111:104759
Article Google Scholar
Liu H, Chau LP (2019) Deepsea video descattering. Multimed Tools Appl 78:28919–289294
Article Google Scholar
Liu R, Fan X, Zhu M, … Luo Z (2020) Real-world underwater enhancement: challenges, benchmarks, and solutions under natural light. IEEE Trans Circ Syst Video Technol 30(12):4861–4875
Article Google Scholar
Liu Y, Rong S, Cao X, … He B (2020) Underwater single image dehazing using the color space dimensionality reduction prior. IEEE Access 8:91116–91128
Article Google Scholar
Lu H, Uemura T, Wang D, … Kim H (2020) Deep-sea organisms tracking using dehazing and deep learning. Mobile Networks Appl 25:1008–1015
Article Google Scholar
Panetta K, Gao C, Agaian S (2016) Human-visual-system-inspired underwater image quality measures. IEEE J Ocean Eng 41(3):541–551
Article Google Scholar
Park E, Sim JY (2020) Underwater image restoration using geodesic color distance and complete image dormation model. IEEE Access 8:157918–157930
Article Google Scholar
Peng YT, Cosman PC (2017) Underwater image restoration based on image blurriness and light absorption. IEEE Trans Image Process 26(4):1579–1594
Article MATH Google Scholar
Peng YT, Zhao X, Cosman PC (2015) Single underwater image enhancement using depth estimation based on blurriness. In: 2015 IEEE international conference on image processing (ICIP). IEEE, pp 4952–4956.
Peng YT, Cao K, Cosman PC (2018) Generalization of the dark channel prior for single image restoration. IEEE Trans Image Process 27(6):2856–2868
Article MATH Google Scholar
Qi Q, Zhang Y, Tian F, … Song D (2022) Underwater image co-enhancement with correlation feature matching and joint learning. IEEE Trans Circ Syst Video Technol 32(3):1133–1147
Article Google Scholar
Qi Q, Li K, Zheng H et al (2022) SGUIE-net: semantic attention guided underwater image enhancement with multi-scale perception. arXiv:2201.02832.
Ren W, Ma L, Zhang J et al (2018) Gated fusion network for single image dehazing. In: 2018 IEEE/CVF conference on computer vision and pattern recognition, pp 3253–3261.
Ren W, Pan J, Zhang H, … Yang MH (2020) Single image dehazing via multi-scale convolutional neural networks with holistic edges. Int J Comput Vis 128:240–259
Article Google Scholar
Sanila KH, Balakrishnan AA, Supriya MH (2019) Underwater image enhancement using white balance, USM and CLHE. In: 2019 international symposium on ocean technology. IEEE, pp 106–116.
Schechner Y, Karpel N (2004) Clear underwater vision. Proceed 2004 IEEE Comput Soc Conf Comput vision Patt Recog.
Si J, Huang B, Yang H, … Pan Z (2022) A no-reference stereoscopic image quality assessment network based on binocular interaction and fusion mechanisms. IEEE Trans Image Process 31:3066–3080
Article Google Scholar
Singh D, Kumar V (2019) Image dehazing using Moore neighborhood-based gradient profile prior. Signal Process Image Commun 70:131–144
Article Google Scholar
Song W, Wang Y, Huang D, Tjondronegoro D (2018) A rapid scene depth estimation model based on underwater light attenuation prior for underwater image restoration. Adv Multimedia Inform Process – PCM 2018:678–688
Google Scholar
Wang Y, Liu H, Chau LP (2018) Single underwater image restoration using adaptive attenuation-curve prior. IEEE Trans Circ Syst I: Regular Papers 65(3):992–1002
Google Scholar
Wong SL, Paramesran R, Taguchi A (2018) Underwater image enhancement by adaptive gray world and differential gray-levels histogram equalization. Adv Electrical Comp Eng 18(2):109–116
Article Google Scholar
Xie J, Hou G, Wang G et al (2021) A variational framework for underwater image dehazing and deblurring. IEEE Trans Circ Syst Video Technol 99:1–1
Google Scholar
Xue B, Huang B, Wei W, … Zhang H (2021) An efficient deep-sea debris detection method using deep neural networks. IEEE J Selected Topics Appl Earth Observ Remote Sensing 14:12348–12360
Article Google Scholar
Yang M, Sowmya A (2015) An underwater color image quality evaluation metric. IEEE Trans Image Process 24(12):6062–6071
Article MATH Google Scholar
Yang H, Tian F, Qi Q, … Li K (2022) Underwater image enhancement with latent consistency learning-based color transfer. IET Image Process 16(6):1594–1612
Article Google Scholar
Zhang S, Li N, Qiu C, … Zheng B (2020) Depth map prediction from a single image with generative adversarial nets. Multimed Tools Appl 79:14357–14374
Article Google Scholar
Zhou Y, Wu Q, Yan K, … Xiang W (2019) Underwater image restoration using color-line model. IEEE Trans Circ Syst Video Technol 29(3):907–911
Article Google Scholar

Download references

Acknowledgments

The research work is partially supported by National Natural Science Foundation of China (No. 61901240), the Natural Science Foundation of Shandong Province, China (No. ZR2019BF042, ZR2019MF050), the Marine S&T Fund of Shandong Province for Pilot National Laboratory for Marine Science and Technology (Qingdao) (No.2022QNLM050301), and China Scholarship Council (No. 201908370002), and the China Postdoctoral Science Foundation (No. 2017 M612204).

Author information

Authors and Affiliations

College of Computer Science & Technology, Qingdao University, Qingdao, 266071, China
Jingyi Li, Guojia Hou & Guodong Wang

Authors

Jingyi Li
View author publications
You can also search for this author in PubMed Google Scholar
Guojia Hou
View author publications
You can also search for this author in PubMed Google Scholar
Guodong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guojia Hou.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, J., Hou, G. & Wang, G. Underwater image restoration using oblique gradient operator and light attenuation prior. Multimed Tools Appl 82, 6625–6645 (2023). https://doi.org/10.1007/s11042-022-13605-5

Download citation

Received: 06 February 2021
Revised: 29 April 2022
Accepted: 18 July 2022
Published: 08 August 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s11042-022-13605-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Underwater image restoration using oblique gradient operator and light attenuation prior

Abstract

Similar content being viewed by others

Underwater Image Restoration Based on Light Attenuation Prior and Scene Depth Fusion Model

A Rapid Scene Depth Estimation Model Based on Underwater Light Attenuation Prior for Underwater Image Restoration

Underwater Image Enhancement by the Combination of Dehazing and Color Correction

1 Introduction