Perceptive low-light image enhancement via multi-layer illumination decomposition model

Wu, Yahong; Zheng, Jieying; Song, Wanru; Liu, Feng

doi:10.1007/s11042-022-13139-w

Perceptive low-light image enhancement via multi-layer illumination decomposition model

Published: 14 May 2022

Volume 81, pages 40905–40929, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Perceptive low-light image enhancement via multi-layer illumination decomposition model

Download PDF

Yahong Wu¹,
Jieying Zheng¹,
Wanru Song¹ &
…
Feng Liu ORCID: orcid.org/0000-0003-2750-0315¹

329 Accesses
1 Altmetric
Explore all metrics

Abstract

The images captured under low light conditions generally have less than satisfactory visual quality. To address this issue, many low-light image enhancement methods have been studied. However, these existing algorithms mostly suffer from unnaturalness, over-enhancement and artifacts. In this paper, a perceptive low-light image enhancement via multi-layer illumination decomposition model is proposed, to preserve the naturalness and improve the contrast for low-light images. First, the contrast of the target image is defined from global, local and the effect of noise aspects. Then, inspired by the human visual system, the perceptive contrast is designed by combining the defined contrast with just-noticeable-difference transformation. Last and most importantly, the target image is decomposed in a multi-layer way based on the multi-scale adaptive filter, which utilizes the perceptive contrast to decide the variance adaptively. This step can effectively obtain multiple illumination and reflectance layers. Combining these reflectance with adjusted illumination components can generate the final enhanced result. The proposed method has better no-reference quantitative measurement results than other compared methods. Experimental results on several public challenging low-light image datasets demonstrate that the proposed method can achieve great performance in balancing the contrast, brightness and naturalness.

Pre-denoising 3D Multi-scale Fusion Attention Network for Low-Light Enhancement

Article 22 December 2022

Low-light image enhancement network with decomposition and adaptive information fusion

Article 12 January 2022

DMPH-Net: a deep multi-scale pyramid hybrid network for low-light image enhancement with attention mechanism and noise reduction

Article Open access 27 July 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the development of image technology, massive images are constantly produced by mobile phones and cameras. However, some images are of poor quality, such as the images captured under low light conditions, which may cause inconvenience to people’s lives. Low visibility hinders many real applications, such as objection detection [48] and tracking [38], image matting [35], and person re-identification [28], which are often used in traffic and criminal investigations. Therefore, enhancing low-light images has a significant effect on our daily work and life. Low-light images have low dynamic range and suffer from heavy noise. For low-light image enhancement, it is a challenging task to balance the relationship among brightness, contrast and naturalness. Inspired by the above problem, many advanced low-light image enhancement methods have been proposed in the past decades.

Histogram equalization (HE) [1] is the simplest and most intuitive method, which can stretch the dynamic range of the observed image. Many extended HE methods have been developed [34, 39]. Nevertheless, these methods exist color distortion and often neglect the effect of noise, especially the noise hidden in dark areas. Inspired by an observation that an inverted low-light image looks like a haze image [53], dehazing-based methods have been proposed to enhance low-light images [10, 26]. Although dehazing-based methods can generate relatively reasonable results, a convincing physical explanation of their basic model has not been provided. With the development of machine learning and deep learning, many low-light image enhancement networks have been proposed [42, 47, 50, 52, 55]. These methods require a mass of sample pairs: low-light images and corresponding normal-light images. Moreover, many sample pairs are artificially generated, resulting in unnatural enhanced results.

The most popular category of low-light image enhancement methods is based on the Retinex model. Retinex theory models the image depending on the illumination and reflectance components [24]: I = R ∘ L, where I is the observed image, R represents the reflectance component of the image, and L represents the illumination component. The operator ∘ denotes the pixel-wise multiplication. Retinex is a chaining process. The estimation of illumination directly affects that of reflectance. It is essential to estimate the illumination precisely. Variational Retinex-based methods regard the Retinex decomposition as an optimization problem by adding constraint terms on the reflectance and illumination components [27, 36, 46]. Nevertheless, these methods have high computational cost and cannot enhance the contrast well. Most of the existing Retinex-based methods regard R as the enhanced image [22, 40] and utilize the logarithmic transformation to reduce the computational complexity [7]. In the multi-scale Retinex (MSR) [21], the reflectance is the enhanced image, which is obtained by multi-scale Gaussian filter. However, MSR cannot preserve edge information or enhance the contrast to satisfy human visual system (HVS). In this paper, just-noticeable-difference (JND) model [20] is adopted as an adaptive adjustment for contrast enhancement, and the contrast is combined with JND to decide the variance of the multi-scale adaptive filter, which can make up for the drawback of MSR. In addition, the proposed method generates enhanced results by fusing illumination and reflectance components to preserve the naturalness.

Based on an assumption that details may exist in multi-layer spatial frequency bands, multi-layer model has been adopted in low-light image enhancement [14, 43]. However, these methods cause artifacts, ignore the enhancement of contrast, and cannot satisfy human visual perception. Only a few methods utilize the multi-layer decomposition, and how to apply human visual perception to Retinex multi-layer decomposition is also an open problem.

In this paper, we perform perceptive low-light image enhancement via multi-layer illumination decomposition model. The core concept is to make the illumination and reflectance components region aware. To preserve the hue and saturation, Retinex decomposition is only applied to the value (V) channel of the input image in the HSV color space. First, the contrast is defined from local and global perspectives. At the same time, the effect of noise is integrated into the contrast. Second, the JND model and the defined contrast are combined to generate the perceptive contrast, which can divide the contrast into high and low contrasts adaptively. Then, the multi-scale adaptive filter is proposed, which can enhance high and low contrasts, respectively. Finally, a novel multi-layer illumination decomposition model is presented to extract accurate illumination information, which can also effectively obtain the reflectance component. The final enhanced result is obtained by combining the decomposed multi-layer reflectance and adjusted illumination components. Figure 1 shows the general framework of the proposed method.

Contrast Definition: In Section 3.1, the contrast is defined by considering the noise as well as the global and local characteristics, and it contributes to enhancing the contrast of dark areas while maintaining the contrast of bright areas.
Perceptive Contrast: In Section 3.2, the perceptive contrast is introduced by combining the JND model with the defined contrast. Perceptive contrast can divide the contrast into high and low parts, which require different enhancement.
Multi-layer Illumination Decomposition Model: In Section 3.3, the multi-layer illumination decomposition model is proposed, which can obtain the illumination accurately and preserve the naturalness by iteratively using the multi-scale adaptive filter. The perceptive contrast is utilized to decide the variance of the filter. The decomposed multi-layer spatial frequency bands are fused to obtain the final enhanced result.

Extensive experimental results demonstrate the effectiveness of the proposed method in terms of objective and subjective assessments. The proposed method can improve the contrast and brightness, as well as preserve the naturalness.

The rest of this paper is organized as follows: Section 2 gives a brief survey of related work. The details of the proposed method are provided in Section 3. Section 4 presents the experimental results and compares the proposed method with the state-of-the-arts both in subjective and objective assessments. In addition, parameter study, computational time and limitations are also presented in this section. Finally, the work is concluded in Section 5. For ease of reading, the mathematical notations and acronyms used in this paper are listed in Table 1.

Table 1 Glossary of key acronyms and notations

Full size table

2 Related work

2.1 Human visual perception model

The photoreceptors on the retina of human eyes perceive the light and act as the sensors for HVS [13]. There are two types of photoreceptors, namely, rods and cones. Rods are sensitive to light and responsible for the visual perception in low light conditions (scotopic vision). There are three types of cones, which are red, green and blue cones. These cones are used for color vision in bright light conditions (photopic vision), and they are less sensitive to light than rods. The neurons can only transfer signals within a narrow range, while the range that human eyes can adapt to is much wider. HVS has been studied extensively. Karen et al. [33] proposed a multi-histogram equalization method using HVS to segment the image into three regions: Devries-Rose, Weber and saturation regions. This method analyzes the relationship between the background intensities and the contrast threshold. To balance the relationship between HVS and luminance, Jayant [20] proposed a JND model based on the relationship between the actual luminance and the human visual perceptive brightness. JND model can be employed to obtain the smallest brightness difference that HVS can distinguish [51]. If the difference between the actual luminance of an image and the human visual perceptive luminance is smaller than the critical JND, these two luminance values are almost identical for human eyes, i.e., human eyes cannot perceive the difference in this case. The visibility threshold of JND model can be expressed as follows [3]:

$$ \textit{JND}\left( t \right) = \left\{ \begin{array}{cc} 17 \times \left( 1 - \sqrt{\frac{t}{127}} \right) + 3 & t \le 127\\ \frac{3}{128} \times \left( {t - 127} \right) + 3 & otherwise \end{array} \right. $$

(1)

where t is the luminance value of the input image and its variation range is [0, 255]. As shown in Fig. 2, HVS is more sensitive to bright regions than to dark regions. JND model plays an important role in many fields, such as perceptual image/video compression and image quality measurement.

2.2 Low-light image enhancement methods

Histogram-based methods enhance low-light images by modifying the distributions of their histograms. Lim et al. [29] adopted an extended 2D-histogram-based image enhancement scheme to improve the contrast of structure image. Cheng et al. [2] utilized a mapping function to determine the γ value in Gamma correction based on the characteristics of the input histogram.

Dehazing-based methods obtain the results of low-light image enhancement by inverting the dehazed unrealistic images again. Dong et al. [6] dealt with the inverted video frames by applying an optimized image dehazing algorithm, which utilizes the temporal correlations between subsequent video frames for the estimation of parameters. Li et al. [26] followed this technique to segment the observed image into super pixels and then denoise the segments adaptively using the Block-Matching 3D method (BM3D) [4]. Feng et al. [10] applied a dark channel prior K-means [37] classification method to estimate the transmission and atmospheric light.

Based on the learning principle, many low-light image enhancement networks have been studied. Lore et al. [30] presented a low-light net to enhance the contrast, identify the signal features and brighten images adaptively by using deep autoencoders. Zhang et al. [54] proposed a novel network for kindling the darkness (KinD-Net) to achieve the decomposition of Retinex model. By regarding the low-light image enhancement process as a residual learning problem, Wang et al. [41] proposed a deep lightening network (DLN-Net), which contains several lightening back-projection blocks to learn the residual for normal-light estimation. Different from these end-to-end mapping learning methods, Guo et al. [17] presented a novel image-to-deep curve estimation network (DCE-Net), which outputs high-order curves to adjust the dynamic range and preserve the contrast of neighboring pixels.

Retinex-based methods depend on the illumination and reflectance components. Wang et al. [44] split the observed image into reflectance and illumination components by a bright-pass filter, which can preserve the naturalness and enhance details. Following the bright-pass filter technique, Fu et al. [12] adjusted the illumination map and then fused the derived inputs with the corresponding weights in a multi-scale fusion. The method proposed in [40] is based on a piecewise stretch of the extracted brightness component to improve the visual quality of the image. Wang et al. [43] designed a novel naturalness preserved image enhancement method by using a prior multi-layer lightness statistics (NPIE-MLLS).

The variational Retinex method was initially introduced in [23]. However, since the logarithmic transformation in this method suppresses the variation of gradient magnitude in bright regions, the estimated reflectance component may cause the loss of some details. To overcome the disadvantage appeared in the log-transformed domain, Fu et al. [11] proposed a maximum posteriori method employed in the linear domain to achieve the simultaneous illumination and reflectance estimation (PMSIRE). But the logarithmic transformation can simulate human vision perception of light intensity, so Fu et al. [49] added the reasonable weights into the constraint terms of illumination and reflectance to offset the weakness of logarithmic transformation. In these methods, noise degrades the enhanced results and causes observable halo artifacts. Guo et al. [18] achieved low-light image enhancement via illumination map estimation (LIME), where BM3D is adopted to denoise after adding a structure-aware prior into the initial illumination map. Li et al. [27] proposed a structure-revealing low-light image enhancement method by a robust Retinex model (SRLLIE), which regards the noise component as a part of the Retinex model. The noise component can be obtained by solving the variational Retinex objective function. Xu et al. [46] estimated the illumination and reflectance components using a structure- and texture-aware model. Ren et al. [36] injected the low-rank regularized prior into the Retinex decomposition model (LR3M) to suppress the noise contained in the reflectance map. Hao et al. [19] estimated the illumination and reflectance components in a semi-decoupled decomposition way (SDD), and denoised during the estimation of the reflectance.

3 Proposed method

This section presents the proposed method in detail. First, the definition of contrast is provided. Then, the perceptive contrast is stated by combining the JND model with the defined contrast to satisfy the human visual perception. Finally, the multi-layer illumination decomposition model is introduced.

3.1 Contrast definition

In this paper, the contrast is defined from both local and global aspects. Furthermore, the effect of noise is considered. The definition of contrast is introduced in detail below.

First, the contrast defined in [8] is adopted as the local contrast in this paper, which is estimated as a standard deviation computed within a Gaussian window, and its expression is as follows:

$$ c_{l}\left( {x,y} \right) = \sqrt {\left( {{g_{\sigma} } * {V^{2}}} \right)\left( {x,y} \right) - {{\left( {{g_{\sigma} } * V} \right)}^{2}}\left( {x,y} \right)} $$

(2)

where c_l(x,y) is the local contrast of the target pixel located at (x,y). ∗ is the convolution operator, V is the V channel of the observed image in the HSV color space, and g_σ is a Gaussian kernel with the standard deviation σ (set to be 3). The matrix C_l is adopted as the local contrast of the entire image.

Second, the average and variance values are utilized to represent the global contrast, since they reflect the global information. The global contrast is expressed mathematically as follows:

$$ C_{g} = V_{var} - {\bar{V}^{2}} $$

(3)

where V_var and $\bar {V}$ are the variance and average values of V channel in the observed image, respectively. C_g indicates the global contrast.

Third, the effect of noise is considered. Since HVS perceives visual information based on the grey-level differences of adjacent pixels instead of their absolute grey levels [5, 9], the relationship between adjacent pixels is analyzed and the target grey-level difference is computed to avoid the noise amplification. The effect of noise is introduced as follows:

$$ c_{n}(x,y) = \frac{1}{M}\sum\limits_{(m,n) \in {\varOmega} } {exp(- \frac{{{{(V(m,n) - V(x,y))}^{2}}}}{{2{{\tilde{\sigma}_{1}}^{2}}}}- \frac{{{{(m - x)}^{2}} + {{(n - y)}^{2}}}}{{2{{\tilde{\sigma}_{2}}^{2}}}} )} $$

(4)

where Ω represents the area with a certain distance (empirically set to 3) from the center (x,y), M is the number of pixels in Ω, and (m,n) is the location of pixels in Ω. ${\tilde {\sigma }_{1}}$ and ${\tilde {\sigma }_{2}}$ are empirically set to be 2 to control the scale. The first term represents the pixel-wise difference between the neighbouring area and the center, and the second term is utilized to constrain the distance between (m,n) and (x,y). These two terms stand for the grey similarity and positional proximity, respectively. c_n(x,y) represents the effect of noise in the pixel located at (x,y). When the pixel is close to (x,y) and its grey value is close to V (x,y), the corresponding c_n(x,y) has a high value. Conversely, c_n(x,y) has a low value. C_n is denoted as the effect of noise on the whole image.

Equation (4) illustrates that the nearer the pixels are, the higher sensibility does HVS present to the grey-level difference. If the second term is fixed, c_n(x,y) completely depends on the first term. c_n(x,y) satisfies the principle that its value is lower when the pixel contains more noise than other pixels. That is to say, when the pixel is degraded due to excessive noise, the noise poses less effect on pixels which need enhancement. For example, as shown in Fig. 3, the pixel p₁ is within a flat region, which is not likely to be noisy, so it needs to be assigned with a high c_n value. In contrast, q₁ is located in a noisy region and it needs to be assigned with a low c_n value. When the differences between the pixels of neighbouring areas and V (x,y) are less than 2, i.e., |V (m,n) − V (x,y)|≤ 2, these pixels are regarded as at the similar grey levels.

Finally, the contrast $\hat {C}$ of the target image is obtained by multiplying the matrix C_n with C_g and C_l, as follows:

$$ \hat{C} = C_{g} \cdot C_{l} \circ C_{n} $$

(5)

As shown in Fig. 3, p₂ and q₂ have the same c_n values as p₁ and q₁, respectively. However, the contrast $\hat C_{{{p_{1}}{q_{1}}}}$ is much larger than $\hat C_{{{p_{2}}{q_{2}}}}$. This demonstrates that the defined contrast can not only enhance the contrast of dark areas, but also maintain the contrast of bright areas.

3.2 Perceptive contrast

MSR performs the convolution between the Gaussian smoothing function and the original image to obtain the illumination layer. But MSR may cause halo artifacts along strong edges due to the assumption of smooth illumination. Therefore, in this paper, an adaptive filter is designed to reduce halo artifacts by adjusting the shape of the filter for high-contrast and low-contrast areas. How to divide the defined contrast into high and low contrasts is the key.

As illustrated in Section 2.1, the JND model satisfies the requirements of the human visual perception system, and it can also show the relationship between the visibility threshold and the luminance. In this paper, the defined contrast and the JND model are combined to obtain the threshold used in the contrast division, which is called perceptive contrast.

When the contrast is higher, HVS perceives the changes more sensitively in the case of the fixed luminance, i.e., the visibility threshold of HVS is lower. Therefore, it is reasonable to assume that the relationship between the contrast and the visibility threshold is inversely proportional, expressed as follows:

$$ \hat C \propto \frac{k}{{JND}} $$

(6)

where k is the proportionality coefficient. JND can be represented as $JND = \frac {k}{{\hat C}}$. Putting this derivation into (1) can make the following conclusion:

$$ \frac{k}{{\hat C}} = \left\{ {\begin{array}{*{20}{c}} {17 \times (1 - \sqrt {\frac{t}{{127}}} ) + 3}&{t \le 127}\\ {\frac{3}{{128}} \times (t - 127) + 3}&{otherwise} \end{array}} \right. $$

(7)

Equation (7) is a piecewise function. There are two situations that need to be discussed:

1)
When t ≤ 127:

The mathematical expression of t is derived as follows:

$$ t = {(1 - \frac{{\frac{k}{{\hat C}} - 3}}{{17}})^{2}} \times 127 $$

(8)

When t ≤ 127, it means that the target image is in low light conditions. According to (8), a constraint of t can be made: ${(1 - \frac {{\frac {k}{{\hat C}} - 3}}{{17}})^{2}} \le 1$. Then, $ 0 < 1 - \frac {{\frac {k}{{\hat C}} - 3}}{{17}} \leq 1$ is workable, which can further restrict the range of $\hat C$. Based on the above analysis, the range of $\hat C$ is $\frac {k}{{20}} \le \hat C < \frac {k}{3}$, under the condition of t ≤ 127.

2)
When t is other value:

The formula of t is expressed as follows:

$$ t = (\frac{k}{{\hat C}} - 3) \times \frac{{128}}{3} + 127 $$

(9)

Since t is the luminance value of the input image within the range of [0, 255], other values refer to the range of [128, 255] for 8 bit images. Similar to the above analysis when t ≤ 127, the range of $\hat C$ is [$\frac {k}{{6}}$, $\frac {k}{{3}}$), under the condition that t is other value.

In summary, the perceptive contrast is a piecewise function and $\frac {k}{{6}}$ is the demarcation value to divide the contrast into high or low ones. The pixels whose $\hat C$ values are in the range [$\frac {k}{{20}}$, $\frac {k}{{6}}$) have low contrast and need more enhancement. On the contrary, the pixels whose $\hat C$ values are in the range [$\frac {k}{{6}}$, $\frac {k}{{3}}$) have relatively high contrast and need less enhancement.

3.3 Multi-layer illumination decomposition model

3.3.1 Illumination decomposition by multi-scale adaptive filter

Different from the Gaussian filter, the bilateral filter not only uses the geometric proximity, but also takes the color difference between pixels into account. This is why the bilateral filter can effectively remove noise and preserve the edge information of the image. In addition, the bilateral filter is non-iterative and regional. Therefore, the bilateral filter is adopted in this paper instead of the Gaussian filter used in MSR to estimate the illumination component, which can reduce the effect of noise and avoid halo artifacts.

Inspired by the above, a multi-scale adaptive filter is proposed to estimate the illumination. The estimated illumination component is expressed as follows:

$$ {B_{s,l}}(x,y) = \frac{{\sum\limits_{(p,q) \in \tilde {\varOmega} } {\alpha \beta l(p,q)} }}{{\sum\limits_{(p,q) \in \tilde {\varOmega} } {\alpha \beta } }} $$

(10)

where $\tilde {\varOmega }$ represents the areas of multi-scale distance from the center (x,y), and (p,q) represents the location of pixels within $\tilde {\varOmega }$. s stands for the characteristic of multi-scale, and it takes on the values of 1, 2 and 4, representing the small, middle and large scales, respectively. B_s,l(x,y) is the result of filtering the image l by s scale. α and β are the geometric proximity and color difference between pixels, expressed as follows:

$$ \alpha = {e^{- \frac{{{{(p - x)}^{2}} + {{(q - y)}^{2}}}}{{{\text{2}}{\sigma_{n}^{2}}}}}} $$

(11)

$$ \beta = {e^{- \frac{{{{[l(x,y) - l(p,q)]}^{2}}}}{{2{\sigma_{l}^{2}}}}}} $$

(12)

where σ_n and σ_l represent the geometric variance and color variance, respectively. In this paper, σ_n and σ_l are set to be equal, and their values are constrained by the perceptive contrast so that halo artifacts can be adaptively removed in the estimation of illumination. σ_n and σ_l are defined as follows:

$$ \sigma_{n} = \sigma_{l} = \left\{ {\begin{array}{*{20}{c}} {{\sigma_{0}}}&{\frac{k}{{20}} \le \hat c(x,y) < \frac{k}{6}}\\ {{\sigma_{1}}}&{otherwise} \end{array}} \right. $$

(13)

where $\hat c(x,y)$ represents the contrast of the pixel located at (x,y). From (13), if the contrast is less than $\frac {k}{6}$ and more than $\frac {k}{20}$, σ₀ is used to reduce the noise in the smooth areas; otherwise, σ₁ is used to maintain the edge information.

3.3.2 Multi-layer model

Multi-layer model is based on an essential assumption that details may exist in the multi-layer spatial frequency bands. Therefore, the multi-scale adaptive filter is used to iteratively decompose the input image into multiple layers until the illumination map is uniform.

Figure 4 shows the overview of the multi-layer model. Each component D_i(x,y) is further decomposed into a low-frequency component L_i+ 1(x,y) and a detail component D_i+ 1(x,y). L_i+ 1(x,y) is computed by filtering D_i(x,y) using the multi-scale adaptive filter, as follows:

$$ {L_{i+1}}(x,y) = \sum\limits_{s} \xi_{s} {B_{s,D_{i}}}(x,y) $$

(14)

where ξ_s is the weight coefficient of each scale. ξ₁, ξ₂ and ξ₄ are set to be 0.33, 0.34 and 0.33, respectively. Each detail component D_i+ 1(x,y) is obtained by D_i(x,y) − L_i+ 1. At the same time, each reflectance component R_i+ 1(x,y) is obtained by R_i+ 1(x,y) = D_i(x,y)/L_i+ 1(x,y). The brightness range of the low-frequency component shrinks as i increases. If the layer number is large enough, the brightness of the last layer will be completely uniform. Each R_i(x,y) covers an unique frequency band of the original image, which can be integrated into the final enhanced image to preserve details.

In the proposed multi-layer model, the image V (x,y) is decomposed into multiple components, including multiple illumination and reflectance maps. After obtaining the reflectance and illumination components, the next task is to adjust the illumination to improve the visibility. In this paper, Gamma correction is applied. The enhanced V channel image $\hat {V}$ is generated by:

$$ \hat V(x,y) = (\sum\limits_{i = 1}^{w} {{\beta_{i}}} {L_{i}}^{\gamma} (x,y)) \cdot (\sum\limits_{i = 1}^{w} {{\eta_{i}}} {R_{i}}(x,y)) $$

(15)

where w represents the number of layers. γ is empirically set as 2.2. When i = 1, L₁(x,y) is obtained by directly decomposing the original V channel image V (x,y). With the increase of the layer number, the illumination and the corresponding reflectance components tend to be closer to the actual illumination and reflectance, respectively. Therefore, their weights (β_i and η_i) should increase as i increases, which are set to be $\frac {2i-1}{w^{2}}$. The whole decomposition will be stopped only if the difference between L_i(x,y) and L_i+ 1(x,y) (or between R_i(x,y) and R_i+ 1(x,y)) is smaller than a threshold in practice, i.e., ∥ L_i+ 1(x,y) − L_i(x,y) ∥ / ∥ L_i(x,y) ∥< 5 × 10^− 3 (or ∥ R_i+ 1(x,y) − R_i(x,y) ∥ / ∥ R_i(x,y) ∥< 5 × 10^− 3), or if the maximal number of layers is reached. The illumination in (15) does not need to be normalized before Gamma correction since the input V channel image V is converted to normalization. Finally, the enhanced HSV image is transformed to RGB color space, generating the final result $\hat S$.

4 Experimental results and analysis

First, the experiment settings are presented in this section. Second, the proposed method is evaluated by comparing it with the state-of-the-art low-light image enhancement algorithms to demonstrate the efficiency and effectiveness of our method. Third, the parameters used in the proposed method are analyzed. Fourth, the computational time of the proposed method is discussed. Finally, the limitations and future directions of our work are also pointed out.

4.1 Experiment settings

To comprehensively evaluate the proposed method, we test the proposed method on images with different illumination conditions. The test images are obtained from NPE [44], LIME [18], MEF [31], MF [12] and DICM [25] datasets. All images from these datasets have low contrast in local areas but serious illumination variation in global space.

If not specifically stated, the parameter k in the expression (6) is 1, and the parameters σ₀ and σ₁ in (13) are 3.3 and 0.5, respectively. In addition, the parameter w in (15) is set to be 10. These parameters are analyzed in Section 4.3.

4.2 Quality assessment

The proposed method is compared with several state-of-the-art methods, including seven traditional algorithms (MSR [21], PMSIRE [11], LIME [18], NPIE-MLLS [43], SRLLIE [27], LR3M [36] and SDD [19]) and three learning-based networks (KinD-Net [54], DLN-Net [41] and DCE-Net [17]). Experiments of traditional methods are performed in MATLAB R2016b on a computer running Windows 10 with an Intel (R) Core (TM) i5-3470 CPU and 8.00GB of RAM. The results of these methods are generated using the codes downloaded from the authors’ websites, and the settings of parameters are the same as the original papers. Learning-based methods are trained with datasets in [17] and performed on the Nvidia RTX 2080Ti GPU with 11G memory.

4.2.1 Subjective assessment

Due to the space limitation, this section only presents some representative results as shown in Figs. 5, 6, 7, 8, 9 and 10.

Since MSR simply takes the reflectance as the final result, the enhanced images shown in Figs. 5(b)-10(b) do not preserve the naturalness. For example, the color of the sky is close to white in Fig. 6(b). In addition, since the Gaussian filter cannot maintain the contrast of edges, the enhanced results by MSR tend to lose some details, such as the sparse texture of rocks in Fig. 10(b). Compared with MSR, the proposed algorithm adopts the perceptive contrast to enhance the contrast adaptively, which can preserve the naturalness, maintain detail information and improve the contrast well.

PMSIRE incorporates the Bayes’ theorem into Retinex theory to formulate a probabilistic model, which can simultaneously estimate the reflectance and illumination components in the linear domain. Figures 5(c)-10(c) show the results of PMSIRE. The overall brightness of the enhanced images is not well improved, even though the enhanced images show some impressive results. For example, in Fig. 5(c), the area under the eave is still buried in darkness. Moreover, PMSIRE cannot enhance the contrast, as shown the smooth structure of aquarium in Fig. 7(c). Comparatively, the proposed algorithm achieves a good tradeoff among detail, contrast, brightness and naturalness.

LIME is an effective method to adjust the brightness of low-light images, which adds a structure-aware prior into the initial illumination map. As shown in Figs. 5(d) - 10(d), the enhanced results have good brightness information. Nevertheless, owing to the lack of constraint on the reflectance, this method can easily over enhance the high-intensity regions, such as the over-exposed areas of glass in Aquarium. Furthermore, the enhanced images exist color distortion, such as the color of roof in Fig. 6(d) and grass in Fig. 9(d). The color distortion has a negative impact on human visual perception, so that the enhanced image loses naturalness. LIME utilizes BM3D method to reduce the effect of noise. However, denoising may cause the results fuzzy, as shown by the profile of pedestrians in Station. In contrast, the proposed method generates more natural results while also considering the influence of noise.

NPIE-MLLS attempts to preserve the naturalness of images. It is effective in improving the local contrast in dark areas and keeping the lightness-order of the image. As shown in Figs. 5(e)-10(e), most of results are satisfactory. However, the brightness is not improved, such as the sky in Fig. 8(e). NPIE-MLLS cannot keep spatial consistency with the original image since there is no consideration of human visual perception. In Fig. 8(a), the values of pixels in the sky area are higher than that in other areas. From the enhanced result in Fig. 8(e), although the brightness of the areas other than the sky is increased, the brightness of the sky is reduced. In addition, there are some artifacts, such as the black areas in the balcony of the green building in Fig. 6(e). Unlike NPIE-MLLS, the proposed method can successfully preserve the spatial consistency with the original image and avoid artifacts by using the human visual characteristic.

SRLLIE presents a robust Retinex model by considering the effect of noise, and utilizes the gradient of reflectance to reveal structural details. SRLLIE can estimate the reflectance, illumination and noise components effectively. From Figs. 5(f)-10(f), it is obvious that SRLLIE can obtain satisfactory results in most cases. Nevertheless, the contrast is not enhanced effectively. For example, the contrast of cars in Fig. 6(f) is low, and the frame of eave in Fig. 5(f) is over-smooth and blurry. This is because SRLLIE only applies the local characteristic into Retinex variational model, while overlooking the contrast enhancement. Furthermore, the hue of enhanced images by SRLLIE is not natural. As shown in Fig. 8(f), color distortion exists in the sky. Compared with SRLLIE, the enhanced images by the proposed method have higher contrast and more natural brightness.

LR3M injects low-rank prior to suppress the noise in the reflectance component based on the Retinex variational model. Figures 5(g)-10(g) show the results of LR3M. The estimated illumination information by LR3M is too smooth, which causes the loss of detail information in the enhanced results. For example, in Fig. 5(g), the texture of eave is not distinguishable. The structural information is not clear either, such as the outline of cars in Fig. 6(g). Due to the lack of color constraint in LR3M, slight color distortion exists in the enhanced results. For instance, the sky in Fig. 8(g) tends to be light blue. Different from LR3M, our method maintains color consistency and enhances the contrast by applying the perceptive contrast. Furthermore, the naturalness is also well preserved in the enhanced results by the proposed method.

SDD estimates illumination and reflectance components by a semi-decoupled decomposition model, which obtains the illumination by an edge-preserving image filter and suppresses the noise during the reflectance estimation. The enhanced results of SDD are shown in Figs. 5(h)-10(h), which are natural but lose some details. For example, the texture of the sorghum in Fig. 5(h) is indistinct. The outline of cars shown in Fig. 6(h) is blurred, leading to the low contrast of the entire image. This is because SDD over smooths the illumination information. Moreover, SDD does not balance the relationship between brightness and contrast, which causes unreasonable exposure. As shown in Figs. 5(h) and 9(h), the sky areas are over-exposed. Compared with SDD, the proposed method enhances the brightness naturally and improves the contrast reasonably by utilizing the characteristic of HVS.

KinD-Net is a deep network designed to decompose the Retinex model, which is trained with paired samples captured under different light conditions. Figures 5(i)-10(i) are the enhanced images by KinD-Net. The brightness of these enhanced results are satisfactory, but these results exist color distortion since the illumination adjustment net of KinD-Net lacks the natural samples. As shown in Fig. 6(i), the floor is off-white. The water in the aquarium shown in Fig. 7(i) is bluish green, which is not consistent with the hue of the original image. Furthermore, KinD-Net does not improve detail information as the restoration net ignores the contrast, such as the low contrast of Fig. 8(i). It is obvious that the sky area in Fig. 9(i) exists artifacts. Different from KinD-Net, the proposed method can avoid color distortion, reduce artifacts, and maintain detail information.

DLN-Net is an end-to-end structure based on the convolutional neural network, which contains several lightening blocks and feature aggregations to enhance the low-light image from global and local aspects. From Figs. 5(j)-10(j), it is obvious that DLN-Net results in over-enhancement, especially in the sky area. Over-exposure causes the loss of details, such as the blurred balustrade in Fig. 6(j). This is because that samples are synthetic. Moreover, these synthetic images lack the naturalness, leading to the color distortion, such as the drain holes in Fig. 10(j). Unlike DLN-Net, the proposed method can avoid over-exposure, improve detail information, and preserve the naturalness simultaneously.

DCE-Net enhances the low-light image by an image-to-curve network, which is trained without any paired samples. It is novel to produce the high-order curves to adjust the dynamic range of the input image. Nevertheless, since some over-exposed images are used in the training set, the enhanced images are in the gray tone, as shown in Figs. 5(k)-10(k). For example, the sky in Fig. 5(a) is blue, while it becomes gray blue after enhancement by DCE-Net shown in Fig. 5(k). Furthermore, the detail information is not improved well. As shown in Fig. 7(k), many fish in the aquarium are too misty to see. Compared with DCE-Net, the enhanced results by the proposed method show more details and keep the color consistency.

These results demonstrate that the proposed method can improve the brightness, enhance the contrast, preserve the naturalness and reduce halo artifacts.

4.2.2 Objective assessment

Since the ground truth of the enhanced image is unknown for a low-light image, several blind objective assessments are employed to quantify the quality, i.e., the blind tone-mapped quality index (BTMQI) [15], the natural image quality evaluator blind image quality assessment (NIQE) [32], and the no-reference image quality metric for contrast distortion (NIQMC) [16]. These matrices are widely used to measure the visual quality from different aspects. The evaluation of contrast uses NIQMC and BTMQI. Meanwhile, NIQE assesses the naturalness of the enhanced results. Tables 2, 3 and 4 show the objective assessment results on the five test datasets. The best objective value is bolded, and the second best value is underlined.

Table 2 Quantitative measurement results of BTMQI

Full size table

Table 3 Quantitative measurement results of NIQE

Full size table

Table 4 Quantitative measurement results of NIQMC

Full size table

The first assessment metric is BTMQI, which measures the average intensity, contrast and structure information of the tone-mapped image. A smaller BTMQI value represents better image quality. From Table 2, the average BTMQI value of the proposed method on five datasets is the smallest among all methods for comparison. This indicates that the proposed method can preserve the naturalness of the image in terms of illumination, contrast, structure, color and scene.

The second metric is NIQE, which assesses the image quality by measuring the distance between the model statistics extracted from natural and undistorted images. A lower NIQE represents higher image quality. As shown in Table 3, the proposed method has the lowest average value of NIQE among all algorithms on all test datasets. This means the results of the proposed method are the closest to natural images and have the least distortion.

The last metric is NIQMC, which utilizes local detail and global histogram to assess the quality of images. For NIQMC, a larger value indicates better quality in terms of contrast. As shown in Table 4, LIME has the highest average NIQMC value, which is 0.062 higher than that of the proposed method. However, LIME exits over-exposure, such as the glass of aquarium in Fig. 7(d), and it has poorer performances than our method in other objective assessments.

Furthermore, NNB [45] is utilized to test the performance of these methods comprehensively, which integrates BTMQI, NIQE and NIQMC as follows:

$$ NNB = 2 \times NIQMC - NIQE - BTMQI $$

(16)

For NNB, a larger value represents better image quality. The test results are shown in Fig. 11. The proposed method has the highest NNB value, which further demonstrates that our method is the best one among compared algorithms in terms of objective perspective.

In summary, the proposed method generates images with better visual quality both in subjective and objective assessments in most cases. The enhanced images have almost no artifacts and show improvement on the aspects of detail, brightness and naturalness.

4.3 Parameter study

Figure 12 shows 12 examples of test images, which are randomly collected from the above datasets and NASA images ^{Footnote 1} to demonstrate the reliability of parameters in the proposed method.

4.3.1 Parameter k

Since the contrast varies from 0 to 1, the inverse proportional parameter k in the expression (6) varies from 1 to 3. The enhanced images with different k values are evaluated by three objective assessment indexes. Figure 13 shows the average objective assessment results of 12 example images shown in Fig. 12. Compared with the case where k is other value, when k is 1, the average NIQMC value is the highest, and the average BTMQI and NIQE values are the smallest. When k increases from 2 to 3, the objective assessments tend to be stable. Therefore, in this paper, k is set to be 1, which can achieve the enhancement of low-light images well.

4.3.2 Parameters σ ₀ and σ ₁

The impact of parameters σ₀ and σ₁ in (13) is shown in Fig. 14. The average objective results are obtained from 12 example test images with different (σ₀, σ₁) pairs. (σ₀, σ₁) takes on the pair-values of (3.3, 0.5), (3.3, 0.05), (33, 0.5) and (33, 0.05), respectively. From Fig. 14, when (σ₀, σ₁) is (3.3, 0.5), NIQE and BTMQI have the lowest average values. At the same time, the corresponding NIQMC has the largest average value. When (σ₀, σ₁) is (3.3, 0.05), (33, 0.5) and (33, 0.05), the corresponding average NIQE and BTMQI values are large, and the average NIQMC value is small. This is because the order of magnitude of σ₀ is much larger than that of σ₁, which makes the bilateral filter play no important role in avoiding halo effects. Therefore, σ₀ and σ₁ are set to be 3.3 and 0.5 respectively, which can generate satisfactory results in most cases.

4.3.3 Parameter w

The effect of the layer number w in (15) is analyzed using the image of Fig. 12(k). The layer number w varies from 1 to 15. Figure 15 shows the results of three blind objective assessments with different layer numbers. When the layer number is 10, the corresponding NIQMC has the largest value, while NIQE and BTMQI have the lowest values. When the layer number increases from 10 to 15, the objective assessments are stable. This means that the proposed method can achieve the best results when the layer number is 10. Therefore, in this paper, the parameter w is set to be 10. In addition, Fig. 15 demonstrates that the multi-layer model has an advantage over the single-layer model.

Figure 16 plots the convergence curves for 12 example test images, and gives an intuitive demonstration of the convergence speed of the proposed method. From the curves, the algorithm converges within 10 layers for 12 example test images. Setting the maximum number of layers to be 10 is sufficient to achieve convergence.

4.4 Computing efficiency

Since the proposed method belongs to Retinex-based algorithms, the comparison of computing efficiency only performs on seven traditional state-of-the-art methods (MSR [21], PMSIRE [11], LIME [18], NPIE-MLLS [43], SRLLIE [27], LR3M [36] and SDD [19]). It takes approximately 186.655s for the proposed method to process one image of size 1312 × 2000. MSR, PMSIRE, LIME, NPIE-MLLS and SDD require approximately 13.129s, 17.101s, 20.234s, 158.126s and 100.344s, respectively. Due to the computing complexity, SRLLIE and LR3M need a computer with larger memory to operate. To process an image of the same size, SRLLIE and LR3M require about 296.949 seconds and 2 hours, respectively. Even though our method needs more time than MSR, PMSIRE, LIME, NPIE-MLLS and SDD, the enhanced results of the proposed method are better than theirs in terms of subjective and objective assessments.

4.5 Limitations and future work

Although the enhanced images of the proposed method are better than other methods, our technique still has a problem. That is, in some enhanced images, the details originally located in the extremely dark areas may be blurred, such as the red circle area in Fig. 5(l). This is because the contrast of extremely dark areas in the original image is not obvious, and HVS cannot perceive these regions well. This pinpoints a direction for our future works. To avoid ignorance of the scene content, semantic understanding is required. With further refinement, combining deep learning techniques with illumination decomposition will be performed. In addition, we will also focus on optimizing the proposed method in the future.

5 Conclusion

In this paper, a perceptive low-light image enhancement method via multi-layer illumination decomposition model is proposed. The contrast is defined from local and global aspects by taking noise into consideration. Then, the defined contrast is combined with the JND model to design the perceptive contrast, which can adapt to HVS and maintain the naturalness of the image for the subsequent enhancement. To estimate the illumination and reflectance components, a multi-layer illumination decomposition model is presented. This model can adaptively adjust the extent of contrast enhancement in the multi-scale adaptive filter based on the perceptive contrast. The proposed method can not only ensure the enhancement of brightness, but also preserve the naturalness of the image. The experimental results reveal the advantages of our method compared with several state-of-the-arts in subjective assessment. Furthermore, BTMQI, NIQE and NIQMC objective assessments further demonstrate the superiority of the proposed method.

Notes

https://dragon.larc.nasa.gov/retinex/pao/news

References

Abdullah-Al-Wadud M, Kabir MH, Dewan MAA, CHae O (2007) A dynamic histogram equalization for image contrast enhancement. IEEE Trans Consum Electron 53(2):593–600
Article Google Scholar
Cheng H, Long W, Li y, Liu H (2020) Two low illuminance image enhancement algorithms based on grey level mapping. Multimed Tools Appl
Chou CH, Li YC (1995) Perceptually tuned subband image coder based on the measure of just-noticeable-distortion profile. IEEE Trans on Circuits & Systems for Video Technology 5(6):467–476
Article Google Scholar
Dabov K, Foi A, Katkovnik V, Egiazarian K (2007) Image denoising by sparse 3-d transform-domain collaborative filtering. IEEE Trans Image Process 16(8):2080–2095
Article MathSciNet Google Scholar
Dicarlo JM, Wandell BA (2006) Rendering high dynamic range images. Proc Spie 3956:392–401
Google Scholar
Dong X, Wang G, Pang Y, Li W, Wen J, Meng W, Lu Y (2011) Fast efficient algorithm for enhancement of low lighting video. In: 2011 IEEE international conference on multimedia and expo (ICME), pp 1–6
Edoardo P, Luca DC, Alessandro R, Daniele M (2005) Mathematical definition and analysis of the retinex algorithm. J Opt Soc Am A: Opt Image Sci Vis 22(12):2613–21
Article MathSciNet Google Scholar
Eilertsen G, Mantiuk RK, Unger J (2015) Real-time noise-aware tone mapping. ACM Trans Graph 34(6):1–15
Article Google Scholar
Fattal R, Lischinski D, Werman M (2002) Gradient domain high dynamic range compression. 21(3)
Feng X, Li J, Hua Z (2020) Low-light image enhancement algorithm based on an atmospheric physical model. Multimed Tools Appl 79(3)
Fu X, Liao Y, Zeng D, Huang Y, Zhang X, Ding X (2015) A probabilistic method for image enhancement with simultaneous illumination and reflectance estimation. IEEE Trans Image Process 24(12):4965–4977
Article MathSciNet Google Scholar
Fu X, Zeng D, Huang Y, Liao Y, Ding X, Paisley J (2016) A fusion-based enhancing method for weakly illuminated images. Signal Process 129:82–96
Article Google Scholar
Gonzalez RC, Woods RE (2007) Digital Image Processing, 3rd. Prentice-Hall, Upper Saddle River, NJ
Google Scholar
Govind LP, Josemartin MJ (2019) Kerala Application of multi-stage filtering and multi-layer model in the context of dark and non uniformly illuminated images. In: 2019 2Nd international conference on intelligent computing, instrumentation and control technologies (ICICICT), vol 1, pp 615–620
Gu K, Wang S, Zhai G, Ma S, Yang X, Lin W, Zhang W, Gao W (2016) Blind quality assessment of tone-mapped images via analysis of information, naturalness, and structure. IEEE Transactions on Multimedia 18(3):432–443
Article Google Scholar
Gu K, Lin W, Zhai G, Yang X, Zhang W, Chen CW (2017) No-reference quality metric of contrast-distorted images based on information maximization. IEEE Trans Cybern 47(12):4559–4565
Article Google Scholar
Guo C, Li C, Guo J, Loy CC, Hou J, Kwong S, Cong R (2020) Zero-reference deep curve estimation for low-light image enhancement. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 1777–1786
Guo X, Li Y, Ling H (2017) Lime: Low-light image enhancement via illumination map estimation. IEEE Trans Image Process 26(2):982–993
Article MathSciNet Google Scholar
Hao S, Han X, Guo Y, Xu X, Wang M (2020) Low-light image enhancement with semi-decoupled decomposition. IEEE Transactions on Multimedia 22(12):3025–3038
Article Google Scholar
Jayant N (1992) Signal compression: technology targets and research directions. IEEE Journal on Selected Areas in Communications 10(5):796–818
Article Google Scholar
Jobson DJ, Rahman Z, Woodell GA (1997a) A multiscale retinex for bridging the gap between color images and the human observation of scenes. IEEE Trans Image Process 6(7):965–976
Article Google Scholar
Jobson DJ, Rahman Z, Woodell GA (1997b) Properties and performance of a center/surround retinex. IEEE Trans Image Process 6(3):451–462
Article Google Scholar
Kimmel R, Elad M, Shaked D (2003) Keshet r, A variational framework for retinex. Int J Comput Vis, Sobel I
Land EH (1977) The retinex theory of color vision. Sci Am 237 (6):108–129
Article Google Scholar
Lee C, Lee C, Kim C (2012) Contrast enhancement based on layered difference representation. In: 2012 IEEE international conference on image processing (ICIP), pp 965–968
Li L, Wang R, Wang W, Gao W (2015) A low-light image enhancement method for both denoising and contrast enlarging. In: 2015 IEEE international conference on image processing (ICIP), pp 3730–3734
Li M, Liu J, Yang W, Sun X, Guo Z (2018) Structure-revealing low-light image enhancement via robust retinex model. IEEE Trans Image Process 27(6):2828–2841
Article MathSciNet Google Scholar
Liao S, Hu Y, Xiangyu Z, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 2197–2206
Lim J, Heo M, Lee C, Kim CS (2017) Contrast enhancement of noisy low-light images based on structure-texture-noise decomposition. J Vis Commun Image Represent 45:107–121
Article Google Scholar
Lore KG, Akintayo A, Sarkar S (2017) Llnet: a deep autoencoder approach to natural low-light image enhancement. Pattern Recogn 61:650–662
Article Google Scholar
Ma K, Zeng K, Wang Z (2015) Perceptual quality assessment for multi-exposure image fusion. IEEE Trans Image Process 24(11):3345–3356
Article MathSciNet Google Scholar
Mittal A, Soundararajan R, Bovik AC (2013) Making a “completely blind” image quality analyzer. IEEE Signal Processing Letters 20(3):209–212
Article Google Scholar
Panetta KA, Wharton EJ, Agaian SS (2008) Human visual system-based image enhancement and logarithmic contrast measure. IEEE Trans Syst Man Cybern Part B (Cybern) 38(1):174–188
Article Google Scholar
Pisano ED, Zong S, Hemminger BM, Deluca M, Johnston RE, Muller K, Braeuning MP, Pizer SM (1998) Contrast limited adaptive histogram equalization image processing to improve the detection of simulated spiculations in dense mammograms. J Digit Imaging 11(4):193–200
Article Google Scholar
Qiao Y, Liu Y, Yang X, Zhou D, Xu M, Zhang Q, Wei X (2020) Attention-guided hierarchical structure aggregation for image matting. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 13673–13682
Ren X, Yang W, Cheng W, Liu J (2020) Lr3m: Robust low-light enhancement via low-rank regularized retinex model. IEEE Trans Image Process 29:5862–5876
Article MathSciNet Google Scholar
Selim SZ, Ismail MA (1984) K-means-type algorithms: a generalized convergence theorem and characterization of local optimality. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI- 6(1):81–87
Article Google Scholar
Steyer S, Lenk C, Kellner D, Tanzmeister G, Wollherr D (2020) Grid-based object tracking with nonlinear dynamic state and shape estimation. IEEE Trans Intell Transp Syst 21(7):2874–2893
Article Google Scholar
Wang C, Ye Z (2005) Brightness preserving histogram equalization with maximum entropy: a variational perspective. IEEE Trans Consum Electron 51(4):1326–1334
Article Google Scholar
Wang D, Niu X, Dou Y (2014) A piecewise-based contrast enhancement framework for low lighting video. In: 2014 IEEE international conference on security, pattern analysis, and cybernetics (SPAC), pp 235–240
Wang LW, Liu ZS, Siu WC, Lun DPK (2020) Lightening network for low-light image enhancement. IEEE Trans Image Process 29:7984–7996
Article Google Scholar
Wang R, Zhang Q, Fu C, Shen X, Zheng W, Jia J (2019) Underexposed photo enhancement using deep illumination estimation. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 6842–6850
Wang S, Luo G (2018) Naturalness preserved image enhancement using a priori multi-layer lightness statistics. IEEE Trans Image Process 27(2):938–948
Article MathSciNet Google Scholar
Wang S, Zheng J, Hu HM, Li B (2013) Naturalness preserved enhancement algorithm for non-uniform illumination images. IEEE Trans Image Process 22(9):3538–3548
Article Google Scholar
Wu Y, Song W, Zheng J, Liu F (2021) Non-uniform low-light image enhancement via non-local similarity decomposition model. Signal Processing Image Communication 93(2):116141
Article Google Scholar
Xu J, Hou Y, Ren D, Liu L, Zhu F, Yu M, Wang H, Shao L (2020) Star: a structure and texture aware retinex model. IEEE Trans Image Process 29:5022–5037
Article Google Scholar
Xu K, Yang X, Yin B, Lau RWH (2020) Learning to restore low-light images via decomposition-and-enhancement. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 2278–2287
Xu X, Luo X, Ma L (2020) Context-aware hierarchical feature attention network for multi-scale object detection. In: 2020 IEEE international conference on image processing (ICIP), pp 2011–2015
Xueyang F, Delu Z, Yue H, Xiaoping Z, Xinghao D (2016) A weighted variational model for simultaneous reflectance and illumination estimation. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 2782–2790
Yang W, Wang S, Fang Y, Wang Y, Liu J (2020) From fidelity to perceptual quality: A semi-supervised approach for low-light image enhancement. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 3060–3069
Yu L, Su H, Jung C (2018) Perceptually optimized enhancement of contrast and color in images. IEEE Access 6:36132–36142
Article Google Scholar
Zhang C, Yan Q, Zhu Y, Li X, Sun J, Zhang Y (2020) Attention-based network for low-light image enhancement. arXiv:2005.09829
Zhang X, Shen P, Luo L, Zhang L, Song J (2012) Enhancement and noise reduction of very low light level images. In: 2012 IEEE international conference on pattern recognition (ICPR), pp 2034–2037
Zhang Y, Zhang J, Guo X (2019) Kindling the darkness: a practical low-light image enhancer. arXiv:1905.04161
Zhu M, Pan P, Chen W, Yang Y (2020) Eemefn: Low-light image enhancement via edge-enhanced multi-exposure fusion network. In: AAAI Conference on artificial intelligence (AAAI), vol 34, pp 13106–13113

Download references

Author information

Authors and Affiliations

Nanjing University of Posts and Telecommunications, No.66 Xin Mofan RD, Nanjing, 210003, China
Yahong Wu, Jieying Zheng, Wanru Song & Feng Liu

Authors

Yahong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jieying Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Wanru Song
View author publications
You can also search for this author in PubMed Google Scholar
Feng Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Feng Liu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported in part by National Natural Science Foundation of China under Grant 61702278, in part by Priority Academic Program Development of Jiangsu Higher Education Institutions and in part by Postgraduate Research & Practice Innovation Program of Jiangsu Province KYCX18_0902.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, Y., Zheng, J., Song, W. et al. Perceptive low-light image enhancement via multi-layer illumination decomposition model. Multimed Tools Appl 81, 40905–40929 (2022). https://doi.org/10.1007/s11042-022-13139-w

Download citation

Received: 12 January 2021
Revised: 01 April 2021
Accepted: 10 April 2022
Published: 14 May 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s11042-022-13139-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Perceptive low-light image enhancement via multi-layer illumination decomposition model

Abstract

Similar content being viewed by others

Pre-denoising 3D Multi-scale Fusion Attention Network for Low-Light Enhancement

Low-light image enhancement network with decomposition and adaptive information fusion

DMPH-Net: a deep multi-scale pyramid hybrid network for low-light image enhancement with attention mechanism and noise reduction

1 Introduction