Abstract
This paper presents a proposed approach for the enhancement of Infrared (IR) night vision images. This approach is based on a trilateral contrast enhancement in which the IR night vision images pass through three stages: segmentation, enhancement and sharpening. In the first stage, the IR image is divided into segments based on thresholding. The second stage, which is the heart of the enhancement approach, depends on additive wavelet transform (AWT) to decompose the image into an approximation and details. Homomorphic enhancement is performed on the detail components, while plateau histogram equalization is performed on the approximation plane. Then, the image is reconstructed and subjected to a post-processing high-pass filter. Average gradient, Sobel edge magnitude and spectral entropy are used as quality metrics for evaluation of the proposed approach. The used metrics ensure good success of this proposed approach.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Image enhancement techniques have been widely used in many applications of image processing in which the subjective quality of images is important for human interpretation. Contrast is an important factor in any subjective evaluation of image quality. Contrast is the difference in visual properties that makes an object distinguishable from other objects and the background [1, 2, 7, 17, 20, 22, 23].
Night vision signifies the ability to see in dark (night). This ability is normally possessed by owls and cats, but with the development of science and technology, devices have been developed to enable human beings to see in the dark and in adverse atmospheric conditions such as fog, rain, and dust [3, 5, 19]. The main purpose for the development of night vision technology was military use to locate enemies at night. Night vision technology is not only used extensively for military purposes, but also for navigation, surveillance, targeting and security [4, 8, 18, 19, 26].
Few thermal IR datasets have been published in the past such as the OTCBVS Benchmark [24, 27], the LITIV Thermal-Visible Registration Dataset [6, 21, 25]. These datasets can be used for the evaluation of any image processing algorithm that can be applied for better night vision. The proposed approach is based on trilateral contrast enhancement of IR night vision images. The paper is arranged as follows. Section 2 gives the motivations and related work. Section 3 gives an explanation of the histogram equalization. Section 4 gives the bilateral histogram equalization referred to as bi-histogram equalization. Section 5 gives a discussion of the segmentation stage in the proposed approach. Section 6 gives a discussion of the plateau histogram equalization. Section 7 covers an IR image enhancement approach based on the AWT with homomorphic processing. Section 8 presents the proposed trilateral contrast enhancement approach. In section 9, performance evaluation quality metrics are given. Section 10 gives a discussion of the experimental results. Finally, section 11 gives the conclusions and the future work.
2 Motivations and related work
This paper deals with a vital topic derived from the problems addressed for IR images [1,2,3, 7, 17, 20, 22, 23]. The objective is the development of image processing technologies to enhance IR night vision images. The proposed approach is based on a hybrid implementation of three stages: segmentation, enhancement and sharpnening [3,4,5, 8, 18, 19, 24, 26, 27]. Compared to the most relevant work [2, 6], this work depends on performance evaluation with spectral entropy, average gradient and Sobel edge magnitude [16, 21, 25]. The proposed approach depends on trilateral contrast enhancement. The IR night vision images pass through three stages: segmentation, enhancement, and sharpening. It is clear that the obtained results in this paper are better than those of the previous works as shown in the Tables 1, 2, 3, 4, 5 and 6 for six cases. Enhancement of the night vision images and videos is very important for many computer vision tasks, such as visual tracking in the night [11, 13]. The use of multiple features for tracking from IR videos can be enhanced with the proposed approach since different types of variations such as illumination, occlusion and pose can be enhanced [9, 10].
To intelligently analyze and understand video content, a main step is to accurately perceive the motion of the objects of interest in videos. The task of object tracking aims to determine the position and status of the objects of interest in consecutive video frames. This field is very important, and has received great research interest in the last decade. Although numerous algorithms have been proposed for object tracking in RGB videos, the task is still limited in IR videos [12, 14, 15].
3 Histogram equalization
Histogram equalization (HE) is a specific case of the more general class of histogram remapping methods. These methods seek to adjust the image to make it easier to analyze or improve its visual quality. It can also be used on color images by applying the same method separately to the Red, Green and Blue components of the RGB color values of the image [7].
Still, it should be noted that applying the same method on the Red, Green, and Blue components of an RGB image may yield dramatic changes in the image color balance since the relative distributions of the color channels change as a result of applying the algorithm. However, if the image is first converted to another color space, Lab color space, or HSL/HSV color space in particular, then the algorithm can be applied to the luminance channel without resulting in changes in the hue and saturation of the image. The HE operation can be represented as follows [22].
where c(x,y) is an image with a poor histogram, and f is the function that transforms the image c(x,y) into an image b(x,y). The Probability Density Function (PDF) of a pixel value a in the image c is given by:
In fact, pc(a) is the probability of finding a pixel with the value a in the image c. Area is the area or number of pixels in the image, and Hc(a) is the histogram value of the image c for gray level a. The Cumulative-Density Function (CDF) for gray level a in image c is therefore given by:
The CDF is the sum of all PDFs up to the value a. Note that ideally the image b has a flat histogram such that Hb(0) = Hb(1) = .... = Hb(a) = .... = Hb(255). Therefore, the probabilities of all pixel values are now equal. They all occur similar times. So, the desired HE function f(a) simply takes the PDF for the values in the image c and multiplies its reciprocal by the CDF of the values in the same image.
Dm is the number of gray levels in the new image b. Assuming histogram uniformity in the image b, we can conclude that Dm = 1/pb(a) for all pixel values a in the image b. It is important to realize that HE reduces the number of gray levels in the image, because the equalization process is a nonlinear process, which may transform multiple gray levels in the image with a poor histogram into a single gray level in the equalized image.
4 Bi-histogram equalization
Bi-histogram equalization (BHE) divides the original image histogram into two different histograms with the reference as the mean value of the original image. Then, the sub-divided image histograms are equalized separately by histogram equalization. The following steps are performed to perform BHE.
- 1.
Mean computation: Mean value of the input image xm is computed.
- 2.
Bi-histogram formation: From the mean value the input image histogram, two sub-image histograms xa and xb are generated as [22]:
where x is the input image, xa and xb are the sub-image histograms.
- 3.
Histogram equalization of sub-images: Histogram equalization of sub-images is performed similar to that of the traditional image.
5 Segmentation stage
This stage is based on Otsu’s N thresholding method. Otsu’s method of segmentation is an optimum global thresholding method. It is a non-parametric and unsupervised method of automatic threshold selection for segmentation of images. It is a simple procedure, and it utilizes only the zeroth and the first-order cumulative moments of the gray-level histogram. It is optimum in the sense that it maximizes the between-class variance, a well-known measure used in statistical discriminant analysis [16].
where M × N is the size of the image, ni is the total number of pixels in the image with level i. Suppose we select a threshold k, and use it to threshold the image into two classes, C1 and C2. Class C1 consists of pixels with intensity values in the range [0, k]. Class C2 consists of the pixels with intensity values in the range [k + 1, L-1]. Using this threshold, the probability, P1(k), that a pixel is assigned to class C1 is given by the cumulative sum as follows:
The pixels of the input image are represented in L gray levels, and k is a selected threshold from 0 < k < L-1.
Similarly, the probability of pixels in Class C2 is,
where P1(k) is the probability of pixels in Class C1.
The mean intensity values of the pixels assigned to class C1 are
Similarly, the mean intensity values of the pixels assigned to class C2 are
The global mean is given by,
The problem is to find an optimum value for k, which maximizes the criterion defined by this equation:
where σB2(k) is the between-class variance defined as
and σG2(k) is the global variance defined as,
where the optimum threshold is the value k* that maximizes σB2(k).
6 Plateau histogram equalization
Plateau histogram equalization (PHE) modifies the shape of the input histogram by reducing or increasing the values in the histogram bins based on a threshold limit before the equalization takes place. An appropriate threshold value is selected firstly, which is represented as T. If the value of P(Xk) is greater than T, then it is forced to be equal to T. Otherwise, it is unchanged, as shown below [17]:
where nk represents the number of times that the level Xk appears in the input image and n is the total number of samples in the input image, for k = 0, 1, ...., L − 1.
where P(Xk) is the modified probability density function, and T is the selected threshold value.
Then, histogram equalization is carried out using this modified probability density function. There is one main problem associated with plateau histogram equalization. Most of the methods need the user to set manually the plateau threshold of the histogram, which makes these methods not suitable for automatic systems. Although some methods can set the plateau threshold automatically, the process for deciding one threshold is often complicated.
Selection of plateau threshold value is very important for IR image enhancement. It has an effect on the contrast of images. An appropriate plateau threshold value would greatly enhance the contrast of the image. In addition, some plateau values would be appropriate to some IR images, but not appropriate to others. As a result, the plateau threshold value would be selected adaptively according to the IR image.
The steps of this algorithm are performed as follows:
- 1.
The IR image is obtained for an object through the optical lens of a thermal imager.
- 2.
The image is considered in matrix form with different pixel values.
- 3.
All pixel values of the image are arranged in an ascending order.
- 4.
Histogram is estimated.
- 5.
The median of the image levels is estimated and used as a threshold.
- 6.
Comparison with the estimated threshold is performed to determine the required processing.
- 7.
Histogram equalization for every pixel is performed.
7 AWT with homomorphic enhancement
In this approach, we merge the benefits of the AWT and homomorphic enhancement. First, the IR image is decomposed into sub-bands using the AWT. After that, each sub-band is processed, separately, using the homomorphic enhancement to reinforce image details.
A visual image can be represented as a product of two components as folows:
where f(n1, n2) is the obtained image pixel value, i(n1, n2) is the light illumination incident on the object to be imaged and r(n1, n2) is the reflectance of that object.
It is known that illumination is approximately constant, since the light falling on all objects is approximately the same. The only change between objects is in the reflectance component.
If we apply a logarithmic process on Eq. (19), we can change the multiplication process into an addition process as follows:
The first term in the above equation has small variations, but the second term has large variations as it corresponds to the reflectivity of the object to imaged. By attenuating the first term and reinforcing the second term of Eq. (20), we can reinforce the image details. This idea can be extended to IR image enhancement by working with the image pixels as values only without considering the composition process of pixel values in IR imaging.
The steps of the AWTH approach can be summarized as follows:
- 1.
Decompose the IR image into four subbands p3, w1, w2 and w3 using the additive wavelet transform and the low-pass filter mask given by [2]:
-
2.
Apply a logarithmic operation on each sub-band to get the illumination and reflectance components of the subbands w1, w2 and w3 as they contain the details.
-
3.
Perform a reinforcement operation on the reflectance component in each sub-band and an attenuation operation on the illumination component.
-
4.
Reconstruct each sub-band from its illumination and reflectance using addition and exponentiation processes.
-
5.
Apply adaptive plateau histogram equalization on p3
-
6.
Perform an inverse additive wavelet transform on the obtained sub-bands by adding p3, w1, w2 and w3 after the homomorphic processing to get the enhanced image.
In image processing, it is often desirable to emphasize high-frequency components representing the image details without eliminating low-frequency components. The high-boost filter can be used to enhance high-frequency components. It is used for amplifying high-frequency components of images. The amplification is achieved via a procedure, which subtracts a smoothed version of the image from the original one [1].
where Whp is a high-pass filter, A is a constant, and Whb is a high-boost filter
8 The proposed trilateral contrast enhancement approach
The proposed approach is concerned with the enhancement of IR night images based on trilateral contrast enhancement. The word trilateral means three stages. The IR night images pass through three stages: segmentation, enhancement, and sharpning (Fig. 1).
The steps of the proposed approach can be summarized as follows:
- 1.
Pick IR night vision image from IR camera.
- 2.
Divide the IR image into overlapping sub-images by a segmentation stage.
- 3.
Apply the AWPH equalization on the resultant image.
- 4.
Apply the high-boost filter on the enhanced resultant image.
9 Performance evaluation metrics
This section presents the quality metrics used for the valuation of the enhancement results. These metrics include average gradient (AG), spectral entropy (Ef) and Sobel edge magnitude (∇f). These metrics are evaluated as follows [8]:
where AG is the average gradient of the IR image f, and m×n is the size of the IR image
The spectral entropy is computed in the discrete cosine transform (DCT) domain on a block-by-block basis as illustrated in Fig. 2. It is a function of the probability distribution of the local DCT coefficient values. This probability distribution function (PDF) is given as follows [15]:
where 1 ≤ i ≤ 8, 1 ≤ j ≤ 8, i, j ≠ 1, and c(i, j) represents the DCT coefficients.
The local spectral entropy is defined as [27]:
where ∇f is the Sobel edge magnitude, fx and fy are two images containing the horizontal and vertical derivative approximations, respectively.
10 Simulation results
This section presents several simulation experiments executed on IR night vision images. These results adopt a strategy of presenting the original IR images with their enhanced versions using different enhancement methods. The results of the first experiment are shown in Fig. 3. Part (a) gives the original IR night vision image. Part (b) gives the IR image after AWPH equalization. Part (c) gives the IR image after adaptive plateau histogram equalization. Part (d) gives AWT with homomorphic enhancement on three sub-bands. Part (e) gives the IR image after the bi-histogram equalization. Part (f) gives the enhanced IR image using the proposed algorithm. Comparing between Parts (b), (c), and (d), it is clear that the proposed enhancement approach enhances the visual quality of the processed image. The performance metrics results are given in Table 1. Similar experiments have been carried out on other IR images and the results are given in Figs. 4 and 5. The higher the value of the average gradient and Sobel edge magnitude, the better the image quality. It has been shown that this algorithm has succeeded in the improvement of the visual quality of the IR images with much details. From these results, it is clear that the proposed approach has succeeded in obtaining the best results in the improvement of IR night vision images from both the visual quality and performance metrics perspectives as illustrated in Tables 2 and 3.
To further confirm the effectiveness of the proposed approach experiments on images from other datasets are presented. The Dune and Otcbvs images with size 300 × 300 pixels, respectively, and the Car images with size 301 × 149 pixels were provided by Shao et al. [6, 21, 24, 25]. The proposed approach has been tested on these images and the results are shown in Figs. 6, 7 and 8. The results illustrate that the proposed approach is superior as compared with other methods. The numerical results are given in Tables 4, 5 and 6. The results of distributions of block spectral entropy for all experiments are shown in Figs. 9, 10, 11, 12, 13 and 14. These results also ensure that the proposed approach is superior as compared with other methods.
11 Conclusions and future work
This paper presented an approach for enhancement of IR night vision images. It is a trilateral contrast enhancement approach. It depends on three stages: segmentation, enhancement and sharpning. The proposed approach comprises an enhancement stage using AWTH. Simulation results revealed that the proposed approach gives superior results to the other methods from the quality metrics perspectives. For future work, deep learning models for object detection from IR images will be considered in conjunction with IR image pre-processing.
References
Alirezanejad M, Saffari V, Amirgholipour S, Sharifi AM (2014) Effect of locations of using high boost filtering on the watermark recovery in spatial domain watermarking. Indian J Sci Technol 7(4):517–524
Ashiba HI, Awadallah KH, El-Halfawy SM, El-Samie FEA (2008) Homomorphic enhancement of infrared images using the additive wavelet transform. Progress Electromagnet Res C 1:123–130
Deepa S, Bharathi VS (2013) Efficient ROI segmentation of digital mammogram images using Otsu’s N thresholding method. J Autom Artif Intell 1(2) ISSN: 2320-4001
Fan Z, Bi D, He L, Ma S (2016) Noise suppression and details enhancement for infrared image via novel prior. Infrared Phys Technol 74:44–52
Fan Z, Bi D, Ding W (2017) Infrared image enhancement with learned features. Infrared Phys Technol. https://doi.org/10.1016/j.infrared.2017.08.015
Gade R, Moeslund TB (2014) Thermal cameras and applications: a survey. Mach Vis Appl 25(1):245–262
Gonzalez RC, Woods RE (2002) Digital image processing: introduction; .
Gupta S, Mazumdar SG (2013) Sobel edge detection algorithm. Int J Comput Sci Manag Res 2(2) ISSN 2278-733X
Lan X, Ma AJ, Yuen PC (2014) Multi-cue visual tracking using robust feature-level fusion based on joint sparse representation: 1194-1201,CVPR
Lan X, Ma AJ, Yuen PC, Chellappa R (2015) Joint sparse representation and robust feature-level fusion for multi-Cue visual tracking. IEEE Trans Image Process 24(12):5826–5841. https://doi.org/10.1109/TIP.2015.2481325
Lan X, Zhang S, Yuen PC (2016) Robust joint discriminative feature learning for visual tracking. IJCAI: 3403-3410
Lan X, Yuen PC, Chellappa R (2017) Robust MIL-based feature template learning for object tracking. AAAI: 4118-4125
Lan X, Zhang S, Yuen PC, Chellappa R (2018) Learning common and feature-specific patterns: a novel multiple-sparse-representation-based tracker. IEEE Trans Image Process 27(4):2022–2037. https://doi.org/10.1109/TIP.2017.2777183
Lan X, Ye M, Zhang S, Yuen PC (2018) Robust collaborative discriminative learning for RGB-infrared tracking. AAAI:7008-7015
Lan X, Ye M, Zhang S, Zhou H, Yuen PC Modality-correlation-aware sparse representation for RGB-infrared object tracking. Pattern Recogn Lett. https://doi.org/10.1016/j.patrec.2018.10.002
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybernat 9(1):62–66
Pik Kong NS, Ibrahim H, Ooi CH, Juinn Chieh DC (2009) Enhancement of microscopic images using modified self-adaptive plateau histogram equalization. Int Conf Comput Technol 2:308–310
Qi Y, He R, Lin H (2016) Novel infrared image enhancement technology based on the frequency compensation approach. Infrared Phys Technol. https://doi.org/10.1016/j.infrared.2016.03.021
Song Q, Wang Y, Bai K (2016) High dynamic range infrared images detail enhancement based on local edge preserving filter. Infrared Phys Technol. https://doi.org/10.1016/j.infrared.2016.06.023
Taha M, H Hala, Zayed T, Nazmy Lalifa MK (2016) Day/night detector for vehicle tracking in traffic monitoring systems. Int J Comput Electr Auto Control Inform Eng 10(1)
Torabi A, Masse G, Bilodeau G-A (2012) An iterative integrated framework for thermal-visible image registration, sensor fusion, and people tracking for video surveillance applications. Comput Vis Image Underst 116(2):210–221
Wang Q, Ward RK (2007) Fast image/video contrast enhancement based on weighted threshold histogram equalization. IEEE Trans Consum Electron 53(2):757–764
Wang G, Xiao D, Gu J (2008) Review on vehicle detection based on video for traffic surveillance. IEEE Int Conf Auto Logist: 2961-2966
Wang J, Peng J, Feng X, He G, Fan J (2014) Fusion method for infrared and visible images by using non-negative sparse representation. Infrared Phys Technol 67:477–489
Wu Z, Fuller N, Theriault D, Betke M A thermal infrared video benchmark for visual analysis. http://www.vcipl.okstate.edu/otcbvs/bench/
Zhang Q, Maldague X (2016) An adaptive fusion approach for infrared and visible images based on NSCT and compressed sensing. Infrared Phys Technol 74:11–20
Zhang S, Li P, Xu X, Li L, Chang CC (2018) No-reference image blur assessment based on response function of singular values. Symmetry 10(304):2–15
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ashiba, M.I., Tolba, M.S., El-Fishawy, A.S. et al. Hybrid enhancement of infrared night vision imaging system. Multimed Tools Appl 79, 6085–6108 (2020). https://doi.org/10.1007/s11042-019-7510-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7510-y