Enhanced Object Detection of Abnormal Light Based on Multi-scale Retinex with Chromacity Preservation

Yin, Zhuyun; Guo, Lingyue

doi:10.1007/978-981-99-9109-9_2

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1998))

Included in the following conference series:

International Symposium on Artificial Intelligence and Robotics

459 Accesses

Abstract

In autonomous driving scenarios, the lighting conditions can often change rapidly and dramatically, leading to overly bright or dark environments. These abnormal light conditions can negatively impact the accuracy of object detection, potentially causing accidents and posing a threat to pedestrian and driver safety. To address this issue, this paper presents a method to mitigate the effect of abnormal light intensity on object detection in autonomous driving scenarios. We apply Multi-scale Retinex with Chromacity Preservation (MSRCP) processing to abnormal light images based on the YOLO model in order to recover lost information. Our method was evaluated on the publicly available BDD 100k dataset, and the results showed a significant improvement in performance under abnormal lighting conditions, Compared to images without any processing, Yolo-v3 model improve 9.59$\%$ total mAP@[.5, .95], and Yolo-v4 model improves 10.86$\%$.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Low-light DEtection TRansformer (LDETR): object detection in low-light and adverse weather conditions

Article 11 April 2024

An Improved YOLO Algorithm for Object Detection in All Day Scenarios

Real-time fast low-light vision enhancement for driver during driving at night

Article 16 February 2021

Keywords

1 Introduction

As an emerging field, automatic drive has vast potential for future development, so one of the important component in autonomous driving, image identification, get people’s attention. Compared to other methods, the identification based on visible light camera is a cheaper and more deployable way.

Other the other hand, this method also comes with its issue. For example, the result will heavily affected by lighting conditions. If there is a hash condition, which can cause blurry object shape or inaccurate color, the detection accuracy will significantly drop. We test some common object recognition algorithms, faster-RCNN [11] and Yolo [8]. They are designed for general scenario, and not are optimized for the light condition. So, when the system is operated under undesired lighting conditions, In the actual usage scenario, at noon or in the late night, these influence may result accidents.

One of the most discerning autonomous driving systems in the industry, Tesla’s Autopilot system [9] is a vision-based Autopilot system, which mainly relies on camera sensors to obtain surrounding information. But this system still experiences accidents under poor lighting conditions. In 2016, a Tesla didn’t identify a front lorry, eventually cause a rear-end collision. Other Autopilot system, like Google’s Waymo [10], Baidu’s Apollo Go [3], and so on, they all have same problem.

To address the problem of accuracy faced by autopilot in complex lighting conditions, and does not affect the recognition ability under normal brightness, we develop a algorithm. In this paper, after introducing the related works, we will present our function into two parts, Firstly, we will introduce light-selection. We use the gray processing of the image, and the anomaly image is recognized and classified. Second we will present an image restoration technique based on MSRCP [7] and how it’s applied in the autonomous driving scenario. It can enhance the performance of target detection under abnormal lighting conditions by considering the lighting conditions and preserving the color information of the image. This method adjusts the brightness and color distribution of the image to enhance the edges and contours of the target, making it easier to detect under abnormal lighting conditions. Next, in the Experiment and Result section, we test the algorithm, 3000 images with abnormal brightness were selected from bdd-100k [13] as the data set of this project. During the test, we use the same model to identify image to find out the progress of our algorithm. The results showed that our method improves the total mAP@[.5, .95] from 15.22% to 19.57%, this algorithm can also used on yolov3 or other version. It can improve the recognition results of abnormal brightness images.

2 Related Work

2.1 Image Enhancement Algorithm Based on Retinex

The image enhancement algorithm is based on Retinex theory [5, 6]. The term “Retinex” stands for “Retina with Cortex” it simulates the biological processes of human’s retina and brain cortex, and it suggests that the image I observed by human eyes is the product of light image L and object reflection image R.

$$\begin{aligned} I(x, y)=L(x, y)^* R(x, y) \end{aligned}$$

(1)

Take log for both side:

$$\begin{aligned} \log [R(x, y)]=\log [I(x, y)]-\log [L(x, y)] \end{aligned}$$

(2)

We can get a reflection property image that preserves the essence of the object from the received image I and the light image L. This method is called “Single Scale Retinex (SSR)” [7, 14], and its workflow is shown as follows:

1
Take log for I(x,y)
2
Use Gaussian blur for l(x,y), the result is $\log [L(x, y)]$
3
Calculate $\log [R(x, y)]$ by $\log [I(x, y)]$ and $\log [L(x, y)]$
4
Transfer the $\log [R(x, y)]$ to RGB image, and this is the result.

Further improved can be achieved by, for example, Muti-Scale Retinex(MSR). When calculating the value of $\log [R(x, y)]$, the algorithm performed multiple SSR operations and weighted the calculation. So compare with SSR, this algorithm has better performance on color.

The aforementioned will still face some issues, such as, color cast, which can be mitigated by applying Muti-Scale Retinex with Color Restoration(MSRCR). It takes into account multiple scales or spatial frequencies of the image for more comprehensive enhancement. The algorithm adds multiple parameters to produce color recovery factor C, adjusts the ratio of the 3 channels of the image, reduces the color cast.

At the same time, the MSRCP [1] algorithm was proposed, which carries out an MSR processing on the light intensity image of the original color image to map to each channel in the same proportion as the original image RGB channel. This algorithm can preserve the color distribution of the original image well.

2.2 Object Detection Algorithm Based on YOLOv4 Model

In this paper, YOLO-V4 [2, 12] is used as the object detection network. The YOLO network is a single-stage algorithm. Be careful about the comparison between YOLO and two-stage algorithms, such as Fast RCNN [4]. YOLO has been improved a lot over the past a few years. But Fast RCNN and some other algorithms haven’t changed since they were published. Various methods are also used to improve accuracy.

The yolo-v4 used in this paper has been optimized in many aspects, making the accuracy significantly improved. The data processing aspect includes mainly Mosaic data enhancement, which can enrich the background of the image and increase the batch size in a different way. Backbone includes the new CSPDarknet53, which replaced ReLu by Mish as the activation function, and the CSP structure prevents different layers from learning the same gradient information. In addition, the loss function has been updated, and the prediction box filtering has been improved. Through this series of optimization, it finally makes the real-time target detection algorithm in the highest accuracy.

3 Method

In this paper, brightness selection algorithm and image restoration algorithm based on MSRCP are used, and then object detection algorithm based on Yolo-v4 is adopted to identify the target. The algorithm flow in this paper is shown in the Fig. 1.

After the input image goes through the brightness selection algorithm, the color information of the image with abnormal brightness will be restored by the image recovery algorithm. Finally, the image recognition will be carried out in the model trained by yolo-v4 and the results will be obtained.

3.1 Brightness Selection Algorithm

Firstly, the brightness selection algorithm is introduced. Through this algorithm, we screen out the images with abnormal brightness. Since brightness equals gray level in gray level image, we can judge brightness by converting the image into gray level image. We first calculate the difference between the gray value pg and 128 of each pixel point:

$$\begin{aligned} g a=\sum (p g-128) / N \end{aligned}$$

(3)

Then the gray-scale histogram is used to obtain the number of pixels of each gray value between 0 and 255, and the weighted average deviation is calculated with the number of pixels as the weight.

$$\begin{aligned} M a=\sum |(p g-128)-g a|^* \textrm{G}[\textrm{i}] / \sum \textrm{G}[\textrm{i}] \end{aligned}$$

(4)

Finally, we calculated according to the average deviation to judge whether the image brightness is unusual, through access to online data, we set threshold is |ga|.

3.2 Image Enhancement Algorithm Based on MSRCP

After brightness selection, we need to process the abnormal brightness image. Here we compare several algorithms and finally choose MSRCP. Compared with MSRCR and MSR, MSRCP can increase the highest recognition rate (Fig. 2).

In the process of brightness selection, we evaluate the gray image, and then we can compute the intensity image I according to the image component of the RGB three-channel of the original image.

$$\begin{aligned} I=\left( I_R+I_G+I_B\right) \end{aligned}$$

(5)

In order to make the color of the enhanced image consistent with that of the original image, the amplification factor A is obtained through the intensity image of the single channel:

$$\begin{aligned} B={\text {Max}}\left( I_R+I_G+I_B\right) \end{aligned}$$

(6)

$$\begin{aligned} A={\text {Min}}\left( \frac{255}{B}, \frac{I_{n t}}{I_{n t i}}\right) \end{aligned}$$

(7)

In the equation, I is the pixel of the image. Color enhancement is carried out on RGB channels respectively, and the final result is obtained (Fig. 3):

$$\begin{aligned} I_E=A * I(R, G, B)(x, y) \end{aligned}$$

(8)

4 Experiment and Result

4.1 Experimental Setting

Dataset. The BDD100K dataset is used in this work to evaluate the proposed algorithm. This dataset was created by Berkeley AI Laboratory, we select 3875 random images in it, including images of different Harsh light environment. The label distribution of the data set is shown in Fig. 4:

Evaluation Step. Experiment Setup

* Operating system: Ubuntu 18.04
* Python version: Python3.8
* PyTorch version: Pytorch 1.10
* GPU model: NVIDIA GeForce RTX 2080Ti

4.2 Experiment Result

After producing the data set, we trained it for 1000 generations by yolo-v4 and yolo-v3. The learning rate was adjusted by cosine annealing with the interval of 0.01–0.0001, and SGD was used for optimization. In terms of data enhancement, mosaic enhancement is used in the first 70% generation of training, and the probability of using it in each training step is 50%. After Mosaic is used, mixup data enhancement will continue to be used, and the probability of using it is also 50%. The final training results are as follows (Fig. 5):

After training, we test for the MSRCP image recovery algorithm. We selected 746 pictures containing different elements as the test set of the experiment. And we tested them with and without the algorithm. The results are shown in the below (Fig. 6):

As the result, the recognition ability after the processed image has been significantly improved. The overall mAP increased about 10%, and traffic lights in particular have improved by 14% and 17%, respectively. Less improved object categories include people and car. We believe that the unidentified car targets are mainly distant targets and obscured targets. When these objects appear in under dark conditions, more color blocks will be generated after our algorithm is restored, which will cause greater interference, and the same is true for person. And traffic lights themselves emit light, so they’re friendly to our algorithm. Totally, the whole recognition rate of image has been improved obviously.

5 Conclusion

In this paper, to mitigate the problem of abnormal image brightness in automatic driving, brightness selection algorithm and image restoration algorithm based on MSRCP are used to classify images and carry out color restoration. Combined with yolo-v4 target recognition, common targets in automatic driving scenes are identified, which ultimately effectively improves the image recognition rate and provides guarantee for safe driving. On the other hand, there are still issues need further investigation. First, images recovered using MSRCP will blur the image and in some cases even render the object that could have been recognized unrecognizable. Second, in the test of this paper, there are still some kinds of targets that cannot be confirm to improve by the insufficient number of the test set. Later, we will increase the size of the test set and conduct further tests on trucks, bus. We will also increase the recognition of different types of objects to make it closer to real driving scenarios. In the future, we will optimize the algorithm and integrate more advanced image recognition methods, such as YOLOv7, to attempt real-time recognition in low-light conditions.

References

Barnard, K., Funt, B.: Investigations into multi-scale retinex. In: Proceedings of the Colour Imaging in Multimedia’98, pp. 9–17 (1998)
Google Scholar
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Fan, H., et al.: Baidu apollo em motion planner. arXiv preprint arXiv:1807.08048 (2018)
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Land, E.H., McCann, J.J.: Lightness and retinex theory. Josa 61(1), 1–11 (1971)
Article Google Scholar
Li, Y., Yang, S., Zheng, Y., Lu, H.: Improved point-voxel region convolutional neural network: 3D object detectors for autonomous driving. IEEE Trans. Intell. Transp. Syst. 23(7), 9311–9317 (2021)
Article Google Scholar
Rahman, Z.U., Jobson, D.J., Woodell, G.A.: Multi-scale retinex for color image enhancement. In: Proceedings of 3rd IEEE International Conference on Image Processing, vol. 3, pp. 1003–1006. IEEE (1996)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Rzadca, K., et al.: Autopilot: workload autoscaling at google. In: Proceedings of the Fifteenth European Conference on Computer Systems, pp. 1–16 (2020)
Google Scholar
Sun, P., et al.: Scalability in perception for autonomous driving: Waymo open dataset. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2446–2454 (2020)
Google Scholar
Sun, X., Wu, P., Hoi, S.C.: Face detection using deep learning: an improved faster R-CNN approach. Neurocomputing 299, 42–50 (2018)
Article Google Scholar
Xu, F., Xu, F., Xie, J., Pun, C.M., Lu, H., Gao, H.: Action recognition framework in traffic scene for autonomous driving system. IEEE Trans. Intell. Transp. Syst. 23(11), 22301–22311 (2021)
Article Google Scholar
Yu, F., et al.: BDD100K: a diverse driving dataset for heterogeneous multitask learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2636–2645 (2020)
Google Scholar
Zheng, Y., Li, Y., Yang, S., Lu, H.: Global-PBNet: a novel point cloud registration for autonomous driving. IEEE Trans. Intell. Transp. Syst. 23(11), 22312–22319 (2022)
Article Google Scholar

Download references

Acknowledgements

Thanks for the support by open research fund of The State Key Laboratory for Management and Control of Complex Systems (20210110), without these reference material, the study cannot be so propitious.

Author information

Authors and Affiliations

Shanghaitech University, Shanghai, China
Zhuyun Yin
School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
Lingyue Guo
Zhongke Wyse (Beijing) Technology Co. Ltd., Beijing, China
Lingyue Guo

Authors

Zhuyun Yin
View author publications
You can also search for this author in PubMed Google Scholar
Lingyue Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhuyun Yin .

Editor information

Editors and Affiliations

Kyushu Institute of Technology, Fukuoka, Japan
Huimin Lu
Southeast University, Nanjing, China
Jintong Cai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yin, Z., Guo, L. (2024). Enhanced Object Detection of Abnormal Light Based on Multi-scale Retinex with Chromacity Preservation. In: Lu, H., Cai, J. (eds) Artificial Intelligence and Robotics. ISAIR 2023. Communications in Computer and Information Science, vol 1998. Springer, Singapore. https://doi.org/10.1007/978-981-99-9109-9_2

Download citation

DOI: https://doi.org/10.1007/978-981-99-9109-9_2
Published: 04 January 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9108-2
Online ISBN: 978-981-99-9109-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Enhanced Object Detection of Abnormal Light Based on Multi-scale Retinex with Chromacity Preservation