Keywords

1 Introduction

Marine radar is divided into navigation radar and traffic management radar, its main function is to achieve target detection and target tracking. Navigation radar is mainly to achieve navigation and collision avoidance functions. For ship pilots, radar is equivalent to their “eyes”. Shore-based radars are generally installed at ports or beaches to monitor and assist safe entry of ships. Radar has all-weather fairness. According to IEC62388, X-band (9.2–9.5 GHz) and S-band (2.9–3.1 GHz) are mainly used in the field of navigation. At present, marine radar uses traditional signal processing methods to achieve target detection which requires manual adjustment of parameters such as thresholds to obtain object information. There are cases where clutter is detected as a target or small targets are filtered out as a result of the difference of the way of each person’s adjustment. It affects the accuracy of radar target detection leading to inaccurate provision of accurate and reliable target information for users [1]. Therefore, the realization of automatic detection of radar targets has important practical significance.

The function of radar automatic target detection can be implemented according to the existing engineering design requirements, without changing the equipment hardware, and ensuring that the data processing time is less than the antenna rotation period. This paper takes a method that combines pre-detection and deep learning which is described in Fig. 1 below:

Fig. 1.
figure 1

Data processing flow chart

As shown in Fig. 1, this paper first pre-detects the original radar echo data to obtain the candidate area, then uses deep learning to detect the candidate target to obtain the target, and finally extract the contour of the target.

2 Pre-detection

Maximally Stable Extremal Regions ( MSER) was proposed by J. Matas in 2004 based on the watershed concept [2]. The basic idea of MSER is to select a series of thresholds from 0 to 255 to perform binary segmentation of the image and the gray value below the set threshold is 0, but higher than or equal to the set is 255. The image will form a closed region as the threshold increases from 0 to 255. The maximum extremal region is the region with the smallest area change within a certain threshold range [3]. This paper uses method of MSER as pre-detection to obtain candidate target regions.

The mathematical definition of MSER is [4]:

$$ {\text{q}}\left( {\text{i}} \right) = {{\left| {{\text{Qi}} + \vartriangle - {\text{Qi}} - \vartriangle } \right|} \mathord{\left/ {\vphantom {{\left| {{\text{Qi}} + \vartriangle - {\text{Qi}} - \vartriangle } \right|} {{\text{Qi}}}}} \right. \kern-\nulldelimiterspace} {{\text{Qi}}}} $$
(1)

where Qi is the connected region when the threshold is i, △ is the variation of gray threshold value and q(i) is the rate of change of Qi when the threshold is i. When q(i) is the local limit value, Qi is the maximally stable extremal region.

In addition, this paper separately counts the echo sizes of S-band and X-band long, medium and short radar pulses corresponding to different ships. The radar echo size is limited to: width range [3, 200], height range [3, 60]. The pre-detection targets can be further screened by comparing the stable region with the echo size.

3 Deep Learning

Deep Learning refers to an algorithm that uses a larger number of layers of neural network structures to complete training and prediction. It has been widely used in machine vision such as pedestrian detection, face detection, remote sensing image detection, and medical image processing. This paper adopts the ResNet (Residual Network) network, which is a residual school framework proposed by K. He et al. [5] (Fig. 2).

Fig. 2.
figure 2

Residual mapping for ResNet

The residual structure can be simplified as follows [6]:

$$ {\text{x}}_{l + 1} = x_{l} + F(x_{l} ,W_{l} ) $$
(2)

The expression of the L feature of any layer unit is:

$$ {\text{x}}_{L} = x_{l} + \sum\limits_{i = l}^{L - 1} {F(x_{i} ,W_{i} )} $$
(3)

If \(\xi\) represents the loss function according to the chain rule, we can get:

$$ \frac{\partial \xi }{{\partial x_{l} }} = \frac{\partial \xi }{{\partial x_{L} }}\frac{{\partial x_{L} }}{{\partial x_{l} }} = \frac{\partial \xi }{{\partial x_{L} }}(1 + \frac{\partial }{{\partial x_{l} }}\sum\limits_{i = l}^{L - 1} {F(x_{i} ,\omega_{i} )} ) $$
(4)

Based on the above and open source related theories, the dataset used in this paper comes from the original echo data recorded by navigation radar and shore-based radar, then the sample data is labeled and trained. Among them, the sample labeling tool is labelImg, the training environment: CPU is Intel(R) Core(TM) i7 8200X, GPU is GeForce RTX2080Ti, operating system is Ubuntu18.04 LTS, Cuda10.2, and finally the network model is trained with.caffemodel file.

4 Experiments and Results

Based on the above theoretical discussion, the data processed by MSER is used as the input of deep learning ResNet, and then the output radar target is detected. Finally, the offline radar echo data and real radar sites are used to conduct experiments to statistically analyze the number of ship targets within 8nm, target detection rate, false detection rate, and missed detection rate. The method described above is used to process the original radar echo data, and select 8 different radar scene data for testing and analysis. The details are shown in Table 1 below.

Fig. 3.
figure 3

Zhuhai Dangantou radar station experiment

Figure 3 shows the results of the actual measurement at the Zhuhai clubhead radar site, which draws a rectangular frame on the original radar echo image to represent the detected target.

Table 1. Statistical table of real scene test results

As shown in Table 1, scenes 1 to 6 are the echo data scenes recorded by the navigation radar, and scenes 7 to 8 are the measured scenes at the Zhuhai Dangantou and Zhuhai Hebao Island sites. In Table 1, the number of targets is the number of ships and other targets within 8 nautical miles, the number of detected targets is the number of targets detected by the algorithm processing, the number of missed detections is the number of targets not detected by the algorithm, and the number of false detections refers to the number of false detections of clutter as the target by the algorithm. This method has a certain missed detection rate and false detection rate, but its detection rate is above 90%.

5 Target-Based Contour Extraction

For the radar target detected by MSER+ResNet, use OpcnCV to extract the contour of the radar target of which OpcnCV (Open Source Computer Vision Library) is an open source computer vision library [7]. This paper uses the findContours() function in OpenCV to extract the contours in the binary image, and then uses the drawContours() function to draw the radar target echo on the image attributes, and fill it with a custom color, as shown in Fig. 4 below.

Fig. 4.
figure 4

Radar raw echo and Contour

As shown in Fig. 4, Fig. 4(a) is the original echo image, and Fig. 4(b) is the target echo image after contour extraction. It can be seen from this that if a high detection rate is ensured, this method has clutter suppression and can improve the visualization effect.

6 Conclusion

Based on image processing related theories, this paper uses a combination of MSER and ResNet to process radar echo data, and tests on offline data and radar sites respectively. In a radar antenna rotation period, proposed method can effectively process a frame of echo data, the detection rate is above 90%, and the radar target is automatically output. And extract the contour based on the radar target echo and customize the fill color to increase the visualization effect of the radar echo. It has certain reference significance to the research and engineering application of radar target detection algorithm.