Leveraging UAVs for Rapid and Hierarchical Railway Intrusion Detection

Tong, Lei; Wang, Zhipeng; Qin, Yong; Tang, Tian; Miao, Bidong; Li, Zhaoyu

doi:10.1007/978-981-99-9315-4_14

Lei Tong⁴²,
Zhipeng Wang⁴²,
Yong Qin⁴²,
Tian Tang⁴³,
Bidong Miao⁴⁴ &
…
Zhaoyu Li⁴⁴

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1136))

Included in the following conference series:

International Conference on Electrical and Information Technologies for Rail Transportation

258 Accesses

Abstract

The normalized and rapid intrusion detection of the railway operation environment is of great significance for real-time monitoring and early detection of foreign objects that threaten the railway’s safe operation. The existing manual patrol and inspection vehicles cannot meet the high demands for rapid detection of some occasional or accidental risks in a short period, such as the intrusion of potential or actual foreign objects. Combining railway segmentation and foreign object detection, this paper proposes a two-stage perception model to achieve hierarchical area-matched intrusion detection and advance the risk assessment of the railway operating environment under the wide field of view of unmanned aerial vehicles (UAVs). With the railway area accurately segmented, the hierarchical division of different risk areas and the foreign objects detection in those hierarchical areas is realized. Leveraging the maneuverability and flexibility of UAVs, the proposed two-stage perception model can effectively improve the efficiency and quality of rapid intrusion inspection of the railway line and surrounding environment.

Access provided by Autonomous University of Puebla. Download conference paper PDF

A Railway Intrusion Detection Framework Based on Vehicle Front Video

A Method on Recognizing Transmission Line Structure Based on Multi-level Perception

Surveillance, Sense and Avoidance Technologies of Freight UAV Based on ADS-B

Keywords

1 Introduction

Foreign objects in and around railway areas can pose serious threats to train operations, leading to derailments, damage to rolling stock, and even endangering the lives of passengers. Traditionally, railway intrusion detection relies on mostly manual patrol and inspection vehicles which is time-consuming and labor-intensive with low intelligence levels. Most importantly, they cannot meet the high demands for rapid response to some occasional or accidental emergencies on the lines in operation in a short period. Intrusion detection based on ground video surveillance can only be employed in some limited key railway sections and monitoring points, which means that the full range of the railway line cannot be covered. However, with the advancements in UAV technology, there is an opportunity to revolutionize this process by employing UAV-based solutions, which can possibly provide a hierarchical detection of those foreign objects from a completely different overhead top-down view.

Unmanned aerial vehicles (UAVs) have emerged as a promising technology for a wide range of applications, including power line monitoring [1], vehicle detection [2], structural health monitoring [3], and railway scene surveillance [4], etc. Equipped with various high-precision sensors, the UAV is empowered with a greater ability to collect data from multiple viewing angles to meet diverse inspection needs. The use of UAVs does not need to take up the normal running time of the train while providing a better choice for the automated inspection of the railway operating environment. In light of their potential for enhancing safety and efficiency, UAVs have also gained significant attention in the railway industry to investigate small object detection [4], real-time railway scene parsing [5], railway track detection [6], UAV-LiDAR-based measuring [7], point cloud segmentation [8], ground risk assessment for long-range railway lines [9], etc. A critical aspect of UAV-based railway inspection is the potential or actual foreign object detection of railway operation environment, which, however, has not been researched and discussed adequately.

Moreover, deep learning techniques supported by deep convolutional neural networks have also made prominent advancements in the tasks of object detection [10] and semantic segmentation [11], in which the lightweight and real-time algorithms have more application prospects for the online and onboard perception of the railway operation environment as a result of their excellent potential for deployment on airborne edge computing platforms. The integration of UAV flying platforms and those computer vision algorithms has created a possibility for rapid and hierarchical railway intrusion detection.

This paper investigates UAV-based hierarchical railway intrusion detection with a two-stage algorithm adapted from the lightweight and classical ERFNet [12] and YOLOv5. In the first stage, ERFNet is exploited to fulfill the hierarchical regional division with the railway area as the core area of concern, and YOLOv5 is adopted to detect all potential foreign objects in the whole image. Then in the second stage, hierarchical regions are matched with all those detected potential foreign objects, endowing them with positional information relative to the railway area. In this way, the area-matched potential foreign objects can be divided into differentiated degrees of risk to complete an assessment of the railway operating environment under the wide field of view of UAVs.

2 Methodology

Railway intrusion detection of the railway operation environment is of great significance for the realization of automated UAV-based rapid railway inspections. It is easy to agree that different distances from the railway area represent different degrees of safety risks caused to normal railway operation. This section elaborates on the methodology of distance-based differentiated and hierarchical intrusion detection of potential and actual railway foreign objects in detail.

A general architecture for UAV-based hierarchical railway intrusion detection is shown in Fig. 1. This method first realized the real-time railway segmentation and hierarchical railway area division along the railway line and then completed real-time object detection. The two steps are both performed from the UAV perspective. The original image taken by the high-resolution camera of UAV is firstly fed into the lightweight segmentation and detection algorithms to obtain an initial potential foreign object detection and railway area segmentation as presented in the detected moving target in (b) and the core area in (c), respectively. Then the core area indicating the railway area is then further processed to obtain hierarchical region divisions, i.e., core area, adjacent area, and peripheral area. After that, all the detected potential or actual foreign objects are matched positionally with the hierarchical region division results to accomplish the hierarchical intrusion detection, as depicted in (d).

2.1 Real-Time Railway Segmentation

The remote sensing images collected by UAVs have a wide field of view and rich content, to which the railway area should be paid extra attention. Therefore, in order to obtain regions of different degrees of interest in these images, it is necessary to effectively segment the original image effectively. To achieve the real-time hierarchical perception of the railway operation and surrounding environment, ERFNet is employed to complete the segmentation of the railway area. It is a popular semantic segmentation deep architecture that can make a good tradeoff between high precision and computing efficiency. The superior performance has been verified on platforms of a large-scale GPU card (83+ FPS in a single Titan X) and embedded edge computing device (7 FPS in an NVIDIA Jetson TX1) and public data set Cityscapes.

The network architecture of ERFNet is shown in Fig. 2, in which (a) gives the structure of the main building block Non-bt-1D and (b) gives the encoder-decoder structure of the network. The block in (a) has the same input and output channel numbers and is a non-bottleneck design, in which all convolutions use one-dimensional kernels with the size of 1 × 3 or 3 × 1. The encoder and the decoder are composed of layers 0–12 and 13–18, respectively. The downsampler is used to downsample feature maps by concatenating the parallel outputs of a single 3 × 3 convolution with stride 2 and a Max-Pooling module. The upsampling module is realized by simple deconvolution layers with stride 2, which are also called transposed convolutions. The network makes an output with the same resolution as the input image and C channels corresponding to C classes in the dataset.

2.2 Hierarchical Railway Area Division

The railway area parsed by the deep networks usually has relatively fuzzy and rough boundaries, which is not conducive to the realization of distance-based hierarchical railway area division. Therefore, the final railway core area needs to be smoothed. Figure 3 gives three examples that have different shapes of railway area, i.e., (a) trapezoidal railway area, (b) curved railway area, and (c) regular straight railway area. The largest connected area of the initial segmented railway area is denoted as ${\rm{\mathcal{A}}}_{core}^O$.

Then the center line of the railway area can be sampled uniformly and vertically to ${\rm{\mathcal{N}}}$ points, denoted as ${\rm{ }}\{ (x_1 , \, y_1 ), \, ...,(x_{\rm{\mathcal{N}}} , \, y_{\rm{\mathcal{N}}} )\}$. Then these ${\rm{\mathcal{N}}}$ points are used to perform linear regression to produce a straight centerline. The left and right boundaries of the railway area are determined by moving the centerline to the left and right until it passes through the leftmost and rightmost pixel points of the railway area ${\rm{\mathcal{A}}}_{core}^O$. The two boundary lines together with the upper and lower boundaries of the image form the smoothed railway area determined as the minimum circumscribed parallelogram (MCP) ${\rm{\mathcal{A}}}_{core} = \{ (i_C , \, j_C )|0 \le i_C \le w, \, 0 \le j_C \le h\}$, as given in the middle images of (a), (b), and (c) in Fig. 3. The pixel length of the upper boundary of ${\rm{\mathcal{A}}}_{core}$ is defined as $d$. When moving ${\rm{\mathcal{A}}}_{core}^O$ to the left and right by the distance of $\lambda d$ pixels respectively, the adjacent areas ${\rm{\mathcal{A}}}_{adj}$ can be obtained by superimposing them with the image itself ${\rm{\mathcal{I}}}$, as shown in formula (1), in which $w$ and $h$ are the width and the height of the image.

$$ {\rm{\mathcal{A}}}_{adj} = \{ (i_C \pm \lambda d, \, j_C )|0 \le i_C \pm \lambda d \le w, \, 0 \le j_C \le h, \, (i_C , \, j_C ) \in {\rm{\mathcal{A}}}_{core} \} $$

(1)

Here the adjacent area is defined to have the same width as the core area, i.e., $\lambda$ is taken as 1.0, which can be changed as appropriate. The peripheral areas ${\rm{\mathcal{A}}}_{per}$ can be calculated by the formula (2). The final hierarchical area division is presented in the right images of Fig. 3.

$$ {\rm{\mathcal{I}}} = {\rm{\mathcal{A}}}_{core} + {\rm{\mathcal{A}}}_{adj} + {\rm{\mathcal{A}}}_{per} $$

(2)

2.3 Area-Matched Intrusion Detection

Potential or actual foreign objects at different distances to the railway area pose different degrees of risk to the safety of railway operations. But it is necessary to first detect all potential foreign objects existing in the whole image. YOLOv5 is a very classic object detection algorithm that has been widely applied and deployed due to its excellent performance. This work also takes YOLOv5 as the detector to detect foreign objects. The locations of detected objects are then matched to the hierarchical areas.

$$ Att_{obj} = \left\{ {\begin{array}{*{20}c} {{\rm{\mathcal{A}}}_{core} , \, if \, Loc_{obj} \in {\rm{\mathcal{A}}}_{core} \, (c_x \in i_C \, {\rm{and}} \, c_y \in j_C )} \\ {{\rm{\mathcal{A}}}_{adj} , \, if \, Loc_{obj} \in {\rm{\mathcal{A}}}_{adj} \, (c_x \in i_A \, {\rm{and}} \, c_y \in j_A )} \\ {{\rm{\mathcal{A}}}_{per} , \, if \, Loc_{obj} \in {\rm{\mathcal{A}}}_{per} { (}c_x \in i_P \, {\rm{and}} \, c_y \in j_P {)}} \\ \end{array} } \right. $$

(3)

Assuming that the position of a detected object is Loc_obj = (c_x, c_y), then its distance-based area attribute Att_obj can be matched by formula (3). Area attributes to which all potential foreign objects belong are assigned. Different area attributes, i.e., core, adjacent, and peripheral area, represent different levels of risk, i.e., high, medium, and low risk, that may be caused to the railway safety operation. Objects assigned ${\rm{\mathcal{A}}}_{core}$ are categorized as high risk of actual foreign objects intruding into the railway area.

3 Experiments

3.1 Data Acquisition and Datasets

The UAV-based data acquisition, processing, and cloud communication process is shown in Fig. 4a. Two datasets are built for railway area segmentation and intrusion detection, respectively. With a fixed-wing UAV as shown in Fig. 4b, the images in the two datasets are collected from the same railway scene in Ma’anshan, Anhui Province, China. The segmentation dataset contains 1602 images in total, including 1442 images for training and 160 images for validation. The dataset has two classes representing railway and non-railway areas respectively. The intrusion detection dataset contains 1123 images in all. The sizes for training and validation are 1010 and 113, respectively. The dataset concerns one class of moving objects, i.e., person. Figure 4c presents two embedded edge computing devices that can be carried on the UAVs. They can be used to run the developed lightweight intelligent algorithms in real-time and complete mutual communication with the cloud.

3.2 Distance-Based Area Division and Object Detection

Experiment results on the distance-based area division and potential foreign object detection of the full image are presented. The trained ERFNet obtains a mIoU of 0.956 on the validation dataset. The top row of Fig. 5a shows some initial segmentation results by ERFNet, which have rough edges or truncated segments. Then the segmentation results are smoothed by their MCPs to generate a hierarchical division of risk areas, as shown in the bottom row of Fig. 5a. For the sake of picture clarity, the corresponding peripheral areas are not drawn. As the smallest one in the yolov5 model series, YOLOv5s achieves a mAP@0.5 of 0.921 on the validation dataset. The evaluation P-R curve is given in Fig. 5b. Figures (c) and (d) of Fig. 5 present some detected visual samples. Despite the high mAP, moving persons on the ground, as small as a few pixels in a 1080p image, are quite difficult to recognize, even for human eyes. This also illustrates the great challenge of small object detection for UAV aerial images with a large field of view, as shown in the FP and FN cases in Fig. 5d.

3.3 Hierarchical Railway Intrusion Detection

With the results on both area divisions of differentiated risk levels and potential foreign object detection combined, the hierarchical railway intrusion detection is completed. It’s verified the proposed MCP smoothing method has excellent effects.

As given in Fig. 6, detected potential foreign objects (persons) with different risk levels are represented by circles of different colors, which can be used to assess railway intrusion risks. Red, yellow, and green represent high, medium, and low risks, respectively. For the sake of image clarity, the peripheral areas are not drawn in color.

4 Conclusion

Rapid intrusion detection under the UAV’s wide field of view has great advantages for real-time monitoring and early detection compared with other inspection methods. This paper proposed a hierarchical area-matched intrusion detection method based on UAV remote sensing images. Experiments on deep and lightweight segmentation and detection architectures show promising results. Future work will conduct more in-depth research on a large number of small object detection challenges and more refined regional divisions.

References

Foudeh, H.: An advanced unmanned aerial vehicle (UAV) approach via learning-based control for overhead power line monitoring: a comprehensive review. IEEE Access 9, 130410–130433 (2021)
Article Google Scholar
Bouguettaya, A.: Vehicle detection from UAV imagery with deep learning: a review. IEEE Trans Neural Netw. Learn. Syst. 33(11), 6047–6067 (2022)
Article Google Scholar
Kang, D.: Autonomous UAVs for structural health monitoring using deep learning and an ultrasonic beacon system with geo-tagging. Comput-Aided Civ. Inf. 33(10), 885–902 (2018)
Article Google Scholar
Li, Y.: Multi-block SSD based on small object detection for UAV railway scene surveillance. Chin. J. Aeronaut. 33(6), 1747–1755 (2020)
Article Google Scholar
Tong, L.: Fully decoupled residual convNet for real-time railway scene parsing of UAV aerial images. IEEE Trans. Intell. Transp. Syst. 23(9), 14806–14819 (2022)
Article Google Scholar
Tong, L.: Anchor-adaptive railway track detection from unmanned aerial vehicle images. Comput-Aided Civ. Inf. (early view), 1–19 (2023)
Google Scholar
Geng, Y.: UAV-LiDAR-based measuring framework for height and stagger of high-speed railway contact wire. IEEE Trans. Intell. Transp. Syst. 23(7), 7587–7600 (2022)
Article Google Scholar
Geng, Y.: 3DGraphSeg: a unified graph representation-based point cloud segmentation framework for full-range highspeed railway environments. IEEE Trans. Ind. Inform. (early access) (2023)
Google Scholar
Bertrand, S., Raballand, N.: Ground risk assessment for long-range inspection missions of railways by UAVs. In: ICUAS 2017, pp. 1343–1351. Miami, FL, US (2017)
Google Scholar
Zhao, Z.: Object detection with deep learning: a review. IEEE Trans. Neural Netw. Learn. Syst. 30(11), 3212–3232 (2019)
Article Google Scholar
Mo, Y.: Review the state-of-the-art technologies of semantic segmentation based on deep learning. Neurocomputing 493, 626–646 (2022)
Article Google Scholar
Romera, E.: ERFNet: efficient residual factorized convNet for real-time semantic segmentation. IEEE Trans. Intell. Transp. Syst. 19(1), 263–272 (2018)
Article Google Scholar

Download references

Acknowledgment

This work is supported in part by the Fundamental Research Funds for the Central Universities 2022YJS111, in part by the National Key R&D Program of China under Grant 2022YFB4300601, and in part by the National Natural Science Foundation of China under Grant 61833002.

Author information

Authors and Affiliations

State Key Lab of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing, 100044, China
Lei Tong, Zhipeng Wang & Yong Qin
Shenzhen United Aircraft Technology Co. Ltd., Shenzhen, 518100, China
Tian Tang
China Railway Electrification Bureau Group Co., Ltd., Beijing, 100036, China
Bidong Miao & Zhaoyu Li

Authors

Lei Tong
View author publications
You can also search for this author in PubMed Google Scholar
Zhipeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yong Qin
View author publications
You can also search for this author in PubMed Google Scholar
Tian Tang
View author publications
You can also search for this author in PubMed Google Scholar
Bidong Miao
View author publications
You can also search for this author in PubMed Google Scholar
Zhaoyu Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhipeng Wang .

Editor information

Editors and Affiliations

Beijing University of Civil Engineering and Architecture, Beijing, China
Jianwei Yang
Beijing University of Civil Engineering and Architecture, Beijing, China
Dechen Yao
Beijing Jiaotong University, Beijing, China
Limin Jia
Beijing Jiaotong University, Beijing, China
Yong Qin
Beijing Jiaotong University, Beijing, China
Zhigang Liu
Beijing Jiaotong University, Beijing, China
Lijun Diao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tong, L., Wang, Z., Qin, Y., Tang, T., Miao, B., Li, Z. (2024). Leveraging UAVs for Rapid and Hierarchical Railway Intrusion Detection. In: Yang, J., Yao, D., Jia, L., Qin, Y., Liu, Z., Diao, L. (eds) Proceedings of the 6th International Conference on Electrical Engineering and Information Technologies for Rail Transportation (EITRT) 2023. EITRT 2023. Lecture Notes in Electrical Engineering, vol 1136. Springer, Singapore. https://doi.org/10.1007/978-981-99-9315-4_14

Download citation

DOI: https://doi.org/10.1007/978-981-99-9315-4_14
Published: 09 January 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9314-7
Online ISBN: 978-981-99-9315-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics