A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection

Ma, Yue; Miao, Lei; Wang, Haosen; Li, Yan; Lu, Bo; Wang, Shifeng

doi:10.1007/s10946-023-10158-2

A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection

Published: 23 November 2023

Volume 44, pages 513–522, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Russian Laser Research Aims and scope

A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection

Download PDF

Yue Ma¹,
Lei Miao¹,
Haosen Wang¹,
Yan Li²,
Bo Lu^1,3 &
…
Shifeng Wang¹

58 Accesses
Explore all metrics

Abstract

In recent years, the application scope of LIDAR has been continuously expanding, especially in object detection. Yet existing LIDAR-based methods focus on detecting vehicles on regular roadways. Scenarios with a higher prevalence of pedestrians and cyclists, such as university campuses and leisure centers, have recently received limited attention. To solve this problem, in this paper we propose a novel detection algorithm named SecondRcnn, which is built upon the SECOND algorithm and introduces a novel two-stage detection method. In the first stage, it utilizes 3D sparse convolution on the voxel LIDAR points to learn feature representations. In the second stage, regression is employed to refine the detection bounding boxes generated by the Region Of Interest pooling network. The algorithm was evaluated on the widely used KITTI data set and demonstrated significant performance improvements in detecting pedestrians (4.61% improvement) and cyclist (6.5% improvement) compared to baseline networks. Our work highlights the potential for accurate object detection in scenarios characterized by a higher presence of pedestrians and cyclists. Advancing the use of LIDAR in the field of 3D detection.

Article PDF

A Small-Size 3d Object Detection Network for Analyzing the Sparsity of Raw Lidar Point Cloud

Article 24 November 2023

Reinforcing LiDAR-Based 3D Object Detection with RGB and 3D Information

Semantic frustum-based sparsely embedded convolutional detection

Article 19 January 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Y. Chen, P. F. Zhang, S. F. Wang, et al., “Image feature based machine learning approach for road terrain classification,” in: 2018 IEEE International Conference on Mechatronics and Automation (ICMA), IEEE Publ. (2018), pp. 2097–2102.
P. F. Zhang, X. Dai, J. P. Ding, et al., Lasers Eng., 42, 187 (2019).
Google Scholar
B. Li, T. l. Zhang, and T. Xia, “Vehicle detection from 3D LIDAR using fully convolutional network,” arXiv:1608.07916 (2016).
B. Yang, W. J. Luo, and R. Urtasun, “PIXOR: Real-time 3D object detection from Point Clouds,” in: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (2018), pp. 7652–7660; arXiv:1902.06326 [cs.CV]
X. Z. Chen, H. Ma, J. Wan, et al., “Multi-view 3D object detection network for autonomous driving,” in: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (2017), pp. 1907–1915; arXiv:1611.07759 [cs.CV]
C. R. Qi, H. Su, K. Mo, et al., “PointNet: Deep learning on point sets for 3D classification and segmentation,” in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 652–660.
C. R. Qi, L. Yi, H. Su, et al., “Pointnet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space,” in: I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.) Advances in Neural Information Processing Systems 30 (NIPS 2017), Curran Associates, NY (2017), pp. 5105–5114; ISBN 9781510860964; arXiv:1706.02413 [cs.CV]
S. S. Shi, X. G. Wang, and H. S. Li, “PointRCNN: 3D object proposal generation and detection from point cloud,” in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019), pp. 770–779; github.com/sshaoshuai/PointRCNN
Y. Zhou and O. Tuzel, “VoxelNet: End-to-end learning for point cloud based 3D object detection,” in: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 4490–4499, https://doi.org/10.1109/CVPR.2018.00472
Y. Yan, Y. X. Mao, and B. Li, Sensors, 18, 3337 (2018).
Article ADS Google Scholar
A. Geiger, P. Lenz, C. Stiller, et al., Int. J. Rob. Res., 32, 1231 (2013).
Article Google Scholar
S. Ren, K. He, R. Girshick, and J. Sun, IEEE Trans. Pattern Anal. Mach. Intell., 39, 1137 (2017); https://doi.org/10.1109/TPAMI.2016.2577031
T. Y. Lin, P. Goyal, R. Girshick, et al., “Focal loss for dense object detection,” in: Proceedings of the IEEE International Conference on Computer Vision (2017), pp. 2980–2988.
Z. L. Zhang and M. Sabuncu, in: S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), NIPS’18: Proceedings of the 32nd International Conference on Neural Information Processing Systems, Curran Associates, NY (2018), pp. 8792-8802; ISBN: 9781510884472; arXiv:1805.07836 [cs.LG]
A. H. Lang, S. Vora, H. Caesar, et al., “PointPillars: Fast encoders for object detection from point clouds,” in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 2019, pp. 12689–12697; https://doi.org/10.1109/CVPR.2019.01298

Download references

Author information

Authors and Affiliations

School of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun, 130022, China
Yue Ma, Lei Miao, Haosen Wang, Bo Lu & Shifeng Wang
School of Computing, Macquarie University, Sydney, 2892921, Australia
Yan Li
School of Computing, Engineering and Physical Sciences, University of the West of Scotland, Renfrewshire, PA12BE, UK
Bo Lu

Authors

Yue Ma
View author publications
You can also search for this author in PubMed Google Scholar
Lei Miao
View author publications
You can also search for this author in PubMed Google Scholar
Haosen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yan Li
View author publications
You can also search for this author in PubMed Google Scholar
Bo Lu
View author publications
You can also search for this author in PubMed Google Scholar
Shifeng Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shifeng Wang.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ma, Y., Miao, L., Wang, H. et al. A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection. J Russ Laser Res 44, 513–522 (2023). https://doi.org/10.1007/s10946-023-10158-2

Download citation

Received: 26 June 2023
Published: 23 November 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s10946-023-10158-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection

Abstract

Article PDF

Similar content being viewed by others

A Small-Size 3d Object Detection Network for Analyzing the Sparsity of Raw Lidar Point Cloud

Reinforcing LiDAR-Based 3D Object Detection with RGB and 3D Information

Semantic frustum-based sparsely embedded convolutional detection

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection

Abstract

Article PDF

Similar content being viewed by others

A Small-Size 3d Object Detection Network for Analyzing the Sparsity of Raw Lidar Point Cloud

Reinforcing LiDAR-Based 3D Object Detection with RGB and 3D Information

Semantic frustum-based sparsely embedded convolutional detection

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation