Abstract
In recent years, the application scope of LIDAR has been continuously expanding, especially in object detection. Yet existing LIDAR-based methods focus on detecting vehicles on regular roadways. Scenarios with a higher prevalence of pedestrians and cyclists, such as university campuses and leisure centers, have recently received limited attention. To solve this problem, in this paper we propose a novel detection algorithm named SecondRcnn, which is built upon the SECOND algorithm and introduces a novel two-stage detection method. In the first stage, it utilizes 3D sparse convolution on the voxel LIDAR points to learn feature representations. In the second stage, regression is employed to refine the detection bounding boxes generated by the Region Of Interest pooling network. The algorithm was evaluated on the widely used KITTI data set and demonstrated significant performance improvements in detecting pedestrians (4.61% improvement) and cyclist (6.5% improvement) compared to baseline networks. Our work highlights the potential for accurate object detection in scenarios characterized by a higher presence of pedestrians and cyclists. Advancing the use of LIDAR in the field of 3D detection.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Y. Chen, P. F. Zhang, S. F. Wang, et al., “Image feature based machine learning approach for road terrain classification,” in: 2018 IEEE International Conference on Mechatronics and Automation (ICMA), IEEE Publ. (2018), pp. 2097–2102.
P. F. Zhang, X. Dai, J. P. Ding, et al., Lasers Eng., 42, 187 (2019).
B. Li, T. l. Zhang, and T. Xia, “Vehicle detection from 3D LIDAR using fully convolutional network,” arXiv:1608.07916 (2016).
B. Yang, W. J. Luo, and R. Urtasun, “PIXOR: Real-time 3D object detection from Point Clouds,” in: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (2018), pp. 7652–7660; arXiv:1902.06326 [cs.CV]
X. Z. Chen, H. Ma, J. Wan, et al., “Multi-view 3D object detection network for autonomous driving,” in: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (2017), pp. 1907–1915; arXiv:1611.07759 [cs.CV]
C. R. Qi, H. Su, K. Mo, et al., “PointNet: Deep learning on point sets for 3D classification and segmentation,” in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 652–660.
C. R. Qi, L. Yi, H. Su, et al., “Pointnet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space,” in: I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.) Advances in Neural Information Processing Systems 30 (NIPS 2017), Curran Associates, NY (2017), pp. 5105–5114; ISBN 9781510860964; arXiv:1706.02413 [cs.CV]
S. S. Shi, X. G. Wang, and H. S. Li, “PointRCNN: 3D object proposal generation and detection from point cloud,” in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019), pp. 770–779; github.com/sshaoshuai/PointRCNN
Y. Zhou and O. Tuzel, “VoxelNet: End-to-end learning for point cloud based 3D object detection,” in: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 4490–4499, https://doi.org/10.1109/CVPR.2018.00472
Y. Yan, Y. X. Mao, and B. Li, Sensors, 18, 3337 (2018).
A. Geiger, P. Lenz, C. Stiller, et al., Int. J. Rob. Res., 32, 1231 (2013).
S. Ren, K. He, R. Girshick, and J. Sun, IEEE Trans. Pattern Anal. Mach. Intell., 39, 1137 (2017); https://doi.org/10.1109/TPAMI.2016.2577031
T. Y. Lin, P. Goyal, R. Girshick, et al., “Focal loss for dense object detection,” in: Proceedings of the IEEE International Conference on Computer Vision (2017), pp. 2980–2988.
Z. L. Zhang and M. Sabuncu, in: S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), NIPS’18: Proceedings of the 32nd International Conference on Neural Information Processing Systems, Curran Associates, NY (2018), pp. 8792-8802; ISBN: 9781510884472; arXiv:1805.07836 [cs.LG]
A. H. Lang, S. Vora, H. Caesar, et al., “PointPillars: Fast encoders for object detection from point clouds,” in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 2019, pp. 12689–12697; https://doi.org/10.1109/CVPR.2019.01298
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ma, Y., Miao, L., Wang, H. et al. A Two-Stage Lidar-Based Approach for Enhanced Pedestrian and Cyclist Detection. J Russ Laser Res 44, 513–522 (2023). https://doi.org/10.1007/s10946-023-10158-2
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10946-023-10158-2