Abstract
Recent years, with a boundless growth in the field of autonomous driving, three-dimensional object detection technologies based on LiDAR and cameras have attracted extensively attentions. However, the LiDAR cannot obtain the complete point cloud information of the object, especially for the object that is heavily occluded. The existing multi-view detection schemes rarely process point cloud data. Thus, in this paper, on the basis of the two-stage detection network, a sampling method based on cylindrical space is proposed, combining the original point cloud and the output of the first region proposal network (RPN) to enhance the point cloud of the objects in the three-dimensional space. The second RPN will complete the accurate regression and classification of the three-dimensional bounding box by using enhanced point clouds. We solve the problem of object point cloud incomplete well. The proposed scheme is verified on the KITTI dataset. The verification results show that the method not only has good results in car three-dimensional and Bird’s Eye View (BEV) object detection, but also achieves the state of the art results in pedestrian detection, especially on heavily occluded objects.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wen, L., Jo, K.: Fast and accurate 3D object detection for Lidar-camera-based autonomous vehicles using one shared voxel-based backbone. IEEE Access 9, 22080 (2021)
Chen, X., Ma, H., Wan, J., Li, B., Xia, T.: Multi-view 3D object detection network for autonomous driving. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: PointNet: deep learning on point sets for 3D classification and segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (2016)
Tatebe, Y., Deguchi, D., Kawanishi, Y., Ide, I., Murase, H., Sakai, U.: Pedestrian detection from sparse point-cloud using 3DCNN. In: IEEE 2018 International Workshop on Advanced Image Technology (IWAIT), vol. 1. IEEE (2018)
Wang, L., Fan, X., Chen, J., Cheng, J., Tan, J., Ma, X.: 3D object detection based on sparse convolution neural network and feature fusion for autonomous driving in smart cities. Sustain. Cities Soc. 54 102002 (2020)
Zhou, Y., Tuzel, O.: VoxelNet: end-to-end learning for point cloud based 3D object detection. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (2017)
Cui, Z., Zhang, Z.: PVF-NET: point & voxel fusion 3D object detection framework for point cloud. In: Conference on Computer and Robot Vision (CRV), vol. 125. IEEE (2020)
Ye, M., Xu, S., Cao, T.: HVNet: hybrid voxel network for LiDAR based 3D Object detection. In: the IEEE/CVF conference on Computer Vision and Pattern Recognition (CVPR) (2020)
Shi S., Guo C., Jiang L., Wang Z., Shi J., Wang X., Li H.: PV-RCNN: point-voxel feature set abstraction for 3D object detection. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019)
Deng, J., Czarnecki, K.: MLOD: a multi-view 3D object detection based on robust feature fusion method. In: 2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019. Institute of Electrical and Electronics Engineers Inc., vol. 279 (2019)
Ku, J., Mozifian, M., Lee, J., Harakeh, A., Waslander, S.: Joint 3D proposal generation and object detection from view aggregation. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2017)
Dai, D., Wang, J., Chen, Z., Zhao, H.: Image guidance based 3D vehicle detection in traffic scene. Neurocomputing 428, 1–11 (2021)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings. International Conference on Learning Representations, ICLR, San Diego, CA, USA (2015)
Girshick, R.: Fast R-CNN. In: The IEEE International Conference on Computer Vision (ICCV), vol. 1440. IEEE (2015)
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, vol. 3354. IEEE (2012)
Qi, C.R., Liu, W., Wu, C., Su, H., Guibas, L.J.: Frustum PointNets for 3D object detection from RGB-D Data. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chen, S., Wang, W. (2022). Multi-view 3D Object Detection Based on Point Cloud Enhancement. In: Zhang, Z. (eds) 2021 6th International Conference on Intelligent Transportation Engineering (ICITE 2021). ICITE 2021. Lecture Notes in Electrical Engineering, vol 901. Springer, Singapore. https://doi.org/10.1007/978-981-19-2259-6_42
Download citation
DOI: https://doi.org/10.1007/978-981-19-2259-6_42
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-2258-9
Online ISBN: 978-981-19-2259-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)