Abstract
Monocular 3d object detection methods are promising in the field of making autonomous robots without lidar, which can reduce costs of production significantly. However monocular 3d object detection methods tend to have low precision due to inaccurate inference of distances to objects. Nevertheless, there are several ways to measure the impact of detection precision on the downstream autonomous driving task. In this work, autonomous agents which use lidar, monocular camera, and ground truth for 3d object detection are compared in the CARLA simulator. Each agent has passed a set of routes with challenging traffic situations, totaling 122.5 km driven. Quality of movement was assessed using the collisions statistics, as a result, the agent using a monocular camera performed 4.5% better than the agent using lidar. This indicates the applicability of monocular 3d object detection algorithms in certain cases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mao, H., Yang, X., Dally, W.J.: A delay metric for video object detection: what average precision fails to tell. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 573–582 (2019)
Philion, J., Kar, A., Fidler, S.: Learning to evaluate perception models using planner-centric metrics. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14055–14064 (2020)
Zeng, W., et al.: End-to-end interpretable neural motion planner. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8660–8669 (2019)
Guo, Y., Caesar, H., Beijbom, O., Philion, J., Fidler, S.: The efficacy of neural planning metrics: a meta-analysis of PKL on nuScenes. arXiv preprint arXiv:2010.09350 (2020)
Caesar, H., et al.: nuScenes: a multimodal dataset for autonomous driving (2019)
Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., Koltun, V.: CARLA: an open urban driving simulator. In: Conference on Robot Learning, pp. 1–16. PMLR (2017)
Wang, Y., Chao, W.-L., Garg, D., Hariharan, B., Campbell, M., Weinberger, K.Q.: Pseudo-lidar from visual depth estimation: bridging the gap in 3D object detection for autonomous driving. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8445–8453 (2019)
Barabanau, I., Artemov, A., Burnaev, E., Murashkin, V.: Monocular 3D object detection via geometric reasoning on keypoints. arXiv preprint arXiv:1905.05618 (2019)
Simonelli, A., Bulo, S.R., Porzi, L., López-Antequera, M., Kontschieder, P.: Disentangling monocular 3D object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1991–1999 (2019)
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Wang, T., Zhu, X., Pang, J., Lin, D.: FCOS3D: fully convolutional one-stage monocular 3D object detection. arXiv preprint arXiv:2104.10956 (2021)
Zhou, X., Wang, D., Krähenbühl, P.: Objects as points. arXiv preprint arXiv:1904.07850 (2019)
Yin, T., Zhou, X., Krähenbühl, P.: Center-based 3D object detection and tracking. arXiv preprint arXiv:2006.11275 (2020)
Najm, W.G., Smith, J.D., Yanagisawa, M.: Pre-crash scenario typology for crash avoidance research. National Highway Traffic Safety Administration, USA (2007)
Acknowledgements
This work was done as the part of the state task of the Ministry of Education and Science of Russia No. 075-00913-21-01 “Development and study of new architectures of reconfigurable growing neural networks, methods and algorithms for their learning”.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Filatov, N., Isakov, T., Bakhshiev, A. (2022). Research on the Applicability of Monocular 3d Object Detection Using CARLA Simulator. In: Kryzhanovsky, B., Dunin-Barkowski, W., Redko, V., Tiumentsev, Y., Klimov, V.V. (eds) Advances in Neural Computation, Machine Learning, and Cognitive Research V. NEUROINFORMATICS 2021. Studies in Computational Intelligence, vol 1008. Springer, Cham. https://doi.org/10.1007/978-3-030-91581-0_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-91581-0_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91580-3
Online ISBN: 978-3-030-91581-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)