Research on the Applicability of Monocular 3d Object Detection Using CARLA Simulator

Filatov, Nikolay; Isakov, Tim; Bakhshiev, Aleksandr

doi:10.1007/978-3-030-91581-0_30

Nikolay Filatov^7,8,
Tim Isakov^7,8 &
Aleksandr Bakhshiev⁸

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1008))

Included in the following conference series:

International Conference on Neuroinformatics

507 Accesses

Abstract

Monocular 3d object detection methods are promising in the field of making autonomous robots without lidar, which can reduce costs of production significantly. However monocular 3d object detection methods tend to have low precision due to inaccurate inference of distances to objects. Nevertheless, there are several ways to measure the impact of detection precision on the downstream autonomous driving task. In this work, autonomous agents which use lidar, monocular camera, and ground truth for 3d object detection are compared in the CARLA simulator. Each agent has passed a set of routes with challenging traffic situations, totaling 122.5 km driven. Quality of movement was assessed using the collisions statistics, as a result, the agent using a monocular camera performed 4.5% better than the agent using lidar. This indicates the applicability of monocular 3d object detection algorithms in certain cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Object Detection and Distance Estimation via Lidar and Camera Fusion for Autonomous Driving

Towards LiDAR and RADAR Fusion for Object Detection and Multi-object Tracking in CARLA Simulator

Human Detection in the Depth Map Created from Point Cloud Data

References

Mao, H., Yang, X., Dally, W.J.: A delay metric for video object detection: what average precision fails to tell. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 573–582 (2019)
Google Scholar
Philion, J., Kar, A., Fidler, S.: Learning to evaluate perception models using planner-centric metrics. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14055–14064 (2020)
Google Scholar
Zeng, W., et al.: End-to-end interpretable neural motion planner. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8660–8669 (2019)
Google Scholar
Guo, Y., Caesar, H., Beijbom, O., Philion, J., Fidler, S.: The efficacy of neural planning metrics: a meta-analysis of PKL on nuScenes. arXiv preprint arXiv:2010.09350 (2020)
Caesar, H., et al.: nuScenes: a multimodal dataset for autonomous driving (2019)
Google Scholar
Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., Koltun, V.: CARLA: an open urban driving simulator. In: Conference on Robot Learning, pp. 1–16. PMLR (2017)
Google Scholar
Wang, Y., Chao, W.-L., Garg, D., Hariharan, B., Campbell, M., Weinberger, K.Q.: Pseudo-lidar from visual depth estimation: bridging the gap in 3D object detection for autonomous driving. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8445–8453 (2019)
Google Scholar
Barabanau, I., Artemov, A., Burnaev, E., Murashkin, V.: Monocular 3D object detection via geometric reasoning on keypoints. arXiv preprint arXiv:1905.05618 (2019)
Simonelli, A., Bulo, S.R., Porzi, L., López-Antequera, M., Kontschieder, P.: Disentangling monocular 3D object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1991–1999 (2019)
Google Scholar
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Wang, T., Zhu, X., Pang, J., Lin, D.: FCOS3D: fully convolutional one-stage monocular 3D object detection. arXiv preprint arXiv:2104.10956 (2021)
Zhou, X., Wang, D., Krähenbühl, P.: Objects as points. arXiv preprint arXiv:1904.07850 (2019)
Yin, T., Zhou, X., Krähenbühl, P.: Center-based 3D object detection and tracking. arXiv preprint arXiv:2006.11275 (2020)
Najm, W.G., Smith, J.D., Yanagisawa, M.: Pre-crash scenario typology for crash avoidance research. National Highway Traffic Safety Administration, USA (2007)
Google Scholar

Download references

Acknowledgements

This work was done as the part of the state task of the Ministry of Education and Science of Russia No. 075-00913-21-01 “Development and study of new architectures of reconfigurable growing neural networks, methods and algorithms for their learning”.

Author information

Authors and Affiliations

The Russian State Scientific Center for Robotics and Technical Cybernetics, Tikhoretsky Prospect 21, 194064, Saint-Petersburg, Russia
Nikolay Filatov & Tim Isakov
Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya, 29, 195251, Saint-Petersburg, Russia
Nikolay Filatov, Tim Isakov & Aleksandr Bakhshiev

Authors

Nikolay Filatov
View author publications
You can also search for this author in PubMed Google Scholar
Tim Isakov
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandr Bakhshiev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nikolay Filatov .

Editor information

Editors and Affiliations

Scientific Research Institute for System Analysis, Russian Academy of Sciences, Moscow, Russia
Boris Kryzhanovsky
Scientific Research Institute for System Analysis, Russian Academy of Sciences, Moscow, Russia
Witali Dunin-Barkowski
Scientific Research Institute for System Analysis, Russian Academy of Sciences, Moscow, Russia
Vladimir Redko
Moscow Aviation Institute (National Research University), Moscow, Russia
Yury Tiumentsev
MEPhI, National Research Nuclear University, Moscow, Russia
Valentin V. Klimov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Filatov, N., Isakov, T., Bakhshiev, A. (2022). Research on the Applicability of Monocular 3d Object Detection Using CARLA Simulator. In: Kryzhanovsky, B., Dunin-Barkowski, W., Redko, V., Tiumentsev, Y., Klimov, V.V. (eds) Advances in Neural Computation, Machine Learning, and Cognitive Research V. NEUROINFORMATICS 2021. Studies in Computational Intelligence, vol 1008. Springer, Cham. https://doi.org/10.1007/978-3-030-91581-0_30

Download citation

DOI: https://doi.org/10.1007/978-3-030-91581-0_30
Published: 23 November 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91580-3
Online ISBN: 978-3-030-91581-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Research on the Applicability of Monocular 3d Object Detection Using CARLA Simulator

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Object Detection and Distance Estimation via Lidar and Camera Fusion for Autonomous Driving

Towards LiDAR and RADAR Fusion for Object Detection and Multi-object Tracking in CARLA Simulator

Human Detection in the Depth Map Created from Point Cloud Data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Research on the Applicability of Monocular 3d Object Detection Using CARLA Simulator

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Object Detection and Distance Estimation via Lidar and Camera Fusion for Autonomous Driving

Towards LiDAR and RADAR Fusion for Object Detection and Multi-object Tracking in CARLA Simulator

Human Detection in the Depth Map Created from Point Cloud Data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation