Skip to main content

Research on the Applicability of Monocular 3d Object Detection Using CARLA Simulator

  • Conference paper
  • First Online:
Advances in Neural Computation, Machine Learning, and Cognitive Research V (NEUROINFORMATICS 2021)

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1008))

Included in the following conference series:

  • 507 Accesses

Abstract

Monocular 3d object detection methods are promising in the field of making autonomous robots without lidar, which can reduce costs of production significantly. However monocular 3d object detection methods tend to have low precision due to inaccurate inference of distances to objects. Nevertheless, there are several ways to measure the impact of detection precision on the downstream autonomous driving task. In this work, autonomous agents which use lidar, monocular camera, and ground truth for 3d object detection are compared in the CARLA simulator. Each agent has passed a set of routes with challenging traffic situations, totaling 122.5 km driven. Quality of movement was assessed using the collisions statistics, as a result, the agent using a monocular camera performed 4.5% better than the agent using lidar. This indicates the applicability of monocular 3d object detection algorithms in certain cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Mao, H., Yang, X., Dally, W.J.: A delay metric for video object detection: what average precision fails to tell. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 573–582 (2019)

    Google Scholar 

  2. Philion, J., Kar, A., Fidler, S.: Learning to evaluate perception models using planner-centric metrics. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14055–14064 (2020)

    Google Scholar 

  3. Zeng, W., et al.: End-to-end interpretable neural motion planner. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8660–8669 (2019)

    Google Scholar 

  4. Guo, Y., Caesar, H., Beijbom, O., Philion, J., Fidler, S.: The efficacy of neural planning metrics: a meta-analysis of PKL on nuScenes. arXiv preprint arXiv:2010.09350 (2020)

  5. Caesar, H., et al.: nuScenes: a multimodal dataset for autonomous driving (2019)

    Google Scholar 

  6. Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., Koltun, V.: CARLA: an open urban driving simulator. In: Conference on Robot Learning, pp. 1–16. PMLR (2017)

    Google Scholar 

  7. Wang, Y., Chao, W.-L., Garg, D., Hariharan, B., Campbell, M., Weinberger, K.Q.: Pseudo-lidar from visual depth estimation: bridging the gap in 3D object detection for autonomous driving. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8445–8453 (2019)

    Google Scholar 

  8. Barabanau, I., Artemov, A., Burnaev, E., Murashkin, V.: Monocular 3D object detection via geometric reasoning on keypoints. arXiv preprint arXiv:1905.05618 (2019)

  9. Simonelli, A., Bulo, S.R., Porzi, L., López-Antequera, M., Kontschieder, P.: Disentangling monocular 3D object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1991–1999 (2019)

    Google Scholar 

  10. Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)

    Google Scholar 

  11. Wang, T., Zhu, X., Pang, J., Lin, D.: FCOS3D: fully convolutional one-stage monocular 3D object detection. arXiv preprint arXiv:2104.10956 (2021)

  12. Zhou, X., Wang, D., Krähenbühl, P.: Objects as points. arXiv preprint arXiv:1904.07850 (2019)

  13. Yin, T., Zhou, X., Krähenbühl, P.: Center-based 3D object detection and tracking. arXiv preprint arXiv:2006.11275 (2020)

  14. Najm, W.G., Smith, J.D., Yanagisawa, M.: Pre-crash scenario typology for crash avoidance research. National Highway Traffic Safety Administration, USA (2007)

    Google Scholar 

Download references

Acknowledgements

This work was done as the part of the state task of the Ministry of Education and Science of Russia No. 075-00913-21-01 “Development and study of new architectures of reconfigurable growing neural networks, methods and algorithms for their learning”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nikolay Filatov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Filatov, N., Isakov, T., Bakhshiev, A. (2022). Research on the Applicability of Monocular 3d Object Detection Using CARLA Simulator. In: Kryzhanovsky, B., Dunin-Barkowski, W., Redko, V., Tiumentsev, Y., Klimov, V.V. (eds) Advances in Neural Computation, Machine Learning, and Cognitive Research V. NEUROINFORMATICS 2021. Studies in Computational Intelligence, vol 1008. Springer, Cham. https://doi.org/10.1007/978-3-030-91581-0_30

Download citation

Publish with us

Policies and ethics