Skip to main content

Survey on Convolutional Neural Networks-Based Object Detection Methods

  • Conference paper
  • First Online:
IOT with Smart Systems

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 312))

  • 579 Accesses

Abstract

The world is undergoing a deep and vast digital transformation and every day millions of images are produced and shared. There is an urgent need to extract valuable information from these images and use it for various applications. Object detection is at the forefront of it and there are many methods/algorithms which can be used for it. As a subfield of computer vision, object detection has been going through fast-paced changes. It is regarded as an exceptionally complex topic in the field of computer vision as it is the amalgamation of object classification as well as object localization. In this paper, we have taken an in-depth look at some of the widely varied state-of-the-art methods of object detection such as RetinaNet, ResNet, and ConvNet. These networks can be differentiated based on a variety of different features such as loss functions, data augmentation and feature extraction. We have compared them with their baseline models and analysed them to identify the most accurate methods. In our literature survey, we have studied the best in class techniques and have presented a brief overview of the current situation of object detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y.M.: YOLOv4: Optimal Speed and Accuracy of Object Detection (2020). arXiv:2004.10934v1. http://arxiv.org/abs/2004.10934

  2. Chen, C., Zheng, Z., Huang, Y., Ding, X., Yu, Y.: I3Net: implicit instance-invariant network for adapting one-stage object detectors. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01239

  3. Chen, L., Yang, T., Zhang, X., Zhang, W., Sun, J.: Points as queries: weakly semi-supervised object detection by points. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021).https://doi.org/10.1109/CVPR46437.2021.00871

  4. Chen, X., Xie, C., Tan, M., Zhang, L., Hsieh, C.-J., Gong, B.: Robust and accurate object detection via adversarial learning. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021).https://doi.org/10.1109/CVPR46437.2021.01635

  5. Chen, X., Li, H., Wu, Q., Ngan, K.N., Xu, L.: High-quality R-CNN object detection using multi-path detection calibration network. IEEE Trans. Circuits Syst. Video Technol. 31(2), 715–727 (2021). https://doi.org/10.1109/TCSVT.2020.2987465

    Article  Google Scholar 

  6. Dong, Z., Wang, M., Wang, Y., Zhu, Y., Zhang, Z.: Object detection in high resolution remote sensing imagery based on convolutional neural networks with suitable object scale features. IEEE Trans. Geosci. Remote Sens. 58(3), 2104–2114 (2020). https://doi.org/10.1109/TGRS.2019.2953119

    Article  Google Scholar 

  7. Fang, F., Li, L., Zhu, H., Lim, J.H.: Combining faster R-CNN and model-driven clustering for elongated object detection. IEEE Trans. Image Process. 29(1), 2052–2065 (2020). https://doi.org/10.1109/TIP.2019.2947792

    Article  Google Scholar 

  8. Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning hierarchical features for scene labeling. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01559

  9. He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015). https://doi.org/10.1109/TPAMI.2015.2389824

    Article  Google Scholar 

  10. Hou, Q., Cheng, M.M., Hu, X., Borji, A., Tu, Z., Torr, P.H.S.: Deeply supervised salient object detection with short connections. IEEE Trans. Pattern Anal. Mach. Intell. 41(4), 815–828 (2019). https://doi.org/10.1109/TPAMI.2018.2815688

    Article  Google Scholar 

  11. Ibrahem, H., Salem, A.D.A., Kang, H.S.: Real-time weakly supervised object detection using center-of-features localization. IEEE Access 9, 38742–38756 (2021). https://doi.org/10.1109/ACCESS.2021.3064372

    Article  Google Scholar 

  12. Li, X., Wang, W., Hu, X., Li, J., Tang, J., Yang, J.: Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection (2020). https://arxiv.org/abs/2011.12885

  13. Li, Y., Zhu, H., Cheng, Y., Wang, W., Teo, C.S., Xiang, C., et al.: Few-shot object detection via classification refinement and distractor retreatment. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01514

  14. Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017). https://doi.org/10.1109/ICCV.2017.324

  15. Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016). https://doi.org/10.1109/CVPR.2016.91

  16. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS’15: Proceedings of the 28th International Conference on Neural Information Processing Systems, vol. 1, pp. 91–99 (2015). https://doi.org/10.5555/2969239.2969250

  17. Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: OverFeat: integrated recognition, localization and detection using convolutional networks. In: 2nd International Conference on Learning Representations, ICLR 2014. http://arxiv.org/abs/1312.6229

  18. Shen, Y., Ji, R., Wang, C., Li, X., Li, X.: Weakly supervised object detection via object-specific pixel gradient. IEEE Trans. Neural Netw. Learn. Syst. 29(12), 5960–5970 (2018). https://doi.org/10.1109/TNNLS.2018.2816021

    Article  Google Scholar 

  19. Sun, B., Li, B., Cai, S., Yuan, Y., Zhang, C.: FSCE: few-shot object detection via contrastive proposal encoding. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.00727

  20. Wang, J., Song, L., Li, Z., Sun, H., Sun, J., Zheng, N.: End-to-end object detection with fully convolutional network. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01559

  21. Yao, C., Kong, Y., Feng, L., Jin, B., Si, H.: Contour-aware recurrent cross constraint network for salient object detection. IEEE Access 8, 218739–218751 (2020). https://doi.org/10.1109/ACCESS.2020.3042203

    Article  Google Scholar 

  22. Zhang, H., Wang, Y., Dayoub, F., Sünderhauf, N.: VarifocalNet: an IoU-aware dense object detector. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.00841

  23. Zhang, L., Zhou, S., Guan, J., Zhang, J.: Accurate few-shot object detection with support-query mutual guidance and hybrid loss. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01419

  24. Zhou, Y., Mao, A., Huo, S., Lei, J., Kung, S.Y.: Salient object detection via fuzzy theory and object-level enhancement. IEEE Trans. Multimedia 21(1), 74–85 (2019). https://doi.org/10.1109/TMM.2018.2845667

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Waiel Tinwala .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tinwala, W., Rauniyar, S., Agrawal, S. (2023). Survey on Convolutional Neural Networks-Based Object Detection Methods. In: Choudrie, J., Mahalle, P., Perumal, T., Joshi, A. (eds) IOT with Smart Systems. Smart Innovation, Systems and Technologies, vol 312. Springer, Singapore. https://doi.org/10.1007/978-981-19-3575-6_56

Download citation

Publish with us

Policies and ethics