Abstract
The world is undergoing a deep and vast digital transformation and every day millions of images are produced and shared. There is an urgent need to extract valuable information from these images and use it for various applications. Object detection is at the forefront of it and there are many methods/algorithms which can be used for it. As a subfield of computer vision, object detection has been going through fast-paced changes. It is regarded as an exceptionally complex topic in the field of computer vision as it is the amalgamation of object classification as well as object localization. In this paper, we have taken an in-depth look at some of the widely varied state-of-the-art methods of object detection such as RetinaNet, ResNet, and ConvNet. These networks can be differentiated based on a variety of different features such as loss functions, data augmentation and feature extraction. We have compared them with their baseline models and analysed them to identify the most accurate methods. In our literature survey, we have studied the best in class techniques and have presented a brief overview of the current situation of object detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y.M.: YOLOv4: Optimal Speed and Accuracy of Object Detection (2020). arXiv:2004.10934v1. http://arxiv.org/abs/2004.10934
Chen, C., Zheng, Z., Huang, Y., Ding, X., Yu, Y.: I3Net: implicit instance-invariant network for adapting one-stage object detectors. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01239
Chen, L., Yang, T., Zhang, X., Zhang, W., Sun, J.: Points as queries: weakly semi-supervised object detection by points. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021).https://doi.org/10.1109/CVPR46437.2021.00871
Chen, X., Xie, C., Tan, M., Zhang, L., Hsieh, C.-J., Gong, B.: Robust and accurate object detection via adversarial learning. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021).https://doi.org/10.1109/CVPR46437.2021.01635
Chen, X., Li, H., Wu, Q., Ngan, K.N., Xu, L.: High-quality R-CNN object detection using multi-path detection calibration network. IEEE Trans. Circuits Syst. Video Technol. 31(2), 715–727 (2021). https://doi.org/10.1109/TCSVT.2020.2987465
Dong, Z., Wang, M., Wang, Y., Zhu, Y., Zhang, Z.: Object detection in high resolution remote sensing imagery based on convolutional neural networks with suitable object scale features. IEEE Trans. Geosci. Remote Sens. 58(3), 2104–2114 (2020). https://doi.org/10.1109/TGRS.2019.2953119
Fang, F., Li, L., Zhu, H., Lim, J.H.: Combining faster R-CNN and model-driven clustering for elongated object detection. IEEE Trans. Image Process. 29(1), 2052–2065 (2020). https://doi.org/10.1109/TIP.2019.2947792
Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning hierarchical features for scene labeling. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01559
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015). https://doi.org/10.1109/TPAMI.2015.2389824
Hou, Q., Cheng, M.M., Hu, X., Borji, A., Tu, Z., Torr, P.H.S.: Deeply supervised salient object detection with short connections. IEEE Trans. Pattern Anal. Mach. Intell. 41(4), 815–828 (2019). https://doi.org/10.1109/TPAMI.2018.2815688
Ibrahem, H., Salem, A.D.A., Kang, H.S.: Real-time weakly supervised object detection using center-of-features localization. IEEE Access 9, 38742–38756 (2021). https://doi.org/10.1109/ACCESS.2021.3064372
Li, X., Wang, W., Hu, X., Li, J., Tang, J., Yang, J.: Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection (2020). https://arxiv.org/abs/2011.12885
Li, Y., Zhu, H., Cheng, Y., Wang, W., Teo, C.S., Xiang, C., et al.: Few-shot object detection via classification refinement and distractor retreatment. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01514
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017). https://doi.org/10.1109/ICCV.2017.324
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016). https://doi.org/10.1109/CVPR.2016.91
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS’15: Proceedings of the 28th International Conference on Neural Information Processing Systems, vol. 1, pp. 91–99 (2015). https://doi.org/10.5555/2969239.2969250
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: OverFeat: integrated recognition, localization and detection using convolutional networks. In: 2nd International Conference on Learning Representations, ICLR 2014. http://arxiv.org/abs/1312.6229
Shen, Y., Ji, R., Wang, C., Li, X., Li, X.: Weakly supervised object detection via object-specific pixel gradient. IEEE Trans. Neural Netw. Learn. Syst. 29(12), 5960–5970 (2018). https://doi.org/10.1109/TNNLS.2018.2816021
Sun, B., Li, B., Cai, S., Yuan, Y., Zhang, C.: FSCE: few-shot object detection via contrastive proposal encoding. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.00727
Wang, J., Song, L., Li, Z., Sun, H., Sun, J., Zheng, N.: End-to-end object detection with fully convolutional network. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01559
Yao, C., Kong, Y., Feng, L., Jin, B., Si, H.: Contour-aware recurrent cross constraint network for salient object detection. IEEE Access 8, 218739–218751 (2020). https://doi.org/10.1109/ACCESS.2020.3042203
Zhang, H., Wang, Y., Dayoub, F., Sünderhauf, N.: VarifocalNet: an IoU-aware dense object detector. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.00841
Zhang, L., Zhou, S., Guan, J., Zhang, J.: Accurate few-shot object detection with support-query mutual guidance and hybrid loss. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/CVPR46437.2021.01419
Zhou, Y., Mao, A., Huo, S., Lei, J., Kung, S.Y.: Salient object detection via fuzzy theory and object-level enhancement. IEEE Trans. Multimedia 21(1), 74–85 (2019). https://doi.org/10.1109/TMM.2018.2845667
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tinwala, W., Rauniyar, S., Agrawal, S. (2023). Survey on Convolutional Neural Networks-Based Object Detection Methods. In: Choudrie, J., Mahalle, P., Perumal, T., Joshi, A. (eds) IOT with Smart Systems. Smart Innovation, Systems and Technologies, vol 312. Springer, Singapore. https://doi.org/10.1007/978-981-19-3575-6_56
Download citation
DOI: https://doi.org/10.1007/978-981-19-3575-6_56
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-3574-9
Online ISBN: 978-981-19-3575-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)