Object Detection with Convolutional Neural Networks

Patel, Sanskruti; Patel, Atul

doi:10.1007/978-981-15-7106-0_52

Sanskruti Patel¹² &
Atul Patel¹²

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 141))

1482 Accesses
7 Citations

Abstract

During the last years, a noticeable growth is observed in the field of computer vision research. In computer vision, object detection is a task of classifying and localizing the objects in order to detect the same. The widely used object detection applications are human–computer interaction, video surveillance, satellite imagery, transport system, and activity recognition. In the wider family of deep learning architectures, convolutional neural network (CNN) made up with set of neural network layers is used for visual imagery. Deep CNN architectures exhibit impressive results for detection of objects in digital image. This paper represents a comprehensive review of the recent development in object detection using convolutional neural networks. It explains the types of object detection models, benchmark datasets available, and research work carried out of applying object detection models for various applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Comparative Study Between the Most Usable Object Detection Methods Based on Deep Convolutional Neural Networks

Convolutional Neural Networks Backbones for Object Detection

Assessment of Object Detection Using Deep Convolutional Neural Networks

References

Z, Zhao, P. Zheng, S. Xu, X. Wu, Object detection with deep learning: a review. IEEE Trans. Neural Netw. Learn. Syst. 30(11), 3212–3232 (2019)
Google Scholar
L. Liu, W. Ouyang, X. Wang et al., Deep learning for generic object detection: a survey. Int. J. Comput. Vis. 128, 261–318 (2020)
Google Scholar
A. Opelt, A. Pinz, M. Fussenegger, P. Auer, Generic object recognition with boosting. IEEE TPAMI 28(3), 416–431 (2006)
MATH Google Scholar
A. Voulodimos, N. Doulamis, A. Doulamis, E. Protopapadakis, Deep learning for computer vision: a brief review. Comput. Intell. Neurosci. 1–13 (2018)
Google Scholar
Y. LeCun, Y. Bengio, G. Hinton, Deep learning. Nature 521, 436–444 (2015)
Google Scholar
H.A. Rowley, S. Baluja, T. Kanade, Neural network-based face detection. PAMI (1998)
Google Scholar
A. Krizhevsky, I. Sutskever, G.E. Hinton, ImageNet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)
Google Scholar
A.R. Pathak, M. Pandey, S. Rautaray, Application of deep learning for object detection. Procedia Comput. Sci. 132, 1706–1717 (2018)
Google Scholar
C. Li, Transfer learning with Mask R-CNN, https://medium.com/@c_61011/transfer-learning-with-mask-r-cnn-f50cbbea3d29
R, Girshick, J, Donahue, T, Darrell, J, Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, in 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
L. Weng, Object detection for dummies part 3: R-CNN family. https://lilianweng.github.io/lil-log/2017/12/31/object-recognition-for-dummies-part-3.html
R. Girshick, Fast R-CNN, in ICCV, pp. 1440–1448 (2015)
Google Scholar
M, Everingham, L. Van Gool, C.K.I. Williams, J. Winn, A. Zisserman, The pascal visual object classes (voc) challenge. Int. J. Comput. Vis. 88, 303–338 (2010)
Google Scholar
S. Ren, K. He, R. Girshick, J. Sun, Faster RCNN: towards real time object detection with region proposal networks. IEEE TPAMI 39(6), 1137–1149 (2017)
Google Scholar
K. He, G. Gkioxari, P. Dollár, R. Girshick, Mask RCNN, in ICCV (2017)
Google Scholar
J. Redmon, S. Divvala, R. Girshick, A. Farhadi, You only look once: unified, real time object detection, in CVPR, pp. 779–788 (2016)
Google Scholar
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Fu, A. Berg, SSD: single shot multibox detector, in ECCV, pp. 21–37 (2016)
Google Scholar
J. Deng, W. Dong, R. Socher, L. Li, K. Li, F. Li, ImageNet: a large scale hierarchical image database, in CVPR, pp. 248–255 (2009)
Google Scholar
T. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, L. Zitnick, Microsoft COCO: common objects in context. in ECCV, pp. 740–755 (2014)
Google Scholar
M. Everingham, S. Eslami, L.V. Gool, C. Williams, J. Winn, A. Zisserman, The pascal visual object classes challenge: a retrospective. IJCV 111(1), 98–136 (2015)
Google Scholar
A. Kuznetsova, H. Rom, N. Alldrin, J. Uijlings, I. Krasin, J. Pont-Tuset et al., The open images dataset v4: unified image classification, object detection, and visual relationship detection at scale. arXiv:1811.00982. (2018)
W. You, L. Chen, Z. Mo, Soldered dots detection of automobile door panels based on faster R-CNN model, in Chinese Control And Decision Conference (CCDC) (Nanchang, China, 2019), pp. 5314–5318
Google Scholar
W. Wu, Y. Yin, X. Wang, D. Xu, Face detection with different scales based on faster R-CNN. IEEE Trans. Cybern. 49(11), 4017–4028 (2019)
Google Scholar
T. Liu, T. Stathaki, Faster R-CNN for robust pedestrian detection using semantic segmentation network. Front. Neurorobot. (2018)
Google Scholar
R. Anantharaman, M. Velazquez, Y. Lee, Utilizing mask R-CNN for detection and segmentation of oral diseases, in IEEE International Conference on Bioinformatics and Biomedicine (BIBM) (Madrid, Spain, 2018), pp. 2197–2204
Google Scholar
G. Cao, W. Song, Z. Zhao, Gastric cancer diagnosis with mask R-CNN, in 11th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC) (Hangzhou, China, 2019), pp. 60–63
Google Scholar
M. Bizjak, P. Peer, Ž. Emeršič, Mask R-CNN for ear detection, in 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) (Opatija, Croatia, 2019), pp. 1624–1628
Google Scholar
T. Santad, P. Silapasupphakornwong, W. Choensawat, K. Sookhanaphibarn, Application of YOLO deep learning model for real time abandoned baggage detection, in IEEE 7th Global Conference on Consumer Electronics (GCCE) (Nara, 2018), pp. 157–158
Google Scholar
H. Nguyen, Improving faster R-CNN framework for fast vehicle detection. Math. Prob. Eng. 1–11 (2019)
Google Scholar
N. Xuan, D. Mengyang, D. Haoxuan, H. Bingliang, W. Edward, Attention mask R-CNN for ship detection and segmentation from remote sensing images. IEEE Access 1–1 (2020)
Google Scholar
Z. Krawczyk, J. Starzyński, Bones detection in the pelvic area on the basis of YOLO neural network, in 19th International Conference (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer Science and Applications, Charotar University of Science and Technology, Changa, India
Sanskruti Patel & Atul Patel

Authors

Sanskruti Patel
View author publications
You can also search for this author in PubMed Google Scholar
Atul Patel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sanskruti Patel .

Editor information

Editors and Affiliations

Global Knowledge Research Foundation, Gujarat, India
Amit Joshi
University of Osaka, Suita, Japan
Mahdi Khosravy
School of Engineering and Computer Science, Oakland University, Rochester Hills, MI, USA
Neeraj Gupta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Patel, S., Patel, A. (2021). Object Detection with Convolutional Neural Networks. In: Joshi, A., Khosravy, M., Gupta, N. (eds) Machine Learning for Predictive Analysis. Lecture Notes in Networks and Systems, vol 141. Springer, Singapore. https://doi.org/10.1007/978-981-15-7106-0_52

Download citation

DOI: https://doi.org/10.1007/978-981-15-7106-0_52
Published: 23 October 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7105-3
Online ISBN: 978-981-15-7106-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Object Detection with Convolutional Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Comparative Study Between the Most Usable Object Detection Methods Based on Deep Convolutional Neural Networks

Convolutional Neural Networks Backbones for Object Detection

Assessment of Object Detection Using Deep Convolutional Neural Networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Object Detection with Convolutional Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Comparative Study Between the Most Usable Object Detection Methods Based on Deep Convolutional Neural Networks

Convolutional Neural Networks Backbones for Object Detection

Assessment of Object Detection Using Deep Convolutional Neural Networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation