A Survey on Recent Deep Learning Architectures

Bhargavi, G.; Vaijayanthi, S.; Arunnehru, J.; Reddy, Pramod Reddy Devi

doi:10.1007/978-981-33-6400-4_5

G. Bhargavi⁵,
S. Vaijayanthi⁵,
J. Arunnehru⁵ &
…
Pramod Reddy Devi Reddy⁶

Part of the book series: Studies in Big Data ((SBD,volume 85))

679 Accesses
2 Citations

Abstract

In artificial intelligence, the area is going rapidly towards tackling and solving problems that are intellectually challenging for human beings, its almost straightforward for machines. A list of formal and analytical rules creates the problem. The computer gains experience automatically by executing the same problem again and again by repeating the ideas by defining the relationship between the concepts. There are many architectures to enhance the system to perform accurately and efficiently. The architecture helps to classify and extract the multiple unique features using many stages from the source data. This innovative CNN architecture reduces the complex problem by breaking into simple concepts, and then it is fed into hidden layers of the architecture. Further, it concentrates on loss function, structural reformulation, optimization, weight sharing, parameter regularization and generalization. Thus, the computer learns more complexity about the concepts on its own, and it works more accurately and efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Deep Learning Basics

Deep Learning Theory Simplified

Summary and Outlook

References

Suchy R, Ezekiel S, Cornacchia M (2017) Fusion of deep convolutional neural networks. In: 2017 IEEE applied imagery pattern recognition workshop (AIPR). IEEE
Google Scholar
Khan A et al. A survey of the recent architectures of deep convolutional neural networks. arXiv: arXiv:1901.06032
Kůrková V et al (eds) (2018) Artificial neural networks and machine learning–ICANN 2018: 27th International conference on artificial neural networks, Rhodes, Greece, October 4–7, 2018, Proceedings, vol 11141. Springer
Google Scholar
https://medium.com/analytics-vidhya/cnns-architectures-lenet-alexnet-vgg-googlenet-resnet-and-more-666091488df5
https://www.researchgate.net/publication/320748406_Consecutive_Dimensionality_Reduction_by_Canonical_Correlation_Analysis_for_Visualization_of_Convolutional_Neural_Networks/figures?lo=1
Gulshan V et al (2016) Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. Jama 316(22):2402–2410
Google Scholar
Yamashita R et al (2018) Convolutional neural networks: an overview and application in radiology. Insights Into Imaging 9(4):611–629
Google Scholar
https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53
Ramachandran P, Barret Z, Le QV (2017) Searching for activation functions. arXiv: arXiv:1710.05941
Nair V, Hinton GE (2010) Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th international conference on machine learning (ICML-10)
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems
Google Scholar
Dubey AK, Jain V (2019) Comparative study of convolution neural network’s Relu and Leaky-Relu activation functions. In: Applications of computing, automation, and wireless systems in electrical engineering. Springer, Singapore, pp 873–880
Google Scholar
https://medium.com/@shrutijadon10104776/survey-on-activation-functions-for-deep-learning-9689331ba092
https://medium.com/@cdabakoglu/what-is-convolutional-neural-network-cnn-with-keras-cab447ad204c
Srivastava N et al (2014) Dropout: a simple way to prevent neural networks from overfitting. The J Mach Learn Res 15(1):1929–1958
Google Scholar
http://penkovsky.com/neural-networks/day5/
Le Cun Y et al (1990) Handwritten digit recognition: applications of neural net chips and automatic learning. In: Neurocomputing. Springer, Berlin, Heidelberg, pp 303–318
Google Scholar
Russakovsky O et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252
Google Scholar
https://neurohive.io/en/popular-networks/alexnet-imagenet-classification-with-deep-convolutional-neural-networks/
Véstias MP (2019) A survey of convolutional neural networks on edge with reconfigurable computing. Algorithms 12(8):154
Google Scholar
Goodfellow T, Bengio Y, Courville A (2017) Deep learning. Nat Methods 13(35)
Google Scholar
https://medium.com/@smallfishbigsea/a-walk-through-of-alexnet-6cbd137a5637
He K et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Google Scholar
Karpathy A, Fei-Fei L (2015) Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv: arXiv:1409.1556
https://medium.com/coinmonks/paper-review-of-zfnet-the-winner-of-ilsvlc-2013-image-classification-d1a5a0c45103
Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: European conference on computer vision. Springer, Cham
Google Scholar
https://leonardoaraujosantos.gitbooks.io/artificial-inteligence/content/googlenet.html
Erhan D et al (2009) Visualizing higher-layer features of a deep network. University of Montreal 1341(3):1
Google Scholar
Jaderberg M, Simonyan K, Zisserman A (2015) Spatial transformer networks. In: Advances in neural information processing systems
Google Scholar
https://www.researchgate.net/publication/315808014_The_Relative_Performance_of_Ensemble_Methods_with_Deep_Convolutional_Neural_Networks_for_Image_Classification/figures?lo=1
https://neurohive.io/en/popular-networks/vgg16/
https://mc.ai/cnn-architectures-lenet-alexnet-vgg-googlenet-and-resnet/
Huang FJ, Boureau Y-L, LeCun Y, Ranzato MA (2007) Unsupervised learning of invariant feature hierarchies with applications to object recognition. In: IEEE Conference on computer vision and pattern recognition CVPR’07, pp 1–8
Google Scholar
https://towardsdatascience.com/residual-blocks-building-blocks-of-resnet-fd90ca15d6ec

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, SRM Institute of Science and Technology, Chennai, 600026, India
G. Bhargavi, S. Vaijayanthi & J. Arunnehru
Department of Information and Technology, Central Queensland University, Melbourne, Australia
Pramod Reddy Devi Reddy

Authors

G. Bhargavi
View author publications
You can also search for this author in PubMed Google Scholar
S. Vaijayanthi
View author publications
You can also search for this author in PubMed Google Scholar
J. Arunnehru
View author publications
You can also search for this author in PubMed Google Scholar
Pramod Reddy Devi Reddy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to G. Bhargavi .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Annamalai University, Chennai, Tamil Nadu, India
Kalaiselvi Geetha Manoharan
Department of Computer Science and Engineering, SRM Institute of Science and Technology, Chennai, Tamil Nadu, India
Jawaharlal Arun Nehru
Department of Mechanical Engineering, Annamalai University, Chennai, Tamil Nadu, India
Sivaraman Balasubramanian

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bhargavi, G., Vaijayanthi, S., Arunnehru, J., Reddy, P.R.D. (2021). A Survey on Recent Deep Learning Architectures. In: Manoharan, K.G., Nehru, J.A., Balasubramanian, S. (eds) Artificial Intelligence and IoT. Studies in Big Data, vol 85. Springer, Singapore. https://doi.org/10.1007/978-981-33-6400-4_5

Download citation

DOI: https://doi.org/10.1007/978-981-33-6400-4_5
Published: 13 February 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6399-1
Online ISBN: 978-981-33-6400-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Survey on Recent Deep Learning Architectures

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Learning Basics

Deep Learning Theory Simplified

Summary and Outlook

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Survey on Recent Deep Learning Architectures

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Learning Basics

Deep Learning Theory Simplified

Summary and Outlook

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation