Name: Neural Networks and Deep Learning
ISBN: 978-3-031-29642-0

Neural Networks and Deep Learning

book PDF

book EPUB

Overview

Authors:

Charu C. Aggarwal ⁰

Charu C. Aggarwal
1. IBM T. J. Watson Research Center, International Business Machines, Yorktown Heights, USA
View author publications

You can also search for this author in PubMed Google Scholar

Simple and intuitive discussions of neural networks and deep learning
Provides mathematical details without losing the reader in complexity
Includes exercises and examples
Discusses both traditional neural networks and recent deep learning models
Request solutions manual: sn.pub/lecturer-material

75k Accesses
36 Citations
19 Altmetric

Buy print copy

Softcover Book USD 49.99

Price excludes VAT (USA)

Hardcover Book USD 69.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

About this book

This book covers both classical and modern models in deep learning. The primary focus is on the theory and algorithms of deep learning. The theory and algorithms of neural networks are particularly important for understanding important concepts, so that one can understand the important design concepts of neural architectures in different applications. Why do neural networks work? When do they work better than off-the-shelf machine-learning models? When is depth useful? Why is training neural networks so hard? What are the pitfalls? The book is also rich in discussing different applications in order to give the practitioner a flavor of how neural architectures are designed for different types of problems. Deep learning methods for various data domains, such as text, images, and graphs are presented in detail. The chapters of this book span three categories:

The basics of neural networks: The backpropagation algorithm is discussed in Chapter 2.

Many traditional machine learning models can be understood as special cases of neural networks. Chapter 3 explores the connections between traditional machine learning and neural networks. Support vector machines, linear/logistic regression, singular value decomposition, matrix factorization, and recommender systems are shown to be special cases of neural networks.

Fundamentals of neural networks: A detailed discussion of training and regularization is provided in Chapters 4 and 5. Chapters 6 and 7 present radial-basis function (RBF) networks and restricted Boltzmann machines.

Advanced topics in neural networks: Chapters 8, 9, and 10 discuss recurrent neural networks, convolutional neural networks, and graph neural networks. Several advanced topics like deep reinforcement learning, attention mechanisms, transformer networks, Kohonen self-organizing maps, and generative adversarial networks are introduced in Chapters 11 and 12.

The textbook is written for graduate students and upper under graduate level students. Researchers and practitioners working within this related field will want to purchase this as well.

Where possible, an application-centric view is highlighted in order to provide an understanding of the practical uses of each class of techniques.

The second edition is substantially reorganized and expanded with separate chapters on backpropagation and graph neural networks. Many chapters have been significantly revised over the first edition.

Greater focus is placed on modern deep learning ideas such as attention mechanisms, transformers, and pre-trained language models.

Keywords

Table of contents (13 chapters)

Front Matter

Pages i-xxiv

Download chapter PDF
An Introduction to Neural Networks
- Charu Aggarwal
Pages 1-27
Download chapter PDF
The Backpropagation Algorithm
- Charu Aggarwal
Pages 29-71
Download chapter PDF
Machine Learning with Shallow Neural Networks
- Charu Aggarwal
Pages 73-117
Download chapter PDF
Deep Learning: Principles and Training Algorithms
- Charu Aggarwal
Pages 119-163
Download chapter PDF
Teaching Deep Learners to Generalize
- Charu Aggarwal
Pages 165-213
Download chapter PDF
Radial Basis Function Networks
- Charu Aggarwal
Pages 215-230
Download chapter PDF
Restricted Boltzmann Machines
- Charu Aggarwal
Pages 231-264
Download chapter PDF
Recurrent Neural Networks
- Charu Aggarwal
Pages 265-304
Download chapter PDF
Convolutional Neural Networks
- Charu Aggarwal
Pages 305-360
Download chapter PDF
Graph Neural Networks
- Charu Aggarwal
Pages 361-387
Download chapter PDF
Deep Reinforcement Learning
- Charu Aggarwal
Pages 389-433
Download chapter PDF
Advanced Topics in Deep Learning
- Charu Aggarwal
Pages 435-485
Download chapter PDF
Correction to: Neural Networks and Deep Learning
- Charu Aggarwal
Pages C1-C1
Download chapter PDF
Back Matter

Pages 487-529

Download chapter PDF

Authors and Affiliations

IBM T. J. Watson Research Center, International Business Machines, Yorktown Heights, USA

Charu C. Aggarwal

About the author

Charu C. Aggarwal is a Distinguished Research Staff Member(DRSM) at the IBM T. J. Watson Research Center in Yorktown Heights, New York. He completed his undergraduate degree in Computer Science from the Indian Institute of Technology at Kanpur in 1993 and his Ph.D. from the Massachusetts Institute of Technology in 1996. He has worked extensively in the field of data mining. He has published more than 400 papers in refereed conferences and journals and authored over 80 patents. He is the author or editor of 20 books, including textbooks on data mining, recommender systems, and outlier analysis. Because of the commercial value of his patents, he has thrice been designated a Master Inventor at IBM. He is a recipient of an IBM Corporate Award (2003) for his work on bio-terrorist threat detection in data streams, a recipient of the IBM Outstanding Innovation Award (2008) for his scientific contributions to privacy technology, and a recipient of two IBM Outstanding Technical AchievementAwards (2009, 2015) for his work on data streams/high-dimensional data. He received the EDBT 2014 Test of Time Award for his work on condensation-based privacy-preserving data mining. He is a recipient of the IEEE ICDM Research Contributions Award (2015) and ACM SIGKDD Innovation Award, which are the two most prestigious awards for influential research contributions in the field of data mining. He is also a recipient of the W. Wallace McDowell Award, which is the highest award given solely by the IEEE Computer Society across the field of Computer Science.

He has served as the general co-chair of the IEEE Big Data Conference (2014) and as the program co-chair of the ACM CIKM Conference (2015), the IEEE ICDM Conference (2015), and the ACM KDD Conference (2016). He served as an associate editor of the IEEE Transactions on Knowledge and Data Engineering from 2004 to 2008. He is an associate editor of the IEEE Transactions on Big Data, an action editor of the DataMining and Knowledge Discovery Journal, and an associate editor of the Knowledge and Information System Journal. He has served or currently serves as the editor-in-chief of the ACM Transactions on Knowledge Discovery from Data as well as the ACM SIGKDD Explorations. He is also an editor-in-chief of ACM Books. He serves on the advisory board of the Lecture Notes on Social Networks, a publication by Springer. He has served as the vice-president of the SIAM Activity Group on Data Mining and is a member of the SIAM industry committee. He is a fellow of the SIAM, ACM, and the IEEE, for “contributions to knowledge discovery and data mining algorithms.

Bibliographic Information

Book Title: Neural Networks and Deep Learning
Book Subtitle: A Textbook
Authors: Charu C. Aggarwal
DOI: https://doi.org/10.1007/978-3-031-29642-0
Publisher: Springer Cham
eBook Packages: Mathematics and Statistics, Mathematics and Statistics (R0)
Copyright Information: Springer Nature Switzerland AG 2023
Hardcover ISBN: 978-3-031-29641-3Published: 30 June 2023
Softcover ISBN: 978-3-031-29644-4Published: 01 July 2024
eBook ISBN: 978-3-031-29642-0Published: 29 June 2023
Edition Number: 2
Number of Pages: XXIV, 529
Number of Illustrations: 128 b/w illustrations, 22 illustrations in colour
Topics: Machine Learning, Data Mining and Knowledge Discovery, Artificial Intelligence, Knowledge based Systems, Natural Language Processing (NLP)

Publish with us

Policies and ethics

Neural Networks and Deep Learning

Overview

Buy print copy

About this book

Keywords

Table of contents (13 chapters)

Front Matter

Back Matter

Authors and Affiliations

IBM T. J. Watson Research Center, International Business Machines, Yorktown Heights, USA

About the author

Bibliographic Information

Publish with us

Search

Navigation