A Study on Accelerating of Inertial Newton Algorithm for Neural Network Training

Mahboubi, Shahrzad; Yamatomi, Ryo; Samejima, Yuta; Ninomiya, Hiroshi

doi:10.1007/978-981-99-7569-3_16

Shahrzad Mahboubi¹³,
Ryo Yamatomi¹⁴,
Yuta Samejima¹³ &
…
Hiroshi Ninomiya¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 803))

Included in the following conference series:

International conference on WorldS4

171 Accesses

Abstract

This paper describes the novel study on accelerating INertial Newton Algorithm (INNA) for neural network training. Recently, INNA, a dynamic system of optimization methods, has been proposed and applied to neural network training. INNA combines the ideas of Newton and the Inertial methods into a dynamical system and expresses them as differential equations. This paper proposes a new training algorithm called Nesterov’s Accelerated Dynamical InertiAl Newton method (NADIAN), which accelerates INNA by introducing Nesterov’s accelerated gradient. Finally, the proposed method is applied to neural network training and verified.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Dynamic Adjustment of the Learning Rate Using Gradient

A New Training Algorithm Based on Finite-Time Stable Theory for Neural Networks

An Improved Neural Networks Algorithm

References

Goodfellow I, Bengio Y, Courville A (2016) Deep learning (adaptive computation and machine learning series). MIT Press
Google Scholar
Attouch H, Goudou X, Redont P (2011) The heavy ball with friction method, I. The continuous dynamical system: global exploration of the global minima of a real-valued function by asymptotic analysis of a dissipative dynamical system. Commun Contemp Math 2(1):1–43
Google Scholar
Antipin AS (1997) Linearization method, in nonlinear dynamic systems: qualitative analysis and control (in Russian). Proc ISA Russ Acad Sci 2:4–20 (1994) (English translation: Comput Math 8(1):1–15)
Google Scholar
Alvarez F, Attouch H, Bolte J, Redont P (2002) A second-order gradient-like dissipative dynamical system with Hessian-driven damping: application to optimization and mechanics. J Math Pure Appl 81(8):747–779
Google Scholar
Castera C, Bolte J, Févotte C, Pauwels E (2021) An inertial newton algorithm for deep learning. J Mach Learn Res 22(134):1–31
Google Scholar
TensorFlow. https://www.tensorflow.org. Last accessed 30 May 2023
https://github.com/ninomiyalab/NADIAN_optimizer. Last accessed 30 May 2023
Sutskever I, Martens J, Dahl GE, Hinton GE (2013) On the importance of initialization and momentum in deep learning. In: Proceedings of the ICML, pp 1139–1147
Google Scholar
Nesterov Y (2004) Introductory lectures on convex optimization: a basic course. Kluwer, Boston
Google Scholar
Muehlebach M, Jordan M (2019) A dynamical systems perspective on Nesterov acceleration. In: Proceedings of the ICML PMLR, pp 4656–4662
Google Scholar
Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Google Scholar
Google Colablatory, https://colab.research.google.com. Last accessed 30 May 2023
Tieleman T, Hinton G (2012) Lecture 6.5—RMSprop: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw Mach Learn 4(2):26–31
Google Scholar
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: Proceedings of the ICLR, pp 1–13
Google Scholar
Zhuang J, Tang T, Ding Y, Tatikonda SC, Dvornek N, Papademetris X, Duncan J (2020) Adabelief optimizer: adapting stepsizes by the belief in observed gradients. In: Proceedings of the NeurIPS, pp 18795-18806
Google Scholar
Krizhevsky A, Hinton GE (2009) Learning multiple layers of features from tiny images
Google Scholar

Download references

Acknowledgements

This work is supported by The Japan Society Promotion of Science (JSPS), KAKENHI (20K11979 and 23K11267).

Author information

Authors and Affiliations

Department of Informatics, Shonan Institute of Technology, 1-1-25 Tsujido-nishikaigan, Fujisawa, Kanagawa, 251-8511, Japan
Shahrzad Mahboubi, Yuta Samejima & Hiroshi Ninomiya
Graduate School of Electrical and Information, Shonan Institute of Technology, 1-1-25 Tsujido-nishikaigan, Fujisawa, Kanagawa, 251-8511, Japan
Ryo Yamatomi

Authors

Shahrzad Mahboubi
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Yamatomi
View author publications
You can also search for this author in PubMed Google Scholar
Yuta Samejima
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Ninomiya
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shahrzad Mahboubi .

Editor information

Editors and Affiliations

School of Mathematics, Computer Science and Engineering, Liverpool Hope University, Liverpool, UK
Atulya K. Nagar
Namibia University of Science and Technology, Windhoek, Namibia
Dharm Singh Jat
School of Computer Science and Information Technology, Symbiosis University of Applied Sciences, Indore, India
Durgesh Mishra
Global Knowledge Research Foundation, Ahmedabad, India
Amit Joshi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mahboubi, S., Yamatomi, R., Samejima, Y., Ninomiya, H. (2024). A Study on Accelerating of Inertial Newton Algorithm for Neural Network Training. In: Nagar, A.K., Jat, D.S., Mishra, D., Joshi, A. (eds) Intelligent Sustainable Systems. WorldS4 2023. Lecture Notes in Networks and Systems, vol 803. Springer, Singapore. https://doi.org/10.1007/978-981-99-7569-3_16

Download citation

DOI: https://doi.org/10.1007/978-981-99-7569-3_16
Published: 16 February 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-7568-6
Online ISBN: 978-981-99-7569-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Study on Accelerating of Inertial Newton Algorithm for Neural Network Training

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Dynamic Adjustment of the Learning Rate Using Gradient

A New Training Algorithm Based on Finite-Time Stable Theory for Neural Networks

An Improved Neural Networks Algorithm

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Study on Accelerating of Inertial Newton Algorithm for Neural Network Training

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Dynamic Adjustment of the Learning Rate Using Gradient

A New Training Algorithm Based on Finite-Time Stable Theory for Neural Networks

An Improved Neural Networks Algorithm

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation