An Efficient Generalization of Battiti-Shanno’s Quasi-Newton Algorithm for Learning in MLP-Networks

Di Fiore, Carmine; Fanelli, Stefano; Zellini, Paolo

doi:10.1007/978-3-540-30499-9_74

Carmine Di Fiore²¹,
Stefano Fanelli²¹ &
Paolo Zellini²¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3316))

Included in the following conference series:

International Conference on Neural Information Processing

83 Accesses
3 Citations

Abstract

This paper presents a novel Quasi-Newton method for the minimization of the error function of a feed-forward neural network. The method is a generalization of Battiti’s well known OSS algorithm. The aim of the proposed approach is to achieve a significant improvement both in terms of computational effort and in the capability of evaluating the global minimum of the error function. The technique described in this work is founded on the innovative concept of “convex algorithm” in order to avoid possible entrapments into local minima. Convergence results as well numerical experiences are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Weight Update Sequence in MLP Networks

A New Conjugate Gradient Method with Smoothing $L_{1/2} $ Regularization Based on a Modified Secant Equation for Training Neural Networks

Article 21 November 2017

Neural Networks Training Based on Recursive Least Squares (RLS)

References

Al Baali, M.: Improved Hessian approximations for the limited memory BFGS method. Numer. Algorithms 22, 99–112 (1999)
Article MATH MathSciNet Google Scholar
Battiti, R.: First- and second-order methods for learning: between steepest descent and Newton’s method. Neural Computation 4, 141–166 (1992)
Article Google Scholar
Bianchini, M., Fanelli, S., Gori, M., Protasi, M.: Non-suspiciousness: a generalization of convexity in the frame of foundations of Numerical Analysis and Learning. In: IJCNN 1998, Anchorage, vol. II, pp. 1619–1623 (1998)
Google Scholar
Bianchini, M., Fanelli, S., Gori, M.: Optimal algorithms for well-conditioned nonlinear systems of equations. IEEE Transactions on Computers 50, 689–698 (2001)
Article MathSciNet Google Scholar
Bortoletti, A., Di Fiore, C., Fanelli, S., Zellini, P.: A new class of quasi-newtonian methods for optimal learning in MLP-networks. IEEE Transactions on Neural Networks 14, 263–273 (2003)
Article Google Scholar
Di Fiore, C., Fanelli, S., Zellini, P.: Matrix algebras in quasi-newtonian algorithms for optimal learning in multi-layer perceptrons. In: ICONIP Workshop and Expo, Dunedin, pp. 27–32 (1999)
Google Scholar
Di Fiore, C., Fanelli, S., Zellini, P.: Optimisation strategies for nonconvex functions and applications to neural networks. In: ICONIP 2001, Shanghai, vol. 1, pp. 453–458 (2001)
Google Scholar
Di Fiore, C., Fanelli, S., Zellini, P.: Computational experiences of a novel algorithm for optimal learning in MLP-networks. In: ICONIP 2002, Singapore, vol. 1, pp. 317–321 (2002)
Google Scholar
Di Fiore, C., Fanelli, S., Lepore, F., Zellini, P.: Matrix algebras in Quasi-Newton methods for unconstrained optimization. Numerische Mathematik 94, 479–500 (2003)
Article MATH MathSciNet Google Scholar
Di Fiore, C., Fanelli, S., Zellini, P.: Convex algorithms for optimal learning in MLPnetworks. In: Preparation
Google Scholar
Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. Wiley, Chichester (1973)
MATH Google Scholar
Frasconi, P., Fanelli, S., Gori, M., Protasi, M.: Suspiciousness of loading problems. IEEE Int. Conf. on Neural Networks 2, 1240–1245 (1997)
Google Scholar
Liu, D.C., Nocedal, J.: On the limited memory BFGS method for large scale optimization. Math. Programming 45, 503–528 (1989)
Article MATH MathSciNet Google Scholar
Nocedal, J., Wright, S.J.: Numerical Optimization. Springer, New York (1999)
Book MATH Google Scholar
http://www.mathworks.com/access/helpdesk/help/toolbox/nnet/backpr14.html

Download references

Author information

Authors and Affiliations

Department of Mathematics, University of Rome “Tor Vergata”, Rome, Italy
Carmine Di Fiore, Stefano Fanelli & Paolo Zellini

Authors

Carmine Di Fiore
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Fanelli
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Zellini
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Indian Statistical Institute, Electronics and Communication Sciences Unit, Kolkata, India
Nikhil Ranjan Pal
School of Computer and Information Sciences, Knowledge Engineering and Discovery Research Institute (KEDRI), Auckland University of Technology, Private Bag 92006, Auckland, New Zealand
Nik Kasabov
Department of Instrumentation and Electronics Engineering, Jadavpur University, Salt-lake Campus, 700098, Calcutta, India
Rajani K. Mudi
Indian Statistical Institute, 203 B. T. Road, 700 108, Calcutta,
Srimanta Pal
Indian Statistical Institute, Computer Vision and Pattern Recognition Unit, 700108, Kolkata, India
Swapan Kumar Parui

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Di Fiore, C., Fanelli, S., Zellini, P. (2004). An Efficient Generalization of Battiti-Shanno’s Quasi-Newton Algorithm for Learning in MLP-Networks. In: Pal, N.R., Kasabov, N., Mudi, R.K., Pal, S., Parui, S.K. (eds) Neural Information Processing. ICONIP 2004. Lecture Notes in Computer Science, vol 3316. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30499-9_74

Download citation

DOI: https://doi.org/10.1007/978-3-540-30499-9_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23931-4
Online ISBN: 978-3-540-30499-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

An Efficient Generalization of Battiti-Shanno’s Quasi-Newton Algorithm for Learning in MLP-Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Weight Update Sequence in MLP Networks

A New Conjugate Gradient Method with Smoothing \(L_{1/2} \) Regularization Based on a Modified Secant Equation for Training Neural Networks

Neural Networks Training Based on Recursive Least Squares (RLS)

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Efficient Generalization of Battiti-Shanno’s Quasi-Newton Algorithm for Learning in MLP-Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Weight Update Sequence in MLP Networks

A New Conjugate Gradient Method with Smoothing \(L_{1/2} \) Regularization Based on a Modified Secant Equation for Training Neural Networks

Neural Networks Training Based on Recursive Least Squares (RLS)

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation