Computational Experience with Pseudoinversion-Based Training of Neural Networks Using Random Projection Matrices

Rubini, Luca; Cancelliere, Rossella; Gallinari, Patrick; Grosso, Andrea; Raiti, Antonino

doi:10.1007/978-3-319-10554-3_24

Luca Rubini²³,
Rossella Cancelliere²³,
Patrick Gallinari²⁴,
Andrea Grosso²³ &
…
Antonino Raiti²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8722))

Included in the following conference series:

International Conference on Artificial Intelligence: Methodology, Systems, and Applications

1345 Accesses
1 Citations

Abstract

Recently some novel strategies have been proposed for neural network training that set randomly the weights from input to hidden layer, while weights from hidden to output layer are analytically determined by Moore-Penrose generalised inverse; such non-iterative strategies are appealing since they allow fast learning. Aim of this study is to investigate the performance variability when random projections are used for convenient setting of the input weights: we compare them with state of the art setting i.e. weights randomly chosen according to a continuous uniform distribution. We compare the solutions obtained by different methods testing this approach on some UCI datasets for both regression and classification tasks; this results in a significant performance improvement with respect to conventional method.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Local search and pseudoinversion: an hybrid approach to neural network training

Article 20 April 2016

Backprojection for Training Feedforward Neural Networks in the Input and Feature Spaces

Efficient extreme learning machine via very sparse random projection

Article 19 March 2018

Keywords

References

Rumellhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. In: Parallel Distrib. Process.: Exploration in the Microstructure of Cognition, vol. 1, pp. 318–362. MIT Press, Cambridge (1986)
Google Scholar
LeCun, Y.A., Bottou, L., Orr, G.B., Müller, K.-R.: Efficient backProp. In: Orr, G.B., Müller, K.-R. (eds.) NIPS-WS 1996. LNCS, vol. 1524, pp. 9–50. Springer, Heidelberg (1998)
Google Scholar
Larochelle, H., Erhan, D., Courville, A., Bergstra, J., Bengio, Y.: An empirical evaluation of deep architectures on problems with many factors of variation. In: 24th ICML (2007)
Google Scholar
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.-A.: Extracting and composing robust features with denoising autoencoders. In: 25th ICML (2008)
Google Scholar
Collobert, R., Weston, J.: A unified architecture for language processing: Deep neural networks with multitask learning. In: 25th ICML (2008)
Google Scholar
Mnih, A., Hinton, G.E.: A scalable hierarchical distributed language model. In: 23rd NIPS, pp. 1081–1088 (2009)
Google Scholar
Poggio, T., Girosi, F.: Networks for approximation and learning. IEEE 78(9), 1481–1497 (1990)
Article Google Scholar
Cancelliere, R.: A High Parallel Procedure to Initialize the Output Weights of a Radial Basis Function or BP Neural Network. In: Sørevik, T., Manne, F., Moe, R., Gebremedhin, A.H. (eds.) PARA 2000. LNCS, vol. 1947, pp. 384–390. Springer, Heidelberg (2001)
Chapter Google Scholar
Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme Learning Machine: Theory and applications. Neurocomputing 70, 489–501 (2006)
Article Google Scholar
Halawa, K.: A method to improve the performance of multilayer perceptron by utilizing various activation functions in the last hidden layer and the least squares method. Neural Processing Letters 34, 293–303 (2011)
Article Google Scholar
Nguyen, T.D., Pham, H.T.B., Dang, V.H.: An efficient Pseudo Inverse matrix-based solution for secure auditing. In: IEEE International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (2010)
Google Scholar
Kohno, K., Kawamoto, M., Inouye, Y.: A Matrix Pseudoinversion Lemma and Its Application to Block-Based Adaptive Blind Deconvolution for MIMO Systems. IEEE Transactions on Circuits and Systems I: Regular Papers 57(7), 1449–1462 (2010)
Article MathSciNet Google Scholar
Ajorloo, H., Manzuri-Shalmani, M.T., Lakdashti, A.: Restoration of damaged slices in images using matrix pseudo inversion. In: 22nd International Symposium on Computer and Information Sciences (2007)
Google Scholar
Wang, X.-Z., Wang, D., Huang, G.-B.: Special Issue on Extreme Learning Machines. Editorial. Soft Comput. 16(9), 1461–1463 (2012)
Article Google Scholar
Wang, X.: Special Issue on Extreme Learning Machine with Uncertainty. Editorial. Int. J. Unc. Fuzz. Knowl. Based Syst. 21(supp. 02), v–vi (2013)
Google Scholar
Arriaga, R.I., Vempala, S.: An algorithmic theory of learning: robust concepts and random projection. In: 40th Annual Symp. on Foundations of Computer Science, pp. 616–623. IEEE Computer Society Press (1999)
Google Scholar
Vempala, S.: Random projection: a new approach to VLSI layout. In: 39th Annual Symp. on Foundations of Computer Science. IEEE Computer Society Press (1998)
Google Scholar
Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: 30th Symp. on Theory of Computing, pp. 604–613. ACM (1998)
Google Scholar
Penrose, R.: On best approximate solution of linear matrix equations. Proceedings of the Cambridge Philosophical Society 52, 17–19 (1956)
Article MATH MathSciNet Google Scholar
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Berlin (2006)
MATH Google Scholar
Badeva, V., Morosov, V.: Problemes incorrectements posès, thèorie et applications (in French). Masson, Paris (1991)
Google Scholar
Cancelliere, R., De Luca, R., Gai, M., Gallinari, P., Artières, T.: Pseudoinversion for neural training: tuning the regularisation parameter. Technical report n. 149/13, Dep. of Computer Science, University of Turin (2013)
Google Scholar
Tikhonov, A.N., Arsenin, V.Y.: Solutions of Ill-Posed Problems. Winston, Washington, DC (1977)
Google Scholar
Tikhonov, A.N.: Solution of incorrectly formulated problems and the regularization method. Soviet Mathematics 4, 1035–1038 (1963)
Google Scholar
Gallinari, P., Cibas, T.: Practical complexity control in multilayer perceptrons. Signal Processing 74, 29–46 (1999)
Article MATH Google Scholar
Poggio, T., Girosi, F.: Regularization algorithms that are equivalent to multilayer networks. Science 247, 978–982 (1990)
Article MATH MathSciNet Google Scholar
Girosi, F., Jones, M., Poggio, T.: Regularization theory and neural networks architectures. Neural Computation 7(2), 219–269 (1995)
Article Google Scholar
Haykin, S.: Neural Networks, a comprehensive foundation. Prentice Hall, U.S.A. (1999)
Google Scholar
Fuhry, M., Reichel, L.: A new Tikhonov regularization method. Numerical Algorithms 59, 433–445 (2012)
Article MATH MathSciNet Google Scholar
Hecht-Nielsen, R.: Context vectors: general purpose approximate meaning representations self-organized from raw data. In: Zurada, J.M., Marks II, R.J., Robinson, C.J. (eds.) Computational Intelligence: Imitating Life, pp. 43–56. IEEE Press (1994)
Google Scholar
Bingham, E., Mannila, H.: Random projection in dimensionality reduction: Applications to image and text data. In: Conference on Knowledge Discovery and Data Mining, KDD 2001, San Francisco, CA, USA (2001)
Google Scholar
Johnson, W.B., Lindenstrauss, J.: Extensions of Lipshitz mapping into Hilbert space. In: Conference in Modern Analysis and Probability. Contemporary Mathematics, vol. 26, pp. 189–206. Amer. Math. Soc. (1984)
Google Scholar
Dasgupta, S., Gupta, A.: An elementary proof of the Johnson-Lindenstrauss lemma. Technical report TR-99-006, International Computer Science Institute, Berkeley, California, USA (1999)
Google Scholar
Achlioptas, D.: Database-friendly random projections. In: ACM Symp. on the Principles of Database Systems, pp. 274–281 (2001)
Google Scholar
Asuncion, A., Newman, D.J.: UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html

Download references

Author information

Authors and Affiliations

Department of Computer Science, Università di Torino, Turin, Italy
Luca Rubini, Rossella Cancelliere, Andrea Grosso & Antonino Raiti
Laboratory of Computer Sciences, LIP6, Université Pierre et Marie Curie, Paris, France
Patrick Gallinari

Authors

Luca Rubini
View author publications
You can also search for this author in PubMed Google Scholar
Rossella Cancelliere
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Gallinari
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Grosso
View author publications
You can also search for this author in PubMed Google Scholar
Antonino Raiti
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Information and Communication, Bulgarian Academy of Sciences, Sofia, Bulgaria
Gennady Agre
Department of Computer Science, Wright State University, Dayton, OH, USA
Pascal Hitzler
Wright State University, Dayton, OH, USA
Adila A. Krisnadhi
Higher School of Economics, National Research University, Moscow, Russia
Sergei O. Kuznetsov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rubini, L., Cancelliere, R., Gallinari, P., Grosso, A., Raiti, A. (2014). Computational Experience with Pseudoinversion-Based Training of Neural Networks Using Random Projection Matrices. In: Agre, G., Hitzler, P., Krisnadhi, A.A., Kuznetsov, S.O. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2014. Lecture Notes in Computer Science(), vol 8722. Springer, Cham. https://doi.org/10.1007/978-3-319-10554-3_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-10554-3_24
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10553-6
Online ISBN: 978-3-319-10554-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Computational Experience with Pseudoinversion-Based Training of Neural Networks Using Random Projection Matrices

Abstract

Chapter PDF

Similar content being viewed by others

Local search and pseudoinversion: an hybrid approach to neural network training

Backprojection for Training Feedforward Neural Networks in the Input and Feature Spaces

Efficient extreme learning machine via very sparse random projection

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Computational Experience with Pseudoinversion-Based Training of Neural Networks Using Random Projection Matrices

Abstract

Chapter PDF

Similar content being viewed by others

Local search and pseudoinversion: an hybrid approach to neural network training

Backprojection for Training Feedforward Neural Networks in the Input and Feature Spaces

Efficient extreme learning machine via very sparse random projection

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation