Multiplicative Updates for Learning with Stochastic Matrices

Zhu, Zhanxing; Yang, Zhirong; Oja, Erkki

doi:10.1007/978-3-642-38886-6_14

Zhanxing Zhu¹⁸,
Zhirong Yang¹⁸ &
Erkki Oja¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7944))

Included in the following conference series:

Scandinavian Conference on Image Analysis

Abstract

Stochastic matrices are arrays whose elements are discrete probabilities. They are widely used in techniques such as Markov Chains, probabilistic latent semantic analysis, etc. In such learning problems, the learned matrices, being stochastic matrices, are non-negative and all or part of the elements sum up to one. Conventional multiplicative updates which have been widely used for nonnegative learning cannot accommodate the stochasticity constraint. Simply normalizing the nonnegative matrix in learning at each step may have an adverse effect on the convergence of the optimization algorithm. Here we discuss and compare two alternative ways in developing multiplicative update rules for stochastic matrices. One reparameterizes the matrices before applying the multiplicative update principle, and the other employs relaxation with Lagrangian multipliers such that the updates jointly optimize the objective and steer the estimate towards the constraint manifold. We compare the new methods against the conventional normalization approach on two applications, parameter estimation of Hidden Markov Chain Model and Information-Theoretic Clustering. Empirical studies on both synthetic and real-world datasets demonstrate that the algorithms using the new methods perform more stably and efficiently than the conventional ones.

Download to read the full chapter text

Chapter PDF

Asymptotic Bayesian Generalization Error in Latent Dirichlet Allocation and Stochastic Matrix Factorization

Article 20 February 2020

Convex Nonnegative Matrix Factorization with Rank-1 Update for Clustering

Discriminative Bayesian filtering lends momentum to the stochastic Newton method for minimizing log-convex functions

Article Open access 22 June 2022

Keywords

References

Dhillon, I.S., Sra, S.: Generalized nonnegative matrix approximations with bregman divergences. In: Advances in Neural Information Processing Systems, vol. 18, pp. 283–290 (2006)
Google Scholar
Choi, S.: Algorithms for orthogonal nonnegative matrix factorization. In: Proceedings of IEEE International Joint Conference on Neural Networks, pp. 1828–1832 (2008)
Google Scholar
Cichocki, A., Zdunek, R., Phan, A.H., Amari, S.: Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis. John Wiley (2009)
Google Scholar
Faivishevsky, B., Goldberger, J.: A nonparametric information theoretic clustering algorithm. In: The 27th International Conference on Machine Learning (2010)
Google Scholar
Jin, R., Ding, C., Kang, F.: A probabilistic approach for optimizing spectral clustering. In: Advances in Neural Information Processing Systems, pp. 571–578 (2005)
Google Scholar
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)
Article Google Scholar
Ding, C., Li, T., Peng, W.: On the equivalence between non-negative matrix factorization and probabilistic latent semantic indexing. Computational Statistics and Data Analysis 52(8), 3913–3927 (2008)
Article MathSciNet MATH Google Scholar
Mørup, M., Hansen, L.: Archetypal analysis for machine learning. In: IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp. 172–177. IEEE (2010)
Google Scholar
Yang, Z., Oja, E.: Linear and nonlinear projective nonnegative matrix factorization. IEEE Transaction on Neural Networks 21(5), 734–749 (2010)
Article Google Scholar
Yang, Z., Oja, E.: Unified development of multiplicative algorithms for linear and quadratic nonnegative matrix factorization. IEEE Transactions on Neural Networks 22(12), 1878–1891 (2011)
Article Google Scholar
Yang, Z., Oja, E.: Quadratic nonnegative matrix factorization. Pattern Recognition 45(4), 1500–1510 (2012)
Article MATH Google Scholar
Yang, Z., Oja, E.: Clustering by low-rank doubly stochastic matrix decomposition. In: International Conference on Machine Learning (ICML) (2012)
Google Scholar
Lakshminarayanan, B., Raich, R.: Non-negative matrix factorization for parameter estimation in hidden markov models. In: IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp. 89–94 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Computer Science, Aalto University, P.O. Box 15400, 00076, Aalto, Finland
Zhanxing Zhu, Zhirong Yang & Erkki Oja

Authors

Zhanxing Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Zhirong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Erkki Oja
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Signal Processing, Tampere University of Technology, P.O. Box 553, Tampere, Finland
Joni-Kristian Kämäräinen
Department of Information and Computer Science,, Aalto University, P.O. Box 15400, 00076, Espoo, Finland
Markus Koskela

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, Z., Yang, Z., Oja, E. (2013). Multiplicative Updates for Learning with Stochastic Matrices. In: Kämäräinen, JK., Koskela, M. (eds) Image Analysis. SCIA 2013. Lecture Notes in Computer Science, vol 7944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38886-6_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-38886-6_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38885-9
Online ISBN: 978-3-642-38886-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Multiplicative Updates for Learning with Stochastic Matrices

Abstract

Chapter PDF

Similar content being viewed by others

Asymptotic Bayesian Generalization Error in Latent Dirichlet Allocation and Stochastic Matrix Factorization

Convex Nonnegative Matrix Factorization with Rank-1 Update for Clustering

Discriminative Bayesian filtering lends momentum to the stochastic Newton method for minimizing log-convex functions

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Multiplicative Updates for Learning with Stochastic Matrices

Abstract

Chapter PDF

Similar content being viewed by others

Asymptotic Bayesian Generalization Error in Latent Dirichlet Allocation and Stochastic Matrix Factorization

Convex Nonnegative Matrix Factorization with Rank-1 Update for Clustering

Discriminative Bayesian filtering lends momentum to the stochastic Newton method for minimizing log-convex functions

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation