A Scalable Learning Algorithm for Kernel Probabilistic Classifier

Serrurier, Mathieu; Prade, Henri

doi:10.1007/978-3-642-40381-1_23

Mathieu Serrurier²² &
Henri Prade²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8078))

Included in the following conference series:

International Conference on Scalable Uncertainty Management

641 Accesses

Abstract

In this paper we propose a probabilistic classification algorithm that learns a set of kernel functions that associate a probability distribution over classes to an input vector. This model is obtained by maximizing a measure over the probability distributions through a local optimization process. This measure focuses on the faithfulness of the whole probability distribution induced rather than only considering the probabilities of the classes separately. We show that, thanks to a pre-processing computation, the complexity of the evaluation of this measure with respect to a model is no longer dependent on the size of the training set. This makes the local optimization of the whole set of kernel functions tractable, even for large databases. We experiment our method on five benchmark datasets and the KDD Cup 2012 dataset.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Nyström-SGD: Fast Learning of Kernel-Classifiers with Conditioned Stochastic Gradient Descent

Data Based Construction of Kernels for Classification

Continuous Kernel Learning

References

Cover, T.M., Hart, P.E.: Nearest neighbour pattern classification. IEEE Transactions on Information Theory 13, 21–27 (1967)
Article MATH Google Scholar
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: Liblinear: A library for large linear classification. Journal of Machine Learning Research 9, 1871–1874 (2008)
MATH Google Scholar
Friedman, J.H.: Greedy function approximation: A gradient boosting machine. Annals of Statistics 29, 1189–1232 (2000)
Article Google Scholar
Jaakkola, T.S., Haussler, D.: Probabilistic kernel regression models. In: Proceedings of the 1999 Conference on AI and Statistics. Morgan Kaufmann (1999)
Google Scholar
Jaakkola, T.S., Jordan, M.I.: A variational approach to bayesian logistic regression models and their extensions (1996)
Google Scholar
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of the IEEE International Conference on Neural Networks 1995, pp. 1942–1948 (1995)
Google Scholar
Langley, P., Iba, W., Thompson, K.: An analysis of bayesian classifiers. In: Proceedings of AAAI 1992, vol. 7, pp. 223–228 (1992)
Google Scholar
Nelder, J.A., Mead, R.: A simplex method for function minimization. The Computer Journal 7(4), 308–313 (1965)
Article MATH Google Scholar
Nickisch, H., Rasmussen, C.E.: Approximations for binary gaussian process classification. Journal of Machine Learning Research 9, 2035–2078 (2008)
MathSciNet MATH Google Scholar
Opper, M., Winther, O.: Gaussian processes for classification: Mean field algorithms. Neural Computation 12, 2000 (1999)
Google Scholar
Serrurier, M., Prade, H.: Imprecise regression based on possibilistic likelihood. In: Benferhat, S., Grant, J. (eds.) SUM 2011. LNCS, vol. 6929, pp. 447–459. Springer, Heidelberg (2011)
Chapter Google Scholar
Serrurier, M., Prade, H.: Maximum-likelihood principle for possibility distributions viewed as families of probabilities (regular paper). In: IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), Taipei, Taiwan, pp. 2987–2993 (2011)
Google Scholar
Sugiyama, M.: Superfast-trainable multi-class probabilistic classifier by least-squares posterior fitting. IEICE Transactions on Information and Systems 93-D(10), 2690–2701 (2010)
Google Scholar
Trelea, I.C.: The particle swarm optimization algorithm: convergence analysis and parameter selection. Information Processing Letters 85(6), 317–325 (2003)
Article MathSciNet MATH Google Scholar
Williams, C.K.I., Barbe, D.: Bayesian classification with gaussian processes. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(12), 1342–1351 (1998)
Article Google Scholar
Wu, T.-F., Chih-Jen, C.-J., Weng, R.C.: Probability estimates for multi-class classification by pairwise coupling. Journal of Machine Learning Research 5, 975–1005 (2004)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

IRIT, 118 route de Narbonne, 31062, Toulouse Cedex 9, France
Mathieu Serrurier & Henri Prade

Authors

Mathieu Serrurier
View author publications
You can also search for this author in PubMed Google Scholar
Henri Prade
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, BT9 5BN, Belfast, UK
Weiru Liu
Department of Computer Science, University of Maryland, 20742, College Park, MD, USA
V. S. Subrahmanian
Département d’Informatique, Université de Mons, 7000, Mons, Belgium
Jef Wijsen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Serrurier, M., Prade, H. (2013). A Scalable Learning Algorithm for Kernel Probabilistic Classifier. In: Liu, W., Subrahmanian, V.S., Wijsen, J. (eds) Scalable Uncertainty Management. SUM 2013. Lecture Notes in Computer Science(), vol 8078. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40381-1_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-40381-1_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40380-4
Online ISBN: 978-3-642-40381-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Scalable Learning Algorithm for Kernel Probabilistic Classifier

Abstract

Chapter PDF

Similar content being viewed by others

Nyström-SGD: Fast Learning of Kernel-Classifiers with Conditioned Stochastic Gradient Descent

Data Based Construction of Kernels for Classification

Continuous Kernel Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Scalable Learning Algorithm for Kernel Probabilistic Classifier

Abstract

Chapter PDF

Similar content being viewed by others

Nyström-SGD: Fast Learning of Kernel-Classifiers with Conditioned Stochastic Gradient Descent

Data Based Construction of Kernels for Classification

Continuous Kernel Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation