Abstract
The learning of probabilistic classification models can be approached from either a generative or a discriminative point of view. Generative methods attempt to maximize the unconditional log-likelihood, while the aim of discriminative methods is to maximize the conditional log-likelihood. In the case of Bayesian network classifiers, the parameters of the model are usually learned by generative methods rather than discriminative ones. However, some numerical approaches to the discriminative learning of Bayesian network classifiers have recently appeared. This paper presents a new statistical approach to the discriminative learning of these classifiers by means of an adaptation of the TM algorithm [1]. In addition, we test the TM algorithm with different Bayesian classification models, providing empirical evidence of the performance of this method.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Bayesian Network
- Transjugular Intrahepatic Portosystemic Shunt
- Exponential Family
- Basque Country
- Linear Search
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Edwards, D., Lauritzen, S.L.: The TM algorithm for maximising a conditional likelihood function. Biometrika 88, 961–972 (2001)
Dawid, A.P.: Properties of diagnostic data distributions. Biometrics 32, 647–658 (1976)
Rubinstein, Y.D., Hastie, T.: Discriminative vs. informative learning. In: Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, pp. 49–53 (1997)
Ng, A.Y., Jordan, M.I.: On discriminative vs. generative classifiers: A comparison of logistic regression and naïve Bayes. In: Proceedings of the Sixteenth Advances in Neural Information Processing Systems, vol. 14 (2002)
Jebara, T.: Machine Learning: Discriminative and Generative. Kluwer Academic Publishers, Dordrecht (2003)
Fisher, R.A.: The use of multiple measurement. Annals of Eugenics 7, 179–188 (1936)
Duda, R., Hart, P.: Pattern Classification and Scene Analysis. John Wiley and Sons, Chichester (1973)
Hosmer, D., Lemeshow, S.: Applied Logistic Regression. John Wiley and Sons, Chichester (1989)
Bishop, C.: Neural Networks for Pattern Recognition. Oxford Press, Oxford (1996)
Pearl, J.: Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Francisco (1988)
Neapolitan, R.E.: Learning Bayesian Networks. Prentice-Hall, Englewood Cliffs (2003)
Greiner, R., Zhou, W., Su, X., Shen, B.: Structural extension to logistic regression: Discriminative parameter learning of belief net classifiers. Machine Learning (2004) (accepted for publication)
Roos, T., Wettig, H., Grünwald, P., Myllymäki, P., Tirri, H.: On discriminative Bayesian network classifiers and logistic regression. Machine Learning (2004) (accepted for publication)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–164 (1997)
Sundberg, R.: The convergence rate of the TM algorithm of Edwards and Lauritzen. Biometrika 89, 478–483 (2002)
Santafé, G., Lozano, J.A., Larran̈aga, P.: El algoritmo TM para clasificadores Bayesianos (in Spanish). Technical Report EHU-KZAA-IK-2/04, University of the Basque Country (2004)
Blake, C., Merz, C.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence 97, 273–324 (1997)
Inza, I., Merino, M., Larrañaga, P., Quiroga, J., Sierra, B., Girala, M.: Feature subset selection by genetic algorithms and estimation of distribution algorithms. A case study in the survival of cirrhotic patients treated with TIPS. Artificial Intelligence in Medicine 23, 187–205 (2001)
Dougherty, J.R., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: Proceedings of the Twelfth International Conference on Machine Learning, pp. 194–202 (1995)
Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence, pp. 1022–1027 (1993)
Mann, H., Whitney, D.: On a test of whether one of two random variables is stochastically larger than the other. Annals of Mathematical Statistics 18, 50–60 (1947)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Santafé, G., Lozano, J.A., Larrañaga, P. (2005). Discriminative Learning of Bayesian Network Classifiers via the TM Algorithm. In: Godo, L. (eds) Symbolic and Quantitative Approaches to Reasoning with Uncertainty. ECSQARU 2005. Lecture Notes in Computer Science(), vol 3571. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11518655_14
Download citation
DOI: https://doi.org/10.1007/11518655_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27326-4
Online ISBN: 978-3-540-31888-0
eBook Packages: Computer ScienceComputer Science (R0)