Studying the Behavior of Generalized Entropy in Induction Trees Using a M-of-N Concept

Rakotomalala, R.; Lallich, S.; Di Palma, S.

doi:10.1007/978-3-540-48247-5_66

R. Rakotomalala⁸,
S. Lallich⁸ &
S. Di Palma⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1704))

Included in the following conference series:

European Conference on Principles of Data Mining and Knowledge Discovery

1966 Accesses
2 Citations

Abstract

This paper study splitting criterion in decision trees using three original points of view. First we propose a unified formalization for association measures based on entropy of type beta. This formalization includes popular measures such as Gini index or Shannon entropy. Second, we generate artificial data from M-of-N concepts whose complexity and class distribution are controlled. Third, our experiment allows us to study the behavior of measures on datasets of growing complexity. The results show that the differences of performances between measures, which are significant when there is no noise in the data, disappear when the level of noise increases.

Download to read the full chapter text

Chapter PDF

Conclusions

Decision tree induction with a constrained number of leaf nodes

Article 15 April 2016

Relationships Between Average Depth and Number of Nodes for Decision Trees

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International, California (1984)
MATH Google Scholar
Buntine, W., Niblett, T.: A further comparison of splitting rules for decision tree induction. Machine Learning 8, 75–85 (1992)
Google Scholar
Goodman, L.A., Kruskal, W.H.: Measures of association for cross classifications. Journal of the American Statistical Association 37, 54–115 (1954)
Google Scholar
Kingsland, L.C.: The evaluation of medical expert system: Experience with the ai/rheum knowledge-based consultant in rheumatology. In: Proceedings of the Ninth Annual Symposium on Computer Applications in Medical Care (1985)
Google Scholar
Kononenko, I.: On biases in estimating multi-valued attributes. In: Proc. Int. Joint Conf. On Artificial Intelligence IJCAF 1995, pp. 1034–1040 (1995)
Google Scholar
Lallich, S.: Concept de diversite et association predictive. In: Proceedings of XXXIemes Journees de Statistique, May 1999, pp. 673–676 (1999)
Google Scholar
De Mantaras, R.L.: A distance-based attributes selection measures for decision tree induction. Machine Learning 6, 81–92 (1991)
Article Google Scholar
Murphy, P., Pazzani, M.: Id2-of-3: Constructive induction of m-of-n concepts for discriminators in decision trees. Technical Report 91-37, Department of Information and Computer Science - University of California at Irvine (1991)
Google Scholar
Pagallo, G., Haussler, D.: Boolean feature discovery in empirical learning. Machine Learning 5, 71–99 (1990)
Article Google Scholar
Quinlan, J.R.: Clf.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)
Google Scholar
Rakotomalala, R.: Graphes d’Induction. PhD thesis, University Claude Bernard -Lyon 1 (December 1997)
Google Scholar
Wehenkel, L.: On uncertainty measures used for decision tree induction. In: Proceedings of Info. Proc. and Manag. Of Uncertainty, pp. 413–418 (1996)
Google Scholar
White, A.P., Liu, W.Z.: Bias in information-based measures in decision tree induction. Machine Learning 15(3), 321–329 (1994)
MATH Google Scholar
Zighed, D.A., Auray, J.P., Duru, G.: Sipina: Methode et logiciel. Lacassagne (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

ERIC Laboratory, University of Lyon 2,
R. Rakotomalala, S. Lallich & S. Di Palma

Authors

R. Rakotomalala
View author publications
You can also search for this author in PubMed Google Scholar
S. Lallich
View author publications
You can also search for this author in PubMed Google Scholar
S. Di Palma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, UNC Charlotte, Charlotte, N.C. 28223 and Institute of Computer Science, Polish Academy of Sciences,
Jan M. Żytkow
Faculty of Informatics and Statistics, University of Economics, Prague, nám. W. Churchilla 4, 130 67, Prague, Czech Republic
Jan Rauch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rakotomalala, R., Lallich, S., Di Palma, S. (1999). Studying the Behavior of Generalized Entropy in Induction Trees Using a M-of-N Concept. In: Żytkow, J.M., Rauch, J. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1999. Lecture Notes in Computer Science(), vol 1704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-48247-5_66

Download citation

DOI: https://doi.org/10.1007/978-3-540-48247-5_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66490-1
Online ISBN: 978-3-540-48247-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Studying the Behavior of Generalized Entropy in Induction Trees Using a M-of-N Concept

Abstract

Chapter PDF

Similar content being viewed by others

Conclusions

Decision tree induction with a constrained number of leaf nodes

Relationships Between Average Depth and Number of Nodes for Decision Trees

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Studying the Behavior of Generalized Entropy in Induction Trees Using a M-of-N Concept

Abstract

Chapter PDF

Similar content being viewed by others

Conclusions

Decision tree induction with a constrained number of leaf nodes

Relationships Between Average Depth and Number of Nodes for Decision Trees

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation