MODL: A Bayes optimal discretization method for continuous attributes

Boullé, Marc

doi:10.1007/s10994-006-8364-x

MODL: A Bayes optimal discretization method for continuous attributes

Published: 08 May 2006

Volume 65, pages 131–165, (2006)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

MODL: A Bayes optimal discretization method for continuous attributes

Download PDF

Marc Boullé¹

1881 Accesses
101 Citations
Explore all metrics

Abstract

While real data often comes in mixed format, discrete and continuous, many supervised induction algorithms require discrete data. Efficient discretization of continuous attributes is an important problem that has effects on speed, accuracy and understandability of the induction models. In this paper, we propose a new discretization method MODL¹, founded on a Bayesian approach. We introduce a space of discretization models and a prior distribution defined on this model space. This results in the definition of a Bayes optimal evaluation criterion of discretizations. We then propose a new super-linear optimization algorithm that manages to find near-optimal discretizations. Extensive comparative experiments both on real and synthetic data demonstrate the high inductive performances obtained by the new discretization method.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Bay, S. (2001). Multivariate discretization for set mining. Knowledge and Information Systems, 3(4), 491–512.
Article MATH Google Scholar
Bertier, P., & Bouroche, J. M. (1981). Analyse des données multidimensionnelles. Presses Universitaires de France.
Blake, C. L., & Merz, C. J. (1998). UCI Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html]. Irvine, CA: University of California, Department of Information and Computer Science.
Boullé, M. (2003). Khiops: A discretization method of continuous attributes with guaranteed resistance to noise. Proceeding of the Third International Conference on Machine Learning and Data Mining in Pattern Recognition (pp. 50–64).
Boullé, M. (2004). Khiops: A statistical discretization method of continuous attributes. Machine Learning, 55(1), 53–69.
Article MATH Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. California: Wadsworth International.
MATH Google Scholar
Catlett, J. (1991). On changing continuous attributes into ordered discrete attributes. In Proceedings of the European Working Session on Learning (pp. 87–102). Springer-Verlag.
Dougherty, J., Kohavi, R., & Sahami, M. (1995). Supervised and unsupervised discretization of continuous features. In Proceedings of the 12 th International Conference on Machine Learning. (pp. 194–202) San Francisco, CA: Morgan Kaufmann.
Elomaa, T., & Rousu, J. (1996). Finding optimal multi-splits for numerical attributes in decision tree learning. Technical report, NeuroCOLT Technical Report NC-TR-96-041. Royal Holloway, University of London.
Elomaa, T., & Rousu, J. (1999). General and efficient multisplitting of numerical attributes. Machine Learning, 36, 201–244.
Article MATH Google Scholar
Fayyad, U., & Irani, K. (1992). On the handling of continuous-valued attributes in decision tree generation. Machine Learning, 8, 87–102.
MATH Google Scholar
Fischer, W. D. (1958). On grouping for maximum of homogeneity. Journal of the American Statistical Association, 53, 789–798.
Article MathSciNet Google Scholar
Fulton, T., Kasif, S., & Salzberg, S. (1995). Efficient algorithms for finding multi-way splits for decision trees. In Proceeding of the Twelfth International Conference on Machine Learning.
Holte, R. C. (1993). Very simple classification rules perform well on most commonly used datasets. Machine Learning, 11, 63–90.
Article MATH Google Scholar
Kass, G. V. (1980). An exploratory technique for investigating large quantities of categorical data. Applied Statistics, 29(2), 119–127.
Article Google Scholar
Kerber, R. (1991). Chimerge discretization of numeric attributes. In Proceedings of the 10 th International Conference on Artificial Intelligence (pp. 123–128).
Kohavi, R., & Sahami, M. (1996). Error-based and entropy-based discretization of continuous features. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (pp. 114–119). Menlo Park, CA: AAAI Press/MIT Press.
Kononenko, I., Bratko, I., & Roskar, E. (1984). Experiments in automatic learning of medical diagnostic rules (Technical Report). Ljubljana: Joseph Stefan Institute, Faculty of Electrical Engineering and Computer Science.
Lechevallier, Y. (1990). Recherche d’une partition optimale sous contrainte d’ordre total. Technical report N ^∘ 1247, INRIA.
Liu, H., Hussain, F., Tan, C. L., & Dash, M. (2002). Discretization: An enabling technique. Data Mining and Knowledge Discovery, 6(4), 393–423.
Article MathSciNet Google Scholar
Quinlan, J. R. (1993). C4.5: Programs for machine learning. Morgan Kaufmann.
Rissanen, J. (1978). Modeling by shortest data description. Automatica, 14, 465–471.
Article MATH Google Scholar
Vitanyi, P. M. B., & Li, M. (2000). Minimum description length induction, Bayesianism, and Kolmogorov Complexity. IEEE Trans. Inform. Theory, IT-46(2), 446–464.
Article MathSciNet Google Scholar
Zighed, D. A., Rabaseda, S., & Rakotomalala, R. (1998). Fusinter: A method for discretization of continuous attributes for supervised learning. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(33), 307–326.
Article MATH Google Scholar
Zighed, D. A., Rabaseda, S., Rakotomalala, R., & Feschet F. (1999). Discretization methods in supervised learning. In Encyclopedia of Computer Science and Technology, vol. 40 (pp. 35–50) Marcel Dekker Inc.
Zighed, D. A., & Rakotomalala, R. (2000). Graphes d’induction. (pp. 327–359) HERMES Science Publications.

Download references

Author information

Authors and Affiliations

France Telecom R&D, 2, Avenue Pierre Marzin, 22300, Lannion, France
Marc Boullé

Authors

Marc Boullé
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marc Boullé.

Additional information

Editor: Tom Fawcett

¹French patent No. 04 00179.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Boullé, M. MODL: A Bayes optimal discretization method for continuous attributes. Mach Learn 65, 131–165 (2006). https://doi.org/10.1007/s10994-006-8364-x

Download citation

Received: 05 April 2004
Revised: 17 June 2005
Accepted: 02 March 2006
Published: 08 May 2006
Issue Date: October 2006
DOI: https://doi.org/10.1007/s10994-006-8364-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

MODL: A Bayes optimal discretization method for continuous attributes

Abstract

Article PDF

Similar content being viewed by others

Using discretization for extending the set of predictive features

A two-stage discretization algorithm based on information entropy

A Comparison of Two Approaches to Discretization: Multiple Scanning and C4.5

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

MODL: A Bayes optimal discretization method for continuous attributes

Abstract

Article PDF

Similar content being viewed by others

Using discretization for extending the set of predictive features

A two-stage discretization algorithm based on information entropy

A Comparison of Two Approaches to Discretization: Multiple Scanning and C4.5

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation