A Comparison of Six Approaches to Discretization—A Rough Set Perspective

Blajdo, Piotr; Grzymala-Busse, Jerzy W.; Hippe, Zdzislaw S.; Knap, Maksymilian; Mroczek, Teresa; Piatek, Lukasz

doi:10.1007/978-3-540-79721-0_10

Piotr Blajdo¹,
Jerzy W. Grzymala-Busse²,
Zdzislaw S. Hippe¹,
Maksymilian Knap¹,
Teresa Mroczek³ &
…
Lukasz Piatek³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5009))

Included in the following conference series:

International Conference on Rough Sets and Knowledge Technology

1597 Accesses
9 Citations

Abstract

We present results of extensive experiments performed on nine data sets with numerical attributes using six promising discretization methods. For every method and every data set 30 experiments of ten-fold cross validation were conducted and then means and sample standard deviations were computed. Our results show that for a specific data set it is essential to choose an appropriate discretization method since performance of discretization methods differ significantly. However, in general, among all of these discretization methods there is no statistically significant worst or best method. Thus, in practice, for a given data set the best discretization method should be selected individually.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Rough set-based approaches for discretization: a compact review

Article 04 February 2015

Structure-Based Attribute Reduction: A Rough Set Approach

Rough Sets in Machine Learning: A Review

Keywords

References

Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees, Wadsworth & Brooks, Monterey, CA (1984)
Google Scholar
Chao, L.L.: Introduction to Statistics. Brooks Cole Publishing Co., Monterey (1980)
MATH Google Scholar
Chan, C.C., Grzymala-Busse, J.W.: On the attribute redundancy and the learning programs ID3, PRISM, and LEM2. Department of Computer Science, University of Kansas, TR-91-14 (1991)
Google Scholar
Chmielewski, M.R., Grzymala-Busse, J.W.: Global discretization of continuous attributes as preprocessing for machine learning. Int. Journal of Approximate Reasoning 15, 319–331 (1996)
Article MATH Google Scholar
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: 12-th International Conference on Machine Learning, pp. 194–202. Morgan Kaufmann, San Francisco (1995)
Google Scholar
Everitt, B.: Cluster Analysis, 2nd edn. Heinmann Educational Books, London (1980)
MATH Google Scholar
Fayyad, U.M., Irani, K.B.: On the handling of continuous-valued attributes in decision tree generation. Machine Learning 8, 87–102 (1992)
MATH Google Scholar
Grzymala-Busse, J.W.: A new version of the rule induction system LERS. Fundamenta Informaticae 31, 27–39 (1997)
MATH Google Scholar
Grzymala-Busse, J.W.: Discretization of numerical attributes. In: Klösgen, W., Zytkow, J. (eds.) Handbook of Data Mining and Knowledge Discovery, pp. 218–225. Oxford University Press, New York (2002)
Google Scholar
Grzymala-Busse, J.W.: MLEM2: A new algorithm for rule induction from imperfect data. In: 9th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems ESIA Annecy, France, pp. 243–250 (2002)
Google Scholar
Grzymala-Busse, J.W., Stefanowski, J.: Three discretization methods for rule induction. Int. Journal of Intelligent Systems. 16, 29–38 (2001)
Article MATH Google Scholar
Pawlak, Z.: Rough Sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)
Article MathSciNet Google Scholar
Peterson, N.: Discretization using divisive cluster analysis and selected post-processing techniques, University of Kansas, Internal Report. Department of Computer Science (1993)
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)
Google Scholar
Stefanowski, J.: Handling continuous attributes in discovery of strong decision rules. In: Polkowski, L., Skowron, A. (eds.) RSCTC 1998. LNCS (LNAI), vol. 1424, pp. 394–401. Springer, Heidelberg (1998)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Expert Systems and Artificial Intelligence, University of Information Technology and Management, 35-225, Rzeszow, Poland
Piotr Blajdo, Zdzislaw S. Hippe & Maksymilian Knap
Department of Electrical Engineering and Computer Science,University of Kansas,Lawrence, KS 66045, USA and Institute of Computer Science, Polish Academy of Sciences, 01-237, Warsaw, Poland
Jerzy W. Grzymala-Busse
Department of Distributed Systems, University of Information Technology and Management, 35-225, Rzeszow, Poland
Teresa Mroczek & Lukasz Piatek

Authors

Piotr Blajdo
View author publications
You can also search for this author in PubMed Google Scholar
Jerzy W. Grzymala-Busse
View author publications
You can also search for this author in PubMed Google Scholar
Zdzislaw S. Hippe
View author publications
You can also search for this author in PubMed Google Scholar
Maksymilian Knap
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Mroczek
View author publications
You can also search for this author in PubMed Google Scholar
Lukasz Piatek
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Guoyin Wang Tianrui Li Jerzy W. Grzymala-Busse Duoqian Miao Andrzej Skowron Yiyu Yao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Blajdo, P., Grzymala-Busse, J.W., Hippe, Z.S., Knap, M., Mroczek, T., Piatek, L. (2008). A Comparison of Six Approaches to Discretization—A Rough Set Perspective. In: Wang, G., Li, T., Grzymala-Busse, J.W., Miao, D., Skowron, A., Yao, Y. (eds) Rough Sets and Knowledge Technology. RSKT 2008. Lecture Notes in Computer Science(), vol 5009. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79721-0_10

Download citation

DOI: https://doi.org/10.1007/978-3-540-79721-0_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-79720-3
Online ISBN: 978-3-540-79721-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Comparison of Six Approaches to Discretization—A Rough Set Perspective

Abstract

Chapter PDF

Similar content being viewed by others

Rough set-based approaches for discretization: a compact review

Structure-Based Attribute Reduction: A Rough Set Approach

Rough Sets in Machine Learning: A Review

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Comparison of Six Approaches to Discretization—A Rough Set Perspective

Abstract

Chapter PDF

Similar content being viewed by others

Rough set-based approaches for discretization: a compact review

Structure-Based Attribute Reduction: A Rough Set Approach

Rough Sets in Machine Learning: A Review

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation