Negative Encoding Length as a Subjective Interestingness Measure for Groups of Rules

Suzuki, Einoshin

doi:10.1007/978-3-642-01307-2_22

Einoshin Suzuki²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5476))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3281 Accesses
3 Citations

Abstract

We propose an interestingness measure for groups of classification rules which are mutually related based on the Minimum Description Length Principle. Unlike conventional methods, our interestingness measure is based on a theoretical background, has no parameter, is applicable to a group of any number of rules, and can exploit an initial hypothesis. We have integrated the interestingness measure with practical heuristic search and built a rule-group discovery method CLARDEM (Classification Rule Discovery method based on an Extended-Mdlp). Extensive experiments using both real and artificial data confirm that CLARDEM can discover the correct concept from a small noisy data set and an approximate initial concept with high “discovery accuracy”.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Finding Probabilistic Rule Lists using the Minimum Description Length Principle

Sets of Robust Rules, and How to Find Them

Classification Rule Mining with Iterated Greedy

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Baram, Y.: Partial Classification: The Benefit of Deferred Decision. IEEE Trans. Pattern Analysis and Machine Intelligence 20(8), 769–776 (1998)
Article MathSciNet Google Scholar
Blake, C., Merz, C.J., Keogh, E.: UCI Repository of Machine Learning Databases, http://www.ics.uci.edu/~mlearn/MLRepository.html
Jaroszewicz, S., Simovici, D.A.: Interestingness of Frequent Itemsets Using Bayesian Networks as Background Knowledge. In: Proc. Tenth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 178–186 (2004)
Google Scholar
Padmanabhan, B., Tuzhilin, A.: Small is Beautiful: Discovering the Minimal Set of Unexpected Patterns. In: Proc. KDD, pp. 54–63 (2000)
Google Scholar
Grünwald, P.D.: The Minimum Description Length Principle. MIT Press, Cambridge (2007)
Google Scholar
Quinlan, J.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Quinlan, J.R.: Learning Logical Definitions from Relations. Machine Learning 5(3), 239–266 (1990)
Google Scholar
Quinlan, J.R., Rivest, R.L.: Inferring Decision Trees Using the Minimum Description Length Principle. Information and Computation 80(3), 227–248 (1989)
Article MathSciNet MATH Google Scholar
Rissanen, J.: Stochastic Complexity in Statistical Inquiry. World Scientific, Singapore (1989)
MATH Google Scholar
Shannon, C.: A Mathematical Theory of Communication. Bell System Technical Journal 27, 379–423, 623–656 (1948)
Article MathSciNet MATH Google Scholar
Siebes, A., Vreeken, J., van Leeuwen, M.: Item Sets that Compress. In: 2006 SIAM Conference on Data Mining (SDM), pp. 393–404 (2006)
Google Scholar
Smyth, P., Goodman, R.M.: An Information Theoretic Approach to Rule Induction from Databases. IEEE TKDE 4(4), 301–316 (1992)
Google Scholar
Tan, P.-N., Kumar, V., Srivastava, J.: Selecting the Right Interestingness Measure for Association Patterns. In: Proc. KDD, pp. 32–41 (2002)
Google Scholar
Tangkitvanich, S., Shimura, M.: Learning from an Approximate Theory and Noisy Examples. In: Proc. AAAI, pp. 466–471 (1993)
Google Scholar
Wallace, C.S., Patrick, J.D.: Coding Decision Trees. Machine Learning 11(1), 7–22 (1993)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics, ISEE, Kyushu University, Japan
Einoshin Suzuki

Authors

Einoshin Suzuki
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Sirindhorn International Institute of Technology, Thammasat University, 131 Moo 5 Tiwanont Road, 12000, Bangkadi, Muang, Pathumthani, Thailand
Thanaruk Theeramunkong
Dept. of Computer Engineering, Faculty of Engineering, Chulalongkorn University, 10330, Bangkok, Thailand
Boonserm Kijsirikul
Faculty of Science & Engineering, York University, 355 Lumbers Building, 4700 Keele Street, M3J 1P3, Toronto, Ontario, Canada
Nick Cercone
School of Knowledge Science, Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, 923-1292, Ishikawa, Japan
Tu-Bao Ho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suzuki, E. (2009). Negative Encoding Length as a Subjective Interestingness Measure for Groups of Rules. In: Theeramunkong, T., Kijsirikul, B., Cercone, N., Ho, TB. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2009. Lecture Notes in Computer Science(), vol 5476. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01307-2_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-01307-2_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01306-5
Online ISBN: 978-3-642-01307-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Negative Encoding Length as a Subjective Interestingness Measure for Groups of Rules

Abstract

Chapter PDF

Similar content being viewed by others

Finding Probabilistic Rule Lists using the Minimum Description Length Principle

Sets of Robust Rules, and How to Find Them

Classification Rule Mining with Iterated Greedy

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Negative Encoding Length as a Subjective Interestingness Measure for Groups of Rules

Abstract

Chapter PDF

Similar content being viewed by others

Finding Probabilistic Rule Lists using the Minimum Description Length Principle

Sets of Robust Rules, and How to Find Them

Classification Rule Mining with Iterated Greedy

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation