Abstract
In this paper we address the problem of multi-objective attribute selection in data mining. We propose a multi-objective genetic algorithm (GA) based on the wrapper approach to discover the best subset of attributes for a given classification algorithm, namely C4.5, a well-known decision-tree algorithm. The two objectives to be minimized are the error rate and the size of the tree produced by C4.5. The proposed GA is a multi-objective method in the sense that it discovers a set of non-dominated solutions (attribute subsets), according to the concept of Pareto dominance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bhattacharyya, S., Evolutionary Algorithms in Data mining: Multi-Objective Performance Modeling for Direct Marketing. In: Proc KDD-2000, ACM Press (2000)465–471
Deb, K, Multi-Objective Evolutionary Algorithms: Introducing Bias Among Pareto-Optimal Solutions. Kanpur Genetic Algorithms Laboratory Report n° 99002, India (1999)
Deb, K., Multi-Objective Optimization using Evolutionary Algorithms, John Wiley & Sons, England (2001)
Fidelis, M.V., Lopes, H.S., Freitas, A.A., Discovering Comprehensible Classification Rules with a Genetic Algorithm. In: Proc. Congress on Evolutionary Computation (2000)
Goldberg, D. E., Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley Publishing Company (1989)
Freitas, A. A.; Understanding the Crucial Role of Attribute Interaction in Data Mining. In: Artificial Intelligence Review 16, Kluwer Academic Publishers (2001) 177–199
Freitas, A.A., Data Mining and Knowledge Discovery with Evolutionary Algorithms (forthcoming book). Springer-Verlag (2002)
Holsheimer, M., Siebes, A., Data Mining — The Search for Knowledge in Databases. Report CS-R9406, Amsterdam: CWI (1991)
Ishibuchi, H., Nakashima, T., Multi-objective Pattern and Feature Selection by a Genetic Algorithm. In: Proc. Genetic and Evolutionary Computation Conf. (GECCO-2000), Morgan Kaufmann (2000) 1069–1076
Liu, H.; Motoda, H., Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers (1998)
Martin-Bautista, M. J., Vila, M. A., A Survey of Genetic Feature Selection in Mining Issues. In: Proc. IEEE Conference on Evolutionary Computation, Washington (1999) 1314–1321.
Michalewicz, Z., Genetic Algorithms + Data Structures = Evolution Programs. 3rd edn. Springer-Verlag (1996)
Murphy, P.M., Aha, D.W., UCI Repository of Machine Learning databases. [http://www.ics.uci.edu/~mlearn/MLRepository.html]. Irvine, CA: University of California, Departament of information and Computer Science (1994)
Rozspyal, A., Kubat, M., Using Genetic Algorithm to Reduce the Size of a Nearest-Neighbor Classifier and Select Relevant Attributes. Proc. Int. Conf. Machine Learning (ICML-2001), Morgan Kauf. (2001)
Quinlan, J.R., C4.5: Programs for Machine Learning. Morgan Kaufmann (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pappa, G.L., Freitas, A.A., Kaestner, C.A.A. (2002). Attribute Selection with a Multi-objective Genetic Algorithm. In: Bittencourt, G., Ramalho, G.L. (eds) Advances in Artificial Intelligence. SBIA 2002. Lecture Notes in Computer Science(), vol 2507. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36127-8_27
Download citation
DOI: https://doi.org/10.1007/3-540-36127-8_27
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00124-9
Online ISBN: 978-3-540-36127-5
eBook Packages: Springer Book Archive