Abstract
We present a new hybrid algorithm for data clustering. This new proposal uses one of the well known evolutionary algorithms called Scatter Search. Scatter Search operates on a small set of solutions and makes only a limited use of randomization for diversification when searching for globally optimal solutions. The proposed method discovers automatically cluster number and cluster centres without prior knowledge of a possible number of class, and without any initial partition. We have applied this algorithm on standard and real world databases and we have obtained good results compared to the K-means algorithm and an artificial ant based algorithm, the Antclass algorithm.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Blake, C.L., Merz, C.J.: UCI Repository of Machine Learning Databases, University of California, Irvine, Dept. of Information and Computer Sciences (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Corne, D., Dorigo, M., Glover, F. (eds.): New Ideas in Optimisation. McGraw-Hill, London (1999)
Glover, F., Laguna, M., Martí, R.: Scatter search. In: Ghosh, A., Tsutsui, S. (eds.) Advances in Evolutionary Computation: Theory and Applications, pp. 519–537. Springer, New York (2003)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Academic press, London (2001)
Laguna, M., Martí, R.: Experimental Testing of Advanced Scatter Search Designs for Global Optimization of Multimodal Functions. In: Global Optimization (1997)
Lozano, J.A., Larrañaga, P.: Using Genetic Algorithms to Get the Classes and their Number in a Partitional Cluster Analysis of Large Data Sets, http://citeseer.ist.psu.edu/457425.html
Martí, R., Laguna, M., Campos, V.: Scatter Search vs. Genetic Algorithms: An Experimental Evaluation with Permutation Problems. In: Rego, C., Alidaee, B. (eds.) Adaptive Memory and Evolution: Tabu Search and Scatter Search. Kluwer Academic Publishers, Dordrecht (1997)
Monmarché, N.: Algorithmes de fourmis artificielles: applications à la classification et à l’optimisation. Thèse de doctorat, Laboratoire d’Informatique, Université de Tours (December 2000)
Monmarché, N.: Artificial datasets. Handicap et Nouvelles Technologies(HaNT), Laboratoire d’Informatique, http://www.hant.li.univ-tours.fr/webhant/index.php?pageid=55
Pacheco, J., Valencia, O.: Design of Hybrids for Minimum Sum-of-Squares Clustering Problem. Computational Statistic and Data Analysis 43(2), 235–248 (2003)
Sarkar, M., Yegnanafayana, B., Khemani, D.: A Clustering Algorithm using an Evolutionary Programming-based Approach. Pattern Recognition letters 18, 975–986 (1997)
Zhang, B., Kleyner, G., Hsu, M.: A Local Search Approach to K-Clustering. HP-Lab Tech. Rep. (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abdule-Wahab, R.S., Monmarché, N., Slimane, M., Fahdil, M.A., Saleh, H.H. (2006). A Scatter Search Algorithm for the Automatic Clustering Problem. In: Perner, P. (eds) Advances in Data Mining. Applications in Medicine, Web Mining, Marketing, Image and Signal Mining. ICDM 2006. Lecture Notes in Computer Science(), vol 4065. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11790853_28
Download citation
DOI: https://doi.org/10.1007/11790853_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36036-0
Online ISBN: 978-3-540-36037-7
eBook Packages: Computer ScienceComputer Science (R0)