Abstract
Genetic algorithms (GA) are randomized search and optimization techniques which have proven to be robust and effective in large scale problems. In this work, we propose a new GA approach for solving the automatic clustering problem, ACGA - Automatic Clustering Genetic Algorithm. It is capable of finding the optimal number of clusters in a dataset, and correctly assign each data point to a cluster without any prior knowledge about the data. An encoding scheme which had not yet been tested with GA is adopted and new genetic operators are developed. The algorithm can use any cluster validity function as fitness function. Experimental validation shows that this new approach outperforms the classical clustering methods K-means and FCM. The method provides good results, and requires a small number of iterations to converge.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Belahbib, F., Souami, F.: Genetic algorithm clustering for color image quantization. In: 3rd European Workshop on Visual Information Processing (EUVIP), pp. 83–87 (2011)
Mecca, G., Raunich, S., Pappalardo, A.: A New Algorithm for Clustering Search Results. Data and Knowledge Engineering 62, 504–522 (2007)
Valafar, F.: Pattern Recognition Techniques in Microarray Data Analysis: A Survey. Annals of New York Academy of Sciences 980, 41–64 (2002)
Hartigan, J., Wong, M.: Algorithm AS 136: A K-Means Clustering Algorithm. Applied Statistics 28(1), 100–108 (1979)
Bezdek, J., Ehrlich, R., Full, W.: FCM: The fuzzy c-means clustering algorithm. Computers and Geosciences 10(2-3), 191–203 (1984)
Holland, J.: Genetic algorithms. Scientific American (1992)
Srinivas, M., Patnaik, M.: Genetic algorithm: A survey. IEEE Computer 27(6), 17–26 (1994)
Murthy, C., Chowdhury, N.: In search of optimal clusters using GA. Pattern Recognition Letters 17, 825–832 (1996)
Tseng, L., Yang, S.: A genetic approach to the automatic clustering problem. Pattern Recognition 34(2), 415–424 (2001)
Agustin-Blas, L., Salcedo-Sanz, S., Jimenez-Fernandez, S., Carro-Calvo, L., Del Ser, J., Portilla-Figueras, J.A.: A new grouping GA for clustering problems. Expert Systems with Applications 39(10) (2012)
Sheikh, R., Raghuwanshi, M., Jaiswal, A.: Genetic Algorithm Based Clustering: A Survey. In: First International Conference on Emerging Trends in Engineering and Technology, vol. 2(6), pp. 314–319 (2008)
Liu, Y., Wu, X., Shen, Y.: Automatic clustering using genetic algorithms. Applied Mathematics and Computation 218(4), 1267–1279 (2011)
He, H., Tan, Y.: A two-stage genetic algorithm for automatic clustering. Neurocomputing 81, 49–59 (2012)
Das, S., Abraham, A., Konar, A.: Automatic Clustering Using an Improved Differential Evolution Algorithm. IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans 38(1), 218–237 (2008)
Calinski, R., Harabasz, J.: A dendrite method for cluster analysis. Communications in Statistics 3(1), 1–27 (1974)
Asuncion, A., Newman, J.: UCI Machine Learning Repository. University of California, Department of Information and Computer Science, Irvine, CA (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html
Speech and Image Processing Unit. Clustering datasets, http://www.cs.joensuu.fi/sipu/datasets/
Hubert, L., Arabie, P.: Comparing Partitions. Journal of Classification (2), 193–218 (1985)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Raposo, C., Antunes, C.H., Barreto, J.P. (2014). Automatic Clustering Using a Genetic Algorithm with New Solution Encoding and Operators. In: Murgante, B., et al. Computational Science and Its Applications – ICCSA 2014. ICCSA 2014. Lecture Notes in Computer Science, vol 8580. Springer, Cham. https://doi.org/10.1007/978-3-319-09129-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-09129-7_7
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09128-0
Online ISBN: 978-3-319-09129-7
eBook Packages: Computer ScienceComputer Science (R0)