Abstract
Multivariate dataset is clustered and compared for improvement for various clustering techniques using four well-known algorithms used in clustering techniques such as like k-Means, Fuzzy K-Means, Rough K-Means, and Fanny taking help of statistical programming environment R. Well-known dataset IRIS is used with 150 instances and 3 classes. Simulated finding speaks in favor of Rough K-Means LW algorithm with Funny as it can allocate all data with zero standard deviation. Dataset are also plotted for all the algorithms used here separately. Result can be extended for analysis of clustered data for any arbitrary dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Fisher, R.A.: The use of multiple measurements in taxonomic problems, Ann. Eugenics 7(II), 179–188 (1936)
Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. Wiley, New York, p. 218 (1973)
Dasarathy, B.V.: Nosing around the neighborhood: a new system structure and classification rule for recognition in partially exposed environments. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-2(1), 67–71 (1980)
Gates, G.W.: The reduced nearest neighbor rule. IEEE Trans. Inf. Theory 18(3), 431–433 (1972)
Aeberhard, S., Coomans, D., deVel, O.: The performance of statistical pattern recognition methods. In: High Dimensional Settings. IEEE Signal Processing Workshop on Higher Order Statistics (1994)
Hershberger, D.E., Kargupta, H.: Distributed multivariate regression using wavelet-based collective data mining. J. Parallel Distrib. Comput. 61(3), 372–400 (2001)
Demiriz, A., Bennett, K.P., Embrechts, M.J.: A genetic algorithm approach for semi-supervised clustering. Int. J. Smart Eng. Syst. Design 4(1), 21–30 (2002)
Vlachos, M., Domeniconi, C., Gunopulos, D., Kollios, G., Koudas, N.: Non-linear dimensionality reduction techniques for classification and visualization. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 645–651 (2002)
Likas, A., Vlassis, N., Verbeek, J.J.: The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003)
Eggermont, J., Kok, J.N., Kosters, W.A.: Genetic Programming for data classification: partitioning the search space. In: Proceedings of the ACM Symposium on Applied Computing, pp. 1001–1005 (2004)
Brodley, C.E.: Feature selection for unsupervised learning. J. Mach. Learn. Res. 5, 845–889 (2004)
Kim, Y.S., Street, W.N., Menczer, F.: Optimal ensemble construction via meta-evolutionary ensembles. Expert Syst. Appl. 30(4), 705–714 (2006)
Hua, J., Tembe, W.D., Koudas, N.: Performance of feature-selection methods in the classification of high-dimension data. Pattern Recogn. 42(3), 409–424 (2009)
Maji, P., Pal, S.K.: Fuzzy-rough sets for information measures and selection of relevant genes from microarray data. In: IEEE Trans. Syst. Man Cybern. 40(3), 741–752 (2010)
Maji, P., Paul, S.: Rough-fuzzy clustering for grouping functionally similar genes from microarray data. In: IEEE/ACM Trans. Comput. Biol. Bioinform. 10(2), 286–299 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Bhattacharjee, A.K., Dey, M., Dutta, D., Sett, S., Mukherjee, S., Deyasi, A. (2019). Comparative Study and Improvement of Various Clustering Techniques in Statistical Programming Environment. In: Mandal, J., Sinha, D., Bandopadhyay, J. (eds) Contemporary Advances in Innovative and Applicable Information Technology. Advances in Intelligent Systems and Computing, vol 812. Springer, Singapore. https://doi.org/10.1007/978-981-13-1540-4_15
Download citation
DOI: https://doi.org/10.1007/978-981-13-1540-4_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1539-8
Online ISBN: 978-981-13-1540-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)