Hybridization of K-means Clustering Using Different Distance Function to Find the Distance Among Dataset

Yadav, Kusum; Gupta, Sunil; Gupta, Neetu; Gupta, Sohan Lal; Khandelwal, Girraj

doi:10.1007/978-981-15-7078-0_29

Kusum Yadav⁷,
Sunil Gupta⁷,
Neetu Gupta⁷,
Sohan Lal Gupta⁷ &
…
Girraj Khandelwal⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 195))

Included in the following conference series:

International Conference on Information and Communication Technology for Intelligent Systems

769 Accesses
3 Citations

Abstract

Clustering using K-means algorithm is very common way to understand and analyze the obtained output data. This clustering is done by making a sample collection of similar objects in one slot and non-similar objects in another slot. The slot can have say let N number of data objects and say let K number of clusters (slot) are formed to make a bunch of common data where k < n. Each cluster has its own centroid, but we cannot predict how exactly cluster shall be based on data. Cluster formation has random occurence, i.e., if one object is dissimilar from rest of all other cluster objects, then the dissimilar object must have its own cluster. This is the main drawback of this approach, and we cannot get optimal result from this clustering approach. To overcome the drawback challenge, we are proposing a formula to find the clusters at the run time, so this approach can give us optimal results. The proposed approach uses Euclidian distance formula as well Mehlanobis to find the minimum distance between slots as technically we called as clusters, and we have also applied the same approach to ant colony optimization (ACO) algorithm which results in the production of two- and multi-dimensional matrix.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Data clustering using K-Means based on Crow Search Algorithm

Article 22 October 2018

Maxmin distance sort heuristic-based initial centroid method of partitional clustering for big data mining

Article 24 January 2022

The Clustering Algorithm Based on Improved Antlion Optimization Algorithm with K-Means Concepts

References

Wang, X., Jiao, Y., Fei, S.: Estimation of clusters number and initial centers of K-means algorithm using watershed method. In: 2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science
Google Scholar
Blum, C., Sampels, M.: Ant colony optimization for FOP shop scheduling: a case study on different pheromone representations. In: Proceedings of the 2002 Congress on Evolutionary Computation (CEC’02), vol. 2, pp. 1558–1563. IEEE Computer Society Press, Los Alamitos, CA (2002)
Google Scholar
Zhang, Y., Xu, X., Liu, Y., Li, X., Ye, Y.: An agglomerative fuzzy K-means approach to building decision cluster classifiers. In: 2011 Second International Conference on Innovations in Bio-inspired Computing and Applications
Google Scholar
Qi, J., Yu, Y., Wang, L., Liu, J.: K-means: an effective and efficient K-means clustering algorithm. In: 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking, (SocialCom), Sustainable Computing and Communications (SustainCom)
Google Scholar
Kane, A., Determining the number of clusters for a K-means clustering algortihm. Indian J. Comput. Sci. Eng. (IJCSE)
Google Scholar
Dorigo, M., Birattari, M., Stitzle, T.: Ant colony optimization: artificial ants as a computational intelligence technique. IEEE Comput. Intell. Mag. (2006)
Google Scholar
Tang, R., Fong, S., Yang, X.-S., Deb, S.: Integrating nature-inspired optimization algorithms to K-means clustering
Google Scholar
Dubey, A., Choubey, A.: A systematic review on K-means clustering techniques. Int. J. Sci. Res. Eng. Technol. (IJSRET) 6(6) (2017). ISSN 2278-0882
Google Scholar
Handhayani, T., Wasito, I.: Fully unsupervised clustering in nonlinearly separable data using intelligent kernel K-means. In: ICACSIS 2014. ©2014 IEEE, 978-1-4799-8075-8/14/$31.00
Google Scholar
Sun, Y., Liu, G., Xu, K., A k-means-based projected clustering algorithm. In: 2010 Third International Joint Conference on Computational Science and Optimization
Google Scholar
Tang, R., Fong, S., Yang, X.-S., Deb, S.: Integrating nature-inspired optimization algorithms to K-means clustering. © 2012 IEEE, 978-1-4673-2430-4/12/$31.00
Google Scholar
Blum, C., Sampels, M.: An ant colony optimization algorithm for shop scheduling problems. J. Math. Model. Alg. 3(3), 285–308 (2004)
MathSciNet MATH Google Scholar
Blum, C., Sampels, M.: When model bias is stronger than selection pressure. In: Merelo Guervos, J.J., et al. (eds.) Proceedings of the PPSN-VII, Seventh International Conference on Parallel Problem Solving from Nature. Lecture Notes in Computer Science, vol. 2439, pp. 893–902. Springer, Berlin, Germany (2002)
Google Scholar
Khadem, A., Nezhad, E.F., Sharifi, M.: Data mining: methods and utilities. Researcher 5(12), 47–59 (2013). ISSN: 1553-9865
Google Scholar
Ghosh, S., Dubey, S.K.: Comparative analysis of K-means and fuzzy C-means algorithms. Int. J. Adv. Comput. Sci. Appl. 4(4) (2013)
Google Scholar
Sun, H., Wang, S., Jiang, Q.: Fcm-based model selection algorithms for determining the number of clusters. Pattern Recognit. 37(10), 2027–2037 (2004)
MATH Google Scholar
Li, M., Ng, M., Cheung, Y., Huang, J.: Agglomerative fuzzy k-means clustering algorithm with selection of number of clusters. TKDE 20(11), 1519–1534 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Global Institute of Technology, Jaipur, Rajasthan, India
Kusum Yadav, Sunil Gupta, Neetu Gupta, Sohan Lal Gupta & Girraj Khandelwal

Authors

Kusum Yadav
View author publications
You can also search for this author in PubMed Google Scholar
Sunil Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Neetu Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Sohan Lal Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Girraj Khandelwal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kusum Yadav .

Editor information

Editors and Affiliations

Department of Electrical and Electronics Engineering, University of the Ryukyus, Nishihara, Japan
Tomonobu Senjyu
Sinhgad Technical Education Society, SKNCOE, Pune, India
Parikshit N. Mahalle
Universiti Putra Malaysia, Serdang, Malaysia
Thinagaran Perumal
Global Knowledge Research Foundation, Ahmedabad, India
Amit Joshi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yadav, K., Gupta, S., Gupta, N., Gupta, S.L., Khandelwal, G. (2021). Hybridization of K-means Clustering Using Different Distance Function to Find the Distance Among Dataset. In: Senjyu, T., Mahalle, P.N., Perumal, T., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems. ICTIS 2020. Smart Innovation, Systems and Technologies, vol 195. Springer, Singapore. https://doi.org/10.1007/978-981-15-7078-0_29

Download citation

DOI: https://doi.org/10.1007/978-981-15-7078-0_29
Published: 22 October 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7077-3
Online ISBN: 978-981-15-7078-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Hybridization of K-means Clustering Using Different Distance Function to Find the Distance Among Dataset

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Data clustering using K-Means based on Crow Search Algorithm

Maxmin distance sort heuristic-based initial centroid method of partitional clustering for big data mining

The Clustering Algorithm Based on Improved Antlion Optimization Algorithm with K-Means Concepts

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Hybridization of K-means Clustering Using Different Distance Function to Find the Distance Among Dataset

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Data clustering using K-Means based on Crow Search Algorithm

Maxmin distance sort heuristic-based initial centroid method of partitional clustering for big data mining

The Clustering Algorithm Based on Improved Antlion Optimization Algorithm with K-Means Concepts

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation