Skip to main content

Hybridization of K-means Clustering Using Different Distance Function to Find the Distance Among Dataset

  • Conference paper
  • First Online:
Information and Communication Technology for Intelligent Systems ( ICTIS 2020)

Abstract

Clustering using K-means algorithm is very common way to understand and analyze the obtained output data. This clustering is done by making a sample collection of similar objects in one slot and non-similar objects in another slot. The slot can have say let N number of data objects and say let K number of clusters (slot) are formed to make a bunch of common data where k < n. Each cluster has its own centroid, but we cannot predict how exactly cluster shall be based on data. Cluster formation has random occurence, i.e., if one object is dissimilar from rest of all other cluster objects, then the dissimilar object must have its own cluster. This is the main drawback of this approach, and we cannot get optimal result from this clustering approach. To overcome the drawback challenge, we are proposing a formula to find the clusters at the run time, so this approach can give us optimal results. The proposed approach uses Euclidian distance formula as well Mehlanobis to find the minimum distance between slots as technically we called as clusters, and we have also applied the same approach to ant colony optimization (ACO) algorithm which results in the production of two- and multi-dimensional matrix.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Wang, X., Jiao, Y., Fei, S.: Estimation of clusters number and initial centers of K-means algorithm using watershed method. In: 2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science

    Google Scholar 

  2. Blum, C., Sampels, M.: Ant colony optimization for FOP shop scheduling: a case study on different pheromone representations. In: Proceedings of the 2002 Congress on Evolutionary Computation (CEC’02), vol. 2, pp. 1558–1563. IEEE Computer Society Press, Los Alamitos, CA (2002)

    Google Scholar 

  3. Zhang, Y., Xu, X., Liu, Y., Li, X., Ye, Y.: An agglomerative fuzzy K-means approach to building decision cluster classifiers. In: 2011 Second International Conference on Innovations in Bio-inspired Computing and Applications

    Google Scholar 

  4. Qi, J., Yu, Y., Wang, L., Liu, J.: K-means: an effective and efficient K-means clustering algorithm. In: 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking, (SocialCom), Sustainable Computing and Communications (SustainCom)

    Google Scholar 

  5. Kane, A., Determining the number of clusters for a K-means clustering algortihm. Indian J. Comput. Sci. Eng. (IJCSE)

    Google Scholar 

  6. Dorigo, M., Birattari, M., Stitzle, T.: Ant colony optimization: artificial ants as a computational intelligence technique. IEEE Comput. Intell. Mag. (2006)

    Google Scholar 

  7. Tang, R., Fong, S., Yang, X.-S., Deb, S.: Integrating nature-inspired optimization algorithms to K-means clustering

    Google Scholar 

  8. Dubey, A., Choubey, A.: A systematic review on K-means clustering techniques. Int. J. Sci. Res. Eng. Technol. (IJSRET) 6(6) (2017). ISSN 2278-0882

    Google Scholar 

  9. Handhayani, T., Wasito, I.: Fully unsupervised clustering in nonlinearly separable data using intelligent kernel K-means. In: ICACSIS 2014. ©2014 IEEE, 978-1-4799-8075-8/14/$31.00

    Google Scholar 

  10. Sun, Y., Liu, G., Xu, K., A k-means-based projected clustering algorithm. In: 2010 Third International Joint Conference on Computational Science and Optimization

    Google Scholar 

  11. Tang, R., Fong, S., Yang, X.-S., Deb, S.: Integrating nature-inspired optimization algorithms to K-means clustering. © 2012 IEEE, 978-1-4673-2430-4/12/$31.00

    Google Scholar 

  12. Blum, C., Sampels, M.: An ant colony optimization algorithm for shop scheduling problems. J. Math. Model. Alg. 3(3), 285–308 (2004)

    MathSciNet  MATH  Google Scholar 

  13. Blum, C., Sampels, M.: When model bias is stronger than selection pressure. In: Merelo Guervos, J.J., et al. (eds.) Proceedings of the PPSN-VII, Seventh International Conference on Parallel Problem Solving from Nature. Lecture Notes in Computer Science, vol. 2439, pp. 893–902. Springer, Berlin, Germany (2002)

    Google Scholar 

  14. Khadem, A., Nezhad, E.F., Sharifi, M.: Data mining: methods and utilities. Researcher 5(12), 47–59 (2013). ISSN: 1553-9865

    Google Scholar 

  15. Ghosh, S., Dubey, S.K.: Comparative analysis of K-means and fuzzy C-means algorithms. Int. J. Adv. Comput. Sci. Appl. 4(4) (2013)

    Google Scholar 

  16. Sun, H., Wang, S., Jiang, Q.: Fcm-based model selection algorithms for determining the number of clusters. Pattern Recognit. 37(10), 2027–2037 (2004)

    MATH  Google Scholar 

  17. Li, M., Ng, M., Cheung, Y., Huang, J.: Agglomerative fuzzy k-means clustering algorithm with selection of number of clusters. TKDE 20(11), 1519–1534 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kusum Yadav .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yadav, K., Gupta, S., Gupta, N., Gupta, S.L., Khandelwal, G. (2021). Hybridization of K-means Clustering Using Different Distance Function to Find the Distance Among Dataset. In: Senjyu, T., Mahalle, P.N., Perumal, T., Joshi, A. (eds) Information and Communication Technology for Intelligent Systems. ICTIS 2020. Smart Innovation, Systems and Technologies, vol 195. Springer, Singapore. https://doi.org/10.1007/978-981-15-7078-0_29

Download citation

Publish with us

Policies and ethics