Skip to main content

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 618))

  • 209 Accesses

Abstract

Although there are many clustering algorithms are introduced, the K-means clustering algorithm is the most universal algorithm because of its simplicity. Several modifications are also introduced to overcome the drawbacks of the K-means clustering algorithm. But still, it suffers from centroid initialization of clusters primarily which may help to elevate the efficiency of clustering. Hence this paper put forward a way of initializing cluster centroids smartly so that drawbacks of the previous algorithm may be eliminated. The proposed algorithm divides the dataspace into K (number of clusters) equally by calculating the center point of each partition set as initial cluster centers. Compared to the original and some other modified versions of the K-means grouping algorithm made by researchers, the proposed model of writing paper provides better results in terms of time complexity, space complexity, inter-cluster distance, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Na S, Xumin L, Yong G (2010) Research on k-means clustering algorithm: an improved k-means clustering algorithm. Third Int Symp Intell Inf Technol Secur Inform 2010:63–67. https://doi.org/10.1109/IITSI.2010.74

    Article  Google Scholar 

  2. Xu H, Yao S, Li Q, Ye Z (2020) An improved K-means clustering algorithm. In: 2020 IEEE 5th international symposium on smart and wireless systems within the conferences on intelligent data acquisition and advanced computing systems (IDAACS-SWS), pp 1–5. https://doi.org/10.1109/IDAACS-SWS50031.2020.9297060.

  3. Huang Z (1998) Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Min Knowl Disc 2:283–304

    Article  Google Scholar 

  4. Tian Z, Ramakrishnan R, Miron L (1996) BIRCH: an efficient data clustering method for very large databases. ACM SIGMOD Rec 25(2):103–114

    Google Scholar 

  5. Guha S, Rastogi R, Shim K (1998) CURE: an efficient clustering algorithm for large databases. SIGMOD'98, Seattle, Washington, pp 73–84

    Google Scholar 

  6. Ester BM, Kriegel HP, Sander J, Xu X (1996) A density based algorithm for discovering clusters in large spatial databases. In: Proceeding of 1996 international conference on knowledge discovery and data mining, pp 226–231

    Google Scholar 

  7. Ankerst M, Breunig MM, Kriegel HP, Sander J (1999) OPTICS: ordering points to identify the clustering structure. ACM SIGMOD Record 28(2):49–60

    Google Scholar 

  8. Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344(6191):1492–1496

    Article  Google Scholar 

  9. Sheikholeslami G, Chatterjee S, Zhang A (1998) WaveCluster: a multi-resolution clustering approach for very large spatial databases. In: Proceeding of the 24th international conference on very large data bases (VLDB '98), pp 428–439

    Google Scholar 

  10. Begum M, Das BC, Hossain MZ, Saha A, Papry KA (2021) An improved Kohonen self-organizing map clustering algorithm for high-dimensional data sets. Indones J Electr Eng Comput Sci 24(1):600–610. ISSN: 2502-4752. https://doi.org/10.11591/ijeecs.v24.i1.pp600-610

  11. Sun H, Chen Y, Lai J, Wang Y, Liu X, Identifying tourists and locals by K-means clustering method from mobile phone signaling data. J Transport Eng Part A: Syst 147(10):04021070

    Google Scholar 

  12. Hutagalung J, Ginantra NLSR, Bhawika GW, Parwita WGS, Wanto A, Panjaitan PD (2020) COVID-19 cases and deaths in Southeast Asia clustering using k-means algorithm. In: Annual Conference on Science and Technology Research (ACOSTER) 2020, 20–21 June 2020, Medan, Indonesia

    Google Scholar 

  13. Khorshidi N, Parsa M, Lentz DR, Sobhanverdi J (2021) Identification of heavy metal pollution sources and its associated risk assessment in an industrial town using the K-means clustering technique. Appl Geochem 135:105113, ISSN 0883-2927. https://doi.org/10.1016/j.apgeochem.2021.105113

  14. Mardi M, Keyvanpour MR (2021) GBKM: a new genetic based k-means clustering algorithm. In: 2021 7th international conference on web research (ICWR), pp 222–226. https://doi.org/10.1109/ICWR51868.2021.9443113

  15. Capó M, Pérez A, Lozano JA (2022) An efficient split-merge re-start for the K-means algorithm. IEEE Trans Knowl Data Eng 34(4):1618–1627. https://doi.org/10.1109/TKDE.2020.3002926.

  16. Rahman Z, Hossain MS, Hasan M, Imteaj A (2021) An enhanced method of initial cluster center selection for K-means algorithm. Innov Intell Syst Appl Conf (ASYU) 2021:1–6. https://doi.org/10.1109/ASYU52992.2021.9599017

    Article  Google Scholar 

  17. Sen A, Pandey M, Chakravarty K (2020) Random centroid selection for K-means clustering: a proposed algorithm for improving clustering results. In: 2020 international conference on computer science, engineering and applications (ICCSEA), pp 1–4. https://doi.org/10.1109/ICCSEA49143.2020.9132921.

  18. Begum M, Akthar MN (2013) KSOMKM: an efficient approach for high dimensional dataset clustering. Int J Electr Energy 1(2):102–107. https://doi.org/10.12720/ijoee.1.2.102-107

    Article  Google Scholar 

  19. Singh RV, Bhatia MPS (2011) Data clustering with modified K-means algorithm. Int Conf Recent Trends Inf Technol (ICRTIT) 2011:717–721. https://doi.org/10.1109/ICRTIT.2011.5972376

    Article  Google Scholar 

  20. Tajunisha S, Saravanan V (2010) Performance analysis of k-means with different initialization methods for high dimensional data. Int J Artif Intell Appl (IJAIA) 1(4):44–52

    Google Scholar 

  21. Yuan, Yang H (2019) Research on K-value selection method of K-means clustering algorithm. J 2(2):226–235. https://doi.org/10.3390/j2020016

  22. Kandali K, Bennis L, Bennis H (2021) A New hybrid routing protocol using a modified K-means clustering algorithm and continuous hopfield network for VANET. IEEE Access 9:47169–47183. https://doi.org/10.1109/ACCESS.2021.3068074

    Article  Google Scholar 

  23. Motwani M, Arora N, Gupta A (2019) A study on initial centroids selection for partitional clustering algorithms. In: Software engineering. Springer, pp 211–220

    Google Scholar 

  24. http://ijcsit.com/docs/Volume%205/vol5issue06/ijcsit2014050688.pdf.

  25. He Z, Yu C (2019) Clustering stability-based evolutionary K-Means. Soft Comput 23:305–321. https://doi.org/10.1007/s00500-018-3280-0

    Article  MATH  Google Scholar 

  26. Zhao Y, Zhou X (2021) K-means clustering algorithm and its improvement research. In: Journal of Physics: Conference Series, Volume 1873, 2021 2nd International Workshop on Electronic communication and Artificial Intelligence (IWECAI 2021), 12–14 March 2021, Nanjing, China

    Google Scholar 

  27. Ghazal TM, Hussain MZ, Said RA, Nadeem A, Hasan MK, Ahmad M, Khan MA, Naseem MT, Intelligent automation and soft computing. 30(2):735–742. https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/81931, https://doi.org/10.32604/iasc.2021.019067

  28. Patra GK, sahu KK, Normalization: a preprocessing stage. https://doi.org/10.48550/arXiv.1503.06462

  29. Newman D, Hettich S, Blake C, Merz C (1998) UCI repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html

  30. https://en.wikipedia.org/wiki/Maxima_and_minima

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md. Hamidur Rahman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hamidur Rahman, M., Begum, M. (2023). Partitional Technique for Searching Initial Cluster Centers in K-means Algorithm. In: Kaiser, M.S., Waheed, S., Bandyopadhyay, A., Mahmud, M., Ray, K. (eds) Proceedings of the Fourth International Conference on Trends in Computational and Cognitive Engineering. Lecture Notes in Networks and Systems, vol 618. Springer, Singapore. https://doi.org/10.1007/978-981-19-9483-8_22

Download citation

Publish with us

Policies and ethics