Skip to main content

Clustering Data Using Techniques of Image Processing Erode and Dilate to Avoid the Use of Euclidean Distance

  • Chapter
  • First Online:
Applications of Hybrid Metaheuristic Algorithms for Image Processing

Part of the book series: Studies in Computational Intelligence ((SCI,volume 890))

  • 601 Accesses

Abstract

Clustering is one of the most popular methods of machine learning. The process of clustering involves the division of a set of abstract objects into a certain number of groups which integrated with objects of similar characteristics. Therefore, a cluster integrates objects which are similar to them, but dissimilar to the elements that belong to the rest of the clusters. Several clustering methods have proposed in the literature with different performance levels. All these techniques use as similarity criterion the Euclidean distance among cluster elements. However, there exist diverse scenarios where the Euclidean distance cannot be utilized appropriately to separate the elements in groups. Under such conditions, traditional cluster methods cannot directly apply. On the other hand, the operations of dilate and erode are a set of non-linear operators that modify the shape of a data group in the feature space, to obtain a monolithic object. Although morphological operations have demonstrated its importance in several engineering fields as image processing, its use as a clustering technique has been practically overlooked. In this work, an alternative clustering algorithm is proposed to group elements without considering the distance as a similarity criterion. In our approach, the data were separated into different groups by considering morphological operations. Under this scheme, the procedure allows the integration of data points, which present a spatial connection. Since the proposed algorithm does not use the distance in its functioning, it solves complex clustering problems which traditional clustering algorithms cannot.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. K. Bailey, Numerical taxonomy and cluster analysis, in Typologies and Taxonomies (1994)

    Google Scholar 

  2. V. Cohen-Addad, V. Kanade, F. Mallmann-Trenn, C. Mathieu, Hierarchical clustering: objective functions and algorithms. J. ACM 66(4), 26 (2019)

    Article  MathSciNet  Google Scholar 

  3. Y. Tarabalka, J.A. Benediktsson, J. Chanussot, Spectral–spatial classification of hyperspectral imagery based on partitional clustering techniques. IEEE Trans. Geosci. Remote Sens. 47(8), 2973–2987 (2009)

    Article  Google Scholar 

  4. M. Girolami, C. He, Probability density estimation from optimally condensed data samples. IEEE Trans. pattern Anal. (2003)

    Google Scholar 

  5. T. Liu, C. Rosenberg, H.A. Rowley, Clustering billions of images with large scale nearest neighbor search, in Proceedings—IEEE Workshop on Applications of Computer Vision, WACV 2007 (2007)

    Google Scholar 

  6. M. Ester, H. Kriegel, J. Sander, X. Xu, A density-based algorithm for discovering clusters in large spatial databases with noise, in KDD (1996)

    Google Scholar 

  7. C.C. Aggarwal, Data Mining (Springer, New York, 2015)

    MATH  Google Scholar 

  8. X. Wu et al., Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008)

    Article  Google Scholar 

  9. J.E. Gentle, L. Kaufman, P.J. Rousseuw, Finding groups in data: an introduction to cluster analysis. Biometrics 47(2), 788 (1991)

    Google Scholar 

  10. S. Harikumar, P.V. Surya, K-medoid clustering for heterogeneous datasets. Procedia Comput. Sci. 70, 226–237 (2015)

    Google Scholar 

  11. U. Maulik, S. Bandyopadhyay, Genetic algorithm-based clustering technique. Pattern Recognit. 33(9), 1455–1465 (2000)

    Article  Google Scholar 

  12. G.H. Ball, D.J. Hall, ISODATA, a novel Method of data analysis and pattern classification (1965)

    Google Scholar 

  13. S. Lloyd, Least squares quantization in PCM. IEEE Trans. Inf. Theory 28(2), 129–137 (1982)

    Article  MathSciNet  Google Scholar 

  14. J. MacQueen, Some methods for classification and analysis of multivariate observations, in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability (1967)

    Google Scholar 

  15. R.M. Gray, D.L. Neuhoff, Quantization. IEEE Trans. Inf. Theory 44(6) (1998)

    Google Scholar 

  16. S.Z. Selim, M.A. Ismail, K-means-type algorithms: a Generalized convergence theorem and characterization of local optimality. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-6(1), 81–87 (1984)

    Google Scholar 

  17. H. Xiong, J. Wu, J. Chen, K-means clustering versus validation measures: a data-distribution perspective. IEEE Trans. Syst. Man, Cybern. Part B 39(2), 318–331 (2009)

    Google Scholar 

  18. S. Bandyopadhyay, S.K. Pal, B. Aruna, Multiobjective GAs, quantitative indices, and pattern classification. IEEE Trans. Syst. Man Cybern. Part B 34(5), 2088–2099 (2004)

    Article  Google Scholar 

  19. S. Bandyopadhyay, U. Maulik, Nonparametric genetic clustering: comparison of validity indices. IEEE Trans. Syst. Man Cybern. Part C (Applications Rev.) 31(1), 120–125 (2001)

    Google Scholar 

  20. D.E. Goldberg, J.H. Holland, Genetic algorithms and machine learning. Mach. Learn. 3(2/3), 95–99 (1988)

    Article  Google Scholar 

  21. L. Davis, Handbook of genetic algorithms (1991)

    Google Scholar 

  22. Z. Michalewicz, Genetic Algorithms + Data Structures = Evolution Programs (Springer, Berlin, Heidelberg, 1996)

    Book  Google Scholar 

  23. W. Song, S.C. Park, Genetic Algorithm-Based Text Clustering Technique (Springer, Berlin, Heidelberg, 2006), pp. 779–782

    Google Scholar 

  24. W. Song, J.Z. Liang, S.C. Park, Fuzzy control GA with a novel hybrid semantic similarity strategy for text clustering. Inf. Sci. (Ny) 273, 156–170 (2014)

    Article  Google Scholar 

  25. J.C. Dunn†, Well-separated clusters and optimal fuzzy partitions. J. Cybern. 4(1), 95–104 (1974)

    Google Scholar 

  26. D.L. Davies, D.W. Bouldin, A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. PAMI-1(2), 224–227 (1979)

    Google Scholar 

  27. A. Banerjee, R.N. Dave, Validating clusters using the Hopkins statistic, in 2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No. 04CH37542), vol. 1, pp. 149–153

    Google Scholar 

  28. W.S. Sarle, A.K. Jain, R.C. Dubes, Algorithms for clustering data. Technometrics 32(2), 227 (1990)

    Article  Google Scholar 

  29. A.F. Famili, G. Liu, Z. Liu, Evaluation and optimization of clustering in gene expression data analysis. Bioinformatics 20(10), 1535–1545 (2004)

    Article  Google Scholar 

  30. M. Halkidi, Y. Batistakis, M. Vazirgiannis, Cluster validity methods : part I

    Google Scholar 

  31. M. Halkidi, Y. Batistakis, M. Vazirgiannis, Clustering validity checking methods: part II

    Google Scholar 

  32. M. Kim, R.S. Ramakrishna, New indices for cluster validity assessment (2005)

    Google Scholar 

  33. R.M. Haralick, L.G. Shapiro, Computer and robot vision, vol. 1 (Addison-Wesley Reading, 1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Noé Ortega-Sánchez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Ortega-Sánchez, N., Cuevas, E., Pérez, M.A., Osuna-Enciso, V. (2020). Clustering Data Using Techniques of Image Processing Erode and Dilate to Avoid the Use of Euclidean Distance. In: Oliva, D., Hinojosa, S. (eds) Applications of Hybrid Metaheuristic Algorithms for Image Processing. Studies in Computational Intelligence, vol 890. Springer, Cham. https://doi.org/10.1007/978-3-030-40977-7_9

Download citation

Publish with us

Policies and ethics