Abstract
Fuzzy clustering, like the known fuzzy k-means method, allows to incorporate imprecision when classifying multivariate observations into clusters. In contrast to hard clustering, when the data are divided into distinct clusters and each data point belongs to exactly one cluster, in fuzzy clustering the observations can belong to more than one cluster. The strength of the association to each cluster is measured by a vector of membership coefficients. Usually, an observation is assigned to a cluster with the highest membership coefficient. On the other hand, the refinement of the hard membership coefficients enables to consider also the possibility of assigning to another cluster according to prior knowledge or specific data structure of the membership coefficients. The aim of the paper is to introduce a methodology to reveal the real data structure of multivariate membership coefficient vectors, based on the logratio approach to compositional data, and show how to display them in presence of outlying observations using loadings and scores of robust principal component analysis.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Aitchison, J.: The statistical analysis of compositional data. Chapman & Hall, London (1986)
Aitchison, J., Greenacre, M.: Biplots of compositional data. Applied Statistics 51, 375–392 (2002)
Bezdek, J.C.: Cluster validity with fuzzy sets. J. Cybernetics 3, 58–73 (1973)
Dunn, J.C.: A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters. J. Cybernetics 3, 32–57 (1973)
Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G., Barceló-Vidal, C.: Isometric logratio transformations for compositional data analysis. Mathematical Geology 35, 279–300 (2003)
Egozcue, J.J., Pawlowsky-Glahn, V.: Simplicial geometry for compositional data. In: Buccianti, A., Mateu-Figueras, G., Pawlowsky-Glahn, V. (eds.) Compositional Data in the Geosciences: From Theory to Practice, Geological Society, London (2006)
Filzmoser, P., Hron, K., Reimann, C.: Principal component analysis for compositional data with outliers. Environmetrics 20, 621–632 (2009)
Fisher, R.A.: The use of multiple measurements in axonomic problems. Annals of Eugenics 7, 179–188 (1936)
Gabriel, K.R.: The biplot graphic display of matrices with application to principal component analysis. Biometrika 58, 453–467 (1971)
Kaufman, L., Rousseeuw, P.J.: Finding groups in data. John Wiley & Sons, New York (1990)
Maronna, R., Martin, R.D., Yohai, V.J.: Robust statistics: theory and methods. John Wiley & Sons, New York (2006)
Mocz, G.: Fuzzy cluster analysis of simple physicochemical properties of amino acids for recognizing secondary structure in proteins. Protein Science 4, 1178–1187 (1995)
Rousseeuw, P., Van Driessen, K.: A fast algorithm for the minimum covariance determinant estimator. Technometrics 41, 212–223 (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hron, K., Filzmoser, P. (2013). Robust Diagnostics of Fuzzy Clustering Results Using the Compositional Approach. In: Kruse, R., Berthold, M., Moewes, C., Gil, M., Grzegorzewski, P., Hryniewicz, O. (eds) Synergies of Soft Computing and Statistics for Intelligent Data Analysis. Advances in Intelligent Systems and Computing, vol 190. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33042-1_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-33042-1_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33041-4
Online ISBN: 978-3-642-33042-1
eBook Packages: EngineeringEngineering (R0)