Standardizing Variables in K-means Clustering

Steinley, Douglas

doi:10.1007/978-3-642-17103-1_6

Douglas Steinley²³

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organisation ((STUDIES CLASS))

1768 Accesses
31 Citations
1 Altmetric

Abstract

Several standardization methods are investigated in conjunction with the K-means algorithm under various conditions. We find that traditional standardization methods (i.e., z-scores) are inferior to alternative standardization methods. Future suggestions concerning the combination of standardization and variable selection are considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Least Square Consensus Clustering: Criteria, Methods, Experiments

K-Way Spectral Clustering

An extension of the K-means algorithm to clustering skewed data

Article 04 July 2018

References

Brusco, M. J., Cradit, J. D. (2001). “A Variable-Selection Heuristic for If-means Clustering,” Psychometrika, 66, 249–270.
Article MathSciNet Google Scholar
Dillon, W. R., Mulani, N., Frederick, D. G. (1989). “On the Use of Component Scores in the Presence of Group Structure,” Journal of Consumer Research, 16, 106–112.
Article Google Scholar
Hubert, L., Arabie, P. (1985). “Comparing partitions,” Journal of Classification, 2, 193–218.
Article Google Scholar
MacQueen, J. (1967). “Some Methods of Classification and Analysis of Multivariate Observations,” in Proceedings of the 5th Berkeley Symposium on Statistics and Probability, eds. L. Le Cam and J. Neyman, Berkeley, CA: University of California Press, pp. 281–297.
Google Scholar
Milligan, G. W. (1980). “An Examination of the Effect of Six Types of Error Perturbation on Fifteen Clustering Algorithms,” Psychometrika, 45, 325–342.
Article Google Scholar
Milligan, G. W. (1985). “An Algorithm for Generating Artificial Test Clusters,” Psychometrika, 50, 123–127.
Article Google Scholar
Milligan, G. W., Cooper, M. C. (1988). “A Study of Standardization of Variables in Cluster Analysis,” Journal of Classification, 5, 181–204.
Article MathSciNet Google Scholar
Schaffer, C. M., Green, P. E. (1996). “An Empirical Comparison of Variable Standardization Methods in Cluster Analysis,” Multivariate Behavioral Research, 31, 149–167.
Article Google Scholar
Späth, H. (1985). Cluster Dissection and Analysis-Theory, FORTRAN Programs, Examples. Wiley, New York.
MATH Google Scholar
Steinley, D. (2003a). “K-means Clustering: What You Don’t Know May Hurt You,” Psychometric Methods, 8, 294–304.
Article Google Scholar
Steinley, D. (2003b). “Properties of the Hubert-Arabie Adjusted Rand Index,” Manuscript submitted for publication.
Google Scholar
Steinley, D., Henson, R. (2003). “OCLUS-An Analytic Method to Generate Clusters with Known Overlap,” Manuscript submitted for publication.
Google Scholar
Stoddard, A. M. (1979). “Standardization of Measures Prior to Cluster Analysis,” Biometrics, 35, 765–773.
Article Google Scholar
Vesanto, J. (2001). “Importance of Individual Variables in the K-means Algorithm,” in Proceedings of the Pacific-Asia Conference in Knowledge Discovery and Data Mining, eds. D. Cheung, G. J. Willimas, and J. Li, New York: Springer, pp. 513–518.
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

University of Illinois Urbana-Champaign, USA
Douglas Steinley

Authors

Douglas Steinley
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Leanna House Institute of Statistics and Decision Sciences, Duke University, 27708, Durham, NC, USA
David Banks
Department of Mathematics, Illinois Institute of Technology, 10 West 32nd Street, 60616-3793, Chicago, IL, USA
Frederick R. McMorris
Faculty of Management, Rutgers University, 180 University Avenue, 07102-1895, Newark, NJ, USA
Phipps Arabie
Institute of Decision Theory, University of Karlsruhe, Kaiserstr. 12, 76128, Karlsruhe, Germany
Wolfgang Gaul

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Steinley, D. (2004). Standardizing Variables in K-means Clustering. In: Banks, D., McMorris, F.R., Arabie, P., Gaul, W. (eds) Classification, Clustering, and Data Mining Applications. Studies in Classification, Data Analysis, and Knowledge Organisation. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17103-1_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-17103-1_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22014-5
Online ISBN: 978-3-642-17103-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Standardizing Variables in K-means Clustering

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Least Square Consensus Clustering: Criteria, Methods, Experiments

K-Way Spectral Clustering

An extension of the K-means algorithm to clustering skewed data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Standardizing Variables in K-means Clustering

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Least Square Consensus Clustering: Criteria, Methods, Experiments

K-Way Spectral Clustering

An extension of the K-means algorithm to clustering skewed data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation