Abstract
Iterative, EM-type algorithms for data clustering and data visualization are derived on the basis of the maximum entropy principle. These algorithms allow the data analyst to detect structure in vectorial or relational data. Conceptually, the clustering and visualization procedures are formulated as combinatorial or continuous optimization problems which are solved by stochastic optimization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
J. Buhmann and H. Kiihnel. Vector quantization with complexity costs. IEEE Transactions on Information Theory, 39 (4): 1133–1145, July 1993.
J. Buhmann and H. Kiihnel. Vector quantization with complexity costs. IEEE Transactions on Information Theory, 39 (4): 1133–1145, July 1993.
T. F. Cox and M.A.A. Cox. Multidimensional Scaling. Number 59 in Monographs on Statistics and Applied Probability. Chapman & Hall, London, 1994.
Т. М. Cover and J. A. Thomas Elements of Information Theory. John Wiley & Sons, New York, 1991.
R. O. Duda and P. E. Hart. Pattern Classification and Scene Analysis. Wiley, New York, 1973. Sect. 3.2.
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. J. Royal Statist. Soc. Ser. B, 39: 1–38, 1977.
Thomas Hofmann and Joachim M. Buhmann. Hierarchical pairwise data clustering by mean—field annealing. In Proceedings of ICANN’95, NEURONÎMES’95, volume II, pages 197–202. EC2 & Cie, 1995.
Thomas Hofmann and Joachim M. Buhmann. An annealed “neural gas” network for robust vector quantization. In Proceedings of ICANN’96, pages 151–156, Berlin, Heidelberg, New York, 1996. Springer.
Thomas Hofmann and Joachim M. Buhmann Infering hierarchical clustering structures by deterministic annealing. In Proceedings of the Knowledge Discovery and Data Mining Conference 1996, Portland, Redwood City, CA, USA, 1996. AAAI Press. (in press).
Thomas Hofmann and Joachim M. Buhmann Pairwise data clustering by deterministic annealing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19 (1): 1–14, 1997.
J. Hertz, A. Krogh, and R. G. Palmer. Introduction to the Theory of Neural Computation. Addison Wesley, New York, 1991.
Thomas Hofmann. Data Clustering and Beyond: A Deterministic Annealing Framework for Exploratory Data Analysis. PhD thesis, Mathematisch-Naturwissenschaftliche Fakultät, Rheinische Friedrich-Wilhelms-Universität Bonn, D-53117 Bonn, Fed. Rep. Germany, 1997.
A. K. Jain and R. C. Dubes. Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs, NJ 07632, 1988.
Hansjörg Klock and Joachim M. Buhmann. Multidimensional scaling by deterministic annealing. In M.Pellilo and E.R.Hancock, editors, Proceedings EMMCVPR’97, Lecture Notes In Computer Science. Springer Verlag, 1997.
T. Kohonen. Self-organization and Associative Memory. Springer, Berlin, 1984.
S.P. Luttrell. Hierarchical vector quantizations. IEE Proceedings, 136: 405–413, 1989.
J. MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, pages 281–297, 1967.
G. J. McLachlan and K. E. Basford. Mixture Models. Marcel Dekker, INC, New York, Basel, 1988.
K. Rose, E. Gurewitz, and G. Fox. A deterministic annealing approach to clustering. Pattern Recognition Letters, 11 (11): 589–594, 1990.
H. Ritter, T. Martinetz, and K. Schulten. Neural Computation and Self-organizing Maps. Addison Wesley, New York, 1992.
J. W Sammon Jr. A non-linear mapping for data structure analysis. IEEE Transactions on Computers, 18: 401–409, 1969.
Yoshio Takane and Forest W. Young. Nonmetric individul differences multidimensional scaling: An alternating least squares method with optimal scaling features. Psychometrika, 42 (1): 7–67, March 1977.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Buhmann, J.M. (1998). Stochastic Algorithms for Exploratory Data Analysis: Data Clustering and Data Visualization. In: Jordan, M.I. (eds) Learning in Graphical Models. NATO ASI Series, vol 89. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-5014-9_14
Download citation
DOI: https://doi.org/10.1007/978-94-011-5014-9_14
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-6104-9
Online ISBN: 978-94-011-5014-9
eBook Packages: Springer Book Archive