On the Initialization of Two-Stage Clustering with Class-GTM

Cruz-Barbosa, Raúl; Vellido, Alfredo

doi:10.1007/978-3-540-75271-4_6

Raúl Cruz-Barbosa^1,2 &
Alfredo Vellido¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4788))

Included in the following conference series:

Conference of the Spanish Association for Artificial Intelligence

594 Accesses
1 Citations

Abstract

Generative Topographic Mapping is a probabilistic model for data clustering and visualization. It maps points, considered as prototype representatives of data clusters, from a low dimensional latent space onto the observed data space. In semi-supervised settings, class information can be added resulting in a model variation called class-GTM. The number of class-GTM latent points used is usually large for visualization purposes and does not necessarily reflect the class structure of the data. It is therefore convenient to group the clusters further in a two-stage procedure. In this paper, class-GTM is first used to obtain the basic cluster prototypes. Two novel methods are proposed to use this information as prior knowledge for the K-means-based second stage. We evaluate, using an entropy measure, whether these methods retain the class separability capabilities of class-GTM in the two-stage process, and whether the two-stage procedure improves on the direct clustering of the data using K-means.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Predictive K-means with Local Models

A Novel Clustering Algorithm Based on a Non-parametric “Anti-Bayesian” Paradigm

Fast Tree-Based Classification via Homogeneous Clustering

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Figueiredo, M.A.T., Jain, A.K.: Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(3), 381–396 (2002)
Article Google Scholar
Bishop, C.M., Svensén, M., Williams, C.K.I.: The Generative Topographic Mapping. Neural Computation 10(1), 215–234 (1998)
Article Google Scholar
Vellido, A.: Missing data imputation through GTM as a mixture of t-distributions. Neural Networks 19(10), 1624–1635 (2006)
Article MATH Google Scholar
Vellido, A., Lisboa, P.J.G.: Handling outliers in brain tumour MRS data analysis through robust topographic mapping. Computers in Biology and Medicine 36(10), 1049–1063 (2006)
Article Google Scholar
Vellido, A., Lisboa, P.J.G., Vicente, D.: Robust analysis of MRS brain tumour data using t-GTM. Neurocomputing 69(7-9), 754–768 (2006)
Article Google Scholar
Hastie, T., Tibshirani, R.: Discriminant analysis by Gaussian mixtures. Journal of the Royal Statistical Society (B) 58, 155–176 (1996)
MATH MathSciNet Google Scholar
Cruz, R., Vellido, A.: On the improvement of brain tumour data clustering using class information. In: STAIRS 2006. Proceedings of the 3rd European Starting AI Researcher Symposium, Riva del Garda, Italy (2006)
Google Scholar
Sun, Y., Tiňo, P., Nabney, I.T.: Visualization of incomplete data using class information constraints. In: Winkler, J., Niranjan, M. (eds.) Uncertainty in Geometric Computations, pp. 165–174. Kluwer Academic Publishers, The Netherlands (2002)
Google Scholar
Vesanto, J., Alhoniemi, E.: Clustering of the Self-Organizing Map. IEEE Transactions on Neural Networks (2000)
Google Scholar
Bishop, C.M., Svensén, M., Williams, C.K.I.: Magnification Factors for the GTM algorithm. In: Proceedings of the IEE fifth International Conference on Artificial Neural Networks, pp. 64–69 (1997)
Google Scholar
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. on Pattern Analysis and Machine Intelligence 1(2), 224–227 (1979)
Article Google Scholar
Bishop, C.M., James, G.D.: Analysis of multiphase flows using dual-energy gamma densitometry and neural networks. Nuclear Instruments and Methods in Physics Research A327, 580–593 (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Universitat Politècnica de Catalunya, Jordi Girona, 08034, Barcelona, Spain
Raúl Cruz-Barbosa & Alfredo Vellido
Universidad Tecnológica de la Mixteca, Car. Acatlima km. 2.5, 69000, Huajuapan, Oaxaca, México
Raúl Cruz-Barbosa

Authors

Raúl Cruz-Barbosa
View author publications
You can also search for this author in PubMed Google Scholar
Alfredo Vellido
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Daniel Borrajo Luis Castillo Juan Manuel Corchado

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cruz-Barbosa, R., Vellido, A. (2007). On the Initialization of Two-Stage Clustering with Class-GTM. In: Borrajo, D., Castillo, L., Corchado, J.M. (eds) Current Topics in Artificial Intelligence. CAEPIA 2007. Lecture Notes in Computer Science(), vol 4788. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75271-4_6

Download citation

DOI: https://doi.org/10.1007/978-3-540-75271-4_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75270-7
Online ISBN: 978-3-540-75271-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

On the Initialization of Two-Stage Clustering with Class-GTM

Abstract

Chapter PDF

Similar content being viewed by others

Predictive K-means with Local Models

A Novel Clustering Algorithm Based on a Non-parametric “Anti-Bayesian” Paradigm

Fast Tree-Based Classification via Homogeneous Clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

On the Initialization of Two-Stage Clustering with Class-GTM

Abstract

Chapter PDF

Similar content being viewed by others

Predictive K-means with Local Models

A Novel Clustering Algorithm Based on a Non-parametric “Anti-Bayesian” Paradigm

Fast Tree-Based Classification via Homogeneous Clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation