Abstract
The increasing availability of streaming data is a consequence of the continuing advancement of data acquisition technology. Such data provides new challenges to the various data analysis communities. Clustering has long been a fundamental procedure for acquiring knowledge from data, and new tools are emerging that allow the clustering of data streams. However the dynamic, temporal components of streaming data provide extra challenges to the development of stream clustering and associated visualisation techniques. In this work we combine a streaming clustering framework with an extension of a static cluster visualisation method, in order to construct a surface that graphically represents the clustering structure of the data stream. The proposed method, OpticsStream, provides intuitive representations of the clustering structure as well as the manner in which this structure changes through time.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Aggarwal, C.: A framework for diagnosing changes in evolving data streams. In: ACM SIGMOD Conference, ACM Press, New York (2003)
Aggarwal, C., Han, J., Wang, J., Yu, P.: A framework for projected clustering high dimensional data streams. In: 10th VLDB conference (2004)
Ankerst, M., Breunig, M.M., Kriegel, H.-P., Sander, J.: OPTICS: Ordering points to identify the clustering structure. In: Proceedings of ACM-SIGMOD International Conference on Management of Data (1999)
Berthold, M., Hand, D.J.: Intelligent Data Analysis: an introduction. Springer, Heidelberg (2003)
Cao, F., Ester, M., Qian, W., Zhou, A.: Density-based clustering over an evolving data stream with noise. In: 2006 SIAM Conference on Data Mining (2006)
Dhillon, I.S., Modha, D.S.: Concept decompositions for large sparse text data using clustering. Machine Learning 42(1/2), 143–175 (2001)
Hand, D.J., Heard, N.A.: Finding groups in gene expression data. J Biomed Biotechnol. 2, 215–225 (2005)
Hinneburg, A., Keim, D.A., Wawryniuk, M.: HD-Eye: visual mining of high-dimensional data. Computer Graphics and Applications, IEEE 19(5), 22–31 (1999)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(3), 264–323 (1999)
Kriegel, H.-P., Kroger, P., Gotlibovich, I.: Incremental OPTICS: Efficient computation of updates in a hierarchical cluster ordering. In: 5th Int. Conf. on Data Warehousing and Knowledge Discovery (2003)
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998)
Sander, J., Ester, M., Kriegel, H.-P., Xu, X.: Density-based clustering in spatial databases: The algorithm GDBSCAN and its applications. Data Mining and Knowledge Discovery 2(2), 169–194 (1998)
Tasoulis, D.K., Adams, N.M., Hand, D.J.: Unsupervised clustering in streaming data. In: IEEE International Conference on Data Mining (2006)
Tryon, C.: Cluster Analysis. Edward Brothers, Ann Arbor, MI (1939)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tasoulis, D.K., Ross, G., Adams, N.M. (2007). Visualising the Cluster Structure of Data Streams. In: R. Berthold, M., Shawe-Taylor, J., Lavrač, N. (eds) Advances in Intelligent Data Analysis VII. IDA 2007. Lecture Notes in Computer Science, vol 4723. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74825-0_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-74825-0_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74824-3
Online ISBN: 978-3-540-74825-0
eBook Packages: Computer ScienceComputer Science (R0)