Abstract
The growing use of information visualization tools and data mining algorithms stems from two separate lines of research. Information visualization researchers believe in the importance of giving users an overview and insight into the data distributions, while data mining researchers believe that statistical algorithms and machine learning can be relied on to find the interesting patterns. This paper discusses two issues that influence design of discovery tools: statistical algorithms vs. visual data presentation, and hypothesis testing vs. exploratory data analysis. I claim that a combined approach could lead to novel discovery tools that preserve user control, enable more effective exploration, and promote responsibility.
Keynote for Discovery Science 2001 Conference, November 25–28, 2001, Washington, DC. Also to appear in Information Visualization, new journal by Palgrave/MacMillan.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ahlberg, C. and Shneiderman, B., Visual Information Seeking: Tight coupling of dynamic query filters with starfield displays, Proc. of ACM CHI’ 94 Human Factors in Computing Systems, ACM Press, NewYork (April 1994), 313–317 + color plates.
Ankerst, M., Ester, M., and Kriegel, H.-P., Towards an effective cooperation of the user and the computer for classification, Proc. 6th ACM SIGKDD International Conf. on Knowledge Discovery and Data Mining, ACM, NewYork (2000), 179–188.
Barnett, Vic, and Lewis, Toby, Outliers in Statistical Data, JohnWiley & Son Ltd; 3rd edition (April) 1994).
Bradley, E., Time-series analysis, In Berthold, M. and Hand, E. (Editors), Intelligent Data Analysis: An Introduction, Springer (1999).
Card, S., Mackinlay, J, and Shneiderman, B. (Editors), Readings in InformationVisualization: Using Vision to Think, Morgan Kaufmann Publishers, San Francisco, CA (1999).
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P., and Uthurusamy, R., (Editors), Advances in Knowledge Discovery and Data Mining. MIT Press, Cambridge, MA (1996).
Fisher, R.A., The Design of Experiments, Oliver and Boyd, Edinburgh (1935). 9th edition, Macmillan, NewYork (1971).
Han, Jiawei and Kamber, Micheline, Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers, San Francisco (2000).
Hochheiser, H. and Shneiderman, B., Interactive exploration of time-series data, In Proc. Discovery Science, Springer (2001).
Hinneburg, A., Keim, D., and Wawryniuk, M., HD-Eye: Visual mining of high-dimensional data, IEEE Computer Graphics and Applications 19, 5 (Sept/Oct 1999), 22–31.
Koenemann, J. and Belkin, N., A case for interaction: A study of interactive information retrieval behavior and effectiveness, Proc. CHI’ 96 Human Factors in Computing Systems, ACM Press, NewYork (1996), 205–212.
Montgomery, D., Design and Analysis of Experiments, 3rd ed,Wiley, NewYork (1991).
Spence, Robert, Information Visualization, Addison-Wesley, Essex, England (2001).
Tukey, John, The technical tools of statistics,American Statistician 19 (1965), 23–28.Available at: http://stat.bell-labs.com/who/tukey/memo/techtools.html
Ware, M., Frank, E., Homes, F., Hall, M., and Witten, I. H., Interactive machine learning: Letting users build classifiers, International Journal of Human-Computer Studies (2001, in press).
Weizenbaum, Joseph, Computer Power and Human Reason: From Judgment to Calculation, W. H. Freeman and Co., San Francisco, CA, (1976).
Westphal, Christopher and Blaxton, Teresa, Data Mining Solutions: Methods and Tools for Solving Real-World Problems, JohnWiley & Sons (1999).
Williamson, Christopher, and Shneiderman, Ben, The Dynamic HomeFinder: Evaluating dynamic queries in a real-estate information exploration system, Proc.ACM SIGIR’92 Conference, ACM Press (1992), 338–346.
Witten, Ian, and Frank, Eibe, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann Publishers, San Francisco (2000).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shneiderman, B. (2001). Inventing Discovery Tools: Combining Information Visualization with Data Mining. In: Jantke, K.P., Shinohara, A. (eds) Discovery Science. DS 2001. Lecture Notes in Computer Science(), vol 2226. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45650-3_4
Download citation
DOI: https://doi.org/10.1007/3-540-45650-3_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42956-2
Online ISBN: 978-3-540-45650-6
eBook Packages: Springer Book Archive