Abstract
The demand for more efficient browsing of video data is expected to increase as greater access to this type of data becomes available. This experiment looked at one technique for displaying video data using key frame surrogates that are presented as a “slide show”. Subjects viewed key frames for between one and four video clips simultaneously. Following this presentation, the subjects performed object recognition and gist comprehension tasks in order to determine human thresholds for divided attention between these multiple displays. It was our belief that subject performance would degrade as the number of slide shows shown simultaneously increased. For object recognition and gist comprehension tasks, a decrease in performance between the one slide show display and the two, three or four slide show displays was found. In the case of two or three video presentations, performance is about the same, and there remains adequate object recognition abilities and comprehension of the video clips. Performance drops off to unacceptable levels when four slide shows are displayed at once.
Preview
Unable to display preview. Download preview PDF.
References
Christel, M.G., Winkler, D.B., and Taylor, C.R., (1997). Multimedia Abstractions for a Digital Library. In Proceedings of the ACM Digital Libraries '97 Conference. Philadelphia, PA, July 1997.
Ding, (1996). A Study on Video Browsing Strategies. Technical Report CS-TR-3790 UMIACS-TR-97-40 CLIS-TR-97-06, University of Maryland at College Park
Ellis, H.C. and Hunt, R.R., (1989). Fundamentals of Human Memory and Cognition. Wm. C. Brown Publishers. Dubuque, Iowa.
Elliot, E. (1993). Watch, grab, arrange, see: Thinking with motion images via streams and collages. MSVS Thesis Document. Cambridge, MA: MIT Media Lab.
Kahneman, D. (1973). Attention and effort. Englewood Cliffs, NJ: Prentice Hall.
Kobla, V., Doermann, D., & Rosenfeld, A. (1996). Compressed domain video segmentation. Technical Report CAR-TR-839 CS-TR-3688, University of Maryland at College Park
LeDoux, J.E. and Hirst, W., eds. (1986). Mind and brain: Dialogues in cognitive neuroscience. Cambridge: Cambridge University Press.
Marchionini, (1995). Information seeking in electronic environments. Cambridge Series on Human Computer Interaction 9. Cambridge University Press. New York
O'Connor, B. C. (1991). Selecting Key Frames of Moving Image Documents: a Digital Environment for Analysis and Navigation. Microcomputers for Information Management. 8(2),119–133.
Shneiderman, B. (1998). Designing the User Interface, Strategies for Effective Human Computer Interaction, Third Edition. Addison-Wesley, Reading, MA.
Wickens, C.D. (1992). Engineering psychology and human performance. Second Edition. New York: HarperCollins.
Wickens, C.D., Sandry, D., & Vidulich, M. (1983). Compatability and resource competition between modalities of input, output, central processing. Human Factors, 25, 227–248.
Zhang, H.J., Wu, J.H., Low, C.Y. and Smoliar, S.W., (1995). A Video Parsing, Retrieval and Browsing System In Proc. of ACM Multimedia '95 San Francisco, Nov.7–9, 1995.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Slaughters, L., Shneiderman, B., Marchionini, G. (1997). Comprehension and object recognition capabilities for presentations of simultaneous video key frame surrogates. In: Peters, C., Thanos, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 1997. Lecture Notes in Computer Science, vol 1324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026720
Download citation
DOI: https://doi.org/10.1007/BFb0026720
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63554-3
Online ISBN: 978-3-540-69597-4
eBook Packages: Springer Book Archive