Abstract
Parsing video content is an important first step in the video indexing process. This paper presents algorithms to automate the video parsing task, including partitioning a source video into clips and classifying those clips according to camera operations, using compressed video data. We have developed two algorithms and a hybrid approach to partitioning video data compressed according to the JPEG and MPEG standards. The algorithms utilize both the video content encoded in DCT (Discrete Cosine Transform) coefficients and the motion vectors between frames. The hybrid approach integrates the two algorithms and incorporates multi-pass strategies and motion analyses to improve both accuracy and processing speed. Also, we present content-based video browsing tools which utilize the information, particularly about the shot boundaries and key frames, obtained from parsing.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
F. Arman, A. Hsu, and M.-Y. Chiu, “Image Processing on Compressed Data for Large Video Databases,” Proc. ACM Multimedia 93, Anaheim, CA, pp. 267–272, 1993.
D. Le Gall, “MPEG: A Video Compression Standard for Multimedia Applications,” Communications of the ACM, 34 (4), pp. 46–58, 1991.
R.M. Haralick and L.G. Shapiro, “Computer and Robot Vision,” Vol. 2, Addison-Wesley, Reading, MA, 1993.
M. Mills, J. Cohen, and Y.Y. Wong, “A Magnifier Tool for Video Data,” Proc. CHI '92, Monterey, CA, pp. 93–98, 1992.
A. Nagasaka and Y. Tanaka, “Automatic Video Indexing and Full-Video Search for Object Appearances,” Visual Database Systems, II, E. Knuth and L.M. Wegner, editors, North-Holland, pp. 119–133, 1991.
B.C. O'Connor, “Selecting Key Frames of Moving Image Documents: A Digital Environment for Analysis and Navigation,” Microcomputers for Information Management, 8(2), pp. 119–133, 1991.
R. Steinmetz, “Data Compression in Multimedia Computing—Standards and Systems,” Multimedia Systems, 1(4), pp. 187–204, 1994.
D. Swanberg, C.-F. Shu, and R. Jain, “Knowledge Guided Parsing in Video Databases,” Proc. IS&T/SPIE Conf. on Storage and Retrieval for Image and Video Databases, San Jose, CA, 1993.
L. Teodosio and W. Bender, “Salient Video Stills: Content and Context Preserved,” Proc. ACM Multimedia 93, Anaheim, CA, pp. 39–46, 1993.
Y.T. Tse and R.L. Baker, “Camera Zoom/Pan Estimation and Compensation for Video Compression,” Proc. SPIE Conf. on Image Processing Algorithms and Techniques II, Boston, MA, pp. 468–79, 1991.
H. Ueda, T. Miyatake, and S. Yoshizawa, “IMPACT: An Interactive Natural-Motion-Picture Multimedia Authoring System,” Proc. CHI'91, New Orleans, LA, pp. 343–350, 1991.
G.K. Wallace, “The JPEG Still Picture Compression Standard,” Communications of the ACM, 34(4), pp. 30–44, 1991.
H.J. Zhang, A. Kankanhalli, and S.W. Smoliar, “Automatic Partitioning of Full-motion Video,” Multimedia Systems, 1(1), pp. 10–28, 1993.
H.J. Zhang, C.Y. Low, Y. Gong, and S.W. Smoliar, “Video Parsing Using Compressed Data,” Proc. IS&T/SPIE Conf. on Image and Video Processing II, San Jose, CA, pp. 142–149, 1994.
H.J. Zhang and S.W. Smoliar, “Developing Power Tools for Video Indexing and Retrieval,” Proc. IS&T/SPIE Conf. on Storage and Retrieval for Image and Video Databases II, San Jose, CA, pp. 140–149, 1994.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Zhang, H., Low, C.Y. & Smoliar, S.W. Video parsing and browsing using compressed data. Multimed Tools Appl 1, 89–111 (1995). https://doi.org/10.1007/BF01261227
Issue Date:
DOI: https://doi.org/10.1007/BF01261227