Video parsing and browsing using compressed data

Zhang, Hongjiang; Low, Chien Yong; Smoliar, Stephen W.

doi:10.1007/BF01261227

Video parsing and browsing using compressed data

Published: March 1995

Volume 1, pages 89–111, (1995)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Multimedia Tools and Applications Aims and scope Submit manuscript

Video parsing and browsing using compressed data

Download PDF

Hongjiang Zhang¹,
Chien Yong Low¹ &
Stephen W. Smoliar¹

268 Accesses
200 Citations
9 Altmetric
Explore all metrics

Abstract

Parsing video content is an important first step in the video indexing process. This paper presents algorithms to automate the video parsing task, including partitioning a source video into clips and classifying those clips according to camera operations, using compressed video data. We have developed two algorithms and a hybrid approach to partitioning video data compressed according to the JPEG and MPEG standards. The algorithms utilize both the video content encoded in DCT (Discrete Cosine Transform) coefficients and the motion vectors between frames. The hybrid approach integrates the two algorithms and incorporates multi-pass strategies and motion analyses to improve both accuracy and processing speed. Also, we present content-based video browsing tools which utilize the information, particularly about the shot boundaries and key frames, obtained from parsing.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

F. Arman, A. Hsu, and M.-Y. Chiu, “Image Processing on Compressed Data for Large Video Databases,” Proc. ACM Multimedia 93, Anaheim, CA, pp. 267–272, 1993.
D. Le Gall, “MPEG: A Video Compression Standard for Multimedia Applications,” Communications of the ACM, 34 (4), pp. 46–58, 1991.
Google Scholar
R.M. Haralick and L.G. Shapiro, “Computer and Robot Vision,” Vol. 2, Addison-Wesley, Reading, MA, 1993.
Google Scholar
M. Mills, J. Cohen, and Y.Y. Wong, “A Magnifier Tool for Video Data,” Proc. CHI '92, Monterey, CA, pp. 93–98, 1992.
A. Nagasaka and Y. Tanaka, “Automatic Video Indexing and Full-Video Search for Object Appearances,” Visual Database Systems, II, E. Knuth and L.M. Wegner, editors, North-Holland, pp. 119–133, 1991.
B.C. O'Connor, “Selecting Key Frames of Moving Image Documents: A Digital Environment for Analysis and Navigation,” Microcomputers for Information Management, 8(2), pp. 119–133, 1991.
Google Scholar
R. Steinmetz, “Data Compression in Multimedia Computing—Standards and Systems,” Multimedia Systems, 1(4), pp. 187–204, 1994.
Google Scholar
D. Swanberg, C.-F. Shu, and R. Jain, “Knowledge Guided Parsing in Video Databases,” Proc. IS&T/SPIE Conf. on Storage and Retrieval for Image and Video Databases, San Jose, CA, 1993.
L. Teodosio and W. Bender, “Salient Video Stills: Content and Context Preserved,” Proc. ACM Multimedia 93, Anaheim, CA, pp. 39–46, 1993.
Y.T. Tse and R.L. Baker, “Camera Zoom/Pan Estimation and Compensation for Video Compression,” Proc. SPIE Conf. on Image Processing Algorithms and Techniques II, Boston, MA, pp. 468–79, 1991.
H. Ueda, T. Miyatake, and S. Yoshizawa, “IMPACT: An Interactive Natural-Motion-Picture Multimedia Authoring System,” Proc. CHI'91, New Orleans, LA, pp. 343–350, 1991.
G.K. Wallace, “The JPEG Still Picture Compression Standard,” Communications of the ACM, 34(4), pp. 30–44, 1991.
Google Scholar
H.J. Zhang, A. Kankanhalli, and S.W. Smoliar, “Automatic Partitioning of Full-motion Video,” Multimedia Systems, 1(1), pp. 10–28, 1993.
Google Scholar
H.J. Zhang, C.Y. Low, Y. Gong, and S.W. Smoliar, “Video Parsing Using Compressed Data,” Proc. IS&T/SPIE Conf. on Image and Video Processing II, San Jose, CA, pp. 142–149, 1994.
H.J. Zhang and S.W. Smoliar, “Developing Power Tools for Video Indexing and Retrieval,” Proc. IS&T/SPIE Conf. on Storage and Retrieval for Image and Video Databases II, San Jose, CA, pp. 140–149, 1994.

Download references

Author information

Authors and Affiliations

Institute of Systems Science, National University of Singapore, Heng Mui Keng Terrace, Kent Ridge, 0511, Singapore, Republic of Singapore
Hongjiang Zhang, Chien Yong Low & Stephen W. Smoliar

Authors

Hongjiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chien Yong Low
View author publications
You can also search for this author in PubMed Google Scholar
Stephen W. Smoliar
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, H., Low, C.Y. & Smoliar, S.W. Video parsing and browsing using compressed data. Multimed Tools Appl 1, 89–111 (1995). https://doi.org/10.1007/BF01261227

Download citation

Issue Date: March 1995
DOI: https://doi.org/10.1007/BF01261227

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Video parsing and browsing using compressed data

Abstract

Article PDF

Similar content being viewed by others

A survey on compressed domain video analysis techniques

VideoZoom: An Interactive System for Video Summarization, Browsing and Retrieval

Indexing Video Database for a CBVCD System

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Video parsing and browsing using compressed data

Abstract

Article PDF

Similar content being viewed by others

A survey on compressed domain video analysis techniques

VideoZoom: An Interactive System for Video Summarization, Browsing and Retrieval

Indexing Video Database for a CBVCD System

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation