Abstract
This paper presents a novel approach to processing encoded video sequences prior to complete decoding. Scene changes are easily detected using DCT coefficients in JPEG and MPEG encoded video sequences. In addition, by analyzing the DCT coefficients, regions of interest may be isolated prior to decompression, increasing the efficiency of any subsequent image processing steps, such as edge detection. The results are currently used in a video browser and are part of an ongoing research project in creating large video databases. The procedure is detailed with several examples presented and studied in depth.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Arman F, Hsu A, Chiu M-Y (1993) Feature management for large video databases. In: Niblack W (ed) Storage and retrieval for image and video databases, SPIE, San Jose, pp 2–12, SPIE 1908
Arman F, Hsu A, Chiu M-Y (1993) Image processing on compressed data for large video databases. In: Proceeding of first ACM International Conference on Multimedia. SPIE, Anaheim, pp 267–272
Chang S-K, Hsu A (1992) Image information systems: where do we go from here? IEEE Trans Knowl Data Eng 4:431–442
Fathima ST (1992) Data and model-driven selection using color regions. In: Proceedings of the Image Understanding Workshop. Morgan Kaufmann, San Diego, CA, pp 705–716
Fu KS (1968) Sequential methods in pattern recognition and machine learning. Academic Press, New York
Le Gall D, (1991) MPEG: a video compression standard for multimedia applications. Commun ACM 34:46–58
Hsu YS, Prum S, Kagel JH, Anrews HC (1983) Pattern recognition experiments in mandala/cosine domain. IEEE Trans Patt Anal Mach Intell 5:512–520
Kender JR (1977) Instabilities in color transformations. In: Proceedings of the Conference on Pattern Reognition and Image Processing. IEEE, RPI, Troy, NY
Ledley RS, Baus M, Golab TJ (1980) Fundamentals of true color image processing. In: Proceedings of the International Conference on Pattern Reognition. IEEE, Miami Beach, FL, pp 791–795
Liou M (1991) Overview of thep × 64 kbits/s video coding standard. Commun ACM 34:59–63
Little TDC, Venkatesh D (1994) Video scene decomposition with the motion picture parser. SPIE Conf. on Digital Video Compression and Processing on PCs: Algorithms and Technologies. SPIE, San Jose, CA (to appear)
Nagasaka A, Tanaka Y (1991) Automatic video indexing and full-video search for object appearences. In: Knuth E, Wegner LM (eds) Proceeding of the IFIP TC2/WG2.6 Second Working Conference on Visual Database Systems. North-Holland, Amsterdam, pp 113–127
Otsuji K, Tonomura Y (1993) Projection detecting filter for video cut detection. In: Proceedings of First ACM International Conference on Multimedia. ACM Press, Anaheim, pp 251–257
Rao KR, Yip P (1990) Discrete cosine transform — algorithms, advantages, applications. Academic Press, New York
Rosenfeld A, Kak AC (1987) Digital picture processing. Academic Press, Orlando
Smith BC, Rowe A (1993) Algorithms for manipulating compressed images. IEEE Comput Graphics Applic 13, 34–42
Strickland RN, Kim C-S, McDonnell WF (1986) Luminance, hue, and saturation processing of digital color images. In: SPIE Conference on Applications of Digital Image Processing, vol 697. SPIE, San Diego, CA, pp 286–292
Suetens P, Fua P, Hanson AJ (1992) Computational strategies for object recognition. ACM Comput Surv 24:5–61
Tonomura Y, Abe S (1990) Content oriented visual interface using video icons for visual database systems. J Vis Lang Comput 1:183–198
Ueda H, Miyatake T, Yoshizawa S (1991) Impact: an interactive natural-motion-picture dedicated multimedia authoring system. In: Robertson SP, Olson GM, Olson JS (eds) Proceedings of Human Factors in Computing Systems (CHI 91). ACM, New Orleans, LA, pp 343–350
Wallace GK (1991) The JPEG still picture compression standard. Commun ACM 34:30–44
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Arman, F., Hsu, A. & Chiu, MY. Image processing on encoded video sequences. Multimedia Systems 1, 211–219 (1994). https://doi.org/10.1007/BF01268945
Issue Date:
DOI: https://doi.org/10.1007/BF01268945