Abstract
An efficient method to recognize caption area in MPEG compressed video was presented, by making use of the contrast of I-frame to distinguish caption area with background. We define texture energy, intensity of boundary, distance of background, and texture correlation to recognize caption area and caption frame. The benefit of only analyzing I-frame is that we can make use of DCT coefficients directly without losing information. We have experimented with our algorithm, and the result of experiment indicates that the performance of the algorithm is efficient.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Li H, Doermann D, Kia O. Automatic Text Detection and Tracking in Digital Video.IEEE Trans on Image Processing, 2000,9(1): 117–156.
Jain A K, Yu B. Automatic Text Location in Images and Video Frames.Proc of International Conf on Pattern Recognition, 1998,1: 1497–1499.
Li H, Doermann D. Automatic Identification of Text in Digital Key Frames.Proc of the 14 th International Conf on Pattern Recognition, 2000,1: 618–620.
Lienhart R, Stuber F. Automatic Text Recognition in Digital Video.Proc of SPIE in Image and Video Processing, 1997,2666: 180–188.
Lienhart R. Comparison of Automatic Shot Boundary Detection Algorithms.Proc of SPIE in Image and Video Processing, 1999,3656: 29.
Mariano V Y, Kasturi R. Locating Uniform-Colored Text in Video Frames.Proc of the 15th International Conf on Pattern Recognition, 2000,4: 539–542.
Wernicke A, Lienhart R. On the Segmentation of Text in Videos.IEEE International Conf on Multimedia and Expo, 2000,3: 1511–1514.
Lim Y K, Choi S H, Lee S W. Text Extraction in MPEG Compressed Video for Content Based Indexing.Proc of the 15th International Conf on Pattern Recognition, 2000,4: 109–112.
Zhang Y, Chua T S. Detection of Text Captions in Compressed Domain Video.Proc of ACM International Conf on Multimedia, 2000,1: 201–204.
Zhong Y, Zhang H J, Jain A K. Automatic Caption Localization in Compressed Video.IEEE Trans on Pattern Analysis and Machine Intelligence, 2000,22(4): 385–392.
Sahoo P K, Soltani S, Won A K C. A Survey of Thresholding Techniques.Computer Vision. Graphics. and Image Processing, 1988,41: 233–260.
Author information
Authors and Affiliations
Additional information
Foundation item: Supported by the Natural Science Foundation of Hubei Province (2004AB174)
Biography: ZHENG Peng (1965-), male, Associate professor, research direction: scientific visualization and video information processing.
Rights and permissions
About this article
Cite this article
Peng, Z., Xiao-ping, L. & Dong-ru, Z. A method to recognize caption area in MPEG compressed video. Wuhan Univ. J. Nat. Sci. 10, 363–367 (2005). https://doi.org/10.1007/BF02830667
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF02830667