A New Algebraic Approach to Retrieve Meaningful Video Intervals from Fragmentarily Indexed Video Shots

Pradhan, Sujeet; Sogo, Takashi; Tajima, Keishi; Tanaka, Katsumi

doi:10.1007/978-0-387-35504-7_2

Sujeet Pradhan³,
Takashi Sogo⁴,
Keishi Tajima⁴ &
…
Katsumi Tanaka⁴

Part of the book series: IFIP — The International Federation for Information Processing ((IFIPAICT,volume 40))

Included in the following conference series:

Working Conference on Visual Database Systems

286 Accesses
3 Citations

Abstract

Video data consists of a sequence of shots. Over the past several years, substantial progress has been made in automatically detecting shot boundaries based on changes of visual and/or audio characteristics. There has also been considerable progress in indexing such video shots by automatically extracting keywords using techniques such as speech and text recognition. Shots detected by those techniques, however, are very fragmental. A single shot itself is rarely self-contained and therefore may not carry enough information to be a meaningful unit. A meaningful interval that interests common users generally spans several consecutive shots. There hardly exists any reliable technique for identifying all such meaningful intervals in advance so that any possible query can be answered.

In this paper, rather than identifying meaningful intervals beforehand, we shift our focus on how to compute them dynamically from fragmentarily indexed shots, when queries are issued. We achieve our goal by using two techniques — glues and filters. Glues are algebraic operations for composing all the longer intervals, which can be meaningful answers to a given query, from a set of shorter indexed shots. Glue operations do not count on any limit to the length of resulting intervals. Consequently, lengthy intervals containing several irrelevant shots are also expected to be composed as possible answers. Therefore, we provide filter functions so that such lengthy intervals are excluded from the answer set and only few relevant intervals are returned to the user. Both glues and filters possess certain algebraic properties that are useful for an efficient query processing.

This work is supported partly by the Japanese Ministry of Education under Grant-in-Aid for Scientific Research on Priority Area: “Advanced Databases”, No. 08244103 and partly by Research for the Future Program of JSPS under the Project “Researches on Advanced Multimedia Contents Processing”.

Download to read the full chapter text

Chapter PDF

Probabilistic Approach to Content-Based Indexing and Categorization of Temporally Aggregated Shots in News Videos

Efficient processing of video containment queries by using composite ordinal features

Article 22 January 2016

Automatic Categorization of Shots in News Videos Based on the Temporal Relations

Keywords

References

Allen, J. (1983). “Maintaining Knowledge about Temporal Intervals”. Communications of the ACM, 26 (11): 832 – 843.
Article MATH Google Scholar
Arman, F., Hsu, A., and Chiu, M. (1993). “Image Processing on Compressed Data for Large Video Databases”. In Proc. of ACM Multimedia Conferences, pages 267 – 272.
Google Scholar
Hjelsvold, R., Midtstraum, R., and Sandst, O. (1996). “Searching and Browsing a Shared Video Database”. Multimedia Database Systems. Design and Implementation Strategies. chapter 4. Kluwer Academic Publishers.
Google Scholar
Hwang, E. and Subrahmanian, V. (1996). “Querying Video Libraries”. Journal of Visual Communications and Image Representation, 7 (1): 44 – 60.
Article Google Scholar
Kobla, V., Doermann, D., and Faloutsos, C. (1997). “VideoTrails: Representing and Visualizing
Google Scholar
Structure in Video Sequences”. In Proc. of ACM Multimedia,pages 335–346.
Google Scholar
Little, T. and Ghafoor, A. (1993). “Interval-Based Conceptual Models for Time-Dependent Multimedia Data”. IEEE Transactions on Knowledge and Data Engineering, 5 (4): 551 – 563.
Article Google Scholar
Lorentzos, N. and Mitsopoulos, Y. (1997). “SQL Extension for Intervals Data”. IEEE Transactions on Knowledge and Data Engineering, 9 (3): 480 – 499.
Article Google Scholar
Oomoto, E. and Tanaka, K. (1993). “OVID: Design and Implementation of a Video-Object Database System”. IEEE Transactions on Knowledge and Data Engineering, 5 (4): 629 – 643.
Article Google Scholar
Pradhan, S., Tajima, K., and Tanaka, K. (2000). “A Query Model to Synthesize Answer Intervals from Indexd Video Units”. Accepted for publication in the forthcoming issue of IEEE Transactions on Knowledge and Data Engineering.
Google Scholar
Smith, T. A. and Davenport, G. (1992). “The Stratification System: A Design Environment for Random Access Video”. In Proc. 3rd Intl Workshop on Network and Operating System Support for Digital Audio and Video, pages 250 – 261.
Google Scholar
Wactlar, H., Kanade, T., Smith, M., and Stevens, S. (1996). “Intelligent Access to Digital Video: Informedia Project”. IEEE Computer, 29 (5): 46 – 52.
Article Google Scholar
Weiss, R., Duda, A., and Gifford, D. (1995). “Composition and Search with a Video Algebra”. IEEE MultiMedia, 2 (1): 12 – 25.
Article Google Scholar
Yeung, M., Yeo, B., and Liu, B. (1996). “Extracting Story Units from Long Programs for Video Browsing and Navigation”. In International Conference on Multimedia Computing and Systems, pages 296 – 305.
Google Scholar
Zhang, H., Kankanhalli, A., and Smoliar, S. (1993). “Automatic Parsing of Full-Motion Video”. Multimedia Systems, 1: 10 – 28.
Article Google Scholar
Zhang, H., Low, C.,Smoliar, S., and Wu, J. (1995). “Video Parsing and Retrieval and Browsing: An Integrated and Content-based Solution”. In Multimedia 95 Proceedings, pages 15–24.
Google Scholar

Download references

Author information

Authors and Affiliations

Kurashiki Univ. of Science & the Arts, Japan
Sujeet Pradhan
Kobe University, Japan
Takashi Sogo, Keishi Tajima & Katsumi Tanaka

Authors

Sujeet Pradhan
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Sogo
View author publications
You can also search for this author in PubMed Google Scholar
Keishi Tajima
View author publications
You can also search for this author in PubMed Google Scholar
Katsumi Tanaka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Yokohama National University, Japan
Hiroshi Arisawa
Università degli Studi di Roma “La Sapienza”, Italy
Tiziana Catarci

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Pradhan, S., Sogo, T., Tajima, K., Tanaka, K. (2000). A New Algebraic Approach to Retrieve Meaningful Video Intervals from Fragmentarily Indexed Video Shots. In: Arisawa, H., Catarci, T. (eds) Advances in Visual Information Management. VDB 2000. IFIP — The International Federation for Information Processing, vol 40. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-35504-7_2

Download citation

DOI: https://doi.org/10.1007/978-0-387-35504-7_2
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-4457-6
Online ISBN: 978-0-387-35504-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics