Skip to main content

Finding Person X: Correlating Names with Visual Appearances

  • Conference paper
Image and Video Retrieval (CIVR 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

Abstract

People as news subjects carry rich semantics in broadcast news video and therefore finding a named person in the video is a major challenge for video retrieval. This task can be achieved by exploiting the multi-modal information in videos, including transcript, video structure, and visual features. We propose a comprehensive approach for finding specific persons in broadcast news videos by exploring various clues such as names occurred in the transcript, face information, anchor scenes, and most importantly, the timing pattern between names and people. Experiments on the TRECVID 2003 dataset show that our approach achieves high performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Smeulders, et al.: Content-Based Image Retrieval at the End of the Early Years. IEEE Trans. Pattern Analysis and Machine Intelligence 22(12), 1349–1379 (2000)

    Article  Google Scholar 

  2. Zhang, H.J., Kankanhalli, A., Smoliar, S.W.: Automatic partitioning of full-motion video. ACM Multimedia Systems 1(1) (1993)

    Google Scholar 

  3. Hauptmann, A., et al.: Informedia at TRECVID 2003: Analyzing and Searching Broadcast News Video. In: Proceedings of TREC 2003 (2003)

    Google Scholar 

  4. Satoh, S., Kanade, K.: NAME-IT: Association of Face and Name in Video. In: IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, pp. 775–781 (1997)

    Google Scholar 

  5. The NIST TREC Video Retrieval Evaluation, http://www-nlpir.nist.gov/projects/trecvid/

  6. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)

    MATH  Google Scholar 

  7. Baeza-Yates, R., Ribeiro-Neto, N.: Modern Information Retrieval. Addison Wesley, Essex (1999)

    Google Scholar 

  8. Zhai, C., Lafferty, J.: A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval. In: Proc. 24th Int’l ACM SIGIR Conf, pp. 334–342 (2001)

    Google Scholar 

  9. Pentland, A., Moghaddam, B.: Starne,r T.: View-Based and Modular Eigenspaces for Face Recognition IEEE Conference on Computer Vision & Pattern Recognition (1994)

    Google Scholar 

  10. Schneiderman, H., Kanade, T.: Object Detection Using the Statistics of Parts. International Journal of Computer Vision (2003)

    Google Scholar 

  11. Chen, M.Y., Hauptmann, A.: Searching for a Specific Person in Broadcast News Video. In: Int’l Conf. on Acoustics, Speech, and Signal Processing (May 2004) (to appear)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yang, J., Chen, My., Hauptmann, A. (2004). Finding Person X: Correlating Names with Visual Appearances. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27814-6_34

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22539-3

  • Online ISBN: 978-3-540-27814-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics