Abstract
We propose a novel approach classify different sports videos given their groups. First, the SURF descriptors in each key frames are extracted. Then they are used to form the visual word vocabulary (codebook) by using K-Means clustering algorithm. After that, the histogram of these visual words are computed and considered as a feature vector. Finally, we use SVM to train each classifier for each category. The classification result of the video is the production of the scores output from all of the key frames. An extensive experiment is performed on a diverse and challenging dataset of 600 sports video clips downloaded from Youtube with a total of more than 6000 minutes in length for 10 different kinds of sports.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Speeded-Up Robust Features (SURF). Computer Vision and Image Understanding 110(3), 346–359
Bosch, A., Zisserman, A., Muñoz, X.: Scene Classification via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1027–1035 (2007)
Hsu, C.-W., Chang, C.-C., Lin, C.-J.: A Practical Guide To Support Vector Classification. Technical Report, Department of Computer Science and Information Engineering, National Taiwan University, Taipei (2003)
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual Categorization with Bags of Keypoints. In: ECCV 2004 (2004)
Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering object categories in image collections. In: ICCV 2005 (2005)
Grauman, K., Darrel, T.: The pyramid match kernel: discriminative classification with sets of image features. In: ICCV 2005 (2005)
Fei-Fei, L., Perona, P.: A Bayesian Hierarchical Model for Learning Natural Scene Categories. In: CVPR 2005 (2005)
Moncrieff, S., Venkatesh, S., Dorai, C.: Horror film genre typing and scene labeling via audio analysis. In: ICME 2003 (2003)
Takagi, S., Hattori, S., Yokoyama, K., Kodate, A., Tominaga, H.: Sports Video Categorizing Method Using Camera Motion Parameters. In: 2003 International Conference on Multimedia and Expo. (ICME 2003), vol. 2 (2003)
Duan, L.-Y., Xu, M., Tian, Q.: Semantic Shot Classification in Sports Video. In: Proc. of SPIE Storage and Retrieval for Media Database 2003, pp. 300–313 (2003)
Nam, J., Alghoniemy, M., Tewfik, A.H.: Audio-visual content-based violent scene characterization. In: ICIP 1988 (1998)
Lang, C., Xu, D., Jiang, Y.: Shot Type Classification in Sports Video Based on Visual Attention. In: ICCINC 2009 (2009)
Zhang, N., Guan, L.: An efficient framework on large-scale video genre classification. In: MMSP 2010 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Duong, D., Dinh, T.B., Dinh, T., Duong, D. (2012). Sports Video Classification Using Bag of Words Model. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Intelligent Information and Database Systems. ACIIDS 2012. Lecture Notes in Computer Science(), vol 7198. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28493-9_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-28493-9_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28492-2
Online ISBN: 978-3-642-28493-9
eBook Packages: Computer ScienceComputer Science (R0)