Abstract
The most important aspects in computer and mobile robotics are both visual object and place recognition; they have been used to tackle numerous applications via different techniques as established previously in the literature, however, combining the machine learning techniques for learning objects to obtain best possible recognition and as well as to obtain its image descriptors for describing the content of the image fully is considered as another vital way which can be used in computer vision. Thus, the ability of the system is to learn and describe the structural features of objects or places more effectively, which in turn; it leads to a correct recognition of objects. This paper introduces a method that uses Naive Base to combine the Kernel Principle Component (KPCA) features with HOG features from the visual scene. According to this approach, a set of SURF features and Histogram of Gradient (HOG) are extracted from a given image. The minimum Euclidean Distance between all SURF features is computed from the visual codebook which was constructed by K-means previously to be combined with HOG features. A classification method such as Support Vector Machine (SVM) was used for data analysis and the results indicate that KPCA with HOG method significantly outperforms bag of visual keyword (BOW) approach on Caltech-101 object dataset and IDOL visual place dataset.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Wenyu, C., Wenzhi, X., Ru, Z.: Method of item recognition based on SIFT and SURF, Mathematical Structures in Computer Science 24(5) (2014)
Suaib, N.M., Marhaban, M.H., Saripan, M.I., Ahmad, S.A.: Performance evaluation of feature detection and feature matching for stereo visual odometry using SIFT and SURF. In: 2014 IEEE Region 10 Symposium, pp. 200–203 (2014)
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: ICCV 2003: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 1470 (2003)
Jiang, Y.-G., Ngo, C.-W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: CIVR 2007: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 494–501 (2007)
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: Proceedings of the IEEE Conference on Computer Visionand Pattern Recognition (2008)
Huang, J., Kumar, S.R., Mitra, M., Zhu, W.J., Zabih, R.: Image indexingusing color correlograms. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 762 (1997)
Gandhali, P.S., Debasis, M.: Correlogram Method for Comparing Bio-Sequences. Technical Report FIT-CS-2006-01, Master’s Thesis, Florida Institute of Technology (2006)
Csurka, G., Dance, C., Fan, L., Bray, C.: Visual categorization with bag of keypoints. In: The 8th European Conference on Computer Vision, pp. 513–516 (2004)
Perronnin, F., Dance, C., Csurka, G., Bressan, M.: Adapted vocabulariesfor generic visual categorization. In: European Conference on Computer Vision (ECCV 2006), pp. 464–475 (2006)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(3), 264–323 (1999)
Forstner, W., Moonen, B.: A metric for covariance matrices. Technical report, Dept. of Geodesy and Geoinformatics, Stuttgart University (1999)
Tian, J., Qiuxia, H., Xiaoyi, M., Mingyu, H.: An Improved KPCA/GA-SVM Classification Model for Plant Leaf Disease Recognition. Journal of Computational Information Systems 8(18), 7737–7745 (2012)
Schölkopf, B., Smola, A.J., Müller, K.-R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 5, 1299–1319 (1998)
Baudat, G., Anouar, F.: Generalized Discriminant Analysis Using a Kernel Approach. Neural Computation 12(10), 2385–2404 (2000)
Artač, M., Jogan, M., Leonardis, A.: Mobile robot localization using an incremental eigenspace model. In: IEEE International Conferenceon Robotics and Automation, Washington, D.C., pp. 1025–1030 (2002)
Dzati, A.R., Salwani, I., Haryati, J.: Robust palm print verification system based on evolution kernel principal component analysis. In: IEEE International Conference on Control System, Computing and Engineering 2014 (ICCSCE 2014) (2014)
Jogan, M., Leonardis, A., Wildenauer, H., Bischof, H.: Mobile robot localization under varying illumination. In: The 16th International on Pattern Recognition, pp. 2385–2404 (2000)
Kröse, B., Bunschoten, R.: Probabilistic localization by appearance models and active vision. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 2255–2260 (1999)
Hong, M.L., Dong, M.Z., Ren, C.N., Xiang, L., Hai, Y.D.: Face Recognition Using KPCA and KFDA. AMM, pp. 380–384:3850–3853 (2013)
Sim, R., Dudek, G.: Learning landmarks for robot localization. In: Proceedings of the National Conference on Artificial Intelligence SIGART/AAAI Doctoral Consortium, Austin, TX, SIGART/AAAI, pp. 1110–1111. AAAI Press (2000)
Phiwmal, N., Sanguansat, P.: An Improved Feature Extraction and Combination of Multiple Classifiers for Query-by-Humming. The International Arab Journal of Information and Technology 11(1) 103–110 (2014)
Bay, H., Tuytelaars, T., Van Gool, L.: Speeded up robust features. ETH Zurich, Katholieke Universiteit Leuven, vol. 3951, pp 404–417. Springer, Heidelberg (2006)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proc. of CVPR 2006 (2006)
Suvi, T., Kai, N., Mikko, T., Antti, K., Tapio, S.: ECG-derived respiration methods: Adapted ICA and PCA. Medical Engineering & Physics (2015)
Vipsita, S., Shee, B.K., Rath, S.K.: Protein superfamily classification using kernel principal component analysis and probabilistic neural networks. In: 2011 Annual IEEE India Conference (INDICON) (2011)
Pronobis, A., Caputo, B., Jensfelt, P., Christensen, I.: A realistic benchmark for visual indoor place recognition. Robotics and Autonomous System 58(1), 81–96 (2009)
Lu, L., Jianhua, Y., Evrim, T., Ronald, M.S.: Multilevel Image Recognition using Discriminative Patches and Kernel Covariance, SPIE Medical Imaging (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Ali, A.M., Rashid, T.A. (2016). Kernel Visual Keyword Description for Object and Place Recognition. In: Thampi, S., Bandyopadhyay, S., Krishnan, S., Li, KC., Mosin, S., Ma, M. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 425. Springer, Cham. https://doi.org/10.1007/978-3-319-28658-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-28658-7_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28656-3
Online ISBN: 978-3-319-28658-7
eBook Packages: EngineeringEngineering (R0)