Abstract
Automated document classification process extracts information with a systematic analysis of the content of documents.
This is an active research field of growing importance due to the large amount of electronic documents produced in the world wide web and available thanks to diffused technologies including mobile ones.
Several application areas benefit from automated document classification, including document archiving, invoice processing in business environments, press releases and research engines.
Current tools classify or βtagβ either text or images separately.In this paper we show how, by linking image and text-based contents together, a technology improves fundamental document management tasks like retrieving information from a database or automated documents.
We present an investigation of a model of conceptual spaces for investigation using joint information sources from the text and the images forming complex documents. We present a formal model and the computable algorithms and the dataset from which we took a subset to make experiments and relative tests and results.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Latent Semantic Indexing
- Nuclear Norm
- Proximal Gradient
- Dimensionality Reduction Algorithm
- Authorship Attribution
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision ComputingΒ 23, 565β576 (2005)
Ah-Pine, J., Bressan, M., Clinchant, S., Csurka, G., Hoppenot, Y., Renders, J.-M.: Crossing textual and visual content in different application scenarios. Multimedia Tools and ApplicationsΒ 42, 31β56 (2009)
Qi, C., Aggarwal, G., Tian, Q., Ji, H., Huang, T.: Exploring context and content links in social media: A latent space method. IEEE Transactions Pattern Analysis and Machine Intelligence (August 2011)
Kesorn, S., Poslad, K.: An enhanced bag of visual word vector space model to represent visual content in athletics images. IEEE Transactions on Multimedia (October 2011)
Denoyer, L., Gallinari, P.: Bayesian network model for semi-structured document classification. Information Processing and ManagementΒ 40, 807β827 (2004)
Bouguila, N., ElGuebaly, W.: Discrete data clustering using finite mixture models. Pattern RecognitionΒ 42, 33β42 (2009)
Mikhailov, D.V., Emelyanov, G.M.: Semantic clustering and affinity measure of subject-oriented language texts. Pattern Recognition and Image AnalysisΒ 20, 376β385 (2010)
Yang, L., Geng, Y., Cai, B., Hanjalic, A.: Object retrieval using visual query context. IEEE Transactions on Multimedia (July 2011)
Qin, J., Yung, N.H.C.: Scene categorization via contextual visual words. Pattern RecognitionΒ 43, 1874β1888 (2010)
Park, G., Baek, Y., Lee, H.-K.: Web image retrieval using majority-based ranking approach. Multimedia Tools and ApplicationsΒ 31, 195β219 (2006)
Chan, W., Coghill, G.: Text analysis using local energy. Pattern RecognitionΒ 34, 2523β2532 (2001)
Aronovich, L., Spiegler, I.: Cm-tree: A dynamic clustered index for similarity search in metric databases. Data & Knowledge EngineeringΒ 63, 919β946 (2007)
Sable, C.L., Hatzivassiloglou, V.: Text-based approaches for non-topical image categorization. International Journal on Digital LibrariesΒ 3, 261β275 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Β© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Cristani, M., Tomazzoli, C. (2014). A Multimodal Approach to Exploit Similarity in Documents. In: Ali, M., Pan, JS., Chen, SM., Horng, MF. (eds) Modern Advances in Applied Intelligence. IEA/AIE 2014. Lecture Notes in Computer Science(), vol 8481. Springer, Cham. https://doi.org/10.1007/978-3-319-07455-9_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-07455-9_51
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07454-2
Online ISBN: 978-3-319-07455-9
eBook Packages: Computer ScienceComputer Science (R0)