Abstract
Retrieval models form the theoretical basis for computing the answer to a query. They differ not only in the syntax and expressiveness of the query language, but also in the representation of the documents. Following Rijsbergen’s approach of regarding IR as uncertain inference, we can distinguish models according to the expressiveness of the underlying logic and the way uncertainty is handled. Classical retrieval models are based on propositional logic. In the vector space model, documents and queries are represented as vectors in a vector space spanned by the index terms, and uncertainty is modelled by considering geometric similarity. Probabilistic models make assumptions about the distribution of terms in relevant and nonrelevant documents in order to estimate the probability of relevance of a document for a query. Language models compute the probability that the query is generated from a document. All these models can be interpreted within a framework that is based on a probabilistic concept space. For IR applications dealing not only with texts, but also with multimedia or factual data, propositional logic is not suffcient. Therefore, advanced IR models use restricted forms of predicate logic as basis. Terminological/ description logics are rooted in semantic networks and terminological languages like e.g. KL-ONE. Datalog uses function-free horn clauses. Probabilistic versions of both approaches are able to cope with the intrinsic uncertainty of IR.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
M. J. Bates. Where should the person stop and the information search interface start? Information Processing & Management, 26(5): 575–591, 1990.
S. Ceri, G. Gottlob, and L. Tanca. Logic Programming and Databases. Springer, Berlin et al., 1990.
W.S. Cooper. Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval. ACM Transactions on Information Systems, 13(1): 100–111, Jan 1995.
Fabio Crestani, Mounia Lalmas, Cornelis J. van Rijsbergen, and Iain Campbell. “ Is this document relevant?... probably ”: a survey of probabilistic models in information retrieval. ACM Computer Surveys, 30(4): 528–552, 1998.
N. Fuhr. Models for retrieval with probabilistic indexing. Information Processing & Management, 25(1): 55–72, 1989.
N. Fuhr. Probabilistic models in information retrieval. The Computer Journal, 35(3):243–255, 1992.
N. Fuhr. Information retrieval methods for multimedia objects. To appear in: Proceedings Dagstuhl WS Content-Based Image and Video Retrieval, 2000.
N. Fuhr and C. Buckley. A probabilistic learning approach for document indexing. ACM Transactions on Information Systems, 9(3):223–248, 1991.
Norbert Fuhr. Probabilistic Datalog: Implementing logical information retrieval for advanced applications. Journal of the American Society for Information Science, 51(2):95–110, 2000.
Djoerd Hiemstra. A linguistically motivated probabilistic model of information retrieval. In C. Nikolaou and C. Stephanidis, editors, Lecture Notes In Computer Science-Research and Advanced Technology for Digital Libraries-Proceedings of the second European Conference on Research and Advanced Technology for Digital Libraries: ECDL’98, pages 569–584. Springer Verlag, 1998.
M.E. Maron and J.L. Kuhns. On relevance, probabilistic indexing, and information retrieval. Journal of the ACM, 7:216–244, 1960.
C. Meghini, F. Sebastiani, U. Straccia, and C. Thanos. A model of information retrieval based on a terminological logic. In Proceedings of the Sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 298–308, New York, 1993. ACM.
S. R. Newcomb, N. A. Kipp, and V. T. Newcomb. The “HyTime” hypermedia/time-based document structuring language. Communications of the ACM, 34(11):67–83, November 1991.
Jianyun Nie. An information retrieval model based on modal logic. Information processing & management., 25(5):477–491, 1989.
J. Pearl. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufman, San Mateo, California, 1988.
J.M. Ponte and W.B. Croft. A language modeling approach to information retrieval. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 275–281, New York, 1998. ACM.
S.E. Robertson. The probability ranking principle in IR. Journal of Documentation, 33: 294–304, 1977.
S.E. Robertson and K. Sparck Jones. Relevance weighting of search terms. Journal of the American Society for Information Science, 27:129–146, 1976.
G. Salton, editor. The SMART Retrieval System-Experiments in Automatic Document Processing. Prentice Hall, Englewood, Cliffs, New Jersey, 1971.
Jeffrey D. Ullman. Principles of Database and Knowledge-Base Systems, volume I. Computer Science Press, Rockville (Md.), 1988.
C. J. van Rijsbergen. A non-classical logic for information retrieval. The Computer Journal, 29(6): 481–485, 1986.
S.K.M. Wong and Y.Y. Yao. On modeling information retrieval with probabilistic inference. ACM Transactions on Information Systems, 13(1):38–68, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Fuhr, N. (2000). Models in Information Retrieval. In: Agosti, M., Crestani, F., Pasi, G. (eds) Lectures on Information Retrieval. ESSIR 2000. Lecture Notes in Computer Science, vol 1980. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45368-7_2
Download citation
DOI: https://doi.org/10.1007/3-540-45368-7_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41933-4
Online ISBN: 978-3-540-45368-0
eBook Packages: Springer Book Archive