Abstract
We present a system based on a Bayesian Network formalism for structured documents retrieval. The parameters of this model are learned from the document collection (documents, queries and assessments). The focus of the paper is on an algebra which has been designed for the interpretation of structured information queries and can be used within our Bayesian Network framework. With this algebra, the representation of the information demand is independent from the structured query language. It allows us to answer both vague and strict structured queries.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Abiteboul, S., Quass, D., McHugh, J., Widom, J., Wiener, J.L.: The lorel query language for semistructured data. International Journal on Digital Libraries 1, 68–88 (1997)
Callan, J., Croft, W.B., Harding, S.M.: The INQUERY Retrieval System. In: Tjoa, A.M., Ramos, I. (eds.) Database and Expert Systems Applications, Proceedings of the International Conference, Valencia, Spain, pp. 78–83. Springer, Heidelberg (1992)
Crestani, F., de Campos, L.M., Fernandez-Luna, J.M., Huete, J.F.: A multi-layered Bayesian network model for structured document retrieval. In: Nielsen, T.D., Zhang, N.L. (eds.) ECSQARU 2003. LNCS (LNAI), vol. 2711, pp. 74–86. Springer, Heidelberg (2003)
Crestani, F., de Campos, L.M., Fernandez-Luna, J.M., Huete, J.F.: Ranking Structured Documents Using Utility Theory in the Bayesian Network Retrieval Model. In: Nascimento, M.A., de Moura, E.S., Oliveira, A.L. (eds.) SPIRE 2003. LNCS, vol. 2857, pp. 168–182. Springer, Heidelberg (2003)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from incomplete data via de EM algorithm. The Journal of Royal Statistical Society 39, 1–37 (1977)
Fuhr, N., Grossjohann, K.: XIRQL: A query language for information retrieval in XML documents. In: Croft, W.B., Harper, D.J., Kraft, D.H., Zobel, J. (eds.) The 24th International Conference on Research and Developmenent in Information Retrieval, New Orleans, Louisiana, USA. ACM, New York (2001)
List, J., Mihajlovic, V., de Vries, A.P., Ramirez, G.: The TIJAX XML-IR system at INEX 2003. In: Fuhr, N., Lalmas, M., Malik, S. (eds.) Proceedings of the Second INEX Workshop INitiative for the Evaluation of XML Retrieval (INEX), Dagstuhl, Germany (2003)
Myaeng, S.H., Jang, D.H., Kim, M.H., Zhoo, Z.C.: A Flexible Model for Retrieval of SGML documents. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, pp. 138–140. ACM Press, New York (1998)
Navarro, G., Baeza-Yates, R.: Proximal nodes: A model to query document databases by content and structure. ACM TOIS 15, 401–435 (1997)
Piwowarski, B., Faure, G.E., Gallinari, P.: Bayesian networks and INEX. In: Proceedings of the First Annual Workshop of the Initiative for the Evaluation of XML retrieval (INEX), DELOS workshop, Dagstuhl, Germany. ERCIM (2002)
Piwowarski, B., Gallinari, P.: A Bayesian Network for XML Information Retrieval: Searching and Learning with the INEX Collection. In: Information Retrieval (2004)
Piwowarski, B., Vu, H.T., Gallinari, P.: Bayesian Networks and INEX 2003. In: Proceedings of the Second INEX Workshop INitiative for the Evaluation of XML Retrieval (2003)
Piwowarski, B., Gallinari, P.: An algebra for probabilistic XML Retrieval. In: The First Twente Data Management Workshop (2004)
Robertson, S.: Threshold setting and performance optimization in adaptive Filtering. Information Retrieval 5, 239–256 (2002)
Walker, S., Robertson, S.E.: Okapi/Keenbow at TREC-8. In: Voorhees, E.M., Harman, D.K. (eds.) NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC-8), Gaithersburg, Maryland, USA (1999)
Zadeh, L.A.: Fuzzy sets (1965)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vittaut, JN., Piwowarski, B., Gallinari, P. (2005). An Algebra for Structured Queries in Bayesian Networks. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds) Advances in XML Information Retrieval. INEX 2004. Lecture Notes in Computer Science, vol 3493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424550_9
Download citation
DOI: https://doi.org/10.1007/11424550_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26166-7
Online ISBN: 978-3-540-32053-1
eBook Packages: Computer ScienceComputer Science (R0)