Black Box Evaluation for Operational Information Retrieval Applications

Braschler, Martin; Imhof, Melanie; Rietberger, Stefan

doi:10.1007/978-3-642-54798-0_9

Martin Braschler¹⁶,
Melanie Imhof¹⁶ &
Stefan Rietberger¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8173))

Included in the following conference series:

PROMISE Winter School

912 Accesses

Abstract

The black box application evaluation methodology described in this tutorial is applicable to a broad range of operational information retrieval (IR) applications. Contrary to popular, traditional IR evaluation approaches that are limited to measure the IR system performance on a test collection, the black box evaluation methodology considers an IR application in its entirety: the underlying system, the corresponding document collection, and its configuration/application layer. A comprehensive set of quality criteria is used to estimate the user’s perception of the application. Scores are assigned as a weighted average of results from tests that evaluate individual aspects. The methodology was validated in a small evaluation campaign. An analysis of this campaign shows a correlation between the testers’ perception of the applications and the evaluation scores. Moreover, functional weaknesses of the tested IR applications can be identified and then systematically targeted.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

An Intrinsic Framework of Information Retrieval Evaluation Measures

The symbiotic relationship between information retrieval and informetrics

Article 25 November 2014

Building a Common Framework for IIR Evaluation

Keywords

References

Rietberger, S., Imhof, M., Braschler, M., Berendsen, R., Järvelin, A., Hansen, P., García Seco de Herrera, A., Tsikrika, T., Lupu, M., Petras, V., Gäde, M., Kleineberg, M., Choukri, K.: PROMISE deliverable 4.2: Tutorial on Evaluation in the Wild (2012)
Google Scholar
Robertson, S.E., Maron, M.E., Cooper, W.S.: Probability of relevance: a unification of two competing models for document retrieval. Info. Tech: R. and.D 1, 1–21 (1982)
Google Scholar
Cleverdon, C.W.: The Cranfield tests on index language devices (1967)
Google Scholar
Voorhees, E.M.: The philosophy of information retrieval evaluation. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 355–370. Springer, Heidelberg (2002)
Chapter Google Scholar
Jansen, B.J.: Search log analysis: What it is, what’s been done, how to do it (2006)
Google Scholar
Blecic, D., Bangalore, N., Dorsch, J., Henderson, C., Koenig, M., Weller, A.: Using transaction log analysis to improve OPAC retrieval results (1998)
Google Scholar
Kohavi, R., Henne, R., Sommerfield, D.: Practical Guide to Controlled Experiments on the Web: Listen to Your Customers not to the HiPPO (2007)
Google Scholar
Radlinski, F., Kurup, M., Joachims, T.: How Does Clickthrough Data Reflect Retrieval Quality? (2008)
Google Scholar
Dunlop, M.: Reflections on Mira: Interactive evaluation in information retrieval. J. Am. Soc. Inf. Sci. 51, 1269–1274 (2000)
Article Google Scholar
Borlund, P.: User-centered evaluation of information retrieval systems. In: Information Retrieval: Searching in the 21st Century, pp. 21–37 (2009)
Google Scholar
Braschler, M., Rietberger, S., Imhof, M., Järvelin, A., Hansen, P., Lupu, M., Gäde, M., Berendsen, R., García Seco de Herrera, A.: PROMISE deliverable 2.3: Best Practices Report (2012)
Google Scholar
Braschler, M., Herget, J., Pfister, J., Schäuble, P., Steinbach, M., Stuker, J.: Evaluation der Suchfunktion von Schweizer Unternehmens-Websites (2006)
Google Scholar
Braschler, M., Heuwing, B., Mandel, T., Womser-Hacker, C., Herget, J., Schäuble, P., Stuker, J.: Evaluation der Suchfunktion deutscher Unternehmens-Websites (2009)
Google Scholar
Peters, C., Braschler, M., Clough, P.: Multilingual Information Retrieval: From Research to Practice. Springer (2012) ISBN 3642230075
Google Scholar

Download references

Author information

Authors and Affiliations

Zurich University of Applied Sciences, Winterthur, Switzerland
Martin Braschler, Melanie Imhof & Stefan Rietberger

Authors

Martin Braschler
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Imhof
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Rietberger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Padua, Italy
Nicola Ferro

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Braschler, M., Imhof, M., Rietberger, S. (2014). Black Box Evaluation for Operational Information Retrieval Applications. In: Ferro, N. (eds) Bridging Between Information Retrieval and Databases. PROMISE 2013. Lecture Notes in Computer Science, vol 8173. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54798-0_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-54798-0_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54797-3
Online ISBN: 978-3-642-54798-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Black Box Evaluation for Operational Information Retrieval Applications

Abstract

Chapter PDF

Similar content being viewed by others

An Intrinsic Framework of Information Retrieval Evaluation Measures

The symbiotic relationship between information retrieval and informetrics

Building a Common Framework for IIR Evaluation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Black Box Evaluation for Operational Information Retrieval Applications

Abstract

Chapter PDF

Similar content being viewed by others

An Intrinsic Framework of Information Retrieval Evaluation Measures

The symbiotic relationship between information retrieval and informetrics

Building a Common Framework for IIR Evaluation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation