Score Normalization Using Logistic Regression with Expected Parameters

Aly, Robin

doi:10.1007/978-3-319-06028-6_60

Robin Aly²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8416))

Included in the following conference series:

European Conference on Information Retrieval

2975 Accesses

Abstract

State-of-the-art score normalization methods use generative models that rely on sometimes unrealistic assumptions. We propose a novel parameter estimation method for score normalization based on logistic regression, using the expected parameters from past queries. Experiments on the Gov2 and CluewebA collection indicate that our method is consistently more precise in predicting the number of relevant documents in the top-n ranks compared to a state-of-the-art generative approach and another parameter estimate for logistic regression.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

A Left-to-Right Algorithm for Likelihood Estimation in Gamma-Poisson Factor Analysis

A parameterisation-invariant modification of the score test

Article Open access 07 March 2023

Learning to Rank

References

Aly, R., Demeester, T., Robertson, S.: Probabilistic models in ir and their relationships. Information Retrieval, 1386–4564 (2013) ISSN 1386-4564, http://dx.doi.org/10.1007/s10791-013-9226-3 , doi:10.1007/s10791-013-9226-3
Arampatzis, A., Kamps, J.: A signal-to-noise approach to score normalization. In: CIKM 2009, USA. ACM (2009) ISBN 978-1-60558-512-3
Google Scholar
Arampatzis, A., Robertson, S.E.: Modeling score distributions in information retrieval. Information Retrieval 14, 1–21 (2010) ISSN 1386-4564
Google Scholar
Arampatzis, A., Kamps, J., Robertson, S.: Where to stop reading a ranked list?: threshold optimization using truncated score distributions. In: SIGIR 2009, USA, pp. 524–531. ACM (2009) ISBN 978-1-60558-483-6
Google Scholar
Callan, J.: Distributed information retrieval. In: Croft, W. (ed.) Advances in Information Retrieval. The Information Retrieval Series, vol. 7, pp. 127–150. Springer US (2000) ISBN 978-0-7923-7812-9, http://dx.doi.org/10.1007/0-306-47019-5_5 , doi:10.1007/0-306-47019-5_5
Cooper, W., Chen, A., Gey, F.C.: Experiments in the probabilistic retrieval based on staged logistic regression. In: TREC 1994. NIST (1994)
Google Scholar
Cormack, G.V., Smucker, M.D., Clarke, C.L.A.: Efficient and effective spam filtering and re-ranking for large web datasets. CoRR, abs/1004.5168 (2010)
Google Scholar
Cormack, G.V., Grossman, M.R., Hedin, B., Oard, D.W.: Overview of the trec 2011 legal track. In: TREC 2011, p. 1 (2011)
Google Scholar
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: Liblinear: A library for large linear classification. Journal of Machine Learning Research 9, 1871–1874 (2008)
MATH Google Scholar
Fernández, M., Vallet, D., Castells, P.: Using historical data to enhance rank aggregation. In: SIGIR 2006, USA. ACM (2006) ISBN 1-59593-369-7
Google Scholar
Kanoulas, E., Dai, K., Pavlu, V., Aslam, J.A.: Score distribution models: assumptions, intuition, and robustness to score manipulation. In: SIGIR 2010, USA, pp. 242–249. ACM (2010) ISBN 978-1-4503-0153-4
Google Scholar
Manmatha, R., Rath, T., Feng, F.: Modeling score distributions for combining the outputs of search engines. In: SIGIR 2001, USA, pp. 267–275. ACM (2001) ISBN 1-58113-331-6
Google Scholar
Nottelmann, H., Fuhr, N.: From uncertain inference to probability of relevance for advanced ir applications. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 235–250. Springer, Heidelberg (2003)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Human Media Interaction, University Twente, 7522AE, Enschede, The Netherlands
Robin Aly

Authors

Robin Aly
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Maarten de Rijke & Tom Kenter &
Centrum Wiskunde en Informatica, Amsterdam, The Netherlands and Delft University of Technology, Delft, The Netherlands
Arjen P. de Vries
University of Illinois at Urbana-Champaign, Urbana, IL, USA
ChengXiang Zhai
University of Twente, Twente, The Netheralnds and Erasmus University Rotterdam, Rotterdam, The Netherlands
Franciska de Jong
SalesPredict, Haifa, Israel
Kira Radinsky
Microsoft Research, Cambridge, UK
Katja Hofmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aly, R. (2014). Score Normalization Using Logistic Regression with Expected Parameters. In: de Rijke, M., et al. Advances in Information Retrieval. ECIR 2014. Lecture Notes in Computer Science, vol 8416. Springer, Cham. https://doi.org/10.1007/978-3-319-06028-6_60

Download citation

DOI: https://doi.org/10.1007/978-3-319-06028-6_60
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06027-9
Online ISBN: 978-3-319-06028-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Score Normalization Using Logistic Regression with Expected Parameters

Abstract

Chapter PDF

Similar content being viewed by others

A Left-to-Right Algorithm for Likelihood Estimation in Gamma-Poisson Factor Analysis

A parameterisation-invariant modification of the score test

Learning to Rank

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Score Normalization Using Logistic Regression with Expected Parameters

Abstract

Chapter PDF

Similar content being viewed by others

A Left-to-Right Algorithm for Likelihood Estimation in Gamma-Poisson Factor Analysis

A parameterisation-invariant modification of the score test

Learning to Rank

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation