Abstract
Whole-History Rating (WHR) is a new method to estimate the time-varying strengths of players involved in paired comparisons. Like many variations of the Elo rating system, the whole-history approach is based on the dynamic Bradley-Terry model. But, instead of using incremental approximations, WHR directly computes the exact maximum a posteriori over the whole rating history of all players. This additional accuracy comes at a higher computational cost than traditional methods, but computation is still fast enough to be easily applied in real time to large-scale game servers (a new game is added in less than 0.001 second). Experiments demonstrate that, in comparison to Elo, Glicko, TrueSkill, and decayed-history algorithms, WHR produces better predictions.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Coulom, R.: Bayeselo (2005), http://remi.coulom.free.fr/Bayesian-Elo/
Dangauthier, P., Herbrich, R., Minka, T., Graepel, T.: TrueSkill through time: Revisiting the history of chess. In: Platt, J.C., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems 20, Vancouver, Canada, MIT Press, Cambridge (2007)
Edwards, R.: Edo historical chess ratings (2004), http://members.shaw.ca/edo1/
Elo, A.E.: The Rating of Chessplayers, Past and Present. Arco Publishing, New York (1978)
Fahrmeir, L., Tutz, G.: Dynamic stochastic models for time-dependent ordered paired comparison systems. Journal of the American Statistical Association 89(428), 1438–1449 (1994)
Glickman, M.E.: Paired Comparison Model with Time-Varying Parameters. PhD thesis, Harvard University, Cambridge, Massachusetts (1993)
Glickman, M.E.: Parameter estimation in large dynamic paired comparison experiments. Applied Statistics 48(33), 377–394 (1999)
Herbrich, R., Graepel, T.: TrueSkillTM: A Bayesian skill rating system. Technical Report MSR-TR-2006-80, Microsoft Research (2006)
Hunter, D.R.: MM algorithms for generalized Bradley-Terry models. The Annals of Statistics 32(1), 384–406 (2004)
Rybicki, G.B., Hummer, D.G.: An accelerated lambda iteration method for multilevel radiative transfer. Astronomy and Astrophysics 245(1), 171–181 (1991)
Rybicki, G.B., Press, W.H.: Interpolation, realization, and reconstruction of noisy, irregularly sampled data. The Astrophysical Journal 398(1), 169–176 (1992)
Knorr-Held, L.: Dynamic rating of sports teams. The Statistician 49(2), 261–276 (2000)
Shubert, W.M.: Details of the KGS rank system (2007), http://www.gokgs.com/help/rmath.html
Sonas, J.: Chessmetrics (2005), http://db.chessmetrics.com/CM2/Formulas.asp
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Coulom, R. (2008). Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength. In: van den Herik, H.J., Xu, X., Ma, Z., Winands, M.H.M. (eds) Computers and Games. CG 2008. Lecture Notes in Computer Science, vol 5131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87608-3_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-87608-3_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87607-6
Online ISBN: 978-3-540-87608-3
eBook Packages: Computer ScienceComputer Science (R0)