Abstract
Online news articles, as a new format of press releases, have sprung up on the Internet. With its convenience and recency, more and more people prefer to read news online instead of reading the paper-format press releases. However, a gigantic amount of news events might be released at a rate of hundreds, even thousands per hour. A challenging problem is how to effciently select specific news articles from a large corpus of newly-published press releases to recommend to individual readers, where the selected news items should match the reader's reading preference as much as possible. This issue refers to personalized news recommendation. Recently, personalized news recommendation has become a promising research direction as the Internet provides fast access to real-time information from multiple sources around the world. Existing personalized news recommendation systems strive to adapt their services to individual users by virtue of both user and news content information. A variety of techniques have been proposed to tackle personalized news recommendation, including content-based, collaborative filtering systems and hybrid versions of these two. In this paper, we provide a comprehensive investigation of existing personalized news recommenders. We discuss several essential issues underlying the problem of personalized news recommendation, and explore possible solutions for performance improvement. Further, we provide an empirical study on a collection of news articles obtained from various news websites, and evaluate the effect of different factors for personalized news recommendation. We hope our discussion and exploration would provide insights for researchers who are interested in personalized news recommendation.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Liu J, Dolan P, Pedersen E R. Personalized news recommendation based on click behavior. In Proc. the 14th International Conference on Intelligent User Interfaces, Hong Kong, China, Feb. 7–10, 2010, pp.31-40.
Burke R. Hybrid systems for personalized recommendations. In Proc. Workshop on Intelligent Techniques for Web Personalization, Acapulco, Mexico, Aug. 11, 2005, pp.133-152.
Billsus D, Pazzani M J. User modeling for adaptive news access. User Modeling and User-Adapted Interaction, 2000, 10(2): 147–180.
Carreira R, Crato J M, Gonçalves D, Jorge J A. Evaluating adaptive user profiles for news classification. In Proc. the 9th International Conference on Intelligent User Interfaces, Funchal, Brtngal, Jan. 13–16, 2004, pp.206-212.
Kim H R, Chan P K. Learning implicit user interest hierarchy for context in personalization. Applied Intelligence, 2008, 28(2): 153–166.
Liang T P, Lai H J. Discovering user interests from web browsing behavior: An application to internet news services. In Proc. HICSS, Hawaii, USA, Jan. 7–10, 2002, pp.2718-2727.
Tan A H, Teo C. Learning user profiles for personalized information dissemination. In Proc. IEEE International Joint Conference on Computational Intelligence, Horolulu, USA, May 12–17, 2002, pp.183-188.
Jurafsky D, Martin J H, Kehler A, Vander Linden K, Ward N. Speech and Language Processing. Prentice Hall, 2000.
Hofmann T. Probabilistic latent semantic indexing. In Proc. the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, USA, Aug. 15–19, 1999, pp.50-57.
Blei D M, Ng A Y, Jordan M I. Latent dirichlet allocation. The Journal of Machine Learning Research, 2003, 3: 993–1022
Billsus D, Pazzani M J. A personal news agent that talks, learns and explains. In Proc. the 3 rd Annual Conference on Autonomous Agents, Seattle, USA, May 1–5, 1999, pp.268-275.
Ahn J, Brusilovsky P, Grady J, He D, Syn S Y. Open user profiles for adaptive news systems: Help or harm? In Proc. the 16th International Conference on World Wide Web, Banff, Canada, May 8–12, 2007, pp.11-20.
Das A S, Datar M, Garg A, Rajaram S. Google news personalization: Scalable online collaborative filtering. In Proc. the 16th International Conference on World Wide Web, Banff, Canada, May 8–12, 2007, pp.271-280.
Resnick P, Iacovou N, Suchak M, Bergstrom P, Riedl J. GroupLens: An open architecture for collaborative filtering of net-news. In Proc. the 1994 ACM Conference on Computer Supported Cooperative Work, Chapel Hill, USA, Oct. 22–26, 1994 pp.175-186.
Sarwar B, Karypis G, Konstan J, Reidl J. Item-based collaborative filtering recommendation algorithms. In Proc. the 10th International Conference on World Wide Web, Hong Kong, China, May 1–5, 2001, pp.285-295.
Yu K, Xu X, Tao J, Ester M, Kriegel H P. Instance selection techniques for memory-based collaborative filtering. In Proc. the 2nd SIAM International Conference on Data Mining, Arlington, USA, Apr. 11–13, 2002, pp.59-74.
Breese J S, Heckerman D, Kadie C et al. Empirical analysis of predictive algorithms for collaborative filtering. In Proc. the 14th Conference on Uncertainty in Artificial Intelligence, Madison, USA, Jul. 24–26, 1998, pp.43-52.
Hofmann T. Latent semantic models for collaborative filtering. ACM Transactions on Information Systems, 2004, 22(1): 89–115.
Shani G, Heckerman D, Brafman R I. An MDP-based recommender system. Journal of Machine Learning Research, 2006, 6(2): 1265.
Schafer J B, Konstan J, Riedi J. Recommender systems in e-commerce. In Proc. the 1st ACM Conference on Electronic Commerce, Denver, USA, Nov. 3–5, 1999, pp.158-166.
Li L, Chu W, Langford J, Schapire R E. A contextual-bandit approach to personalized news article recommendation. In Proc. the 19th International Conference on World Wide Web, Raleigh, USA, Apr. 26–30, 2010, pp.661-670.
Schein A I, Popescul A, Ungar L H, Pennock D M. Methods and metrics for cold-start recommendations. In Proc. the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tenpere, Finland, Aug. 11–15, 2002, pp.253-260.
Chu W, Park S T. Personalized recommendation on dynamic content using predictive bilinear models. In Proc. the 18th International Conference on World Wide Web, Madrid, Spain, Apr. 20–24, 2009, pp.691-700.
Li L, Wang D, Li T, Knox D, Padmanabhan B. SCENE: A scalable two-stage personalized news recommendation system. In Proc. the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Beijing, China, Jul. 25–29, 2011, pp.124-134.
Gionis A, Indyk P, Motwani R. Similarity search in high dimensions via hashing. In Proc. the 25th International Conference on Very Large Data Bases, Edinberg, UK, Sept. 7–10, 1999, pp.518-529.
Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters. Communications of the ACM, 2008, 51(1): 107–113.
Chu C T, Kim S K, Lin Y A, Yu Y Y, Bradski G, Ng A Y, Olukotun K. Map-reduce for machine learning on multicore. In Proc. the 2006 Conference on Neural Information Processing Systems, Vancouver, Canada, Dec. 4–7, 2006, pp.281-288.
Kang U, Tsourakakis C E, Faloutsos C. PEGASUS: A peta-scale graph mining system implementation and observations. In Proc. the 9th IEEE International Conference on Data Mining, Miami, USA, Dec. 6–9, 2009, pp.229-238.
Papadimitriou S, Sun J. Disco: Distributed co-clustering with map-reduce: A case study towards petabyte-scale end-to-end mining. In Proc. the 8th IEEE International Conference on Data Mining, Pisa, Italy, Dec. 15–19, 2008, pp.512-521.
Wang D, Zhu S, Li T, Gong Y. Comparative document summarization via discriminative sentence selection. In Proc. the 18th ACM Conference on Information and Knowledge Management, Hong Kong, China, Nov. 2–6, 2009, pp.1963-1966.
Gauch S, Speretta M, Chandramouli A, Micarelli A. User Profiles for Personalized Information Access. The Adaptive Web, 2007, pp.54-89.
Tan P N, Steinbach M, Kumar V et al. Introduction to Data Mining. Boston: Pearson Addison Wesley, 2006.
IJntema W, Goossen F, Frasincar F, Hogenboom F. Ontology-based news recommendation. In Proc. the 2010 EDBT Workshops, Laussane, Switzerland, Mar. 22–26, 2010, pp.1-6.
Cunningham D H, Maynard D D, Bontcheva D K, Tablan M V. GATE: A framework and graphical development environment for robust NLP tools and applications. In Proc. the 40th Anniversary Meeting of the Association for Computational Linguistics, Philadelphia, USA, Jul. 6–12, 2002, pp.168-175.
Nemhauser G L, Wolsey L A, Fisher M L. An analysis of approximations for maximizing submodular set functions. Mathematical Programming, 1978, 14(1): 265–294.
Khuller S, Moss A, Naor J S. The budgeted maximum coverage problem. Information Processing Letters, 1999, 70(1): 39–45.
Girolami M, Kabán A. On an equivalence between PLSI and LDA. In Proc. the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, Jul. 28-Aug. 1, 2003, pp.433-434.
Chang C C, Lin C J. LIBSVM: A library for support vector machines. ACM Trans. Intelligent Systems and Technology, 2001, 2(3): Article No.27.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is partially supported by the National Science Foundation of US under Grant Nos. IIS-0546280 and CCF-0830659 and the National Natural Science Foundation of China under Grant No. 61070151.
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Li, L., Wang, DD., Zhu, SZ. et al. Personalized News Recommendation: A Review and an Experimental Investigation. J. Comput. Sci. Technol. 26, 754–766 (2011). https://doi.org/10.1007/s11390-011-0175-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-011-0175-2