Abstract
The growth of Internet commerce has provoked the use of Recommender Systems (RS). Adequate datasets of users and products have always been demanding to better evaluate RS algorithms. Yet, the amount of public data, especially data containing content information (attributes) is limited. In addition, the performance of RS is highly dependent on various characteristics of the datasets. Thus, few others have conducted studies on synthetically generated datasets to mimic the user-product relationship. Evaluating algorithms based on only one or two datasets is often not sufficient. A more thorough analysis can be conducted by applying systematic changes to data, which cannot be done with real data. However, synthetic datasets that include attributes are rarely investigated. In this paper, we review synthetic datasets applied in RS and present our synthetic data generation methodology that considers attributes. Furthermore, we conduct empirical evaluations on existing hybrid recommendation algorithms and other state-of-the-art algorithms using these variable synthetic data and observe their behavior as the characteristic of data varies. In addition, we also introduce the use of entropy to control the randomness of the generated data.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Aggarwal, C.C., Wolf, J.L., Wu, K.-L., Yu, P.S.: Horting hatches an egg: A new graph-theoretic approach to collaborative filtering. In: Proceedings of ACMSIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York (1999)
Agrawl, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB), pp. 487–499. Morgan Kaufmann, San Francisco (1994)
Basilico, J., Hofmann, T.: Unifying collaborative and content-based filtering. In: Proceedings of the 21st International Conference on Machine Learning, Banff, Canada (2004)
Basu, C., Hirsh, H., Cohen, W.: Recommendation as classification: Using social and content-based information in recommendation. In: Proceedings of the 1998 Workshop on Recommender Systems, pp. 11–15. AAAI Press, Reston (1998)
Claypool, M., Gokhale, A., Miranda, T.: Combining content-based and collaborative filters in an online newspaper. In: Proceedings of the SIGIR-99 Workshop on Recommender Systems: Algorithms and Evaluation (1999)
Deshpande, M., Karypis, G.: Item-based top-N recommendation algorithms. ACM Transactions on Information Systems 22/1, 143–177 (2004)
Goldberg, D., Nichols, D., Oki, B.M., Terry, D.: Using collaborative filtering to weave an information tapestry. Commun. ACM 35, 61–70 (1992)
Good, N., Schafer, J.B., Konstan, J., Borchers, A., Sarwar, B., Herlocker, J., Riedl, J.: Combining Collaborative Filtering with Personal Agents for Better Recommendations. In: Proceedings of the 1999 Conference of the American Association of Artificial Intelligence (AAAI), pp. 439–446 (1999)
Herlocker, J., Konstan, J., Borchers, A., Riedl, J.: An Algorithmic Framework for Performing Collaborative Filtering. In: Proceedings of ACM SIGIR 1999, ACM Press, New York (1999)
Li, Q., Kim, M.: An Approach for Combining Content-based and Collaborative Filters. In: Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages (ACL), pp. 17–24 (2003)
Konstan, J.A., Miller, B.N., Maltz, D., Herlocker, J.L., Gordon, L.R., Riedl, J.: Group-Lens: Applying collaborative filtering to usenet news. ACM Commun. 40, 77–87 (1997)
Marlin, B., Roweis, S., Zemel, R.: Unsupervised Learning with Non-ignorable Missing Data. In: Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics (AISTATS), pp. 222–229 (2005)
Melville, P., Mooney, R.J., Nagarajan, R.: Content-Boosted Collaborative Filtering for Improved Recommendations. In: Proceedings of the Eighth National Conference on Artificial Intelligence(AAAI-2002), Edmonton, Canada, pp. 187–192 (2002)
Miller, B.N., Riedl, J., Konstan, J.A.: Experiences with GroupLens: Making Usenet useful again. In: Proceedings of the 1997 USENIX Technical Conference (1997)
MovieLens (2003), Available at, http://www.grouplens.org/data
Pazzani, M.J.: A framework for collaborative, content-based and demographic filtering. Artificial Intelligence Review 13(5-6), 393–408 (1999)
Popescul, A., Ungar, L.H., Pennock, D.M., Lawrence, S.: Probabilistic models for unified collaborative and content-based recommendation in sparse-data environments. In: Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence, pp. 437–444 (2001)
Sarwar, B.M., Karypis, G., Konstan, J.A., Riedl, J.: Analysis of recommendation algorithms for E-commerce. In: Proceedings of the 2nd ACM Conference on Electronic Commerce (EC), pp. 285–295. ACM, New York (2000)
Schmidt-Thieme, L.: Compound Classification Models for Recommender Systems. In: Proceedings of the IEEE International Conference on Data Mining (ICDM), New Orleans, USA, pp. 559–570 (2005)
Traupman, J., Wilensky, R.: Collaborative Quality Filtering: Establishing Consensus or Recovering Ground Truth? In: Proceedings of WebKDD 2004: KDD Workshop on Web Mining and Web Usage Analysis, in conjunction with the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2004), August 22-25 (2004), Seattle, WA (2004)
Tso, H.L.K., Schmidt-Thieme, L.: Attribute-Aware Collaborative Filtering. In: Proceedings of the 29th Annual Conference of the German Classification Society 2005, Magdeburg, Germany (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tso, K.H.L., Schmidt-Thieme, L. (2006). Evaluation of Attribute-Aware Recommender System Algorithms on Data with Varying Characteristics. In: Ng, WK., Kitsuregawa, M., Li, J., Chang, K. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2006. Lecture Notes in Computer Science(), vol 3918. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11731139_97
Download citation
DOI: https://doi.org/10.1007/11731139_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33206-0
Online ISBN: 978-3-540-33207-7
eBook Packages: Computer ScienceComputer Science (R0)