Abstract
Nowadays, the recommender system plays an important role in the real world by which we can recommend the most useful and perfect movies to the users from a large set of movies list and their ratings based on different users. Since the number of users and the movies are increasing day by day, computing the recommended movies list in a single node machine takes a very large time. Hence to reduce the computation time, we are using Hadoop framework to work in a distributed manner. Further, we have proposed a hybrid approach to recommend movies to the users by combining both the filtering techniques, i.e., user-based collaborative filtering and content-based filtering to overcome the problems of these techniques. In content-based filtering, we recommend items that are similar to the previous items which are highly rated by that user. Whereas in case of user-based collaborative filtering technique, we find out the most similar users with respect to the current user based on their cosine similarity and centered cosine similarity, and based on best similarity values, top N movies are recommended to the user by predicting the ratings of the movies. Further, to reduce the computation complexity, Hive database for Hadoop framework is used for developing SQL type scripts to perform MapReduce operations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Resnick, P., Varian, H.R.: Recommender systems. Commun. ACM 40, 56–58 (1997)
Mooney, R.J., Roy, L.: Content-based book recommendation using learning for text categorization. In: Workshop Recommendation System: Algorithm and Evaluation (1999)
Ricci, Francesco, Rokach, Lior, Shapira, Bracha: Introduction to Recommender Systems Handbook, pp. 1–35. Recommender Systems Handbook, Springer (2011)
Breese, J.S., Heckerman, D., Kadie, C.: Empirical analysis of predictive algorithms for collaborative filtering. In: Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence (1998)
Jafarkarimi, H., Sim, A.T.H., Saadatdoost, R.: A naïve recommendation model for large databases. Int. J. Inf. Educ. Technol., June (2012)
“Welcome to Apache Hadoop!”. hadoop.apache.org. Retrieved 25 August 2016
“What is the Hadoop Distributed File System (HDFS)?”. ibm.com. IBM. Retrieved 30 Oct 2014
“Google Research Publication: MapReduce”. Retrieved 9 March 2016
“Apache Hive TM”. Retrieved 9 March 2016
De Pessemier, T., Vanhecke, K., Dooms, S., Martens, L.: Content-based recommendation algorithms on the Hadoop map reduce framework. In: 7th International Conference on Web Information Systems and Technologies, pp. 237–240 (2011)
Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, pp. 322–331 (2014)
Deshpande, M., Karypis, G.: Item-based top-N recommendation algorithms. ACM Transac. Inf. Syst. 22(1), 143–177 (2004)
Kadam, S.D., Dilip M., Siddhesh A.V.: Big data analytics—recommendation system with Hadoop framework. In: International Conference on Inventive Computation Technologies (ICICT) August (2016)
Fernández, G., et al.: Let’s go to the cinema! A movie recommender system for ephemeral groups of users. In: Proceedings of the 2014 Latin American Computing Conference, CLEI 2014, Institute of Electrical and Electronics Engineers Inc. (2014)
Ponnam, L.T.: Movie recommender system using item based collaborative filtering technique. In: International Conference on Emerging Trends in Engineering, Technology and Science (ICETETS). 24–26 Feb 2016, https://doi.org/10.1109/icetets.2016.7602983 (2016)
Saravanan, S.: Design of large-scale content-based recommender system using Hadoop map reduce framework. In: 8th International Conference on Contemporary Computing, IC3 2015, Institute of Electrical and Electronics Engineers Inc., pp. 302–307 (2015)
Movielens Dataset http://grouplens.org/datasets/movielens/latest/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Bharti, R., Gupta, D. (2019). Recommending Top N Movies Using Content-Based Filtering and Collaborative Filtering with Hadoop and Hive Framework. In: Kalita, J., Balas, V., Borah, S., Pradhan, R. (eds) Recent Developments in Machine Learning and Data Analytics. Advances in Intelligent Systems and Computing, vol 740. Springer, Singapore. https://doi.org/10.1007/978-981-13-1280-9_10
Download citation
DOI: https://doi.org/10.1007/978-981-13-1280-9_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1279-3
Online ISBN: 978-981-13-1280-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)