Skip to main content

Recommending Top N Movies Using Content-Based Filtering and Collaborative Filtering with Hadoop and Hive Framework

  • Conference paper
  • First Online:
Recent Developments in Machine Learning and Data Analytics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 740))

Abstract

Nowadays, the recommender system plays an important role in the real world by which we can recommend the most useful and perfect movies to the users from a large set of movies list and their ratings based on different users. Since the number of users and the movies are increasing day by day, computing the recommended movies list in a single node machine takes a very large time. Hence to reduce the computation time, we are using Hadoop framework to work in a distributed manner. Further, we have proposed a hybrid approach to recommend movies to the users by combining both the filtering techniques, i.e., user-based collaborative filtering and content-based filtering to overcome the problems of these techniques. In content-based filtering, we recommend items that are similar to the previous items which are highly rated by that user. Whereas in case of user-based collaborative filtering technique, we find out the most similar users with respect to the current user based on their cosine similarity and centered cosine similarity, and based on best similarity values, top N movies are recommended to the user by predicting the ratings of the movies. Further, to reduce the computation complexity, Hive database for Hadoop framework is used for developing SQL type scripts to perform MapReduce operations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Resnick, P., Varian, H.R.: Recommender systems. Commun. ACM 40, 56–58 (1997)

    Article  Google Scholar 

  2. Mooney, R.J., Roy, L.: Content-based book recommendation using learning for text categorization. In: Workshop Recommendation System: Algorithm and Evaluation (1999)

    Google Scholar 

  3. Ricci, Francesco, Rokach, Lior, Shapira, Bracha: Introduction to Recommender Systems Handbook, pp. 1–35. Recommender Systems Handbook, Springer (2011)

    MATH  Google Scholar 

  4. Breese, J.S., Heckerman, D., Kadie, C.: Empirical analysis of predictive algorithms for collaborative filtering. In: Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence (1998)

    Google Scholar 

  5. Jafarkarimi, H., Sim, A.T.H., Saadatdoost, R.: A naïve recommendation model for large databases. Int. J. Inf. Educ. Technol., June (2012)

    Google Scholar 

  6. “Welcome to Apache Hadoop!”. hadoop.apache.org. Retrieved 25 August 2016

    Google Scholar 

  7. “What is the Hadoop Distributed File System (HDFS)?”. ibm.com. IBM. Retrieved 30 Oct 2014

    Google Scholar 

  8. “Google Research Publication: MapReduce”. Retrieved 9 March 2016

    Google Scholar 

  9. “Apache Hive TM”. Retrieved 9 March 2016

    Google Scholar 

  10. De Pessemier, T., Vanhecke, K., Dooms, S., Martens, L.: Content-based recommendation algorithms on the Hadoop map reduce framework. In: 7th International Conference on Web Information Systems and Technologies, pp. 237–240 (2011)

    Google Scholar 

  11. Leskovec, J., Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets, pp. 322–331 (2014)

    Google Scholar 

  12. Deshpande, M., Karypis, G.: Item-based top-N recommendation algorithms. ACM Transac. Inf. Syst. 22(1), 143–177 (2004)

    Article  Google Scholar 

  13. Kadam, S.D., Dilip M., Siddhesh A.V.: Big data analytics—recommendation system with Hadoop framework. In: International Conference on Inventive Computation Technologies (ICICT) August (2016)

    Google Scholar 

  14. Fernández, G., et al.: Let’s go to the cinema! A movie recommender system for ephemeral groups of users. In: Proceedings of the 2014 Latin American Computing Conference, CLEI 2014, Institute of Electrical and Electronics Engineers Inc. (2014)

    Google Scholar 

  15. Ponnam, L.T.: Movie recommender system using item based collaborative filtering technique. In: International Conference on Emerging Trends in Engineering, Technology and Science (ICETETS). 24–26 Feb 2016, https://doi.org/10.1109/icetets.2016.7602983 (2016)

  16. Saravanan, S.: Design of large-scale content-based recommender system using Hadoop map reduce framework. In: 8th International Conference on Contemporary Computing, IC3 2015, Institute of Electrical and Electronics Engineers Inc., pp. 302–307 (2015)

    Google Scholar 

  17. Movielens Dataset http://grouplens.org/datasets/movielens/latest/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepak Gupta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bharti, R., Gupta, D. (2019). Recommending Top N Movies Using Content-Based Filtering and Collaborative Filtering with Hadoop and Hive Framework. In: Kalita, J., Balas, V., Borah, S., Pradhan, R. (eds) Recent Developments in Machine Learning and Data Analytics. Advances in Intelligent Systems and Computing, vol 740. Springer, Singapore. https://doi.org/10.1007/978-981-13-1280-9_10

Download citation

Publish with us

Policies and ethics