Skip to main content

Rating of Movie via Movie Recommendation System Based on Apache Spark Using Big Data and Machine Learning Techniques

  • Conference paper
  • First Online:
Computational Intelligence for Engineering and Management Applications

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 984))

  • 399 Accesses

Abstract

Recommendation engines are very useful for businesses to increase their revenue. They are regarded as one of the best types of machine learning models. They are responsible for predicting the choices of people and uncovering all the relationships between items so that discovery of the right choices becomes easy. They help in presenting users with items that they might not even have searched or have known about. Movies are considered as one of the most popular sources of entertainment. It is a very tedious task to search for movies according to the user’s taste from a large pool of available movies. The proposed system builds a movie recommendation engine that uses the user’s profile to find movies of similar taste as the user. The system recommends the most relevant movies to the user. Apache Spark framework is used for implementing the proposed system via Scala language. The Apache Spark machine learning library (MLlib) is used to ease the implementation. The proposed system provides analysis on various measures. The measures include the total number of ratings by a user, top ten recommended movie ids and names with predicted ratings for a particular user. At last, the performance of the system is evaluated using Root Mean Square Error (RMSE). The value of RMSE gives the accuracy of the model. The results are shown in tabular form. The results show that the model well and after some number of iterations, the value of RMSE is constant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ahuja S (2018) Big data–introduction, applications and future scope. Int J Emerg Technol Innov Res (IJETIR) vol 5, pp 751–754

    Google Scholar 

  2. Acharjya DP, Ahmed KP (2016) A survey on big data analytics: challenges, open research issues and tools. Int J Adv Comput Sci Appl (IJACSA), vol 7, pp 511–518.

    Google Scholar 

  3. Elgendy N, Elragal A (2014) Big data analytics: a literature review paper. In: Industrial conference on data mining (ICDM), pp 214–227

    Google Scholar 

  4. Vijayarani S, Sharmila S (2016) Research in big data—an overview. Inf Eng Int J (IEIJ) vol 4, pp 1–20

    Google Scholar 

  5. Alam JR, Asma Sajid, Ramzan Talib, & Muneeb Niaz, (2014) A review on the role of big data in business. Int J Comput Sci Mobile Comput vol 3, pp 446–453

    Google Scholar 

  6. Bokde D, Girase S, Mukhopadhyay D (2015) Matrix factorization model in collaborative filtering algorithms: a survey. In: International conference on advances in computing, communication and control (ICAC3’15), pp 136–146

    Google Scholar 

  7. Almohsen KA, Al-Jobori H (2015) Recommender systems in light of big data. Int J Electr Comput Eng (IJECE), pp 1553–1563

    Google Scholar 

  8. Panigrahi S, Lenka RK, Stitipragyan A (2016) A hybrid distributed collaborative filtering recommender engine using apache spark. In: International workshop on big data and data mining challenges on iot and pervasive systems (BigD2M 2016), pp 1000–1006

    Google Scholar 

  9. Miryala G, Gomes R, Dayananda KR (2017) Comparative analysis of movie recommendation system using collaborative filtering in spark engine. J Global Res in Comput Sci 8:10–14

    Google Scholar 

  10. Sadanand H, Vrushali D, Rohan N, Avadhut M, Rushikesh V, Harshada R (2017) Movie recommender engine using collaborative filtering. Int J Adv Res Innov, pp 599–608

    Google Scholar 

  11. FadhelAljunid M, Manjaiah DH (2017) A survey on recommendation systems for social media using big data analytics. Int J Latest Trends Eng Technol, pp 048–058

    Google Scholar 

  12. Mali KS, Shaikh SI, Shaikh SA, Mohole GP (2017) Generic recommendation engine using spark. Int J Sci Res Dev 5:471–474

    Google Scholar 

  13. Lenka RK, Barik RK, Panigrahi S, Panda SS (2018) An improved hybrid distributed collaborative filtering model for recommender engine using apache spark. Int J Int Sys Appl, pp 74–81

    Google Scholar 

  14. Sri N, Abhilash P, Avinash K, Rakesh S, Prakash CS (2018) Movie recommender system using item based collaborative filtering technique. Int J Eng Technol Sci Res (IJETSR), vol 5, pp 64–69

    Google Scholar 

  15. Shoro AG, Soomro TR (2015) Big data analysis: Ap spark perspective. Global J Comput Sci Technol (GJCST), vol 15, pp 7–14

    Google Scholar 

  16. Bhattacharya A, Bhatnagar S (2016) Big data and apache spark: a review. Int J Eng Res Sci (IJOER) vol 2, pp 206–210

    Google Scholar 

  17. Jonnalagadda VS, Srikanth P, Thumati K, Nallamala SH (2016) A review study of apache spark in big data processing. Int J Comput Sci Trends Technol (IJCST) vol 4, pp 93–98

    Google Scholar 

  18. Zaharia M, Chowdhury M, Das T, Dave A, Ma J, McCauley M, Franklin MJ, Shenker S, Stoica I (2012) Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Proceedings of the 9th USENIX conference on networked systems design and implementation, pp 15–28

    Google Scholar 

  19. Dahiya P, Chaitra B, Kumari U (2017) Survey on big data using apache Hadoop and spark. Int J Comput Eng Res Trends (IJCERT), pp 195–201

    Google Scholar 

  20. Pathrikar KK, Dudhgaonkar AA (2017) Review on apache spark technology. Int Res J Eng Technol (IRJET), pp 1386–1388

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ayasha Malik .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Malik, A., Gupta, H., Kumar, G., Sharma, R.K. (2023). Rating of Movie via Movie Recommendation System Based on Apache Spark Using Big Data and Machine Learning Techniques. In: Chatterjee, P., Pamucar, D., Yazdani, M., Panchal, D. (eds) Computational Intelligence for Engineering and Management Applications. Lecture Notes in Electrical Engineering, vol 984. Springer, Singapore. https://doi.org/10.1007/978-981-19-8493-8_55

Download citation

Publish with us

Policies and ethics