Keywords

1 Introduction

We live in the era of flood information, with data significantly increasing from various sites. With the massive data information, recommendation systems are the well-known technique to filter the knowledge data (e.g., news, movies, songs, point-of-interests, books, journals, products, etc.) information and then return suitable lists to the user. From the traditional data mining approach to the deep learning model, the model of the recommendation is a promising technique at the root of state-of-art technology [1]. In the realm of eCommerce, users have several options in finding items of interest, where recommendation systems assist people in locating and analyzing the preferred items. In the Healthcare area, recommendations have been used to suggest suitable items (e.g., food, drug, physical activity, healthcare professional, hospital place). Moreover, location-based social networks (LBSNs), including Foursquare, Facebook, Instagram, LinkedIn, and Yelp are the centric sites where users can share their physical location, user tag, comment/feedback, like/dislike, rating, real-time check-ins, and user friend connections. In addition, users are closely connected and certainly know about their real-time locations.

The existing studies have created recommendations in various domains by applying simple recommendation techniques to complex models. At first, the recommendation models are categorized into two methods. These two models are collaborative filtering (CF), and content-based filtering (CBF). According to the demand for state-of-art applications, more models are introduced including knowledge-based, demographic-based, and hybrid method, etc.

Due to the popularity of the recommendation model, researchers are exploring the recommendation model to enhance its performance and the model’s complexity.

To the best knowledge of our study, Sect. 2 introduces the detail of modern recommendation approaches. Section 3 provides simple explanations about recommendation-specific domains. Finally, Sect. 4 is described conclusion of the work.

2 Modern Recommendation Systems in the Big Data Era

2.1 How Do We Filter the Related Journals?

Within the scope of this investigation, we spot the journals from various well-known sites. Google Scholar [2] is used as a primary search engine, and in addition to this, we have also embraced Web of Science [3], which we refer to as an essential tool to discover related published papers.

Moreover, several keywords are used to filter the journal papers. These keywords include recommendation system, collaborative filtering, content-based filtering, hybrid model, traditional recommendation system, modern recommendation system, product recommendation, movie recommendation, product recommendation, news recommendation, job recommendation, location recommendation, point-of-interest recommendation, healthcare recommendation system, and recommendation challenges.

2.2 Overview of Recommendation Systems

With the massive amount of data, the recommendation system aims to predict the close items to the user. Recommendation systems collect information on their users’ preferences for items (e.g., books, journals, songs, movies, products, location, travel, etc.), which then return the personalized items to the active/target users. Data is a necessary part of the model of the recommendation. Refer to Fig. 1, there are two types of the data, namely explicit data, and implicit data. Explicit data refer to the data that users are actively provide to the system through their interaction such as like/dislike, rating and especially it’s directed collection. This type of data is powerful and beneficial to the recommendation model since the data is given the clear purpose of the user’s mind in terms of analysis. The disadvantage of this data is difficult to obtain. Another type is Implicit data which is collected indirectly indicates preference and taste. The data is mainly stored in the session system like browsing history, landing page, purchase history, and even mouse movement.

Fig. 1
A flow diagram of data is divided into explicit data and implicit data.

An overview of the basic the data

Figures 2 and 3 are shown the recommendation model from collaborative filtering, content-based filtering, knowledge-based, demographic-based, and finally, the hybrid model. More precisely, the recommendation system’s general process is displayed in Fig. 2.

Fig. 2
A flow diagram of how the users send their personal data to the traditional R Ss and receive recommended items.

Basic recommendation system model

Fig. 3
A chart of the recommendation system is divided into collaborative filtering, content-based filtering, knowledge-based, demographic-based, and hybrid models. Collaborative is further divided into memory and model-based.

Recommendation system types

2.3 Modern Recommendation Systems

Various recommendation models have been deployed in the study area’s wide range. However, there are two main categories in the recommendation systems, including collaborative filtering and content-based filtering. In our current work, we are not focusing on only two models but aim to provide more studies with other models with the interaction of the big data.

Collaborative Filtering. Collaborative filtering is the most successful technique in the recommendation system and is widely used in the recommendation area. This technique uses the behavior of the group of users to suggest to other users [4]. Simply, the recommendation is based on nearest neighbor or similar users. There are two mainly types in the collaborative filtering including user-based and item-based.

User-based: In the mechanism, the system will suggest the items based on user’s favorite and find the most similar item to the one user like [5]. Several similar metrics can be utilized to calculate the similarity between users/items, such as cosine similarity, adjusted cosine similarity, Pearson correlation coefficient (PCC).

Item-based: In collaborative filtering, item-based predicts user rating for the item similarity. For this model, their performance is better than the user-based since it suffers from several issues, such as sparsity and scalability [6]. Moreover, both user- and item-based suffer from cold-start problems (Item cold-start and User cold-start).

Content-based Filtering. The fundamental principle underlying content-based recommenders is to recommend products based on their content items’ similarity [7]. By examining the descriptions of those items, the algorithms discover and separate the primary common attributes of a particular user’s personal favorites. These choices are then saved in this user’s profile. The system then recommends more similar things to the user’s profile.

Furthermore, content-based recommendation systems can capture the user’s individual preferences and propose products that would be of little interest to other people. However, because item feature representations are built manually to some extent, this technique necessitates a great deal of domain knowledge. Furthermore, content-based recommendation systems may only propose based on the user’s existing interests, and in addition, it limits their capacity to broaden users’ existing interests.

Knowledge-based model. The primary purpose of the knowledge-based recommendation model is to suggest to the users according to basic knowledge of the users, items, and the relationship between items or users [8]. More importantly, this model has no cold-start problem because the model doesn’t depend on the user/item rating or purchase history. In addition, knowledge-based is particularly useful for complex domains where items are not frequently purchased [9]. However, this recommendation mechanism can run into trouble if people don’t have the proper domain knowledge [10].

Demographic-based model. Demographic-based is the recommendation system type where the user’s demographic data is utilized as an alternative input for making suggestions [11]. The data can be user age, gender, location, etc.

Hybrid model. Hybrid model is the method that merge multiple recommendation models to over-come the issue in the existing recommendation models.

3 Specific Domain of Recommendation Systems

Recommendation systems are utilized in various application perspectives. It is a state-of-art application where people use techniques because it assists the user to save time to filter the valuable item/information to reduce their finding times. Recommendation systems have been applied in many eCommerce companies. For instance, amazon deployed the product recommendation in their web sites to suggest the most suitable item to their target customers. Existing works [12,13,14,15] designed a recommendation model to suggest the products to the target customers.

Other applications in movie domains proposed by Ref. [16,17,18,19] created a model to recommend movie entertainment based on various machine learning mechanisms. Not only in the range of entertainment or eCommerce, but recommendation systems are also widely used in healthcare. Ozsoy et al. [20] proposed a recommendation model to identify new targets for known drugs. This model aims to over-come the problem related to traditional drug discovery by using the existing drugs to treat newly discover diseases. Ref. [21] deploys the model of recommendation to predict a patient’s health condition by analyzing the patient’s lifestyle, physical health record, and social activities.

4 Conclusion

The simple survey focuses on modern recommendation systems in the big data era. In the study, we provide a simple explanation and survey of current recommendation systems such as collaborative, content-based, knowledge-based, demographic-based, and hybrid approaches. Furthermore, we illustrate the several issues which existed in the traditional recommendation models. Due to the demand of the big data era, recommendation models are built and researched more critically to adapt to real-world application and to find valuable knowledge filtering to figure out the pattern in the massive amount of data (big data).