A Personalized Recommender System Using Conceptual Dynamics

Sammulal, P.; Venu Gopalachari, M.

doi:10.1007/978-981-10-2471-9_21

P. Sammulal¹⁹ &
M. Venu Gopalachari²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 507))

1062 Accesses
4 Citations

Abstract

E-commerce applications are popular as a requirement of emerging information and are becoming everyone’s choice for seeking information and expressing opinions through reviews. Recommender systems plays a key role in serving the user with the best Web services by suggesting probable liked items or pages that keeps user out of the information overload problem. Past research of the recommenders mostly focused on improving the quality of suggestions by the user’s navigational patterns in history, but not much emphasis has been given on the concept drift of the user in the current session. In this paper, a new recommender model is proposed that not only identifies the access sequence of the user according to the domain knowledge, but also identifies the concept drift of the user and recommends it. The proposed approach is evaluated by comparing with existing algorithms and perhaps does not sacrifice the accuracy of the quality of the recommendations.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Hybrid Recommender System with Conceptualization and Temporal Preferences

A novel temporal and topic-aware recommender model

Article 02 August 2018

ICFR: An effective incremental collaborative filtering based recommendation architecture for personalized websites

Article 21 May 2019

Keywords

1 Introduction

Internet has left a significant mark in all fields, such as e-commerce, science and technology, education and research, and telecommunication. From the past couple of decades, the research and development of the Web services hasbecome exponential and accelerated by many cutting edge technologies such as big data and cloud computing. Popular service providers on the Web such as Netflix, Last.fm music and Amazon are trying to promise satisfaction to their customers by predicting their interests toward the domain by means of recommender systems.

Recommender systems are the providers of personalized recommendations that exist in various types with respect to the strategy used, out of which the first one is content based (CB), in which recommenders try to analyze the users’ access sequence; the second one is collaborative filtering (CF) that tries to aggregate the interests of the neighbors of a user. Over a period, hybrid recommenders evolved that combined the features of CB and CF to make suggestions better. The recommenders generally focus on the patterns of the navigation sequence of the customer by means of the user’s past history. The log file in the server can be the source of finding the access patterns of a user under various types of criteria.

Broadly, there are two issues identified in this scenario where the first one is if these patterns do not consider the true semantics behind the access patterns, then the outcome will limits the quality of prediction. That means the recommender must have domain knowledge to provide meaningful suggestions, so that the user can be satisfied. For instance, if the user is accessing a movie portal such as Netflix, then the access patterns must be simplified to the genre of the movie rather than the title of the movie, assuming that the genres will say the semantic of a movie page. To achieve this, one has to construct and incorporate the knowledge using the ontology of that particular domain, and the real challenge is in constructing the knowledge with reasonable efficiency. The second issue is recommending according to the dynamics in the user’s interest that drifts from one concept to another.

The patterns identified for a customer even with knowledge hardly gives the user profile little historic. If the user is with another concept which is not in the pattern, then it definitely leads to dissatisfaction of the user. For example, assume that the recommender stored the access pattern concept for a user as “romantic movies” and suggests accordingly, and assume that the user is currently accessing a set of “action movies”; then it definitely leads to the dissatisfaction of the user proving lag in the predictive accuracy.

In this paper, a recommender system is proposed that is focused on resolving the two above-mentioned issues. To accomplish this, the proposed system includes the following tasks:

Develop a methodology to construct the domain knowledge to identify the concepts.
Develop a model to find the sequence patterns by integrating the knowledge.
Propose a recommendation strategy that also identifies the concept drift in the access pattern and suggests accordingly.

The experimental results carried on benchmark data sets clearly show the improvement in the performance of the proposed framework when compared with the popular existing models and also prove the importance of analyzing the interest drift of the user by evaluating with appropriate measures.

The rest of the paper is organized as follows. In Sect. 2, related work is presented. In Sect. 3, the architecture and design details of the proposed recommender strategy are described. Section 4 analyzes the implementation and experimentation part. Finally, Sect. 5 gives the conclusion followed by references.

2 Related Work

Adomavicius and Tuzhilin, in [1], discussed the classification of the recommenders as content based [2, 3], collaborative filtering [4] and hybrid methods [5, 6]. Although many of the researchers kept their efforts in improving the accuracy of the recommender through the technique used [7], some focused on metrics such as “diversity” (average dissimilarity among all recommendation pairs) [8], individual diversity (average dissimilarity of recommendation pairs limiting to a user) [9] and aggregate diversity (average of dissimilarity of all users) [10] as important as accuracy to satisfy thebuser. Some works [11] proposed recommenders considering a new kind of metric “novelty” (amounts to the user’s surprise w.r.t. the time for searching a page or item) in the evaluation of recommendations and act accordingly. But all these studies did not consider the concept of the item or the user and did not try to identify the drift of concept.

In general, the access sequence patterns can be learned by probabilistic algorithms and association analysis [12]. Ezifie and Y. Lu proposed a sequence pattern mining algorithm using a tree structure called PLWAP-Mine [13] which showed better results than other pattern mining techniques. In [14], Nguyen proved that PLWAPMine integrated with the Markov model can enhance the performance of mining. However, these algorithms consider only the usage history, but not the semantics at all which lags the quality of recommendations.

L. Wei and S. Lei made a model by integrating the ontology with usage mining, so that the patterns are subject oriented rather than item or page oriented to improve the performance of the recommender [15]. There were several ontologies constructed on different domains such as personalized e-learning and software to generate the recommendations of the ontology with the significant terms in the web site used [3]. S. Salin and P. Senkul proposed applying these domain concepts even on access sequence instances and then tried to make accurate suggestions [16]. These studies did not consider the dynamics of the ontology instance and also made no focus on the efficiency of constructing the ontology.

3 Recommender Model with Conceptual Semantics

The proposed model for recommenders based on concept and its dynamics can be defined in three layers as shown in Fig. 1. In the first layer, the construction of ontology for the domain will take place by manual, automatic or semi automatic approaches. Though several techniques are available in constructing the ontology, there still exists demand for the customized domains depending on the purposes. To accomplish this task, the informal information provided is Web content; somehow it seems tedious to process the huge provided content. Instead, one can make use of the informal information available in or associated with a Web page such as title, tags or URL of the page to construct the domain knowledge. Typically, the second layer of the model is to find the patterns of the user’s access sequence. Generally, the sequence pattern modeling uses the Web usage log after some traditional preprocessing techniques to find patterns. However, the proposed model extends preprocessing in its way to get semantic log, so that concept-oriented patterns can be extracted accordingly. In the model, the recommendation strategy is defined in the third layer that uses not only the patterns provided in the second layer to assess the user’s navigation, but also the domain ontology constructed in the first layer to identify the existence of the concept drift if any. Finally the recommendation strategy in this layer gives the suggestions of the items or Web pages to the user.

3.1 Conceptual Knowledge Construction

Here, the title of the page and the tags provided for a Web page are used as sources to derive concepts to generate the ontology. Generally, the tags and title of the Web page contains key terms that represent the content of the page. The idea is to define the concepts from these key terms depending on the number of occurrences and combinations of the terms. This task can be accomplished by following the steps that involve defining concepts and relationships among concepts.

3.1.1 Defining Concepts

Let {T₁, T₂…T_n} be the titles and tags of m number of pages or items in the domain.

Step 1: The stop word removal technique is applied to the titles and tags of the pages, so that only a set of raw terms {w₁, w₂…w_k} will be derived in each page title.
Step 2: Find frequent terms by means of the association analysis technique which gives the significance of a set of terms that can be assumed as concept C_i.
Step 3: From the derived concept set C = {C₁, C₂…C_m}, identify the most generalized and specialized concepts.; 2
Step 4: Identify a relationship from all the concepts to at least one of the mentioned generalized concepts.

3.2 Sequence Pattern Mining with Concepts

This layer of model gives the access patterns of the user as per the domain ontology constructed and stored with concepts and relationships among concepts. This task can be of two parts, where the first one is about preprocessing and the second one is pattern mining.

3.2.1 Preprocessing

The literature provides many data preprocessing techniques as four different classes (data cleaning, reduction, integration and transformation) to make mining qualitative in terms of accuracy. Here, the Web log contains the access records of all the users of all sessions for a particular period. This log information will act as raw input for extracting the navigational sequences of the user. Generally, the record in a log contains the IP address, time stamp of access, title of the page, URL of the page, protocol used, session id and typically the tags of the page. Data reduction is the first step that is applied to remove unnecessary fields such as protocol name in the log table. Information such as time stamps or session ids must be transformed to the format that can be processed. This log will be given for data selection so that only the records of that particular user will be extracted with its session ids (Fig. 2).

After making traditional preprocessing, the log is applied with advanced techniques to transform it as semantic log. In this model, the titles of the page are applied with stop word removal techniques to avoid the terms such as ‘the’, ‘are’ and ‘is’. Thereafter, the term extraction technique will consider the raw terms and their combinations by means of the frequency measure. Then, annotation is made to make the records in the user’s input log represented with the concepts for which the domain ontology is used to annotate the relevant concept label for each record. Finally, the aggregation step shows the access records with conceptual information.

3.2.2 Mining Patterns

Once the semantic log is constructed, the sequential learning method will be applied to get patterns. To get patterns, the proposed model uses the basic theme of association patterns extraction by the TITANIC algorithm [14], which outperforms in constructing the lattice of the concept for a user.

The above algorithm gives the access sequences as a set of combinations preserving the order of a particular user results the personalized lattice of concepts. The personalized concept lattice is the hierarchy of concepts in a tree structure constructed using the support and confidence measures.

Algorithm to find conceptual patterns:

3.3 Recommendation Strategy

The final layer in the architecture deals with the recommendation strategy to suggest pages by means of the knowledge gained in the first layer and the patterns from the second layer. The primary task in this step is to identify the recent concept pattern of the current session. Thereafter, if the current concept matches any part of any of the existing patterns saved for the user, then suggestions has to be made accordingly. If the pattern matches part of the existing pattern but not with the current concept, then the suggestion will switch away from the traditional recommendation path to the current concept pages dynamically. Thus, the concept drift of the user’s interest identified and the suggestions will be changed dynamically by the proposed method.

4 Experimentation Results

The experimental setup to evaluate the proposed model is kept on a benchmark dataset Movielens of two variants with 100 k and 1 M ratings. The one with 100 k ratings is provided for 1682 movies by 943 users, whereas the other one is provided for 3900 movies by 6040 users. The ratings are on the scale from 1 to 5, defining 1 for low quality and 5 for high quality. To evaluate the methodology, the mean absolute error (MAE) measure is used, and to evaluate the efficiency of recommendations, hit ratio [6] measure is used:

$$ MAE = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left| {ar_{i} - pr_{i} } \right|}}{n}, $$

(1)

$$ Hit\,ratio = \frac{Ar}{Re} $$

(2)

where {ar₁, ar₂…ar_n} are the actual ratings, {pr₁, pr₂…pr_n} are the predicted ratings, ‘A_r’ is the total number of recommendations accessed by the user and ‘R_e’ is the total number of recommendations. This experimentation compared the proposed model with one of the popular existing usage model PLWAP for the two variants of the data set for the top ten recommendations.

Table 1 shows the summary of the hit ratio and MAE of the proposed model as well as the existing model. It clearly says that the proposed model outperforms the PLWAP model in terms of the number of suggestions that are accessed by the user that is relevant to the user interest.

Table 1 Experimentation values for the two recommenders

Full size table

5 Conclusion

The typical recommenders based on usage history cannot use the semantics and will not consider the concept drift of user interest. This paper made study on the recommenders with concepts as well to find interest drift of the user on top of concepts in the sense to make user satisfied. The proposed model constructs the ontology, mines the patterns and applies on the current Web access sequence of the user. The proposed model was evaluated by comparing with popular existing methods and the results showed that the model outperformed in terms of performance.

References

G. Adomavicius and A. Tuzhilin, “Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions,” IEEE Trans. Knowledge Data Eng., vol. 17, no. 6, pp. 734–749, June 2005.
Google Scholar
M. Pazzani and D. Billsus, “Content-based recommendation systems,” in The Adaptive Web, P. Brusilovsky, A. Kobsa, and W. Nejdl, Eds. Berlin Heidelberg, Germany: Springer-Verlag, 2007, pp. 325–341.
Google Scholar
M. Venu Gopalachari, P. Sammulal, “Personalized Web Page Recommender System using integrated Usage and Content Knowledge”, in the proceedings of 2014 IEEE ICACCCT, 2014. pp. 1066–1071.
Google Scholar
J. Schafer, D. Frankowski, J. Herlocker, and S. Sen, “Collaborative filtering recommender systems,” in The Adaptive Web, P. Brusilovsky, A. Kobsa, and W. Nejdl, Eds. Berlin Heidelberg, Germany: Springer-Verlag, 2007, pp. 291–324.
Google Scholar
E. Amolochitis, I. T. Christou, and Z. H. Tan, “Implementing a commercial-strength parallel hybrid movie recommendation engine,” IEEE Intell. Syst., vol. 29, no. 2, pp. 92–96, Mar. 2014.
Google Scholar
M. Venu Gopalachari, P. Sammulal, “Hybrid Recommender System with Conceptualization and Temporal Preferences”, Proceedings of the Second International Conference on Computer and Communication Technologies, AISC 380, pp. 811–819, Springer, 2015.
Google Scholar
M. Jahrer, A. Töscher, and R. Legenstein, “Combining predictions for accurate recommender systems,” in Proc. 16th ACM SIGKDD Int. Conf. Knowledge Discovery Data Mining, New York, 2010, pp. 693–702.
Google Scholar
S. Vargas and P. Castells, “Rank and relevance in novelty and diversity metrics for recommender systems,” in Proc. 5th ACM Conf. Recommender System, New York, 2011, pp. 109–116.
Google Scholar
M. Zhang and N. Hurley, “Avoiding monotony: Improving the diversity of recommendation lists,” in Proc. ACM Conf. Recommender Systems, New York, 2008, pp. 123–130.
Google Scholar
G. Adomavicius and Y. Kwon, “Improving aggregate recommendation diversity using ranking-based techniques,” IEEE Trans. Knowledge Data Eng., vol. 24, no. 5, pp. 896–911, May 2012.
Google Scholar
F. Fouss and M. Saerens, “Evaluating performance of recommender systems: An experimental comparison,” in Proc. IEEE/WIC/ACM Int. Conf. Web Intelligent Agent Technology, Washington, D.C.: IEEE Computer Society, 2008, pp. 735–738.
Google Scholar
B. Mobasher, “Data Mining for Web Personalization,” in The Adaptive Web. vol. 4321, P. Brusilovsky, A. Kobsa, and W. Nejdl, Eds.: Springer-Verlag Berlin, Heidelberg, 2007, pp. 90–135.
Google Scholar
C. I. Ezeife and Y. Lu, “Mining Web Log Sequential Patterns with Position Coded Pre-Order Linked WAP-Tree,” Data Mining and Knowledge Discovery, vol. 10, pp. 5–38, 2005.
Google Scholar
S. T. T. Nguyen, “Efficient Web Usage Mining Process for Sequential Patterns,” in Proceedings of the 11th International Conference on Information Integration and Web-based Applications and Services, Kuala Lumpur, Malaysia 2009, pp. 465–469.
Google Scholar
L. Wei and S. Lei, “Integrated Recommender Systems Based on Ontology and Usage Mining,” in Active Media Technology. vol. 5820, J. Liu, J. Wu, Y. Yao, and T. Nishida, Eds.: Springer-Verlag Berlin Heidelberg, 2009, pp. 114–125.
Google Scholar
S. Salin and P. Senkul, “Using Semantic Information for Web Usage Mining based Recommendation,” in 24th International Symposium on Computer and Information Sciences, 2009., 2009, pp. 236–241.
Google Scholar

Download references

Author information

Authors and Affiliations

Jawaharlal Nehru Technological University, Hyderabad College of Engineering, Jagtial, Telangana, India
P. Sammulal
Chaitanya Bharathi Institute of Technology, Hyderabad, India
M. Venu Gopalachari

Authors

P. Sammulal
View author publications
You can also search for this author in PubMed Google Scholar
M. Venu Gopalachari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Sammulal .

Editor information

Editors and Affiliations

ANITS, Prof., Comp. Sci. & Engg. Dept. ANITS, Visakhapatnam, Andhra Pradesh, India
Suresh Chandra Satapathy
JNTUH College of Engg. HYD (Autonomous), Prof. & Head, Comp. Sci. & Engg. Dept. JNTUH College of Engg. HYD (Autonomous), Hyderabad, Telangana, India
V. Kamakshi Prasad
JNTUH College of Engg. HYD (Autonomous), Pro., Dept. Computer Science & Engg. JNTUH College of Engg. HYD (Autonomous), Hyderabad, Telangana, India
B. Padmaja Rani
SCIS, University of Hyderabad , Hyderabad, India
Siba K. Udgata
CMR Technical Campus , Hyderabad, India
K. Srujan Raju

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sammulal, P., Venu Gopalachari, M. (2017). A Personalized Recommender System Using Conceptual Dynamics. In: Satapathy, S., Prasad, V., Rani, B., Udgata, S., Raju, K. (eds) Proceedings of the First International Conference on Computational Intelligence and Informatics . Advances in Intelligent Systems and Computing, vol 507. Springer, Singapore. https://doi.org/10.1007/978-981-10-2471-9_21

Download citation

DOI: https://doi.org/10.1007/978-981-10-2471-9_21
Published: 01 December 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2470-2
Online ISBN: 978-981-10-2471-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics