Recommendations with Sparsity Based Weighted Context Framework

Dixit, Veer Sain; Jain, Parul

doi:10.1007/978-3-319-95171-3_23

Veer Sain Dixit²³ &
Parul Jain²³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10963))

Included in the following conference series:

International Conference on Computational Science and Its Applications

2003 Accesses
8 Citations

Abstract

Context-Aware Recommender Systems (CARS) is a sort of information filtering tool which has become crucial for services in this big era of data. Owing to its characteristic of including contextual information, it achieves better results in terms of prediction accuracy. The collaborative filtering has been proved as an efficient technique to recommend items among all existing techniques in this area. Moreover, incorporation of other evolutionary techniques in it for contextualization and to alleviate sparsity problem can give an additive advantage. In this paper, we propose to find the vector of weights using particle swarm optimization to control the contribution of each context feature. It is aimed to make a balance between data sparsity and maximization of contextual effects. Further, the weighting vector is used in different components of user and item neighborhood-based algorithms. Moreover, we present a novel method to find aggregated similarity from local and global similarity based on sparsity measure. Local similarity gives importance to co-rated items while global similarity utilizes all the ratings assigned by a pair of users. The proposed algorithms are evaluated for Individual and Group Recommendations. The experimental results on two contextually rich datasets prove that the proposed algorithms outperform the other techniques of this domain. The sparsity measure that is best suited to find aggregation is dataset dependent. Finally, the algorithms show their efficacy for Group Recommendations too.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Context-Aware Based Evolutionary Collaborative Filtering Algorithm

Context Similarity Measurement Based on Genetic Algorithm for Improved Recommendations

Efficient music recommender system using context graph and particle swarm

Article 10 February 2017

Keywords

1 Introduction

Context-Aware Recommender Systems (CARS) have gained the high attention of experts and researchers for item recommendations due to an important role played by contextual information in them [1, 2]. Dey (2001) defined the context as “Context is a piece of information that describes the circumstances of an entity”. The choice of items is usually different in different contextual situations. To quote a few examples: (a) One would choose to listen different music if weather is pleasant and/or road is free rather than it is being a hot summer day and/or traffic jam, (b) The choice of restaurant would be different if one is going on quick business lunch rather than going to dine with girlfriend. Therefore, to improve accuracy and user satisfaction, CARS have been studied in various domains encompassing information retrieval, mobile applications, e-commerce, e-learning, management, and marketing.

The crux of CARS is to incorporate contextual information in various mathematical [1, 2, 5], probabilistic [1, 2], soft computing [1, 2, 16] and particle swarm optimization techniques [3, 7]. The recommender systems(RS) involving collaborative filtering (CF) technique are most effective and widely used among all existing RS. The quality and accuracy of such RS can be increased by using new and improved similarity measures along with utilization of contexts in an effective manner.

Major Challenges in CARS.

There remain many challenges in CARS such as utilization of contextual information, data sparsity, scalability and cold start problem. Selection and application of contextual factors in making recommendations is clearly a blunt instrument. However, user-item-rating matrices are sparse since generally all items are not evaluated by users. Sparsity problem becomes more severe when these matrices are diluted with contextual factors. Use of too many contextual factors in algorithms increases data sparsity and few context factors fails to bring contextual effects in recommendations. In our previous research [10], we have addressed this issue by formation of context communities which are utilized in different components of the algorithm (CAWP). Although, the algorithm included the context and increased accuracy, but it is domain dependent. Optimization is the core of research. In this research, we find the optimal set of weights for the whole context vector using particle swarm optimization (PSO) to alleviate data sparsity problem.

Another major issue is to find similarity between two users and/or items, especially in context aware datasets. More the similarity measure is improved, better are the recommendations. Typically, conventional similarity measures such as Pearson Correlation Coefficient (PCC), Cosine(COS), Mean Squared Difference(MSD) can be calculated on items which are commonly rated by two users and the users who have rated common items. They ignore global rating information. Moreover, contextual information is not considered by them while finding similarity between two users and items. Even some researches have employed newly emerging similarity measures such as NHSM [18], Bhattacharya Coefficient with correlation [17] and PSS based similarity measure [18, 20] which overcome one or another problem mentioned above but do not consider contextual situation into account.

Our Contribution.

Motivated by the above mentioned issues, the primary contribution in this work is presented as follows:

A context weighing vector is computed using PSO to weight the contribution of all contextual features instead of context selection or relaxation. Weighting the contextual features overcome the data sparsity problem and optimizes the contextual effects. Then, these weighting vector are applied in each component of the user and item neighborhood based algorithms.
An effective approach is proposed to combine local and global similarities based on sparsity measure. Several variations of sparsity measure are used to weight the contribution of global and local similarities. These measures caters sparse and dense data. Moreover, global similarity can be computed on non-corated items and considers global preferences of the user behaviour while local similarity is obtained using co-rated items.
Extending our research area, the proposed approach is evaluated for two different types of a group of users. Three different group recommendation techniques are compared to analyze the efficacy of the proposed framework. The two datasets used are contextually rich and especially designed for contextual personalization research.

The forthcoming paper is organized as follows. Few CARS related reviews and similarity measures are mentioned in Sect. 2. The detailed construction of the proposed framework and the approach used are presented in Sect. 3. Section 4 presents experimental results and their analysis. Section 5 specifies the conclusions followed by future research work.

2 Related Reviews

2.1 Context-Aware Recommendation

Context-Aware Recommender System includes context features while making a prediction. The rating estimation function is given by $ R:userX\,item\, X\, context \to rating $. Context aware recommendation algorithm falls into three paradigms: (a) contextual pre filtering, where filtered dataset using contextual information is utilized by rating prediction algorithm, (b) contextual post filtering, where final set of recommendations are filtered using contextual information and (c) contextual modeling, where contextual information is used to predict the rating [1, 2]. The identification of valid and influential contexts is also required to be applied into recommendation algorithms [24, 26]. To identify relevant and influential context features of LDOS-CoMoDa dataset, [15] has summarized the assessments obtained from user survey and statistical testing. Another method is proposed in [22] to select optimal contexts including demographic, item and contextual features. The relevance value of each context feature set under a specific genre for IncarMusic dataset is found by [4]. A new prediction aggregation model combining predictions obtained using demographic, semantic and social contexts is demonstrated by [9]. An approach is presented by [25] to analyze several direct context prediction algorithms based on multilabel classification.

2.2 Sparsity Problem

Many techniques have been used by the researchers to handle data sparsity issue. It is one of the major challenges observed in this field. Especially in context aware datasets where the rating dataset is filtered with contextual information, the matrices become more sparse. DCR algorithm is proposed in [21] to handle data sparsity problem where the relaxed context constraints are used in the prediction algorithm. Another approach CAWP formed context communities and included weighted percentile method [10]. DCW for CARS [23] describes to find weights of contextual features and those users which possess context similarity greater than the threshold are used by the algorithm.

2.3 Similarity Measures

Traditional similarity measures such as Pearson Correlation Coefficient (PCC), Mean Square Difference (MSD), Cosine (COS), Jaccard are mostly used by recommender systems for computation of similarity between a pair of user or item. These measures have several drawbacks such as few co-rated item problems, utilization of only local rating information and non-inclusion of global ratings [1, 2, 14, 18, 20]. Hence some new similarity measures are proposed to overcome the drawbacks. A new similarity measure based on Bhattacharya Coefficient is evaluated by [14] to handle sparse data that do not depend on co-rated items. A heuristic similarity model which considers both local and global contexts of the user behaviour is experimentally shown in [18]. A model based on a mean measure of divergence is defined by [13, 20] that takes rating habits of a user into account. These measures do not consider contextual information and suffer from one or the other problem, so we attempt to form a combination of local, global and contextual similarity measure to improve accuracy.

3 The Proposal

This section presents construction and details of the proposed framework depicted in Fig. 1. The framework consists of Sparsity Based Weighted Context Recommendation Unit (SWCRU) and Group Recommendation Unit (GRU). The SWCRU predicts ratings via user neighborhood based and item neighborhood based algorithms. To achieve this, first, the optimum weight of different context features and genres are obtained using particle swarm optimization (PSO). The overlap metric and weights of context features are used to find contextual similarity value. This value is utilized by different parts of both user and item neighborhood based algorithms. Also, local and global similarities are found to exploit their strengths by the prediction algorithms. The sparsity measure is employed to make a balance between local and global similarity since both performs differently under different sparse data scenario. The GRU unit presents three different group recommendation techniques Merging, Multiplicative and Merging-Multiplicative for performance analysis of the proposed algorithms for a group of users. Random Groups are used for this purpose.

3.1 PSO to Learn Optimal Weighting Vector $ \varvec{w} $

We assume that those rating which is more similar in contexts are more valuable in making predictions. To handle data sparsity problem, we learn the optimal weight $ w $ for each context feature using PSO instead of filtering out some context features. The $ w $ can take the weight as the real values in the range of [0,1]. These weighted values control the contribution of each context feature in the recommendation algorithms.

Particle Representation and Initial Population

Each context feature weight is represented with 8 binary digits in range of [0,255]. If there are ‘n’ features then 8n bits represent a particle. After reaching termination criteria, binary value of each weight is converted to its decimal equivalent [3, 7]. Then each weight is divided by the total weight to get a normalized value.

Particle Dynamics

PSO consists of collection of candidate solutions called swarm where each candidate solution represents a particle. These particles continuously move in search space by some velocity. The velocity and position w.r.t. each particle in every dimension is updated at each time stamp. The following rules are followed to update the swarm [7].

$$ \begin{array}{*{20}c} {vel_{i} = rw*vel_{i} + cons_{1} *r_{1} \left( {pos_{PBest,i} - pos_{i} } \right) + cons_{2} *r_{2} \left( {pos_{GBest} - pos_{i} } \right)} \\ {{\text{If}}\,(\left| {vel_{i} } \right| > vel_{max} ),\left( {vel_{max} /\left| {vel_{i} } \right|} \right)* vel_{i} \,;\,pos_{i} = pos_{i} + vel_{i} } \\ \end{array} $$

where the position of current particle i is represented by $ pos_{i} $, the best position attained by particle is known by $ pos_{PBest,i} $, $ pos_{GBest} $ elaborates the swarm’s global best, $ vel_{i} $ represents the velocity of particle i, $ rw $ is the random inertia weight lies between 0.4 and 0.9, $ cons_{1} $ and $ cons_{2} $ are spring constants whose values are set as 2.0 by empirical suggestions [22], $ r_{1} $ and $ r_{2} $ are random numbers between 0 and 1. $ vel_{min} $ and $ vel_{max} $ are $ \left( {ul - lb} \right)/2 $. In our case, it is tuned to 2.0 after experimental analysis. Swarm size is 10 and the number of iterations performed before termination is 20.

The Fitness Function

The $ ith $ particle’s fitness value in the swarm is computed using the following fitness function [3, 7].

$$ Fitness_{i} = \frac{1}{{S_{R} }}\sum\nolimits_{t = 0}^{{S_{R} }} {\left| {\left( {ar_{t} - pr_{t} } \right)} \right|} $$

$ S_{R} $ is the cardinality of a training set for the active user. $ ar_{t} $ and $ pr_{t} $ are actual and predicted rating of item t respectively.

Termination Criteria

We opted to take a specified number of iterations as terminating situation. This also means that the PSO algorithm terminates after executing specified number of iterations.

3.2 Overlap Metric to Find Contextual Similarity

The Overlap metric finds the similarity between two objects $ obj_{i} $ and $ obj_{j} $ and widely used for categorical attributes [19]. It is a simple and effective and is defined by

$$ S\left( {obj_{i} , obj_{j} } \right) = \frac{{\mathop \sum \nolimits_{a = 1}^{m} S_{a} \left( {obj_{ia} , obj_{ja} } \right)}}{m} $$

where $ a $ depicts a particular attribute and $ m $ is used to represent the total number of attributes of the object. $ obj_{ia} $ means $ a $-th attribute of object $ obj_{i} $. $ if\,obj_{ia} = obj_{ja} , $ then the value of $ S_{a} \left( {obj_{ia} , obj_{ja} } \right) = 1 $ otherwise it is 0.

Illustrative Example

Using Table 1, the contextual similarity between $ user1 $ and $ user4 $ via Overlap metric is computed as $ S\left( {user1,user4} \right) = \frac{3}{5} = 0.6 $.

Table 1. Rating matrix for an item i by different users under different contextual conditions.

Full size table

3.3 Weighted Overlap Metric

This metric is used to assess how much weight should be given to a rating $ r_{a,i,c2 } $ using weighted vector w obtained using PSO. The weighted overlap metric to find context similarity between target context $ c1 $ and different user context $ c2 $ is given by:

$$ O_{w} \left( {c1,c2,w} \right) = \frac{{\mathop \sum \nolimits_{func \in c1 \cap c2} w_{func} }}{{\mathop \sum \nolimits_{func \in c1 \cup c2} w_{func} }} $$

(1)

which means total context similarity of target user with some other user will be obtained by adding weights of those context features where the values match.

3.4 Weighted Local Similarity

To calculate local similarity between two users or two items, we used Pearson Correlation Coefficient which uses commonly rated items to find the value. $ c1 $ represents the context vector of $ u_{x} $ or $ i_{x} $ and $ c2 $ represents the context vector of $ u_{y} $ or $ i_{y} $.

The weighted variant to find similarity between two users, $ simu - loc_{w} $ is given by

$$ simu - local_{w} \left( {u_{x} ,u_{y} ,w} \right) = \frac{{\mathop \sum \nolimits_{t = 1}^{{n^{{\prime }} }} \left( {r_{{u_{x} ,i_{t} }} - \overline{{r_{{u_{x} }} }} } \right) - \left( {r_{{u_{y} ,i_{t} }} - \overline{{r_{{u_{y} }} }} } \right)O_{w} \left( {c1,c2,w} \right)}}{{\sqrt {\mathop \sum \nolimits_{t = 1}^{{n^{{\prime }} }} \left( {r_{{u_{x} ,i_{t} }} - \overline{{r_{{u_{x} }} }} } \right)^{2} O_{w} \left( {c1,c2,w} \right)} \sqrt {\mathop \sum \nolimits_{{{\text{t}} = 1}}^{{n^{{\prime }} }} \left( {r_{{u_{y} ,i_{t} }} - \overline{{r_{{u_{y} }} }} } \right)^{2} O_{w} \left( {c1,c2,w} \right)} }} $$

(2)

where $ i_{t} :t = 1,2,..,n^{{\prime }} \wedge n^{{\prime }} \le n $ represents a set consisting of those items which $ u_{x} $ and $ u_{y} $ had rated and $ n $ identifies the total number of accessible items.

The weighted variant to find similarity between two items, $ simi - loc_{w} $ is given by

$$ simi - local_{w} \left( {i_{x} ,i_{y} ,w} \right) = \frac{{\mathop \sum \nolimits_{t = 1}^{{m^{{\prime }} }} \left( {r_{{u_{t} ,i_{x} }} - \overline{{r_{{i_{x} }} }} } \right) - \left( {r_{{u_{t} ,i_{y} }} - \overline{{r_{{i_{y} }} }} } \right)O_{w} \left( {c1,c2,w} \right)}}{{\sqrt {\mathop \sum \nolimits_{t = 1}^{{m^{{\prime }} }} \left( {r_{{u_{t} ,i_{x} }} - \overline{{r_{{i_{x} }} }} } \right)^{2} O_{w} \left( {c1,c2,w} \right)} \sqrt {\mathop \sum \nolimits_{t = 1}^{{m^{{\prime }} }} \left( {r_{{u_{t} ,i_{y} }} - \overline{{r_{{i_{y} }} }} } \right)^{2} O_{w} \left( {c1,c2,w} \right)} }} $$

(3)

where $ \left\{ {u_{x} :x = 1,2,..,m^{{\prime }} \wedge m^{{\prime }} \le m} \right\} $ represents a set consisting of users who have rated $ i_{x} $ and $ i_{y} $, where m identifies the total number of accessible users.

3.5 Weighted Global Similarity

Bhattacharya Coefficient

The Bhattacharyya Coefficient finds similarity value between two statistical samples [14]. If $ pq $ and $ pr $ be the discrete probability distributions under same domain $ D $, the Bhattacharyya coefficient (BC) between $ pq $ and $ pr $ is given by.

$$ BC\left( { pq, pr} \right) = \mathop \sum \limits_{x \in D} \sqrt {pq\left( {\text{x}} \right) pr\left( x \right)} $$

Following it, the similarity between two users $ u1\, and\, u2 $ is given as:

$$ BC\left( { u1,u2} \right) = \mathop \sum \limits_{k = 1}^{m} \sqrt {\left( {\widehat{{D_{u1k} }}} \right)\left( {\widehat{{D_{u2k} }}} \right)} $$

(4)

where $ \left( {\widehat{{D_{u1k} }}} \right)\,{\text{and}}\,\left( {\widehat{{D_{u2k} }}} \right) $ represents users rating values under domain D and $ \left( {\widehat{{D_{u1k} }}} \right) = \frac{\# k}{\# u} $ where $ \# k = $ total count of items which are rated as $ k $ (value), $ \# u = $ total count of items rated by user $ u $.

Illustrative Example

Consider the rating scale lies in the range {1, 2, 3} and user $ u1 $ and $ u2 $ made rating on five different items (Table 2).

Table 2. User-item rating matrix.

Full size table

Table 3. Description of notations used.

Full size table

The BC coefficient is calculated as:

$$ BC\left( {u1,u2} \right) = \mathop \sum \limits_{t = 1}^{3} \sqrt {\left( {\widehat{{u1_{t} }}} \right)\left( {\widehat{{u2_{t} }}} \right)} = \sqrt {\frac{2}{3} *\frac{1}{2}} + \sqrt {\frac{1}{3} *\frac{1}{2}} + \sqrt {\frac{0}{3} *\frac{0}{2}} = 0.9855 $$

Proximity-Significance-Singularity (PSS)

PSS similarity is used as local measure [11, 18] which punishes bad similarity and reward good similarity and is defined as follows:

$$ sim\left( {u1,u2} \right)^{PSS} = \sum\nolimits_{i\epsilon I} {PSS(r_{u1,i} ,r_{u2,i} )} $$

(5)

where I represents all the items rated by user $ u1\, and\, u2 $ and $ r_{u1,i} $ means rating assigned by user $ u1 $ to item $ i $.

$$ PSS(r_{u1,i} , r_{u2,i} ) = {\text{Proximity}}(r_{u1,i} , r_{u2,i} )\,\times\,{\text{Significance}}(r_{u1,i} , r_{u2,i} )\,\times\,{\text{Singularity}}(r_{u1,i} , r_{u2,i} ). $$

Proximity considers absolute difference between two ratings and assigns penalty to disagreement.

Significance assumes that those ratings which are far off from the median are more significant.

Singularity uses difference of two ratings from the mean of their rating vector.

$$ {\text{Proximity}}\,\left( {r_{u1,i} , r_{u2,i} } \right) = 1 - \frac{1}{{1 + { \exp }\left( { - \left| {r_{u1,i} - r_{u2,i} } \right|} \right)}} $$

$$ {\text{Significance}}\,\left( {r_{u1,i} , r_{u2,i} } \right) = 1 - \frac{1}{{1 + { \exp }\left( { - \left| {r_{u1,i} - r_{med} } \right| *\left| {r_{u2,i} - r_{med} } \right|} \right)}} $$

$$ {\text{Singularity}}\,\left( {r_{u1,i} , r_{u2,i} } \right) = 1 - \frac{1}{{1 + { \exp }\left( { - \left| {\left( {\frac{{r_{u1,i} + r_{u2,i} }}{2}} \right) - \mu_{i} } \right|} \right)}} $$

where $ \mu_{i} $ is the mean rating of item $ i $.

Hybrid Similarity Metric

Combining the strengths of Weighted Overlap (for context similarity) stated by Eq. (1), Bhattacharya Coefficient (for global similarity) described by Eq. (4) and PSS (for local component) defined by Eq. (5), the hybrid similarity measure is given by the Eqs. (6) and (7).

$$ simu - global_{w} \left( {u_{x} ,u_{y} ,w} \right) = \mathop \sum \limits_{{{\text{a}} \in I_{{u_{x} }} }} \mathop \sum \limits_{{b \in I_{{u_{y} }} }} BC\left( {u_{x} ,u_{y} } \right) *O_{w} \left( {c1,c2,w} \right) *s\left( {u_{x} ,u_{y} } \right)^{PSS} $$

(6)

$$ simi - global_{w} \left( {i_{x} ,i_{y} ,w} \right) = \mathop \sum \limits_{{a \in U_{{i_{x} }} }} \mathop \sum \limits_{{b \in V_{{i_{y} }} }} BC\left( {i_{x} ,i_{y} } \right) *O_{w} \left( {c1,c2,w} \right) *s\left( {i_{x} ,i_{y} } \right)^{PSS} $$

(7)

3.6 Sparsity Measure $ \left(\varvec{\vartheta}\right) $

Previous researches have verified that when the data sparsity is high then the global similarity makes more accurate predictions while in case of low sparsity, local similarity performs better. The various sparsity measures are proposed in [3] to ensure that the locally similar neighbors and globally similar neighbors should be weighted differently in different scenario of data. We propose to use these sparsity measures to get the correct proportions of local and global similarities. The various sparsity measures that utilized are as follows.

Overall Sparsity Measure $ \left( {\vartheta_{{\mathbf{1}}} } \right) $

It is uniform for all users and considers sparsity of entire matrix. It is computed as

$$ \upvartheta_{1} = 1 - \left( {\frac{{m_{R} }}{{\left( {m_{U } \,X\,m_{I} } \right)}}} \right) $$

where $ m_{R} $, $ m_{U } $ and $ m_{I} $ represents total count of ratings in the entire matrix, total count of unique users and total count of unique items in the matrix respectively.

User Dependent Sparsity Measure $ \left( {\vartheta_{{\mathbf{2}}} } \right) $

The intuition behind this metric is that those users who have rated less items will not get much reliable local neighborhood. It is user specific and remains constant for all the items of the active user. The value of $ \vartheta_{2} $ is defined as follows.

$$ \vartheta_{2} = 1 - \left( {\frac{{m_{u} }}{{(max_{u \in U} m_{u } }}} \right) $$

where $ m_{u} $ represents number of items which user $ u $ has rated.

The forthcoming measure addresses the sparsity at user-item level since sometimes globally similar neighbors shows superiority depending on the items rated by the users.

Local Global Ratio $ \left( {\vartheta_{{\mathbf{3}}} } \right) $

The value of $ \vartheta_{3} $ is computed as: $ \vartheta_{3} = 1 - \left( {\frac{{|L_{Neigh} \left( {a,i} \right)|}}{{|G_{Neigh} \left( {a,j} \right)|}}} \right) $

where $ L_{Neigh} \left( {a,i} \right) $ represents the set of locally similar neighbors of user $ a $ who have rated item $ i $ and $ G_{Neigh} \left( {a,j} \right) $ represents the set of globally similar neighbors of user $ a $.

$ \vartheta_{4} $ is defined as average of $ \vartheta_{1} ,\vartheta_{2} \,and\,\vartheta_{3} . $

Aggregated Similarity

The correct proportion of local and global similarities can achieve better quality predictions in sparse and dense data scenario. Thus, the aggregated similarity is a linear combination of local and global similarity and is defined by Equation

$$ sim - aggr\left( {i,j} \right) =\upvartheta\,X\,sim - global_{w} \left( {i,j,w} \right) + \left( {1 -\upvartheta} \right)X\,sim - local_{w} \left( {i,j,w} \right) $$

(8)

where $ i $ represents $ u_{x} $ or $ i_{x} $ and $ j $ represents $ u_{y} $ or $ i_{y} $ depending on whether it is user or item neighborhood algorithm. The values of $ \vartheta $ can be calculated with the help of any of these sparsity measure $ \vartheta_{1} , where\, i = 1,2,3 \,or\, 4. $

The value of $ sim - local_{w} \left( {i,j,w} \right) $ is computed by Eqs. (2) or (3) and $ sim - global_{w} \left( {i,j,w} \right) $ using Eqs. (6) or (7) depending on user or item neighborhood algorithm.

3.7 Predictions and Recommendations

The following neighborhood based algorithms are being used for rating prediction towards active user $ a $ towards an unrated item:

Weighted Context User Based Using Sparsity Measure $ \left( {WCUB_{{\varvec{ }\vartheta \varvec{ - }sim}} } \right) $

$$ P_{a,i,c} = \overline{\omega } \left( {{\text{a}},O_{w} \left( {c1,c2,w} \right)} \right) + \frac{{\mathop \sum \nolimits_{{t \in N_{a} }} sim - aggr\left( {a,t} \right)\left( {\upomega\left( {{\text{t}},O_{w} \left( {c1,c2,w} \right)} \right) - \overline{\upomega} \left( {{\text{t}},O_{w} \left( {c1,c2,w} \right)} \right)} \right)}}{{\mathop \sum \nolimits_{{t \in N_{{a,{\text{k}}}} }} sim - aggr\left( {a,t} \right)}} $$

(9)

Weighted Context Item Based Using Sparsity Measure $ \left( {WCUB_{{\varvec{ }\vartheta \varvec{ - }sim}} } \right) $

$$ P_{a,i,c} = \overline{\omega } \left( {{\text{i}},O_{w} \left( {c1,c2,w} \right)} \right) + \frac{{\mathop \sum \nolimits_{{t \in N_{i} }} sim - aggr\left( {i,t} \right)\left( {\upomega\left( {{\text{t}},O_{w} \left( {c1,c2,w} \right)} \right) - \overline{\upomega} \left( {{\text{t}},O_{w} \left( {c1,c2,w} \right)} \right)} \right)}}{{\mathop \sum \nolimits_{{t \in N_{{i,{\text{k}}}} }} sim - aggr\left( {i,t} \right)}} $$

(10)

3.8 Group Recommendation Unit

This unit provides three different group recommendation techniques to evaluate the algorithms for group of users.

Merging.

In merging, top-n recommended items belonging to each member of group are merged into a single list. Then top-n items of the merged list are recommended to the group [6, 8].

Multiplicative.

In multiplicative, an aggregated value is calculated after multiplication of predicted rating obtained by each group member. Then top-n items with highest value(prediction) are recommended to group as a whole [8].

Merging-Multiplicative.

In this method, first off the top-n recommended items of each group member are merged together. Among them, top-n items are extracted. Then the new aggregated value is calculated after multiplication and the items are rearranged [8].

4 Experimental Evaluation

We have performed several experiments to obtain and analyze the performance of the proposed framework. The following issues are addressed:

How do the utilization of weighted contexts via PSO in user neighborhood and item neighborhood model performs?
To analyze the effects of sparsity measure variants controlling the contribution of local and global similarities.
Are the proposed algorithms reliable for group of users?

4.1 Description, Parameter Setup and Evaluation Metrics

The experiments are conducted on two global datasets enriched with context features and especially designed for context aware personalization research. The LDOS-CoMoDa dataset is from movie domain contains 30 features and are collected from surveys [12]. IncarMusic dataset is a global dataset and collected from is https://github.com/irecsys/CARSKit/tree/master/context-ware_data_sets [4]. The summarized statistics of these datasets are given in Table 4.

Table 4. The statistics of datasets.

Full size table

For implementation purpose, those users who have given ratings to at least three items are filtered and used for experimentation. The filtered dataset is divided into three folds. Out of them one fold is utilized as test set and rest two are treated as training set. The average of five runs are presented for all measures in the results. To measure predictive accuracy, mean absolute error i.e. MAE and root mean square error i.e. RMSE are used. Also, recommended ranked list of top10 items are calculated using Precision, Recall and F1-score. For both IncarMusic and LDOS-CoMoDa data sets, an item is considered relevant (a hit) only if it is assigned a rating higher than or equal to 4 (in scale of 1–5) by the active user. Each group recommendation technique is evaluated for five Random Groups. Moreover, the experiments are performed on two sizes of groups i.e. Small Group (SG) consisting of 3–5 users and Large Group (LG) consisting of 6–8 users. Group recommendations are measured for five runs and average using F1-score (metric) is presented as result.

4.2 Compared Methods

The experimental results shown below are compared with three more approaches to analyze the performance of the proposed algorithms presented in Sect. 3. We choose one context aware recommendation approach $ CAWP $ from our previous researches [10], second DCR via BPSO [21, 23] and DCW via PSO [23] from the same domain of research.

CAWP.

In our previous research work, we tried to come out from the dilemma of context selection by forming context communities and used a weighted percentile method to increase the accuracy [10]. We implemented the concept in user neighborhood model $ CAWP_{UB - ER} $ and item neighborhood model $ CAWP_{IB - ER} $. We are using the best cases i.e. 90th percentile in case of movie dataset and 70th percentile in music dataset for comparison.

DCR via BPSO.

Binary particle swarm optimization (BPSO) uses vectors of binary values to represent the position of particle instead the real-valued vectors. BPSO has been successfully demonstrated as efficient non linear optimizer for feature selection. It is available in open source libraries to understand and implement [23]. Moreover, DCR via BPSO is described [23] as the best technique to filter the context features.

DCW via PSO.

Instead of selecting few features, DCW includes the contribution of each contextual feature which is weighted. PSO optimizes the position of particles which represents a weighting vector for context features. It is also found that PSO based algorithms outperforms genetic algorithms which is also used for the same purpose. Hence it is used in CF technique [23].

4.3 Results and Analysis

This section presents and discusses the experimental results of the proposed framework using LDOS-CoMoDa and IncarMusic datasets.

Method Comparisons

Table 5 presents the results of the proposed sparsity based weighted context recommendation technique and other context aware implementations. It is also shown in Table 5 that the proposed algorithms whether it is user or item neighborhood based outperforms the other techniques of this area. The reason could be that usage of optimum weight for context features and sparsity dependent contribution of local and global neighbors are taken. It is worth to be noted that the proposed algorithms consider global similarities too (i.e. neighbors who have not rated common items) which other compared techniques don’t do.

Table 5. The computed values of MAE, RMSE, Precision, Recall and F1-score for different algorithms using two datasets.

Full size table

Further, Fig. 3 illustrates a comparison in the predictive accuracy of the recommendation system using only local similarity and the combination(local and global similarities with best case of ϑ). Using both datasets, the combination of local and global similarities perform better than local similarities. It also verifies the assumption that two users can be similar even if they do not rate common items.

Figure 2 (a) and (b) depict that the two variants of proposed method (i.e. user neighbourhood based and item neighbourhood based) show a significant difference in terms of MAE and RMSE. Similar trend is seen in F1-score (Table 5). Proposed algorithms reduces MAE and RMSE values remarkably compared to other baselines. The search space for BPSO is limited since the value in particle position can switch between 0 and 1 i.e. either the context feature is included or it is not. In PSO, search space becomes unlimited since the value can be in range [0,1] i.e. all context feature take some value in the range [0,1].

Hence, it can be concluded that proposed algorithms which finds optimum weight for context features using PSO and utilize the combination of local and global similarities are the best performing one. Also, item neighbourhood based algorithm are better than user neighbourhood based algorithm.

Sensitivity Analysis of Sparsity Measure $ {\varvec{\upvartheta}} $

Figure 4 shows the comparison of results w.r.t. F1-score using different sparsity measures $ \vartheta_{i} $ where $ = 1,2,3,4\, or\, 5 $. The prediction accuracy of $ \vartheta_{2} $ based on F1-score is better than others using movie dataset. The reason might be that small local neighborhood is formed since it is user specific sparsity measure. It can be observed that the similar trend is shown by both user and item neighborhood algorithms. It is worth to be noted that in IncarMusic dataset $ \vartheta_{1} $ performs better as dataset is comparatively less sparse and rich set of local neighbors are obtained.

Hence, we claim that sparsity measure $ \vartheta $ can handle local and global similarities in more effective way and the choice of sparsity measure best suited, is dataset dependent.

Performance for Group Recommendations

Figure 5(a) and (b) illustrates that the Merging + Multiplicative technique is slightly better than other grouping techniques. The reason could be that it is able to fix up errors more. Particularly, with user neighborhood algorithms on music dataset. F1-score values in Fig. 5 and Table 3 also reveals the effectiveness of proposed algorithms for group recommendations.

5 Conclusions and Future Work

Through this paper, we tried to alleviate the data sparsity problem and the issue of finding similarity in the absence of co-rated items for context aware recommender systems. To attain these goals, we have proposed a novel framework which utilizes contextually weighted collaborative filtering techniques based on user and item neighborhood model. The aggregated similarity measure used by these algorithms attempts to take sparsity based contribution of local and global similarities to produce better quality predictions. Global similarity measure provides better predictions when data sparsity is high and can handle user rating behaviour. Local similarity measure performs well in case of low data sparsity. The algorithms are evaluated under different levels of sparsity. Also, PSO technique is used to find weights of different context features to be utilized by different components of the algorithms. Assigning weights to contextual features rather than selection or matching also solves data sparsity problem. The experimental results show that the balanced contribution of local and global similarities produce better accuracy than considering only local similarity. Moreover, PSO is an efficient optimizer for weighting context. Hence, the proposed algorithms increase predictive accuracy. Furthermore, the variant of sparsity measure that suits best is dataset dependent. The proposed algorithms are reliable for group recommendations also.

In future, we aim to utilize fuzzy logic to better understand the results.

References

Adomavicius, G., Sankaranarayanan, R., Sen, S., Tuzhilin, A.: Incorporating contextual information in recommender systems using a multidimensional approach. ACM Trans. Inf. Syst. (TOIS) 23(1), 103–145 (2005). https://doi.org/10.1145/1055709.1055714
Article Google Scholar
Adomavicius, G., Tuzhilin, A.: Context-aware recommender systems. In: Ricci, F., Rokach, L., Shapira, B., Kantor, Paul B. (eds.) Recommender Systems Handbook, pp. 217–253. Springer, Boston, MA (2011). https://doi.org/10.1007/978-0-387-85820-3_7
Chapter MATH Google Scholar
Bakshi, S., Jagadev, A.K., Dehuri, S., Wang, G.: Enhancing scalability and accuracy of recommendation systems using unsupervised learning and particle swarm optimization. Appl. Soft Comput. 15, 21–29 (2014). https://doi.org/10.1016/j.asoc.2013.10.018
Article Google Scholar
Baltrunas, L., et al.: InCarMusic: context-aware music recommendations in a car. In: Huemer, C., Setzer, T. (eds.) EC-Web 2011. LNBIP, vol. 85, pp. 89–100. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23014-1_8
Chapter Google Scholar
Baltrunas, L., Ludwig, B., Peer, S., Ricci, F.: Context relevance assessment and exploitation in mobile recommender systems. Pers. Ubiquit. Comput. 16(5), 507–526 (2012). https://doi.org/10.1007/s00779-011-0417-x
Article Google Scholar
Baltrunas, L., Makcinskas, T., Ricci, F.: Group recommendation with rank aggregation and collaborative filtering. In: RecSys 2010 Proceedings of the Fourth ACM Conference on Recommender Systems, pp. 119–126. ACM, New York (2010). https://doi.org/10.10145/1864708.1864733
Choudhary, P., Kant, V., Dwivedi, P.: Handling natural noise in multi criteria recommender system utilizing effective similarity measure and particle swarm optimization. In: Seventh International Conferences on Advances in Computing Communications-2017, pp. 853–862 (2017). Procedia Computer Sciences 115. https://doi.org/10.1016/j.procs.2017.09.168
Article Google Scholar
Christensen, I.A., Schiaffino, S.: Entertainment recommender systems for group of users. Expert Syst. Appl. 38, 14127–14135 (2011). https://doi.org/10.1016/j.eswa.2011.04.221
Article Google Scholar
Dixit, V.S., Jain, P.: A proposed framework for recommendations aggregation in context aware recommender systems. In: 8th International conference on Cloud Computing, Data Science & Engineering. IEEE, Noida (2018, paper accepted and presented, in press)
Google Scholar
Dixit, V.S., Jain, P.: Weighted percentile based context aware recommender systems. In: 1st International Conference on Signals, Machines and Automation, AISC. Springer, Heidelberg (2018, paper accepted and presented, in press)
Google Scholar
Katpara, H., Vaghela, V.B.: Similarity measure for collaborative filtering to alleviate the new user cold start problem. In: Third International Conference on Multidisciplinary Research and Practice, vol. 4, no. 1, pp. 233–238 (2016)
Google Scholar
Kosir, A., Odic, A., Kunaver, M., Tkalcic, M., Tasic, Jurij, F.: Database for contextual personalization. Elektrotehniški vestnik, vol. 78, no. 5, str. 270–274, ilustr (2011). [English print ed.]
Google Scholar
Liu, H., Hu, Z., Mian, A., Tian, H., Zhu, X.: A new user model to improve the accuracy of collaborative filtering. Knowl.-Based Syst. 56, 156–166 (2014). https://doi.org/10.1016/j.knosys.2013.11.006
Article Google Scholar
Miao, Z., Zhao, Z., Huang, L., Yu, P., Qiao, Y., Song, Y.: Methods for improving the similarity measure of sparse scoring based on the Bhattacharyya measure. In: International Conference on Artificial Intelligence: Techniques and Applications (2016). https://doi.org/10.1016/j.eswa.2011.04.221
Odic, A., Tkalcic, M., Tasic, J.F., Kosir, A.: Relevant context in a movie recommender system: users opinion vs. statistical detection. In: Proceedings of the 4th International Workshop on Context-Aware Recommender Systems. Dublin, Ireland (2012)
Google Scholar
Panniello, U., Tuzhilin, A., Gorgoglione, M.: Comparing context-aware recommender systems in terms of accuracy and diversity. User Model. User-Adap. Inter. 249(1–2), 35–65 (2014). https://doi.org/10.1007/s11257-012-9135-y
Article Google Scholar
Patra, B.K., Launonen, R., Ollikainen, V., Nandi, S.: A new similarity measure using Bhattacharya coefficient for collaborative filtering in sparse data. Knowl.-Based Syst. 82, 163–177 (2015). https://doi.org/10.1016/j.knosys.2015.03.001
Article Google Scholar
Saranya, K.G., Sudha Sadasivam, G.: Modified heuristic similarity measure for personalization using collaborative filtering technique. Appl. Mathe. Inf. Sci. 1, 307–315 (2017). https://doi.org/10.18576/amis/110137
Article Google Scholar
Sulc, Z., Rezankova, H.: Evaluation of recent similarity measures for categorical data. In: 17th Application of Mathematics and Statistics in Economics, International Scientific Conference, Poland (2014). https://doi.org/10.15611/amse.2014.17.27
Wang, Y., Deng, J., Gao, J., Zhang, P.: A hybrid user similarity model for collaborative filtering. Inf. Sci. 418–419, 102–118 (2017). https://doi.org/10.1016/j.ins.2017.08.008
Article Google Scholar
Zheng, Y., Burke, R., Mobasher, B.: Differential context relaxation for context-aware travel recommendation. In: Huemer, C., Lops, P. (eds.) EC-Web 2012. LNBIP, vol. 123, pp. 88–99. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32273-0_8
Chapter Google Scholar
Zheng, Y., Burke, R., Mobasher, B.: Optimal feature selection for context-aware recommendation using differential relaxation. In: Conference Proceedings of the 4th International Workshop on Context-Aware Recommender Systems, Dublin, Ireland. ACM RecSys (2012). https://doi.org/10.13140/2.1.3708.7525
Zheng, Y., Burke, R., Mobasher, B.: Recommendation with differential context weighting. In: Carberry, S., Weibelzahl, S., Micarelli, A., Semeraro, G. (eds.) UMAP 2013. LNCS, vol. 7899, pp. 152–164. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38844-6_13
Chapter Google Scholar
Zheng, Y., Burke, R., Mobasher, B.: The role of emotions in context aware recommendation. In: Decisions@RecSys Workshop in Conjunction with the 7th ACM Conference on Recommender Systems, Hong Kong, China, pp. 21–28. ACM (2013)
Google Scholar
Zheng, Y., Burke, R., Mobasher, B.: Context recommendation using multilabel classification. In: IEEE/WIC/ACM International Joint Conference on Web Intelligence (WI) and Intelligent Agent Technologies (IAI), ACM Recsys, pp. 301–304. ACM, Silicon Valley (2014)
Google Scholar
Zheng, Y.: A revisit to the identification of contexts in recommender systems. In: 20th International Conference on Intelligent Users Interfaces, ACM IUI, Atlanta, GA, USA, pp. 109–115 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Atma Ram Sanatan Dharam College, University of Delhi, Delhi, India
Veer Sain Dixit & Parul Jain

Authors

Veer Sain Dixit
View author publications
You can also search for this author in PubMed Google Scholar
Parul Jain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Parul Jain .

Editor information

Editors and Affiliations

University of Perugia, Perugia, Italy
Osvaldo Gervasi
University of Basilicata, Potenza, Italy
Beniamino Murgante
Covenant University, Ota, Nigeria
Sanjay Misra
Saint Petersburg State University, Saint Petersburg, Russia
Elena Stankova
Polytechnic University of Bari, Bari, Italy
Carmelo M. Torre
University of Minho, Braga, Portugal
Ana Maria A.C. Rocha
Monash University, Clayton, Victoria, Australia
David Taniar
Kyushu Sangyo University, Fukuoka shi, Fukuoka, Japan
Bernady O. Apduhan
Politecnico di Bari, Bari, Italy
Eufemia Tarantino
Myongji University, Yongin, Korea (Republic of)
Yeonseung Ryu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dixit, V.S., Jain, P. (2018). Recommendations with Sparsity Based Weighted Context Framework. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2018. ICCSA 2018. Lecture Notes in Computer Science(), vol 10963. Springer, Cham. https://doi.org/10.1007/978-3-319-95171-3_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-95171-3_23
Published: 04 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95170-6
Online ISBN: 978-3-319-95171-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Recommendations with Sparsity Based Weighted Context Framework

Abstract

Similar content being viewed by others

Context-Aware Based Evolutionary Collaborative Filtering Algorithm

Context Similarity Measurement Based on Genetic Algorithm for Improved Recommendations

Efficient music recommender system using context graph and particle swarm

Keywords

1 Introduction

Major Challenges in CARS.

Our Contribution.

2 Related Reviews

2.1 Context-Aware Recommendation

2.2 Sparsity Problem

2.3 Similarity Measures

3 The Proposal

3.1 PSO to Learn Optimal Weighting Vector \( \varvec{w} \)

Particle Representation and Initial Population

3.2 Overlap Metric to Find Contextual Similarity

3.3 Weighted Overlap Metric

3.4 Weighted Local Similarity

3.5 Weighted Global Similarity

3.6 Sparsity Measure \( \left(\varvec{\vartheta}\right) \)

3.7 Predictions and Recommendations

3.8 Group Recommendation Unit

Merging.

Multiplicative.

Merging-Multiplicative.

4 Experimental Evaluation

4.1 Description, Parameter Setup and Evaluation Metrics

4.2 Compared Methods

CAWP.

DCR via BPSO.

DCW via PSO.

4.3 Results and Analysis

Method Comparisons

5 Conclusions and Future Work

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation