A semantic-enhanced trust based recommender system using ant colony optimization

Gohari, Faezeh Sadat; Haghighi, Hassan; Aliee, Fereidoon Shams

doi:10.1007/s10489-016-0830-y

A semantic-enhanced trust based recommender system using ant colony optimization

Published: 02 September 2016

Volume 46, pages 328–364, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Applied Intelligence Aims and scope Submit manuscript

A semantic-enhanced trust based recommender system using ant colony optimization

Download PDF

Faezeh Sadat Gohari¹,
Hassan Haghighi¹ &
Fereidoon Shams Aliee¹

1077 Accesses
20 Citations
Explore all metrics

Abstract

Collaborative Filtering (CF) is the most popular recommendation technique that uses preferences of users in a community to make personal recommendations for other users. Despite its popularity and success, CF suffers from the data sparsity and cold-start problems. To alleviate these issues, in recent years, there has been an upsurge of interest in exploiting trust information to improve the performance of CF. In general, trust has a number of distinct properties such as asymmetry, transitivity, dynamicity and context-dependency. However, conventional trust-based CF systems do not address trust computation by considering all the properties of trust. Particularly, the context-dependency property has received less attention in the existing approaches. The consideration of all these properties leads to more accurate recommendations since the quality of the inferred is improved. In this paper, we propose a novel trust-based approach, called Semantic-enhanced Trust based Ant Recommender System (STARS), which satisfies all the properties mentioned above. Using ant colony optimization, the proposed system performs a depth- first search for the optimal trust paths in the trust network and selects the best neighbors of an active user to provide better recommendations. To consider the context-dependency property, trust inference in STARS depends on the semantic descriptions of items. Incorporation of both global and local trust in CF-based recommender systems in addition to the trust computation based on the semantic features of items allows STARS to alleviate the data sparsity, cold-start and “multiple-interests and multiple-content” problems of CF. Experimental results on real-world data sets show that STARS outperforms its counterparts in terms of prediction accuracy and recommendation quality and can overcome the above problems.

Trust Inference Path Search Combining Community Detection and Ant Colony Optimization

Nature inspired recommendation with path optimization for online social network communities

Article 04 July 2024

Stochastic trust network enriched by similarity relations to enhance trust-aware recommendations

Article 31 August 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The rapid increase in the amount of information over the World Wide Web has made it difficult to search and find objects that may be of interest to users. One solution to this information overload problem is the use of Recommender Systems (RSs). RSs intend to provide users with recommendations of products, services, and information they might like, taking into account their needs or preferences. In recent years, RSs have become increasingly popular and have been applied to diverse domains [1].

Two basic entities in all recommender systems are the user and the item. A user who utilizes the recommender system is called an active user. An active user provides her opinion about a variety of items, usually expressed in the form of ratings. The recommender system applies a filtering algorithm on the input ratings and generates suggestions about new items (i.e., target items) for the active user [2]. Collaborative Filtering (CF) is one of the most popular techniques in recommender systems. CF is the process of filtering items using the known preferences of a group of users. CF has been used fairly successfully in various domains. However, it has main limitations such as the data sparsity [3–5] and cold-start [6–9] problems. Data sparsity arises when the number of ratings obtained from users is very small compared to the number of ratings that must be predicted, and so it becomes difficult to find a significant overlap between the items rated by two users [10]. Cold-start users are new users who have provided none, or a small number of ratings. CF fails to generate reliable recommendations for cold users due to the lack of enough initial ratings.

Beside the mentioned problems discussed by many researchers, as another limitation, CF is not adaptive to environments in which users have many different interests, and at the same time, items have completely different contents. In such cases, CF provides poor recommendations because the target item for the active user may not be consistent with the common interests of her neighbors. This problem is called Multiple-Interests and Multiple-Content (denoted as MIMC) [11]. More specifically, in traditional CF methods, the similarities between users are computed based on all co-rated items, even those items that are not related to the target items. In this way, neighbors of an active user will be identical for all target items. However, a reasonable assumption is that, for different predicted items, neighbors of the same active user are likely to be different [12].

Due to the inherent problems with CF approaches, many researchers have shifted their attention to hybrid approaches that incorporate additional external information, such as demographic information [2], semantic information [10, 13], and trust information [14, 15]. More recently, semantic-based CF systems have successfully been used in different domains. Such systems can take advantages of semantic reasoning to provide much more reasonable recommendations in the case of newly added item or in very sparse data sets [16]. In recent years, semantic-based recommender systems have been applied to several different domains such as health [17], tourism/leisure [13, 18], news [19], sound/movie/music [20–22], etc. Furthermore, as a new direction, several researches have suggested that the incorporation of social trust information into the traditional CF method can resolve the data sparsity and cold-start problems and improve the quality of recommendations [14, 23–29].

In the context of recommender systems, trust can be defined as “one’s belief toward others in providing accurate ratings relative to the preferences of the active user” [30]. In general, trust has a number of distinct properties [31]:

i.
Asymmetry: trust is personal and subjective. More specifically, if user u trusts user v, v does not necessarily trust u.
ii.
Transitivity: an important property of trust which says if user u trusts v, and v trusts p, it can be inferred that user u trusts p to some extent. This property helps to identify new neighbors for an active user by propagating trust in the network.
iii.
Dynamicity: trust between two persons often changes over time. Trust can be increased with positive experiences and decreased with negative experiences.
iv.
Context-dependency: trust is context-dependent, which means trust relations should be determined with respect to a particular situation. For example, a user who provides satisfying recommendations in the movie domain may not be an expert in the domain of digital cameras. The context can refer to the type of items that users give ratings or the condition in which ratings are issued, such as the location of users or items. In this paper, contextual information refers to the content descriptions of items.

Trust-based recommender systems employ trust relationships between users in a social network, known as a trust network, to produce recommendations for users based on people they trust. Within the trust network, trust can be modeled locally or globally. Global trust models compute reputation of a user within the whole network. In other words, such models estimate how the community as a whole considers a certain user. In contrast, local trust models compute a user’s trustworthiness with respect to every other user [32]. Actually, local trust models derive trust values between users based on their ratings on co-rated items. In general, while local trust models can be more precise and personalized than global models, they are computationally more expensive [23]. Also, local trust models suffer from the cold-start problem [33] since it is difficult to identify trustworthy users using an insufficient number of co-rated items.

Two main trust-based filtering approaches have been adopted in the current literature: implicit trust and explicit trust. In the former approach, trust is inferred from user behaviors such as the provided ratings whereas in the latter, trust is directly specified by users [30]. Although explicit trust-based systems tend to be more accurate than the implicit ones, they require additional user effort, and thus, explicit trust statements may not always be available [24]. So, this paper focuses on the implicit trust in the proposed approach.

There are many trust inference approaches (or in other words, trust metrics) proposed to calculate the implicit trust from user ratings. Among the proposed approaches, some representative and popular trust metrics [14, 23, 25–29] have been studied by Guo et al. [31] in terms of the trust properties. Their study reveals that these metrics are not asymmetric since they derive trust values based on similarity or error measures which are symmetric in nature. Although all these metrics [14, 23, 25–29] are transitive, none of them explicitly consider the dynamicity and context-dependency properties of trust. In summary, the proposed approaches partially cover trust properties, and new metrics are needed to better satisfy the semantics of trust [31]. Among the proposed approaches, the work of Shambour and Lu’s [14] is relatively close to ours since it fuses the trust and semantic information of users and items within the CF framework. They proposed an innovative Trust–Semantic Fusion (TSF)-based recommendation approach, which merges two hybrid recommendation approaches: the user-based trust-enhanced CF and the item-based semantic-enhanced CF. The former approach utilizes trust information to alleviate the data sparsity and cold-start problems, whereas the latter approach employs the semantic features of items to address these problems. Therefore, in the TSF approach, two separate modules use the trust information and semantic information, and thus, trust values are computed independently of semantic features of items. In contrast, in our proposed approach, trust inference relies on the items’ semantic descriptions in order to handle the context-dependency of trust. It is remarkable that TSF only satisfies the transitivity property [31]. As will be shown in our experiments, ignoring the main properties of trust in TSF results in predictions with less accuracy and recommendations with lower quality compared to our approach.

It has been recently shown that by applying Ant Colony Optimization (ACO) algorithms [34, 35] to trust-based recommender systems, it is possible to handle the dynamicity of trust [36]. Ant colony algorithms are a group of meta-heuristic optimization algorithms based on the ants’ efforts for seeking food in nature. These algorithms, which utilize random procedures and reinforcement learning, are extremely satisfactory in dynamic environments. Ant colony algorithms are based on emulation of behavior of real ants. In nature, real ants aim to find the optimal path between a food source and their nest without direct communications, adapting to changes in the environment. One factor that the ants benefit from is pheromone deposition. Ants are attracted by pheromones coming from fellow type ants. As the time passes, paths with higher pheromone levels are chosen with a higher probability than those that have a weaker amount of pheromone deposit. This collaborative behavior between fellow type ants is similar to the collaborative world as people mostly collect opinions from their like-minded friends (or neighbors) [36].

In the context of trust-based CF recommender systems, ACO can be applied on a directed trust graph in order to search optimal trust paths. Actually, ACO selects the paths with the maximal propagated trust values as the most trustworthy paths. These trustworthy paths help to identify the best neighbors of an active user because a chain of users with high propagated trust values can provide more precise opinions for the active user. To be more specific, the active user is considered as the ants’ nest and her trusted neighbors as the food sources. So, the artificial ants are dispatched from the active user node into the trust network to imitate the foraging behavior of real ants in search for a valuable food source. Dynamicity of trust, as an important property for improving the quality of recommendations, can be effectively handled using the pheromone updating strategy of ants to analyze the trust intensity among users over time. Based on this strategy, the pheromone value associated with trustworthy neighbors is increasing, while this value for the other users is decreasing. The application of ACO to the area of implicit trust-based filtering approaches has been studied in the Bedi and Sharma’s work [36] where they addressed time-based trust computation using the pheromone updating strategy. They proposed the Trust based Ant Recommender System (TARS) which produces valuable recommendations by incorporating the notion of dynamic trust between users and selecting the best neighborhood based on the biological metaphor of ant colonies. As time passes, TARS produces recommendations by continuously updating the dynamic trust between users. TARS satisfies asymmetry, transitivity and dynamicity properties but not context-dependency property. It also has some other limitations that will be briefly explained in Section 2.3.

In conclusion, the literature on trust-based CF systems reveals the absence of a recommender system that would take into account all the properties of trust, especially, context-dependency which has not yet been investigated by any of the previous implicit trust-based filtering approaches in an empirical manner. In order to fill the current gap, we extend TARS developed by Bedi and Sharma [36] and propose STARS (Semantic-enhanced Trust based Ant Recommender System) which satisfies all the properties of trust. Using semantic information and clustering items based on their semantic similarities, STARS incorporates the context information in trust computation. More specifically, for each item cluster c, STARS creates a directed Implicit Trust Graph (ITG). An ITG^c is a directed graph where the nodes are users (So, we use the words “node” and “user” interchangeably in the paper). The edges are weighted according to the degree of trust from one user to another user in the context of item cluster c. In order to find the best neighbors of an active user for target item x, STARS applies ACO on ITG^ε, where ε refers to the cluster that item x belongs to. Thus, implicit trust values between users depend on the semantic features of the target item. In other words, for different target items, trusted neighbors of the same active user may be different.

STARS works in two phases: offline and online. Starting at time t=t ₀, it creates a set of initial ITGs based on each item cluster in the offline mode. In order to create initial ITGs, STARS uses a global trust model. In the online phase, STARS first determines which semantic cluster the target item belongs to, and then uses the ITG associated with that cluster to implement ACO for selecting the best neighbors of the active user. In order to control the maximum search depth, STARS uses a tunable trust propagation limit. After neighborhood formation, STARS predicts the active user’s ratings for target items and produces a list of recommendations. Finally, using the ACO pheromone updating strategy, STARS updates each ITG such that the pheromone increases for the trustworthy neighbors and decreases for other users. It should be noted that the updating step is also accomplished offline.

In comparison with research efforts found in the literature, our work has the following differences:

A novel implicit trust-based filtering approach, called STARS, which satisfies all the distinct properties of trust, especially, context-dependency. To the best of our knowledge, this is the first trust-based CF system that is asymmetric, transitive, dynamic and is able to leverage context information in trust computation.
A novel ant-inspired search algorithm for finding the best neighbors of an active user with respect to a specific target item x at each time step. This algorithm utilizes both trust and similarity information in the context of cluster ε that item x belongs to. Trust and similarity information is represented by pheromone levels on the edges and heuristic values of the nodes in ITG^ε, respectively.
A novel ant-inspired algorithm for updating the dynamic trust (through pheromone evaporation and deposition) between the active user and other users in ITG^ε at each time step. In this algorithm, the amount of pheromone to be deposited depends on: (1) the inferred trust values through the best trust paths in ITG^ε, and (2) the amount of confidence which is directly related to the number of co-rated items between two users.
A new method for considering both global and local trusts in a recommender system. More specifically, STARS uses global trust at the initial stage (time t=t ₀), and as the time passes (time t>t ₀), it locally updates the dynamic trust value between the active user and others. Using the global trust at the initial stage, the system can provide reasonable recommendations for cold users with a few or even no ratings. As the time passes and the active user provides more information about her preferences (e.g., rates an item x), STARS locally adjusts the weight of her outgoing links in ITG^ε so that her most trusted neighbors have a higher probability of selection in future.

Incorporation of both the global and local trusts into CF along with the trust computation based on the semantic features of items contributes to the success of STARS in alleviating the data sparsity, cold-start and MIMC problems of CF. An exhaustive set of temporal experiments on the Movielens data sets empirically evaluated the performance of the proposed approach over time and demonstrated its advantages over benchmark algorithms. In order to incorporate time into our experiments, we used rating timestamp available in the Movielens data sets.

The rest of this paper is organized as follows. The related work is reviewed in the following section. In Section 3, the STARS approach and its components are elaborated. Section 4 demonstrates the experiments and their results. Finally, we present our conclusions and outline the future lines of research in Section 5.

2 Background

In this section, the related subjects to the proposed recommender system are explained. First, a short introduction about ACO is presented. Then, we review the literature related to the CF recommender systems and the hybrid systems. The studied hybrid systems exploit additional sources of knowledge (such as trust relations between users and/or semantic features of items) to improve the performance of CF.

2.1 Ant colony optimization

Swarm intelligence is a computational and behavioral metaphor for problem solving that takes inspiration from the social behavior of insects or other animals. ACO is one of the most powerful optimization methods that takes inspiration from the foraging behavior of real ants [37]. In nature, ants deposit pheromone on the ground in order to communicate each other. The deposited pheromone helps the ants to find the shortest path between the nest and the food. More specifically, in searching for a food source, ants smell the pheromone left by previous ants of the same colony and tend to follow the paths marked by strong pheromone concentrations. In other words, ants choose their path by a probabilistic decision guided by the amount of pheromone: the larger the amount of pheromone on a trail, the higher the probability that ants follow that trail when choosing their path.

During the return trip, the amount of pheromone deposited on the trail depends on the quantity and quality of the food source. This indirect communication among ants helps them to find the shortest path. Since the shorter paths take less time to be traversed, they are reinforced more with the greater amount of pheromone. Therefore, the shorter paths become more favored over time. It should be noted that the pheromone gradually evaporates. Thus, the longer paths lose their pheromone intensity and become less attractive over time. The final result is that the majority of the ants will quickly trace the shortest path between the nest and a food source [38–40].

Various ACO algorithms exploit a similar mechanism for solving optimization problems. Ant colony problems are usually modeled with a decision graph [37]. In an ACO algorithm, each artificial ant constructs a candidate solution by a sequence of probabilistic decisions. The decisions are biased by the amount of pheromone deposited on the edges, and available heuristic information. The sequence of decisions for constructing a solution can be viewed as a path through the corresponding decision graph. Finding good solutions for the problem is done in an iterative process. In each iteration, the solutions found by the ants guide the process of solution construction in the following iterations. More specifically, in each iteration, the pheromone trails are updated to guide the ants towards paths that are more likely to result in good solutions. This process continues until some stopping criterion is met. For example, stopping criteria could include reaching a maximum number of iterations or finding a solution of a given quality [41].

2.2 CF recommender systems

CF is one of the most popular techniques in recommender systems [14]. CF approaches are divided into two categories: memory-based approaches and model-based approaches. In memory-base approaches, the entire rating matrix is used to make recommendations. In model-base approaches, a model is driven from the previous ratings. Then, the driven model is used to make the predictions [42]. Memory-based approaches can be further classified into two main classes: User-based CF (UCF) [43] and Item-based CF (ICF) [44]. In UCF approaches, a subset of users is chosen based on their similarity to the active user (this subset is called the neighborhood). Then, a weighted combination of the neighbors’ ratings is used to predict the ratings for the active user. ICF approaches are similar to UCF approaches, except that the former ones employ the similarity between the items instead of users [45]. Despite the popularity of CF approaches, they suffer from data sparsity, cold-start and MIMC problems [11, 14, 30]. Among these problems, MIMC has received less attention in the existing works. This problem is occurred when users are interested in a variety of items that have different content. In such cases, CF cannot provide accurate recommendations because the target item for the active user may not be consistent with the common interests of her neighbors [11]. MIMC problem can be alleviated by considering the similarity between items when finding neighbors of an active user. For example, Li et al. [11] proposed a hybrid approach by integrating both UCF and ICF methods. This approach is able to filter the dissimilar items to the target item and select neighbors of the active user based on the similar items to the target. In [12], a new similarity function has been proposed to select neighbors who are more appropriate according to each specific target item. In the proposed function, the rating of a user on an item is weighted based on the similarity between this item and the target item.

Many successful approaches have been developed over the past few years to alleviate the data sparsity and cold-start problems. These approaches explained in the subsequent sections usually rely on additional sources of knowledge, such as items’ semantic descriptions [10, 13] and/or users’ trust information [14, 15].

2.3 Semantic-based CF recommender systems

In recent years, reasoning techniques borrowed from the Semantic Web have been adopted in the context of recommender systems in order to overcome the data sparsity and cold-start problems [10]. Traditional syntactic-based recommender systems miss a lot of useful knowledge during the recommendation process. Therefore, their recommendations only include items very similar to those the user already knows. Semantic-based recommender systems can overcome this problem by inferring implicit semantic relationships between items [46]. The cornerstone of the Semantic Web is the use of taxonomies or ontologies to classify and describe concepts in a particular domain. Using product taxonomies and ontologies allows the system to reason about the semantics of items and to discover the hidden semantic associations between them [10]. Semantic-based CF approaches [10, 13, 20, 47, 48] provide two primary advantages over traditional CF approaches. First, the semantic attributes of items allow the system to make inferences based on the underlying reasons for which a user may or may not be interested in a particular item. Second, in the case of a new item or in very sparse data sets, the system can still use the semantic information to provide reasonable recommendations for users [16].

2.4 Trust-based CF recommender systems

As another approach to alleviate the problems of traditional CF approaches (such as the data sparsity and cold-start), trust information has been widely used in CF methods. The resulting hybrid systems typically explore the trust network and find a neighborhood of users trusted (directly or indirectly) by a user and generate recommendations by aggregating their ratings [15]. As mentioned earlier, there are two main trust-based filtering methods: explicit trust and implicit trust. Since in explicit approaches trust values are obtained from pre-existing social links between users, asymmetry of the trust is always satisfied. In contrast, implicit approaches are often symmetric since they are based on the similarity or error measures which are symmetric in general [31]. According to [49], the performance of using implicit trust information is slightly worse than applying explicit trust information. Nevertheless, explicit trust-based filtering approaches encounter two major limitations [14]: (1) they require extra user efforts to label the trust statements. Accordingly, explicit trust statements may not always be available; (2) they suffer from the cold-start problem since new users should specify explicitly whom they trust before the filtering becomes effective. These limitations make the implicit trust-based filtering approaches more feasible to use [14]. Hence, the present work focuses on the implicit trust.

Most of the trust-based recommendation approaches use the transitivity of trust and propagate trust to indirect neighbors in the social network. However, the dynamicity and context-dependency of the trust have been often ignored in the proposed approaches. Here, we review some major and popular implicit trust-based filtering approaches. Due to space restrictions, we only present related works on the implicit trust. To the best of our knowledge, there is not any explicit trust-based recommender system that takes all distinct properties of trust into account.

Implicit trust-based filtering approaches derive trust values based on users’ ratings on items. For instance, O’Donovan and Smyth [28] define the “profile-level” and “item-level” trust as the percentage of correct predictions that a profile has made “in general” or “with respect to a particular item”, respectively. Pitsilis and Marshall [29] proposed a model of implicit derivation of the user’s trust from an evidence that describes her rating behavior. In the proposed model, trust is expressed in the form of opinion which is always subjective and uncertain. Papagelis et al. [26] developed a trust computational model permitting to consider the subjective notion of trust associations by applying confidence and uncertainty properties. Lathia et al. [25] proposed a trusted k-nearest recommenders algorithm allowing users to learn how much to trust one another by evaluating the utility of the rating information they receive. Hwang and Chen [23] proposed an implicit trust metric deriving trust scores directly from the ratings data based on the users’ prediction accuracy in the past. Yuan et al confirmed the small-world property of the trust network [50] and developed an implicit TrustAware Recommender System (iTARS) [27] based on this property. This property indicates that the trust propagation distance between any two randomly selected users of the trust network is short. Shambour and Lu [14] proposed the TSF recommendation approach providing more effective results, in terms of prediction accuracy and coverage, compared to the user-based and item-based benchmark algorithms. As mentioned before, this approach fuses the “user-based trust-enhanced CF” with the “item-based semantic-enhanced CF” approaches. The previously mentioned implicit trust metrics do not handle dynamicity, context-dependency and asymmetry [31]. The recent work conducted by Fang et al. [51] takes into account the context-dependency of trust. The authors focused on predicting implicit trust and distrust values according to interpersonal and impersonal aspects of trust and distrust. In their proposed model, interpersonal aspects are computationally modeled based on users’ historical ratings, while impersonal aspects are computed on the basis of users’ explicit trust and distrust network. In this model, competence, as one of the interpersonal trust aspects, is computed under a specific context. In other words, the user receiving a high competence belief from the trustor is capable of providing satisfactory recommendations to the trustor in a specific context. However, the authors did not take into account the context information in their experiments. Also, the dynamic aspect of trust was not considered in their model.

It has been recently shown that by applying ACO to trust-based recommender systems, it is possible to handle the dynamicity of trust [36, 52, 53]. This property can be viewed as the trust intensity between users. The trust intensity between an active user and each of her neighbors may change depending on recommendations generated by the neighbor. The pheromone updating strategy in ACO algorithms can be used to analyze the trust intensity among users over time. Based on this strategy, the pheromone value associated with trustworthy neighbors is increasing, while this value for the other users is decreasing [36]. The first successful application of ACO to the context of trust-enhanced recommender systems is T-BAR (Trust-Based Ant Recommender) [52]. T-BAR is a dynamic algorithm based on the probabilistic model of ACO algorithms, which its ability to increase the accuracy and coverage of predictions has been proven. In T-BAR, explicitly expressed trust values are used as heuristic information. In addition, using the trust information in the network, the pheromone level of each edge is locally initialized before an ant encounters it. However, this algorithm suffers from its inability to deal with cold users. To overcome this problem, the authors proposed DT-BAR (Dynamic T-BAR) [53], a dynamic trust-based recommender that solves the new user problem by allowing the ants to share information about the traversed edges.

The application of ACO to the context of implicit trust-based filtering approaches has been studied in TARS [36]. TARS produces valuable recommendations by incorporating a notion of dynamic trust between users and selecting the best neighborhood based on the biological metaphor of ant colonies. In TARS, the pheromone level on edges represents the strength of connectedness, i.e., the trust intensity between the two recommendation partners; also, heuristic values depend on the level of connectedness from the active user node to another node chosen by ants. The initial pheromone level is computed by combining similarity and confidence measures. The combination of similarity with confidence reduces data sparsity and creates asymmetric trust values. As the time passes, TARS produces recommendations by continuously updating the dynamic trust between users.

Nevertheless, the major limitation of TARS stems from the fact that it performs a modified breadth-first search in the trust network to generate recommendations. More specifically, TARS initially selects the best neighborhood only among the direct neighbors (i.e., users at distance 1 from the active user). The active user’s rating for an item is generated by aggregating the ratings of selected neighbors for that item. If some items are not rated by any of the direct neighbors, then TARS moves to the next level and explores the child nodes of the most trustworthy user. This process is repeated until the ratings of all unrated items are predicted. So, due to the proceeding of the search in a breadth-first manner, if direct neighbors have rated an item, then TARS is unable to use possible valuable ratings of the users without direct trust link to the active user. The other limitation of this approach is that the use of distance metric in computing heuristic values would not be effective in distinguishing between trusted and untrusted friends in a breath-first search process. Also, TARS does not take into account changes in user interests over time. Actually, it considers user interest profiles only during the initial stage (i.e., at time t=t ₀) in which it uses similarity between profiles for computing the initial trust values. After that, at time t>t ₀, it does not consider new or changed interests of the active user; it updates the trust information only based on the involvement or non-involvement of other users as a recommender for the active user over a period of time. As the final shortcoming, TARS does not take into account the context information in its trust model.

As seen, the literature on trust-based CF systems reveals the absence of a recommender system that takes all distinct properties of trust into account. In order to fill this gap, we propose a novel implicit trust-based filtering approach, called STARS, which is asymmetric, transitive, dynamic and is able to leverage the context information in trust computation. Using the ant colony optimization, STARS performs a depth-first search for the optimal trust paths in the trust network and selects the best neighbors of the active user. In this approach, trust can be passed from one member to another in the trust network, creating trust chains, based on the propagative and transitive nature of the trust. STARS considers the contextual information by inferring trust values based on the semantic descriptions of items. This approach also handles the asymmetry and dynamicity properties using the pheromone updating strategy. Based on this strategy, at each time step, the pheromone value associated with the best neighbors of the active user in a specific context is increased, while this value for the other users is decreased. As will be shown in the following sections, STARS adapts itself to dynamically changing user interests and mitigates the data sparsity, cold-start and MIMC issues.

3 STARS: semantic-enhanced trust based ant recommender system

STARS is a dynamic recommender system which considers contextual (i.e., content descriptions of items) and temporal information for selecting the most trustworthy neighbors of the active user according to her current interests in specific types of items. To infer context-dependent trust values, STARS uses the semantic descriptions of items. Actually, STARS clusters items based on their semantic similarities and finds trusted neighbors of an active user with respect to a specific cluster. Let I={i ₁,i ₂,....,i _m}, m>1, be a given set of items and U={u ₁,u ₂,....,u _n}, n>1, be a given set of users. By clustering items based on their semantic similarities, we have a set of z item clusters C={c ₁,c ₂,....,c _z}, z>1, such that each cluster c _j contains at least one item (i.e., $\left | {c_{j}} \right |\ge 1$, where $\left | {c_{j}} \right |$ denotes the number of items in cluster c _j). Each cluster represents a specific context. In the rest of the paper, wherever we mention a context c, it refers to cluster c.

Temporal information is incorporated into STARS by exploiting timestamps of ratings. In other words, STARS observes the ratings over time. Let R(t) be an n×m user-item rating matrix which stores the ratings made on and before time slot t. Each element r _u,i of this matrix represents the rating of item i by user u. Besides, r _u,i is associated with a timestamp. Let matrix S(t) with the same size of R(t) involve the corresponding timestamp of ratings. Each element t _u,i of matrix S(t) is the timestamp of the rating made by user u on item i. So, as a dynamic recommender system, STARS continuously collects users’ feedbacks over a long period of time. In this paper, we use time values with day granularity. Starting at time t ₀=x (i.e., x days from the first rating), STARS is iteratively updated at every μ days. So, the length of the first time slot (i.e., time slot t ₀) is equal to x days, while the length of other time slots is equal to μ days. At each time step t, the task of STARS is to predict ratings of the active user for target items at time slot t+1, produce a list of top-N recommendations, and update dynamic trust values for the next time slot.

As mentioned before, STARS leverages context information in neighborhood formation at time slot t by considering users’ interest in each item cluster. In other words, STARS infers multiple trust relationships between two users, each of which corresponds to a specific context. For this purpose, STARS splits the rating matrix R and timestamp matrix S into z sub-matrices, each of which corresponds to a cluster of items. Considering cluster c _j, STARS produces an $n\times \left | {c_{j}} \right |$ sub-matrix, called R ^c _j(t), which contains user ratings for all items in cluster c _j up to time t. It also produces a sub-matrix S ^c _j(t) with the same size of R ^c _j(t), which contains the corresponding timestamp of ratings. At the initial time slot t ₀, STARS uses $\text {\textbf {R}}^{c_{j}} (t{}_{0})$ in order to compute the initial global trust values between users in the context of cluster c _j. Using a global trust model at the initial stage, users can benefit from opinions of globally trusted users. This is especially useful for the cold users who have only rated a small number of items. As the time passes and users provide information about their preferences for items belonging to cluster c _j, STARS uses R ^c _j(t) and S ^c _j(t) to locally update the dynamic trust values between users in this context.

Using the ACO algorithm, STARS applies the following information for selecting the most trustworthy neighbors of the active user in a specific context c at each time slot:

1)
The previously learned trust knowledge based on the global reputation and involvement of users in generating recommendations in context c in the past. This knowledge is memorized in the form of pheromone trails. Based on this knowledge, users who have been selected more frequently as the trusted neighbors of an active user have a higher probability of selection compared to other users.
2)
The similarity between users according to the current interests of users in items belonging to context c. This knowledge is represented as the heuristic value of nodes (users) in a trust network during the neighborhood selection process. In order to compute the similarities between users, STARS exploits timestamps of ratings. The reason is that STARS is a dynamic recommender system, and thus, the existing users’ preferences are not static and may change over time. With respect to the fact that implicit trust is positively correlated with user interests, the changes in users’ preferences affect the users’ tendencies to trust or not to trust others. To be able to adapt to such changes, when the recommendations are requested, STARS computes similarity between users using a temporal relevance measure [54]. This measure gives more importance to recent observations and reduces the effect of old ratings since the recent ratings could better reflect the user’s current interest [54]. The utilization of the time-based similarity information (as the heuristic values) helps STARS to select more accurate neighbors and better update the trust pheromone values according to the users’ current preferences. So, STARS is able to adapt to dynamically changing user interests and select the most trustworthy neighbors of the active user according to her current interests in specific types of items.

In the following sections, the architecture of STARS is described, and each of its components is discussed in detail. Then, the computational complexity of STARS is analyzed.

3.1 The architecture of STARS

As shown in Fig. 1, STARS contains two main components: database and recommendation engine. The first component involves the development and storage of the item ontology and data structures. The item ontology helps to classify, describe and interrelate the universe of relevant concepts in a specific domain [10]. The second component generates a list of top-N recommendations for users. It enables the system to reason about the semantic descriptions of the available items and to infer hidden semantic relationships between them [46]. The recommendation engine has three modules, namely preprocessor, recommender and trust network updater. Initially, at time t=t ₀, the preprocessor module, which works in the offline mode, computes the similarity between every two items based on their semantic descriptions, as given in the item ontology. Then, the preprocessor module clusters the items according to their semantic similarities by the k-medoids algorithm [55]. Finally, this module creates a set of initial ITGs based on the global trust values between users in each item cluster. In the online mode, at each time slot t, the recommender module first retrieves context-specific data (i.e., trust and rating data) and implements ACO for selecting the best neighbors of the active user in each context. After neighborhood formation, it predicts unknown ratings of the active user and produces a list of recommendations. Finally, the third module uses the ACO pheromone updating strategy to update the ITG related to each context. The third module works offline. As the result of this update, the pheromone increases for the trustworthy neighbors and decreases for other users in that context. In the following sections, we describe each module of the recommendation engine in detail.

3.1.1 Preprocessor module

The aim of this module is to incorporate the context information into STARS using semantic profiles of items. For this purpose, this module first generates item clusters based on their semantic similarities and then creates an ITG for each cluster. The main intuition behind this idea is that the trust value between two users depends on the content of the existing items. For example, a user who provides valuable recommendations for purchasing cars may not be an expert in the movie domain. In other words, a trustworthy user is capable of providing satisfactory recommendations to the trustor only in specific contexts [51].

Calculating implicit trust values based on the semantic features of items makes STARS adaptive to multiple-interests environments. Actually, the implicit trust is positively correlated with user interests. Thus, when users have many different interests, and items have completely different content, multiple trust relationships may exist between users. Each of these trust relationships corresponds to a different context. By applying context-dependent trust relationships, selected neighbors of a specific active user may be different with respect to different target items, which results in alleviating the MIMC problem.

In order to create initial ITGs, the preprocessor module uses a global trust model. Global trust models predict a global “reputation” score that estimates how the community as a whole considers a certain user [32]. The application of the global trust model at the initial stage helps to provide more neighbors for every single user. This method is useful when there is not enough rating information (such as in the case of new users or very sparse data sets).

Semantic similarity calculation

In order to utilize the semantic information of items, we first have to represent item characteristics with a domain ontology. The item ontology implemented in our system includes the typical concepts and relationships of the movie domain. Although our approach can potentially be used for other domains with different item ontologies, it has been implemented in the movie domain because it is a well-known domain and there are a large number of relationships between the concepts involved in this domain (such as movies, actors, directors, writers, genres, etc.). We use the Movie Ontology^{Footnote 1} which has been developed according to the Ontology Web Language (OWL) standard by the University of Zurich. The Movie Ontology provides a controlled vocabulary to semantically describe movie related concepts (such as Actor, Director, Genre, etc.) and their associated individuals. Through this ontology, it is possible to link, hierarchically and non-hierarchically, elements belonging to the domain of the movies. The main class of the ontology is class “Movie”. All the movies are instances of this class. For instantiating the Movie Ontology, we use the Internet Movie Database^{Footnote 2} and gather required data using a web crawler.

The similarity between two items a and b is computed based on their semantic descriptions, as given in the item ontology. For this purpose, we use the semantic similarity formula presented by Carrer-Neto et al. [56]:

$$\begin{array}{@{}rcl@{}} Semsim(a,b)&=&\sum\limits_{i=1}^{\left| {\mathrm{P}} \right|} {\left({\frac{\text{common}(a,b,P[i])}{\max(\text{deg}(a,P[i]),\text{deg}(b,P[i]))}} \right)}\\ &&\times Weight(P[i]) \end{array} $$

(1)

where P is a vector that contains a set of datatype properties and object properties of the Movie class representing the “target” of the recommender engine; deg(a,p) represents the number of instances associated with item a through property p; common(a,b,p) denotes the number of common instances associated with items a and b through property p; and Weight(p) indicates the importance of property p. The weights of the properties should be determined subjectively according to the given domain. For example, in the movie domain, the Genre of a movie is more important than its filming locations.

The main datatype and object properties that are used in our system are as follows: belongsToGenre, hasActor, hasDirector, hasProducer, isAwardedWith, nominatedFor, hasFilmLocation, isProducedBy, and isFromDecade. For detailed information about these properties, refer to Carrer-Neto et al. [56] (Section 3.1.1).

Item clustering

For semantic clustering of items, we have adopted the k-medoids algorithm [55] due to its simplicity and high accuracy. Similar to the k-means algorithm [57], k-medoids is a partition based clustering algorithm. However, in contrast to k-means, k-medoids chooses objects as centers (medoids) instead of taking the mean value of the objects, and it can work with an arbitrary matrix of distances between objects. Since the k-medoids algorithm minimizes the summation of pairwise distances within a cluster, it is more robust to the noise and outliers compared to the k-means algorithm. Moreover, k-medoids is not generally influenced by the presentation order of objects. In this paper, we have adopted the k-medoids algorithm in order to preserve items’ semantic information during the clustering process described as follows.

First, the k-medoids algorithm randomly selects k items as the initial medoids. Then, it proceeds by alternating between two steps. In the first step, each item is assigned to the cluster associated with the nearest medoid. In particular, the semantic similarity is used as the distance metric to measure the closeness of two items. The item distance between two items a and b, denoted by d i s t(a,b), is computed by d i s t(a,b)=1−S e m s i m(a,b). In the next step, within each cluster, each of the non-medoid items is swapped with the medoid. If the sum of within-cluster distances decreases using a non-medoid item as the medoid, then that item is chosen as a new medoid. The algorithm repeats these steps until the medoids become fixed.

Generating initial trust graphs

After generating item clusters, the preprocessor module creates a directed implicit trust graph for each cluster. The implicit trust graph ITG^c(t) is a directed graph where the nodes are users and the edges are implicit trust relationships. In this graph, the edges are weighted according to the degree of the trust between each pair of users at time slot t, considering only items belonging to cluster c. In order to create initial ITGs, a global trust model is used. A user’s global trust can be computed as the average of the local trust scores given by direct neighbors of the user [23].

As described before, STARS derives the implicit trust values from context-related data. At time t=t ₀, the initial ITG^c _j(t ₀) is created based on the available ratings in sub-matrix R ^c _j(t ₀). Each ITG^c _j(t ₀),j∈1,2,...,z, is created as the following steps:

Step 1: Normalize rating data

Because ratings are determined not only by user interests but also by rating habits of users, it is important to normalize ratings of different users to the same scale [58]. In STARS, user ratings are normalized in the range [0,1] using the Min–Max Normalization method. Here, this method linearly transforms an original rating value r of data set X to a new value r ^′ which fits in the range [0,1] as follows:

$$ {r}^{\prime}=\frac{r-\min(\mathrm{X})}{\max(\mathrm{X})-\min(\mathrm{X})} $$

(2)

where min(X) and max(X) denote the minimum and the maximum value of ratings in data set X, respectively.

Step 2: Compute the initial local trust

Local trust models consider the personal and subjective views of the users and predict personalized trust scores from each single user’s point of view [32]. In this step, STARS uses the rating matrix and calculates the direct implicit trust score of every pair of users. In particular, STARS derives the local trust score by averaging the prediction error on co-rated items. Therefore, if two users have no co-rated item, then there is no direct trust relationship between them. STARS modifies the metric proposed by Shambour and Lu [14] for computing the implicit trust values. This metric [14] uses the Mean Squared Differences (MSD) method [59] to measure the degree of trust between two users by averaging the prediction error on co-rated items. Also, in this metric, the proportion between the common ratings and the total rated items are taken into consideration to derive the implicit trust values [14].

Let $I_{u}^{c_{j}} (t_{0} )$ be the set of items in cluster c _j rated by user u until time t ₀, and $\left | {I_{u}^{c_{j}} (t_{0} )} \right |$ represents the number of elements in set $I_{u}^{c_{j}} (t_{0} ).$ According to the available normalized ratings in sub-matrix R ^c _j(t ₀), the direct trust score assigned by user u to user v in the context of cluster c _j at time t=t ₀, $trust_{u\to v}^{c_{j}} (t_{0} )\in [0,1]$, is computed as:

$$\begin{array}{@{}rcl@{}} &&trust_{u\to v}^{c_{j}} (t_{0} )=\left({1-\frac{\sum\limits_{i\in D\cap E} {(p_{u,i}^{v} -r_{u,i} )^{2}}} {\left| {D\cap E} \right|}} \right)\\ &&\quad\times \frac{\left| {D\cap E} \right|}{\left| D \right|+\left| E \right|-\left| {D\cap E} \right|},\,\,\,\,D=I_{u}^{c_{j}} (t_{0} ),\,\,\,E=I_{v}^{c_{j}} (t_{0})\\ \end{array} $$

(3)

where $D\cap E$ is the set of items in cluster c _j that have been commonly rated by both users u and v; and $p_{u,i}^{v} $ is the predicted rating of item i for user u by only considering neighborhood user v; $p_{u,i}^{v} $ is calculated by:

$$ p_{u,i}^{v} =\overline {r_{u}} +(r_{v,i} -\overline {r_{v}} ) $$

(4)

where $\overline {r_{u}} $ and $\overline {r_{v}} $ are the mean ratings of users u and v for items belonging to cluster c _j, respectively.

Step 3: Compute the initial global trust

In this step, STARS computes the global trust score of each user v as the average of the local trust scores given by direct neighbors of this user in the trust network [23]. So, in the context of cluster c _j, the global trust score of user v at time t=t ₀, called $Gtrust_{v}^{c_{j}} (t_{0} )\in [0,1]$, is:

$$ Gtrust_{v}^{c_{j}} (t_{0} )=\frac{1}{\left| {NB_{v}^{c_{j}} } \right|}\sum\limits_{u\in NB_{v}^{c_{j}} } {trust_{u\to v}^{c_{j}} (t_{0} )} $$

(5)

where $NB_{v}^{c_{j}} $is the set of direct neighbors of user v in context c _j.

Step 4: Create the initial directed ITG

In this step, the initial ITG^c _j=(V,E) is created. V is the set of vertices correspond to the users and E is the set of edges connecting users. The weights on incoming links to each node v are equal to the global trust score of that node:

$$ \forall u\in U,\,\,u\ne v\,\,\,W_{uv}^{c_{j}} (t_{0} )=Gtrust_{v}^{c_{j}} (t_{0} ) $$

(6)

where $W_{uv}^{c_{j}} (t_{0} )$ denotes the weight of the link from node u to node v in ITG^c _j(t ₀). Therefore, at the beginning, STARS predicts the same value of trustworthiness of user v for every user. As the time passes and more data is collected about the preference of the users for items belonging to cluster c _j, STARS locally adjusts the weight of corresponding links in this graph.

3.1.2 Recommender module

This module is in charge of retrieving context-specific data, analyzing the retrieved data to select the best neighborhoods using the ant colony metaphor, and suggesting matching items that users might like. In the following subsections, the process of this module is described in detail.

Retrieving context-specific data

In the online phase, in order to create a list of top-N recommendations for active user u, the appropriate trust and rating data are first retrieved from the database. Let $\tilde {{I}}_{u} (t)=\{i\left | {\,i\in I\,\text {and}\,r_{u,i} =null} \right .\}$ be a set of target items which user u had not rated until time slot t, and $\tilde {{C}}_{u} (t)=\{c\left | {\,c\in C\,\text {and}\,(\tilde {{I}}_{u} (t)\cap c)\ne \emptyset } \right .\}$ be a set of target clusters each of which contains at least one target item $i\in \tilde {{I}}_{u} (t)$. The context-related data for user u at time slot t is as follows:

(1)
$T_{u} (t)=\{\text {ITG}^{c}(t)\left | {\,c\in \tilde {{C}}_{u} (t)\}} \right .$, a set of trust graphs required for generating recommendations for active user u at time slot t such that each graph corresponds to a target cluster c.
(2)
$R_{u} (t)=\{\text {\textbf {R}}^{c}(t)\left | {\,c\in \tilde {{C}}_{u} (t)\}} \right .$, a set of rating sub-matrices required for generating recommendations for active user u at time slot t such that a sub-matrix R ^c(t) contains available normalized user ratings, up to time t, for items in a target cluster c.
(3)
$S_{u} (t)=\{\text {\textbf {S}}^{c}(t)\left | {\,c\in \tilde {{C}}_{u} (t)\}} \right .$, a set of timestamp sub-matrices required for generating recommendations for active user u at time t such that a sub-matrix S ^c(t) contains corresponding timestamps of ratings in R ^c(t).

This contextual data is the key input to our ant-inspired neighborhood selection algorithm as detailed in the next subsection.

Selecting the best neighborhood using ACO

In this step, the recommender module selects the best neighborhood of active user u for each target item $i\in \tilde {{I}}_{u} (t)$. For this purpose, STARS uses a novel ant-inspired search algorithm which performs a depth-first search in the trust network to select the best neighbors of the active user. In the proposed algorithm, trust can be passed from one member to another in a trust network, creating trust chains, based on its propagative and transitive nature. In each context $c\in \tilde {{C}}_{u} (t)$, the trust value between active user u and each of the other users is iteratively learned and memorized as the pheromone value of the edge connecting their nodes in ITG^c(t). During the search process, the heuristic value of a node depends on the similarity between this node and the active user node. This similarity is calculated with respect to users’ current interests in context c. To place more emphasis on the recent ratings, STARS incorporates the temporal relevance of the ratings into the similarity computation. Using the available data in sub-matrices R ^c(t) and S ^c(t), STARS measures time-weighted similarities in context c. Therefore, STARS relies on both trust and similarity information to form the neighborhood of active user u in context c at time t. Actually, in each node a∈ITG^c(t), the probability of choosing the next node b∈ITG^c(t) depends on both “the trust value assigned by node a to node b” and “similarity between node b and the active user node in this context”.

The utilization of the similarity information, as the heuristic knowledge about the nodes, along with the learned trust knowledge, memorized in the form of pheromone trails, results in some advantages:

(1)
In a sparse rating matrix, the propagation of trust over the network helps to alleviate the data sparsity problem by providing extra information for neighborhood selection.
(2)
The utilization of global trust values allows STARS to provide more trusted neighbors for cold users with a few or even no ratings.
(3)
The utilization of the time-weighted user similarities helps STARS to adapt itself to dynamically changing user interests. Actually, as long as the active user’s rating data is insufficient, she benefits from opinions of globally trusted users. As the time passes and the active user provides more information about her preferences, the probability of selecting her locally similar neighbors increases, and consequently, the pheromone value associated with the selected neighbors increases. Whenever the active user’s interests change, the heuristic knowledge reflects this change and helps to select more accurate neighbors.

In the proposed ant-inspired neighborhood search algorithm, selection of the best solutions is accomplished by computing the “path trust” [53] for each constructed solution. The path trust is a function of the number of co-rated items and the trust value between two adjacent nodes in ITG^c(t). The detailed steps of this algorithm are shown in Algorithm 1.

The input parameters of Algorithm 1 are as follows: the set of users U, target items $\tilde {{I}}_{u} (t)$, target clusters $\tilde {{C}}_{u} (t)$, the contextual trust data T _u(t), the rating data R _u(t), and the timestamp data S _u(t) for active user u, the number of ants (γ), the time decay weight (𝜃), the maximum trust propagation distance (φ), the maximal number of the best trust paths (bpn), the relative importance of the pheromone trail (α) and heuristic value (β). As will be detailed later, parameter 𝜃 is a decay factor to emphasize users’ recent preference. φ is a tunable parameter that is used to control the maximum distance from the active user to where the trust is propagated. Parameter bpn is used for identifying the best solutions (i.e., solutions with a high path trust value). Finally, α and β are two parameters that control the relative importance of trust (i.e., the pheromone value) versus similarity (i.e., the heuristic information) during neighborhood formation. Proper values of these parameters are selected by performing sensitivity analysis (see Section 4.1).

By the mentioned inputs, this algorithm finds the best neighbors of active user u at time t for each target item $i\in \tilde {{I}}_{u} (t)$. For this purpose, in each context $c\in \tilde {{C}}_{u} (t)$, it first identifies the best trust paths that start from node u in ITG^c(t), and creates a sub-graph from the identified best paths, called $\text {TG\_Best}_{u}^{c} (t)$ (lines 2–27). Then, for each target item i∈c, those nodes of graph $\text {TG\_Best}_{u}^{c} (t)$ which have a rating for item i are selected as the neighbors of user u for item i at time t; this set of selected neighbors is represented by N e i g h _u,i(t) (lines 29–35). The mentioned whole procedure is repeated for each target cluster.

Considering a target cluster $c\in \tilde {{C}}_{u} (t)$, Algorithm 1 starts with the initialization step (lines 2–6). In this step, ITG^c(t) is used as a graph which models the ant colony problem (line 2). In this graph, the weight of edgeab represents the pheromone level on this edge at time t:

$$ \tau_{ab}^{c} (t)=W_{ab}^{c} (t) $$

(7)

where $\tau _{ab}^{c} (t)$ denotes the pheromone level on edge ab in ITG^c(t). As mentioned in Section 3.1.1, at time t=t ₀, for each node a∈ITG^c(t), $W_{ab}^{c} (t)$ is equal to the global trust score of user b in the context of cluster c.

In the initialization step and after determining the underlying trust graph, the active user node, e.g., node u, is taken as the ants’ nest (line 3), a heuristic value is assigned to each node (line 4), and γ artificial ants are dispatched from node u (lines 5–6). In order to determine the heuristic value of a node a, we need to compute the rating similarity between user a and active user u . In the context of cluster c, this similarity is computed using the user rating profiles available in sub-matrix R ^c(t). To be able to adapt to changes in users’ preferences, STARS incorporates the temporal information in similarity computation via weighting each rating with its temporal relevance. For this purpose, it uses a temporal relevance function which assigns a weight to each rating according to its age (time distance) with respect to the current time. In the context of cluster c, the timestamp of each observed rating until time t is available in sub-matrix S ^c(t). At current time slot t, the temporal relevance f _u,i(t) of the observed rating r _u,i is computed as follows [54]:

$$ f_{u,i} (t)=e^{-\theta (t-t_{u,i} )} $$

(8)

where 𝜃∈[0,1] controls the decaying rate. Under the assumption that older ratings are generally less correlated with the users’ current tastes and interests, the above function decreases the relevance of rating r _u,i with the amount of time that has passed since the rating date.

To compute user similarities by incorporating time-based weights, STARS uses a modified cosine similarity measure as follows [54]:

$$\begin{array}{@{}rcl@{}} TWC_{ua}^{c} (t)&=&\frac{\sum\limits_{i\in D\cap E} {(f_{u,i} (t)\cdot r_{u,i} )(f_{a,i} (t)\cdot r_{a,i} )}} {\sqrt {\sum\limits_{i\in D} {(f_{u,i} (t)\cdot r_{u,i} )^{2}} \sum\limits_{i\in E} {(f_{a,i} (t)\cdot r_{a,i} )^{2}}} } ,\,\,\,\,\\D&=&{I_{u}^{c}} (t),\,\,\,E={I_{a}^{c}} (t) \end{array} $$

(9)

where $TWC_{ua}^{c} (t)\in [0,1]$ represents the time-weighted cosine similarity between users u and a in context c. $TWC_{ua}^{c} (t)$ is computed using the available data in R ^c(t) and S ^c(t). The cosine similarity metric only considers the value of ratings, and does not regard the proportion of commonly rated items. So, when the number of items that have been commonly rated by two users is very small, their similarity value will be high. To solve this issue, we need to consider the number of common ratings between two users. For this purpose, we use the user-based Jaccard similarity metric [14] as a weighting factor to adjust $TWC_{ua}^{c} (t)$. The Jaccard metric measures the similarity based on the proportion between the number of common ratings and the total rated items up to time t, as given by:

$$\begin{array}{@{}rcl@{}} Jaccard_{ua}^{c} (t)&=&\frac{\left| {D\cap E} \right|}{\left| D \right|+\left| E \right|-\left| {D\cap E} \right|},\,\,\,\,\\ D&=&{I_{u}^{c}} (t),\,\,\,E={I_{a}^{c}} (t) \end{array} $$

(10)

$Jaccard_{ua}^{c} (t),$ which represents the user-based Jaccard similarity between users u and a in context c, is computed using the available data in R ^c(t). The combination of $TWC_{ua}^{c} (t)$ and $Jaccard_{ua}^{c} (t)$ is used to calculate the similarity $sim_{ua}^{c} (t)$ between users a and u in context c at time t. So, the heuristic value of node a in ITG^c(t), denoted as ${\eta _{a}^{c}} (t)$, is computed as follows:

$$ {\eta_{a}^{c}} (t)=sim_{ua}^{c} (t)=Jaccard_{ua}^{c} (t)\times TWC_{ua}^{c} (t) $$

(11)

After the initialization step, Algorithm 1 continues with the creation of solutions (Step 2). In this step, each ant performs a depthfirst search in graph ITG^c(t) to select the best neighbors of the active user (lines 7–18). According to parameter φ which represents the maximum trust propagation distance, the iterative process of solution creation will continue until each ant explores a solution with depth φ. In other words, each ant creates a chain of trust relationships with the maximum size of φ. The constructed solutions are stored in a γ×(φ+1) matrix, namely Solution, where each row contains the solution created by an ant (line 7). More specifically, considering an ant k, Solution _k,∗ (i.e., the kth row of matrix Solution) consists of an ordered sequence of nodes selected by ant k. The starting node of each ant (here, node u) is placed in the first cell of each row of matrix Solution (line 8). In order to construct a solution by an ant, STARS combines the trust information with the similarity information in a probabilistic transition rule. Considering an ant k located at node a∈ITG^c(t), the following probabilistic transition rule [35] is used for selecting the next movement (line 12):

$$ prob_{ab}^{k} (t)=\left\{ {\begin{array}{l} \frac{(1+\tau_{ab}^{c} (t))^{\alpha} (1+{\eta_{b}^{c}} (t))^{\beta} }{\sum\limits_{f\in F_{k}} {(1+\tau_{af}^{c} (t))^{\alpha} (1+\eta_{f}^{c} (t))^{\beta} }} \,\,\,\,\,\,\,\,\,\,\text{if}\,b\in F_{k} \\ 0\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\text{otherwise} \\ \end{array}} \right. $$

(12)

where $prob_{ab}^{k} (t)$ represents the probability of selecting node b∈ITG^c(t), and F _k is the set of nodes that have not yet been visited by ant k. In (12), we use 1+η instead of η for the following reason: when the active user has no rating, all the nodes have heuristic value equal to zero. If we use η in (12), then the probability of selecting any node will be equal to zero, and therefore, ants will be unable to choose their next movement. So, in this case, the system cannot provide any recommendation for the active user. Also, when the active user has a small number of ratings, the number of possible choices for the next movement will be significantly reduced, affecting the recommendation quality. Actually, in such cases, a cold user cannot benefit from opinions of globally trusted users. In order to prevent these problems, we use 1+η instead of η in (12). For the sake of being in the same range, τ is also incremented accordingly. In this way, when the similarity between two users cannot be defined, only the trust knowledge memorized in the form of pheromone trails is considered.

After computing transition probabilities, each ant k chooses the next node to move to (line 14). To implement a probabilistic selection, we use the classical roulette-wheel procedure [60], where the nodes with higher probability have a higher chance of being the next node. When a simple linear search is used, the complexity of selection is of O(n), where n is the number of users. After selecting the next node (line 14), ant k stores this new node in one of the cells of the kth row of matrix Solution (line 15), and moves to the selected node (line 16).

The next step of Algorithm 1 is the solution evaluation (lines 19–23) in which each constructed solution is evaluated by computing the corresponding path trust [53]. The path trust of solutions constructed by ants is stored in an array of length γ, namely PT (line 19). As mentioned above, each constructed solution is a sequence of nodes starting from the active user node. For each constructed solution, the path trust is a function with two parameters:

1-
the number of co-rated items between every two adjacent nodes x and y, and
2-
the trust value issued by node x towards node y.

The paths with a high value of propagated trust can be considered as the best solutions. Actually, a chain of users with high path trust value can provide more precise opinions for the active user. Considering solution Solution _k,∗ constructed by the kth ant at time t, the path trust is computed as follows (line 21):

$$ ptrust_{k} =\frac{\sum\nolimits_{\text{pair}(x,y)\text{ of adjacent elements in } \text{ \textbf{Solution}}\,_{k,\ast} } {\left({\left| {{I_{x}^{c}} (t)\cap {I_{y}^{c}} (t)} \right|\times \tau_{xy}^{c} (t)} \right)}} {\sum\nolimits_{\text{pair}(x,y)\text{ of adjacent elements in } \text{ \textbf{Solution}}\,_{k,\ast} } {\left({\left| {{I_{x}^{c}} (t)\cap {I_{y}^{c}} (t)} \right|} \right)}} $$

(13)

p t r u s t _k represents the path trust of the ant k’s solution; x and y refer to two adjacent nodes in the constructed solution. The computed path trust of the ant k’s solution is stored in PT[k] (line 22).

The last step of Algorithm 1 is the best solutions selection (lines 24–35). In this step, bpn number of solutions with the highest path trust value are added to a set, called Best_paths, which stores the best solutions (lines 24–25). Here, we use the median of medians algorithm [60], that is an optimal algorithm for selecting m largest elements in a list with the linear time complexity in the worst case. Afterward, Algorithm 1 must identify the best neighbors of user u for each target item belonging to the current cluster. For this purpose, it first creates a sub-graph from the best paths in ITG^c(t) starting from node u, according to the selected solutions in Best_paths (line 26). Since this graph, called $\text {TG\_Best}_{u}^{c} (t)$, is used as a key input to our ant-inspired trust updating algorithm (Section 3.1.3), it is saved to an output set, called B G S _u(t) (line 27), for later use. In the next step, for each target item i∈c, neighbors N e i g h _u,i(t) of user u are determined as follows: each node of graph $\text {TG\_Best}_{u}^{c} (t)$ which has a rating for item i is selected as a neighbor of user u for item i at time t (lines 29–31). All identified neighbors for different target items are stored in an output set, called N S _u(t) (line 34). Finally, N S _u(t) and B G S _u(t) are returned as outputs (line 37).

After the execution of Algorithm 1 in the online mode, the pheromone values must be updated according to the selected neighbors. More specifically, the pheromone update process occurs after the neighborhood formation process at each time step. Using this process, at each time step, the pheromone value associated with the best neighbors of the active user in a specific context is increased, while this value for the other users is decreased. The update process, which is performed offline, is detailed in Algorithm 2 of the following section.

In the following, a simple example is given to illustrate how Algorithm 1 works.

Example 1

Suppose that there are 13 items (i ₁ to i ₁₃, m=13) and 10 users (u ₁ to u ₁₀, n= 10). Suppose that t ₀=20 (i.e., 20 days after the first rating). A sample user-item rating matrix at time t ₀=20, R _10×13(t ₀), is depicted in Table 1. The ratings are integers ranged from 1 to 5. We use symbol? to state that a user has not rated an item yet. A sample timestamp matrix of the observed ratings, S _10×13(t ₀), is also depicted in Table 2. Time is measured with the day granularity, and timestamps are ranged from 1 (the first known rating time) to 20 (the current time). We assume that item clusters obtained based on the semantic similarities are c ₁={i ₁,i ₂}, c ₂={i ₃,i ₄,i ₅,i ₆,i ₇}, c ₃={i ₈,i ₉,i ₁₀,i ₁₁} and c ₄={i ₁₂,i ₁₃}. So, there are four rating sub-matrices at time t=t ₀: $\text {\textbf {R}}_{10\times 2}^{c_{1}} (t_{0} )$, $\text { \textbf {R}}_{10\times 5}^{c_{2}} (t_{0} )$, $\text {\textbf {R}}_{10\times 4}^{c_{3}} (t_{0} )$ and $\text {\textbf {R}}_{10\times 2}^{c_{4}} (t_{0} )$. Also, we have four timestamp sub-matrices at time t=t ₀: $\text {\textbf {S}}_{10\times 2}^{c_{1}} (t_{0} )$, $\text {\textbf {S}}_{10\times 5}^{c_{2}} (t_{0} )$, $\text { \textbf {S}}_{10\times 4}^{c_{3}} (t_{0} )$ and $\text {\textbf {S}}_{10\times 2}^{c_{4}} (t_{0})$.

Table 1 The user–item rating matrix at time t=t ₀

A semantic-enhanced trust based recommender system using ant colony optimization

Abstract

Similar content being viewed by others

Trust Inference Path Search Combining Community Detection and Ant Colony Optimization

Nature inspired recommendation with path optimization for online social network communities

Stochastic trust network enriched by similarity relations to enhance trust-aware recommendations

Explore related subjects

1 Introduction

2 Background

2.1 Ant colony optimization

2.2 CF recommender systems

2.3 Semantic-based CF recommender systems

2.4 Trust-based CF recommender systems

3 STARS: semantic-enhanced trust based ant recommender system

3.1 The architecture of STARS

3.1.1 Preprocessor module

Semantic similarity calculation

Item clustering

Generating initial trust graphs

3.1.2 Recommender module

Retrieving context-specific data

Selecting the best neighborhood using ACO

Example 1

Generating recommendations

3.1.3 Trust network updater module

Example 2

3.2 Computational complexity analysis

4 Experimental results and discussions

4.1 Sensitivity analysis for STARS

Effect of parameter z

Effect of parameter γ

Effect of parameters α and β

Effect of parameter φ

Effect of parameter bpn

Effect of parameter ρ

Effect of parameter 𝜃:

4.2 The effectiveness of STARS

4.2.1 Comparing the accuracy of STARS with baseline algorithms

4.2.2 Handling the cold start problem

4.2.3 Handling the data sparsity problem

4.2.4 Handling the MIMC problem

4.3 Run-time performance of STARS

5 Conclusions and future work

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation