Leveraging contextual influence and user preferences for point-of-interest recommendation

Yu, Dongjin; Wanyan, Wenbo; Wang, Dongjing

doi:10.1007/s11042-020-09746-0

Leveraging contextual influence and user preferences for point-of-interest recommendation

Published: 08 September 2020

Volume 80, pages 1487–1501, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Leveraging contextual influence and user preferences for point-of-interest recommendation

Download PDF

866 Accesses
26 Citations
Explore all metrics

Abstract

The effective Point-of-Interest (POI) recommendation can significantly assist users to find their preferred POIs and help POI owners to attract more customers. As a result, a variety of methods have been proposed to tackle the issue of POI recommendation recently. However, it is still very difficult to precisely model the strong correlations between the POIs visited by the user and the POIs to be visited next, which leads to the poor performance of POI recommendation. In this paper, we propose a context- and preference- aware model (CPAM) to incorporate both contextual influence and user preferences into POI recommendation. Firstly, we design a Skip-Gram based POI Embedding Model (SG-PEM) to capture the contextual influence of POIs and learn the vector representation (embedding) of POIs from visiting sequences. The users’ preferences for the target POIs are obtained from the learned embeddings via similarity metric. Secondly, for the implicit feedback information contained in the check-in data, we use the Logistic Matrix Factorization (LMF) algorithm to model the users’ personalized preferences for POI. Finally, we unify SG-PEM and LMF as the CPAM model to perform personalized recommendation by leveraging contextual influence and user preferences. The experimental results on two real-world datasets of Foursquare and Gowalla show that the proposed model outperforms the state-of-the-art baselines.

Graph-Based Metric Embedding for Next POI Recommendation

On successive point-of-interest recommendation

Article 23 July 2018

HRec: Heterogeneous Graph Embedding-Based Personalized Point-of-Interest Recommendation

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the development of GPS-enabled smartphones and mobile internet, users can easily obtain real-time geographic location information, and record/share their activities and daily lives via check-in on Location Based Social Networks (LBSN), such as Yelp, Instagram, Foursquare and Twitter. Specifically, as the digital mirror to human trajectories in physical world, these POI check-in sequences on LBSN indicate human mobility and behavioral information, including sequential context or periodic transition patterns [16, 44]. For example, people may regularly stop by coffee shop to grab a cup of coffee on their way to work in the morning, and some users may prefer to have high-protein dinner after taking part in sports and physical activity, which can be explained as sequential transition patterns. The massive amount of check-in data can not only help understand users’ preferences for POIs and provide the possibility of personalized POI recommendation, but also benefit for business to acquire more potential customers. As a fact, the effective POI recommendation can significantly assist users to find their preferred POIs and help POI owners to attract more customers.

Recommender systems have been applied in a variety of areas, such as music recommendation [5, 35], content recommendation [21, 22], business process recommendation [11], and social recommendation [6]. POI recommendation has become a popular research direction and have attracted much attention [2, 4, 39]. As we know, LBSN is a complex heterogeneous network that includes the relationship between users and POIs. Compared to traditional recommender systems, POI recommendation faces three new challenges: 1) Correlations between consecutive POIs. Most existing POI recommendation methods consider all the users’ check-in data as a whole. Specifically, these works only utilize the check-in relationship between the users and the POIs, and ignore the correlations between consecutive POIs visited by a user. In fact, there are strong correlations between the POIs visited by the user and the POIs to be visited next. For example, as shown in Fig. 1, when User 1 left office at night after work, he/she would go to eat instead of going outdoors. 2) Implicit feedback properties for check-in data. In traditional recommender systems, users express their explicit feedbacks through ratings. However, the users’ check-in behaviors at POI are implicit feedbacks, where positive feedback is the POIs that user have visited. The POIs that are not visited include the POIs that users are not interested in and the POIs that users have not yet discovered but may be interested in. 3) The users’ personalized preferences and patterns. Two users with similar interests may have different behavior patterns. Therefore, a POI recommender system should recommend POIs for users based on their personalized preferences and patterns.

In this paper, we propose a unified Context- and Preference- Aware Model (CPAM) for POI recommendation. Specifically, since users’ check-in behaviors at the target POI are influenced by users’ previously visited POIs and influence the POIs visited by the user after the target POI, we design a Skip-Gram based POI Embedding Model (SG-PEM) to capture the contextual influence of POIs, and learn the vector representation (embedding) of POIs from visiting sequences. Then we calculate the use rs’ preferences for the target POI based on the learned embeddings and similarity metric. For the implicit feedback information contained in the check-in data, we adopt the Logistic Matrix Factorization (LMF) algorithm [20] to model the users’ preferences for POIs from implicit feedbacks. Finally, we unify SG-PEM and LMF as the CPAM model to perform personalized recommendation by leveraging contextual influence and user preferences.

In summary, the main contributions of this paper are summarized as follows:

We design a Skip-Gram based POI Embedding Model called SG-PEM, which is able to learn the embedding of POIs and capture the contextual influence of POIs from users’ check-in sequences effectively.
We propose a unified model CPAM which combines the SG-PEM model and LMF model to capture both the contextual influence of POIs and users’ preferences, respectively, for better recommendation.
Experimental results on two real-world datasets show that the effectiveness of the proposed model CPAM compared with the state-of-the-art baselines.

The rest of the paper is structured as follows: After Section 2 introduces some related works, Section 3 details the proposed POI recommendation model. Then we compare its performance with some baselines in Section 4. Finally, Section 5 concludes the paper and outlines the future work.

2 Related works

The POI recommender system utilizes a user’s historical check-in data to model the behavioral patterns of the users and recommends a sequence of POIs based on the users’ preferences [27, 40]. Currently, researchers attempt to further improve the performance of the POI recommender system by integrating various information in the LBSN that affects the users’ check-in behavior. According to different types of fusion information, POI recommender systems can be divided into the following four categories.

POI recommendation incorporating temporal information

The users’ check-in behavior always shows the periodic characteristics change over time [16, 42]. For example, Li et al. [23] proposed a time-aware personalized POI recommendation by analyzing the trend of user behavior patterns over time and considering the long-term and short-term preferences for users.

POI recommendation incorporating geographical information

Since the check-in behavior is a physical interaction between users and POIs, users prefer to visit the POIs that close to them. Many researches have incorporated geographic information into POI recommendation [8, 26, 43]. For example, Rahmani et al. [30] proposed a POI recommendation model LGLMF by fusing the geographical model into the logistic matrix factorization approach. LGLMF improves the performance of recommendation by considering both users’ and locations’ point of view of geographical information.

POI recommendation incorporating category information

The POIs in LBSN are usually divided into different categories [16]. The category of the POIs visited by users implies the behavior characteristics of users. For example, shopping people like to go to various malls, and gourmets are keen to check in at restaurants. He et al. [17] proposed a two-step approach for next POI recommendation to predict users’ preferences for the category-level first, and then derive the ranking list of POI candidates based on the predicted category preference.

POI recommendation incorporating sequential information

Factorizing personalized Markov chains (FPMC) [31] is an extension of the ordinary Markov model and can provide personalized recommendations for different users. Recurrent Neural Network (RNN) has been proved very successful on modeling sequential information, and usually outperforms FPMC in POI recommendation [24]. In addition, many researchers have also tried to employ the Word Embedding technique to solve the POI recommendation problem [8] by transforming POIs into low-dimensional space vectors, and then recommend the next POIs for the user based on the similarity between the vectors.

When a user posts a check-in, he/she may also make a short comment, which describes the users’ feelings about POIs. Many researches [4, 38] have actively explored content information to improve the performance of POI recommendation. Generally, users’ comments at POI are short. It brings a great challenge to POI recommendation because the short texts are usually sparse, noisy, and ambiguous. In addition, social relationships are the attributes that LBSN inherits from traditional social networks. Many researchers have tried to use social relationships to improve the performance of POI recommender system [18, 41]. However, some studies show that only a small number of users have similar preferences with their friends in visiting POIs, and social relationships have limited influence on users’ check-in behavior.

3 Proposed model

In this section, we will introduce the proposed Context- and Preference- Aware Model (CPAM) in details. As shown in Fig. 2, we first capture contextual influence of POIs and learn vector representation (embedding) of POIs by a Skip-Gram based POI Embedding Model (SG-PEM). Then we calculate the users’ preferences for the target POI based on the learned embeddings and similarity metric. Moreover, we adopt the Logistic Matrix Factorization (LMF) algorithm [20] to model the users’ preferences for POI from implicit feedbacks. Finally, we unify these two models as CPAM model for POI recommendation.

3.1 POI embedding model

The proposed POI Embedding Model for learning the effective embedding of POI in this paper can be seen as part of the literature on representations learning [1]. Specifically, as one of the most popular embedding techniques, Word2Vec [28] can map symbolic data, such as words, from a space with one dimension per symbolic data object (one-hot representation) to a continuous vector space with much lower dimension based on sequences in training dataset, such as sentences, and the learned low dimensional representation of the object is called its embedding. Note that the learned embeddings can effectively capture items’ important relationships and features in training dataset. Recently, the embedding has been expanded to many sequential tasks, including trajectory data mining [3, 13], sequential recommender systems[35, 43], question answering [19], graph representation [14] and so on. As for POI recommendation, a check-in at a target POI would be influenced by users’ previously visited POIs, and influence the POIs visited by user after the target POI. To capture this contextual influence of POIs, we design a Skip-Gram based POI Embedding Model (SG-PEM). Before introducing the model, we define some basic concepts as follows.

Definition 1 (Check-in record and dataset)

Check-in record c = (u,p,t) consists of POI p ∈ P, user u ∈ U and check-in time t, and the whole dataset is defined as $C=(c_{1}, c_{2}, c_{3},{\dots }, c_{\left |{C}\right |})$.

Definition 2 (Check-in sequence)

Check-in sequence is a list of POIs from a given user sorted by check-in time. The check-in sequence of the user u is defined as $S_{u}=(p_{1}, p_{2}, p_{3}, {\dots }, p_{\left |{S_{u}}\right |})$, and all users’ check-in sequences is defined as $S=(S_{u_{1}}, S_{u_{2}}, {\dots }, S_{u_{\left |{U}\right |}})$.

Definition 3 (Target and context POI)

As shown in Fig. 3, for a target poi p_i in a user sequence S_u, its context POIs {p_i−w : p_i+w}∖p_i are the POIs visited before and after p_i, where w is the context window size. Especially, each POI is firstly represented with a one-hot vector $p_{i} \in \mathcal {R}^{\left |P\right |}$, where P is the POIs set.

The SG-PEM model is described in Fig. 3. We consider a POI sequence as a “sentence”, and each POI in the sequence as a “word”. Skip-Gram model [29] can effectively capture the correlations between consecutive POIs and the contextual information before and after the target POI, and embed the one-hot vector of POI into low dimensional vector representation (embedding) by maximizing the objective function O, which is formulated as:

$$ O = \underset{S_{u}\in S}{\sum}\frac{1}{\left| S_{u} \right|} \underset{p_{i}\in S_{u}}{\sum}\underset{-w \leq k \leq w, k \not= 0}{\sum} \log (Pr(p_{i+k}{\mid}p_{i})). $$

(1)

Given the target POI p_i, its context POIs are in a sliding window from p_i−w to p_i+w. Then we can predict its context by the embedding of POI p_i according to probability Pr(p_i+k∣p_i), which is computed by using the soft-max function as follows,

$$ Pr(p_{i+k}{\mid}p_{i})= \frac{\exp{({\boldsymbol{p}^{\prime}}_{i+k} \cdot \boldsymbol{p}_{i})}}{\sum\nolimits_{l=1}^{{\mid}P{\mid}} \exp{(\boldsymbol{p}^{\prime}_{l}\cdot \boldsymbol{p}_{i})}}, $$

(2)

where $\boldsymbol {p} \in \mathcal {R}^{d}$ and $\boldsymbol {p}^{\prime } \in \mathcal {R}^{d}$ are input and output embedding of POI respectively, and ∣P∣ is the number of POIs. However, the computation complexity of the full soft-max function defined in (2) is proportion to the size of item set, which may reach millions in practice. Here, we apply the negative sampling technique [29] to calculate (2) over several negative samples instead of the whole dataset approximately and efficiently. Therefore, the training time yields linear scale to the number of negative samples and becomes independent of the item set size. Then the objective function O can be defined as:

$$ O = \underset{S_{u}\in S}{\sum} \frac{1}{\left| S_{u} \right|} \underset{p_{i}\in S_{u}}{\sum} \underset{-w \leq k \leq w, k \not= 0}{\sum} (\log \sigma (\boldsymbol{p}_{i+k}^{\prime} \cdot \boldsymbol{p}_{i}) + \frac{1}{n} \underset{\overline{p} \in P_{neg}}{\sum} \log \sigma (-\boldsymbol{\overline{p}}^{\prime} \cdot \boldsymbol{p}_{i})), $$

(3)

where P_neg is negative sample set with n negative samples, σ(⋅) is the sigmoid function, and $\boldsymbol {\overline {p}}^{\prime } \in \mathcal {R}^{d}$ means the output embedding of sample $\overline {p}$.

Then we define the preference of user u_i for target POI p_j with cosine similarity as follows:

$$ Preference\_pem_{u_{i},p_{j}}=cosine\_similarity(\boldsymbol{u}_{i}, {\boldsymbol{p}_{j}}), $$

(4)

where u_i is user u_i’s preference vector, and p_j is the embedding of target POI p_j, respectively. Since users’ preferences are reflected in their POI visiting sequences, we can infer and model the users’ preferences by aggregating the embeddings of POIs in their check-in sequences. Especially, u_i is calculated with averaging aggregation, which keeps the wholeness and smoothness of the input embeddings with linear transformation. Formally, given the check-in sequence of the user u, u_i is defined as

$$ {\boldsymbol{u}_{i}} = \frac{{\underset{{p_{k}} \in {S_{u}}}{\sum} {{\boldsymbol{p}_{k}}} }}{{\left| {{S_{u}}} \right|}}. $$

(5)

Besides, the definition of cosine similarity between two vectors v and $\boldsymbol {v}^{\prime }$ is:

$$ cosine\_similarity\left( {\boldsymbol{v},\boldsymbol{v}^{\prime}} \right) = \frac{{\boldsymbol{v} \cdot \boldsymbol{v}^{\prime}}}{{\left\| \boldsymbol{v} \right\|\left\| \boldsymbol{v}^{\prime} \right\|}} = \frac{{\sum\limits_{i = 1}^{d} {{v_{i}}{v^{\prime}_{i}}} }}{{\sqrt {\sum\limits_{i = 1}^{d} {{v_{i}}^{2}} } \sqrt {\sum\limits_{i = 1}^{d} {{v^{\prime}_{i}}^{2}} } }} $$

(6)

3.2 Unifying the logistic matrix factorization model

There are two types of data used in traditional collaborative filtering recommendation models. One is explicit feedback data, which is usually the users’ ratings of items such as Netflix movie ratings. The other is implicit feedback data such as clicks, page views, or media streaming counts. In POI recommendation, check-in data on location-based social media can also be considered as implicit feedback data. In this section, we use the Logistic Matrix Factorization (LMF) model [20] to capture users’ preferences for POIs from implicit feedbacks. Since the LMF model cannot capture the contextual influence of POIs, we unify the SG-PEM model and LMF model at the end of this section.

Generally, matrix factorization (MF) based model learns user matrix U_n×h and item matrix P_m×h by factorizing the observation matrix R, where h is the number of latent factors. The rows of U are h dimensional latent vectors that represent users’ preferences, and the rows of P are h dimensional latent vectors that represent an item’s characteristics. In this paper, we define f_i,j as the number of check-ins that user i has visited poi j, U_n×h(u_i ∈U) represents users’ latent factors, and P_m×h(p_j ∈P) represent POI’s latent factors. Given the observation matrix R, we can define our “confidence” in the entries of R as r_i,j = αf_i,j, where α is a tuning parameter. Formally, u_i’s preference for POI p_j is defined as,

$$ Pr(f_{i,j}\mid \boldsymbol{u}_{i},\boldsymbol{p}_{j},\boldsymbol{\beta}_{i},\boldsymbol{\beta}_{j} )= \frac{\exp{(\boldsymbol{u}_{i}\boldsymbol{p}_{j}^{\mathrm{T}} + \boldsymbol{\beta}_{i} + \boldsymbol{\beta}_{j})}}{1 + \exp{(\boldsymbol{u}_{i}\boldsymbol{p}_{j}^{\mathrm{T}} + \boldsymbol{\beta}_{i} + \boldsymbol{\beta}_{j})}} $$

(7)

where the β_i and β_j terms are user bias and POI bias. Then we learn U, P and β by maxing the objective function as follows:

$$ \arg \max \boldsymbol{U},\boldsymbol{P},\boldsymbol{\beta} \log Pr(\boldsymbol{U},\boldsymbol{P},\boldsymbol{\beta} \mid \boldsymbol{R}) $$

(8)

where $\log (Pr(\boldsymbol {U},\boldsymbol {P},\boldsymbol {\beta } \mid \boldsymbol {R})$ is formulated as:

$$ \underset{i,j}{\sum} r_{i,j} (\boldsymbol{u}_{i}\boldsymbol{p}_{j}^{\mathrm{T}} + \boldsymbol{\beta}_{i} + \boldsymbol{\beta}_{j})-(1+r_{i,j}) \log (1+\exp{(\boldsymbol{u}_{i}\boldsymbol{p}_{j}^{\mathrm{T}} + \boldsymbol{\beta_{i}} + \boldsymbol{\beta}_{j})}) - \frac{\lambda}{2} \left\|\boldsymbol{u}_{i}\right\|^{2} - \frac{\lambda}{2} \left\|\boldsymbol{p}_{j}\right\|^{2} $$

(9)

Finally, we construct a novel POI recommendation model CPAM, which unifies SG-PEM and LMF with weighted sum strategy to predict users’ preferences for POIs and perform recommendation. Specifically, weighted sum is a concise and effective strategy of linearly combining multiple components/objectives and it is widely used in or machine learning tasks [33, 34], including recommendation and prediction . Formally, the final preference probability function of user u_i at POI p_j is represented as follows:

$$ Preference_{u_{i},p_{j}}=Preference\_pem_{u_{i},p_{j}}+\gamma Pr(f_{i,j}\mid \boldsymbol{u}_{i},\boldsymbol{p}_{j},\boldsymbol{\beta}_{i},\boldsymbol{\beta}_{j} ) $$

(10)

where $Preference\_pem_{u_{i},p_{j}}$ is calculated by (4), and γ is the weight of LMF.

4 Experiments

We compare the proposed POI recommendation model CPAM with state-of-the-art baselines on two public real-world check-in datasets.

4.1 Datasets

We choose two typical LBSN (Foursquare and Gowalla) real-world check-in datasets^{Footnote 1} [25]. The time span of Foursquare dataset is from April 2012 to September 2013, and we remove the users who have less than 10 check-in POIs and the POIs with less than 10 checked-ins. The time span of Gowalla dataset is from February 2009 to October 2010, and remove the users who have less than 10 check-in POIs and the POIs with less than 10 checked-ins. The statistics of the two datasets are listed in Table 1. Sparsity means how sparse the user-POI interaction data is. Specifically, if there exists k interaction records between m users and n POIs, then the corresponding data sparsity is $1-\frac {k}{m \times n}$. For example, the sparsity of Foursquare dataset is $1-\frac {512,523}{7,642 \times 28,483} = 99.76\%$. Besides, we choose the first 70% of each users’ check-ins as training data, the last 20% as test data, and the remaining 10% as validation data.

Table 1 Statistics of datasets

Full size table

4.2 Evaluation metrics

In this work, we adopt three widely used metrics to compare evaluate the performance, which are Precision@N, Recall@N and F1@N score, where N is the length of recommendation list. Formally, Precision@N, Recall@N and F1@N are defined as follows:

$$ Precision@N=\frac{\left| I^{rec} \bigcap I^{test} \right|}{m} $$

(11)

$$ Recall@N=\frac{\left| I^{rec} \bigcap I^{test} \right|}{\left| I^{test}\right|} $$

(12)

$$ F1@N=\frac{2*Precision@N*Recall@N}{Precision@N+Recall@N} $$

(13)

where I^rec is the top-N recommended POI list of target users, and I^test is the visited POI list of target users in test set. For these three indicators, we set N = 10,20 to evaluate the performance of the proposed approach and baselines.

4.3 Comparison methods

We compared the proposed model CPAM with the following POI recommendation methods:

LRT [12]: A POI recommendation model with temporal influence based on observed temporal properties.
LMF [20]: A probabilistic model for matrix factorization with implicit feedback. It can model the probability that a user will prefer a specific POI.
L-WMF [15]: A location-based matrix factorization approach for POI recommendation, which captures the geographical influence from a location perspective.
LMFT [32]: A novel method to incorporate spatial, temporal, and social influence into a collaborative filtering algorithm.
LGLMF [30]: An approach is proposed by fusing the local geographical model into the logistic matrix factorization algorithm.
iGLSR^{Footnote 2} [41]: A POI recommendation method fusing user preferences, social influence, the geographical influence of users, and the personalized geographical influence of locations.
PFMMGM [7]: A geo-social recommendation method which fuses matrix factorization with social and geographical influence, it uses the Multi-center Gaussian Model (MGM) model users’ preferences for a POI to capture the geographical influence.
SG-PEM: Our designed Skip-Gram based model for POI embedding and recommendation.
CPAM: Our proposed model for POI recommendation by leveraging contextual influence and user preferences.

4.4 Parameter settings

For the comparison models, the parameters are initialized according to the settings in the corresponding paper. For LMF, PFMMGM and L-WMF, the latent factors parameter k are set to 30. We set the distance threshold to 15 and the frequency control parameter α to 0.2 for PFGMGM. For LRT, temporal state T is set to 24, and the regularization parameters α and β are set to 2.0. The parameters of the proposed model CPAM are adjusted according to the validation dataset. After optimizing the parameters, we set latent factors parameter k = 10, embedding dimension m = 150, context window size w = 3, and weight parameter γ = 0.1.

4.5 Experiment results

In this subsection, we firstly demonstrate and analyze the results of the proposed model CPAM compared with other baselines. Then we explore the effect of data sparsity, effect of embedding dimension and effect of γ on recommendation performance, respectively. The source code and results of experiments can be downloaded from http://dbsi.hdu.edu.cn/CPAM/ for reference.

4.5.1 Performance comparison

We firstly compare the proposed model CPAM with the baseline models on both Foursquare and Gowalla datasets. As shown in Figs. 4, 5 and 6, LRT gets much lower performance than other baselines for all metrics, indicating that this model is insufficient for POI recommendation. Compared with the PFMMGM and LRT, iGLSR models geographical influence based on users’ behavior, and thus obtains a better performance. Among these baselines, LMF and LGLMF perform better than PFMMGM and iGLSR. Specifically, Fig. 4a and b show that LMF outperforms PFMMGM by 26.00% and 31.40% with Precision@20 on Foursquare and Gowalla, respectively. One reason is that the LMF model makes better use of implicit feedback of check-in data and captures users’ preferences more effectively. Besides, LGLMF model is better than LMF, L-WMF and LMFT. For example, as shown in Fig. 5a and b, LGLMF outperforms LMF by 20.00% and 17.79% with Recall@20 on Foursquare and Gowalla, respectively. This is because the LMF only considers the users’ preferences for POIs, while LGLMF considers both the users’ and the locations’ points of view in modeling the geographical influence.

The proposed model SG-PEM performs much better than LGLMF model. Specifically, Fig. 5a and b show that SG-PEM outperforms LGLMF by 27.78% and 24.92% with Recall@10 on Foursquare and Gowalla, respectively. The results demonstrate that the effectiveness of our Skip-Gram based POI Embedding Model SG-PEM and indicates that contextual influence of POIs should be taken into consideration to improve performance during POI embedding learning. As shown in Fig. 6a and b, our proposed model CPAM outperforms SG-PEM (LGLMF) by 26.27% (37.96%) on Foursquare and by 20.52% (29.38%) on Gowalla with F1@20. Moreover, CPAM achieves the best performance on both datasets for all metrics, which shows that the effectiveness of our proposed CPAM model for leveraging contextual influence and users’ preferences for POI Recommendation.

4.5.2 Effect of data sparsity

In this work, the data sparsity means how sparse the user-POI interaction data is. Specifically, the sparsity of Foursquare and Gowalla datasets are 99.76% and 99.65% respectively, which influence the performance of the proposed model and baselines. Therefore, we perform experiments on datasets with different percentages of POIs that each user has visited in the training data to further evaluate the performance of the proposed model and baselines. The percentages are set to (40%, 60%, 80%, 100%) respectively, and their corresponding sparsities are (99.89%, 99.86%, 99.82%, 99.76%) for Foursquare and (99.84%, 99.78%, 99.73%, 99.65%) for Gowalla. Figure 7 demonstrates the comparison of Recall@20 with different sparsities of Foursquare and Gowalla datasets. The LGLMF model has similar performance with CAPM when sparsity is 99.89% on Foursquare and 99.84% on Gowalla respectively, even LGLMF is slightly better than CPAM on Foursquare dataset when sparsity is 99.89%. This is because the higher sparsity of dataset leads to shorter average length of users’ POI sequences, and CPAM cannot capture sufficient contextual information. Moreover, the proposed approach CPAM achieves better performance than LGLMF in most cases, and CPAM outperforms other baselines on Foursquare and Gowalla, which shows the effectiveness of CPAM in POI recommendation, especially on sparse datasets.

4.5.3 Effect of embedding dimension d and context window size w

Figure 8a shows the comparison of Recall@20 with different embedding dimension d from 10 to 210. We can see that for both datasets, the Recall@20 increases with the increasing of the dimension of embedding from 10 to 170. This is because the larger embedding dimension could capture more comprehensive contextual information. When the embedding dimension reaches about 170 for both datasets, CPAM model achieves the best performance. Then we can see that the performance of CPAM becomes stable when dimension is greater than 170, which shows that dimensions from 170 to 210 are sufficient for capturing contextual influence of POIs. Figure 8b shows the variations in performance when we change the context window size w from 1 to 5. Specifically, we can see that for both datasets, the Recall@20 increases with the increasing of w from 1 to 3. CPAM obtains the best performance when w is 3, and shows a steady performance when w is greater than 3, which demonstrates that the model can effectively capture contextual influence between POIs when w is greater than 3.

4.5.4 Effect of hyper-parameter γ

CPAM model optimizes the linearly combination of Skip-Gram based POI Embedding Model (SG-PEM, (4)) and Logistic Matrix Factorization (LMF, (10)) using a hyper-parameter γ. Specifically, γ is the weight of Logistic Matrix Factorization (LMF), and small γ means weakening LMF and enhancing the impact of the influence of SG-PEM, and vice versa. Figure 9 demonstrates the comparison results in terms of Precision and Recall with different γ from 0 to 10.0. Firstly, CPAM model obtains better performance when γ > 0 than γ = 0, which shows that both SG-PEM and LMF in CPAM are necessary for accurate POI recommendation. Secondly, CPAM model obtain the best overall performance of Precision and Recall on both Foursquare and Gowalla datasets when γ = 0.1. Moreover, with γ increasing, especially when γ > 1.0, the performance of CPAM shows a downward trend on all metrics, which demonstrates that the sequential check-in patterns and contexts captured by SG-PEM are more important in improving POI recommendation.

5 Conclusion

In this paper, we propose a CPAM model for POI recommendation. We first design a Skip-Gram based POI Embedding Model (SG-PEM) to capture the contextual influence of POIs and learn the embedding of POIs from visiting POI sequences. Then we calculate the users’ preferences for the target POI based on the learned POI embeddings and similarity metric. For the implicit feedback information contained in the check-in data, we adopt the Logistic Matrix Factorization (LMF) algorithm to model the users’ preferences for POIs from implicit feedbacks. Finally, we unify these two models as the CPAM model to perform personalized POI recommendation by leveraging contextual influence and user preferences. Experimental results on two real-world datasets, i.e., Foursquare and Gowalla, demonstrate that the effectiveness of our proposed model CPAM compared with the state-of-the-art baselines. In the future, we plan to exploit category information, social information, temporal information, metadata [36] with advanced integrating strategies, such as attention mechanism [37] and ensemble learning [10], to enhance POI recommendation model. Besides, we will also try to model users’ preferences with powerful methods, such as recurrent neural network [9], to further improve the performance.

Notes

http://spatialkeyword.sce.ntu.edu.sg/eval-vldb17/
iGLSR is evaluated only on Gowalla as we do not have access to the social data of the Foursquare.

References

Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828
Article Google Scholar
Bin C, Gu T, Sun Y, Chang L (2019) A personalized poi route recommendation system based on heterogeneous tourism data and sequential pattern mining. Multimed Tools Appl 78(24):35,135–35,156
Article Google Scholar
Cao H, Xu F, Sankaranarayanan J, Li Y, Samet H (2019) Habit2vec: Trajectory semantic embedding for living pattern recognition in population. IEEE Trans Mob Comput 19(5):1096–1108
Article Google Scholar
Chang B, Park Y, Park D, Kim S, Kang J (2018) Content-aware hierarchical point-of-interest embedding model for successive poi recommendation. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI), pp 3301–3307
Chen J, Ying P, Zou M (2019) Improving music recommendation by incorporating social influence. Multimed Tools Appl 78(3):2667–2687
Article Google Scholar
Chen R, Chang YS, Hua Q, Gao Q, Ji X, Wang B (2020) An enhanced social matrix factorization model for recommendation based on social networks using social interaction factors. Multimed Tools Appli 1–31
Cheng C, Yang H, King I, Lyu MR (2012) Fused matrix factorization with geographical and social influence in location-based social networks. In: Proceedings of the Twenty-Sixth AAAI conference on artificial intelligence, pp 17–23
Cheng C, Yang H, Lyu MR, King I (2013) Where you like to go next: Successive point-of-interest recommendation. In: Proceedings of the Twenty-Third international joint conference on artificial intelligence, pp 2605–2611
Cui Q, Wu S, Liu Q, Zhong W, Wang L (2020) Mv-rnn: a multi-view recurrent neural network for sequential recommendation. IEEE Trans Knowl Data Eng 32(2):317–331
Article Google Scholar
Da Costa AF, Manzato MG, Campello RJ (2019) Boosting collaborative filtering with an ensemble of co-trained recommenders. Expert Syst Appl 115:427–441
Article Google Scholar
Deng S, Wang D, Li Y, Cao B, Yin J, Wu Z, Zhou M (2016) A recommendation system to facilitate business process modeling. IEEE Trans Cybern 47(6):1380–1394
Article Google Scholar
Gao H, Tang J, Hu X, Liu H (2013) Exploring temporal effects for location recommendation on location-based social networks. In: Proceedings of the 7th ACM conference on Recommender systems, pp 93–100
Gao Q, Zhou F, Zhang K, Trajcevski G, Luo X, Zhang F (2017) Identifying human mobility via trajectory embeddings. In: Proceedings of the 26th international joint conference on artificial intelligence, pp 1689–1695
Goyal P, Ferrara E (2018) Graph embedding techniques, applications, and performance: a survey. Knowl-Based Syst 151:78–94
Article Google Scholar
Guo L, Wen Y, Liu F (2019) Location perspective-based neighborhood-aware poi recommendation in location-based social networks. Soft Comput 23 (22):11,935–11,945
Article Google Scholar
He J, Li X, Liao L, Song D, Cheung WK (2016) Inferring a personalized next point-of-interest recommendation model with latent behavior patterns. In: Proceedings of the Thirtieth AAAI conference on artificial intelligence, pp 137–143
He J, Li X, Liao L (2017) Category-aware next point-of-interest recommendation via listwise bayesian personalized ranking. In: Proceedings of the 26th international joint conference on artificial intelligence, pp 1837–1843
Hu B, Ester M (2014) Social topic modeling for point-of-interest recommendation in location-based social networks. In: 2014 IEEE International conference on data mining. IEEE, pp 845–850
Huang X, Zhang J, Li D, Li P (2019) Knowledge graph embedding based question answering. In: Proceedings of the Twelfth ACM international conference on web search and data mining, pp 105–113
Johnson CC (2014) Logistic matrix factorization for implicit feedback data. In: Advances in neural information processing systems workshop on distributed machine learning and matrix computations, p 27
Kant S, Mahara T, Jain VK, Jain DK (2019) Fuzzy logic based similarity measure for multimedia contents recommendation. Multimed Tools Appl 78(4):4107–4130
Article Google Scholar
Kim Y, Jung S, Ji S, Hwang E, Rho S (2019) Iot-based personalized nie content recommendation system. Multimed Tools Appl 78(3):3009–3043
Article Google Scholar
Li X, Jiang M, Hong H, Liao L (2017) A time-aware personalized point-of-interest recommendation via high-order tensor factorization. ACM Trans Inform Syst (TOIS) 35(4):1–23
Article Google Scholar
Liu Q, Wu S, Wang L, Tan T (2016) Predicting the next location: a recurrent model with spatial and temporal contexts. In: Proceedings of the Thirtieth AAAI conference on artificial intelligence, pp 194–200
Liu Y, Pham TAN, Cong G, Yuan Q (2017) An experimental evaluation of point-of-interest recommendation in location-based social networks. Proc VLDB Endow 10(10):1010–1021
Article Google Scholar
Liu W, Wang ZJ, Yao B, Yin J (2019) Geo-alm: poi recommendation by fusing geographical information and adversarial learning mechanism. In: Proceedings of the 28th international joint conference on artificial intelligence. AAAI Press, pp 1807–1813
Liu W, Lai H, Wang J, Ke G, Yang W, Yin J (2020) Mix geographical information into local collaborative ranking for poi recommendation. World Wide Web 23(1):131–152
Article Google Scholar
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv:1301.3781
Mikolov T, Sutskever I, Chen K, Corrado G, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th international conference on neural information processing systems - volume 2. Curran Associates Inc, pp 3111–3119
Rahmani HA, Aliannejadi M, Ahmadian S, Baratchi M, Afsharchi M, Crestani F (2019) Lglmf: local geographical based logistic matrix factorization model for poi recommendation. In: Asia information retrieval symposium. Springer, pp 66–78
Rendle S, Freudenthaler C, Schmidt-Thieme L (2010) Factorizing personalized markov chains for next-basket recommendation. In: Proceedings of the 19th international conference on World Wide Web. Association for computing machinery, pp 811–820
Stepan T, Morawski JM, Dick S, Miller J (2016) Incorporating spatial, temporal, and social context in recommendations for location-based social networks. IEEE Trans Comput Soc Syst 3(4):164–175
Article Google Scholar
Tu C, Liu H, Liu Z, Sun M (2017) Cane: context-aware network embedding for relation modeling. In: Proceedings of the 55th annual meeting of the association for computational linguistics, pp 1722–1731
Wang Z, Zhang Y, Li Y, Wang Q, Xia F (2017) Exploiting social influence for context-aware event recommendation in event-based social networks. In: IEEE INFOCOM 2017-IEEE conference on computer communications. IEEE, pp 1–9
Wang D, Deng S, Xu G (2018) Sequence-based context-aware music recommendation. Inform Retrieval J 21(2-3):230–252
Article Google Scholar
Wang D, Deng S, Zhang X, Xu G (2018) Learning to embed music and metadata for context-aware music recommendation. World Wide Web 21 (5):1399–1423
Article Google Scholar
Wang D, Zhang X, Yu D, Xu G, Deng S (2020) Came: content-and context-aware music embedding for recommendation. IEEE Trans Neural Netw Learn Syst 1–14
Yin H, Zhou X, Cui B, Wang H, Zheng K, Nguyen QVH (2016) Adapting to user interest drift for poi recommendation. IEEE Trans Knowl Data Eng 28(10):2566–2581
Article Google Scholar
Yu D, Xu K, Wang D, Yu T, Li W (2019) Point-of-interest recommendation based on user contextual behavior semantics. Int J Softw Eng Knowl Eng 29(11n12):1781–1799
Article Google Scholar
Yuan T, Cheng J, Zhang X, Qiu S, Lu H (2014) Recommendation by mining multiple user behaviors with group sparsity. In: Proceedings of the Twenty-Eighth AAAI conference on artificial intelligence, pp 222–228
Zhang JD, Chow CY (2013) Igslr: personalized geo-social location recommendation: a kernel density estimation approach. In: Proceedings of the 21st ACM SIGSPATIAL international conference on advances in geographic information systems, pp 334–343
Zhang JD, Chow CY (2016) Point-of-interest recommendations in location-based social networks. Sigspatial Special 7(3):26–33
Article Google Scholar
Zhao S, Zhao T, King I, Lyu MR (2017) Geo-teaser: geo-temporal sequential embedding rank for point-of-interest recommendation. In: Proceedings of the 26th international conference on world wide web companion, pp 153–162
Zhao K, Zhang Y, Yin H, Wang J, Zheng K, Zhou X, Xing C (2020) Discovering subsequence patterns for next poi recommendation. In: Proceedings of the Twenty-Ninth international joint conference on artificial intelligence, pp 3216–3222

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, 310018, China
Dongjin Yu, Wenbo Wanyan & Dongjing Wang

Authors

Dongjin Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wenbo Wanyan
View author publications
You can also search for this author in PubMed Google Scholar
Dongjing Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dongjing Wang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This research is supported by Zhejiang Provincial Natural Science Foundation of China under No. LQ20F020015, and the Fundamental Research Funds for the Provincial University of Zhejiang under No. GK199900299012-017.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yu, D., Wanyan, W. & Wang, D. Leveraging contextual influence and user preferences for point-of-interest recommendation. Multimed Tools Appl 80, 1487–1501 (2021). https://doi.org/10.1007/s11042-020-09746-0

Download citation

Received: 03 June 2020
Revised: 20 August 2020
Accepted: 26 August 2020
Published: 08 September 2020
Issue Date: January 2021
DOI: https://doi.org/10.1007/s11042-020-09746-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Leveraging contextual influence and user preferences for point-of-interest recommendation

Abstract

Similar content being viewed by others

Graph-Based Metric Embedding for Next POI Recommendation

On successive point-of-interest recommendation

HRec: Heterogeneous Graph Embedding-Based Personalized Point-of-Interest Recommendation

Explore related subjects

1 Introduction

2 Related works

POI recommendation incorporating temporal information

POI recommendation incorporating geographical information

POI recommendation incorporating category information

POI recommendation incorporating sequential information

3 Proposed model

3.1 POI embedding model

Definition 1 (Check-in record and dataset)

Definition 2 (Check-in sequence)

Definition 3 (Target and context POI)

3.2 Unifying the logistic matrix factorization model

4 Experiments

4.1 Datasets

4.2 Evaluation metrics

4.3 Comparison methods

4.4 Parameter settings

4.5 Experiment results

4.5.1 Performance comparison

4.5.2 Effect of data sparsity

4.5.3 Effect of embedding dimension d and context window size w

4.5.4 Effect of hyper-parameter γ

5 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation