Points-of-interest recommendation based on convolution matrix factorization

Xing, Shuning; Liu, Fangai; Zhao, Xiaohui; Li, Tianlai

doi:10.1007/s10489-017-1103-0

Points-of-interest recommendation based on convolution matrix factorization

Published: 05 December 2017

Volume 48, pages 2458–2469, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Applied Intelligence Aims and scope Submit manuscript

Points-of-interest recommendation based on convolution matrix factorization

Download PDF

Shuning Xing¹,
Fangai Liu ORCID: orcid.org/0000-0003-4023-3979¹,
Xiaohui Zhao² &
…
Tianlai Li¹

1301 Accesses
37 Citations
Explore all metrics

Abstract

A point-of-interest(POI) recommendation aims to mine a user’s visiting history and find her/his potentially preferred places. The decision process when choosing a POI is complex and can be influenced by numerous factors, including personal preferences, geographical considerations, and user social relations. While latent factor models have been proven effective and are widely used for recommendations, adopting them to POI recommendations requires delicate consideration of the unique characteristics of location-based social networks (LBSNs). To this end, in this paper, we propose a joint convolution matrix factorization model, named the Review Geographical Social (ReGS) which strategically takes various factors into consideration. Specifically, this model captures geographical influences from a user’s check-in behaviour, and user social relations can be effectively leveraged in the recommendation model. The reviews information available on LBSNs could be related to a user’s check-in action, providing a unique opportunity for a POI recommendation. We model above three types of information under a unified POI recommendation framework based on convolution matrix factorization which integrates a convolutional neural network into a probability matrix factorization. Finally, we conduct a comprehensive performance evaluation for the ReGS using two real-world datasets collected from Foursquare. Experimental results show that the ReGS achieves significantly superior precision and recall rates to other state-of-the-art recommendation models.

Hybrid graph convolutional networks with multi-head attention for location recommendation

Article 23 June 2020

Content-aware point-of-interest recommendation based on convolutional neural network

Article 02 October 2018

GN-GCN: Combining Geographical Neighbor Concept with Graph Convolution Network for POI Recommendation

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the exponential growth of network information, improving the efficiency of information utilization and alleviating the problem of information overload have become popular research areas. A recommender system is an important way to solve above problems, and plays an important role in e-commerce, information retrieval, e-tourism, online advertising, and mobile applications, among others [1]. Recommender systems are defined as programmes that attempt to recommend the most suitable items to particular users by predicting a user’s interest in an item based on related information about items [2]. Recently, Yera et al. [3] analysed more than a hundred papers to improve the performance of recommender systems using fuzzy techniques.

With the rapid development of mobile devices and global position system (GPS), location-based social networks (LBSNs) have become very popular and attracted lots of attention from industry and academia. Typical location-based social networks include Foursquare, Gowalla, Facebook Place, and GeoLife, among others. Until June 2016, Foursquare has collected more than 8 billion check-ins and more than 65 million place shapes mapping businesses around the world. Over 55 million people in the world use Foursquare each month [4]. As shown in Fig. 1, users can build connections with their friends, upload photos, share their locations via check-in for points of interest (e.g., restaurants, tourist spots, and stores, etc), and publish relevant reviews. The task of recommending new, interesting places is referred to as a point-of-interest (POI) recommendation. POI recommender systems play an important role in LBSNs since they can both meet users’ personalized preferences for visiting new places and help to increase revenues by providing users with intelligent location services, such as location-aware advertisements.

At present, most POI recommendation algorithms [5,6,7,8,9,10]are based on historical check-in behaviours and contextual information (e.g., ratings, temporal, geographical, social relations, labels, and categories) mined from users’ preferences of POIs. The premise of a points-of-interest recommendation based on convolution matrix factorization methods is that the frequency of users’ check-in reflects the users’ POI preferences. Above recommended methods is that the frequency of users’ check-in reflects the users’ POI preferences. From the distribution of the number of users’ checked-in to Foursquare [11], we can know that users’ attitude to check-in POIs is not very positive, as more than 50 % of POIs were checked by the same user only once. Therefore, only using sparse users’ check-in data and contextual information as the basis for the recommended model will have a big deviation from the final results.

In reality, in addition to the analysis of users’ historical check-in data and contextual information, users’ preferences can also be analysed by users’ POIs reviews text. For example, one user evaluated one restaurant “The taste of this restaurant is good, dishes taste partial spicy!” From the above review, we know that this user’s emotional tendency is positive, while the degree of interest to this restaurant is relatively high. Therefore, in view of the sparseness of historical check-in data, this paper combines POIs review information and contextual information to improve the quality of POI recommendations. However, there is a small challenge for existing recommendation models, which is that it is difficult to find an effective way to fuse data from multiple heterogeneous data sources. The main contributions of this paper are summarized as follows:

Based on the analysis of large-scale data, POI recommendations contain a variety of data from different areas and structures. Therefore, this paper proposes using a convolution matrix factorization method to integrate heterogeneous data, that integrates heterogeneous data based on large-scale data.

We propose a novel POI recommendation model called the Review Geographical Social (ReGS). This model uses a convolution neural network to build a POI review model based on review text and puts two factors of user social relations and geographical influence into the convolution matrix factorization model to achieve improved predictions of users’ preferences.

Extensive experiments on two real-world datasets collected from Foursquare showed that the ReGS is superior to other state-of-the-art POIs recommendation models for precision and recall rates.

2 Related work

2.1 Recommendation based on contextual information

A context-aware recommendation system (CARS) [12] is both a recommendation system and a context-aware application system, which was proposed by Aavavius and Tuzhilin et al. They noted that the incorporation of contextual information into the recommendation system will improve recommendation accuracy. So, at present, most POI recommendations focus on how to use a variety of contextual information (e.g., geographical factors and social relations) to recommend POIs. Noguera et al. [13] proposed a mobile CARS that took advantage of the additional information that mobile devices provide. This system allows tourists to benefit from innovative features such as a 3D map-based interface and real-time location-sensitive recommendations. Ye et al. [14] was inspired by the view that friends could share more common interest places. By analysing datasets from Foursquare, they find a strong correlation between social relations and geographical location. Then, they proposed a recommendation model based on a naive Bayesian algorithm to integrate users’ preferences, geographical location and users’ social relations. Cheng et al. [15] integrated users’ social relations and geographical locations into a probability matrix factorization model. They built the users’ check-in probability model as the multi-centre Gaussian model to capture geographical influence. Then, they put geographical influence, social information and geographical information into a generalized matrix factorization model. Lian et al. [16] proposed using a weighted matrix factorization model for POIs recommendation. Due to the phenomenon of spatial aggregation of users’ check-in activity based on LBSNs, they described the spatial aggregation effect from the perspective of a two-dimensional kernel density estimation and integrated it into the matrix factorization model. They explained why the spatial aggregation effect model can address the problem of the user-POI sparse matrix.

2.2 Recommendation based on review texts

To alleviate the sparse effect of users’ historical check-in data for POI recommendations, researchers have began to explore review texts for POI recommendations. Cheng et al. [17] collected 22 million points of check-in data from 220 thousand users and evaluated user mobility patterns using quantitative analysis of relevant user information such as spatial, temporal, social, and textual information. They found that emotional-based review information related to check-in would provide a better understanding of users’ preferences and richer contextual information that could improve the performance of relevant recommendations.

Recently, recommendations based on text models such as Latent Dirichlet Allocation (LDA) and Stacked Denoising Auto-Encoder (SDAE) have been proposed to utilize item description text [18, 19]. However, existing models do not fully capture text information. For example, take the following two sentences, “I trust this man” and “I betray his trust finally”. Since the LDA and SDAE consider the text as a bag of distinguished words, they cannot distinguish each occurrence of the term“trust”. In [20], a convolutional matrix factorization is proposed by using a convolution neural network (CNN) for text recommendations. This model integrates the CNN into a probability matrix factorization (PMF). It uses the CNN to learn items’ latent factors and the PMF to get users’ latent factors. Assuming that this model satisfies the Gaussian distribution, the objective function consists of matrix factorization and the CNN loss function. However, this model is only used in the text recommendations and is not combined with other contextual information factors applied to POI recommendations.

In general, the above recommendation models have achieved good results, but they did not integrate reviews and contextual information into one model using deep neural networks. Therefore, this paper analyses the above various kinds of information and takes contextual and review information of users’ check-in behaviours into the process of inferring users’ POI preferences. Compared with existing recommendation models,the ReGS uses a convolution neural network to learn POIs’ latent vectors from review texts, which can be used to better meet users’ preferences.

3 POI recommendation based on matrix factorization

The problem of personalized POI recommendation is how to recommend POIs to a user given a user’s POI check-in records and other available side information. Let U = {u ₁, u ₂,..., u _M} be a set of LBSN users. Let V = {v ₁, v ₂,..., v _N} be a set of POIs, where each POI has a location l _i =[l o n _j, l a t _j]^T represented by longitude and latitude. $ \mathbf {R}\in \mathbb {R}^{M\times N}$ is a check-in matrix with each element r _{i
j} representing the number of observed check-in made by u _i at v _j. The users’ check-in frequency reflects the users’ POI preferences. The data of users and POIs are mapped into a potentially low-dimensional hidden space k ≪ m i n(m, n). The basic POIs recommendation model approximates u _i’s latent interests in an unvisited v _j by solving the following optimization problems:

$$ \min_{\mathbf{U},\mathbf{V}}\frac{1}{2}\left( \mathbf{I}\odot(\mathbf{R}-\mathbf{U}\mathbf{V}^{T})^{2}\right) $$

(1)

Where $ \mathbf {I}\in \mathbb {R}^{M\times N}$ is a check-in weight matrix with I = 1 indicating that u _i has checked in v _j, otherwise I = 0. The above recommendation model learns an optimal set of {U, V} whose product $\widehat {\mathbf {R}}=\mathbf {U}\mathbf {V}^{T}$ is a non-sparse matrix that approximates the original R. POI recommendations are then performed for each user based on the ranking among her unvisited POIs in $\widehat {\mathbf {R}}$. To avoid overfitting, two regularization terms on free matrix parameters U and V are added into (2). Hence, we have

$$ \min_{\mathbf{U},\mathbf{V}}\frac{1}{2}\left\| \mathbf{I}\odot\left( \mathbf{R}-\mathbf{U}\mathbf{V}^{T} \right) \right\|_{F}^{2}+\frac{\lambda_{u}}{2}\left\| \mathbf{U} \right\|_{F}^{2}+\frac{\lambda_{v}}{2}\left\| \mathbf{V} \right\|_{F}^{2} $$

(2)

where the regularization parameters λ _u, λ _v > 0, $\left \| \cdot \right \|_{F}^{2}$ denotes the Frobenius norm. The optimization problem in (2) minimizes the sum-of-squared-errors objective function with quadratic regularized terms. Gradient based approaches can be applied to find a local minimum.

4 POI recommendation framework based on convolution matrix factorization

On the basis of users’ historical check-in data, this paper proposes a POI recommendation model ReGS that considers semantics regarding user reviews and relevant POIs contextual information.

4.1 Modelling reviews information

Review text usually includes the reasons for users’ ratings, which is conducive to understanding users’ rating behaviours. Cold start problems can be effectively alleviated by deeply analysing reviews. Topic model techniques are often used to mine the“topic” that is hidden in documents. Currently, Latent Dirichlet Allocation (LDA) is the most widely used topic modelling technique. However, CNN can effectively capture local features of documents through modelling components such as local receptive fields [21], shared weights [22], and sub-sampling [23]. Thus, using CNN can provide deeper understanding of documents and generate better latent models than LDA and SDAE. This paper discovers “POIs topics” hidden in review text based on the CNN model. According to the method used in [20], first all words of one review are embedded into a matrix. One review can be represented as a two-dimensional matrix, where the rows of the matrix are the number of words in the review and the columns are the length of the embedded words vector. POIs latent vectors can be determined by convolution, pooling and mapping methods for this matrix. The POI latent vector is generated from three variables: 1) the internal weights W in CNN; 2)D _j representing the review of POI j; and 3) epsilon variables as Gaussian noise, which enables us to further optimize the POI latent model for the ratings. Thus, the final POI latent factor based on reviews information is obtained by the following equations.

$$ v_{j}=cnn(W,D_{j})+\varepsilon_{j} $$

(3)

Thus, information from reviews can be leveraged to learn the POI latent vector, as shown below:

$$ min\frac{1}{2}{\sum\limits_{j}^{m}}\left( v_{j}-cnn\left( W,D_{j} \right) \right)^{2} $$

(4)

4.2 Modelling geographical information

Users’ check-in records contain plentiful geographical information, and geographical distance plays an important role in POI recommendations. As is shown in Fig. 2, POIs checked by the same user are in a small range of geographical distances, which can be attributed to geographical influence. In reality, people usually visited one POI (such as a museum) and then travelled to nearby POIs (such as restaurants or a shop store). Adjacent POIs have a stronger geographical relevance than long-distance POIs. According to the Tobler’s first law of geography [24], one user’s propensity for one POI is inversely proportional to the distance between them. This is similar to the observation that the probability of purchasing an item is inversely proportional to its cost. Therefore, users’ check-in places often form a geographical cluster area. Thus, based on the geographical characteristics of users’ check-in data, we can effectively improve the performance of POI recommendations.

In this paper, we assumed that u _i’s preference for several neighbouring locations of v _j to represent u _i’s preference for v _j. A geographical weight strategy is used to compensate for missing geographical information in the classical matrix factorization models. According to (1,2), with POI recommendations based on geographical characteristics [25], the minimization problem can be expressed as the following equation:

$$ \min_{\mathbf{U},\mathbf{V}}\frac{1}{2}\left( \mathbf{I}\odot (\mathbf{R}-\mathbf{U}\mathbf{H}\mathbf{V}^{T})^{2}\right) $$

(5)

While, H = α U V ^T + (1 − α)S ^T, $\mathbf {S}\in \mathbb {R}^{n\times n}$, $S_{j,k}=\frac {sim\left (v_{j},v_{k} \right )}{Z\left (v_{j} \right )}$. α is a weight parameter used to control the influence of neighbouring locations. s i m(v _j, v _k) indicates the geography weight of the adjacent location v _k at the location v _j. Z(v _j) is a regularization item, defined as $Z\left (v_{j} \right )={\sum }_{v_{k}\in C\left (v_{j} \right )}sim\left (v_{j},v_{k} \right )$, while s i m(v _j, v _k) uses the Gaussian function to represent, as shown in (6):

$$ sim\left( v_{j},v_{k}\right)=e^{-\frac{\left\| l_{j}-l_{k} \right\|^{2}}{\alpha^{2}}},\forall v_{k}\in C\left( v_{j} \right) $$

(6)

Considering the range of geographical areas, users are unlikely to check in a place that is too far from the users’ geographical location. Therefore, this paper presents a geographical area distance variable F to distinguish geographical range. C(v _j) represents the adjacent locations to v _j. In the experiment, F is set to 10000 according to the empirical value. If the recommended POI is not in the user’s current position C(v _j), then this POI is not considered.

4.3 Modelling user social relations

In reality, users often go to places where their friends provide strong recommendations. However, if all trust relations are treated equally, it will be hard to find user characteristics and relations hidden behind trust relations. Zhang et al. [26] proposed a personalized recommendation algorithm integrating trust relationships and time series. Based on the above algorithm, they proposed a model that comprehensively considers factors such as direct and indirect trust among users, a mechanism for trust propagation and user similarity [27]. We use the method above papers mentioned to model user social relations. The details are shown as below. Firstly, we use recursive computing that calculates trust values by analysing the transmission and aggregation characteristics of trust as:

$$ t_{i,f}=\frac{{\sum}_{v\in V^{T}}t_{i,f}\cdot t_{v,f} }{{\sum}_{v\in V^{T}}t_{i,f}} $$

(7)

where i, f, and v are users; t _{i, f} represents the trust value between target user i and user f; v is a node on the shortest path from i to f; and V _T is a set of users. These users should be located on the shortest path from i to f and their trust values should be greater than the designated threshold value. The similarity between users i and f can be acquired by calculating the Pearson correlation coefficient:

$$ sim\left( i,f \right)=\frac{{\sum}_{m\in {I}_{i,f}}\left( R_{i,m}-\overline{R_{i}} \right)\left( R_{f,m}-\overline{R_{f}} \right)}{\sqrt{{\sum}_{m\in {I}_{i,f}}\left( R_{i,m}-\overline{R_{i}} \right)^{2}}\times \sqrt{\left( R_{f,m}-\overline{R_{f}} \right)^{2}}} $$

(8)

where sim(i,f) represents the similarity between user i and f. After acquiring the trust value and similarity between users i and f, a weighted average is used to combine the two features. Recommended neighbours with high similarity and trust values gain increased trust. The new trust value T _{i, f} between user i and user f can be acquired by calculating the following equation:

$$ T_{i,f}=\frac{2sim\left( i,f \right)\cdot t_{i,f}}{sim\left( i,f \right)+t_{i,f}} $$

(9)

The objective function of POI recommendations based on social relations is minimized as shown in (10):

$$ min\frac{1}{2}\sum\limits_{i = 1}^{m}\sum\limits_{f\in u}T_{i,f}\left( u_{i}- u_{f} \right)^{2} $$

(10)

4.4 ReGS model

The ReGS model integrates reviews information, geographical information and social relations to predict ratings. The graphical model of the ReGS model is shown in Fig. 3. The objective function is minimized as follows:

$$\begin{array}{@{}rcl@{}} \underset{\mathbf{U},\mathbf{H},\mathbf{V}\geq 0}{min}\delta \!&=&\!\frac{1}{2}\left\| \mathbf{I}\odot \left( \mathbf{R}\,-\,\mathbf{U}\mathbf{H}\mathbf{V}^{T} \right) \right\|_{F}^{2}\\ &&\!+\frac{\lambda_{1}}{2}\sum\limits_{i = 1}^{m}\sum\limits_{f\in u}T_{i,f}\left\| u_{i}\,-\,u_{f} \right\|_{F}^{2}\\ &&\!+\frac{\lambda_{2}}{2}{\sum\limits_{j}^{m}}\left\| v_{j}\,-\,cnn\left( W,D_{j} \right) \right\|_{F}^{2}\,+\,\frac{{\lambda }_{3}}{2}\sum\limits_{k}^{\left|{w}_{k} \right|}\left\| w_{k} \right\|_{2}\\ \end{array} $$

(11)

where λ ₁ is the weight parameter to control social relations. λ ₂ is the weight parameter to control reviews information.λ ₃ is the weight parameter to control internal weights W.

4.5 Parameter estimation

This paper uses the gradient descent method [28] and coordinate descent method [29] to solve the local optimal solution of the objective function.

$$\begin{array}{@{}rcl@{}} \frac{\partial \delta }{\partial \mathbf{U}}&=&-\left( \mathbf{I}\odot\mathbf{I}\odot\mathbf{R} \right)\mathbf{V}\mathbf{H}^{T}+\left( \mathbf{I}\odot\mathbf{I}\left( \mathbf{U}\mathbf{H}\mathbf{V}^{T} \right) \right)\mathbf{V}\mathbf{H}^{T}\\ &&+\lambda_{1}\sum\limits_{i = 1}^{m}\sum\limits_{f\in u}T_{i,f}\left( u_{i}-u_{f} \right)+\lambda_{1}\sum\limits_{i = 1}^{m}\sum\limits_{g\in u}T_{i,f}\\ &&\times\left( u_{i}-u_{g} \right) \end{array} $$

(12)

$$\begin{array}{@{}rcl@{}} \frac{\partial \delta }{\partial \mathbf{V}}&=&-\left( \mathbf{I}^{T}\odot\mathbf{I}^{T}\odot\mathbf{R} \right)\mathbf{U}\mathbf{H}+ \left( \mathbf{I}^{T}\odot\mathbf{I}^{T}\left( \mathbf{V}\mathbf{H}^{T}\mathbf{U}^{T} \right) \right)\mathbf{U}\mathbf{H}\\ &&+\lambda_{2}{\sum\limits_{j}^{m}}\left( v_{j}-cnn\left( W,D_{j} \right) \right) \end{array} $$

(13)

$$ \begin{array}{llllllllll} v_{j}\leftarrow \left( \mathbf{U}\mathbf{I}_{j}\mathbf{U}^{T}+\lambda_{2}\mathbf{I}_{k} \right)^{-1}\left( \mathbf{U}\mathbf{R}_{j}+\lambda_{2}cnn\left( W,D_{j} \right) \right) \end{array} $$

(14)

$$ \begin{array}{llllllllll} \frac{\partial \delta }{\partial \mathbf{H}}=-\mathbf{U}^{T}\left( \mathbf{I}\odot\mathbf{I}\odot \mathbf{R} \right)\mathbf{V}+ \mathbf{U}^{T}\left( \mathbf{I}\odot\mathbf{I}\odot\left( \mathbf{U}\mathbf{H}\mathbf{V}^{T} \right)\right)\mathbf{V} \end{array} $$

(15)

The goal of this paper is to optimize parameters U, V, H and W which are in the convolution neural network, whereby, U, V and H are optimized by the formulas (12)–(15). Given U and V, we can learn the weights w _l and biases of each layer using the back-propagation learning algorithm as in [30].

5 Experiments

5.1 Foursquare dataset

We choose Foursquare, one of the most popular location-based social networking websites, to analyse review text on LBSNs. We collected two experimental datasets from Foursquare, New York City(NYC) and Los Angeles(LA). Then, we obtained their corresponding check-in records with the same crawling strategy proposed in [31] and collected check-ins that occurred in NYC and LA. Based on the venue id extracted from check-in records, we obtained the POI categories through the “Venue API4”of Foursquare. In pre-processing, we split each dataset into the training set and testing set based on the check-in time rather than using a random partition method, because in practice we can only utilize past check-in data to predict future check-in events. Half of the check-in data with earlier timestamps are used as the training set, and the other half are used as the testing set that needs to be divided into different time slots for evaluation purposes. In the experiments,the training set is used to learn the recommendation models of the evaluated techniques described in Section 4.4 to predict the testing data.

The statistics of the two datasets are shown in Table 1. User-POI check-in matrix densities of these two datasets are 5.63 × 10^− 5and 2.04 × 10^− 5, respectively. Since the user-POI check-in matrix is sparse, the accuracy of the POI recommendation is generally not high. For example, when the user-POI check-in matrix density is 2.72 × 10^− 4, the maximum accuracy is only 0.06 [32]. Because user-POI check-in matrix density is relatively low, it is reasonable to get the lowest accuracy rate and recall rate. Meanwhile, the density of the LA dataset is slightly higher than for the NYC dataset. Therefore, the accuracy and recall rate based of the LA dataset are mostly higher than those of the NYC dataset.

Table 1 Statistic on the Datasets

Full size table

5.2 Evaluation index

5.2.1 Predicting ratings

Root Mean Square Error (RMSE) [33] is used to measure performance. The definitions are as follows:

$$ RMSE=\sqrt{\frac{1}{\left| T \right|}\sum\limits_{i,j}\left( R_{i,j}- \hat{R_{i,j}}\right)^{2}} $$

(16)

Where R _{i, j} and $\hat {R_{i,j}}$ denote the observed and predicted ratings, respectively, and T is the testing set. The smaller the values are for both metrics, the better their performance.

5.2.2 Recommending top-k POIs

We use two indicators to evaluate Top-k POI recommendation performance. One is accuracy rate Precision @k and the other is recall rate Recall @k, abbreviated as P@k and R@k, respectively. For a target user u _i, P@k for each user is defined as follows:

$$ P@K=\frac{number\; of \;locations \;the \;user\; likes \;in\; top\; K}{total \;number \;of\; locations} $$

(17)

And, R@k for each user is defined as follows:

$$ R@K=\frac{number\; of \;locations \;the \;user\; likes \;in\; top\; K}{total \;number \;of\; locations\;the \;user\; likes} $$

(18)

We select P@1, P@5, P@10, R@1,R@5 and R@10 as the evaluation indicators.

5.3 The method for comparison

We compared our proposed model with the following algorithms:

LCARS [34]

This method built the location-content recommendation system based on the topic model to infer the user’s individual interest and location preferences.

CoRe [32]

This recommendation model integrated user social relations and geographical influence based on robustness rules, in which the geographical influence factors are modeled based on kernal density estimations.

GeoMF [16]

GeoMF is a matrix factorization model that utilized the spatial clustering of user check-in behaviour.

DRW [35]

This recommendation model fused users’ social relations, category information and popularity information of POIs based on the dynamic random walk.

NCPD [36]

This recommendation model integrated geographical information and category information based on the NMF matrix factorization model. Geographical factors are modelled based on the influence of user’s geographical neighbours.

In this experiment, the k value is set to 1, 5, and 10 respectively. When the k value changed, each algorithm recalculated P@k and R@k. Considering the effectiveness of the experiment, the implicit space dimension is set to 200. When the geographical location weight α is set to 0.4 in (6), the recommended performance is best [25].

5.4 Analysis of experimental results

This section evaluates the ReGS model from three perspectives: 1) Comparing the ReGS model with five existing POI recommendation models, 2) Analysing the contribution of the three evaluation indexes, and 3) Discussing the impact of relevant parameters.

5.4.1 Comparison and analysis of recommended models

We used different training data to test our method and list the results in Table 2. The process of model training is performed three times at a time, and the average value listed in Table 2 is the final result to reduce errors. From Table 2, we see that our method has advantages. This is mainly because some methods only used ratings as the input of the model; thus, they have some difficulties when solving the problems of data sparseness and cold start. Other methods utilized ratings and other contextual information to predict ratings by using traditional regression methods, and these methods do not use deep neural networks to extract user and contextual features. Thus, they do not do well when learning user’s interest preferences, and the results have a large deviation.

Table 2 RMSE comparasion

Full size table

As shown in Tables 3 and 4, the ReGS model incorporates the influence of review factors, social relation factors and geographical factors. Compared with the other five recommendation models, the ReGS model shows better recommendation quality in terms of accuracy and recall rate. With the k value increased, the accuracy declines and the recall rate rises. That is because recommending more locations to users could be conducive to finding more POIs, and users are more willing to check-in at POIs.

Table 3 LA Dataset

Full size table

Table 4 NYC Dataset

Full size table

CoRe

This model uses a more robust rule rather than a simple linear weight to fuse users’ social relations and the geographical impact of POIs. Geometrical factors are modelled based on kernel density estimation. Thus, as shown in Tables 3 and 4, the performance of this model ranks third.

LCARS

This method uses the topic model (LDA) to infer the user’s personal interests and local area preferences. Local preferences or personalized interests are expressed as a mixture of topics with each topic being a distribution of POIs learned from POIs check-in data and classified information. However, LCARS ignores geographical and social characteristics of user check-in behaviour. Thus, as shown in Tables 3 and 4, the performance of this model ranks fifth.

GeoMF

GeoMF enhances the potential factors of users and POIs, as well as users’ active region vectors and region influencing vectors of POIs. This model captures spatial clustering from the two-dimensional kernel density assessment and integrates it into matrix factorization. Thus, as shown in Tables 3 and 4, the performance of this model ranks fourth.

DRW

Based on the dynamic random walk model, this model integrates users’ social relations, related categories information and popularity influence. However, it ignores the most important geographical factors of POIs. Therefore, the performance of this model is last.

NCPD

Geographical influence and popularity information are merged based on the NMF matrix factorization model. However, relative to the ReGS, the NCPD model lacks users’ social relations and review information, and the final recommendation accuracy has not improved. As shown in Tables 3 and 4, the performance of this model ranks second.

ReGS

Based on two datasets, the ReGS model performs best. Compared to the NCPD, the recommended performance of the ReGS has improved. The reasons are as follows: 1) Relative to CoRe, GeoMF, NCPD, DRW and LCARS, the ReGS is a comprehensive model that fused users’ reviews text based on their interests, users’ social relations, and geographical factors based on geographical neighbours. 2) Compared to the LCARS based on topic model(LDA) technology, the ReGS is a matrix factorization model based on a convolutional neural network to model review information. Modelling geographical factors based on geographical neighbours features is better rather than the GeoMF based on geographical location similarity.

5.4.2 Factor impact analysis

This section analyses the three factors of reviews information, geographical information and users’ social relations in the ReGS model. These three factors are named Rev, Geo and So, corresponding to (4), (5) and (10). Figure 4a,b shows comparison results of the three factors and two evaluation indexes of accuracy and recall rate on the LA dataset. Figure 4c,d shows comparison results of the three factors and two evaluation indexes of accuracy and recall rate on the NYC dataset. The following conclusions can be drawn from Fig. 4: 1) These three factors are important for POI recommendations; and 2) The ReGS model is significantly better due to the convergence of the three factors, which contributes to improved recommendation accuracy. The reasons for these conclusion is that users are influenced by many aspects of contextual information in real life, rather than one-sided from a certain aspect to predict users’ preferences. Therefore, POI recommendations should take full advantage of a variety of POI contextual information, which is an effective method to solve the cold start and data sparsity problems.

5.4.3 Parameter analysis

The ReGS model has three important parameters: 1) Social relations parameter λ ₁; 2) Review information parameter λ ₂; and 3) The geographical neighbour relations weight parameter α. The value of one parameter is changed while other parameters are fixed to analyse the sensitivity of each parameter and the impact on the final recommendation.

We first analysed the geographical neighbour relations weight parameter α, and set k = 5, λ ₂ = 0.04, and λ ₁ = 0.01. The effect on the objective function based on two datasets are shown in Fig. 5. As seen from Fig. 5: setting the range of α from 0.4 to 0.5 obtains better results. This indicates that α has a significant impact on the measurement of users’ preferences and geographical neighbour characteristics. α = 0 or α = 1 will result in decreased recommendation accuracy. In particular, when α = 0, since the geographical neighbour characteristics are not taken into account, the recommendation accuracy is the worst. When k = 5, λ ₂ = 0.04, the influence of social relations parameter λ ₁ to the whole model is shown in Fig. 6. From Fig. 6, the following conclusions can be drawn: 1) λ ₁ = 0.01 gets the best recommendation result, but the recommendation accuracy will drop when λ ₁ = 0 ; 2) When λ ₁ > 0.05, the ReGS model is stable, and will not become sensitive when λ ₁ changes. Therefore, the ReGS model is not very sensitive regarding λ ₁. Choosing λ ₁ = 0.01 as the default value is reasonable. When k = 5, λ ₁ = 0.01, the influence of the review text parameter λ ₂ to the whole model is shown in Fig. 7. As shown in Fig. 7, when the λ ₂ = 0.04, the ReGS model’s accuracy and recall rate improve. However, when λ ₂ > 0.04, the ReGS model is stable and not sensitive to λ ₂ changes. Therefore, the ReGS model is not very sensitive about λ ₂. Choosing λ ₂= 0.04 as the default value is reasonable. This is mainly because users may only mention some potential factors rather than all factors in one review.

6 Conclusions

In this paper, we presented an integrated analysis of the joint effects of multiple factors that influence one user to choose a POI. We proposed a general framework ReGS that integrates CNN into PMF to learn users’ preferences for POI recommendation in LBSNs. There are several advantages of the proposed recommendation method. First, this model captured the geographical influence on a user’s check-in behaviour by taking into consideration the geographical factors in LBSNs, such as Tobler’s first law of geography. Second, this method effectively models user social rel ations, which are important for location-based services. Last but not least, the proposed approach extended the latent factors from explicit rating recommendation to implicit feedback recommendation settings by considering the review data characteristic based on CNN. Finally, extensive experimental results on real-world LBSN data validated the performance of the proposed method. In the future, we will investigate other deep learning models to replace CNN for further boosting performance including recurrent neural networks and long short-term memory. Furthermore, it would be interesting to investigate the recommendation effect of reviews information compared to other information, such as temporal effects, POIs orders or popularity impacts.

References

Lu J, Wu D, Mao M et al (2015) Recommender system application developments: a survey. Decis Support Syst 74(C):12–32
Article Google Scholar
Bobadilla J, Ortega F, Hernando A (2013) Recommender systems survey. Knowl-Based Syst 46 (1):109–132
Article Google Scholar
Toledo RY, Martinez L (2017) Fuzzy tools in recommender systems: a survey. Int J Comput Intell Syst 10(1):776–803
Article Google Scholar
Foursquare Available online: http://foursquare.com. Accessed 15 May 2017
Liu X, Liu Y, Aberer K et al (2013) Personalized point-of-interest recommendation by mining users’ preference transition. In: Proceedings of the 22st ACM International conference on conference on information and knowledge management. San Francisco, pp 733–738
Wu L, Chen E H, Liu Q et al (2012) Leveraging tagging for neighborhood-aware probabilistic matrix factorization. In: Proceedings of the 21st ACM international conference on information and knowledge management. Maui, pp 1854–1858
Li XT, Cong G (2015) Rank-GeoFM: a ranking based geographical factorization method for point of interest recommendation. In: Proceedings of the 38th International ACM Sigir conference on research and development in information retrieval. Santiago, pp 433–442
Yuan Q, Cong G, Ma Z et al (2013) Time-aware point-of-interest recommendation. In: Proceedings of the 36th International ACM Sigir conference on research and development in information retrieval. Dublin, pp 363–372
Liu B, Xiong H (2015) A general geographical probabilistic factor model for point of interest recommendation. IEEE Trans Knowl Data Eng 27(5):1167–1179
Article Google Scholar
Jamali M, Ester M (2010) A matrix factorization technique with trust propagation recommendation in social networks. In: Proceedings of the 4th ACM conference on recommender systems. Barcelona, pp 135–142
Gao HJ, Tang JL, Hu X (2015) Content-aware point of interest recommendation on location-based social networks. In: Proceedings of the 29th AAAI conference on artificial intelligence. Austin, pp 336–350
Adomavicius G, Tuzhilin A (2011) Context-aware recommender systems. Int J Inf Technol Web Eng 16 (3):2175–2178
Google Scholar
Noguera JM, Barranco MJ, Segura RJ et al (2012) A mobile 3D-GIS hybrid recommender system for tourism. Inform Sci 215(18):37–52
Article Google Scholar
Ference G, Ye M (2013) Location recommendation for out-of-town users in location-based social networks. In: Proceedings of the 22st ACM international conference on conference on information and knowledge management. San Francisco, pp 721–726
Cheng C, Yang HQ, King I et al (2012) Fused matrix factorization with geographical and social influence in location-based social networks. In: Proceedings of the 26th AAAI conference on artificial intelligence. Toronto, pp 211–276
Lian DF, Zhao C, Xie X et al (2014) GeoMF: joint geographical modeling and matrix factorization for point-of-interest recommendation. In: The 20th ACM SIGKDD international conference on knowledge discovery and data mining. New York, pp 831– 840
Cheng ZY, Caverlee J, Lee K et al (2011) Exploring millions of footprints in location sharing services. In: Proceedings of the 5st International AAAI conference on Weblogs and social media. Barcelona, pp 221–226
Ling G, Lyu MR, King I (2014) Ratings meet reviews, a combined approach to recommend. In: Proceedings of the 8th ACM conference on recommender systems. Foster City, pp 105–112
Wang H, Wang N, Yeung DY (2015) Collaborative deep learning for recommender systems. In: Proceedings of the 21st ACM SIGKDD conference on knowledge discovery and data mining. Sydney, pp 1235–1244
Kim D, Park C, Oh J et al (2016) Convolutional matrix factorization for document context-aware recommendation. In: Proceedings of the 10th ACM conference on recommender systems. Boston, pp 233–240
Shen Y, He X, Gao J, Deng L, Mesnil G (2014) A latent semantic model with convolutional-pooling structure for information retrieval. In: Proceedings of the 23rd ACM international conference on conference on information and knowledge management. Shanghai, pp 101–110
van den Oord A, Dieleman S, Schrauwen B (2013) Deep content-based music recommendation. In: Proceedings of the 27th annual conference on neural information processing systems. Lake Tahoe, pp 2643–2651
Li S, Kawale J, Fu Y (2015) Deep collaborative filtering via marginalized denoising auto-encoder. In: Proceedings of the 24th ACM International conference on information and knowledge management. Melbourne, pp 811–820
Tobler WR (1970) A computer movie simulating urban growth in the detroit region. Econ Geogr 46:234–240
Article Google Scholar
Liu Y, Wei W, Sun A et al (2014) Exploiting geographical neighborhood characteristics for location recommendation. In: Proceedings of the 23st ACM international conference on information and knowledge management. Kuala Lumpur, pp 739–748
Zhang Z, Liu H (2015) Social recommendation model combining trust propagation and sequential behaviors. Appl Intell 43(3):695–706
Article Google Scholar
Zhang Z, Xu G, Zhang P et al (2017) Personalized recommendation algorithm for social networks based on comprehensive trust. Appl Intell 1–11
Koren Y (2008) Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Las Vegas, pp 426–434
Wang C, Blei DM (2011) Collaborative topic modeling for recommending scientific articles. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining. San Diego, pp 448–456
Lecun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. IEEE 86:2278–2324
Article Google Scholar
Gao H, Tang J, Liu H (2012) Exploring social-historical ties on location-based social networks. In: Proceedings of the twenty-sixth AAAI conference on artificial intelligence. Toronto, pp 114–121
Zhang JD, Chow CY (2015) CoRe: exploiting the personalized influence of two-dimensional geographic coordinates for location recommendations. Inform Sci 293:163–181
Article Google Scholar
Gunawardana A, Shani G (2009) A survey of accuracy evaluation metrics of recommendation tasks. J Mach Learn Res 10(10):2935–2962
MathSciNet MATH Google Scholar
Yin H, Cui B, Sun Y et al (2014) LCARS: a spatial item recommender system. Acm Trans Inf Syst 32(3):11–11
Article Google Scholar
Ying JC, Kuo WN, Tseng VS et al (2014) Mining user check-in behavior with a random walk for urban point-of-interest recommendations. Acm Trans Intell Sys Technol 5(3):1–26
Article Google Scholar
Hu L, Sun A, Liu Y (2014) Your neighbors affect your ratings: on geographical neighborhood influence to rating prediction. In: The 37th International ACM SIGIR conference on research and development in information retrieval. Gold Coast, pp 345– 354

Download references

Acknowledgements

This work was supported by the following grants: National Natural Science Foundation of China (No.61772321, No.61572301, No.61602282, No.90612003), Natural Science Foundation of Shandong Province (No. ZR2013FM008, No. ZR2016FP07), the Open Research Fund from Shandong Provincial Key Laboratory of Computer Network (No. SDKLCN-2016-01), Innovation Fundation of Science and Technology Development Center of Ministry of Education and New H3C Group (2017A15047).

Author information

Authors and Affiliations

School of Information Science and Engineering, Shandong Normal University, No. 88 East Wenhua Road, Jinan, 250014, China
Shuning Xing, Fangai Liu & Tianlai Li
School of Mathematical Science, Shandong Normal University, No. 88 East Wenhua Road, Jinan, 250014, China
Xiaohui Zhao

Authors

Shuning Xing
View author publications
You can also search for this author in PubMed Google Scholar
Fangai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohui Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Tianlai Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fangai Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xing, S., Liu, F., Zhao, X. et al. Points-of-interest recommendation based on convolution matrix factorization. Appl Intell 48, 2458–2469 (2018). https://doi.org/10.1007/s10489-017-1103-0

Download citation

Published: 05 December 2017
Issue Date: August 2018
DOI: https://doi.org/10.1007/s10489-017-1103-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Points-of-interest recommendation based on convolution matrix factorization

Abstract

Similar content being viewed by others

Hybrid graph convolutional networks with multi-head attention for location recommendation

Content-aware point-of-interest recommendation based on convolutional neural network

GN-GCN: Combining Geographical Neighbor Concept with Graph Convolution Network for POI Recommendation

Explore related subjects

1 Introduction

2 Related work

2.1 Recommendation based on contextual information

2.2 Recommendation based on review texts

3 POI recommendation based on matrix factorization

4 POI recommendation framework based on convolution matrix factorization

4.1 Modelling reviews information

4.2 Modelling geographical information

4.3 Modelling user social relations

4.4 ReGS model

4.5 Parameter estimation

5 Experiments

5.1 Foursquare dataset

5.2 Evaluation index

5.2.1 Predicting ratings

5.2.2 Recommending top-k POIs

5.3 The method for comparison

LCARS [34]

CoRe [32]

GeoMF [16]

DRW [35]

NCPD [36]

5.4 Analysis of experimental results

5.4.1 Comparison and analysis of recommended models

CoRe

LCARS

GeoMF

DRW

NCPD

ReGS

5.4.2 Factor impact analysis

5.4.3 Parameter analysis

6 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation