1 Introduction

Event-based social networks (EBSNs) have become popular in the last couple of years, e.g., MeetupFootnote 1 and DoubanFootnote 2, which provide platforms for those who interact with each other in both online and offline activities, or who can create a user group with similar interests for professional networking. In March of 2019, Meetup, one of the largest EBSNs, had over 40 million users; more than 10 thousand events were created every day in more than 180 countries through the Meetup network.

Liu et al. [22] studied different types of recommendation problems in EBSNs, such as event recommendations and group recommendations. These problems have gained a significant increase in research interests [1, 4, 14,15,16, 21, 22, 24, 25, 29,30,31,32, 36,37,38,39]. Several techniques were proposed to solve the recommendation problems, such as matrix factorization [30, 31, 38] and exploiting content information [10, 14, 16, 23,24,25]. The similarities between users and between events were also studied in [9, 35], in which user interests and event contents were chosen to compute the similarities. Besides, the problem of friend recommendation in location-based social networks has been studied in [2, 6, 13, 33], but this problem has been ignored in EBSNs.

Understanding users’ behavior and predicting activities of them are important to design effective recommendation systems. Whether a user takes part in an event or not depends mainly on the topic of the event, and his/her own interests or the interests of his/her active friends. We define that active-friends of a user are the people who are interested in the topic of an event that the user is interested in and the people who are likely to attend the event based on their previous attending histories. In this paper, we study the problem of event recommendation for active-friends in the context of EBSNs. We formulate this problem as follows: Given an event ef and a key user u, the goal is to recommend event ef to the top N active-friends of user u based on a list of previous events.

To tackle the problem, we crawled datasets of events in a time window and of users who attended these events from the Meetup network, and conducted data analysis to investigate major factors affecting the decisions of users to take part in the events. From our analysis, we propose a content-based model that combines user features and events contents to recommend an upcoming event to the friends of a key user.

Our proposed model is constructed with the following procedure: Firstly, we propose a new approach to calculating the similarity of events based on their contents and locations. In this approach, we model the contents of events as the Latent Dirichlet Allocation (LDA) topic model [3] and use Gibbs Sampling method [7] to generate topic distributions of the events. Then, we use the Jensen Shannon divergence [20] to compute the similarity between the topic distributions of two events. Secondly, we collect users who took part in the chosen events which are similar to the given event ef, and propose a new content-based method to compute possibility scores of these users who are closely related to the key user u. In this method, we use multiple factors to compute the possibility scores, including the similarity between previously chosen events and the event ef, locations and users’ interests, and event times. Finally, we have a list of users with their possibility scores with respect to the key user and the event ef. The event ef will be recommended to the top N active-friends of the key user u who have the top possibility scores.

Experiments were conducted on real world datasets crawled from four cities. The experimental results have shown that the performance of our content-based event recommendation model outperformed three baseline methods.

The remainder of this paper is organized as follows. Section 2 reviews the related work. EBSNs data characteristics are analyzed, and the proposed model is presented in Section 3. Section 4 presents the Gibbs Sampling method to compute the topic vectors of events. Experiment results are presented in Section 5. Section 6 gives conclusions and new research directions of this work.

2 Related work

Event-based social networks (EBSNs) were initially studied by Liu et al. [22] and some unique and exciting features of EBSNs were discovered, such as locality structural networks and information flows. Different types of recommendation problems for EBSNs were also defined, such as event recommendation and group recommendation problems. These problems have been studied by many researchers [4, 9, 11, 14,15,16, 22, 24, 25, 29,30,31, 37,38,39,40].

Matrix factorization methods are famous for rating recommender systems [34]. Qiao et al [31] combined online and offline social relationships between users with locations to propose a standard matrix factorization. In another work, Qiao et al. [30] proposed a Bayesian matrix factorization model based on social interaction features and users rating data. However, these two methods ignored the content information of events.

To exploit the content information, several works studied the content-based event recommendation. Zhang and Wang [37] considered event contents, user relations, locations and organizers, and proposed a collective Bayesian Poisson factorization method to solve the cold-start local event recommendation problem. Based on analyzing users’ interests and attended seminar events, Minkov et al. [25] proposed a hybrid content-based and collaborative method to recommend academic events to users. The problem of event detection was studied in [14, 17] on the Twitter network. The LDA topic model was adopted in [14, 39] to solve the event recommendation problem. Li et al. [19] analyzed the rich information of DoubanEvent network, and proposed a hybrid model with integration of the LDA topic model to predict social influence. In a series of recent works [26,27,28], Ogundele et al. studied the problem of personalized event recommendation in EBSNs, and proposed a framework, called SoCaST. They listed the four types of influences on users, i.e., temporal, geographical, categorical and social types, which were modeled in SoCaST. The multi-criteria decision making technique was adopted to make recommendations of the most suitable events to users.

Lu et al. [23] developed a cross-media event extraction system which effectively recommends events to users’ ongoing search based on their interests. Jhamb and Fang [16] analyzed rich information of groups and presented two oriented items, user-oriented and event-oriented, to solve the event recommendation problem. They proposed a dual-perspective latent model that uses Logistic and Probit sigmoid functions to model the two oriented items. Another work in Macedo et al. [24] exploited a set of contextual information including description of events and interaction of users derived from their RSVPs. Then, they further proposed a hybrid context-aware model to recommend upcoming events to users in EBSNs. Chen and Su [4] also exploited the social interaction of users and collaborative friendships, and proposed a new social event recommendation model. Horowitz et al. [15] modeled the agenda of users and tags to form a mobile recommender system for events. To investigate how relevant human personality is to different types of places people visit, Chorley et al. [5] studied human behaviors on the Foursquare location-based social network. The conclusion was that the personality traits determine the sorts of places that users checked in.

Developing a unified model that can solve multiple recommendation tasks in one framework was also carried in [9] and [29]. Dong et al. [9] designed a probabilistic model that solves the group recommendation and the event recommendation problems in one framework. A time horizon was used in the model to recommend groups to users and offline events to users iteratively. A general graph-based model [29] was proposed to address three recommendation problems, namely, group-to-user, tag-to-user and event-to-user recommendations in one framework. The model treats the three problems as a query-dependent node proximity problem with rich information included. Extraction of features from users interests and events contents was studied in [1, 21, 36]. Zhang et al. [38] studied a new type of group recommendation problems, called the personalized event-based group recommendation. Location and social features were first modeled in matrix factorization, and the information of users and groups is taken into account. They proposed a pairwise tag-enhanced and feature-based matrix factorization method that is a combination of all the elements in order to solve the group recommendation problem. Du et al. [10] and Ding et al. [8] modeled the factors of spatial and temporal contexts of events to predict event attendance. Xu et al. [35] studied the problem of event recommendation through the Douban network, and proposed a semantic-enhanced and context-aware model that was constructed from a combination of the content information analysis and the social relations.

Hannon et al. [13] studied the tweets and relationships of users from Twitter and defined the friend recommendation problem, i.e., recommending those who should be followed by new users of the network. They proposed a model that combined collaborative filtering strategies with the contents to solve the problem. Several works [6, 18, 33] studied further extensions of friend recommendation. Chu et al. [6] proposed a new friend recommendation problem for location-based social networks. Based on the similar interests of the users, they employed the real-life location and dwell time in friend recommendation. After gathering these data, the proposed method analyzes the data using a weighted Voronoi diagram and interest similarity. Tu et al. [33] proposed an approach to solving the problem with a specific location and a friend in the location-based social networks. Li et al. [18] collected social and event-based features from DoubanEvent data to study the problem of followee recommendation in EBSNs, and they proposed an improved Bayesian model that uses these features to address this problem. However, in our work, we use data crawled from Meetup network and there is no direct links between users.

3 Methodology

In this section, we first describe the datasets of four cities we crawled from Meetup network. Then, we analyze the properties of the data, and investigate what features affect the decisions of users to take part in events. Finally, we propose a new content-based event recommendation model.

3.1 Data collection

We crawled datasets from Meetup.com that is considered as the largest event-based social network (EBSN). The structure of EBSNs is illustrated in Fig. 1, which contains five main entities: groups, events, users, venues and tags. One event is created from a specific group and hosted in a particular venue. Users confirm whether to attend the event or not by sending RSVPs (YES or NO). YES means that a user will attend an event. NO means that a user will not take part in an event. Each user can be a member of several groups and send RSVPs for any events. Users can express their personal interests by a set of tags. In our work, each event has content, location, time and a list of RSVPs (YES or NO). Each user has information of a set of tags and a location.

Fig. 1
figure 1

An example of an event-based social network

For example, in Fig. 1 User 1 and User 2 join Group 1, and User 2 and User 3 join Group 3. Group 1 held the Live Music and Reading Books events in a Coffee shop, which were attended by User 1 and User 2. Both User 3 and User 2 took part in the Cycling event in a Park venue, which was held by Group 2. User 1 uses tags “Music and Learning”. Tags “Learning and Outdoors” are used by User 2, and User 3 only uses tag “Outdoors”.

3.2 User similarity

In EBSNs, there is no direct link to indicate that two users are friends. However, we model the similarities between users in two ways, i.e., interest-based and behavior-based similarities [9, 35]. We use these similarities to measure how users are related to each other.

3.2.1 Interest-based similarity

Users who are members of a particular group may have similar personal interests, which can be manifested by a set of tags (as shown in Fig. 1). To calculate the interest-based similarity between users, we utilize the tags. The interest-based similarity between users is defined:

$$ S_{i} (u,u^{*}) = \frac{ T^{u}\cap T^{u^{*}}}{ T^{u} \cup T^{u^{*}} } $$
(1)

where Tu and \(T^{u^{*}}\) are two sets of tags of users u and u respectively.

3.2.2 Behavior-based similarity

The event participation is treated as a social behavior of users. The behavior-based similarity between two users can be defined as

$$ {S_{b}^{c}} (u,u^{*}) = \frac{ E^{u}\cdot E^{u^{*}}}{\lVert E^{u} \lVert \lVert E^{u^{*}} \lVert } $$
(2)

where Eu and Eu are two sets of events that u and u are interested and have sent RSVPs to these events.

To be more specific to the upcoming event ef, we define the behavior-based similarity between two users as

$$ S_{b}(u,u^{*}) = \sum\limits_{j=1}^{|E|} {D^{j}_{u}} \times D^{j}_{u^{*}} $$
(3)

where Dj is the similarity between the upcoming event ef and ej in E which the user attended. The similarity between events is defined in (5) in the following section.

3.3 Event similarity

Social interests of users are expressed by social events they attend in EBSNs. The social events are described in a set of documents shown on web pages as semi-structured text data, as illustrated in Fig. 2. We can observe that the event consists of the content (i.e., the title and description) and its location and time. These kinds of information have an impact on the decisions of users to confirm RSVPs (YES or NO). Hence, the content and location factors extracted from events are adopted to analyze the decisions of users.

Fig. 2
figure 2

Example of one Meetup event

3.3.1 Topics from events contents

The contents of events may include several themes or topics. Instead of using keywords, we represent an event as a vector of topics based on the Latent Dirichlet Allocation (LDA) topic model [3]. Given a set of event documents, we use the Gibbs Sampling method [7] to compute the topic vectors of these documents. Then, the set of event documents is converted to a matrix Θ where a row is a corresponding event document and a column is a topic. The process of computing Θ is described in details in Section 4.

3.3.2 Similarity between events

To measure how events are related to each other, the divergence between the topic distributions of the events is taken into account. We adopt Jensen-Shannon divergence [20] to compute the similarity between events.

Given two topic distributions 𝜃p and 𝜃q of events ep and eq respectively, Jensen Shannon divergence DJS is defined as follows:

$$ D_{JS}(\theta_{p},\theta_{q}) = \frac{1}{2} \left[D_{KL}\left( \theta_{p},\frac{\theta{p}+\theta{q}}{2}\right)+D_{KL}\left( \theta{q},\frac{\theta{p}+\theta{q}}{2}\right)\right] $$
(4)

where DJS is the Jensen Shannon divergence, DKL is the Kullback-Leibler divergence, defined as \(D_{KL} (\theta {p},\theta {q}) = {\Sigma }_{i=1}^{K}p_{i}ln\frac {p_{i}}{q_{i}}\) where pi and qi are the probabilities of topic i in event ep and event eq respectively.

The similarity of ep and eq is denoted by D, computed as

$$ D = 1 - D_{JS}(\theta_{p},\theta_{q}) $$
(5)

The value of D varies from 0 to 1. In particular, if the value is close to 1, the contents between ep and eq are very similar. Otherwise, if the contents are very different, the value is close to 0.

3.4 Location-based similarity between users and events

Obviously, users who are interested in previous events are willing to attend the upcoming event, if the new event is located closely to the previous event places and the users’ location. The location-based similarity can be used to measure how users are related to events in terms of location. It is represented as

$$ S_{l}(e,u) = e^{\frac{-d(e,u )^{2}}{2} } $$
(6)

where d(.,.) is the Euclidean distance between location of event e and user u.

3.5 Event recommendation model

Given an upcoming event ef and one key user u, we want to find a group of potential attendees who are closely related to u. We call them active-friends of u and recommend the upcoming event ef to them. We name this recommendation problem as the problem of event recommendation for active-friends.

We propose an effective content-based event recommendation model (ECA) that is formed by event-based and user-based similarities to solve the problem. Our model consists of two phases. The first phase selects previous events in the same group or the same location with ef in order to reduce the number of events and sparse data. Then, we compute the similarity between the selected previous events and ef. In the second phase, we first collect users from the selected events, and then construct an association matrix of users and events. Consequently, we score all users and recommend ef to top N active-friends of user u.

3.5.1 Phase 1: Find relevant events

This phase is used to select previous events relevant to the upcoming event ef. The process is described as follows:

  1. Step 1

    The set of events EP is modeled by the Latent Dirichlet Allocation (LDA) topic model, and Gibbs sampling method is used to generate topic distributions ΘP and topic-word distributions ΦP from the whole set of events contents.

  2. Step 2

    Given an upcoming event ef and a key user u, the content of ef and ΦP are used to infer topic distribution 𝜃f of the event ef.

  3. Step 3

    An event ei in the whole event set EP is chosen and added in the set EC, if it meets one of the following requirements: (1) ei and ef are created by the same group or (2) they have the same location.

  4. Step 4

    For each event ej in EC, the similarity Dj between ej and ef is computed by (5).

3.5.2 Phase 2: Find active-friends

From Phase 1, we obtain a set of chosen events EC. Then, we obtain a set of users U who attended these events in EC. If a user ui attended an event ej, the similarity between event ej and the upcoming event ef is computed in (5), denoted by Dj. This similarity Dj represents the association between the user ui and the event ej. The results are represented in a user-event association matrix (UE). In UE, each row is a user, and each column is an event. The elements equal to 0 mean that the users did not attended the corresponding events. An example of UE is given below.

$$ \text{UE} = \begin{array}{cccccc} &{} e^{1} &{\kern4pt} e^{2} &{\kern4pt}... &{\kern6pt} e^{j} &{\kern4pt} e^{m}\\ \begin{array}{l} u^{1}\\ u^{2}\\ ...\\ u^{*}\\ u^{i}\\ u^{n} \end{array} \left( \begin{array}{cccccc} D^{1} & 0 & 0 & 0 & D^{m} \\ 0 & D^{2} & 0 & D^{j} & 0 \\ 0 & D^{2} & D^{...} & 0 & D^{m} \\ D^{1} & 0 & D^{...} & 0 & D^{m} \\ D^{1} & 0 & D^{...} & 0 & 0 \\ 0 & D^{2} & 0 & D^{j} & 0 \end{array}\right) \end{array} $$

For the key user u, we select one event organizer of the upcoming event as u, and this user has also attended previous events. Given a UE, we can compute the similarity between the key user u and all other users in UE, using (3).

To measure how the upcoming event ef is associated with the last attended event el of a user u, we calculate the gap time Tu between them as

$$ T_{u} = \frac{1}{ T^{e^{f}} - T^{e^{l}} } $$
(7)

where the time difference of (\(T^{e^{f}}\) - \(T^{e^{l}}\)) is measured in a number of weeks. The bigger the gap time Tu is, the more associated the event ef to the user u is.

Finally, each user in UE is scored against the key user u with respect to the upcoming event ef as

$$ S (u|(u^{*},e^{f}) = S_{b}(u,u^{*}) + S_{i}(u,u^{*})+T_{u} $$
(8)

where Si(u,u) and Sb(u,u) are computed by (1) and (3) respectively.

Top N users with high scores are selected and the upcoming event ef is recommended to them. If there are some users having the same scores, these users will be chosen by distance scores that are computed by (6). The user with high distance score is chosen.

4 Gibbs sampling method for computing topic vectors

We use the LDA topic model to model a set of event documents crawled from EBSNs. Given the number of latent topics K, the generative process of LDA for a set of N event documents with a set of unique words W is described in the following steps:

  1. 1.

    For each topic zK

    1. (a)

      Draw \(\phi _{z} \sim Dirichlet(\beta )\), which is a multinomial distribution over words.

  2. 2.

    For each document dN

    1. (a)

      Draw a topic distribution \(\theta _{d} \sim Dirichlet(\alpha )\).

    2. (b)

      For each wi in d

      1. (i)

        draw a topic \(z_{i} \sim Mult(\theta _{d})\)

      2. (ii)

        draw a word \(w_{i} \sim Mult(\phi _{z_{i}})\)

Suppose that parameters α and β are given, the generative process results in the following joint distribution for document d:

$$ \begin{array}{@{}rcl@{}} p(w_{d},z_{d},\theta_{d}, \varPhi|\alpha,\beta) ={\varPi}_{i=1}^{N_{d}} p(w_{d,i}|\phi_{z_{d},i}) p(z_{d,i}|\theta_{d}) \\ p(\theta_{d}|\alpha)p(\varPhi|\beta) \end{array} $$
(9)

where Nd is the number of words in document d.

Probability distributions for the set of N documents is

$$ p(W,Z,\varTheta,\varPhi|\alpha,\beta) = {\varPi}_{d=1}^{N}p(w_{d},z_{d},\theta_{d},\varPhi|\alpha,\beta) $$
(10)

To represent the set of N documents as a set of N vectors, each vector with K topics, we need to calculate the topic distributions Θ and the topic-word distributions Φ. In this work, we use Gibbs Sampling method to calculate Θ and Φ. The pseudo code of this method is briefly described in Algorithm 1, which includes the following main three steps.

  1. Step 1

    Initial step: For each document dj in the set of N documents, randomly assign each word wi to one of K topics.

  2. Step 2

    Gibbs Sampling: For each iteration, we re-assign a topic to each wi in dj. First we need to reduce the value of \(W_{t^{c}}, N_{d_{j},t^{c}}\) and \(W_{w_{i},t^{c}}\) related to the current topic tc by 1. We select topic k in K with the probability for word wi denoted by:

    $$ P(z_{t}=k|z_{-t},w_{i},d_{j}) = \frac{W_{w_{i},k} +\beta}{W_{k} +W\beta } \times \frac{N_{d_{j},k} +\alpha}{N_{d_{j}} +K\alpha } $$
    (11)

    where P(zt = k|.) is the probability of word wi assigned to topic k.

    zt includes all topic allocations except for zt.

    \(W_{w_{i},k}\): Number of times word wi assigned for topic k

    Wk: Number of words in W assigned to topic k

    \(N_{d_{j}}\): Total number of words in document dj

    \(N_{d_{j},k}\): Number of words for topic k in document dj

    We have a set of P(z|.). Then, a new topic tn is sampled from P(z|.) and reassigned to wi. We increase the value of \(W_{t^{n}}, N_{d_{j},t^{n}}\) and \(W_{w_{i},t^{n}}\) by 1.

  3. Step 3

    Generating Φ and Θ: After a large number of iterations (e.g, 1000 loops), we obtain the stable topic-word distribution Φ and the stable topic distribution Θ as follows:

    $$ \phi_{w_{i},k} = \frac{W_{w_{i},k} +\beta}{W_{k}+W\beta} $$
    (12)

    where \(\phi _{w_{i},k}\) is the probability of word wi for topic k, and

    $$ \theta_{d_{j},k} = \frac{N_{d_{j},k} +\alpha}{N_{d_{j}}+K\alpha} $$
    (13)

    where \(\theta _{d_{j},k}\) is the proportion of topic k in document dj. Topic distribution of document \(\theta _{d_{j}}\) is a set of \(\theta _{d_{j},k}\).

figure e

5 Experiments

In this section, we present the experiment results to show the performance of the new recommendation model for active-friends. First, we describe the datasets. Then we evaluate the proposed model compared to other baseline methods.

5.1 Experimental setup

5.1.1 Datasets

The data used in our experiments were crawled from the Meetup websiteFootnote 3 in year 2016. Four cities in the world were selected, i.e., London(LD), Sydney(SN), and San Francisco (SF) and New York (NY). For each city, we collected information of events and attendees of the events. The information of each event includes title, description, time and location. Each event has a list of RSVPs (YES or NO), which present those who confirm YES or NO to take part in it. The number of users was obtained as the number of unique users. Moreover, each event must have at least 5 attendees (YESs). A list of attendees in the events is obtained. Each attendee contains the information of interests, location, and time to send RSVPs to the event.

We plot the distributions of events and users for four cities, illustrated in Fig. 3. We can see that 95% of users attended less than 15 events and 70% of events had less than 20 attendees. These datasets are summarized in Table 1. In our experiments, we consider the chronological order of events in selecting training and testing data. We first sorted the events on event time. Then, we took the early 80% events as the training data, the later 20% events as new events for the testing data for each city.

Fig. 3
figure 3

Distributions of events and users in the four cities in year 2016

Table 1 Dataset statistics

5.1.2 Evaluation metrics

To evaluate the performance of the active-friends recommendation we adopt two classic precision and recall metrics, denoted by Precision@N and Recall@N respectively.

$$ \begin{array}{@{}rcl@{}} Precision@N= \frac{\frac{|L^{rec} \bigcap L^{real}|}{N}}{M} \end{array} $$
(14)
$$ \begin{array}{@{}rcl@{}} Recall@N= \frac{\frac{|L^{rec} \bigcap L^{real}|}{|L^{real}}|}{M} \end{array} $$
(15)

where N is the number of friends of a user u,M is the number of upcoming events in the testing set. Lrec is the list of active-friends to whom the given event e will be recommended, and Lreal is the list of real attendees who say YES to the event e.

5.1.3 Parameters setting

For each dataset, we removed all punctuations and stop words from the contents of all events. Then, the cleaned contents in the training set were used as a collection of all documents P with a set of unique words W. In order to obtain the similarity between events [12], we set parameters β = 0.01,α = 50/K, where K was set to 100 topics. The iteration of Gibbs Sampling method was set to 1000. These parameters were used to generate topic distributions ΘP and topic-word distributions ΦP.

5.1.4 Comparison methods

We compared our method, ECA for an effective content-based event recommendation model for active-friends, with three baseline methods listed below:

  • Most-Popular (MP). This method is the traditional popularity-based approach that selects the users in a group who frequently attended the events in the training set for recommendation.

  • User-based collaborative filtering method using Pearson correlation (Pearson)

  • User-based collaborative filtering method using Cosine correlation (Cosine)

Platform

The experiments were conducted on a machine with 12GB main memory and 3.4GHz dual-core CPU. All algorithms were written in Java. The operating system was Window 10.

5.2 Results and analysis

In this section, we compare the results of the proposed model with the results of baseline algorithms. The experiments were conducted on four datasets. For each event e in the testing set, we assigned one event organizer as the key user u. To evaluate the performance of all methods, we set the size of the recommended friend list of u from 1 to 5.

As described before, our model consists two phases. The first phase is to compute the similarity between the chosen events EC and the upcoming event. The second phase is to compute the possibility scores of users. To be fair on the performance comparison of all methods, we used these chosen events EC to compute the scores of users u to u when making comparisons for all methods. The UE matrix is constructed from EC and a set of attendees obtained from EC. Elements of the matrix represent the similarity between users and attended events. With this matrix, we can use Cosine and Pearson methods to compute similarities between users.

The results of the experiments are given in Fig. 4, where N is changed from 1 to 5. From these figures, it is clear that our proposed method outperformed all baseline methods in precision and recall metrics, and the Pearson is the worst for the four cities with different N s. We also observe that the precision of all methods decreases and the recall increases as N increases. It indicates that the recommendations closer to the top are more accurate, and these results also reflect the real life situations that many people do not have many close friends. The two user-based methods, Pearson and Cosine, yield the worse results. Because the matrix constructed by the set events EC and the users of EC is sparse, these two methods result in low predictions. The other traditional method, MP, recommends those users who frequently attended the events. However, the users who were more active in the past may not be active in the present time, so it leads to worse results. In our method, this weakness is overcome by (7). Moreover, we also consider topics, and interests together to compute the possibility scores of users. Therefore, our method produced better predictions in the four datasets.

Fig. 4
figure 4

Performance of the four methods in event recommendation for active-friends

6 Conclusions and future work

In this paper, we have studied the problem of event recommendation for active-friends in event-based social networks. The study was carried on the extensive datasets crawled from Meetup. We identified social topics and personal interests impacting on decisions of users to take part in events. Moreover, they are likely to attend events with friends. We developed the effective content-based model that is formed by chosen events and users to solve the problem. The proposed model produces a list of users with possibility scores, and recommends the future event to the top N users. Through evaluation using the datasets from four cities, the comparison results demonstrate the effectiveness of our model. In the future work, we will study the loyalty problem in social networks to address the question why users leave their group and do not take part in events, and what factors affect their decisions.