Keywords

1 Introduction

There is a large number of the possibilities for obtaining data on mobile phone individual trajectories, e.g. tracking applications in smartphones. It appears that the localisation data of mobile phone operators are most interesting. This data is not conditioned on the possession of any advanced smartphone or intentional enabling of a proper application. Nowadays, almost everyone has a mobile phone, even the simplest one; therefore, localisation data from the providers of mobile telephony is the most common in this regard, democratic and reliable. Localisation takes place based on triangulation and trilateration methods in BTSs (Base Transceiver Stations), which guarantee high accuracy in the urban area [2, 5]. The spread of mobile devices in the recent years has contributed to the performance of many original researches concerning human mobility, and the mobile phone data collected by network operators have become the invaluable source of information about individual mobility patterns on a large scale. Whereas, mobility and individual trajectory data are still poorly available although it is significant for a smart city, which provides supporting services and solving city problems. A main hindrance here are legal aspects referring to privacy, see for example [3, 9, 10].

Our goal and contribution is to design and implement a generator of individual mobility trajectories for mobile phone users. Trajectories mirror the behaviours of tourists staying in a given urban area. Trajectories are generated at random based on the prepared profiles of tourism behaviours, considering the various points of interests (POIs) located within the urban area. The respective elements of behaviour profiles are selected randomly for each object. The following parameters are considered: the prospective interests of tourists, their variability and possible habits connected with tourists mobility, intensity in respect of the implementation of a tourism plan, changing the means of transport and other aspects.

We believe that there are numerous prospective applications for thus obtained mass data concerning tourist mobility. These are: the analysis and understanding of the tourist traffic needs, the calibration of urban models and operational procedures, recommendation systems, increasing tourists’ safety, in particular in dangerous regions or districts, solutions supporting the designing of the urban infrastructure, applications supporting the control of epidemic spread, solutions helping in emergency management and many others. Furthermore, possessing such data constitutes a perfect base for testing systems concerning the smart city which are being created or which will be created in the future.

2 Related Works

The legal regulations concerning mobile devices localisation are not too restrictive in the Asian countries, they are moderately restrictive in the USA and extremely restrictive in Europe, cf. Uhlirz et al. [12]. This last case is the main reason for the low availability of such data, despite their immense potential. A certain example here is Estonia, see Ahas et al. [1], where anonymised data are analysed, which refer to a certain group of users but at the same time they demonstrate huge advantages from possessing and analysing such data. The possibilities of the tourist trajectories analysis are also discussed in the paper [6]. Kwan et al. [7] describe an experiment of the limited data disclosure by Deutsche Telekom for a certain selected period from the past. Such data allow to analyse various hypotheses, verify them in terms of various initiatives, including start-ups, the assumed methodologies of data analysis, etc. Therefore, the objects mobility trajectories are crucial here. In the work by Gonzalez et al. [4], the following was analysed: the use of trajectory in searching for similarities, finding general patterns, the distribution of spatial probability, what is significant for the urban models calibration or understanding the spreading of e.g. an epidemic.

The aspect of trajectory generation has already been considered. Pelekis et al. [10] suggest the generator of objects which follow a specific main objective of such a generator. This differs considerably from our solution which involves object profiles; moreover, our work includes some sets of targets which are interrelated unambiguously with the pre-defined profiles. Pappalardo and Simini [9] generate trajectories taking into account some tendencies, also considering a tendency for breaking the routine. The work does not imply any basic source of obtaining mobile data. In the work by Giurlanda et al. [3], there is the trajectory simulator considered, concerning people’s habits based on the behaviour model. This work affects our approach; yet, we consider behaviour profiles for tourists in the urban area. Zhou et al. [13] suggest the system of recommending points connected with travelling on the basis of preferences and journey time estimation. The aim is to match the trajectory with the users based on the objects mobility patterns. Networks are trained and trajectories are corrected as a result of the min-max strategy. In our paper, trajectories are generated without their corrections based on random behaviour profiles. Renso et al. [11] discuss various trajectory generators, comparing real data and synthetic data. The work discussed implies that our generator may have numerous applications. According to our best knowledge, it differs from the recently known generators, and a distinguishing factor is the random generation of trajectory based on various profiles which are weighted additionally in order to obtain better realism. Our paper is based on the unpublished work [8]. Nevertheless, a significant summary was performed and new results were generated.

3 User Profiles and POIs Planner

The main task of the generator is to produce trajectories which represent human behaviours as naturally as possible. The trajectory quality is understood here as the capability of imitating human behaviours in a natural manner. In order to ensure the conformity of the visited POI with the preferences of a given tourist, a user profile was designed and formed as a structure allowing for the description of features and the inclinations of a specific tourist in an adequate and implementable manner generator algorithms.

$$\begin{aligned} Profile=(I, f(i_n)\rightarrow w_n,S, d, M, T, f(t_n)\rightarrow v_n, a, p) \end{aligned}$$

I is the set of profile interests. Function \(f(z_n)\rightarrow w_n\) allocates weights to a category from set I. A weight means here a degree of being interested in each of the elements, expressed as a number from the interval (0, 1). The sum of all the weights for the profile amounts to 1. S – the geographical coordinates of the journey starting point for a given profile. d – a suggested maximum distance in meters between each POI matched with the profile, and a starting point S. M – a material status allocated to the profile, it is determined in a five degree scale. T is the set of the preferred means of transport, and function \(f(t_n)\rightarrow v_n\) allocates a weight to each means of transport. The preferred activeness time is marked as a, it is used when determining time for trajectory. Value p is another structure containing additional tourist’s preferences used in the process of allocating the points.

The Planner of POIs task is to create a trajectory plan based on the pre-selected POIs. Each POI set for an urban area is obtained automatically by publicly available services, so it does not require any (manual) preparations. Firstly, the planner establishes the place of a trajectory starting point and the time of its starting. Then, searching for the best POI to be visited next takes place. It is searched in the supplied points, for which opening hours are known and for which it is possible to reach a given point timely or points for which we have no information about opening hours. In an extraordinary case, when it is not possible to create a plan consistent with opening hours, information about such opening hours may be ignored for given points. After determining another POI, the planner selects a planned transport mode matching the preferences for the profile, it estimates the travelling time to the next point and then it defines the planned visiting time in this point. Defining the visiting time takes place at random but the following factors are taken into consideration: an appropriate profile attribute and average visiting time in a given point. All the pre-calculated parameters are collected to the structure describing one fragment of the trajectory plan. At the end of each step, a selected point is marked as a current one and the process is re-implemented. While modifying the planner configuration, we can decide whether the trajectories are to be terminated in the last POI or the tourists are to return to the starting point after visiting all the points.

Fig. 1.
figure 1

The operating rule of the trajectory planner is presented as a simplified outline. (POD – Place of Departure, POI – Point of Interest). The maps depict the respective stages of trajectory generation. The first one presents an initial state, that is POIs and their assessments for the profile. A middle map is a stage repeated until all the points are planned. The last map is a ready-made trajectory plan; and there is also the outline of the ready-made trajectory plan below the last map. The outline provided below presents the steps taken at each stage; a dashed line joins given steps with the corresponding fragment of the map. Furthermore, the respective corresponding elements of the outline are marked with the same colours

After implementing the steps described above for all the supplied POIs, we obtain a daily trajectory plan which consists of parts ordered depending on their planned visiting sequence. Each part contains information enabling the creation of a corresponding fragment of a trajectory. The operating rule of the planner is also presented in Fig. 1.

4 Generating Results

In order to validate our system, we have carried out several generating processes. Table 1 contains details concerning the generated sets. Figure 2 presents a fragment of the visualisation made for a generated large trajectory set (set C, 4853 trajectories, 865 unique POIs).

Table 1. The generated data sets (A, B, C)
Fig. 2.
figure 2

The fragments of the visualisation made for Kraków City for 4853 generated trajectories (set C). On account of a large volume of data, trajectories are represented by unicolour lines. POIs are represented by dots whose colours denote different object categories

Table 2 presents the statistics generated based on the analysed sets. The statistics refer to transport methods and parameters connected with travelling, visiting POIs and the time in which the respective trajectories were placed.

Table 2. Table presenting statistical data obtained on the basis of the analysis of the generated sets
Table 3. 10 most popular categories of POIs on account of the number of visits for set B and set C

Table 3 presents the distribution of 10 most popular categories of POIs taken from 976-element set B and 4853-element set C.

We believe we have built an important and universal tool. Our system is characteristic in this regard as compared to the existing ones [3, 9,10,11, 13]. The results obtained prove the credibility and realism in reference to the trajectories which thus are reliable. The trajectories generated are synthetic but much effort was made so that they could look realistic and so that they could consider both various profiles of human behaviours and a randomness factor.

5 Conclusions

By using our system, it is possible to generate practically any number of tourist trajectories, depending on demand and in the real time. Further works may comprise other groups of people instead of tourists, e.g. regular citizens.