Keywords

1 Introduction

Improving air quality and reduce congestion in urban areas has been a priority in many countries. In central areas, regulatory traffic control has been introduced, usually integrated with improvement of public transport provision. These solutions are effective, but more than 20 years has passed from the introduction of those schemes and the technological context is changing. One main technological change refers to the rising of Automated Electric Vehicles (AEVs) that have the potential to affect substantially congestion, energy use and emissions in central areas. These AEVs will facilitate the sharing mobility development, with expected relevant effects on urban livability. One issue in designing future SAEVs service is the identification of the number of shared autonomous vehicles and the number and the localization of stations for the pick-up and drop-off of shared vehicles and the feasibility of the service [1]. In order to help the definition of space-temporal characteristics of the SAEVs demand in future mobility scenarios, the analyses of current characteristics of car sharing and taxi services can be of great help. For such purpose, this paper analyses the current taxi services in Rome, using FCD collected in February 2014, from a sample of 310 taxi drivers out of a total population of about 7,700 taxi drivers operating in the municipality.

The paper is composed as follows. Section 2 reports a literature review, Sect. 3 presents the study area where the taxi service is analyzed. Section 4 reports the methodology adopted in the analysis, considering the data and the approach. Section 5 contains the main obtained results. Finally, Sect. 6 reports some conclusions.

2 State of the Art

Several studies have been carried out on the analysis of taxi services, thanks to the innovative technologies. One of them concerns Berlin’s taxi services [2] and it’s based on data collected from on-board GPS devices while operating taxis. In particular, the study analyses the travel behavior and vehicle supply of the Berlin taxi market using floating car data (FCD) for one week each in 2013 and 2014. Regarding spatial analysis, the study shows that most taxi trips take place either within the city center or from/to Tegel Airport, the most important Berlin’s airport. Another study [3] concerns Washington, DC where the authors investigate the relationship between taxi pick-ups and drop-offs from GPS data considering land use and travel data for each zone in the city. Also in this case, the presence of three civil airports in the study area influences the taxi demand. The purpose is the development of a model to relate taxi demand with land use and accessibility. Harbin City [4] is a further example for taxi data analysis, in particular, the city was divided into traffic zones and the pick-up and drop-off locations are identified to build the origin-destination matrix of the trips. Other considered input data are travel distance, time and average speed in occupied and non-occupied status. The aim is to analyze travel demand distributions and the estimation of an entropy-maximizing model to estimate the traffic distribution. To reveal the travel patterns in Shangai, Liu et al. [5], analyze the trips of about 6,000 taxi. This study aims to identify spatial interactions among areas of the city and identify the structure of the travel flows.

The data collected from taxi trips are complex, contains geographical and temporal components and, in some cases, could contains other trip information. Consequently, can be hard to use queries to perform analyses (e.g., trip distance, trip distribution). Ferreira et al. [6] propose a possible solution of this computational hard problem using the trips data of New York City taxi, developing a model able to support visual exploration of big data related to origin-destination taxi trips. A similar analysis is conducted using FCD collected in Beijing and containing the taxi trips [7]. The data processed deals with the taxi stay location and the taxi operations. In order to analyze the travel length, the distribution of trip distance and the spatial distribution of the taxi, [8] use a dataset containing the trajectory data of 11,880 taxis in Beijing. The analysis is conducted considering both the travels with customer and the empty travels. The variables analyzed in [9] are the displacement of each trip the duration of each trip, the time interval between successive trips by the same taxi. The data are from five different cities and consider only the trips with customer.

3 The Study Area

The municipality of Rome extends over an area of 1,283.70 Km2, with the 22.20% destined to urban activities. The area is divided in six zones according to the PGTU (General urban traffic plan). Four of the six zones are inside the main road ring (GRA). The fifth zone is outside the GRA and includes urban perimeters of some relevance. The last zone is located in the west part of the city.

The population reaches 2.9 million of inhabitants, which daily generate 4.7 million of trips, while trips generated from outside Rome are about 800,000. Trips from outside is continuously growing (250,000 units more than 2004), confirming the population’s tendency to go live outside Rome.

4 Methodology of Work

4.1 Supporting Data

GPS data used in this case study belong to CRAWDAD dataset 2014 [10]. The GPS position of each taxi is logged every 10 s and it has been possible to build a database of historical GPS traces through around 27 thousand GPS positions recorded per day, therefore 756 thousand for the entire month of February 2014. Each entry includes:

  • ID (the taxi identifier),

  • Date (date the record is logged),

  • Timestamp (time the record is logged),

  • Coordinates (geographical location: latitude and longitude).

4.2 Approach

The possible status of taxicabs was aggregated in the following categories:

  1. 1.

    to customer (driving to or waiting for a customer),

  2. 2.

    with customer (driving with a customer on board),

  3. 3.

    at rank (standing at a taxi rank),

  4. 4.

    outside rank (idle but not at a rank, for example returning to a rank).

The analysis of taxi demand from GPS traces is composed of several phases:

  1. 1.

    computation of distance between two successive GPS positions recorded (if the distance travelled by the taxi driver in 2 min is less than 10 m, it has been considered the vehicle non-moving, otherwise the driver is traveling),

  2. 2.

    individuation of the origin o and the destination d of each trip,

  3. 3.

    computation of progressive distance (to evaluate the length of a trip).

Other calculated values are the travel time (for each trip) and the waiting time of a taxi in a position (if the waiting time in a position is greater than 2 min we suppose that the position is a destination).

The activity with customer is the taxicabs status needed for this demand study. In order to individuate this kind of status, it’s important to recognize all the times the vehicle is at a rank. A taxi cab is considered steady at a rank when the time previously calculated is more than 2 min and the coordinates are quite near one of 100 rank of the city. Hence, it’s possible to establish the number of times a vehicle is steady at a rank and instead when it’s steady because the driver is dealing with a customer (the costumer is entering or is leaving). Consequently, it can be found the trips with the activity with customer.

Once the data processing is applied, it has been possible to build a database based on the vehicles’ status with customer, for the studied period. Each entry of the database contains:

  • length of the trip,

  • travel time,

  • origin and destination of the trip,

  • waiting time.

Finally, the origin-destination matrix based on the vehicles’ status with customer is built.

5 Main Results of the Analyses

5.1 Temporal Patterns of Taxi Demand

A detailed distribution of taxi demand (the number of requests submitted per day and the number of requests submitted per hour) over the two weeks are presented in Figs. 1 and 2. For the demand side, the following happens:

Fig. 1.
figure 1

Request submissions per day.

Fig. 2.
figure 2

Request submission per hour.

  • taxi demand during weekdays (Monday–Friday) is generally higher than that of weekend (Saturday and Sunday),

  • taxi demand during weekdays follows a clear pattern: a major peak from 8 am to 9 am, smaller peaks during the afternoon (the first peak at 3 pm and the second one at 6 pm) and very low taxi demand from 12 am throughout the night,

  • Monday is demand-wise the busiest day,

  • Saturday night records a peak around 11 pm and significant taxi demand throughout the night.

Besides Saturday night, the demand for taxi services during weekends is low.

5.2 Trip Distances and Travel Time

The average trip distances, calculated adding the beeline distances between two consecutive geographical points (the points are recorded every 10 s), are at most weekday hours between five and fifteen kilometers (Fig. 3). However, the weekday morning (about 8 am), with average distances around twenty-five kilometers, form a notable exception. The day with the highest average trip distance is, on the other hand, Wednesday, with roughly 30 km. Relating to the average travel time (Fig. 4), it range from 7 to 31 min. The maximum average travel time is recorded Monday, between 8 am and 12 am, another peak is notable on Friday around 8 pm. The time analysis can be useful in defining the use of SAEV, assuming that the time that a user spends traveling with a taxi is comparable with the time that he/she would spend using the SAEV.

Fig. 3.
figure 3

Average trip distance at different times.

Fig. 4.
figure 4

Average travel time.

As to the distribution of trip distances (Fig. 5), there are few trips of less than one kilometer, whereas a trip distance between two and five kilometers is the most common. Longer distances are less and less likely: not even twenty percent of all trips are long between ten and twenty kilometers. On the other hand, trips longer than twenty kilometers are about the 13%.

Fig. 5.
figure 5

Trip distance distribution.

5.3 Location-Based Taxi Demand

Origins and destination of taxi trips are generally spread all over the city (Fig. 6a). The majority commences and ends within the inner railway ring and in the EUR district, one of the most important financial and tertiary area of the city. Moreover, the data suggest that about the 15% of trips are from the city to the outside and vice versa. Figure 6b highlights these trips, remarking that they are mainly from/to Fiumicino Airport, the biggest Rome Airport.

Fig. 6.
figure 6

Trips and origin destination relations (Map source: Google Streets).

6 Conclusions

In this paper, we presented an analysis of FCD related to the taxi trips in Rome. The data were collected in February 2014, from a sample of 310 taxi (on a total of about 7,700 taxi operating in the municipality). The data processing suggests that there is generally a demand peak on workday mornings and a several lower peaks over a longer time in the afternoon. From analyzed data, about 30% of the trips are taken in the morning from 5am to 8am while the remaining 70% are distributed over the rest of the day with two afternoon peaks, the first one at 4 pm and the second one at 6 pm. On weekends, the demand peaks shift towards the night and are generally lower than weekdays. On Saturday night, about 40% of the total number of trips occurs, with a peak at 11 pm. The remaining trips (60%) are distributed equally over the course of the day.

As far as spatial analysis is concerned, the majority taxi trips begins and ends within the Inner Railway Circle, the trips from/to the outside (mainly from/to Fiumicino airport) are about the 15%. As to the distribution of trip distances, there are few trips of less than 1 km, whereas a trip distance between 2 and 5 km is the most common. Longer distances are less and less likely: not even 20% of all trips are long between 10 and 20 km. On the other hand, trips longer than 20 km are about the 13%.

This analysis of Rome can be useful for the future SAEV services design, also using the taxi data (space-temporal distribution, number of trips, travel time, distance) as a proxy to try to define one component of the future demand for SAEV. Indeed, a first element to design the sharing service is the definition of the areas where the vehicles can be picked-up or dropped-off. To do this, can be selected as potential areas those where are registered high values of demand for taxi. Besides, the travel time evaluated with taxi data can be used to size the fleet of shared vehicles. It is noted that the actual zones of Rome covered with the vehicle sharing services are those with high taxi demand (central zones).

Further analyses are also in progress to use these data, with the aim to develop and test a model framework for forecasting taxi travel demand in relation to socio-demographic characteristic of different zones of the city.