Identifying important variables for predicting travel time of freeway with non-recurrent congestion with neural networks

Li, Chi-Sen; Chen, Mu-Chen

doi:10.1007/s00521-012-1114-z

Identifying important variables for predicting travel time of freeway with non-recurrent congestion with neural networks

Original Article
Published: 16 October 2012

Volume 23, pages 1611–1629, (2013)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

Identifying important variables for predicting travel time of freeway with non-recurrent congestion with neural networks

Download PDF

Chi-Sen Li¹ &
Mu-Chen Chen¹

980 Accesses
33 Citations
Explore all metrics

Abstract

The provision of long-distance travel time information has been a major factor facilitating the intelligent transportation system to become more successful. Previous studies have pointed out that non-recurrent congestion is the major cause of freeway delay. The long travel distance complicates the characteristics of traffic flow. Hence, how to improve the prediction capability of long-distance travel time in the case of non-recurrent congestion is an important issue that must be overcome in the field of travel time prediction. This study constructs the travel time prediction model for a segment of 36.1 kms (including eight interchanges) in the National Freeway No. 1, Taiwan, by using the multilayer perceptron. To improve the prediction capability of the model in the case of non-recurrent congestion, this study collects data of average spot speed and heavy vehicle volume gathered by dual-loop vehicle detectors, in addition to rainfall and temporal feature. Furthermore, the historical travel time inferred from the original data of electronic toll collection (ETC) system is also used as the input variable, and the actual travel time inferred from ETC is used as the training target to establish a robust prediction model. As suggested by the results of 168 experimental combinations, the most appropriate prediction model established in this study is a highly accurate forecasting model with MAPE of 6.47 %.

A Neural Network Approach for Solving Traffic-Flow Forecasting Based on the Historical Voyage Datasets: A Case Study on Hai Phong Roads

Study on Subway passenger flow prediction based on deep recurrent neural network

Article 04 June 2020

Applying Recurrent Neural Network for Passenger Traffic Forecasting

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The provision of travel time information has been one of the major factors facilitating the advanced traffic management system (ATMS) and advanced traveler information system (ATIS) to become more successful [1]. Furthermore, the establishment of ATMS and ATIS can improve the performance of existing transportation systems, make more efficient use of limited resources, and reduce pollution emissions to slow down the global warming ultimately. Hence, travel time prediction has been a research topic of concern and attention. The long-distance travel time prediction can effectively provide alternative freeway route information to facilitate ATMS and ATIS to be more successful. However, the longer section of freeway contains more interchanges, leading to more complex changes in the characteristics of traffic flow and thus higher difficulty of travel time prediction. Hence, the prediction of long-distance travel time becomes a major issue that must be overcome in the area of travel time prediction. Nevertheless, the development of continuous travel time prediction model will encounter the traffic condition of non-recurrent congestion as a result of incidents.

The study of Oak Ridge National Laboratory [2] pointed out that 55 % of the delays drivers encounter in American freeways are caused by non-recurrent events, 72 % of which are freeway accidents [3]. Therefore, improving the accident prediction capability [4, 5], finding the key accident-related variables [6], and estimating the impact [7–9] are all issues that should be addressed seriously by research institutions and management units. However, due to the gap between accident reporting time and the occurrence time, the important parameters to measure traffic performance such as the accident disposal time and number of closed lanes cannot be accurately measured at the first time. As a result, how to collect important variables and develop a robust prediction model when it is unable to acquire important accident-related variables in real time to improve the real-time continuous prediction capability in the case of non-recurrent congestion becomes an interesting issue that needs to be further addressed.

As far as prediction technology is concerned, since 1970, researchers have used the autoregressive integrated moving average (ARIMA), Kalman filtering [10, 11], locally weighted regression (LWR) [12], and exponential smoothing (ES) models [13–18] to perform travel time prediction or traffic flow prediction. Furthermore, many successful studies of travel time prediction or traffic flow prediction on freeway in the past also utilized the support vector regression [19], ARIMA-like time series [20], Markov Chains [21], neural networks [22–24], and so on. Regarding the travel time prediction of freeway, studies on topics such as short distance [23, 25], general vehicle flow status (excluding non-recurrent congestion) [19, 21, 24, 26], and peak hour [22] have achieved good results. Van Lint et al. [23] pointed out that the travel time would have a larger variance when congestion occurs. Furthermore, since the freeway delay is mainly caused by non-recurrent congestion events [2], improving the capability of the prediction model in the case of non-recurrent congestion is an important issue that needs to be addressed. Fei et al. [27] presented a Bayesian inference-based dynamic linear model to predict online short-term travel time on a freeway section under both recurrent and non-recurrent traffic conditions. In recent decades, the artificial neural network (ANN) has been widely applied in the areas of traffic flow prediction, speed prediction [28–31], and travel time prediction [32]. Additionally, ANN has been successfully applied in other areas such as water quality prediction [33] and automotive price forecasting [34]. From the results of Najah et al. [33], the radial basis function neural network outperforms the linear regression model and the multilayer perceptron (MLP). In [34], Reza Peyghami and Khanduzi proposed a hybrid learning approach based on the genetic algorithm and least square method to obtain the weights of neural networks. Previous research findings (e.g., [28–32]) showed that the MLP have relatively high degree of robustness and prediction capability in the case of complex, nonlinear, and hardly predictable issues. Therefore, this study attempts to employ the MLP network as the travel time prediction tool in the case of freeway with non-recurrent congestion.

This study collects the characteristics of traffic on freeway with non-recurrent congestion and develops a travel time prediction model of long distance on freeway by using MLP. The remainder of this paper is organized as follows. Section 2 elaborates on the variable selection process. Section 3 presents the travel time prediction model in the case of freeway with non-recurrent congestion. The data distribution is illustrated in Sect. 4. Thereafter, the experimental process and results are presented in Sect. 5. Finally, conclusions of this study are drawn in Sect. 6.

2 Variable selection

The study of travel time can be done by simulation or estimation. In terms of travel time prediction by estimation, selecting the significant variables to reflect the characteristics of traffic flow is the key in improving the prediction and estimation capability of models. Chang [4] pointed out that factors affecting the characteristics of traffic flow can be divided into three categories including geometric variables, traffic characteristics, and environmental factors. The geometric variables include variables such as the degree of horizontal curve and vertical grade. The traffic characteristics include variables such as average daily traffic (ADT) per lane, trucks percentage, bus percentage, and peak hour factor (PHF). The environmental factors mainly include the number of days with precipitation. The research findings in Chang [4] indicated that rainfall and bus percentage are important variables to explain accidents. Wei et al. [8] utilized the traffic, time, space, and geometric attributes to analyze the accident lasting time and achieve good results. In Wei et al. [8], the traffic data including the speed and traffic flow were collected by dual-loop vehicle detectors (VDs). As this study predicts the travel time at every 5 min and the important variables in analyzing traffic flow characteristics such as geometric and space attributes do not vary significantly in the short-term continuous prediction model, this study collects data regarding traffic characteristics, time, and environmental factors.

Data collection methods can be divided into the spot and spatial collection methods. The main techniques of spot data collection method include the inductance loop detectors, microwave, infrared, and radar. Traffic variables such as the space-mean-speed, vehicle type, and traffic flow can be collected by the above methods. The data collection method of using inductance loop detectors is most widely employed in Taiwan. Yeon et al. [21] pointed out that using the traffic flow and average spot speed collected by dual-loop VDs to predict the travel time of general traffic status (congestion not due to weather, accidents, incidents, or work zones) can achieve good estimation results. Yuan et al. [35] indicated that variables of speed, occupancy, and volume are also the important factors to capture traffic characteristics. Additionally, Chang et al. [36] pointed out that bus percentage is an important variable for accident analysis, indicating that the flow of heavy vehicle is an important variable affecting the characteristics of traffic flow. As the freeway segment in this study is the main connection road of major economically developed areas including Hsinchu Science-Based Park, Jungli Industrial Park, Taoyuan International Airport, Songshan International Airport, Taipei Port, and Keelung Port, the characteristics of traffic flow are affected by complex economic activities. In addition to reflecting the economic activities, the larger volume of heavy vehicle has a greater impact on the moving efficiency of overall traffic flow due to different speed limits for heavy vehicles and small vehicles on the National Freeway No. 1, Taiwan, as well as the relatively poor climbing and lane-changing capability of heavy vehicles. In view of this, this study collects the heavy vehicles flow via the dual-loop VDs as the input variables of prediction model.

Furthermore, in terms of the spatial data collection methods, the travel time prediction of freeway with non-recurrent congestion can mainly be conducted by automatic vehicle identification (AVI) [37, 38] and probe vehicle technology [39]. The actual travel time of the freeway segment under study can be collected by AVI and probe systems such that the reliability of prediction model can be guaranteed. It is also because the AVI system does not need to overcome the problems such as the error resulted from positioning by using global positioning system (GPS) and time delay of data feedback in the data acquisition process, and the AVI system has advantages such as higher accuracy and timeliness as compared with the probe vehicle technology. Due to the higher establishment cost of AVI system, previous studies were limited in road segment under study and number of samples. In Taiwan, the ETC system was established in 2006; the ETC system covers the entire National Freeway No. 1. Up to the end of October 2009, the utilization rate has reached 36.48 %, and there are a total of 16,247,908 charge records in October 2009 with charge success rate being at 99.9984 %. Therefore, through the ETC system, the data of travel time can be collected on a long-road segment and the number of samples can also considerably increase to ensure the representativeness of the samples. In this study, the original data are collected and the actual travel time is calculated as the training target through the ETC system. In addition, generally speaking, traffic characteristics vary in weekdays and weekends. Hence, the different encoding schemes for the day of the week are also the important variable in mastering the characteristics of traffic flow.

To summarize, in addition to integrating important variables affecting the characteristics of traffic flow such as rainfall, the day of the week, morning and afternoon, spot speed, and heavy vehicle volume, this study further integrates the historical travel time inferred from the original ETC data and utilizes the actual travel time as the training target to establish a robust travel time prediction model for the freeway with non-recurrent congestion.

3 Travel time prediction architecture

The procedure of travel time prediction in this study is illustrated in Fig. 1. In this study, the data including rainfall, speed and heavy vehicle volume collected by VDs, historical travel time and actual travel time transformed from ETC were used to build the model of travel time prediction. With the results of data collection, the relatively stable traffic parameters detected by the dual-loop VDs are selected as the input variables of VD to avoid the deviation of prediction results from the actual traffic flow due to the over-imputation of missing data. Missing data could be a problem in the process of collecting original data of various attributes. The suitable imputation approach could reflect the actual characteristics of traffic flow and improve the application of the continuous prediction model. In addition, this study calculates the historical travel time and actual travel time based on the original ETC data. The AVI algorithms proposed by the Southwest Research Institute [40] and Transmit [41] are used to identify the consecutive trips and compute the travel time. After the steps of data collection, summarization, and computation, various experimental combinations are designed to understand the impact of different variable combinations on travel time prediction. In order to build a robust prediction model, this study integrates data of various attributes by using MLP network to improve the capability of travel time prediction model in the case of the freeway with non-recurrent congestion.

3.1 Data collection

National Freeway No. 1 is the main inter-city transportation corridor for the west coast of Taiwan. In a total length of 373 km, National Freeway No. 1 totally has 20 toll stations. In this study, data of VD, ETC, accident, and rainfall were collected from September 16 to October 16, 2009, between the Yangmei Toll Station and Taishan Toll Station of the freeway in northward direction. Figure 2 illustrates that the freeway segment in this study includes a total of six interchanges and two system interchanges with a total length of 36.1 km. Moreover, according to the statistics of September and October 2009 of the Taiwan Area National Freeway Bureau, MOTC, the ADT volumes of Yangmei Toll Station and Taishan Toll Station were, respectively, 111,938 and 224,957 vehicles, which approximately account for 23.5 % of the ADT volume of National Freeway No. 1. It thus can be seen that the freeway segment in this study covered the busiest freeway section of the National Freeway No. 1. In this study, speed and heavy vehicle volume were collected at a 5-minute interval by the dual-loop VDs (a total of 22 VDs) through the database of Traffic Control Center of Taiwan Area National Freeway Bureau, MOTC. The original toll charging time of ETC users was also collected. The rainfall data were collected from the database of Central Weather Bureau (data from three rainfall detectors). Moreover, accident data were collected from the accident database of National Freeway Police Bureau. The above databases are established by Taiwan’s governmental agencies to permanently collect the most complete and real-time data for information dissemination, management, and research use.

3.2 Data availability checking

Regarding the complex traffic environment, the more complete data for representing the traffic characteristics can better improve the prediction capability of nonlinear models. In light of this, this study collected data of all dual-loop VDs in a total number of 22 on the freeway segment in this study. However, data credibility is the most important and basic requirement for model building. Selection of VDs of high stability can further ensure data credibility and improve model applicability. Regarding the data collected by VDs in this study, the VDs with missing data for more than 2 h were regarded as unstable and were eliminated from the model building. In the end, 11 VDs of relatively high stability were selected for model building. The number of VDs for data collection and number of VDs applied in this study on various freeway sections in this study are illustrated in Fig. 3.

3.3 Missing data processing

Although automatic data collection has advantages such as long time collection, wide range investigation, smaller error and consistency, routine maintenance, construction, cable theft, weather conditions and other force majeure events may result in system failure or poor stability, leading to the unavoidable problem of missing data. The missing data may be deleted or imputed. The data imputation can be processed in the following three ways. First, the imputation is performed by using the historical data of the same time on different dates in the original spot of data collection, and the data of closer dates or data with same characteristics have a higher priority for imputation. Second, in the same spot of data collection, the data of Time t is imputed based on the data of Time t – n by using the arithmetic mean method, simple weighting method, ES method, etc. Third, the imputation is performed by using the data collected in upstream and downstream spots, and the closer spot and the spot belonging to the same group have a higher priority.

The missing data of Taiwan’s ETC system may occur at a particular Time t due to the following reasons: (1) equipment maintenance; (2) judgment of non-continuous trips when the travel time at Time t deviates from that at Time t – 1 over 40 %; or (3) no trip recorded as a result of no ETC vehicle passing through the toll station, resulting in the lack of travel time samples. In this study, the ETC-based actual travel time is used as the target for model training, validation, and test, and the historical travel time is used as an input variable. To avoid inconsistency between the result of model training and the real-world situation as a result of data imputation error, the sample at Time t with missing data of actual travel time and historical travel time is deleted. Furthermore, the missing data in the VD data collection process can be categorized into three cases and are imputed accordingly. These three cases are described as follows. Case 1: vehicle detector j $ ({\text{VD}}_{j} ) $ has a single missing data at Time t, but there are data at Time t – 1. Case 2: vehicle detector j $ ({\text{VD}}_{j} ) $ causes multiple missing data, and there are no missing data in the upstream and downstream VDs of $ ({\text{VD}}_{j} ) $, that is, there are missing data at Times t and t – 1, and there are no missing data in the upstream and downstream VDs. Case 3: vehicle detector j $ ({\text{VD}}_{j} ) $ causes multiple missing data, and there are missing data in the upstream and downstream VDs of $ ({\text{VD}}_{j} ) $, that is, there are missing data at Times t and t – 1, and there are missing data in the upstream and downstream VDs. Notice that, for the imputation of missing data of heavy vehicles, the heavy vehicle volume at the time with the speed closest to that of Time t within the previous half hour is used to impute the missing heavy vehicle volume of Time t in $ ({\text{VD}}_{j} ) $. For example, if the speed at Time t – 1 is closest to that at Time t, the missing heavy vehicle volume at Time t is imputed by that at Time t – 1. This way, the impact of factors such as different VD detection quality at various observation spots and different traffic characteristics were taken into consideration. Hence, filling the missing data with on-time data of the same observation spot is an effective method to reflect the traffic characteristics of the observation spot. For Case 1, simple weighting method, that is, $ {\text{Speed}}_{j} (t) = {\text{Speed}}_{j} (t - 1) $ and $ {\text{HVV}}_{j} (t) = {\text{HVV}}_{j} (t - 1) $, is used to impute the missing data. For Cases 2 and 3, the third data imputation method presented in Sect. 3.3 is used. In summary, for the above-mentioned three cases, the procedure of data imputation is described as follows.

Step 1: If the missing data in the data collection process of VD conform to Case 1, go to Step 2. Otherwise, go to Step 4.
Step 2: Find the speed and heavy vehicle volume of $ ({\text{VD}}_{j} ) $ at Time t – 1 from database, and they are recorded as $ {\text{Speed}}_{j} (t - 1) $ and $ {\text{HVV}}_{j} (t - 1) $, respectively.
Step 3: Set $ {\text{Speed}}_{j} (t) = {\text{Speed}}_{j} (t - 1) $ and $ {\text{HVV}}_{j} (t) = {\text{HVV}}_{j} (t - 1) $, and go to Step 17.
Step 4: If the missing data in the data collection process of VD conform to Case 2, go to Step 5. Otherwise, go to Step 12.
Step 5: Record the data of $\left\{{{\text{VD}}_{j} (t) > 0} \right\}$.
Step 6: According to the VDj grouping mark, find out the data $\left\{{{\text{VD}}_{j}^{k}>0}\right\}$.
Step 7: The $ {\text{VD}}_{j} $ that is closer to $ {\text{VD}}_{m} $ (i.e., min distance $ ({\text{VD}}_{j} , {\text{VD}}_{m} ) $) has the higher priority of data imputation.
Step 8: Impute the speed of $ {\text{VD}}_{j} $ at time t by using the LRM model and set $ {\text{speed}}_{j} (t) = a + b \times {\text{speed}}_{m} (t) $.
Step 9: Find the speed of $ {\text{VD}}_{j} $ within a half hour of Time t that is closest to $ {\text{speed}}_{j} (t) $ $ (\min \left\{ {\left| {{\text{speed}}_{j} (t) - {\text{speed}}{}_{j}(t - i)} \right|} \right\},\quad i = 1,2, \ldots ,6) $. Impute the heavy vehicle volume of Time t by setting $ {\text{HVV}}_{j} (t) = {\text{HVV}}_{j} (t - i) $.
Step 10: If the missing data in the data collection process of VD conform to Case 2, go to Step 11. Otherwise, go to Step 15.
Step 11: If the consecutive time of data imputation is more than 2 h, stop imputing the data and delete the following consecutive missing data. Otherwise, repeat Steps 5–9 until finishing the imputation of missing data and go to Step 17.
Step 12: Case 3. Record the data of $\left\{{{\text{VD}}_{j} (t) > 0} \right\}$.
Step 13: Check whether $ {\text{VD}}_{j} $ of the same group K have data at Time t, $ {\text{VD}}_{j}^{k} $. If so, select the VD with data of the same group K as the object of data imputation, $\left\{{{\text{VD}}_{j}^{k}>0}\right\}$. Otherwise, select $ {\text{VD}}_{j} $ with data of a different group η as the object of data imputation, $\left\{{{\text{VD}}_{j}^{\eta} > 0} \right\},\quad \eta\ne K$.
Step 14: Repeat Steps 7–9.
Step 15: Check whether the missing data in all VDs at Time t have been imputed. If so, go to Step 16. Otherwise, impute the missing data of next VD and repeat Steps 13 and 14.
Step 16: If the consecutive time of data imputation is more than 2 hours, stop imputing the data and delete the following consecutive missing data. Otherwise, repeat Steps 12–13 until all missing data are imputed and go to Step 17.
Step 17: Check whether all missing data have been imputed. If so, stop the data imputation process. Otherwise, go to Step 1.

If the missing data of VD do not fit into any above-mentioned case, this sample is deleted. In addition, the sample is deleted if data at Time t are regarded as abnormal. The driving speed more than 120 km/h is regarded as abnormal since the speed limit on the freeway segment in this study is 120 km/h. Moreover, from the statistics of traffic volume in September 2009 reported by the Taiwan Area National Freeway Bureau, the percentage of heavy vehicle volume was between 10.0 and 17.1 %. Accordingly, it would be regarded as abnormal if the heavy vehicle volume is more than 150 at Time t. After the above data preprocessing, a total of 7,908 samples were acquired for model building.

3.4 Computation of historical travel time and actual travel time

The ETC system collects the times of a vehicle passing through the upstream point A and the downstream point B by identifying ID, and the AVI system collects the times by identifying the vehicle license plate. Although the technologies of vehicle identification and time collection adopted by ETC and AVI are different, the logic of travel time computation are applicable in both systems. Therefore, in this study, the ETC charging times of freeway users were collected to calculate the historical travel time and actual travel time by using the algorithms developed by Southwest Research Institute [40] and Transmit [41]. Regarding the computation of travel time by AVI system, Southwest Research Institute [40] developed the TransGuide and TranStar algorithms. Both algorithms employ the concept of rolling average algorithm to automatically calculate the travel time. Equation (1) expresses the set, $ {\text{Ctt}}_{ABt} $, for computing the travel time by using the SwRI algorithm.

$$ {\text{Ctt}}_{ABt} = \left\{ {t_{Bi} - t_{Ai} \left| {t - t_{r} \le t_{Bi} \le t} \right.\;{\text{and}}\;{\text{Btt}}_{ABt} (1 - l_{\text{th}} ) \le t_{Bi} - t_{Ai} \le {\text{Btt}}_{ABt} (1 + l_{\text{th}} )} \right\} $$

(1)

Equation 1 is utilized to estimate the travel time of a vehicle passing through two AVI readers, which are the upstream point A, $ t_{Ai} $, and the downstream point B, $ t_{Bi} $. To avoid the data of abnormal travels (detour and parking) from affecting the estimation of travel time, if the travel time $ (t_{Bi} - t_{Ai} ) $ of vehicle i passing through points A and B of AVI readers is more than the link threshold parameter, $ l_{\text{th}} $, the data of this travel will be eliminated. The threshold, $ l_{th} $, in both TransGuide and TranStar is set to 0.2. That is, if the travel time of vehicle i is lower or more than 20 % of the previous average travel time,$ {\text{Btt}}_{ABt} $, this travel will be regarded as abnormal and will not be included in the travel time computation. Furthermore, regarding the observation window, $ t_{r} $, of travel time data set, $ {\text{Ctt}}_{ABt} $, the observation window is set to 2 min in the TransGuide algorithm, that is, the average travel time of all trips within 2 min is computed by using Eq. (2). It takes form as follows:

$$ {\text{Ott}}_{ABt} = \frac{{\sum\nolimits_{i = 1}^{{{\text{Ctt}}_{ABt} }} {(t_{Bi} - t_{Ai} )} }}{{{\text{Ctt}}_{ABt} }} $$

(2)

However, the TranStar algorithm differs from the TransGuide algorithm in fix window concept as it simultaneously renews the travel time data set and computes the average travel time if AVI readers obtain new travel time samples [37].

The computation logic of Transmit algorithm is very similar to those of TransGuide and TranStar. The main difference is that the Transmit algorithm does not use the concept of rolling average algorithm to calculate the travel time in line with the threshold, but it calculates the travel time within 15 min. In the Transmit algorithm, at each fix time interval, s, it collects the travel time samples of two AVI readers, ns, with an upper limit of 200 samples, and calculates the travel time, $ {\text{Ott}}_{ABS} $, by using Eq. (3) in the time interval [41]. Equation (3) takes form as follows [41]:

$$ {\text{Ott}}_{ABS} = \frac{{\sum\nolimits_{i = 1}^{{\left| {ns} \right|}} {(t_{Bi} - t_{Ai} )} }}{{\left| {ns} \right|}} $$

(3)

In addition, with the travel time database, in which the travel time is computed every 15 min, the actual travel time, $ {\text{Att}}_{ABS}^{\prime \prime } $, can be computed by using Eq. (4). It takes form as follows:

$$ {\text{Att}}_{ABS}^{\prime \prime } = \alpha \times {\text{Htt}}_{ABS} + (1 - \alpha ){\text{Att}}_{ABS - 1}^{\prime \prime } $$

(4)

In Eq. (4), with the historical travel time in period S, $ {\text{Htt}}_{ABS} $, and the actual travel time in period $ S - 1 $, $ {\text{Att}}_{ABS - 1}^{\prime \prime } $, after being adjusted by the smoothing parameter α, the actual travel time in period S, $ {\text{Att}}_{ABS}^{\prime \prime } $, can be estimated. According to [37], in the case of no incident detected, smoothing parameter (α) is set to 0.1, whereas in the case of incident detected, smoothing parameter is set to 0. To prevent the characteristic of non-recurrent congestion due to accidents from affecting the normal traffic flow represented by the historical database, this database of travel time does not include the travel time data after accidents. Therefore, Transmit algorithm is limited to the case of stable traffic flow without accidents and to the area with a linear change in actual travel time and historical data. Although the historical travel time can reflect some conditions of traffic flow, the accidents occur randomly, and it is difficult to accurately present the characteristics of complex traffic conditions by using linear equations. Hence, in this study, the historical travel time and actual travel time in the ETC database are computed by using the AVI travel time algorithm. Moreover, the MLP network is used to develop a robust model to predict the freeway travel time in the case with complex and nonlinear traffic characteristics.

3.5 The MLP-based travel time prediction model

The neural network can build nonlinear models and furthermore exclude the disadvantage of setting up several assumptions when building models by the multiple linear regression method and Auto-regressive Integrated Moving Average Model (ARIMA) [42]. According to the survey from 1992 to 1998 by Vellido et al. [43], about 78 % of studies using neural networks to business-related area employed Back-Propagation Neural Network (BPN). The architecture of BPN is a multilayer feed-forward network with supervised learning, and thus it is also termed as MLP [44]. It inputs the training samples into the network while transmitting the outputs to allow the network to learn the mapping between the input and output variables. BPN is mainly composed of input layer, hidden layer, and output layer. In this study, the input layer is mainly used to receive the input variables, specifically, such as rainfall, speed and heavy vehicle volume collected by VDs, the day of the week, historical travel time collected by ETC, and time (AM or PM). Figure 4 illustrates the structure of BPN applied in this study.

This study employs the SAS Enterprise Miner, version 5.3, to build the prediction model of freeway travel time. The number of hidden nodes is an important parameter affecting the prediction performance of MLP. Hence, in the case of common parameter settings (see Table 1) for various experimental combinations, this study evaluates the prediction performance of different numbers of hidden nodes. The root mean square error (RMSE) is used to measure the prediction performance, and the lower RMSE value represents the better prediction performance. Finally, the number of hidden nodes with the lowest RMSE is used to develop the prediction model for each experimental combination, and the prediction results of all experimental combinations are compared in this study.

Table 1 The setting of MLP parameters

Identifying important variables for predicting travel time of freeway with non-recurrent congestion with neural networks

Abstract

Similar content being viewed by others

A Neural Network Approach for Solving Traffic-Flow Forecasting Based on the Historical Voyage Datasets: A Case Study on Hai Phong Roads

Study on Subway passenger flow prediction based on deep recurrent neural network

Applying Recurrent Neural Network for Passenger Traffic Forecasting

Explore related subjects

1 Introduction

2 Variable selection

3 Travel time prediction architecture

3.1 Data collection

3.2 Data availability checking

3.3 Missing data processing

3.4 Computation of historical travel time and actual travel time

3.5 The MLP-based travel time prediction model

4 Data analysis

4.1 ETC

4.2 Rainfall

4.3 Accidents

4.4 Current travel time analysis

5 Experiments

5.1 Experimental design

5.2 Analysis of results

5.2.1 Analysis of rainfall variable

5.2.2 Analysis of speed and heavy vehicle volume collected by VDs

5.2.3 Analysis of encoding scheme of the day of the week variable

5.2.4 Analysis of historical travel time collected by ETC

5.2.5 Analysis of time variable

5.3 Discussion

6 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation