IntRoute: An Integer Programming Based Approach for Best Bus Route Discovery

Sung, Chang-Wei; Yang, Xinghao; Liao, Chung-Shou; Liu, Wei

doi:10.1007/978-3-030-73200-4_44

Chang-Wei Sung¹⁶,
Xinghao Yang¹⁷,
Chung-Shou Liao¹⁶ &
…
Wei Liu¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12683))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

2417 Accesses
1 Citations

Abstract

An efficient data-driven public transportation system can improve urban potency. In this research, we propose IntRoute, an Integer Programming (IP) based approach to optimize bus route planning. Specifically, IntRoute first contracts bus stops via clustering and then derives a new bus route via a mixed integer linear program (ILP). This two-phase strategy brings three major merits, i.e., a single bus route without any transfer, the minimal total time consuming, and an efficient optimization algorithm for large-scale problems. Experimental results show that our IntRoute significantly reduces the traditional commuting time in Sydney from 31.53 min down to 18.06 min on average.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Rural bus route design problem: Model development and case studies

Article 01 December 2014

Offering a New Bus Route Between Campus and Bus Terminal Using Shortest Path Algorithm

Network-level Optimization of Bus Stop Placement in Urban Areas

Article 01 April 2018

1 Introduction

In this study, we consider an important data-driven public transportation problem: finding the best bus route that minimizes passengers’ overall commuting time. Given a bus transportation system as well as the requests of specific passengers who commute from different starting locations to a fixed destination, the goal of the problem is designing a new bus route to satisfy the passengers’ demand without any transfer.

Previous studies on bus transfer problems were mostly not data-driven [2] due to data skewness problems [3]. In this work, we proposed a new integer-programming based method, which we call IntRoute, to find the route that minimizes the total time cost of the targeted passengers. Specifically, our IntRoute contains two main phases, i.e., the contraction of bus stops via K-means clustering and the derivation of new bus route via a mixed integer linear programming (ILP). The major contributions of this research are listed below.

We design a single bus route in which all passengers with an identical destination are delivered without any transfers.
We present a two-phase framework to minimize the total time expense of the specific passengers.
We develop a genetic algorithm (GA) to solve the integer linear programming (ILP) for large-scale instances.

2 Methodology

The main framework of our two-phase IntRoute method are shown in Fig. 1.

Phase 1: Contraction of Bus Stops. In the first phase of IntRoute, we contract multiple requests into a super node by using clustering approaches and determine a pickup bus stop for all the requests inside the super node. In each super node, passengers are asked to walk to the pickup bus stop and wait for buses. The walking time is counted into the total commuting time. We employ K-means clustering, as it consumes the least walking time compared with hierarchical clustering and density-peaks clustering. Then we exploit silhouette method and elbow method to determine the number of pickup stops, i.e., the cluster number n. Both methods indicate the best clustering number should be $n=20$. Therefore, this phase finds 20 pickup stations that minimizes the passengers total walking time.

Phase 2: Design of One Alternate Route. In the real-world transportation network, an arbitrary node pair (i, j) may be connected (Fig. 2 left). To solve the problem via mathematical IP model, we introduce the multi-level graph $G'$ (Fig. 2 right), where each level represents a possible sub-route from one stop to the next one. The red paths in G and $G'$ are equal. The graph $G'$ indicates the order of visiting the pick-up stops directly by the levels.

Modelling via Integer Programming. We denote the node set as $N=\{ 1,2,\dots ,n \}\cup \{ S,T \}$, where S and T represents the source and destination, respectively. The arc set is $A=\{ (i,j)|i\in N,\ j\in N, \ \text {and} \ i\ne j \}$. The travel time from node i to node j is denoted as $c_{ij}$, and $r_{i}$ is the number of passengers who want to go to the destination. $x_{ij}^l\in \{ 0,1 \}$ represents a binary variable, indicating whether the bus goes from node i to node j at level l on $G'$. $y_{ijk}^l\in \{ 0,1 \}$ represents a binary variable which denotes if request k travels through arc (i, j) at level l. $v_{k}^l\in \{ 0,1 \}$ represents a binary variable, indicating if request k is served at the level l. Formally, the IP model is formulated as:

$$\begin{aligned} \nonumber&\text{ min }\sum \limits _{l}\sum \limits _{k}\sum \limits _{(i,j)\in A}^{} y_{ijk}^{l} c_{ij} r_{k}, \quad s.t.\\&{\sum }_{(i,j)\in A} x_{ij}^{l}=1, for\ l=0,1,\dots ,n \end{aligned}$$

(1)

$$\begin{aligned}&{\sum }_{(j,i)\in A} x_{ji}^{l}={\sum }_{(i,k)\in A}x_{ik}^{l+1},\forall i;\forall l \end{aligned}$$

(2)

$$\begin{aligned}&{\sum }_{(S,i)\in A} x_{Si}^{0}={\sum }_{(i,j)\in A}x_{ij}^{1} \end{aligned}$$

(3)

$$\begin{aligned}&{\sum }_{(i,j)\in A} x_{ij}^{n-1}={\sum }_{(j,T)\in A}x_{jT}^{n}\end{aligned}$$

(4)

$$\begin{aligned}&{\sum }_{i}{\sum }_{l} x_{ij}^{l}\le 1, for\quad j=1,2,\dots ,n,T \end{aligned}$$

(5)

$$\begin{aligned}&v_k^l\le v_k^{(l+1)},\forall k, \forall l \end{aligned}$$

(6)

$$\begin{aligned}&{\sum }_{k} v_k^l= l, for\ l=1,2,\dots ,n\end{aligned}$$

(7)

$$\begin{aligned}&x_{ij}^{l} \le v_{i}^{l},for\ (i,j) \in A, l=1,2,\dots ,n \end{aligned}$$

(8)

$$\begin{aligned}&y_{ijk}^{l} \le (x_{ij}^{l} + v_{k}^{l})/2,\ \forall (i,j) \in A, \forall k, \forall l \end{aligned}$$

(9)

$$\begin{aligned}&y_{ijk}^{l} \ge (x_{ij}^{l} + v_{k}^{l})-1,\ \forall (i,j) \in A, \forall k, \forall l \end{aligned}$$

(10)

$$\begin{aligned}&x_{ij}^{l}\in \{ 0,1 \};\ v_k^l\in \{ 0,1 \};\ y_{ijk}^{l}\in \{ 0,1 \} \end{aligned}$$

(11)

where (1) ensures that each level is exactly passed once. (2)–(4) ensure that flow conservation of the graph. (5) ensures that each node can be entered at most once. (6) ensures that request k must be served at level $l+1$ if it is served at level l. (7) ensures that l requests are on the bus when the bus is serving level l. (8) ensures that if arc (i, j) is picked at level l, request i should be served at level l. (9) and (10) ensure that the request k is served at level l on the arc (i, j) if both $x_{ij}^{l}$ and $v_{k}^{l}$ are equal to one. (11) shows $x_{ij}^{l}$, $v_{k}^{l}$, and $y_{ijk}^{l}$ are all binaries.

Optimization via Genetic Algorithm. We design a genetic algorithm (GA) to solve this IP problem. Specifically, the chromosome represents possible sequence of the node set $\{1,2,\dots ,n \}$, and the population is a set of chromosomes.

As shown in Algorithm 1, a new chromosome can be generated by the crossover between two parent bus routes with a probability of crossover rate $R_c$. The mutation is defined as the position exchange between two randomly selected near-by bus stoops with a probability of $R_m$. There are two steps in our GA: (1) the randomly population initialization with a given size M, and (2) the population evolution for T generations by crossover and mutation according to a fitness function. We adopt a 2-OPT technique to avoid the cross sub-paths. Besides, we propose a decomposition technique that clusters the optimal route into three sub-route via k-means. We concatenate the clusters in a reverse order, i.e., from the destination node to the start node. This strategy greatly reduces the travel time.

3 Experiments and Analysis

Data. The experiment is performed based on publicly available real-world commuting data, retrieved from the card-based transit payment system^{Footnote 1} in Sydney, Australia, including approximately three million trips.

Table 1. Routing time before optimization

Full size table

Results. The routing time before optimization are listed in Table 1, where routes are represented by the IDs of the bus stops. Here C denotes the time cost without transfer and without optimization (also demoed in Fig. 3), and $C'$ denotes that of the original commute with transfers. The new route after our optimization is shown in Fig. 4, while the time costs of the bus routes optimized by different methods are listed in Table 2.

Conclusions. Our IntRoute method greatly reduces the time expense for passengers from 31.53 min to 18.06 min on average, saving about 43% of commute time. In future, we plan to investigate more optimization methods for further improving our solutions.

Table 2. Commute time

Full size table

Notes

1.
https://opendata.transport.nsw.gov.au/dataset/opal-tap-on-and-tap-off.

References

Li, L., Fu, Z.: The school bus routing problem: a case study. J. Oper. Res. Soc. 53(5), 552–558 (2002). https://doi.org/10.1057/palgrave.jors.2601341
Article MATH Google Scholar
Liu, J., Mao, J., Du, Y.T., Zhao, L., Zhang, Z.: Dynamic bus route adjustment based on hot bus stop pair extraction. In: Li, G., Yang, J., Gama, J., Natwichai, J., Tong, Y. (eds.) DASFAA 2019. LNCS, vol. 11448, pp. 562–566. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-18590-9_87
Chapter Google Scholar
Liu, W., Chawla, S.: A quadratic mean based supervised learning model for managing data skewness. In: Proceedings of SIAM SDM Conference (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Industrial Engineering, National Tsing Hua University, Hsinchu, Taiwan
Chang-Wei Sung & Chung-Shou Liao
School of Computer Science, University of Technology Sydney, Ultimo, Australia
Xinghao Yang & Wei Liu

Authors

Chang-Wei Sung
View author publications
You can also search for this author in PubMed Google Scholar
Xinghao Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Shou Liao
View author publications
You can also search for this author in PubMed Google Scholar
Wei Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinghao Yang .

Editor information

Editors and Affiliations

Aalborg University, Aalborg, Denmark
Christian S. Jensen
Singapore Management University, Singapore, Singapore
Ee-Peng Lim
Academia Sinica, Taipei, Taiwan
De-Nian Yang
The Pennsylvania State University, University Park, PA, USA
Wang-Chien Lee
National Chiao Tung University, Hsinchu, Taiwan
Vincent S. Tseng
Athens University of Economics and Business, Athens, Greece
Vana Kalogeraki
National Cheng Kung University, Tainan City, Taiwan
Jen-Wei Huang
National Tsing Hua University, Hsinchu, Taiwan
Chih-Ya Shen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sung, CW., Yang, X., Liao, CS., Liu, W. (2021). IntRoute: An Integer Programming Based Approach for Best Bus Route Discovery. In: Jensen, C.S., et al. Database Systems for Advanced Applications. DASFAA 2021. Lecture Notes in Computer Science(), vol 12683. Springer, Cham. https://doi.org/10.1007/978-3-030-73200-4_44

Download citation

DOI: https://doi.org/10.1007/978-3-030-73200-4_44
Published: 06 April 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73199-1
Online ISBN: 978-3-030-73200-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

IntRoute: An Integer Programming Based Approach for Best Bus Route Discovery

Abstract

Similar content being viewed by others

Rural bus route design problem: Model development and case studies

Offering a New Bus Route Between Campus and Bus Terminal Using Shortest Path Algorithm

Network-level Optimization of Bus Stop Placement in Urban Areas

1 Introduction

2 Methodology

3 Experiments and Analysis

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

IntRoute: An Integer Programming Based Approach for Best Bus Route Discovery

Abstract

Similar content being viewed by others

Rural bus route design problem: Model development and case studies

Offering a New Bus Route Between Campus and Bus Terminal Using Shortest Path Algorithm

Network-level Optimization of Bus Stop Placement in Urban Areas

1 Introduction

2 Methodology

3 Experiments and Analysis

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation