1 Introduction

The technology selection and design model of Moccia and Laporte (2016) minimizes the sum of passengers and operator costs of a transit line. The passengers’ cost is the sum of access, waiting, riding, and egress time values. The operator cost is the sum of fixed and variable costs of the transit line. We expand this model in five directions with the goal of improving the representation of semi-rapid transit (i.e. with higher quality of service attributes than local service). First, we improve the multi-period representation of the demand profile. Second, we propose a crowding penalty function that reduces the underestimation caused by Jensen’s inequality when using the average vehicle occupancy rate. Third, we consider frequency-dependent travel time delays and platooning. Fourth, we define the maximum frequency as dependent on the longest dwell time. Fifth, we present a refined representation of the operator cost by differentiating between operating and commercial cycle times, and by modeling station construction and maintenance costs as variable with peak-period train length.

On the practical side, we provide a deep analysis of techno-economical parameters of two semi-rapid modes, namely bus rapid transit (BRT), and light rail transit (LRT). We use “mode” here to represent a relatively large set of specific implementations that we indicate as “technologies”. We denote as “semi-rapid” the modes that require a right-of-way (RoW) with partial separation from other traffic. According to the classification scheme of Vuchic (2005), semi-rapid transit occurs in RoWs of category B, whereas properly defined rapid transit requires fully separated RoWs of category A, and local services take place in mixed traffic, RoW of category C (Vuchic et al. 2012).

We examine two scenarios offering comparable performance for each technology in terms of service, rather than assuming that service quality is strongly associated with technology. Two scenarios representing different levels of investment and performance are assessed by the proposed model. Scenario 1 represents operation in an urban roadway in RoW obtained by a simple change in designation of an existing lane, with relatively simple curb stops. Scenario 2 operates in an exclusive at grade RoW where a new laterally separated alignment is developed with more substantial stations and preferential signal priority. These two scenarios differ in performance levels, and, as a result, in productive capacity (Vuchic 2007). They have different operating and maintenance (O&M) and investment costs reflecting their design standards and technology assumptions but have identical values of time (VoTs) for the users.

The remainder of this paper is structured as follows. Section 2 reviews the relevant literature and outlines our approach. The model is presented in Sect. 3. Section 4 discusses the two scenarios and Sect. 5 describes the numerical results. Finally, we present some conclusions in Sect. 6.

2 Literature review

Moccia et al. (2017) establish an equivalence in the objective functions of fixed and elastic demand models for the optimization of a transit line when the elastic demand can be approximated by a linear function of travel time components and fare. This finding reinforces the general result of Daganzo (2012) about the equivalence in the optimality conditions of fixed and elastic demand models for the optimization of public services. These results allow a decoupling between system design and demand forecast. The system modeling effort can therefore be directed toward a more refined representation of the operational details while the demand can be assumed as given. The fixed-demand transit line models of Moccia and Laporte (2016) integrate variable stop spacing and train length, a crowding penalty, and a multi-period generalization in the base model of Tirachini et al. (2010). The research question that we are dealing with is what functional extensions can increase the model’s realism. In Sects. 2.1, 2.2 and 2.3 we discuss some literature on service variability, in-vehicle passenger crowding, and stop spacing, respectively. Our approach with respect to this literature is outlined in Sect. 2.4.

2.1 Travel time, speed, platooning and service variability

Fernandez and Planzer (2002) discuss the relevance of stops on transit capacity analysis given that stops can be more of a constraint on capacity than intersections or links. The authors show that simulation models of stop operations provide a better, and generally larger, estimation of stop capacity than analytical formulae. Tirachini and Hensher (2011) propose a total cost minimization model of a segregated bus line to choose the fare collection system and the infrastructure investment level that determines the bus running speed. To this latter aim, a linear relationship between infrastructure investment and running speed is assumed. High bus frequency induces delays at stops and at intersections. The delay at signaled intersections is estimated by the Akçelik and Rouphail (1993) formula and stops are assumed to be isolated from traffic lights. In the frequency range limited by stop capacity the overflow delay at signaled intersections is negligible. Queuing delay and dwell time at stops are modeled by regressions of simulations where frequency, types of fare payment, and boarding demand are key variables. Simulation results indicate rapidly increasing queuing delays when a threshold boarding demand and bus frequency are reached. These thresholds are higher with more efficient types of fare payment. The authors find that off-board fare payment is the most convenient option under many circumstances and that the optimal running speed grows with the logarithm of the demand.

Quality of service may be affected by bus bunching or platooning, the underlying process for which is well understood. Whenever a bus loses time or schedule it is likely to experience more passenger boardings than the trailing bus. This increases its dwell time at stops, and thus the process is self-reinforcing (Daganzo 2009). Negative consequences of platooning for the passengers are an increase of the average and the variance of the waiting times, crowding on the platforms and inside the vehicles, and discomfort when passengers are not able to board the first incoming bus and they have to relocate along the platform. Real-time information systems such as automatic vehicle location have allowed ambitious strategies to counteract platooning such as, for example, the self-equalizing headways of Bartholdi and Eisenstein (2012). In spite of these advancements, Verbich et al. (2016) report that in a state-of-the-art bus corridor in Portland, Oregon, platooning is still a frequent occurrence.

Service reliability, through its effects on waiting times, can have a significant effect on how users perceive the quality of service. Van Oort (2016) presents a framework to monetize service reliability, and estimates that 2/3 of the total benefits for a tram project in Utrecht, the Netherlands, can be attributed to improvements in service reliability.

2.2 Passenger in-vehicle crowding

Li and Hensher (2013) compare crowding levels as measured by some rail operators in Australia with results of passenger surveys. A significant gap exists between measured and expressed crowding levels. The authors observe that this gap may be caused by excessive aggregation of crowding measures with respect to the time and to the number of stations. Tirachini et al. (2016) estimate sitting and standing crowding multipliers by inferring passenger preferences from the smart card transaction database of the Singapore metro. Some passengers are willing to take a train in the opposite direction of their destination to secure a seat. Analyzing this behavior the authors find that the standing multiplier can be as much as 1.55 with a density of three standees per square meter. Klumpenhouwer and Wirasinghe (2016) define a single-period model of a LRT line where the frequency, the stop spacing, and the demand are given. The demand is centripetal, the passenger cost is related to the in-vehicle crowding disutility, and the operator cost is a function of the fleet size and of the platform length. The optimized variable is the train length. The model shows that crowding can significantly affect the optimal train length. Numerical experiments based on data from Calgary’s C-Train yield train lengths similar to those in current and planned operation in the LRT network. Hörcher et al. (2017) analyze large scale automated demand and train location data from Hong Kong to estimate the user cost of crowding in a revealed preference route choice framework. A linear crowding multiplier model is validated, and this study found that at six passengers per square meter and no chance to find a seat, the value of time circa doubles. Haywood et al. (2017) report results of a survey on stated satisfaction of subway passengers in Paris. Crowding disutility grows linearly with in-vehicle density and it is sensitive to the passenger income. Three causes of crowding disutility are identified: standing probability, poorer use of time, and closeness to other passengers.

2.3 Stop spacing

Stop spacing in passenger transportation studies performed between 1915 and 1930 are reviewed in Vuchic and Newell (1968). These early works were the first to highlight the fact that an optimal stop spacing should be a function of the number of passengers on the vehicle and of those wishing to board it. Vuchic and Newell (1968) generalize their analysis to variable population densities, different access speeds, train kinematics, and competition for a single line where passengers commute to a terminus serving a central business district and the minimization of aggregate passenger travel time is sought. The proposed analytic model can be solved in closed form for the special case of a uniform population density. The optimal stop spacing then follows an arithmetic progression and increases in the direction of cumulation. Optimal stop spacing is found to be highly sensitive and increasing with the ratio between passenger access speed and maximum train speed. Vuchic (1969) considers the maximization of patronage for a rapid transit line in an area with uniform population density, as well as other assumptions as in Vuchic and Newell (1968). This study allows an analytical comparison of the two objectives, namely maximum patronage vs minimum total travel time. In both cases, stop spacings increase in the direction of cumulation, but maximum patronage requires a considerably higher stop density. Kikuchi and Vuchic (1982) study the optimal number of stops and vehicle stopping policy of a transit line where boarding and alighting densities are constant along the line, an assumption that makes this study relevant to transit in central areas, as opposed to the cumulative boarding assumption more relevant to commuting lines, as in the studies previously reviewed. The authors compare two objective functions: minimization of passenger travel time, and minimization of total cost, defined as the sum of passenger and operator costs. For the large passenger volume case, the two objective functions converge to a similar number of optimal number of stops. In the context of transit technology selection, Moccia and Laporte (2016) analytically find that a faster technology requires longer stop spacing, but the ratio of the optimal stop spacings between two technologies follows a square root formula of their speed ratio, i.e. a sublinear relationship.

2.4 Proposed approach

Regarding travel time variability we chose to model a transit system where only a moderate level of this variability occurs at less than the maximum allowed frequency. Semi-rapid bus and rail systems can carry higher demand levels by using higher frequencies than those that we allow, but the resulting travel time variability is not likely to be acceptable for a new design. Regarding travel time speed we propose a scenario analysis because the alignment choice is often severely constrained and few options are available. Regarding platooning, we observe that this issue is tangential to our model’s scope. However, since we consider frequencies that may generate some occurrences of these conditions, we conservatively model their impacts on the passengers by a reduction of the waiting time advantage of those frequencies, and on both the operator and the passengers by a travel time component as an increasing function of the frequency.

Regarding in-vehicle passenger crowding, both the recent survey of Haywood et al. (2017) and the revealed preference studies of Tirachini et al. (2016) and Hörcher et al. (2017) reaffirm the finding of the meta-analysis of Wardman and Whelan (2010) that the crowding multiplier of the base in-vehicle time with respect to the load factor is well approximated by a piecewise linear function which is equal to one up to the load factor corresponding to the seated capacity, and then starts to increase. The optimization model of Klumpenhouwer and Wirasinghe (2016) highlights the relevance of in-vehicle passenger crowding for station design and fleet sizing. Our contribution to this literature thread is that we overcome the inherent underestimation of the crowding disutility when applying directly a piecewise linear function to the average vehicle load along a bidirectional line (we note that this issue does not pertain to the model of Klumpenhouwer and Wirasinghe (2016)). This is accomplished by a new crowding disutility function presented in the next section and derived in “Appendix 2”.

Regarding stop spacing, in the computational experiments we discern between the effects of demand and trip length variations.

3 Model

This section is structured as follows. We first present the principal assumptions and the modeling of the cycle time in Sect. 3.1. The passengers’ time value and the operator cost, which are treated separately and then combined into a total cost, are described in Sects. 3.2 and 3.3, respectively. The side constraints are described in Sect. 3.4, and the resulting optimization model is presented in Sect. 3.5.

Table 1 summarizes the main symbols. Dimensionless parameters are indicated by Greek letters. Additional symbols are derived as explained below. The subscripts min and max specify bounds of a parameter or of a variable. The subscript p refers to a specific period or portion of an operating day.

Table 1 List of primary symbols, and units of measure used in the formulae

3.1 Main assumptions and cycle time

We assume that a bidirectional transit line operates on a route, where by “route” we indicate the physical alignment and the infrastructure, and by “line” we refer to the service operated. The route length is L and the line length is 2L. The transit line serves passengers for \(\hat{p}\) periods. The average bidirectional hourly demand (boardings) \(q_{p}\) in a period is equal to \(q \gamma _{p}\), where p is the period index, \(\gamma _{p}\) is a positive parameter not larger than one, and q is the average hourly demand at the peak period. Without loss of generality we index the periods in decreasing order of demand, the peak period is indicated by \(p = 1\) and thus \(\gamma _{1} = 1\). For each period the maximum demand is \(q_{p} \tau _{p}\), where \(\tau _{p}\) is a parameter larger than one. The ratio of a period’s hours to the total service hours is denoted by \(\chi _{p}\). These demand parameters must reflect the total service hours in a year and they are usually derived by a procedure outlined in “Appendix 1”. The average stop spacing is an optimized variable and is indicated by d. The frequency, f, is expressed as the number of transit units (TUs) per hour. The concept of transit unit, see Vuchic (2005), refers to a set of n physically linked vehicles traveling together. This number is usually called the “consist”. For buses, n is equal to one, whereas for rail technology n can be an integer larger than one. TU is therefore a common concept for both single vehicles and trains used on a transit line. We posit a frequency for each period, namely \(f_{p}\). The TU length for each period is indicated by \(n_{p}\).

We distinguish between operating and commercial cycle times. The operating cycle time is the sum of the running time between stations, including acceleration and deceleration, of the time lost at intersections, and of the dwell time for boarding and alighting. The commercial cycle time is determined as a function of the operating cycle time plus some time components related to providing of variability in running times and operations at terminals.

More formally, we model the operating cycle time as the sum of four terms described in the following. We assume that a TU accelerates up to and decelerates from a speed \(S_{max}\) which is the maximum allowable speed. The TU loses an average \(t_u\) minutes per km. This time loss occurs mainly at intersections and diminishes with higher investments in runningway improvements and traffic signal priority (TSP) systems. By “runningway” we intend the continuous infrastructure occupied by the TUs, including the lanes or tracks, any major structures required to support or accommodate the lanes or tracks, and any continuous systems such as electric traction power supply that is specifically required by a technology.

The average speed excluding user service at stops, \(S_{run}\), is

$$\begin{aligned} S_{run} = \frac{1}{\frac{1}{S_{max}} + \frac{t_u}{60}} && S_{max}[\text {km/h}], t_{u}[\text {min/km}]. \end{aligned}$$
(1)

The resulting running time, denoted R, is equal to \(2L/S_{run}\) and is the first term of the operating cycle time. We assume that on average a TU leaves a stop accelerating up to \(S_{run}\), travels at this speed, and then decelerates to halt at the next stop. Let \(\bar{a}\) and \(\bar{b}\) be the average acceleration and deceleration rates of a TU. The incremental time loss caused by the acceleration and deceleration phases is denoted by \(T_{a}\), and is equal to

$$\begin{aligned} T_{a} = \dfrac{S_{run}}{25920} \left( \dfrac{ 1 }{\bar{a}} + \dfrac{ 1 }{\bar{b}} \right)&&S_{run}[\text {km/h}], \bar{a}, \bar{b} [\text {m/s}^{2}], \end{aligned}$$
(2)

(see e.g. Vuchic and Newell 1968). We add to the standing time a fixed component \(t_{d}\), which accounts for the opening and closing of doors, and we denote by \(T_{l}\) the lost time for acceleration, deceleration, and door opening and closing:

$$\begin{aligned} T_{l} = T_{a} + \frac{t_d}{3600}&&T_{a}[\text {h}], t_{d}[\text {s}]. \end{aligned}$$
(3)

Because the number of stops is equal to 2L/d, the second term of the operating cycle time is \(2LT_l\)/d. The third term of the operating cycle time expresses the load-dependent dwell time which is related to \(T_{b}\), the boarding and alighting time per user of a TU, and the number of passengers using a TU, given by q/f. The boarding and alighting time of a TU depends on the number n of vehicles per TU and their door configuration. We assume a boarding and alighting service time per user of a vehicle as equal to \(t_{b}\), thus \(T_{b} = t_{b}/ n\). We introduce a fourth term of the operating cycle time accounting for extra delays at intersections, links, and stops under high frequencies, where for high frequencies we intend those exceeding a threshold frequency \(\dot{f}\), for example equal to 25 TU/h. As this frequency is reached the design TSP may underperform, the interactions between signals and stops, as well as disturbances induced by trespassing, may become significant. These phenomena warrant a simulation approach for design purposes; here we propose a synthetic representation by a specific term. This term depends on the ratio of the frequency to this threshold frequency elevated to the power of \(\omega _2\), and is proportional by a coefficient \(\omega _1\) to the base operating cycle time loss at intersections, which is equal to \(2t_u L\). The operating cycle time \(t_{oc,p}\) for each period is

$$\begin{aligned}&t_{oc,p}(f_{p},d,n_{p}) = R + \dfrac{2L}{d} T_{l} + \dfrac{t_{b}}{3600} \dfrac{q_{p} }{ n_{p} f_{p}} + \omega _1 \frac{2t_u L}{60}\left( \frac{f_{p}}{\dot{f}}\right) ^{\omega _2} \nonumber \\&\quad t_{b}[\text {s-pax/veh}], q_{p} [\text {pax/h}], T_{l}, R[\text {h}], t_u[\text {min/km}], d, L[\text {km}], \nonumber \\&\quad n_{p}[\text {veh/TU}], f_{p}, \dot{f} [\text {TU/h}]. \end{aligned}$$
(4)

The commercial cycle time is obtained by multiplying the operating cycle time by \(\beta\), a parameter larger than one that accounts for the schedule time recovery at the terminals, and by adding the following two terms. First, a fixed terminal time \(t_{tf}\) that accounts for minimum crew rest, securing vehicle keys, and in some cases navigating a terminal loop. Second, a terminal time proportional to \(t_{tv}\) and variable with the TU length that accounts for the crew walking distance between the TU ends, and safety checks. The commercial cycle time, in the following referred to as cycle time for brevity, is

$$\begin{aligned} & t_{c,p}(f_{p},d,n_{p}) = \beta t_{oc,p}(f_{p},d,n_{p})+ \dfrac{t_{tf}+ t_{tv}n_{p}}{3600} \nonumber \\&\quad t_{oc,p}[\text {h}], n_{p}[\text {veh/TU}], t_{tf}[\text {s}], t_{tv}[\text {s/veh}]. \end{aligned}$$
(5)

We note that the accuracy of \(\beta\) as a linear factor goes down with shorter frequencies because of the rounding requirements within real-world schedules, but the accuracy of \(\beta\) is usually satisfactory at higher frequencies.

3.2 Passengers’ time value

The passenger time value is a monetized value of time composed of three parts: access and egress, waiting, and in-vehicle time values.

Users access to and egress from the nearest stop at speed s. The average distance is d / 4 at the origin and at the destination, the average total access and egress length is d / 2, and the average access and egress time \(t_{a}\) of a user is

$$\begin{aligned} t_{a}(d) = \dfrac{d}{2 s}&&d[\text {km}], s[\text {km/h}]. \end{aligned}$$
(6)

The value of one unit of access and egress time is expressed by the parameter \(V_{a}\), and the average access and egress value \(C_{a}\) is

$$\begin{aligned} C_{a}(d) = V_{a} \dfrac{d}{2 s} q \sum _{p}\gamma _{p} \chi _{p}. \end{aligned}$$
(7)

Waiting time depends on the frequency and we distinguish between low, medium and high frequencies. In the case of high frequencies, defined as those above a threshold frequency \(f_{m}\), users arrive at stops at a constant rate but TU platooning starts to occur. We note that we use the term “platooning” to indicate the unorganized formation of clumps of vehicles due to the discrete red-green operation of traffic signals, not the organized “platoons” of vehicles dispatched together. Because of platooning, the additional TU capacity provided by frequencies larger than \(f_{m}\) do not yield a further reduction of the average headway with respect to \(1/f_{m}\). Thus, the average waiting time \(t_{w}\) can be modeled as a fraction \(\epsilon \ge 1/2\) of the expected headway. Values of \(\epsilon\) strictly larger than 1 / 2 can model cases where the headways have a large variance. In the case of medium frequencies, users arrive at stops at a constant rate and the average waiting time is \(\epsilon /f\). In the case of low frequencies, users follow timetables and arrive at stops w minutes before the expected time of service. The waiting time saved by this behavior still has a cost for the user but is discounted by a factor \(\mu\) less than one, for example \(\mu = 1/3\) as in Tirachini et al. (2010). The threshold frequency for these two former behavior regimes is defined by \(f_{l}\), for example six TU per hour, which results in a headway of 10 minutes. The average waiting time \(t_w\) of a user is

$$\begin{aligned} t_{w}(f) = {\left\{ \begin{array}{ll} \dfrac{w}{60} + \mu \dfrac{\epsilon }{f} &\quad \text {if} \,\,\,\, f < f_{l}\\ \dfrac{\epsilon }{f} &\quad\text {if} \,\,\,\, f_{l}\le f \le f_{m},\\ \dfrac{\epsilon }{f_{m}} &\quad \text {if} \,\,\,\, f > f_{m} \, \,\,\,\, f[\text {TU/h}], w[\text {min}] \end{array}\right. }. \end{aligned}$$
(8)

Figure 1 illustrates this new waiting time function and the difference with respect to a previous study.

Fig. 1
figure 1

An example of the new waiting time function and that of Moccia and Laporte (2016) with \(f_{l}=6, f_{m}=40, \epsilon = 0.5, \mu = 0.33,\) and \(w = 5\)

The average value of waiting \(c_{w}(f)\) borne by q users at the frequency f is

$$\begin{aligned} c_{w}(f) = V_{w} t_{w}(f) q && V_{w}[\$/\text{pax-h}], t_{w}[\text {h}], q[\text {pax/h}], \end{aligned}$$
(9)

where \(V_{w}\) is the value of one waiting time unit. The average value of waiting for the \(\hat{p}\) periods, \(C_{w}\), is

$$\begin{aligned} C_{w}(\mathbf {f}) = \sum _{p}c_{w}(f_{p}) \gamma _{p} \chi _{p}&& c_{w}[\$/\text{h}], \end{aligned}$$
(10)

where we indicate by \(\mathbf {f}\) the vector of \(\hat{p}\) frequencies.

The average in-vehicle time \(t_{v}\) of a user is modeled as a fraction of the operating cycle time \(t_{oc}\). This fraction is equal to the ratio of the average trip length l to the total distance 2L covered by a TU in a cycle. For notational compactness we indicate by \(\lambda\) the ratio of the average trip length to the route length, \(\lambda = l/L\), and thus:

$$\begin{aligned} t_{v} = \dfrac{\lambda }{2} t_{oc}\quad t_{oc}[\text {h}]. \end{aligned}$$
(11)

The value of the in-vehicle travel time is multiplied by the crowding penalty function \(\Delta\) introduced in “Appendix 2”. This penalty function aims at reducing underestimation of crowding when it is synthetically applied to the average of the vehicle occupancy rate \(\bar{\theta }\). As explained in “Appendix 2”, this improvement is accomplished by the introduction of a parameter \(\phi\) belonging to the interval [1, 2] and such that \(\phi \bar{\theta }\) is the maximum value of the vehicle occupancy rate. Figure 2 illustrates this new penalty function and the difference with respect to previous studies.

Fig. 2
figure 2

An example of the crowding penalty function. With \(\phi = 1\) the occupancy rate is constant and the crowding penalty is the same as that of Moccia and Laporte (2016). With \(\phi = 2\) a maximum correction is applied

At each period, the average occupancy rate \(\bar{\theta }\) depends on the frequency and on the TU length. For notational compactness in the following two equations we have dropped the period subscript. The average occupancy rate is

$$\begin{aligned} \bar{\theta }(f,n) = \dfrac{lq}{2Lknf},&\end{aligned}$$
(12)

where we denote by k the passenger capacity of a vehicle, and therefore the capacity K of a multi-unit TU is equal to \(k \times n\). The penalty function \(\Delta\) is here expressed as a function of the frequency and of the TU length:

$$\begin{aligned} \Delta (f,n) = {\left\{ \begin{array}{ll} 1 + \rho \left( \dfrac{lq}{2Lknf} - \theta _{min}\right) &\quad nf \le \dfrac{lq}{2Lk\theta _{min}} (2 - \phi )\\ 1 + \rho \left( \phi \dfrac{lq}{2Lknf} - \theta _{min}\right) ^{2} \dfrac{Lknf}{2lq(\phi - 1)} &\quad \dfrac{lq}{2Lk\theta _{min}} (2 - \phi )< nf < \phi \dfrac{lq}{2Lk\theta _{min}} \\ 1 &\quad nf \ge \phi \dfrac{lq}{2Lk\theta _{min}} \end{array}\right. }. \end{aligned}$$
(13)

We denote by \(\Delta _{p}\) the resulting \(\hat{p}\) penalty functions.

Let \(V_{v}\) be the value of one unit of in-vehicle time, then the value of in-vehicle time, \(C_{v}\), is

$$\begin{aligned} C_{v}(\mathbf {f}, d, \mathbf {n}) = V_{v}\dfrac{\lambda }{2}q\sum _{p} \chi _{p}\gamma _{p}\Delta _{p}(f_{p}, n_{p}) t_{oc,p}(f_{p},d,n_{p}), \end{aligned}$$
(14)

where we indicate by \(\mathbf {n}\) the vector of \(\hat{p}\) TU lengths. The total passengers’ time value \(C_{u}\) is then

$$\begin{aligned} C_{u} = C_{a} + C_{w} + C_{v}&&C_{a},C_{w},C_{v}[\$/\text{h}]. \end{aligned}$$
(15)

3.3 Operator cost

The operator cost has five components. The first is the construction and maintenance of the route and is indicated by \(c_{0l}\). The second includes the construction and maintenance costs of the stations. For each station this term has a fixed part \(c_{0s}\), and a variable part, \(c_{0sv}\), which depends on the TU length at peak hours. The third depends on the fleet size and reflects vehicle capital and administrative costs. Let \(c_{1v}\) be the unit operator cost per vehicle-hour which accounts for the capital and administrative costs. The deployed fleet size B of TUs is the product of frequency and cycle time: \(B = f t_{c}\). The vehicle fleet size is equal to \(\zeta nB\), where \(\zeta > 1\) provides for O&M spares. The fourth expresses the crew costs and depends on \(c_{1t}\), the unit operator cost per TU-hour. The fifth accounts for running costs such as energy, tires, lubricants, etc. Let \(c_{2v}\) be the unit operator cost per vehicle-km. The amount of vehicle-km is the product of the commercial speed S and the fleet size. The commercial speed is obtained by dividing the total length 2L by the cycle time. Thus, the amount of vehicle-km is \(S\times nB = 2L/t_{c} \times n f t_{c} = 2Lnf\). The operator cost \(C_{o}\) is then

$$\begin{aligned} C_{o}(\mathbf {f}, d, \mathbf {n})= & {} c_{0l} + (c_{0s} + c_{0sv}(n_{1} - 1))\dfrac{2L}{d} + c_{1v}\zeta n_{1} f_{1} t_{c,1}(f_{1},d,n_{1}) \nonumber \\&+\, c_{1t}\sum _{p}\chi _{p}f_{p} t_{c,p}(f_{p},d,n_{p}) + 2c_{2v}L \sum _{p}\chi _{p}n_{p}f_{p}. \end{aligned}$$
(16)

3.4 Side constraints

The frequency is constrained to be equal to or larger than \(f_{min}\), and less than or equal to \(f_{max}\). The value \(f_{min}\) can be set by a “policy headway” rationale, i.e. there is a minimum guaranteed frequency \(f_{pol}\), or can account for capacity as follows. Let \(\alpha q \tau\) be the largest load served by the line in a generic period, where \(\alpha \le 1\), and \(\nu\) be a spare capacity design factor. For example, a value of \(\nu\) smaller than one accounts for random demand fluctuations and represents a safety margin, whereas \(\nu\) larger than one allows crush loading. Thus, \(f_{min}\) is

$$\begin{aligned} f_{min} = \max \left( f_{pol}, \dfrac{\alpha q \tau }{\nu k n}\right)&&f_{pol}[\text {TU/h}],q[\text {pax/h}], k[\text {pax/veh}], n[\text {veh/TU}].&\end{aligned}$$
(17)

The \(f_{max}\) is defined according to general principles from the Transit Capacity and Quality of Service Manual (TCQSM 2013). We set a cap \(\check{f}_{max}\) on the value of \(f_{max}\) to reflect rail vehicle operation with drivers responsible for maintaining safe separation. For buses \(\check{f}_{max}\) can attain a larger value, twice as high as the one for rail. TCQSM (2013) effectively defines \(f_{max}\) as the maximum number of transit vehicles that can pass a given location in a given time period, and is inversely proportional to a minimum achievable headway under which acceptable service can be maintained. For BRT and LRT operations within or parallel to urban arterial roadways the minimum achievable headway depends on the longest dwell time. We assume that the longest dwell time is \(\psi\) times the average dwell time, where \(\psi\) is a parameter larger than one. The average dwell time \(t_e\) is a function of frequency, stop spacing, and TU length

$$\begin{aligned} t_e(f, d, n) = t_d + t_{b} \frac{q d}{2 n f L}&&t_d[\text {s}], t_{b}[\text {s-pax/veh}], q[\text {pax/h}], \nonumber \\&&d, L[\text {km}], n[\text {veh/TU}], f[\text {TU/h}]. \end{aligned}$$
(18)

We simplify the \(f_{max}\) formula for the bus and light rail modes to share the following common form:

$$\begin{aligned} f_{max}(f, d, n) = \min \left( \check{f}_{max}, \frac{3600}{t_{e0} +t_{ev}(n-1)+ \eta \psi t_e}\right)&&\check{f}_{max}[\text {TU/h}], \nonumber \\&&t_{e0},t_{ev}, t_e[\text {s}]. \end{aligned}$$
(19)

The values of \(t_{e0}\), the fixed stop clearance time, \(t_{ev}\), the stop clearance time for an extra vehicle length, and \(\eta\), the fraction of the longest dwell time, are set in accordance with the vehicle characteristics we define for each scenario, and with the assumed station and runningway characteristics for which capital cost estimates are established.

Finally, we define side constraints on the average stop spacing. Reaching the speed \(S_{max}\) requires a stop spacing larger than a threshold value \(d_{min}\), which depends on acceleration and deceleration rates

$$\begin{aligned} d_{min} = \dfrac{S_{max}^{2}}{25920} \left( \dfrac{ 1 }{\bar{a}} + \dfrac{ 1 }{\bar{b}} \right)&&S_{max}[\text {km/h}], \bar{a}, \bar{b} [\text {m/s}^{2}], \end{aligned}$$
(20)

(see e.g. Vuchic and Newell 1968). Values of d less than this minimum distance will not be allowed. We observe that \(d_{min}\) will be different for types of vehicles and speed limits, and is a constraint that is rarely binding. An upper bound \(d_{max}\) is defined as well. We remark that the optimal d is an average stop spacing that does not impede specific implementations where the stop spacing varies along the route.

3.5 Optimization model

The total cost \(C_{tot}\), sum of passengers’ time value and operator cost, is a function of frequencies, stop spacing, and consists. The model follows:

$$\begin{aligned} \text {minimize} \, C_{tot}(\mathbf {f},d,\mathbf {n}) \end{aligned}$$
(21)

subject to

$$\begin{aligned} d_{min}\le d \le d_{max}\end{aligned}$$
(22)
$$\begin{aligned} \max \left( f_{pol,p},\dfrac{\alpha q_{p} \tau _{p}}{\nu n_{p} k}\right)\le f_{p} \le f_{max,p}(f_{p}, d, n_{p}), \quad \forall p \in \{1,\ldots ,\hat{p}\} \end{aligned}$$
(23)
$$\begin{aligned} n_{min}\le n_{p} \le n_{max}, \, n_{p} \in \mathbb {N}, \quad \forall p \in \{1,\ldots ,\hat{p}\} \end{aligned}$$
(24)
$$\begin{aligned} n_{p} f_{p} t_{c, p}(f_{p},d,n_{p})\le n_{1} f_{1} t_{c,1}(f_{1},d,n_{1}), \quad \forall p \in \{2,\ldots ,\hat{p}\}. \end{aligned}$$
(25)

Constraints (22) set minimum and maximum values for the stop spacing. Constraints (23) enforce minimum and maximum values for the frequencies. Constraints (24) specify the feasible range of TU lengths, and constraint (25) ensures that the maximum fleet is deployed at peak times, where for “maximum deployed fleet” we refer to the vehicles needed for the scheduled service (the reserve that bears a capital cost is accounted in the operator cost function). The model is solved by an updated version of the algorithm presented in Moccia and Laporte (2016) where the new non-linear constraint, the right side of (23), is imposed iteratively, and the lower convex approximation scheme, reported in “Appendix 3”, accounts for the new definitions (4), (5), (13), and (16).

4 Scenarios

We consider two scenarios that capture operational aspects dependent upon the interaction between service frequency and crowding, with infrastructure for each designed to offer comparable performance for both bus and rail until capacity is approached. These scenarios are based on an assessment of the literature (Vuchic 2005; Vuchic et al. 2012; Casello et al. 2014; Bruun et al. 2018), and on IBI Group’s internal database of transit projects. These scenarios represent design values that planners of new systems in developed countries with high labor costs are likely to use and thus represent best practices. A preliminary version of these scenarios for a two-period case was presented in Moccia et al. (2016). Because some introduced model features were not present in Moccia et al. (2016) the following results are new.

Scenario 1 consists of a dedicated arterial RoW obtained by a simple change in use of an existing lane. We assume an average of six intersections per km, four of which are signalized. TSP is assumed to be present in the relatively modest form prevalent in North America. Stations are part of the curb or sidewalk environment, with only modest passenger shelters and amenities, but allow level boarding. Fares are collected on board, so no fare vending occurs at stations. Most stations are not multi-modal transfer points; users access and egress stations by walking at a speed of \(s = 4\) km/h. Rail vehicles are single-unit low-floor trams of three different lengths ranging from 34 to 56 m. A 24 m tram was also considered in Moccia et al. (2016) but we have omitted it because it is indicated to be dominated in both the current and previous models. For buses we consider a single-articulated bus of 18 m, and a double-articulated bus of 24 m, the latter of which would generally require special authorization in mixed rights of way. Platooning is assumed to occur for buses at frequencies in the range between 30 and 80 TU/h, whereas it does not occur for trams because their frequency is capped at 40 TU/h. For each technology, the two terminals have a loop that is not part of the service route length and vehicles are in a single-ended configuration. The maximum authorized speed is assumed to be the same for each technology (50 km/h). In the case of rail technology, this might commonly be referred to as an enhanced streetcar or tram. In the case of bus technology, the labels “enhanced bus service”, or “BRT lite” might apply.

Scenario 2 represents an exclusive at grade RoW when a new laterally separate alignment is developed. Intersections are assumed to be identical to the above scenario, but TSP is provided in a more robust form prevalent in many European cities. Stations provide for off-vehicle fare collection, ample passenger amenities including weather protection and lighting, off-board fare collection, level boarding, a distinct architectural treatment and branding appropriate to the streetscape, and, in some cases, revisions to the streetscape as may be warranted to accommodate the stations without interfering with sidewalks. Stations are multi-modal transfer points so that access and egress occur at an average speed \(s= 12\) km/h, i.e. a bike speed is representative of the system average between the walking speed and the speed of motorized transport. Rail TUs may operate as multiple units with a consist ranging from one to four cars, and are served by stub terminals, i.e. they are in a double-ended configuration. Capital and O&M cost figures for rail in this scenario consider the higher complexity of variable consists. Bus vehicles are the same as for Scenario 1 and change directions at the terminals via a loop that is not part of the service route length. Platooning is assumed to occur for buses at frequencies in the range between 40 and 80 TU/h, whereas it does not occur for rail TUs because their frequency is capped at 40 TU/h. The maximum authorized speed is assumed to be the same for each technology (75 km/h). These might commonly be referred to as LRT and BRT, in the case of rail and bus technology, respectively.

In both scenarios, stops and stations are configured to allow in-line double-berthing by buses, and offer all-door level boarding. In both cases, higher frequencies cause increased delays at stops and stations due to queuing and higher demand causing longer dwell times.

We consider a route of 20 km with users traveling on average a fraction \(\lambda = 0.45\) of this length, i.e. 9 km per trip. Scenario variants with different values of \(\lambda\) are discussed in Sect. 5.4.

The flow at the peak direction and at the most loaded section is equal to 40% of the total bidirectional demand. This assumption represents a moderate centripetal demand where, for example, 60% of the bidirectional demand moves in the heavy or peak direction, and 67% of these users traverse the most loaded section. Scenario variants of this assumption are presented in Sect. 5.4.

The passenger time values were derived as follows. The in-vehicle value of time is assumed to be the base VoT. A wide defendable range for this parameter can be assumed as discussed in Litman (2017). We decided for the US-DOT (2011) estimate of the average surface transportation VoT equal to 12.5 in year 2009 US dollars, updated for inflation to 13.37 in year 2012 US dollars. The waiting and the access VoTs were obtained by multiplying the base VoT for 1.25 and 1.5, respectively, as in Tirachini et al. (2010) and Moccia and Laporte (2016). A scenario variation analysis on the VoT assumptions is carried out in Sect. 5.4.

A technology is indicated by three letters, “BRT” or “LRT”, to distinguish between bus and rail, respectively, followed by a number, one or two, referring to the scenario, and, in the case of a single-unit technology, a number specifying the length of the vehicle in meters. For example, “BRT1_18” refers to the BRT of Scenario 1 with a bus of 18 m, and “LRT2” indicates the multi-unit rail technology of Scenario 2. The set of parameters related to the users and to the transit line that are common to all scenarios is listed in Table 2. Table 3 reports parameters that are scenario-specific and Table 4 lists those that are mode-specific. Parameters that are technology-specific are reported for Scenario 1 and 2 in Tables 5 and 6, respectively. Monetary figures are expressed in year 2012 US dollars.

The demand profile is derived from turnstile data of the Boston subway network presented in “Appendix 1”. The parameters of the demand profile are listed in Table 7.

Capital costs are annualized assuming a discount rate of 3% over a one-stage technical life for the vehicles and the infrastructure. The discount rate is indicated by the symbol \(\iota\), and the one-stage technical life by y. The one-stage technical life is lower than a typical service lifetime because it expresses the equivalent years including cost of a mid-life rebuild at the prevalent discount rate. For example, a bus may have a useful service life of 15 years with a mid-life rebuild, but because that overhaul costs 25% of the initial vehicle cost, the equivalent one-stage bus technical life is assumed to be 12 years, which fits the one-stage capital accounting in the relevant range of discount rates (1–10%). Similarly, runningway and station infrastructure were annualized on the basis of a 40-year one-stage technical life, determined from the inclusion of the periodic re-investments of different elements on specific timeframes. A 5% residual value is factored into the annuity computation for the rolling stock by setting a parameter \(\xi\) to 0.05. No residual value is assumed for the infrastructure, \(\xi = 0\). In general, for infrastructure and rolling stock, the capital amortization per hour of service is computed as

$$\begin{aligned} \frac{ P(1 - \xi ) \iota }{H (1 - (1+\iota )^{-y})} \end{aligned}$$
(26)

where P is the purchase price and H is the number of service hours in a year. Land capital cost is annualized by multiplying it by the discount rate, i.e. assuming an infinite service life.

Vehicle capacities are determined such that all vehicles for both technologies have the same ratio of seats to total users. This ratio is equal to 0.3, equal to \(\theta _{min}\), the threshold load ratio of the \(\delta\) penalty function up to which there is no penalty. The total user capacities for rail are computed assuming an industry-standard 4 standees per available square meter, whereas for bus we use 3.6 standees per square meter to account for the higher vertical accelerations and less controlled lateral accelerations experienced on buses which necessitates that passengers use more space to brace themselves.

Table 2 Parameters related to the users and to the transit system that are common to both scenarios and to all technologies
Table 3 Scenario-specific parameters
Table 4 Mode-specific parameters
Table 5 Scenario 1, technology-specific parameters
Table 6 Scenario 2, technology-specific parameters
Table 7 Parameters related to the demand profile common to both scenarios and to all technologies

5 Results

Optimal results of the decision variables and of other relevant indices will be presented as functions of the density of demand, a key input variable for any system designer. This latter index, referred to as passenger travel density (PTD), expresses the amount of traveled distance by passengers per unit of route length. More formally, PTD is

$$\begin{aligned} \dfrac{H q l\sum _{p}\chi _{p}\gamma _{p}}{L} = H q \lambda \sum _{p}\chi _{p}\gamma _{p}. \end{aligned}$$
(27)

We indicate the PTD unit of measure as “pax-km/year-km” because the “km” at the numerator has a different meaning of that to the denominator—the former refers to the unitary distance traveled by a passenger while the latter expresses the unitary route length. The rationale for PTD as the index for the amount of service provided is that it incorporates the ratio between the average trip length and the route length, and the number of service hours in a year. This index allows a degree of abstraction from some of our parameter choices as we illustrate in the following by a scenario variation analysis. The PTDs are large numbers and therefore we use a logarithm scale with the exception of the frequency and consist figures where a linear scale is preferred to highlight the proportionality of these variables to the demand. For the frequency figures we also use the following secondary index: the users per peak hour in the peak direction at the maximum load segment (MLS)—an index often used in practice.

The remaining of this section is structured as follows. Sections 5.1 and 5.2 report numerical results for the Scenarios 1 and 2, respectively. Section 5.3 presents a model’s validation by comparing computed O&M costs with those of light rail systems in North America. Sensitivity analysis and scenario variants are discussed in Sect. 5.4.

5.1 Results for Scenario 1

Figure 3 illustrates the average total cost of the studied technologies in Scenario 1. The breakeven points between the technologies occur at PTDs between 6.3 and \(8.6\times 10^{6}\). However, we note that there is a wider range where cost differences are small, and therefore other factors beside passengers’ time value and operator cost could be decisive.

Fig. 3
figure 3

Scenario 1, average total cost, \(C_{tot}/(ql \sum _{p}\chi _{p} \gamma _{p})\)

Figure 4 compares the BRT1_18 optimal and minimum frequencies for all periods. At the peak period, \(p=1\), the optimal frequency aligns with the minimum feasible value. This means that even from a total cost perspective providing extra frequent service (i.e. a frequency higher than the minimum) at the peak hour is suboptimal. At the medium demand period, \(p=2\), extra-service is provided with a reduced difference at high frequencies when platooning starts to occur. At the low demand period, \(p=3\), the minimum frequency again becomes the dominating factor. These observations also apply to the BRT1_24 optimal frequencies, and hence such a figure is omitted for brevity. Figure 5 compares the LRT1_34 optimal and minimum frequencies for all periods. At the peak period, the tram optimal frequency behavior is the same as that of buses, i.e the minimum frequency rules. In contrast, at the medium and low demand periods significantly more frequent service is provided with respect to it being strictly necessary. This results from the lower operating expenses of rail vs. bus when the fleet acquisition cost is considered as a sunk cost. Similar observations hold for the remaining Scenario 1 rail modes, and hence figures are omitted for brevity.

Fig. 4
figure 4

Scenario 1, mode BRT1_18, optimal and minimum frequencies for all periods

Fig. 5
figure 5

Scenario 1, technology LRT1_34, optimal and minimum frequencies for all periods

Optimal stop spacings, illustrated in Fig. 6, vary in a small range across the studied PTDs. Differences between the technologies are small and follow the approximate formula (34) of Moccia and Laporte (2016) which holds as well for the model of this paper. We further observe that PTD is proportional to \(\lambda\), the ratio of the average trip length to the route length, but the reported stop spacing figures are computed for a fixed \(\lambda\). In those figures, PTD varies because q, the average demand at the peak period, varies. Formula (42), the unconstrained optimal stop spacing of the approximation, explains the observed decrease of the stop spacing with an increasing q: the component related to the stop fixed costs is divided by q. This formula also indicates that, ceteris paribus, the stop spacing would increase with \(\lambda\) as we will show in Sect. 5.4.

Fig. 6
figure 6

Scenario 1, optimal stop spacings

At the high frequencies required by higher PTDs, user inconvenience occurs both because of crowding and of longer operating cycle times. The degradation of cycle time can be observed in Fig. 7 which depicts the average commercial speeds. This phenomenon can be further appreciated by disaggregating the total cost into average passenger time value (for brevity in the following indicated by “user cost”, see Fig. 8) and average operator cost (Fig. 9). The former increases with demand density when the optimal frequency deviates from that guaranteeing uncrowded or moderately crowded conditions. For buses at high PTDs the decrease in average operator cost is not sufficient to compensate for the increase in user cost which rail technologies can avoid.

Fig. 7
figure 7

Scenario 1, average commercial speed for all periods

Fig. 8
figure 8

Scenario 1, average passenger time value, \(C_{u}/(ql \sum _{p}\chi _{p} \gamma _{p})\)

Fig. 9
figure 9

Scenario 1, average operator cost, \(C_{o}/(ql \sum _{p}\chi _{p} \gamma _{p})\)

We further observe in this scenario that high frequency penalty does not impact breakeven points, which occur at frequencies where this penalty is not significant. The same holds for the platooning effect. The combination of the high frequency penalty and of the self-adjusting maximum frequency constraint are significant in limiting the attainable capacities of the BRTs and increasing the average passenger time value for large demand levels.

This dynamic highlights the crucial role of the high frequency penalty and of the maximum frequency constraint which are two of the main extensions proposed in this paper. These combine to limit operations to capacity in the sense defined by the TCQSM, e.g. “without unreasonable delay and with reasonable certainty”. Many semi-rapid bus and rail systems carry throughputs on their maximum load sections that are higher than capacity in this sense, at low operating speeds, severe overcrowding, or other less-than-ideal circumstances which are not likely to be acceptable for a new design. This model does not attempt to represent such overcapacity operations.

We have also evaluated the operational impact of an exclusive operator cost minimization versus a total cost minimization. In this former case the stop distance is set to the optimal value as in total cost minimization. Figures are not presented for brevity, and the results can be synthesized as follows. The resulting optimal frequencies align to the \(f_{min}\) policy. The breakeven points between bus and rail recede toward lower PTDs because the waiting time advantage of buses is not included in the single objective of operator cost minimization.

5.2 Results for Scenario 2

Figure 10 illustrates the average total cost of the studied technologies in Scenario 2. The breakeven points between the technologies occur at PTDs between 7.7 and 11.2 \(\times 10^{6}\). Comparing total cost between scenarios, Figs. 3 and 10, it may be concluded that the faster and more capital-intensive technologies of Scenario 2 dominate those of Scenario 1 with the only exception of the LRT2 vs BRT1 at the lowest demand density studied.

For buses, the optimal frequencies compared to the minimum frequencies behave similarly to those of Scenario 1 (see Fig. 11 for BRT2_18). Figure 12 shows how the optimal number of rail vehicles per Transit Unit varies between peak and off-peak periods. Figure 13 compares the LRT2 optimal and minimum frequencies for all periods. As in Scenario 1, rail provides more extra frequent service at non-peak periods with respect to buses. The sawtooth pattern of the frequencies highlights how flexible capacity can be offered by multiple-unit operation light rail. Stop spacings are illustrated in Fig. 14. We illustrate the degradation of the cycle times in Figure 15, its effect on the average passenger time value, Fig. 16, and the average operator cost, Fig. 17.

As for Scenario 1, we have also evaluated the exclusive operator cost minimization. The breakeven points are similar to those of total cost minimization. The optimal frequencies under operator cost minimization align to a \(f_{min}\) policy. However, there is a relevant difference for LRT2 under the two objective functions, total and operator cost minimizations. Operator cost minimization leads to minimum frequencies with longer consists and thus less frequent service than under total cost minimization.

Fig. 10
figure 10

Scenario 2, average total cost, \(C_{tot}/(ql \sum _{p}\chi _{p} \gamma _{p})\)

Fig. 11
figure 11

Scenario 2, technology BRT2_18, optimal and minimum frequencies for all periods

Fig. 12
figure 12

Scenario 2, technology LRT2, optimal number of vehicles per Transit Unit for all periods

Fig. 13
figure 13

Scenario 2, technology LRT2, optimal and minimum frequencies for all periods

Fig. 14
figure 14

Scenario 2, optimal stop spacings

Fig. 15
figure 15

Scenario 2, average commercial speed for all periods

Fig. 16
figure 16

Scenario 2, average passenger time value, \(C_{u}/(ql \sum _{p}\chi _{p} \gamma _{p})\)

Fig. 17
figure 17

Scenario 2, average operator cost, \(C_{o}/(ql \sum _{p}\chi _{p} \gamma _{p})\)

5.3 Model validation

The previously presented figures of computed stop spacings, commercial speeds, and consists align with observed best practices in Europe, North America and Australia. Because of space limits, we do not include additional figures of other performance indices, but we report that computed passenger in-vehicle occupancy, labor percentage of O&M cost, fleet size, service vehicle hours and kilometres per year are in good accord with the above mentioned best practices. As a synthetic illustration of this, Fig. 18 compares the computed O&M costs of LRT2 with those of light rail systems in North America. The data points of the systems that are widely recognized as some of the best-in-class are well enveloped by the computed values.

Fig. 18
figure 18

O&M costs of North America light rail systems versus those of the computed LRT2

5.4 Sensitivity analysis and scenario variants

This section discusses the model’s sensitivity and the dependence of our results to VoT assumptions, parameters influencing demand concentration and density, discount rate, and maximum vehicle loading.

The presented model is robust in the strict sense of optimization theory where sensitivity analysis deals with very small input changes. This robustness is a consequence of the fact that the optimal solution is well approximated by square root formulae (“Appendix 3”). However, significantly different assumptions, hence significantly different parameters, may yield different results as discussed in the following.

Table 8 Breakeven PTDs between bus and rail technologies in the base case and in eleven scenario variants

Table 8 summarizes the breakeven points between the bus and rail technologies in the two base case scenarios and in eleven variants of them. We list breakeven points for the user, operator and total cost curves, but, unless stated otherwise, in the following discussion we refer to the total cost curves when we use the term “breakeven points”. Nine scenario variants are indicated by an alphabetical letter from “A” to “I”. Two scenario variants are obtained by combining some of the previous nine variants and are labelled by the corresponding set of letters.

Variant A halves all VoTs with respect to the base case. Because of the large VoT reduction, the total cost curves are significantly lower than those of our base scenarios. However, the total cost breakeven points between bus and rail technologies do not change dramatically, see the second data row of Table 8, although there are differences in the user and operator cost breakeven points. This allows us to conclude that our results are robust to uniform small and medium VoT variations.

Variant B sets access and waiting VoTs equal to the base in-vehicle VoT. This experiment assesses the influence of the access and waiting time penalties. In fact, because of the wide adoption of info-mobility in recent years and the station quality features embedded in our scenarios’ assumptions, it is unlikely that passengers will experience additional discomfort of waiting. This should also be true for the access time given the current emphasis and the popular uptake of active transport (Khreis et al. 2016). We further note that our access value of time formula (7), by assuming a uniform spatial demand distribution around stations, provides an overestimation of access time when the origins and the destinations are instead clustered and centered at stations—as often is the case. We observe that VoT equality among the three passenger time components penalizes BRT, because at the medium and low demand densities this assumption reduces the advantage of a higher frequency and a reduced access distance provided by bus vs. rail technologies (with the waiting component playing the major role). Consequently, the breakeven points shift toward lower demand levels.

Variant C tests the impact of the crowding penalty by setting \(\rho = 0\). Removing the crowding penalty does not recognize the capacity advantage of rail TU, and therefore the breakeven points significantly move toward higher demand levels. A similar effect can be obtained by disabling the newly introduced correction for the average vehicle occupancy by setting \(\phi = 1\) as done in Variant D.

Variant E reduces by 5% the in-vehicle VoT for rail modes with respect to the base case while it keeps the in-vehicle VoT for bus modes as in the base case. These VoT variations attempt to represent a rail ride quality advantage in terms of both lateral and vertical accelerations. Even this modest in-vehicle VoT reduction for rail yields significantly lower breakeven points.

We now present results obtained by varying \(\alpha\), the parameter referring to the degree of demand concentration along the line. Variant F sets a low value for this parameter, \(\alpha = 0.2\), which expresses a very low demand concentration. Breakeven points do not change significantly with respect to our base case, but we note that bus technologies can serve higher PTDs than the base case before reaching maximum capacity. Variant G posits a high value of \(\alpha\), equal to 0.6, which indicates an unusual concentration of traffic in a relatively short part of the line. Breakeven points shift toward lower demand levels with respect to our base case because the large capacity of rail TU show its usefulness under this variant.

We explored the dependence to a project discount rate, which had a default value of 3%. The defensible range for this assumption can be considerable, ranging from a “fiscal” discount rate reflecting only the cost for the project sponsor of borrowing money to a “social” discount rate reflecting the urgency of the problem or competing demands for funds, more typical of developing countries. Table 8 reports two variants, H and I, with a discount rate of 6 and 9%, respectively. Further experiments on the discount rate for Scenario 2 are synthesized in Fig. 22. These results indicate that the breakeven points are strongly related to the discount rate.

Variant C+G presents a best case for bus technologies, where crowding is not considered, \(\rho = 0\), and the demand concentration along the line is low, \(\alpha =0.2\). An opposite combined scenario variant is B+F where there is VoT equality among the three passenger time components and a high demand concentration along the line, \(\alpha =0.6\). These opposite scenario variants illustrate how particular and legitimate assumptions can yield very different results.

We have tested a wide interval for the parameter \(\lambda\). Breakeven points presented in terms of PTD do not change significantly for \(\lambda \in [0.35,0.6]\), whereas the breakeven points expressed as average hourly peak demand would vary misleadingly. This highlights the advantage of the PTD index for the demand level when comparing technologies. As discussed in Sect. 5.1, \(\lambda\) does have an impact on the stop spacings. As an example, we report in Fig. 19 the stop spacings in Scenario 2 at \(q = 5000\) pax/h.

Fig. 19
figure 19

Scenario 2, stop spacing as a function of \(\lambda\) at \(q = 5000\) pax/h

We now explore the dependence of the model results to maximum vehicle loading. Such loading can be represented by setting the spare capacity factor \(\nu\) larger than one. In the base case we have \(\nu = 0.95\) indicating a safety margin of \(5\%\). In our vehicle capacity parameter set, \(\nu = 1.28\) allows loads up to six standees per square meter, commonly referred to as a “crush” load. Allowing this crowding level does not affect the breakeven points in our two base scenarios. This crowding level increases capacities but by less than the \(1.28/0.95 = 1.35\) factor that one would expect. This occurs because the higher load ratios induce longer dwell times, and hence smaller maximum frequencies due lo longer operating times. For example, compare the maximum and minimum frequencies in the Scenario 2 with \(\nu = 0.95\), Fig. 20, and those with \(\nu = 1.28\), Fig. 21.

Fig. 20
figure 20

Scenario 2, maximum and minimum frequencies with \(\nu = 0.95\)

Fig. 21
figure 21

Scenario 2, maximum and minimum frequencies with \(\nu = 1.28\)

Fig. 22
figure 22

Breakeven demand as a function of the discount rate for Scenario 2

6 Conclusions

We believe that the results of the model to date support the following conclusions. The model is validated by a good alignment of its results with performance indices of best practices in North America, and to a lesser extent Europe and Australia. For a particular corridor and specific cost conditions, a breakeven point between rail and bus technologies can be identified from both an operator perspective and a total (operator and user) perspective. However, we note that there is a wide range where the cost differences between the technologies are small, and therefore other factors beside passengers’ time value and operator cost, both capital and O&M, could be decisive. It may be concluded that faster, although more capital-intensive, best commonly available technologies, as represented in our choice of coefficients and parameters, dominate the slower ones with the only exception of the multiple-unit rail vs. BRT at the lowest demand density studied. Planning for a faster technology is then more relevant than the choice between bus and rail per se, except at very low demand density. At high demand density multiple-unit rail as in Scenario 2 offers the most cost-effective way to achieve high capacities (in the TCQSM sense) under many conditions. A scenario variation analysis shows that differences between waiting and in-vehicle VoTs are more relevant to the technology choice than the exact estimate of the VoT. Assessing the crowding disutility by reducing the bias of averaging vehicle load ratios is found to be significant as well. Even a small reduction of the rail in-vehicle VoT with respect to that of bus, reflecting a difference in riding comfort, yields lower breakeven points than in the conservative scenarios were in-vehicle VoTs are equal for all modes. The choice of a density demand index allows a degree of abstraction with respect to a straightforward comparison by average peak demand level. Allowing crush loading does not modify breakeven points and extends capacities in a sub-linear way. Breakeven points are very sensitive to the project discount rate. Thus, developing countries will tend to have a higher breakeven point for choosing rail, as the added capital costs of rail do not offset labor costs as quickly.

With respect to the impact of the new model’s features, the computational experiments show the following. The reduction of the crowding disutility underestimation has a significant effect on the breakeven points between bus and rail technologies that otherwise would move toward higher demand levels, because the full comfort advantage of higher capacity rail TU would not be recognized. The high frequency penalty, the reduction of the waiting advantage in case of platooning, and the maximum frequency constraint do not impact the breakeven points, hence the technology selection issue, but are instrumental in aligning computed attainable capacities, and TU speeds to those that can be empirically observed in good quality semi-rapid transit lines. These results are relevant from a design perspective, and attain our research objective of functional extensions that increase the model’s realism. We also have developed and offer techno-economic parameters that reflect the types of construction characteristic of the two scenarios we present. As discussed in Bruun et al. (2018) these costs can exhibit a significant range even within a class of construction because of local circumstances including terrain and availability of right-of-way. These parameters represent an empirical mid-point of the observed range of basic infrastructure elements within the applicable classes of construction. With respect to innovation management in transit, our model can be used to sharpen the value proposition for the users and the operator in broader assessment frameworks as that of Newman et al. (2018) without limiting the technology spectrum to rail.

We observe that we have compared technologies that may have significantly different pollution profiles both in terms of local air quality and greenhouse gas emissions, noise levels, visual aesthetics, and degree of local manufacturing content, and that the model does not represent these aspects. Specific planning requirements, for example gradients or a transit section in a pedestrian street, can tilt the choice between modes. See Vuchic et al. (2012) for an extensive discussion of several transit planning issues, and Vuchic (1984) on the limits of total cost appraisal in transportation. Moreover, total cost minimization does not address equity concerns (Perugia et al. 2011; Gutiérrez-Jarpa et al. 2017). Future work may address these particulars.