Keywords

1 Introduction

The economy of any country is the backbone of its peoples’ livelihood. It is essential to make the cash flow in the economy for the survival and growth of any country. This cash flow happens through the movement of people and goods from one place to the other because of the spatial distribution of the resources in the country. For all such valuable economic activities, there is an indispensable and induced need for a transportation system. To boost economic growth, it is important to provide sufficient facilities for the smooth conduct of activities. While focusing on supplying the goods and passengers the transport facilities, it is required to know the demand beforehand. Since the activities take place in the physical world, to cater to its demand it is favourable to take decisions for the supply of the transportation system for the future years. In this process, there is a need of modelling the transport demand so that decision makers can be assisted in forming transportation planning decisions. This demand can be generated within the city (Intracity) as well as between the cities (Intercity).

The urban and regional contexts are totally different in the case of transportation systems. Urban travel generally demands a single mode of transportation, mostly road transport, whereas regional travel demands a different pattern with multiple modes of transportation. This further comes with more complexions with a number of modes available for access, egress, and trunk route journey. To analyse and alleviate the complexion of travel demand, it is required to perform the modelling which will support the decisions taken by the decision makers. Again, there is a different modelling approach for urban and regional contexts.

Multimodal inter-regional travel demand models can be made through various approaches. If the behavioural responses are considered at the aggregate level, then a direct demand model and elasticity analysis can be developed. If not considered, then only origin–destination estimation without behaviour theory is made. Furthermore, if disaggregated individual behaviour is considered, then a four-stage model is used. From therein, for analysing trips, the trip-based four-step models and for analysing tours/activity chains, activity-based modelling is done. At the regional level, developing an aggregate model is more sensible considering the complexities in the behavioural responses.

Urban travel demand can be approached by disaggregate modelling where each behavioural measure is analysed. Conventionally, the urban travel demand is modelled through the four-step modelling or sequential modelling which are trip generation, trip distribution, modal choice, and trip assignment. While doing so for regional context, it requires a huge dataset and at such a macrolevel, the homogeneity can be perceived and hence, for regional travel demand aggregate models are used.

In the conventional modelling approach, the estimation of relatively well-defined submodels is necessary which requires disaggregated data of the entire study area. This makes the task complicated at the regional level. Also, there is a challenge of calibrating the gravity model which creates problems with the errors in trip end totals and for those generated poorly from intrazonal trips. This drawback is eliminated in the case of direct demand model where it is calibrated simultaneously for all the three submodels, i.e., trip generation, trip assignment, and mode choice.

2 Literature Survey

Wirasinghe and Kumarage attempted to formulate an aggregate total demand model for estimating the inter-district passenger travel by public transport in Sri Lanka. Limitations of the model proposed were large calibration error, correlation between socio-economic variables, correlation between impedance variables, inappropriate functional forms, and spatially non-transferable [1]. Ibe modelled an intercity travel demand model for Nigeria. The study found that passenger traffic is closely related to the frequency of vehicle trips, available bus fare, vehicular capacity, distance between origin and destination, and journey time performance of vehicles [2]. A log-linear relationship between travel demand and the explanatory variables was developed.

Woldeamanuel compared the intercity bus service with other competing modes. It concluded that intercity buses are environment-friendly, economically viable, and socially inclusive modes of long-distance travel. Intercity bus provides services to many rural areas which are far apart from the other intercity transit services [3]. Peers and Bevilacqua stated that the separate formulation of models by mode provides the opportunity to achieve the maximum flexibility in the model specification in direct demand modelling [4]. One of the initial formulations for the direct demand models was the Kraft-SARC model which had later on been modified by Mc-Lynn. Filippini and Deb studied 22 Indian states to estimate the log-linear travel demand function using panel data over the period of 1990–2001 [5]. It stated that the population has a positive and significant impact on travel demand. Fouquet studied the behaviour of real income and the price elasticity of demand for aggregate transport [6].

The report of “Transit and Quality Service Manual, third edition” provides the quality of service measure for the transit mode for comfort and convenience as passenger load, reliability, and transit auto travel time. Other factors mentioned were safety, security, and employee interactions with customers. Sinha identified that the key quality of service parameters of public transport is i) Cost, ii) Time, and iii) Quality of attributes of onboard comfort, ease of transfer, and information availability [7]. The report on “Understanding Transport Demands and Elasticities” mentions the elasticity of travel with respect to travel time for various modes and time periods, based on Portland, Oregon, data. It indicates that each 1% increase in AM peak drive-alone travel time reduces vehicle travel by 0.225% and increases demand for shared ride travel by 0.037% and transit by 0.036%. Frank found that transit riders are more sensitive to the change in travel time or waiting time compared to transit fare. A 10% increase in the in-vehicle travel time will reduce the transit demand by 2.3%, whereas a 10% increase in the transit fare will reduce the transit demand by 0.8% only [8].

Gaudry and Wills investigated the different forms of travel demand models that have been part of the different studies. It found that the sequential urban travel demand model is linear whereas the intercity travel demand model is generally of the log-linear form. This states that incorrect form of modelling can lead to erroneous results of the important parameters in the model [9]. Semeida developed the travel demand forecasting models for low population areas using two different methods, multiple linear regression, and generalised linear modelling (GLM) and then compared them which suggested that the GLM procedure offers a more suitable and accurate approach than linear regression for developing number of trips [10].

Filippini and Deb report that the aggregate transit demand can be estimated from a linear function model, semi-log linear model, log-linear model, and generalised log-linear model. The most common functional form used is the log-linear model because it reduces the number of coefficients to be estimated and the coefficient of the log-linear model directly estimates the elasticity [5].

Yahya carried out the study on modelling the variables when the data shows multi-collinearity and overfit to the testing data. For the stable and efficient estimates of the parameters in the presence of multi-collinearity in the data, it is suggested that the variables should be scaled at unit length or have zero means and unit standard deviation. Studies suggested that it is necessary and desirable to scale the predictor variables in ridge regression modelling when the presence of multi-collinearity is suspected [11].

3 Study Area

The economy of Gujarat state is the third largest in the country with a per capita GDP of 1,57,000. Also, Gujarat has been in the top five contributors to Indian GDP.

Moreover, in this state the corridor of three districts in the central and south regions, Ahmedabad, Vadodara, and Surat, have been taken for this study considering the heavily populated and industrialised corridor. The municipal areas have been delineated based on TAZ in the municipal area. Surat, Ahmedabad, and Vadodara have been divided into 7, 6, and 4 TAZs, respectively, as shown in Fig. 1.

Fig. 1
Three maps of three cities in Gujrat with T A Z s. a. Map of Ahmedabad with 6 parts of T A Z. b. Map of Surat with 7 parts of T A Z. c. Map of Vadodara with 4 parts of T A Z.

a Ahmedabad TAZ, b Surat TAZ, and c Vadodara TAZ

4 Demography

The demographic profile of all three cities is mentioned in Table 1.

Table 1 Demographic profile of all cities

4.1 Linkages Between Cities

All three cities are well connected through Rail, Road, and Air Transport. Rail transport is considered for the study and study corridors are mentioned below.

  1. (1)

    ST-ADI Rail Corridor

  2. (2)

    ST-BRC Rail Corridor

  3. (3)

    ADI-BRC Rail Corridor.

5 Data Collection

The following sections are provided for the secondary data collected for the present study.

5.1 Railways Secondary Data

For the current study, secondary data were collected such as for railways; historical passenger travel demand has been collected from the Ministry of Indian Railways (Railway Board), New Delhi, for the past 11 years from January 2009 to December 2019. Railway passenger demand data has been collected for all types of trains (passenger, Mail, Express, superfast, Shatabdi, Special trains, etc.) with 1A, 2A, 3A, CC, EC, FC, SL, and 2S. Later, for ease of analysis and evaluation, the data is segregated into Non-AC and AC classes.

5.2 Population

Historical population data was collected from the official website of Surat Municipal Corporation. The population of municipal areas of all three cities has been listed on a decadal basis from 1901 to 2011. For the construction of Direct Demand Models, the population is estimated from 2011 to 2019.

5.3 Annual Income Per Capita

Annual Income Per Capita has been collected from the national high-speed rail feasibility report. The report has published the annual income per capita of the city for the year 2006.

5.4 Travel Time

Access and Egress Travel time has been collected from the centroid of every zone to the intercity terminal such as railway stations and bus stands with the help of Google Maps. Travel time for the trunk route journey has been taken as per the published schedule of the services.

5.5 Travel Cost

Travel cost has been estimated based on fuel prices. The fuel prices have been collected from the petrol pump.

5.6 Service Frequency

The frequency of the railways has been collected as per the timetable provided by the railway board.

6 Data Analysis

After the data collection, irrespective of the type of data (qualitative or quantitative), analysis may consist of following steps:

  1. 1.

    Describe and summarise the data

  2. 2.

    Identify relationships between variables

  3. 3.

    Compare variables

  4. 4.

    Identify the difference between variables

  5. 5.

    Forecast outcomes.

6.1 Ridge Regression

To identify the relationships between the variables, the Pearson correlation method was used. But the results showed that it is an overfit model which is shown in Fig. 2 and cannot be used reliably over the forecasting.

Fig. 2
A 6 cross 6 matrix dataset. It presents the Pearson product moment correlations within the range between minus 1 and 1. The six parts are LOG S A gap D, LOG P S A, L O G I S A, LOG P T P gap T T, LOG P T P gap T C, LOG Y Fre.

Multi-collinearity in dataset

Hence, to alleviate the variance in the model, another statistical method called Ridge Regression was used. In this method, it changes the line of least squared fit to ridge regression line that will compromise on the bias but reduces the variance that is needed for the model. It adds an additional term in the least squared fit equation as the product of the lambda value and square of the slope of the line. The lambda value can be obtained by ten-fold cross validation.

6.2 Passenger Demand Analysis

The secondary data obtained from the Ministry of Railway Board gives passenger demand for the last 11 years. The railway passenger demand (RPD) variation over the years is shown in the following figures. Figure 3 shows the overall demand variation, whereas Figs. 4 and 5 are showing demand variation for AC and Non-AC classes, respectively. It can be observed that Surat-Ahmedabad is having the highest demand compared to Surat-Vadodara and Vadodara-Ahmedabad sections.

Fig. 3
A line graph of annual intercity passenger demand versus year. Three lines for S T A D I, S T B R C, and A D I B R C present the increased demand in 6 specified sections of Western railways between 2009 and 2019.

Railway passenger demand variation from 2009 to 2019

Fig. 4
A line graph of passenger demand for A C class versus year. Three lines for S T A D I, S T B R C, and A D I B R C present the increased demand in 6 specified sections between 2008 and 2020. For the S T A D I section, the demand increases most rapidly.

RPD AC variation from 2009 to 2019

Fig. 5
A line graph of passenger demand for A C class versus year. Three lines for S T A D I, S T B R C, and A D I B R C present the increased demand for A C class for all 6 specified sections between 2008 and 2020. For the S T A D I section, the demand increases most rapidly.

RPD non-ac variation from 2009 to 2019

6.3 AC Class Passenger Demand

6.4 Non-AC Passenger Demand

As shown in the above Fig. 5, Non-AC class passenger demand is highest in all sections. The relative demand of Non-AC class passenger demand with respect to AC class passenger demand is shown below:-

  1. i.

    ST-ADI—1.71

  2. ii.

    ADI-ST—1.86

  3. iii.

    ST-BRC—1.70

  4. iv.

    BRC-ST—1.81

  5. v.

    ADI-BRC—2.03

  6. vi.

    BRC-ADI—1.89.

6.5 Input Variables for Direct Demand Model

For the direct demand model, the intercity demand has been correlated with the product of the population of two cities, product of annual income per capita of two cities, travel time between two cities, travel cost for the chosen mode, and annual frequency of the services for the chosen between two cities. Because of the difficulty in predicting the population of a specific age group who make the intercity trips, the total population is considered for the forecasting.

There can be many measures of income to include in the model: Income distribution, median income, or mean income. Using income groups for the model will be quite unpredictable for forecasting. To forecast median income would require either a projection of the total income distribution or a model relating the median income to some other, more readily forecast, variable such as aggregate personal income. This model will definitely lead to a more complex model than using a mean income. Income can be measured at different scales such as the income of a person, household, or city. The population and annual income per capita are taken in the log scale for the model.

To develop the model, the total travel time is considered for one way. In this study, total travel time considers access time, transfer time, waiting time, trunk route journey time, and egress time. The access and egress travel times for the three cities are mentioned in Table 2.

Table 2 Access and egress travel time in minutes

7 Minutes of Walking Time Has Been Considered Based on the 1.2 m/s Pedestrian Speed

Waiting time at different terminal stations has been taken from the feasibility report of national high-speed rail. A waiting time of 24 min is considered for the railway stations. A transfer time of 5 min is considered for the study.

The variation in trunk route journey travel time in minutes is shown in Fig. 6 for the past 11 years. It shows that over the years the travel time is decreased for Surat to Ahmedabad more than the other sections.

Fig. 6
A line graph of travel time versus year. Three lines for S T A D I, S T B R C, and A D I B R C present the variation of travel time in minutes of Railways between 2008 and 2020. For the S T A D I section, the travel time decreases most rapidly.

Variation of travel time (minutes) for rail corridors

In this study, the travel cost is considered as access travel cost, trunk route travel, and egress travel cost. Access and egress travel cost has been estimated from the fuel prices in the city. The trunk route travel cost is available through secondary data, and it is given in terms of average travel cost per passenger. It was found that the travel cost has increased drastically for the railways over the years.

As per the transit capacity quality of the service manual, the service frequency was consistently reported as the top factor influencing overall trip satisfaction. It is also taken from the secondary data provided. This shows that over the years, the frequency has increased with increased demand.

8 Passenger Demand Modelling

Direct Demand Models have been constructed for the different corridors by using the five variables as explanatory variables and the passenger demand as the response variable. The following types of direct demand models have been constructed:

  1. 1.

    Railway models without access and egress consideration for all three sections (RPDM—XEA)

  2. 2.

    Railway AC models without access and egress consideration for all three sections (RPDM AC—XEA)

  3. 3.

    Railway Non-AC models without access and egress consideration for all three sections (RPDM NAC—XEA)

  4. 4.

    Railway AC models with access and egress consideration for all three sections (RPDM AC—WEA)

  5. 5.

    Railway Non-AC models with access and egress consideration for all three sections (RPDM NAC—WEA).

8.1 RPDM—XEA

Table 3 shows the coefficients for every explanatory variable used in the model. The coefficients for the product of population, product of income per capita, average travel cost, and annual frequency are positive because of having a positive correlation with annual passenger demand, whereas the coefficient of travel time is negative, because of the negative correlation with annual passenger demand. The model has been constructed based on the log-linear model hence the coefficient itself shows the elasticity value of that variable.

Table 3 RPDM—XEA

For intercity travel, the elasticity value of travel cost is positive. Mathematically, it represents that by increasing the travel cost the travelling demand will get increased, which never justifies the behaviour of transit users. Generally, the user responds to the travel cost when it exceeds his travel expenditure limit. In the case of railways, there are 30 types of trains which are running with multiple classes with different travel costs. Hence, If the travel cost increases for the railways, then the passengers will shift to the other classes which will not show a decrease in demand.

Table 4 shows the residual analysis results for the above model.

Table 4 RPDM—XEA performance statistics

Likewise, the models for the AC class and Non-AC class were developed without considering access and egress time. And similar results were obtained.

8.2 Ridge Parameter

It was found that with the increase of the ridge parameter, there was a higher error observed in the model in terms of R2 value. The relationship between the R2 value and the ridge parameter is shown in Fig. 7. Hence, it is necessary to find the appropriate value of the ridge parameter for the optimum results of the model.

Fig. 7
A line graph of coefficient of determinations versus Ridge parameter. A line presents the decrease of the R square value with the Ridge parameter.

Variation of R2 value with the ridge parameter

8.3 RPDM AC—WEA

This section describes the direct demand model of railway passenger demand considering access and egress modes. Access and egress modes have been kept in 4 categories. Private car and 2W have been mentioned as Private mode, BRTS and City Bus are considered as Buses, Cab and Auto are categorised as Auto, and Walk has been taken as Non-motorised Transport. A total of 16 possible mode chains have been taken in the model as PTP, PTB, PTA, PTN, BTP, BTB, BTA, BTN, ATP, ATB, ATA, ATN, NTP, NTB, NTA, and NTN where P = Private Mode, T = Train, B = Bus, A = Auto, and N = Non-motorised Transport.

Hence, PTP indicates a mode chain for a passenger as follows:

First Alphabet: It denotes the access mode in the origin city.

Second Alphabet: It denotes the trunk route mode between the cities.

Third Alphabet: It denotes the egress mode in the destination city.

8.3.1 ST-ADI Rail Corridor

This model considers the access and egress modes with AC class in the trip and the regression coefficient for all explanatory variables used in the model which are shown in Table 5. These coefficients show the elasticity value in that particular variable.

Table 5 RPDM AC—WEA ST-ADI

The performance measures of this model are shown in Table 6.

Table 6 RPDM AC—WEA ST-ADI performance statistics

In this way, models for other corridors ST-BRC and ADI-BRC were developed. And similar results were found.

8.4 RPDM NAC—WEA

This model considers Non-AC class of railway and includes access and egress modes of travel. The results and performance measures are shown for each corridor considered in this study in the subsequent tables below.

8.4.1 ST-ADI Rail Corridor

Tables 7 and 8 are showing the model results in terms of coefficients of ridge regression and the performance measures.

Table 7 RPDM NAC—WEA ST-ADI
Table 8 RPDM NAC—WEA ST-ADI performance statistics

In a similar manner, models for other corridors were developed. And satisfactory results were obtained.

9 Sensitivity Analysis

The sensitivity analysis based on the elasticity values of the level of service parameters used in the model was carried out. The following Table 9 shows the percentage change in the passenger demand for railways with respect to the unit change of LOS parameters while keeping other factors constant.

Table 9 Railway passenger demand sensitivity analysis

The results show that the reduction in travel time shows higher passenger demand compared to a unit change in travel cost and annual frequency. Hence, passengers are more sensitive to travel time during the travel compared to the other two parameters.

10 Scenario Analysis

After the construction of the above models, attempts were made to check the future demand with the three scenarios with the different combinations of three parameters, i.e., travel time, travel cost, and annual frequency.

Three scenarios related to all LOS parameters were generated as Business As Usual (BAU), Scenario-1 (S-1), and Scenario-2 (S-2).

The different combinations of these parameters with different scenario analysis combinations are shown below.

  1. (1)

    (BAU_TT) and (BAU_TC) and (BAU_Fre)

  2. (2)

    (S1_TT) and (S1_TC)

  3. (3)

    (S1_TT) and (S1_Fre)

  4. (4)

    (S1_TC) and (S1_Fre)

  5. (5)

    (S1_TT) and (S1_TC) and (S1_Fre)

  6. (6)

    (S2_TT) and (S2_TC)

  7. 7)

    (S2_TT) and (S2_Fre)

  8. (8)

    (S2_TC) and (S2_Fre)

  9. (9)

    (S2_TT) and (S2_TC) and (S2_Fre)

where

BAU_TT = Business as Usual for Travel Time.

BAU_TC = Business as Usual for Travel Cost.

BAU_Fre = Business as Usual for Annual Frequency.

S1_TT = Scenario—1 for Travel Time.

S1_TC = Scenario—1 for Travel Cost.

S1_Fre = Scenario—1 for Annual Frequency.

S2_TT = Scenario—2 for Travel Time.

S2_TC = Scenario—2 for Travel Cost.

S2_Fre = Scenario—2 for Annual Frequency.

Business As Usual: This scenario shows a do-nothing case where only the historical trends are observed.

S1_TT: In this scenario, the speed of trains increases by 15% and 25% for the years 2025 and 2030, respectively.

S2_TT: Speed has been changed by 20% and 30% for the years 2025 and 2030, respectively.

S1_TC: (BAU-10%): This scenario considers there will not be any investment towards the new trains with new technology, and the travel cost will change due to the change in the source of energy with renewable energy so the cost might increase with decreased rate than the BAU.

S2_TC: (BAU + 10%): It considers new investment towards new technology trains with better comfort and convenience, with renewable energy sources hence increasing the cost of travel at a greater rate than BAU, considered for the years 2025 and 2030.

S1_Fre: (BAU-10%): Since the semi-high-speed train has been planned in the same corridor, it will attract passengers from railways as well as from buses hence these may experience an increase in frequency with a lower rate than BAU, considered for the years 2025 and 2030.

S2_Fre: (BAU + 10%): If some new technology train has been operated through Indian railways with an affordable price than semi-high-speed rail, then it might see some increase in demand, and hence increase in frequency with a greater rate than BAU, considered for the years 2025 and 2030.

Table 10 shows the different scenarios of LOS parameters for the years 2025 and 2030.

Table 10 Scenario analysis for RPD for the year 2025

Table 11 suggests that scenarios S1_TT and S1_TC and S1_TT and S1_Fre have resulted in almost the same percentage change in demand for all three study corridors. Similarly, S2_TT and S2_TC, S2_TT and S2_Fre, and S2_TT and S2_TC and S2_Fre have also resulted in almost equal percentage increase in demand.

Table 11 Change in passenger demand for the year 2025 based on scenario analysis

The similar analysis was carried out for the year 2030 and results are shown in Tables 12 and 13.

Table 12 Scenario analysis for RPD for the year 2030
Table 13 Change in passenger demand for the year 2030 based on scenario analysis

The above table suggests that scenario S1_TT and S1_TC and S1_TT and S1_Fre have resulted in almost the same percentage change in demand for all three study corridors. Similarly, S2_TT and S2_TC, S2_TT and S2_Fre, and S2_TT and S2_TC and S2_Fre have also resulted in almost equal percentage increase in demand. As most of the scenarios resulted in around a 100% increment, this indicates that in the next 10 years the passenger demand may get doubled.

11 Conclusion

Direct Demand Models have been constructed for the 5 different combinations with and without consideration of access and egress modes. Four different modes have been considered in the cities for access and egress to and from the intercity terminal in the cities. The elasticity values are found less than the unity for most of the cases which shows the inelastic nature of the railway passenger demand. As mentioned in previous studies that travel time and travel cost generally show negative elasticity values for the change in demand but in this study, it is found that travel cost is having positive elastic value. This indicates that there is no decrease in passenger demand with travel cost increased. This is justified by the thought that whenever the travel cost exceeds the travel expenditure of an individual, he will shift the class (AC to Non-AC) of the train journey rather than shifting to other modes.

Scenario Analysis has shown that in 2025, the passenger demand will get increased by around 40% whereas by the year 2030, this demand will be nearly doubled. It can be observed that scenarios S1_TT and S1_TC and S1_TT and S1_Fre have resulted in almost the same percentage change in demand for all three study corridors. Similarly, S2_TT and S2_TC, S2_TT and S2_Fre, and S2_TT and S2_TC and S2_Fre have also resulted in almost equal percentage increase in demand.