Abstract
In the international literature, several studies have analyzed the impact of HSR on tourists' behavior with qualitative and quantitative approaches. However, they have not been able to solve the problem of capturing the spatial and temporal variation by fitting a regression model at a local point. The spatial heterogeneity within local models, such as Geographically Weighted Regression (GWR) models, provides a better platform allowing exploring the different spatial relationships between HSR and tourism. In this chapter, a spatio-temporal analysis has been proposed to evaluate the variables affecting tourists ‘choices, specifically the impact of HSR on both Chinese and Foreign tourists. Two advanced methods were adopted: firstly, we used the Weighted Regression with Poisson distribution (GWPR) modelling approach, which considers the problem of the temporal and spatial autocorrelation differently with respect to the Generalized Estimating Equations method. The results of this study support the use of the GWPR as a promising tool for tourism planning, especially because it makes it possible to model non-stationary spatially counting data. As far as the authors know, this methodology has never been applied in the international literature to this context. Secondly, we combined both temporal autocorrelation and spatial autocorrelation by applying models of Geographical and Temporal Weighted Regression (GTWR) types to take into account the local effects from the temporal point of view.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
The “second railway age” is brought about by significant advancements in transportation technology and the ongoing building of High-Speed Rail (HSR) [1, 2]. China's HSR network has expanded significantly during the past ten years, affecting the geographical organization of cities within the transportation system.
The longest HSR in the world is, in fact, in China (see Fig. 1). The total operating mileage of the national railroad in 2021 was higher than 150,000 km, including 40,000 km of HSR. Additionally, 2168 km of new mileage was built and put into service in 2021. From 28% in 2012 to 93% in 2021, the HSR network's coverage of cities with a population of more than 500,000 people increased. Except for Lhasa on the mainland, all province capital cities have been connected to the HSR. In August 2020, China National Railway Group Co., Ltd. issued and released the Outline of Railway Development Plan for A Powerful Transportation Country, defining the development blueprint of China Railway in the next 15 years and 30 years [3]. According to the plan, in 2035, China will have built about 200,000 km of the national railway network, including about 70,000 km of HSR. The HSRs mileage will be double-sized w.r.t. the year 2019, and HSR will serve all provincial cities and cities with a population of more than 500,000. The 3-h HSR circle is basically realized between the provincial capitals in adjacent regions (http://www.xinhuanet.com/fortune/2020-08/14/c_1126366741.htm).
The main focus of this study is the investigation of the elements influencing tourists’ decisions for the case study of thirty Chinese provinces, where the influence of HSR has been investigated. There are contributions in the literature that primarily focus on the accessibility and mobility impacts of HSR in China [4,5,6], as well as the influence of HSR on the growth of regional tourism [7] and foreign visitors [8]. Although extensive studies have examed the impact of HSR on tourism in China, the findings are not always consisistent, as they discovered significant effects in some instances [9, 10] and insignificant effects in others [11]. For example, Chen and Haynes [8] demonstrated that the Chinese provinces served by HSR experienced an increase in the number of foreign tourists of 20% and an increase in tourism revenue of almost 25%. In addition, Chen et al. [12] evaluated the spatial impacts of HSR on domestic tourism demand in China using spatial econometric analysis for the period 1999–2016. Their study confirmed that HSR has diverse spatial impacts on tourism output, with a particularly strong effect in the less developed west regions, moderate impact in the central region, and less significant in the developed east regions.
The effect of HSR on tourism in the Yangtze River Delta was analyzed by Taotao et al. [13]. The Yangtze River Delta's development in regional tourism demonstrated a “HSR effect,” and the demand and supply of tourism-related goods significantly improved. Yuhua and Jun [14] demonstrated how the HSR's introduction impacted the growth of tourism in the cities it served.
After studying Huangshan City's tourist spatial structure before, immediately after, and two years after the HSR's inauguration, Lei et al. [15] concluded that the HSR's inauguration had little to no effect on Huangshan City's tourism.
An additional investigation revealed that the expansion of the HSR network in the HSR also affected tourist flows and spatial relationships of the two cities, Beijing and Tianjin [16, 17].
Ziyang et al. [18] used Xiamen City as a case study and confirmed a relationship between HSR and tourism. According to Yongze et al. [19], the inauguration of the HSR had a substantial impact on encouraging regional tourism. Still, as the country moved from the east to the west, this influence gradually diminished.
The impact of high-speed rail on tourism development can be also predicted through random utility models (RUM). A study on their predictive capability in terms of market share was conducted by [20]. Travellers’ and transport users’ preferences can be inferred from different data sources, such as trip diaries or trajectories (e.g. [21]).
In this chapter, a spatio-temporal analysis has been adopted to evaluate the variables affecting tourists ‘choices and, specifically, the impact of HSR on both Chinese domestic and foreign tourists.
The two methods were adopted. The GWPR modelling approach was first chosen. The latter considers the problem of temporal and spatial autocorrelation in a different way with respect to the Generalized Estimating Equations method. Specifically, the results of this study support the use of the Geographically Weighted Regression with Poisson distribution (GWPR) as a useful tool for tourism planning, since it makes possible to model non-stationary spatially counting data. This methodology, as far as the authors know, has never been applied in the international literature to this context.
Secondly it takes into account a further analysis which combines both the temporal autocorrelation and spatial-autocorrelation by the application of models of Geographical and Temporal Weighted Regression (GTWR) types in order to take into account also the local effects from the temporal point of view.
The chapter is organized as follows. Section 2 deals with the description of the data set and the methodology. In Sect. 3, the results are reported. Conclusions and further perspectives are reported in Sect. 4.
2 Description of the Data Set and the Methodology
The dataset collected for this study contains information concerning thirty-four Chinese provinces. Hong Kong, Macao, Taiwan and Tibet are excluded from the dataset due to a data limitation. In total, the data covers the period from 2001–2019 (see Fig. 2).
As shown in Table 1, eight variables were adopted in this evaluation.
The impacts of HSR projects on tourism can be quantified in different ways.
In this study, the dependent variables take only non-negative integer values, the statistical treatment differs from that of the normally distributed one, which can assume any real value, positive or negative, integer or fractional. Count data can be modeled using different methods, the most popular is the Poisson distribution, which is applied to a wide range of transportation count data contexts. In a Poisson regression model, the probability of city i having \({y}_{it}\) number of tourist per year is given by [22,23,24]:
where \(P\left({y}_{i}\right)\) is the probability of city i having \({y}_{i}\) tourist per year and \({\lambda }_{i}\) is the Poisson parameter for city i, which is equal to the expected number of tourists per year at city i, E[\({y}_{i}\)]. The mean and the variance are given by E[\({y}_{i}\)]= \({\lambda }_{i}\) and V[\({y}_{i}\)] = \({\lambda }_{i}\). . Generalized Linear Models (GLMs) are considered the most suitable to determine the relationship between count data and the dependent variables. GLMs aim to extend ordinary regression models to non-normal response distributions [25, 26]. Furthermore, the data considered involve measurements over time for the same cities, to avoid the serial correlation seriously affecting the estimated parameters, leading to inappropriate statistical inferences. Therefore, the panel data regression models have been considered. Panel model analysis provides a general, flexible approach in these contexts, since it allows the modeling various correlation patterns. To consider these possible unknown correlations, an extension of GLMs, namely Generalized Estimating Equations (GEEs), has been considered. The relationship between the explanatory variables and the Poisson parameter is given by [27]:
where \({\beta }_{0}\) is the intercept, the \({\beta }_{p}\), i = 0,1,…,p, are the regression coefficients, ϕ is the parameter for the autoregressive component and \({u}_{it}\) is the error component model for the disturbances. The main problem is that the \({u}_{it}\) is auto-correlated with \({y}_{it-1}\). In order to fix this, the model is fitted by using population-averaged
Poisson models and by imposing an AR(1) process in the error term. These models are suitable when the random effects and their variances are not of inherent interest, as they allow for the correlation without explaining its origin. The aim is to estimate the average response over the population rather than the regression parameters that would enable the prediction of the effect of changing one or more components of the predictor variable on a given individual.
The parameters of this model are estimated by a backward elimination procedure, which considers all the variables in the model. At each step of the backward process, a variable is removed. The latter is the one assuming the largest p-value. The process ends when all the variables in the model have a p-value less than 0.05 or until there is no variable remaining [25].
The significance of each variable has been tested with the t-student statistic, therefore, a coefficient is significant when t is greater than 1.96.
Then, the Geographically Weighted Generalised Linear model (GWGL) was developed by integrating the GLM and the GWR ones and extending the concept of the Geographical Weighted Regression (GWR) models in the context of the Generalized Linear Models (GLM). Given that the dependent variables are count data with discrete and non-negative integer values, GWR models have been performed using the Poisson distribution error [28].
The Geographically Weighted Poisson Regression (GWPR) approach has been adopted [29] to capture the heterogeneity of the independent variables concerning each province. These models capture the spatial variation by fitting a regression model at each sample point. The result of this process is a set of local spatial parameters, described in Eq. (3).
where \(\left({u}_{j};{v}_{j}\right)\) are the coordinates of the different areas, \({\beta }_{jp}\) represents the regression coefficient for the independent variable \(p\) and \({x}_{jp}\) is the independent variable with \(p=1,\dots ,P\). The basic idea of the GWR is that the observed data next to point \(i\) has a higher influence on the estimation of \({\beta }_{j}({u}_{i})\)’s than the data located further away. A weighting function describes this influence. GWR tries to capture the spatial variation by adjusting a regression model to each point individually and using a distance function denominated kernel spatial function. Models have been estimated yearly to capture the variability over time and space.
Lastly, an extension of geographically weighted regression (GWR), geographical and temporal weighted regression (GTWR), is developed to account for local effects in both space and time [30]. The result of this process is a set of local spatial parameters, described in Eq. (4).
where similar to the GWPR model, \(\left({u}_{j};{v}_{j}\right)\) are the coordinates of the different areas with the addition of the term t to indicate the dependence on the dimension time, \({\beta }_{jp}\) represents the regression coefficient for the independent variable \(p\) and \({x}_{jp}\) is the variable value.
3 Results
The estimation results of GLM are reported in Tables 2 and 3. The independent variable, that is not statistically significant at the 0.5 level of significance were removed from the models.
The GLM models’ results show that both the number of domestic and foreign tourists are influenced by the presence of HSR stations, the presence of IntAirport, GDP, and the Number of Passengers.
The results of the GEE models, reported in Tables 4 and 5, confirm the results obtained by the GLM models. However, GEE models are more conservative than GLMs. A higher standard error of the GLM can be observed, moreover, also the value of the log-likelihood is lower, indicating a greater ability to estimate the models.
Starting from the statistically significant variables obtained from the GLM and GEE models, the GWPR models have been estimated for every year. An example of the results obtained, for the years 2001, 2010 and 2019 for Domestic tourists and Foreign tourists, is shown in Tables 1 and 2, respectively, where the minimum, the maximum, the first quartile, the median, the third quartile and the global values are reported (Tables 6 and 7).
In particular, for each year, it is possible to observe a variability of coefficients, indicating that the impact of this variable is not the same for each province and in the time. It generally refers to a diversified mixture of spatial events, which relates to the intensity of a spatial phenomenon.
Looking at the global value of the HSR coefficient, it is interesting to notice, in general, an increase from 2001 to 2019 for both Domestic and Foreign tourists. Observing the values of the HSR coefficient for the provinces of Beijing, Hainan, Hebei, Heilongjiang, Hubei, Qinghai, and Shandong, an increase is observed starting from 2013 (see Figs. 3 and 4).
The results of the three years have been presented, i.e. the 2013, 2015 and 2019 for the Chinese tourists. The results in the years before 2013 are reported since the coefficients are not significant.
Indeed, in Figs. 5 and 6, the weight that the coefficient of the variable HSR has on each province is reported, i.e. the objective is to demonstrate the effect of HSR of the neighboring provinces on their tourism.
It appears that central provinces, such as Hubei, Chongqing, Jiangxi and Hunan, have experienced more substantial impacts from HSR on domestic tourism, while the HSR's impact on international tourism is relatively evenly distributed among all the provinces.
The results of the GWTR essentially confirm the results of the GWPR models (Tables 8 and 9).
However, being more conservative than the GWPR models, a lower spatial and temporal variability is observed (Figs. 7, 8, 9 and 10).
4 Conclusion
In this chapter, we provided a spatio-temporal analysis using two advanced methods, the Weighted Regression with Poisson distribution (GWPR) and the Geographical and Temporal Weighted Regression (GTWR) model, to evaluate the spatial impact of HSR on tourist behavior. Using the Chinese HSR system as an example, our study confirms that the impact of HSR varies both spatially and temporally. The results suggest that the impacts of HSR on tourism in China have increased constantly since 2013. In terms of the spatial impacts, central provinces, such as Hubei, Chongqing, Jiangxi and Hunan, have experienced more substantial impacts from HSR on domestic tourism, while the HSR's impact on international tourism is quite relatively evenly distributed among all the provinces.
Overall, the study reveals some consistent patterns as Chen et al. [12], which adopted spatial econometric models. For instance, the impact of HSR on domestic tourism is found to be relatively strong in central provinces. Such a result suggests that HSR system tends to enhance the attractiveness of central regions and promote tourism due to improved regional accessibility and connectivity.
Future transport infrastructure project evaluation should consider adopting more advanced spatial modeling techniques, such as spatial weighted regression models and spatial econometric models, to capture both the spatial–temporal variation of impacts as well as the spatial dependence of impacts. Only a full understanding of the spatial and temporal impacts of the system may provide sound implications to guide future planning and policy decision-making.
References
Pagliara, F., Mauriello, F., Garofalo, A.: Exploring the interdependences between High Speed Rail systems and tourism: some evidence from Italy. Transp. Res. Part A Policy Pract. 106, 300–308 (2017)
Pagliara, F., Mauriello, F.: Modelling the impact of high speed rail on tourists with geographically weighted poisson regression. Transp. Res. Part A Policy Pract. 132, 780–790 (2020)
Yin, P., Pagliara, F., Wilson, A.: How does high-speed rail affect tourism? a case study of the capital region of China. Sustainability 11(2), 472 (2019)
Cao, J., Liu, X.C., Wang, Y., Li, Q.: Accessibility impacts of China’s high-speed rail network. J. Transp. Geogr. 28, 12–21 (2013)
Jiao, J., Wang, J., Jin, F.: Impacts of high-speed rail lines on the city network in China. J. Transp. Geogr. 60, 257–266 (2017)
Shaw, S.L., Fang, Z., Lu, S., Tao, R.: Impacts of high speed rail on railroad network accessibility in China. J. Transp. Geogr. 40, 112–122 (2014)
Wang, W.C., Chang, Y.J., Wang, H.C.: An application of the spatial autocorrelation method on the change of real estate prices in Taitung City. ISPRS Int. J. Geo Inf. 8(6), 249 (2019)
Chen, Z., Haynes, K.E.: Impact of high-speed rail on international tourism demand in China. Appl. Econ. Lett. 22(1), 57–60 (2015)
Degen, W., Jia, Q., Yu, N.: Spatial pattern and evolution of urban tourism field strength in China under high speed railway network. Acta Geogr. Sin. 71(10), 1784–1800 (2016)
Xin, W., Tongqiao, Z.: The influence of high speed railway network on the development and layout of China’s regional tourism industry. Econ. Geogr. 30(7), 1189–1194 (2010)
Feng, F., Linhao, C.: The opening of high-speed railway and the development of tourism in station cities: engine or corridor? Econ. Manag. 42(2), 175–191 (2020)
Chen, Z., Haynes, K.E., Zhou, Y., Dai, Z.: High speed rail and China’s new economic geography: Impact assessment from the regional science perspective. Edward Elgar Publishing, Chapter 6, pp. 148–168 (2019)
Taotao, D., Lei, Z., Mulan, M.: Research on the impact of Yangtze River delta high speed railway network on urban tourism development. Econ. Manag. 38(1), 137–146 (2016)
Yuhua, Z., Jun, C.: Heterogeneous impact of high speed rail on tourism development of station cities: a study based on double difference method. Tour. Sci. 32(6), 79–92 (2018)
Lei, L., Lin, L., Chenglin, M., Xiaolong, S.: Evolution of spatial structure of tourism flow in typical tourism cities in the era of high speed railway network: a case study of Huangshan City. Econ. Geogr. 39(5), 207–216 (2019)
Yin, P., Lin, Z., Prideaux, B.: The impact of high-speed railway on tourism spatial structures between two adjoining metropolitan cities in China: Beijing and Tianjin. J. Transp. Geogr. 80, 102495 (2019)
Yin, P., Tonghao, Z., Hanyan, Y.: Study on the change characteristics of urban tourism spatial function of Beijing Tianjin Hebei under the influence of high speed railway. Tour. Guide 4(1), 30–44 (2020)
Ziyang, W., Dan, L., Yuting, Z.: Research on the impact of high speed railway on tourism development based on grey correlation analysis: a case study of Xiamen City. Logist. Eng. Manage. 42(8), 163–165 (2020)
Yongze, Y., Yu, F., Haitao, Z.: The impact of the opening of high speed railway on the development of regional tourism. Res. Financ. 1, 31–38 (2020)
Tinessa, F., Papola, A., Marzano, V.: The importance of choosing appropriate random utility models in complex choice contexts. In: 5th IEEE International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS), pp. 884–888, IEEE, Naples, Italy (2017)
Tinessa, F., Marzano, V., Papola, A., Montanino, M., Simonelli, F.: CoNL route choice model: numerical assessment on a real dataset of trajectories. In: 6th International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS), pp. 1–10, IEEE, Cracow, Poland (2019)
Washington, S.P., Karlaftis, M.G., Mannering, F.: Statistical and Econometric Methods for Transportation Data Analysis. CRC Press, Boca Raton (2010)
Montella, A., Marzano, V., Mauriello, F., Vitillo, R., Fasanelli, R., Pernetti, M., Galante, F.: Development of macro-level safety performance functions in the city of Naples. Sustainability 11(7), 1871 (2019)
Cafiso, S., Montella, A., D’Agostino, C., Mauriello, F., Galante, F.: Crash modification functions for pavement surface condition and geometric design indicators. Accid. Anal. Prev. 149, 105887 (2021)
Jobson, J.D.: Applied Multivariate Data Analysis: Regression and Experimental Design. Springer, New York (1991)
Agresti, A.: Categorical Data Analysis. John Wiley, New Jersey (2002)
Fitzmaurice, G.M., Laird, N.M., Ware, J.H.: Applied Longitudinal Analysis, vol. 998. John Wiley & Sons (2012)
Fotheringham, A.S., Brunsdon, C., Charlton, M.: Geographically Weighted Regression: The Analysis of Spatially Varying Relationships. Wiley, Chichester (2002)
Nakaya, T., Fotheringham, A.S., Brunsdon, C., Charlton, M.: Geographically weighted Poisson regression for disease association mapping. Stat. Med. 24, 2695–2717 (2005)
Kopczewska, K.: Applied Spatial Statistics and Econometrics Data Analysis in R. Routledge (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mauriello, F., Chen, Z., Pagliara, F. (2023). A Geographically Weighted Poisson Regression Approach for Analyzing the Effect of High-Speed Rail on Tourism in China. In: Pagliara, F. (eds) Socioeconomic Impacts of High-Speed Rail Systems. IW-HSR 2022. Springer Proceedings in Business and Economics. Springer, Cham. https://doi.org/10.1007/978-3-031-26340-8_17
Download citation
DOI: https://doi.org/10.1007/978-3-031-26340-8_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26339-2
Online ISBN: 978-3-031-26340-8
eBook Packages: Economics and FinanceEconomics and Finance (R0)