Introduction

Climate warming is a global issue, which has become a serious threat to human health and the environment of the world. In addition to the natural factors, global climate warming is closely related to the increase of carbon emissions produced by human activities. According to the report from the Intergovernmental Panel on Climate Change (IPCC), human activities and fuel burning, especially in cities, are the major sources of global carbon emissions (IPCC 2007). Previous studies showed that urban areas were estimated to consume 67 % of global energy and emit 71 % of carbon dioxide (CO2) worldwide (Agency (IEA) 2008). It should be a major task to reduce carbon emission from cities. Many scholars and policy makers have paid great attention on dealing with the impact of global climate change by investigating carbon emission mitigation strategies, including transforming the pattern of economic development, optimizing the energy structure, promoting technological progress, and developing a low-carbon economy (Guo et al. 2010). For example, Hou et al. (2011) suggested a long-term effective mechanism for China to boost energy-saving technologies and products, and to promote the economic development toward a low-carbon mode. Such strategies could contribute substantially to the reduction of carbon emissions.

In addition, during the last few decades, scholars have learned about the interactions between carbon emissions and urban forms. Urban forms can be defined as the spatial configuration of human activities (including the spatial pattern and density of land uses, and spatial design of transport and communication infrastructure), which can reflect economic, environmental, technological, and social processes at a certain time (Tsai 2005). The influence of urban forms on carbon emissions is inevitable and profound. Some researchers believe that smaller, more compact, and less disperse urban forms are associated with lower levels of carbon emissions (Anderson et al. 1996; Banister 1996; Dhakal 2009). Christen et al. (2011) found that as much as 50 % of CO2 emissions in cities are attributable to urban forms, land-use mix, building types, transportation networks, and vegetation. Kennedy et al. (2009) also analyzed the relationship between carbon emissions and land use, and found that, when constraints on land use were stricter, the level of carbon emissions of the residents living was lower.

The above studies indicate that carbon abatement must be achieved, not only through more efficient use of fuel and transformation of the pattern of economic development, but also through urban planning and spatial optimization. Thus, research on the relationships between urban forms and carbon emissions has become increasingly important. The demand for quantifying the carbon emissions in the atmosphere with urban morphology has likewise been raised by the scientific and policymaking communities. However, few studies have systematically examined the spatial–temporal interplay of urban forms with the specific aim of quantifying the carbon impacts of expanding urban areas (Alberti and Hutyra 2009; Heath et al. 2011; Lucy et al. 2011). For example, Glaeser and Kahn (2010) attempt to quantify carbon dioxide emissions associated with new construction in different locations across the country (Glaeser and Kahn 2010). However, they merely viewed emissions from the perspectives of driving, public transit, home heating, and household electricity usage, and ignore the impact of urban spatial structure. In another study on carbon emissions in the State of Louisiana, Shu and Lam (2011) presented a new method based on a multiple linear regression model to disaggregate traffic-related CO2 emission estimates from the parish-level scale to a 1 × 1 km2 grid scale (Shu and Lam 2011). However, the factors involved did not consider the patterns of land use, and uncertainty regarding the emission estimates at grid cell level remains because of the error and gaps in the original data. These studies on carbon emissions don’t well provide the explicit evidence of how urban land and transportation work together to affect carbon emissions.

Therefore, in order to deal with the existing problems, this paper attempts to quantify the relationships between carbon emissions and urban forms using panel data analysis. Panel data analysis is a regression method of studying observations from multiple entities over multiple periods, which has several advantages over conventional statistical analysis using only cross-sectional or time series data (Chen et al. 2011). For instance, panel data usually contain more degrees of freedom and more sample variabilities than cross-sectional data, which can improve the efficiency of estimates. In addition, the panel data model can reduce the effects of multicollinearity and estimate error. Moreover, panel data analysis can verify and measure some factors that cannot be recognized in pure cross-sectional and time series data models (Hsiao 2003; Baltagi 2005). In this study, four fastest-growing cities (i.e., Beijing, Shanghai, Tianjin, and Guangzhou) are selected as study areas. As the most important cities in China, these four cities suffer from a series of environmental problems, and their carbon emissions continue to grow due to the rapid development of their economies. Therefore, this study attempts to reveal the relationship between urban forms and carbon emissions for the panel of these four cities using the time series data from 1990 to 2010. Since urban form is primarily affected by urban land use and transportation (Anderson et al. 1996), we used the land use classification of remote sensing images and the digitization of transportation maps to obtain and analyze the spatial distribution of urban land use and transportation network. Then, a set of spatial metrics for quantifying urban land use and transportation network data is utilized. After considering the allocation factors together, we further implemented the statistical method of panel data analysis to estimate whether and to what extent the spatial patterns of urban forms are associated with carbon emissions. The purpose of this study is to create a better understanding on the issues that we are currently facing and to provide a reference for decision makers to prepare appropriate policies on developing a low carbon economy.

Study areas

Four cities (i.e., Beijing, Shanghai, Tianjin, and Guangzhou), located in eastern coastal areas of China (Fig. 1), are selected as the study area in this study. These four cities are the most developed metropolis areas in China, with a total population of 40.67 million and a total gross domestic product (GDP) of 5125.23 billion RMB in 2010. Beijing, the political, cultural, and educational capital of China, is located in the northern part of the North China Plain. It covers an area of 16,410 km2, with 14 urban and suburban districts and 2 rural counties. In 2010, it had a population of 19.62 million and a GDP of 1411.36 billion RMB. Shanghai is the largest city by population (over 23 million in 2010) in China. Located in the Yangtze Delta area, Shanghai is also a global city, serving as the most influential economic, financial, international trade, cultural, science and technology centre. Tianjin, which borders Beijing, is a metropolis in Northern China. It covers an area of 1,191 km2, with a population of approximately 13 million and a GDP of 922.45 billion RMB in 2010. Guangzhou, located in the Pearl River Delta, is Southern China’s largest city, with an area of 7434.40 km2, a population of 8.06 million, and a GDP of 1,074 billion RMB in 2010.

Fig. 1
figure 1

The location of research areas (Beijing, Tianjin, Shanghai, and Guangzhou)

These four cities consume a vast volume of natural resources in order to sustain economic growth. For example, the total area of urban land use in four cities was approximately 1954.40 km2 in 1990, but increased to 5714.84 km2 in 2010. The rapid urbanization process has not only led to the conversion of natural ecosystems, farmland, and water into urban areas, but also given rise to a series of environmental problems, especially the greenhouse effect and air pollution (Li and Liu 2008). Thus, to protect natural resources and improve environmental quality, it is necessary to identify the factors that influence the carbon emissions in these fast-growing regions. In this study, these four cities are chosen as the study area to determine whether there is a important relationship between urban forms and carbon emissions.

Data and method

Carbon emissions

In this study, the carbon emissions are described by carbon dioxide equivalent emissions, which referring to the carbon content of greenhouse gases that would have the same global warming potential. The carbon dioxide equivalent emissions are normally used when attributing aggregate emissions from the particular sources over a specified timeframe. Currently, it is very difficult to acquire precise data of carbon emissions. However, because carbon emissions are mainly released from the fossil energy consumption, it is a useful method to approximate the carbon emissions through energy-related statistical data (Bi et al. 2011; Chun et al. 2011). In this paper, we calculated the carbon emissions only from fossil energy consumption using a unified standard method recommended by the IPCC Guidelines (IPCC 2006):

$$ C = \sum\limits_{w = 1}^{W} {\alpha_{w} \beta_{w} E_{w} } $$
(1)

where w is the different categories of energy sources; W is the total number of energy sources; C represents the amount of carbon emissions; \( \alpha_{w} \) and \( \beta {}_{w} \) are the appropriate calorific value and carbon emission factor for each fuel type respectively, as shown in Table 1 (Dhakal 2009); E w is the amount of fossil energy w obtained from total primary energy supply of the original energy balance tables expressed in the physical units, and these tables are derived from the China Energy Statistical Yearbook and some statistics yearbooks of each city (Table 2). Specially, total primary energy supply contain some fossil energy consumption used to produce thermal power and heating. So the total carbon emissions we obtained would include the part of emissions from the electricity and heating consumption. In addition, the consumption of electricity and heating bought from other regions could also lead to some of carbon emissions. But this part of the emissions only account for a small proportion of total carbon emissions, so was ignored in this study.

Table 1 Calorific value and emission factors of fuel types
Table 2 the consumption of each energy obtained from the original energy balance tables of each city

As shown in Fig. 2, the estimated results of carbon emissions of each city are obtained from Eq. (1). In Fig. 2, we find that the carbon emissions of these cities from 1990 to 2010 show an increasing trend. Shanghai is the largest emitter, with its emission increasing from 12.85 million tons in 1990 to 62.87 million tons in 2010. In addition, the carbon emission of Guangzhou was lower than those of other three cities before 2005, but more than that of Beijing in 2010.

Fig. 2
figure 2

Carbon emissions in four cities for selected years. Notes because the statistics yearbooks don’t record any energy data of Guangzhou in 1990, here we estimate a value based on its economic and population for the data consistency in panel analytic

Urban land use and transportation data

In this study, a spatial distribution data of urban land use and transportation network is used to develop spatial metrics for quantifying urban forms.

The spatial distribution of urban land use is acquired from the land use classification of remote sensing images. Remote sensing is the acquisition of information on an object or phenomenon that can support detailed and accurate urban land mapping at different spatiotemporal scales. These data can be useful in acquiring the spatial distribution of urban land use through the land use classification (Herold et al. 2002). First, land use and cover of four cities are mapped by the use of cloud-free Landsat Thematic Mapper data with corresponding 30 m resolution for five time points: 1990, 1995, 2000, 2005, and 2010. After all images are geometrically corrected to Universal Transverse Mercator map projection system, land use classification of these images is then implemented using the object-oriented classification software Definiens Developer 7.0. The classification procedure involves the following steps: image segmentation, sample selection, feature optimization, and objects classification. In the process of image segmentation, the image is subdivided into objects consisting of similar pixels. Then, the image objects are selected manually as samples for each land use class. To maximize the distance between land use class and another, the next step is to determine a set of features using the tool of Feature Optimization in the software. Finally, according to the selected samples and features, nearest-neighbor classification is performed to achieve the classification results. Once the land use classification completed, the output images are further converted into two land categories (urban and non-urban) based on the differences in land use properties of each city and the research objective in urban forms analysis. As shown in Figs. 3, 4, 5 and 6, the final classification results provide an overview of urban land use for each city in 5 years.

Fig. 3
figure 3

The change of urban land use (a) and transportation network (b) in Beijing for 1990–2010

Fig. 4
figure 4

The change of urban land use (a) and transportation network (b) in Shanghai for 1990–2010

Fig. 5
figure 5

The change of urban land use (a) and transportation network (b) in Tianjin for 1990–2010

Fig. 6
figure 6

The change of urban land use (a) and transportation network (b) in Guangzhou for 1990–2010

To evaluate the reliability of the classification results, an accuracy assessment for the urban category is then performed using a method proposed by Pontius and Millones (2011). This method divides the disagreements between classification and reference into quantity disagreement and allocation disagreement, which is more helpful than Kappa indices just using a single ratio to represent the classification accuracy. The quantity disagreement and allocation disagreement can be derived from the following equations (Pontius and Millones 2011):

$$ p_{uv} = \left( {\frac{{n_{uv} }}{{\sum\limits_{v = 1}^{V} {n_{uv} } }}} \right)\left( {\frac{{N_{u} }}{{\sum\limits_{u = 1}^{V} {N_{u} } }}} \right) $$
(2)
$$ Q = \frac{1}{2}\sum\limits_{g = 1}^{V} {q_{g} } = \frac{1}{2}\sum\limits_{g = 1}^{V} {\left| {\left( {\sum\limits_{v = 1}^{V} {p_{ug} } } \right) - \left( {\sum\limits_{v = 1}^{V} {p_{gv} } } \right)} \right|} $$
(3)
$$ A = \frac{1}{2}\sum\limits_{g = 1}^{V} {a_{g} } = \frac{1}{2}\sum\limits_{g = 1}^{V} {2\hbox{min} \left[ {\left( {\sum\limits_{u = 1}^{V} {p_{ug} } } \right) - p_{gg} ,\left( {\sum\limits_{v = 1}^{V} {p_{gv} } } \right) - p_{gg} } \right]} $$
(4)

where V is the number of land use classes; n uv is the number of sample classified as u and referenced as v; N u is the population of land use class u; q g and a g are the quantity disagreement and the allocation disagreement of land use class g; p uv is the estimated proportion of the study area classified as u and referenced as v; Q and A are the overall quantity disagreement and the allocation disagreement, respectively.

We calculated the quantity and allocation disagreements for the accuracy of the urban category. In each image we constructed several sampling data and collected the samples’ reference information based on visual inspection of the raw images. Through comparing the classification result and reference information in each point, each sampling data was evaluated and summarized in an estimated population matrix. According to Eqs. (2), (3) and (4), the results of the overall quantity and allocation disagreements of the urban category in each image were listed in Table 3. From the table, the minimum of quantity disagreement is 1.17 %, while the maximum one is 5.01 %. The values of allocation disagreement range from 4.56 to 9.14 %. On the whole, the total disagreements are approximately 12 %.

Table 3 Results of the overall quantity disagreement (Q) and allocation disagreements (A) of the urban category in each image (%)

In addition, transportation network data with vector format are obtained from the digitization of the transportation maps of each city. However, it is difficult to acquire the complete transportation maps for each year due to the absence of establishment or update of traffic road database. Thus, we integrate the transportation maps and remote sensing images to extract main city roads. Firstly, we obtained the latest transportation maps from the transportation bureau of each city. After the overlaying of the latest transportation maps and Landsat TM images, we then modified digitally the transportation maps based on the visual inspection of the Landsat TM images. Such that, we could get the main roads of each city in the five time points. The digital results shown in Figs. 3, 4, 5 and 6 present the spatial distribution of the main roads of each city.

Spatial metrics for quantifying urban forms

Urban forms can affect the social, economic conditions and environment of the city (Camagni et al. 2002). Particularly, urban forms directly affect the travel behavior, which, in turn, affect carbon emissions (Cervero 1998). Therefore, in order to deal with the relationships between urban forms and carbon emissions, city planners need a deep understanding on the characteristics and rules of urban forms. Currently, the use of spatial metrics is very useful for representing urban forms (Galster et al. 2001; Holden and Norland 2005). Based on previous studies, spatial metrics can constitute critical independent measures of the urban socioeconomic landscape and can also be used for an improved representation of a variety of urban spatial characteristics (Geoghegan et al. 1997; Parker and Meretsky 2004). Thus, in this study, several spatial metrics for quantifying urban forms in different dimensions(i.e. size, shape, density) are chosen based on the published literatures about this theme (Seto and Fragkias 2005; Dietzel et al. 2005) and the characteristics of the study area. These spatial metrics include Total (urban) Class Area (CA), Number of Patches (NP), Edge Density (ED), Mean Perimeter-Area Ratio (PARA_MN), Percentage of Like Adjacencies (PLADJ), Patch Cohesion Index (COHESION), Largest Patch Index (LPI), Urban Road Density (RD), and Traffic Coupling Factor (CF).

CA represents the total area of all patches of the corresponding patch type, and here is an important measure in representing the expansion of urban land use. NP is equal to the number of urban patches, and is used to measure the extent of subdivision or fragmentation of urban land. ED is equal to the sum of the lengths of all edge segments involving the urban patch, divided by the total landscape area (i.e., a measure of the sprawl and shape of urban land use). PARA_MN is the average perimeter-to-area ratio of urban patches in the landscape, which can quantify urban landscape configuration in terms of the complexity of patch shape. PLADJ is a contagion index equal to the percentage of cell adjacencies involving urban patch that are similar adjacencies (Turner 1989). PLADJ can be used to represent the degree of aggregation of the urban patch. Regardless of the amount of landscape comprising urban land use, this index will be at a minimum if the urban patch is maximally dispersed and will be at a maximum if the urban patch is maximally contagious. COHESION is proportional to the area-weighted mean perimeter-to-area ratio divided by the area-weighted mean patch shape index. This index can be a measure of the physical connectedness of urban land, which would increase as the urban patch becomes more clumped or aggregated in its distribution. LPI quantifies the percentage of total landscape area comprised by the largest patch. In urban landscape, it can be used to describe the extent to which an urban area is characterized by a mononuclear pattern of development (Gustafson 1998). Table 4 shows a more detailed description including the specific mathematical equations of the metrics (McGarigal et al. 2002) used in this research.

Table 4 Spatial metrics of urban land use

The two metrics, RD and CF, are constructed to describe the interrelationship between urban spatial structure and traffic organization. Because of rapid economic growth, fast urbanization, and quick motorization, the urban transportation sector has currently become one of the sectors with the highest amount of carbon (Svensson et al. 2004). In addition, traffic organization change can play an important role in the development of urban spatial structure. But at the same time it is also affected by urbanization, which can provide the development space and objective necessity of urban traffic organization (Han and Liu 2009). Thus, the analysis of the influence of traffic organization and urban spatial structure on carbon emissions is very important. In this research, we used urban road density and traffic coupling factor to represent the interrelationship between traffic organization and urban spatial structure. Urban road density (RD) refers to the total length of the regional road network divided by the area of urban land use (not the total area of a city). Higher road density should generally imply a higher availability and connectivity of short alternative routes (Jenelius 2009). Traffic coupling factor (CF) is defined as a proportion of the total area of road effect buffer zones and the area of urban land use, indicating the extent of interaction in urban spatial structure and urban traffic organization. The higher the value of CF is, the more harmonious the development of urban forms is (Kenworthy and Laube 1996). The equations of RD and CF are presented as follows:

$$ RD = \frac{L}{{A_{urban} }} $$
(5)
$$ CF = \frac{{A_{buffer} }}{{A_{urban} }} $$
(6)

where L is the total length of the regional road network, A buffer represents the total area of road effect buffer zones, and A urban represents the area of urban land use. Specifically, because no criteria for determining buffer distance when conducting road effect buffer zones have been identified in previous studies (Li et al. 2004), we set the buffer distance as 500 m, which does not account for road width.

With the support of the data we need, some spatial metrics, such as CA, NP, ED, PARA_MN, PLADJ, COHESOIN and LPI, are computed individually for each city using the public domain spatial metrics program FRAGSTATS (McGarigal et al. 2002). And the value of the metrics RD and CF are obtained from the use of the spatial analyst and buffer analyst in ArcGIS. Finally, the specific calculation results, further used as the explanatory variables for carbon emissions in panel data analysis, are shown in Fig. 7.

Fig. 7
figure 7

Values of spatial metrics of four cities

Panel data analysis

Panel data analysis employed in this paper is to examine the relationships between carbon emissions and urban forms. Firstly, panel unit root tests and panel cointegration tests of the variables should be conducted to confirm the validity of the panel data model estimation before establishing the panel regression model. Next, all data must undergo natural logarithm transformation to avoid non-stationarity and heteroskedasticity phenomena in the time series variables (Hsiao 2003). Finally, the parameters of interacting variables are estimated by using the panel regression model.

Panel unit root tests

On account of the non-stationary nature of time series data, their stationary nature must be examined before panel data models are established. Panel unit root test has recently attracted attention because it is more powerful than the normal time series unit root. In general, the panel unit root test is based on the following autoregressive model:

$$ y_{it} = \rho_{i} y_{it - 1} + \delta_{i} X_{it} + \varepsilon_{it} $$
(7)

where i = 1, 2,…, N indicates entities observed in the time points t = 1, 2,…, T; X it represents exogenous variables in the model including any fixed effects or individual trends; δ i is the vector of regression parameters; ρ i is the autoregressive coefficients, and ε it is a stationary process. If ρ i  < 1, y i is said to be weakly trend-stationary. Conversely, if ρ i  = 1, then y i contains a unit root (Mahadevan and Asafu-Adjaye 2007).

One of the most widely applied tests within this field of research is the Levin et al. (2002) (LLC) test, which is improved on Eq. (7). The test is designed to examine the null hypothesis of a common unit root in the panel versus the alternative of stationarity when the cross-sectional units are independent of one another (Levin et al. 2002). In this paper, the LLC panel unit root test is applied to examine whether the data are difference-stationary or trend-stationary and to determine the number of unit roots at their level. We likewise check if any of the variables are integrated from the same order to satisfy the premise of cointegration test.

Panel cointegration tests

From the result of the unit root tests, if the variables are integrated of order one, then the next step is to utilize cointegration tests to analyze whether a long-run relationship exists among them. The analysis is conducted by applying Pedroni’s heterogeneous panel cointegration tests, which allow for cross-section interdependence with heterogeneous slope coefficients, fixed effects, and individual specific deterministic trends. Pedroni’s framework provides cointegration tests for both heterogeneous and homogenous panels with seven repressors based on seven residual-based statistics (Pedroni 1999).

In these seven statistics, the panel v-statistic, panel r-statistic, panel PP-statistic, and panel ADF t-statistic are based on pooling the residuals of the regression along the within-dimension of the panel. The other three (Group rho-Statistic, Group PP-Statistic and Group ADF-Statistic) are based on pooling the residuals of the regression along the between-dimension of the panel (Al-mulali and Binti Che Sab 2012). In both cases, the basic approach is to first estimate the hypothesized cointegrating relationship separately for each panel member, and then to pool the resulting residuals for conducting the panel tests. See Pedroni (1999) for details on these tests and the relevant critical values.

Panel data model

Generally, panel data analytic models can be categorized into three types: pooled regression model, variable intercepts and constant coefficients model, and variable intercepts and variable coefficients model (Baltagi 2005). The pooled regression model has constant coefficients, which are referred to as both intercepts and slopes. The form of the pooled regression model can be expressed as Eq. (8), in which a and b are constant coefficients. Equation (9) is the form of variable intercepts and constant coefficients model, which has constant slopes but intercepts that differ based on entity and/or time. The third model has differential intercepts and slopes that vary based on both entity and/or time shown as Eq. (10).

$$ y_{it} = a + bx_{it} + \varepsilon_{it} $$
(8)
$$ y_{it} = a_{i} + bx_{it} + \varepsilon_{it} $$
(9)
$$ y_{it} = a_{i} + b_{i} x_{it} + \varepsilon_{it} $$
(10)

where i and t represent the entities and time points; y it and x it are indices for the dependent variable and independent variable, respectively; and a i is specified as fixed effects or random effects. Similar to the specification of a i , b i can also be expressed as a fixed or random effect. Here, ε it is the error term.

Two main hypotheses are used to determine which specific model should be selected. Whether the hypothesis is accepted based on the result of F-test through comparing the residual sum of squares (RSS) of Eqs. (8), (9), and (10).

$$ H_{1} :b_{1} = b_{2} = \cdots = b_{N} $$
$$ F_{1} = \frac{{(S_{2} - S_{1} )/[(N - 1)k]}}{{{{S_{1} } \mathord{\left/ {\vphantom {{S_{1} } {(NT - N(k + 1))}}} \right. \kern-0pt} {(NT - N(k + 1))}}}}\sim F[(N - 1)k, \quad N(T - k - 1)] $$
(11)
$$ H_{2} :a_{1} = a_{2} = \cdots = a_{N} \quad b_{1} = b_{2} = \cdots = b_{N} $$
$$ F_{2} = \frac{{(S_{3} - S_{1} )/[(N - 1)(k + 1)]}}{{{{S_{1} } \mathord{\left/ {\vphantom {{S_{1} } {(NT - N(k + 1))}}} \right. \kern-0pt} {(NT - N(k + 1))}}}}\sim F[(N - 1)(k + 1), \quad N(T - k - 1)] $$
(12)

where F 1 is the statistic for H 1 that intercepts and coefficients are held constant over entities and time; F 2 is the statistic for H 2 that intercepts are variable and coefficients are constant; S 1, S 2, and S 3 are RSS for Eqs. (10), (9), and (8), respectively; and N, T, and k denote the number of entities, the number of time points, and the number of explanatory variables, respectively. The entities are specified as the cities in this study.

Given the confidence level and condition that T > k + 1, if F 2 is larger than or equal to the critical value, the hypothesis H 2 is accepted and Eq. (8) is selected; otherwise, the hypothesis H 1 must be tested. If F 1 is larger than or equal to the critical value, the hypothesis H 1 is then accepted, and Eq. (9) is selected; otherwise Eq. (10) is selected. Next, we need to decide if the fixed effects or random effects model should be used, which is determined by the Hausman test (Baltagi 1996). The Hausman test mainly asks whether the covariance estimator and GLS estimates of b (the common parameter) are obviously different (Hausman 1978). This statistic analysis is distributed asymptotically as a central χ 2 under the null hypothesis (Hsiao 2003).

Results and discussion

In this study, the statistical method of panel data analysis was implemented through the statistical software EViews. Originally developed and distributed by Quantitative Micro Software (QMS), EViews offers innovative solutions for econometric analysis, forecasting, and simulation. Therefore, we estimated the relationships between carbon emission and urban forms with the EViews testing. The analysis procedure and experiment results are described below.

Panel unit root analysis

Table 5 reports the results for the LLC panel unit tests. As the table shows, with exception of urban road density and traffic coupling factor, the null hypothesis of non-stationary is rejected at the 10 % significance for the levels of other variables. When the first differences are taken, the null hypothesis of non-stationary is rejected for all the variables. Therefore, we can conclude that all the variables are non-stationary and integrated at an order of one. Based on these results, carbon emissions and other variables are tested to find whether there is a long-run relationship between them.

Table 5 Results of LLC panel unit root tests

Results of panel cointegration

According to the above results, seven cointegration statistics are calculated to test the long-run relationship among these variables. The results of panel cointegration tests between carbon emissions and other variables are displayed in Table 6. As Table 6 shows, the panel rho-statistic and group rho-statistic of these variables accept the null hypothesis of no cointegration; however, the other five statistics reject this hypothesis. In addition, the significant levels are different among these variables. Considering the Orsal’s findings that the panel ADF-statistic performs better than the other three within-dimension-based statistics and three group-mean statistics through a comparison of the relative performance of Pedroni’s the test statistic (Orsal 2007), we based our conclusions primarily on the panel ADF-statistics, which shows that the nullity of non-cointegration of the variables is rejected at the 5 % significance level (see the bold numbers in Table 6). This indicates that a long-term equilibrium relationship exists between carbon emissions and other variables.

Table 6 Testing for bivariate cointegration between carbon emissions and other variables

Parameter estimations of the panel model

Since there is a relationship between carbon emissions and other variables, we establish panel regression models for these variables to estimate influences of patterns of urban land use and transportation on carbon emissions. Given the condition that T > k + 1 and T = 5, the maximum value of k is 3, which implies that a regression model have, at most, three explanatory variables. The explanatory variables are separated into three regression models in order to analyze the relationship of carbon emissions and urban forms properly. The combinations of explanatory variables are organized as (1) CA, PLADJ, and COHESION; (2) NP, CF, and LPI; and (3) RD, ED, and PARA_MN.

Next, F-tests are performed to determine which specific regression form should be used for these three models. The results of the F-tests are listed in Table 7. For Model 1, given the significance level of 5 %, F 2 is equal to 6.0077, which is greater than F(12,4). Thus, hypothesis H 2 is rejected. In addition, F 1 is greater than F(9,4), given the same significance level; thus, hypothesis H 1 is accepted, meaning that Model 1 should adopt Eq. (9). The F-test results for the other two models are similar to that of Model 1; thus, Eq. (9) is also used for Models 2 and 3. Table 8 presents the results of the Hausman test for the three models. The probability P values of the three models are less than the critical value at the 5 % level of significance, indicating that it should use the fixed-effect model, rather than the random effects model. Then, generalized least squares (GLS) regression is likewise employed for the above three models.

Table 7 F-test results for Model 1, 2 and 3
Table 8 Hausman test results

Table 9 displays the coefficients estimated from panel data analysis, which demonstrate a clear relationship between carbon emissions and urban forms. As expected, Table 9 shows that CA is positively correlated with carbon emissions. Thanks to the market-oriented reform and fast economic development, the urbanization in these four cities is accelerating during the study period. The expansion of urban areas has led, not only to changes in the structure and composition of agriculture and forest, but also to decreases in vegetation carbon storage. Rapid urbanization in these four cities has also caused the fast growth of population, such that daily living, traveling, and working of the population create a great demand for energy and lead to large quantities of carbon emissions. Moreover, as an important industry in these regions, manufacturing has to consume huge land resource and increase the amount of carbon emissions for the industrial production. Thus, unsurprisingly, the rapid growth of urban areas has brought about a corresponding increase in carbon emissions.

Table 9 Coefficients estimated from panel data analysis

The estimation results show that the variables NP, ED, and PRAR_MN also have significant positive impacts on carbon emissions. NP is used to reflect the scattering of the spatial pattern of urban land use. The higher the value of NP is, the more scattered the spatial pattern of urban land use is. The variables ED and PRAR_MN measure the regularity of the shape of urban patches. The shape of urban land use becomes more irregular if the values of ED and PARA_MN increase under the restriction of the same amount of area for urban patch. Accordingly, the estimation results on the variables NP, ED, and PRAR_MN indicate that a more scattered or irregular pattern of urban land use will give rise to more carbon emissions. The main reason for this phenomenon may be the increase in potential transportation requirements when activities are distributed in many different urban patches. For example, in Beijing or Shanghai, many newly built residential areas are distributed in distant suburbs where the living environment is better than that in the central urban area. Thus, the scattered or irregular pattern of residential areas lead to recurrent long movements of people from residences to their places of work (Muller 2004). As a consequence, a higher level of person transport is incurred to meet the need of economic activities, which results in higher consumption of energy and increases more carbon emissions.

We further find that urban sprawl with an aggregated and continuous pattern is conducive to the reduction of carbon emissions based on the estimation results that the variables PLADJ and COHESION are negatively correlated with carbon emissions. As discussed above, the variables PLADJ and COHESION are measures of the aggregation and connectedness of urban land, respectively. The lower the values both PLADJ and COHESION are, the more compact the development pattern of urban land is. Currently, a number of studies have stated that a compact urban structure is highly beneficial for sustainable development because, for instance, of the lesser need for car transport, increased accessibility, reuse of existing infrastructure, preservation of green areas outside the cities, and regeneration of urban areas (Gordon and Richardson 1997; Van Der Waals 2000; Thinh et al. 2002). These benefits contribute substantially to the decrease in carbon emissions. Thus, the estimation results suggest that a compact urban structure would help reduce carbon emissions.

Notably, the variable LPI has significant positive impact on carbon emissions. Contrary to previous research conclusions by Chen et al. (2011), this estimation result indicates that increasing the percentage of urban landscape accounted by the urban core (largest urban patch) can lead to an increase in carbon emissions. The difference of urbanization level between the different study areas could be a reason why this study reached a different conclusion from Chen. But most importantly, the research of Chen ignored the core influence of traffic congestion in a city. Certainly, we cannot deny that a larger urban core can provide more functions of economic activities. However, this can also lead to an increase in traffic. For example, according to the statistical yearbooks, the total number of vehicles of the four cities is about 4.49 million in 2000, but increased to 11.57 million in 2010. A large number of people driving to the urban center for work, study, or shopping could easily result in traffic congestion due to insufficient road resources. Traffic congestion, characterized by slower speeds, longer trip times, and increased vehicular queuing, not only raises fuel consumption, but also increases exhaust emissions (Ang 1990; Baranovskii et al. 1995). Besides, the research of Makidoa et al. (2012) has endorsed the view that too dense settlements in mono-centric form may lead to greater per capita CO2 emissions. Therefore, to some degree, the urban form development with a mononuclear pattern can increase the amount of carbon emissions.

The estimation results of urban road density and traffic coupling factor for carbon emissions support the above analysis. The variables RD and CF both have significant negative effects on carbon emissions, indicating that higher RD and CF values can result in lower carbon emissions. Obviously, improvements in urban road density can effectively raise the flow speed of vehicles and reduce traffic congestion, which could improve the efficiency of fuel consumption of vehicles and reduce the carbon emissions. However, if a new road is constructed far from the functional sites of economic activities, the value of RD metric may be increase. But this construction could also result in a waste of energy and resources, instead of the reduction of carbon emissions, because of the increase in travel time. Thus, the coupling degree of urban spatial structure and traffic organization is a very important factor in carbon emissions. Specifically, the coupling development of urban spatial structure and traffic organization can enable traffic to flow freely and improve accessibility to a transport node or agglomeration of economic activities (Kenworthy and Laube 1996). Thus, improvements in urban road density and the coupling degree of urban spatial structure and traffic organization could be a particularly effective method of reducing traffic congestion, as well as carbon emissions.

At present, there have already existed some researches on the energy consumption or carbon emissions from cities (Huang et al. 2013). But researches on quantifying the relationship between carbon emissions and urban forms using panel data analysis, especially for Chinese cities, have been still little reported so far. One recent study used somewhat similar approach to estimate the impacts of urban forms on energy consumption in five cities of Pearl River Delta (Chen et al. 2011). Its analysis showed the same views that the growth of the urban size, fragmentation, and irregularity of urban land use patterns can lead to increased energy consumption. However, one conclusion that increasing the percentage of urban landscape accounted by the urban core can help reduce the energy consumption in their study is different from ours. In the results of our study, the urban form development with a mononuclear pattern may increase the amount of carbon emissions because of the core influence of traffic congestion in the four large cities. Besides, we also confirmed this view by the estimation results of urban road density and traffic coupling factor for carbon emissions, and suggested urban forms in China should transform from the pattern of disperse, single-nuclei development to the pattern of compact, multiple-nuclei development. Thus, these results of our study are credible and representative, which is not presented in the previous researches.

In fact, the study has certain limitations that remain to be fixed. For example, industrial activities are usually the major sources of carbon emissions in Chinese cities, but less affected by the urban forms. So it is a limitation when analyzing the relationship between city’s total emissions and urban forms. Considering the difficulty in classifying accurately the industrial land through Landsat TM images with a 30 m resolution and collecting land use data on earlier years, in this study we could also not figure out the impact of spatial distribution of industrial activities on carbon emissions. Furthermore, because of the difference of urbanization in different regions, the results of the empirical results do not represent the common mechanism and need thorough examination. Thus, in the future study, we will anticipate the collection of more finer land use data and focus on building models in order to provide general understandings about the relationship between urban forms and carbon emissions.

Conclusions and policy implications

In response to global climate change, carbon emission mitigation strategies have been studied and formulated from social and economic perspectives. Increasing attention has been given to the effects on carbon emissions caused by different urban forms. However, quantifying the carbon impacts of urban forms accurately and systematically remains relatively unexplored. Therefore, this paper attempts to quantify the relationships between urban forms and carbon emissions empirically for the panel of four cities (i.e., Beijing, Tianjin, Shanghai, and Guangzhou) using the time series data for four time intervals.

In this study, panel data analysis was implemented to estimate the relationships between urban forms and carbon emissions after several selected spatial metrics for quantifying urban forms were obtained from the spatial distribution data of urban land use and transportation network. Panel unit root test and panel cointegration analysis of the spatial metrics variables were conducted before panel data models were established. These test results support the view that all panel variables are non-stationary and integrated of order one. The results further demonstrate that a long-term equilibrium relationship exists between these spatial metrics and carbon emissions.

Additionally parameter estimations of the panel data model reveal that the individual variable coefficients have important but different impacts on carbon emissions. Urban expansion (high CA) inevitably leads to increased carbon emissions because of the consumption of resources and the accretion of population in rapid urbanization. Fragmented (high NP) or irregularly shaped (high ED and PARA_MN) patterns of urban land use likewise result in more carbon emissions. Conversely, we found an urban sprawl that has an aggregated and continuous pattern (high PLADJ and COHESION) would be conducive to the reduction of carbon emissions. Increases in both urban road density (RD) and traffic coupling factor (CF) also promote the reduction of carbon emissions. However, because it can easily cause traffic congestion, urban form development with a mononuclear pattern (high LPI) may accelerate carbon emissions. Therefore, in the foundation of compact development, the urban forms should transform from the pattern of single-nuclei development to that of multiple-nuclei development.

From the analytical results, we can conclude that planning for future development should consider the effects of different urban forms to reduce carbon emissions. Currently, China is facing many serious issues on ecology and environment resulting from its rapid economic growth and urbanization process. Maintaining a balance between sustainable development and cutting down total carbon emissions remain important challenges for policy makers. One direct solution to reduce carbon emissions is the reduction of energy consumption. However, such a measure could have a negative impact on economic growth. As shown in this paper, constructing an ideal urban form through urban planning and spatial optimization is a critically important method of handling the carbon emissions problem. Thus, the planning of future development in China should take into account the effects of urban forms, and make the urban pattern more compact rather than disperse. Meanwhile, it suggest that urban forms should be a spatial pattern of the spool thread with poly-centers. From the above, the findings obtained in this study could provide important decision support in building China’s low-carbon society.