Introduction

To promote energy revolution, reform energy production, and utilization modes, and improve the energy utilization rate, China proposed to build a clean, low-carbon, safe, and highly effective modern energy system that focuses on the adjustment of the energy structure, construction of modern energy storage, and transportation networks, and an intelligent energy system in its Thirteenth Five-Year Plan. However, given the actual conditions within its vast territory, economic development and energy efficiency remain unbalanced across various provinces, and thus, it is essential to consider regional discrepancy when formulating energy conservation policies (Bian et al. 2017). To this effect, analyzing the effects of factors, spatial conditions, and the dynamic distribution of discrepancies among various provinces on energy efficiency can help reduce the energy efficiency gap, and accomplish the task of building a new modern energy system under the Thirteenth Five-Year Plan.

Since its accession to the WTO in 2001, China opened itself up to foreign investments. By 2014, it became the top destination for FDI, reaching an amount of US$1.28 trillion. However, at the same time, the country was faced with the disadvantages of a fast-growing economy, including increased consumption of resources and environmental deterioration; lowering of environmental regulations by local governments to attract pollutive enterprises; market segmentation derived from an officer promotion mechanism; and restricted flow of labor force, capital, and energy elements into markets. There is growing public demand for environment protection given the increasing pollution levels because of energy consumption, strength, and efficiency problems. Despite this, research on the effects of factor flow at the regional spatial level to other provinces remains rare.

The following research work is carried out in this paper. First, by adopting the super-efficiency slack-based model–data envelopment analysis (SBM–DEA) method (Tone 2002) that accounts for output, this study estimates the total factor energy efficiency of every Chinese province. Second, considering geographical spatial interactions, a measuring model for the spatial interaction strength of total factor energy efficiency is constructed. Third, against a possible spatial autocorrelation, the space Durbin model is used to quantitatively analyze the effects of industrial structure, governmental interference, and energy strength on the spatial interaction strength of energy efficiency. Third, based on spatial correlation, the factors of influence of industrial structure, government intervention, and energy intensity on spatial interaction of energy efficiency are analyzed. The main innovation of this paper is to construct a spatial dynamic model based on Wilson’s maximum entropy model to analyze the direct and indirect effects on the factors that affect energy efficiency.

Literature reviews

Following a century of development, there are three main types of energy efficiency definitions—energy production efficiency, energy strength, and total factor energy efficiency. Energy production efficiency is expressed by the ratio of gross domestic product (GDP) to one input factor derived from the productivity of a single factor in a department’s production function. Energy productivity is a single-factor productivity, which is usually expressed as a ratio of output (usually GDP) to an energy input factor. Energy strength is generally used to reflect the level of energy utilization efficiency; it is expressed by the ratio of energy input to output. However, these two definitions do not account for other input factors related to production. In fact, total factor energy efficiency considers not only energy input factors, but also other relevant factors. Hu and Wang (2006) proposed an estimation method for total energy efficiency that obtains maximum output under a restricted input factor level, and relatively sequences energy efficiency by comparing distances from the samples to the production frontier.

As for China’s regional energy efficiency, scholars have conducted extensive research, and derived conclusions with practical significance. They have studied energy efficiency and intensity improvements to explore the relationship between energy and economy (Alam et al. 2015). Schleich et al. (2009) stated that the EU emissions trading scheme could improve energy efficiency and reduce carbon emission through environmental regulations, such as restrictions on energy price. Many scholars highlight the close relationship between improved economic development and energy efficiency and strength (Hang and Tu 2007; Stern 2015). Technical efficiency in the industrial sector (Wang et al. 2012), technological progress, governmental intervention, and change in energy consumption structure (Lv et al. 2015; Yuxiang and Chen 2010; Ai et al. 2015) have a certain impact on energy efficiency changes.

Research on factors that influence regional energy efficiency with consideration for spatial interactions dates back to as early as the end of the nineteenth century. For instance, Ravenstein (1976) examined population migration on the basis of spatial feature rules. In the article Commodity Circulation in America, geographer Ullman (1957) explored spatial interactions using multidisciplinary theory models by referring to Ohlin (1934) and Stouffer (1940). In the context of China, some scholars demonstrated the spatial effect or dependency in provincial energy efficiency (Guan and Xu 2016; Ying-Zhi and Guan 2011). Lv et al. (2017) showed a significant spatial spillover effect in the east and west regions of China, suggesting that spatial factors should be introduced in research on variations in energy efficiency.

While numerous achievements have been made in research on China’s energy efficiency and factors that influence it, studies that consider spatial interaction strength and employ a spatial econometric model to analyze regional energy efficiency and influencing factors still have scope for improvement (Cheng et al. 2013; Guan and Xu 2016; Wang et al. 2014). Moreover, considering rapid information exchanges among provinces, and how such communication could result in a spatial spillover effect, energy efficiency studies that measure the strength of spatial interactions among provinces and influencing factors remain insufficient.

In recent years, scholars have also considered reducing carbon emissions in order to improve eco-efficiency from the perspective of the industry’s overall supply chain and region (Gunasekaran 2014; Gul et al. 2015; Govindan and Sivakumar 2016; Carvalho et al. 2017). Considering industrial agglomeration analysis, some scholars researched industrial energy efficiency from the perspective of a space Markov chain test of China’s energy efficiency convergence and the “club effect” (Costantini et al. 2017; Pan et al. 2015), and others focused on regional research, usually conducted by regions east, west, and center (Song et al. 2017; Chen et al. 2016; Lv et al. 2017), or traditionally, six major geographical locations, in order to explore variations in energy efficiency and its relationship with economic growth. Li and Wu (2016) divided regions on the basis of their political attributes.

The data envelopment analysis (DEA) model becomes an important technical means in energy efficiency research. It is based on a nonlinear mathematical programming method, different from the econometrics method. Rojas-Cardenas et al. (2017) compared and analyzed the difference in energy intensity between the graphite industry in Mexico and that in China and the USA. Lin and Liu (2017) analyzed the energy efficiency of China’s transportation sector using DEA, and truncated regression and its influencing factors. All of the explorations mentioned above have broadened the academic scope for other scholars.

Model

Estimation method for energy efficiency

DEA is a nonparametric method derived through solving a linear programming problem. It is widely used because it does not require many forms of data and production frontiers, evaluates every decision-making unit (DMU), and compares input and output factors. Other methods that evaluate total factor energy efficiency include the stochastic frontier analysis (SFA) and Solow residual method (SRM). The accuracy of total factor energy efficiency values is affected by that of the selection of a production function. The basic approach of a DEA is to estimate the nonparametric envelopment frontier of effective production to identify effective and ineffective points on the basis of whether these points are on the frontier line. Suppose there are DMUs; here, each province is treated as a DMU, and every province has K types of input factors to produce M output. The following linear programming is used to determine the efficiency value of the ith DMU:

$$ {\displaystyle \begin{array}{c}\operatorname{Min}\sum \limits_{\theta, \lambda}\theta \\ {}s.t.-{y}_i+ Y\lambda \ge 0,\\ {}\begin{array}{c}\theta {x}_i- X\lambda \ge 0\\ {}\lambda \ge 0\end{array}\end{array}} $$
(1)

where constant vector λ is the N × 1 order and scalar θ is the energy efficiency value. The evaluation rule is θ = 1, indicating that the point is on the production frontier, and technically effective. θ ≤ 1 denotes that the ineffective point is not on the production frontier and generally under the frontier.

Although the traditional DEA models that are based on constant returns to scale (such as CCR model) and variable returns to scale (such as BCC model) consider multiple inputs and outputs, they generally ignore the problem of slack variables in the inefficiency calculation because they adopt a viewpoint that rests on radial distance function. The super-efficiency SBM–DEA, which is based on Tone (2001, 2002), resolves this problem by conducting an evaluation from a nonradial viewpoint; moreover, it addresses the undesirable output problem in which “bad output” in economic activities and the slack variable problem are avoided, and sequences multiple DMUs when they are all effective. An increasing number of scholars have adopted the SBM model for performance evaluation considering the environmental pollution resulting from economic production activities (Guo and Sun 2013; Qian and Liu 2013; Zhang et al. 2015; Zuo and Yang 2011):

$$ {\displaystyle \begin{array}{l}{\rho}^{\ast }=\min \frac{1-\frac{1}{N}\sum \limits_{n=1}^N\frac{s_n^x}{x_{kn}^t}}{1+\frac{1}{M+I}\left(\sum \limits_{m=1}^M\frac{s_m^y}{y_{km}^t}+\sum \limits_{i=1}^L\frac{s_i^b}{b_{kt}^t}\right)}\\ {}s.t.\left\{\begin{array}{l}\sum \limits_{k=1,k\ne j}^K{z}_k^t{x}_{kn}^t+{s}_n^x={x}_{kn}^t,n=1,\cdots, N\\ {}\sum \limits_{k=1,k\ne j}^K{z}_k^t{x}_{kn}^t-{s}_m^y={y}_{km}^t,m=1,\cdots, M\\ {}\sum \limits_{k=1,k\ne j}^K{z}_k^t{b}_{kt}^t+{s}_t^b={b}_{ki}^t,i=1,\cdots, I\\ {}{z}_k^t\ge 0,{s}_n^x\ge 0,{s}_m^y\ge 0,{s}_i^b\ge 0,k=1,\cdots, K\end{array}\right.\end{array}}, $$
(2)

where \( \left({x}_{kn}^t,{y}_{kn}^t,{b}_{kn}^t\right) \) refers to input, desirable output, and undesirable output of a DMU at time t, and \( \left({s}_n^x,{s}_m^y,{b}_i^b\right) \) is their slack variable. This study uses the super-efficiency SBM–DEA model to calculate the total factor energy efficiency of Chinese provinces.

Spatial interaction model for regional energy efficiency

Spatial interaction is defined as population mobility, commodity circulation, industrial transfer, capital inflow, and material and information exchange among different provinces and regions (Long et al. 2016). According to its form and connotation, spatial interaction is divided into three types—convection, radiation, and transmission. Improvements in regional energy efficiency not only depend on economic level, geographic location, and technical and institutional factors, but also involve population, commodity, industrial, and technical innovation.

Research on Ravenstein’s gravity model finds that distance and spatial interaction are inseparable (Ravenstein 1976). Moreover, spatial interaction is related to regions’ sizes, degree to which economic activities are related, and social development level. Combined with universal gravitation theory, the gravity model is used to calculate the strength of regional spatial interaction, that is, \( {I}_{ij}={p}_i{p}_j/{D}_{ij}^b \), where p i and p j denote size of region, \( {D}_{ij}^b \) is the distance between regions i and j, and b is the friction coefficient used to calculate distance. Accordingly, a potential model is developed to examine the strength of interaction among multiple regions, and obtain the summation of I ij : \( M={\sum}_{j=1}^n{I}_{ij}=\sum {p}_i{p}_j/{D}_{ij}^b \); this process is called “potential.” These two models have similar limitations—they lack rigorous mathematical formula derivation, the friction coefficient is not fixed, and its selection is subjective, all resulting in errors. Therefore, this study adopts Wilson’s (1967) maximum entropy model to estimate the spatial interaction strength of energy efficiency for each province in China.

First, it assumes that the regional system is closed and satisfies the physics law of energy conservation. Then, when the entropy of the subsystem reaches maximum value, the macroscopic quantity will be stable, and the total system entropy will peak. In the long run, the instant flux will remain stable, and such a flux is called the strength of spatial interaction:

$$ {\displaystyle \begin{array}{c}\sum \limits_j{T}_{ij}={O}_i\kern12.30001em \\ {}\sum \limits_i{T}_{ij}={D}_j\kern7.399995em i,j=1,\cdots, n\kern0.5em \\ {}\sum \limits_{ij}{c}_{ij}{T}_{ij}=C\kern11.70001em \end{array}}\kern3.699999em , $$
(3)

where T ij is the strength of regional interaction. The total supply of region i is set to O i , and the total demand of region j is D j . Current cost is denoted by C. We obtain the extreme value using

$$ W\left({T}_{ij}\right)=\frac{T!}{\prod \limits_{ij}{T}_{ij}!}. $$
(4)

This mode is set up by using probability, as well as permutation and combination knowledge, which is different from the aforementioned gravity and potential models.

Prasad (2005) attempted to measure regional interaction influence using a spatial interaction model. Some other scholars combined Wilson’s model with commodity inflow and outflow in logistics theory to examine logistics spatial interaction. This study introduces a total factor energy efficiency index on the basis of a maximum entropy model. Regions with low energy efficiency often strive to bring in experiences, scientific technologies, and policy institutions of regions with high efficiency, in which case an appealing force tends to emerge. Thus, we put forward our first proposition.

Proposition 1

The appealing force between a supply region and a demand region denotes the total factor energy efficiency of each region.

Spatial damping factors, such as distance, communication, information, and degree of mutual exchange among different regions, also tend to differ. First, suppose 30 Chinese provinces are in a closed system. T ij is used to express spatial interaction strength based on total factor energy efficiency when the macroscopic quantity is maximum. The model is constructed as follows:

$$ {\displaystyle \begin{array}{c}\sum \limits_{i=1}^n{T}_{ij}={E}_{1i}\kern4.099998em i=1,\cdots n\\ {}\sum \limits_{j=1}^m{T}_{ij}={E}_{2j}\kern3.799999em j=1,\cdots m\kern0.1em \\ {}\sum \limits_{i=1}^n\sum \limits_{j=1}^m{c}_{ij}{T}_{ij}=C\kern6.399996em \end{array}}, $$
(5)

where E1i is the total supply of the ith region and E2j is the total demand of the jth region. The entropy of the subsystem in thermodynamics is \( {S}_j=-\sum \limits_{i=1}^N\frac{T_{ij}}{E_{1i}}\log \frac{T_{ij}}{E_{1i}} \). The Lagrange function is used to obtain the extreme value of this formula, and when using the point of this extreme value, the subsystem’s entropy and the macroscopic quantity of the closed system become stable. At this time, T ij  = K i E1i exp(−βc ij ) = K j E2j exp(−βc ij ), where β denotes the spatial damping function. Thus, we put forward our second proposition.

Proposition 2

When analyzing population diffusion and spatial interaction, the diffusion coefficient is \( D=\frac{h^2}{2T} \) and \( \beta =\sqrt{\frac{2T}{t_{\mathrm{max}}D}} \). Here, h is the region diameter, and T is the residence time of persons in a given region.

The following equation is used to estimate spatial interaction strength for multiple regions while accounting for energy efficiency factors under the maximum entropy model:

$$ {T}_i=\sum \limits_{j=1}^n{T}_{ij}={E}_{1i}{E}_{2j}\exp \left(-\sqrt{\frac{2T}{t_{\mathrm{max}}D}}{\gamma}_{ij}\right)\kern1.1em \left(i\ne j\right), $$
(6)

where D is the administrative area of 30 provinces; T is the quantity of regions with interaction; tmax is the quantity of regions with high energy efficiency value; E1i and E2j are the energy efficiency values of a region with supply i, and a region with demand j; and T i is the sum of spatial interactions among each province, and termed the potential value of total factor energy efficiency.

Spatial econometric model

Model introduction

The traditional econometric framework has been reformed with the development of spatial econometrics. In fact, the spatial spillover effect can be quantitatively analyzed on the basis of space, space and time, and geographical location considerations. In general, a data point in a spatial data sample expresses observed values related to regions, in which case the concept of “spatial dependence” is derived. This indicates that spatial correlation and time sequence correlation concurrently exist among regions.

Spatial autocorrelation analysis

When the values of variables for adjacent regions are similar, spatial autocorrelation may exist, which is also known as spatial dependence (Long et al. 2015). Introduced in 1950, Moran’s index I is a commonly used method to test for spatial autocorrelation. This study employs Moran’s index to explore the clustering effect of spatial interaction for each Chinese province, and then test the spatial correlation, the spatial interaction strength of the total factor energy efficiency for each province during 2000–2014. Stata 14.2 is used to calculate the value of Moran’s I.

$$ Mora{n}^{\prime }s\kern0.1em I=\sum \limits_{i=1}^n\sum \limits_{j=1}^n{\boldsymbol{W}}_{ij}\left({Y}_i-\overline{Y}\right)\kern0.2em \left({Y}_j-\overline{Y}\right)/{S}^2\sum \limits_{i=1}^n\sum \limits_{j=1}^n{\boldsymbol{W}}_{ij}, $$
(7)

where S2 is the sample variance of an observed value (total factor energy efficiency value); Y i , for the ith region; and \( {S}^2=\sum \limits_{i=1}^n\left({Y}_i-\overline{Y}\right)/n \), \( \overline{Y}=\sum \limits_{i=1}^n{Y}_i/n \). The value range for Moran’s I is [‐1, 1], where the negative value denotes a negative autocorrelation among regions, while a positive value refers to a positive autocorrelation. I > 0 means that high values are close to high values, and low values are near low values, and I < 0 denotes that high values are next to low values. The geographical distance spatial weight matrix is the [0, 1] matrix under the adjacent criterion, and is standardized as follows:

$$ W=\left[\begin{array}{cccc}{w}_{11}& {w}_{12}& \cdots & {w}_{1n}\\ {}{w}_{21}& {w}_{22}& \cdots & {w}_{2n}\\ {}\vdots & \vdots & \vdots & \vdots \\ {}{w}_{i1}& {w}_{i2}& \cdots & w{}_{ij}\end{array}\right], $$
(8)

i, j refer to regions i, j. When the two regions are adjacent and have a common boundary, w ij  = 1, and when there is no common boundary, w ij  = 0. Finally, statistical magnitude Z is used to perform a significance test of Moran’s I.

$$ Z(d)=\frac{Mora\kern0.2em {n}^{\prime }s\kern0.2em I-E(I)}{\sqrt{VAR(I)}}. $$
(9)

Spatial econometric analysis of factors influencing potential values of total factor energy efficiency

First, it is certain that a change in the potential value of total factor energy efficiency is caused by the combined action of multiple factors. Since it is difficult to list all influencing factors, drawing in the literature, this study focuses on industrial structure, FDI, transportation infrastructure, technical progress, and governmental intervention. To avoid multicollinearity and heteroscedasticity, this study takes the logarithm of each variable.

The spatial econometric model compensates for the deficiency in the traditional measurement model, that is, the latter does not account for spatial correlation, and includes a spatial function for each region. When setting up the model and analyzing the spatial effect, if the explained variable T i for province i is related to every explanatory variable of the province and independent variables for adjacent provinces, then

$$ T= X\beta + WX\delta +\varepsilon, $$
(10)

where WXδ is the influence of an independent variable for an adjacent province; δ is the influence coefficient; and T is the explained variable, the total factor energy efficiency potential value. When combined with a spatial autoregression model, we derive the space Durbin model:

$$ T=\rho \kern0.1em WT+ X\beta + WX\delta +\varepsilon . $$
(11)

According to the above theoretical basis, we set up the dynamic panel space Durbin model

$$ {T}_{it}=\tau {T}_{i,t-1}+\rho \kern0.1em {w}_i{T}_{it}+{X}_{it}\beta +{w}_i{X}_{it}\delta +{\mu}_i+{\gamma}_t+{\varepsilon}_{it}, $$
(12)

where Ti, t − 1 is the one-order lag for an explained variable, w i is row i in spatial weight matrix W, the spatial lag term of the energy efficiency potential value is w i X t δ, and the time effect is γ t .

The study follows the spatial dependence model (SDM) to observe the spatial effect of an adjacent province (LeSage and Pace 2008):

$$ {\displaystyle \begin{array}{l}{T}_{it}=\frac{X_{it}\beta +{w}_i{X}_{it}\delta }{\left({I}_n-\rho {w}_i\right)}+\frac{\mu_i+{\gamma}_t}{\left({I}_n-\rho {w}_i\right)}+\frac{\varepsilon_{it}}{\left({I}_n-\rho {w}_i\right)}\\ {}\kern1.4em =\sum \limits_{k=1}^5{H}_k(w){X}_{it}+L(w)\kern0.2em \left({\mu}_i+{\gamma}_t\right)+L(w){\varepsilon}_{it}\\ {}{H}_k(w)=L(w)\kern0.2em \left({I}_n\beta +{w}_i\delta \right)\\ {}L(w)={\left({I}_n-\rho {w}_i\right)}^{-1}={I}_n+\rho {w}_i+{\rho}^2{w_i}^2+{\rho}^3{w_i}^3+\cdots .\end{array}} $$
(13)

Changes in an independent variable for one province will indirectly affect those in dependent variables for other provinces. w i T it fits the difference in the potential values for total factor energy efficiency among provinces by capturing those for total factor energy efficiency of adjacent provinces, and given the independent variable feature level, w i X it . In other words, the spatial effect of each province is influenced by adjacent provinces. To measure the influence of change in an independent variable on T it , the partial derivative of period t, which corresponds to the kth explained variable for one region at a certain period point, is solved:

$$ \frac{\partial {T}_{it}}{\partial {X}_{it}}{H}_k{(w)}_{ij}. $$
(14)

Such an effect will simultaneously influence province i. The degree of reaction depends on the location of every province, the spatial weight matrix that reflects relationships among each province, and the spatial autocorrelation coefficient ρ and parameters μ i ,  δ,   γ t . The elements on the diagonals of matrix H k (w) represent a direct effect, while those not on the diagonals and mean values denote an indirect effect. Pace and LeSage (2006) presented a measurement method by defining the mean value of the ith row of matrix H k (w) as the accepted average total effect of this region. A simple method to measure the average direct effect is to obtain the mean value of elements on the diagonal. In this case, the difference in subtracting the average direct effect from the average total effect will be the average indirect effect.

$$ \mathrm{Average}\ \mathrm{direct}\ \mathrm{effect}:\overline{M}{(k)}_{direct}={H}_k{(w)}_{ii}/n={n}^{-1}{tkH}_k(w). $$
(15)
$$ \mathrm{Average}\ \mathrm{total}\ \mathrm{effect}:\overline{M}{(k)}_{total}={H}_k{(w)}_{ijsum}\left({\mu}_i+{\gamma}_t\right)/n\kern3.599999em i\ne j. $$
(16)
$$ \mathrm{Average}\ \mathrm{indirect}\ \mathrm{effect}:\overline{M}{(k)}_{indirect}=\overline{M}{(k)}_{total}-\overline{M}{(k)}_{direct} $$
(17)

In Eq. (16), ijsum is the row sum average value of elements that are not on the diagonals in matrix H k (w).

Empirical analysis

Data selection and variables

The Rawski (2001) study showed that, since the mid-1990s, China’s economic growth trend and energy consumption growth trend have been in contrast with each other. Sinton and Fridley (2000), Sinton (2001) explored the relationship between China’s economic growth, energy production, and actual output in this period, and stated that relevant statistical indicators were less credible. Thus, this study selects 2000 as the base year. Energy consumption data are released every other year, and relevant energy data are available till 2015 on the National Statistics Bureau’s official website. Considering data availability and completeness, this study uses relevant energy data for 30 provinces for 2000–2014. The following regions are excluded owing to missing data: Tibet, Hong Kong, Macau, and Taiwan. Missing data are fixed using the mean value, ratio, and interpolation methods. Data for Tianjin city’s transportation infrastructure are expressed in the total length of operational lines, as stated on the National Statistics Bureau’s official website.

For data on GDP output, the gross regional domestic product for each province and a GDP deflator are used. Relevant data for the constant price of each year are obtained using 2000 as the base year. For the labor force index, it is preferable to employ average labor time and labor productivity for each region. However, considering data availability, this study uses the number of employees at the end of each year, data for which are taken from the relevant year’s statistical yearbook for every province, which is half of the sum of last year’s and current year’s year-end employee number. For energy data, the total energy consumption for each province is chosen and converted into a unified unit, that is, 10,000 t of standard coal. Using relevant capital stock data updated to 2014, the perpetual inventory method is employed, that is, K it  = Kit − 1(1 − δ it ) + I it , where K it is the capital stock of the ith province in the tth year. Given space constraints, this study does not conduct a detailed estimation of the economic depreciation rate δ it , but assumes it to be 9.6%, drawing on Zhang et al. (2004). As for the selection of data for investment I it , the current system of national accounts provides a fixed asset price index and data on fixed asset investments for each province. The annual constant price for each fixed asset investment is calculated with 2000 as the base year. Accordingly, each year’s capital stock is estimated by incorporating a fixed asset price index in the formula. Undesirable output data include industrial solid waste emissions, industrial emissions, and industrial waste water discharge, and other major pollutant emission targets for each province during the last 15 years. Table 1 presents the statistical description indexes used to calculate energy efficiency from 2000 to 2014.

Table 1 Statistical description of input and output indexes for 30 provinces

Geographically, 30 provinces are divided into three locations according to traditions, respectively, for the eastern, central, and western regions (Table 2).

Table 2 Territory classification

In calculating the spatial interaction strength of regional total factor energy efficiency, T = 30 denotes the number of research objects, and tmax is the number of provinces whose mean value for the Malmquist–Luenberger (ML) index is greater than 1. The adjacent relationship is defined on the basis of whether a common boundary or interregional distance exists, and if great-circle/economic distance or transportation cost reflects travel time from one place to another. When deciding interregional distance, this study considers the significant development of railway and road systems, for example, G-series high-speed train lines, in cities with a population higher than 500,000, which will definitely reduce the time and cost of interregional exchanges of institutions, information, and industrial transfer, and increase spatial interaction strength. Considering data availability, the present analysis used minimum train arrival time for 30 provinces till May 30, 2017.

This study highlights five factors that influence the spatial interaction strength of total energy efficiency. First is industrial structure (IS). The related literature largely adopts the GDP of the secondary or tertiary industry. However, the energy consumption level tends to differ by industry; in the recent 2 years, the proportion of China’s tertiary industry has exceeded 50%. Thus, it would be one-sided to examine the industrial structure level for each province using the proportion of one industry. Employing an industrial upgrading coefficient method, which has been increasingly used to measure industrial structure in recent years, this study assumes the upgrade level of provincial industrial structure to be 1 × provincial proportion of secondary industry + 2 × provincial proportion of secondary industry +3 × provincial proportion of tertiary industry. The second influencing factor is transportation infrastructure (TI), expressed by the railway operating mileage of each province; the third is technical progress (TECH), which is replaced by transaction volume for each province’s technical market. The fourth is the degree of government interference (GI), denoted by the ratio of local general budget expenditure to gross regional domestic product; and finally, FDI in each province during the study period was collected and converted into the constant value of 2000. Data for these factors are obtained from the China Statistical Yearbook, China Statistical Yearbook on Energy, China City Statistical Yearbook, China Statistical Yearbook on Environment, and statistical yearbooks of every province from 2000 to 2015.

Total factor energy efficiency for each province

The values of total factor energy efficiency for the 30 Chinese provinces during 2000 and 2014 are obtained using MaxDEA 6.0. Table 3 presents the results for the efficiency change (EC) index, technology change (TC) index, and Malmquist–Luenberger index, as well as the mean values.

Table 3 Evaluation results for total factor energy efficiencies for 30 Chinese provinces (2000–2014)

Table 3 indicates the following trends for average total factor energy efficiency in China during 2000 and 2014. Ningxia, Inner Mongolia, and Qinghai report the highest total factor energy efficiency, of which Ningxia has the highest mean value of 1.196, with an increase of 19.6%. All three provinces are in northwest inland of China, where economic development lags, there are few industrial enterprises, and the environmental quality and energy consumption are better than those in developed areas. The ML index is bounded by 1, and its results indicate that total factor energy efficiencies for 19 provinces during the period examined are on the production frontier. The gaps in the ML index for mid-level provinces (i.e., those ranking 9–19) are small, and the maximum is 2.3%. In addition, Beijing, Liaoning, Anhui, Hainan, and Guangxi appear to be falling behind, of which Hainan’s factor energy efficiency is 78% and descended by 22%, which is significantly lower than those of other provinces (Table 4).

Table 4 ML index mean values for each Chinese region

Figure 1 illustrates that the growth in the average total factor energy efficiency of the western region is perennially higher than those of other regions and the entire nation, which is also consistent with the results in Table 4. However, prior to 2009, changes in total factor energy efficiency were not stable. The energy efficiency in the middle region fluctuated most closely to the national average, and the ML index was the most stable with a mean of 0.998. In the eastern region, the total factor productivity fluctuated greatly. It declined rapidly from 2000 to 2002, and reached a low level from 2002 to 2005, after which it rose to a stable state, with the exception of 2010–2011, when the total factor productivity retained stable status.

Fig. 1
figure 1

Change trend for total factor energy efficiency

Spatial interaction strength

Since the data for spatial interactions constantly change, whereas the present index is a static result, first, this study calculates each year’s value, and then compares them. This discussion is limited to the potential values of each province’s energy efficiency for specific years owing to space constraints. Table 5 presents the results.

Table 5 Potential energy efficiency values for each Chinese province (select years)

The table shows that the top ranking of Gansu, Inner Mongolia, Qinghai, and Ningxia did not fluctuate, and for mid-level provinces like Henan, Heilongjiang, Hubei, and Shandong, the fluctuation trends are the same. Provinces with a significant increasing trend in recent years are Tianjin (from 28th to 5th), Chongqing (from 17th to 8th), and Qinghai (from 14th to 4th). It is noteworthy that, in 2012, the decline in spatial interaction strength in Jiangsu Province (from 16th to 25th) and Anhui Province (from 6th to 27th) coincided with that in total factor energy efficiency change in the eastern region. In addition, the rankings for Heilongjiang, Liaoning, and Jilin marginally decreased, while the spatial interaction strength for Inner Mongolia increased by 141.3% to 113.36 in 2003 than 46.98 in 2002.

Spatial econometric model results

Spatial autocorrelation test

In this section, Stata 14.2 is used to calculate Moran’s index for the potential values of the total factor energy efficiencies in 30 provinces. Table 6 presents the results.

Table 6 Moran’s I for potential values of China’s provinces, 2000–2014

The results show that the values for Moran’s I in each year are greater than 0, indicating that the potential values of total factor energy efficiencies for each province have a spatial positive correlation. The statistical values for Moran’s I during the 12-year research period ranged between 0.058 and 0.267, and were significant at the 10% level. Thus, we can conclude that the spatial interaction of total factor energy efficiency for all provinces is not completely random, but is in a positively related state of spatial dependence. In addition to local economic development, industrial restructuring, resource consumption, technological development, and increased environmental regulation strength in adjacent regions could affect the strength of total factor energy efficiency for all provinces, and intensify the spatial correlation.

Scatter diagram for local autocorrelation

On the basis of the aforementioned discussion, Fig. 2 presents a four-quadrant Moran’s I scatter diagram for the potential values of total factor energy efficiency for 30 provinces in select years.

Fig. 2
figure 2

Moran’s I scatter diagram for potential value of energy efficiency (select years)

The scatter diagram reveals that, in the selected 4 years, two thirds of the provinces are in the first and third quadrants, thus proving the existence of spatial positive correlation or the spatial dependence of the potential value for total factor energy efficiency among the provinces. Most provinces demonstrate a clustering effect on adjacent provinces, and the potential values for the remaining provinces are atypical.

In 2011 and 2014, Inner Mongolia, Gansu, and Ningxia are in the first quadrant, of which Inner Mongolia and Qinghai show an increasing trend to the first quadrant in 2008 and 2011. Inner Mongolia jumps from the fourth quadrant to the first quadrant in 2009. These four provinces also rank among the top four in terms of spatial interaction strength of total factor energy efficiency, indicating a high-high (H-H; high strength of energy efficiency integration and high spatial lag) positive autocorrelation clustering mode. Provinces with spatial interaction of high strength are surrounded by provinces with high spatial interaction strength.

Hebei, Shanxi, Liaoning, Jilin, Heilongjiang, Sichuan, Shanxi, and Xinjiang are located in the second quadrant, and report a clustering low-high (L-H; low strength of energy efficiency integration and high spatial lag) negative autocorrelation, of which Shanxi, Liaoning, and Xinjiang have low spatial interaction strength in terms of energy efficiency, and are surrounded by provinces with higher strength. Beijing, Shanghai, Jiangsu, Anhui, Zhejiang, Fujian, Jiangxi, Shandong, Henan, Hubei, Hunan, Guangdong, Guangxi, Hainan, Chongqing, and Yunnan lie within the third quadrant, and the potential values of total factor energy efficiency rank low. Most of these regions are located in the low-low quadrant and East Chinese region; they meet the condition that provinces with low spatial interaction strength are surrounded by provinces with higher strength. Tianjin is in the fourth quadrant, while Guizhou stretches across both the third and fourth quadrants.

Referencing Rey’s (2001) space time translation method, 17 provinces can be classified as HH→HH and LL→LL, and their transition is at the same level. The transition of Beijing Tianjin (LL→HH), Jilin and Shanxi (HH→HL), and Ningxia (HL→HH) is geographically of adjacent type. Inner Mongolia (LH→HH) and Hainan’s (LL→HL) transitions can be termed as relative displacement. Guizhou stretches across the two quadrants and thus belongs to an atypical type. Qinghai is in the second quadrant in 2005 and 2014, stretches across the first and fourth quadrants in 2011, and reaches the fourth quadrant in 2012 and 2014, rendering it an unstable region.

Solution of spatial econometric model

According to the above analysis, the explained variables in this study may have time inertia, that is, one region may become the “supply region” of demand in the early phase, and exert an appealing force on adjacent regions. Such an appealing force may persist throughout the current period. When using the model with fixed time and individual effects, to avoid endogeneity resulting from the first-order lag term of the dependent variable being the explained variable and the spatial spillover effect, this study adopts the space Durbin model for parameter estimation. Tables 7 and 8 present the results for the entire nation, as well as the east, west, and middle regions, calculated using Stata 14.2.

Table 7 Regression results for spatial dynamic panel Durbin model
Table 8 Breakdown table of spatial effect

From above regression results, we can draw the following conclusions. First, the one-order-lagged potential value for total factor energy efficiency is significant at the 1% level. The regression coefficient is positive, indicating that the previous year’s economic development level, environmental improvement, and policymaking among provinces had a significant positive effect on the current year’s energy efficiency potential value. Regression coefficient lncyjg for the spatial lag term of the explained variable is not significant as per the regression model. The study attempted to set an economic distance matrix (the value of every factor in the matrix is expressed by the product of the reciprocal of the square of the geographical distance between two regions and the ratio of local average GDP to national total GDP) to perform a regression on the space Durbin model, and derived a regression coefficient lnjtss of 0.039, and a p value of 0. This indicates that the spillover effect of the potential value for total factor energy efficiency for all provinces is significant when provinces have similar economic development levels.

Second, the industrial structure variable has a significant negative effect on the potential value of the regional and provincial total factor energy efficiency; it is significant at the 1% level. However, when testing the eastern, western, and mid-regions, the industrial structure level positively impacts the potential values of total factor energy efficiency across all regions, especially in the west. This further highlights China’s development strategies, industrial transfers in the mid- and western regions, and the advantages of the “One Belt One Road” initiative in these regions. The industrial structure coefficient in the eastern region is the lowest, indicating that the blind pursuit for industrial upgrading may not improve the potential value for energy efficiency because of the high economic development levels and complete industrial structure. This also explains the negative coefficient when considering industrial structure within the nationwide range.

Third, the effects of transport infrastructure and technical progress on the potential values of total factor energy efficiency were significant at the 1 and 10% levels. The construction of highways, railways, and civil aviation route networks will enable provinces to reach the maximum effect of interconnection. The costs of such links as technology exchange, population migration, commodity circulation, industrial transfer, and policy learning will also decrease in these provinces, directly enhancing the spatial interaction intensity of total factor energy efficiency. During the industrial transfer, eight provinces in the mid-region underwent an upgrading process from the primary to the secondary industry. Given a large proportion of industrial enterprises, industrial enterprise clustering can boost local economic development and potential value in the short term. However, in the long run, industrial enterprises are mostly pollutive. Thus, improvements in scientific and technological levels could improve productivity and reduce energy consumption. However, at the same time, they can reduce enterprises’ treatment cost, which is favorable to enterprises’ operating activities, and counters governmental environmental regulations. The west has the highest technical progress coefficient with a high significance level and p value of 0. This is because the west has been in a state of lag, and technological development in the region seems to have an instant effect.

Fourth, of the five independent variables, FDI and degree of government interference negatively affected the potential value of China’s total factor energy efficiency; they pass the 10% significant test. On the other hand, the mid- and western regions passed the significance test. At present, in addition to FDI, governmental fiscal expenditure in the western region also promotes GDP growth. This is also reflected in the values of independent variable coefficients. The coefficient of government interference is the second largest independent variable after industrial restructuring to affect the potential value of total factor energy efficiency in the west.

The regression coefficients for the space Durbin model are not the elastic coefficients of explained variables. In Eq. (14), H k (w) ij is used to measure the influence of change in independent variables on the observed value of the explained variable and that of the observed value on itself in the feedback circuit. The effects are divided into short- and long-term influences. The effect of explained variable T i caused by a change in the ith observed value for the kth explained variable is called the average direct effect, and the effects on other regions are called spatial spillover effect (average indirect effect). See Table 8 for details.

Fifth, judging from the nationwide range, the direct effects of other independent variables, except FDI, have contrary results in the short and long term. For example, the economic environmental benefits from industrial regulations may not have direct short-term effects; even areas with strict environmental regulations may compel industrial enterprises to exit the regions, resulting in the slowing down of economic development. However, in the long run, the optimization of proportions of various industries and restrictions on high energy-consuming and highly pollutive enterprises could positively stimulate the potential value of total factor energy efficiency in China. Another example is the degree of government interference. An increase in the general budge expenditure for local finance would increase local economic governance in the short term, promote technical progress, and improve the production efficiency of industrial efficiencies. However, long-term dependence of the local government on government expenditure will reduce productivity and economic growth, which, in turn, will hinder the potential value of total factor energy efficiency in the corresponding area.

Sixth, the spatial effect produced by the increase of FDI in the western region all passed the significance test at the 1% level, indicating high investment potential in western China. However, examining the spatial effect of FDI in the whole country shows results that are contrary to those of the western region, which needs further research (Table 8).

Finally, technical progress has the best spatial effect across all provinces. Irrespective of the direct or spillover effect, it has a positive influence on adjacent regions. Technical development, R&D input, and clean energy technology have a direct bearing on the environmental performance of each province. The value of the long-term average indirect effect, 0.128, is considerably higher than that of the short-term average indirect effect, 0.031. This indicates that, in addition to increased technological levels significantly improving the potential value of total factor energy efficiency in the short run, such an influence persists in both the relevant and adjacent areas.

Conclusions and implications

Conclusions

This study adopts a nonradial and slack variable perspective, and employs a super-efficiency SBM–DEA model to calculate the total factor energy efficiency of 30 Chinese provinces during 2000–2014. Since total factor energy efficiency has a spatial spillover effect, by referencing Wilson’s maximum entropy model, this analysis attempted to set up a model to calculate the strength of spatial interactions that are based on energy efficiency, which is also known as the potential value for total factor energy efficiency. After testing for the existence of the spatial positive autocorrelation of the potential value for total factor energy efficiency, a space Durbin model was set up to examine influencing factors, including industrial structure, energy strength, and technical progress.

For the period of 2000–2014, 16 provinces reported mean values for the ML index that were greater than 1. Provinces with front rankings are located in the Northwest China, where there are few industrial enterprises, and environmental pollution is low. According to the calculation results for the potential value of total factor energy efficiency, the abnormal fluctuation in the ML index during 2003 and 2012 tallied with those of rankings of the potential value for each province’s energy efficiency in terms of time and location. This study also adopted Moran’s I scatter diagram to observe the clustering effect of the potential value of all provinces’ total factor energy efficiency. It shows that the transitions of 22 provinces were of the same type. However, the industrial structure of the three major eastern, central, and western regions has a positive effect on the total factor energy efficiency potential of all regions.

Implications

Based on the empirical results in the article, the following implications can be drawn:

  1. 1.

    Optimize economic structure and improve energy efficiency. Currently, China still has low energy efficiency. In some areas, it is far away from the production front of DEA, and there is still substantial room for improvement in energy efficiency. According to the research in this article, the industrial structure and technological level have different impacts on energy efficiency. Therefore, accelerating the structural adjustment is conducive to China’s economic restructuring and energy efficiency.

  2. 2.

    Improve the technical level and promote energy conservation. One of the main reasons for the low energy efficiency in China is the excessive emission of pollutants. According to this research, the emission of unwanted pollutants from energy sources limits the energy efficiency improvement. Therefore, energy conservation and environmental governance need to be strengthened to enhance energy efficiency and achieve effective energy efficiency improvements.

  3. 3.

    Strengthen regional coordination and promote common development. According to the analysis in this paper, the energy efficiency in the western region is higher than that in the middle and eastern parts of the country, and there is a direct impact and spillover effect between regions. Therefore, it is necessary to strengthen the coordination of regional energy issues on environmental governance and give full play to the exemplary role of energy-efficient regions for energy efficiency improvement.