1 Introduction

In environmental economics study area, it is becoming a conventional view to treat industrial emissions as sorts of undesirable outputs (Färe et al. 2004; Chung et al. 1997). Such treatment leads to some analyses on multilateral industrial productive efficiency and the social well-being changes from “goods” of improvement and “bads” of distortion. Since the undesirable outputs generation depends on the level of input usage, the environmental efficiency index based on the standard axioms production theory can usually be applied to measure the environmental performance.Footnote 1

There are plenty of literatures have studied emission generating technologies over several decades.Footnote 2 However, these papers generalize the property of pollution generating technology on the positive relationship between emission and desirable output production. They also advocate even if the disposability of emissions is not free, at least these undesirable outputs should be disposed off in a proportionate to the desirable outputs (usually called “weak-disposability”) and there is no bad output generated if no desirable output produced (“null-jointness”).Footnote 3 The pollution generating technology with these properties can be referred as “weak-disposability approach”. In many empirical works, nonparametric or parametric specifications of weak-disposability approaches are widely employed for measuring the production efficiency, environmental efficiency, and shadow prices of pollutants generated et al.

Recently, Murty et al. (2012) indicate that the weak-disposability approach treats freely disposable input and weekly disposable and null-jointness outputs might lead to counterintuitive implications for trade-offs between inputs, desirable outputs and undesirable outputs.Footnote 4 To resolving the concerns raised on weak-disposability approach, Murty et al. (2012) propose a set of disposability properties for emission-generating technologies, which reflect the simultaneous play of the emission-generating mechanism of nature and the firm’s desirable production activities. They call this new approach the by-production (BP) approach to modelling emission generating technologies. In this approach, they distinguish explicitly between emission-causing inputs (like fossil fuels) and non-emission causing inputs (such as capital, labour, etc) and decompose the technology into two separated technology sets (1) a standard neo-classical (engineering) desirable production technology which satisfies free disposability with respect to all inputs and desirable outputs and (2) a nature’s emission generating mechanism that violates free disposability of emissions and emission-causing inputs. Rather, BP approach proposes that the latter set should satisfy costly disposability of emissions and emission-causing inputs. However, the paper to propose BP approach mainly focuses on the theoretical modeling and the capability test for the new theory is only according to the artificial data. Therefore, the empirical analyses for testifying applicability of BP approach based on the real data set are still necessary.

As a fast economic developing country, China’s environmental issues have caused the extensive concern as well as the Chinese government. Hence, the joint production and environmental efficiency measures identifying technologies and regional locations can provide benchmark purposes within limited resources, and have important implication for designing development and emission policies to promote economic and environmental performance in China. Recently, several literatures contributed to relevant China studies. Hu and Wang (2006) provided a total-factor energy efficiency measurement to analyse the efficiency of China’s regional energy usage, but their model only treats the energy as one of multiple inputs and regional GDP as desirable output without considering any emission as undesirable output. Some recent studies proposed the Data Envelopment Analysis (DEA) models to measure the China’s regional integrated economic and environmental efficiency, which attempt to decrease the undesirable outputs and increase desirable outputs simultaneously (Bian and Yang 2010; Wang et al. 2012, 2013). However, these models are all based on the “weak-disposability” technologies, which assume there exists a proportionally relation between desirable and undesirable outputs. Since some counterintuitive concerns arose in the “weak-disposability” assumption, the DEA efficiency models in this paper are based on the new BP approach and treat the desirable and undesirable outputs generation as two separated technology sets and measure each technical efficiency, respectively. In terms of efficiency models constructing, there are wide variations between two orientations: “input, desirable and undesirable orientation” and “undesirable orientation” (Nakano and Managi 2012). Weighted Russell Directional Distance (WRDD) models based on the former orientation are recently used in China’s environmental efficiency studies (Fujii et al. 2015; Cao et al. 2015). One of the strong points of this efficiency measurement is easily decomposing and detecting the each inefficiency level of input, desirable output and undesirable output individually. The main difference between our study and WRDD method is we separate the total input as non-emission causing inputs and emission causing inputs; the production efficiency measurement for desirable outputs with respect to all inputs and the environmental efficiency measurement for undesirable outputs with only respect to emission causing inputs can be conducted separately, corresponding to the two separate technology sets decomposition under BP emission generating technologies. Hence, the efficiency methods proposed in this paper would be an extension to the previous two orientations.

This paper could fill in the blank of China’s regional efficiency studies under BP approach with first applying the actual data of China’s 30 province-level regions from 2006 to 2010 to construct the DEA models to measure the provincial technical efficiency. In particular, given the fixed level of capital, labour and energy inputs, the efficiency of China’s provincial GDP output with consideration of two main air pollutants (\({\rm CO}_2\) and \({\rm SO}_2\)) will be addressed by employing the new BP technology.

The rest of this paper is organised as follows: Section 2 introduces the characteristics of BP pollution generating technology and how to construct the efficiency measures under BP approach. Section 3 employs the data to evaluate the regional production and environmental efficiency scores and deeply analyses the relevant reasons based on empirical results. According to empirical results presenting, some policy implementations are discussed in Sect. 4.

2 Methodology

2.1 By-production approach

We will assume that there are N inputs, M desirable outputs, and K emission types (undesirable outputs). Input vector is denoted by \(x=(x_{1},\ldots x_{N})\in \mathbb {R}_{+}^{N}\), desirable output vector is denoted by \(y=(y_{1},\ldots y_{M})\in \mathbb {R}_{+}^{M}\), and undesirable outputs vector is \(b=(b_{1},\ldots b_{K})\in \mathbb {R}_{+}^{K}\).Footnote 5

In BP approach, N inputs will be classified into non-emission causing and emission-causing inputs. The first \(N_{1}\) inputs are non-emission causing, while the last \(N_2\) inputs are emission-causing. Hence, \(N=N_{1}+N_{2}\). The input quantity vector \(x=\langle x_{1},\ldots,x_{N}\rangle \in \mathbb {R}_{+}^{N}\) can be partitioned into a vector of non-emission causing inputs, denoted by \(x^{1}=\langle x_{1},\ldots,x_{N_{1}}\rangle \in \mathbb {R}_{+}^{N_{1}}\) and a vector of emission-causing inputs, denoted by \(x^{2}=\langle x_{N_{1}+1},\ldots,x_{N}\rangle \in \mathbb {R}_{+}^{N_{2}}\). Hence, \(x=\langle x^1,x^2\rangle \in \mathbb {R}_{+}^{N}\). When producers use pollution causing inputs, the production of desirable outputs would set a nature’s residual mechanism in motion, which will lead to the generation of undesirable outputs (Murty et al. 2012). Therefore, the emission-generating technologies can be separated to two technology sets: \(T_1\) is the conventional production technology set, which reflects the transformation of all inputs into desirable outputs; and \(T_2\) denotes nature’s residual generating technology, which shows how emission-causing goods used in \(T_1\) generate emissions in nature.Footnote 6 Hence, the parametric formulation of a BP emission generating technology is given as

$$\begin{aligned} T_{\rm BP}&={} T_{1}\cap T_{2},\quad \hbox {\rm where}\nonumber \\ T_{1}&={} \{\langle x^{1},x^{2},y,b\rangle \in \mathbb {R}_{+}^{N+M+K}|f(x^{1},x^{2},y)\le 0\}, \nonumber \\ T_{2}&={} \{\langle x^{1},x^{2},y,b\rangle \in \mathbb {R}_{+}^{N+M+K}|g(x^{2}, b) \ge 0\}. \end{aligned}$$
(1)

Functions \(f:\mathbb {R}_{+}^{N+M+K}\longrightarrow \mathbb {R}\) and \(g: \mathbb {R}_{+}^{N+M+K}\longrightarrow \mathbb {R}\) are the parametric representations of sets \(T_1\) and \(T_2\), respectively. We assume that both functions are continuously differential and the properties we will impose on functions f and g below will ensure that \(T_{\rm BP}\) is non-empty.

We would like to use function f to represent the conventional neo-classical technology set \(T_{1}\). Hence, we will assume the following signs for the derivatives of function f:

$$\begin{aligned} f_{x_{n}}(x^{1},x^{2},y)\le & {} 0, \quad \forall \ {n}={1, \ldots , N_1},\nonumber \\ f_{x_{n}}(x^{1},x^{2},y)\le & {} 0, \quad \forall \ {n}={N_1+1, \ldots , N},\nonumber \\ f_{y_m}(x^{1},x^{2},y)\ge & {} 0.\quad \forall \ m={1, \ldots , M} \end{aligned}$$
(2)

The signs of these derivatives imply that all inputs satisfy standard free disposability and all desirable outputs are also freely disposable. In particular, along the frontier of technology \(T_1\), there is a positive relationship between any input and any desirable output. In addition, the technology set \(T_1\) is independent of the level of emissions, which means emissions do not affect desirable output production.Footnote 7

Set \(T_2\) in (1) reflects the physical and chemical mechanism of pollution generation in nature. In nature, the more the emission-causing goods are used the more are the emissions generated. The function g should capture this. We assume the following signs for the derivatives of function g.

$$\begin{aligned} g_{x_{n}}(x^{2},b)\le & {} 0, \quad \forall \ n={N_1+1, \ldots , N},\nonumber \\ g_{b_k}(x^{2},b)\ge & {} 0. \quad \forall \ b_k={1, \ldots , K}. \end{aligned}$$
(3)

Under these sign conventions, the production vectors \(\langle x^1, x^2, y, b\rangle \in \mathbb {R}_{+}^{N+M+K}\) that satisfy \(g(x^2, b)=0\) form the lower frontier of technology \(T_2\). For every vector of emission-causing inputs, this frontier gives the minimal levels of emissions generated in nature. This property has been called costly disposability of emissions and it captures our intuition that emissions are not freely disposable as outputs. Usage of emission-causing inputs definitely produces some minimal emissions. Employing the implicit function theorem, it can be shown that these sign conventions imply that the trade-off between any emission-causing input and any emission type along the lower frontier of technology \(T_2\) is −\(\frac{g_{b_k}}{g_{x_n}}\), which is non-negative, provided \(g_{x_n}\ne 0\) for \(n=N_1+1, \ldots , N\). Thus, this captures the positive relation between emission-causing goods such as fossil fuels and emissions such as \({\rm CO}_2\) and \({\rm SO}_2\).

In Fig. 1, we assume \(M=N=K=1\) and \(N_1=0\) (that is the single input is emission-causing). For every level of the input, parts (a) and part (b) in Fig. 1 illustrate the maximal level of the desirable output produced in \(T_{1}\) and minimal level of undesirable outputs generated in \(T_{2}\), respectively. Part (c) reflects the output possibility set under BP approach. This shows that given input level x, there is only one combination of the good and the bad output that is efficient, namely, the point A. A indicates the maximal amount of the good output and the minimal amount of the bad output that input level x can produce under \(T_1\) and \(T_2\), respectively.

Fig. 1
figure 1

By-production pollution generating technologies

2.2 Efficiency measurements under by-production approach

Murty et al. (2012) employ a non-parametric formulation of their BP technology for measuring technical efficiency. The notation that we will employ for a DEA construction of the non-parametric version of the BP technology is as follows: let the matrix of observations on non-pollution causing inputs be denoted by \(X^1_{D\times {N_1}}\) and the pollution causing inputs be denoted by \(X^2_{D\times {N_2}}\). Let the matrices of observations on desirable and undesirable outputs be denoted as before by \(Y_{D\times M}\) and \(B_{D\times K}\), respectively. Then the standard DEA non-parametric representation of BP can be specified as

$$\begin{aligned} T_{1}&= {} \bigg \{\langle x,y,b\rangle \in \mathbb {R}_{+}^{N+M+K}|\lambda [X^{1},X^{2}] \le \langle x^{1},x^{2}\rangle \wedge \lambda Y\ge y, \quad \lambda \in \mathbb {R}_{+}^{D}.\bigg \}\nonumber \\ T_{2}&= {} \bigg \{\langle x^{1},x^{2},y,b\rangle \in \mathbb {R}_{+}^{N+M+K}|\mu X^{2}\ge x^{2}\wedge \mu B\le b, \quad \lambda \in \mathbb {R}_{+}^{D}.\bigg \} \end{aligned}$$
(4)

The overall BP technology is the intersection of \(T_{1}\) and \(T_{2}\). Hence, it is derived under DEA as

$$\begin{aligned} T_{\rm BP} = {T_1}\cap {T_2}&= {} \bigg \{\langle x^{1},x^{2},y,b\rangle \in \mathbb {R}_{+}^{N+M+K}|\lambda [X^{1},X^{2}] \le \langle x^{1},x^{2}\rangle \nonumber \\&\wedge\lambda Y\ge y\wedge \mu X^{2} \ge x^{2}\wedge \mu B\le b, \langle \lambda ,\mu \rangle \in \mathbb {R}_{+}^{2D}\bigg \}. \end{aligned}$$
(5)

Here, \(\lambda \in \mathbb {R}_{+}^D\) and \(\mu \in \mathbb {R}_{+}^D\) here represent the intensity vectors, which are the weights assigned to each decision making unit (DMU) to construct the technically efficient frontiers of \(T_1\) and \(T_2\) under DEA.

Following by the concept of non-parametric technical efficiency measurement under the BP approach, in this paper, we will focus on output-based measures of efficiency and consider two types of efficiency indexes: the hyperbolic (HYP) efficiency index and the modified Färe-Grosskopf-Lovell (FGL) efficiency index.Footnote 8

Since BP approach distinguishes between desirable production technology \(T_1\) and nature’s emission-generating technology \(T_2\), a technical efficiency index defined under the BP approach can be implicitly or explicitly decomposed into two components: index of desirable output (production) efficiency and an index of undesirable output (environmental) efficiency. In the case of the HYP measure of efficiency in a BP technology, this decomposition is explicit, while in the case of the FGL measure, the decomposition is implicit.

2.2.1 HYP measurement under BP approach

The HYP measure of efficiency decomposes efficiency explicitly into desirable production efficiency, which is defined relative to set \(T_1\), and environmental efficiency, which is defined relative to \(T_2\). The former is denoted by \(D_{\rm HYP(1)}\) and the latter is denoted by \(D_{\rm HYP(2)}\). Intuitively, holding all inputs fixed, \(\frac{1}{D_{\rm HYP(1)}}\) measures the maximal factor by which the given desirable output vector can be scaled-up and yet be technologically feasible, while \(\frac{1}{D_{\rm HYP(2)}}\) captures the maximal factor by which the bad output vector can be scaled-down and yet be technologically feasible. The overall index of efficiency, denoted by \(D_{\rm HYP}\) is obtained by taking the maximum of \(D_{\rm HYP(1)}\) and \(D_{\rm HYP(2)}\). This implies that \(\frac{1}{D_{\rm HYP}}\) is the maximal extent to which the good output vector and the bad output vector can be simultaneously scaled-up and scaled-down, respectively, and yet be technologically feasible.

The mathematical programme to measure hyperbolic efficiency under the BP approach is:

$$\begin{aligned}&D_{\rm HYP}(x,y,b;T_{\rm BP}) \nonumber \\&=\underset{\beta>0}{\inf }\{\beta |\langle x,y/\beta ,b\beta \rangle \in T_{\rm BP}\} \nonumber \\&=\underset{\beta>0}{\inf }\{\beta |\langle x,y/\beta ,b\beta \rangle \in T_{1}\wedge \langle x,y/\beta ,b\beta \rangle \in T_{2}\}\nonumber \\&=\max \{\beta _{1},\beta _{2}\}. \\ D_{\rm HYP(1)}&=\underset{\beta _1>0}{\inf }\{\beta |\langle x,y/\beta _{1} ,b\rangle \in T_{1}\}\nonumber \\ D_{\rm HYP(2)}&=\underset{\beta _2 >0}{\inf }\{\beta |\langle x,y,b\beta _{2} \rangle \in T_{2}\}\nonumber \end{aligned}$$
(6)

where, the last two equalities follow from the fact that, in the BP approach, given a vector of inputs, the output possibility sets corresponding to \(T_{1}\) and \(T_{2}\) are independent. When \({D_{\rm HYP(1)}}=1\), the observed point is on the weakly efficient frontier of \(T_1\) and when \({D_{\rm HYP(2)}}=1\), the observed point is on the weakly efficient lower frontier of \(T_2\). An observation is inefficient when \(D_{\rm HYP}\) is strictly less than one. There might be an observation, for which \(D_{\rm HYP}\) equals to one, while \(D_{\rm HYP(1)}\) and \(D_{\rm HYP(2)}\) might not both equal to one. This implies that the hyperbolic measure will judge this observation as efficient, even when it is inefficient in desirable output production or undesirable output production.

Below, we present the DEA programme for measuring hyperbolic efficiency: for each DMU \(d^{\prime}\) in each different year t, HYP efficiency is measured as

$$\begin{aligned} D_{\rm HYP}\left( {x_{d^{\prime}}^t},{y_{d^{\prime}}^t},{b_{d^{\prime}}^t};{T_{\rm BP}} \right)&={} \max \left\{ {{\beta _1},{\beta _2}} \right\} , \nonumber \\ {D _{\rm HYP(1)}}&={} \mathop {\min }\limits _{\lambda ,\beta _1 } \beta _1 \nonumber \\ {\rm s.t.} \sum \limits _{t = 1}^T \sum \limits _{d = 1}^D {{\lambda _{d}^t}{y_{d,m}^t}}&\ge {} {{y_{d^{\prime},m}^t}/\beta _1} , \quad\forall m = 1,\ldots,M \nonumber \\ \sum \limits _{t = 1}^T \sum \limits _{d = 1}^D {{\lambda _{d}^t}x_{d,n}^t }&\le {} x_{d^{\prime},n}^t, \quad\forall n = 1 \ldots N \nonumber \\ \lambda _{d}^t & \ge {} 0 \quad \forall \ d=1, \ldots , D. \nonumber \\ \nonumber \\ {D_{\rm HYP(2)}}&= {} \mathop {\min }\limits _{\mu ,\beta _2 } \beta _2 \nonumber \\ {\rm s.t.} \sum \limits _{t = 1}^T {\sum \limits _{d = 1}^D {{\mu _{d}^t}{b_{d,k}^t}} } & \le {} {b_{d^{\prime},k}^t}\beta _2 ,\quad\forall k = 1,\ldots,K \nonumber \\ \sum \limits _{t = 1}^T \sum \limits _{d = 1}^D {{\mu _{d}^t}x_{d,n}^t} & \ge {} {x_{d^{\prime}, n}^t} ,\quad\forall n = {N_1} + 1,\ldots,N \nonumber \\ \mu _{d}^t & \ge {} 0\quad \forall \ d=1, \ldots , D. \end{aligned}$$
(7)

2.2.2 Modified FGL measurement under BP approach

Murty et al. (2012) consider the output-based version of the FGL approach to construct a modified FGL efficiency index with respect to the BP approach. This index is based on the coordinate-wise expansions of desirable outputs and coordinate-wise contractions of undesirable outputs.Footnote 9 The FGL index decomposes under the BP approach into production and environmental efficiency measures as follows:Footnote 10

$$\begin{aligned}&D_{\rm FGL}(x,y,b;T_{\rm BP})\nonumber \\&=:\frac{1}{2}\underset{\theta ,\gamma }{\min }\left\{\frac{\sum _{m}\theta _{m}}{M}+\frac{\sum _{k}\gamma _{k}}{K}|\langle x, y \,\oslash \,\theta ,b\otimes \gamma \rangle \in T_{\rm BP}\right\}\nonumber \\&=\frac{1}{2}\underset{\theta }{\min }\left\{\frac{\sum _{m}\theta _{m}}{M}|\langle x, y\,\oslash \,\theta ,b\rangle \in T_{1}\right\}\\&\quad+\;\frac{1}{2}\underset{\gamma }{\min }\left\{\frac{\sum _{k}\gamma _{k}}{K}|\langle x,y,b\otimes \gamma \in T_{2}\rangle \right\}\nonumber \\&=\frac{1}{2}[D_{\rm FGL(1)}(x,y,b;T_{1})+D_{\rm FGL(2)}(x,y,b;T_{2})].\nonumber \end{aligned}$$
(8)

Here, \(D_{\rm FGL(1)}\) measures the production efficiency of the DMU in desirable production, while \(D_{\rm FGL(2)}\) measures its environmental efficiency. The FGL efficiency index takes a simple average of the production efficiency and environmental efficiency to compute the overall efficiency of DMUs. The key feature of this index is that a DMU is judged as efficient if and only if it is efficient in both desirable outputs and environmental directions, i.e., if and only if \(D_{\rm FGL(1)}=D_{\rm FGL(2)}=1\). Compare this with the hyperbolic measure of efficiency, where a DMU can be judged efficient even when it is not efficient in the direction of desirable outputs or in the environmental direction.

The DEA algorithm for computing FGL index is given as follows. To compute efficiency of each DMU \(d^{\prime}\) in each year t, we solve the following optimisation problem:

$$\begin{aligned} D_{\rm FGL}(x_d^t,y_d^t,b_d^t;\ T_{\rm BP})&= {} \underset{\lambda ,\theta ,\mu ,\gamma }{\min }\frac{1}{2}\left[\frac{\sum \nolimits _{m=1}^{M}\theta _{m}}{M}+\frac{\sum \nolimits _{k=1}^{K}\gamma _{k}}{K}\right] \nonumber \\ {\rm s.t.}\sum \nolimits _{t=1}^{T}\sum \limits _{d=1}^{D}{\lambda _{d}^t}{y_{d,m}^t}& \ge {} y_{d^{\prime},m}^{t}/\theta _{m}\quad \forall m=1,\ldots M \nonumber \\ \sum \nolimits _{t=1}^{T}\sum \limits _{d=1}^{D}{\lambda _{d}^t}{x_{d,n}^t}& \le {} {x_{d^{\prime},n}^t}\ \quad \forall n=1,\ldots N \nonumber \\ \sum \limits _{t=1}^{T}\sum \limits _{d=1}^{D}{\mu _{d}^t}{b_{d,k}^t}& \le {} {b_{d^{\prime},k}^t}\gamma _{k}\quad \forall k=1,\ldots K \nonumber \\ \sum \limits _{t=1}^{T}\sum \limits _{d=1}^{D}{\mu _{d}^t}{x_{d,n}^t}& \ge {} {x_{d^{\prime},n}^t}\quad \forall n=N_{1}+1,\ldots N \nonumber \\ \lambda _{d}^t \ge 0,\ \mu _{d}^t &\ge {} 0,\quad \forall \ d=1, \ldots , D \end{aligned}$$
(9)

Since the \(T_1\) and \(T_2\) are independent from each other, \(D_{\rm FGL}\) could be calculated separately as following:

$$\begin{aligned} D_{\rm FGL(1)}&= {} \underset{\lambda , \theta }{\min }\frac{\sum \nolimits _{m=1}^{M}\theta _{m}}{M} \nonumber \\ {\rm s.t.}\sum \nolimits _{t=1}^{T}\sum \limits _{d=1}^{D}{\lambda _{d}^t}{y_{d,m}^t} & \ge {} y_{d^{\prime},m}^{t}/\theta _{m}\quad \forall m=1,\ldots M \nonumber \\ \sum \limits _{t=1}^{T}\sum \limits _{d=1}^{D}{\lambda _{d}^t}{x_{d,n}^t}& \le {} {x_{d^{\prime},n}^t}\ \quad \forall n=1,\ldots N \nonumber \\ \lambda _{d}^t & \ge {} 0,\quad \forall \ d=1, \ldots , D \end{aligned}$$
(10)
$$\begin{aligned} D_{\rm FGL(2)} &= {} \underset{\mu , \gamma }{\min }\frac{\sum \nolimits _{k=1}^{K}\gamma _{k}}{K} \nonumber \\ \sum \nolimits _{t=1}^{T}\sum \limits _{d=1}^{D}{\mu _{d}^t}{b_{d,k}^t}& \le {} {b_{d^{\prime},k}^t}\gamma _{k}\quad \forall k=1,\ldots K \nonumber \\ \sum \limits _{t=1}^{T}\sum \limits _{d=1}^{D}{\mu _{d}^t}{x_{d,n}^t} &\ge {} {x_{d^{\prime},n}^t}\quad \forall n=N_{1}+1,\ldots N \nonumber \\ \mu _{d}^t & \ge {} 0,\quad \forall \ d=1, \ldots , D \end{aligned}$$
(11)

When the equal weights are given to measurements in \(T_1\) and \(T_2\), the coordinate-wise FGL efficiency index could be calculated as \(\frac{1}{2}(D_{\rm FGL(1)}+D_{\rm FGL(2)})\).

3 Empirical analysis

In this section, the empirical analysis will be carried out by using China’s province-level data. The HYP and modified FGL indexes under BP technologies will be implemented to measure the production and environmental efficiency for different provincial regions. We will analyse our results in the context of current Chinese environmental protection regulation reviews and provide some explanations for our empirical results.

3.1 Data

In this study, we consider Chinese 30 provincial administrative divisions as DMUs from 2006 to 2010.Footnote 11 We also divide these 30 regions into four major parts: Eastcoast, Central, Northeast and West areas from the perspective of China’s economic development. The details are shown in Fig. 2.

Fig. 2
figure 2

Map of China’s four economic parts

The annual GDP for each province is considered as one desirable output (y). The data on labour and capital stock are selected for the non-polluting cause inputs (\(x^1\)). To eliminate the inflation effect, GDP data is deflated to the price of 2000 and measured in hundreds of million CNY.Footnote 12 The GDP and labour data could be obtained from the “China Statistical Yearbook”. However, the capital stock could not be gathered directly from the official released data resources. In this paper, we adopt the Perpetual Inventory Method (PIM) to estimate the annual capital stocks for each province during the 2006 to 2010.Footnote 13

The annual total amount of coal, oil and natural gas consumed by each region are selected for polluting cause inputs (\(x^2\)). The information of three energy inputs are all from the “China Energy Statistical Yearbook”. The annual net volume of sulfur dioxide (\({\rm SO}_2\)) and gross volume of carbon dioxide (\({\rm CO}_2\)) are two undesirable outputs (b). The data on \({\rm SO}_2\) could be obtained from the “China Environment Yearbook”.Footnote 14 But the data on \({\rm CO}_2\) could not be obtained directly. In this paper, we calculate the gross volume of \({\rm CO}_2\) emission from the algorithm based on the fossil fuels combustion, which is provided by IPCC (2006).Footnote 15 The descriptive statistics of all variables computed from 2006 to 2010 are shown in Table 1.

Table 1 Summary statistics of inputs and outputs variables

3.2 Efficiency results under BP approach

Through conducting the optimization problems, HYP and FGL efficiency indexes under BP approach for each province from 2006 to 2010 could be simulated. The 5-year average efficiency scores and descriptive statistics table shows in Tables 2 and 3. As we discussed in theoretical part, given fixed levels of inputs, the output-oriental HYP is defined by the desirable outputs expansions and undesirable outputs contractions by a maximum single scalar, and the FGL is defined by the coordinate-wise expansion of all desirable outputs and contractions of all undesirable outputs. Therefore, the integrated FGL results for each DMU would less or equal to HYP results, which means the HYP efficiency measurement might overestimate the technical efficiency level for each DUM. This can also be observed and proved in Fig. 3. Hence, in this paper, we recommend to focus coordinate-wise FGL index results to analyse the production and environmental efficiency levels for each region in China.Footnote 16

Fig. 3
figure 3

Five-year average integrated HYP and FGL efficiency scores under BP approach

Table 2 Five-year average calculated HYP and FGL efficiency index under BP approach
Table 3 Summary statistics of calculated HYP and FGL efficiency index under BP approach

From the four-part areas respective (Fig. 4), we can observe that the Eastcoast area gets the highest overall technical efficiency level, the Central and Northeast areas are following, and the West area is lowest. Moreover, only the West area is lower than the nation average level, the other three are all higher than the nation average efficiency level. Figure 4 also reveals the regional production and environmental efficiencies under FGL. From the production efficiency, we could see the Eastcoast area can get the highest efficiency level, the Central and Northeast areas rank the second and third high position, respectively, and they all above the average nation efficiency level during these 5 years period. Furthermore, the production efficiency level of the West area is lowest, which is the only region lower than the national level during the 5-year study period. On the other hand, the environmental efficiency measurement shows differently. In 2006, the Eastcoast was the highest one, but from 2007 to 2010, it has been exceeded by the Central area. Moreover, the Central area has been exceeded by the Northeast since 2008. Only the West area has the relatively poor performance and even below the national levels during these 5 years.

Fig. 4
figure 4

Average four-part regional FGL efficiency measurements under BP from 2006 to 2010

When we target on each province’s efficiency results of four parts in 2010 (Fig. 5), we could find that production efficiency scores for all provinces in Eastcoast are always higher than the environmental efficiency levels. For the Central area, most of provinces are also have the similar characteristics as Eastcoast, except Shanxi, which has the higher environmental efficiency than production efficiency. In Northeast, Liaoning is the only one who gets the higher production efficiency compared to environmental efficiency. But in West, only Gunaxi, Sichuan and Gansu three provinces have higher production efficiency levels than environmental efficiency levels, the other provinces all have higher scores in environmental efficiency than production efficiency. Even though the most West provinces can achieve weakly environmental efficient, the overall environmental efficiency level is still lower than the other three regions and below the national average level.

Fig. 5
figure 5

Five-year average FGL production and environmental efficiency scores of four-part regions

Therefore, it might be concluded that for most provinces in Eastcoast and Central areas, the production efficiency level is higher than the environmental efficiency level, and in Northeast and West, the most provinces would achieve more environmental efficient than production efficiency measures. Furthermore, from the view of whole country, there are significant gaps on the overall production efficiency levels across all different regions, but in environmental efficiency measurements, the gaps of efficiency levels among four regions are not such obvious as which in production efficiency measures. Finally, we also can find that the West area is always the least one no matter in production or environmental measurements.

3.3 Reasons discussion

As we discussed above, China’s regional development disparity is partially reflected in our BP-FGL results. Eastcoast area can always exhibit a strong advantage in production efficiency. The West area reveals the worst performances in both production and environmental efficiency measurement. From the comprehensive technical efficiency respective, it can be concluded that the Eastcoast is the best, following by Central and Northeast, and the West is the worst. The gaps of efficiency levels between West to other three regions are always significant.

In fact, the imbalance problem on regional development in China has been pointed out by many relevant researches. Hu and Wang (2006) firstly use total-factor energy efficiency index to measure the China’s regional efficiency of energy inputs utilization, and find there exists a variety of the technology levels among the different areas, the East is highest and West and Central are worse. Lu and Lu (2007) use a cross-efficiency measure to the overall technical efficiency in 31 provinces in China, and find that the coastal regions perform on average better than the inland regions for both economic and environmental considerations. Therefore, our empirical results about production efficiency measurement can correctly characterize this regional disparity issue in China and be in correspondence to relevant studies. Some reasons on regional gaps on production efficiency measurements could also be summarized as different regional industrial structures, unequal economic development stages, and government reform policies implemented preferential to the Eastcoast area, etc. (Fleisher and Chen 1997; Kanbur and Zhang 1999, 2005).

Due to the BP approach separates the pollution-generating technologies as two independent parts, our production and environmental efficiency measurements can be conducted to capture each DMU’s efficiency based on the two technology parts, respectively. Therefore, from the theoretical view, each province’s production technology would not direct influence its environmental efficiency level. The environmental efficiency mainly depends on the different levels of emission generating, given the fixed amounts of fossil fuels usage and pollution reductions. Furthermore, different types of energy resources utilisation and the effectiveness of abatement activities for each province or region should be reflected by the our environmental efficiency measurement results. Table 4 also verifies the the production efficiency (\(D_{\rm HYP(1)}\) or \(D_{\rm FGL(1)}\)) is not correlated to the environmental efficiency (\(D_{\rm HYP(2)}\) or \(D_{\rm FGL(2)}\)).

Table 4 Spearman’s correlation tests on 5-year average HYP and FGL efficiency index under BP appraoch

Therefore, through analysing the deep reasons for various environmental efficiency levels in different regions, this study could make recommendations on the appropriate environmental policy adjustments for Chinese government to improve the regional environmental efficiency levels without harming their production efficiencies.

In terms of fossil fuels usage, many researches indicate that coal consumption always occupies the major percentages of China’s energy usage structure. From 1980s to 2014, coal has accounted for around 70 % of the production and consumption of Chinese domestic primary energy sources.Footnote 17 The heavily relying on coal usage in China has led to the serious environmental problems. In 2012, China’s 79 % total \({\rm SO}_2\) emission stems from the direct combustion of coal.Footnote 18 Similarly, \({\rm CO}_2\) produced by the coal consumption in the long term \({\rm CO}_2\) emission generated from energy activities remains about 80 % (IEA 2013). Therefore, various qualities of coal utilisation in different areas may partially lead to the different degrees of emission and environmental efficiency levels. Wang and Li (2001) state that in China, the sulphur content of power coal varies with different regions, the coverage range is between 0.14 and 5.3 %. From the sampling and analysing, they also point out the sulphur content of coal conserving in Northeast is lowest, average to be 0.45 %. Those in Beijing, Jilin, Yunnan etc. average to be 0.5 %. However, the highest sulphur coals (average to be 2.79 %, with individual regions as high as 5.0 %) are mainly observed in Southwest area, like Sichuan Guangxi and Guizhou. The similar results are also found in other literatures (Hong et al. 1993; Xiao and Liu 2011). Due to the different types of coal exploiting and using between the East and West areas, the emission levels from fossil fuel combustion will be influenced. This characteristic can also be reflected in and consistent with our regional environmental efficiency measurements.

In reality, even though the influences on atmospheric environment from coal resource distribution and consumption structure in different areas are inevitable, the strict regulations or policies for controlling low quality coal using and setting up emission standards can also play important roles in forcing the producers to reduce the pollution-causing inputs using and make more efforts on abatement activities to improving the regional efficiency of energy utilisation and environmental quality. To illustrate the reasonability of our environmental efficiency results, we further construct the econometric model to investigate whether our FGL environmental efficiency scores (\(D_{\rm FGL(2)}\)) will be influenced by such factors as the \({\rm SO}_2\) abatement ratio, which is defined by the ratio of annual volume of industrial \({\rm SO}_2\) removed to total \({\rm SO}_2\) emission (\(X_1\)); energy intensity, defined by the annual energy consumption per GDP (\(X_2\));the annual investment in anti-pollution projects as the percentage of GDP (\(X_3\)); the ratio of annual pollution discharges levied to GDP (\(X_4\)); and the ratio of annual expenditures for indraught of technology to GDP (\(X_5\)). All the data of independent variables are all from 2006 to 2010 and collected from “China Statistic Yearbook”, “China Environment Yearbook”, and further calculated by author. The descriptive statistics of data is shown in Table 5 and the specific model form can be given following

$$\begin{aligned} {{D_{\rm FGL(2)}}_{it}} = {\beta _0} + {\beta _1}{X_{1,it}} + {\beta _2}{X_{2,it}} + {\beta _3}{X_{3,it}} + {\beta _4}{X_{4,it}}+ {\beta _5}{X_{5,it}} + {u_{it}} \end{aligned}$$
(12)

where \(\beta _0\) is constant term; \(\beta _1\), \(\beta _2\), \(\beta _3\), \(\beta _4\) and \(\beta _5\) are coefficient parameters of independent variables, respectively and u is the error term.

Table 5 Descriptive statistics of data for reasons discussion

The regression results are listed in Table 6. We begin our analysis by estimating the coefficients of such influence factors using simple OLS, fixed effect (FE) and random effect (RE) models. Since our data set is panel data, the OLS may ignore the variations between different regions and lead to estimation bias. Hence, OLS result is only taken as reference. Based on the Hausman test (Hausman 1978), we decide to refer fixed effect model results to analyse which factors would effect our environmental efficiency scores. It can be observed that the \({\rm SO}_2\) abatement ratio has a positive relationship with environmental efficiency level, which can be explained as the more efforts conducted by province will lead to a higher environmental efficiency level. Besides, the energy intensity shows a significant negative relation with environmental efficiency, which indicates that if one unit of GDP produced requires more energy consumptions, less environmental efficiency level will be reached. Therefore, readjusting the industrial structure for reducing the proportions of high energy consuming industrials would be an effective way to improve the regional environmental efficiency, especially for some undeveloped western areas. However, our regression results also shows that expenditures for indraught of technology even has a significant negative effect on environmental efficiency. This result consistent with the previous literature.Footnote 19 One possible explanation can be proposed as the variable of expenditure for technology indraught might not concretely reflect the expense on cleaning-technology or eco-friendly technology import and innovation. Furthermore, only expenditures on technology import can not capture the real details of new technology application. The internal technology development and absorptive capacity seem more important (Liao et al. 2012; Fisher-Vanden et al. 2006). When we add region dummies in Panel B and year dummies in Panel C in Table 6, the fixed effects estimation can show consistent results.

Table 6 Regression analysis of FGL environmental efficiency on selected impact factors from 2006 to 2010

Meanwhile, our regression results also reveal the China’s current environment regulations do not seem very effective to improve the regional environmental efficiency levels. The total investment in the treatment of environmental pollution and pollution discharge fees neither have significant effects on our environmental efficiency scores.

The reasons on the ineffectiveness of pollution treatment investments could be given as: first, the proportion of total investment to GDP in China is still very low (only 1.2 % of GDP in 2005), which can not play a proper role in the environmental efficiency improvement in a short term (Liu and Diamond 2005). Second, the total investment in pollution treatment shows extremely unbalanced distribution across different regions. In 2010, the total investment on East regions reached to 369.94 billion CNY, which was even greater than the aggregation of other three areas 256.87 billion.Footnote 20 The unreasonable funding allocation might lead to the insufficiency in some areas but waste in other areas. Third, due to lacking of number of professional and investing management experience, regional pollution control investment might not be planed and implemented reasonably. In addition, China even though established a pollution discharge system, in particular, the discharge fees are still significantly lower than the abatement costs (OECD 2006). According to statistics, the minimal discharge rates for \({\rm SO}_2\) and \({{\rm NO}_x}\) are 0.63 CNY/kg and 0.60 CNY/kg in 2005, respectively.Footnote 21 Besides, the charges are only incurred on excess emissions, and no charge to the enterprises whose emissions are below the waste standards. Hence, it is difficult to stimulate enterprises maximize emission reduction. Zhang et al. (2001) also indicate that due to the environmental tax has not been levied in China, the current environmental regulatory instruments will neither punish the enterprises which abided by the emission standards nor encourage them seeking for the low-cost pollution control technologies. Once environmental tax could be implemented, every technology innovation means paying less taxes, which might be a new solution to improve the environmental efficiency of energy utilization.

In a word, according to our production and environmental efficiency measurement results, the reasons discussion has been carried out thoroughly. Through constructing the regression model, we concentrate to explain which factors would influence the regional environmental efficiency and find some shortages of China’s current environment policy. Besides, some other effective measures to improve environmental efficiency could also be analysed further, such as decomposition the effect factors of elimination policy on different air pollutants from the regional or industrial aspect (Fujii et al. 2013). Furthermore, it is also required to examine the relationship between income or economic growth and environmental quality (Stern et al. 1996; Harbaugh et al. 2002; Rezek and Rogers 2008); and decompose determinants of environmental performance or consider whether environmental policy could be more stringent when the technique effect could not sufficiently reduce emissions (Panayotou 1977; Tsurumi and Managi 2010).

4 Conclusions

This study introduces the characteristics of new by-production approach technologies. Under BP approach, the decomposition of pollution generating technologies based on the multiple production relations could better capture the phenomenon of by production generating than the traditional weak-disposability approach with a single production relation. Under by-production approach, the efficiency indexes are first applied to measure the production efficiency and environmental efficiency by using the DEA algorithm in this study.

In the empirical part, this study employs the hyperbolic index and modified coordinate-wise FGL index under BP technologies to investigate the technical efficiency with emission consideration in China’s province-level regions from 2006 to 2010. By classifying four economic areas, the Eastcoast area has the most effective performance in desirable output generation, but the West performs worst in both desirable output and undesirable outputs efficiency measurements. Furthermore, the production efficiency gaps between four areas are very significant, but in environmental efficiency this characteristics is not such obvious. Through conducting reason discussions, we also find that the environmental efficiency levels are significantly affected by \({\rm SO}_2\) abatement activities and energy intensity, but the environment policy factors show less robust. Besides, this paper also testifies China’s regional technical efficiency levels are consistent with the regional disparity development pattern and ineffectiveness of current environmental regulations implementations.

According to our findings in this paper, we can provide some policy implications. First, due to such differences between regional development, Chinese central government should release more rights of decision making to sub-national local governments and support them to design more suitable policies for local own development. Since the uniform planing or standards are widely used in Chinas environmental regulation system, some drawbacks of relying on uniform standards might inevitably impact the effectiveness of environmental governance. Therefore, it requires policy makers to pay more concerns on key regions with lower efficiency levels and ensure the environmental regulations in different regions being more purposefully and pertinently. Second, there exists the negative effects between the energy intensity and environmental efficiency level. Adjusting and changing the industrial structure from the high energy consumption industries to high value-added service or technology industries will play a key role in emission reduction and improving the regional environmental efficiency. Third, the current environmental regulations as pollution charges and investment in anti-pollution projects have not achieved the prospective effects. All levels of government and environmental regulators need to constantly improve the level of management in pollution charge system, to ensure the discharge process is reasonable, and meets the real needs of local development. Meanwhile, policy makers should accelerate and promote the enforcement of the environmental taxation in China at appropriate time, which would be a good supplement to the environment pollution charges regulation. For achieving the effectiveness of environmental investment, local governments and relevant enterprises should be more focus on the purpose of the investment and the expected results, rather than blind to expand investment. Forth, local enterprises should remain cautious about introducing new technology from abroad and pay more attention on enhancing the research and development (R&D) strength based on their own situation of development to overcome the dependence on technology import.