1 Introduction

Over the past decades, China’s economy has experienced rapid growth through the reform and opening-up policy (Ding and Li 2014). According to the China National Bureau of Statistics, the gross domestic product (GDP) reached exceed 80 trillion Yuan, and contributes more than 30% of global economic growth in 2017. In addition, Chinese GDP has exceeded the USA in the light of purchase power parity, and will surpass the scale of the United States in the light of market rates of exchange by 2030. However, the rapid economic development has brought challenges of imbalance and insufficiency development problems, such as regional disparities, resource shortage, and environmental pollution (Zhang et al. 2016; Yang et al. 2017). To address these issues, economic environmental performance analysis has been widely studied by governments and academics.

Data envelopment analysis (DEA) has been widely applied for evaluating economic environmental efficiency since it was firstly proposed by Charnes et al. (1978). It is a well-known non-parametric method to evaluate the performance of a set of decision making units with multiple inputs and multiple outputs. Based on different empirical axioms and corresponding to different characteristics of the production possibility set and production frontiers, many DEA models, such as CCR model (Charnes et al. 1978), namely the BCC model (Banker et al. 1984), the FG model (Färe and Grosskopf 1985) and the ST model (Seiford and Thrall 1990), are developed and applied in various areas, such as educational institutions (Sagarra et al. 2014; Thanassoulis et al. 2017), hospitals (Chowdhury and Zelenyuk 2016; Toloo and Jalili 2016), financial industries (Aggelopoulos and Georgopoulos 2017; Zhou et al. 2018). Considering the multiple inputs and outputs of economic regions in China, the DEA approach is selected as our tool for performance evaluation in this paper.

In literature, DEA models have been widely applied to economic environmental performance evaluation problems. A direct approach for measuring environmental performance originates from the idea of incorporating undesirable outputs with productive efficiency measurement pioneered by Färe et al. (1989). Zhou et al. (2008) apply a DEA method to analyze the Chinese industrial eco-efficiency under the assumption of variant returns to scale (VRS). Chu et al. (2016) focuse on the eco-efficiency analysis of Chinese provincial-level regions, regarding each region as a two-stage network structure. Fei et al. (2016) integrate the goal of maximizing the desirable outputs and that of disposing the undesirable outputs to evaluate the performance of industrial systems for Chinese administrative regions. Masuda (2016) measures the eco-efficiency of wheat production in Japan at a regional level by using a combined methodology of DEA and life cycle assessment. Zhu et al. (2016) propose a SBM–DEA model based on natural resource input orientation to evaluate the efficiency of natural resource utilization for 26 provincial regions in mainland China from 2005 to 2012. Sueyoshi et al. (2017) define social sustainability as the simultaneous achievement of economic prosperity and environmental protection, and evaluate the degree of social sustainability across provinces in China. Beltrán-Esteve and Picazo-Tadeo (2017) assess environmental performance in the European Union (EU) using Luenberger productivity indicators, directional distance functions and Data Envelopment Analysis techniques. Song et al. (2017) evaluate China’s provincial environmental efficiency by using Ray slack-based model. Other relative researches on DEA based economic environmental performance evaluation can be seen in Mardani et al. (2016) which provide a quite comprehensive literature review.

The above economic environmental analysis approaches mainly develop radial measures to deal with performance evaluation problems. However, there are some shortages in using radial efficiency measures. For example, it often leads to the case where a lot of DMUs have the same efficiency score of 1 and hence difficulty in ranking the environmental performance of these DMUs only based on their efficiency scores. Non-radial DEA models seem to be more efficient in measuring environmental performance, as they have a higher discriminating power in evaluating the efficiencies of DMUs. In addition, if more information, for example, the preference of decision makers, is available, radial DEA models are not easy to incorporate the information by assigning different weights to different undesirable outputs.

There are also some non-radial DEA models measuring economic efficiency have been developed in environmental performance measurement, for example, Meng et al. (2013), Sueyoshi and Wang (2014), Huang et al. (2014) and Krivonozhko et al. (2014). However, rarely study considers their applicability in comprehensively considering environmental and economic performance measurement. It is therefore worthwhile to extend the traditional DEA models into the case comprehensively considering inputs, desirable outputs and undesirable outputs. Due to the increasing depletion of non-renewable resources, the resources needed by social economic development are more and more difficult to obtain. While protecting the environment, we also pay more attention to the resources invested and try to maintain economic growth. Hence, this paper aims to introduce a non-radial and multi-objective generalized DEA model measuring pure environmental performance and productive efficiency.

In this paper, we develop a non-radial and multi-objective generalized DEA model to evaluate the economic environmental performance of 30 Chinese provincial regions in 2016. The contribution of this paper is in four aspects. First, the proposed DEA model outlines some particular DEA models including (CCR) multi-objective DEA model, (BCC) multi-objective DEA model, (FG) multi-objective DEA model, and (ST) multi-objective DEA model. Second, we analyze the relationship between DEA efficiency and Pareto Optimality. Third, we define the definition on the projection of DMUs, and obtain adjustable volumes of inputs and outputs by using the projection of DMUs. Finally, a empirical study of measuring economic-environmental performance of Chinese provincial regions, which indicates that this new model promotes the accuracy of economic and environmental efficiency evaluation.

The rest of the paper is organized as follows. The methodology is presented in Sect. 2. Section 3 analyzes the empirical study and discusses the results. Conclusions and suggestions for future research are given in Sect. 4.

2 Non-radial and Multi-objective Generalized DEA Model

2.1 Production Technology

When desirable outputs and undesirable outputs are jointly produced, we first study the concept of production technology. Assume that \( x = (x_{1} ,x_{2} , \ldots ,x_{m} ) \), \( y = (y_{1} ,y_{2} , \ldots ,y_{s} ) \) and \( z = (z_{1} ,z_{2} , \ldots ,z_{k} ) \) denote the vectors of inputs, desirable outputs and undesirable outputs respectively. namely Production possibility set is defined as T = {(x, y, z): x can produce (y, z)}

The production technology T has been well-defined conceptually, but it cannot be directly used to the environmental DEA technology. When undesirable outputs are considered, Färe et al. (2004) introduced the production possibility set exhibiting constant returns to scale by the piecewise linear combination of the observed data. Now suppose undesirable outputs can be also changed the same as input and desirable output and we extend the production possibility set. There are n DMUs and for DMUj(j = 1,2,…,n) the observed input, desirable output and undesirable output vectors are respectively xi = (x1j, x2j, …,xmj), yr = (y1j,y2j, …,ysj) and zt = (z1j,z2j, …,zkj), T can be concretely formulated as follows:

$$ \begin{array}{*{20}l} {{\text{T}} = } \hfill & {\left\{ {(x,y,z):\sum\limits_{j = 1}^{n} {\lambda_{j} } x_{ij} \le x_{i} ,\;\; i = 1, \ldots m;} \right.} \hfill \\ {} \hfill &\quad {\sum\limits_{j = 1}^{n} {\lambda_{j} y_{rj} } \ge y_{r} ,\;\; r = 1, \ldots s;} \hfill \\ {} \hfill &\quad {\left. {\sum\limits_{j = 1}^{n} {\lambda_{j} z_{tj} } \le z_{t} ,\;\; t = 1, \ldots k;\,\lambda_{j} \ge 0,\;\; j = 1, \ldots n} \right\}} \hfill \\ \end{array} $$
(1)

where \( \lambda_{\text{j}} \) are intensity variables. Multi-objective programming is composed of T:

$$ \hbox{min} \;{\text{F(X,Y,Z) }}s.t.{\text{ (X,Y,Z)}} \in {\text{T}} $$
(2)

where \( F(X,Y,Z) = (X, - Y,Z)^{T} \).

2.2 Multi-objective Performance Measure

There are many radial DEA-based models for measuring environmental performance in the process of measuring environmental efficiency. However, these models have some limitations. One of limitations is that they adjust all inputs, desirable outputs or undesirable outputs by the same proportion to the efficient targets. In addition, their discriminating power is so weak that many DMUs cannot be directly compared and ranked. Out of realistic or economic considerations, however, decision makers or government officials may prefer different efficient targets. Therefore, it is meaningful and practical to extend radial DEA model to non-radial one measuring environmental performance.

In the framework of DEA environmental performance evaluation, non-radial DEA models have been well developed in the past. Despite the abundance of non-radial DEA models, rarely of them comprehensively consider inputs, desirable outputs and undesirable outputs simultaneously. In the real production process, people always expect to put in the less and get more desirable outputs and fewer undesirable outputs. The analytic structure of environmental efficiency evaluation considering undesirable outputs model is described in Fig. 1. In this figure We can see desirable and undesirable outputs will increase with the increase of inputs during production. If we blindly focus on economic growth, then this will bring increasing pollution that can be hard for us to take. Moreover, raw materials are limited and then the increase in inputs is limited. If we are obsessed with controlling pollution by reducing undesirable outputs, then it would reduce the desirable outputs and hold back economic development. So we must seek a balance point between them to seek the efficiency optimization.

Fig. 1
figure 1

The structural relationship of inputs, desirable outputs and undesirable outputs

For a given \( DMU_{{j_{0} }} \) under evaluation \( (0 \le j_{0} \le n) \), then based on the idea, we introduce a new non-radial and multi-objective DEA model for measuring environmental performance as follows:

$$ \begin{array}{*{20}l} {\hbox{min} \, a = (a_{1} ,a_{2} , \ldots a_{m} ) ;} \hfill & {{ \hbox{max} }b = (b_{1} ,b_{2} , \ldots ,b_{s} ) ; {\text{ min }}c = (c_{1} ,c_{2} , \ldots ,c_{k} )} \hfill \\ {s.t.} \hfill & {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } \le a_{i} x_{{ij_{0} }} ,\quad i = 1,2, \ldots ,m} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} \ge b_{r} y_{{rj_{0} }} ,\quad r = 1,2, \ldots ,s} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} \le c_{t} z_{{tj_{0} }} ,\quad t = 1,2, \ldots ,k} \hfill \\ {} \hfill & {a_{i} \le 1,\quad b_{r} \ge 1,\quad c_{t} \le 1,\forall i,r,t} \hfill \\ {} \hfill & {\lambda_{j} \ge 0,\quad j = 1,2, \ldots n} \hfill \\ \end{array} $$
(3)

where ai (i = 1, 2,…,m) indicates the efficiency value of ith input; br(r = 1, 2,…,s) indicates the efficiency value of rth desirable output; ct(t = 1, 2,…,k) indicates the efficiency value of tth undesirable output.

There is very obvious economic significance of model (3). If the optimum value \( a_{i}^{*} < 1 \), \( b_{r}^{*} > 1 \) and \( c_{t}^{*} < 1 \), there exists a decision making unit by which we can get more desirable outputs and fewer undesirable outputs, but its inputs are not greater than decision making unit evaluated. Thus the decision making units \( DMU_{{j_{0} }} \) evaluated is not DEA efficient. Therefore, only if ai = 1 (i = 1, 2,…,m), br = 1 (r = 1, 2,…,s) and ct = 1 (t = 1, 2,…,k), \( {\text{DMU}}_{{j_{0} }} \) is DEA efficient.

However, the multi-objective programming (3) is not solved, and its DEA efficiency is difficult for us to discuss. In addition, the duality of model (3) is hard to study. To this end, we extend the generalized DEA (GDEA) model given in Zhu (2014) into a Non-radial generalized tri-DEA model measuring economic-environmental performance.

Let \( w_{i}^{1} \left( {\text{i} = 1,2, \ldots ,\text{m}} \right) \) indicates the degree of ith production input and \( w_{r}^{2} \left( {\text{r} = 1,2, \ldots ,\text{s}} \right) \) indicates the importance of rth desirable output; \( w_{t}^{3} \left( {\text{t} = 1,2, \ldots ,\text{k}} \right) \) is normalized user-specified weights for adjusting the tth pollutant which reflects the desirability degree of decision makers in adjusting the current level of this pollutant. The model is proposed as follows.

$$ \begin{array}{*{20}l} {\hbox{min} {\text{ h}}_{0} = } \hfill & {\sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } \, } \hfill \\ {s.t.} \hfill & {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } \le a_{i} x_{{ij_{0} }} ,\quad i = 1,2, \ldots ,m} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} \ge b_{r} y_{{rj_{0} }} ,\quad r = 1,2, \ldots ,s} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} \le c_{t} z_{{tj_{0} }} ,\quad t = 1,2, \ldots ,k} \hfill \\ {} \hfill & {\delta_{1} \left( {\sum\limits_{j = 1}^{n} {\lambda_{j} } + \delta_{2} ( - 1)^{{\delta_{3} }} \lambda_{n + 1} } \right) = \delta_{1} } \hfill \\ {} \hfill & {a_{i} \le 1,b_{r} \ge 1,c_{t} \le 1,\forall i,r,t} \hfill \\ {} \hfill & {\lambda_{j} \ge 0,j = 1,2, \ldots n,\lambda_{n + 1} \ge 0} \hfill \\ \end{array} $$
(4)

where \( h_{0} \) indicates efficiency value of \( DMU_{{j_{0} }} \) comprehensively considering inputs, desirable outputs and undesirable outputs.

In addition, \( \sum\nolimits_{i = 1}^{m} {w_{i}^{1} } + \sum\nolimits_{r = 1}^{s} {w_{r}^{2} } + \sum\nolimits_{t = 1}^{k} {w_{t}^{3} } = 1 \), \( 0 \le w_{i}^{1} \le 1 \), \( 0 \le w_{r}^{2} \le 1 \) and \( 0 \le w_{t}^{3} \le 1 \). The greater \( w_{i}^{1} \left( {\text{i} = 1,2, \ldots ,\text{m}} \right) \), the more decision makers’ input; The greater \( w_{r}^{2} \left( {\text{r} = 1,2, \ldots ,\text{s}} \right) \) is, the more the desirable output decision makers want to get; The greater \( w_{t}^{3} \left( {\text{t} = 1,2, \ldots ,\text{k}} \right) \) is, the more to priority reduce the emissions of pollutants decision makers take. Moreover, not all the variables in real-life word are discretionary (or controllable), while the discretion of variables should be controlled by set the value of weights (Tsai et al. 2011). For example, we can set \( w_{1}^{1} { = }0 \) if the first input variables is non-discretionary (or uncontrollable). The decision makers try various devices to reduce the inputs and undesirable outputs and to increase desirable outputs as much as possible. If the optimum value \( a_{i}^{*} = 1 \, \left( {\text{i} = 1,2, \ldots ,\text{m}} \right) \), \( b_{r}^{*} = 1 \, \left( {\text{r} = 1,2, \ldots ,\text{s}} \right) \) and \( c_{t}^{*} = 1 \, \left( {\text{t} = 1,2, \ldots ,\text{k}} \right) \), \( {\text{DMU}}_{{j_{0} }} \) is DEA efficient. Otherwise, \( DMU_{{j_{0} }} \) is not DEA efficient.

For convenience of study DEA efficiency, Pareto efficient solution and the projection of decision making units, slack variables \( S_{i}^{ - } \left( {i = 1,2, \ldots ,m} \right) \), \( S_{r}^{ + } \left( {r = 1,2, \ldots ,s} \right) \) and \( S_{t}^{ - } \left( {t = 1,2, \ldots ,k} \right) \) are introduced into model (4), The model (5) is formulated as follows:

$$ \begin{array}{*{20}l} {\hbox{min} {\text{ h}}_{0} = } \hfill & {\sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } } \hfill \\ {s.t.} \hfill & {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } {\text{ + S}}_{i}^{ - } = a_{i} x_{{ij_{0} }} ,\quad i = 1,2, \ldots ,m} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} - {\text{ S}}_{r}^{ + } = b_{r} y_{{rj_{0} }} ,\quad r = 1,2, \ldots ,s} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} + S_{t}^{ - } = c_{t} z_{{tj_{0} }} ,\quad t = 1,2, \ldots ,k} \hfill \\ {} \hfill & {\delta_{1} \left( {\sum\limits_{j = 1}^{n} {\lambda_{j} } + \delta_{2} ( - 1)^{{\delta_{3} }} \lambda_{n + 1} } \right) = \delta_{1} } \hfill \\ {} \hfill & {a_{i} \le 1,b_{r} \ge 1,c_{t} \le 1,\forall i,r,t} \hfill \\ {} \hfill & {\lambda_{j} \ge 0,j = 1,2, \ldots n,\lambda_{n + 1} \ge 0.} \hfill \\ \end{array} $$
(5)

The duality of model (5):

$$ \begin{array}{*{20}l} {\hbox{max} } \hfill & { - \delta_{1} \mu_{0} - \beta_{1} + \beta_{2} - \beta_{3} } \hfill \\ {s.t.} \hfill & {\omega^{T} X_{j} - \mu^{T} Y_{j} + \gamma^{T} Z_{j} + \delta_{1} \mu_{0} e^{T} \ge 0, \quad j = 1,2, \ldots n} \hfill \\ {} \hfill & {\omega^{T} X_{0} + \beta_{1} = e_{m}^{T} K^{1} } \hfill \\ {} \hfill & {\mu^{T} Y_{0} - \beta_{2} = e_{s}^{T} K^{2} } \hfill \\ {} \hfill & {\gamma^{T} Z_{0} + \beta_{3} = e_{k}^{T} K^{3} } \hfill \\ {} \hfill & {\delta_{1} \delta_{2} ( - 1)^{{\delta_{3} }} \mu_{0} \ge 0} \hfill \\ {} \hfill & {\beta_{1} \ge 0,\beta_{2} \ge 0,\beta_{3} \ge 0} \hfill \\ {} \hfill & {\omega^{T} \ge 0,\mu^{T} \ge 0,\gamma^{T} \ge 0} \hfill \\ \end{array} $$
(6)

where Xj = (x1j, x2j, …, xmj)T is the input vector for DMUj, j = 1, …, n, Yj = (y1j, y2j, …, ysj)T is the desirable output vector for DMUj, j = 1, …, n, Zj = (z1j, z2j, …, zkj)T is the undesirable output vector for DMUj, j = 1, …, n; \( \text{K}^{1} = (w_{1}^{1} ,w_{2}^{1} , \ldots ,w_{m}^{1} ) \), \( \text{K}^{2} = (w_{1}^{2} ,w_{2}^{2} , \ldots ,w_{s}^{2} ) \), \( \text{K}^{3} = (w_{1}^{3} ,w_{2}^{3} , \ldots ,w_{k}^{3} ) \); e = (1, 1, …, 1)T ∈ En; β1, β2 and β3 are slack variables. \( \delta_{1} \), \( \delta_{2} \) and \( \delta_{3} \) are 0–1 binary parameters. Using Wei et al. (2008) technique, different values of parameters \( \delta_{1} \), \( \delta_{2} \) and \( \delta_{3} \) lead to the different generalized tri-DEA models measuring economic-environmental performance (where‘*’indicates either 0 or 1):

Case 1 When \( (\delta_{1} ,\delta_{2} ,\delta_{3} ) = (0,*,*) \), the generalized tri-DEA model is reduced to the (CCR) generalized tri-DEA model:

$$ \begin{array}{*{20}l} {\hbox{min} {\text{ h}}_{0} = } \hfill & {\sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } } \hfill \\ {s.t.} \hfill & {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } \le a_{i} x_{{ij_{0} }} ,\quad i = 1,2, \ldots ,m} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} \ge b_{r} y_{{rj_{0} }} ,\quad r = 1,2, \ldots ,s} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} \le c_{t} z_{{tj_{0} }} ,\quad t = 1,2, \ldots ,k} \hfill \\ {} \hfill & {a_{i} \le 1,b_{r} \ge 1,c_{t} \le 1,\quad \forall i,r,t} \hfill \\ {} \hfill & {\lambda_{j} \ge 0,\quad j = 1,2, \ldots n,} \hfill \\ \end{array} $$
(7)

Case 2 When \( (\delta_{1} ,\delta_{2} ,\delta_{3} ) = (1,0,*) \), the generalized tri-DEA model is reduced to the (BCC) generalized tri-DEA model:

$$ \begin{array}{*{20}l} {\hbox{min} {\text{ h}}_{0} = } \hfill & {\sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } } \hfill \\ {s.t.} \hfill & {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } \le a_{i} x_{{ij_{0} }} ,\quad i = 1,2, \ldots ,m} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} \ge b_{r} y_{{rj_{0} }} ,\quad r = 1,2, \ldots ,s} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} \le c_{t} z_{{tj_{0} }} ,\quad t = 1,2, \ldots ,k} \hfill \\ {} \hfill & {a_{i} \le 1,\quad b_{r} \ge 1,\quad c_{t} \le 1,\quad \forall i,r,t} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } = 1, \, \lambda_{j} \ge 0,\quad j = 1,2, \ldots n,} \hfill \\ \end{array} $$
(8)

Case 3 When \( (\delta_{1} ,\delta_{2} ,\delta_{3} ) = (1,1,0) \), the generalized tri-DEA model is reduced to the (FG) generalized tri-DEA model:

$$ \begin{array}{*{20}l} {\hbox{min} {\text{ h}}_{0} = } \hfill & {\sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } } \hfill \\ {s.t.} \hfill & {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } \le a_{i} x_{{ij_{0} }} ,\quad i = 1,2, \ldots ,m} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} \ge b_{r} y_{{rj_{0} }} ,\quad r = 1,2, \ldots ,s} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} \le c_{t} z_{{tj_{0} }} ,\quad t = 1,2, \ldots ,k} \hfill \\ {} \hfill & {a_{i} \le 1,\quad b_{r} \ge 1,\quad c_{t} \le 1,\quad \forall i,r,t} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } \le 1, \, \lambda_{j} \ge 0,\quad j = 1,2, \ldots n,} \hfill \\ \end{array} $$
(9)

Case 4 When \( (\delta_{1} ,\delta_{2} ,\delta_{3} ) = (1,1,1) \), the generalized tri-DEA model is reduced to the (ST) generalized tri-DEA model:

$$ \begin{array}{*{20}l} {\hbox{min} {\text{ h}}_{0} = } \hfill & {\sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } } \hfill \\ {s.t.} \hfill & {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } \le a_{i} x_{{ij_{0} }} ,\quad i = 1,2, \ldots ,m} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} \ge b_{r} y_{{rj_{0} }} ,\quad r = 1,2, \ldots ,s} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} \le c_{t} z_{{tj_{0} }} ,\quad t = 1,2, \ldots ,k} \hfill \\ {} \hfill & {a_{i} \le 1,\quad b_{r} \ge 1,\quad c_{t} \le 1,\quad \forall i,r,t} \hfill \\ {} \hfill & {\sum\limits_{j = 1}^{n} {\lambda_{j} } \ge 1,\;\lambda_{j} \ge 0,\quad j = 1,2, \ldots n.} \hfill \\ \end{array} $$
(10)

According to decision maker’s preferences, they may choose the relative importance of different input, desirable output and undesirable output categories and different DMUs (see Chen 2003).

Definition 1

Let \( \omega^{0} ,\mu^{0} , \, \gamma^{0} ,\mu_{0}^{0} , \, \beta_{1}^{0} , \, \beta_{2}^{0} , \, \beta_{3}^{0} \) be the optimal solution of model (6). If \( \omega^{0} > 0,\mu^{0} > 0,\gamma^{0} > 0 \), \( \beta_{1} = \beta_{2} = \beta_{3} = 0 \) and the optimum \( - \delta_{1} \mu_{0}^{0} = e_{m}^{T} K^{1} - e_{s}^{T} K^{2} + e_{k}^{T} K^{3} \), then \( DMU_{{j_{0} }} \) is called DEA efficiency.

Definition 2

Let \( (\tilde{X},\tilde{Y},\tilde{Z}) \in T \). If there no existence of \( (X,Y,Z) \), satisfying \( F(X,Y,Z) < F(\tilde{X},\tilde{Y},\tilde{Z}) \), \( (X,Y,Z) \in T \), then \( (\tilde{X},\tilde{Y},\tilde{Z}) \) is called weak Pareto solution of multi-objective programming.

Definition 3

Let \( (\tilde{X},\tilde{Y},\tilde{Z}) \in T \). If there no existence of \( (X,Y,Z) \), satisfying \( F(X,Y,Z) \le F(\tilde{X},\tilde{Y},\tilde{Z}) \), \( (X,Y,Z) \in T \), then \( (\tilde{X},\tilde{Y},\tilde{Z}) \) is called Pareto solution of multi-objective programming.

Lemma 1

(weak dual theorem) Let\( (\lambda_{j} ,\lambda_{n + 1} ,a_{i} ,b_{r} ,c_{t} ) \)be a feasible solution of model (4), \( (\omega ,\mu ,\gamma ,\mu_{0} ,\beta_{1} ,\beta_{2} ,\beta_{3} ) \)be a feasible solution of model (6), then

$$ \sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } \ge - \delta_{1} \mu_{0} - \beta_{1} + \beta_{2} - \beta_{3} $$

Proof

Let \( (\lambda_{j} ,\lambda_{n + 1} ,a_{i} ,b_{r} ,c_{t} ) \) and \( (\omega ,\mu ,\gamma ,\mu_{0} ,\beta_{1} ,\beta_{2} ,\beta_{3} ) \) be feasible solutions of model (4) and (6), respectively.Since

$$ \sum\limits_{j - 1}^{n} {\lambda_{j} x_{ij} } - a_{i} x_{{ij_{0} }} \le 0,i = 1,2, \ldots ,m, - \sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} { + }b_{r} y_{{rj_{0} }} \le 0,\;{\text{r}} = 1 , 2 ,\ldots , {\text{s, }}\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} - c_{t} z_{{tj_{0} }} \le 0,t = 1,2, \ldots ,k, $$

and

$$ \omega^{T} \ge 0,\mu^{T} \ge 0,\gamma^{T} \ge 0, $$

then

$$ \omega^{T} \sum\limits_{i = 1}^{m} {\left( {\sum\limits_{j = 1}^{n} {\lambda_{j} x_{ij} } - a_{i} x_{{ij_{0} }} } \right)} + \mu^{T} \sum\limits_{r = 1}^{s} {\left( { - \sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} { + }b_{r} y_{{rj_{0} }} } \right)} + \gamma^{T} \sum\limits_{t = 1}^{k} {\left( {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} - c_{t} z_{{tj_{0} }} } \right)} \le 0, $$

that is

$$ \omega^{T} \sum\limits_{i = 1}^{m} {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } } - \mu^{T} \sum\limits_{r = 1}^{s} {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} } { + }\gamma^{T} \sum\limits_{t = 1}^{k} {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} } \le \omega^{T} \sum\limits_{i = 1}^{m} {a_{i} x_{{ij_{0} }} } - \mu^{T} \sum\limits_{r = 1}^{s} {b_{r} y_{{rj_{0} }} } + \gamma^{T} \sum\limits_{t = 1}^{k} {c_{t} z_{{tj_{0} }} } . $$

Note that

$$ \omega^{T} X_{j} - \mu^{T} Y_{j} + \gamma^{T} Z_{j} + \delta_{1} \mu_{0} e^{T} \ge 0, $$

we have

$$ \omega^{T} \sum\limits_{j - 1}^{n} {\lambda_{j} X_{j} } - \mu^{T} \sum\limits_{j = 1}^{n} {\lambda_{j} } Y_{j} { + }\gamma^{T} \sum\limits_{j = 1}^{n} {\lambda_{j} } Z_{j} \ge - \delta_{1} \mu_{0} e^{T} \sum\limits_{j = 1}^{n} {\lambda_{j} } . $$

Since \( \lambda_{n + 1} \ge 0 \) and \( \delta_{1} \delta_{2} ( - 1)^{{\delta_{3} }} \mu_{0} \ge 0 \), then \( \lambda_{n + 1} \delta_{1} \delta_{2} ( - 1)^{{\delta_{3} }} \mu_{0} \ge 0 \).From the above analysis, we have

$$ \begin{aligned} \sum\limits_{i = 1}^{m} {w_{i}^{1} a_{i} } - \sum\limits_{r = 1}^{s} {w_{r}^{2} b_{r} } + \sum\limits_{t = 1}^{k} {w_{t}^{3} c_{t} } & = \left( {\sum\limits_{i = 1}^{m} {a_{i} } \omega^{T} X_{0} - \sum\limits_{r = 1}^{s} {b_{r} } \mu^{T} Y_{0} + \sum\limits_{t = 1}^{k} {c_{t} } \gamma^{T} Z_{0} } \right) - \left( {\beta_{1} \sum\limits_{i = 1}^{m} {a_{i} } - \beta_{2} \sum\limits_{r = 1}^{s} {b_{r} } + \beta_{3} \sum\limits_{t = 1}^{k} {c_{t} } } \right) \\ & \ge \omega^{T} \sum\limits_{i = 1}^{m} {\sum\limits_{{j{ = }1}}^{n} {\lambda_{j} x_{ij} } } - \mu^{T} \sum\limits_{r = 1}^{s} {\sum\limits_{j = 1}^{n} {\lambda_{j} } y_{rj} } { + }\gamma^{T} \sum\limits_{t = 1}^{k} {\sum\limits_{j = 1}^{n} {\lambda_{j} } z_{tj} } - \beta_{1} + \beta_{2} - \beta_{3} \\ & \ge \omega^{T} \sum\limits_{j - 1}^{n} {\lambda_{j} X_{j} } - \mu^{T} \sum\limits_{j = 1}^{n} {\lambda_{j} } Y_{j} { + }\gamma^{T} \sum\limits_{j = 1}^{n} {\lambda_{j} } Z_{j} - \beta_{1} + \beta_{2} - \beta_{3} \\ & \ge - \delta_{1} \mu_{0} e^{T} \sum\limits_{j = 1}^{n} {\lambda_{j} } - \beta_{1} + \beta_{2} - \beta_{3} \\ & { = } - \delta_{1} \mu_{0} (1 - \delta_{2} ( - 1)^{{\delta_{3} }} \lambda_{n + 1} ) - \beta_{1} + \beta_{2} - \beta_{3} \\ & { = } - \delta_{1} \mu_{0} - \beta_{1} + \beta_{2} - \beta_{3} . \\ \end{aligned} $$

From Lemma 1, it is not hard to have the following corollary.□

Corollary 1

If\( \omega^{0} ,\mu^{0} , \, \gamma^{0} ,\mu_{0}^{0} , \, \beta_{1}^{0} , \, \beta_{2}^{0} , \, \beta_{3}^{0} \)is a feasible solution of model (6) such that

$$ - \delta_{1} \mu_{0}^{0} - \beta_{1}^{0} + \beta_{{_{2} }}^{0} - \beta_{{_{3} }}^{0} { = }e_{m}^{T} K^{1} - e_{s}^{T} K^{2} + e_{k}^{T} K^{3} , $$

then\( \omega^{0} ,\mu^{0} , \, \gamma^{0} ,\mu_{0}^{0} , \, \beta_{1}^{0} , \, \beta_{2}^{0} , \, \beta_{3}^{0} \)is an optimal solution of model (6).

Theorem 1

If \( (X_{0} ,Y_{0} ,Z_{0} ) \) is Pareto solution of multi-objective programming, then the corresponding \( DMU_{{j_{0} }} \) is DEA efficient.

Proof

(Proof by contradiction) Assume \( DMU_{{j_{0} }} \) is not DEA efficient, by Lemma 1 and Corollary 1, then the optimum \( - \delta_{1} \mu_{0}^{0} < e_{m}^{T} K^{1} - e_{s}^{T} K^{2} + e_{k}^{T} K^{3} \), \( h_{0} > e_{m}^{T} K^{1} - e_{s}^{T} K^{2} + e_{k}^{T} K^{3} \). It suggests that decision makers can make further efforts to reduce inputs or undesirable outputs and increase desirable outputs such that the optimum declines to \( e_{m}^{T} K^{1} - e_{s}^{T} K^{2} + e_{k}^{T} K^{3} \).That is, the \( DMU_{{j_{0} }} \) relatively efficient.

Assume \( (X_{*} ,Y_{*} ,Z_{*} ) \) are input–output volumes adjusted, \( X_{*} < X_{0} ,Y_{*} > Y_{0} ,Z_{*} < Z_{0} \) and \( (X_{*} ,Y_{*} ,Z_{*} ) \) are input–output volumes of \( DMU_{{j_{0} }} \). Based on the above analysis, \( DMU_{{j_{*} }} \) known is DEA efficient and ai = 1 (i = 1, 2,…, m), br = 1 (r = 1, 2,…, s) and ct = 1 (t = 1, 2,…, k). Therefore, a conclusion can be drawn: \( (X_{*} ,Y_{*} ,Z_{*} ) \) meets the following conditions:

$$ \sum\limits_{j = 1}^{n} {\lambda_{j} X_{j} } \le X_{*} ,\sum\limits_{j = 1}^{n} {\lambda_{j} Y_{j} } \ge Y_{*} ,\sum\limits_{j = 1}^{n} {\lambda_{j} Z_{j} } \le Z_{*} ,\lambda_{j} \ge 0,j = 1,2, \ldots ,n, $$

Then \( (X_{*} ,Y_{*} ,Z_{*} ) \in T \). That is, there exists \( (X_{*} ,Y_{*} ,Z_{*} ) \) satisfying \( F(X_{*} ,Y_{*} ,Z_{*} ) < F(X_{0} ,Y_{0} ,Z_{0} ) \), \( (X_{*} ,Y_{*} ,Z_{*} ) \in T \).By Definition 2, \( (X_{0} ,Y_{0} ,Z_{0} ) \) is not weak Pareto solution of multi-objective programming, nor is it Pareto solution. Obviously, it contradicts with the known condition of \( (X_{0} ,Y_{0} ,Z_{0} ) \) being Pareto solution of multi-objective programming. Therefore, \( DMU_{{j_{0} }} \) is DEA efficient.□

Theorem 2

If \( DMU_{{j_{0} }} \) is DEA efficient, then the corresponding \( (X_{0} ,Y_{0} ,Z_{0} ) \) is Pareto solution of multi-objective programming.

Proof

(Reduction to absurdity) Assume \( (X_{0} ,Y_{0} ,Z_{0} ) \) is not Pareto solution of multi-objective programming, then there exists \( (X,Y,Z) \) satisfying \( F(X,Y,Z) \le F(X_{0} ,Y_{0} ,Z_{0} ) \), \( (X,Y,Z) \in G \). By \( (X,Y,Z) \in T \) and model (4), then \( \lambda_{j} \ge 0(j = 1,2, \ldots ,n) \), such that \( \sum\nolimits_{j - 1}^{n} {\lambda_{j} x_{ij} } \le a_{i} x_{{ij_{0} }} ,\;i = 1,2, \ldots ,m \), \( \sum\nolimits_{j = 1}^{n} {\lambda_{j} } y_{rj} \ge b_{r} y_{{rj_{0} }} ,r = 1,2, \ldots ,s,\;\sum\nolimits_{j = 1}^{n} {\lambda_{j} } z_{tj} \le c_{t} z_{{tj_{0} }} ,t = 1,2, \ldots ,k. \) By model (5), we have \( S_{i}^{ - } = a_{i} x_{{ij_{0} }} - \sum\nolimits_{j = 1}^{n} {\lambda_{j} x_{ij} } \), \( S_{r}^{ + } = - b_{r} y_{{rj_{0} }} - \sum\nolimits_{j = 1}^{n} {\lambda_{j} y_{ij} } \), \( S_{t}^{ - } = c_{t} z_{{tj_{0} }} - \sum\nolimits_{j = 1}^{n} {\lambda_{j} z_{tj} } \), Then \( \lambda_{j} \ge 0(j = 1,2, \ldots ,n) \), \( S_{i}^{ - } (i = 1,2, \ldots ,m) \), \( S_{r}^{ + } (r = 1,2, \ldots ,s) \) and \( S_{t}^{ - } (t = 1,2, \ldots ,k) \) is a feasible solution of model (5) and \( (S_{i}^{ - 0} ,S_{r}^{ + 0} ,S_{t}^{ - 0} ) \ge 0 \). Therefore, \( DMU_{{j_{0} }} \) is DEA efficient. This contradicts with the known conditions. The assumption does not hold.□

2.3 Frontier Projection

In this section, we will focus on the projection of decision making units. If \( DMU_{{j_{0} }} \) is not efficient, \( DMU_{{j_{0} }} \) may be efficient through adjusting the inputs and outputs. The adjustable inputs and outputs are called the projections of the efficient production frontier. From the perspective of multi-objective programming, efficient production frontier is surface composed of Pareto solutions. DEA efficient production frontier is defined as follows:

Definition 4

If \( \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\omega } > 0,\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\mu } > 0,\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\gamma } > 0 \) and hyperplane \( L = \{ (X,Y,Z)\left| {\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\omega }^{T} X - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\mu }^{T} Y + \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\gamma }^{T} Z = 0} \right.\} \) satisfies \( T \subset \{ (X,Y,Z)\left| {\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\omega }^{T} X - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\mu }^{T} Y + \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{\gamma }^{T} Z \ge 0} \right.\} \) and \( L \cap T \ne \emptyset \), then L is called the efficient surface of production possibility set T and \( L \cap T \) is called the production frontier of production possibility set T.

Definition 5

Let \( \lambda_{j}^{0} ,\lambda_{n + 1}^{0} , \, a_{i}^{0} , \, b_{r}^{0} ,c_{t}^{0} ,S_{i}^{ - 0} ,S_{r}^{ + 0} ,S_{t}^{ - 0} \) be an optimal solution of model (5) and \( \hat{x}_{{ij_{0} }} = a_{i}^{0} x_{{ij_{0} }} - S_{i}^{ - 0} ,\hat{y}_{{rj_{0} }} = b_{r}^{0} y_{{rj_{0} }} + S_{r}^{ + 0} ,\hat{z}_{{tj_{0} }} = c_{t}^{0} z_{{tj_{0} }} - S_{t}^{ - 0} \), \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) is called projection on the production frontier of production possibility set T.

Obviously, \( \hat{x}_{{ij_{0} }} = a_{i}^{0} x_{{ij_{0} }} - S_{i}^{ - 0} = \sum\limits_{j = 1}^{n} {\lambda_{j}^{0} } x_{ij} ,\hat{y}_{{rj_{0} }} = b_{r}^{0} y_{{rj_{0} }} + S_{r}^{ + 0} = \sum\limits_{j = 1}^{n} {\lambda_{j}^{0} } y_{rj} ,\hat{z}_{{tj_{0} }} = c_{t}^{0} z_{{tj_{0} }} - S_{t}^{ - 0} = \sum\limits_{j = 1}^{n} {\lambda_{j}^{0} } z_{tj} \).

If \( DMU_{{j_{0} }} \) is weak DEA efficient, then \( \hat{x}_{{ij_{0} }} = x_{{ij_{0} }} - S_{i}^{ - 0} \), \( \hat{y}_{{rj_{0} }} = y_{{rj_{0} }} + S_{r}^{ + 0} \) and \( \hat{z}_{{tj_{0} }} = z_{{tj_{0} }} - S_{t}^{ - 0} \); If \( DMU_{{j_{0} }} \) is DEA efficient, then \( \hat{x}_{{ij_{0} }} = x_{{ij_{0} }} \), \( \hat{y}_{{rj_{0} }} = y_{{rj_{0} }} \) and \( \hat{z}_{{tj_{0} }} = z_{{tj_{0} }} \). Further, we can draw the conclusion that the corresponding projection \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) of \( DMU_{{j_{0} }} \) constitutes a new decision making unit which is DEA efficient. That is, the new decision making unit \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) locates in the production frontier of convex polyhedral cone \( C(\hat{T}) \), where, \( \hat{T} = \{ (X_{1} ,Y_{1} ,Z_{1} ),(X_{2} ,Y_{2} ,Z_{2} ), \ldots ,(X_{n} ,Y_{n} ,Z_{n} )\} \) is the corresponding reference set of input and output data. Convex cone \( C(\hat{T}) = \left\{ {\sum\limits_{i = 1}^{n} {\lambda_{j} } (X_{j} ,Y_{j} ,Z_{j} )\left| {\lambda_{j} \ge 0,j = 1,2, \ldots ,n} \right.} \right\} \) generated by set \( \hat{T} \) is data envelopment of n points in reference set (Yu et al. 1996). Therefore, DEA is helpful to estimate the unknown economic production function. If \( DMU_{{j_{0} }} \) is not DEA efficient, then \( a_{i}^{0} , \, b_{r}^{0} , \, c_{t}^{0} \) are not all 1 and \( S_{i}^{ - 0} ,S_{r}^{ + 0} ,S_{t}^{ - 0} \) are not all 0.

According to the complementary slackness theorem of linear programming, the optimal solution vector of model (5) is greater than 0, \( \omega^{0} > 0,\mu^{0} > 0,\gamma^{0} > 0 \).

Theorem 3

Let \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) be projection on the production frontier of \( DMU_{{j_{0} }} \) , then \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) is DEA efficient.

Proof

By Definition 5, \( \hat{x}_{{ij_{0} }} = a_{i}^{0} x_{{ij_{0} }} - S_{i}^{ - 0} ,\; \, \hat{y}_{{rj_{0} }} = b_{r}^{0} y_{{rj_{0} }} + S_{r}^{ + 0} ,\;\hat{z}_{{tj_{0} }} = c_{t}^{0} z_{{tj_{0} }} - S_{t}^{ - 0} \). For the first condition of model (6), applying complementary slackness condition of Karush–Kuhn–Tucker theorem in mathematical programming, we have

$$ \lambda_{j}^{0} \left( {\omega^{0T} X_{j} - \mu^{0T} Y_{j} + \gamma^{0T} Z_{j} } \right) = 0,\;j = 1,2, \ldots ,n, $$

then

$$ \omega^{0T} \hat{X}_{0} - \mu^{0T} \hat{Y}_{0} + \gamma^{0T} \hat{Z}_{0} = 0. $$

For \( \forall (X,Y,Z) \in G \), we have

$$ \sum\limits_{j = 1}^{n} {X_{j} \lambda_{j} \le X} ,\;\sum\limits_{j = 1}^{n} {Y_{j} \lambda_{j} \ge Y} ,\;\sum\limits_{j = 1}^{n} {Z_{j} \lambda_{j} \le Z} ,\;\lambda_{j} \ge 0,\;j = 1,2, \ldots ,n. $$

Note that \( \omega^{0T} X_{j} - \mu^{0T} Y_{j} + \gamma^{0T} Z_{j} \ge 0 \), then

$$ \begin{aligned} \omega^{0T} X - \mu^{0T} Y + \gamma^{0T} Z & \ge \omega^{0T} \left( {\sum\limits_{j = 1}^{n} {X_{j} \lambda_{j} } } \right) - \mu^{0T} \left( {\sum\limits_{j = 1}^{n} {Y_{j} \lambda_{j} } } \right) + \gamma^{0T} \left( {\sum\limits_{j = 1}^{n} {Z_{j} \lambda_{j} } } \right) \\ & = \sum\limits_{j = 1}^{n} {\left( {\omega^{0T} X_{j} - \mu^{0T} Y_{j} + \gamma^{0T} Z_{j} } \right)\lambda_{j} } \\ & = \omega^{0T} \hat{X}_{0} - \mu^{0T} \hat{Y}_{0} + \gamma^{0T} \hat{Z}_{0} = 0. \\ \end{aligned} $$

Therefore, \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) is a Pareto solution of multi-objective programming.Assume \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) is not a Pareto solution. By Definition 3, there exists \( (X^{{\prime }} ,Y^{{\prime }} ,Z^{{\prime }} ) \in G \) and \( X^{{\prime }} \le \hat{X}_{0} ,Y^{{\prime }} \ge \hat{Y}_{0} ,Z^{{\prime }} \le \hat{Z}_{0} \). Since \( \omega^{0} > 0,\mu^{0} > 0,\gamma^{0} > 0 \), then \( \omega^{0T} X^{{\prime }} - \mu^{0T} Y^{{\prime }} + \gamma^{0T} Z^{{\prime }} \; < \omega^{0T} \hat{X}_{0} - \mu^{0T} \hat{Y}_{0} + \gamma^{0T} \hat{Z}_{0} \). It contradicts with the above results. Therefore, \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) is a Pareto solution of multi-objective programming. From Theorem 1, \( (\hat{X}_{0} ,\hat{Y}_{0} ,\hat{Z}_{0} ) \) is DEA efficient.□

From Theorem 3, \( DMU_{{j_{0} }} \) is not DEA efficient, but its projection may be DEA efficient. The projection on relatively DEA efficient surface of \( DMU_{{j_{0} }} \), in fact, points out the non-efficient reasons and provides a feasible scheme to improve the efficiency of \( DMU_{{j_{0} }} \) simultaneously. In the actual production process, therefore, people can take advantage of the projection to achieve the relative DEA efficiency. Expressions of projection can be rewritten as follows:

$$ \hat{x}_{{ij_{0} }} - x_{{ij_{0} }} = - ((1 - a_{i}^{0} )x_{ij} + S_{i}^{ - 0} ),\quad \, \hat{y}_{{rj_{0} }} - y_{{rj_{0} }} = (b_{r}^{0} - 1)y_{{rj_{0} }} + S_{r}^{ + 0} ,\quad \hat{z}_{{tj_{0} }} - z_{{tj_{0} }} = - ((1 - c_{t}^{0} )z_{tj} + S_{t}^{ - 0} ), $$

then we can get the adjustable volume of input, desirable output and undesirable output. For ith input, the amount reduced is \( ((1 - a_{i}^{0} )x_{ij} + S_{i}^{ - 0} ), \, i = 1,2, \ldots ,m \); for rth desirable output, the amount increased is \( (\alpha_{r}^{0} - 1)y_{{rj_{0} }} + S_{r}^{ + 0} , \, r = 1,2, \ldots ,s \); for tth undesirable output, the amount reduced is \( ((1 - c_{t}^{0} )z_{tj} + S_{t}^{ - 0} ), \, t = 1,2, \ldots ,k \).

3 Application

3.1 Variables and Data

In this section we conduct the application of economic-environmental performance of thirty Chinese provinces in 2016 to evaluate the non-radial and multi-objective generalized DEA approaches in assessing the impact of contextual variables on inputs, desirable outputs and undesirable outputs. A lot of researches have been done to measure economic-environmental efficiency of Chinese regions, such as Zhang and Chen (2017) and Song et al. (2018). Following their researches, population, capital stock and power consumption are viewed as three inputs to produce one desirable output gross domestic production (GDP). As byproducts, the two undesirable outputs are wastewater emissions and SO2 emissions, respectively. The corresponding input–output measures are listed in the following Table 1.

Table 1 Variables of inputs and outputs

The data on these variables were collected from the 2017 China Statistics Yearbook, the 2017 China Energy Database, and the 2017 China Environment Database. Due to data unavailability, we exclude the Taiwan, Hong Kong, Macao and Tibet provinces from the economic-environment performance analysis. The raw data is shown in Table 2.

Table 2 Raw data of inputs and outputs

3.2 Performance Evaluation

The non-radial and multi-objective generalized DEA model (7) (based on CRS assumption) is applied to calculate the economic-environmental efficiency score of thirty Chinese provinces in 2016. According to the model (7), the weight \( w_{i}^{1} \), \( w_{r}^{2} \) and \( w_{t}^{3} \) affect only the value of the objective function, having no influence on the constraint conditions. Following the research of Zhang and Choi (2013), we set the weight of input, desirable output and undesirable output as (1/3, 1/3, 1/3). Further, we set the weight of all the variable population (X1), capital stock (X2), power consumption (X3), GDP (Y), wastewater emissions \( ({\text{Y}}_{1}^{\text{b}} ) \) and SO2 emission \( ({\text{Y}}_{2}^{\text{b}} ) \) as (1/9, 1/9, 1/9, 1/3, 1/6, 1/6). Then, our DEA approach should be similar with the non-radial DDF proposed in Zhang and Choi (2013). Table 3 shows the obtained efficiency scores of the thirty provinces in 2016.

Table 3 Efficiency scores of 30 Chinese provinces in 2016 based on all variables

It can be seen from Table 3 that the non-radial and multi-objective generalized DEA model has a higher discriminating power. There are significant difference between the efficiency scores for different kind of variables, in which the efficiency scores of desirable outputs should be no smaller than 1 while that of inputs and undesirable outputs should be no larger than 1. In 30 regions, only Beijing, Tianjin and Shanghai are DEA efficient and the rest are not DEA efficient. In 2016, 24 regions have the same GDP performance index of ‘‘1’’, and only Shanxi, Yunnan, Gansu, Qinghai, Ningxia and Xinjiang need to further improve GDP. For the input performance indexes, however, only 3 regions (Beijing, Tianjin and Shanghai) have always been equal to “1”, and the rest of the regions need to pay attention to the distribution of inputs, especially, the conservation of power consumption. From Table 3 we can also find that the majority of regions have smaller environmental performance indexes, only 10 regions (Beijing, Tianjin, Shanxi, Inner Mongolia, Shanghai, Yunnan, Gansu, Qinghai, Ningxia and Xinjiang) have wastewater emissions performance indexes of ‘‘1’’ and 3 regions (Beijing, Tianjin and Shanghai) have So2 emissions performance indexes of ‘‘1’’.

However, the inputs in regional economic activities are different from that of microeconomic production, in which the inputs are not discretionary or need not to decrease. In this application, some of the input variables (population and capital stock) are not discretionary. It is unreasonable to reduce these inputs for attaining higher environmental efficiency. Then, we can set weight of all the variable population (X1), capital stock (X2), power consumption (X3), GDP (Y), wastewater emissions \( ({\text{Y}}_{1}^{\text{b}} ) \) and SO2 emission \( ({\text{Y}}_{2}^{\text{b}} ) \) as (0, 0, 1/3, 1/3, 1/6, 1/6) for showing the different discretionary on variables. Then, we can calculate the efficiency score as shown in Table 4.

Table 4 Efficiency scores of 30 Chinese provinces in 2016 based on discretionary variables

Table 4 shows the efficiency result with considering variables’ discretionary in which power consumption (X3) is the only disposable input. Then, we can obtain more reasonable results from Table 4. By comparison, we can find that the efficiency scores of inputs and outputs are different from that shown in Table 3 for some DMUs. Some of the DMUs need to decrease the power consumption and pollution emission while keep the present level of GDP, such as Jiangsu, Fujian and Chongqing. Some DMUs such as Sichuan and Jiangxi need to use the given level of input to produce more desirable outputs and emit less pollution. While the other DMUs need to do some improvements from all the sides of input, desirable output and undesirable output.

3.3 Frontier Projection

On the basis of the expressions: \( \hat{x}_{{ij_{0} }} - x_{{ij_{0} }} = - ((1 - a_{i}^{0} )x_{ij} + S_{i}^{ - 0} ),\quad \, \hat{y}_{{rj_{0} }} - y_{{rj_{0} }} = (b_{r}^{0} - 1)y_{{rj_{0} }} + S_{r}^{ + 0} ,\quad \, \hat{z}_{{tj_{0} }} - z_{{tj_{0} }} = - ((1 - c_{t}^{0} )z_{tj} + S_{t}^{ - 0} ) \), we calculate the adjustable volumes of inputs and outputs. We should point out that the adjustable volumes of the input population and total capital are not listed in our calculation because they can not be adjusted through a short term of operation. The results are shown in Table 5.

Table 5 Adjustable volumes of discretionary inputs and outputs

From Table 5, we can find that only three regions, namely Beijing, Tianjin and Shanghai, should keep the present level of production while all the poor performance DMUs should find out the adjustable volumes for power consumption, GDP, waste water and SO2 emission to become efficient. We can see that the GDP of Shanxi, Yunnan Gansu Qinghai Ningxia and Xinjiang should be increased. Adjustable volumes of inputs are relatively large and the deductible volumes of power consumption are more noteworthy. The projection is consistent with the corresponding results in Table 3. For the environmental performance indexes, most countries need to reduce the corresponding volumes of waste water emissions and SO2 emissions exclusive of Beijing, Tianjin and Shanghai. Through the adjustments, we can improve the level of economic-environmental efficiency and achieve DEA efficiency.

Based on the efficiency results, we analysis the type of return to scale for all the DMUs and the benchmarks for all the inefficient DMUs. The results of return to scale measured by using the new proposed models are listed in Table 6. Only three efficient DMUs, namely Beijing, Tianjin and Shanghai are Constant Return to Scale (CRS). Nine provinces are Decrease Return to Scale (DRS), most of which locate at the west area of China such as Gansu, Qinghai, Ningxia and Xingjiang or at the northeast of China such as Heilongjiang, Jining and Liaoning. While all the rest inefficient DMUs are Increase Return to Scale (IRS). Moreover, we can find that the inefficient DMUs select different efficient DMUs as benchmark. Most of them set Beijing (17 DMUs) or Beijing and Tianjin (7 DMUs) as benchmarks, they need to study their mode on economic development. While only one DMU (Liaoning) use Shanghai as benchmark, which means that Shanghai’s mode might not be suitable for most of the regions.

Table 6 Return to scale and benchmarks

4 Conclusions

During the past three decades, China’s economy has achieved significant development. However, the rapid economic growth is causing China to pay heavily due to the increasing environmental pollution. Therefore, it is worth to achieve more accurate results of economic environmental efficiency evaluation to provide some policy suggestions. During the past years, non-radial DEA models integrating with undesirable outputs were frequently used in environmental performance measurements because they have a higher discriminating power in environmental performance comparisons. In this paper, we extend previous studies and present a non-radial and multi-objective generalized DEA approach to measuring environmental performance, which synthetically takes account of decreasing inputs, increasing desirable outputs and decreasing undesirable outputs simultaneously in DEA models and consists of a non-radial DEA-based models for multilateral environmental performance comparisons. It thus allows the decision maker to have the kinds of preferable projections and allows that the projection would not be in Pareto-inefficient portions of the production frontier which may occur in the radial projection.

The proposed non-radial and multi-objective generalized DEA approach has been applied to for modeling economic-environmental performance of 30 Chinese regions in 2016. It is found that multi-objective generalized DEA model has a higher discriminating power. Only 3 regions are DEA efficient while the rest are not DEA efficient in which some of them are Decrease Return to Scale. This suggests that we need to pay more attention on the allocation and rational use of resources together with the control of environment pollution, and it’s more important to increase the quality of economic development but not the scale. Moreover, most of the inefficient DMUs select Beijing and Tianjin as benchmarks to improve their performance that means it’s more valuable and feasible to extend “Beijing mode” and “Tianjin mode” to poor performance regions in China rather than “Shanghai mode”.

It should be point out that there exists some shortages of the multiple-objective although it is a feasible approach to calculate the environmental efficiency. One shortage is that the DMs are not able to assign any objective weights sometimes due to the limited knowledge and perception capability of human beings. Another shortage is that it may lead to nonlinear problem in multiple-objective programming which is not easy to obtain the global optimal solution. Therefore, we will aim to develop our approach with uncertain objective weights and study the solving algorithms for nonlinear programs in the next step.

Future study should also be considered in the following two ways. Firstly, generalize the non-radial and multi-objective DEA model for calculating the efficiency on resource allocation and pollution production Simultaneously, in which we should confront that how to allocate production resources and how to handle the loss caused by undesirable outputs. Besides, the approach proposed in this paper should also be extended into Malmquist index for calculating dynamic efficiency scores among difference years.