Introduction

In the past three decades, China’s economy has grown aggressively. The rapid economic development causes substantially increase of its energy consumption. Though nowadays China’s economy steps into a “new normal,” which means it is growing in a relatively balanced manner, the energy consumption still maintains a high level. According to the latest Key World Energy Statistics (IEA 2017), China’s total final energy consumption reached 1914 Mtoe in 2015, accounting for 20.4% of the world’s total final energy consumption (see Fig. 1).

Fig. 1
figure 1

Shares of world total final energy consumption by region, 2015

Energy efficiency is regarded as the world’s “first energy” in emission reduction and energy security improvement (IEA 2014). In order to combat the energy crisis and air pollution caused by excess energy consumption, China’s government makes great efforts in improving its energy efficiency. With large regional disparities, it is necessary and meaningful to measure its energy efficiency from regional perspectives in China. In previous studies, Hong Kong, Macao, and Taiwan are usually excluded due to data availability. However, along with the “Belt and Road” strategy initiated by the Chinese government in 2014, Hong Kong, Macao, and Taiwan are having more economic connections with Mainland China, and data is getting complete gradually. Therefore, it is important and useful to include Hong Kong, Macao, and Taiwan when assessing China’s regional energy efficiency.

The measurement of energy efficiency is often in the form of an aggregated energy efficiency index. Among the existing approaches to develop energy efficiency index, data envelopment analysis (DEA) models have been widely used as shown in the review by Zhou et al. (2008) and Meng et al. (2016).Footnote 1 DEA developed by Charnes et al. (1978) is a well-established methodology for evaluating the relative efficiency of a set of comparable entities often called decision-making units (DMUs) with multiple inputs/outputs (Meng et al. 2013). Energy efficiency values can be obtained by solving above DEA models.

Based on the energy efficiency values and ranking information of DMUs obtained by DEA models, a lot of policy recommendations in energy efficiency improvement are provided. Considering the efficiency values of a certain DMU in different DMU sets are not comparable, which results from the relativity of DEA efficiency values, rankings of DMUs will provide more information sometimes. In fact, the most widely use of the ranks is to find the benchmarks. In our study, benchmark provinces and other provinces have different energy development paths, and affect the improvement of energy efficiency across the country. It is noticed that no matter what type of DEA model is adopted, there is a possible change of regional rankings in Mainland China after adding Hong Kong, Macao, and Taiwan. This phenomenon is called “rank reversal,” which will be discussed in this paper due to its effect on the stability and rationality of policy recommendation.

This paper focuses on rank reversal phenomenon in China’s regional energy efficiency measurement, and contributes to the literature in the following aspects: (1) explore China’s energy efficiency from the regional perspective, including Hong Kong, Macao, and Taiwan; (2) illustrate the rank reversal phenomenon of DEA models in theory; (3) investigate the degree of rank reversal in five different and widely used DEA models for energy efficiency assessment in China; and (4) analyze the impact of rank reversal in policy making.

The remaining parts are organized as follows: “Literature review” section reviews previous studies. “Models and rank reversal issue” section introduces five different DEA models for energy efficiency assessment and proposes the rank reversal index. “China’s regional energy efficiency analysis” section applies the proposed approaches to model the regional energy efficiency in China using different datasets from 2000 to 2014. In “Energy efficiency rank reversal” section, we show the impacts of rank reversal on regional energy efficiency assessment and study the degree of rank reversal in different DEA models. The final section concludes this paper.

Literature review

Among the existing approaches to develop energy efficiency index, DEA has been widely used due to its practicability (Huang et al. 1995; Zhou et al. 2008). Since the early 1980s, it has been widely investigated and gain popularity in many application areas, especially the energy efficiency measures. Following the literature survey by Zhou et al. (2008) on issues relating to DEA for energy and environment analysis, a systematic review of studies on China’s regional energy and carbon emission efficiency (EE&CE) assessments using different DEA models is reported in Meng et al. (2016). It is found that five DEA-type models have been widely used in China’s regional EE&EC assessment, i.e., radial model, modified radial model (M-Radial), Tone’s slack-based model (SBMT), range adjusted model (RAM), and directional distance function model (DDF).

Among these five DEA models, radial model is probably the most widely used model for measuring energy efficiency. It adjusts inputs and outputs proportionally. The most famous Radial models are the CCR model (Charnes et al. 1978) and BCC model (Banker et al. 1984). Following their seminal work, many studies apply the radial model to estimate China’s regional energy efficiency, for example, Wei et al. (2007), Shi et al. (2010), Xue et al. (2014), and Al-Refaie et al. (2016). While M-Radial model attempts to measure energy efficiency by constructing an index which utilizes radial indicators and slacks together. Hu and Wang (2006) firstly develop an index called total factor energy efficiency by using M-Radial model. It has been widely adopted, such as Wang et al. (2012), Zhou et al. (2014), Pan et al. (2015), and Fernández et al. (2018).

SBMT (Tone 2001) and RAM (Cooper et al. 1999) models both belong to slack-based measure which constructs energy efficiency index with slacks of all the inputs and outputs. DDF model (Chung et al. 1997) allows adjusting the inputs and desirable/undesirable outputs at different rates on the basis of different direction vectors for input-output variables. SBMT, RAM, and DDF models, known collectively as non-radial models, have become increasingly popular in recent years due to their relatively strong discriminating power and capability to expand desirable outputs and reduce undesirable outputs simultaneously. For instance, Choi et al. (2012), Wang et al. (2013), Hu et al. (2017), Zaim and Gazel (2018), Khoshroo et al. (2018), Iftikhar et al. (2018), Zhang and Chen (2018), and so on.

Based on the energy efficiency values and rankings of different regions obtained from various DEA models, a lot of policy recommendations for China’s energy efficiency improvement are provided. If we focus on the ranking, it is noticed that no matter what type of DEA model is adopted, there is a possible change of regional rankings in Mainland China after adding Hong Kong, Macao, and Taiwan. This phenomenon is called rank reversal, which is firstly observed in multiple criteria decision-making (MCDM) approach by Belton and Gear (1983). It is found that rank reversal can also occur in DEA ranking models. Green et al. (1996) firstly notice that the relative rankings of two DMUs can be reversed when a DMU is added in the DEA cross-efficiency evaluation method. Recently, Wang and Luo (2009) and Soltanifar and Shahghobadi (2014) also study the rank reversal in DEA ranking approaches. Moreover, different DEA models may have the varying degree of rank reversal. This is an area deserving further investigation.

Models and rank reversal issue

DEA models for energy efficiency assessment

Following the neoclassical production framework, we consider a production process which use non-energy inputs (x) and energy inputs (e) to product desirable outputs (y) and undesirable outputs (b) jointly. The reference technology T can be described as follows:

$$ T=\left\{\left(x,e,y,b\right):\left(x,e\right)\ \mathrm{can}\ \mathrm{produce}\ \left(y,b\right)\right\}, $$

where T is a closed and bounded set. In the traditional framework of reference technology, the inputs and desirable outputs are all strongly disposable, i.e., if (x, e, y) ∈ T and (x', e') ≥ (x, e) (or y' ≤ y), then (x', e', y) ∈ T (or (x, e, y') ∈ T).

As for the disposability of undesirable output, one way is to treat undesirable output as input and utilize the traditional model (Seiford & Zhu, 2002). In this case, suppose that there are K DMUs (refer to regions in this study), we can describe the reference technology under strong disposability on undesirable output (T-SD) as:

$$ T- SD=\left\{\left(x,e,y,b\right):\sum \limits_{k=1}^K{\lambda}_k{x}_{nk}\le {x}_n;\sum \limits_{k=1}^K{\lambda}_k{e}_{mk}\le {e}_m;\sum \limits_{k=1}^K{\lambda}_k{y}_{ik}\ge {y}_i;\sum \limits_{k=1}^K{\lambda}_k{b}_{jk}\le {b}_j;{\lambda}_k\ge 0\Big(k=1,\cdots, K\left)\right\}.\right\} $$
(1)

Another way to involve undesirable output into the reference technology is to satisfy the following assumptions suggested by Färe et al. (1989):

(i) Weak disposability on undesirable outputs, i.e., if (x, e, y, b) ∈ T and 0 ≤ θ ≤ 1, then (x, e, θy, θb) ∈ T.

(ii) Desirable and undesirable outputs are null-joint, i.e., if (x, e, y, b) ∈ T and b = 0, then y = 0.

The first property means that the proportional reduction in desirable and undesirable outputs is possible, and the second implies that the only way to remove all the undesirable outputs is to cease the production process. According to above assumptions, the reference technology under weak disposability on undesirable output T-WD can be characterized as:

$$ T- WD=\left\{\left(x,e,y,b\right):\sum \limits_{k=1}^K{\lambda}_k{x}_{nk}\le {x}_n;\sum \limits_{k=1}^K{\lambda}_k{e}_{mk}\le {e}_m;\sum \limits_{k=1}^K{\lambda}_k{y}_{ik}\ge {y}_i;\sum \limits_{k=1}^K{\lambda}_k{b}_{jk}={b}_j;{\lambda}_k\ge 0\Big(k=1,\cdots, K\left)\right\}.\right\} $$
(2)

Based on the production framework under strong and weak disposability, previous studies have developed many alternative nonparametric models for energy efficiency measurement. Table 1 summarizes the most popular five DEA models applied in this field, namely the Radial, M-Radial model, SBMT model, DDF model, and RAM.

Table 1 DEA models for energy efficiency measurement

Rank reversal in DEA models

In this section, we intend to discuss the occurrence of rank reversal without change in the production possibility set. Figure 2 illustrates how this phenomenon occurs in theory, considering a production progress that transforms only one input X to one output Y. According to the production theory, efficiency can be defined by the distance between each DMU and its production frontier. The DMUs that lie on the production frontier are called efficient DMUs with efficiency values equal to unit. DMUs that lie under the production frontier are deemed to be relatively inefficient. And the DMUs which are near to production frontier have higher efficiency values than others.

Fig. 2
figure 2

Illustration of rank reversal in DEA

Assume that A, B, C, D, and E are five DMUs in group I who perform under production frontier I. Thus, the rank of efficiency for five DMUs is A = B = 1 > C > E > D. When some DMUs in group I are added, the production frontier of the remaining DMUs changes to frontier II. In this case, the rank of efficiency for five DMUs is B = E = 1 > C > A > D. Obviously, the phenomenon called rank reversal occurs. More specifically, for instance, E has lower efficiency than C under frontier I. However, under frontier II, the efficiency of E is equal to unit and higher than C. Different from E, A is efficient DMU which lies on frontier I, and the changing of frontier makes its efficiency even lower than C under frontier II. There are some other DMUs (like B) which always lie on production frontier in both cases. Based on above analysis, generally speaking, the increase of DMUs is likely to result in rank reversal to some DMUs by affecting the production frontier.

In order to investigate the rank reversal phenomenon in various DEA models, we define the difference between the rank of DMUk in group I and group II as rank reversal index (RRI), i.e.,

$$ RR{I}_k={R}_k\left(\mathrm{I}\right)-{R}_k\left(\mathrm{I}\mathrm{I}\right) $$
(3)

where k represents the kth DMU; Rk(I) and Rk(II) are the ranks of the kth DMU in group I and group II, respectively. The larger ∣RRIk∣ means the greater difference of rank for DMUk between group I and group II, and vice versa.

China’s regional energy efficiency analysis

In this section, we apply five different DEA models discussed in “Models and rank reversal issue” section to measure China’s regional energy efficiency. Along with the rapid development of the cross-strait trade scale and economic integration, putting Mainland China, Hong Kong, Macao, and Taiwan together when studying China’s regional energy efficiency issues becomes possible. It can help constructing a production frontier matching reality more. As mentioned above, previous studies usually adopt 30 provinces in Mainland China as DMU set, and in this case, some provinces like Beijing stay in first place for a long time. So no policy suggestions can be made for Beijing’s energy efficiency improvement, which is not helpful to the overall improvement of energy efficiency level of China. Inclusion of Hong Kong, Macao, and Taiwan into the DMU set may provide new benchmarks to these provinces and provide different policy recommendations.

To compare the efficiency results in different DMU sets, China’s regional data from 2000 to 2014 are adopted to construct the comparison groups I and II. Group I includes Hong Kong, Macao, Taiwan, and 30 regions in Mainland China, while group II only includes 30 regions in Mainland China as most previous studies did.

Data source

As for inputs, we take labor force and capital stock as non-energy inputs and total final energy consumption (TFEC) as the only energy input. For outputs, we choose gross domestic product (GDP) and CO2 emissions as desirable and undesirable outputs, respectively. The data on TFEC and CO2 emissions are collected from IEA, the data on labor force and GDP using PPP come from Word Bank, and the data on capital stock using PPP are from the Penn World Table (Version 9.0). Particularly, the data on labor force and capital stock of Taiwan are obtained from Chinese Taiwan Statistics; the data on CO2 emissions and TFEC of Macao come from Word Bank and China Energy Statistical Yearbook. The data of 30 regions in Mainland China are calculated according to their ratio of China’s total amount on the basis of data published by IEA or World Bank. Table 2 presents the summary statistics of all inputs/outputs in this study.

Table 2 Summary statistics of inputs/outputs for China’s regional dataset, 2000–2014

Energy efficiency results

Firstly, energy efficiency (EE) values of all regions in groups I and II using different DEA models in 2014 are shown in Table 3. From the perspective of models, it is found that EE values obtained from SBMT and DDF models are lower than those calculated by Radial, M-Radial, and RAM models. Besides, SBMT model has higher discrimination power since there are less efficient DMUs in this case. So we further explore China’s regional energy efficiency from 2000 to 2014 by applying SBMT model.

Table 3 EE of China’s regions in groups I/II with different DEA models, 2014

Figure 3 shows the change of energy efficiency over time in group I and group II, respectively. We find that the regional average EE values in group II are larger than those in group I during the observation period. It indicates that the addition of Hong Kong, Macao, and Taiwan leads to the shift of production frontier, and the regions in Mainland China in group I are farther away from the frontier than group II. In other words, although the energy efficiency of Mainland China seems barely satisfactory, it can be improved greatly compared with Hong Kong and Macao. Focusing on the variation of EE, it is found that the variation trend of group I and group II is basically the same. China’s energy efficiency decreased during the 10th Five Years Plan phase (2001–2005), and then increased from the beginning of the 11th Five Years Plan (2006) until to 2009. Difference appears after the year 2011. It can be interpreted that energy efficiency of Mainland China was going up in 2012 and 2013, but that of Hong Kong and Macao increased more. It suggests China’s government can make more effort to improve its regional energy efficiency in Mainland China to catching up Hong Kong and Macao.

Fig. 3
figure 3

Regional average energy efficiency values in China, 2000–2014

Shifting our interest to the regional differences, we divide the whole studying area into nine regions: Hong Kong, Macao, and Taiwan, northeast region, north coast region, east coast region, south coast region, Mid-Yellow River region, Mid-Yangtze River region, southwest region, and northwest region. Figure 4 presents the average EE values from 2000 to 2014 in nine regions mentioned above. It shows that energy efficiencies of Hong Kong, Macao, and Taiwan are much higher than those of other regions. In Mainland China, Mid-Yangtze River region performs the best while the northwest and Mid-Yellow River region are the worst. The northwest region includes Gansu, Qinghai, Ningxia, and Xinjiang; their discrepancy in the level of economic and technology development may be the main reasons of low energy efficiency. The Mid-Yellow River region includes Shannxi, Shanxi, Henan, and Inner Mongolia, and their developments all depend on the coal industry. It implies that China can pay more attention to northwest and Mid-Yellow River region to improve the regional energy efficiency throughout the country.

Fig. 4
figure 4

Average energy efficiency values in China’s nine economic regions, 2000–2014

Energy efficiency rank reversal

In this section, we demonstrate the phenomenon of rank reversal in DEA and further investigate the degree of rank reversal on five DEA models discussed in “Models and rank reversal issue” section. We first analyze rank reversal phenomenon in the year of 2014 in detail as it is the most recent year with complete data. After that, we choose a 15-year period (2000–2014) to get more information.

We sort 30 regions in Mainland China according to their EE values in group I and group II, respectively, where the region with the largest EE value ranks first. Table 4 shows their ranking and RRI values calculated using Eq. (3). The ranks of most regions change after adding Hong Kong, Macao, and Taiwan. Beijing has the highest |RRI| value by using Radial and M-Radial models, respectively. For DDF and RAM models, Tianjin and Jiangsu have the highest |RRI| value, respectively.

Table 4 Rankings of 30 regions in Mainland China in groups I/II, 2014

On the other hand, the highest |RRI| value for SBMT model is 3, which is much smaller than those for other DEA models. It indicates that addition of DMUs in SBMT model has less impact on the ranks of DMUs than in other DEA models. We also find that there are 25 regions with RRI values equal to 0 in SBMT model. It denotes that rank reversal does not occur in these regions after adding these three DMUs (Hong Kong, Macao, and Taiwan). Different from SBMT model, RAM model only have two regions whose RRI values equal to 0.

In order to investigate the degree of rank reversal in different models, Figs. 5 and 6 present the average |RRI| values of 30 regions in Mainland China by different types of models. From Fig. 5, SBMT model has the least rank reversal, followed by M-Radial and Radial model. The ranks of 30 regions in RAM model change most after adding Hong Kong, Macao, and Taiwan. Thus, we suggest using the SBMT model to get more stable policy recommendations on energy efficiency improvement.

Fig. 5
figure 5

Average |RRI| values of 30 regions in Mainland China by different DEA models, 2014

Fig. 6
figure 6

Bubble chart for RRI values of 30 regions in Mainland China by different DEA models, 2014. Note: AH (Anhui), BJ (Beijing), CQ (Chongqing), FJ (Fujian), GS (Gansu), GD (Guandong), GX (Guangxi), GZ (Guizhou), HN (Hainan), HeB (Hebei), HLJ (Heilongjiang), HeN (Henan), HuB (Hubei), HuN (Nunan), IM (Inner Mongolia), JS (Jiangsu), JX (Jiangxi), JL (Jilin), LN (Liaoning), NX (Ningxia), QH (Qinghai), SaX (Shaanxi), SD (Shandong), SH (Shanghai), SX (Shanxi), SC (Sichuan), TJ (Tianjin), XJ (Xinjiang), and ZJ (Zhejiang)

Bubble charts for RRI values of 30 regions by DEA models are shown in Fig. 6 to illustrate the phenomenon of rank reversal visually. The size of bubbles represents |RRI| value—larger bubbles imply larger |RRI| values. Bubbles in color denote positive RRI values and those in white denote negative RRI values.

Firstly, from the viewpoint of horizontal axis, it is apparent that |RRI| values in RAM model and DDF model are larger than those in other DEA models as a whole. It implies that the ranks of 30 regions change most in RAM model and DDF model. SBMT is proved to show the least rank reversal according to its smaller |RRI| values for all the 30 regions. As for Radial model and M-Radial model, significant differences for ranks between groups I and II are mainly observed in four regions, i.e., Beijing, Guangxi, Heilongjiang, and Shanghai.

Secondly, from the viewpoint of vertical axis, we observe that some regions always have positive changes by applying five DEA models, such as Heilongjiang and Shanghai. On the contrary, some regions always present negative changes in any types of DEA models, for example, Guizhou, Inner Mongolia, Qinghai, and Xinjiang.

To verify the stability of SBMT model on rank reversal, we show the regional RRI values of SBMT model from 2000 to 2014 in Table 5. We find that the degree of rank reversal has been small during the 15 years period excluding Zhejiang in 2000 and Yunnan in 2009. Even though, rank reversal occurs in some regions. Take Beijing as an example, its RRI values of 2007 to 2014 are all positive, which means that the rankings of Beijing decline after adding Hong Kong, Macao, and Taiwan. It implies that Beijing has more energy efficiency improvement potential when including Hong Kong, Macao, and Taiwan into the DMUs.

Table 5 RRI values of 30 regions in Mainland China by SBMT model, 2000–2014

Discussions and conclusion

Along with the Belt and Road strategy initiated by the Chinese government, taking Hong Kong, Macao, and Taiwan into consideration is reasonable and necessary when exploring China’s regional energy efficiency. However, it is noticed that no matter what type of DEA model is adopted, there is a possible change of regional rankings in Mainland China after adding Hong Kong, Macao, and Taiwan. This phenomenon is called rank reversal which means the relative rankings of two DMUs can be reversed when one or more units are added. This paper focuses on the rank reversal in China’s regional energy efficiency ranking after adding Hong Kong, Macao and Taiwan into the DMUs set.

Based on the principle of rank reversal in DEA in theory illustrated in this paper, rank reversal in DEA is caused by the change of the position for production frontier when some DMUs are added. The most popular five DEA models in efficiency measurement, including Radial model, M-Radial model, SBMT model, DDF model, and RAM model, are applied to assess China’s regional (30 regions and Hong Kong, Macao, and Taiwan) energy efficiency. The degrees of rank reversal in five different DEA models are investigated using China’s regional data. Among these models, SBMT model shows the least rank reversal when measuring energy efficiency for 30 regions in Mainland China.

According to our study, rank reversal is an important factor to be considered for energy efficiency evaluation using DEA models. The policy suggestions based on the ranking of DMUs are subject to their pre-defined production frontier. When some regions without data before (or are excluded for some other reasons) are added, policy suggestions may change due to rank reversal. Taking Beijing as an example, according to the results obtained from Radial model, Beijing’s EE ranking drops from no. 1 to no. 27 among the 30 regions in Mainland China after taking Hong Kong, Macao, and Taiwan into consideration in year 2014. Obviously, rank reversal can result in different policy implications for Beijing’s future energy development.

The influence of rank reversal on policy makings can be further expanded from China’s regional study to other countries and regions’ study. For instance, EU as an important economy which is worth studying, its members have changed over time. The policy implications proposed based on DEA model which has less rank reversal will be more robust. Besides, the regions in China can be further compared with some other developing countries along the Belt and Road initiative, such as the Southeast Asian Countries.

Rank reversal which reflects the robustness of a DEA model should be gained more attention in the future studies. Besides energy efficiency assessment, rank reversal issues can also happen in other efficiency assessment using DEA, such as the CO2 emission and pollution efficiency. This is an important aspect that should be considered when selecting DEA models for efficiency assessment. Theoretically, the ideal DEA ranking model shall not change the ranks of other DMUs after the addition or removal of one or more DMUs. The development of such DEA model with rank preservation would be a meaningful topic for future research.