1 Introduction

With the continuing threats of global climate change, resource exhaustion and ecological disruption, sustainable development (SD) has received unprecedented attention from the international community. The concept of SD emerged in 1987 with the publication of the World Commission on Environment and Development (WCED), Our Common Future (commonly referred to as the Brundtland Report) (WCED 1987), which became a primary driver for the United Nations Conference on Environment and Development (UNCED) (also known as the Rio Earth Summit) in 1992 and the United Nations Conference on Sustainable Development (UNCSD) (commonly called Rio + 20 Earth Summit) in 2012. To date, SD has been recognized as an overarching principle for long-term global development, and many countries have also placed their focuses on planning concrete policies to embody the core of SD and thereby create a better society (Rondinelli and Berry 2000).

SD consists of three essential dimensions, namely, economic, environmental and social, which are the so-called “three pillars of sustainability” or the “triple bottom line” (Elkington 1998; Holling 2001). As an instrument that harmonises economic and environmental dimensions, eco-efficiency has been launched with the aim of creating more value with less environmental (ecological) impact via the implementation of SD (WBCSD 2005). Specifically, eco-efficiency can be formulised as a ratio of economic output to environmental impact in which the indicators gross domestic product (GDP), value of products/services, sales and value added are general proxies for economic output while the indicators energy, material and water consumption; greenhouse gas (GHG) emissions; and waste pollution are general proxies for environmental impact (Schaltegger and Sturm 1990; Michelsen et al. 2006; Wursthorn et al. 2011; Wang et al. 2015). Although eco-efficiency might guide a practical solution for SD, eco-efficiency analyses do not guarantee sustainability because they cover only economic and environmental dimensions (Bruni et al. 2011; Charmondusit et al. 2014). A broadened eco-efficiency analysis that incorporates the social dimension was recently recommended to thoroughly achieve SD core concepts via benchmarking (Tatari et al. 2016; Caiado et al. 2017; Nissi and Sarra 2018).

Developing a robust SD measure composed of multiple indicators related to all three dimensions is of vital importance for achieving improvements towards sustainability (Jollands et al. 2004; Strezov et al. 2017). In order to address this issue, data envelopment analysis (DEA), introduced by Charnes et al. (1978) based on the earlier work of Farrell (1957), has received wide recognition. DEA is a widely used non-parametric approach to assessing the relative efficiencies of a set of homogenous decision-making units (DMUs) that consume multiple inputs to produce multiple outputs. The important feature of DEA is that it not only objectively provides a composite indicator to ascertain the efficiency of each DMU but also offers benchmarking information to elucidate strategies for improving the efficiency of inefficient DMUs. Research in the SD fields has witnessed a great acceptance of the DEA (Zhou et al. 2018). For example, Bruni et al. (2011) employed DEA to benchmark the SD performance of 20 Italian regions in which several indicators involving energy consumption, GDP, carbon dioxide (CO2) emissions and the poverty rate were considered. Lee and Farzipoor Saen (2012) advanced the measurement of corporate sustainability management performance in the Korean electronics industry using DEA. Yin et al. (2014) focused on the topic of urban sustainability and applied DEA to investigate the SD performance of 30 provincial capital cities in China. As for the electricity industry, Tajbakhsh and Hassini (2018) developed a comprehensive measure for sustainability performance of fossil-fuel power plants in the U.S.

Generally, DEA has two fundamental types of models, namely, radial and non-radial. Classic DEA models, such as CCR (Charnes–Cooper–Rhodes) (Charnes et al. 1978) and BCC (Barker–Charnes–Cooper) (Banker et al. 1984), are referred to as radial models and mainly deal with changes in either inputs or outputs in a proportional way, whereas the additive DEA (Charnes et al. 1985), the enhanced Russell measure (Pastor et al. 1999) and the slacks-based measure (Tone 2001) are well-known non-radial models that allow changes in both inputs and outputs in different proportions. Moreover, some extensions of DEA, which allow for both input contraction and output expansion in the direction of a given input–output vector, have been developed in accordance with the directional distance function (DDF) presented by Chambers et al. (1996). From this perspective, existing DEA variants can be clearly partitioned in the direct approach (using DDF) and the indirect approach (not using DDF) (Zanella et al. 2015). Here, I build on the additive DEA which is considered an indirect approach, and devote into its possible reinforcement to regional SD applications. The additive DEA model has a greater discriminatory power and excels in identifying the sources of inefficiency and yielding efficiency scores that are coherent with the Pareto–Koopmans optimality notion, which may be appropriate for the current study (Zhou et al. 2007; Du et al. 2010). However, the additive DEA model may generate an unconvincing projection (or benchmarking target) that is far away from the DMU under evaluation. This imperfection stems from the fact that the model attempts to maximize the potential contraction of inputs and expansion of outputs. Additionally, the other notable defect of the additive DEA model lies in the fact that it does not consider undesirable outputs.

To address the aforementioned gap of additive DEA and focus on the holistic assessment of regional SD, this study has two main objectives. First, this study proposes a minimum distance-based additive DEA model with window analysis that accommodates the undesirable outputs and provides the closest benchmarking target for each considered DMU. The second objective is to evaluate the longitudinal SD efficiency across 19 administrative regions of Taiwan over a period from 2011 to 2016, aiming to provide local governments with directions of improvement towards a more sustainable society. Taiwan is a mountainous island in East Asia and holds a population of approximately 23 million spread across a total land area of nearly 36,000 km2; it is one of the most densely populated places in the world and has very limited natural resources. Since the 1960s, Taiwan has experienced rapid economic growth along with substantial progress in industrialisation. This trend has consequently resulted in high energy consumption and impacts to the diversified ecology, natural environment and quality of human life. The Executive Yuan, which is the executive branch of the central government of Taiwan, organised the National Council for Sustainable Development (NCSD) in 1997 to combat these obstacles to SD and spearhead national SD by formulating regulations for the Sustainable Development Action Plan (NCSD 2009). Currently, a primary task of the NCSD in Taiwan is to achieve international SD standards, especially the 17 Sustainable Development Goals (SDGs) of the 2030 Agenda announced by the United Nations (UN 2015). However, the success of SD promotion is highly dependent on the conjunction of SD implementation in cities and counties since these subnational scales act as key pioneers in localizing sustainability (Gibbs 1998; Herrera-Ulloa et al. 2003; Lebel et al. 2006). Thus, developing a method of properly assessing regional SD performance to promote sustainability among cities and counties has become one of the most crucial topics in Taiwan. The benchmarking results obtained in the present research framework not only enable local governments in cities and counties to clearly understand the current state of SD promotion from an efficiency perspective but also offer potential directions to guide the development of appropriate policies for improvements that contribute to localize and realize the goal of SD.

The rest of this paper is arranged as follows. The next section reviews the radial and additive DEA models. Section 3 introduces the proposed additive model and explains how it can be adopted herein to carry out longitudinal SD evaluation for the subsequent study. The research sample and data collection are presented in Sect. 4. Section 5 elaborates the empirical results and discussions with respect to regional SD efficiency in Taiwan. Finally, concluding remarks are drawn in the last section.

2 Data Envelopment Analysis

The primary feature of DEA is the proverbial capability to compose an objective indicator for each DMU having multiple inputs and multiple outputs. This indicator, termed as efficiency, represents the DMU’s distance to an efficient frontier and can be assessed at industrial (Belu 2009; Sun and Stuebs 2013; Mahdiloo et al. 2015; Wang et al. 2016), regional (Liu et al. 2010; González et al. 2011; Yin et al. 2014; Carboni and Russu 2015), and national levels (Zhou et al. 2010; Wu et al. 2014a, b; Rashidi and Farzipoor Saen 2015; Gómez-Calvet et al. 2016). The efficient frontier acts as the boundary of the Production Possibility Set (PPS) spanned by all observed DMUs. This section briefly reviews radial and additive DEA models and explains how they perform the task of performance evaluation.

2.1 Radial DEA Model

Suppose that there are n homogenous DMUs, each of which consumes m inputs to produce s outputs. For the jth DMU, j = 1,…,n, let xij and yrj denote the ith input, i = 1,…,m, and rth output, r = 1,…,s, respectively. The radial CCR model, which was originally proposed by Charnes et al. (1978), for evaluating the input-oriented efficiency of DMUo under constant returns to scale (CRS) is given as follows.

$$\begin{aligned} & \hbox{min} \quad \theta_{o} \\ & {\text{s}} . {\text{t}} . \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} x_{ij} \leq x_{io} \theta_{o} \quad i = 1, \ldots ,m \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} y_{rj} \geq y_{ro} \quad\quad r = 1, \ldots ,s \\ & \lambda_{j} \ge 0\qquad\quad\qquad j = 1, \ldots ,n \\ \end{aligned}$$
(1)

where λj is a non-negative variable representing the intensity of the jth DMU in forming the efficient frontier and θo represents a proportional scale for the inputs contraction of a DMUo. Denote the optimality of model (1) by superscript “*”, and the efficiency of DMUo can be defined as \(\theta^{*}_{o}\) which ranges from 0 to 1. Obviously, if \(\theta^{*}_{o}\) = 1, then DMUo is efficient and lies on the efficient frontier; otherwise, it is inefficient and is at a distance from the efficient frontier. Model (1) can be extended to meet with variable returns to scale (VRS) assumption (known as the BCC model) by imposing the constraint \(\sum \limits_{j = 1}^{n} \lambda_{j} = 1\)(Banker et al. 1984).

One should note that a DMUo(xioyro) with \(\theta^{*}_{o} = 1\) may be weakly efficient, i.e., there exists \((- x_{i}^{{\prime }} ,y_{r}^{{\prime }} ) \in {\text{PPS}} \ge (- x_{io}, y_{ro})\, \text {and} \,(x_{i}^{\prime }, y_{r}^{\prime }) \neq (x_{io}, y_{ro})\). Moreover, for inefficient DMUs, the suggested improvements to reach the efficient frontier, i.e., contraction on all inputs, rely on a unified scale \(1 - \theta^{*}_{o}\). In addition to input-oriented model (1), there is also output-oriented model that mainly deals with the expansion of outputs [for more details please refer to Charnes et al. (1978)]. The efficiency with such radial measures, however, can be assessed only from either input-oriented or output-oriented sides.

2.2 Additive DEA model

These problems can be solved by taking non-radial slacks into account. Charnes et al. (1985) developed the following additive DEA model to simultaneously tackle both input and output slacks, such that the inputs and outputs of a DMUo can be contracted and expanded in different proportions.

$$\begin{aligned} & \hbox{max} \quad \mathop \sum \limits_{i = 1}^{m} s_{io}^{ - } + \mathop \sum \limits_{r = 1}^{s} s_{ro}^{ + } \\ & {\text{s}} . {\text{t}} .\\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} x_{ij} = x_{io} - s_{io}^{ - } \qquad i = 1, \ldots ,m \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} y_{rj} = y_{ro} + s_{ro}^{ + } \qquad r = 1, \ldots ,s \\ & \lambda_{j} \ge 0,s_{io}^{ - } \ge 0,s_{ro}^{ + } \ge 0\quad j = 1, \ldots ,n \\ \end{aligned}$$
(2)

where \(s^{-}_{io}\) and \(s^{+}_{ro}\) are the non-radial slacks for DMUo’s ith input and rth output, which represent the excess amounts of the ith input and the shortage amounts of the rth output compared with the efficient frontier, respectively. If in optimal solution of model (2) for DMUo one has that \(s^{{-}^{*}}_{io}=0\) and \(s^{{+}^{*}}_{ro}=0\) for all i and r, then DMUo is efficient; otherwise, it is inefficient and has the room on \(s^{{-}^{*}}_{io}\neq0\) and/or \(s^{{-}^{*}}_{ro}\neq0\) should be adjusted. The benchmarking target (\(\bar{x}_{io} , \bar{y}_{ro}\)) of DMUo on the efficient frontier can be determined by (\(x_{io} + s_{io}^{ - *},y_{ro} + s_{ro}^{ + *}\)) or \(\left( { \sum\nolimits_{j = 1}^{n} {\lambda_{j}^{*} x_{ij} } ,\sum\nolimits_{j = 1}^{n} {\lambda_{j}^{*} y_{rj} } } \right).\) The reference set is given as \(B_{o} = \left\{ {{\text{DMU}}_{j} |\lambda_{j}^{*} > 0,\quad j = 1,2, \ldots ,n} \right\}\), indicating which DMUj serves as a benchmark for inefficient DMUs.

The optimal objective value of model (2) is a sum of input and output slacks rather than an explicit efficiency measure. For DMUo, the additive efficiency \(\rho_{o}^{*}\) can be defined as follows (Tone 2001).

$$\rho_{o}^{*} = \frac{{1 - \frac{1}{m}\mathop \sum \nolimits_{i = 1}^{m} s_{io}^{ - *} /x_{io} }}{{1 + \frac{1}{r}\mathop \sum \nolimits_{r = 1}^{s} s_{ro}^{ + *} /y_{ro} }}$$
(3)

Notably, the \(\rho_{o}^{*}\) ranges between 0 and 1 and is unit-invariant. Moreover, \(\rho_{o}^{*}\) is monotonically decreasing with respect to \(s^{{-}^{*}}_{io}=0\) and \(s^{{-}^{*}}_{ro}=0\); that is, a larger value of \(\rho_{o}^{*}\) indicates a better performance. An efficient DMUo with \(\rho_{o}^{*}=1\) is strongly Pareto-efficient, which is achieved if and only if \(s^{{-}^{*}}_{io}=0\) and \(s^{{-}^{*}}_{ro}=0\).

3 A minimum distance-based additive model with undesirable outputs

The general efficiency criterion of production processes associated with the aforementioned models is a matter of how DMUs consume as less amounts of each input as possible to produce as more amounts of each output as possible. However, in many real-world situations, undesirable outputs would inescapably appear as by-products accompanied with desirable outputs (Scheel 2001; Seiford and Zhu 2002; Zhou et al. 2007). Therefore, measuring efficiency in the presence of undesirable outputs has attracted considerable attention in the literature, especially in the fields of energy, ecology and environment (Tyteca 1996; Chung et al. 1997; Lozano and Gutierrez 2008). In this section, the additive model (2) was first extended to incorporate undesirable outputs, and then a novel minimum distance-based additive model under the window analysis procedure was proposed to find the closest benchmarking targets and perform a longitudinal efficiency assessment.

3.1 Incorporating undesirable outputs into additive DEA

Different models have been proposed to handle the undesirable outputs under the strongly disposable assumption. One method applies a non-linear monotonic decreasing transformation that takes the multiplicative inverse of the undesirable outputs, i.e., f(U) = 1/U (Golany and Roll 1989; Lovell et al. 1995). Another is built on linear monotonic decreasing transformation, which adds absolute constants to the opposite of undesirable outputs, i.e., f(U) = − U + B (Ali and Seiford 1990; Scheel 2001; Seiford and Zhu 2002). The other one, suggested by Reinhard et al. (1999), is to treat undesirable outputs as inputs, and it has received considerable acceptance in recent studies (e.g., Hailu and Veeman 2001; Yang and Pollitt 2009; Mahlberg and Sahoo 2011; Sahoo et al. 2011; Wu et al. 2014a, b). It should be noted that these methods are regarded as indirect approaches (Zanella et al. 2015). As for direct approaches, Chung et al. (1997) employed DDF to simultaneously deal with the joint production of desirable and undesirable outputs. Despite the increased popularity for the adoption of Chung et al.’s (1997) method, it remains difficult to explicitly assign the justified direction vector (Wang et al. 2017).

This study adopts the method of considering undesirable output as inputs since it is readily understood and does not require actual data transformation. Assume that each DMUj, j = 1,2,…,n, consumes m inputs xij, i = 1,…,m, to produce q1 desirable outputs \(y_{{r_{1} j}}^{g}\), r1 = 1,…,q1, along with q2 undesirable outputs \(y_{{r_{2} j}}^{b}\), r2 = 1,…,q2. The additive model that can deal with undesirable outputs is given as follows.

$$\begin{aligned} & { \hbox{max} }\quad \mathop \sum \limits_{i = 1}^{m} s_{io}^{ - } + \mathop \sum \limits_{{r_{1} = 1}}^{{q_{1} }} s_{{r_{1} o}}^{ + g} + \mathop \sum \limits_{{r_{2} = 1}}^{{q_{2} }} s_{{r_{2} o}}^{ + b} \\ & {\text{s}}.{\text{t}}.\\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} x_{ij} = x_{io} - s_{io}^{ - } \qquad\qquad\qquad i = 1, \ldots ,m \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} y_{{r_{1} j}}^{g} = y_{{r_{1} o}}^{g} + s_{{r_{1} o}}^{ + g} \qquad\qquad\quad r_{1} = 1, \ldots ,q_{1} \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j} y_{{r_{2} j}}^{b} = y_{{r_{2} o}}^{b} - s_{{r_{2} o}}^{ + b} \qquad\qquad\quad r_{2} = 1, \ldots ,q_{2} \\ & \lambda_{j} \ge 0,s_{io}^{ - } \ge 0,s_{{r_{1} o}}^{ + g} \ge 0,s_{{r_{2} o}}^{ + b} \ge 0\quad j = 1, \ldots ,n \\ \end{aligned}$$
(4)

where \(s^{-}_{io}\)\(s_{{r_{1} o}}^{ + g}\) and \(s_{{r_{2} o}}^{ + b}\) represent the excess for ith input, shortage for r1th desirable output and overproduction for r2th undesirable output of the DMUo, respectively. By adding the convexity constraint ∑  n j=1 λj = 1, the additive model under the VRS assumption is achieved. As in model (4), the undesirable outputs hold the same property as the inputs, i.e., the less the better, and both s io and \(s_{{r_{2} o}}^{ + b}\) represent how much of the adjusted amount should be decreased to improve DMUo’s performance status. Thus, for DMUo, the benchmarking target \(\left( {\bar{x}_{io} , \bar{y}_{{r_{1} o}}^{g} , \bar{y}_{{r_{2} o}}^{b} } \right)\) can be defined as (\(x_{{io}} - s_{{io}}^{-*} ,y_{{r_{1} o}}^{g} + s_{{r_{1} o}}^{ + g*} , y_{{r_{2} o}}^{b} - s_{{r_{2} o}}^{ + b*}\)).

3.2 Closest benchmarking target and minimum distance-based measure

As model (4) maximizes the sum of s io , \(s_{{r_{1} o}}^{ + g}\) and \(s_{{r_{2} o}}^{ + b}\), the obtained benchmarking target \(\left( {\bar{x}_{io} , \bar{y}_{{r_{1} o}}^{g} , \bar{y}_{{r_{2} o}}^{b} } \right)\) is the farthest one and may not be sufficiently convincing to serve as a reference for inefficient DMUs. To process a more practical assessment by finding the closest benchmarking target, the maximization may be replaced by a minimization for the objective function; however, the resultant model is unbounded if one does so.

Previous work conducted by Aparicio et al. (2007) found that the closest benchmarking target on the efficient frontier for DMUo can be sought by embedding the linear combination of extremely efficient DMUs from the envelopment form and the weights regarding inputs and outputs from the multiplier form into constraints. Geometrically, extremely efficient DMUs are vertices of the facets of the efficient frontier, and each is not a linear combination of the other DMUs (Sueyoshi and Sekitani 2007). As reported in Aparicio et al. (2007), the idea of similarity between evaluated DMUo and benchmarking target can be achieved by different criteria, including distance and efficiency measure, thereby conceiving several possible variants. For example, to generate closest targets, the L1-distance, L2-distance and L-distance can be adopted. Otherwise, if the efficiency measure criterion is used, models with the maximization of efficiency measure at the objective function such as the range adjusted measure (Cooper et al. 1999), the enhanced Russell measure (Pastor et al. 1999), and the slacks-based measure (Tone 2001) can be also formulated.

Let \(E = \left\{ {{\text{DMU}}_{k} \in J|s_{io}^{ - *} ,\;s_{{r_{1} o}}^{ + g*} ,\;s_{{r_{2} o}}^{ + b*} = 0,\;\lambda_{k}^{*} = 1,\lambda_{j}^{*} = 0, \forall j \ne k} \right\}\) denote the set of extremely efficient DMUs as a subset of a whole DMU set J ∊ {1, …, n} using the optimal solution for the model (4) (Charnes et al. 1986, 1991). Based on the mDD model of Aparicio et al. (2007), this study proposes the minimum distance-based additive DEA model as a form of 0–1 mixed integer linear programming (MILP) to find the closest benchmarking targets in the presence of undesirable outputs, expressed as follows:

$$\begin{aligned} & { \hbox{min} }\quad \mathop \sum \limits_{i = 1}^{m} s_{io}^{ - } + \mathop \sum \limits_{{r_{1} = 1}}^{{q_{1} }} s_{{r_{1} o}}^{ + g} + \mathop \sum \limits_{{r_{2} = 1}}^{{q_{2} }} s_{{r_{2} o}}^{ + b} \\ & {\text{s}}.{\text{t}}. \\ & \mathop \sum \limits_{j \in E} \lambda_{j} x_{ij} = x_{io} - s_{io}^{ - } \qquad\qquad\qquad i = 1, \ldots ,m \\ & \mathop \sum \limits_{j \in E} \lambda_{j} y_{{r_{1} j}}^{g} = y_{{r_{1} o}}^{g} + s_{{r_{1} o}}^{ + g} \quad\qquad\qquad r_{1} = 1, \ldots ,q_{1} \\ & \mathop \sum \limits_{j \in E} \lambda_{j} y_{{r_{2} j}}^{b} = y_{{r_{2} o}}^{b} - s_{{r_{2} o}}^{ + b} \quad\qquad\qquad r_{2} = 1, \ldots ,q_{2} \\ & - \;\mathop \sum \limits_{i = 1}^{m} v_{i} x_{ij} + \mathop \sum \limits_{{r_{1} = 1}}^{{q_{1} }} u_{{r_{1} }} y_{{r_{1} j}}^{g} - \mathop \sum \limits_{{r_{2} = 1}}^{{q_{2} }} z_{{r_{2} }} y_{{r_{2} o}}^{b} + d_{j} = 0 \\ & v_{i} \ge 1,u_{{r_{1} }} \ge 1,z_{{r_{2} }} \ge 1\quad\qquad\qquad j \in E \\ & d_{j} \le {\text{M}}b_{j} \\ & \lambda_{j} \le {\text{M}}\left( {1 - b_{j} } \right) \\ & b_{j} \in \left\{ {0,1} \right\} \\ & \lambda_{j} \ge 0,\;d_{j} \ge 0,\;s_{io}^{ - } \ge 0,\;s_{{r_{1} o}}^{ + g} \ge 0,\;s_{{r_{2} o}}^{ + b} \ge 0 \\ \end{aligned}$$
(5)

where s io , \(s_{{r_{1} o}}^{ + g}\) and \(s_{{r_{2} o}}^{ + b}\) are the non-radial slacks; vi, \(u_{{r_{1} }}\) and \(z_{{r_{2} }}\) are the multipliers, corresponding to the ith input, the r1th desirable output and the r2th undesirable output, respectively; M is a large enough positive number; and bj is the binary integer variable. As shown in model (5), the objective function inherits the concept of model (4) but attempts to minimize rather than maximize the sum of non-radial slacks with the L1 norm. To ensure unit invariance, one possibility is to assign 1/xio, \(1/y_{{r_{1} o}}^{g} ,1/y_{{r_{2} o}}^{b}\) as the weights for the corresponding slacks in the objective function, i.e., \(\sum\nolimits_{i = 1}^{m} {s_{io}^{ - } /x_{io} } + \sum\nolimits_{{r_{1} = 1}}^{{q_{2} }} {s_{{r_{1} o}}^{ + g} /y_{{r_{1} o}}^{g} } + \sum\nolimits_{{r_{2} = 1}}^{{q_{2} }} {s_{{r_{2} o}}^{ + b} /y_{{r_{2} o}}^{b} }\) (Lovell and Pastor 1995; Pastor et al. 1999; Tone 2001). On the constraint side, \(b_{j}^{*}=0\) is embedded as a switch to determine whether DMUj acts as a benchmark for DMUo. If \(b_{j}^{*}=0\), then \(d_{j}^{*}=0\) and \(lambda_{j}^{*}>0\), which implies that DMUj belongs to hyperplane \(- \mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{ij} + \mathop \sum \nolimits_{{r_{1} = 1}}^{{q_{1} }} u_{{r_{1} }} y_{{r_{1} j}}^{g} - \mathop \sum \nolimits_{{r_{2} = 1}}^{{q_{2} }} z_{{r_{2} }} y_{{r_{2} o}}^{b} = 0\) and is the benchmark for DMUo; conversely, if \(b_{j}^{*}=1\), then \(\lambda_{j}^{*}=0\) and \(d_{j}^{*}\geq0\), which implies that DMUj does not act as a benchmark for DMUo. Moreover, by adding the constraint \(\sum \nolimits_{{j = 1}}^{{n }}\lambda_{j}=1\) and converting the fourth constraint into \(- \mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{ij} + \mathop \sum \nolimits_{{r_{1} = 1}}^{{q_{1} }} u_{{r_{1} }} y_{{r_{1} j}}^{g} - \mathop \sum \nolimits_{{r_{2} = 1}}^{{q_{2} }} z_{{r_{2} }} y_{{r_{2} o}}^{b} - \alpha + d_{j} = 0\), model (5) can be adapted to the case of VRS.

According to the optimal solution \(\left( {s_{io}^{ - *} ,s_{{r_{1} o}}^{ + g*} ,s_{{r_{2} o}}^{ + b*} } \right)\) to model (5), the minimum distance-based efficiency \(\delta_{o}^{*}\) for DMUo can be formulated as follows (Cooper et al. 2007):

$$\delta_{o}^{*} = \frac{{1 - \frac{1}{m}\mathop \sum \nolimits_{i = 1}^{m} s_{io}^{ - *} /x_{io} }}{{1 + \frac{1}{{q_{1} + q_{2} }}\left( {\mathop \sum \nolimits_{{r_{1} = 1}}^{{q_{1} }} s_{{r_{1} o}}^{ + g*} /y_{{r_{1} o}}^{g} + \mathop \sum \nolimits_{{r_{2} = 1}}^{{q_{2} }} s_{{r_{2} o}}^{ + b*} /y_{{r_{2} o}}^{b} } \right)}}$$
(6)

Here \(\delta_{o}^{*}\) is considered as a composite measure ranging from 0 to 1 that indicates how each administrative region of the sampled data performs in terms of SD efficiency. An administrative region is SD efficient if and only if it holds \(\delta_{o}^{*}=1\), equivalently \(\left( {s_{io}^{ - *} ,s_{{r_{1} o}}^{ + g*} ,s_{{r_{2} o}}^{ + b*} } \right) = 0.\) By contrast, an administrative region performs inefficiently when δ * o  < 1 and \(\left( {s_{io}^{ - *} ,s_{{r_{1} o}}^{ + g*} ,s_{{r_{2} o}}^{ + b*} } \right) \ne 0.\)

In what follows, a simple example covering five DMUs, i.e., A, B, C, D and E, is applied to visualize the favorable specialty of the model (5). Each DMU uses one input x to produce two outputs, desirable yg and undesirable yb, where the amount of input x is unitized to 1 for simplicity. Figure 1 depicts the PPS spanned by these DMUs and shows the relevant results of models (4) and (5). It can be seen that DMUs A, B, C lie on the frontier; thus, they are efficient with optimal slacks \(s_{{r_{1} o}}^{ + g*} ,s_{{r_{2} o}}^{ + b*} = 0\), whereas the remaining two DMUs D and E are enveloped by the frontier, indicating that they are inefficient and have non-zero optimal slacks on \(s_{{r_{2} o}}^{ + b*}\) or \(s_{{r_{1} o}}^{ + g*}\) and should be further improved. If model (4) is used, for DMU D, the optimal slacks are \(s_{{r_{2} o}}^{ + b*} = 4\) and \(s_{{r_{1} o}}^{ + g*} = 2\); for DMU E, the optimal slacks are \(s_{{r_{2} o}}^{ + b*} = 0\) and \(s_{{r_{1} o}}^{ + g*} = 5\); DMU B (4,7) is determined as the benchmarking target. However, DMU B (4,7) is actually placed far from where the inefficient DMUs lie. Moreover, the proposed model (5) is competent to provide inefficient DMUs with more realistic benchmarking targets that are closer than those yielded by model (4). By using model (5), DMU D obtains optimal slacks \(s_{{r_{2} o}}^{ + b*} = 0\) and \(s_{{r_{1} o}}^{ + g*} = 3.6\), and DMU E obtains optimal slacks \(s_{{r_{2} o}}^{ + b*} = 3\) and \(s_{{r_{1} o}}^{ + g*} = 1\). The identified benchmarking targets of DMUs D and E are (8, 8.6) and (1, 3), respectively.

Fig. 1
figure 1

The differences on benchmarking targets between models (4) and (5)

It should be noted that, in a similar fashion as explained above, incorporating undesirable outputs into other variants with different criteria is also feasible for generating the closest benchmarking targets. Based on the slacks-based measure, Wang et al. (2013a, b) put forward a linear bilevel programming problem (called mSBM) that takes the undesirable outputs into account. Moreover, An et al. (2015) incorporated the undesirable outputs into the conventional enhanced Russell measure, and then proposed its extension to obtain the minimum distance of a DMUo to the efficient frontier. A similar study built on the additive measure has been made by Xiong et al. (2017), who determined the closest benchmarking targets in the presence of undesirable outputs by developing a closest target model that uses the weighted operator 1/(m + q2) for input and undesirable output slacks and the weighted operator 1/q1 for desirable output slacks in the objective function. This model can be regarded as an extension under the framework of the additive measure, but its initial form is not consistent with the standard additive model of Charnes et al. (1985) and it would not inherit the optimality of the mDD model of Aparicio et al. (2007) if only inputs and desirable outputs are considered. Regarding computation, such a way of embedding weighted operators in the objective function may also affect the optimality of slack variables.

Although the above models provide slightly different results, they are all conducive to globally lessening the differences between the values of the input, the desirable output and the undesirable output of the DMUo and those of the benchmarking target. Despite the merit in generating the closest benchmarking targets, it should be noted that the aforementioned models perform the efficiency analysis in static situations only and can be further extended to accommodate time-varying data.

3.3 Window analysis for time-varying data

As efficiency varies with time, a general static analysis may be insufficient for time-dependent situations. The window analysis put forward by Charnes et al. (1984) is one of the most prevalent approaches for addressing cross sectional and longitudinal data (Wang et al. 2013a, b). It is not only useful for exploring the changes in efficiency of each DMU over time, but also helpful in enhancing the discriminatory power when cases with small sample sizes and a large number of variables occur (Ross and Droge 2002; Asmild et al. 2004; Sueyoshi et al. 2013).

This study combines the proposed model (5) with the window analysis procedure. Built on the notion of the moving average, window analysis treats each DMU in different periods as an independent unit, and divides time periods into a sequence of overlapping windows. Let us consider n DMUs over T periods, t = 1,…,T, with widow width w (1 ≤ w ≤ T). A total of T − w + 1 windows exist, and each window has n × w DMUs under evaluation of the model (5). The first window covers DMUs from periods 1,…,w, the second window covers DMUs from periods 2,…,w + 1, and so on. In this manner, each DMU in a period of a window can compare its own performance in other periods as well as the performance of other DMUs in the same and other periods.

4 Research sample and data collection

A total of 19 administrative regions in Taiwan, including 9 cities and 10 counties, over the period from 2011 to 2016 were sampled in this study. According to the Comprehensive Development Planning reported by the Council for Economic Planning and Development (CEPD), Executive Yuan, Taiwan, these cities and counties can be classified into four areas, namely, the northern, central, southern, and eastern areas. Figure 2 illustrates detailed information on the spatial distribution of these areas. The northern area consists of five cities, i.e., New Taipei, Taipei, Taoyuan, Hsinchu, and Keelung, and two counties, i.e., Yilan and Hsinchu. Taichung City and four counties, including Miaoli, Changhua, Yunlin, and Nantou, constitute the central area. The southern area is composed of Chiayi, Tainan and Kaohsiung cities as well as Chiayi and Pingtung counties. Hualien and Taitung counties group the eastern area.

Fig. 2
figure 2

Study area

The economic development and rapid industrialization of Taiwan heavily depends on the consumption of natural resources. However, Taiwan contains scarce indigenous natural resources. More than 98% of its energy resources, which are mostly in the form of fossil fuels including petroleum, coal, and natural gas, have long relied on foreign imports to supply the vast majority of energy needs. In addition, the shortage of water resources is another problem. Although Taiwan receives an average of approximately 2500 mm in annual rainfall, which is 2.5 times more than the world average, the average disposable water resource per capita from rainfall is approximately 4000 m3, which is less than one-fifth the world average. This scenario occurs because of the uneven spatiotemporal rainfall distribution and the dearth of storage capacity in short and steep river terrains. Therefore, in pursuit of economic development, Taiwan must effectively mitigate resource consumption and emphasise environmental protection and social welfare (CEPD 2004; NCSD 2009).

Each administrative region in Taiwan plays a crucial role in realizing sustainability at the regional scale. To explicitly assess SD efficiency across these administrative regions, three inputs, two desirable outputs and four undesirable outputs were adequately considered in accordance with previous literature on SD issues. Figure 3 provides a clear view of the research framework and the input–output data selected in the present analysis. On the side of inputs, the annual amount of water consumption x1 in million m3, the power consumption x2 in Gigawatt hours (GWh) and the gasoline consumption x3 in kiloliters (kL) are considered to directly reflect energy resources investment (Bruni et al. 2011; Yin et al. 2014; Yang et al. 2016). Regarding outputs, desirable outputs include per capita income y g1 in thousands of NTD and the number of employed labor y g2 per thousand persons, which represent the prosperity of regions under the economic dimension (Murias et al. 2006; Yin et al. 2014; Chen et al. 2017), whereas undesirable outputs are measured by the number of labor disputes y b1 , the number of disabling injuries y b2 , the amount of Municipal Solid Waste (MSW) generation y b3 in tonnes and the amount of sulfur dioxide (SO2) emissions y b4 in parts per billion (ppb); among which the former two, y b1 and y b2 , reflect the burden on society (Shen et al. 2003; Miles and Munilla 2004; Iribarren et al. 2016), and the latter two, y b3 and y b4 , represent the degrees of environmental pollution in the waste and in the air, respectively (Hu 2006; Yin et al. 2014; Yu et al. 2017). The proposed model (5) is conducted with window analysis to perform a longitudinal efficiency evaluation for 19 administrative regions, n = 19, with time period of 6 years, T = 6. Given the consideration of the best balance of informativeness and stability of the efficiencies, a 3-year window width, w = 3, is assumed (Charnes et al. 1994; Wang et al. 2013a, b). Thus, in the present analysis, there are four (T − w + 1 = 4) overlapping windows proceeded for each administrative region, where each window covers a number of 76 (n × w = 76) observations.

Fig. 3
figure 3

Input-output data and research framework

The data for water consumption x1, power consumption x2 and gasoline consumption x3 were gathered from annual statistics documented by the Water Resources Agency, Taipower Company and Bureau of Energy; the per capita income y g1 was collected from the Report on The Survey of Family Income and Expenditure of the Directorate-General of Budget, Accounting and Statistics (DGBAS); the employed labor y g2 , labor disputes y b1 and disabling injuries y b2 were extracted from the Yearly Bulletin of Ministry of Labour; and the MSW generation y b3 and the SO2 emissions y b4 were compiled from the Environment Resource Database of the Environmental Protection Administration. The descriptive statistics of the input–output data over 2011–2016 are summarized in Table 1.

Table 1 Descriptive statistics of the input–output data

5 Empirical Results and Discussions for Regional SD in Taiwan

The collected input–output dataset was analysed under the presented research framework, and the resulting SD efficiency for 19 administrative regions of Taiwan from 2011 to 2016 are reported in Table 2, where the annual SD efficiency is the average of the four overlapping 3-year windows using the window analysis procedure: 2011–2013, 2012–2014, 2013–2015, and 2014–2016. The results show that during all time periods, four administrative regions, New Taipei City, Yunlin County, Taitung County, and Chiayi City, were consistently efficient and received a SD efficiency of δ * o  = 1. These four regions could serve as benchmarks for other regions with an SD efficiency of δ * o  < 1 because they perform efficiently in consuming input resources of water x1, power x2, and gasoline x3 to produce economical outputs of per capita income y g1 and employed labor y g2 while restraining the social burden of labor disputes y b1 and disabling injuries y b2 as well as the environmental pollution of MSW generation y b3 and SO2 emissions y b4 . For most of the periods, Taipei City, Nantou County, Chiayi County, and Keelung City were deemed as SD efficient, and they all gained an average SD efficiency over 0.990. It can be also observed that three administrative regions, Taoyuan City, Tainan City, and Hualien County, presented significant improvements in SD efficiency from 2011 to 2013, whereas their SD efficiency dropped after 2014. Kaohsiung City and Changhua County show a fluctuating SD efficiency over the research periods.

Table 2 Annual score of SD efficiency for 19 administrative regions

The average annual SD efficiencies of 19 administrative regions within four areas are depicted in Fig. 4 which provides an overview of the spatial distribution of Taiwanese SD efficiency. As is shown, among the northern area regions, Taoyuan City and Hsinchu County fall in the bottom two places with an average SD efficiency lower than 0.850. In the central area, four out of five regions perform efficiently and present average SD efficiencies above 0.950, whereas Taichung City is the least efficient with an average SD efficiency that is far lower than the others. In the southern area, Chiayi City takes first place with the top SD efficiency of 1.000, and it is followed by Chiayi County with average SD efficiency of 0.991. Kaohsiung City comes last in this area and obtains an average SD efficiency of only 0.755. Moreover, in the eastern area, Hualien County is not efficient enough and its average SD efficiency is less than 0.900. Notably, the efficient state for the regions implies that the well balanced development among the three pillars of sustainability was achieved, while the inefficient state indicates that some improvements must be made by less efficient regions, from the efficiency perspective. Table 3 presents the average optimal slacks for each input and output using the proposed model (5). These non-zero optimal slacks clearly manifest the sources resulting in inefficiency and indicate the proper decreases of the corresponding inputs and undesirable outputs as well as the increases of the corresponding desirable outputs based on the minimum distance notion. That is to say, adjusting these slacks would be conductive to a better SD performance and less efficient regions can use this information as a reference to draw up effective policies and strategies for SD improvement. Taking Taoyuan City (δ * o  = 0.7208) as an example, it has the non-zero optimal slacks of s −*1 , s −*2 , s −*3 , s + g*2 , s + b*1 , s + b*2 , s + b*3 , and s + b*4 . This finding indicates that to become efficient, Taoyuan City could decrease water, power, and gasoline consumption by 13.577 million m3, 8134.941 GWh, and 252,934.106 kL, respectively; expand the number of employed labor by 253.482 thousand persons; and reduce the number of labor disputes, disabling injuries, MSW generation, and SO2 emissions by 540, 843, 35,014.565 tonnes, and 0.292 ppb, respectively. Furthermore, regarding Taichung City, which achieves the lowest average SD efficiency (δ * o  = 0.5196), the optimal slacks information reveals that it has resource input excesses for water (s −*1  = 1103.693), power (s −*2  = 10,107.139) and gasoline (s −*3  = 632,903.197); output shortages for employed labor (s + g*2  = 81.557); and output overproduction for labor disputes (s + b*1  = 102), disabling injuries (s + b*2  = 324), MSW generation (s + b*3  = 20,732.731) and SO2 emissions (s + b*4  = 0.07). Therefore, more specific policies and effective activities should be particularly devised by Taichung government to manage those inefficient sources.

Fig. 4
figure 4

Spatial distribution of the SD efficiency for 19 administrative regions

Table 3 Average optimal slacks of inputs and outputs for 19 administrative regions

Figure 4 also shows that among the four areas, the eastern area is ahead of the others in terms of average SD efficiency. In fact, due to the large coverage of mountain terrain, this area is less developed and has a very low population density. Although the eastern area is not impressive economically, it has cut a figure in resources saving and ecological conservation. With regard to areas in the western coastal plain where the majority of the population resides, the northern area achieves the highest average SD efficiency (δ * o  = 0.919), followed by the southern area (δ * o  = 0.906) and the central area (δ * o  = 0.889). Figure 5 illustrates the average potential improvement of each input and output among the four areas, which is expressed as the ratio of optimal slack to original value. As shown, for the northern area, disabling injuries \(y_{2}^{b}\) and gasoline consumption x3 are primary loadings affecting SD and must be reduced on average by 10.82% and 7.56%, respectively. For the southern area, power consumption x2 (12.09%), labor disputes y b1 (7.16%) and SO2 emissions y b4 (6.64%) are the top three priorities for improving SD efficiency. Furthermore, the central area has the highest room for improvement in water consumption x1, which should be decreased by 12.30% on average. These results may indicate that inefficient sources vary from area to area, most likely due to differences in industry structures. The northern area has a mature development of business services and is vigorously high-tech concentrated. The integrated circuits (IC) industry, computer and peripherals industry, optoelectronics industry, biotechnology industry, and telecommunications industry present industrial centres in this area, and they are mostly incubated at Hsinchu Science Park and Nankang Software Park. The labor intensive productions of these industries involve numerous underlying hazards at the workplace that therefore cause occupational injuries and illnesses. Local governments in the northern area could pressure the industries to comply with labour safety and health standards or guidelines documented in the Electronic Industry Code of Conduct (EICC), Occupational Health and Safety Assessment Series 18,001 (OHSAS 18,001), and Social Accountability 8000 (SA 8000), thereby lessening the social burden (Lo et al. 2014; Santos et al. 2018; Tuczek et al. 2018). The central area hosts the precision machinery industry and traditional industries of textiles, clothing, footwear and floriculture. With the completion of the Taichung Science Park and the rapid growth of required industrial water, the problem of water resource scarcity is getting serious in central Taiwan. The Water Resources Agency also reports that the central area might have a shortage of water supply of approximately 170,000 tonnes per day over the next decade. To eliminate the dependency on water resources, technologies associated with waste water recovery and reuse as well as effective water management practices must be incorporated into operational strategies of water-intensive industries (Angelakis et al. 2003; Chu et al. 2004; Vajnhandl and Valh 2014). Meanwhile, a series of policies and actions should be continuously promoted by local governments to deepen the awareness of water resources conservation. In the southern area, there is a great concentration of heavy industries including the petrochemical industry, steel industry, electric power industry, and naval architecture industry, which could explain why the southern area suffers the most pressure from energy consumption and air pollution. Therefore, there is a rising need for these power-hungry industries to optimize the energy usage of their operations. Appropriate policies, such as electricity pricing reform and regulatory requirement on certain share of renewable energy, including solar, wind, and hydraulic, could be further implemented to not only force energy-saving activities but also accelerate the development of renewable energy (Liming et al. 2008; Kohler 2014; Lin et al. 2017).

Fig. 5
figure 5

Average potential improvement for the four areas in Taiwan

Figure 6 illustrates the annual SD efficiencies for the four areas individually and for Taiwan as a whole from 2011 to 2016 to demonstrate the changes in efficiency during the research periods from a dynamic perspective. As shown, the SD efficiency of the southern area rose dramatically from 2011 to 2014 and it even surpasses than that of the eastern area in 2014. This uptrend is contributed by the progress on SD efficiency in Tainan City, Kaohsiung City, and Pingtung County. A similar but slight change appears in the efficiency of the northern area from 2011–2013. Three out of seven administrative regions, i.e., Taipei City, Hsinchu City, and Taoyuan City, achieved improvement in their SD efficiencies over the first 3 years, which resulted in the slow upward efficiency trend for the northern area. However, all the areas show had an evident downswing in efficiency over the last 3 years. The SD efficiency of Taiwan (grey bar in Fig. 6) has declined since 2014 and even dropped to a performance level worse than in 2011 for the 2015–2016 period. This finding implies that although Taiwan has been attempting to meet SD goals, it still has a long way to go before it can realize ideal sustainability. To remedy the worsening trend of SD efficiency, Taiwan must accelerate SD improvement and intensify the balance of resource consumption, economic prosperity, environmental protection, and social welfare by dispelling the sources that deteriorate SD efficiency. The annual potential improvement of inputs and outputs of Taiwan is shown in Fig. 7. This figure provides policymakers with holistic insights on changes in the magnitudes of inputs and outputs, which could be adjusted during varying periods to enable them to draw up precisely relevant policies for SD improvement. The required improvement of per capita income y g1 has gradually decreased and its magnitude is much less than that of others, indicating that pressure to enrich inhabitants has become weaker in general. Furthermore, there appears to be an evident magnitude of potential improvements for power consumption x2, disabling injuries y b2 , and SO2 emissions y b4 , and a rising trend in required improvements is observed for water consumption x1, gasoline consumption x3 and the labor disputes y b1 . These findings imply that these variables serve critical roles in promoting SD, and striving to eliminate the inefficiencies of which would be more conducive for improving Taiwanese SD performance. Therefore, Taiwan must place greater emphasis on resource conservation, environment protection and social welfare instead of merely pursuing economic prosperity.

Fig. 6
figure 6

Average SD efficiency trend for four areas and whole Taiwan

Fig. 7
figure 7

The trend of average improvement of inputs and outputs for whole Taiwan

6 Concluding Remarks

Along with the wave of sustainability sweeping the world in the past few decades, progressing towards regional SD has become a matter of concern for improving societal wellbeing and quality of life. Benchmarking and performance evaluation has gained widespread acceptance as a scientific process to follow up and accelerate regional SD. In this study, a novel additive DEA model with the consideration of undesirable outputs is proposed based on the minimum distance measure, which can not only construct a composite efficiency indicator but also provide holistic insights into improved strategies according to closest benchmarking target notion. The window analysis was then employed to successfully perform the work of cross-sectional and longitudinal efficiency evaluations.

Using an empirical dataset encompassing three inputs, two desirable outputs, and four undesirable outputs, such an assessment framework is applied to benchmark SD efficiency across 19 administrative regions of Taiwan for the period from 2011 to 2016. The study also elaborates how each administrative region performs in SD, explores which major sources lead to inefficiency and discusses some potential policies that contribute towards sustainability. The empirical results reveal that the sources of inefficiency may vary with industrial structure. In general, the administrative regions in the northern, central and southern areas faced relatively higher room for improvement to disabling injuries, water consumption and power consumption, respectively. The study also found that for the recent 3 years, Taiwan as a whole experienced deterioration in terms of average SD efficiency, implying that the pressure for SD improvement has become more intense. These findings provide local governments with valuable information for devising corresponding policies and regulations that facilitate a more sustainable society.

This study brings a two-fold contribution, both methodological and empirical, to the existing literature. First, this study presents a novel extension in non-radial DEA in which the minimum distance-based additive model is extended to accommodate undesirable outputs and conjoined with window analysis. The newly developed model preserves the desirable feature of standard additive model in meeting Pareto–Koopmans optimality for efficient DMUs but owns the merit of determining convincing benchmarking targets and a more feasible direction for improving inefficient DMUs. Remarkably, the proposed model is promising for a wide range of practical applications, especially to those with undesirable outputs. In order to accelerate progress in achieving SD, other regions or countries can implement this model to examine their SD performance with a dynamic view and thereby draft effective policies for SD improvement. Furthermore, this study conducts a holistic investigation into sustainability and responds to the need for compositing a reliable indicator of SD efficiency at the regional level, whereas previous studies mainly dealt with economic and environmental problems and rarely focused on social impacts. This topic conforms to the contemporary trend of sustainability, and the findings of this study could contribute to the literature on regional SD promotion.

There remain a few limitations that should be clarified. Regarding time-dependent benchmarking, the use of window analysis fails to offer precise benchmarking information that indicates the change in efficiency of an evaluated region between two adjacent periods of time by examining the pre- and post-values of input, desirable and undesirable outputs. The Malmquist index analysis presented by Malmquist (1953) is an effective approach for addressing this limitation. Future researches could attempt to extend the proposed model to include the Malmquist index to take a closer look at the pairwise changes in SD performance. Moreover, in the empirical application to Taiwan SD, only one desirable output of the observed variables is expressed in per capita term, implying that conceptual asymmetry might appear in the input–output data set. Using GDP in place of per capita income would be desirable but difficult due to data availability issue in Taiwan (DGBAS 2012). The other alternative could be using the product of the per capita income and the number of inhabitants of a region. Additionally, this study does not allow for the environmental variables (also called non-controllable variables), such as population and land area, which are exogenously fixed but can affect the benchmarking results to a certain extent (Banker and Morey 1986; Zhu 1996). A noteworthy topic for future research would be to extend the proposed model by considering this kind of variable so as to bring more insights into SD improvements. Finally, one should note that the Benefit of Doubt model, presented by Melyn and Moesen (1991) and introduced at length by Cherchye et al. (2007), is also a useful and credible DEA-based approach for creating composite indicators. However, it mainly deals with the flexibility issue of weighting sub-indicators and does not consider the input and output slacks. Thus, the proposed model still remains advantageous and applicable to benchmarking issues due to its capacity to provide insights into the sources of inefficiency and the strategies of performance improvement.