Keywords

14.1 Introduction

We consider the estimation of the efficiency and returns to scale (RTS) for a production system which can be modeled as having multi-components based on data envelopment analysis (DEA) . There are many production systems bearing this situation. For example, Beasley (1995) studied the performances of universities, each of which had two components: research and teaching . Cook et al. (2000) modeled a banking production system as having two components: service and sales. We are mainly concerned with radial measurements, and the theoretical connection with the existing black-box approach .

DEA is a nonparametric technique for measuring the relative efficiencies of a set of peer decision-making units (DMUs) involving multiple inputs and outputs. Charnes et al. (1978) first introduced it. In this pioneer paper, the authors constructed a nonlinear programming model to evaluate the efficiency of activity conducted by a non-profit organization. The model is known as the CCR model in the literature. As is known, the CCR model captures both technical and scale inefficiencies. Banker et al. (1984) proposed a new model (BCC model) which extended the CCR model by separating technical efficiency and scale efficiency. Recently, DEA has been extended to many areas in management science and operational research field.

At the early stage of development, DEA treats a DMU under evaluation as a black box. Thus, it cannot provide users with specific information concerning the sources of inefficiency within an organization. Färe and Grosskopf (2000) introduced a network DEA technique, which opened the black box, and explicitly modeled the internal mechanism of a DMU . Lewis and Sexton (2004) also published a research paper in this direction. Färe and Grosskopf (2000) and Lewis and Sexton (2004) proposed radial measurements of efficiency in network DEA literature. By contrast, Tone and Tsutsui (2009) extended radial measurements in network DEA to non-radial measurements of efficiency by introducing slack-based network DEA model. Kao and Hwang (2008) and Kao (2009a, b) proposed models for evaluating DMUs with serial network structure, parallel network structure and the mixture of the above two structures. DMUs with a two-stage production process have been extensively studied both from a theoretical and from a practical perspective. Included among these studies are Liang et al. (2008) and Chen et al. (2006, 2009a, b, 2010). We refer the reader to review papers, such as Cook et al. (2010) and Castelli et al. (2010) for more references.

The value of returns to scale (RTS) measures the percentage change in output from a given percentage change in inputs in economic theory. Unlike main researches in economic literature, which are concerned about production processes with a single output, extensions to the situations of multiple outputs are spurred by Banker et al. (1984). Since then, RTS has been studied extensively. Banker et al. (2004) published an excellent review on different methods used to handle RTS. According to the paper, there are two approaches followed in the literature to study RTS. The first approach is proposed by Färe et al. (1985, 1994) and the other one is devised by Banker et al. (1984). In this paper, we follow the first approach, which has the advantage of being unaffected by the possible existence of multiple optima.

The existing papers concerning RTS are mainly based on the black-box assumption. However, very few of these papers deal with RTS, when the black-box assumption is dropped. Research papers with RTS consideration include Chen et al. (2009a), Tsai and Molinero (2002). Those two papers both follow the framework proposed by Banker et al. (1984), and could suffer from the existence of multiple optima.

Our current paper studies a production process with a multi-component structure. Before moving on, we firstly differentiate two cases of production processes having a multi-component according to data availability. The first case has the data on how the shared inputs and shared outputs are split among sub-decision making units (SDMUs) . The second case does not have data on how the shared inputs/outputs are split among SDMUs. Beasley (1995) and Cook et al. (2000) investigated models for evaluating performance in the second case, but did not study the RTS of the productions. In addition, how to extend their models to treat RTS is not clear. The difficulties are twofold in multi-component setting: (1) the nonlinearity of the proposed models and (2) the impact of potential multiple optima on testing RTS by following Banker’s approach. Our work focuses on production processes with multi-components of (1). In doing so, we avoid the problem of nonlinearity, to center on investigating RTS.

The contributions of our work mainly lie in three aspects. Firstly, we propose radial measurements for efficiency evaluation and a procedure to determine the RTS of a DMU that is unaffected by possible multiple optima. Secondly, we establish theoretical connection between the black-box approach by Färe et al. (1985, 1994) and our multi-component approach, which helps to connect the black-box approach with the network approach, and ensures consistency between both approaches in dealing with RTS. In addition, two homogeneity conditions are proposed and are important for evaluating performance in multi-component setting. They are not pointed out before in the literature. Thirdly, in this work, we use the proposed method to study the efficiency and RTS of pollution treatment processes in China based on real data. We model the processes as having two components, which is different from the traditional approach, and secure various insightful findings related to the operations of the pollution treatment processes in China.

The paper unfolds as follows: Section 14.2 proposes a radial evaluation model under variable returns to scale assumption (14.2.1), and establishes the theoretical connection of the proposed model to the black-box model (14.2.2). Section 14.3 provides a procedure for determining the RTS of a DMU. Section 14.4 establishes the theoretical connection of the proposed approach for estimating RTS to Färe et al. (1985, 1994). In Sect. 14.5, we apply the prospective method to study the performance of pollution treatment processes in China. Section 14.6 concludes the paper.

14.2 Radial Performance Measurement for a Multi-component System

A production unit (denoted as a DMU) with multi-component structure studied in this paper is depicted in Fig. 14.1. The DMU consists of two sub-decision-making units (SDMUs) without loss of generality. It is assumed that some inputs of DMU are shared by SDMU1 and SDMU2, and some outputs are the results of SDMU1 and SDMU2. In addition to shared inputs and outputs, there are inputs and outputs of the DMU dedicated to, or are the results of, SDMU1 or SDMU2 exclusively. We assume to deal with n DMUs in this paper. In the sequel, when referring to a specific DMU, we denote it by a subscript j, that is, DMUj, SDMU1j, and SDMU2j ( j = 1,…,n).

Fig. 14.1
figure 1

Structure of multi-component system

The variables in Fig. 14.1 are defined as follows: \( {X}_1=\left({x}_1^1,\dots, {x}_m^1\right) \) indicates m inputs dedicated to SDMU1; \( {X}_2=\left({x}_1^2,\dots, {x}_h^2\right) \) indicates h inputs dedicated to SDMU2; \( {X}_s=\left({x}_1^s,\dots, {x}_l^s\right) \) indicates l inputs shared by SDMU1 and SDMU2; \( {Y}_1=\left({y}_1^1,\dots, {y}_s^1\right) \) indicates s outputs produced exclusively by SDMU1; \( {Y}_2=\left({y}_1^2,\dots, {y}_q^2\right) \) indicates q outputs produced exclusively by SDMU2; \( {Y}_s=\left({y}_1^s,\dots, {y}_u^s\right) \) indicates u outputs produced together by SDMU1 and SDMU2. When referring to the specific data of DMUj, we shall use a secondary index j. For instance, the m inputs dedicated to SDMU1j, the SDMU1 of DMUj, are denoted as \( {X}_{1j}=\left({x}_{1j}^1,\dots, {x}_{mj}^1\right) \).

We differentiate two cases of production processes with multi-component structure according to data availability. In the first case, the data on how the shared inputs and shared outputs are split between SDMU1 and SDMU2 are available. In this case, we use \( {X}_{s1}=\left({x}_1^{s1},\dots, {x}_l^{s1}\right) \), \( {\mathrm{X}}_{s2}=\left({x}_1^{s2},\dots, {x}_l^{s2}\right) \), and \( {Y}_{s1}=\left({y}_1^{s1},\dots, {y}_u^{s1}\right) \), \( {Y}_{s2}=\left({y}_1^{s2},\dots, {y}_u^{s2}\right) \) to denote the observational data fulfilling \( {X}_s={X}_{s1}+{X}_{s2} \) and \( {Y}_s={Y}_{s1}+{Y}_{s2} \). Note that these are component wise additions indicating X s(i ) = X s1(i ) + X s2(i ), i,…,l, and Y s( j ) = Y s1( j ) + Y s2( j ), j = 1,…,u. In the second case, it is not known how the shared inputs/outputs are split. We deal with the former case in this paper.

To be specific, we take pollution treatment processes in China as an example. If we are going to investigate the performances of pollution treatment processes in all provinces, provinces are naturally modeled as DMUs. When the black box of a DMU is opened, it can be found that cities can be further classified into two SDMUs: capital city and non-capital cities. The capital city is the political, economic and cultural center of a province. Thus, the environment beyond the control of the management of the pollution treatment process in capital city and non-capital cities is arguably different. This makes sense: For example, a capital city often consumes more inputs such as capital inputs: pollution treatment facilities. As will be shown in this paper, the average capital city consumes approximately more than one fifth of the total inputs, but produces less than one fifth of the total outputs. In this case, we might reasonably claim that the capital city consumes more inputs as compared with noncapital cities.

14.2.1 Basic Model

Let us begin with the construction of production possibility set (PPS) of each SDMU. Based on the PPS of SDMUs, the PPS of a DMU is derived. We assume first variable returns of scale for all SDMUs. Note that the PPS considered is similar to that in Tsai and Molinero (2002).

The PPS of SDMU1:

$$ {T}_1^{VRS}=\left\{\left({X}^1,{Y}^1\right)\left|\begin{array}{l}{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{x}_{ij}^1\le {x}_i^1,i=1,\dots, m,{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{y}_{rj}^1\ge {y}_r^1,r=1,\dots, s\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{x}_{ij}^{s1}\le {x}_i^{s1},i=1,\dots, l,{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{y}_{rj}^{s1}\ge {y}_r^{s1},r=1,\dots, u\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^1}=1,{\lambda}_j^1\ge 0\end{array}\right.\right\} $$
(14.1)

where \( \left({X}^1,{Y}^1\right)=\left({x}_1^1,\dots, {x}_m^1,{x}_1^{s1},\dots, {x}_l^{s1},{y}_1^1,\dots, {y}_s^1,{y}_1^{s1},\dots, {y}_u^{s1}\right) \).

The PPS of SDMU2:

$$ {T}_2^{VRS}=\left\{\left({X}^2,{Y}^2\right)\left|\begin{array}{l}{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{x}_{ij}^2\le {x}_i^2,i=1,\dots, h,{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{y}_{rj}^2\ge {y}_r^2,r=1,\dots, q\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{x}_{ij}^{s2}\le {x}_i^{s2},i=1,\dots, l,{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{y}_{rj}^{s2}\ge {y}_r^{s2},r=1,\dots, u\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^2}=1,{\lambda}_j^2\ge 0\end{array}\right.\right\} $$
(14.2)

where \( \left({X}^2,{Y}^2\right)=\left({x}_1^2,\dots, {x}_h^2,{x}_1^{s2},\dots, {x}_l^{s2},{y}_1^2,\dots, {y}_q^2,{y}_1^{s2},\dots, {y}_u^{s2}\right) \).

The PPS of DMU:

$$ {T}^{VRS}=\left\{\left(X,Y\right)\left|\begin{array}{l}{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{x}_{ij}^1\le {x}_i^1,i=1,\dots, m,{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{x}_{ij}^{s1}+{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{x}_{ij}^{s2}\le {x}_i^s,i=1,\dots, l\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{x}_{ij}^2\le {x}_i^2,i=1,\dots, h,{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{y}_{rj}^1\ge {y}_r^1,r=1\dots s\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^1{y}_{rj}^{s1}}+{\displaystyle \sum_{j=1}^n{\lambda}_j^2{y}_{rj}^{s2}}\ge {y}_r^s,r=1,\dots, u,{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{y}_{rj}^2\ge {y}_r^2,r=1,\dots, q\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^1}=1,{\displaystyle \sum_{j=1}^n{\lambda}_j^2}=1,{\lambda}_j^1,{\lambda}_j^2\ge 0\end{array}\right.\right\} $$
(14.3)

where \( \left(X,Y\right)=\left({x}_1^1,\dots, {x}_m^1,{x}_1^s,\dots, {x}_l^s,{x}_1^2,\dots, {x}_h^2,{y}_1^1,\dots, {y}_s^1,{y}_1^s,\dots, {y}_u^s,{y}_1^2,\dots, {y}_q^2\right) \).

It should be noted that the PPS of DMU is the addition of the PPS’s of SDMU1 and SDMU2. We assume that if SDMU1 (X 1, Y 1) and SDMU2 (X 2, Y 2) are possible, then one can set up a DMU consisting of a SDMU1 and a SDMU2. Most importantly, the two SDMUs do not interfere with each other and carry out (X 1, Y 1) and (X 2, Y 2) independently. The result is then that DMU built in this way consumes (X 1 + X 2), and produces (Y 1 + Y 2).

The performance of a DMU can be measured under two different situations: first, price information is given, and second, prices are not available. In the latter situation, Shephard’s input distance function is a frequently used measurement (Shephard's 1970). Suppose L(Y ) is the input requirement set derived from T VRS. Shephard’s input distance function is given below.

$$ D\left(X,Y\right)= \max \left\{\lambda :X/\lambda \in L(Y),\lambda \in R\right\} $$
(14.4)

Clearly, D(X, Y ) is greater than or equal to 1, if \( X\in L(Y) \), with D(X,Y) = 1, if and only if it is impossible to improve input vector X proportionately without worsening the output vector. Let \( \theta =1/\lambda \). It follows that

$$ {\left[D\left(X,Y\right)\right]}^{-1}= \min \left\{\theta :\theta X\in L(Y)\right\} $$
(14.5)

According to (14.3) and (14.5), the performance of DMU0 with multi-components can be estimated by the following linear programming model .

$$ \begin{array}{l}{\theta}_T^{*}= \min \kern0.36em \theta \\ {}\mathrm{s}.\mathrm{t}.\kern0.6em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{x}_{ij}^{sk}\le \theta {x}_{io}^s\kern2em i=1,\dots, l\ \left(\mathrm{shared}\ \mathrm{inputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\lambda}_j^1}{x}_{ij}^1\le \theta {x}_{io}^1\kern3.5em i=1,\dots, m\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\lambda}_j^2}{x}_{ij}^2\le \theta {x}_{io}^2\kern3.5em i=1,\dots, h\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_2\right)\\ {}\kern2em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{y}_{rj}^{sk}\ge {y}_{ro}^s\kern2em r=1,\dots, u\ \left(\mathrm{shared}\ \mathrm{o}\mathrm{utputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\lambda}_j^1}{y}_{rj}^1\ge {y}_{ro}^1\kern3.7em r=1,\dots, s\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\lambda}_j^2}{y}_{rj}^2\ge {y}_{ro}^2\kern3.5em r=1,\dots, q\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_2\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\lambda}_j^k}=1\kern5.2em k=1,2\\ {}\kern2em {\lambda}_j^k\ge 0,k=1,2,\kern2.7em j=1,\dots, n.\end{array} $$
(14.6)

where the decision variables are \( {\lambda}_j^k\ \left(j=1,\dots, n;\ k=1,2\right) \) and θ. It should be noted that x k ij , x sk ij , y k rj and y sk rj are observational data that correspond to the types of inputs and outputs labeled in (14.6).

14.2.2 Theoretical Connection with Black-Box Approach

In this section, we formally derive the black-box equivalent PPS that corresponds to T VRS, which can give an insight into model (14.6). Before moving on, we assume that the structure depicted in Fig. 14.1 consumes all inputs shared by SDMU1 and SDMU2, and all the outputs of DMU are the results of SDMU1 and SDMU2. We adopt the convention that DMU consumes m inputs \( {X}_j=\left({x}_{1j},\dots, {x}_{mj}\right) \) and produces s outputs \( {Y}_j=\left({y}_{1j},\dots, {y}_{sj}\right) \). Thus, based on the notations provided above for DMUs with multi-component structure, the assumption here implies that \( {X}_j^{sk}=\left({x}_{1j}^{sk},\dots, {x}_{mj}^{sk}\right) \) and \( {Y}_j^{sk}=\left({y}_{1j}^{sk},\dots, {y}_{sj}^{sk}\right) \) with \( {X}_j^{s1}+{X}_j^{s2}={X}_j^s={X}_j \), and \( {Y}_j^{s1}+{Y}_j^{s2}={Y}_j^s={Y}_j \). Later in the paper, the s in the superscript is deleted for simplicity. In cases where inputs or outputs are not entirely shared by SDMU1 and SDMU2 (See Fig. 14.1), the values of those inputs/outputs dedicated to SDMU1 (SDMU2) are zeros for SDMU2 (SDMU1). Therefore, the structure of the DMU in Fig. 14.1 reduces to structure provided in Fig. 14.2.

Fig. 14.2
figure 2

Structure of DMU with all shared inputs and outputs

In light of the structure depicted in Fig. 14.2, T VRS1 T VRS2  and T VRS in the previous section are rewritten as follows:

$$ {T}_1^{VRS}=\left\{\left({X}^1,{Y}^1\right)\Big|{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{x}_{ij}^1\le {x}_i^1,i=1,\dots, m,\right.\left.{\displaystyle \sum_{j=1}^n{\lambda}_j^1}{y}_{rj}^1\ge {y}_r^1,r=1,\dots, s,{\displaystyle \sum_{j=1}^n{\lambda}_j^1}=1,{\lambda}_j^1\ge 0\right\} $$
(14.7)

where \( \left({X}^1,{Y}^1\right)=\left({x}_1^1,\dots, {x}_m^1,{y}_1^1,\dots, {y}_s^1\right) \).

$$ {T}_2^{VRS}=\left\{\left({X}^2,{Y}^2\right)\Big|{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{x}_{ij}^2\le {x}_i^2,i=1,\dots, m,\right.\left.{\displaystyle \sum_{j=1}^n{\lambda}_j^2}{y}_{rj}^2\ge {y}_r^2,r=1,\dots, s,{\displaystyle \sum_{j=1}^n{\lambda}_j^2}=1,{\lambda}_j^2\ge 0\right\} $$
(14.8)

where \( \left({X}^2,{Y}^2\right)=\left({x}_1^2,\dots, {x}_m^2,{y}_1^2,\dots, {y}_s^2\right) \).

$$ {T}^{VRS}=\left\{\left(X,Y\right)\left|\begin{array}{l}{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{x}_{ij}^k\le {x}_i,i=1,\dots, m,{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{y}_{rj}^k\ge {y}_r,r=1,\dots, s\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^k}=1,{\lambda}_j^k\ge 0,k=1,2\end{array}\right.\right\} $$
(14.9)

where \( \left(X,Y\right)=\left({x}_1,\dots, {x}_m,{y}_1,\dots, {y}_s\right) \).

We proceed to give a result on the convexity of T VRS that is necessary for the exposition of this paper.

Property 1

T VRS is convex set .

Proof

Suppose (X 1, Y 1) and (X 2, Y 2) belong to T VRS. By definition, there are sets of nonnegative multipliers λ k1 * j , λ k2 * j with \( {\displaystyle \sum_{j=1}^n{\lambda}_j^{k1*}}=1 \) and \( {\displaystyle \sum_{j=1}^n{\lambda}_j^{k2*}}=1 \) such that

$$ \begin{array}{l}{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^{k1*}}}{y}_{rj}^k\ge {y}_r^1,r=1,\dots, s,{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^{k1*}}}{x}_{ij}^k\le {x}_i^1,i=1,\dots, m,\\ {}{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^{k2*}}}{y}_{rj}^k\ge {y}_r^2,r=1,\dots, s,{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^{k2*}}}{x}_{ij}^k\le {x}_i^2,i=1,\dots, m.\end{array} $$

For any convex pair α, β, we have \( {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n\Big(}}\alpha {\lambda}_j^{k1*}+\beta {\lambda}_j^{k2*}\Big){y}_{rj}^k\ge \alpha {y}_r^1+\beta {y}_r^2,r=1,\dots, s, \) \( {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n\Big(}}\alpha {\lambda}_j^{k1*}+\beta {\lambda}_j^{k2*}\Big){x}_{ij}^k\le \alpha {x}_i^1+\beta {x}_i^2,i=1,\dots, m, \) and \( {\displaystyle \sum_{j=1}^n\Big(}\alpha {\lambda}_j^{k1*}+\beta {\lambda}_j^{k2*}\Big)=1 \). This ensures that \( \alpha \left({X}_1,{Y}_1\right)+\beta \left({X}_2,{Y}_2\right)=\left(\alpha {X}_1+\beta {X}_2,\alpha {Y}_1+\beta {Y}_2\right)\in {T}^{VRS} \). □

Assumption 1

Assume there are n DMUs, each of which consists of two production units SDMU1j, SDMU2j, j = 1,…,n using the production technology characterized by T VRS1 and T VRS2 respectively. Let there be an extended data set ( EDS) of n 2 distinct DMUs, each of which comprises SDMU1j and SDMU2k with \( j,k\in \left\{1,\dots, n\right\} \).

Let (x ij , y rj ) denote the input and output bundle of DMUj in EDS. Define T VRS b , T CRS b , and T NIRS b as below, where the superscripts CRS and NIRS, respectively, stand for constant returns to scale and non-increasing returns to scale :

$$ \begin{array}{l}{T}_b^{VRS}=\left\{\left(X,Y\right)\Big|{\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{x}_{ij}\le {x}_i,i=1,\dots, m,{\displaystyle \sum_{k=1}^{n^2}{\lambda}_j}{y}_{rj}\ge {y}_r,r=1,\dots, s,{\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}=1,{\lambda}_j\ge 0\right\}\\ {}{T}_b^{CRS}=\left\{\left(X,Y\right)\Big|{\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{x}_{ij}\le {x}_i,i=1,\dots, m,{\displaystyle \sum_{k=1}^{n^2}{\lambda}_j}{y}_{rj}\ge {y}_r,r=1,\dots, s,{\lambda}_j\ge 0\right\}\end{array} $$
$$ {T}_b^{NIRS}=\left\{\left(X,Y\right)\Big|{\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{x}_{ij}\le {x}_i,i=1,\dots, m,{\displaystyle \sum_{k=1}^{n^2}{\lambda}_j}{y}_{rj}\ge {y}_r,r=1,\dots, s,\right.\left.{\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}\le 1,{\lambda}_j\ge 0\right\} $$

where \( \left(X,Y\right)=\left({x}_1,\dots, {x}_m,{y}_1,\dots, {y}_s\right) \).

We now establish that the PPS of the general multi-component system with two different SDMUs can be recovered by DMUs in EDS through the black-box approach. The connections between the multi-component PPS’s and the above mentioned black-box PPS’s are summarized in Theorem 1.

Theorem 1

\( {T}_b^{VRS}={T}^{VRS},\ {T}_b^{CRS}={T}^{CRS},\ and\kern0.5em {T}_b^{NIRS}={T}^{NIRS} \), where

$$ {T}^{CRS}=\left\{\left(X,Y\right)\left|\begin{array}{l}{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{x}_{ij}^k\le {x}_i,i=1,\dots, m,{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{y}_{rj}^k\ge {y}_r,r=1,\dots, s\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^1}={\displaystyle \sum_{j=1}^n{\lambda}_j^2},{\lambda}_j^k\ge 0,k=1,2\end{array}\right.\right\} $$

and

$$ {T}^{NIRS}=\left\{\left(X,Y\right)\left|\begin{array}{l}{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{x}_{ij}^k\le {x}_i,i=1,\dots, m,{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{y}_{rj}^k\ge {y}_r,r=1,\dots, s\\ {}{\displaystyle \sum_{j=1}^n{\lambda}_j^1}={\displaystyle \sum_{j=1}^n{\lambda}_j^2}\le 1,{\lambda}_j^k\ge 0,k=1,2\end{array}\right.\right\} $$

Proof

See Appendix. □

Let us close this section by pointing out the difference between T CRS and \( {\overline{T}}^{CRS} \), which is defined by

\( {\overline{T}}^{CRS}=\left\{\left(X,Y\right)\Big|{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{x}_{ij}^k\le {x}_i,i=1,\dots, m,{\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{y}_{rj}^k\ge {y}_r,r=1,\dots, s,{\lambda}_j^k\ge 0,k=1,2\right\} \).

Researchers in the literature tend to define \( {\overline{T}}^{CRS} \) as the CRS PPS for the production system in Fig. 14.1. Tsai and Molinero (2002) is a case in point. Obviously, the production frontier determined by T CRS is dominated by the one defined by \( {\overline{T}}^{CRS} \). In Fig. 14.3, we use a set of two DMUs with one input and one output for illustration.

Fig. 14.3
figure 3

Graphical illustration of \( {T}^{CRS}\ \mathrm{and}\ {\overline{T}}^{CRS} \)

Here, DMU1 and DMU2 comprise of (SDMU11, SDMU21) and (SDMU12, SDMU22) respectively. DMUA and DMUB are generated by combining respectively SDMU11 and SDMU22, SDMU21 and SDMU22. In light of Theorem 1, T CRS is the conic hull constructed by DMU1, DMU2, DMUA and DMUB. This is the region to the right of frontier F 2. PPS provided by \( {\overline{T}}^{CRS} \) is the region to the right of frontier F 1.

Figure 14.3 shows that the production frontier of \( {\overline{T}}^{CRS} \) is determined by SDMU11. Apparently, the production process of SDMU is arguably different from that of DMU. Therefore, the use of SDMU as a benchmarking point for DMU is not appropriate. To highlight the difference between SDMU and DMU , criteria for homogeneity are essential. The homogeneity in this context refers to the characteristic of the efficient frontier that a benchmarking point on the frontier constructed for evaluating the performance of a DMU should be comparable to the DMU in terms of the internal production process. Two homogeneity conditions for the construction of a virtual DMU, i.e., weak condition and strong condition , are introduced below:

  1. (1)

    Weak homogeneity condition: \( {t}_1=0 \) if and only if \( {t}_2=0 \).

  2. (2)

    Strong homogeneity condition: \( {t}_1={t}_2 \).

Clearly, if a virtual DMU built by SDMU1 and SDMU2 satisfies the strong homogeneity condition , the weak homogeneity condition is automatically satisfied. However, the opposite is not true. Comparing the definition of T CRS with that of \( {\overline{T}}^{CRS} \), the difference is the distinct requirements of the sum of the levels of elementary activities involved (i.e., \( {t}_1={\displaystyle {\sum}_{j=1}^n{\lambda}_j^1},{t}_2={\displaystyle {\sum}_{j=1}^n{\lambda}_j^2} \)). Specifically, T CRS requires \( {t}_1={t}_2 \), while \( {\overline{T}}^{CRS} \) does not. \( {\overline{T}}^{CRS} \) is claimed to violate the strong homogeneity condition.

This small example shows that SDMU11 is chosen as a benchmarking point , as can be seen from Fig. 14.3 where F 1 is completely specified by SDMU11. If we do not set conditions for choosing a benchmarking point, the frontier is arguably too ideal. The main consequence is the potential under estimation of the efficiency of a DMU, since an improper benchmarking point is chosen. This specification of conditions is comparable to the modeling consideration in the evaluation considering environment constraints. One might expect that the performance of a DMU be evaluated by comparing it to the DMUs possessing similar environment characteristics (See, for example, Ruggiero (1998)).

14.3 Procedure for Estimating the Returns to Scale

In economic theory, the value of RTS measures the percentage change in output from a given percentage change in inputs. Let \( y=f(x) \) denote a production function for a single-output technology . The production function is said to have IRS if \( f(ax)>af(x) \), for any \( a>1 \). The production function exhibits DRS if \( f(ax)>af(x) \), for any \( a\in \left[0,1\right) \). If \( f(ax)=af(x) \) for all scalars \( a\ge 0 \), the production function exhibits CRS. Banker et al. (1984), who introduced the concept of Most Productive Scale Size (MPSS) into the DEA literature, spurred extensions to the situations of multiple inputs and outputs. For a technically efficient DMU0 with input and output bundle (X 0, Y 0) to be MPSS, the following optimization model should achieve a value of one. Note that the subscript 0 is usually used to indicate the DMU under evaluation in the literature. In the sequel, we shall frequently refer to DMU0 when a specific DMU is discussed.

$$ \begin{array}{l} \max \kern0.36em \frac{\beta }{\alpha}\\ {}\mathrm{s}.\mathrm{t}.\kern0.74em \left(\alpha {X}_0,\beta {Y}_0\right)\in T\\ {}\kern2em \alpha, \beta \ge 0.\end{array} $$
(14.10)

where T is the empirical production possibility set. If the optimal value is larger than 1, it means that either the current input level can be reduced with a less percentage of losses in outputs, or it can be increased with a larger percentage of gains in outputs. Therefore, DMU0 can benefit from the adjustment of input levels. By analogy, the following model is proposed for testing whether DMU0 with multi-component structure is MPSS , where T in (14.10) is substituted by T VRS.

$$ \begin{array}{l} \max \kern0.36em \frac{\phi }{\theta}\\ {}\mathrm{s}.\mathrm{t}.\kern0.48em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{x}_{ij}^{sk}+{s}^{s-}=\theta {x}_{io}^s\kern2em i=1,\dots, l\ \left(\mathrm{shared}\ \mathrm{inputs}\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\lambda}_j^1}{x}_{ij}^1+{s}_i^{1-}=\theta {x}_{io}^1\kern2em i=1,\dots, m\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_1\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\lambda}_j^2}{x}_{ij}^2+{s}_i^{2-}=\theta {x}_{io}^2\kern2em i=1,\dots, h\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_2\right)\\ {}\kern1.75em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\lambda}_j^k}}{y}_{rj}^{sk}-{s}_r^{s+}=\phi {y}_{ro}^s\kern2em r=1,\dots, u\ \left(\mathrm{shared}\ \mathrm{o}\mathrm{utputs}\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\lambda}_j^1}{y}_{rj}^1-{s}_r^{1+}=\phi {y}_{ro}^1\kern2em r=1,\dots, s\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_1\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\lambda}_j^2}{y}_{rj}^2-{s}_r^{2+}=\phi {y}_{ro}^2\kern2em r=1,\dots, q\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_2\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\lambda}_j^k}=1\kern2em k=1,2,\\ {}\kern1.75em {\lambda}_j^k\ge 0,k=1,2,j=1,\dots, n.\end{array} $$
(14.11)

Cooper et al. (1996) proposed an approach to transform the above non-linear model to an equivalent linear model. Firstly, let us divide both sides of the constraints by ϕ. The resulting model is given in (14.12). Secondly, by letting \( \theta /\phi =t \), \( {s}_r^{s+}/\phi ={\overline{s}}_r^{s+} \), \( {s}_r^{1+}/\phi ={\overline{s}}_r^{1+} \), \( {s}_r^{2+}/\phi ={\overline{s}}_r^{2+} \), \( {s}_i^{s-}/\phi ={\overline{s}}_i^{s-} \), \( {s}_i^{1-}/\phi ={\overline{s}}_i^{1-} \), \( {s}_i^{2-}/\phi ={\overline{s}}_i^{2-} \) and \( {\lambda}_j^k/\phi ={\overline{\lambda}}_j^k \), we can obtain model (14.13). Since ϕ in (14.13) is a free variable, it is safe to delete it. Finally, model (14.13) can be further reduced to an equivalent model (14.14). Note that we call two optimization problems equivalent if from a solution of one, a solution of the other is readily found, and vice versa.

$$ \begin{array}{l} \max \kern0.36em \frac{\phi }{\theta}\\ {}\mathrm{s}.\mathrm{t}.\kern0.73em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n\frac{\lambda_j^k}{\phi }}}{x}_{ij}^{sk}+\frac{s_i^{s-}}{\phi }=\frac{\theta }{\phi }{x}_{io}^s\ i=1,\dots, l\ \left(\mathrm{shared}\ \mathrm{inputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n\frac{\lambda_j^1}{\phi }}{x}_{ij}^1+\frac{s_i^{1-}}{\phi }=\frac{\theta }{\phi }{x}_{io}^1\kern2em i=1,\dots, m\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n\frac{\lambda_j^2}{\phi }}{x}_{ij}^2+\frac{s_i^{2-}}{\phi }=\frac{\theta }{\phi }{x}_{io}^2\kern2em i=1,\dots, h\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_2\right)\\ {}\kern1.87em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n\frac{\lambda_j^k}{\phi }}}{y}_{rj}^{sk}-\frac{s_r^{s+}}{\phi }={y}_{ro}^s\kern2em r=1,\dots, u\ \left(\mathrm{shared}\ \mathrm{o}\mathrm{utputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n\frac{\lambda_j^1}{\phi }}{y}_{rj}^1-\frac{s_r^{1+}}{\phi }={y}_{ro}^1\kern2em r=1,\dots, s\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n\frac{\lambda_j^2}{\phi }}{y}_{rj}^2-\frac{s_r^{2+}}{\phi }={y}_{ro}^2\kern2em r=1,\dots, q\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_2\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n\frac{\lambda_j^k}{\phi }}=\frac{1}{\phi}\kern2em k=1,2\\ {}\kern2em {\lambda}_j^k,\phi \ge 0,k=1,2,j=1,\dots, n.\end{array} $$
(14.12)
$$ \begin{array}{l} \max \kern0.36em \frac{1}{t}\\ {}\mathrm{s}.\mathrm{t}.\kern0.6em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^k}}{x}_{ij}^{sk}+{\overline{s}}_i^{s-}=t{x}_{io}^s\kern2em i=1,\dots, l\ \left(\mathrm{shared}\ \mathrm{inputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}{x}_{ij}^1+{\overline{s}}_i^{1-}=t{x}_{io}^1\kern2em i=1,\dots, m\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}{x}_{ij}^2+{\overline{s}}_i^{2-}=t{x}_{io}^2\kern2em i=1,\dots, h\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_2\right)\\ {}\kern2em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^k}}{y}_{rj}^{sk}-{\overline{s}}_r^{s+}={y}_{ro}^s\kern2em r=1,\dots, u\ \left(\mathrm{shared}\ \mathrm{o}\mathrm{utputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}{y}_{rj}^1-{\overline{s}}_r^{1+}={y}_{ro}^1\kern2em r=1,\dots, s\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}{y}_{rj}^2-{\overline{s}}_r^{2+}={y}_{ro}^2\kern2em r=1,\dots, q\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_2\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}={\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}=\frac{1}{\phi}\kern2em \\ {}\kern2em {\lambda}_j^k\ge 0,k=1,2,j=1,\dots, n.\end{array} $$
(14.13)
$$ \begin{array}{l}{t}^{*}= \min \kern0.36em t\\ {}\mathrm{s}.\mathrm{t}.\kern0.48em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^k}}{x}_{ij}^{sk}+{\overline{s}}_i^{s-}=t{x}_{io}^s\kern2em i=1,\dots, l\ \left(\mathrm{shared}\ \mathrm{inputs}\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}{x}_{ij}^1+{\overline{s}}_i^{1-}=t{x}_{io}^1\kern2em i=1,\dots, m\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_1\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}{x}_{ij}^2+{\overline{s}}_i^{2-}=t{x}_{io}^2\kern2em i=1,\dots, h\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_2\right)\\ {}\kern1.75em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^k}}{y}_{rj}^{sk}-{\overline{s}}_r^{s+}={y}_{ro}^s\kern2em r=1,\dots, u\ \left(\mathrm{shared}\ \mathrm{o}\mathrm{utputs}\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}{y}_{rj}^1-{\overline{s}}_r^{1+}={y}_{ro}^1\kern2em r=1,\dots, s\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_1\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}{y}_{rj}^2-{\overline{s}}_r^{2+}={y}_{ro}^2\kern2em r=1,\dots, q\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_2\right)\\ {}\kern1.75em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}={\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}\kern2em \\ {}\kern1.75em {\lambda}_j^k\ge 0,k=1,2,j=1,\dots, n.\end{array} $$
(14.14)

Assume that \( {t}^{*},{\overline{\lambda}}_j^{k*} \) are the optimal solution to model (14.14). It follows that \( {\phi}^{*}=1/{\displaystyle {\sum}_{j=1}^n{\lambda}_j^{2*}} \) and \( {\theta}^{*}={t}^{*}{\phi}^{*}={t}^{*}/{\displaystyle {\sum}_{j=1}^n{\overline{\lambda}}_j^{1*}}={t}^{*}/{\displaystyle {\sum}_{j=1}^n{\overline{\lambda}}_j^{2*}} \). Apparently, Proposition 1 holds.

Proposition 1

If \( {t}^{*}=1 \), then DMU is MPSS, and constant returns to scale prevails at DMU; Otherwise, the unit is not MPSS.

RTS generally has an unambiguous meaning only if DMU0 is on the efficiency frontier. For any inefficient DMU0 to become efficient, based on the optimal solutions of model (14.6), it can be projected onto the efficient frontier by formulas as follows:

  1. (1)

    \( {\overline{y}}_{ro}^s={y}_{ro}^s+{s}_r^{+s*},{\overline{y}}_{ro}^1={y}_{ro}^1+{s}_r^{+1*},{\overline{y}}_{ro}^2={y}_{ro}^2+{s}_r^{+2*} \).

  2. (2)

    \( {\overline{x}}_{io}^s={t}^{*}{x}_{io}^s-{s}_i^{-s*},{\overline{x}}_{io}^1={t}^{*}{x}_{io}^1-{s}_i^{-1*},{\overline{x}}_{io}^2={t}^{*}{x}_{io}^2-{s}_i^{-2*} \).

For those who are interested in the projection operation and the concept of efficient frontier, we recommend Cooper et al. (2004). A full treatment of the topics is beyond the scope of this paper. Before proceeding to discuss how to determine RTS of a DMU, we now introduce the scale efficiency of a production unit in Definition 1.

Definition 1

Scale efficiency: \( {\theta}_S^{*}={t}^{*}/{\theta}_T^{*} \).

Scale efficiency reflects the RTS characteristic of DMU0. It should be noted that if DMU0 is not an efficient unit, the scale efficiency actually reflects the RTS characteristic of the corresponding projection on the efficient frontier by formulas (14.1) and (14.2). Let us denote it as DMU *o for the convenience of reference.

Obviously, it can be seen that \( {\theta}_S^{*}\le 1 \), since the feasible set of model (14.6) is a subset of the feasible set of model (14.14). If \( {\theta}_S^{*}=1 \), DMU *o should achieve an efficiency rating of 1 by model (14.14). If not, it contradicts that \( {\theta}_S^{*}=1 \), i.e., \( {t}^{*}={\theta}_T^{*} \). Therefore, by Proposition 1, DMU *0 is MPSS. In other words, DMU0 exhibits or is projected onto a region of the efficient frontier exhibits constant returns to scale.

If \( {\theta}_S^{*}<1 \), or equivalently, the optimal objective function (ϕ/θ) of model (14.11) is larger than 1, the current input–output data of DMU *0 can be improved in productivity by adjusting the scale of it. This is because the percentage by which the outputs gain equiproportionate increase due to the adjustment of the scale will outweigh the percentage by which the inputs increase equiproportionate, or the input equiproportionate reduction will outweigh the output equiproportionate reduction. To sum up, if \( {\theta}_S^{*}<1 \), DMU0 is currently not located in CRS region of the frontier or not projected onto a region of the frontier that exhibits CRS.

Below we provide Proposition 2 to shed light on how to determine whether IRS or DRS prevail at DMU0 with the aid of model (14.15).

Proposition 2. (Conditions for the Determination of RTS (Multi-component ))

  1. (1)

    If \( {\theta}_S^{*}=1 \), then DMU0 exhibits or is projected onto a region of the efficient frontier exhibits constant returns to scale.

  2. (2)

    If \( {\theta}_S^{*}<1 \) and the optimal values of models ( 14.14 ) and ( 14.15 ) below coincide, then DMU 0 exhibits or is projected onto a region of the efficient frontier that exhibits increasing returns to scale.

  3. (3)

    If \( {\theta}_S^{*}<1 \) and the optimal values of models ( 14.6 ) and ( 14.15 ) below coincide, DMU 0 exhibits or is projected onto a region of the efficient frontier that exhibits decreasing returns to scale.

A short proof of the proposition is in order. We consider the condition (2): \( {\theta}_S^{*}<1 \) and the optimal values of models (14.14) and (14.15) coincide. The condition (3) can be established similarly.

Let \( {\overline{\lambda}}_j^{1*} \) and \( {\overline{\lambda}}_j^{2*} \) be the optimal solutions of models (14.14) and (14.15). It is clear that \( {\displaystyle {\sum}_{j=1}^n{\overline{\lambda}}_j^{1*}}={\displaystyle {\sum}_{j=1}^n{\overline{\lambda}}_j^{2*}}<1 \). DMU *0 can make improvement through output augmentation since \( {\phi}^{*}=1/{\displaystyle {\sum}_{j=1}^n{\overline{\lambda}}_j^{1*}}>1 \). As DMU *0 is technically efficient, the only way that it can increase the output level is by increasing the level of inputs. As the percentage by which the outputs increase outweighs the percentage by which the inputs increase, DMU0 is currently located in the region that shows increasing returns to scale.

We have to show now it is impossible to lower its output level, and at the same time improve the productivity, i.e., achieve MPSS , since we have not checked if model (14.15) can achieve an value less than that of model (14.6) (i.e., θ * T ) if \( {\displaystyle {\sum}_{j=1}^n{\lambda}_j^1}={\displaystyle {\sum}_{j=1}^n{\lambda}_j^2}\le 1 \) is replaced by \( {\displaystyle {\sum}_{j=1}^n{\lambda}_j^1}={\displaystyle {\sum}_{j=1}^n{\lambda}_j^2} \) \( \ge 1 \). It should be noted that an optimal value less than θ * T in this context indicates DMU *0 can gain benefits by lowering its input level. If this were true, the RTS of DMU *0 will have an ambiguous meaning, since it can gain positive change in productivity by either lowering or augmenting its input level.

We claim impossibility by contradiction. Suppose \( {\overline{\lambda}}_{1j}^{1*} \), \( {\overline{\lambda}}_{1j}^{2*} \), t *1 and \( {\overline{\lambda}}_{2j}^{1*} \), \( {\overline{\lambda}}_{2j}^{2*} \), t *2 are the respective optimal solutions of model (14.15) and the model similar to model (14.15) except that \( {\displaystyle {\sum}_{j=1}^n{\lambda}_{1j}^{1*}}={\displaystyle {\sum}_{j=1}^n{\lambda}_{1j}^{2*}}<1 \) is replaced by \( {\displaystyle {\sum}_{j=1}^n{\lambda}_{2j}^{1*}}={\displaystyle {\sum}_{j=1}^n{\lambda}_{2j}^{2*}}>1 \). In addition, \( {t}_1^{*}={t}^{*}\le {t}_2^{*}<{\theta}_T^{*} \) (i.e., \( {\theta}_S^{*}<1 \)). Thus, there exists a convex combination of the two solutions with \( {t}^{*}=a{t}_1^{*}+\left(1-a\right){t}_2^{*}<{\theta}_T^{*} \), and \( {\displaystyle {\sum}_{j=1}^n\left(a{\lambda}_{1j}^{1*}+\left(1-a\right){\lambda}_{2j}^{1*}\right)}= \) \( {\displaystyle {\sum}_{j=1}^n\left(a{\lambda}_{1j}^{2*}+\left(1-a\right){\lambda}_{2j}^{2*}\right)}=1 \), which contradicts the premise that θ * T is the optimal value of model (14.6). Thus, impossibility holds and condition (2) has an unambiguous meaning.

$$ \begin{array}{l}{t}_{nirs}^{*}= \min \kern0.36em t\\ {}\mathrm{s}.\mathrm{t}.\kern0.6em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^k}}{x}_{ij}^{sk}+{\overline{s}}_i^{s-}=t{x}_{io}^s\kern2em i=1,\dots, l\ \left(\mathrm{shared}\ \mathrm{inputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}{x}_{ij}^1+{\overline{s}}_i^{1-}=t{x}_{io}^1\kern2em i=1,\dots, m\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}{x}_{ij}^2+{\overline{s}}_i^{2-}=t{x}_{io}^2\kern2em i=1,\dots, h\ \left(\mathrm{inputs}\ \mathrm{dedicated}\ \mathrm{t}\mathrm{o}\ {\mathrm{SDMU}}_2\right)\\ {}\kern2em {\displaystyle \sum_{k=1}^2{\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^k}}{y}_{rj}^{sk}-{\overline{s}}_r^{s+}={y}_{ro}^s\kern2em r=1,\dots, u\ \left(\mathrm{shared}\ \mathrm{o}\mathrm{utputs}\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}{y}_{rj}^1-{\overline{s}}_r^{1+}={y}_{ro}^1\kern2em r=1,\dots, s\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_1\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}{y}_{rj}^2-{\overline{s}}_r^{2+}={y}_{ro}^2\kern2em r=1,\dots, q\ \left(\mathrm{outputs}\ \mathrm{produced}\ \mathrm{b}\mathrm{y}\ {\mathrm{SDMU}}_2\right)\\ {}\kern2em {\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^1}={\displaystyle \sum_{j=1}^n{\overline{\lambda}}_j^2}\le 1\kern2em \\ {}\kern2em {\lambda}_j^k\ge 0,k=1,2,j=1,\dots, n.\end{array} $$
(14.15)

14.4 Theoretical Connection Between Black Box Approach and Multi-component Approach

In this section, we establish the equivalence between the method proposed in the previous section and the traditional black approach provided by Färe et al. (1985, 1994). This further ensures consistency in transition from black box to multi-component setting.

The efficiency measurements based on CRS, VRS, and NIRS respectively are provided as follows:

  1. 1.

    Efficiency index based on CRS;

    $$ \begin{array}{l}{\theta}_b^{crs}= \min \kern0.36em \theta \\ {}\mathrm{s}.\mathrm{t}.\kern0.6em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{y}_{rj}\ge {y}_{ro}\kern2em r=1,\dots, s.\\ {}\kern1.75em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{x}_{ij}\le \theta {x}_{io}\kern2em i=1,\dots, m.\\ {}\kern1.75em {\lambda}_j\ge 0,j=1,\dots, {n}^2.\end{array} $$
    (14.16)
  2. 2.

    Efficiency index based on VRS;

    $$ \begin{array}{l}{\theta}_b^{vrs}= \min \kern0.36em \theta \\ {}\mathrm{s}.\mathrm{t}.\kern0.6em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{y}_{rj}\ge {y}_{ro}\kern2em r=1,\dots, s.\\ {}\kern1.75em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{x}_{ij}\le \theta {x}_{io}\kern2em i=1,\dots, m.\\ {}\kern1.75em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}=1\\ {}\kern1.75em {\lambda}_j\ge 0,j=1,\dots, {n}^2.\end{array} $$
    (14.17)
  3. 3.

    Efficiency index based on NIRS;

    $$ \begin{array}{l}{\theta}_b^{nirs}= \min \kern0.36em \theta \\ {}\mathrm{s}.\mathrm{t}.\kern0.6em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{y}_{rj}\ge {y}_{ro}\kern2em r=1,\dots, s.\\ {}\kern2em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}{x}_{ij}\le \theta {x}_{io}\kern1.25em i=1,\dots, m.\\ {}\kern2em {\displaystyle \sum_{j=1}^{n^2}{\lambda}_j}\le 1\\ {}\kern2em {\lambda}_j\ge 0,j=1,\dots, {n}^2.\end{array} $$
    (14.18)

Färe et al. (1985, 1994) provided the following proposition for determining RTS.

Proposition 3 (Conditions for Determination of RTS (Black Box))

  1. (1)

    DMU 0 exhibits or is projected onto a region of the efficient frontier that exhibits constant returns to scale, if \( {\theta}_b^{crs}={\theta}_b^{vrs}={\theta}_b^{nirs} \).

  2. (2)

    DMU 0 exhibits or is projected onto a region of the efficient frontier that exhibits increasing returns to scale, if \( {\theta}_b^{crs}={\theta}_b^{nirs}<{\theta}_b^{vrs} \).

  3. (3)

    DMU 0 exhibits or is projected onto a region of the efficient frontier that exhibits decreasing returns to scale, if \( {\theta}_b^{crs}<{\theta}_b^{nirs}={\theta}_b^{vrs} \).

Formally, the following theorem holds.

Theorem 2

Proposition 2 is equivalent to Proposition 3.

Proof

In light of Theorem 1, we can derive that \( {\theta}_b^{vrs}={\theta}_T^{*}, \) \( {\theta}_b^{crs}={t}^{*} \) and \( {\theta}_b^{crs}={t}_{nirs}^{*} \), since the corresponding PPS’s are equal. Since \( {\theta}_s^{*}=1 \) indicates \( {t}^{*}={\theta}_T^{*}={t}_{nirs}^{*} \), it follows that the first condition of Proposition 3 is equivalent to the first condition of Proposition 2. By the same reasoning, condition 2 of the propositions is equivalent as well as their conditions 3. Thus, Proposition 3 is equivalent to Proposition 2. □

14.5 Application

In this section, data extracted from Environmental Statistics 2009 are used for illustration. We analyze the performances (efficiency and RTS) of the pollution treatment processes for waste water and waste air in China. Provinces are deemed as DMUs, each of which consists of two SDMUs, namely, capital city and non-capital cities. The pollution treatment process is depicted in Fig. 14.4.

Fig. 14.4
figure 4

Treatment process for wastewater and gas

The inputs involved in this application are three indicators : (1) number of facilities for treatment of wastewater in set (X 1); (2) number of facilities for treatment of waste gas in set (X 2); (3) annual expenditures in 10,000 Yuan (X 3). The outputs include (1) the industrial wastewater meeting discharge standards in 10,000 t (Y 1), (2) industrial sulphur dioxide removed in 10,000 t (Y 2), (3) industrial soot removed in 10,000 t (Y 3), and (4) industrial dust removed in 10,000 t (Y 4).

The inputs (X 1, X 2, X 3) are shared by capital city (SDMU1) and non-capital cities (SDMU2), and the outputs are the results of SDMU1 and SDMU2 fulfilling \( {X}_i={\displaystyle {\sum}_{k=1}^2{x}_{ik}} \) and \( {Y}_r={\displaystyle {\sum}_{k=1}^2{y}_{rk}} \).

Table 14.1 provides the input/output data by DMU (province ), and Table 14.2 provides data on inputs/outputs by SDMU1 (capital city ). Table 14.3 presents the descriptive statistics of the data on inputs/outputs . In light of Table 14.3, capital city consumes relative more inputs and produces comparatively less outputs. An average capital city consumes inputs 19 %, 21 % and 26 % of the means of X 1 , X 2 and X 3 respectively. However, the amounts produced account for 20 %, 17 %, 17 % and 14 % respectively of the means of Y 1 , Y 2 , Y 3 and Y 4 by an average capital city. Thus, roughly speaking, the average capital city consumes approximately more than one fifth of the total inputs, but produces less than one fifth of the total outputs. In this case, we might reasonably claim that the capital city consumes more inputs as compared with the noncapital cities. In the sequel, we will present the computational results associated with efficiency and returns to scale.

Table 14.1 Number of facilities, annual expenditures for operation and treatment of wastewater and waste gas by province
Table 14.2 Number of facilities, annual expenditures for operation and treatment of wastewater and waste gas by capital city
Table 14.3 Descriptive statistics on input and output variables

14.5.1 Efficiency

The efficiencies of DMUs by using the black-box approach and the proposed multi-component approach are presented in Table 14.4. From the black-box approach, the results of \( {\overline{\theta}}_o^{crs} \) (CCR model), \( {\overline{\theta}}_o^{nirs} \) and \( {\overline{\theta}}_o^{vrs} \) (BCC model) are reported in columns 2–4. Column 5 presents results by Kao’s parallel model which, in fact, are based on the \( {\overline{T}}^{CRS} \) (see Kao (2009 b)). Using the multi-component approach the results of t*, θ * nirs , θ * T by models (14.14), (14.15) and (14.6) are presented in columns 6–8.

Table 14.4 Results of various models

Now we focus on the results of \( {\overline{\theta}}_o^{vrs} \) and θ * T , both of which are based on the VRS assumption. Note that \( {\overline{\theta}}_o^{vrs} \) is the result of the black-box approach without considering the internal mechanism of a DMU, and θ * T is the result of multi-component approach. The difference between the two efficiency indexes can be attributed to the level of information requirements. Obviously, if more information is available, we are able to refine the results from the black-box approach. Overall, notice from Table 14.4 that the mean of θ * T is approximately 87.6 % of the mean of \( {\overline{\theta}}_o^{vrs} \), with a standard deviation of 0.103. Their distributions are provided in Fig. 14.5. The distribution of θ * T is more bell-shaped, while the distribution of \( {\overline{\theta}}_o^{vrs} \) is obviously skewed to the left.

Fig. 14.5
figure 5

Distribution of efficiency scores

Furthermore, according to Fig. 14.5, 15 provinces are classified as efficient by the BCC. It can be seen the discrimination power of BCC model in this application is too weak. By contrast, 12 of them are degraded in efficiencies by the multi-component approach. They are Hebei, Liaoning, Zhejiang, Jiangxi, Shandong, Henan, Guangdong, Guangxi, Tibet, Gansu, Qinghai, and Ningxia. Seven of them are given efficiency scores lower than 0.9.

Finally, we point out that the efficiency scores based on T CRS are almost the same as those based on \( {\overline{T}}^{CRS} \). Though the differences of θ * Kao and t* are negligible, we can find that the efficiencies of some DMUs such as Zhejiang and Hunan are adjusted slightly.

14.5.2 Returns to Scale

The RTS of provinces can be determined by Proposition 3 (black box), and Proposition 2 (multi-component). The results are presented in Table 14.5.

Table 14.5 Results of various models

Table 14.5 shows that approximately half of the provinces which are classified by the black-box approach as CRS and IRS are reclassified as DRS or CRS by the multi-component approach. Those classified as DRS by the black-box approach remain the same by the both approaches. We concentrate here on the results of the multi-component approach. In summary, six provinces show IRS, five provinces show CRS and the rest show DRS. Among those that show CRS, Inner Mongolia and Jilin have the MPSS because the optimal value in Model (14.10) that corresponds to t* in Table 14.4 equals one. We proceed to rearrange the results by the multi-component approach according to the administrative regions of China. The results are provided in Table 14.6.

Table 14.6 RTS by administrative regions

From Table 14.6, the developed provinces are more likely to show DRS. In particular, East China shows DRS entirely. Another obvious finding is that the provinces that show IRS are mainly located in the west of China, which is less developed area of China.

14.6 Summary and Conclusion

This paper studies the efficiency evaluation and RTS estimation in the situation where a DMU has multi-component structure. Radial measurements for efficiency evaluation and a procedure to determine the RTS of a DMU that is unaffected by possible multiple optima are provided. In doing so, we emphasize the theoretical connections between the black-box approach, which has been extensively studied in the literature, and the proposed methods. The strong relationship as is given by theorem 1 ensures a consistent transition from the black-box approach to the multi-component approach.

In the application section, we use the proposed method to study the efficiencies and RTS of pollution treatment processes in China. The results show that the multi-component approach has strong discrimination power : the efficiency scores obtained are distributed in a bell-shaped manner, contrast this to the weak discrimination power as evidenced by the black-box approach with the distribution of efficiency scores skewed to the left. It is also found that six provinces show IRS, five provinces show CRS, and the rest show DRS. Among those that show CRS, Inner Mongolia and Jilin have the MPSS. Furthermore, the developed provinces are more likely to show DRS. In particular, East China shows DRS entirely. In contrast, the provinces that show IRS are mainly located in the west, which is a less developed area of China.