Abstract
Applications of data envelopment analysis (DEA) models show that inadequate results may arise in some cases, two of these inadequacies being: (1) too many efficient units may appear in some DEA models; (2) a DEA model may show an inefficient unit from the point of view of experts as an efficient one. The purpose of this paper is to identify units that may unduly become efficient. The concept of a terminal unit is introduced for such units. It is shown by establishing theorems how units can be identified as terminal units. An approach for improving the adequacy of DEA models based on terminal units is suggested, and an example shown based on a real-life data set for Russian banks.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The seminal paper of Farrell (1957) defined efficiency measures and suggested both non-parametric and parametric estimation methods, and Farrell and Fieldhouse (1962) used linear programming for the first time. Charnes et al. (1978) (CCR) coined the term data envelopment analysis (DEA) and generalised and put DEA into the linear programming format we use today.
After a decade of applications of DEA following the publication of CCR, it was recognised that results both concerning efficiency scores and shape of the frontier production function, on which Farrell efficiency measures are based, were not always adequate when confronted with expert knowledge of the units to which DEA was applied. Types of inadequacies discussed in the literature have been that too many efficient units may appear in some DEA models, a DEA model may show an inefficient unit from the point of view of experts as an efficient one, too many zeros appear as solutions for the multipliers (or shadow prices or weights) (variables appearing in the dual solution if the primal model is the envelopment model formulated in the space of inputs and outputs), and units are not ‘properly’ enveloped.
Already Farrell (1957, p. 256–257) restricted the estimation of the non-parametric frontier function and consequently some of the efficiency scores by introducing artificial observations of the zero-infinity type in order to secure convex isoquants. CCR criticised this approach and introduced a non-Archimedean number weighting the slacks in the objective function and restricting the shadow prices (multipliers) to be strictly positive “because all resources and outputs are assumed to have ‘some’ positive value” (Charnes et al. 1979).
Thompson et al. (1986) generalised the restriction on dual variables introduced by CCR restricting the multipliers (weights) in a more general way than using the non-Archimedean number as a lower constraint on the weights. The problem of Thompson et al. (1986) was that the number of units under investigation was so small (only six) that all but one of the units was rated efficient using conventional DEA. In order to increase the discrimination restrictions on the multipliers or shadow prices or weights were enforced.
Based on Thompson et al. (1986) an elegant and subtle approach was developed in the DEA literature to deal with the problems of inadequacies of the DEA models. This approach was based on incorporating domination cones (Yu 1974) in DEA models. A number of outstanding papers were devoted to substantiation, development and applications of domination cones to DEA models (Brockett et al. 1997; Thompson et al. 1990, 1997; Wei et al. 2008; Yu et al. 1996; Charnes et al. 1989, 1990). The two latter papers introduced the cone-ratio approach of basing the shape of the frontier on a few efficient units selected by experts by restricting the weights (multipliers) to be within cones in the dual space.
Building on an approach in Thanassoulis and Allen (1998) and Allen and Thanassoulis (2004) followed the idea of Farrell (without a reference) of dealing with the inadequacies of the DEA solution by introducing artificial observation termed anchor units in the case of constant returns to scale and a single input. This definition was generalised in Bougnol and Dulá (2009) to multiple inputs and outputs and variable returns to scale. An elaborate algorithm for finding anchor points was introduced. The empirical applications gave the somewhat surprising result that almost all extreme efficient units are in fact anchor points. The situation of some zeros for weights seems to be the normal situation for DEA applications. However, their algorithms may produce units that are just usual efficient units (vertices) in DEA models.
Thanassoulis et al. (2012) elaborated further the super-efficiency approach for finding anchor units in a general model exhibiting variable returns to scale. However, their approach does not reveal all efficient units that may be the point of departure for improving envelopment in the Banker et al. (1984) model of variable returns to scale.
Edvardsen et al. (2008) suggested an empirical witty method for discovering “suspicious” units, which is units that may cause inadequate results in the DEA models; they call them exterior units. However, their method cannot discover all suspicious units.
The purpose of this paper is to identify units that may unduly become efficient by making use of a new concept; a terminal unit. This concept is better suited as points of departure for introducing artificial observations than the definitions of anchor units or exterior units. The definition of terminal units is linked both to the Farrell approach of introducing artificial observations and to the approach of incorporating domination cones in DEA models.
Cones are usually determined in the dual space of multipliers. It is rather difficult, however, for a manager (the decision-maker) to determine cones in the multipliers space that is dual to the space of inputs and outputs where a production possibility set is constructed (Cooper et al. 2000). For this very reason only two particular DEA models with cones are widely used in practice at present: the assurance region model and the cone-ratio model (Cooper et al. 2000).
The plan of the paper is to go into the background in Sect. 2, using key elements from the cone-ratio approach developed in Charnes et al. (1990), based on the Banker et al. (1984) model of variable returns to scale, and establish necessary definitions. The main results are presented in Sect. 3, including the definition of a terminal unit and illustrating its difference from the term anchor unit and using domination cones to establish that terminal production units exist if some production units become inefficient if cones are inserted in the model. Some numerical experiments on data for Russian banks are carried out in Sect. 4, showing how to find a terminal unit and how to use experts to indicate an artificial efficient unit using a visual interactive graphical technique. Section 5 concludes and offer ideas for further research.
2 Background
It was shown in the DEA scientific literature (see Krivonozhko et al. 2009) that the model in Banker et al. (1984) exhibiting variable returns to scale (hereafter termed the BCC model) can approximate any DEA model from a large family of DEA models. For this reason, we consider the BCC model as a basic model in our exposition.
Consider a set of n observations of actual production units (X j , Y j ), j = 1, …, n, where the vector of outputs Y j = (y 1j , …, y rj ) ≥ 0, j = 1, …, n, is produced from the vector of inputs X j = (x 1j , …, x mj ) ≥ 0. The production possibility set T is the set {(X, Y) | the outputs Y ≥ 0 can be produced from the inputs X ≥ 0}. The primal input-oriented BCC model can be written in the form
where X j = (x 1j , …, x mj ) and Y j = (y 1j , …, y rj ) represent the observed inputs and outputs of production units j = 1, …, n, \(S^{ - } = (s_{1}^{ - } , \ldots ,s_{m}^{ - } )\) and S + = (s +1 , …, s + r ) are vectors of slack variables. In this primal model the efficiency score θ of production unit (X o , Y o ) is found; (X o , Y o ) is any unit from the set of production units (X j , Y j ), j = 1, …, n.
Notice that we do not use an infinitesimal constant ɛ (a non-Archimedean quantity) explicitly in the DEA models, since we suppose that each model is solved in two stages in order to separate efficient and weakly efficient units.
The dual multiplier form of the BCC model (1a) is expressed as
where (v, u, u 0) is a vector of dual variables, v ∈ E m, u ∈ E r, u 0 is an unconstrained scalar variable associated with the convexity constraint.
The BCC primal output-oriented model can be written in the following form
The dual multiplier form of the BCC output-oriented model (1c) is written in the form
where (v, u, u 0) is a vector of dual variables, v ∈ E m, u ∈ E r, u 0 is a scalar variable associated with the convex constraint [the same symbols for dual variables are used as for models (1b)].
Definition 1
(Cooper et al. 2000). Unit (X o , Y o ) ∈ T is called efficient with respect to the input-oriented BCC model if and only if any optimal solution of (1a) satisfies: (a) θ * = 1, (b) all slacks s − k , s + i , k = 1, …, m, i = 1, …, r are zero.
If the first condition (a) in Definition 1 is satisfied, then unit (X o , Y o ) is called input weakly efficient with respect to the BCC input-oriented model. We denote the set of these weakly efficient points by WEff I T. In the DEA literature (Banker and Thrall 1992; Seiford and Thrall 1990) this set is also called the input boundary.
Definition 2
(Cooper et al. 2000). Unit (X o , Y o ) ∈ T is called efficient with respect to the output-oriented BCC model if and only if any optimal solution of (1c) satisfies: (a) η * = 1, (b) all slacks s − k , s + i , k = 1, …, m, i = 1, …, r are zero.
If the first condition in Definition 2 is satisfied, then unit (X o , Y o ) is called output weakly efficient with respect to the BCC model. We denote the set of these weakly efficient points by WEff O T. In the DEA literature (Banker and Thrall 1992; Seiford and Thrall 1990), this set is also called the output boundary.
Definition 3
(Cooper et al. 2000). Activity (X′, Y′) ∈ T is weakly Pareto efficient if and only if there is no (X, Y) ∈ T such that X < X′ and Y > Y′. We denote the set of weakly Pareto efficient activities by WEff P T.
We denote the set of efficient points of T with respect to the BCC model (1a–1d) by EffT. Krivonozhko et al. (2005) have proved that the following relations hold:
where the boundary of T is designated as BoundT.
The production possibility set T B for the BCC model can be written in the form (Banker et al. 1984)
In this paper we will mainly consider production possibility sets of this type.
3 Main results
The use of cones will play a crucial role in our search for terminal units. The main idea of incorporating domination cones in DEA models is to reduce the domain of multipliers. For this purpose, additional constraints on multipliers are incorporated in the DEA models.
In the assurance region method, constraints on the multipliers are added to the CCR model in the following manner; see Charnes et al. (1990),
where l 1i , k 1i , L 1s , K 1s are given low and upper bounds on the ratios of multipliers.
Assertion 1
There exist polyhedral cones in multidimensional space E m+r that cannot be described by relations (3).
Thus, formulas (3) describe only some subset of possible polyhedral cones in multidimensional space E m+r. The next model enables one to use more general form of domination cones in the DEA models.
The dual multiplier form of the cone-ratio model is expressed as (Charnes et al. 1989, 1990; Yu et al. 1996)
where variables v ∈ E m, u ∈ E r, u o ∈ E 1 and U ⊆ E r+ , V ⊆ E m+ are given polyhedral cones.
The primal problem of (4a) is written as
where V * and U * are negative polar cones of sets V and U, respectively.
In practice (Charnes et al. 1990; Cooper et al. 2000), polyhedral cones U and V are constructed as follows: (a) some excellent units are chosen from the point of view of experts; (b) averages of the optimal multipliers u * i , v * i are computed for every excellent unit i ∈ Ex, where Ex denotes the set of excellent units (a subset of Eff T). Vectors u * i , v * i , i ∈ Ex form polyhedral cones U and V.
Cones U and V reduce the feasible domain of multipliers, while the feasible domains of inputs, see Fig. 1, and outputs, see Fig. 2, are expanding.
Now, we make an attempt to reveal the causes of inadequacies in DEA models.
Assumption
Let the cone-ratio method allows one to reduce the number of “suspicious” production units, i.e. the units that are efficient, but should be inefficient from the point of view of experts.
Production possibility set T B (2) is a convex polyhedral set. According to the classical theorems of Goldman (1956) and Motzkin (1936) any convex polyhedral set can be represented as a vector sum of convex combination of vertices and the non-negative linear combination of vectors (rays).
Before going further, let us recall some notions from convex analysis. Faces are formed by an intersection of the supporting hyperplane and the polyhedral set. In the DEA models, the dimension of face may vary from 0 up to (m + r − 1), the maximal dimension. Faces of maximal dimension are called facets. Faces of 0-dimension are known as vertices, 1-dimension as edges.
Definition 4
We call an efficient (vertex) unit terminal unit if an infinite edge is going out from this unit.
According to the classification of Charnes et al. (1991) terminal units belong to the class E (extreme efficient).
We denote the set of terminal units with respect to the production possibility set (2) by T term .
Definition 5
We call a face Γ of set T B a terminal face if this face contains an infinite edge.
Then the following assertion can be proved.
Theorem 1
If some efficient production units in model (1a–1d) become inefficient in model (4a, 4b) as a result of inserting cones in the BCC model (1a–1d), then it is necessary that there exist terminal production units among such inefficient units.
Proof
See the proof of Theorem 12 in “Appendix 1” for a more general case.
Observe that the results of Theorem 1 was proved for the cone-ratio model at first, since this model was thoroughly elaborated from theoretical and practical points of view in the scientific literature on the DEA models, see Charnes et al. (1990), Brockett et al. (1997) and Cooper et al. (2000). However, the cone-ratio model cannot cover all possible cases where inadequate results may appear in the DEA models. Therefore, theorem 12 is proven in this paper for the generalized DEA model with domination cones.
Thus, Theorem 1 shows that terminal points are the first “suspicious” units which may cause inadequate results in the DEA models.
The following optimization models enable us to find terminal units or units belonging to terminal faces of the production possibility set. For this purpose two types of models are solved for every efficient unit (vertex) q that belongs to the class E according to the classification of Charnes et al. (1991).
Problem P k (q) (k = 1, …, m):
where d k = (0, …, 1, …, 0) ∈ E m, the unity is in kth position.
Variable τ provides that ray (X q + τd k , Y q ) going out from unit (X q , Y q ) belongs to the feasible set of the model (5).
Theorem 2
If in problem (5) the optimal value J *1k = 1, then unit (X q , Y q ) is a terminal one or belongs to a terminal face Γ ⊂ WEff O T.
Proof
Under any τ ≥ 0 ray (X q + τd k , Y q ) is a feasible subset for production possibility set T B (2) due to monotonicity of T B . As it follows from model (5), if J *1k = 1, then this ray belongs to the set WEff O T B . Hence this ray is an unbounded edge or belongs to an unbounded face of set T B .
This completes the proof.
The following models determine infinite edges emanating along direction g i , where g i = (0, …, 1, …, 0) ∈ E r (the unity is in ith position).
Problem R i (q) (i = 1, …, r)
Theorem 3
If in problem (6) the optimal value J *2i = 1, then unit (X q , Y q ) is a terminal one or belongs to a terminal face Γ ⊂ WEff I T.
The proof of Theorem 3 is very similar to the proof of the previous theorem and is therefore skipped.
Thus, models (5) and (6) enable us to reveal terminal units or efficient units belonging to unbounded faces and also directions of infinite edges going out from efficient units.
The following problems enable one to discover only terminal units among all units belonging to the class E.
Problem \(\bar{P}_{k} (q)\) (k = 1, …, m):
Theorem 4
Unit (X q , Y q ) from the class E is a terminal one if the optimal value of problem (7) \(\bar{J}_{1i}^{*} = \lambda_{q}^{*} = 1\).
Proof
Observe that solution λ j = 0, j = 1, …, n, j ≠ q, λ q = 1, τ = 0 is a feasible solution of problem (7). So, if λ * q = 1, this implies that any interior point belonging to the ray (X q + τd k , Y q ) cannot be represented as a convex combination of some other points of production possibility set T B . Hence this ray is an infinite edge and point (X q , Y q ) is a terminal unit.
However, if λ * q < 1, this implies that some interior points of the ray (X q + τd k , Y q ) can be represented as a convex combination of some other points of set T B .
This completes the proof.
The following problems allow one to find only terminal units and infinite edges emanating along directions g i , i = 1, …, r.
Problem \(\bar{R}_{i} (q)\) (i = 1, …, r)
Theorem 5
Unit (X q , Y q ) from the class E is a terminal one if the optimal value of problem (8) \(\bar{J}_{2i}^{*} = \lambda_{q}^{*} = 1\).
The proof of Theorem 5 is very similar to the proof of Theorem 4 and is therefore skipped.
Let us introduce the following designations \(\bar{d}_{k} = (d_{k},0) \in E^{m + r}\), k = 1, …, m, \(\bar{g}_{i} = (0,g_{i}) \in E^{m + r}\), i = 1, …, r.
Proposition
Only vectors of the following forms \(\bar{d}_{k} = (d_{k},0) \in E^{m + r}\), k = 1, …, m, \(\bar{g}_{i} = (0,g_{i}) \in E^{m + r}\), i = 1, …, r can be the direction vectors of infinite edges of set T B .
The assertion of the proposition follows from the structure of the polyhedral set T B (Goldman 1956; Nikaido 1968; Rockafellar 1970).
Based on assertions formulated above we can prove the following result.
Theorem 6
Unit (X q , Y q ) from the class E is a terminal one if and only if it turns out to be terminal at least in one of the problems \(\bar{P}_{k} (q)\), k = 1, …, m in (7) or \(\bar{R}_{i} (q)\), i = 1, …, r in (8).
Proof
Let unit (X q , Y q ) be a terminal one. Then an infinite edge with direction vector of the form \(\bar{d}_{k} = (d_{k},0) \in E^{m + r}\), k = 1, …, m, \(\bar{g}_{i} = (0,g_{i} ) \in E^{m + r}\), i = 1, …, r is going out from this unit according to Theorem 6. Let it be vector \(\bar{d}_{k}\), without loss of generality. Consider problem of type \(\bar{P}_{k} (q)\) (7), where direction vector d k is used. Since d k determines an infinite edge, then any interior point belonging to the ray (X q + τd k , Y q ) cannot be represented as a convex combination of some other points of set T B . Hence \(\lambda_{q}^{*} = 1\), \(\lambda_{j}^{*} = 0\), j = 1, …, n, j ≠ q, τ * ≥ 0 is an optimal solution of problem (7).
Conversely, let unit (X q , Y q ) be a terminal one in one of the problems \(\bar{P}_{k} (q)\), k = 1, …, m in (7) or \(\bar{R}_{i} (q)\), i = 1, …, r in (8). Then according to Theorems 4 and 5, this unit is a terminal one.
This completes the proof.
Bougnol and Dulá (2009) determined an anchor point as an efficient vertex belonging to an unbounded face of set T B . They proposed algorithms for discovering anchor points.
Let us denote the set of anchor units in the BCC model (1a–1d) with respect to the definition of Bougnol and Dulá (2009) by T 1 anc .
Theorem 7
Unit (X q , Y q ) belongs to the set T 1 anc in the BCC model (1a–1d) if and only if the optimal value J *1k = 1 and/or J *2i = 1 at least in one of the problems P k (q), k = 1, …, m in (5) and R i (q), i = 1, …, r in (6).
Proof
The result follows from Theorems 2 and 3. Indeed, if J *1k = 1 and/or J *2i = 1 at least in one of the problems P k (q), k = 1, …, m (5) and R i (q), i = 1, …, r (6), then unit (X q , Y q ) belongs to an edge or to a terminal face of the production possibility set T B (2). Remember from the convex analysis that an edge of the polyhedral set represents also a face of 1-dimension. Hence unit (X q , Y q ) is an anchor point with respect to the definition of Bougnol and Dulá (2009).
Conversely, let vertex (X q , Y q ) belong to T 1 anc . Hence unit (X q , Y q ) belongs to an unbounded face. All unbounded directions of this face are determined by some of the vectors \(\bar{d}_{k}\), k = 1, …, m, \(\bar{g}_{i}\), i = 1, …, r. Take some of these vectors. Let it be vector \(\bar{d}_{k}\), without loss of generality. Consider problem of type P k (q) (5), where direction vector d k is used. The ray \((X_{q} + \tau {\kern 1pt} d_{k},Y_{q})\) belongs to the set \(WEff_{O} {\kern 1pt} T_{B}\), since (X q , Y q ) is an efficient unit (vertex) and vector (0, Y q ) is perpendicular to \(\bar{d}_{k}\). Hence solving (5), we obtain J *1k = 1.
This completes the proof.
Thus Theorem 7 gives us a constructive way to reveal whether unit (X q , Y q ) from the class E belongs to the set T 1 anc or not. For this purpose one has to solve problems P k (q), k = 1, …, m (5) and/or R i (q), i = 1, …, r (6).
Now we can formulate the following result.
Corollary 1
The set of terminal units T term of the BCC model (1a–1d) belongs to the set of anchor points T 1 anc with respect to the definition of Bougnol and Dulá, i.e. T term ⊆ T 1 anc .
This result immediately follows from Theorems 2, 3 and 7.
Edvardsen et al. (2008) suggested an empirical method for discovering “suspicious” units, they call them “exterior units”. Let T ext denote the set of exterior units in the BCC model (1a–1d). However, their method cannot discover all suspicious units. Indeed, consider the following illustrative example, Fig. 3, panel (a). The three dimensional BCC model is determined by units A, B, C, D. Inputs and outputs of these units are presented in Table 2, see “Appendix 2”. Again consider the BCC model with the same units, but now y is an input variable, x 1 and x 2 are output variables, see Fig. 3, panel (b), points E and F are projections of points B and C onto planes x 1 Oy and x 2 Oy, respectively. Point D is not efficient in this case. Hence this point is not exterior. At the same time this point is a “suspicious” unit, since it may cause inadequacy in the DEA model. Units, like point D, belonging to unbounded faces and not being terminal units are also “suspicious” points. These units may also cause inadequate results in the DEA models. However, our computational experience shows that the number of such units in real-life data sets is very small in comparison with the number of terminal units.
The next theorem establishes that the set of terminal units includes the set of exterior units.
Theorem 8
The set of terminal units T term of the BCC model contains the set of exterior units T ext , i.e. T ext ⊆ T term .
Proof
See “Appendix 1”.
However, some terminal units may not belong to the set of exterior units. Indeed, consider the following illustrative example. Figure 4 depicts a 3-dimensional BCC model, points A–F (efficient units) determine the production possibility set T B . Inputs and outputs of these units are indicated in Table 3, see “Appendix 2”.
Units D and E are terminal ones since these units belong to unbounded edges. However, these units are not exterior ones since these units will be inefficient after “reversing the inputs and outputs”.
The following assertion summarizes the results of Corollary 1 and Theorem 8.
Theorem 9
For BCC model (1a–1d) the following relations hold T ext ⊆ T term ⊆ T 1 anc .
Its proof is based on previously stated results.
Thanassoulis et al. (2012, p. 178) proposed a new definition of anchor units for efficient units, “which with reference to the extreme-efficient DMUs [vertices] but excluding the evaluated DMU itself, can be rendered class F by contracting radially their output levels, while keeping their input level constant OR by increasing inputs and keeping their output levels constant”, where class F according to Charnes et al. (1991) contains inefficient DMUs that are on the PPS boundary. However, the set of anchor units with respect to definition of Thanassoulis et al. (2012, p. 178) does not coincide with the set of anchor units according to Bougnol and Dulá (2009). Indeed, consider the following illustrative example. In Fig. 5, a two-inputs/one-output BCC model is depicted. Units A, B, C, D, E are the observed efficient production units that determines set T B . Inputs and outputs of these units are indicated in Table 4, see “Appendix 2”.
Points M,A, B, C, L form the face of set T B . This face belongs to the orthant X 1 OY. However, unit B is just a common vertex of the BCC model. Let “unit B be rendered class F by increasing inputs and keeping their output levels constant”. As a result point B 1 will be obtained. This point belongs to segment AC, an efficient part of the frontier. Again, let “unit B be rendered class F by contracting radially their output levels, while keeping their input level constant”. As a result point B 2 will be obtained. Point B 2 belongs to an efficient part of the frontier, segment AC. Hence, unit B is not anchor unit according to Thanassoulis et al. (2012). However, unit B is an anchor unit according to Bougnol and Dulá (2009), since this unit belongs to an unbounded face.
Let us denote the set of anchor units in the BCC model (1a–1d) with respect to the definition of Thanassoulis et al. (2012, p. 178) by T 2 anc . Then the following assertion can be proved.
Theorem 10
For BCC model (1a–1d) the following relation holds T 2 anc ⊆ T term .
Proof
See “Appendix 1”.
Theorem 10 shows that approach of Thanassoulis et al. (2012) does not reveal all efficient units that may be used for improving envelopment in BCC models.
Consider again an illustrative example in Fig. 4. Units D and E are not anchor units with respect to the definition of Thanassoulis et al. (2012). However, units D and E are terminal ones, since infinite edges start from this units. Figure 6 depicts an input isoquant for unit D. Unit G is inefficient. Its projection will be point G′ on some slack face. However, their approach cannot improve this part of the boundary, unit D is not identified as an anchor unit, since unit D can be moved to the efficient part of the frontier by contracting radially its output level while keeping its input levels constant or by increasing inputs and keeping its output levels constant. Figure 7 illustrates this fact. So, their approach is incomplete.
Summarizing all mentioned above, the following assertion can be formulated.
Theorem 11
For BCC model (1a–1d) the following relations hold T 2 anc ⊆ T term ⊆ T 1 anc .
So, the approach of Bougnol and Dulá (2009) generates excess number of anchor units and approach of Thanassoulis et al. (2012) may not generate sufficient number of units in order to improve the frontier.
The cone-ratio model (3) cannot help in every case where suspicious units appear in the DEA models. In Fig. 4, point B is a terminal unit. However, it is impossible to transform the frontier with the help of cones U and V in such a way that terminal point B would be inefficient, see Fig. 6.
Only simultaneous transformation of the frontier in the space of inputs and outputs enables one to make suspicious unit B inefficient, see Fig. 8.
Yu et al. (1996) proposed the following generalized DEA (GDEA) model that unifies and extends most the well-known DEA models based on using domination cones (see, e.g. Yu 1974) in their constraint sets.
The optimization dual problem to (9a) is written in the form (Yu et al. 1996):
where \(\overline{X} = (X_{1}, \ldots, X_{n})\) is an m × n matrix, X j = (x 1j , …, x mj ) ≥ 0 is the input vector for the jth production unit j = 1, …, n; \(\overline{Y} = (Y_{1} , \ldots ,Y_{n} )\) is an r × n matrix, Y j = (y 1j , …, y rj ) ≥ 0 is the output vector for the jth production unit j = 1, …, n. Parameters δ 1, δ 2, δ 3 are binary ones assuming only the values 0 and 1. Vector e is determined as e = (1, …, 1) ∈ E n. Sets W ⊆ E m+r and K ⊆ E n are the closed convex cones, where E m+r and E n are Euclidean spaces of the dimensions (m + r) and n, respectively. W * and K * are the negative polar cones (Charnes et al. 1989; Yu et al. 1996) of sets W and K, respectively. It is usually assumed in the DEA models that the polyhedral cones W ⊆ E m+r+ and K ⊆ E n+ and int W ≠ Ø and int K ≠ Ø, then we get W * ≠ Ø and K * ≠ Ø.
Theorem 12
If some efficient units in model (1a–1d) become inefficient in model (9a, 9b) as a result of inserting cones in the BCC model, then it is necessary that there exist terminal production units among such inefficient units.
Proof
See “Appendix 1”.
It is rather difficult for a manager (expert) to determine cones in the multipliers space that is dual to the space of inputs and outputs where a production possibility set is constructed.
For this very reason it is difficult to use the GDEA model in practice.
Krivonozhko et al. (2009) proposed a model that is more general than the GDEA model, on the one hand, as it covers situations that the GDEA model cannot describe. On the other hand, this model enables one to construct step-by-step any model from a large family of the DEA models by incorporating artificial units and rays in the space of inputs and outputs in the BCC model, which makes the process of model construction visible and more understandable.
The production possibility set of this model is written in the form
where (D i , G i ), i ∈ I, I is a set of artificial production units, (A k , B k ), k ∈ J, J is a set of vectors (rays) added to the model.
Figure 9 shows the transformation of the frontier of the 2-dimensional BCC model with the help of artificial units and rays. In the figure, cone Q is formed by artificial rays, point B′ is an artificial unit.
In addition to problem (7) and (8), we can also discover terminal (suspicious) production units with the help of constructions of 2-dimensional and 3-dimensional sections of the frontier.
Define 3-dimensional affine subspace in space E m+r as
where (X o , Y o ) ∈ T B , α, β and γ are any real numbers, directions d 1, d 2, d 3 ∈ E m+r are not parallel to each other.
Next, define intersections of the frontier with 3-dimensional affine subspace
where WEff P T is a set of weakly Pareto-efficient points. Krivonozhko et al. (2005) have proved that set WEff P T coincides with the boundary of T B (2).
By choosing different directions d 1, d 2 and d 3 we can construct various 2-dimensional and 3-dimensional sections going through point (X o , Y o ) and cutting the frontier. Parametric optimization algorithms for construction of sections of the type (12) are described in detail by Krivonozhko et al. (2004) and Volodin et al. (2004).
Moreover, thanks to our package FrontierVision, one can add to the DEA model any artificial units and rays on the computer screen interactively.
Assertion 2
There always exists a section (12) that reveals any terminal unit and/or efficient units belonging to an unbounded face.
However, the specific section may not reveal some terminal units. In the 3-dimensional BCC model, see Fig. 4, unit B is a terminal one. In Fig. 6, unit B does not look like a terminal one. The section in Fig. 7 reveals this unit as a terminal point.
Generally speaking, a 2-dimensional section of the type (12) consists mainly of a number of segments and two rays. The first and the last vertices in the chain of segments are usually terminal units.
Next, a user (expert) can control the changes of efficiency scores of inefficient units as a result of inserting artificial rays and units with the help of our package FrontierVision. Indeed, the following assertion is valid.
Assertion 3
There always exists a vicinity for any terminal unit such that any change of this unit’s position within this vicinity will change only efficiency scores of some inefficient units within some small range.
So, if some changes in efficiency scores are unacceptable according to expert’s opinion, he/she can move artificial units closer to some small vicinity of a terminal unit just on the screen of the computer.
Now, we are ready to present our general Procedure for improving the adequacy of DEA models.
-
1.
Solve BCC models (1a–1d) for all units DMU j , j = 1, …, n.
If experts (decision making persons) agree with computational results, then STOP, else go to the next step.
-
2.
Experts are requested to indicate units whose efficiency scores do not correspond to their opinions. For every such units several sections of the type (12) are constructed. Set T term of terminal units is determined. Experts are requested to indicate on the 2-dimensional or 3-dimensional sections artificial efficient units or rays in the some vicinity of every terminal unit.
-
3.
Add the set of artificial units and rays to the current set of units.
Solve BCC models (1a–1d) with modified set of production units.
If computational results are acceptable, then STOP, else change vicinities of terminal units or/and positions of artificial units within vicinities of terminal units.
Go to the beginning of step 3.
Observe that in the Procedure the process of frontier transformation is controllable from the very beginning up to the end. Our computational experiments showed that it is sufficient to make two or three steps of Procedure in order to adjust his/her model and obtain reliable computational results.
Notice that we present here only general Procedure for improving of the adequacy of the DEA models. This Procedure may be specified in dependence of the real-life problems. This is an interesting problem for future development.
4 Computational results
In order to illustrate consequences of letting experts choose one unit from the terminal units and change data to more reasonable values we used a dataset for 920 Russia banks’ financial accounts for January 1 of 2009. The following inputs and outputs for the BCC output-oriented model were used:
-
Inputs: working assets; time liabilities; demand liabilities.
-
Outputs: equity capital; liquid assets; fixed assets.
The data were financial accounts of Russian banks for the year 2008. Remember that this year was the first year of the world crisis. It was important at that time for financial experts to have reliable tools for forecasting the behavior of financial institutes and for warning about possible bankruptcies.
The number of efficient banks is very low; 42 units out of 920. The majority of banks have efficiency scores <50 %. This situation is different from the situation reported in Charnes et al. (1990).
However, financial experts expressed doubt about the result of full efficiency of some banks. For example, Fig. 10 presents a cut of the frontier in a 6-dimensional space by the 2-dimensional plane for bank A; certainly we use legends instead of real names of banks. The directions of the plane are determined by two inputs: demand liabilities and working assets. Small circles denote the projections of the actual observations of banks onto the 2-dimensional plane. The colors of circles are associated with the efficiency scores of appropriate units. The red color means that the corresponding unit is efficient. The green color designates units with low efficiency score; the yellow color denotes that a unit has an intermediate value of efficiency score.
The scale is such that point (1, 1) in the figure corresponds to bank A. According to the BCC model bank A is 100 % efficient. However, experts did not agree with this evaluation, and they were right, since bank A was bankrupted in 6 months. In fact point A is a typical terminal unit, since unbounded edges go out from this unit. However, Fig. 10 cannot help us to improve the frontier.
For this purpose we should use another section. Figure 11 shows a cut of the frontier in a 6-dimensional space by the 2-dimensional plane for bank A. The horizontal axis in the figure is determined by input vector X o and the vertical axis corresponds to output vector Y o of bank A, respectively. The scale is such that point (1, 1) corresponds to bank A. The solid line outlines the production function (the cut of the production possibility set) of the model. The balls in the figure denote projections of some other banks on the 2-dimensional plane. Again, according to the model, bank A is efficient, which contradicts experts’ opinion. However, Fig. 11 can help us to improve the frontier. Experts were asked to insert an artificial efficient unit on the screen by phrasing the question how much outputs should be expected from an efficient unit using the observed inputs of unit A (implicitly assuming a proportional increase of the outputs). This artificial unit is denoted by B. In the figure, the dotted line together with the solid line after it shows the frontier of the modified model.
After the frontier transformation the efficiency score of bank A became 48.3 %. Some other banks also changed their efficiency scores after inserting artificial unit in the model. Table 1 shows efficiency scores of some banks, which were bankrupted during 6 months, in the BCC model before and after frontier transformation.
After the second run of the model, the experts recognized the modelling results to be adequate and reliable.
We have presented an investigation for only one terminal unit, but demonstrated that the choice of terminal units as units that should be investigated using expert information worked out satisfactorily; reducing the efficient unit to an inefficient one and also reducing several other units’ scores and improving the realism of the results. However, the working out of a more formal procedure for eliciting expert help in providing more realistic efficient units based on terminal units is still to be done.
5 Conclusions
In this paper, we proposed tools for discovering units which may cause inadequate results in the DEA models. It was shown that terminal units constitute “suspicious” points in the first place. If the graph of intersection of the frontier with a 2-dimensional plane is constructed, then the first and the last vertices of the graph are usually terminal units. However, it is not necessarily the case that terminal units may cause inadequate results in the DEA models, such units may be quite normal efficient points. Only experts in the specific area can evaluate the adequacy of efficiency scores of terminal units.
Terminal units arise because a non-countable (continuous) production possibility set T is determined on the basis of a finite number of production units; some of these units turn out to be terminal ones. A gap between derivatives may take place at these points. For example, the left-hand side scale elasticity takes infinite value, and the right-hand side scale elasticity takes zero value at some terminal points, see Førsund et al. (2007).
Let us remember that Farrell (1957) introduced artificial units at infinity in order to smooth his model, see also Førsund et al. (2009).
We also propose how to deal with inadequacies in the DEA models with the help of incorporating artificial units and rays interactively on the screen of the computer by experts into some BCC model. This makes the DEA models more adequate and adjustable.
Only one case of eliciting information from experts suggesting artificial units that should be efficient based on terminal units has been shown. However, several other experiments were carried out. Carrying out systematic experiments on new real-life datasets will be attempted in a further research, together with developing more formal procedures for eliciting information from experts on the activities under investigation constructing artificial efficient units based on terminal units. The use of interactive graphical representation in a space of sufficiently reduced dimensions to be readily understandable and recognizable by experts seems a promising approach.
References
Allen R, Thanassoulis E (2004) Improving envelopment in data envelopment analysis. Eur J Oper Res 154(2):363–379
Banker RD, Charnes A, Cooper WW (1984) Some models for estimating technical and scale efficiency in data envelopment analysis. Manage Sci 30:1078–1092
Banker RD, Thrall RM (1992) Estimation of returns to scale using data envelopment analysis. Eur J Oper Res 62(1):74–84
Bougnol M-L, Dulá JH (2009) Anchor points in DEA. Eur J Oper Res 192(2):668–676
Brockett PL, Charnes A, Cooper WW, Huang ZM, Sun DB (1997) Data transformations in DEA cone ratio envelopment approaches for monitoring bank performance. Eur J Oper Res 98:250–268
Charnes A, Cooper WW, Rhodes E (1978) Measuring the efficiency of decision making units. Eur J Oper Res 2:429–444
Charnes A, Cooper WW, Rhodes E (1979) Short communication. Measuring the efficiency of decision making units. Eur J Oper Res 3(4):339
Charnes A, Cooper WW, Wei QL, Huang ZM (1989) Cone ratio data envelopment analysis and multi-objective programming. Int J Syst Sci 20:1099–1118
Charnes A, Cooper WW, Huang ZM, Sun DB (1990) Polyhedral cone-ratio DEA models with an illustrative application to large commercial banks. J Econ 46:73–91
Charnes A, Cooper WW, Thrall RM (1991) A structure for classifying and characterizing efficiency and inefficiency in data envelopment analysis. J Prod Anal 2(3):197–237
Cooper WW, Seiford LM, Tone K (2000) Data envelopment analysis. Kluwer, Boston
Dantzig GB, Thapa MN (1997) Linear programming 1: introduction. Springer, New York
Dantzig GB, Thapa MN (2003) Linear programming 2: theory and extensions. Springer, New York
Edvardsen DJ, Førsund FR, Kittelsen SAC (2008) Far out or alone in the crowd: a taxonomy of peers in DEA. J Prod Anal 29:201–210
Farrell MJ (1957) The measurement of productive efficiency. J R Stat Soc Ser A (General) 120(III): 253–281, 290
Farrell MJ, Fieldhouse M (1962) Estimating efficient production functions under increasing returns to scale. J R Stat Soc Ser A (General) 125(2):252–267
Førsund FR, Hjalmarsson L, Krivonozh-ko VE, Utkin OB (2007) Calculation of scale elasticities in DEA models: direct and indirect approaches. J Prod Anal 28:45–56
Førsund FR, Kittelsen SAC, Krivo-nozhko VE (2009) Farrell revisited—visualizing properties of DEA production frontiers. J Oper Res Soc 60:1535–1545
Goldman AJ (1956) Resolution and separation theorems for polyhedral convex sets. In: Kuhn HW, Tucker AW (eds) Linear inequalities and related systems. Princeton University Press, New Jersey, pp 41–51
Krivonozhko VE, Utkin OB, Volodin AV, Sablin IA, Patrin M (2004) Constructions of economic functions and calculations of marginal rates in DEA using parametric optimization methods. J Oper Res Soc 55:1049–1058
Krivonozhko VE, Utkin OB, Volodin AV, Sablin IA (2005) About the structure of boundary points in DEA. J Oper Res Soc 56:1373–1378
Krivonozhko VE, Utkin OB, Safin MM, Lychev AV (2009) On some generalization of the DEA models. J Oper Res Soc 60:1518–1527
Motzkin TS (1936) Beiträge zur Theorie der linearen Ungleichungen. Dissertation, Basel, Jerusalem
Nikaido H (1968) Convex structures and economic theory. Academic Press, New York
Rockafellar RT (1970) Convex analysis. Princeton University Press, New Jersey
Seiford LM, Thrall RM (1990) Recent developments in DEA: the mathematical programming approach to frontier analysis. J Econometrics 46(1–2):7–38.
Thanassoulis E, Allen R (1998) Simulating weight restrictions in data envelopment analysis by means of unobserved DMUs. Manage Sci 44(4):586–594
Thanassoulis E, Kortelainen M, Allen R (2012) Improving envelopment in data envelopment analysis under variable returns to scale. Eur J Oper Res 218(1):175–185
Thompson RG, Singleton FR Jr, Thrall RM, Smith BA (1986) Comparative site evaluation for locating a high-energy physics labs in Texas. Interfaces 16(6):35–49
Thompson RG, Langemeier LN, Lee C-H, Lee E, Thrall RM (1990) The role of multiplier bounds in efficiency analysis with application to Kansas farming. J Econ 46:93–108
Thompson RG, Brinkman ES, Dharma-pala PS, Gonzalez-Lima MD, Thrall RM (1997) DEA/AR profit rations and sensitivity of 100 large U.S. banks. Eur J Oper Res 98:213–229
Volodin AV, Krivonozhko VE, Ryzhikh DA, Utkin OB (2004) Construction of three-dimensional sections in data envelopment analysis by using parametric optimization algorithms. Comput Math Math Phys 44:589–603
Wei Q, Yan H, Xiong L (2008) A bi-objective generalized data envelopment analysis model and point-to-set mapping projection. Eur J Oper Res 190:855–876
Yu PL (1974) Cone convexity, cone extreme points, and nondominated solutions in decision problems with multiobjectives. J Optim Theory Appl 14:319–377
Yu G, Wei Q, Brockett P (1996) A generalized data envelopment analysis model: a unification and extension of existing methods for efficiency analysis of decision making units. Ann Oper Res 66:47–89
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1
Proof of Theorem 8
Take any point Z q = (X q , Y q ) ∈ T ext . Assume that Z q is not a terminal unit. Consider the following ray
where α k > 0, k = 1, …, m, and β i > 0, i = 1, …, r are constants, \(\bar{d}_{k} = (d_{k},0) \in E^{m + r}\) (k = 1, …, m), \(\bar{g}_{i} = (0,g_{i}) \in E^{m + r}\) (i = 1, …, r).
Since Z q is not a terminal unit, some part of ray (12) goes through the interior of the set
Consider some point Z * that belongs to set (12) and to set (13).
After “reversing the inputs and outputs” according to the procedure of Edvardsen et al. (2008) point Z * will dominate point Z q . Hence point Z q cannot belong to the set of exterior points, contradicting the assumption.
This completes the proof.
Proof of Theorem 10
Take any unit Z o = (X o , Y o ) ∈ T 2 anc from the class E. Consider the case when this unit “can be rendered class F by increasing inputs and keeping their output levels constant”, where class F contains inefficient DMUs that are on the PPS boundary. Let us denote the face where unit Z 1 can be rendered by Γ1. Since face Γ1 is infinite, it can be represented in the form
where J is a subset of vertices of polyhedral set T, I 1 and I 2 are index subsets of vector Z = (X, Y) ∈ E m+r associated with vectors X and Y, respectively, and |I 1| ≤ m, |I 2| ≤ r, where |I| denotes the power of set I, e i ∈ E m+r is an identity vector with a one in position i.
Let the projection of unit Z o onto the face Γ1 be \(\tilde{Z}_{o}\). Point \(\tilde{Z}_{o}\) belongs to riΓ1, where riΓ stands for relative interior of face Γ, since μ k and ρ i are not zero.
Unit Z o does not belong to Γ1, and point \(\tilde{Z}_{o}\) belongs to riΓ1, hence according to convex analysis (Goldman 1956; Nikaido 1968; Rockafellar 1970) half-segment \([Z_{o},\tilde{Z}_{o})\) does not belongs to face Γ1.
Let us denote production possibility set of problem (1a–1d) without unit Z o by \(\tilde{T}\). Construct a supporting hyperplane H for set \(\tilde{T}\), which contains face Γ1.
If face Γ1 has full dimension, then hyperplane separates unit Z o and set \(\tilde{T}\), since half-segment does not belong to face Γ1.
If the dimension of face Γ1 is less then (m + r − 1), then there is some degree of freedom in building supporting hyper-plane H. Let us build hyper-plane H in such a way that it separates unit Z o and set \(\tilde{T}\) and hyper-plane H does not contain unit Z o .
Consider the affine set S of the following form:
Hyperplane H separates affine set S and set \(\tilde{T}\), since direction vectors e k , k ∈ I 1, e i , i ∈ I 2 are parallel to hyperplane H.
So, unit Z o emanates rays along directions e k , k ∈ I 1, and e i , i ∈ I 2. However, these rays are infinite edges since interior points of every ray cannot be represented as convex combination of points from set \(\tilde{T}\) due to existence of separating hyperplane H. Hence unit Z o is a terminal unit.
The case when unit Z o “can be rendered class F by contracting radially their output levels, while keeping their input level constant” is proved in a similar way.
This completes the proof.
Proof of Theorem 12
Consider efficient unit (X 1, Y 1) in model (1a–1d) under evaluation, that remains efficient in model (9a, 9b) after inserting domination cones, and efficient unit (X 2, Y 2) in model (1a–1d) that becomes inefficient in model (9a, 9b) after inserting domination cones. Assume, at first, that K = E n+ , this implies that cone K coincides with the first non-negative orthant.
If unit (X 2, Y 2) is a terminal point, then the theorem is proved. Assume that unit (X 2, Y 2) is not a terminal unit.
Let (v 1, u 1) be optimal dual solution in model (9a, 9b) and (v 2, u 2) be optimal dual solution in model (1a–1d) for units (X 1, Y 1) and (X 2, Y 2), respectively. It is known that dual optimal solution (v 1, u 1) is an orthogonal vector to some face of the frontier at point (X 1, Y 1) (see Cooper et al. 2000, p. 120, theorem 5.1). Notice that we do not write here the third part u 0 of the dual vector, because we need only an orthogonal vector for our purpose.
Dual optimal solution (v 1, u 1) for problem (9a, 9b) is also optimal solution for problem (1a–1d) since the inclusion of cones in model (1a–1d) may only decrease the feasible set of dual variables (Yu et al. 1996).
Denote vector of dual variables by w = (v, u) ∈ E m+r. Next, let ‖w‖2 be the quadratic norm of vector w.
Consider the following linear programming problem with a parameter α in the objective function (see Dantzig and Thapa 1997, 2003)
here
where w 1 = (v 1, u 1) and w 2 = (v 2, u 2).
According to the theory of linear programming with a parameter in the objective function when α is increasing, the optimal solution of problem (14) moves along the frontier from one face to another face.
It follows from (15), that some components of \(\overline{w}_{1} = (\overline{v}_{1},\overline{u}_{1})\) will be greater than the corresponding components of \(\overline{w}_{2} = (\overline{v}_{2},\overline{u}_{2})\) and vice versa. Some components of vector \(\overline{w}_{3} = \left({\overline{v}_{1} + \alpha^{*} (\overline{v}_{2} - \overline{v}_{1}),\overline{u}_{1} + \alpha^{*} (\overline{u}_{2} - \overline{u}_{1})} \right)\) will be equal to zero under some α * > 1.
Since vector \(\overline{w}_{3}\) is orthogonal to some face Γ ⊂ T B , this face contains at least one unbounded edge and one terminal point.
Points of face Γ are not efficient in model (9a, 9b) with domination cones. Let us dwell on this in detail.
According to the assumption, unit (X 1, Y 1) is efficient in model (1a–1d) and in model (9a, 9b). Optimal dual variables w 1 = (v 1, u 1) are associated with efficient unit (X 1, Y 1) in both model (1a–1d) and model (9a, 9b). Hence vectors w 1, \(\overline{w}_{1}\) belongs to some cone W 1 that is included in problem (1a–1d) in order to get problem (9a). Unit (X 2, Y 2) is efficient in problem (1a–1d) and inefficient in problem (9a, 9b). Hence vector w 2 = (v 2, u 2) associated with unit (X 2, Y 2) does not belong to cone W 1.
Since vectors w 2, \(\overline{w}_{2}\) and cone \(W_{1}\) are convex sets, we can construct a hyper-plane
where β ∈ E m+r, w ∈ E m+r, b is a scalar, that separates these two vectors and cone W 1, or, in other words,
It follows from (17), that
under α * > 1.
Hence vectors w 3 and \(\overline{w}_{3} = {{w_{3}} \mathord{\left/{\vphantom {{w_{3}} {\left\| {w_{3}} \right\|_{2}}}} \right. \kern-0pt} {\left\| {w_{3}} \right\|_{2}}}\) do not belong to cone W 1.
Thus, points of face Γ associated with dual vector w 3 are inefficient in model (9a, 9b), and among units of face Γ there exist terminal points.
Now, let K ⊆ E n+ . The inclusion of cone K in model (9a, 9b) may only expand the production possibility set T B , therefore the number of efficient units in model (1a–1d) that become inefficient in model (9a, 9b) may only increase.
This completes the proof.
Appendix 2
Rights and permissions
About this article
Cite this article
Krivonozhko, V.E., Førsund, F.R. & Lychev, A.V. Terminal units in DEA: definition and determination. J Prod Anal 43, 151–164 (2015). https://doi.org/10.1007/s11123-013-0375-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11123-013-0375-6