Far out or alone in the crowd: a taxonomy of peers in DEA

Edvardsen, Dag Fjeld; Førsund, Finn R.; Kittelsen, Sverre A. C.

doi:10.1007/s11123-007-0066-2

Far out or alone in the crowd: a taxonomy of peers in DEA

Published: 29 December 2007

Volume 29, pages 201–210, (2008)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Journal of Productivity Analysis Aims and scope Submit manuscript

Far out or alone in the crowd: a taxonomy of peers in DEA

Download PDF

Dag Fjeld Edvardsen¹,
Finn R. Førsund^2,3 &
Sverre A. C. Kittelsen³

568 Accesses
10 Citations
Explore all metrics

Abstract

A method is presented for classifying strongly efficient units in DEA as interior or exterior, and as self-evaluators or active peers. The exterior strongly efficient units are found by running the enveloping procedure “from below”. There is no firm production-function evidence of the efficiency of exterior self-evaluators. Interior self-evaluators are more likely to have active peers as neighbours in more directions and may therefore represent technology. When performing a second stage regression analysis of efficiency scores, exterior self-evaluators should be removed. The proportion of exterior active peers also provides information on whether the variable specification is supported by the data.

Red Queen Among Organizations, the

What hampers innovation? External stakeholders, the organization, groups and individuals: a systematic review of empirical barrier research

Article 30 January 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The calculation of efficiency scores for production units based on a non-parametric piecewise linear frontier production function is well established within the last two decades. Originally introduced by Farrell (1957) the method was further developed in Charnes et al. (1978), where the term the Data envelopment analysis (DEA) model was coined.

Given the variable specification, the assumed scale properties (e.g. variable or constant returns to scale), and the orientation of the efficiency measures (e.g. input-saving or output-increasing), the DEA method calculates an efficiency measure of 1.0 for a number of units. The motivation for the classification that we propose below is that this number does not always convey much information about the actual performance of the unit in question. Among the units with an efficiency score of 1.0, the strongly efficient units, termed peers, span the frontier. The classification of some of these units as efficient may not be based on the occurrence of some similar observations, but may be due to the method only. We are referring to units which are classified as being self-evaluators in the literature, a concept introduced by Charnes et al. (1985a, p. 110), which we formalise in the input and output direction below. ^{Footnote 1} Our main contribution is to suggest a second partitioning of the peers of a model-run into interior and exterior peers. We argue that an efficiency score of 1.0 conveys very little information on the performance of the group of peers that are both exterior and self-evaluators, given the variable, scale and orientation specification of the model under which these efficiency measures are calculated.

Our approach is one of several that are interested in classifying the observations that are measured as efficient by the method, i.e. the peers. ^{Footnote 2} In the literature there are approaches to detect those peers that are extreme in some sense. One approach is concerned with exploring whether efficient observations have undue influence. Andersen and Petersen (1993) proposed to rank efficient units by calculating what later was called super-efficiency scores in order to discriminate between efficient units and deem some peers as more efficient than others. Wilson (1995) picked up this idea and proposed to detect outliers that may unduly influence the efficiency by weighing the super-efficiency score with the average change in efficiency scores for the remaining units when removing the efficient unit being investigated from the data on which the frontier is based. Torgersen et al. (1996) proposed a measure of influence of units that takes into account the amount of potential output increase or input saving that an efficient unit is referencing. The taxonomy we define below is not concerned with influence, but rather with classifying peers as to the information the status implies about their own efficiency.

This perspective is shared with the concept of “efficient by default” suggested by Tulkens (1993), but our method has a different classification of efficient units. The free disposal hull (FDH)-approach (Deprins et al. 1984) is based on the basic notion of dominance in the input–output space. Among those that are on the FDH—frontier, i.e. those that are undominated, a unit is called “efficient by default” if it dominates no other units. In DEA, the input or output orientation of the efficiency measures may influence the classification we will propose when the location of the inefficient units behind the frontier matters, while in FDH some measures will be invariant since dominance is based on both the output- and input-dimension simultaneously. We will return to the differences in Sect. 2 below.

Self-evaluators may most naturally appear at the “edges” of the technology, but it is also possible that self-evaluators appear in the interior. One contribution of this article is to distinguish between those self-evaluators that are exterior and those that are interior. ^{Footnote 3} The motivation for this dichotomy is that much less is known about the performance of the exterior self-evaluators than the interior units. An approach often followed in actual investigations ^{Footnote 4} is to find the influence of some variables on the level of efficiency by running regressions of efficiency scores on a set of potential explanatory variables. Using exterior self-evaluators with efficiency score of 1.0 may then distort the results, because to assign the value of 1 to these self-evaluators is arbitrary. In the case of interior self-evaluators, on the other hand, neighbouring peers may be fairly similar. Interior self evaluators need not therefore be dropped when applying the second stage regression approach.

In addition to classifying self evaluators, the remaining peers can also be characterised in a similar way that may be instructive for interpretation of results within DEA. The pattern of location of peers as being interior or exterior gives information about which parts of the frontier are relatively best supported by data, and in the case of variable returns to scale will illuminate the empirical support for scale variation.

The plan of the paper is to review the role of peers in DEA models in Sect. 1 and establish the new taxonomy for peers. In Sect. 2 the method for classifying the peers is introduced. Actual data are presented in Sect. 3 and the method for classifying peers is applied. Section 4 concludes.

1 Self-evaluators and active peers

1.1 DEA models

Consider a set, J, of production units transforming multiple inputs into multiple outputs. Let y _mj be an output and x _nj an input $(m\,{\in}\,M,n\,{\in}\,N,j\,{\in}\,J).$ As the reference for the units in efficiency analyses we want to calculate a piecewise linear frontier based on observations, fitting as closely as possible and obeying some fundamental assumptions, like free disposal, and the technology set being convex and closed, as usually entertained (Banker et al. 1984; Färe and Primont 1995). This frontier can be found by solving the following LP problem, termed the additive model in the DEA literature (Charnes et al. 1985b), for each unit i:

$$ Sl_i =\hbox{Max}\left\{ \begin{array}{l} \mathop{\sum}\limits_{m \in M} s_{mi}^+ +\mathop{\sum}\limits_{n \in N} s_{ni}^-,\,\, \hbox{s.t.}\\ \mathop{\sum}\limits_{j \in J} \omega_{ij} y_{mj} -y_{mi} -s_{mi}^+ =0, \\ x_{ni} -\mathop{\sum}\limits_{j \in J} \omega_{ij} x_{nj} -s_{ni}^- =0, \\ \mathop{\sum}\limits_{j \in J} \omega_{ij} =1 \\ s_{mi}^+ ,s_{ni}^- , \omega_{ij} \geq 0,n \in N,m \in M \\ \end{array} \right\} $$

(1)

The frontier is found by maximising the sum of the slacks, Sl _i, on the output constraints, $s_{mi}^+,$ and input constraints, $s_{ni}^-.$ The last equality constraint in Eq. 1 imposes variable returns to scale (VRS) on the frontier, while dropping this constraint imposes constant returns to scale (CRS). Our analysis will be valid for both scale assumptions. The strongly efficient units (using the terminology of Charnes et al. 1985b) are identified by the optimal sum of the slacks being zero and therefore all the slack variables being zero for these units.^{Footnote 5} The set of strongly efficient units, P, and the set of inefficient units, Q, are then defined as:

$$ \begin{aligned} P&=\left\{ i \in J:Sl_i =0 \right\} \\ Q&=\left\{ i \in J: Sl_i > 0 \right\} \\ \end{aligned} $$

(2)

Since the maximal value in Eq. 1 is unique, this is a partitioning of the set of units J, by which we mean that P and Q are disjoint sets $P\cap Q={\O},$ whose union is the original set of units $P\cup Q=J.$

So far we only have slacks as measures of inefficiency. If we want only one efficiency measure for each unit, and a measure that is independent of units of measurement, the radial Farrell (1957) measure of technical inefficiency is the natural choice. The standard DEA model on primal (enveloping) form, is set up as a problem of determining the Farrell technical efficiency score, either in the input direction, E _1i, or the output direction, E _2i, for an observation, i. The following LP model is formulated for each observation in the case of input-orientation:

$$ E_{1i} =\hbox{Min} \left\{ \theta_i \hbox{ s.t. } \sum_{j \in P} \lambda_{ij} y_{mj} \geq y_{mi} , \theta_i x_{ni} \geq \sum_{j \in P} \lambda_{ij} x_{nj} ,\sum_{j \in P} \lambda_{ij} =1, \lambda_{ij} \geq 0,n \in N,m \in M \right\} $$

(3)

In the case of output orientation we have the following LP program:

$$ E_{2i} =1/\hbox{Max} \left\{ \phi_i \hbox{ s.t. } \sum_{j \in P} \mu_{ij} y_{mj} \geq \phi_i y_{mi} , x_{ni} \geq \sum_{j \in P} \mu_{ij} x_{nj} ,\sum_{j \in P} \mu_{ij} =1, \mu_{ij} \geq 0,n \in N,m \in M \right\} $$

(4)

The proportionality factors, θ_i or ϕ_i, and the weights, λ_ij or μ_ij, are the endogenous variables in these LP-problems. For compactness Eqs. 3 and 4 use inequalities instead of explicit slacks, but the implied production possibility set is the same as in Eq. 1.

The point defined by the weighted sum of inputs and outputs in Eqs. 3 and 4 serves as the frontier reference point for unit i, and this point is a convex combination of strongly efficient units from the set P, which in this setting can be termed peers.^{Footnote 6} The peers that participate in the reference point of an inefficient unit can be identified by a positive weight, but these weights are not always unique. To ensure that any strongly efficient unit that could participate is counted as a peer, we define the maximal weights consistent with Eqs. 3 and 4 by

$$ \tilde{\lambda}_{ij} =\hbox{Max} \left\{ \lambda_{ij} \hbox{ s.t. } \sum_{j \in P} \lambda_{ij} y_{mj} \geq y_{mi} , E_{1i} x_{ni} \geq \sum_{j \in P} \lambda_{ij} x_{nj} ,\sum_{j \in P} \lambda_{ij} =1, \lambda_{ij} \geq 0,n \in N,m \in M \right\} $$

(5)

$$ \tilde{\mu}_{ij} =\hbox{Max}\left\{ \mu_{ij} \hbox{ s.t. } \sum_{j \in P} \mu_{ij} y_{mj} \geq \frac{1}{E_{2i}}y_{mi} , x_{ni} \geq \sum_{j \in P} \mu_{ij} x_{nj} ,\sum_{j \in P} \mu_{ij} =1, \mu_{ij} \geq 0,n \in N,m \in M \right\} $$

(6)

where the radial factors have been replaced by their optimal values (i.e. their efficiencies) from Eqs. 3 and 4.

Following Edvardsen and Førsund (2003), for each peer, p, we can define the referencing set in input and output direction,^{Footnote 7} respectively, as:

$$ \begin{aligned} J_1^p &=\left\{ i \in J:i\neq p,\tilde{\lambda}_{ip} > 0 \right\},\quad p \in P\\ J_2^p &=\left\{ i \in J:i\neq p,\tilde{\mu}_{ip} > 0 \right\},\quad p \in P\\ \end{aligned} $$

(7)

1.2 The self-evaluators

Each of the referencing sets in Eq. 7 may be empty, in which case the peer unit p is called a self-evaluator in the respective orientation:

Definition 1

A strongly efficient unit p ∈ P, is a self-evaluator in the input direction if $J_1^P ={\O},$ and in the output direction if $J_2^P ={\O}.$ ^{Footnote 8}

The set of peers may thus in each orientation be partitioned into a set of self-evaluators and a set of active peers, i.e. peers with non-empty referencing sets. In order to simplify the notation the four partitioned sets will be denoted $S_{1},{\,\,}S_{2}$ and $A_{1},\,\, A_{2}\,\, (P=S_1 \cup A_1 =S_2 \cup A_2)$ for self-evaluators and active peers, respectively:

$$ \begin{array}{ll} \hbox{Input partition:} &S_1 =\left\{ p \in P:J_1^p ={\O} \right\},\,\, A_1 =\left\{ p \in P:J_1^p \neq{\O} \right\}\\ \hbox{Output partition:} &S_2 =\left\{ p \in P:J_2^p ={\O} \right\},\,\, A_2 =\left\{ p \in P:J_2^p \neq{\O} \right\} \end{array} $$

(8)

The self-evaluators are so called not just because they reference themselves, but because they only reference themselves. The vertex points for the facets that make up the DEA frontier are strongly efficient units, and each such unit may be a vertex point for many facets. Our definition of a self-evaluator implies that there are no reference points for other units on any of its facets.

2 Exterior and interior peers

2.1 Enveloping from below

The purpose of this article is to develop a method for classification of strongly efficient units into exterior or interior active peers, and into exterior or interior self-evaluators using only the standard DEA format. The production set is by construction convex. If all inefficient units are removed from the data set, and a new run is done with only the strongly efficient units, we will find the exterior peers by reversing the enveloping of the data from “above” to be from “below”. All that needs to be done is to reverse the inequalities in the LP program Eq. 1 by adding the slack variables instead of subtracting, thus for each strongly efficient unit i ∈ P:

$$ \widetilde{Sl}_i =\hbox{Max}\left\{ \begin{array}{l} \mathop{\sum}\limits_{m \in M} s_{mi}^+ +\mathop{\sum}\limits_{n \in N} s_{ni}^- ,\,\, \hbox{s.t.} \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} y_{mj} -y_{mi} +s_{mi}^+ =0, \\ x_{ni} -\mathop{\sum}\limits_{j \in P} \omega_{ij} x_{nj} +s_{ni}^- =0, \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} =1 \\ s_{mi}^+ ,s_{ni}^- , \omega_{ij} \geq 0,m \in M,n \in N \\ \end{array} \right\} $$

(9)

The units that turn out as “efficient” in the solution to Eq. 9, in the sense that all slacks are zero, must be units belonging to the exterior facets^{Footnote 9} in the solution to the original model Eq. 1 due to convexity of the production possibility set. We will use this result to define exterior and interior strongly efficient units:

Definition 2

A strongly efficient unit i ∈ P is exterior iff $\widetilde{Sl}_i =0$ .

This forms the basis of the partitioning of the set P into exterior and interior strongly efficient units:

$$ \begin{aligned} E&=\left\{ i \in P:\widetilde{Sl}_i =0 \right\} \\ I&=\left\{ i \in P: \widetilde{Sl}_i > 0 \right\} \end{aligned} $$

(10)

This partition does not depend on the orientation of the model. Combining the partition into exterior and interior strongly efficient units with the partition into self-evaluators and active peers we get several possible partitions that do depend on orientation. Often the researcher is working with a model that for some reason has a natural orientation, i.e. input-saving efficiency for cost-minimising firms, or output-increasing efficiency for a budget-constrained public service provider. In such cases one could partition the strongly efficient units into four sets for the orientation in question, i.e. the exterior or interior self-evaluators (superscript E and I, respectively), and the exterior or interior active peers (superscript E and I, respectively):

$$ \begin{array}{ll} \hbox{Input partition: } &S_1^E =E\cap S_1 ,\,\, S_1^I =I\cap S_1 ,\,\, A_1^E =E\cap A_1 ,\,\, A_1^I =I\cap A_1. \\ \hbox{Output partition: } &S_2^E =E\cap S_2,\,\, S_2^I =I\cap S_2,\,\, A_2^E =E\cap A_2 ,\,\, A_2^I =I\cap A_2 . \end{array} $$

(11)

It is the exterior self-evaluators in $S_1^E$ or $S_2^E$ in input or output orientation, respectively, that are “far out”, since they neither are peers to any other units, nor are in the interior of the input-output mixes. Each of these criteria reduces the information content,^{Footnote 10} but in combination there is very little we can say about performance.

An exterior self-evaluator in the input direction can sometimes be an active peer in the output direction, or visa versa, though still exterior. A full partition would list the eight sets of strongly efficient units that result from using all possible combinations of the three partitionings in Eqs. 8 and 10, and this would be particularly interesting in the cases where there is no natural orientation of the efficiency model. Among these, the set $S_1^E \cap S_2^E$ contains the units we know the least about. An empirical example is discussed in subsection 3.2 below.

Table 1 Impact of orientation on classification

Full size table

Figure 1 shows the two different cases of exterior and interior peers and the subdivision into active peers and self evaluators in the simplest case of two dimensions. The observations represented by points A, B, C, D, F and G are efficient, while H is inefficient. Considering output-orientation, the peers are D and F, and the reference point is d. To illustrate the referencing set of a peer, the shaded area in Fig. 1 shows the referencing zone for the efficient unit D in the case of output orientation. All the inefficient units being in unit D’s referencing set must be located here (such inefficient units may also appear in referencing sets of other peers; e.g. unit F’s). If no observations are located in the referencing zone of a peer then this peer is a self-evaluator. Removal of such a self-evaluator will not change the efficiency scores for any other units. We would expect the self-evaluators to be extreme points in one or more of the mix or scale dimensions, but if the referencing zone is narrow a self-evaluator may also be interior, i.e. centrally placed within the set of observations. A narrow zone means that other peers are close to the self-evaluator.

We see that the classification of peers is dependent on the orientation of the efficiency measure. In the output direction, we have that both units B and C are interior self-evaluators, while units A and G are exterior self-evaluators, and units D and F are interior active peers.

Considering input orientation, the radial reference or projection point for unit H is a. The reference point defined by Eq. 5 coincides with the peer A that is the only active peer, and in addition it is exterior. We have that the units B, C, D and F are interior self-evaluators, while G is an exterior one. In both cases of orientation the unit G could have been observed anywhere between the line g’ (the continuation of the line DF) and the line g” (referenced by F), without any unit changing its estimated efficiency or its status as peer. The efficiency score of 1 assigned to unit G therefore contains little information. In, e.g., the output oriented case we see that there is a considerable scope for output variation for a given input yielding the efficiency score of 1.

Illustrating the enveloping from below using Fig. 1, we have that the new “from below frontier” will be the line from A to G, thus these units are the only ones on the “from below frontier” and therefore exterior points in E. This classification is independent of orientation, and they are both located on exterior facets in the original problem Eq. 1. In the case of output orientation, the self-evaluators B and C, according to the solution to problem Eq. 4, will not appear on the new frontier, and they are therefore interior according to Definition 3. The self-evaluators A and G appear on the new frontier and are therefore exterior. In the case of input orientation solving problem Eq. 4 gives B, C, D, F and G as self-evaluators, and we have that B, C, D, and F are interior self-evaluators and G an exterior one. While A is an exterior peer in input orientation, it is not a self-evaluator. In the case of CRS, facets without any reference points may also be found in the interior of the frontier surface with respect to mixes, while for VRS interior also means interior regarding scale.

Figure 2 provides another type of illustration. In a two-dimensional input space an isoquant is shown formed by the efficient units A, B, C and D. Consider input orientation and CRS. Assuming inefficient units are only located northeast of the isoquant segment AB in the cone delimited by the rays going through the points A and B, we have that C and D are self-evaluators, and A and B are active peers. Running the “reverse” program Eq. 9 we will envelope the four peers from “behind” by the broken line from A to D. We then know that units A and D are exterior peers, and using the information from running the DEA model Eq. 1 we then have that unit C is an interior self evaluator, and unit D is an exterior self evaluator.

The difference between the Tulkens (1993) concept of efficient by default within the FDH-approach and the self-evaluator concept within DEA can be illustrated by considering unit C on the frontier. This unit is dominating the inefficient unit F, and thus is not efficient by default, but it is an interior self-evaluator within our DEA approach.^{Footnote 11}

Olesen and Petersen (1996) provide indicators of ill-conditioned data sets concerning using the shadow prices on output- and input constraints in the linear programming problems calculating the efficiency scores in Eqs. 3 or 4 to measure substitution characteristics. For exterior facets one or more of these shadow prices will be zero, thus reducing the possibility of measuring substitution. Data sets are termed ill conditioned if a relatively large number of units are located in an area where exterior facets are used as references in the efficiency evaluation. An exterior peer must be a vertex of at least one exterior facet. The proportion of exterior peers can therefore be used as an indicator of the data set in this respect; the higher the proportion the more ill-conditioned is the data set.

2.2 Identifying the exterior peers

The program Eq. 9 is not the standard DEA additive formulation, since the sign of the slacks in the restrictions on inputs and outputs have been changed. However, by negating these equalities, Eq. 9 can be rewritten as:

$$ \widetilde{Sl}_i =\hbox{Max}\left\{ \begin{array}{l} \mathop{\sum}\limits_{m \in M} s_{mi}^+ +\mathop{\sum}\limits_{n \in N} s_{ni}^- ,\,\, \hbox{s.t.} \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} x_{nj} -x_{ni} -s_{ni}^- =0, \\ y_{mi} -\mathop{\sum}\limits_{j \in P} \omega_{ij} y_{mj} -s_{mi}^+ =0, \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} =1 \\ s_{mi}^+ ,s_{ni}^- , \omega_{ij} \geq 0,m \in M,n \in N \\ \end{array} \right\} $$

(12)

Comparing Eqs. 1 and 12 we see that these are identical except that inputs and outputs are exchanged. Since existing DEA software often will solve the additive model Eq. 1, we may as well for convenience find the set of exterior peers E by exchanging inputs and outputs and running Eq. 12 on the strongly efficient units, rather than running Eq. 9 on these units.

3 An empirical application

3.1 The data

We apply the method for determining interior and exterior peers and self-evaluators on a cross section data set of the nursing and home care sector of Norwegian municipalities with 469 municipalities as units. The data is found in Edvardsen et al. (2000). The model was first formulated in Erlandsen and Førsund (2002) based on expert advice on how to define outputs and inputs using available statistics. Resource usage is measured by financial data and number of man-years of different categories. Production data contains mainly the number of clients dealt with by institutionalised nursing, home based nursing, and practical assistance. Quality information is lacking, but the clients are split into some age groups that may be of significance for resource use.^{Footnote 12}

There are three inputs in the model; Trained nurses, Other Employees (both measured in man-years), and Other expenses (measured in 1000 NOK (Norwegian crowns)). There are 10 outputs: Institutions age 0–66 and Institutions age 67+ are the number of institutionalised clients in the age groups 0–66 and above 67, respectively. Short-term stay shows how many visits the institutions in the municipality have gotten from clients who are not residents, while Closed ward shows how many of the residents are in a special ward for dementia clients. Single person room is the number of rooms with single occupancy, and represents a quality variable. Mentally disabled shows how many of the clients are mentally disabled (almost all of these clients get home care). Practical assistance 0–66 and Practical assistance 67+ counts how many clients get practical assistance (such as cleaning and making food) in the indicated age groups, while Home based nursing 0–66 and Home based nursing 67+ count the same for clients getting nursing services in their own homes.

The number of variables is rather extensive compared with other DEA studies of the sector, and may contribute to the relative high incidence of self-evaluators found.

3.2 The taxonomy of peers

Running the output-oriented LP model (4) assuming variable returns to scale resulted in 129 of the 469 municipalities being classified as strongly efficient. Both large and small units are found among efficient units. The average efficiency is 86%, while the efficiency of the (arithmetic) average unit is 67%.

An overview of the taxonomy developed in Sects. 2 and 3 for classification of units is given in Fig. 3, together with the actual decomposition for the data set at hand. The figure will be read from left to right. In view of the relatively large number of observations (469) it may be surprising that as many as 28% of the units are efficient (set P). This may be due to the unusually high number of dimensions, 13 variables in all. Since the efficient units span out the frontier technology it is to be expected that the number of exterior ones (set E) is higher than the interior ones (set I), 75 and 25%, respectively. Turning to the Farrell efficiency model (4) for output-orientation at the right-hand top of the figure, the self-evaluators (set S ₂) represent 22% of the efficient units.^{Footnote 13} As expected the relative share of exterior peers (set $S_{2}^{E}$) is larger in the group of self-evaluators than in the group of active peers, 86 versus 72%. The number of interior self-evaluators (set $S_{2}^{I}$) is rather small. Among the active peers the share of interior units (set $ A_{2}^{I}$) is much higher, 28%, while the share of active exterior peers (set $A_{2}^{E}$) is still as high as 72%. These distributions are of importance for the empirical support of the frontier and the associated efficiency distribution. A rather similar distribution of the different categories is found for input-orientation in the lower right-hand corner of the figure.

In order to see the details of the impact of the two orientations on our classifications a detailed break-down is offered in Table 1. (The grey cells cannot contain any numbers by definition.) The numbers in the column- and row sums in the table correspond to the right-hand column of boxes in Fig. 3. We see that elements on the main diagonal dominates, i.e. if a unit is classified as an exterior or interior self-evaluator within one orientation, or an active exterior or interior peer, it tends to be so also within the other orientation. However, we have some numbers in the off-diagonal boxes, e.g. six units that are classified as exterior self-evaluators within input orientation are classified as active exterior peers within output orientation.

3.3 Far out or alone in the crowd

The location of the interior and exterior self evaluators can be measured by the relative distance from the total sample average unit. ^{Footnote 14} The interior self-evaluators in our application are on both sides of the average, and one of the four units is quite close to the sample average. While the exterior self-evaluators are mostly either very large or very small, the interior self-evaluators are quite different and tend to be mid-sized on most variables. It seems appropriate to use the expression “alone in the crowd.”

The 25 exterior self-evaluators measured in output orientation are distributed with half above and half below the sample average. One unit has maximal sample values for two of the variables. There are several output variables with zero as the lower limit. The variable Institutions age 0–66 has seven exterior units with the minimum value of zero; while for Closed ward there are eight exterior units with the minimum value of zero. So given that “far out” means both small and large units the exterior units deserve well this classification. The influence of extreme mixes may also be investigated, but due to all the possible comparison we leave this exercise out.

4 Conclusions

The units found strongly efficient in DEA studies can be divided into self-evaluators and active peers, depending on whether the peers are referencing any inefficient units or not. The contribution of the paper starts with subdividing the peers into interior and exterior ones. The exterior self-evaluators are efficient “by default”; there is no firm evidence from observations for the classification. If the set of efficient units is used for identifying “winners”, or efficiency scores are used within a yardstick competition scheme, then exterior self evaluators should be excluded. Self-evaluators may most naturally appear at the “edges” of the technology, but it is also possible that self-evaluators appear in the interior. Finding the influence of some variables on the level of efficiency by running regressions of efficiency scores on a set of potential explanatory variables is an approach often followed in actual investigations. Using exterior self-evaluators with efficiency score of 1 in such a “two-stage” procedure may then distort the results, because to assign the value of 1 to these self-evaluators is arbitrary. Interior self-evaluators, on the other hand, are more likely to be fairly similar to active peers since they have neighbours in more directions. They should then not be dropped when applying the two- stage approach. ^{Footnote 15}

A method for classifying self-evaluators based on the additive DEA model, either CRS or VRS, has been developed. The exterior strongly efficient units are found by running the enveloping procedure “from below”, i.e. reversing the signs of the slack variables in the additive model Eq. 1, after removing all the inefficient units from the data set. Which of the strongly efficient units from the additive model Eq. 1 that turn out to be self-evaluators or active peers, will depend on the orientation of the efficiency analysis, i.e. whether input-or output orientation is adopted. The classification into exterior and interior peers is determined by the strongly efficient units zero or positive slacks in the “reversed” additive model Eq. 9 or the equivalent model Eq. 12.

The empirical application showed that the majority of self evaluators were exterior ones, implying that one should be careful using that part of the DEA frontier as representing the estimate of technology. But also the majority of active peers were exterior, giving rise to some concern about the quality of the estimated frontier since exterior peers are associated with at least one exterior facet that have at least one shadow price on an input-or output constraint that is not non-zero and finite. Scale- and substitution properties will then not exists for all variable combinations. As pointed out in Olesen and Petersen (1996, p. 206); “insufficient variation in data may imply estimation of a frontier actually located in subspaces of lower dimension than the input output space.” The result may be taken as an indication of the necessity of investigating the frontier estimate further by, e.g. applying bootstrap techniques (Simar and Wilson 1998). It will be an interesting task for future research if the proposed peer taxonomy could be related to bootstrap results for the size of confidence intervals for efficiency scores. Preliminary bootstrap runs on the dataset indicate that the efficiency estimates of exterior peers tend to have larger standard errors than the estimates for the interior peers.

Notes

As far as we know such a formalisation is not found in the literature. Charnes et al. (1985a, p. 110) defines a self-evaluator as “A DMU which appears only in its own evaluation facet...”.
In Charnes et al. (1986) the set of all DMUs is partitioned into six sets, but neither active peers and self-evaluators, nor external and internal self-evaluators are covered.
Charnes et al. (1985a, p. 110) seemed to have only exterior self-evaluators in mind since the reason offered for a DMU being a self-evaluator was that it is “...being characterized as efficient only because of specialized properties which set it off from other DMUs.”
The approach was originally introduced in Seitz (1967, 1971), inspired by Nerlove (1965); see Førsund and Sarafoglou (2002). Simar and Wilson (2007) review the approach and find it at fault in general due to serial correlation between the efficiency scores, and provide a new statistically sound procedure based on specifying explicitly the data generating process and bootstrapping to obtain confidence intervals.
These units are strongly rather than weekly efficient since it is not possible to decrease (increase) any input (output) without changing the level of at least one other variable. This parallels the concept of Pareto efficiency, although Pareto efficiency in economics is used for comparisons of the utilities of individuals when evaluating an allocation of a good.
Because our interest is in the partitioning of the set of strongly efficient units P, the summations of Eqs. 3 and 4 use this set, instead of using J which is more common in the literature. This also improves the computational efficiency, since the membership of P is already determined in Eq. 2.
As for the efficiency measures, we follow the Farrell tradition of using subscript 1 to denote the input direction and subscript 2 to denote the output direction.
An alternate definition could be in terms of the reference shares defined in Torgersen et al. (1996), where a self-evaluator has a reference share of zero.
By an exterior facet we mean a facet with at least one shadow price that is not non-zero and finite. In two dimensions this corresponds to the vertical or horizontal segments of the frontier in diagrams such as Figs. 1 and 2.
As one referee rightly points out, an exterior unit can be light-years away from other units, but still be an active peer. This should however make it eligible for an outlier analysis, i.e. a screening of units that should enter the DEA model. In our article we are concerned with how much we can say about the efficiency of units that are presumed to be observed without error, which could be viewed as a screening of units before any use of the DEA-estimated efficiencies, e.g. in a second stage regression analysis.
One could extend the concepts of self-evaluators to the FDH setting. In the minimax approach of FDH, each inefficient unit such as F is referenced by one (though not necessarily unique) FDH peer, which in Fig. 2 would be unit B. The FDH referencing set corresponding to Eq. 7 would still be empty for unit C, making it a self-evaluator also in the FDH method, even though it is not efficient by default. Since we in this article are interested in the information content of the efficiency measures under a convexity assumption, it is the DEA classification which is relevant here.
To ensure that the data quality was good enough extensive quality control was performed. We strongly feel that one should not automatically remove outliers, but if possible contact the municipality in question and ask if the data is correct. This is especially important if the methodology is frontier based (such as DEA) because the units defining the frontier are outliers by definition. This led to many changes in the dataset and required quite a lot of work, but as a result we could be much more confident in the quality of the data (see Aas (2000) for details).
In the empirical application the the multipliers in Eqs. 3 and 4 are unlikely to be non-unique, and if they are this will have only a minor influence on the results presented. We have therefore used these original weights rather than the maximal weights from Eqs. 5 and 6 in computing the partitions into self-evaluators and active peers.
A detailed analysis is found in Edvardsen et al. (2003).
A simple regression analysis carried out in Edvardsen et al. (2003) shows that removing the 25 exterior self evaluators increases the multiple correlation coefficient and the significance level of some key explanatory variables.

References

Andersen P, Petersen NC (1993) A procedure for ranking efficient units in data envelopment analysis. Manage Sci 39(10):1261–1264
Google Scholar
Banker RD, Charnes A, Cooper WW (1984) Some models for estimating technical and scale inefficiencies. Manage Sci 30:1078–1092
Google Scholar
Charnes A, Cooper WW, Rhodes E (1978) Measuring the efficiency of decision making units. Eur J Oper Res 2:429–444
Article Google Scholar
Charnes A, Cooper WW, Thrall RM (1986) Classifying and characterizing efficiencies and inefficiencies in data envelopment analysis. Oper Res Lett 5(3):105–110
Article Google Scholar
Charnes A, Clark CT, Cooper WW, Golany B (1985a) A developmental study of data envelopment analysis in measuring the efficiency of maintenance units in the U.S. Air Forces. Ann Oper Res 2:95–112
Article Google Scholar
Charnes A, Cooper WW, Golany B, Seiford L, Stutz J (1985b) Foundations of data envelopment analysis for Pareto–Koopmans efficient empirical production functions. J Econom 30:91–107
Article Google Scholar
Deprins D, Simar L, Tulkens H (1984) Measuring labor efficiency in post offices. In: Marchand M, Pestieau P, Tulkens H (eds) The performance of public enterprises: concepts and measurements. North-Holland, Amsterdam, pp 243–267
Google Scholar
Edvardsen DF, Førsund FR (2003) International benchmarking of electricity distribution utilities. Resour Energy Econ 25:353–371
Article Google Scholar
Edvardsen DF, Førsund FR, Kittelsen SAC (2003) Far out or alone in the crowd: classification of self-evaluators in DEA. HERO Working Paper 7/2003. Health Economics Research Programme at the University of Oslo
Edvardsen DF, Førsund FR, Aas E (2000) Effektivitet i pleie- og omsorgssektoren [Efficiency in the nursing- and home care sector]. Rapport 2/2000, Oslo: Frischsenteret
Erlandsen E, Førsund FR (2002) Efficiency in the provision of municipal nursing-and home care services: the Norwegian experience. In: Fox KJ (eds) Efficiency in the public sector, chapter 10. Kluwer, Boston/Dordrecht/London, pp 273–300
Google Scholar
Färe R, Primont D (1995) Multi output production and duality: theory and applications. Kluwer, Boston/Dordrecht/London
Google Scholar
Farrell MJ (1957) The measurement of productive efficiency. J R Stat Soc Ser A 120(III):253–281
Google Scholar
Førsund FR, Sarafoglou N (2002) On the origins of data envelopment analysis. J Prod Anal 17:23–40
Article Google Scholar
Nerlove M (1965) Estimation and identification of Cobb–Douglas production functions. North-Holland Publishing Company, Amsterdam
Google Scholar
Olesen OB, Petersen NC (1996) Indicators of ill-conditioned data sets and model misspecification in data envelopment analysis: an extended facet approach. Manage Sci 42(2):205–219
Google Scholar
Seitz WD (1967) Efficiency measures for steam-electric generating plants. Western farm economic association, proceedings 1966, Pullman, Washington, pp 143–151
Seitz WD (1971) Productive efficiency in the steam-electric generating industry. J Polit Econ 79:878–886
Article Google Scholar
Simar L, Wilson PW (1998) Sensitivity analysis of efficiency scores: how to bootstrap in nonparametric Frontier models. Manage Sci 44:49–61
Article Google Scholar
Simar L, Wilson PW (2007) Estimation and inference in two-stage, semi-parametric models of production processes. J Econom 136(1):31–64
Article Google Scholar
Torgersen AM, Førsund FR, Kittelsen SAC (1996) Slack-adjusted efficiency measures and ranking of efficient units. J Prod Anal 7:379–39
Article Google Scholar
Tulkens H (1993) On FDH efficiency analysis: some methodological issues and applications to retail banking, courts and urban transit. J Prod Anal 4(1–2):183–210
Google Scholar
Wilson PW (1995) Detecting influential observations in data envelopment analysis. J Prod Anal 6:27–45
Article Google Scholar
Aas E (2000) På leting etter målefeil – en studie av pleie- og omsorgssektoren [In search of measurement error—a study of the home care and nursing sector]. Notater 2000:10, Statistics Norway, Oslo

Download references

Author information

Authors and Affiliations

SINTEF Building and Infrastructure, Oslo, Norway
Dag Fjeld Edvardsen
Department of Economics, University of Oslo, Oslo, Norway
Finn R. Førsund
The Frisch Centre, Gaustadalléen 21, 0349, Oslo, Norway
Finn R. Førsund & Sverre A. C. Kittelsen

Authors

Dag Fjeld Edvardsen
View author publications
You can also search for this author in PubMed Google Scholar
Finn R. Førsund
View author publications
You can also search for this author in PubMed Google Scholar
Sverre A. C. Kittelsen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sverre A. C. Kittelsen.

Additional information

The paper is based on results from the Frisch Centre project “Better and Cheaper?” and written within the project “Efficiency analyses of the nursing and home care sector of Norway” at the Health Economics Research Programme at the University of Oslo (HERO) and the Frisch Centre. Both projects are financed by the Norwegian research council (NFR). We thank the editor and two anonymous referees for valuable comments and very constructive suggestions.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Edvardsen, D.F., Førsund, F.R. & Kittelsen, S.A.C. Far out or alone in the crowd: a taxonomy of peers in DEA. J Prod Anal 29, 201–210 (2008). https://doi.org/10.1007/s11123-007-0066-2

Download citation

Published: 29 December 2007
Issue Date: June 2008
DOI: https://doi.org/10.1007/s11123-007-0066-2

Keywords

JEL Classifications

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Far out or alone in the crowd: a taxonomy of peers in DEA

Abstract