The calculation of efficiency scores for production units based on a non-parametric piecewise linear frontier production function is well established within the last two decades. Originally introduced by Farrell (1957) the method was further developed in Charnes et al. (1978), where the term the Data envelopment analysis (DEA) model was coined.

Given the variable specification, the assumed scale properties (e.g. variable or constant returns to scale), and the orientation of the efficiency measures (e.g. input-saving or output-increasing), the DEA method calculates an efficiency measure of 1.0 for a number of units. The motivation for the classification that we propose below is that this number does not always convey much information about the actual performance of the unit in question. Among the units with an efficiency score of 1.0, the strongly efficient units, termed peers, span the frontier. The classification of some of these units as efficient may not be based on the occurrence of some similar observations, but may be due to the method only. We are referring to units which are classified as being self-evaluators in the literature, a concept introduced by Charnes et al. (1985a, p. 110), which we formalise in the input and output direction below. Footnote 1 Our main contribution is to suggest a second partitioning of the peers of a model-run into interior and exterior peers. We argue that an efficiency score of 1.0 conveys very little information on the performance of the group of peers that are both exterior and self-evaluators, given the variable, scale and orientation specification of the model under which these efficiency measures are calculated.

Our approach is one of several that are interested in classifying the observations that are measured as efficient by the method, i.e. the peers. Footnote 2 In the literature there are approaches to detect those peers that are extreme in some sense. One approach is concerned with exploring whether efficient observations have undue influence. Andersen and Petersen (1993) proposed to rank efficient units by calculating what later was called super-efficiency scores in order to discriminate between efficient units and deem some peers as more efficient than others. Wilson (1995) picked up this idea and proposed to detect outliers that may unduly influence the efficiency by weighing the super-efficiency score with the average change in efficiency scores for the remaining units when removing the efficient unit being investigated from the data on which the frontier is based. Torgersen et al. (1996) proposed a measure of influence of units that takes into account the amount of potential output increase or input saving that an efficient unit is referencing. The taxonomy we define below is not concerned with influence, but rather with classifying peers as to the information the status implies about their own efficiency.

This perspective is shared with the concept of “efficient by default” suggested by Tulkens (1993), but our method has a different classification of efficient units. The free disposal hull (FDH)-approach (Deprins et al. 1984) is based on the basic notion of dominance in the input–output space. Among those that are on the FDH—frontier, i.e. those that are undominated, a unit is called “efficient by default” if it dominates no other units. In DEA, the input or output orientation of the efficiency measures may influence the classification we will propose when the location of the inefficient units behind the frontier matters, while in FDH some measures will be invariant since dominance is based on both the output- and input-dimension simultaneously. We will return to the differences in Sect. 2 below.

Self-evaluators may most naturally appear at the “edges” of the technology, but it is also possible that self-evaluators appear in the interior. One contribution of this article is to distinguish between those self-evaluators that are exterior and those that are interior. Footnote 3 The motivation for this dichotomy is that much less is known about the performance of the exterior self-evaluators than the interior units. An approach often followed in actual investigations Footnote 4 is to find the influence of some variables on the level of efficiency by running regressions of efficiency scores on a set of potential explanatory variables. Using exterior self-evaluators with efficiency score of 1.0 may then distort the results, because to assign the value of 1 to these self-evaluators is arbitrary. In the case of interior self-evaluators, on the other hand, neighbouring peers may be fairly similar. Interior self evaluators need not therefore be dropped when applying the second stage regression approach.

In addition to classifying self evaluators, the remaining peers can also be characterised in a similar way that may be instructive for interpretation of results within DEA. The pattern of location of peers as being interior or exterior gives information about which parts of the frontier are relatively best supported by data, and in the case of variable returns to scale will illuminate the empirical support for scale variation.

The plan of the paper is to review the role of peers in DEA models in Sect. 1 and establish the new taxonomy for peers. In Sect. 2 the method for classifying the peers is introduced. Actual data are presented in Sect. 3 and the method for classifying peers is applied. Section 4 concludes.

1 Self-evaluators and active peers

1.1 DEA models

Consider a set, J, of production units transforming multiple inputs into multiple outputs. Let y mj be an output and x nj an input \((m\,{\in}\,M,n\,{\in}\,N,j\,{\in}\,J).\) As the reference for the units in efficiency analyses we want to calculate a piecewise linear frontier based on observations, fitting as closely as possible and obeying some fundamental assumptions, like free disposal, and the technology set being convex and closed, as usually entertained (Banker et al. 1984; Färe and Primont 1995). This frontier can be found by solving the following LP problem, termed the additive model in the DEA literature (Charnes et al. 1985b), for each unit i:

$$ Sl_i =\hbox{Max}\left\{ \begin{array}{l} \mathop{\sum}\limits_{m \in M} s_{mi}^+ +\mathop{\sum}\limits_{n \in N} s_{ni}^-,\,\, \hbox{s.t.}\\ \mathop{\sum}\limits_{j \in J} \omega_{ij} y_{mj} -y_{mi} -s_{mi}^+ =0, \\ x_{ni} -\mathop{\sum}\limits_{j \in J} \omega_{ij} x_{nj} -s_{ni}^- =0, \\ \mathop{\sum}\limits_{j \in J} \omega_{ij} =1 \\ s_{mi}^+ ,s_{ni}^- , \omega_{ij} \geq 0,n \in N,m \in M \\ \end{array} \right\} $$
(1)

The frontier is found by maximising the sum of the slacks, Sl i , on the output constraints, \(s_{mi}^+,\) and input constraints, \(s_{ni}^-.\) The last equality constraint in Eq. 1 imposes variable returns to scale (VRS) on the frontier, while dropping this constraint imposes constant returns to scale (CRS). Our analysis will be valid for both scale assumptions. The strongly efficient units (using the terminology of Charnes et al. 1985b) are identified by the optimal sum of the slacks being zero and therefore all the slack variables being zero for these units.Footnote 5 The set of strongly efficient units, P, and the set of inefficient units, Q, are then defined as:

$$ \begin{aligned} P&=\left\{ i \in J:Sl_i =0 \right\} \\ Q&=\left\{ i \in J: Sl_i > 0 \right\} \\ \end{aligned} $$
(2)

Since the maximal value in Eq. 1 is unique, this is a partitioning of the set of units J, by which we mean that P and Q are disjoint sets \(P\cap Q={\O},\) whose union is the original set of units \(P\cup Q=J.\)

So far we only have slacks as measures of inefficiency. If we want only one efficiency measure for each unit, and a measure that is independent of units of measurement, the radial Farrell (1957) measure of technical inefficiency is the natural choice. The standard DEA model on primal (enveloping) form, is set up as a problem of determining the Farrell technical efficiency score, either in the input direction, E 1i , or the output direction, E 2i , for an observation, i. The following LP model is formulated for each observation in the case of input-orientation:

$$ E_{1i} =\hbox{Min} \left\{ \theta_i \hbox{ s.t. } \sum_{j \in P} \lambda_{ij} y_{mj} \geq y_{mi} , \theta_i x_{ni} \geq \sum_{j \in P} \lambda_{ij} x_{nj} ,\sum_{j \in P} \lambda_{ij} =1, \lambda_{ij} \geq 0,n \in N,m \in M \right\} $$
(3)

In the case of output orientation we have the following LP program:

$$ E_{2i} =1/\hbox{Max} \left\{ \phi_i \hbox{ s.t. } \sum_{j \in P} \mu_{ij} y_{mj} \geq \phi_i y_{mi} , x_{ni} \geq \sum_{j \in P} \mu_{ij} x_{nj} ,\sum_{j \in P} \mu_{ij} =1, \mu_{ij} \geq 0,n \in N,m \in M \right\} $$
(4)

The proportionality factors, θ i or ϕ i , and the weights, λ ij or μ ij , are the endogenous variables in these LP-problems. For compactness Eqs. 3 and 4 use inequalities instead of explicit slacks, but the implied production possibility set is the same as in Eq. 1.

The point defined by the weighted sum of inputs and outputs in Eqs. 3 and 4 serves as the frontier reference point for unit i, and this point is a convex combination of strongly efficient units from the set P, which in this setting can be termed peers.Footnote 6 The peers that participate in the reference point of an inefficient unit can be identified by a positive weight, but these weights are not always unique. To ensure that any strongly efficient unit that could participate is counted as a peer, we define the maximal weights consistent with Eqs. 3 and 4 by

$$ \tilde{\lambda}_{ij} =\hbox{Max} \left\{ \lambda_{ij} \hbox{ s.t. } \sum_{j \in P} \lambda_{ij} y_{mj} \geq y_{mi} , E_{1i} x_{ni} \geq \sum_{j \in P} \lambda_{ij} x_{nj} ,\sum_{j \in P} \lambda_{ij} =1, \lambda_{ij} \geq 0,n \in N,m \in M \right\} $$
(5)
$$ \tilde{\mu}_{ij} =\hbox{Max}\left\{ \mu_{ij} \hbox{ s.t. } \sum_{j \in P} \mu_{ij} y_{mj} \geq \frac{1}{E_{2i}}y_{mi} , x_{ni} \geq \sum_{j \in P} \mu_{ij} x_{nj} ,\sum_{j \in P} \mu_{ij} =1, \mu_{ij} \geq 0,n \in N,m \in M \right\} $$
(6)

where the radial factors have been replaced by their optimal values (i.e. their efficiencies) from Eqs. 3 and 4.

Following Edvardsen and Førsund (2003), for each peer, p, we can define the referencing set in input and output direction,Footnote 7 respectively, as:

$$ \begin{aligned} J_1^p &=\left\{ i \in J:i\neq p,\tilde{\lambda}_{ip} > 0 \right\},\quad p \in P\\ J_2^p &=\left\{ i \in J:i\neq p,\tilde{\mu}_{ip} > 0 \right\},\quad p \in P\\ \end{aligned} $$
(7)

1.2 The self-evaluators

Each of the referencing sets in Eq. 7 may be empty, in which case the peer unit p is called a self-evaluator in the respective orientation:

Definition 1

A strongly efficient unit p ∈ P, is a self-evaluator in the input direction if \(J_1^P ={\O},\) and in the output direction if \(J_2^P ={\O}.\) Footnote 8

The set of peers may thus in each orientation be partitioned into a set of self-evaluators and a set of active peers, i.e. peers with non-empty referencing sets. In order to simplify the notation the four partitioned sets will be denoted \(S_{1},{\,\,}S_{2}\) and \(A_{1},\,\, A_{2}\,\, (P=S_1 \cup A_1 =S_2 \cup A_2)\) for self-evaluators and active peers, respectively:

$$ \begin{array}{ll} \hbox{Input partition:} &S_1 =\left\{ p \in P:J_1^p ={\O} \right\},\,\, A_1 =\left\{ p \in P:J_1^p \neq{\O} \right\}\\ \hbox{Output partition:} &S_2 =\left\{ p \in P:J_2^p ={\O} \right\},\,\, A_2 =\left\{ p \in P:J_2^p \neq{\O} \right\} \end{array} $$
(8)

The self-evaluators are so called not just because they reference themselves, but because they only reference themselves. The vertex points for the facets that make up the DEA frontier are strongly efficient units, and each such unit may be a vertex point for many facets. Our definition of a self-evaluator implies that there are no reference points for other units on any of its facets.

2 Exterior and interior peers

2.1 Enveloping from below

The purpose of this article is to develop a method for classification of strongly efficient units into exterior or interior active peers, and into exterior or interior self-evaluators using only the standard DEA format. The production set is by construction convex. If all inefficient units are removed from the data set, and a new run is done with only the strongly efficient units, we will find the exterior peers by reversing the enveloping of the data from “above” to be from “below”. All that needs to be done is to reverse the inequalities in the LP program Eq. 1 by adding the slack variables instead of subtracting, thus for each strongly efficient unit i ∈ P:

$$ \widetilde{Sl}_i =\hbox{Max}\left\{ \begin{array}{l} \mathop{\sum}\limits_{m \in M} s_{mi}^+ +\mathop{\sum}\limits_{n \in N} s_{ni}^- ,\,\, \hbox{s.t.} \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} y_{mj} -y_{mi} +s_{mi}^+ =0, \\ x_{ni} -\mathop{\sum}\limits_{j \in P} \omega_{ij} x_{nj} +s_{ni}^- =0, \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} =1 \\ s_{mi}^+ ,s_{ni}^- , \omega_{ij} \geq 0,m \in M,n \in N \\ \end{array} \right\} $$
(9)

The units that turn out as “efficient” in the solution to Eq. 9, in the sense that all slacks are zero, must be units belonging to the exterior facetsFootnote 9 in the solution to the original model Eq. 1 due to convexity of the production possibility set. We will use this result to define exterior and interior strongly efficient units:

Fig. 1
figure 1

DEA taxonomy

Fig. 2
figure 2

Reversing the DEA program

Definition 2

A strongly efficient unit i ∈ P is exterior iff \(\widetilde{Sl}_i =0\) .

This forms the basis of the partitioning of the set P into exterior and interior strongly efficient units:

$$ \begin{aligned} E&=\left\{ i \in P:\widetilde{Sl}_i =0 \right\} \\ I&=\left\{ i \in P: \widetilde{Sl}_i > 0 \right\} \end{aligned} $$
(10)

This partition does not depend on the orientation of the model. Combining the partition into exterior and interior strongly efficient units with the partition into self-evaluators and active peers we get several possible partitions that do depend on orientation. Often the researcher is working with a model that for some reason has a natural orientation, i.e. input-saving efficiency for cost-minimising firms, or output-increasing efficiency for a budget-constrained public service provider. In such cases one could partition the strongly efficient units into four sets for the orientation in question, i.e. the exterior or interior self-evaluators (superscript E and I, respectively), and the exterior or interior active peers (superscript E and I, respectively):

$$ \begin{array}{ll} \hbox{Input partition: } &S_1^E =E\cap S_1 ,\,\, S_1^I =I\cap S_1 ,\,\, A_1^E =E\cap A_1 ,\,\, A_1^I =I\cap A_1. \\ \hbox{Output partition: } &S_2^E =E\cap S_2,\,\, S_2^I =I\cap S_2,\,\, A_2^E =E\cap A_2 ,\,\, A_2^I =I\cap A_2 . \end{array} $$
(11)

It is the exterior self-evaluators in \(S_1^E\) or \(S_2^E\) in input or output orientation, respectively, that are “far out”, since they neither are peers to any other units, nor are in the interior of the input-output mixes. Each of these criteria reduces the information content,Footnote 10 but in combination there is very little we can say about performance.

An exterior self-evaluator in the input direction can sometimes be an active peer in the output direction, or visa versa, though still exterior. A full partition would list the eight sets of strongly efficient units that result from using all possible combinations of the three partitionings in Eqs. 8 and 10, and this would be particularly interesting in the cases where there is no natural orientation of the efficiency model. Among these, the set \(S_1^E \cap S_2^E\) contains the units we know the least about. An empirical example is discussed in subsection 3.2 below.

Table 1 Impact of orientation on classification

Figure 1 shows the two different cases of exterior and interior peers and the subdivision into active peers and self evaluators in the simplest case of two dimensions. The observations represented by points A, B, C, D, F and G are efficient, while H is inefficient. Considering output-orientation, the peers are D and F, and the reference point is d. To illustrate the referencing set of a peer, the shaded area in Fig. 1 shows the referencing zone for the efficient unit D in the case of output orientation. All the inefficient units being in unit D’s referencing set must be located here (such inefficient units may also appear in referencing sets of other peers; e.g. unit F’s). If no observations are located in the referencing zone of a peer then this peer is a self-evaluator. Removal of such a self-evaluator will not change the efficiency scores for any other units. We would expect the self-evaluators to be extreme points in one or more of the mix or scale dimensions, but if the referencing zone is narrow a self-evaluator may also be interior, i.e. centrally placed within the set of observations. A narrow zone means that other peers are close to the self-evaluator.

We see that the classification of peers is dependent on the orientation of the efficiency measure. In the output direction, we have that both units B and C are interior self-evaluators, while units A and G are exterior self-evaluators, and units D and F are interior active peers.

Considering input orientation, the radial reference or projection point for unit H is a. The reference point defined by Eq. 5 coincides with the peer A that is the only active peer, and in addition it is exterior. We have that the units B, C, D and F are interior self-evaluators, while G is an exterior one. In both cases of orientation the unit G could have been observed anywhere between the line g’ (the continuation of the line DF) and the line g” (referenced by F), without any unit changing its estimated efficiency or its status as peer. The efficiency score of 1 assigned to unit G therefore contains little information. In, e.g., the output oriented case we see that there is a considerable scope for output variation for a given input yielding the efficiency score of 1.

Illustrating the enveloping from below using Fig. 1, we have that the new “from below frontier” will be the line from A to G, thus these units are the only ones on the “from below frontier” and therefore exterior points in E. This classification is independent of orientation, and they are both located on exterior facets in the original problem Eq. 1. In the case of output orientation, the self-evaluators B and C, according to the solution to problem Eq. 4, will not appear on the new frontier, and they are therefore interior according to Definition 3. The self-evaluators A and G appear on the new frontier and are therefore exterior. In the case of input orientation solving problem Eq. 4 gives B, C, D, F and G as self-evaluators, and we have that B, C, D, and F are interior self-evaluators and G an exterior one. While A is an exterior peer in input orientation, it is not a self-evaluator. In the case of CRS, facets without any reference points may also be found in the interior of the frontier surface with respect to mixes, while for VRS interior also means interior regarding scale.

Figure 2 provides another type of illustration. In a two-dimensional input space an isoquant is shown formed by the efficient units A, B, C and D. Consider input orientation and CRS. Assuming inefficient units are only located northeast of the isoquant segment AB in the cone delimited by the rays going through the points A and B, we have that C and D are self-evaluators, and A and B are active peers. Running the “reverse” program Eq. 9 we will envelope the four peers from “behind” by the broken line from A to D. We then know that units A and D are exterior peers, and using the information from running the DEA model Eq. 1 we then have that unit C is an interior self evaluator, and unit D is an exterior self evaluator.

The difference between the Tulkens (1993) concept of efficient by default within the FDH-approach and the self-evaluator concept within DEA can be illustrated by considering unit C on the frontier. This unit is dominating the inefficient unit F, and thus is not efficient by default, but it is an interior self-evaluator within our DEA approach.Footnote 11

Olesen and Petersen (1996) provide indicators of ill-conditioned data sets concerning using the shadow prices on output- and input constraints in the linear programming problems calculating the efficiency scores in Eqs. 3 or 4 to measure substitution characteristics. For exterior facets one or more of these shadow prices will be zero, thus reducing the possibility of measuring substitution. Data sets are termed ill conditioned if a relatively large number of units are located in an area where exterior facets are used as references in the efficiency evaluation. An exterior peer must be a vertex of at least one exterior facet. The proportion of exterior peers can therefore be used as an indicator of the data set in this respect; the higher the proportion the more ill-conditioned is the data set.

2.2 Identifying the exterior peers

The program Eq. 9 is not the standard DEA additive formulation, since the sign of the slacks in the restrictions on inputs and outputs have been changed. However, by negating these equalities, Eq. 9 can be rewritten as:

$$ \widetilde{Sl}_i =\hbox{Max}\left\{ \begin{array}{l} \mathop{\sum}\limits_{m \in M} s_{mi}^+ +\mathop{\sum}\limits_{n \in N} s_{ni}^- ,\,\, \hbox{s.t.} \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} x_{nj} -x_{ni} -s_{ni}^- =0, \\ y_{mi} -\mathop{\sum}\limits_{j \in P} \omega_{ij} y_{mj} -s_{mi}^+ =0, \\ \mathop{\sum}\limits_{j \in P} \omega_{ij} =1 \\ s_{mi}^+ ,s_{ni}^- , \omega_{ij} \geq 0,m \in M,n \in N \\ \end{array} \right\} $$
(12)

Comparing Eqs. 1 and 12 we see that these are identical except that inputs and outputs are exchanged. Since existing DEA software often will solve the additive model Eq. 1, we may as well for convenience find the set of exterior peers E by exchanging inputs and outputs and running Eq. 12 on the strongly efficient units, rather than running Eq. 9 on these units.

3 An empirical application

3.1 The data

We apply the method for determining interior and exterior peers and self-evaluators on a cross section data set of the nursing and home care sector of Norwegian municipalities with 469 municipalities as units. The data is found in Edvardsen et al. (2000). The model was first formulated in Erlandsen and Førsund (2002) based on expert advice on how to define outputs and inputs using available statistics. Resource usage is measured by financial data and number of man-years of different categories. Production data contains mainly the number of clients dealt with by institutionalised nursing, home based nursing, and practical assistance. Quality information is lacking, but the clients are split into some age groups that may be of significance for resource use.Footnote 12

There are three inputs in the model; Trained nurses, Other Employees (both measured in man-years), and Other expenses (measured in 1000 NOK (Norwegian crowns)). There are 10 outputs: Institutions age 0–66 and Institutions age 67+ are the number of institutionalised clients in the age groups 0–66 and above 67, respectively. Short-term stay shows how many visits the institutions in the municipality have gotten from clients who are not residents, while Closed ward shows how many of the residents are in a special ward for dementia clients. Single person room is the number of rooms with single occupancy, and represents a quality variable. Mentally disabled shows how many of the clients are mentally disabled (almost all of these clients get home care). Practical assistance 0–66 and Practical assistance 67+ counts how many clients get practical assistance (such as cleaning and making food) in the indicated age groups, while Home based nursing 0–66 and Home based nursing 67+ count the same for clients getting nursing services in their own homes.

The number of variables is rather extensive compared with other DEA studies of the sector, and may contribute to the relative high incidence of self-evaluators found.

3.2 The taxonomy of peers

Running the output-oriented LP model (4) assuming variable returns to scale resulted in 129 of the 469 municipalities being classified as strongly efficient. Both large and small units are found among efficient units. The average efficiency is 86%, while the efficiency of the (arithmetic) average unit is 67%.

An overview of the taxonomy developed in Sects. 2 and 3 for classification of units is given in Fig. 3, together with the actual decomposition for the data set at hand. The figure will be read from left to right. In view of the relatively large number of observations (469) it may be surprising that as many as 28% of the units are efficient (set P). This may be due to the unusually high number of dimensions, 13 variables in all. Since the efficient units span out the frontier technology it is to be expected that the number of exterior ones (set E) is higher than the interior ones (set I), 75 and 25%, respectively. Turning to the Farrell efficiency model (4) for output-orientation at the right-hand top of the figure, the self-evaluators (set S 2) represent 22% of the efficient units.Footnote 13 As expected the relative share of exterior peers (set \(S_{2}^{E}\)) is larger in the group of self-evaluators than in the group of active peers, 86 versus 72%. The number of interior self-evaluators (set \(S_{2}^{I}\)) is rather small. Among the active peers the share of interior units (set \( A_{2}^{I}\)) is much higher, 28%, while the share of active exterior peers (set \(A_{2}^{E}\)) is still as high as 72%. These distributions are of importance for the empirical support of the frontier and the associated efficiency distribution. A rather similar distribution of the different categories is found for input-orientation in the lower right-hand corner of the figure.

Fig. 3
figure 3

The taxonomy of units in DEA efficiency analyses

In order to see the details of the impact of the two orientations on our classifications a detailed break-down is offered in Table 1. (The grey cells cannot contain any numbers by definition.) The numbers in the column- and row sums in the table correspond to the right-hand column of boxes in Fig. 3. We see that elements on the main diagonal dominates, i.e. if a unit is classified as an exterior or interior self-evaluator within one orientation, or an active exterior or interior peer, it tends to be so also within the other orientation. However, we have some numbers in the off-diagonal boxes, e.g. six units that are classified as exterior self-evaluators within input orientation are classified as active exterior peers within output orientation.

3.3 Far out or alone in the crowd

The location of the interior and exterior self evaluators can be measured by the relative distance from the total sample average unit. Footnote 14 The interior self-evaluators in our application are on both sides of the average, and one of the four units is quite close to the sample average. While the exterior self-evaluators are mostly either very large or very small, the interior self-evaluators are quite different and tend to be mid-sized on most variables. It seems appropriate to use the expression “alone in the crowd.”

The 25 exterior self-evaluators measured in output orientation are distributed with half above and half below the sample average. One unit has maximal sample values for two of the variables. There are several output variables with zero as the lower limit. The variable Institutions age 0–66 has seven exterior units with the minimum value of zero; while for Closed ward there are eight exterior units with the minimum value of zero. So given that “far out” means both small and large units the exterior units deserve well this classification. The influence of extreme mixes may also be investigated, but due to all the possible comparison we leave this exercise out.

4 Conclusions

The units found strongly efficient in DEA studies can be divided into self-evaluators and active peers, depending on whether the peers are referencing any inefficient units or not. The contribution of the paper starts with subdividing the peers into interior and exterior ones. The exterior self-evaluators are efficient “by default”; there is no firm evidence from observations for the classification. If the set of efficient units is used for identifying “winners”, or efficiency scores are used within a yardstick competition scheme, then exterior self evaluators should be excluded. Self-evaluators may most naturally appear at the “edges” of the technology, but it is also possible that self-evaluators appear in the interior. Finding the influence of some variables on the level of efficiency by running regressions of efficiency scores on a set of potential explanatory variables is an approach often followed in actual investigations. Using exterior self-evaluators with efficiency score of 1 in such a “two-stage” procedure may then distort the results, because to assign the value of 1 to these self-evaluators is arbitrary. Interior self-evaluators, on the other hand, are more likely to be fairly similar to active peers since they have neighbours in more directions. They should then not be dropped when applying the two- stage approach. Footnote 15

A method for classifying self-evaluators based on the additive DEA model, either CRS or VRS, has been developed. The exterior strongly efficient units are found by running the enveloping procedure “from below”, i.e. reversing the signs of the slack variables in the additive model Eq. 1, after removing all the inefficient units from the data set. Which of the strongly efficient units from the additive model Eq. 1 that turn out to be self-evaluators or active peers, will depend on the orientation of the efficiency analysis, i.e. whether input-or output orientation is adopted. The classification into exterior and interior peers is determined by the strongly efficient units zero or positive slacks in the “reversed” additive model Eq. 9 or the equivalent model Eq. 12.

The empirical application showed that the majority of self evaluators were exterior ones, implying that one should be careful using that part of the DEA frontier as representing the estimate of technology. But also the majority of active peers were exterior, giving rise to some concern about the quality of the estimated frontier since exterior peers are associated with at least one exterior facet that have at least one shadow price on an input-or output constraint that is not non-zero and finite. Scale- and substitution properties will then not exists for all variable combinations. As pointed out in Olesen and Petersen (1996, p. 206); “insufficient variation in data may imply estimation of a frontier actually located in subspaces of lower dimension than the input output space.” The result may be taken as an indication of the necessity of investigating the frontier estimate further by, e.g. applying bootstrap techniques (Simar and Wilson 1998). It will be an interesting task for future research if the proposed peer taxonomy could be related to bootstrap results for the size of confidence intervals for efficiency scores. Preliminary bootstrap runs on the dataset indicate that the efficiency estimates of exterior peers tend to have larger standard errors than the estimates for the interior peers.