Keywords

14.1 Introduction

It is commonly observed that the DEA model proposed by Charnes et al. (1978) is a “black box” that receives inputs and produces outputs, but the transformation process by which this occurs is opaque to the analyst. As Tone and Tsutsui (2014) remark, “One of the drawbacks of these models is the omission of the internal structure of the DMUs.” Färe et al. (2007a) built on Shephard and Färe (1975) with a sequence of models where the interior (“black box”) of the CCR model could be analyzed. The primary device for achieving this was the use of a network. The insight they had was that when multiple DEA models are connected in a network, the network itself is a DEA model, and can be calculated using linear programming.

In this chapter we extend and update our paper (Färe et al. 2007a) with additional discussion of DEA sub-technologies, objective functions, and static and dynamic DEA models. We start with a discussion of the CCR model from an axiomatic perspective. Then we turn to objective functions, which can be either price dependent or not. In the case where objective functions are price independent, we discuss both distance functions and slack-based functions. The static network model is introduced next, starting with a generic model and extending it to three common cases. We end the chapter with dynamic DEA. The models discussed in this chapter are relatively simple, but provide the tools for construction of models describing arbitrarily complex processes.

14.2 The Black Box and Sub-technologies

Production models are frequently modeled as a black box, where inputs x = (x 1, …,x N ) ∈ ℝ N+ are transformed into outputs y = (y 1, …,y M ) ∈ ℝ M+ (Fig. 14.1).

Fig. 14.1
figure 1

The black box

In this chapter we go inside the black box and define sub-technologies as its smallest building blocks. First, we establish an axiomatic structure for the models, then discuss the connection of sub-technologies through a directed network.

A technology or sub-technology may modeled as

$$ T=\left\{\left(x,y\right):x\kern0.24em can\kern0.24em produce\kern0.24em y\right\} $$

or by its output sets,

$$ P(x)=\left\{y:\left(x,y\right)\in T\right\}\kern0.36em x\in {\mathbb{R}}_{+}^N $$

or by its input sets,

$$ L(y)=\left\{x:\left(x,y\right)\in T\right\}, $$

and it holds that

$$ y\in P(x)\iff \left(x,y\right)\in T\iff x\in L(y). $$

Suppose that we have k = k, …, K observations of inputs and good outputs (x k,y k), then we can formulate the activity analysis or data envelopment analysis model as

$$ \begin{array}{l}T=\left\{\left(x,y\right):{\displaystyle \sum_{k=1}^K{z}_k{x}_{kn}\le {x}_n},\kern0.36em n=1,\dots, N,\right.\\ {}\kern5.6em {\displaystyle \sum_{k=1}^K{z}_n{y}_{kn}\ge {y}_m,\kern0.36em m=1,\dots, M,}\\ {}\left.\kern8.25em {z}_k\ge 0,\kern0.36em k=1,\dots, K\right\}\end{array} $$

or equivalently,

$$ \begin{array}{l}P(x)=\left\{y:{\displaystyle \sum_{k=1}^K{z}_n{x}_{kn}\le {x}_n},\kern0.36em n=1,\dots, N,\right.\\ {}\kern5em {\displaystyle \sum_{k=1}^K{z}_m{y}_{km}\ge {y}_m,\kern0.36em m=1,\dots, M,}\\ {}\left.\kern8em {z}_k\ge 0,\kern0.36em k=1,\dots, K\right\}\end{array} $$

i.e.,

$$ \begin{array}{l}L(y)=\left\{x:{\displaystyle \sum_{k=1}^K{z}_k{x}_{kn}\le {x}_n},\kern0.36em n=1,\dots, N,\right.\\ {}\kern5em {\displaystyle \sum_{k=1}^K{z}_k{y}_{km}\ge {y}_m,\kern0.36em m=1,\dots, M,}\\ {}\left.\kern7.7em {z}_k\ge 0,\kern0.36em k=1,\dots, K\right\}.\end{array} $$

If the data meet the Kemeny et al. (1956) conditions, then the technology satisfies the following conditions;

$$ (i)\kern0.36em P(0)=\left\{0\right\};\kern0.36em \mathrm{no}\ \mathrm{free}\ \mathrm{lunch}, $$
$$ (ii)\kern0.36em P(x)\kern0.36em \mathrm{is}\ \mathrm{bounded}\ \mathrm{for}\ \mathrm{each}\ \mathrm{x};\mathrm{scarcity}, $$
$$ (iii)\kern0.24em T\ \mathrm{is}\ \mathrm{closed}. $$

Additional properties, from the definitions are;

$$ (iv)\kern0.36em T\ \mathrm{is}\ \mathrm{convex}\ \left(\Rightarrow L(y)\ \mathrm{and}\ P(x)\ \mathrm{are}\ \mathrm{convex}\right), $$
$$ (v)\kern0.36em x^{\prime}\ge x\in L(y)\Rightarrow x^{\prime}\in L(y);\mathrm{free}\ \mathrm{disposability}\ \mathrm{of}\ \mathrm{inputs}, $$
$$ (vi)\kern0.36em y^{\prime}\le y\in P(x)\Rightarrow y^{\prime}\in P(x);\mathrm{free}\ \mathrm{disposability}\ \mathrm{of}\ \mathrm{outputs}, $$
$$ (vii)\kern0.36em T\ \mathrm{is}\ \mathrm{a}\ \mathrm{cone};\mathrm{constant}\ \mathrm{returns}\ \mathrm{to}\ \mathrm{scale}\ \left(\mathrm{CRS}\right). $$

The last condition holds, since the intensity variables z i are only non-negative. If, in addition to non-negativity, \( {\displaystyle {\sum}_{k=1}^K} \) z k  = 1, variable returns to scale are modeled. Conditions (v) and (vi) follow from the inequalities of the inputs and outputs expressions.

$$ \begin{array}{l}(a)\kern5.2em {\displaystyle \sum_{k=1}^K{y}_{km}>0\kern3em m=1,\dots, M}\\ {}(b)\kern5em {\displaystyle \sum_{m=1}^M{y}_{km}>0\kern3.12em k=1,\dots, K}\\ {}(c)\kern5em {\displaystyle \sum_{k=1}^K{x}_{kn}>0\kern3.12em n=1,\dots, N}\\ {}(d)\kern5em {\displaystyle \sum_{n=1}^N{x}_{kn}>0\kern3.12em k=1,\dots, K}\end{array} $$

Output condition (a) states that each output is produced by some k (DMU), and (b) requires that each activity produce some output. The input conditions (c) and (d) say that each input is used by some k and that each k uses at least one input. If z k  ≥ 0, k = 1, …, K and \( {\displaystyle {\sum}_{k=1}^K} \) z k  ≤ 1, the technology exhibits non-increasing returns to scale.

Before we link technologies (and/or sub-technologies) together to model processes within the black box, we study optimization problems on a technology. These problems include profit and revenue maximization, cost minimization and distance function measures.

14.3 Objective Functions

Network models are frequently applied in performance measurement, including optimal resource allocations. An objective function that specifies the evaluation of measurement or allocation is required for optimization. Two types of objective functions are used; (i) those that require prices, and (ii) those that require only measures of inputs and outputs (quantity functions). These quantity functions can be either slack based or distance functions.

The distance functions have their duals among the objective functions involving prices. The most common are; the profit function dual to the directional technology function, the revenue function dual to Shephard’s output distance function, and the cost function dual to Shephard’s input distance function. The slack based objective functions do not have natural duals (Färe et al. 2007b).

Let g = (g x ,g y ) ∈ ℝ N + M+ be a directional vector for inputs (g x ) and outputs (g y ). This vector provides the direction in which a given input/output observation is projected onto the frontier of T. The optimization problem that defines the directional technology distance function is

$$ {\overrightarrow{D}}_T\left(x,y;g\right)= \max \left\{\beta :\left(x-\beta {g}_x,y+\beta {g}_y\right)\in T\right\} $$

Inputs are contracted while outputs are expanded.Footnote 1 Under “g-disposability” this function characterizes T, i.e.,

$$ {\overrightarrow{D}}_T\left(x,y;g\right)\ge 0\iff \left(x,y\right)\in T, $$

and it can be used as a measure of technical efficiency.

If we choose the directional vector to equal g = (0,g y ), then we have a directional output distance function

$$ {\overrightarrow{D}}_0\left(x,y;{g}_y\right)= \max \left\{\beta :\left(y+\beta {g}_y\right)\in P(x)\right\}, $$

which may also be a measure of technical efficiency, now output based.

It is interesting to note how \( {\overrightarrow{D}}_0\left(x,y;{g}_y\right) \) is related to Shephard’s output distance function. The latter is defined as

$$ {D}_i\left(x,y\right)= \min \left\{\lambda :y/\lambda \in P(x)\right\}. $$

Shephard’s output distance function or its reciprocal is the Farrell output oriented measure of technical efficiency.

To relate the two output distance function to each other, choose g y  = y, then

$$ {\overrightarrow{D}}_0\left(x,y;y\right)=1-1/{D}_0\left(x,y\right) $$

Thus, we have shown how the directional output distance function generalizes Shephard’s (radial) output distance function. Similarly, we may choose g = (g x ,0) to obtain a directional input distance function

$$ {\overrightarrow{D}}_i\left(x,y;{g}_x\right)= \max \left\{\beta :\left(x-\beta {g}_x\right)\in L(y)\right\}, $$

and relate it to Shephard’s input distance function defined as

$$ {D}_i\left(y,x\right)= \max \left\{\lambda :x/\lambda \in L(y)\right\}. $$

Choose g x  = x and we have

$$ {\overrightarrow{D}}_i\left(y,x;x\right)=-1+1/{D}_i\left(y,x\right). $$

Our derivations above show that the directional output distance function is the origin of the other four distance functions. This can be illustrated as (Fig. 14.2).

Fig. 14.2
figure 2

Relation of directional distance functions

Finally we have (Färe and Lovell 1978)

$$ {D}_0\left(x,y\right)=1/{D}_i\left(y,x\right)\iff \mathrm{CRS}. $$

The above distance functions are associated with measurement of technical efficiency. Each projects the observation in question onto a corresponding isoquant.Footnote 2

Turning to the duals, assume input prices w ∈ ℝ N+ and output prices p ∈ ℝ M+ are known. Then we can define the profit function as

$$ \begin{array}{l}\Pi \left(p,w\right)= \max \left\{ py- wx:\left(x,y\right)\in\,T\right\}\\ {}\kern3.1em = \max \left\{ py- wx:{\overrightarrow{D}}_T\left(x,y;g\right)\ge 0\right\}\end{array} $$

The last inequality holds, since the directional technology distance function characterizes T. From this it follows that (Färe and Grosskopf 2004)

$$ \frac{\Pi \left(p,w\right)-\left( py- wx\right)}{p{g}_y+w{g}_x}\ge {\overrightarrow{D}}_T\left(x,y;g\right) $$

where the LHS is the Nerlovian profit indicator, which is the normalized difference between maximal profit Π(p,w) and observed profit (py − wx). This indicator is larger than the corresponding directional distance function (a measure of technical efficiency). By adding a measure of allocative efficiency, the inequality becomes an equality, and the Nerlovian indicator is decomposed into technical and allocative efficiency (Chambers et al. 1998).

In order to derive results similar to the Nerlovian indicator, we first define the revenue and cost functions

$$ \begin{array}{l}R\left(x,p\right)= \max \left\{ py:y\in P(x)\right\}\\ {}\kern2.75em = \max \left\{ py:{\overrightarrow{D}}_0\left(x,y;{g}_y\right)\ge 0\right\}\end{array} $$

and

$$ \begin{array}{l}C\left(y,w\right)= \min \left\{ wx:x\in L(y)\right\}\\ {}\kern2.95em = \min \left\{ wx:{\overrightarrow{D}}_i\left(x,y;{g}_x\right)\ge 0\right\}.\end{array} $$

From these expressions we get

$$ \frac{R\left(x,p\right)- py}{p{g}_y}\ge {\overrightarrow{D}}_0\left(x,y;{g}_y\right) $$

and

$$ \frac{ wx-C\left(y,w\right)}{w{g}_x}\ge {\overrightarrow{D}}_i\left(x,y;{g}_x\right) $$

respectively.

Each of these inequalities can be closed by adding, as above, an allocative inefficiency component. Hence arriving at a revenue and a cost indicator with their corresponding decompositions.

Prior to a discussion of the radial distance function (Shephard 1953, 1970), we provide activity analysis/DEA models for calculating profit and the directional technology distance function. Since, under constant returns to scale, profit is zero, we choose the variable returns to scale formulation allowing for losses and profit. Maximal profit is then estimated as

$$ \begin{array}{l}\Pi \left(p,w\right)={ \max}_{\left(x,y,z\right)}\left( py- wx\right)\\ {}s.t.\kern0.36em {\displaystyle \sum_{k=1}^K{z}_k{x}_{kn}\le {x}_n},\kern0.36em n=1,\dots, N\\ {}\kern1.5em {\displaystyle \sum_{k=1}^K{z}_k{y}_{km}\ge {y}_m},\kern0.36em m=1,\dots, M\\ {}\kern1.5em {\displaystyle \sum_{k=1}^K{z}_k=1},\kern0.36em {z}_k\ge 0,\kern0.36em k=1,\dots, K\end{array} $$

One may, of course, allow the price vectors p and w to vary with the observation.

To estimate the technology distance function, the researcher may choose the directional vectors or endogenize them.Footnote 3 Here we consider the case of g = (g x , g y ) without any specific choice. Thus, for observation k , we have

$$ \begin{array}{l}{\overrightarrow{D}}_T\left({x}^{k^{\prime }},{y}^{k^{\prime }};{g}_x,{g}_y\right)= \max \beta \\ {}s.t.\kern0.36em {\displaystyle \sum_{k=1}^K{z}_k{x}_{kn}\le {x}_{k^{\prime }n}-\beta {g}_{x_n}},\kern0.36em n=1,\dots, N\\ {}\kern1.5em {\displaystyle \sum_{k=1}^K{z}_k{y}_{km}\ge {y}_{k^{\prime }m}}+\beta {g}_{y_m},\kern0.36em m=1,\dots, M\\ {}\kern1.45em {\displaystyle \sum_{k=1}^K{z}_k=1},\kern0.36em {z}_k\ge 0,\kern0.36em k=1,\dots, K.\end{array} $$

From our two calculations and the data on \( \left({x}^{k^{\prime }},{y}^{k^{\prime }}\right),\left(p,w\right) \) we may calculate the Nerlovian profit indicator. Similar problems can be formulated for the revenue and cost indicators.Footnote 4

In the discussion above, profit, revenue and cost indicators took an additive structure. The traditional Farrell (1957) decomposition of cost and revenue efficiency is multiplicative, not additive, and is based on Shephard’s (1953, 1970) distance function.

To derive the revenue measure, note that

$$ \begin{array}{l}R\left(x,p\right)= \max \left\{ py:y\in\,P(x)\right\}\\ {}\kern2.8em = \max \left\{ py:{D}_0\left(x,y\right)\le 1\right\},\end{array} $$

where the last equality holds because

$$ {D}_0\left(x,y\right)\le 1\iff y\in\,P(x). $$

From this definition of revenue, it follows (Färe and Grosskopf 2004) that

$$ \frac{R\left(x,p\right)}{ py}\ge 1/{D}_0\left(x,y\right), $$

i.e., the ratio of maximal revenue to observed revenue is larger than the reciprocal of the output distance function.Footnote 5 By multiplying the RHS with an allocative efficiency component, the Farrell decomposition of revenue efficiency is obtained. It is the product, not sum, of technical and allocative efficiencies.Footnote 6

To set the stage for the slack-based measure of technical efficiency consider the Leontief production function

$$ y= \min \left\{{x}_1,{x}_2\right\} $$

The isoquant for y = 1, is

$$ \left\{\left({x}_1,{x}_2\right): \min \left\{{x}_1,{x}_2\right\}=1\right\} $$

and the Koopmans efficient input set is

$$ \left\{{x}_1,{x}_2\right\}=\left\{1,1\right\} $$

Thus the isoquant and the efficient subset do not coincide. The following figure illustrates the relation between the isoquant and efficient subset (Fig. 14.3).

Fig. 14.3
figure 3

Leontief isoquant (I,I) and efficient set (E)

The isoquant consists of the points along II while E is the only efficient point.

The input distance functions introduced above have the property that they project an input vector onto the isoquant, and not necessarily onto the efficient point(s). Measures that project an input vector onto efficient points are the multiplicative Russell measure (Färe and Lovell 1978) and the additive slack-based measures (Färe and Grosskopf 2010; Tone 2001).

The input oriented Russell measure (RM) has the following DEA formulation

$$ \begin{array}{l}R{M}_i\left({x}^{k^{\prime }},{y}^{k^{\prime }}\right)=\frac{1}{N} \min {\displaystyle \sum_{n=1}^N{\lambda}_n}\\ {}s.t.\kern0.72em {\displaystyle \sum_{k=1}^K{z}_k{x}_{kn}}\le {\lambda}_n{x}_{k^{\prime }n},\kern0.48em n=1,\dots, N\\ {}\kern2em {\displaystyle \sum_{k=1}^K{z}_m{y}_{km}}\ge {y}_{k^{\prime }m},\kern0.48em m=1,\dots, M\\ {}\kern2em {z}_k\ge 0,\kern0.48em k=1,\dots, K\kern0.36em.\end{array} $$

If an input \( {x}_{k^{\prime }n} \) is zero, we modify the measure and set λ n  = 1. This measure is one if and only if \( {x}^{k^{\prime }} \) belongs to the efficient subset of \( L\left({y}^{k^{\prime }}\right) \), where

$$ Eff\;L(y)=\left\{x:x\in L(y),{x}^{\prime}\le x\Rightarrow {x}^{\prime}\notin L(y)\right\} $$

The input oriented slack-based measure (SB) by Färe and Grosskopf (2010) is based on the input oriented directional distance function, and has the following DEA formulation

$$ \begin{array}{l}S{B}_i\left({x}^{k^{\prime }},{y}^{k^{\prime }}\right)= \max {\displaystyle \sum_{n=1}^N{\beta}_n}\\ {}\kern1.5em {\displaystyle \sum_{k=1}^K{z}_n{y}_{kn}}\le {x}_{k^{\prime }n}-{\beta}_n\bullet {1}_n,\kern0.48em n=1,\dots, N\\ {}\kern1.5em {\displaystyle \sum_{k=1}^K{z}_m{y}_{km}}\ge {y}_{k^{\prime }m},\kern0.48em m=1,\dots, M\\ {}\kern1.5em {z}_k\ge 0,\kern0.36em k=1,\dots, K.\end{array} $$

This measure is zero if and only if \( {x}^{k^{\prime }} \) belongs to the efficient subset of \( L\left({y}^{k^{\prime }}\right) \).Footnote 7 Note that β n is independent of the unit of measurement, since the direction g n  = 1 n is in the same units as x n , thus \( {\beta}_{n_i}\forall i \) can be added.

14.4 Static Network Models

In this section we go inside the black box and model it as a network of sub-technologies. This approach has its origin in Shephard and Färe (1975), who wrote that “many production systems (technologies) may be conceptualized as the joint interaction of a finite number of production sub-technologies called activities.” The static model is useful for analyzing the allocation of intermediate products and also provides the basic structure of dynamic DEA models.

We restrict our presentation to three sub-technologies P 1, P 2, and P 3. These three sub-technologies are connected by the directed network shown in Fig. 14.4.

Fig. 14.4
figure 4

Representation of sub-technologies

To complete a network of these three sub-technologies, we add a distribution process and a sink, or collection of final outputs.Footnote 8 Inputs are denoted by \( x=\left({x}_{1,}\dots, {x}_N\right)\in {\Re}_{{}^{+}}^N \), the network exogenous vector, i.e., total availability is attached to the three sub-technologies, i0 x, i = 1, 2, 3 … where 0 denotes source and i denotes use (Fig. 14.5). For example, 10 x is the input vector from source 0 used in activity 1. The total amounts used in the three activities cannot exceed the total amount available, i.e.,

Fig. 14.5
figure 5

A network technology

$$ x\ge {\displaystyle \sum_{i=1}^3{}_0{}^ix}\kern0.5em. $$

In Fig. 14.5 activity P 1 uses 10 x as an exogenous input and produces

$$ {}_1{}^3y+{}_1{}^4y $$

as outputs. The 31 y is an input into activity 3, while 41 y is the final output from P 1. Activity P 3 uses 30 x as an exogenous input and 31 y, 32 y as intermediate inputs. The final product from activity P 3 is output vector 43 y. The total network output is the sum of the final output of the three activities,

$$ {}_1{}^4y+{}_2{}^4y+{}_3{}^4y. $$

Note that even if an activity does not produce one of the listed outputs, that output is set at zero and the network structure remains the same. It is also noteworthy that some outputs may be both final and intermediate outputs, e.g. spare parts.

Based on the description above, a generic network model takes the form

$$ \begin{array}{l}P(x)=\left\{\left({}_1{}^4y+{}_2{}^4y+{}_3{}^4y\right)\right.:\\ {}\kern3em \left({}_1{}^4y+{}_1{}^3y\right)\in {P}^1\left({}_0{}^1x\right)\\ {}\kern3em \left({}_2{}^4y+{}_2{}^3y\right)\in {P}^2\left({}_0{}^2x\right)\\ {}\kern3em {}_3{}^4y\in {P}^3\left({}_0{}^3x,{}_1{}^3y+{}_2{}^3y\right)\\ {}\kern3em \left.{}_0{}^1x+{}_0{}^2x+{}_0{}^3x\le x\right\}\kern0.36em,\end{array} $$

where P i(•) i = 1, 2, 3 are output sets, i.e., P(x) = {y : x can produce y}. Thus the network model P(x) is formed by the individual sub-technologies.

Next we formulate the network technology as a DEA model, i.e.,

$$ \begin{array}{l}P(x)=\left\{y=\left({}_1{}^4y+{}_2{}^4y+{}_3{}^4y\right)\right.:\\ {}\left(\mathrm{a}\right)\kern0.75em {}_3{}^4{y}_m\le {\displaystyle \sum_{k=1}^K{z}_k^3\kern0.24em {}_3{}^4{y}_{km},\kern0.48em m=1,\dots, M,}\\ {}\left(\mathrm{b}\right)\kern0.6em {\displaystyle \sum_{k=1}^K{z}_k^3\kern0.24em {}_0{}^3{x}_{kn}\le {}_0{}^3x_n,\kern0.48em n=1,\dots, N,}\\ {}\left(\mathrm{c}\right)\kern0.6em {\displaystyle \sum_{k=1}^K{z}_k^3\kern0.24em {}_1{}^3y_{km}\le {}_1{}^3y_m,m=1,\dots, M,}\\ {}\left(\mathrm{d}\right)\kern0.6em {\displaystyle \sum_{k=1}^K{z}_k^3\kern0.24em {}_2{}^3y_{km}\le {}_2{}^3y_m,m=1,\dots, M,}\\ {}\left(\mathrm{e}\right)\kern0.6em {z}_k^3\ge 0,k=1,\dots, K,\\ {}\left(\mathrm{f}\right)\kern0.5em \left({}_1{}^3y_m+{}_1{}^4y_m\right)\le {\displaystyle \sum_{k=1}^K{z}_k^1\;\left({}_1{}^3y_{km}+{}_1{}^4y_{km}\right)},m=1,\dots, M,\\ {}\left(\mathrm{g}\right)\kern0.6em {\displaystyle \sum_{k=1}^K{z}_k^1\kern0.24em {}_0{}^1x_{kn}}\le {}_0{}^1x_n,n=1,\dots, N,\\ {}\left(\mathrm{h}\right)\kern0.6em {z}_k^1\ge 0,k=1,\dots, K,\\ {}\left(\mathrm{i}\right)\kern0.5em \left({}_2{}^3y_m+{}_2{}^4y_m\right)\le {\displaystyle \sum_{k=1}^K{z}_k^2\;\left({}_2{}^3y_{km}+{}_2{}^4y_{km}\right)},m=1,\dots, M,\\ {}\left(\mathrm{j}\right)\kern0.6em {\displaystyle \sum_{k=1}^K{z}_k^2\kern0.24em {}_0{}^2x_{kn}}\le {}_0{}^2x_n,n=1,\dots, N,\\ {}\left(\mathrm{k}\right)\kern0.6em {z}_k^2\ge 0,k=1,\dots, K,\\ {}\left(\mathrm{l}\right)\kern0.6em {}_0{}^1x_n+{}_0{}^2x_n+{}_0{}^3x_n\le {x}_n,\kern0.48em n=1,\dots, \left.N\right\}.\end{array} $$

where i0 y km and i0 x kn , k = 1, …, K, m = 1, …, M and n = 1, …, N are observed data. In the network DEA model the first sub-technology is specified by (f)–(h), the second by (i)–(k) and the third by (a)–(e). Note that each sub-technology has a vector of intensity variables (e), (h) and (k). Here they all satisfy constant returns to scale.

Färe and Grosskopf (1996b) have shown that if each sub-technology satisfies standard axioms such as free disposability of inputs and outputs, compactness of the output set and other axioms, then the network technology has the same properties as the sub-technologies from which it is constructed.

If we wish to compute, for example, output efficiency of the network DEA model, we may use the standard Farrell (1957) output measure using linear programming;

$$ {F}_0\left({x}^{k^{\prime }},{y}^{k^{\prime }}\right)= \max \left\{\theta :\theta {y}^{h^{\prime }}\in P\left({x}^{k^{\prime }}\right)\right\}, $$

where \( P\left({x}^{k^{\prime }}\right) \) is the network for observation k . Note that in this model the ratio of outputs from P 1 and P 2 to observed output may vary. If this effect is not appropriate, we may modify the LHS of (f) and (i) to read

$$ \left({\mathrm{f}}^{\prime}\;\right)\kern0.5em \theta \left({}_1{}^3y_{k^{\prime }m}+{}_1{}^4y_{k^{\prime }m}\right)\le $$

and

$$ \left({\mathrm{i}}^{\prime}\;\right)\kern0.5em \mu \left({}_2{}^3y_{k^{\prime }m}+{}_2{}^4y_{k^{\prime }m}\right)\le $$

where k is the observed data and θ, μ are non-negative scalars. When \( {F}_0\left({x}^{k^{\prime }},{y}^{k^{\prime }}\right) \) is computed under these more restrictive conditions, the optimal θ and μ are the sub-vector Farrell efficiency scores.

From standard DEA modeling practice (Cooper et al. 2004), one knows that the primal envelopment model has a dual multiplier formulation. Hence, when we use the “fixed” mix formulation, i.e., when (f ) and (i ) are used, we may write the network in its dual form, as in Kao (2009). To see how the network model may be specified as a traditional neoclassic production model, suppose for simplicity that 41 y =  42 y = 0 and that 31 y,  32 y, and  43 y ∈ ℝ+, so that each sub-technology produces a single output and no final products are produced by P 1 and P 2. If we define a production function by

$$ F(x)= \max \left\{y:y\in P(x)\right\}, $$

then the network model can be written as followsFootnote 9;

$$ \begin{array}{l}{}_1{}^4y={F}^3\left({x}_0^3,{F}^1\left({}_0{}^1x\right),{F}^2\left({}_0{}^2x\right),\kern-0.2em \right)\\ {}\kern1em ={F}^3\left(x-{}_0{}^1x-{}_0{}^2x,{F}^1\left({}_0{}^1x\right),{F}^2\left({}_0{}^2x\right),\kern-0.2em \right),\end{array} $$

i.e., as function of functions. This example illustrates the creation of “parametric” neoclassical models of network technologies.

Next, we formulate three commonly used network models; (i) the Johansen (1972) model, (ii) the two stage model and (iii) the externality model. These three models are set up here generically, and like the network models above can be extended to any finite set of sub-technologies. We start with the Johansen model as formulated by Färe et al. (1992). Here we express it and the other models by means of diagrams (Fig. 14.6).

Fig. 14.6
figure 6

The Johansen model

We have two technologies P 1 and P 2 producing two vectors of outputs y 1 and y 2. They use fixed inputs (non-allocable) x 1 f and x 2 f respectively, and variable allocable inputs x 1 v and x 2 v . The sum of these input vectors cannot exceed a given vector \( {\overline{x}}_v \). Thus the total can be allocated between the two technologies P 1 and P 2. Such an allocation can be implemented by choosing one of the objective functions discussed in Sect. 14.2. For example, if output prices are known, one may maximize total revenue for the network, viz,

$$ \begin{array}{l} \max \kern3pt {p}^1{y}^1+{p}^2{y}^2\\ {}s.t.\kern0.48em {y}^1\in {P}^1,{y}^2\in {P}^2\end{array} $$

where p 1 and p 2 are two output price vectors. The solution to this problem yields the optimal output vector for each technology and the optimal allocation of the variable inputs.Footnote 10 This model may be used in the discussion of merger and coalition formation (Bogetoft and Wang 2005; Färe et al. 2011).

While the Johansen model organizes the technologies in parallel, our second model organizes the technologies in a sequence (Chen et al. 2009) (Fig. 14.7).Footnote 11

Fig. 14.7
figure 7

A two stage model of production

Again we have two technologies P 1 and P 2, now ordered sequentially with the output from P 1 being an intermediate input into P 2 to produce the final output y 2. Each technology has its own input x 1, x 2. It is not hard to adapt the concept of allocation from the Johansen model to introduce savings and borrowing into the model (Färe and Grosskopf 1996a).

Suppose the externality model consists of one polluter and one receptor, with the technologies P 1 and P 2, respectively. Each technology uses an input, x 1 or x 2, to produce desirable outputs y 1 and y 2, and the polluter also produces an undesirable output u. The undesirable output u is an input into the receptor technology, as illustrated below (Fig. 14.8).

Fig. 14.8
figure 8

The network model with externalities

The upstream firm (technology P 1) may be, for example, a paper mill, and the downstream firm P 2 a fishery. The upstream firm produces y 1 and u jointly and in the terminology of Shephard and Färe (1974) we say that y 1 is nulljoint with u if

$$ \left({y}^1,u\right)\in {P}^1\left({x}^1\right)\ \mathrm{and}\ u=0\Rightarrow {y}^1=0, $$

i.e., if no undesirable outputs are produced, no desirable outputs can be produced. The downstream technology takes u as an additional input with negative implications, i.e.,

$$ {u}^1\ge u\Rightarrow {P}^2\left({x}^2,{u}^1\right)\le {P}^2\left({x}^2,u\right). $$

This means that more desirable input does not increase production, but may decrease it. This type of network models have also been applied in the studies of property rights (see Färe and Grosskopf 2004, for a DEA formulation of such a problem).

14.5 Dynamic Network Models

This section is to a large extent adapted from Färe, Grosskopf and Whittaker (2007) For a recent survey of nonparametric dynamic efficiency see (Fallah-Fini et al. 2013).

“Late in 1974, Dr Thomas Varley, Office of Naval Research, asked whether I could formulate a production function for shipbuilding. In thinking about this matter it became apparent that the usualsteady state (static) production function could at best provide a faint model of this production technology. Imagine that you visit a shipyard. Day by day a tremendous amount of production activity of great variety is carried on, yet no ships are turned out. This goes on for a long time. Eventually a ship emerges. What was being produced day by day all during this time? It is clear that the daily, weekly, monthly outputs of the system were intermediate products. The shipbuilding production system, like construction, is a dynamically evolving process.”(Shephard and Färe 1980, page V)

With this description of a production process as a motivation for the dynamics, let us start by formulating a dynamic model as a network. Assume there are three time periods t1, t, t+1, and that each has its production technology P τ,  τ = t−1,  t, t+1. A dynamic model has the property that a decision in one time period impacts on later time periods. For example, if I save now, then my possible consumption may increase later. Therefore we introduce intermediate products, i.e., those products that are held over between time periods, τ + 1 τ y ∈  M+ . If τ = 1, then t + 1 t y is the intermediate vector of outputs produced at time t and entering the production process at t+1, i.e., an input at P t+1. Figure 14.9 illustrates this setup.

Fig. 14.9
figure 9

Dynamic sub-technologies

Each of the production sub-technologies P τ uses exogenous inputs x τ to produce the final output τ τ − 1 y and intermediate inputs i τ − 1 y. To complete the network model (see the figure below) we add initial conditions (distribution process in the static model) and transversality conditions (sink in the static model).

The initial condition is given by \( {}^i\overline{y} \) and may be thought of as the stocks of “capital” initially available. The transversality condition could include the number of periods, say t+1 = T, the state of the system at \( T,{}_{t+1}{}^i\overline{y} \), and the final output vector from the last period, \( {}_{t+1}{}^f\overline{y} \). The chosen conditions are specific to the research problem to be analyzed.

Thus far we have not introduced discounting. However, if outputs are given in value terms, a discount factor of δ τ, 0 ≤ δ τ ≤ 1, may be introduced to account for the lesser of future income compared to present income. For example, δ t f t y are the discounted values of the final output in period t (Fig. 14.10).

Fig. 14.10
figure 10

A dynamic network model

The dynamic network DEA model consists of the interaction of a finite number of static models. Therefore let us start by studying the sub-technology P t from above. This technology uses inputs x t and intermediate inputs i t − 1 y to produce outputs f t y +  i t y, where f t y is the final output and i t y is the intermediate output that is used as an input in the next period. Thus we may write

$$ \begin{array}{c}{P}^t\left({x}^t,{}_{t-1}{}^ty\right)=\left\{\left({}_t{}^fy+{}_t{}^iy\right)\right.:\\ {}\left({}_t{}^f{y}_m+{}_t{}^i{y}_m\right)\le {\displaystyle \sum_{k=1}^{K^t}{z}_k^t\left({}_t{}^f{y}_{km}+{}_t{}^i{y}_{km}\right)}\kern3em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^t}{z}_k^t}{}_i{}^{t-1}{y}_{km}\le {}_i{}^{t-1}{y}_{k^{\prime }m},\kern1.7em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^t}{z}_k^t}{x}_{km}^t\le {x}_{k^{\prime }n}^t,\kern4.5em n=1,\dots, N,\\ {}{z}_k^t\ge 0,\kern8.2em k=1,\dots, {K}^t\Big\},\end{array} $$

where f t y km , i t y km , \( {}_{t-1}{}^i{y}_{km} \), and x kn are observed inputs and outputs. We allow the number of observations to differ between periods, hence the notation K t. The output vector ( f t y +  i t y) is specified so that one can include M t for all t, i.e., M = M t−1 + M t + … by including appropriate zeroes.

Recall from the static model, that if the sub-technologies have the properties (i)–(vii), then the whole network model also has those properties. This observation also applies to the dynamic model below:

$$\begin{array}{c}{P}^t\left({x}^{t-1},{x}^t,{x}^{t+1},{}^i\overline{y}\right)=\left\{\left({}_{t-1}{}^fy,\kern2pt,{}_t{}^fy,\left({}_{t+1}{}^fy+{}_{t+1}{}^iy\right)\right)\right.:\\ {}\left({}_{t-1}{}^fy_m+{}_{t-1}{}^iy_m\right)\le {\displaystyle \sum_{k=1}^{K^{t-1}}{z}_k^{t-1}\left({}_{t-1}{}^fy_{km}+{}_{t-1}{}^iy_{km}\right)}\kern1.3em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^{t-1}}{z}_k^{t-1}}{}^i{y}_{km}\le {}^i\overline{y}_{km},\kern3.7em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^{t-1}}{z}_k^{t-1}}{x}_{kn}^{t-1}\le {x}_n^{t-1},\kern3.95em n=1,\dots, N,\\ {}{z}_k^{t-1}\ge 0,\kern8.1em k=1,\dots, {K}^{t-1},\end{array} $$
$$ \begin{array}{c}\left({}_t{}^fy_m+{}_t{}^iy_m\right)\le {\displaystyle \sum_{k=1}^{K^t}{z}_k^t\left({}_t{}^fy_{km}+{}_t{}^iy_{km}\right)}\kern3.1em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^t}{z}_k^t}\kern0.24em {}_{t-1}{}^iy_{km}\le {}_{t-1}{}^i\overline{y}_m,\kern2.2em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^t}{z}_k^t}{x}_{kn}^t\le {x}_n^t,\kern5.2em n=1,\dots, N,\\ {}{z}_k^t\ge 0,\kern8.2em k=1,\dots, {K}^t,\\ {}\left({}_{t+1}{}^fy_m+{}_{t+1}{}^iy_m\right)\le {\displaystyle \sum_{k=1}^{K^{t+1}}{z}_k^{t+1}\left({}_{t+1}{}^fy_{km}+{}_{t+1}{}^iy_{km}\right)}\kern.7em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^{t+1}}{z}_k^{t+1}}\kern0.24em {}_t{}^iy_{km}\le {}_t{}^i\overline{y}_{km},\kern2.9em m=1,\dots, M,\\ {}{\displaystyle \sum_{k=1}^{K^{t+1}}{z}_k^{t+1}}{x}_{kn}^{t+1}\le {x}_n^{t+1},\kern3.3em n=1,\dots, N,\\ {}{z}_k^{t+1}\ge 0,\kern7.5em \left.k=1,\dots, {K}^{t+1}\right\},\end{array} $$

Note that each sub-technology has its own intensity vector z τ, τ = t − 1, t, t + 1, and that the interaction between time periods comes through the intermediate outputs.

Färe and Grosskopf (1997) use this model to study the inefficiency of APEC countries due to dynamic misallocation of resources. They used the sum of Shephard (1970) sub-technology distance functions as their optimization criterion. Nemota and Goto (2003) applied the dynamic network model to study Japanese electricity production over time. They used cost minimization for the optimization criterion. Jaenicke (2000) applied the dynamic model in the analysis of the yield effects of crop rotation. Nemota and Goto (1999) applied the dual linear programming problem formulation to the cost minimization and derived the fundamental equation (Hamilton-Jacobi-Bellman) of dynamic programming. We also refer the reader to the work of J.K. Sengupta. A search for “J.K. Sengupta dynamic models” using any one of the major internet search engines produces a large number of dynamic non-parametric models that he has analyzed.