Nonparametric analysis of technology and productivity under non-convexity: a neighborhood-based approach

Chavas, Jean-Paul; Kim, Kwansoo

doi:10.1007/s11123-014-0383-1

Nonparametric analysis of technology and productivity under non-convexity: a neighborhood-based approach

Published: 02 February 2014

Volume 43, pages 59–74, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Productivity Analysis Aims and scope Submit manuscript

Nonparametric analysis of technology and productivity under non-convexity: a neighborhood-based approach

Download PDF

Jean-Paul Chavas¹ &
Kwansoo Kim²

526 Accesses
4 Citations
Explore all metrics

Abstract

This paper investigates the nonparametric analysis of technology under non-convexity. The analysis extends two approaches now commonly used in efficiency and productivity analysis: data envelopment analysis where convexity is imposed; and free disposal hull (FDH) models. We argue that, while the FDH model allows for non-convexity, its representation of non-convexity is too extreme. We propose a new nonparametric model that relies on a neighborhood-based technology assessment which allows for less extreme forms of non-convexity. The distinctive feature of our approach is that it allows for non-convexity to arise in any part of the feasible set. We show how it can be implemented empirically by solving simple linear programming problems. And we illustrate the usefulness of the approach in an empirical application to the analysis of technical and scale efficiency on Korean farms.

Data Envelopment Analysis: A Nonparametric Method of Production Analysis

Stochastic Nonparametric Approach to Efficiency Analysis: A Unified Framework

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Nonparametric analysis of technology and productivity has been the subject of much interest (e.g., Afriat 1972; Färe et al. 1994; Varian 1984). It has provided the basis for data envelopment analysis (DEA) now commonly used in the investigation of productivity and firm efficiency (e.g., Banker 1984; Banker et al. 1984; Ray 2004; Cook and Seiford 2009). DEA has been seen as an attractive approach for three reasons: it allows for a flexible representation of multi-input multi-output technology; it involves solving simple linear programming problems; and it can provide firm-specific estimates of productivity and efficiency. Yet, it has one significant limitation: it assumes that the feasible set is always convex (where diminishing marginal productivity applies everywhere). As such, DEA is not appropriate in the investigation of non-convex technologies. How important are non-convexity issues in the analysis of productivity and firm efficiency? There are situations where non-convexity has significant implications for economics and management. For example, it is an important issue in the analysis of multi-product firms: non-convexity contributes to generating productivity benefits from specialization (e.g., Bogetoft 1996; Chavas and Kim 2007). This implies a need to develop empirical methods that can support the analysis of non-convex technology. Such methods are needed to examine empirically when and where non-convexity may arise.

The objective of this paper is to propose a refined nonparametric method for the analysis of technology under non-convexity. Note that non-parametric representations of technology under non-convexity are not new. Relaxing convexity assumptions in DEA has been explored by Deprins et al. (1984), Petersen (1990a, b), Bogetoft (1996), Chang (1999), Kerstens and Vanden Eeckaut (1999), Bogetoft et al. (2000), Briec et al. (2004), Podinosvki (2005), Leleu (2006, 2009), De Witte and Marques (2011), Briec and Liang (2011) and others. The most common approach is the “free disposal hull” (FDH) representation investigated by Deprins et al. (1984), Tulkens (1993), Kerstens and Vanden Eeckaut (1999) and Agrell and Tind (2001). But while the FDH model allows for non-convexity, we argue that its representation is too extreme: it tends to find evidence of non-convexity “too often”. Note that other approaches have also been used to relax the convexity assumption in nonparametric analyses. They include Petersen (1990a, b), Bogetoft (1996), Agrell et al. (2005) and Podinosvki (2005). Petersen (1990a, b) and Bogetoft (1996) have proposed to restrict convexity only to the input space or the output space. Agrell et al. (2005) have considered technology represented by unions of pairs of convex input and output sets. And Podinosvki (2005) has put forward an approach where convexity is evaluated individually for each input or output.

In this paper, we propose a new nonparametric model that relies on a neighborhood-based assessment of technology. Our approach allows for non-convexity to arise in any part of the feasible set. It differs from the approaches proposed by Petersen (1990a, b), Bogetoft (1996), Agrell et al. (2005) or Podinosvki (2005), who explored departures from non-convexity based on specific inputs and/or outputs. Our approach has three useful characteristics: it provides a flexible representation of non-convexity; it nests as (restrictive) special cases both the DEA model and the FDH model; and it is easy to implement empirically. As such, our new nonparametric approach extends the related literature both theoretically and empirically. Its usefulness is illustrated in an application to the analysis of technical and scale efficiency on Korean farms. The empirical results show how allowing for non-convexity reduces the extent of technical inefficiency. They report evidence that non-convexity is more common on large farms. Finally, they document how non-convexity matters in the analysis of scale effects.

The new model and its neighborhood-based assessment of technology are presented in Sect. 2. Its use in the evaluation of non-convex technologies is discussed in Sect. 3. Using a directional distance function, Sect. 4 presents productivity analysis under non-convexity and proposes a new measure to evaluate the extent of non-convexity. Section 5 examines the evaluation of returns to scale and scale efficiency under non-convexity. In Sect. 6, we show how our approach can be implemented easily by solving simple optimization problems. The usefulness of the method is illustrated in an application presented in Sect. 7. Finally, Sect. 8 concludes.

2 The model

Consider the observation of production activities on a set of N firms in an industry. Each firm produces m netputs z ∈ R^m and faces a production technology represented by the feasible set T ⊂ R^m. We use the netput notation where inputs are negative and outputs are positive. Let z_i ≡ (z_1i, …, z_mi) ∈ R^m be the netput vector produced by the i-th firm, where z_ji is the j-th netput used/produced by the i-th firm, and z_i ∈ T means that netputs z_i are feasible, i ∈ N ≡ {1, …, N}. The technology T may exhibit different scale properties. It is said to exhibit $\left\{ {\begin{array}{l} {\text{non-decreasing returns to scale (NDRS)}} \\ {\text{constant returns to scale (CRS)}} \\ {\text{non-increasing returns to scale (NIRS)}} \\ \end{array} } \right\}$ if T $\left\{ {\begin{array}{*{20}c} \supset \\ { = } \\ \subset \\ \end{array} } \right\}$ δ T for any scalar δ > 1. And the technology is said to exhibit variable returns to scale (VRS) if no a priori restriction is imposed on returns to scale. Throughout the paper, we assume that the technology T satisfies free disposal, where free disposal means that T = T − ${\text{R}}_{ + }^{\text{m}}$.

First, consider the case where T is convex.^{Footnote 1} Then, under free disposal, a nonparametric representation of the technology is given by

$${\text{T}}_{\text{v}} = \, \{ {\text{z}}:{\text{ z}} \le \sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} {\text{z}}_{\text{i}} } ;\lambda_{\text{i}} \in {\text{R}}_{ + } ,{\text{i}} \in {\rm N};\sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} = { 1}} \}$$

(1)

T_v in (1) is the smallest convex set containing all data points {z_i: i ∈ N} under free disposal and VRS (e.g., Afriat 1972; Varian 1984). It is the representation commonly used in DEA (e.g., Banker 1984; Banker et al. 1984; Ray 2004; Cook and Seiford 2009).

Alternative representations have been proposed depending on the scale properties of the technology. Following Färe et al. (1994) and Banker et al. (2004), they are

$${\text{T}}_{\text{s}} = \, \{ {\text{z}}:{\text{ z}} \le \sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} {\text{z}}_{\text{i}} ;\lambda_{\text{i}} } \in {\text{R}}_{ + } ,{\text{i}} \in {\rm N},\sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} \in {\text{S}}_{\text{s}} } \} ,$$

(2)

where s ∈ {v, c, ni, nd}, with S_v = 1 under VRS, S_c = [0, ∞] under constant returns to scale (CRS), S_ni = [0, 1] under non-increasing returns to scale (NIRS), and S_nd = [1, ∞] under non-decreasing returns to scale (NDRS). Indeed, when S_v = 1, T_v in (2) reduces to Eq. (1) under VRS. Alternatively, when S_c = [0, ∞], T_c in (2) provides a representation of a convex technology under CRS. T_c is the smallest convex cone containing all data points {z_i: i ∈ N}. When S_ni = [0, 1], T_ni in (2) provides a representation of a convex technology under NIRS. Finally, when S_nd = [1, ∞], T_nd in (2) represents a convex technology under NDRS. Since S_v ⊂ S_ni ⊂ S_c and S_v ⊂ S_nd ⊂ S_c, it follows from (2) that T_v ⊂ T_ni ⊂ T_c and T_v ⊂ T_nd ⊂ T_c. Also, S_c = S_ni ∪ S_nd implies that T_c = T_ni ∪ T_nd. Note that the sets T_v, T_ni, T_nd and T_c are all convex.

Next, we want to introduce non-convexity in the analysis. For that purpose, consider the following nonparametric representation of technology

$${\text{T}}_{\text{FDHv}} = \, \{ {\text{z}}:{\text{ z}} \le \sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} {\text{z}}_{\text{i}} } ;\lambda_{\text{i}} \in \left\{ {0,{ 1}} \right\},{\text{i}} \in {\rm N};\sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} = { 1}} \} ,$$

(3)

where FDH stands for “FDH” (Deprins et al. 1984; Tulkens 1993; Kerstens and Vanden Eeckaut 1999; Agrell and Tind 2001). Under free disposal, T_FDHv is the smallest set containing all data points {z_i: i ∈ N} under VRS. It provides a non-convex representation of the technology under VRS.

Alternative non-convex representations have been proposed depending on the scale properties of the technology. Following Kerstens and Vanden Eeckaut (1999), they include

$${\text{T}}_{\text{FDHs}} = \, \{ {\text{z}}:{\text{ z}} \le \sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} {\text{z}}_{\text{i}} } ;\lambda_{\text{i}} \in \{ 0,\delta \} ,{\text{i}} \in {\rm N};\sum\limits_{{{\text{i}} \in {\rm N}}} {\lambda_{\text{i}} = \delta ;\delta \in {\text{S}}_{\text{s}} } \} .$$

(4)

where s ∈ {v, c, ni, nd}, and the S_s’s are as defined above. When S_v = 1, T_FDHv in (4) reduces to Eq. (3) under VRS. Alternatively, when S_c = [0, ∞], T_FDHc in (4) provides a representation of a FDH technology under CRS. T_FDHc is the smallest cone containing all data points {z_i: i ∈ N}. When S_ni = [0, 1], T_FDHni in (4) provides a representation of a FDH technology under NIRS. Finally, when S_nd = [1, ∞], T_FGHnd in (4) represents a FDH technology under NDRS.^{Footnote 2} Since S_v ⊂ S_ni ⊂ S_c and S_v ⊂ S_nd ⊂ S_c, it follows from (4) that T_FDHv ⊂ T_FDHni ⊂ T_FDHc and T_FDHv ⊂ T_FDHnd ⊂ T_FDHc. Also, S_c = S_ni ∪ S_nd implies that T_FDHc = T_FDHni ∪ T_FDHnd. Note that each of the sets T_v, T_ni, T_nd and T_c is in general non-convex. Finally, note that the λ‘s are restricted to take discrete values in (4) but not in (2). It follows that T_FDHs ⊂ T_s, i.e., that T_FDHs is a subset of T_s, for s ∈ {v, c, ni, nd}.

The sets T_v, T_c and T_FDHv are illustrated in Fig. 1. Figure 1 shows that these sets satisfy T_FDHv ⊂ T_v ⊂ T_c. Note that the sets T_v and T_c are convex, but that the set T_FDHv is in non-convex. This indicates that DEA is clearly inappropriate in the analysis of non-convexity. Indeed, since T_v is always convex, DEA offers no prospect to uncover any evidence of non-convexity and produces biased estimates of technical efficiency under a non-convex technology. In contrast, FDH can provide a basis to represent a non-convex technology. Yet, it has a rather undesirable characteristic: it has a tendency to find non-convexity at many places. This can be seen in Fig. 1, where the frontier technology is given by the line ABDHJ under T_v and by ABCDEFGHJ under T_FDHv. While the frontier line ABDHJ is concave, the frontier line ABCDEFGHJ is not. The two lines coincide only along the segments AB and HJ, where marginal products are either zero or infinite under T_v. At all other points, the two lines differ. It means that, under FDH, the frontier technology would basically exhibit non-convexity at all points where marginal products are positive and bounded under T_v. Yet, we are usually interested in situations where marginal products are positive and bounded. The fact that FDH would always reveal non-convexity in these situations seems undesirable. In other words, while T_FDHv can provide a representation of non-convexity, it may reveal it “too often”.^{Footnote 3} This indicates a need to develop alternative representations of technology that can capture non-convexity in a more useful and credible way. Below, we explore alternative formulations that allow for flexible representations of the technology T under non-convexity.

Define a neighborhood of z ≡ (z₁, …, z_m) ∈ R^m as B_r(z, σ) = {z′: D_p(z, z′) ≤ r: z′ ∈ R^m} ⊂ R^m, where r > 0 and D_p(z, z′) ≡ $\sum\nolimits_{\text{j = 1}}^{\text{m}} {}$[(|z_j − z_j’|/σ_j)^p]^1/p is a weighted Minkowski distance between z and z′, with weights σ = (σ₁, …, σ_m) ∈${\text{R}}_{ + + }^{\text{m}}$and based on a p-norm 1 ≤ p < ∞.^{Footnote 4} Let I(z, r) = {i: z_i ∈ B_r(z, σ), i ∈N} ⊂ N, where I(z, r) is the set of firms in N that are located in the neighborhood B_r(z, σ) of z.^{Footnote 5} Define a local representation of the technology T in the neighborhood of point z as:

$${\text{T}}_{\text{rv}} \left( {\text{z}} \right) \, = \, \{ {\text{z}}:{\text{ z}} \le \sum\limits_{{{\text{i}} \in {\rm I}\left( {{\text{z}},{\text{r}}} \right)}} {\lambda_{\text{i}} {\text{z}}_{\text{i}} ;\lambda_{\text{i}} \in {\text{R}}_{ + } ,{\text{i}} \in {\rm I}\left( {{\text{z}},{\text{ r}}} \right);} \sum\limits_{{{\text{i}} \in {\rm I}\left( {{\text{z}},{\text{r}}} \right)}} {\lambda_{\text{i}} = { 1}\} } .$$

(5)

Equation (5) corresponds to Eq. (1) except that it applies locally using information limited to points in the neighborhood B_r(z, σ) of z under VRS. Using (2), alternative local representations of the technology can be obtained depending on its scale properties. They are

$${\text{T}}_{\text{rs}} \left( {\text{z}} \right) \, = \{ {\text{z}}:{\text{ z}} \le \sum\limits_{{{\text{i}} \in {\rm I}\left( {{\text{z}},{\text{r}}} \right)}} {\lambda_{\text{i}} {\text{z}}_{\text{i}} ;\lambda_{\text{i}} \in {\text{R}}_{ + } ,{\text{i}} \in {\rm I}\left( {{\text{z}},{\text{ r}}} \right);} \sum\limits_{{{\text{i}} \in {\rm I}\left( {{\text{z}},{\text{r}}} \right)}} {\lambda_{\text{i}} \in {\text{S}}_{\text{s}} \} } .$$

(6)

where s ∈ {v, c, ni, nd}, and the S_s’s are as defined above. When S_v = 1, T_rv(z) in (6) reduces to Eq/ (5) under VRS. Alternatively, when S_c = [0, ∞], T_rc(z) in (6) provides a local representation of the technology under CRS. When S_ni = [0, 1], T_rni(z) in (6) is a local representation of the technology under NIRS. Finally, when S_nd = [1, ∞], T_rnd(z) in (6) gives a local representation of the technology under NDRS. Since S_v ⊂ S_ni ⊂ S_c and S_v ⊂ S_nd ⊂ S_c, it follows from (6) that T_rv(z) ⊂ T_rni(z) ⊂ T_rc(z) and T_rv(z) ⊂ T_rnd(z) ⊂ T_rc(z). Also, S_c = S_ni ∪ S_nd implies that T_rc(z) = T_rni(z) ∪ T_rnd(z). Finally, note that, for a given z, the sets T_rv(z), T_rni(z), T_rnd(z) and T_rc(z) are all convex.

Definition 1

Consider the following neighborhood-based representation of the technology T:

$$T_{\text{rs}}^{*} = \cup_{{{\text{i}} \in {\text{N}}}} T_{\text{rs}} (z_{\text{i}} ),\quad {\text{for}}\;{\text{s}}\; \in \;\left\{ {{\text{v}},\,{\text{c}},\,{\text{ni}},\,{\text{nd}}} \right\}.$$

(7)

Equation (5) defines the set T ^*_rs as the union of the sets T_rs(z_i), i ∈ N. In the neighborhood of point z_i, the set T_rs(z_i) is convex and provides a local representation of the technology T under free disposal and returns to scale characterized by s ∈ {v, c, ni, nd}. Since the union of convex sets is not necessarily convex, it follows that T ^*_rs defined in (7) is not necessarily convex for each s ∈ {v, c, ni, nd}. Since the sets T_rs(z_i) in (6) are convex, it means that the rise of non-convexity in T ^*_rs necessarily comes from the union of the neighborhood-based sets T_rs(z_i). As discussed below, this provides useful flexibility in investigating a non-convex technology.

Equation (7) is our proposed neighborhood-based representation of technology. It extends previous literature by allowing for non-convexity to arise in any part of the feasible set. Our approach has two points in common with Agrell et al. (2005): 1/we both rely on the fact that unions of convex sets are not necessarily convex; and 2/like Agrell et al.’s approach, our approach can nest FDH as a special case (as shown below). But the convex pair approach proposed by Agrell et al. (2005) did not rely on neighborhood-based measures used in (). As such, the neighborhood-based sets T_rs(z_i) (7) is specific to our approach. As argued below, our neighborhood-based characterization provides useful flexibility in the characterization of a non-convex technology.

Equation (7) differs from the approaches proposed by Petersen (1990a, b), Bogetoft (1996), or Podinosvki (2005), who explored departures from non-convexity based on inputs and/or outputs. Petersen (1990a, b) and Bogetoft (1996) assume full convexity in the output set or the input set. The selective convexity approach proposed by Podinosvki (2005) is more general in the sense that it allows for non-convexity to arise for specific inputs or outputs. By defining non-convexity for all values of selected sets of inputs or outputs, the approaches proposed by Petersen (1990a, b), Bogetoft (1996) or Podinosvki (2005) focus on a global characterization of non-convexity. It means that they cannot examine the possible presence of non-convexity in particular subsets of feasible inputs/outputs. As such, they do not allow for a local specification of convexity (Podinosvki 2005, p. 556). Our approach does. Indeed, our neighborhood-based approach is flexible enough to allow for non-convexity to arise in any region of the feasible set. As noted above, the non-convexity of T ^*_rs in (7) comes from the union of the neighborhood-based convex sets T_rs(z_i). This provides useful guidance in the choice of neighborhoods: choose a neighborhood to be “large” in parts of the feasible region that are thought to be convex, but “small” in parts that are thought to be non-convex (see Sect. 6.3 below). Our proposed approach offers a flexible representation of parts of the feasible set that exhibit non-convexity. This local flexibility can apply to specific ranges of values taken by given inputs or outputs (as discussed in Sect. 6). Importantly, this useful property is not shared with the global approaches proposed by Petersen (1990a, b), Bogetoft (1996) or Podinosvki (2005). The flexibility can also apply to all values taken by specific netputs (in a way similar to the approach proposed by Podinosvki (2005)). To see that, given σ = (σ₁, …, σ_m), choosing σ_j determines how large (or small) a neighborhood B_r(z, σ) is for the j-th netput. In this context, the choice of σ = (σ₁, …, σ_m) implies that convexity would apply for the inputs/outputs that have a “large” neighborhood while non-convexity can arise for inputs/outputs that have a “small” neighborhood.

As showed below, T ^*_rs has three useful characteristics: 1/it provides a flexible representation of non-convexity; 2/it nests as (restrictive) special cases both the DEA model and the FDH model; and 3/it is easy to implement empirically.

3 Evaluating non-convexity

Our evaluation of non-convexity of the technology relies on the properties of the representations T_s and T ^*_rs . The following properties will prove useful.

Lemma 1

For s ∈ {v, c, ni, nd}, the set T ^*_rs satisfies

$${ \lim }_{r \to \infty } {\text{T}}_{\text{rs}}^{*} = {\text{T}}_{{{\text{s}} \cdot }}$$

(8)

Proof

Note that lim_r→∞ I(z, r) = N for any finite z ∈ R^m. Using Eqs. (2), (6) and (7), it follows that T_s = lim_r→∞ T_rs(z_i) = lim_r→∞ T ^*_rs for any i ∈ N and s ∈ {v, c, ni, nd}.

Lemma 2

For s ∈ {v, c, ni, nd}, the set T ^*_rs satisfies

$${ \lim }_{{{\text{r}} \to 0}} {\text{T}}_{\text{rs}}^{*} = {\text{ T}}_{\text{FDHs}} .$$

(9)

Proof

Note that lim_r→0 B_r(z_i, σ) = {z_i} and lim_r→0 I(z_i, r) = {i} for any i ∈ N. Using Eq. (6), we have lim_r→0 T_rs(z_i) = {z: z ≤ γ z_i, γ ∈ S_s}. Eq. (7) can be alternatively written as T ^*_rs = {Σ_i∈Ν α_i T_rs(z_i): α_i ∈ {0, 1}, i ∈ Ν; Σ_i∈Ν α_i = 1}. Letting η_i = α_i γ, this implies that lim_r→0 T ^*_rs = {z: z ≤ Σ_i∈Ν η_i z_i; η_i ∈ {0, γ}, i ∈ Ν; Σ_i∈Ν η_i = γ, γ ∈ S_s}. Using Eq. (4), this gives (9).

Given s ∈ {v, c, ni, nd}, Eqs. (8) and (9) show that T ^*_rs includes two important special cases. From Eq. (8), the set T ^*_rs reduces to the set T_s when r → ∞, i.e., when the neighborhood B_r(z, σ) of any z becomes “very large”. And from Eq. (9), the set T ^*_rs reduces to the set T_FDHs when r → 0, i.e., when the neighborhood B_r(z_i, σ) become “very small” for any i ∈ N.

Proposition 1

For s ∈ {v, c, ni, nd}, the sets satisfy

$$T_{\text{FDHs}} \subset {\text{T}}_{\text{rs}}^{*} \subset {\text{T}}_{\text{r's}}^{*} \subset {\text{T}}_{\text{s}} ,\quad {\text{for any r'}} > {\text{r}} > 0.$$

(10)

Proof

Note that lim_r→0 B_r(z_i, σ) ⊂ B_r(z_i, σ) ⊂ B_r′(z_i, σ) ⊂ lim_r→∞ B_r(z_i, σ) for any r′ > r > 0. Thus, for any r′ > r > 0, lim_r→0 I(z_i, r) ⊂ I(z_i, r) ⊂ I(z_i, r′) ⊂ lim_r→∞ I(z_i, r) = N. Then, Eq. (6) implies that lim_r→0 T_rs(z_i) ⊂ T_rs(z_i) ⊂ T_r’s(z_i) ⊂ lim_r→∞ T_rs(z_i) for any r′ > r > 0 and any i ∈ N. Using Eqs. (7), (8) and (9), this proves (10).

Proposition 1 states that T_FDHs is in general a subset of T_s: T_FDHs ⊂ T_s, for s ∈ {v, c, ni, nd}. It also establishes that the set T ^*_rs , our neighborhood-based representation of technology, is bounded between T_FDHs and T_s, with T_FDHs as lower bound and T_s as upper bound. Noting that the set T_s is convex, and the set T_FDHs is in general non-convex, it means that T ^*_rs provides a generic way of introducing non-convexity in production analysis. The sets T_v, T_FDHv and T ^*_rv are illustrated in Fig. 2 under VRS. Figure 2 shows that these sets satisfy T_FDHv ⊂ T ^*_rv ⊂ T_v. Note that the set T_v is convex, but that the sets T ^*_rv and T_FDHv are non-convex. These representations apply under alternative scale properties: under VRS when s ∈ v (with S_v = 1), under CRS when s = c (with S_c = [0, ∞]), under NIRS when s = ni (with S_ni = [0, 1]), as well as under NDRS when s = ni (with S_nd = [1, ∞]). Finally, Eq. (10) states that the set T ^*_rs becomes larger when r increases, i.e., when the neighborhoods used to evaluate T ^*_rs become larger. As further discussed below, this provides some flexibility in the empirical analysis of non-convexity issues.

4 Productivity under non-convexity

Let g ∈ ${\text{R}}_{\text{m}}^{ + }$be a reference bundle satisfying g ≠ 0. Following Chambers et al. (1996), consider the directional distance function^{Footnote 6}

$$\begin{aligned} {\text{D}}\left( {{\text{z}},{\text{ T}}} \right) \, & = \, { \sup }_{\beta } \{ \beta : \, ({\text{z }} + \beta {\text{g}}) \in {\rm T}\} {\text{ if there is a scalar}}\beta {\text{satisfying }}({\text{z }} + \beta {\text{g}}) \in {\rm T}\} , \\ & = - \infty {\text{otherwise}}. \\ \end{aligned}$$

(11)

The directional distance function is the distance between point z and the upper bound of the technology T, measured in number of units of the reference bundle g. It provides a general measure of productivity. In general, D(z, T) = 0 means that point z is on the frontier of the technology T. Alternatively, D(z) > 0 implies that z is technically inefficient (as it is below the frontier),^{Footnote 7} while D(z, T) < 0 identifies z as being infeasible (as it is located above the frontier). Luenberger (1995) and Chambers et al. (1996) provide a detailed analysis of the properties of D(z, T). First, by definition in (11), z ∈ T implies that D(z, T) ≥ 0 (since β = 0 would then be feasible in (11)), meaning that T ⊂ {z: D(z, T) ≥ 0}. Second, D(z, T) ≥ 0 in (11) implies that (z + D(z, T) g)) ∈ T. When the technology T exhibiting free disposal, it follows that D(z, T) ≥ 0 implies that z ∈ T, meaning that T ⊃ {z: D(z, T) ≥ 0}. Combining these two properties, we obtain the following result: under free disposal, T = {z: D(z, T) ≥ 0} and D(z, T) provides a complete representation of the technology T. Importantly, besides being convenient, this result is general: it allows for an arbitrary multi-input multi-output technology; and it applies with or without convexity.

Using (10) and (11), we obtain the following key result.

Proposition 2

For any point z ∈ R ^m where D(z, T _s ) > −∞, the directional distance function satisfies

$${\text{D}}({\text{z}},{\text{T}}_{\text{FDHs}} ) \le {\text{D}}\left( {{\text{z}},{\text{T}}_{\text{rs}}^{*} } \right) \le {\text{D}}\left( {{\text{z}},{\text{T}}_{\text{r's}}^{*} } \right) \le {\text{D}}\left( {{\text{z}},{\text{T}}_{\text{s}} } \right),\quad {\text{for any r}}' > r > 0,\quad {\text{for s}} \in \{ {\text{v, c, ni, nd}}\}$$

(12)

Proposition 2 shows that D(z, T ^*_rs ) is bounded between D(z, T_FDHs) and D(z, T_s), with D(z, T_FDHs) as lower bound and D(z, T_s) as upper bound. When s = v, Eq. (12) implies that DEA (relying on T_v) is more likely to find evidence of technical inefficiency than FDH. This is illustrated in Fig. 1, which shows that the production frontier tends to be higher under DEA compared to FDH. With s ∈ {v, c, ni, nd}, Eq. (12) shows that this result applies under alternative characterizations of returns to scale. It also shows that D(z, T ^*_rs ) tends to increase with r, where T ^*_rs is our neighborhood-based representation of technology given in (7). Finally, as discussed next, Proposition 2 provides a basis to evaluate the role of non-convexity in productivity analysis.

Definition 2

At point z, define the following measure of non-convexity

$${\text{C}}_{\text{rs}} \left( {\text{z}} \right) \equiv {\text{D}}\left( {{\text{z}},{\text{ T}}_{\text{s}} } \right) \, - {\text{ D}}\left( {{\text{z}},{\text{ T}}_{\text{rs}}^{*} } \right),\quad {\text{for s}} \in \left\{ {{\text{v}},{\text{ c}},{\text{ ni}},{\text{ nd}}} \right\}.$$

(13)

Proposition 3

At point z where D(z, T _v ) > −∞,

$$\lim_{{{\text{r}} \to 0}} {\text{C}}_{\text{rs}} ({\text{z}}) \ge {\text{C}}_{\text{rs}} ({\text{z}}) \ge {\text{C}}_{\text{r's}} ({\text{z}}) \ge \lim_{{{\text{r}} \to \infty }} {\text{C}}_{\text{rs}} ({\text{z}}) = 0,{\text{for any r'}} > {\text{r}} > 0,\quad {\text{for s}} \in \{ {\text{v, c, ni, nd}}\} .$$

(14)

Proof

The inequalities in (14) are obtained from combining (12) and (13), and using Eqs. (8) and (9).

Proposition 3 applies under alternative characterizations of returns to scale: under VRS (when s = v), CRS (when s = c), NIRS (when s = ni), as well as NDRS (when s = nd). Equation (13) defines C_rs(z) as a measure of non-convexity, evaluated in number of units of the reference bundle g. From Eq. (14), this measure is always non-negative: C_rs(z) ≥ 0. Equation (14) states that lim_r→∞ C_rs(z) = 0. This is intuitive: DEA assumes convexity and does not provide any opportunity to uncover the presence of non-convexity. It means that the search for non-convexity must rely on the case where r < ∞. Then, for a given r < ∞, finding C_rs(z) > 0 at some point z implies that the set T ^*_rs is non-convex. In addition, (14) states that lim_r→0 C_rs(z) is an upper bound measure for C_rs(z). This reflects the fact that, under free disposal, FDH offers the greatest prospects to uncover non-convexity. Finally, Eq. (14) shows that C_rs(z) tends to decrease with r, indicating that the opportunity to uncover non-convexity declines with the size of the neighborhoods used to evaluate T ^*_rs . The effects of r on the evaluation of non-convexity are further discussed below.

5 Evaluating returns to scale

Since our analysis applies under alternative scale characterization, it can also be used to investigate returns to scale. While evaluating scale efficiency is well known under convexity (e.g., Färe et al. 1994; Banker et al. 2004), this section explores how this can be done under non-convexity.

Proposition 4

The sets satisfy

$${\text{T}}_{\text{rv}}^{*} \subset {\text{T}}_{\text{rni}}^{*} \subset {\text{T}}_{\text{rc}}^{*} ,$$

(15a)

$${\text{T}}_{\text{rv}}^{*} \subset {\text{T}}_{\text{rnd}}^{*} \subset {\text{T}}_{\text{rc}}^{*} .$$

(15b)

Proof

We have seen that T_rv(z) ⊂ T_rni(z) ⊂ T_rc(z) and T_rv(z) ⊂ T_rnd(z) ⊂ T_rc(z). Then, (15a) and (15b) follow from (7).

Definition 3

At point z, define the following measure of scale efficiency

$${\text{SE}}_{\text{rs}} \left( {\text{z}} \right) \equiv {\text{D}}\left( {{\text{z}},{\text{ T}}_{\text{rc}}^{*} } \right) \, - {\text{ D}}\left( {{\text{z}},{\text{ T}}_{\text{rs}}^{*} } \right),\quad {\text{for s}} \in \left\{ {{\text{v}},{\text{ c}},{\text{ ni}},{\text{ nd}}} \right\}.$$

(16)

Proposition 5

At point z where D(z, T _v ) > −∞, the scale efficiency measures SE _rs (z) satisfy

$${\text{SE}}_{\text{rv}} \left( {\text{z}} \right) \, \ge {\text{ SE}}_{\text{rni}} \left( {\text{z}} \right) \, \ge \, 0,$$

(17a)

$${\text{SE}}_{\text{rv}} \left( {\text{z}} \right) \, \ge {\text{ SE}}_{\text{rnd}} \left( {\text{z}} \right) \, \ge \, 0.$$

(17b)

Proof

Equations (11), (15a) and (15b) imply that ${\text{D}}\left( {{\text{z}},{\text{ T}}_{\text{rc}}^{*} } \right) \, \ge {\text{ D}}\left( {{\text{z}},{\text{ T}}_{\text{rni}}^{*} } \right) \, \ge {\text{ D}}\left( {{\text{z}},{\text{ T}}_{\text{rv}}^{*} } \right),{\text{ and D}}\left( {{\text{z}},{\text{ T}}_{\text{rc}}^{*} } \right) \, \ge {\text{ D}}\left( {{\text{z}},{\text{ T}}_{\text{rnd}}^{*} } \right) \, \ge {\text{ D}}\left( {{\text{z}},{\text{ T}}_{\text{rv}}^{*} } \right)$. Using (16), this gives (17a) and (17b).

Equation (16) defines SE_rs(z) as a measure of departure from CRS, evaluated in number of units of the reference bundle g. From Eqs. (17a)–(17b), evaluated under VRS (with s = v), the measure is always non-negative: SE_rv(z) ≥ 0. This is intuitive: it follows from the fact that the set T ^*_rc is always at least as large as T ^*_rv , as stated in (15a)–(15b). In addition, (17a) states that, under NIRS (with s = ni), SE_rni(z) is also non-negative but has SE_rv(z) as an upper bound. This follows from the fact that the set T ^*_rni is always at least as large as T ^*_rv but never larger than T ^*_rc , as stated in (15a). And (17b) establishes a similar result under NDRS (with s = nd): SE_rnd(z) is non-negative but has SE_rv(z) as an upper bound. This shows how SE_rs(z) in equation (16) provides a basis to measure scale efficiency under non-convexity. Indeed, finding SE_rs(z) > 0 at point z implies that the set T ^*_rs exhibits a departure from CRS and that point z is scale inefficient. The effects of r on the evaluation of scale efficiency will be evaluated below.

6 Empirical assessment

Consider a data set involving observations of m netputs chosen by N firms: {z_i = (z_1i, …, z_mi): i ∈ N}, where z_ji is the j-th netput used by the i-th firm. As suggested in propositions 2–5, we want to find some convenient way to solve for the directional distance function D(z, T) under alternative representations of the technology T.

6.1 Empirical evaluation of directional distance functions

This section examines empirical applications using the data {z_i = (z_1i, …, z_mi): i ∈ N}. First consider the optimization problem (11) under T_s in (2), where s ∈{v, c, ni, nd}, S_v = 1, S_c = [0, ∞], S_ni = [0, 1] and S_nd = [1, ∞1]. For each s ∈{v, c, ni, nd} and assuming that a solution exists, this gives the standard linear programming (LP) problems: D(z, T_s) = max_β {β: z + β g ≤ Σ_i∈Ν λ_i z_i; λ_i ∈ R₊, i ∈ Ν, Σ_i∈Ν λ_i ∈ S_s}. In all these cases, convexity is imposed. Second, consider the optimization problem (11) under T_FDHs in (4) for s ∈{v, c, ni, nd}. Assuming that a solution exists, this gives D(z, T_FDHs) = max_β {β: z + β g ≤ Σ_i∈Ν λ_i z_i; λ_i ∈ {0, δ}, i ∈ Ν; Σ_i∈Ν λ_i = δ; δ ∈ S_s}, which is a mixed integer linear programming (MILP) problem for s = v (where S_v = 1), but a mixed integer nonlinear programming (MINLP) problem for s ∈{c, ni, nd}.^{Footnote 8}

Below, we explore how to solve (14) under T ^*_rv , the neighborhood-based representation of technology given in (7). For s ∈ {v, n, ni, nd}, note that Eq. (7) can be alternatively written as

$${\text{T}}_{\text{rs}}^{*} = \{ \sum\limits_{{{\text{j}} \in N}} {\upalpha_{\text{j}} {\text{T}}_{\text{rs}} \left( {{\text{z}}_{\text{j}} } \right);\upalpha_{\text{j}} \in \left\{ {0,{ 1}} \right\},{\text{j}} \in N;} \sum\limits_{{{\text{j}} \in N}} {\upalpha_{\text{j}} = { 1}\} } ,$$

(18)

for s ∈{v, c, ni, nd}. Let λ_ij be the weight λ_i associated with z = z_j in (7). Letting η_ij = α_j λ_ij, it follows from (6), (11) and (18) that

$$\begin{aligned} {\text{D}}\left( {{\text{z}},{\text{ T}}_{\text{rs}}^{*} } \right) \, & = {\text{Max}}_{{\upbeta,\uplambda,\upeta,\upalpha}} \{\upbeta: \, ({\text{z}} +\upbeta{\text{g}}) \le \sum\limits_{{{\text{j}} \in {\rm N}}} {\sum\limits_{{{\text{i}} \in I({\text{zj}},{\text{r}})}} {\upeta_{\text{ij}} {\text{z}}_{\text{i}} :\upeta_{\text{ij}} =\upalpha_{\text{j}}\uplambda_{\text{ij}} ,\uplambda_{\text{ij}} \in {\text{R}}_{ + } ,} } \\ &\quad \sum\limits_{{{\text{i}} \in I({\text{zj}},{\text{r}})}} {\uplambda_{\text{ij}} \in {\text{S}}_{\text{s}} ,\upalpha_{\text{j}} \in \left\{ {0,{ 1}} \right\},} \sum\limits_{{{\text{j}} \in {\rm N}}} { \upalpha_{\text{j}} = { 1},{\text{i}} \in I\left( {{\text{z}}_{\text{j}} ,{\text{ r}}} \right),{\text{j}} \in N\} {\text{ if a solution exists}},} \\ & = \, - \infty {\text{otherwise}}, \\ \end{aligned}$$

(19)

for s ∈{v, c, ni, nd}. Equation (19) is a MINLP problem. Solving it numerically can provide a way to assess the directional distance functions D(z, T ^*_ra ) for s ∈{v, c, ni, nd}.

Yet, dealing with non-linear constraints in (19) can be empirically challenging. In this context, alternative formulations that avoid non-linear constraints are of interest. One such formulation is the following optimization problem

$$\begin{aligned} {\text{D}}^{ + } \left( {{\text{z}},{\text{ T}}_{\text{rs}}^{*} } \right) \, & = {\text{Max}}_{{\upbeta,\upeta,\upalpha}} \{\upbeta: \, ({\text{z}} +\upbeta{\text{g}}) \le \sum\limits_{{{\text{j}} \in N}} {\sum\limits_{{{\text{i}} \in I({\text{zj}},{\text{r}})}} {\upeta_{\text{ij}} {\text{z}}_{\text{i}} :\upeta_{\text{ij}} \in {\text{R}}_{ + } ,} } \sum\limits_{{{\text{i}} \in I({\text{zj}},{\text{r}})}} {\upeta_{\text{ij}} \in\upalpha_{\text{j}} {\text{S}}_{\text{s}} ,} \\&\upalpha_{\text{j}} \in \left\{ {0,{ 1}} \right\},\sum\limits_{{{\text{j}} \in N}} {\upalpha_{\text{j}} = { 1},{\text{i}} \in I\left( {{\text{z}}_{\text{j}} ,{\text{ r}}} \right),{\text{j}} \in N\} {\text{ if a solution exists}},} \\& = \, - \infty \;{\text{otherwise}}. \\ \end{aligned}$$

(20)

for s ∈{v, c, ni, nd}. Equation (20) is a MILP problem. Because it does not include the nonlinear restrictions η_ij = α_j λ_ij, solving (20) is simpler than solving (19). But the absence of the restrictions η_ij = α_j λ_ij in (20) implies that D⁺(z, T ^*_rs ) is in general an upper bound to D(z, T ^*_rs ): D⁺(z, T ^*_rs ) ≥ D(z, T ^*_rs ). When would the two objective functions coincide? They would coincide (with D⁺(z, T ^*_rs ) = D(z, T ^*_rs )) when the solution to (20), (η^*, α^*), satisfies η ^*_ij = 0 for all i when α ^*_j = 0, j ∈ N. Otherwise, they would differ, and D⁺(z, T ^*_rs ) would be strictly larger than D(z, T ^*_rs ): D⁺(z, T ^*_rs ) > D(z, T ^*_rs ). In this later case, solving the simpler problem (20) would provide upward biased estimates of D(z, T ^*_rs ).

6.2 Linear programming formulation

Given the potential empirical difficulties in solving the nonlinear optimization problem (19), we now explore a simpler way to evaluate D(z, T ^*_rs ) in (19). From (7), note that T ^*_rs is defined from T_rs(z_j), j ∈ N. This suggests obtaining D(z, T ^*_rs ) using the following two-step approach.

In step one, solve (11) under T_rs(z′) in (6). For s ∈{v, c, ni, nd}, this corresponds to the (primal) linear programming (LP) problem

$$\begin{aligned} D(z,T_{rs} (z')) & = {\text{Max}}_{{\upbeta,\uplambda}} \{\upbeta:({\text{z}} +\upbeta{\text{g}}) \le \sum\limits_{{{\text{i}} \in I\left( {{\text{z}}',{\text{r}}} \right)}} {\uplambda_{\text{i}} {\text{z}}_{\text{i}} ;\uplambda_{\text{i}} \in {\text{R}}_{ + } ,{\text{i}} \in I({\text{z}}',{\text{r}});} \sum\limits_{{{\text{i}} \in I\left( {{\text{z}}',{\text{r}}} \right)}} {\uplambda_{\text{i}} \in {\text{S}}_{\text{s}} } \quad {\text{if a solution exists,}}\} \\ & = - \infty \,\;{\text{otherwise}} \\ \end{aligned}$$

(21)

or its dual LP formulation

$$\begin{aligned} {\text{D}}({\text{z}},{\text{T}}_{\text{rs}} ({\text{z}}')) & = {\text{Min}}_{\text{u,v}} \{ {\text{v}} - {\text{z}}^{\text{T}} {\text{u}}:{\text{z}}_{\text{j}}^{\text{T}} {\text{u}} \le {\text{v}},{\text{j}} \in I({\text{z}}',{\text{r}});{\text{g}}^{\text{T}} {\text{u}} = 1;{\text{u}} \in ;{\text{v}} \in V_{\text{s}} \} ,\quad {\text{if a solution exists}}, \\ & = - \infty \;{\text{otherwise,}} \\ \end{aligned}$$

(21’)

where u and v are the Lagrange multipliers associated with the constraints [(z + β g) ≤ Σ_i∈I(z′_,r) λ_i z_i] and [Σ_i∈I(z′_,r) λ_i ∈ S_s] in (21), with V_v = [−∞, ∞], V_c = 0, V_ni = [0, ∞] and V_di = [−∞, 0].

Then, in step two, assuming that D(z, T_rs(z_i)) > −∞ for some i ∈ I, and using (18), D(z, T ^*_rs ) can be obtained as

$${\text{D}}\left( {{\text{z}},{\text{ T}}_{\text{rs}}^{*} } \right) \, = {\text{Max}}_{\text{i}} \{ {\text{D}}\left( {{\text{z}},{\text{ T}}_{\text{rs}} \left( {{\text{z}}_{\text{i}} } \right)} \right):{\text{ i}} \in N\} .$$

(22)

In this two-step approach, step one involves solving linear programming (LP) problems in (21) or (21’). And step 2 stated in (22) is a simple maximization problem. This shows how (21)-(22) can be used to obtain D(z, T ^*_rs ) by solving simple linear programming problems. This provides a convenient way to solve (11) under T ^*_rs , our neighborhood-based representation of technology given in (7).

6.3 Defining the neighborhood B_r(z, σ)

As discussed in Sect. 2, our analysis relies on the definition of a neighborhood B_r(z, σ) = {z′: D_p(z, z′) ≤ r: z′ ∈ R^m} ⊂ R^m, where D_p(z, z′) is a weighted Minkowski distance with 1 ≤ p < ∞. Below, it will be convenient to rely on a weighted Chebyshev distance defined as lim_p→∞ D_p(z, z′) = Max_j {|z_j − z_j’|/σ_j: j = 1, …, m}. In this context, B_r(z, σ) can be written as B_r(z, σ) = {z′: −r σ_j ≤ z_j − z_j′ ≤ r σ_j; j = 1, …, m; z′ ∈ R^m} and I(z, r) can be written as I(z, r) = {i: –r σ_j ≤ z_j − z_jj’ ≤ r σ_j; j = 1, …, m; i ∈ N}.

Below, we discuss general rules that can be used in choosing this neighborhood. Sometimes, we may have a priori information about the regions where non-convexity is likely to arise. Assume that one of these regions is region A(z) around point z. In general we want to choose the neighborhood of B_r(z, σ) to be no larger than A(z). Indeed, choosing B_r(z, σ) ⊃ A(z) may just “hide” the non-convexity in A(z) within the larger region B_r(z, σ). This generates the following rule:

Rule R1

Around point z, choose a neighborhood B_r(z, σ) that is no larger than the region A(z) where non-convexity is suspected: B_r(z, σ) ⊂ A(z).

Rule R1 assumes that we do have a priori information about the presence of non-convexity. This a priori information can come from theoretical considerations. For example, the presence of fixed cost is a well-known source of non-convexity. It means that non-convexity can be expected in any region of the feasible set where “fixed resources” are being used. This can include labor or management (e.g., “fixed” labor or management wasted in the process of switching between tasks) as well as capital (e.g., “fixed” machinery, equipment or infrastructure used in the production process). This could also include “resource fixity” on the output side (e.g., for perishable products).

What if we do not have the a priori information stipulated in rule R1? Then we need to find other ways to identify the neighborhood B_r(z, σ). In this context, we can use the data to help choose these neighborhoods. To see that, let M_j ≡ [Max_i∈N {z_ji} − Min_i∈N {z_ji}] be the range of observations for z_j, j = 1, …, m. For the j-th netput, consider partitioning the line [Min_i∈N {z_ji}, Max_i∈N {z_ji}] into k intervals, j = 1, …, m, where k is an integer satisfying 1 ≤ k ≤ N. One way is to choose these intervals to be equally spaced.^{Footnote 9} Then, for the j-th netput, the width of an interval is M_j/k. Given B_r(z, σ) = {z′: –r σ_j ≤ z_j − z_j’ ≤ r σ_j; j = 1, …, m; z′ ∈ R^m}, associate these intervals with a neighborhood of point z by letting r σ_j = M_j/k, k being a positive integer, j = 1, …, m. For a given k, it follows that the neighborhood of z can be written as B_r(z, ·) = {z′: − M_j/k ≤ z_j − z_j′ ≤ M_j/k; j = 1, …, m; z′ ∈ R^m}. When z ad z′ are points within the range of the data, then choosing k = 1 implies that B_r(z, ·) is a “large neighborhood” of z which includes all data points. And choosing k > 1 means that we partition the range of each netput into k equally spaced intervals, the neighborhood B_r(z, ·) of z becoming smaller as k becomes larger.

Next, we propose the following rule to guide us in the choice of neighborhoods.

Rule R2

Around point z, choose a neighborhood B_r(z, σ) that includes more than one data point.

R2 has important implications. First, it implies that point z cannot be outside the range of the data. That is intuitive: in any analysis, we should always try to avoid extrapolating beyond the data. Second, Rule R2 requires that there are sufficient data points to support the analysis. It hints that the number of observations N should be “large enough” to provide credible evidence on non-convexity in the neighborhood of point z. Third, R2 rules out FDH. Indeed, from Eq. (9) in Lemma 2, FDH is obtained when r → 0, implying that the neighborhood of any point z_j would include just the point z_j. This would be inconsistent with R2. As discussed in Sect. 2, the FDH approach seems undesirable as it can find evidence of non-convexity “too often”. Intuitively, R2 stresses the importance of having a minimal number of observations (more than 1) to evaluate the characteristics of technology in any neighborhood within the data. As such, R2 can help improve the credibility of finding evidence that a technology is non-convex. Fourth, Rule R2 puts some upper bound on the number of intervals k discussed above. Indeed, increasing k would also reduce the number of observations in each interval. Again, to be credible, evidence of non-convexity in the neighborhood of point z should rely on a sufficient number of data points. Overall, Rule R2 implies that the number of observations N should be “large enough” while the number of intervals should “not be too large”. As such, it provides useful guidance to support productivity analysis under non-convexity.

7 Empirical illustration

To illustrate the usefulness of our proposed approach, we apply it to a data set on production activities from a sample of Korean farm households.

7.1 Data

The data were collected in 2007 in a Farm Household Economy Survey conducted by the Korean National Statistical Office. Our analysis focuses on a sample of farms classified as paddy rice farms located in the Jeon-Nam province, a rice-producing province in the southern part of Korea. Being in the same region, all farms face similar agro-climatic conditions. The sample includes 122 rice farms. It provides data on ten outputs: rice, vegetable, soybean, fruit, potato, barley, miscellaneous, specialty, livestock, and others; and four inputs: labor, size of paddy land, size of upland, and other inputs. Labor input is measured in hours, and land inputs are measured in hectares (ha). Other netputs are measured in value, assuming that all farmers face the same prices.

Descriptive statistics on the variables used in our analysis are presented in Table 1. The average revenue from rice production is 15,398.81 (measured in 1,000 won^{Footnote 10}), accounting for 62.7 % of total farm revenue. The second largest source of revenue is vegetable production: 3,608.15 (measured in 1,000 won), accounting for 14.7 % of total farm revenue. The average size of a farm is 1.31 ha (including both paddy land and upland).

Table 1 Descriptive statistics

Full size table

7.2 Results

Our analysis uses data on production activities from our sample of 122 Korean farms. It covers 14 netputs: 10 outputs treated as positive, and 4 inputs treated as negative. For the i-th farm, the netputs are z_i = (z_ji: j = 1, …., 14), i ∈ N ≡ {1, 2, …, 122}.

The estimation of the directional distance function in (11), (19) or (21)–(22) produces a nonparametric estimate of the distance between point z and the boundary of the feasible set, as measured by the number of units of the reference bundle g. When z is the netput vector for the i-th farm, then the distance function D(z_i, T) ≥ 0 provides a measure of technical inefficiency for the i-th farm, with D(z_i, T) > 0 when the i-th farm is technically inefficient. The reference bundle g = (g₁, …, g₁₄) is chosen as follows. We let g_j = 0 when j is an input, and g_j = sample mean for the j-th output when j is an output. Thus, our reference bundle g = (g₁, …, g₁₄) is the typical bundle associated with the outputs of an average farm. This choice leads to a simple interpretation of our directional distance estimates. For example, for a given T, finding that D(z_i, T) = 0.2 would mean that the i-th farm is technically inefficient: it could move the production frontier and increase its outputs by a maximum of 20 percent of the average outputs in our sample by becoming technically efficient. Note that this interpretation remains valid under alternative characterizations of the technology T.

We evaluate the directional distance function D(z_j, T) in (11) for each farm under alternative representations of the technology. First, we start with DEA analysis and solve for D(z_j, T) under technologies T_v under VRS and T_c under CRS (as given in Eqs. (1) and (2)). Second, using T_FDHv in (3), we obtain FDH measures D(z_j, T_FDHv) under VRS technology by solving the corresponding mixed integer programming problems. The results are reported in the “Appendix” for each farm. Since our neighborhood-based representation of technology allows for non-convexity to arise in any part of the feasible set, it can provide a basis to evaluate productivity and non-convexity for different firm types. We investigate this issue for three categories of farms: small farms, medium farms, and large farms.^{Footnote 11} The results are summarized in Table 2. Table 2 presents the average technical inefficiency estimates D(z_j, T) for each group of farms under alternative representation of the technology. It shows that DEA finds evidence of technical inefficiency across all farm sizes. The mean value of D(z_j, T_v) is 0.063 for small farms, 0.159 for medium farms, and 0.119 for large farms. Table 2 also reports that FDH finds that all farms are technically efficient, with D(z_j, T_FDHv) = 0 for all j = 1, …, 122. Note that this is consistent with Proposition 2, which showed that DEA (relying on T_v) is more likely to find evidence of technical inefficiency than FDH (as the production frontier tends to be higher under DEA compared to FDH). But in this case, allowing for non-convexity under FDH eliminates all evidence of technical inefficiency. This has two implications. First, there can be a large difference between the DEA measure of technical inefficiency D(z_j, T_v), and its FDH counterpart D(z_j, T_FDHv). Second, this difference is due entirely to relaxing the convexity assumption. One must wonder whether this difference is “credible”. As discussed in Sect. 2, this raises the question: Does the FDH approach finds non-convexity “too often”? We believe that it does (as further discussed below).

Table 2 Average technical inefficiency D(z, T) and non-convexity C(z) under alternative representations of the technology, by farm size

Full size table

Next, using the neighborhood-based representation of technology T ^*_rs in (7) or (18), we obtain estimates of the directional distance D(z_j, T ^*_rs ) by solving the linear programming problems in (21)–(22). In the absence of strong a priori information about where non-convexity may arise, we define the neighborhoods B_r(z, σ) as follows. Assuming equally spaced intervals, we let r σ_j = M_j/k, and define B_r(z, ·) = {z′: − M_j/k ≤ z_j − z_j’ ≤ M_j/k; j = 1, …, m; z′ ∈ R^m} as neighborhood of z, where M_j ≡ [Max_i∈N {z_ji} − Min_i∈N {z_ji}] and k denotes the number of intervals within the data range. The set T ^*_rv in (7) is then defined accordingly. The analysis is repeated for alternative numbers of intervals k: k = 1, 2, 4, 6, 8, 10, 12. The distances D(z_j, T ^*_rs ) are estimated under VRS (with s = v) for each farm. The results are reported in the “Appendix” for each farm. Summary measures are presented in Table 2 for our three farm sizes: small farms, medium farms, and large farms. The results are consistent with Proposition 2. First, as expected, D(z, T ^*_rv ) is bounded between D(z, T_FDHv) and D(z, T_v), with D(z, T_FDHv) as lower bound and D(z, T_v) as upper bound. Second, D(z, T ^*_rv ) tends to increase with the size of the neighborhood r, or equivalently decrease with the number of intervals k (given r σ_j = M_j/k). Third, Table 2 shows that our estimates D(z, T ^*_rv ) nest DEA estimates and FDH estimates as special cases. Indeed, D(z, T ^*_rv ) becomes equal to D(z, T_v) when neighborhoods become “large” (in our case, when k = 1), and it becomes equal to D(z, T_FDHv) when neighborhoods become “small” (in our case, when k = 12). Yet, neither case seems realistic. Indeed, choosing k = 1 imposes a convex technology and prevents any possibility of uncovering evidence of non-convexity. Alternatively, choosing k = 12 likely finds non-convexity “too often”. As noted above, FDH does not satisfy our “Rule 2”. In this case, 12 intervals are “too many” as there are not enough points in each neighborhood to obtain a reliable estimate of marginal productivity around each data point. And this has adverse effects on the ability to find evidence of technical inefficiency. Indeed, in this case FDH or k = 12 fails to find any evidence of technical inefficiency.^{Footnote 12} These results help document why FDH does not provide a reasonable approach in the analysis of non-convexity.

One advantage of our approach is that it allows us to choose neighborhoods that satisfy our Rules R1 and R2. These rules seek a balance between finding evidence of technical inefficiency versus finding evidence of non-convexity. In our application, we believe that choosing k = 4 is a good choice: it is between k = 1 (corresponding to DEA) and k = 12 (corresponding to FDH). It identifies neighborhoods that are “not too large” to allow us to uncover evidence of non-convexity, and “not too small” to generate a more reliable estimate of the production technology around any data point. Interestingly, when k = 4, we still find evidence of technical inefficiency. Indeed, Table 2 reports mean estimates of technical inefficiency of 0.025 for small farms (with 62.2 % of small farms being technically efficient), 0.035 for medium farms (with 75.5 % of medium farms being technically efficient), and 0.003 for large farms (with most large farms being technically efficient).

In addition, Table 2 reports estimates of the non-convexity measure C_rv(z) given in Eq. (13). When k = 4, the mean estimates of C_rv(z) are 0.039 for small farms, 0.123 for medium farms, and 0.116 for large farms. For example, it means that, for medium farms, the effects of non-convexity amount to a 12.3 percent change in average outputs. These estimates indicate that the technology facing Korean farmers exhibit significant non-convexity. They also show that the extent of non-convexity is larger on medium and large farms (compared to small farms). As analyzed by Chavas and Kim (2007), non-convexity contributes to increasing the productivity benefits of specialization. This would indicate that large farms have stronger incentives to specialize than smaller farms. To our knowledge, this is the first evidence that non-convexity appears to vary with firm size.

Finally, we evaluate returns to scale under non-convexity. Using (16), we use our neighborhood-based representation T ^*_rv under VRS to evaluate scale efficiency SE_rv(z). The results are summarized in in Table 3 for our three farm sizes. Recall that SE_rv(z) = 0 when point z is scale efficient, and SE_rv(z) > 0 implies a departure from CRS and measures the magnitude of scale inefficiency. The evidence against CRS is in general modest. Under DEA (obtained when r is large and k = 1), the average SE is 0.026 for small farms, 0.024 for medium farms, and 0.13 for large farms. Alternatively, under FDH (obtained when r is large and k = 12), all farms are found to be scale efficient (with all SE = 0). Using our neighborhood-based representation of technology with k = 4, the average SE is 0.02 for small farms, 0.041 on medium farms, and 0.030 on large farms.

Table 3 Scale efficiency SE_rs(z) under alternative representations of the technology, by farm size

Full size table

These results have several implications. First, Korean farms exhibit a high level of scale efficiency. This is consistent with the dominant small-scale rice farming system commonly found in Korea. Second, introducing non-convexity affects the estimate of scale effects. Table 3 shows that the relationship between SE and k is not always monotonic. For example, in the case of medium farms, the average SE first rises then declines with k. This indicates that there is no general relationship between non-convexity and returns to scale. Yet, our results indicate that non-convexity matters in the analysis of scale effects. Indeed, Table 3 suggests that neglecting non-convexity (by using DEA) would generate “upward-biased” estimates of SE, while relying on FDH would likely generate “downward-biased” estimates of SE.^{Footnote 13} Finally, Table 3 indicates that these biases vary with farm size. In particular, the estimate of SE is found to be more sensitive to the choice of k for large farms. This is likely due to the fact that non-convexity effects are more important on large farms. This stresses the need to account for non-convexity in the evaluation of returns to scale. This also illustrates the usefulness of our approach in understanding and evaluating the technical and scale efficiency of firms under non-convexity.

8 Concluding remarks

This paper has presented a new nonparametric approach to the analysis of technology and productivity under non-convexity. Our approach relies on a neighborhood-based representation of technology. We investigate the general properties of our model and its use in the evaluation of technology and productivity under non-convexity. Our approach nests two well-known approaches as special cases: DEA, and FDH models. Yet either of these two approaches is overly restrictive: DEA because it does not allow for any non-convexity; and FDH because it allows for “too much” non-convexity. We argue that our new nonparametric model allows for non-convexity in a more flexible way. Its neighborhood-based representation of technology allows for non-convexity to arise in any part of the feasible set. In this context, we propose a measure capturing the extent of non-convexity. We also use our approach to evaluate scale efficiency under non-convexity. We show how our approach can be applied by solving simple optimization problems. Finally, we illustrate its usefulness through an empirical application to Korean farms. The empirical analysis shows how non-convexity can reduce the extent of technical inefficiency. It finds evidence that non-convexity is more common on large farms. Finally, it documents how non-convexity matters in the analysis of scale effects.

Note that our analysis could be extended in number of directions. First, while our neighborhood-based approach provides a flexible way to investigate the presence of non-convex technology, there is a need for additional research exploring the implications of neighborhood choice for productivity and efficiency analysis. Second, exploring the statistical properties of our proposed efficiency estimator and investigating linkages with stochastic frontier analysis (e.g., Kumbhakar et al. 2007; Simar and Zelenyuk 2011) are good topics for further investigation. Third, the economics and management implications of non-convexity need to be examined in more details. For example, evaluating the productivity effects of firm specialization is a good topic for further research. Fourth, there is a need for additional studies of the economic implications of non-convex technologies in a market equilibrium context (e.g., Chavas and Briec 2012). Finally, empirical applications to different industries are needed to uncover evidence of situations where non-convexity may be important.

Notes

The technology T is convex if, for any z and z' ∈ T, then (θ z + (1-θ) z') ∈ T for any scalar θ ∈ [0, 1].
Note that Boussemart et al. (2009) analyze returns to scale under general conditions, allowing the technology to be either convex or non-convex.
In the context of a statistical model, fewer data points being used to evaluate the FDH frontier, Park et al. (2000) have showed that the rate of convergence of the efficiency estimator is slower under FDH than under DEA. This has led Jeong and Simar (2006) to propose a linearized version of FDH with better convergence properties.
For example, when p = 2, this corresponds to the Euclidean distance: D₂(z, z′) ≡ $\sum\nolimits_{\text{j = 1}}^{\text{m}} {}$[(|z_j − z_j'|/σ_j)²]^1/2. And when p → ∞, this corresponds to the Chebyshev distance: lim_p→∞ D_p(z, z′) = Max_j {|z_j − z_j′|/σ_j: j = 1, …, m}.
The choice and evaluation of the neighborhood B_r(z, σ) will be further discussed in Sect. 6.3 below.
The directional distance function D(z, T) in (11) is the negative of Luenberger’s shortage function (see Luenberger 1995).
Note that D(z, T) includes as special cases many measures of technical inefficiency that have appeared in the literature. Relationships with Shephard’s distance functions (Shephard 1953) or Farrell’s measure of technical efficiency (Farrell 1957) are discussed in Chambers et al. (1996) and Färe and Grosskopf (2000).
Since dealing with non-linear constraints can be empirically challenging, note that alternative formulations have been proposed avoiding non-linear constraints in productivity analysis (e.g., Podinosvki 2004; Leleu 2006; Soleimani-Damneh and Reshadi 2007; De Witte and Marques 2011). In this context, Briec et al. (2004) developed a much simpler enumeration approach to efficiency analysis under FDH.
An alternative way to choose the intervals would be to rely on the empirical distribution of netputs. In this context, one option would be to choose the intervals such that, for each netput, each interval includes the same number of sample observations.
Note that 1,000 won (the Korean currency) = 0.89 US dollars.
Farm size is measured by the total amount of land (in ha). Small farms are defined as farms being in the 0–30 percentile of the sample distribution of farm size, medium farms are between the 30 percentile and 70 percentile, and large farms are in the 70–100 percentile. The average farm size of small, medium and large farms are 0.574, 1.624, and 5.965 ha, respectively.
Note that Wheelock and Wilson (2009) found a similar result in their analysis of bank efficiency. This does suggest that estimates of technical inefficiency reported in the literature are driven in part by the assumption of convexity.
Note that, since our model is not presented as a statistical model, our use of the term “bias” does not have a statistical meaning. Exploring the statistical properties of our proposed efficiency estimator appears to be a good topic for further research.

References

Afriat SN (1972) Efficiency estimation of production functions. Int Econ Rev 13:568–598
Article Google Scholar
Agrell PJ, Tind J (2001) A dual approach to nonconvex frontier models. J Prod Anal 16:129–146
Article Google Scholar
Agrell PJ, Bogetoft P, Tind J (2005) Efficiency evaluation with convex pairs. Adv Model Optim 7:211–237
Google Scholar
Banker RD (1984) Estimating most productive scale size using data envelopment analysis. Eur J Oper Res 17:35–44
Article Google Scholar
Banker RD, Charnes A, Cooper WW (1984) Some models for estimating technical and scale efficiencies in data envelopment analysis. Manage Sci 30:1078–1092
Article Google Scholar
Banker RD, Cooper WW, Seiford LM, Thrall RM, Zhu J (2004) Returns to scale in different DEA models. Eur J Oper Res 154:345–362
Article Google Scholar
Bogetoft P (1996) DEA on relaxed convexity assumptions. Manage Sci 42:457–465
Article Google Scholar
Bogetoft P, Tama JM, Tind J (2000) Convex input and output projections of nonconvex production possibility sets. Manage Sci 46:858–869
Article Google Scholar
Boussemart JP, Briec W, Peypoch N, Tavéra C (2009) α-Returns to scale and multi-output production technologies. Eur J Oper Res 197:332–339
Article Google Scholar
Briec W, Liang QL (2011) On semilattice structures for production technologies. Eur J Oper Res 215:740–749
Google Scholar
Briec W, Kerstens K, Vanden Eeckaut P (2004) Non-convex technologies and cost functions: definitions, duality and nonparametric tests of convexity. J Econ 181:155–192
Article Google Scholar
Chambers RG, Chung Y, Färe R (1996) Benefit and distance functions. J Econ Theor 70:407–419
Article Google Scholar
Chang KP (1999) Measuring efficiency with quasiconcave production functions. Eur J Oper Res 115:497–506
Article Google Scholar
Chavas JP, Briec W (2012) On economic efficiency under non-convexity. Econ Theor 50:671–701
Article Google Scholar
Chavas JP, Kim K (2007) Measurement and sources of economies of scope. J Inst Theor Econ 163:411–427
Article Google Scholar
Cook WD, Seiford LM (2009) Data envelopment analysis (DEA): thirty years on. Eur J Oper Res 192:1–17
Article Google Scholar
De Witte K, Marques RC (2011) Big and beautiful? On non-parametrically measuring scale economies in non-convex technologies. J Prod Anal 35:213–226
Article Google Scholar
Deprins D, Simar L, Tulkens H (1984) Measuring labor efficiency in post offices. In: Marchand M, Pestiau P, Tulkens H (eds) The performance of public enterprises: concepts and measurements. North-Holland, Amsterdam, pp 243–267
Google Scholar
Färe R, Grosskopf S (2000) Theory and application of directional distance functions. J Prod Anal 13:93–103
Article Google Scholar
Färe R, Grosskopf S, Lovell CAK (1994) Production frontiers. Cambridge University Press, Cambridge
Google Scholar
Farrell M (1957) The measurement of productive efficiency. J R Stat Soc 120:253–281
Google Scholar
Jeong SO, Simar L (2006) Linearly interpolated FDH efficiency score for nonconvex frontiers. J Multivar Anal 97:2141–2161
Article Google Scholar
Kerstens K, Vanden Eeckaut P (1999) Estimating returns to scale using non-parametric deterministic technologies: a new method based on goodness-of-fit. Eur J Oper Res 113:206–214
Article Google Scholar
Kumbhakar SC, Park BU, Simar L, Tsionas EG (2007) Nonparametric stochastic frontiers: a local likelihood approach. J Econ 137:1–27
Article Google Scholar
Leleu H (2006) ”A Linear programming framework for free disposal hull technologies and cost functions: primal and dual models. Eur J Oper Res 168:340–344
Article Google Scholar
Leleu H (2009) Mixing DEA and FDH models together. J Oper Res Soc 60:1730–1737
Article Google Scholar
Luenberger D (1995) Microeconomic theory. McGraw-Hill Inc, New York
Google Scholar
Park BU, Simar L, Weiner C (2000) The FDH estimator for productivity efficiency scores: asymptotic properties. Econ Theor 16:855–877
Article Google Scholar
Petersen NC (1990a) Data envelopment analysis on a relaxed set of assumptions. Manage Sci 36:214–305
Article Google Scholar
Petersen NC (1990b) Data envelopment analysis on a relaxed set of assumptions. Manage Sci 36:305–314
Article Google Scholar
Podinosvki VV (2004) On the linearization of reference technologies for testing returns to scale in FDH models. Eur J Oper Res 152:800–802
Article Google Scholar
Podinosvki VV (2005) Selective convexity in DEA models. Eur J Oper Res 161:552–563
Article Google Scholar
Ray SC (2004) Data envelopment analysis: theory and techniques for economics and operations research. Cambridge University Press, Cambridge
Book Google Scholar
Shephard RW (1953) Cost and production functions. Princeton University Press, Princeton
Google Scholar
Simar L, Zelenyuk V (2011) Stochastic FDH/DEA estimators for frontier analysis. J Prod Anal 36:1–20
Article Google Scholar
Soleimani-Damneh M, Reshadi M (2007) A polynomial-time algorithm to estimate returns to scale in FDH models. Comput Oper Res 34:2168–2176
Article Google Scholar
Tulkens H (1993) On FDH efficiency analysis: some methodological issues and applications to rettail banking, courts and urban transit. J Prod Anal 4:183–210
Article Google Scholar
Varian HR (1984) The nonparametric approach to production analysis. Econometrica 52:579–597
Article Google Scholar
Wheelock DC, Wilson PW (2009) Robust nonparametric quantile estimation of efficiency and productivity change in US commercial banking, 1985–2004. J Bus Econ Stat 27:354–368
Article Google Scholar

Download references

Acknowledgments

We would like to thank by two anonymous reviewers and the Editor for useful comments made on an earlier draft of the paper. This research was supported by a grant from the National Foundation of Korea (Government of Korea, grant NRF-2013S1A5A2A01018948) and by a USDA Hatch grant, College of Agricultural and Life Sciences, University of Wisconsin, Madison.

Author information

Authors and Affiliations

University of Wisconsin, Taylor Hall, Madison, WI, 53706, USA
Jean-Paul Chavas
Seoul National University, Seoul, Korea
Kwansoo Kim

Authors

Jean-Paul Chavas
View author publications
You can also search for this author in PubMed Google Scholar
Kwansoo Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jean-Paul Chavas.

Appendix

See Table 4.

TAble 4 Technical inefficiency D(z, T) for each farm under T_FDHv, T_v and T ^*_rv

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chavas, JP., Kim, K. Nonparametric analysis of technology and productivity under non-convexity: a neighborhood-based approach. J Prod Anal 43, 59–74 (2015). https://doi.org/10.1007/s11123-014-0383-1

Download citation

Published: 02 February 2014
Issue Date: February 2015
DOI: https://doi.org/10.1007/s11123-014-0383-1

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Nonparametric analysis of technology and productivity under non-convexity: a neighborhood-based approach

Abstract

Similar content being viewed by others

Data Envelopment Analysis: A Nonparametric Method of Production Analysis

Data Envelopment Analysis: A Nonparametric Method of Production Analysis

Stochastic Nonparametric Approach to Efficiency Analysis: A Unified Framework

1 Introduction

2 The model

Definition 1

3 Evaluating non-convexity

Lemma 1

Proof

Lemma 2

Proof

Proposition 1

Proof

4 Productivity under non-convexity

Proposition 2

Definition 2

Proposition 3

Proof

5 Evaluating returns to scale

Proposition 4

Proof

Definition 3

Proposition 5

Proof

6 Empirical assessment

6.1 Empirical evaluation of directional distance functions

6.2 Linear programming formulation

6.3 Defining the neighborhood Br(z, σ)

Rule R1

Rule R2

7 Empirical illustration

7.1 Data

7.2 Results

8 Concluding remarks

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation

6.3 Defining the neighborhood B_r(z, σ)