1 Introduction

Since the pioneering work of Aigner et al. (1977), stochastic frontier (SF) analysis has been widely used in productivity and efficiency studies to describe and estimate production, cost, and profit frontier models. Such analysis typically assumes that a decision-making unit (DMU) employs a single production process or technology in a production of an output using multi-inputs. Most of the econometric techniques have focused on the estimation and interpretation of a single stochastic frontier regression with the composite error on inefficiency and statistical noise. There has been, however, very little discussion in the stochastic frontier literature to extend the single SF regression to multiple SF regressions settings, which would allow for the possibility of correlation among composite errors across SF regressions. The motivation behind the extension to system of SF regressions is analogous to the classical generalization to system of seemingly unrelated regressions (Zellner 1962).

Organizations or DMUs often operate multiple production divisions or sub-DMUs, with each division supported by its own set of resource inputs. A tourist hotel, for example, may provide services of accommodation and restaurants under separate divisions within the same DMU hotel management. Given that these divisions would still belong to the same DMU, they may be subject to the same random shocks as the parent DMU; and given that these divisions would share some commonly observed or unobserved characteristics of the parent DMU, the divisions’ technical efficiencies may well be correlated. In this circumstance, a system of multiple stochastic frontier regressions on the sub-DMUs is a more appropriate representation of a DMU’s operation and performance.

In this paper, the stochastic frontier model under consideration focuses on the correlation among a set of individual SF regressions, and still assumes independence between the error components: the statistical noise and the inefficiency component. The correlation may come from the correlated statistical noises or the correlated inefficiency components, or both. This pattern would therefore seem to suggest that the estimation of these stochastic frontier regressions should take into account the level of mutual dependency among them, as opposed to the regression-by-regression estimation.

Lately, several studies have used the copula method in studying of stochastic frontier models. Amsler et al. (2011) use copulas to model time dependence in stochastic frontier models. Shi and Zhang (2011) consider a copula regression model for dependence in long-tail distributions. Carta and Steel (2010) suggest using copulas in modeling multi-output stochastic frontiers. Various types of dependence of SF models are investigated with utilizing the copula method. Smith (2008), on the other hand, proposes a copula function to link the marginal distributions of correlated components v and u in order to construct the distribution of the composite error ɛ.

Copulas, originally introduced by Sklar (1959), provide a useful method of deriving joint distribution of a set of random variables given the univariate margins, especially in the case of nonnormal margins of the SF composite errors. However, the copula-based likelihood function of the joint SF regressions requires the computation of the cumulative distribution function (CDF) of the composite error. Such computation typically requires numerical integration procedures, which proves to be onerous. To facilitate the computation of the copula-based maximum likelihood estimation, we follow Tsay et al. (2012) in deriving an easy to implement and accurate closed-form formula for computing the CDF of the composite error.

A demonstration of our approach is provided through the analysis of a set of two SF regressions. The consequences of ignoring the dependence between the two SF regressions are examined via a Monte Carlo simulation, where both efficiency and bias in estimation are investigated under various degrees of correlation. The findings of the simulated experiment suggest that the efficiency gain in estimation from the system of SF regressions over the regression-by-regression increases significantly as the correlated composite errors become stronger. The efficiency gain is particularly significant in estimating the variances of the symmetric and one-sided errors, which is crucial in predicting the measures of the technical efficiency.

Finally, a two-SF regressions model is applied to an empirical study of the hotel industry in Taiwan, focusing on the production efficiency in accommodation and restaurant divisions. Our results show that both the estimated production frontiers and technical efficiencies differ significantly between the system estimation and the regression-by-regression estimations. They suggest that ignoring the interdependence in operations of the two divisions may cause a downward biased estimator of the technical efficiency.

While we apply a set of SF regressions to the divisions or sub-DMUs within a DMU, the accommodation and restaurant divisions of 50 international grand hotels, we recognize that the procedure is more generally applicable. For example, it can be applied to the SF analysis of temporal cross-section data where the “sub-DMUs” are the departments of a bank, or an efficiency study of the airline industry where the “sub-DMUs” are firms with correlated composite errors. The application may extend to macroeconomic settings where the “sub-DMUs” are countries within the same economic region or block.

The plan of the paper is as follows. In Sect. 2 we describe a multiple stochastic frontier regressions model with correlated composite errors. Section 3 proposes a copula-based joint maximum likelihood estimation of the frontier regressions. We then turn to the computation issue of evaluating the Gaussian copula-based likelihood function in Sect. 4. The comparison of the proposed joint multiple regressions estimation versus the separated regression-by-regression is studied via a Monte Carlo simulation in Sect. 5. An empirical study of the hotel industry in Taiwan is given in Sect. 6. Lastly, a summary conclusion is given in Sect. 7.

2 A multiple stochastic frontier regressions model

Suppose the production operations of a DMU consist of J sub-DMUs (divisions), each of which produces a single output under the Cobb-Douglas type of production technology. The production frontiers of the J sub-DMUs are represented by a set of SF regressions,

$$ \begin{gathered} y_{1i} = x_{1i}^{\text{T}} \beta_{1} \, + v_{1i} - u_{1i} , \hfill \\ y_{2i} = x_{2i}^{\text{T}} \beta_{2} + v_{2i} - u_{2i} ,\quad i = 1, 2, \ldots ,N \hfill \\ \vdots \hfill \\ y_{Ji} = x_{Ji}^{\text{T}} \beta_{J} + v_{Ji} - u_{Ji} , \hfill \\ \end{gathered} $$
(1)

where y ji and x ji respectively denote the log output and the log inputs of the jth sub-DMU (j = 1, 2, …, J) of the DMU i; \( v_{ji} \sim N(0,\sigma_{{v_{j} }}^{2} ) \) and \( u_{ji} \sim N^{ + } (0,\sigma_{{u_{j} }}^{2} ) \) respectively represent the noise component and the non-negative inefficiency component. For a given j, the sub-DMU’s v ji and u ji are assumed to be mutually independent. Thus, for any given SF regression, the noise component is uncorrelated with the inefficiency component, which is a standard assumption in a single stochastic frontier modeling.Footnote 1

When the composite error is defined as \( \varepsilon_{ji} = v_{ji} - u_{ji} \), the correlations among the composite errors ɛ ji are the consequence of the correlation in v ji and the correlation in u ji . Across the SF regressions in (1), u ji and u si are correlated due to the sharing of the same common characteristics among the J sub-DMUs within a DMU. Similarly, across the SF regressions, v ji and v si are allowed to be correlated for j ≠ s, possibly due to common stochastic shocks to the DMU and its sub-DMUs. Conversely, however, in our model, the correlation of the composite errors is not decomposable to the correlation in v ji or to the correlation in u ji . Nevertheless, we should be able to show that the SF regression-by-regression estimation of (1) is less efficient than the system estimation, especially for \( \sigma_{{v_{j}}}^{2}\) and \( \sigma_{{u_{j} }}^{2} \).

In estimating a standard single sector’s stochastic frontier (SF) regression model using the maximum likelihood (ML) approach, it is often assumed that the noise component v ji follows a normal distribution, and that the inefficiency component u ji follows a half-normal (or truncated-normal) distribution. In this case, the derivations of the probability density function (PDF) of the composite error ɛ ji and the ML estimator of a single SF regression are relatively straightforward. However, computation of the likelihood function in the joint estimation of (1) with correlated composite errors is sometimes tedious in the analytic form if no further simplified assumptions are imposed. For instance, Wang and Ho (2010) assume the scaling property in the distribution of u to simplify the model. Pitt and Lee (1981) use a multivariate truncated normal distribution but the computation of likelihood function involves T-dimensional integrals, which requires numerical approximation and the precision of the approximation may be an issue. We therefore propose the copula approach as the means of constructing the joint probability of \( \varepsilon_{1i} ,\ldots,\varepsilon_{Ji} \) in (1). In Sect. 3 we begin with some basics on the copula, and then construct the likelihood function of the system of SF model by the copula approach.

3 Copula and the ML estimator

Let \( \theta_{j} = (\beta_{j}^{\text{T}} ,\sigma_{vj} ,\sigma_{uj} )^{\text{T}} \) be a vector of parameters in the jth SF regression, and \( F_{j} (\varepsilon_{ji} ) = F(\varepsilon_{ji} ;\theta_{j} ) \) be the cumulative distribution function (CDF) of the composite error of the jth SF regression with the vector of parameters θ j . In order to simplify the notation in our following analysis, we suppress θ j in the marginal CDF \( F(\varepsilon_{ji} ;\theta_{j} ) \) and use the subscript j to indicate the jth SF regression, so \( F_{j} (\varepsilon_{ji} ) \) will be used instead in our following analysis.

In deriving the joint CDF of J random variables, we rely on the results of Sklar’s theorem (1959), and Schweizer and Sklar (1983), in which the joint CDF can be expressed as a function of its own one-dimensional margins. The function is the copula. More specifically, the joint CDF of the composite errors \( \varepsilon_{i} = \left( {\varepsilon_{1i} ,\ldots,\varepsilon_{Ji} } \right) \) from the J SF regressions can be represented by a J-copula function C(·),

$$ F\left( {\varepsilon_{1i} ,\ldots,\varepsilon_{Ji} } \right) = C\left( {F_{1} (\varepsilon_{1i} ), \ldots ,F_{J} (\varepsilon_{Ji} );\rho } \right) $$
(2)

where ρ is a vector of parameters of the copula called the dependence parameter, which measures dependence between the marginal CDFs. Thus, the joint distribution of the composite errors \( \varepsilon_{i} = \left( {\varepsilon_{1i} ,\ldots,\varepsilon_{Ji} } \right) \) is expressed in terms of the marginal distributions, and the copula function binds them together. The advantage of copula function representation in (2) is that, while each SF regression in (1) corresponds to a separate \( F_{j} (\varepsilon_{ji} ) \), it allows us to identify the dependence among the multiple SF regressions through the dependence in their marginal CDFs. Therefore, the copula approach is an appropriate technique of studying the correlated stochastic frontier regressions. It has been shown that the copula function C(·) is unique if \( F_{1} ( \cdot ),\ldots,F_{J} ( \cdot ) \) are all continuous.Footnote 2 Furthermore, since \( 0 \le F_{j} (\varepsilon_{ji} ) \le 1 \), the copula function can be viewed as multivariate distribution of uniform U[0, 1] variables with the dependence parameter ρ.

In general, different copula functions capture different dependence structures. One special case is the product copula where \( C\left( {F_{1} (\varepsilon_{1i} ), \ldots ,F_{J} (\varepsilon_{Ji} );\rho } \right) = \prod\nolimits_{j = 1}^{J} {F_{j} (\varepsilon_{ji} )} \). It follows then, from (2), that \( F\left( {\varepsilon_{1i} ,\ldots,\varepsilon_{Ji} } \right) = \prod\nolimits_{j = 1}^{J} {F_{j} } \left( {\varepsilon_{ji} } \right) \). Hence, the product copula implies the mutual independence among ɛ 1i , …, ɛ Ji , or the independence among the J SF regressions.

For the maximum likelihood estimation, the joint PDF, f(ɛ 1i , …, ɛ Ji ), instead of the joint CDF, F(ɛ 1i , …, ɛ Ji ), is required. By taking the derivatives of (2) with respect to ɛ 1i , …, ɛ Ji , the corresponding joint PDF is obtained,

$$ f\left( {\varepsilon_{1i} ,\ldots,\varepsilon_{Ji} } \right) = c\left( {F_{1} \left( {\varepsilon_{1i} } \right), \ldots,F_{J} \left( {\varepsilon_{Ji} } \right);\rho } \right) \times \mathop \Uppi \limits_{j = 1}^{J} f_{j} \left( {\varepsilon_{ji} } \right), $$
(3)

where \( c\left( {F_{1} \left( {\varepsilon_{1i} } \right), \ldots ,F_{J} \left( {\varepsilon_{Ji} } \right);\rho } \right) = \tfrac{{\partial^{J} C\left( {F_{1} \left( {\varepsilon_{1i} } \right), \ldots ,F_{J} \left( {\varepsilon_{Ji} } \right);\rho } \right)}}{{\partial F_{1} \left( {\varepsilon_{1i} } \right), \ldots ,\partial F_{J} \left( {\varepsilon_{Ji} } \right)}} \) is the copula density and f j (ɛ ji ) is the marginal PDF. In the case of product copula, we have c(·) = 1, and it follows from (3) that the joint PDF is the product of the marginal PDFs.

Several multivariate copulas can be used in (2) under this framework. Further extensions of the proposed approach to other copula functions should be able to follow the same procedure with a similar calculation. For instance, the multivariate Student’s t copula, Archimedean copula, Gumble n-coplua, and Clayton n-copula, see Cherubini et al. (2004) for a systematic review of the copula functions.

For the joint estimation of the multiple SF regressions in (1), we use the Gaussian copula to derive the J-dimensional distribution function F(ɛ 1i , …, ɛ Ji ) in (2). The Gaussian copula takes the formFootnote 3

$$ \begin{aligned} C\left( {\gamma_{1i} , \ldots ,\gamma_{Ji} ;\Upomega } \right) = & \Upphi_{J} \left( {\Upphi^{ - 1} (\gamma_{1i} ), \ldots ,\Upphi^{ - 1} (\gamma_{Ji} );\Upomega } \right), \\ = & \int\limits_{ - \infty }^{{\Upphi^{ - 1} (\gamma_{1i} )}} \ldots \int\limits_{ - \infty }^{{\Upphi^{ - 1} (\gamma_{Ji} )}} {\frac{1}{{\left( {2\pi } \right)^{J/2} |\Upomega |^{1/2} }}e^{{ - \frac{1}{2}z^{\text{T}} \left( {\Upomega^{ - 1} } \right)z}} dz_{1} \ldots dz_{J} } \\ \end{aligned} , $$
(4)

where Φ(·) is the CDF of the standard normal distribution, and Φ J (·) is the CDF of a standard J-variate normal distribution of the random variables with the J × J correlation matrix Ω = [Ω js ], i.e., the diagonal elements are Ω jj  = 1, and the off-diagonal elements Ω js are the correlation coefficients between two variables, Φ −1(γ ji ) and Φ −1(γ si ). The corresponding Gaussian copula density of (4) is

$$ c\left( {\gamma_{1i} , \ldots ,\gamma_{Ji} ;\Upomega } \right) = \frac{1}{{|\Upomega |^{1/2} }}e^{{ - \frac{1}{2}\zeta_{i}^{\text{T}} \left( {\Upomega^{ - 1} - I_{J} } \right)\zeta_{i} }} , $$
(5)

where \( \zeta_{i} = \left( {\Upphi^{ - 1} (\gamma_{1i} ),\ldots,\Upphi^{ - 1} (\gamma_{Ji} )} \right)^{\text{T}} \) and I J is a J × J identity matrix.

Replacing γ ji  = F j (ɛ ji ) in (4), the joint CDF of the composite errors in (2) becomes,

$$ F\left( {\varepsilon_{1i} , \ldots ,\varepsilon_{Ji} } \right) = \Upphi_{J} \left( {\Upphi^{ - 1} \left( {F_{1} (\varepsilon_{1i} )} \right), \ldots ,\Upphi^{ - 1} \left( {F_{J} (\varepsilon_{Ji} )} \right);\Upomega } \right). $$
(6)

The corresponding joint PDF of the composite errors in (3) becomes,

$$ \begin{aligned} f\left( {\varepsilon_{1i} , \ldots ,\varepsilon_{Ji} } \right) = & c\left( {F_{1} \left( {\varepsilon_{1i} } \right), \ldots ,F_{J} \left( {\varepsilon_{Ji} } \right);\Upomega } \right) \times \mathop \Uppi \limits_{j = 1}^{J} f_{j} \left( {\varepsilon_{ji} } \right) \\ = & \frac{1}{{|\Upomega |^{1/2} }}e^{{ - \frac{1}{2}\zeta_{i}^{\text{T}} \left( {\Upomega^{ - 1} - I_{J} } \right)\zeta_{i} }} \times \mathop \Uppi \limits_{j = 1}^{J} f_{j} \left( {\varepsilon_{ji} } \right) \\ \end{aligned} $$
(7)

where \( \zeta_{i} = \left({\Upphi^{- 1} \left({F_{1} (\varepsilon_{1i})} \right),\ldots,\Upphi^{- 1} \left({F_{J} (\varepsilon_{Ji})} \right)} \right)^{\text{T}}\). Note that the off-diagonal elements of Ω measure the correlation coefficients between two variables, \( \Upphi^{ - 1} \left( {F_{j} (\varepsilon_{ji} )} \right) \) and \( \Upphi^{ - 1} \left( {F_{s} (\varepsilon_{si} )} \right) \). Thus, if the off-diagonal elements of Ω are all zeros, Ω becomes a J × J identity matrix, i.e., Ω = I J . In this case, the Gaussian copula density c(·) = 1 and \( f\left( {\varepsilon_{1i} ,\ldots,\varepsilon_{Ji} } \right) = \mathop \Uppi \limits_{j = 1}^{J} f_{j} \left( {\varepsilon_{ji} } \right) \), which implies the mutual independence of the composite errors ɛ i  = (ɛ 1i , …, ɛ Ji ), and hence the mutual independence of the SF regressions in (1).

Here, the dependence parameter ρ in (2) is defined as the column vector obtained by stacking the column vectors of the off-diagonal lower triangular matrix of Ω in (4). Therefore, based on (7) we may write the log-likelihood function of the J multiple SF regressions of (1) as

$$ \begin{aligned} \ln L(\theta ) = & \sum\limits_{i = 1}^{N} {\ln f\left( {\varepsilon_{1i} , \ldots ,\varepsilon_{Ji} } \right)} , \\ = & \sum\limits_{i = 1}^{N} {\ln c\left( {F_{1} \left( {\varepsilon_{1i} } \right), \ldots ,F_{J} \left( {\varepsilon_{Ji} } \right);\Upomega } \right)} + \sum\limits_{i = 1}^{N} {\sum\limits_{j = 1}^{J} {\ln f_{j} \left( {\varepsilon_{ji} } \right)} } \\ \end{aligned} $$
(8a)
$$ \quad = - \frac{N}{2}\ln |\Upomega | - \frac{1}{2}\sum\limits_{i = 1}^{N} {\zeta_{i}^{\text{T}} \left( {\Upomega^{ - 1} - I_{J} } \right)\zeta_{i} } + \sum\limits_{i = 1}^{N} {\sum\limits_{j = 1}^{J} {\ln f_{j} \left( {\varepsilon_{ji} } \right)} } , , $$
(8b)

where \( \theta = \left( {\theta_{1}^{\text{T}} ,\ldots,\theta_{J}^{\text{T}} ,\rho^{\text{T}} } \right)^{\text{T}} \) and θ j ’s are vectors of parameters of the jth SF regression. Therefore, the ML estimator of θ is defined as

$$ \hat{\theta } = \arg \mathop {\max }\limits_{\theta \in \Uptheta } \ln L(\theta ), $$

where Θ denotes the parameter space of θ. Under the regularity conditions for the asymptotic maximum likelihood theory, the ML estimator can be shown to be consistent, asymptotic efficient and asymptotic normal,Footnote 4 that is,

$$ \sqrt N \left( {\hat{\theta } - \theta_{0} } \right) \to N\left( {0,I^{ - 1} (\theta_{0} )} \right), $$

where I(θ 0) is the usual Fisher’s information matrix and \( \theta_{0} = \left( {\theta_{1,0} ,\ldots,\theta_{J,0} ,\rho } \right) \) denotes the vector of true parameters.

The objective function in (8a) and (8b) give us further insight on the difference in the estimators by separate estimation and joint estimation. The first term in (8a) captures the correlations between the equations; while the second term is the log-likelihood function obtained by separate regression. Although the separate regression may still give consistent estimates and also valid standard errors under correctly specified marginal densities; however, the standard errors are inefficient. Given the marginal specification, maximizing (8a) gives consistent and more efficient ML estimator if the correct copula density is included.

Empirically, we never know if the copula density is true; therefore, the log-likelihood function may be referred as the log quasi-likelihood function. The derived quasi-maximum likelihood (QML) estimator, which is still consistent (in the sense of the best approximation) but has an invalid standard error. Compared with the separate regression, we still have efficiency gain. However, in this case it is necessary to compute the “sandwich” estimator in order to obtain the correct standard errors.

Some existing tools are helpful in choosing a correctly specified copula. For instance, the goodness of fit tests reviewed in Genest et al. (2009) or the moment test suggested by Amsler et al. (2011). Alternatively, if there are many candidate copula functions, the commonly used model selection criteria, such the Akaike information criterion (AIC) or Bayesian information criterion (BIC) can be also used in deciding the proper copula. See Trivedi and Zimmer (2005) for more discussion on the model selection criteria. For a further consideration, it seems that we are able to implement the Hausman test by comparing the difference in the estimators of separate and joint estimation.Footnote 5 If the copula function is misspecified, the two estimators will have different probability limits.

The model specification test devised by Hausman (1987) can be applied to test for the specification of copulas. The essential idea of Hausman (1987) is that the covariance of an efficient estimator with its difference from an inefficient estimator is zero, i.e., \( Cov\left( {\hat{\theta }_{S} - \hat{\theta }_{J} ,\hat{\theta }_{J} } \right) = 0 \), which suggests that

$$ Var\left( {\hat{\theta }_{S} - \hat{\theta }_{J} } \right) = Var\left( {\hat{\theta }_{S} } \right) - Var\left( {\hat{\theta }_{J} } \right), $$

where the subscript S and J denote separate and joint estimation, respectively. Therefore, the Hausman’s Chi-squared test statistics is

$$ W = \left( {\hat{\theta }_{S} - \hat{\theta }_{J} } \right)^{\text{T}} Var\left( {\hat{\theta }_{S} - \hat{\theta }_{J} } \right)^{ - 1} \left( {\hat{\theta }_{S} - \hat{\theta }_{J} } \right)\sim \chi^{2} (k), $$

where k is the number of common parameters in the separate and joint estimation.

Finally, some measures of the dependence structure between the two SF regressions can be obtained by the transformations from the Gaussian copula parameter matrix Ω = [Ω js ]. Taking the transformation according to the distribution function,Footnote 6 one can show that the linear correlation between F j (ɛ ji ) and F s (ɛ si ) is

$$ \rho_{{F_{j} ,F_{s} }} = \frac{6}{\pi }\arcsin \left( {\frac{{\Upomega_{js} }}{2}} \right), $$

which measures the correlation of the two SF regressions (j and s) in terms of the CDF of \( \varepsilon_{ji} \;{\text{and}}\;\varepsilon_{si} \) and is also called the Spearman’s rank correlation coefficient of \( \varepsilon_{ji}\,{\text{and}}\,\varepsilon_{si} \).

Alternatively, another dependence measure is concordance. For example, two observations \( (\varepsilon_{{1i_{1} }} ,\varepsilon_{{2i_{1} }} ) \) and \( (\varepsilon_{{1i_{2} }} ,\varepsilon_{{2i_{2} }} ) \) of a pair \( (\varepsilon_{1i} ,\varepsilon_{2i} ) \) of composite errors are concordant if both values of one pair are greater than the corresponding values of the other pair, that is if \( \varepsilon_{{1i_{1} }} > \varepsilon_{{1i_{2} }} \) and \( \varepsilon_{{2i_{1}}} > \varepsilon_{{2i_{2}}} \), or \( \varepsilon_{{1i_{1} }} < \varepsilon_{{1i_{2} }} \) and \( \varepsilon_{{2i_{1} }} < \varepsilon_{{2i_{2} }} \). The discordance is defined in the opposite way; in other words, ɛ 1i and ɛ 2i are said to be discordant if \( \varepsilon_{{1i_{1}}} > \varepsilon_{{1i_{2}}} \) and \( \varepsilon_{{2i_{1} }} < \varepsilon_{{2i_{2} }} \), or \( \varepsilon_{{1i_{1} }} < \varepsilon_{{1i_{2} }} \) and \( \varepsilon_{{2i_{1} }} > \varepsilon_{{2i_{2} }} \). The measure of the concordance is also called the Kendall’s coefficient, which is defined as

$$ \tau_{12} = \Pr \left( {\left( {\varepsilon_{{1i_{1} }} - \varepsilon_{{1i_{2} }} } \right)\left( {\varepsilon_{{2i_{1} }} - \varepsilon_{{2i_{2} }} } \right) > 0} \right) - \Pr \left( {\left( {\varepsilon_{{1i_{1} }} - \varepsilon_{{1i_{2} }} } \right)\left( {\varepsilon_{{2i_{1} }} - \varepsilon_{{2i_{2} }} } \right) < 0} \right). $$

Intuitively, Kendall τ measures the difference between the probability of concordance and that of discordance for two composite errors. It can be shown that Kendall τ can also be transformed from the Gaussian copula parameter Ω. The concordance between ɛ ji and ɛ si of the two SF regressions (j and s) is

$$ \tau_{js} = \frac{2}{\pi }\arcsin \left( {\Upomega_{js} } \right). $$

Although we may obtain certain dependence measures of the SF regressions through the direct transformation of the copula parameter Ω, the interpretation of the measure is limited to the composite errors. How much correlation between ɛ ji and ɛ si is attributed to (v ji , v si ) or (u ji , u si ) cannot be further identified.

4 Computation issue

The maximum likelihood estimation in (8) requires the evaluation of the PDF \( f_{j} \left( {\varepsilon_{ji} } \right) \) and the inverse (quantile) of the distribution function F j (ɛ ji ) in the vector \( \zeta_{i} = \left( {\Upphi^{ - 1} \left( {F_{1} (\varepsilon_{1i} )} \right),\dots,\Upphi^{ - 1} \left( {F_{J} (\varepsilon_{Ji} )} \right)} \right)^{\text{T}} \). As shown by Aigner et al. (1977), the PDF of the composite error ɛ ji is

$$ f_{j} \left( {\varepsilon_{ji} } \right) = \frac{2}{{\sigma_{j} }}\phi \left( {\frac{{\varepsilon_{ji} }}{{\sigma_{j} }}} \right)\Upphi \left( { - \frac{{\varepsilon_{ji} \lambda_{j} }}{{\sigma_{j} }}} \right), $$
(9)

where \( \sigma_{j} = \sqrt {\sigma_{vj}^{2} + \sigma_{uj}^{2} } \), \( \lambda_{j} = \frac{{\sigma_{uj} }}{{\sigma_{vj} }} \), and ϕ(.) and Φ(.) are the standard normal PDF and CDF, respectively. The computation of the PDF f j (ɛ ji ) is relatively easy and routine. However, the computation of CDF F j (ɛ ji ) is not trivial. Since the closed form evaluation of CDF does not exist, it typically requires some numerical integration procedures or the use of simulation to approximate the integration (Greene 2003, 2010) in computing F j (ɛ ji ). Recently Tsay et al. (2012) have suggested a method utilizing some mathematical approximation functions to obtain the closed form of the CDF of ɛ ji  = v ji  + u ji , and shown that the approximation error is less than 10−5. Here, we follow a similar computation procedure and derive the closed form for the case of ɛ ji  = v ji  − u ji .

For simplicity of presentation, the subscripts of variables and function are omitted. Given the PDF f(ɛ) in (9), the corresponding CDF F(Q) of ɛ at point Q is

$$ \Pr \left( {\varepsilon \le Q} \right) = F(Q) = \frac{2}{\sigma }I(Q), $$
(10)

where

$$ I(Q) = \int\limits_{ - \infty }^{{^{Q} }} \left( {\int\limits_{ - \infty }^{{ - \tfrac{\varepsilon \lambda }{\sigma }}} \phi \left( \xi \right)d\xi } \right)\phi \left( {\frac{\varepsilon }{\sigma }} \right)d\varepsilon . $$

It follows that the evaluation of the CDF of ɛ requires the computation of I(Q).Define a sign function

$$ {\text{sign}}(\xi ) = \left\{ {\begin{array}{*{20}c} {1,} & {{\text{if}}\;\xi > 0;} \\ {0,} & {{\text{if}}\;\xi = 0;} \\ { - 1,} & {{\text{if}}\;\xi < 0.} \\ \end{array} } \right. $$

and an error function

$$ {\text{erf}}\left( \xi \right) = 2\int\limits_{0}^{{\sqrt {2\pi } }} {\phi (t)dt = 2\Upphi (\sqrt 2 \xi ) - 1.} $$

Let \( a = - \tfrac{\lambda }{\sigma }, \), \( b = \tfrac{1}{\sigma } \). Following the approach of Tsay et al. (2009), we may show that the function I(Q) can be approximated by

$$ \begin{aligned} I(Q) & & \approx \frac{1}{2b}\left[ {1 + \frac{{{\text{erf}}\left( {\tfrac{bQ}{\sqrt 2 }} \right)}}{2b} \cdot \left( {\frac{{1 - {\text{sign}}\left( Q \right)}}{2}} \right)} \right] \\ & \quad - \frac{1}{{4\sqrt {b^{2} - a^{2} c_{2} } }}{ \exp }\left( {\frac{{a^{2} c_{1}^{2} }}{{4\left( {b^{2} - a^{2} c_{2} } \right)}}} \right)\left[ {1 + {\text{erf}}\left( {\frac{{ - ac_{1} - \sqrt 2 Q\left( {b^{2} - a^{2} c_{2} } \right){\text{sign}}(Q)}}{{2\sqrt {b^{2} - a^{2} c_{2} } }}} \right)} \right], \\ \end{aligned} $$
(11)

where c 1 and c 2 are known constants.Footnote 7 Further details about the derivation of (11) can be found in Tsay et al. (2012).

5 A Monte Carlo simulation

In this section we conduct a Monte Carlo simulation based on a two SF regressions model. The data-generating process (DGP) is set as the following

$$ \begin{gathered} {\text{Regression}}\, 1\, (F_{1} ) :\quad y_{1} = \beta_{10} + \beta_{11} x_{1} + \varepsilon_{1} , \hfill \\ {\text{Regression}}\, 2\, (F_{2} ) :\quad y_{2} = \beta_{20} + \beta_{21} x_{2} + \varepsilon_{2} , \hfill \\ \end{gathered} $$
(12)

where \( \varepsilon_{j} = v_{j} - u_{j},j = 1,2 \). The parameters are set as \( \beta_{10} = 1, \) β 11 = 1, β 20 = 0.5, and β 21 = 1.5. The distributions of the symmetric stochastic errors are assumed to be normal with unequal variances, i.e., \( v_{j} \sim N(0,\sigma_{vj}^{2} ) \), the one-sided errors are assumed to follow the half normal distributions, i.e., \( u_{j} \sim N^{ + } (0,\sigma_{uj}^{2} ) \), and the corresponding transformed parameters specified in (12) are \( \sigma_{j} = \sqrt {\sigma_{vj}^{2} + \sigma_{uj}^{2} } \) and \( \lambda_{j} = \frac{{\sigma_{uj} }}{{\sigma_{vj} }} \). In our following simulations, three different settings of the parameters about \( \sigma_{uj}^{2} {\text{ and }}\sigma_{vj}^{2} \) are specified and summarized below:

Case 1

\( \sigma_{v1}^{2} = 1.6, \) \( \sigma_{u1}^{2} = 0.4 \),\( \sigma_{v2}^{2} = 5.55 \) and\( \sigma_{u2}^{2} = 0.5 \). The transformed parameters are σ 1 = 1.4142, σ 2 = 2.4597, and λ 1 = 0.5, λ 2 = 0.3.

Case 2

σ 2 v1  = 0.15, σ 2 u1  = 0.4,σ 2 v2  = 0.2 and σ 2 u2  = 0.5. The transformed parameters are σ 1 = 0.7416, σ 2 = 0.8367, and λ 1 = 1.63, λ 2 = 1.58.

Case 3

σ 2 v1  = 0.025, σ 2 u1  = 0.4, σ 2 v2  = 0.056 and σ 2 u2  = 0.5. The transformed parameters are σ 1 = 0.6519, σ 2 = 0.7457, and λ 1 = 4, λ 2 = 2.99.

Among these cases, we have set, λ j , the ratios of \( \sigma_{uj} {\text{and}}\sigma_{vj} \), increasing from Case 1 to Case 3 in order to demonstrate the effect of λ on the estimates of separate and joint estimation. With known σ 1, σ 2, λ 1, and λ 2, we are able to find F 1(ɛ 1) and F 2(ɛ 2) for any given \( \varepsilon_{1}\,{\text{and}}\,\varepsilon_{2} \).

We assume that the dependence between the composite errors, ɛ 1 and ɛ 2, and hence the dependence between the dependent variables, y 1 and y 2, is captured by the Gaussian copula with the copula parameter ρ. Thus the Gaussian copula representation of the joint CDF of ɛ 1 and ɛ 2 is

$$ \begin{aligned} F\left( {\varepsilon_{1} ,\varepsilon_{2} } \right) = & C\left( {F_{1} (\varepsilon_{1} ),F_{2} (\varepsilon_{2} );\rho } \right) \\ = & \Upphi_{2} (\zeta_{1} ,\zeta_{2} ;\rho ) \\ \end{aligned} $$

where ζ j  = Φ −1(F j (ɛ j )). As shown in (4), ζ 1 and ζ 2 follow a standard bivariate normal distribution with the correlation coefficient ρ. With the copula representation in place, the steps of generating the composite errors ɛ 1 and ɛ 2 for the simulation are stated as follows:

Step 1

Draw two independent random variables, z 1 and z 2, from N(0, 1).

Step 2

Compute ζ 1 = z 1 and \( \zeta_{2} = z_{1} \rho + z_{2} \sqrt {1 - \rho^{2}} \) such that ζ 1 and ζ 2 follow a standard bivariate normal distribution with correlation coefficient ρ.

Step 3

Compute γ 1 = Φ(ζ 1) and \( \gamma_{2} = \Upphi (\zeta_{2}) \) from normal distribution CDF. γ 1 and γ 2 are the probability integral transformations of ζ 1 and ζ 2.

Step 4

Finally, the composite errors, ɛ 1 and ɛ 2, are generated by the inverse CDF, \( \varepsilon_{1} = F_{1}^{ - 1} (\gamma_{1} ){\text{ and }}\varepsilon_{2} = F_{2}^{ - 1} (\gamma_{2} ) \).Footnote 8

Step 5

Given the parameters β 10, β 11, β 20, and β 21 and also the exogenous variables \( x_{1}\,{\text{and}}\,x_{2} \), the dependent variables y 1 and y 2 are generated from the SF regressions in (12).

It is worth emphasizing that the composite errors ɛ 1 and ɛ 2 generated in Step 4 are correlated, and the dependence may implicitly come from either the correlation between \( v_{1} {\text\,{and}}\,v_{2} \), or between \( u_{1}\, {\text{and}}\,u_{2} \), or both.

In our simulation, the number of replications is set to be 1,000 and the small sample performance is investigated with the sample sizes N = 200, 500. Various degrees of dependence structure are tried experimentally with the dependence parameter ρ = 0.1, 0.3, 0.5, 0.7, 0.9 considered. Both of the exogenous variables \( x_{1} {\text\,{and}}\,x_{2} \) are independently drawn from uniform distributions over the range [1, 2] and have been kept fixed over the replications, only ɛ 1 and ɛ 2 are redrawn in each replication.

The performance comparisons of the separate and joint estimators are based on the relative mean squared errors (RMSE). Two sets of estimators are evaluated: the coefficients of the frontier part, (β 10, β 11, β 20, β 21), and the variances of error components, (σ 2 v1 , σ 2 u1 , σ 2 v2 , σ 2 u2 ). The relative performance of the estimators are investigated by the RMSEs, which are defined as

$$ {\text{RMSE(}}k ) = \frac{{{\text{MSE}}_{S} (k )}}{{{\text{MSE}}_{J} (k)}}, $$

where k stands for the regression coefficients or the variances of error components. The subscript ‘S’ denotes separate estimation and ‘J’ denote joint estimation, respectively. Therefore, RMSE(k) > 1 is equivalent to that MSE S (k) > MSE J (k). Similarly, we also use the relative efficiency (RE), defined as the relative variance of the estimators,

$$ {\text{RE(}}k) = \frac{{{\text{Var}}_{S} (k )}}{{{\text{Var}}_{J} (k)}}, $$

to investigate the efficiency gain. The simulated results of RMSE and RE under Cases 1–3 are summarized in Tables 1 and 2, respectively.

Table 1 RMSEs of Cases 1–3 when the sample sizes are 200 and 500
Table 2 REs of Cases 1–3 when the sample sizes are 200 and 500

A few findings from Table 1 are worthy of mention. First, Table 1 shows that almost all RMSEs are greater than 1, except for few cases having RMSEs equal to 0.99 when ρ = 0.1 and N = 200. The joint estimation shows significantly better in the mean squared errors performance than the separate estimation when there is some dependence between the two regressions. The superiority of the joint estimation over the separate estimation shows even more clearly in the estimation of variances of the SF error components, σ 2 vj and σ 2 uj because of the relatively high values of RMSEs. This finding suggests that incorporating the correlation between the SF regressions may significantly increase the efficiency in estimating the variances, and thus the efficiency in predicting the technical efficiency.

Second, for a fixed n and λ, the RMSEs tend to increase as the dependence of the two regressions gets stronger, which suggests that the loss of estimation efficiency will worsen when the dependence between the two sectors increases in strength.

Third, for each fixed ρ and λ, the RMSEs for the β coefficients as well as σ 2 vj and σ 2 uj increase when the sample size increases from 200 to 500, which is consistent with the implications of the large sample theory. Therefore, we may conclude that the overall performance of the estimators under joint estimation is in general better than that under separate estimation. In other words, if the dependence structure is ignored in the estimation process, the loss of estimation efficiency will, as expected, worsen as the dependence between the two sectors becomes stronger or as the sample size increases. Cases 2 and 3 also suggest similar findings as λ, the ratio of \( \sigma_{u} \,{\text{and}}\,\sigma_{v} \), becomes larger in the two regressions.

Finally, by comparing Cases 1–3 we also find that the RMSEs of \( \sigma_{u}\,{\text{and}}\,\sigma_{v} \) for all ρ's significantly increase, which indicates that as λ increases the joint estimation also become significantly better in the mean squared errors performance than the separate estimation even for fixed N and ρ.

Similar findings are also found in the REs. Most REs are greater than 1, except for σ 2 u1 , σ 2 u2 and few cases when n = 200 and ρ = 0.1. The joint estimation indeed shows higher efficiency gain than the separate estimation, especially when the dependence of the two regressions gets stronger or the sample size increases.

In addition to RMSEs and REs, we also investigate the bias of the estimators obtained by separate and joint estimations. The estimated biases of the parameters, \( \beta_{10},\beta_{11},\sigma_{v1}^{2},{\text{and}}\,\sigma_{u1}^{2} \) in Regression 1 and \( \beta_{20},\beta_{21},\sigma_{v2}^{2},\,{\text{and}}\,\sigma_{u2}^{2} \) in Regression 2, under Cases 1–3 with sample size N = 200, 500 are summarized in Tables 3 and 4.

Table 3 Bias of Regression 1 of Cases 1–3 when the sample sizes are 200 and 500
Table 4 Bias of Regression 2 of Cases 1–3 when the sample sizes are 200 and 500

Tables 3 and 4 do not provide significant evidence showing that estimators of the frontier coefficients under joint estimation are definitely better or worse than those under separate estimation. However, the estimated biases of the variance estimators, σ 2 vj and σ 2 uj , under the joint estimation are much smaller than those under separate estimation for all ρ’s. It seems that estimation without considering the correlation between the SF regressions cause a lot of loss in efficiency in the variance estimators and thus in the prediction of technical efficiencies, but not necessarily for the prediction of the SF frontiers. Furthermore, by definition λ represents the relative contribution of u and v to ɛ. A larger λ implies that the statistical noise plays a relative small role in the composite error. By comparing Cases 1–3 in Tables 3 and 4, both tables suggest that the degree of estimation bias for σ 2 vj and σ 2 uj is alleviated as λ increases in both separate and joint estimation. In summary, it is of fundamental importance to take into account the interdependence of the SF regressions from the point of view in reducing estimation bias and improving the estimation efficiency.

6 An empirical example

An empirical application of a model with two related stochastic frontier regressions is applied to a study of the hotel industry in Taiwan, focusing on production efficiency in the accommodation and restaurant divisions. The data come from the annual report of the Taiwan Tourism Bureau at the Ministry of Transportations and Communications. Pooled observations of 50 international grand hotels from 2001 to 2005 has been obtained, providing a total of 250 sample points for the empirical study. The two SF regressions, with each representing the operation of the sub-DMU in the accommodation and in the restaurant divisions of an international hotel, are specified. Although the data under consideration is balanced panel, at the current stage we do not take into account the firm-specific effect, such as the fixed effect, in our following analysis due to the complexity of the panel likelihood function in this current SF model under consideration. We leave it to our future study

The output for the accommodation division is measured in total revenue (y 1), while the inputs include the total number of workers (x 11), the total number of rooms (x 12), and other expenses (x 13), which includes utilities, materials, maintenance fees, and so on. The output and inputs are allocable within the accommodation division. The output for the restaurant division is also measured by the total revenue (y 2) while the corresponding inputs are the total number of workers (x 21), the floor area of the restaurant (x 22), and other expenses (x 23), including utilities, materials, and so on. Again, these output and inputs are attributable and accountable within the restaurant division. All revenues and other expenses are measured in New Taiwan dollars (NT$). Since these two divisions of a hotel share certain common characteristics, such as the same DMU, brand, location… etc.; the outputs y 1 and y 2 should be correlated and so are the composite errors. The stochastic frontier regressions in accommodation and restaurant of a hotel operation are specified as:

$$ \begin{gathered} {\text{Accommodation: }}y_{1i} = \beta_{10} + \beta_{11} x_{11i} + \beta_{12} x_{12i} + \beta_{13} x_{13i} + \varepsilon_{1i} , \hfill \\ {\text{Restaurant: }}y_{2i} = \beta_{20} + \beta_{21} x_{21i} + \beta_{22} x_{22i} + \beta_{23} x_{23i} + \varepsilon_{2i} , \hfill \\ \end{gathered} $$

where \( \varepsilon_{ji} = v_{ji} - u_{ji} \) with the assumption of normal-half normal distribution specification, \( v_{ji} \sim N(0,\sigma_{vj}^{2} ) \) and \( u_{ji} \sim N^{ + } (0,\sigma_{uj}^{2} ) \).

The ML estimates of the two SF regression models, using both separate and joint estimations, are summarized in Table 5. The separate estimation results are obtained by estimating the frontier regression in isolation without consideration of the potential dependence between the two divisions. The joint ML estimation is conducted with the dependence of the two regressions specified as the Gaussian copula with the PDF \( f\left( {\varepsilon_{1i} ,\varepsilon_{2i} } \right) \) shown in (4).

Table 5 The separate and joint ML estimation results

The estimated model, either by the separate or the joint ML estimation shown in Table 5, gives no surprising results. All estimates have expected signs and, with the exception of the coefficient of the floor area of restaurant (x 22), are statistically significant at the 5 % level or better. It is observed in Table 5 that the point estimates yielded by the separate and joint methods differ. This outcome is to be expected since different likelihood functions in (8) are maximized in two methods, unless the dependence parameter ρ = 0. The estimated dependence in the Gaussian copula \( \hat{\rho} = 0.6318 \) is statistically significant at the 1 % level. The corresponding Spearman’s rank correlation coefficient is 0.7284, which provides evidence of a significant positive rank correlation between ε 1 and ε 2. The estimated Kendall’s τ is about 0.5345.

For the current case, the separate estimation is equivalent to joint estimation with the restriction ρ = 0 or using the product copula. Therefore, we adapt the likelihood ratio (LR) statistics for testing the hypothesis H 0: ρ = 0. The value of the LR statistics is 103.83 so the Gaussian copula is suggested. Moreover, both model selection criteria AIC and BIC consistently suggest the Gaussian copula.

It is also seen from the results in Table 5 that the application of the joint estimation has resulted in a significant reduction (about 12 %) in the standard errors of the estimated frontier coefficients \( \hat{\beta} \) as compared with those of a separate regression-by-regression estimation. Even if the separate ML estimation provides a consistent estimator of the parameter, it does not fully incorporate all the information relating to the correlation between the two divisions; and thus, the estimates are not as efficient as those obtained under the joint estimation method. More evidence is provided through a comparison of the estimated coefficient of x 22, which is insignificant in the separate estimation, and yet significant in the joint estimation at less than the 1 percent significance level.

For rigorousness, we further conduct the Hausman test to test for the specification of copula function. Unfortunately, the test statistics is 11.518787, the Gaussian copula is rejected. Together with the previous LR test result, it seems that the two divisions have significant dependence; and the Gaussian copula can give a better approximation of the true joint density than the product copula (or separate estimation) but may not represent the true one. Therefore, the QML standard errors should be considered for further statistical inferences.

With the ML estimates of the stochastic frontier regressions, we further compare the estimates of the individual hotel’s technical efficiency based on the separated and joint methods. Following Battese and Coelli (1988), we define the technical efficiency of the jth division of ith hotel as the conditional expectation, \( {\text{TE}}_{ji} = {\text{E}}\left( {e^{{ - u_{ji} }} |\varepsilon_{ji} } \right) \). Under the normal-half normal distribution assumption, the estimated technical efficiency is

$$ \widehat{\text{TE}}_{ji} = \frac{{1 - \Upphi \left( {\hat{\sigma }_{*j} - \hat{\mu }_{*ji} /\hat{\sigma }_{*j} } \right)}}{{1 - \Upphi \left( { - \hat{\mu }_{*ji} /\hat{\sigma }_{*j} } \right)}}e^{{ - \hat{\mu }_{*ji} + \frac{1}{2}\hat{\sigma }_{*j}^{2} }} , $$

where \( \hat{\mu }_{*ji} = - \hat{\varepsilon }_{ji} \hat{\sigma }_{uj}^{2} /\hat{\sigma }_{j}^{2} \) and \( \hat{\sigma }_{*j}^{2} = \hat{\sigma }_{uj}^{2} \hat{\sigma }_{vj}^{2} /\hat{\sigma }_{j}^{2} \). We emphasize that, in the joint estimation, the prediction \( \hat{\varepsilon }_{ji} = y_{ji} - x_{ji}^{\text{T}} \hat{\beta }_{j} \) should represent the predicted composite errors calculated from the joint estimates of β; and in the separated estimation, the predicted composite errors should be those calculated from the regression-by-regression estimates of β. Obviously, these two predicted composite errors calculated from the joint and separated estimations are not the same unless the estimated dependence \( \hat{\rho} \) is zero.

The summary statistics of the technical efficiency levels of the two divisions are given in Table 5, which shows that, under the separate estimation method, the sample mean predicted technical efficiencies in both divisions are consistently smaller than those obtained under the joint estimation method. This result seems to suggest that the efficiency levels are underestimated in the separate estimation method as a result of the failure to consider the potential dependency between the two divisions within the same hotel management.

Table 6 shows the sample correlation coefficients to technical efficiency between the two divisions from the joint and separated methods. The joint method shows a much higher correlation coefficient between the two divisions than the counterpart under the separate method, 0.6158 versus 0.5417. Although the magnitude of technical efficiency measures varies by method, the ordering of the estimated individual technical efficiency, as indicated by the correlation between two methods, is fairly high, 0.9751 in the accommodation division and 0.9810 in the restaurant division. However, it seems that the separate regression-by-regression method is likely to underestimate the joint-interrelated hotel operations. What it is important to realize is that it makes good sense to use the joint method in estimating the seemingly unrelated stochastic frontier regressions.

Table 6 The correlation coefficients of the predicted technical efficiencies under separate and joint estimations

7 Conclusions

In this paper, we have presented a copula-based maximum likelihood approach to estimate the multiple stochastic frontier regressions with correlated composite errors. The joint estimation of the multiple SF regressions is more efficient than the separate regression-by-regression estimation, as the joint approach takes into consideration the correlation among composite errors. The rationale behind the joint estimation of SF regressions is analogous to the classical generalization to system of seemingly unrelated regressions.

Our simulation of a model with two related stochastic frontier regressions demonstrates that joint estimation provides relatively more efficient estimators for parameters than those in the separate estimation. Moreover, we also found that ignoring the dependence between regressions will also cause severe bias in estimating the technical efficiencies; however, it may not have similar effects on the frontier prediction or parameters in the frontier part. In those cases where there are correlations between the different stochastic frontier regressions, it is of fundamental importance for the dependence structure to be taken into consideration in the model estimation. Finally, we conduct an empirical study of the hotel management in accommodation and in restaurant division in Taiwan. Our results show that both the frontier estimates and predicted technical efficiencies differ significantly if the dependence structure is ignored. In particular, the predicted individual technical efficiencies may be underestimated for both regressions in separate estimation.

Following a similar procedure, the multiple stochastic frontier regressions of microeconomic temporal cross-section data can be extended to macroeconomic settings where the individual regressions are countries within the same economic region or block. The Gaussian copula-based approach of estimating a system of stochastic frontier regressions proposed in this paper can also be further modified to various measures of dependence structures among the regressions by incorporating different copula functions. We leave such analysis to future study.