1 Introduction

Stochastic frontier analysis originated with two seminal papers, Meeusen and van den Broeck (1977) and Aigner et al. (1977). Jondrow et al. (1982) provided a way to estimate firm specific technical efficiency. These contributions were framed in a cross sectional data framework. Panel data potentially can give more reliable information about the efficiencies of the firm. Pitt and Lee (1981) and Schmidt and Sickles (1984) applied random effects and fixed effects models to estimate firm specific efficiencies. In these models the efficiencies are assumed to be time-invariant. For long panel data this assumption might be questionable. The time-invariance assumption was relaxed by Cornwell et al. (1990) (CSS), Kumbhakar (1990), Battese and Coelli (1992) (BC), and Lee and Schmidt (1993). The time-varying inefficiency models were followed by dynamic efficiency models such as Ahn et al. (2000), Desli et al. (2003), Tsionas (2006), Huang and Chen (2009), and Assaf et al. (2014).Footnote 1 Work on time varying effects models and their use in productivity and efficiency studies have accelerated in the last decade and we view our current contribution as following in this tradition. Many of these advances are summarized in the recent chapter by Sickles et al. (2015).Footnote 2

In this paper we consider the use of the Kalman (1960) filter by treating the inefficiency term as an unobserved state. In contrast to the classical Box–Jenkins approach, one also can explicitly model non-stationary stochastic processes in the Kalman filter setting. This gives significant flexibility to the econometrician when specifying the inefficiency portion of the model. We use the Kalman filter estimator (KFE) to model the efficiency component of the stochastic frontier composed error. For this purpose we use a panel data generalization of the local level model. For long panel data, relatively inflexible stochastic frontier models [e.g., BC, CSS, and Kumbhakar (1990)] are more likely to fail to capture potentially complex time-varying patterns of the effects terms. We examine this claim by conducting a series of Monte Carlo simulations. Results of these simulations indicate that some of the widely used estimators can perform poorly in terms of capturing the efficiencies of firms when we have long panel data with fluctuating efficiencies. For example, if the efficiencies of firms are affected by macro factors that tend to have cycles, then it is likely that these relatively inflexible approximations will fail to capture the efficiency patterns. While some of the factors that lead to variation in efficiency can be controlled for by including exogenous variables in the modeling of the inefficiency term, the unobserved factors leading to such variations are generally left out in the conventional stochastic frontier methods. That is, the pattern of time-variation in efficiency is restricted to follow a known function of exogenous parameters. Hence, one of the main goals of this study is to point out the importance of capturing these time-varying unobserved factors in the efficiency analysis, especially for longer panel data, and the relative ease with which such time-varying unobserved factors can be addressed using the Kalman filter. The results of our Monte Carlo simulations serve well for this purpose. Our model is not unduly complicated and can be applied relatively easily in many applications. Thus the KFE is proposed as a simple and effective (as shown in the simulations) solution to the problem at hand. The KFE can be viewed as an alternative to the factor model approach addressed in Kneip et al. (2012) and Ahn et al. (2013) and recent generalizations utilizing Bayesian alternatives.

An early application of the Kalman filter in the productivity setting is Slade (1989) where she uses the local level model with trend to model total factor productivity. However, Ueda and Hoshino (2005) appear to have been the first to apply the Kalman filter to the estimation of efficiency in a data envelopment analysis (DEA) framework. Ueda and Hoshino (2005) examine the case where the inputs and outputs are not deterministic. Kutlu (2010a), Emvalomatis et al. (2011) and our study appear to be the first to use the Kalman filter to estimate efficiency in the framework of stochastic frontier analysis (SFA).Footnote 3 Emvalomatis et al. (2011) modeled the logarithm of ratio of inefficiency and efficiency by a generalized version of an AR(1) process. Their method, however, does not use the traditional Kalman filter since the state variable is not linearly incorporated in their model, which is a necessary assumption for the traditional version of the Kalman filter. Hence, they use a non-linear version of the Kalman filter. In contrast, we model the effects term as in the local level model and calculate the efficiency scores utilizing the approach adopted by Schmidt and Sickles (1984). Moreover, for our model the traditional Kalman filter method is sufficient for our estimation purposes, although extensions of the Kalman filter, for example, to handle endogenous regressors, recently have been developed and used in a production setting.Footnote 4 We apply the KFE to estimate the average (and individual) efficiencies of the U.S. airlines during the period 1999–2009. Over our 11 years of study period, the average efficiency of the airlines do not show a tendency to increase. Indeed, for the first few years of the study it seems that the efficiencies of the airlines decreased. As efficiency change and technical (innovation) change are the two main components of productivity growth our empirical findings are broadly consistent with the findings of others (see, for example, Färe et al. 2007) who report declining service quality as problems with delays and congestion at US major airports accelerated during our sample period.

In the next section we describe the KFE and propose several ways in which it can be implemented to model productive efficiency. In Section 3 we discuss our Monte Carlo simulation results. Section 4 provides the data description and results of an analysis of productivity trends in the US commercial airline industry during the period 1999–2009. Section 5 concludes. Additional estimation results for other functional specifications as a check of the robustness of our overall findings are provided in the Appendix.

2 Description of the Kalman filter estimator

Consider a panel of n i firms observed over n t periods. A general stochastic frontier model is given as follows:

$$\begin{array}{ccccc} & \kern0.7pc{y_{it}} = {X_{it}}\beta + {\mu _{it}} + {\varepsilon _{it}}\\ &{\mu _{i,t + 1}} = \rho {\mu _{i,t}} + {\tau _{it}} + {e_{1it}}\\ &\kern-2.2pc {\tau _{i,t + 1}} = {\tau _{i,t}} + {e_{2it}}\\ \end{array}$$
(1)

where y it is the logarithm of output, \({\varepsilon _{it}} \sim {\bf NID}\left( 0,{\sigma _\varepsilon ^2} \right)\) and \(e_{it} = \left[ e_{1it} \ e_{2it} \right]\prime \sim {\bf NID}\,\left( 0,Q \right)\) are independently distributed error terms. The initial values of the state variables μ it and τ it are assumed to be jointly normally distributed with zero mean and they are independent from ε it and e it . Estimation details are provided in Appendix 1. The component μ it is the random heterogeneity specific to ith individual which is interpreted as efficiency. In the spirit of Ahn et al. (2000) we allow the firm to sluggishly change its inefficiency by modeling efficiency as an AR(1) process with trend τ it . We also allow the firm to adjust quickly. Efficiency may be a random walk (or a random walk with trend), for example (cf, Kneip et al. 2012) and thus the model allows for non-stationarity. In our empirical illustration of the KFE that we explore in Section 5, we estimate production efficiency using a restricted version of the translog (RTRANS) production function. The restricted version of the translog that we use provides us with an empirical vehicle that suits our purpose in this introduction of a new estimator and is statistically supported over the full translog model.Footnote 5 As a check of the robustness of results based on the restricted translog model we also present estimation results from the full translog model in the Appendix 2.

We calculate the time-varying production frontier intercept common to all producers in period t as \({\hat \mu _t} = \mathop {{\max}}_i {\hat \mu _{it}}\) (Cornwell et al. 1990). Relative technical efficiency is estimated as \(T{E_{it}} = \exp( - {\hat u_{it}})\), where \({\hat u_{it}} = {\hat \mu _t} - {\hat \mu _{it}}\). Equation system 1 can be rewritten as:

$$\begin{array}{ccccc}& \kern1.4pc {y_{it}} = {X_{it}}\beta + Z{B_{it}} + {\varepsilon _{it}},\quad {\varepsilon _{it}} \sim {\bf{NID}}\left( {0,\sigma _\varepsilon ^2} \right)\\ & \kern-2pc {B_{i,t + 1}} = T{B_{i,t}} + {e_{it}},\quad {e_{it}} \sim {\bf{NID}}\left( {0,Q} \right)\\ \end{array}$$
(2)

where

$${B_{it}} = {\displaystyle{\left[\begin{array}{c}{\mu _{it}}\\ {\tau _{it}}\end{array}\right]}},\,{e_{it}} = { \displaystyle\left[\begin{array}{c}{e_{1it}} \\ {e_{2it}} \end{array}\right]},\,T = {\displaystyle\left[\begin{array}{cc}\rho & 1 \\ 0 & 1\end{array} \right]},\,{\rm and}\ Z = \Big[ 1 \quad 0 \Big] $$

For the initialization of the Kalman filter, one can use the initial values that are implied by stationarity. In the case of non-stationary states, diffuse priors can be used. One practical choice is setting the mean squared error matrix of the initial states to be a constant multiple of the identity matrix. The constant is chosen by the econometrician and should be a large number. Alternatively, one can utilize an exact diffuse initialization.Footnote 6 For the sake of simplicity we prefer using the former diffuse initialization method. The traditional Kalman filter estimation may be numerically unstable due to rounding errors which might cause variances to be non-positive definite during the update process. One solution to this issue is using the square-root Kalman filter. Hence, we further implement the square-root Kalman filter.Footnote 7

A simpler and yet flexible model we will use is:

$$\begin{array}{ccccc}&\kern1pc {y_{it}} = {X_{it}}\beta + {\mu _{it}} + {\varepsilon _{it}},\quad {\varepsilon _{it}} \sim {\bf{NID}}\left( {0,\sigma _\varepsilon ^2} \right)\\ & \kern-2pc {\mu _{i,t + 1}} = {\mu _{i,t}} + {e_{it}},\quad {e_{it}} \sim {\bf{NID}}\left( {0,Q} \right)\end{array}$$
(3)

This model generalizes the panel data models where the effects term is time-invariant by using time-specific local approximations, i.e., Schmidt and Sickles (1984). When Q = 0, μ it is a deterministic function of initial values, i.e., μ it  = μ i0. When choosing this model we follow a commonly used modeling of time-varying parameter models (i.e., random walk parameters); and we do not claim that this model is preferred over the more general model presented above. However, the simple model may perform better for relatively shorter panel data applications as diffuse initialization of state variables eats up smaller number of observations.Footnote 8 For example, in our empirical model, which uses an unbalanced panel data set with 11 time periods, the full model was not suitable for estimation.Footnote 9

KFE is a random effects-type estimator, in the sense that E[X it μ it ] = 0 is needed for consistency, and is considerably flexible in terms of capturing latent cross-sectional variations that can change over time and which we consider herein unobservable productivity effects. If the ε it or μ it (effects) terms are correlated with the regressors, then the parameter estimates are inconsistent. The KFE can be modified in line with the control function approach used by Kim and Kim (2011) in order to allow for endogenous regressors that are correlated with the ε it term.Footnote 10 Kim (2008) provides a solution to a similar endogeneity problem in the context of Markov-switching models when the state variable and regression disturbance are correlated. If the regressors are correlated with the effects term, then we can estimate the first differenced model:

$$\begin{array}{ccccc}& \kern1pc\Delta {y_{it}} = \Delta {X_{it}}\beta + {e_{it}} + \Delta {\varepsilon _{it}}\\ &\kern0.5pc = \Delta {X_{it}}\beta + {w_{it}}\end{array}$$
(4)

by instrumental variables and standard Kalman filter estimation methods can be applied to the consistent residuals, \({y_{it}} - {X_{it}}\hat \beta \), in order to obtain the consistent hyperparameter estimates.Footnote 11

3 Monte Carlo experiments

In this section we implement a set of Monte Carlo simulations to examine the finite sample performance of the KFE. For expositional simplicity we consider a production model. The data generating process is given by:Footnote 12 , Footnote 13

$$\begin{array}{ccccc}& \kern1pc {y_{it}} = {x_{it}}\beta + {\varepsilon _{it}} - {\mu _{it}},\quad {\varepsilon _{it}} \sim {\bf{NID}}\left( {0,\sigma _\varepsilon ^2} \right)\\ & \kern-1.7pc {x_{i,t + 1}} = R{x_{i,t}} + {\xi _{it}},\quad {\xi _{it}} \sim {\bf{NID}}\left( {0,{I_2}} \right)\end{array}$$
(5)

where \(x_{it} = [x_{1it} \quad {x_{2it}} ] \sim {\bf NID}( 0,( {I_2} - {R^2} )^{ - 1} ), \) \(\beta \, = \,[ {\beta _1}\ \ {\beta _2}]\prime \,= \,\left[ {0.5}\ \ {0.5} \right]\prime \), \(\sigma _\varepsilon ^2 = 1\), and

$$R = \displaystyle\left[ {0.4 \quad 0.05 }\atop {0.05 \quad 0.4}\right].$$

The generated values for x are shifted around three different means to obtain three balanced groups of firms. We chose m 1 = (5, 5)′, m 2 = (7.5, 7.5)′, and m 3 = (10, 10)′ as the group means. We simulate a sample of size (n i , n t ) = (50, 60). Each simulation is carried out 1000 times. We consider five different data generating processes for the μ it term:

$$\begin{array}{ll}DGP\,1\,:& {\mu _{it}} = {\mu _i}\kern14pc\\ DGP\,2\,:&\,{\mu _{it}} = {a_{0i}} + {a_{1i}}\left( {\frac{t}{{{n_t}}}} \right) + {a_{2i}}{\left( {\frac{t}{{{n_t}}}} \right)^2}\kern7pc\\ DGP\,3\,:&\,{\mu _{it}} = {b_{0i}} + \mathop {\sum}\limits_{r = 1}^2 {\left\{ {{b_{1ri}}\sin\left( {\frac{{2rt\pi }}{{{n_t}}}} \right) + {b_{2ri}}\cos\left( {\frac{{2rt\pi }}{{{n_t}}}} \right)} \right\}}\kern1.5pc \\ DGP\,4\,:&\,{\mu _{it}} = {\eta _t}{u_i}\kern14pc\\ DGP\,5\,:&\,{\mu _{it}} = {r_{it}}\kern14.5pc\end{array}$$
(6)

where \({\mu _i} \sim {\bf NID}\left( {0,\,\,1} \right)\); η t  = exp(−h(tn t )), h = (0.5)/(n t ), and \({u_i} \sim {\bf{NI}}{{\bf{D}}^ + }\left( {0,\,1} \right)\); \({a_{li}} \sim {\bf{N}}\left( {0,1} \right)\); \({b_{lri}} \sim {\bf{NID}}\left( {0,1} \right)\); r i,t+1 = r it  + v it and \({r_{i1}} \sim {\bf{NID}}\left( {0,1} \right)\); and \({v_{it}} \sim {\bf{NID}}\left( {0,1} \right)\).

We consider five estimators in our simulations. Each of these estimators correspond to one of the DGPs. The estimators are: Fixed effects (FE) estimator, CSS within estimator (CSSW), Fourier estimator (FOE), Battese–Coelli estimator (BC), and KFE. FE, CSSW, and FOE are described as follows:

$$\hat {\beta} = \left( X'{M_Q}X \right)^{ - 1}X'{M_Q}y$$
(7)

where M Q  = IQ(QQ)−1 Q′, Q = diag(W i ) is a block diagonal matrix with W i matrices on the diagonal, and W i is a matrix with rows W it . For example, we have W it  = 1 for the FE estimator W it  = [1, (t/n t ), (t/n t )2] for the CSSW estimatorFootnote 14, and W it= [1, sin(2tπ/n t ), sin(4/n t ), cos(2/n t ), cos(4/n t )] for the FOE.

Excepting the BC estimator, technical efficiencies are estimated as \(T{E_{it}} = \exp( - {\hat u_{it}})\), where \({\hat u_{it}} = \mathop {\max}\nolimits_i {\hat \mu _{it}} - {\hat \mu _{it}}\). The BC estimator assumes that u it  = η t u i where \({u_i} \sim {\bf NI}{\bf D}^ {+} \left( {m,\sigma _u^2} \right)\) and η t  = exp(−h(tn t )). Let e it  = ε it μ it . For the BC estimator the efficiency is estimated by:

$$\begin{array}{lllll}T{E_{it}} &= E\left[ {\exp\left( { - {u_{it}}} \right)|{e_{it}}} \right]\kern6.7pc \\ &= \frac{{1 - \Phi \left( {{\eta _t}{\sigma ^ * } - \frac{{m_i^ * }}{{{\sigma ^ * }}}} \right)}}{{1 - \Phi \left( { - \frac{{m_i^ * }}{{{\sigma ^ * }}}} \right)}}\exp\left( { - {\eta _t}m_i^ * + \frac{1}{2}\eta _t^2{\sigma ^{ * 2}}} \right)\end{array}$$
(8)

where η = (η 1, η 2,…,η n )′, Φ represents the distribution function for the normal random variable and

$$\begin{array}{rcl}m_i^* &=& \displaystyle\frac{{m\sigma _\varepsilon ^2 - \eta \prime {e_i}\sigma _u^2}}{{\sigma _\varepsilon ^2 + \eta \prime \eta \sigma _u^2}}\\ \sigma^{*2} &=& \displaystyle\frac{{\sigma _u^2\sigma _\varepsilon ^2}}{{\sigma _\varepsilon ^2 + \eta \prime \eta \sigma _u^2}}\end{array}$$

For the KFE we assume the following model:

$$\begin{array}{l} \kern2pc {y_{it}} = {X_{it}}\beta + {\mu _{it}} + {\varepsilon _{it}},\quad {\varepsilon _{it}} \sim {\bf{NID}}\left( {0,\,\sigma _\varepsilon ^2} \right)\\ {\mu _{i,t + 1}} = {\mu _{i,t}} + {e_{it}},\quad {e_{it}} \sim {\bf{NID}}\left( {0,\,Q} \right)\kern2.45pc\end{array}$$
(9)

Hence, for the KFE the effects term is modeled as a random walk, which is consistent with the local level model of univariate time series. We provide the bias, the variance, the mean squared error (MSE) of the coefficients, the (normalized) MSE of the efficiency estimates as well as the Pearson and Spearman correlations of efficiency estimates with the true efficiency levels. The MSE of the efficiencies are calculated as follows:

$${{MSEeff}}\left( {{{T}}{{{E}}_{0it}},{{\widehat {{{TE}}}}_{it}}} \right) = \frac{{\mathop {\sum}_{i,t} {{{\left( {{{T}}{{{E}}_{0it}} - {{\widehat {{{TE}}}}_{it}}} \right)}^2}} }}{{ {\sum}_{i,t} {{{TE}}_{0it}^2} }}$$
(10)

where TE0it is the true technical efficiency level and \(\widehat{TE}_{it}\) is the estimated efficiency level. The results for the Monte Carlo experiments are given in Tables 1, 2, 3, 4 and 5.

Table 1 Monte Carlo results for DGP1 (FE)
Table 2 Monte Carlo results for DGP2 (CSSW)
Table 3 Monte Carlo results for DGP3 (FOE)
Table 4 Monte Carlo results for DGP4 (BC)
Table 5 Monte Carlo results for DGP5 (KFE)

For the β estimates, the estimators generally show similar performances. For both the β estimates and the efficiency estimates we find that whenever there is a high variation in the efficiency term the less flexible estimators, such as FE and BC, perform worse than the others. KFE performs particularly well in terms of correlations between the true efficiency and the estimated efficiency. It is worth noting that all estimators other than the FOE and the KFE performed very poorly for DGP3. Indeed, the FE and BC estimators show almost no correlation between the true efficiency and the estimated efficiency.Footnote 15 This is because these estimators are not flexible enough to capture the time-varying pattern of the efficiency. Hence, this simulation study shows that when the efficiencies of the firms fluctuate the performance of non-flexible efficiency estimators can be arbitrarily misleading in capturing the performances of firms.

Finally, we present simulation results for smaller sample sizes in Tables 6 and 7: (n i , n t ) = (50, 10) and (n i , n t )= (10, 60). As in our earlier simulations the estimators performed more or less the same in terms of β estimates. Hence, we only summarize their performance for efficiency estimation. The last two rows are the averages of MSE values and Pearson correlations which may serve as an aggregate measure of performance. These tables also confirm that the KFE estimator performs quite well in terms of capturing the unobserved efficiency. A striking observation is that KFE performs well even for relatively shorter panels.

Table 6 Monte Carlo results for n i  = 50, n t  = 10
Table 7 Monte Carlo results for n i  = 10, n t  = 60

4 The U.S. Airline Industry 1999–2009

4.1 The data

In order to illustrate our estimator and its usefulness in applied settings, we utilize annual data from the U.S. airline industry during the period 1999–2009. The third author has written extensively on commercial airline efficiency issues in the U.S., Europe, and in Asia. We view the example below as informative in regard to the usefulness of our estimator in modeling efficiencies in the airline industry and how it may inform researchers in more extensive industry studies as to potential limitations in their modeling approaches and alternative approaches they may wish to consider, such as ours. The time period we choose is one during which the U.S. airlines faced serious financial troubles. The financial losses for domestic passenger airline operations were more than three times the losses between 1979–1999. Some of the exogenous cost shocks during the sample period were due to increased taxes and jet fuel prices. At the same time fares fell and remained relatively low. Real jet fuel prices were about 20 % lower in 2009 than in 2000. Since 1979 demand grew steadily. However, we observe sharp demand drops during the recession of 2001–2002 and 2008–2009. Due to capital costs and sticky labor prices such unanticipated decreases in demand brought additional complications to an industry which had been experiencing relatively stable and steady demand growth. Another feature of the sample time period is the increase in load factors. Average load factors increased from 71 to 81 % between 2000 and 2009 due in part to improved yield management techniques and reduced flight frequency but which also lead to reduced levels of service quality.Footnote 16

The unbalanced data is mainly obtained from the International Civil Aviation Organization (ICAO). The data set that we use has 35 airlines and 298 observations.Footnote 17 Input and output variables are constructed following the approaches of Sickles (1985) and Sickles et al. (1986). Inputs are flight capital (K, quantity of planes), labor (L, quantity of pilots, cabin crew, mechanics, passenger and aircraft handlers, and other labor), fuel (F, quantity of barrels of fuel), and materials (M, quantity index of supplies, outside services, and non-flight equipment’s). We focus on value added from capital and labor in our empirical illustration of the KFE by netting out from revenue output (RTK, revenue ton kilometers) the value of the intermediate energy and materials. Thus our technology is rather simple and uses capital and labor to produce value added revenue ton kilometers.

In addition to the above, we include two sets of control variables into our model to account for the heterogeneity of output and the capital input. The first set of control variables is concerned with service characteristics: (i) aircraft stage length (SL) and (ii) load factor (LF). SL is the average length of a route segment, obtained by dividing the miles flown by the number of departures. The shorter (low value) the stage length the shorter the period an airlines’ aircraft spends in each flight segment. LF reflects the average occupancy of an airline’s aircraft seats, is considered a measure of service quality, and is often used as a proxy for service competition. A lower load factor often implies that the airline assigns a relatively larger number of planes to a particular route and reflects higher service quality by the airline. The second set of control variables is concerned with capital stock characteristics. The first is the average size of the airline’s aircraft (SIZE). The larger the size of the aircraft the more services can be provided without a proportionate increase in factors such as flight crew, passenger and aircraft handlers, and landing slots. The second is the percentage of each airline fleet that is a (JET) aircraft to total number of aircraft. JET is considered as a proxy for the aircraft speed. The jet aircraft tends to fly around three times as fast as turboprops aircraft and in addition the jet aircraft requires a relatively lower number of flight crew resources. A brief description of the variables is given in Table 8.

Table 8 Description of variables

4.2 Analysis using the KFE

In this section we examine the technical efficiency trends in the U.S. airline industry during the period 1999–2009 using our new KFE and compare our findings to those from the Battese–Coelli (BC) and the Cornwell, Schmidt, and Sickles within (CSSW) estimator with efficiency modeled as depending only on deterministic time period proxies that vary over time. We utilize the quadratic specification used in the U.S. airline empirical illustration of Cornwell et al. (1990). The BC estimator is probably the most widely used of the panel estimators and is a random effects type estimator of efficiency change that also utilizes a deterministic time trend. The CSSW has somewhat more flexibility and provides a fixed effects treatment. We estimate the value-added production function of the U.S. airlines (revenue ton kilometers less a value weighted average of materials and energy). The production function is specified as linear in logs as:

$$\begin{array}{rcl} {y_{it}} & = & {\sum}_j {\beta _j}{x_{jit}} + {\mu _{it}} + {\varepsilon _{it}}\\ {\mu _{i,t + 1}} & = & {\mu _{i,t}} + {e_{it}}\end{array}$$
(11)

where \({\varepsilon _{it}} \sim {\bf{NID}}(0,\sigma _\varepsilon ^2)\) and \({e_{it}} \sim {\bf{NID}}(0,\sigma _e^2)\) are independently distributed error terms.Footnote 18 The estimates for the production function parameters and average efficiencies for the KFE and the BC and CSSW are given in Table 9 and Fig. 1, respectively.Footnote 19 The overall average efficiencies for the KFE and the CSSW and BC estimators are 0.577, 0.438, and 0.632, respectively.

Table 9 Restricted translog production model estimates
Fig. 1
figure 1

Efficiency estimates for restricted translog model

The median of the returns to scale values for the KFE and the BC and CSSW estimators are 0.883, 0.94, and 1.034, respectively. A common finding for the airline industry is that the airlines operate in a constant returns to scale environment. In a single-output production setting, Basu and Fernald (1997) provide a theoretical proof that the value added estimate of returns to scale is smaller (greater) than the corresponding gross output model when there is decreasing (increasing) returns to scale. Hence, there is a magnification effect for returns to scale estimates when a value added production function is used.Footnote 20 Therefore, the returns to scale estimate for the KFE might have been driven by this fact. Nevertheless, the constant returns to scale value of one lies within one sample standard deviation away from the mean value of returns to scale estimates from the KFE. In terms of regularity conditions, KFE outperforms other two estimators. More precisely, while the KFE satisfies curvature regularity condition at each time period, the BC and CSSW violate curvature regularity condition at each time period. At the median values of the regressors, all three estimators satisfy monotonicity conditions at each time period. According to KFE estimates, the average efficiency of the U.S. airlines is relatively stable for the second half of the study period. However, there is some evidence of a decrease in efficiency for the first half of the study period.

One potential empirical concern would be whether the effects term is correlated with the regressors or not. If the effects term is correlated with the regressors, then the coefficient estimates would be inconsistent. One advantage of the CSSW estimator over the random effects-type estimator is that even when the regressors are correlated with the effects term, the parameter estimates are consistent. Hence, the parameter estimates from the CSSW model can be used to test the consistency of parameter estimates from the KFE. We test the consistency of parameter estimates from KFE using a Wu-Hausman test and cannot reject the KFE estimates at the 5 % significance level.

We also check the robustness of our results by estimating a full version of the translog model. A common problem with the translog production function is that by increasing the number of variables by adding second-order ln terms to the Cob–Douglas functional form the second order terms tend to exhibit considerable multicollinearity. The full translog model estimates are given in the Appendix. For the full translog model, few of the parameters were significant at the 5 % level. We choose our final model specification based on the BIC for the Kalman filter. This criterion is:

$$BIC = \frac{-2\,\ln\,L + \ln\,(s)\,(p + d)}{s}$$

where L is the likelihood value, s is the sample size, p is the number of hyperparameters, and d is the number of diffuse priors (Durbin and Koopman 2001). The BIC values for the full translog and restricted translog forms are 1.805 and 1.714, respectively. Based on the BIC and the fact that almost all the parameters of the full translog model are insignificant, the restricted translog functional form is preferred on statistical grounds.

5 Conclusions

In this study we have proposed a way to measure technical efficiency via the Kalman filter. Our new Kalman Filter estimator (KFE) provides a local approximation to general time and cross sectionally varying effects terms in a standard panel model. We examine the new estimator in a series of limited Monte Carlo experiments. Our simulation results indicate that while the performance of the KFE is similar to the performances of the other estimators for the coefficient estimates, the KFE outperforms the less flexible estimators in terms of the correlation of the effects with true effects. A result of our simulations is that the widely used BC estimator performed very poorly whenever there is substantial variation in the effects, or for our canonical stochastic frontier efficiency model, the efficiency term. If the sample data contains events that can cause jumps in the productivity of firms, then the KFE estimator appears able to improve on other standard panel treatments that are less flexible in specifying the temporal variation in the effects. We then used the KFE in order to estimate the average efficiency of the U.S. airlines. Point estimates for the KFE indicate that average efficiency of the U.S. airlines fell by more than 10 % during earlier years of time period, but these trends are not stable. What does appear to be the case is that there is no strong or even weak evidence that airlines experienced improved efficiencies over the sample period. Given that there were no particularly important new technical innovations during the sample period, the sizeable losses incurred by the industry as fares continued to be held down by competitive pressures were not surprising. Moreover, many of the efficiency gains from reorganized networks, code-sharing arrangements, and other best business practices apparently had already been realized by the beginning of the sample period.