Abstract
In the Kalman filter setting, one can model the inefficiency term of the standard stochastic frontier composed error as an unobserved state. In this study a panel data version of the local level model is used for estimating time-varying efficiencies of firms. We apply the Kalman filter to estimate average efficiencies of U.S. airlines and find that the technical efficiency of these carriers did not improve during the period 1999–2009. During this period the industry incurred substantial losses, and the efficiency gains from reorganized networks, code-sharing arrangements, and other best business practices apparently had already been realized.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Stochastic frontier analysis originated with two seminal papers, Meeusen and van den Broeck (1977) and Aigner et al. (1977). Jondrow et al. (1982) provided a way to estimate firm specific technical efficiency. These contributions were framed in a cross sectional data framework. Panel data potentially can give more reliable information about the efficiencies of the firm. Pitt and Lee (1981) and Schmidt and Sickles (1984) applied random effects and fixed effects models to estimate firm specific efficiencies. In these models the efficiencies are assumed to be time-invariant. For long panel data this assumption might be questionable. The time-invariance assumption was relaxed by Cornwell et al. (1990) (CSS), Kumbhakar (1990), Battese and Coelli (1992) (BC), and Lee and Schmidt (1993). The time-varying inefficiency models were followed by dynamic efficiency models such as Ahn et al. (2000), Desli et al. (2003), Tsionas (2006), Huang and Chen (2009), and Assaf et al. (2014).Footnote 1 Work on time varying effects models and their use in productivity and efficiency studies have accelerated in the last decade and we view our current contribution as following in this tradition. Many of these advances are summarized in the recent chapter by Sickles et al. (2015).Footnote 2
In this paper we consider the use of the Kalman (1960) filter by treating the inefficiency term as an unobserved state. In contrast to the classical Box–Jenkins approach, one also can explicitly model non-stationary stochastic processes in the Kalman filter setting. This gives significant flexibility to the econometrician when specifying the inefficiency portion of the model. We use the Kalman filter estimator (KFE) to model the efficiency component of the stochastic frontier composed error. For this purpose we use a panel data generalization of the local level model. For long panel data, relatively inflexible stochastic frontier models [e.g., BC, CSS, and Kumbhakar (1990)] are more likely to fail to capture potentially complex time-varying patterns of the effects terms. We examine this claim by conducting a series of Monte Carlo simulations. Results of these simulations indicate that some of the widely used estimators can perform poorly in terms of capturing the efficiencies of firms when we have long panel data with fluctuating efficiencies. For example, if the efficiencies of firms are affected by macro factors that tend to have cycles, then it is likely that these relatively inflexible approximations will fail to capture the efficiency patterns. While some of the factors that lead to variation in efficiency can be controlled for by including exogenous variables in the modeling of the inefficiency term, the unobserved factors leading to such variations are generally left out in the conventional stochastic frontier methods. That is, the pattern of time-variation in efficiency is restricted to follow a known function of exogenous parameters. Hence, one of the main goals of this study is to point out the importance of capturing these time-varying unobserved factors in the efficiency analysis, especially for longer panel data, and the relative ease with which such time-varying unobserved factors can be addressed using the Kalman filter. The results of our Monte Carlo simulations serve well for this purpose. Our model is not unduly complicated and can be applied relatively easily in many applications. Thus the KFE is proposed as a simple and effective (as shown in the simulations) solution to the problem at hand. The KFE can be viewed as an alternative to the factor model approach addressed in Kneip et al. (2012) and Ahn et al. (2013) and recent generalizations utilizing Bayesian alternatives.
An early application of the Kalman filter in the productivity setting is Slade (1989) where she uses the local level model with trend to model total factor productivity. However, Ueda and Hoshino (2005) appear to have been the first to apply the Kalman filter to the estimation of efficiency in a data envelopment analysis (DEA) framework. Ueda and Hoshino (2005) examine the case where the inputs and outputs are not deterministic. Kutlu (2010a), Emvalomatis et al. (2011) and our study appear to be the first to use the Kalman filter to estimate efficiency in the framework of stochastic frontier analysis (SFA).Footnote 3 Emvalomatis et al. (2011) modeled the logarithm of ratio of inefficiency and efficiency by a generalized version of an AR(1) process. Their method, however, does not use the traditional Kalman filter since the state variable is not linearly incorporated in their model, which is a necessary assumption for the traditional version of the Kalman filter. Hence, they use a non-linear version of the Kalman filter. In contrast, we model the effects term as in the local level model and calculate the efficiency scores utilizing the approach adopted by Schmidt and Sickles (1984). Moreover, for our model the traditional Kalman filter method is sufficient for our estimation purposes, although extensions of the Kalman filter, for example, to handle endogenous regressors, recently have been developed and used in a production setting.Footnote 4 We apply the KFE to estimate the average (and individual) efficiencies of the U.S. airlines during the period 1999–2009. Over our 11 years of study period, the average efficiency of the airlines do not show a tendency to increase. Indeed, for the first few years of the study it seems that the efficiencies of the airlines decreased. As efficiency change and technical (innovation) change are the two main components of productivity growth our empirical findings are broadly consistent with the findings of others (see, for example, Färe et al. 2007) who report declining service quality as problems with delays and congestion at US major airports accelerated during our sample period.
In the next section we describe the KFE and propose several ways in which it can be implemented to model productive efficiency. In Section 3 we discuss our Monte Carlo simulation results. Section 4 provides the data description and results of an analysis of productivity trends in the US commercial airline industry during the period 1999–2009. Section 5 concludes. Additional estimation results for other functional specifications as a check of the robustness of our overall findings are provided in the Appendix.
2 Description of the Kalman filter estimator
Consider a panel of n i firms observed over n t periods. A general stochastic frontier model is given as follows:
where y it is the logarithm of output, \({\varepsilon _{it}} \sim {\bf NID}\left( 0,{\sigma _\varepsilon ^2} \right)\) and \(e_{it} = \left[ e_{1it} \ e_{2it} \right]\prime \sim {\bf NID}\,\left( 0,Q \right)\) are independently distributed error terms. The initial values of the state variables μ it and τ it are assumed to be jointly normally distributed with zero mean and they are independent from ε it and e it . Estimation details are provided in Appendix 1. The component μ it is the random heterogeneity specific to ith individual which is interpreted as efficiency. In the spirit of Ahn et al. (2000) we allow the firm to sluggishly change its inefficiency by modeling efficiency as an AR(1) process with trend τ it . We also allow the firm to adjust quickly. Efficiency may be a random walk (or a random walk with trend), for example (cf, Kneip et al. 2012) and thus the model allows for non-stationarity. In our empirical illustration of the KFE that we explore in Section 5, we estimate production efficiency using a restricted version of the translog (RTRANS) production function. The restricted version of the translog that we use provides us with an empirical vehicle that suits our purpose in this introduction of a new estimator and is statistically supported over the full translog model.Footnote 5 As a check of the robustness of results based on the restricted translog model we also present estimation results from the full translog model in the Appendix 2.
We calculate the time-varying production frontier intercept common to all producers in period t as \({\hat \mu _t} = \mathop {{\max}}_i {\hat \mu _{it}}\) (Cornwell et al. 1990). Relative technical efficiency is estimated as \(T{E_{it}} = \exp( - {\hat u_{it}})\), where \({\hat u_{it}} = {\hat \mu _t} - {\hat \mu _{it}}\). Equation system 1 can be rewritten as:
where
For the initialization of the Kalman filter, one can use the initial values that are implied by stationarity. In the case of non-stationary states, diffuse priors can be used. One practical choice is setting the mean squared error matrix of the initial states to be a constant multiple of the identity matrix. The constant is chosen by the econometrician and should be a large number. Alternatively, one can utilize an exact diffuse initialization.Footnote 6 For the sake of simplicity we prefer using the former diffuse initialization method. The traditional Kalman filter estimation may be numerically unstable due to rounding errors which might cause variances to be non-positive definite during the update process. One solution to this issue is using the square-root Kalman filter. Hence, we further implement the square-root Kalman filter.Footnote 7
A simpler and yet flexible model we will use is:
This model generalizes the panel data models where the effects term is time-invariant by using time-specific local approximations, i.e., Schmidt and Sickles (1984). When Q = 0, μ it is a deterministic function of initial values, i.e., μ it = μ i0. When choosing this model we follow a commonly used modeling of time-varying parameter models (i.e., random walk parameters); and we do not claim that this model is preferred over the more general model presented above. However, the simple model may perform better for relatively shorter panel data applications as diffuse initialization of state variables eats up smaller number of observations.Footnote 8 For example, in our empirical model, which uses an unbalanced panel data set with 11 time periods, the full model was not suitable for estimation.Footnote 9
KFE is a random effects-type estimator, in the sense that E[X it μ it ] = 0 is needed for consistency, and is considerably flexible in terms of capturing latent cross-sectional variations that can change over time and which we consider herein unobservable productivity effects. If the ε it or μ it (effects) terms are correlated with the regressors, then the parameter estimates are inconsistent. The KFE can be modified in line with the control function approach used by Kim and Kim (2011) in order to allow for endogenous regressors that are correlated with the ε it term.Footnote 10 Kim (2008) provides a solution to a similar endogeneity problem in the context of Markov-switching models when the state variable and regression disturbance are correlated. If the regressors are correlated with the effects term, then we can estimate the first differenced model:
by instrumental variables and standard Kalman filter estimation methods can be applied to the consistent residuals, \({y_{it}} - {X_{it}}\hat \beta \), in order to obtain the consistent hyperparameter estimates.Footnote 11
3 Monte Carlo experiments
In this section we implement a set of Monte Carlo simulations to examine the finite sample performance of the KFE. For expositional simplicity we consider a production model. The data generating process is given by:Footnote 12 , Footnote 13
where \(x_{it} = [x_{1it} \quad {x_{2it}} ] \sim {\bf NID}( 0,( {I_2} - {R^2} )^{ - 1} ), \) \(\beta \, = \,[ {\beta _1}\ \ {\beta _2}]\prime \,= \,\left[ {0.5}\ \ {0.5} \right]\prime \), \(\sigma _\varepsilon ^2 = 1\), and
The generated values for x are shifted around three different means to obtain three balanced groups of firms. We chose m 1 = (5, 5)′, m 2 = (7.5, 7.5)′, and m 3 = (10, 10)′ as the group means. We simulate a sample of size (n i , n t ) = (50, 60). Each simulation is carried out 1000 times. We consider five different data generating processes for the μ it term:
where \({\mu _i} \sim {\bf NID}\left( {0,\,\,1} \right)\); η t = exp(−h(t−n t )), h = (0.5)/(n t ), and \({u_i} \sim {\bf{NI}}{{\bf{D}}^ + }\left( {0,\,1} \right)\); \({a_{li}} \sim {\bf{N}}\left( {0,1} \right)\); \({b_{lri}} \sim {\bf{NID}}\left( {0,1} \right)\); r i,t+1 = r it + v it and \({r_{i1}} \sim {\bf{NID}}\left( {0,1} \right)\); and \({v_{it}} \sim {\bf{NID}}\left( {0,1} \right)\).
We consider five estimators in our simulations. Each of these estimators correspond to one of the DGPs. The estimators are: Fixed effects (FE) estimator, CSS within estimator (CSSW), Fourier estimator (FOE), Battese–Coelli estimator (BC), and KFE. FE, CSSW, and FOE are described as follows:
where M Q = I−Q(Q′Q)−1 Q′, Q = diag(W i ) is a block diagonal matrix with W i matrices on the diagonal, and W i is a matrix with rows W it . For example, we have W it = 1 for the FE estimator W it = [1, (t/n t ), (t/n t )2] for the CSSW estimatorFootnote 14, and W it = [1, sin(2tπ/n t ), sin(4tπ/n t ), cos(2tπ/n t ), cos(4tπ/n t )] for the FOE.
Excepting the BC estimator, technical efficiencies are estimated as \(T{E_{it}} = \exp( - {\hat u_{it}})\), where \({\hat u_{it}} = \mathop {\max}\nolimits_i {\hat \mu _{it}} - {\hat \mu _{it}}\). The BC estimator assumes that u it = η t u i where \({u_i} \sim {\bf NI}{\bf D}^ {+} \left( {m,\sigma _u^2} \right)\) and η t = exp(−h(t−n t )). Let e it = ε it −μ it . For the BC estimator the efficiency is estimated by:
where η = (η 1, η 2,…,η n )′, Φ represents the distribution function for the normal random variable and
For the KFE we assume the following model:
Hence, for the KFE the effects term is modeled as a random walk, which is consistent with the local level model of univariate time series. We provide the bias, the variance, the mean squared error (MSE) of the coefficients, the (normalized) MSE of the efficiency estimates as well as the Pearson and Spearman correlations of efficiency estimates with the true efficiency levels. The MSE of the efficiencies are calculated as follows:
where TE0it is the true technical efficiency level and \(\widehat{TE}_{it}\) is the estimated efficiency level. The results for the Monte Carlo experiments are given in Tables 1, 2, 3, 4 and 5.
For the β estimates, the estimators generally show similar performances. For both the β estimates and the efficiency estimates we find that whenever there is a high variation in the efficiency term the less flexible estimators, such as FE and BC, perform worse than the others. KFE performs particularly well in terms of correlations between the true efficiency and the estimated efficiency. It is worth noting that all estimators other than the FOE and the KFE performed very poorly for DGP3. Indeed, the FE and BC estimators show almost no correlation between the true efficiency and the estimated efficiency.Footnote 15 This is because these estimators are not flexible enough to capture the time-varying pattern of the efficiency. Hence, this simulation study shows that when the efficiencies of the firms fluctuate the performance of non-flexible efficiency estimators can be arbitrarily misleading in capturing the performances of firms.
Finally, we present simulation results for smaller sample sizes in Tables 6 and 7: (n i , n t ) = (50, 10) and (n i , n t )= (10, 60). As in our earlier simulations the estimators performed more or less the same in terms of β estimates. Hence, we only summarize their performance for efficiency estimation. The last two rows are the averages of MSE values and Pearson correlations which may serve as an aggregate measure of performance. These tables also confirm that the KFE estimator performs quite well in terms of capturing the unobserved efficiency. A striking observation is that KFE performs well even for relatively shorter panels.
4 The U.S. Airline Industry 1999–2009
4.1 The data
In order to illustrate our estimator and its usefulness in applied settings, we utilize annual data from the U.S. airline industry during the period 1999–2009. The third author has written extensively on commercial airline efficiency issues in the U.S., Europe, and in Asia. We view the example below as informative in regard to the usefulness of our estimator in modeling efficiencies in the airline industry and how it may inform researchers in more extensive industry studies as to potential limitations in their modeling approaches and alternative approaches they may wish to consider, such as ours. The time period we choose is one during which the U.S. airlines faced serious financial troubles. The financial losses for domestic passenger airline operations were more than three times the losses between 1979–1999. Some of the exogenous cost shocks during the sample period were due to increased taxes and jet fuel prices. At the same time fares fell and remained relatively low. Real jet fuel prices were about 20 % lower in 2009 than in 2000. Since 1979 demand grew steadily. However, we observe sharp demand drops during the recession of 2001–2002 and 2008–2009. Due to capital costs and sticky labor prices such unanticipated decreases in demand brought additional complications to an industry which had been experiencing relatively stable and steady demand growth. Another feature of the sample time period is the increase in load factors. Average load factors increased from 71 to 81 % between 2000 and 2009 due in part to improved yield management techniques and reduced flight frequency but which also lead to reduced levels of service quality.Footnote 16
The unbalanced data is mainly obtained from the International Civil Aviation Organization (ICAO). The data set that we use has 35 airlines and 298 observations.Footnote 17 Input and output variables are constructed following the approaches of Sickles (1985) and Sickles et al. (1986). Inputs are flight capital (K, quantity of planes), labor (L, quantity of pilots, cabin crew, mechanics, passenger and aircraft handlers, and other labor), fuel (F, quantity of barrels of fuel), and materials (M, quantity index of supplies, outside services, and non-flight equipment’s). We focus on value added from capital and labor in our empirical illustration of the KFE by netting out from revenue output (RTK, revenue ton kilometers) the value of the intermediate energy and materials. Thus our technology is rather simple and uses capital and labor to produce value added revenue ton kilometers.
In addition to the above, we include two sets of control variables into our model to account for the heterogeneity of output and the capital input. The first set of control variables is concerned with service characteristics: (i) aircraft stage length (SL) and (ii) load factor (LF). SL is the average length of a route segment, obtained by dividing the miles flown by the number of departures. The shorter (low value) the stage length the shorter the period an airlines’ aircraft spends in each flight segment. LF reflects the average occupancy of an airline’s aircraft seats, is considered a measure of service quality, and is often used as a proxy for service competition. A lower load factor often implies that the airline assigns a relatively larger number of planes to a particular route and reflects higher service quality by the airline. The second set of control variables is concerned with capital stock characteristics. The first is the average size of the airline’s aircraft (SIZE). The larger the size of the aircraft the more services can be provided without a proportionate increase in factors such as flight crew, passenger and aircraft handlers, and landing slots. The second is the percentage of each airline fleet that is a (JET) aircraft to total number of aircraft. JET is considered as a proxy for the aircraft speed. The jet aircraft tends to fly around three times as fast as turboprops aircraft and in addition the jet aircraft requires a relatively lower number of flight crew resources. A brief description of the variables is given in Table 8.
4.2 Analysis using the KFE
In this section we examine the technical efficiency trends in the U.S. airline industry during the period 1999–2009 using our new KFE and compare our findings to those from the Battese–Coelli (BC) and the Cornwell, Schmidt, and Sickles within (CSSW) estimator with efficiency modeled as depending only on deterministic time period proxies that vary over time. We utilize the quadratic specification used in the U.S. airline empirical illustration of Cornwell et al. (1990). The BC estimator is probably the most widely used of the panel estimators and is a random effects type estimator of efficiency change that also utilizes a deterministic time trend. The CSSW has somewhat more flexibility and provides a fixed effects treatment. We estimate the value-added production function of the U.S. airlines (revenue ton kilometers less a value weighted average of materials and energy). The production function is specified as linear in logs as:
where \({\varepsilon _{it}} \sim {\bf{NID}}(0,\sigma _\varepsilon ^2)\) and \({e_{it}} \sim {\bf{NID}}(0,\sigma _e^2)\) are independently distributed error terms.Footnote 18 The estimates for the production function parameters and average efficiencies for the KFE and the BC and CSSW are given in Table 9 and Fig. 1, respectively.Footnote 19 The overall average efficiencies for the KFE and the CSSW and BC estimators are 0.577, 0.438, and 0.632, respectively.
The median of the returns to scale values for the KFE and the BC and CSSW estimators are 0.883, 0.94, and 1.034, respectively. A common finding for the airline industry is that the airlines operate in a constant returns to scale environment. In a single-output production setting, Basu and Fernald (1997) provide a theoretical proof that the value added estimate of returns to scale is smaller (greater) than the corresponding gross output model when there is decreasing (increasing) returns to scale. Hence, there is a magnification effect for returns to scale estimates when a value added production function is used.Footnote 20 Therefore, the returns to scale estimate for the KFE might have been driven by this fact. Nevertheless, the constant returns to scale value of one lies within one sample standard deviation away from the mean value of returns to scale estimates from the KFE. In terms of regularity conditions, KFE outperforms other two estimators. More precisely, while the KFE satisfies curvature regularity condition at each time period, the BC and CSSW violate curvature regularity condition at each time period. At the median values of the regressors, all three estimators satisfy monotonicity conditions at each time period. According to KFE estimates, the average efficiency of the U.S. airlines is relatively stable for the second half of the study period. However, there is some evidence of a decrease in efficiency for the first half of the study period.
One potential empirical concern would be whether the effects term is correlated with the regressors or not. If the effects term is correlated with the regressors, then the coefficient estimates would be inconsistent. One advantage of the CSSW estimator over the random effects-type estimator is that even when the regressors are correlated with the effects term, the parameter estimates are consistent. Hence, the parameter estimates from the CSSW model can be used to test the consistency of parameter estimates from the KFE. We test the consistency of parameter estimates from KFE using a Wu-Hausman test and cannot reject the KFE estimates at the 5 % significance level.
We also check the robustness of our results by estimating a full version of the translog model. A common problem with the translog production function is that by increasing the number of variables by adding second-order ln terms to the Cob–Douglas functional form the second order terms tend to exhibit considerable multicollinearity. The full translog model estimates are given in the Appendix. For the full translog model, few of the parameters were significant at the 5 % level. We choose our final model specification based on the BIC for the Kalman filter. This criterion is:
where L is the likelihood value, s is the sample size, p is the number of hyperparameters, and d is the number of diffuse priors (Durbin and Koopman 2001). The BIC values for the full translog and restricted translog forms are 1.805 and 1.714, respectively. Based on the BIC and the fact that almost all the parameters of the full translog model are insignificant, the restricted translog functional form is preferred on statistical grounds.
5 Conclusions
In this study we have proposed a way to measure technical efficiency via the Kalman filter. Our new Kalman Filter estimator (KFE) provides a local approximation to general time and cross sectionally varying effects terms in a standard panel model. We examine the new estimator in a series of limited Monte Carlo experiments. Our simulation results indicate that while the performance of the KFE is similar to the performances of the other estimators for the coefficient estimates, the KFE outperforms the less flexible estimators in terms of the correlation of the effects with true effects. A result of our simulations is that the widely used BC estimator performed very poorly whenever there is substantial variation in the effects, or for our canonical stochastic frontier efficiency model, the efficiency term. If the sample data contains events that can cause jumps in the productivity of firms, then the KFE estimator appears able to improve on other standard panel treatments that are less flexible in specifying the temporal variation in the effects. We then used the KFE in order to estimate the average efficiency of the U.S. airlines. Point estimates for the KFE indicate that average efficiency of the U.S. airlines fell by more than 10 % during earlier years of time period, but these trends are not stable. What does appear to be the case is that there is no strong or even weak evidence that airlines experienced improved efficiencies over the sample period. Given that there were no particularly important new technical innovations during the sample period, the sizeable losses incurred by the industry as fares continued to be held down by competitive pressures were not surprising. Moreover, many of the efficiency gains from reorganized networks, code-sharing arrangements, and other best business practices apparently had already been realized by the beginning of the sample period.
Notes
See also Galán and Pollitt (2014) for an empirical study.
See, also Sickles (2005).
Our paper is a substantially revised and extended version of Chapter 2 in Levent Kutlu’s dissertation, Market Power and Efficiency (2010a). Recently, independent from us, Peyrache and Rambaldi (2013) proposed a similar Kalman filter model for estimating efficiencies.
In the Kalman filter setting it is possible to estimate a cost function with/without input share equations. For the simultaneous equations setting we do not consider a stochastic frontier model because of so called Greene’s problem. See Kumbhakar (1997), Kumbhakar and Lovell (2003), and Kutlu (2013).
See Durbin and Koopman (2001) for more details about initialization.
See Appendix 1 for more details about required degrees of freedom. The degrees of freedom requirement may be eased by using other (yet restrictive) initialization approaches.
We failed to estimate the full model for this short panel data.
See Harvey (1989) for more details on this type of solutions to the endogeneity problem in the Kalman filter setting.
Note that for the KFE μ it may be negative or positive. Hence, as long as the u it term is predicted properly the sign of μ it is not important. However, in the simulations the production model is written in a general form so that BC production model is also nested. Hence, for this purpose the sign of μ it is negative.
The original CSSW estimator assumes W it = [1, t, t 2]. However, for the simulations we normalize t by n t . This normalization does not affect the results and is done for numerical purposes.
In some of the simulation runs we observed even negative correlations.
For more information about the financial situations of U.S. airlines see Borenstein (2011).
The full data set has 39 airlines and 321 observations. We droped 1 airline with less than 4 observations and 3 cargo airlines.
As mentioned in our theoretical section we want to concentrate on the simple local level model rather than the full model as it is easier to estimate. For example, our attempt to estimate the full model failed. The alternative estimates for random walk with deterministic trend and AR(1) effects models are provided in Appendix 2.
When calculating the efficiency estimates, we trim the effects term from the upper and lower 7.5 % percentiles, observed at least at one time period, to remove the outlier effects. See, Berger (1993), Berger and Hannan (1998), Kutlu (2012), and Kneip et al. (2012) for more details. See, also Appendix 2 for some robustness check for trimming.
For similar results see also Diewert and Fox (2008).
Note that smoothing is not needed to get the MLE estimates. The smoothing equations are calculated after the estimations. The Kalman filter uses past and current observations to predict the state variables; and thus it does not use all information when calculating the state variable predictions. Once the parameters of the model are estimated by MLE, the smoothing enables us to update our predictions using information from all time periods. This is why after smoothing procedure the predictions of state variables look smoother. Hence, the name smoothing.
The initial values for the AR(1) model are estimated as parameters.
References
Ahn SC, Good DH, Sickles RC (2000) Estimation of long-run inefficiency levels: a dynamic frontier approach. Econ Rev 19:461–492
Ahn SC, Lee YH, Schmidt P (2013) Panel data models with multiple time-varying individual effects. J Econ 174:1–14
Aigner DJ, Lovell CAK, Schmidt P (1977) Formulation and estimation of stochastic frontier production functions. J Econ 6:21–37
Assaf AG, Gillen D, Tsionas EG (2014) Understanding relative efficiency among airports: a general dynamic model for distinguishing technical and allocative efficiency. Transp Res Part B 70:18–34
Battese GE, Coelli TJ (1992) Frontier production functions, technical efficiency and panel data with application to paddy farmers in India. J Prod Anal 3:153–169
Basu S, Fernald JG (1997) Returns to scale in U.S. production: estimates and implications. J Polit Econ 105:249–83
Berger AN (1993) Distribution free estimates of efficiency in US banking industry and tests of the standard distributional assumptions. J Prod Anal 4:261–292
Berger AN, Hannan TH (1998) The efficiency cost of market power in the banking industry: a test of the “Quiet Life” and related hypotheses. Rev Econ Stat 80:454–465
Borenstein S (2011) On the persistent financial losses of U.S. Airlines: a preliminary exploration, Working paper
Cornwell C, Schmidt P, Sickles RC (1990) Production frontiers with time-series variation in efficiency levels. J Econ 46:185–200
Desli E, Ray SC, Kumbhakar SC (2003) A dynamic stochastic frontier production model with time-varying efficiency. Appl Econ Lett 10:623–626
Diewert WE, Fox KJ (2008) On the estimation of returns to scale, technical progress and monopolistic markups. J Econ 145:174–193
Durbin J, Koopman SJ (2001) Time series analysis by state space methods, oxford statistical series 24. Oxford University Press, Oxford
Emvalomatis G, Stefanou SE, Lansink AO (2011) A reduced-form model for dynamic efficiency measurement: application to dairy farms in Germany and The Netherlands. Am J Agric Econ 93:161–174
Färe R, Grosskopf S, Sickles RC (2007) Productivity? of U.S. Airlines after deregulation. J Transp Econ Policy 41:93–112
Galán JE, Pollitt MG (2014) Inefficiency persistence and heterogeneity in Colombian electricity utilities. Energy Econ 46:31–44
Harvey AC (1989) Forecasting, structural time series models and the Kalman filter. Cambridge University Press, Cambridge
Huang TH, Chen YH (2009) A study on long-run inefficiency levels of a panel dynamic cost frontier under the framework of forward-looking rational expectations. J Bank Financ 33:842–849
Jin H, Jorgenson DW (2009) Econometric modeling of technical change. J Econ 157:205–219
Jondrow J, Lovell CAK, Materov IS, Schmidt P (1982) On the estimation of technical inefficiency in the stochastic frontier production function model. J Econ 19:233–238
Kalman RE (1960) A new approach to linear filtering and prediction problems. J Basic Eng Trans ASMA Ser D 82:35–45
Karakaplan MU, Kutlu L (2015) Consolidation policies and saving reversals. Working paper (SSRN: http://ssrn.com/abstract=2607276).
Kim CJ (2006) Time-varying parameter models with endogenous regressors. Econ Lett 91:21–26
Kim CJ (2008) Dealing with endogeneity in regression models with dynamic coefficients. Found Trends Econ 3:165–266
Kim CJ, Nelson CR (2006) Estimation of a forward-looking monetary policy rule: a time-varying parameter model using ex post data. J Monet Econ 53:1949–1966
Kim Y, Kim CJ (2011) Dealing with endogeneity in a time-varying parameter model: joint estimation and two-step estimation procedures. Econ J 14:487–497
Kneip A, Sickles RC, Song W (2012) A new panel data treatment for heterogeneity in time trends. Econ Theory 28:560–628
Kumbhakar SC (1990) Production frontiers, panel data, and time-varying technical inefficiency. J Econ 46:201–211
Kumbhakar SC (1997) Modeling allocative inefficiency in a translog cost function and cost share equations: an exact relationship. J Econ 76:351–356
Kumbhakar SC, Lovell CAK (2003) Stochastic frontier analysis. Cambridge University Press, Cambridge
Kutlu L (2010a) Battese–Coelli estimator with endogenous regressors. Econ Lett 109:79–81
Kutlu L (2010b) Market power and efficiency. Unpublished PhD dissertation, Rice University
Kutlu L (2012) U.S. banking efficiency, 1984–1995. Econ Lett 117:53–56
Kutlu L (2013) Misspecification in allocative inefficiency: a simulation study. Econ Lett 118:151–154
Kutlu L, Sickles RC (2012) Estimation of market power in the presence of firm level inefficiencies. J Econ 168:141–155
Lee YH, Schmidt P (1993) A production frontier model with flexible temporal variation in technical efficiency. Meas Prod Effic Tech Appl 237–255
Meeusen W, van den Broeck J (1977) Efficiency estimation from Cobb–Douglas production functions with composed error. Int Econ Rev 2:435–444
Park BU, Sickles RC, Simar L (2003) Semiparametric efficient estimation of AR1 panel data models. J Econ 117:279–309
Park BU, Sickles RC, Simar L (2007) Semiparametric efficient estimation of dynamic panel data models. J Econ 136:281–301
Peyrache A, Rambaldi AN (2013) Incorporating temporal and country heterogeneity in growth accounting—an application to EU-KLEMS, Working paper
Pitt MM, Lee LF (1981) The measurement and sources of technical inefficiency in the indonesian weaving industry. J Dev Econ 9:43–64
Schmidt P, Sickles RC (1984) Production frontiers and panel data. J Business Econ Stat 2:367–374
Sickles RC (1985) A nonlinear multivariate error-components analysis of technology and specific factor productivity growth with an application to the U.S. Airlines. J Econ 27:61–78
Sickles RC (2005) Panel estimators and the identification of firm-specific efficiency levels in parametric, semiparametric, and nonparametric settings. J Econ 126:305–334
Sickles RC, Good DH, Johnson RL (1986) Allocative distortions and the regulatory transition of the U.S. Airline industry. J Econ 33:143–163
Sickles RC, Hao J, Sheng C (2015) Panel data and productivity measurement. In: Baltagi B (ed) Oxford handbook of panel data Chapter 17. Oxford University Press, New York, pp 517–547
Slade ME (1989) Modeling stochastic and cyclical components of technical change: an application of the Kalman filter. J Econ 41:363–383
Tran KC, Tsionas EG (2012) GMM estimation of stochastic frontier model with endogenous regressors. Econ Lett 118:233–236
Tsionas EG (2006) Inference in dynamic stochastic frontier models. J Appl Econ 21:669–676
Ueda T, Hoshino K (2005) Estimation of firms efficiencies using kalman filter and stochastic efficiency model. J Oper Res Soc Jpn. 308–317
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interests.
Appendices
Appendix 1
In this appendix we provide further details about the Kalman filter estimation. Consider two stochastic frontier models that are nested by the general setting that we described:
where
and
where
The first model assumes a random walk with deterministic trend effects term and the second model assumes a potentially non-stationary AR(1) process for the effects term. Both of these models can be estimated by using the recursive equations provided below. The estimation consists of two steps. Kalman filtering and smoothing. In the first step, the following recursive Kalman filter equations are applied:
In the second step, the smoothing is applied by using the following recursive equations:
where \({r_{i{n_t}}}\, = \,0\) and \({N_{i{n_t}}}\, = \,0\).Footnote 21 The log-likelihood is given by:
where d i is the number of diffuse states for firm i. The number of diffuse priors (per panel unit) for the first model is two. The number of diffuse priors (per panel unit) for the second model is one. If we assume that ρ = 1, the second model would still have one diffuse prior per panel unit. However, obviously, the number of parameters to be estimated would be smaller. If we assume that |ρ| < 1, the second model would not have any diffuse priors.
Let m be the number of state variables (per panel unit) and q be the number of state variables with diffuse priors (per panel unit). For diffuse initialization we assume that:
where δ is a q × 1 vector of unknown quantities and the m × q matrix A and m × (m−q) matrix S are selection matrices that consist of columns of identity matrix. Then, matrix for initialization is:
where κ→∞, P ∞ = A′A, and P * = SQ 0 S′. As it can be seen from the log-likelihood, the first d i observation(s) for panel unit i are burnt out for the sake of initialization and are not considered in the log-likelihood. Hence, for example, for the second model the first observation of each panel unit is used for initialization. The reason for this is that as long as t ≥ d i + 1, we would have P i∞t = 0. The variance matrix can be estimated using the standard maximum likelihood procedures. For the estimations we used the standard BFGS optimization method.
Appendix 2
In this appendix we present additional results based on the full translog model and our truncation scheme when calculating the efficiency estimates for KFE and CSSW estimator. We also provide estimates for alternative Kalman filter models.
The full translog estimates are given in Table 10. The parameter estimates are generally not significant even at 10 % significance level. The median of the returns to scale values for the KFE, CSSW, and BC estimators are 0.8625, 1.1478, and 1.0184, respectively. The corresponding returns to scale estimates from the restricted model were 0.883, 0.94, and 1.034, respectively. Hence, for the KFE and BC estimator the returns to scale estimates are robust to the choice of the functional form. Nevertheless, for both restricted and unrestricted translog production models the constant returns to scale value of 1 lies within one sample standard deviation away from the median value of returns to scale estimates from each of these estimates. All the estimators satisfy the monotonicity conditions at the median values of the regressors at each time period. In contrast to the restricted translog production model where only KFE satisfied the regularity conditions at the median values of the regressors, KFE and CSSW estimator satisfies the curvature conditions at each time period. BC estimator failed to satisfy the regularity conditions at four of the time periods. The estimates for the production function parameters and average efficiencies for the KFE and the BC and CSSW estimators are given in Table 10 and Fig. 2. The overall average efficiencies for the KFE, CSSW, and BC are 0.637, 0.458, and 0.605, respectively. These values are not substantially different from their restricted counterparts, i.e., 0.577, 0.438, and 0.632. The average efficiencies for the full translog model are provided in Fig. 2. In line with the restricted translog model, KFE predicts decrease in efficiency in first few years of the study period and relatively stable efficiency levels for the last couple of years.
Now, we present the efficiency estimates when the trimming for KFE and CSSW are done for top–bottom 5 % (rather than 7.5 %) of the effects term when calculating the efficiencies. The BC estimates remain the same as they are not subject to such trimming. The average efficiency estimates for 5 % trimming case are provided in Figs. 3 and 4.
Figure 5 presents the average of efficiency estimates for restricted translog model. Due to outliers the KFE and CSS model estimates are low.
Finally, we provide our estimates for alternative Kalman filter models in Table 11. As we mentioned we failed to estimate the full Kalman filter model that we presented in Eq. 1. We rather estimated the models given in Eq. 12 (random walk model with deterministic trend) and Eq. 13 (AR(1) model).Footnote 22 Based on the BIC values the random walk model with deterministic trend is not preferred. In particular, BIC values for random walk with deterministic trend and random walk models are 2.1 vs. 1.7, respectively. The second model seems to be subject to the pile up problem as one of the variance parameters is collapsed to zero. Hence, we only provide the results for the sake of completeness. Nevertheless, the estimate of ρ = 1.0315 parameter indicates that our random walk assumption for the trend term is sensible for our empirical example. Hence, these findings support our choice for using the random walk model as our benchmark model.
Rights and permissions
About this article
Cite this article
Duygun, M., Kutlu, L. & Sickles, R.C. Measuring productivity and efficiency: a Kalman filter approach. J Prod Anal 46, 155–167 (2016). https://doi.org/10.1007/s11123-016-0477-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11123-016-0477-z