1 Introduction

The Lerner index Lerner (1934) is a widely used market power measure, which is the ratio of price-marginal cost mark-up and price. One of the difficulties for calculating the Lerner index is that total cost data may not be available making the estimation of the marginal cost difficult. A solution to this problem is estimating a conduct parameter (or conjectural variations) gameFootnote 1 in which firms form conjectures about the variation in other firms’ strategies (e.g., output) in response to a change in their own strategies. For given demand and cost conditions, the conjectures corresponding to the observed price-cost margins can be estimated “as-if” the firms are playing a conduct parameter game. In this setting, the “implied marginal cost” can be estimated via a supply-demand system.

Stochastic frontier analysis (SFA) literatureFootnote 2 suffers from a similar problem. That is, the standard SFA models require total cost data in order to estimate the cost efficiency of a firm. Moreover, as we will describe in the next section, market power and efficiency are closely related concepts and ignoring inefficiency in a conduct parameter model may lead to inconsistent conduct parameter estimates. What is more, ignoring the inefficiencies of productive units may invalidate the standard deadweight loss (DWL) calculations since DWL from collusive behavior depends on inefficiency levels.Footnote 3 If the productive units exhibit inefficiency that is misinterpreted as firm heterogeneity, then the standard calculations of DWL become invalid. In such cases, Kutlu and Sickles (2012) recommend using what they call the efficient full marginal cost (EFMC) for markup calculation.Footnote 4 Hence, estimation of conduct, marginal cost, and marginal cost efficiency would be essential for a valid DWL calculation. We overcome all these issues by combining conduct parameter and SFA literatures so as to generalize a conventional conduct parameter model to allow inefficiency in marginal cost. This enables the estimation of marginal cost efficiencies and conduct parameters jointly and consistently without using total cost data. In contrast to the SFA literature, which infers cost efficiency from a cost function, we estimate the marginal cost efficiency from a supply-demand system that is derived from our conduct parameter game. This introduces a related but different measure of efficiency that is based on deadweight loss (DWL), i.e., marginal cost efficiency. Therefore, the marginal cost efficiency concept would be a valuable tool for antitrust authorities.

We would like to note that our methodology can be applied to a variety of existing conduct parameter settings. Hence, we consider our study as a guideline for estimating conduct parameter models in the presence of (marginal cost) inefficiency. Examples for how this can be accomplished are illustrated in the Appendix. Hence, our study provides a link between industrial organization and SFA literatures to enable researchers to model inefficiency in more structural settings.

In order to illustrate how our methodology can be used in an empirical framework, we apply our methodology to estimate the firm-route-quarter specific conducts and marginal cost efficiencies of U.S. airlines for routes that originate in Chicago.Footnote 5 The time period that our data set covers is 1999I–2009IV. One of the difficulties that empirical researchers face is that the availability of cost data. Aggregate airline cost data is available but not at the route level. Therefore, route level total cost data is not available. Kutlu and Sickles (2012) try to overcome this problem by incorporating a specific number of enplanements for each airline, a specific distance of each city-pair, and airline fixed effects when estimating the cost function.Footnote 6 This enables the calculation of firm-route-quarter specific marginal costs from the cost function estimation. However, their efficiency estimates are still aggregate firm-quarter specific making it difficult to understand route-specific difficulties that may increase inefficiency.Footnote 7 Moreover, Kutlu and Sickles (2012) first estimate cost efficiencies using a standard stochastic frontier model.Footnote 8 These derived efficiency estimates are used to estimate the supply relation.Footnote 9 The conduct estimates are conditional on these derived efficiency estimates. In contrast to their study, we jointly estimate the firm-route-quarter specific conducts and marginal cost efficiencies of the U.S. airlines without the use of route specific total cost data.Footnote 10 Compared to the standard stochastic frontier efficiency measures, our efficiency concept is a more relevant measure from the antitrust point of view. Moreover, our empirical results for the relationship between market concentration and efficiency can be used as a robustness check from another perspective. Our results suggest that concentration ratio (measured by CR4) and market share are negatively related to the marginal cost efficiency. In contrast to this, the concentration ratio and market share are positively related to the conduct. Finally, we find that both the conduct and DWL estimates may be biased if a conventional conduct parameter model, which ignores marginal cost inefficiency, is used.

The rest of the paper is structured as follows. In Section 2, we briefly discuss the relationship between market power and efficiency. In Section 3, we build up our theoretical model. In Section 4, we describe our data set, present our empirical model, and discuss our results. In the next section, we make our concluding remarks. Finally, in the Appendix, we present extensions of the theoretical model.

2 Market power and efficiency

“The Quiet Life Hypothesis” (QLH) by Hicks (1935) and “the Efficient Structure Hypothesis” (ESH) by Demsetz (1973) are two well-known hypotheses that relate market power to efficiency. The former claims that higher competitive pressure is likely to force management work harder, which in turn increases efficiencies of firms. The latter states that the firms with superior efficiency levels use their competitive advantages to gain larger market shares, which leads to higher market concentration and thus higher market power. The findings of Berger and Hannan (1998) and Kutlu and Sickles (2012) support the QLH for the banking and airline industries, respectively. However, Maudos and Fernandez de Guevara (2007) show evidence for ESH applicable to the banking industry. Delis and Tsionas (2009) are in favor of the QLH on average but acknowledge that the ESH may prevail in the case of highly efficient banks. The relationship between market power and efficiency has long been acknowledged by economists. However, market power and SFA literatures largely ignore this relationship. Delis and Tsionas (2009), Koetter and Poghosyan (2009), Koetter et al. (2012), and Kutlu and Sickles (2012) exemplify some studies that attempt to estimate the market powers of firms in frameworks where firms are allowed to be inefficient. Except for Delis and Tsionas (2009), the market power estimates in these studies are conditional on efficiency estimates.

In conduct parameter models, ignoring marginal cost inefficiency can potentially cause inconsistent conduct parameter estimates. In particular, the conduct parameter estimates may pick up some of the marginal cost inefficiency. Delis and Tsionas (2009) argue that if the inefficiency is not taken into account, the optimization model of firms become irrelevant, which would lead to severe bias as the level of inefficiency increases. Similarly, ignoring market power (conduct parameter) can potentially cause inconsistent cost efficiency or marginal cost efficiency estimates. The differences in market powers would lead to different firm behavior and this can be confused with the firm level inefficiency. Generally, efficiencies are measured by closeness of production units to the best-practice units observed in the market. If the firm level conducts affect the performance of the best-practice units, then the efficiency estimates which do not take this into account would not be accurate. For instance, in a market facing a Cournot competition the best practicing firm may not really be fully efficient.Footnote 11 As mentioned in the introduction, our methodology aims to overcome these difficulties by explicitly and simultaneously modeling conduct parameter and marginal cost efficiency.

3 Theoretical model

In this section, we describe the theoretical framework, used to estimate marginal cost efficiencies and conducts of firms without total cost data. The stochastic frontier literature relaxes full efficiency assumption of neoclassical production theory by allowing the firms to be inefficient. The inefficiency is treated as an unobserved component, which is captured by a one-sided error term. In the conventional stochastic frontier framework, the cost efficiencies of firms would be estimated by the following model:

$${\mathrm{ln}}{\kern 1pt} C\left( {q_{it};X_{c,it}} \right) = {\mathrm{ln}}{\kern 1pt} C^ \ast \left( {q_{it};X_{c,it}} \right) + u_{it} + v_{it}.$$
(1)

where q it is the quantity of firm i at time t; Xc,it is a vector of variables related to cost; u it  ≥ 0 is a term which is capturing the inefficiency; v it is the usual two-sided error term; and C* is the deterministic component of cost when firms achieve full efficiency. A variety of distributions are proposed for u it including the half normal (Aigner et al. 1977), exponential (Meeusen and van den Broeck 1977), truncated normal (Stevenson 1980), and gamma (Greene 1980a, 1980b, 2003) distributions. The cost efficiency of a firm, EFF it , is estimated by:

$$\begin{array}{l}EFF_{it} = {\mathrm{exp}}\left( { - \hat u_{it}} \right)\\ \hat u_{it} = E\left[ {u_{it}|u_{it} + v_{it}} \right].\end{array}$$
(2)

The stochastic frontier approach requires detailed cost data, which many times is not available. We utilize the conduct parameter approach to overcome this issue. For this purpose, instead of modelling total cost as in the conventional SFA models, we directly model marginal cost, c, as follows:

$${\mathrm{ln}}{\kern 1pt} c\left( {q_{it};X_{c,it}} \right) = {\mathrm{ln}}{\kern 1pt} c^ \ast \left( {q_{it};X_{c,it}} \right) + u_{it} + v_{it}.$$
(3)

where c* is the deterministic component of marginal cost when firms achieve full efficiency; u it  ≥ 0 is a term which is capturing the marginal cost inefficiency; and v it is a two-sided random variable, which is observed by the firm but not observed by the researcher. We call c* efficient marginal cost (EMC). Rather than estimating a cost function, we estimate a supply-demand system that enables us to calculate the marginal cost efficiency. From the antitrust point of view, which is concerned with market power and DWL estimations, the marginal cost efficiency is a more relevant efficiency concept compared to the cost efficiency concept.

Let P t  = P(Q t ; Xd,t) be the inverse demand function, Q t be the total quantity, and Xd,t is a vector of demand related variables at time t. The perceived marginal revenue (PMR) is given by:

$$\begin{array}{*{20}{l}} {PMR\left( {\theta _{it}} \right)} \hfill & = \hfill & {P_t + \frac{{\partial P_t}}{{\partial Q_t}}\frac{{\partial Q_t}}{{\partial q_{it}}}q_{it}} \hfill \\ {} \hfill & = \hfill & {P_t\left( {1 - \frac{{s_{it}}}{{E_t}}\theta _{it}} \right)} \hfill \end{array}$$
(4)

where \(s_{it} = \frac{{q_{it}}}{{Q_t}}\) is the market share of firm i at time t; \(E_t = - \frac{{\partial Q_t}}{{\partial P_t}}\frac{{P_t}}{{Q_t}}\) is the (absolute value of) elasticity of demand; \(\theta _{it} = \frac{{\partial Q_t}}{{\partial q_{it}}}\) is the conduct parameter. Three benchmark values for \(\theta _{it} = \left\{ {0,1,\frac{1}{{s_{it}}}} \right\}\) correspond to perfect competition, Cournot competition, and joint profit maximization, respectively. The supply relation is:Footnote 12

$$\begin{array}{r}P_t\left( {1 - \frac{{s_{it}}}{{E_t}}\theta _{it}} \right) = c_{it} \Leftrightarrow \\ {\mathrm{ln}}{\kern 1pt} P_t + {\mathrm{ln}}\left( {1 - \frac{{s_{it}}}{{E_t}}\theta _{it}} \right) = {\mathrm{ln}}{\kern 1pt} c_{it}\end{array}$$
(5)

where c it  = c(q it ; Xc,it). After including the econometric error terms, the supply relation becomes:Footnote 13

$${\mathrm{ln}}{\kern 1pt} P_{it} = {\mathrm{ln}}{\kern 1pt} c_{it}^ \ast + g_{it} + u_{it} + \varepsilon _{it}^s$$
(6)

where g it  = g(θ it , s it , E t ) = \( - {\mathrm{ln}}\left( {1 - \frac{{s_{it}}}{{E_t}}\theta _{it}} \right) \ge 0\) is the market power term, which is an increasing function of θ it ; u it  ≥ 0 is the inefficiency term; and \(\varepsilon _{it}^s\) is the two-sided error term.

The E t term is identified through the demand equation. Intuitively, Eq. (4) suggests that if c it and q it are highly collinear, then the conduct parameter may be identified through the variation in \(\frac{{\partial P_t}}{{\partial Q_t}}\). We assume that the demand and marginal cost functions are such that the conduct parameter and marginal cost can be separately identified.Footnote 14 In most cases, identification is a problem when the demand function is linear. For example, when the demand and marginal cost functions are linear, we do not observe a variation in \(\frac{{\partial P_t}}{{\partial Q_t}}\), and c it and q it are perfectly collinear. In this case, we cannot separately identify the conduct parameter and marginal cost. One way to achieve identification is assuming constant marginal cost.Footnote 15 Another commonly used approach that does not require constant marginal cost assumption is including the cross-product of quantity and an exogenous variable in the demand equation. When such cross-products are included in the model, the identification of conduct parameter is achieved through both parallel shifts and rotations of the demand curve. Bresnahan (1982) illustrates how identification can be achieved by such rotations for the linear demand and linear marginal cost case. He states that the logic of identification is maintained even if the curves are not linear. In general, the conduct parameter is identifiable if the inverse demand function is not separable in exogenous variables, Z, and the number of exogenous variables is enough. More precisely, we can write the inverse demand function P so that P = f(Q, r(Z)) where Z is a vector of exogenous variables but P does not take the form P = Q−1/θr(Z) + h(Q) for some functions f, r, and h if and only if the identification is impossible.

Our model is different from the standard market power models due to the additional u it term. This inefficiency term, u it , is identified by utilizing the asymmetric distribution of the composed error term, i.e., \(u_{it} + \varepsilon _{it}^s\). Intuitively, u it is identified if the signal-to-noise ratio (the variance ratio of the inefficiency component to the composed error) is not small. Hence, the identification of model parameters requires the standard conduct parameter model and stochastic frontier model identification assumptions to hold when there are endogenous variables.Footnote 16

Following Kutlu and Sickles (2012), Fig. 1 aims to illustrate the underlying mechanism of our model and consequences of ignoring inefficiency when calculating DWL. The figure includes inverse demand function, perceived marginal revenue (PMR), marginal revenue (MR) that is corresponding to monopoly scenario, marginal cost (MC), and efficient marginal cost (EMC). For illustrative purposes, we consider the same constant marginal costs, conducts, and efficiencies for each firm. P θ and Q θ are the equilibrium price and quantity at conduct level θ. Similarly, P C and Q C are price and quantity for the perfect competition scenario in which conduct equals 0. In the figure, it is assumed that under perfect competition there would be no inefficiency. If QLH holds, then as the market power, measured by θ, increases MC diverges from EMC. In our framework, the marginal cost efficiency is defined as EMC/MC. The social welfare loss at conduct level θ would be equal to the shaded area (sum of dark and light shaded regions). In Fig. 1, the efficiency is roughly 60%, which is relatively low.Footnote 17 As a result the social welfare loss due to inefficiency is substantial. The conventional DWL value, which is ignoring inefficiency, is given by the dark shaded triangular area; and is much smaller than the overall social welfare loss. When there is heterogeneity either in efficiencies or marginal cost frontiers of firms (under constant MC assumption), the calculation of DWL would be similar except that EMC would be a step function rather than a horizontal line.

Fig. 1
figure 1

Conduct, marginal cost efficiency, and social welfare

Now, we describe how this conduct parameter game would be estimated. We assume that the conduct parameter θ it is a function of variables, Xg,it, that affect firm specific market power such as market shares and concentration ratios. Modeling θ it through this function may lead to computational difficulties. In contrast, g it can be modeled directly as a function of Xg,it so that θ it is solved after getting the parameter estimates. That is, we can calculate the estimate of θ it as follows:

$$\hat \theta _{it} = \frac{{\hat E_t}}{{s_{it}}}\left( {1 - {\mathrm{exp}}\left( { - \hat g_{it}} \right)} \right)$$
(7)

where \(\hat E_t\) and \(\hat g_{it}\) are the estimates for E t and g it , respectively. The market power term, g it , is bounded by 0 and \(B_{it} = - ln\left( {1 - \frac{1}{{E_t}}} \right)\). It follows that the choice of functional form should be so that g it  ∈ [0, B it ]. In this study, we use:

$$g_{it} = \frac{{B_{it}{\kern 1pt} {\mathrm{exp}}\left( {X_{g,it}^\prime \beta _g} \right)}}{{1 + {\mathrm{exp}}\left( {X_{g,it}^\prime \beta _g} \right)}}{\mathrm{.}}$$
(8)

One of the drawbacks of the standard stochastic frontier models is that if the regressors are correlated with v it or u it , then the parameter and efficiency estimates are inconsistent. In this setting, v it and u it terms are assumed to be independent, which can be a questionable assumption in a variety of settings. Similar to Kutlu (2010), Karakaplan and Kutlu (2017a, b), and Kutlu et al. (2018), we use a limited information maximum likelihood based approach to handle the endogeneity issue that occurs when the two-sided error term is correlated with the regressors or u it . The approach solves the endogeneity issue by including a bias correction term in the model. For example, u it can be a function of regressors (e.g., market shares of firms or concentration ratios) that are correlated with the two-sided error term. Consider the following supply relation model with endogenous explanatory variables:

$$\begin{array}{*{20}{l}} {{\mathrm{ln}}{\kern 1pt} P_{it}} \hfill & = \hfill & {{\mathrm{ln}}{\kern 1pt} c_{it}^ \ast + g_{it} + u_{it} + \varepsilon _{it}^s} \hfill \\ {X_{en,it}} \hfill & = \hfill & {\zeta _{it}^\prime \delta + w_{it}} \hfill \\ {\left[ {\begin{array}{*{20}{c}} {\tilde w_{it}} \\ {\varepsilon _{it}^s} \end{array}} \right]} \hfill & \equiv \hfill & {\left[ {\begin{array}{*{20}{c}} {\Sigma _w^{ - 1/2}w_{it}} \\ {\varepsilon _{it}^s} \end{array}} \right]\sim {\bf{N}}\left( {\left[ {\begin{array}{*{20}{c}} 0 \\ 0 \end{array}} \right],\left[ {\begin{array}{*{20}{c}} {I_m} & {\rho \sigma _\varepsilon } \\ {\rho {\prime}\sigma _\varepsilon } & {\sigma _\varepsilon ^2} \end{array}} \right]} \right)} \hfill \\ {u_{it}} \hfill & = \hfill & {h_{it}\tilde u_{it}} \hfill \\ {h_{it}} \hfill & \ge \hfill & 0 \hfill \\ {\tilde u_{it}} \hfill & \sim \hfill & {{\bf{N}}^ + \left( {\mu _u,\sigma _u^2} \right)} \hfill \end{array}$$
(9)

where P it is the price; Xen,it is an m × 1 vector of all endogenous variables used in modelling \(c_{it}^ \ast \), g it , and u it ; ζ it  = I m  ⊗ Z it where Z it is a l × 1 (with l ≥ m) vector of all exogenous variables. The irregular term \(\varepsilon _{it}^s\) is correlated with the regressors but independent of \(\tilde u_{it}\) conditional on Xen,it and Z it .Footnote 18 Hence, \(\varepsilon _{it}^s\) is independent of u it conditional on Xen,it and Z it . Note that \(\varepsilon _{it}^s\) and u it may still be correlated unconditionally. By applying a Cholesky decomposition of the variance-covariance matrix of \(\left[ {\begin{array}{*{20}{c}} {\tilde w_{it}^\prime } & {\varepsilon _{it}^s} \end{array}} \right]^\prime \), we can rewrite the supply equation as follows:

$$\begin{array}{*{20}{l}} {{\mathrm{ln}}{\kern 1pt} P_{it}} \hfill & = \hfill & {{\mathrm{ln}}{\kern 1pt} c_{it}^ \ast + g_{it} + \sigma _\varepsilon \rho {\prime}\tilde w_{it} + u_{it} + \tilde \varepsilon _{it}^s} \hfill \\ {} \hfill & = \hfill & {{\mathrm{ln}}{\kern 1pt} c_{it}^ \ast + g_{it} + \eta {\prime}\left( {X_{en,it} - \zeta _{it}^\prime \delta } \right) + u_{it} + \tilde \varepsilon _{it}^s} \hfill \end{array}$$
(10)

where \(\tilde \varepsilon _{it}^s\sim {\bf{N}}\left( {0,\left( {1 - \rho {\prime}\rho } \right)\sigma _\varepsilon ^2} \right)\) and \(\eta = \sigma _\varepsilon \rho {\kern 1pt} {\mathrm{\Sigma }}_w^{ - 1/2}\). The parameters of this supply relation can be estimated in one stage with the maximum likelihood estimation method. However, sometimes it is simpler to get the consistent parameter estimates in two stages by first estimating the bias correction term \(\eta {\prime}\left( {X_{en,it} - \zeta _{it}^\prime \delta } \right)\) and then including the estimate of bias correction term in the second stage where we apply traditional SFA methods.Footnote 19 For the two-stage approach, the standard errors need to be corrected, e.g., by a bootstrap procedure. In our empirical section, we use the limited information maximum likelihood estimator that we presented in this section, i.e., the one-stage method.

Amsler et al. (2016, 2017) relax the conditional independence assumption for \(\varepsilon _{it}^s\) and \(\tilde u_{it}\) by using a copula approach. Kutlu et al. (2018) show by simulations that if the firm-specific individual effects are included in the model, even if \(\varepsilon _{it}^s\) and \(\tilde u_{it}\) are correlated conditionally, the estimates are still reasonable. When it is difficult to find instruments for endogenous variables, one may use the copula approach proposed by Tran and Tsionas (2015). Their model does not require the availability of outside information. Instead, to obtain the instruments, a flexible joint distribution of the endogenous variables and composed error is constructed. If the researcher is inclined to use Bayesian methods, the stochastic frontier model of Griffiths and Hajargasht (2016) can be applied to our framework. Traditional stochastic frontier models impose inefficient behavior on all firms. If it is believed there is a mixture of efficient and inefficient firms in the sample, it is possible to apply the model of Tran and Tsionas (2016).Footnote 20 Finally, one can disentangle firm specific heterogeneity from inefficiency by using variations of true-fixed effects (or true-random effects) model of Greene (2005a, 2005b) that allow endogeneity as in Kutlu et al. (2018). For example, in the airport and banking cost efficiency contexts Kutlu et al. (2018) and Kutlu and McCarthy (2016), respectively, illustrate that efficiency estimates can be substantially different if productive unit heterogeneity is not controlled in the estimations.

In the stochastic frontier model that we presented, we have exogenous and endogenous regressors along with some “outside instruments.” Hence, our identification assumptions are somewhat different from the standard stochastic frontier models without endogenous variables. Our main identification assumption is that the exogenous variables (including the outside instruments) are uncorrelated with \(\varepsilon _{it}^s\) and w it and that there are enough instruments. If \(\varepsilon _{it}^s\) and w it are independent of the exogenous variables (including the outside instruments), then for each endogenous variable and its functions a single control function would be enough to achieve identification. For example, if z is a valid instrument for an endogenous variable x, then the model parameters can be identified by a single control function even when the model has x and x2 as regressors.Footnote 21

In order to make our contribution clearer, we finalize this section by comparing our model with two closely related papers. Kutlu and Sickles (2012) consider a dynamic conduct parameter model in which under the full market power scenario the firms play an efficient super-game equilibrium where the firms cooperate subject to incentive compatibility constraints. They estimate a market specific aggregate model and assume that the corresponding aggregate incentive compatibility constraint is a function of efficiency. The only place that efficiency enters their model is within the incentive compatibility constraint causing the parameter estimates from the static counterpart of their model to be invariant to the presence of inefficiency. This contrasts with our setting as our static model directly includes inefficiency in the supply equation. For their static model, the presence of efficiency matters only in the calculation of DWL. Although Kutlu and Sickles (2012) introduce the marginal cost efficiency concept, they assume that marginal cost efficiency equals cost efficiency (in the SFA sense), which is a somewhat strong assumption. Hence, they simply estimate a stochastic frontier cost function to obtain the cost efficiencies of the firms. A direct implication of this is that their dynamic model requires total cost data.

In another closely related study, Delis and Tsionas (2009) estimate a supply-demand-cost system where the cost function is modeled in the SFA framework. Hence, this study requires total cost data and efficiency concept is cost efficiency in the SFA sense. Their supply equation is derived from a standard conduct parameter model, which is invariant to the presence of inefficiency, and cost inefficiency enters their model only through the stochastic frontier cost function. This contrasts with our setting as our model directly includes (marginal cost) inefficiency in the supply equation. As a final remark, supply equation of Delis and Tsionas (2009) has revenue as the dependent variable and the right-hand-side variables include cross products of (total) cost with many other variables, which are expected to be endogenous in this setting. This complication may pose some estimation related difficulties if not controlled properly.

4 The data

In order to testify our theoretical framework, we use the U.S. domestic airline data. One of the main data sources that we use is the Passenger Origin-Destination Survey of the U.S. Department of Transportation (DB1B data set). This data set is a 10% random sample of all tickets that originate in the U.S. on domestic flights. In our data set a market is defined as a directional city-pair (route). Calculation of prices and quantities are based on the tickets that have no more than three segments in each direction. Approximately 1% of tickets are eliminated during the elimination of tickets with more than 3 segments. We only focus on coach class tickets due to the differences in demand elasticities and other characteristics between coach class and high-end classes (first class and business class).

Our data set covers the time period from the first quarter of 1999 to fourth quarter of 2009. During this time period, the U.S. airlines faced serious financial problems. As pointed out by Duygun et al. (2016), the financial losses for domestic passenger airline operations during this time period were substantially higher than their losses between 1979 and 1999. Increase in taxes and jet fuel prices, relatively low fares, and sharp decrease in demand were some of the challenges facing U.S. airlines. During this time period, there were dramatic increases in load factors.

We provide the details about data construction process as follows. First, all multi-destination tickets are dropped as it is difficult to identify the ticket’s origin and destination without knowing the exact purpose of the trip. Second, any itinerary that involves international flights was eliminated. Third, the fare class for high-end carrier was adjusted. That is, for some airlines, due to marketing strategies, only first class and business class (high-end) tickets are provided to consumers on all routes. However, the quality should be taken as coach class. For such airlines, if there is no coach class tickets from a certain carrier in a given quarter, we consider all tickets as coach class tickets. Fourth, tickets that have high-end segments and unknown fare classes were dropped.

Following Borenstein (1989) and Brueckner et al. (1992), we assume that ticketing carrier is the relevant airline. After further elimination of multi-ticketing-carrier tickets, firm specific average segment numbers (SEG) and average stage length (SL) on a given route are calculated as indicators for quality and costs. Moreover, our data set includes a distance variable which is the shortest directional flight distance (DIST). A ticket is online when the one-way ticket does not involve change of airplanes. The online variable is the percentage of online tickets.

For the price variable, we use the average price of all tickets for a given airline on a given route in given quarter.Footnote 22 All tickets with incredibleFootnote 23 prices are dropped from our data set. Following Borenstein (1989) and Ito and Lee (2007), we eliminate the open-jaw tickets since it would be difficult to distribute the ticket price into outbound and inbound segment for open jaw tickets. We drop the tickets that have a price less than 25 dollars or higher than 99 percentile or more than 2.5 times standard deviation from the mean for each airline within a route. The tickets that have price less than 25 dollars are considered as frequent flyer program tickets and the tickets that have prices higher than 99 percentile are considered to be input (key punch) errors for the data set. For the round trip tickets, we divided the total price by two to get the one-way price.

The cost data set is constructed from the firm level data of DOT’s airline production data set (based on Form 41 and T100).Footnote 24 While the airline-specific total cost data are available for the whole U.S. airline industry, the route-airline-specific total cost data are not available. We control for three types of important costs: labor price (LP), energy price (EP), and capital price (KP). The salaries and benefits for five main types of personnel are provided in Form 41/P6. Annual employee number is given in Form 41, P10. We interpolated the annual employee data to get the quarterly values. For energy price, we only capture the cost based on aircraft fuel. The energy input is developed by combining information on aircraft fuel gallons used with expense data per period. Flight capital is described by the average size (measured in number of seats) of the fleet. The number of aircraft that a carrier operated from each different model of aircraft in airline’s fleet is collected from DOT Form 41. For each quarter, the average number of aircraft in service is calculated by dividing the total number of aircraft days for all aircraft types by the number of days in the quarter. This serves as an approximate measure of the size of fleet.

In order to estimate the demand, we also include the city specific demographic variables: per capita income (PCI) and population (POP). We obtained the city level per capita income and population data from Bureau of Labor Statistics. We interpolate the annual data to get the quarterly PCI and population variables for each city. For each origin-destination city-pair, we use the population weighted PCI as the route-specific PCI measure. Similarly, city-pair population is the average population of origination and destination cities. In order to get the real prices, we deflated the nominal prices by Consumer Price Index (CPI) and use the first quarter of 1999 as the base time period. Because metropolitan areas have available demographic information whereas airports located in small cities do not, the number of the city-pairs is further reduced in our final data set.

We apply our theoretical method on the routes that originate from Chicago, which is a popular choice because of its relatively large airport and wide selection of airline firms. For instance, Brander and Zhang (1990) use 33 Chicago-based routes in their studies. In our final data set, we further eliminate the small firms and small routes. On a given route, small firms with market shares less than 0.01 are eliminated. For a given quarter, any route with enplanements less than 1800, i.e., 20 passengers per day or routes with less than 30 observations are dropped from the analysis. Table 1 presents the summary statistics for Chicago-based routes.

Table 1 Summary for Chicago Originating Routes

Low cost carrier (LCC) is a dummy variable that equals 1 if the ticketing carrier is a low cost carrier, otherwise it is 0. Number of firms represents the total number of firms that operate on the route. The total number of passengers is the total number of tickets sold on a given route by all airlines together in a given quarter. Total number of passengers for other routes (OQOTH) variable is the total number of tickets that are sold on the all other routes that share the same origination city. We use the geometric market share (GEOS) variable of Gerardi and Shapiro (2009) as an instrument.Footnote 25 Another instrumental variable that we use is GEONF. This variable is the product of GEOS and \(nf^ \ast = \sqrt {n_o \ast n_d} \), where n o denotes to the mean value of number of firms for all routes that share the same origination as route of interest while the n d refers to the mean value of number of firms for all routes that share the same destination city.

The final data set contains 108 routes that originate from Chicago and 14 carriers. The low cost carriers are Frontier Airlines, JetBlue Airways, Southwest Airlines, and Spirit Airlines. The remaining carriers are: Alaska Airlines, American Airlines, Continental Airlines, Delta Airlines, Northwest Airlines, United Airlines, US Airways, America West Airlines, ATA Airlines, and Trans World Airways.

5 Empirical example

The purpose of this section is providing an empirical example for our theoretical model. In particular, we estimate time-varying firm-route-specific conducts and marginal cost efficiencies of U.S. airlines for Chicago based routes. Like Brander and Zhang (1990) and Oum et al. (1993), we only consider coach class tickets. Brander and Zhang (1990) conclude that there is Cournot type competition in the airline industry, i.e., competition is quantity based. Hence, we assume a quantity based competition.Footnote 26 Our city-pair markets consist of one-way or round-trip directional trips having up to three segments in each direction. We divide the total ticket price by 2 to get the one-way fare for round-trip tickets. The demand and supply equations are estimated separately. The market demand equation is given by:

$${\mathrm{ln}}{\kern 1pt} P_{itr} = \beta _0 + \beta _1{\kern 1pt} {\mathrm{ln}}{\kern 1pt} Q_{tr} + \beta _2{\kern 1pt} {\mathrm{ln}}{\kern 1pt} PCI_{tr}{\kern 1pt} {\mathrm{ln}}{\kern 1pt} Q_{tr} + f_d\left( {X_{d,itr}} \right) + \varepsilon _{itr}^d$$
(11)

where f d is a function of demand related variables, Q tr is the total quantity at time t for route r, Xd,itr is a vector of demand related variables, and \(\varepsilon _{itr}^d\) is the conventional two-sided error term. We assume that ln Q tr and ln PCI tr ln Q tr are endogenous. Along with the exogenous variables included in the demand model, our instrumental variables are GEOS itr , GEONF itr , lnOQOTH rt , logarithm of labor price (ln LP it ), logarithm of capital price (ln KP it ), and logarithm of energy price (ln EP it ).

The supply equation is given by:

$${\mathrm{ln}}{\kern 1pt} P_{itr} = {\mathrm{ln}}{\kern 1pt} c_{itr}^ \ast + g_{itr} + u_{itr} + \varepsilon _{itr}^s$$
(12)

where \(c_{itr}^ \ast \) is the marginal cost when firms achieve full efficiency, g itr  = \( - {\mathrm{ln}}\left( {1 - \frac{{s_{itr}}}{{E_{tr}}}\theta _{itr}} \right)\) is the market power term, u itr  ≥ 0 is the inefficiency term, and \(\varepsilon _{itr}^s\) is the conventional two sided error term. The parameters of the E tr term is identified through the demand equation. We assume that the efficient marginal cost, \(c_{itr}^ \ast \), is constant with respect to quantity, i.e., it is not a function of quantity but maybe a function of exogenous cost shifters. Hence, as we described in the theoretical model section, the theoretical values for cost and marginal cost efficiencies coincide in this model. Although constant marginal cost is a relatively strong assumption, it is commonly used in the conduct parameter models. Iwata (1974), Genesove and Mullin (1998), Corts (1999), and Puller (2007) exemplify some papers that use this assumption in a variety of conduct parameter settings. We use this simplifying assumption to illustrate our methodology. Nevertheless, we approximate the efficient marginal cost function by a fairly flexible function of input prices and other cost related exogenous variables. These cost related variables include year, quarter, and airline dummy variables, which capture time-firm-specific unobserved factors.Footnote 27 Moreover, constant marginal cost assumption is not unreasonable at least around the equilibrium as there is substantial empirical evidence supporting constant returns to scale for the airline industry. We model g itr as in the theoretical model section and assume that Xg,itr = (s itr , CR4,tr, ln DIST r , t, E tr ,1)′ where CR4,tr is the concentration ratio for largest four firms on route r at time t. We assume that \(u_{itr} = h_{itr}\tilde u_{itr}\) and \(\tilde u_{itr}\sim {\bf{N}}^ + \left( {0,\sigma _u^2} \right)\) where \(\sigma _u^2 = {\mathrm{exp}}\left( {X_{g,itr}^\prime \beta _u} \right)\); and \(\varepsilon _{itr}^s\sim {\bf{N}}\left( {0,\sigma _\varepsilon ^2} \right)\) where \(\sigma _\varepsilon ^2 = {\mathrm{exp}}\left( {\beta _\varepsilon } \right)\). For the supply side, s itr and CR4,tr are assumed to be endogenous. Our instrumental variables are GEOS itr , GEONF itr , ln POP tr , and ln PCI tr . The estimations of the supply relations are done by using the limited information maximum likelihood estimation method that we described in our theoretical model section.

6 Results

In this section, we present our estimation results. The demand parameter estimates for the routes originating from Chicago are given in Table 2. We estimated the inverse demand equation by 2SLS. Our demand model controls for year, quarter, and airline. The demand elasticities are negative at each observation, i.e., E tr  > 0.Footnote 28

Table 2 Estimation for demand function

For the supply function, as we described previously, we use the one-stage limited information maximum likelihood approach to deal with endogeneity. In order to illustrate the consequences of ignoring marginal cost inefficiency, we estimated two supply models: the first one allows inefficiency (benchmark model) and the second one assumes full efficiency so that u itr  = 0 (full-efficiency model). The full-efficiency model is a standard conduct parameter model, which helps us to compare our benchmark estimates with the standard conduct parameter models. Both models include airline, year, and quarter dummy variables.Footnote 29

Table 3 shows the estimation results. The bias correction terms (η) are jointly significant at any conventional significance level, which is an indication of endogeneity. The median of the conduct estimates from the benchmark model is 0.63, which is lower than the theoretical conduct value for Cournot competition, 1.Footnote 30 At the median, the extent of competition lies somewhere between perfect competition and Cournot competition. The median of conduct estimates from the full-efficiency model is somewhat lower, 0.22, suggesting a competitive market. In other words, when we allow inefficiency, the median conduct is closer to Cournot competition benchmark, i.e., θ = 1; and when we assume full efficiency, the median conduct is closer to perfect competition benchmark, i.e., θ = 0. Moreover, the Kolmogorov-Smirmov test rejects the equality of the distributions at any conventional significance levels. This illustrates the importance of considering marginal cost inefficiency when estimating a conduct parameter model.

Table 3 Estimation for supply function

LCCs, due to their special operating style,Footnote 31 tend to have lower marginal costs compared to other airlines, which helps them to offer lower fares. Hence, it is worthwhile to examine the decomposition of conducts based on LCCs and non-LCCs. Our conduct estimates from the benchmark model for LCC and non-LCC carriers are 0.24 and 0.74, respectively. Therefore, while the conducts of LCCs are closer to perfectly competitive values, the conducts of non-LCCs are closer to Cournot competition values. The corresponding estimates from the full-efficiency model are 0.11 and 0.25, indicating relatively competitive markets for both LCCs and non-LCCs. Hence, ignoring inefficiency leads to underestimation of conducts for both LCCs and non-LCCs. The underestimation of welfare loss is boosted by the fact that the full-efficiency model ignores the marginal cost inefficiency.

The conduct parameter estimates show that an airline with higher market share tends to have higher market power. In markets with high CR4 values, it may be easier for airlines with higher market share to cooperate. The positive coefficient of CR4 in conduct verifies this. For the time period that we examine, the U.S. airlines seem to be losing market power over time. For longer flight distances the alternative transportation means (e.g., bus or car) are likely to become less attractive to the consumers. This reduction in outside competition suggests a positive relationship between market power and distance. The positive coefficient of distance variable for the market power term is in line with this intuition.

In Figs. 2 and 3, the efficiency and conduct estimates from our benchmark model are presented. In our benchmark estimates, the median efficiency estimates for the whole sample and non-LCC carriers are 82.6 and 84.4%, respectively. Hence, the efficiencies of LCCs and non-LCCs are similar. The parameter estimates for the inefficiency term show that an airline with higher market share tends to have higher inefficiency. Moreover, higher CR4 values lead to lower efficiency and the correlation between conduct parameter and efficiency is −0.19, which is statistically significant at any conventional significance level. These are in line with the QLH which postulates that higher market power leads to lower efficiency levels.Footnote 32 Based on our benchmark model, the medians of price-marginal cost markups, price-efficient marginal cost markups, and prices are $4.65, $30.32, and $140.46, respectively. However, for the full-efficiency model, the median of price-marginal cost markups is $2.49. Historically, airlines have been challenged in their efforts to generate high profits. These markup values from our benchmark model indicate that airlines may partially be responsible for the financial difficulties that they face. Our study shows that the answer to achieving reasonable profit levels may be through improving efficiency.

Fig. 2
figure 2

Efficiency estimates

Fig. 3
figure 3

Conduct estimates

Finally, we calculate a lower bound for bias in DWL estimates when the researchers use the full-efficiency model instead of the benchmark model. This number is obtained by estimating the shaded rectangular area between MC, EMC, and Q θ (see Fig. 1) where we assume that Q θ equals total observed quantity on a given route in a given quarter. The median of the lower bound over all routes and quarters is $427,715. This area measures the DWL that is caused by efficiency loss only. Hence, the DWL due to misallocation of quantity that stems from efficiency loss is not included.Footnote 33 Nevertheless, this lower bound clearly illustrates the severe consequences of ignoring marginal cost inefficiency when evaluating welfare loss using the conventional conduct parameter models.

7 Summary and concluding remarks

In this paper, we provided a conduct parameter based framework to estimate market powers and (marginal cost) efficiencies of firms simultaneously. Our methodology enables us to relax the total cost data requirement for stochastic frontier models. Total cost data may not be available for a variety of reasons. For example, firms might not want to reveal this potentially strategic information. Even when some form of total cost data are available, the data may not reflect the total cost of the relevant unit that we want to examine. For instance, in our empirical example, for the U.S. airlines only firm specific total cost data is available for the entire U.S. airline system. In such cases, the conventional stochastic frontier models cannot estimate firm-route specific efficiencies as this would require firm-route specific total cost data.

Besides relaxing a vital data requirement, our methodology aims to overcome some estimation issues. Efficiencies are generally measured by the distance between the units of production and the best practice units observed in the market. If the performance of the best-practice units depends on their market powers, then the efficiency estimates that are not taking this into account would not be accurate. We overcome this difficulty by explicitly modeling a conduct parameter game in an environment where firms are allowed to be inefficient.

In the Appendix, we provide extensions of our model for a variety of different settings including capacity constraints, stochastic marginal costs, multi-output firms, and dynamic strategic interactions. Another potential extension of our conduct parameter model is so that the firms price discriminate. For example, the marginal cost efficiency concept can be incorporated into the conduct parameter games of Kutlu (2012a, 2017a) and Kutlu and Sickles (2017). Such an extension would enable us to understand the connection between price discrimination, market power, and efficiency better.Footnote 34 Hence, our theoretical model serves as a guideline as to how conduct parameter and efficiency can be estimated simultaneously without requiring total cost data. Our guideline can be applied to a variety of conduct parameter settings.

As for the standard DWL calculations, our efficiency measure for marginal cost does not consider the fixed costs for the short run. It is possible to consider dynamic frameworks where prior investments and R&D may affect fixed costs and marginal costs. Hence, suboptimal investment decisions may result in suboptimal marginal cost levels even when the marginal cost may seem optimal in a single time horizon for given investment and R&D levels. This type of efficiency is not controlled in our conduct parameter model or in a standard stochastic frontier model. Moreover, similar to the standard stochastic frontier models, we assume that the input market is perfectly competitive. Hence, our model ignores potential market power in the input markets. The advantage of our framework over SFA setting is that we can make such structural extensions relatively easier compared to SFA setting using the already-established industrial organization literature.

We applied our methodology to estimate the conducts and marginal cost efficiencies of the U.S. airlines for the Chicago based routes between 1999I-2009IV. We found that the market concentration and market share of airlines are negatively related to the marginal cost efficiency, which is in line with the QLH. We also found that the conduct and DWL estimations may be seriously distorted if inefficiency is ignored. A more extensive empirical study is warranted but we consider this outside the scope of this paper. For example, a future study may consider a more complete list of U.S. routes and a longer time period. Moreover, although we control for the airline specific factors by including airline and time dummy variables, a study including variables related to on time rate, food service, aircraft engine information, and aircraft seat specification may be worth exploring.