Natural monopolies are typically defined as industries in which the average cost of production decreases with output. The property of increasing returns to scale fully characterizes natural monopolies in the sense that it is necessary and sufficient for the existence of decreasing average costs.Footnote 1 This result rests on the important assumption, often omitted, that input markets are perfectly competitive.

In the simplest case when there is only one input, say labor, denoted by l, the production function f(l) exhibits increasing returns to scale if and only if

$$\begin{aligned} f(\lambda l)>\lambda f(l),\quad \text {for all }\lambda>1\text { and all }l>0, \end{aligned}$$

which can be equivalently stated as

$$\begin{aligned} c(f(\lambda l))>c(\lambda f(l)),\quad \text {for all }\lambda>1\text { and all }l>0, \end{aligned}$$
(1)

where \(c(\cdot )\) denotes the cost function. Letting \(w>0\) be the competitive wage rate, we have

$$\begin{aligned} c(f(\lambda l))=w\lambda l=\lambda c(f(l)), \end{aligned}$$

and so (1) can be restated as

$$\begin{aligned} \lambda c(f(l))>c(\lambda f(l)),\quad \text {for all }\lambda>1\text { and all }l>0, \end{aligned}$$

or, equivalently,

$$\begin{aligned} \lambda c(x)>c(\lambda x),\quad \text {for all }\lambda>1\text { and all }x>0, \end{aligned}$$

which holds if, and only if,

$$\begin{aligned} \frac{c(x)}{x}>\frac{c(\lambda x)}{\lambda x},\quad \text {for all }\lambda>1\text { and all }x>0, \end{aligned}$$

i.e., if, and only if, the average cost function is decreasing.

The purpose of this note is to understand the conditions under which production processes exhibit a decreasing average cost function in the absence of perfectly competitive input markets. The relevance of this line of inquiry rests on the observation that monopoly power tends to go hand-in-hand with monopsony power.Footnote 2

It will be shown that increasing returns to scale are necessary but not sufficient for a decreasing average cost function, and that measures of the degree of oligopsony and market power in input markets are important additional factors determining the shape of average costs. The implications of these results for regulatory policy will be discussed at the end.

The analysis is framed in terms of market power in upstream labor and capital markets. To keep the analysis as simple as possible, we first confine attention to the case of a single-input production function, treating labor and capital separately, and then show that the analysis extends to the standard two-input case.

To begin, consider a monopolist that produces a private good using labor, l, as its only input. The production function is denoted by f(l). Suppose that w(l) is an increasing function representing the firm-level inverse labor supply. This function describes how workers employed by the monopolist react to changes in wages. When \(w(\cdot )\) is flat (i.e., completely elastic), workers quit (moving to other firms, for example) when the monopolist lowers their wages. This extreme case represents the textbook case of competitive input markets, i.e., a complete absence of the monopolist’s labor market power. When \(w(\cdot )\) is very steep, the monopolist’s labor force tends to be unresponsive to even large wage declines. In this case, the monopolist holds significant labor market power.

The monopolist’s cost function, expressed in terms of hours of hired labor, l, is given by w(l)l. The elasticity of the cost function with respect to l is expressible as

$$\begin{aligned} \frac{d(w(l)l)}{dl}\cdot \frac{l}{w(l)l}=\frac{w'(l)l+w(l)}{w(l)}=1+\frac{1}{\xi _l}, \end{aligned}$$
(2)

where \(\xi _l\) denotes the wage elasticity of the labor supply facing the monopolist, i.e.,

$$\begin{aligned} \xi _l=\frac{1}{w'(l)}\cdot \frac{w(l)}{l}. \end{aligned}$$

This elasticity is called “residual labor supply elasticity” in Naidu et al. (2018). It takes values in the range \([0,\infty )\). The interval’s lower bound (resp., upper bound) represents the case of absolute labor market power (resp., the absence of labor market power).

The elasticity of the cost function with respect to l given in (2) measures the relative responsiveness of the monopolist’s cost to a one-percent increase in the quantity of hired labor.

The elasticity of scale,

$$\begin{aligned} \xi _f(l)=f'(l)\cdot \frac{l}{f(l)} \end{aligned}$$

takes values in the interval \((0,\infty )\) and measures the relative responsiveness of output to a one-percent increase in the quantity of hired labor. Note that

$$\begin{aligned} \xi _f(l)>1\text { for all }l\text { if and only if the production function} f(\cdot ) \text { exhibits}\\ \text {increasing returns to scale};\\ \xi _f(l)=1\text { for all }l\text { if and only if the production function} f(\cdot ) \text { exhibits}\\ \text {constant returns to scale};\\ \xi _f(l)<1\text { for all }l\text { if and only if the production function} f(\cdot ) \text { exhibits}\\ \text {decreasing returns to scale}. \end{aligned}$$

The ratio of the elasticity of the cost function with respect to l to the elasticity of scale,

$$\begin{aligned} \theta (l)=\frac{1+(1/\xi _l)}{\xi _f(l)}, \end{aligned}$$

measures the relative responsiveness of the total cost to the relative responsiveness of output (with respect to a one-percent increase in the quantity of hired labor). Note that

$$\begin{aligned}&\theta (l)<1\text { for all }l\text { if and only if the monopolist's average cost function is decreasing};\\&\theta (l)=1\text { for all }l\text { if and only if the monopolist's average cost function is constant};\\&\theta (l)>1\text { for all }l\text { if and only if the monopolist's average cost function is increasing}. \end{aligned}$$

Thus, when the labor market is not competitive, the shape of the average cost function is determined by the elasticity of scale and the residual labor supply elasticity. Standard textbooks consider the extreme case of a competitive labor market, i.e., the case when \(\xi _l=\infty \), which yields \(\theta (l)=1/\xi _f(l)\), and so \(\theta (l)\) becomes the inverse of the elasticity of scale. In this case, increasing returns to scale are necessary and sufficient for a decreasing average cost curve.

In general, increasing returns to scale are necessary but not sufficient for a decreasing average cost curve. This is because \(\theta (l)<1\) requires \(\xi _f(l)>1\) (i.e., increasing returns to scale). However, if the monopolist holds significant labor market power, so that the residual labor supply elasticity is relatively low and the elasticity of the cost function relatively high, \(\theta (l)\) tends to be large—in particular, greater than one. In fact, in the extreme case of absolute labor market power, i.e., the case when \(\xi _l\approx 0\) for a range of hired labor l, we have \(\theta (l)\approx \infty \), implying that the monopolist’s average cost function is increasing (for said range of l).

Let us now consider the case of capital input markets.Footnote 3 Suppose that the monopolist owns the capital used in its production process and chooses how much of its capital stock, K, is used as an input in its own production. Alternatively, the monopolist can loan capital to other agents.

The production function is denoted by f(k), where k measures ‘capital.’ Let r(k) be a decreasing function representing the inverse demand for capital facing the monopolist. The price elasticity of the firm-level demand for capital,

$$\begin{aligned} \epsilon _k=\frac{1}{r'(k)}\cdot \frac{r(k)}{k}, \end{aligned}$$

which ranges between 0 and \(-\infty \), can be taken as a measure of the monopolist’s capital market power as a supplier of capital.Footnote 4 A perfectly inelastic (resp., elastic) demand represents the case of absolute market power (resp., the absence of market power).

If the monopolist loans \(\kappa \) units of capital and uses \(K-\kappa \) units of capital in the production of its own final good, the cost of using an extra (marginal) unit of capital in the production of the final good is the opportunity cost of loaning that unit, i.e.,

$$\begin{aligned} {\textit{MR}}(\kappa )=r(\kappa )+r'(\kappa )\kappa , \end{aligned}$$

which represents the monopolist’s marginal revenue evaluated at the quantity of capital loaned, \(\kappa \). Intuitively, this opportunity cost consists of the forgone unit price of capital, \(r(\kappa )\), minus the extra revenue from the increase in the price of capital paid for the inframarginal units, \(r'(\kappa )\kappa \); the increase in the price of capital results from the marginal reduction in the supply of loanable funds.

Hence, the monopolist’s marginal economic cost of using k units of capital in the production of the final good (which gives a corresponding loan size of \(K-k\) units of capital) is given by

$$\begin{aligned} c'(k)={\textit{MR}}(K-k), \end{aligned}$$

while the monopolist’s total economic cost of using k units of capital in the production of the final good is given by

$$\begin{aligned} c(k)=\int _{K-k}^K{\textit{MR}}(\kappa )d\kappa . \end{aligned}$$
(3)

Note that the marginal cost is negative for those values of k for which \({\textit{MR}}(K-k)\) is negative. However, any loan supply of size \(K-k\), where \({\textit{MR}}(K-k)\) is negative, is not profit maximizing, since, at those levels, reducing the loan supply brings about extra revenue.

The elasticity of the cost function with respect to k, which measures the relative responsiveness of the monopolist’s cost to a one-percent increase in the quantity of capital, can be written as

$$\begin{aligned} c'(k)\cdot \frac{k}{c(k)}=\frac{{\textit{MR}}(K-k)k}{c(k)}. \end{aligned}$$
(4)

Note that if the \({\textit{MR}}(\cdot )\) function is decreasing, then this elasticity is greater than 1, since, in this case, the marginal cost exceeds the average cost: Footnote 5

$$\begin{aligned} \frac{{\textit{MR}}(K-k)k}{c(k)}>1\Leftrightarrow \frac{c'(k)}{c(k)/k}=\frac{{\textit{MR}}(K-k)}{c(k)/k} =\frac{{\textit{MR}}(K-k)}{(\int _{K-k}^K{\textit{MR}}(\kappa )d\kappa )/k}>1. \end{aligned}$$

Note also that \({\textit{MR}}(K-k)\) is expressible, in terms of the price elasticity of the firm-level demand for capital, as

$$\begin{aligned} {\textit{MR}}(K-k)=\left( 1+\frac{1}{\epsilon _{K-k}}\right) r(K-k). \end{aligned}$$
(5)

The ratio of the elasticity of the cost function with respect to k to the elasticity of scale,

$$\begin{aligned} \theta (k)=\frac{{\textit{MR}}(K-k)k}{c(k)}\bigg /\xi _f(k)=\frac{\left( 1+\frac{1}{\epsilon _{K-k}}\right) r(K-k)k}{c(k)}\bigg /\xi _f(k), \end{aligned}$$

measures the relative responsiveness of the total cost to the relative responsiveness of output (with respect to a one-percent increase in the quantity of hired labor). Note that

$$\begin{aligned}&\theta (k)<1\text { for all }k\text { if and only if the monopolist's average cost function is decreasing};\\&\theta (k)=1\text { for all }k\text { if and only if the monopolist's average cost function is constant};\\&\theta (k)>1\text { for all }k\text { if and only if the monopolist's average cost function is increasing}. \end{aligned}$$

Thus, when the labor market is not competitive, the shape of the average cost function is determined by the elasticity of scale and the firm-level price elasticity of demand.

The extreme case of a perfectly competitive capital market, i.e., the case when the price elasticity of the firm-level demand for capital is \(-\infty \), corresponds to the case of a flat inverse demand function \(r(\cdot )\) (hence a flat \({\textit{MR}}(\cdot )\) function which coincides with \(r(\cdot )\)), implying that the elasticity of the cost function with respect to k is equal to one. In this case, \(\theta (k)=1/\xi _f(k)\), implying that increasing returns to scale are necessary and sufficient for a decreasing average cost curve.

Under complete capital market power, \(\epsilon _k=0\), which gives an infinite elasticity of the cost function with respect to k, implying that the monopolist’s operating average costs are increasing.

If the elasticity of the cost function with respect to k is greater than 1 (which is true, for example, if the \({\textit{MR}}(\cdot )\) function is decreasing), then increasing returns to scale are necessary, but not sufficient, for a decreasing average cost function.

The two-input case can be handled as a simple extension of the preceding analysis.

Suppose that the production function is given by f(lk), a function of labor, l, and capital, k. We maintain the assumption that the monopolist owns the capital used in its production process and chooses how much of its capital stock, K, is used as an input in its own production. Alternatively, the monopolist can loan capital to other agents.

The monopolist’s total cost function is now

$$\begin{aligned} w(l)l+c(k), \end{aligned}$$

where \(c(\cdot )\) represents the capital cost function given in (3). The elasticity of the cost function with respect to the labor input,

$$\begin{aligned} \frac{\partial (w(l)l+c(k))}{\partial l}\cdot \frac{l}{w(l)l+c(k)}, \end{aligned}$$

measures the percentage change in the total cost resulting from a one-percent increase in the quantity of labor hired. Similarly,

$$\begin{aligned} \frac{\partial (w(l)l+c(k))}{\partial k}\cdot \frac{k}{w(l)l+c(k)}. \end{aligned}$$

measures the percentage change in the total cost resulting from a one-percent increase in the quantity of capital used in the firm’s production process.

Note that

$$\begin{aligned} \frac{\partial (w(l)l+c(k))}{\partial l}\cdot \frac{l}{w(l)l+c(k)}=\left( \frac{1}{\xi _l}+1\right) \frac{w(l)l}{w(l)l+c(k)} =\left( \frac{1}{\xi _l}+1\right) \alpha _l(l,k), \end{aligned}$$

where \(\xi _l\) is the familiar residual labor supply elasticity and \(\alpha _l(l,k)\) represents the cost share of the labor input. Similarly,

$$\begin{aligned}{} & {} \frac{\partial (w(l)l+c(k))}{\partial k}\cdot \frac{k}{w(l)l+c(k)}\\{} & {} \quad =\left( c'(k)\cdot \frac{k}{c(k)}\right) \frac{c(k)}{w(l)l+c(k)} =\left( c'(k)\cdot \frac{k}{c(k)}\right) \alpha _k(l,k)\\{} & {} \quad =\left( \frac{{\textit{MR}}(K-k)k}{c(k)}\right) \alpha _k(l,k) =\left( \frac{\left( 1+\frac{1}{\epsilon _{K-k}}\right) r(K-k)k}{c(k)}\right) \alpha _k(l,k), \end{aligned}$$

where \(\alpha _l(l,k)\) represents the cost share of capital and the last two equalities follow from (4) and (5), respectively.

Consequently, the relative responsiveness of the total cost to a one-percent increase in the quantity of all inputs is given by

$$\begin{aligned}{} & {} \frac{\partial (w(l)l+c(k))}{\partial l}\cdot \frac{l}{w(l)l+c(k)}+\frac{\partial (w(l)l+c(k))}{\partial k}\cdot \frac{k}{w(l)l+c(k)}\\{} & {} \quad =\left( \frac{1}{\xi _l}+1\right) \alpha _l(l,k)+\left( \frac{{\textit{MR}}(K-k)k}{c(k)}\right) \alpha _k(l,k)\\{} & {} \quad =\left( \frac{1}{\xi _l}+1\right) \alpha _l(l,k)+\left( \frac{\left( 1+\frac{1}{\epsilon _{K-k}}\right) r(K-k)k}{c(k)}\right) \alpha _k(l,k). \end{aligned}$$

The elasticity of scale is now given by

$$\begin{aligned} \xi _f(l,k)=\frac{\partial f(l,k)}{\partial l}\cdot \frac{l}{f(l,k)}+\frac{\partial f(l,k)}{\partial k}\cdot \frac{k}{f(l,k)}, \end{aligned}$$

and it measures the relative responsiveness of output to a one-percent increase in the quantity of all inputs.Footnote 6 Note that

$$\begin{aligned} \xi _f(l,k)>1\text { for all }(l,k)\text { if and only if the production function} f(\cdot ) \text { exhibits}\\ \text {increasing returns to scale};\\ \xi _f(l,k)=1\text { for all }(l,k)\text { if and only if the production function} f(\cdot ) \text { exhibits}\\ \text {constant returns to scale};\\ \xi _f(l,k)<1\text { for all }(l,k)\text { if and only if the production function} f(\cdot ) \text { exhibits}\\ \text {decreasing returns to scale}. \end{aligned}$$

The ratio of the elasticity of the total cost function with respect to both inputs to the elasticity of scale,

$$\begin{aligned} \theta (l,k)= & {} \frac{\left( \frac{1}{\xi _l}+1\right) \alpha _l(l,k)+\left( \frac{{\textit{MR}}(K-k)k}{c(k)}\right) \alpha _k(l,k)}{\xi _f(l,k)}\nonumber \\= & {} \frac{\left( \frac{1}{\xi _l}+1\right) \alpha _l(l,k)+\left( \frac{\left( 1+\frac{1}{\epsilon _{K-k}}\right) r(K-k)k}{c(k)}\right) \alpha _k(l,k)}{\xi _f(l,k)}, \end{aligned}$$
(6)

determines the shape of the average cost function:

$$\begin{aligned} \theta (l,k)<1\text { for all }(l,k)\text { if and only if the monopolist's average cost}\\ \text {function is decreasing};\\ \theta (l,k)=1\text { for all }(l,k)\text { if and only if the monopolist's average cost}\\ \text {function is constant};\\ \theta (l,k)>1\text { for all }(l,k)\text { if and only if the monopolist's average cost}\\ \text {function is increasing}. \end{aligned}$$

Note that, in the case of a perfectly elastic firm-level capital demand curve, the \({\textit{MR}}(\cdot )\) is flat, implying that

$$\begin{aligned} c(k)=\int _{K-k}^K{\textit{MR}}(\kappa )d\kappa ={\textit{MR}}(K-k)k,\quad \text {for all }k. \end{aligned}$$

Thus, in the extreme case of competitive input markets, we have \(\xi _l=\infty \) (a perfectly elastic firm-level labor supply curve) and \(\epsilon _{K-k}=-\infty \) (a perfectly elastic firm-level capital demand curve), and \(\theta (l,k)\) reduces to

$$\begin{aligned} \theta (l,k)=1/\xi _f(l,k), \end{aligned}$$

implying that the shape of the average cost curve is fully determined by the elasticity of scale. In this case, the average cost function is decreasing if and only if the production function exhibits increasing returns to scale.

In general, if \(\frac{{\textit{MR}}(K-k)k}{c(k)}>1\) for all k (i.e., if the marginal cost of capital exceeds the average cost of capital, which is true, for example, if the \({\textit{MR}}(\cdot )\) curve is decreasing), since \(1+\frac{1}{\xi _l}>1\) for all l, (6) implies that \(\theta (l,k)<1\) for all (lk) (i.e., the firm’s average cost function is decreasing) only if \(\xi _f(l,k)>1\) for all (lk), i.e., only if the production function exhibits increasing returns to scale. While, in this case, increasing returns to scale are necessary for a decreasing average cost function, they are not, in general, sufficient.

In the case when the firm holds significant market power in the market for inputs, so that \(\xi _l\approx 0\approx \epsilon _{K-k}\) for a range of input levels, the firm’s average cost function is increasing over said range.

We conclude this note with a brief discussion of the implications of the analysis for regulatory policy. The textbook example of a natural monopoly is an industry whose production processes exhibit declining average costs. Such industries are argued to be more efficiently served by monopolies, given their cost advantages, which also act as a barrier to entry, since an incumbent firm can always undercut the prices charged by an entrant. In the presence of decreasing average costs, standard economic theory advocates the awarding of an exclusive franchise to serve the market to a single firm. Such exclusive rights typically coexist with other regulatory policies (e.g., price regulation) aimed at restraining market power.Footnote 7

On the other hand, the modern literature on merger efficiencies, which dates back to Williamson (1968), often refers to merger efficiencies as average cost reductions resulting from economies of scale. This interpretation has been underscored by law scholars and economists alike to this day.Footnote 8

Thus, economies of scale and declining average costs play a central role in regulatory policy and are often regarded as a prominent rationale for legal promotion and consolidation of monopoly power via the awarding of exclusive franchises or by means of (vertical or horizontal) mergers. However, if legal entry restrictions and corporate amalgamation are accompanied by a significant outgrowth of market power in input markets, such regulations are likely to bring about a complete reversal of the conditions warranting their implementation, ultimately resulting in diseconomies of scale.

This prediction is consistent with evidence reported in the recent empirical literature on labor markets, which documents a negative correlation between the residual labor supply elasticity and measures of labor market concentration; and a tendency for markets with higher concentration or lower residual labor supply elasticity to have significantly lower wages.Footnote 9 In this context, our analysis puts forward a novel rationale for antitrust enforcement as a means to restraining monopsony power.