The practical use of bottom-up models for analyzing energy demand is faced with significant micro-data requirements. In order to keep such models manageable, the individual components of energy demand are usually linked to the same macroeconomic variables such as the Gross Domestic Product (GDP), per-capita income, and relative energy prices. This gives rise to the question, “Why not model energy demand directly as a function of these macro variables?”. This macro approach is presented in this chapter, exploring the role of population growth, economic growth, and in particular changes in relative prices. However, this approach raises issues of its own:

  • How does one differentiate between short-term and long-term adjustments of demand?

  • Do rising and declining prices have the same effect on energy demand?

  • How can the effects of technological change be isolated from the effects of changes in energy prices?

  • Is the relationship between energy and other production inputs, in particular capital, substitutive or complementary?

Issues not discussed in this chapter include the possible instability of estimated relationships over time and reverse causality, i.e. the fact that GDP may not be exogenous but is in turn influenced by the price of energy (as evidenced by the recessions caused by the two oil price shocks of 1973/1974 and 1979/1980).

The variables used in this chapter are:

C :

Total cost

c :

Unit cost

ΔE :

Change in demand for energy, E t+1 − E t

E t :

Energy demand in period t

GDP :

Gross Domestic Product

η E,GDP :

Income elasticity of energy demand

η E,p :

Price elasticity of energy demand

K :

Capital stock

L :

Labor

M :

Input of materials

p :

Price index

p E :

(Inflation-adjusted) energy price index

PCI :

Per-capita income

POP :

Population

Q :

Real output (quantity)

sh :

Cost share

σ :

Elasticity of substitution

5.1 Population Growth

A first approach to top-down modeling is the following tautological relationship between population POP t and the aggregate energy demand E t ,

$$ {E}_t={POP}_t\frac{E_t}{POP_t}. $$
(5.1)

Because

$$ \frac{d \log E}{dE}=1/E,\kern1em d \log E=\frac{dE}{E}\kern1em \mathrm{holds},\mathrm{and}\ \mathrm{hence} $$
(5.2)
$$ \frac{\Delta {E}_t}{E_t}\approx \frac{\Delta {POP}_t}{POP_t}+\frac{\Delta \left({E}_t/{POP}_t\right)}{\left({E}_t/{POP}_t\right)}. $$
(5.3)

Therefore, the logarithmic differentiation of Eq. (5.1) yields the percentage change in energy demand as the sum of the percentage change in population and the per-capita energy demand, at least to a first approximation. The development of the demand for energy is thus tautologically given by the sum of population growth and change of per-capita energy consumption.

Table 5.1 shows the corresponding values and their percentage changes between 2000 and 2011. In 2011, the world population of nearly 7 bn people consumed 12.7 bn toe of commercial energy. However, while the per-capita energy consumption in China grew by no less than 7.5% p.a., it declined in several developed countries. Evidently there must be other factors at work beyond population growth that explain the demand for energy and its trend. Population growth alone is a misleading indicator.

Table 5.1 Population and per-capita primary energy supply

However, population is of importance in a different context. Table 5.1 reveals substantial differences in per-capita energy consumption between countries. In the United States it is roughly twice as high as in Germany or Japan. With 0.17 toe per capita, the figure for Indonesia (as of 2011) is at the other end of the spectrum. The international disparities in energy consumption can be visualized by the so-called Lorenz curve . On the horizontal axis of Fig. 5.1, countries are ranked according to their shares of the world’s population, with e.g. China accounting for the first 20%. The vertical axis exhibits the countries’ respective shares of global energy consumption. If per-capita energy consumption were completely equal among countries, the Lorenz curve would be a diagonal running from point (0; 0) to point (1; 1). With increasing inequality, the Lorenz curve moves away from this straight line. According to the solid line, 60% of the world population accounted for only 17% of energy consumption in 2002.Footnote 1

Fig. 5.1
figure 1

Lorenz curves of the global energy consumption and income distribution. Data source: World Bank (2014)

Yet the dashed Lorenz curve of Fig. 5.1 shows that income inequality between countries is even more marked than energy inequality, with 60% of the world population disposing of about 8% of world income only. This difference is an expression of the fact that energy has to be regarded as an essential good. Poor people devote a bigger share of their income to it than the rich. This observation suggests that the demand for energy increases less than proportionally with increasing income. Therefore, the so-called income elasticity of energy demand is smaller than one in the long run (see Sect. 5.2).

The World Energy Council (1993) stipulated 1.5 toe per capita as the benchmark to ensure economic and social development. This implies that global energy consumption would have to be about 40% higher than at present. The extent to which this postulate will be realized depends decisively on investment in energy technology, which in turn is driven by returns expected by investors and ability to pay of consumers.

5.2 Economic Growth

An increasing Gross Domestic Product (GDP) and the associated creation of value usually require an increased use of energy while at the same time leading to an improved capacity to pay for it. Similar to the tautology of Sect. 5.1, GDP can be inserted in the following way, with PCI t denoting per-capita income,

$$ {E}_t={POP}_t\cdot \frac{GDP_t}{POP_t}\cdot \frac{E_t}{GDP_t}={POP}_t\cdot {PCI}_t\cdot \frac{E_t}{GDP_t}. $$
(5.4)

After taking logarithms, the relative change in energy demand can be expressed as the sum of relative population growth, per-capita income growth, and the change in so-called energy intensity E t /GDP t ,

$$ \frac{\Delta {E}_t}{E_t}\approx \frac{\Delta {POP}_t}{POP_t}+\frac{\Delta {PCI}_t}{PCI_t}+\frac{\Delta \left({E}_t/{GDP}_t\right)}{\left({E}_t/{GDP}_t\right)}. $$
(5.5)

According to Table 5.2, energy intensity has decreased substantially between 2000 and 2011 in all countries sampled. Hence, consumption of energy has been increasing less than per-capita income, implying that the income elasticity of energy demand η E,GDP is smaller than one. It is defined as follows (see Eq. 5.2),

$$ {\eta}_{E,GDP}=\frac{\mathrm{percent}\ \mathrm{change}\ \mathrm{of}\ E}{\mathrm{percent}\ \mathrm{change}\ \mathrm{of}\ GDP}=\frac{\partial \ln E}{\partial \ln GDP}=\frac{\partial E}{\partial GDP}\frac{GDP}{E}. $$
(5.6)
Table 5.2 Development of population, per-capita income, and energy intensity

Here, the partial derivative indicates the ceteris paribus condition: the other determinants of energy demand (among them, the relative price of energy in particular) are held constant (see Sect. 5.3).

In normal circumstances, the income elasticity of energy demand is positive. The following distinctions can be made.

  • 0 < η E,GDP < 1: In this case energy intensity E/GDP declines with growing income.

  • η E,GDP > 1: In this case the opposite holds. This is typical of developing countries, many of which are characterized by a backlog of demand at the going price.Footnote 2 This backlog is usually created by an artificially low price of energy imposed by the government.

  • In the case of η E,GDP = 1, energy intensity is independent of income.

When GDP data of different years are to be compared, adjustment for inflation is necessary, as explained in Sect. 3.3. For international comparisons of energy intensity, one also has to convert GDP values expressed in national currency into a common currency (e.g. USD or EUR). In view of short-term exchange rate fluctuations, the average of a year (1995 in Table 5.2) or an average over several years (as is World Bank practice) may be the appropriate choice for depicting development over time.

Currency conversion can also be based on purchasing power parity (PPP), estimates of which are published by the OECD and the World Bank. This is a virtual exchange rate between two currencies, based on the notion that tradable goods have the same price everywhere. For example, if a hamburger costs 4 USD in the United States but 3 EUR in France (say), then 1 USD is apparently worth 0.75 EUR. When the hamburger is replaced by a basket of goods and services, one obtains the PPP. For most developing countries the application of PPP results in higher GDP values and hence lower estimates of energy intensity.Footnote 3 While exchange rates between industrialized countries tend to be closer to PPP values, there are deviations even in the EUR-USD exchange rate, as shown in Fig. 5.2.

Fig. 5.2
figure 2

Sample exchange rate and purchasing power parity. Data source: OECD

5.3 The Price of Energy

The price of energy and its development over time (relative to the prices of other goods and services) are crucial determinants of the demand for energy. Considering the swath of energy prices, it is not easy to calculate a representative energy price index. Even for a given energy source, more than one price often exists. An aggregated energy price index is constructed by weighting energy sources by their market shares. Since economic theory predicts that demand mainly depends on relative prices besides income or wealth, this index must be related to a macroeconomic price index (consumer price index, producer price index, or price index of the GDP). This ratio is often called the real price of energy. An increase over time signifies that energy prices grow faster than the average price of goods and services in general.

5.3.1 Short-Term and Long-Term Price Elasticities

Similar to the income elasticity, the energy own-price elasticity (often simply called price elasticity for short) is defined by using a change in the real price of energy p E as the impulse,

$$ {\eta}_{E,{p}_E}=\frac{\mathrm{percent}\ \mathrm{change}\ \mathrm{of}\ E}{\mathrm{percent}\ \mathrm{change}\ \mathrm{of}\ {p}_E}=\frac{\partial \ln E}{\partial \ln {p}_E}=\frac{\partial E}{\partial {p}_E}\frac{p_E}{E}. $$
(5.7)

Here as well, several cases need to be distinguished.

  • −1 < η E,pE < 0: The demand for energy is inelastic (to price). If the price of energy goes up (relative to the rate of inflation), the quantity of energy demanded declines less than proportionally (e.g. price goes up 10% but quantity sold only 4%, leaving a bottom line of 6% to sellers). This implies that inelastic demand results in a strong market position for suppliers (often called a seller’s market).

  • η E,pE < −1: Demand for energy is elastic. If the (relative) price of energy increases, the quantity demanded declines more than proportionally (for instance a 10% price hike triggers a 12% fall in quantity sold, resulting in a bottom line of −2%). Elastic demand causes the market position of consumers to be strong (often called a buyer’s market).

When energy markets become tight, the relative price of energy rises and the quantity traded decreases. This situation is shown in Fig. 5.3, where the supply curve shifts to the left. The original market equilibrium A is replaced by the one at point B, where the new supply function intersects with the short-term energy demand function. At first, a marked price hike combines with a limited decrease in quantity because immediate adjustment would be very costly for consumers. They often need to undertake an investment (e.g. by buying a car with higher fuel efficiency), a decision that is made only if the price change is viewed as permanent. Once undertaken, consumer adjustment gives rise to the long-term demand function (dotted line in Fig. 5.3), which is flatter (more price elastic) than its short-term counterpart and thus lies closer to the origin in the neighborhood of point B. This indicates reduced energy consumption at a given price, at least in the neighborhood of the initial equilibrium. Compared to B, the energy price drops slightly while the quantity traded continues to decrease until the new long-term equilibrium C is reached.

Fig. 5.3
figure 3

Short-term and long-term effects of a reduction in energy supply

In normal circumstances, energy demand is a composite of demand for different energy sources E i , i = 1,…, N with their relative prices p i . This consideration gives rise to an extended definition of price elasticity,

$$ {\eta}_{i,j}=\frac{\mathrm{percent}\ \mathrm{change}\ \mathrm{of}\ {E}_i}{\mathrm{percent}\ \mathrm{change}\ \mathrm{of}\ {p}_j}=\frac{\partial \ln {E}_i}{\partial \ln {p}_j}=\frac{\partial {E}_i}{\partial {p}_j}\frac{p_j}{E_i}. $$
(5.8)

For i = j one obtains the own-price elasticity, for i ≠ j a so-called cross-price elasticity . While normally the own-price elasticity is negative, the cross-price elasticity can be of either sign. If the energy sources considered are substitutes, it is positive because a price increase dp j  > 0 leads to a response dE j  < 0, which triggers more demand for E i , thus dE j  > 0. Yet E i and E j can also be complementary, in which case the cross-price elasticity is negative. A price increase dp j  > 0 again leads to a response dE j  < 0, which now causes a reduction dE i  < 0 in the complementary input E i .

5.3.2 A Partial Energy Demand Model

A popular specification of a partial demand model readsFootnote 4

$$ E(t)=\alpha \kern0.5em \cdot GDP{(t)}^{\beta}\cdot {p}_E{(t)}^{\gamma }. $$
(5.9)

Here, E(t) symbolizes aggregate energy demand (in physical units), GDP(t), the real (inflation-adjusted) Gross Domestic Product , and p E (t) the relative price of energy.

This formulation is partial rather than of the general-equilibrium type for three reasons:

  • Consider a drop in the quantity of energy transacted. As the oil price shocks of 1973/1974 and 1979/1980 clearly demonstrated, this does not leave GDP unaffected. Therefore a reverse causality exists, running from E(t) to GDP(t).

  • The observed consumption of energy is the outcome of an interaction between supply and demand. For example, if an increase in GDP causes the demand function to shift outward, the relative price of energy is predicted to go up. This time, causality runs from GDP(t) to p E (t); superficially, it even seems to run from E(t) to p E (t), indicating reverse causation.

  • For all its importance, energy is just one factor of production both for households and firms. Therefore, a change in the relative price of energy has repercussions on the mix of factors of production. As a consequence, energy price developments may change inputs of other factors of production, which has an impact not only on the composition but also the size of GDP.

In keeping with the partial approach, economic theory indeed states that aggregate energy demand is determined by income and the relative price of energy. A mostly analogous formulation to Eq. (5.9) is

$$ \frac{E(t)}{POP(t)}=\alpha \cdot {\left(\frac{GDP(t)}{POP(t)}\right)}^{\beta}\cdot {p}_E{(t)}^{\gamma } $$
(5.10)

with POP denoting the resident population. While the parameter α is a constant determining the general level of demand, β represents the income elasticity, and γ the price elasticity, respectively. This can be shown either by taking logarithms or by partial differentiation. In the latter case, one obtains

$$ \frac{\partial E(t)}{\partial GDP(t)}=\alpha \cdot \beta \cdot GDP{(t)}^{\beta -1}\cdot {p}_E{(t)}^{\gamma }=\beta \cdot \frac{E(t)}{GDP(t)}. $$
(5.11)

Multiplication of both sides by GDP(t)/E(t) yields β as the income elasticity η E,GDP . Turning to the relative price of energy, one has

$$ \frac{\partial E(t)}{\partial {p}_E(t)}=\alpha \cdot GDP{(t)}^{\beta}\cdot \gamma \cdot {p}_E{(t)}^{\gamma -1}=\gamma \cdot \frac{E(t)}{p_E(t)}. $$
(5.12)

Multiplication by p E (t)/E(t) shows that γ = η E,Pe .

As it stands, the partial model does not permit to distinguish between short-term and long-term elasticities. This distinction is important because current energy consumption is the result of a reaction not only to current income and price, but also to past incomes and prices through the inherited stock of energy-using capital. This also implies that current energy consumption is to some extent determined by past energy consumption. The stronger this link, the longer it takes for a change in income or price to exert its full impact.

Two variants of this modified demand model are discussed in the literature. The stock adjustment hypothesis posits that consumers orient themselves to a desired (planned) energy consumption E p (t), which is a function of the desired stock of energy-using capital. In addition, the hypothesis assumes that this stock and hence planned energy consumption is determined by current income and the current relative price of energy, resulting in

$$ {E}_p(t)=\alpha \cdot GDP{(t)}^{\beta}\cdot {p}_E{(t)}^{\gamma }. $$
(5.13)

During any given period t, however, there is a discrepancy between desired and actual (inherited) stock because adjustment is partial in view of its cost. If adjustment is completed up to a portion (1−ρ) of the gap while ρ still is to be undertaken, one has

$$ E(t)={E}_p{(t)}^{1-\rho}\cdot E{\left(t-1\right)}^{\rho}\kern1em \mathrm{with}\kern0.5em 0<\rho <1. $$
(5.14)

The parameter ρ reflects the speed of adjustment. In the case of ρ = 0, adaptation to new market conditions happens without any delay, while in the case ρ = 1 no adjustment occurs at all. Note that ρ is to some degree an economic decision variable reflecting the benefits and costs of fast versus slow adjustment. This adjustment of the stock of energy-using capital is not explicitly modeled, in contrast with Eq. (4.1).

By substituting Eq. (5.14) into (5.13), one obtains according to the stock adjustment hypothesis,

$$ E(t)={\alpha}^{1-\rho}\cdot GDP{(t)}^{\beta \cdot \left(1-\rho \right)}\cdot {p}_E{(t)}^{\gamma \cdot \left(1-\rho \right)}\cdot E{\left(t-1\right)}^{\rho }. $$
(5.15)

The second approach is called habit persistence hypothesis . It states that the energy consumption E(t) of period t is a function of expected future income GDP e(t) and expected relative energy price p E e(t) rather than their current values,

$$ E(t)=\alpha \kern0.5em \cdot {GDP}^e{(t)}^{\beta}\cdot {p}_E^e{(t)}^{\gamma }. $$
(5.16)

Of course, an auxiliary hypothesis concerning the formation of expectations is needed. A popular alternative has been adaptive expectations , meaning that expectations are formed as an extrapolation from previous and current observation. If again a geometric mean is postulated, the pertinent functions read

$$ \begin{array}{l}{GDP}^e(t)={GDP}^e{\left(t-1\right)}^{\rho}\kern0.5em \cdot GDP{(t)}^{1-\rho },\\ {}{p}_E^e(t)={p}_E^e{\left(t-1\right)}^{\rho}\cdot {p}_E{(t)}^{1-\rho }.\end{array} $$
(5.17)

Here, 0 < ρ < 1 denotes the parameter of adjustment as before, which now refers to expectations rather than the stock of energy-using capital. By substituting these expressions into Eq. (5.15) and taking into account

$$ E{\left(t-1\right)}^{\rho }={\alpha}^{\rho}\cdot {GDP}^e{\left(t-1\right)}^{\beta \cdot \rho}\cdot {p}_E^e{\left(t-1\right)}^{\gamma \cdot \rho } $$
(5.18)

by Eq. (5.16), Eq. (5.15) is obtained, with expected signs α, β > 0, γ < 0, and 0 < ρ < 1.

In both approaches, energy demand E(t) of period t (the dependent variable) is thus a function of income GDP(t), relative energy price p E (t), and energy demand of the previous period E(t–1), the lagged dependent variable. In order to estimate the parameters, appropriate data must be collected and econometric methods applied. In the case of model (5.18) a testable linear specification results when taking logs and adding an error term ε(t),

$$ \begin{array}{l} \ln E(t)=\left(1-\rho \right) \ln \alpha +\beta \left(1-\rho \right) \ln GDP(t)+\\ {}\kern3.5em \gamma \left(1-\rho \right) \ln {p}_E(t)+\rho E\left(t-1\right)+\varepsilon (t).\end{array} $$
(5.19)

The short-term income elasticity is β(1–ρ) > 0, the short-term price elasticity, γ (1–ρ) < 0 . The long-term elasticities follow from considering the situation in which all impulses of a one-time income or price change have exerted their full effect, resulting in perfect adjustment (the unobserved energy-using stock of capital is constant). This means that energy demand is stationary,

$$ E(t)=E\left(t-1\right). $$
(5.20)

In a stationary situation the time index may be omitted, resulting in

$$ E={\alpha}^{1-\rho}\cdot {GDP}^{\beta \cdot \left(1-\rho \right)}\cdot {p_E}^{\gamma \cdot \left(1-\rho \right)}\cdot {E}^{\rho }=\alpha \cdot {GDP}^{\beta}\cdot {p}_E^{\gamma }. $$
(5.21)

Therefore, β represents the long-term income elasticity and γ the long-term price elasticity. They can be obtained by dividing the estimated coefficients of Eq. (5.19) by the estimated ρ pertaining to the lagged dependent variable. The mean adjustment time (number of periods) following a one-time income or price change equals 1/(1–ρ). This follows from the fact that a discrepancy between desired and inherited energy consumption is reduced at the tune of 1–ρ per period. On average the discrepancy is thus eliminated in 1/(1–ρ) periods.

Yet this model is based on assumptions that prove to be restrictive:

  • Rising and falling relative energy prices have a symmetric impact on energy demand, an assumption that is hardly plausible. So-called hysteresis is more likely, meaning that the consumption-reducing effect of a price hike continues even after price decreases again. After all, once equipment with higher energy efficiency is installed, it is not scrapped just because energy has become cheaper again. In order to model hysteresis, the price variable needs to be split in two,

$$ \begin{array}{l}{p}_E^{+}(t)={p}_E^{+}\left(t-1\right)+ \max \left(0,{p}_E(t)-{p}_E\left(t-1\right)\right)\kern1em \left(\mathrm{the}`{\mathrm{up}}^{'}\ \mathrm{component}\right),\\ {}{p}_E^{-}(t)={p}_E^{-}\left(t-1\right)+ \min \left(0,{p}_E(t)-{p}_E\left(t-1\right)\right)\kern1em \left(\mathrm{the}`{\mathrm{down}}^{'}\ \mathrm{component}\right).\end{array} $$
(5.22)
  • Taking the price p E (0) of the base period 0 and using Eq. (5.22) again and again, the first variable p + contains the sum of all price increases beyond p E (0), while the second variable p contains the sum of all price decreases. Evidently

$$ {p}_E(t)={p}_E^{+}(t)+{p}_E^{-}(t) $$
(5.23)
  • holds. The modified Eq. (5.18) then reads,

$$ E(t)={\alpha}^{1-\rho}\cdot GDP{(t)}^{\beta \cdot \left(1-\rho \right)}\cdot {p}_E^{+}{(t)}^{\gamma \cdot \left(1-\rho \right)}\cdot {p}_E^{-}{(t)}^{\delta \cdot \left(1-\rho \right)}\cdot E{\left(t-1\right)}^{\rho } $$
(5.24)
  • where γ symbolizes the long-term price elasticity in case of price increases and δ for the long-term price elasticity in case of price decreases. Unfortunately, the explanatory variables often turn out to be highly correlated (giving rise to the so-called multicollinearity problem), rendering precise estimation of the parameters difficult.

  • In the demand model presented, the mean adjustment time 1/(1–ρ) is independent of whether adjustment is triggered by change in income or energy prices. This assumption can be relaxed as well. For simplicity, consider the extreme case where the demand for energy reacts immediately to a change in income, while it reacts with a lag to a change in relative price, as before. In this case, Eq. (5.16) is modified as follows,

$$ E(t)=\alpha \kern0.5em \cdot GDP{(t)}^{\beta}\cdot {p}_E^e{(t)}^{\gamma } $$
(5.25)
  • with p E e(t) denoting the expected relative price of energy. In view of Eq. (5.17), this results in the specification

$$ \frac{E(t)}{E{\left(t-1\right)}^{\rho }}={\alpha}^{1-\rho}\frac{GDP{(t)}^{\beta }}{GDP{\left(t-1\right)}^{\beta \cdot \rho }}{p}_E{(t)}^{\gamma \cdot \left(1-\rho \right)}. $$
(5.26)
  • The parameter β > 0 represents the (short-term and long-term) elasticity, γ (1–ρ) < 0 the short-term and γ < 0, the long-term price elasticity. The mean adjustment time of energy demand to changes in price is once again 1/(1–ρ).

  • A further assumption is that both short-term and long-term elasticities are constant. In particular, they do not depend on current values of income and price of energy. However, dropping this assumption calls for a much more complex modeling for demand (e.g. using the so-called translog specification, see Sect. 5.3.3).

Table 5.3 shows the results of a regression estimate of model (5.19) using annual data for the European Union and the United States covering the period from 1980 to 2013. The dependent variable is crude oil demand per capita. The explanatory variables are inflation-adjusted per-capita income and inflation-adjusted price of Brent crude. Demand for crude oil is fairly well explained: The coefficient of determination R2 is between 0.88 and 0.91, indicating a high statistical fit . Though estimated income elasticities have the expected sign, they are statistically insignificant at the 1% level. However, the price elasticities and the lagged dependent variable are statistically significant at the 1% level.

Table 5.3 Income and price elasticities of crude oil demand

Taking the results of Table 5.3 at face value, one is led to the following interpretation. In both the European Union and in the United States, the inflation-adjusted price of crude oil has a significant impact on demand. However, the long-term price elasticities are 0.220 or below in absolute value, which means that the demand for oil is inelastic. This view is confirmed by most econometric studies. Interestingly enough, consumers in the United States react with a shorter lag to oil price changes than Europeans.

5.3.3 Substitution Between Energy and Capital

According to the process analysis discussed in Sect. 4.3, the relationship between energy demand and aggregate capital stock is substitutional rather than limitational (fixed proportions), meaning there is a choice between more and less energy-intensive modes of production and consumption. Substitution of energy is therefore possible through investment in capital goods. More generally, a given output quantity Q can be produced using more or less energy input E and commensurately modified quantities of other production factors. This is formally expressed by a production function,

$$ Q=f\left(K,L,E,M\right) $$
(5.27)

which relates output Q to inputs of capital K, labor L, energy E, and materials M (non-energy raw materials). To be precise, Q denotes the maximum output achievable given the state of technology and input quantities, reflecting best practice.

Figure 5.4 depicts the production function by means of a so-called isoquant. An isoquant shows the quantities of production factors K and E (with inputs of labor L and materials M held constant in the present case) that are needed to produce a given quantity Q. The isoquant thus summarizes the efficient production frontier for a given quantity of output, depicting uses of an available technology ranging from energy-intensive to capital-intensive. Specifically, production process II is an energy-intensive variant that in turn uses little capital, whereas process III is capital-intensive but saves on energy. Production processes I and IV can be disregarded because of their excessive use of costly inputs; indeed, only technologies II and III are efficient.

Fig. 5.4
figure 4

Efficient and inefficient technical processes

In order to decide which production alternative is minimum cost, the prices of production factors have to be considered. For K this is capital user cost p K , an annuity which reflects interest and depreciation (net of tax exemptions and subsidies); for L this is the wage rate (including non-wage labor costs) p L ; for E this is the average energy purchase price p E ; for M this is average unit cost p M . When L and M as well as their unit prices are held constant, all factor combinations that are compatible with constant total production cost lie on a straight line (the so-called isocost line ; see Fig. 5.4). Per definition all points on the isocost line are characterized by constant total production cost,

$$ C={p}_KK+{p}_EE+{p}_LL+{p}_MM. $$
(5.28)

To calculate the slope of this line and neglecting L and M (thus dL = dM = 0), this equation can be differentiated to become

$$ \begin{array}{l}dC=0={p}_KdK+{p}_EdE\kern1em \mathrm{and}\ \mathrm{therefore}\\ {}\frac{dE}{dK}=-\frac{p_K}{p_E}.\end{array} $$
(5.29)

Thus the slope of the isocost line is −p K /p E . Consider a reduction in energy use, dE < 0. If the unit price of energy p E is relatively high, the cost saving is substantial, permitting to use a lot more capital K while holding cost constant. Therefore, dE/dK takes on a low (absolute) value in this case. Conversely, if energy is cheap compared to the user cost of capital, dE < 0 generates a small cost saving which creates little room for an increased use of capital since this is relatively expensive. Accordingly, dE/dK takes on a high (absolute) value in this case.

Competitive pressure makes producers minimize cost. They therefore seek to attain the isocost line representing the lowest possible production cost. This is the one running closest to the origin in (K, E)-space, given the amount of output Q and hence the isoquant. Therefore, the isocost line needs to be tangent to the isoquant for cost minimization. This corresponds to the choice of technology II with its rather high energy intensity.

Note that the isocost line of Fig. 5.4 has a fairly steep slope, reflecting a situation where energy is cheap compared to the user cost of capital (as reflected by the annuity; see Sect. 3.1). If energy were to become more expensive relative to capital, the isocost line would exhibit a reduced slope, thus favoring a more capital-intensive mode of production. Therefore, a change in relative prices is predicted to affect the choice of production process within the technology available. This constitutes producers’ short-run response, while the choice of technology (to be discussed in Sect. 5.4) amounts to their long-run adjustment.

For a given technology, the curvature of the isoquant representing it evidently is of great importance. The more pronounced the curvature, the smaller is the adjustment in the factor mix in response to a given change in relative factor prices. In the extreme case of a limitational technology, isoquants have an angular shape, which means that there cannot be any adjustment to a change in relative prices. Producers are stuck at the corner as it were (see Fig. 5.5). With reference to a pair of inputs, one defines the elasticity of substitution σ KE as the parameter reflecting the degree of substitutability in production. It answers the question, “By how much (in percent) does the mix of inputs change when their relative price changes by 1%?”. In terms of Fig. 5.4, one has

$$ \begin{array}{l}{\sigma}_{KE}=\frac{d\left(K/E\right)}{dR}\kern0.5em \frac{R}{K/E}\kern1em \mathrm{with}\ \mathrm{the}\ \mathrm{slope}\ \mathrm{of}\ \mathrm{the}\ \mathrm{tangent}\ \mathrm{given}\ by\\ {}R=-\frac{dK}{dE}=-\frac{\partial Q}{\partial E}/\frac{\partial Q}{\partial K}.\end{array} $$
(5.30)
Fig. 5.5
figure 5

Isoquants with different elasticities of substitution

Obviously, the slope of the (tangent to an) isoquant reflects relative marginal productivities. It is known as the marginal rate of (factor) substitution . Given choice of a minimum-cost production process, the marginal rate of substitution R is just equal to the (negative) relative price of the factors (the slope of the isocost line),

$$ R=\frac{p_K}{p_E}. $$
(5.31)

Therefore, the elasticity of substitution can also be defined in terms of a change in relative factor prices,

$$ {\sigma}_{KE}=\frac{d\left(E/K\right)}{d\left({p}_K/{p}_E\right)}\frac{p_K/{p}_E}{E/K}=\frac{d \ln \left(E/K\right)}{d \ln \left({p}_K/{p}_E\right)}={\sigma}_{EK}. $$
(5.32)

The symmetry follows from dln(K/E) = −dln(E/K) and dln(p K /p E ) = −dln(p E /p K ). If capital and energy are substitutive factors of production, the elasticity of substitution must lie in the interval 0 < σ KE  < ∞. A high value of σ KE indicates that substitution between these factors is easy. With an elasticity of substitution σ KE  = 3 e.g., a 10% increase in the relative price of energy results in a 30% reduction of the E/K ratio. If σ KE  = 1.2, the E/K ratio falls by 12% only. In the extreme case of σ KE  = 0, there is no substitution possibility between K and E but a fixed input relationship between them (this amounts to a fixed-proportions (limitational) production function, also called Leontief production function; see Sect. 2.5).

The situation becomes more complex when more than two production factors are considered. The partial elasticity of substitution defined above has to be replaced by the so-called Allen elasticity (see Allen 1938, Sect. 19.4). Any two factors of production may now be complementary rather than substitutive. For instance, labor has historically been substituted by capital and energy, making capital and energy complements in production. Since the elasticity of substitution is defined to be positive in the case of substitutability, the Allen elasticity is negative in the case of complementarities.

This raises the question of whether energy and capital are complements or substitutes in the context of the four-factor production function Q = f(K, L, E, M). This is an empirical question which can only be answered by applying econometric methods. In doing so, one usually prefers not to focus on the isoquants but rather on (minimum) cost, which is a scalar measure. As shown by Fig. 5.4, the isocost line contains the same information as the isoquant in the neighborhood of the minimum cost combination of inputs. Indeed the problem, “Minimize production cost for a given output level” leads to the same solution as the so-called dual formulation, “Maximize output for a given cost budget”. Thus, the dual to maximizing output given a cost constraint reads

$$ C=C\left(Q,{p}_K,{p}_L,{p}_E,{p}_M\right)=Q\cdot c\left({p}_K,{p}_L,{p}_E,{p}_M\right). $$
(5.33)

It states that minimum total cost C depends on the amount of output Q to be achieved and the (relative) prices of inputs. Since unit cost c is given by C/Q, one can also analyze unit cost c. Strictly speaking, this is possible only if scaling up by Q does not matter, i.e. if the cost function C(Q, p K , p L , p E , p M ) is homogenous of the first degree in Q. This is the case when a change of all production factors (e.g. doubling all of them) leads to an analogous change (doubling) of output, amounting to constant returns to scale.

This leaves the choice of functional form. Preferably, the functional form should not impose a priori restrictions on crucial parameters such as the elasticity of substitution. A popular solution is the so-called translog function (see Christensen et al. 1973; Berndt and Wood 1975). It results from a second-degree Taylor approximation to an arbitrary function, with the arguments and the dependent variable in logarithms. In the case of the average cost function, the translog form becomes

$$ \begin{array}{l} \ln c={\alpha}_0+\sum_{i=1}^N\kern0.5em {\alpha}_i \ln {p}_i+\frac{1}{2}\sum_{i=1}^N\kern0.5em \sum_{j=1}^N\kern0.5em {\beta}_{ij} \ln {p}_i \ln {p}_j,\\ {}\sum_{i=1}^N\kern0.5em {\alpha}_i=1,\kern0.5em \sum_{i,j=1}^N\kern0.5em {\beta}_{ij}=\sum_{i,j=1}^N\kern0.5em {\beta}_{ji},\kern0.5em {\beta}_{ij}={\beta}_{ji}.\end{array} $$
(5.34)

Here α 0 > 0 is a constant and N is the number of production factors (N = 4 in the present context). The α i are the first-order derivatives of the unit cost function with reference to the inputs. They sum up to one because of the assumed homogeneity of the first degree. The β ij are the second-order derivatives. By Young’s theorem, the order of differentiation does not matter for continuously differentiable functions, implying β ij = β ji .

Differentiating Eq. (5.34), one obtains

$$ \frac{\partial \ln c}{\partial \ln {p}_i}=\frac{\partial c}{\partial {p}_i}\cdot \frac{p_i}{c}={\alpha}_i+\frac{1}{2}\sum_{j=1}^N{\beta}_{ij}\cdot \ln {p}_j $$
(5.35)

Shephard’s lemma states that the derivative of the minimum cost function with respect to factor price p i yields the optimal input quantity x i (see e.g. Varian 1992, Chap. 5). Therefore one obtains

$$ \frac{x_i\cdot {p}_i}{c}=:{sh}_i={\alpha}_i+\sum_{j=1}^N{\beta}_{ij} \ln {p}_j $$
(5.36)

with sh i denoting the cost share of the i-th factor of production (see Diewert 1974). Thus, the shares of K, L, E, and M can be linearly related to the logarithm of their prices p K , p L , p E , and p M , making estimation of the β ij by ordinary least-squares (OLS) possible.

Also, the Allen partial elasticities of substitution between capital and energy can be recovered from the cost shares and price elasticities as follows (see Allen 1938, Sects. 19.5 and 19.6),

$$ {\sigma}_{KE}=\frac{\eta_{K,{p}_E}+{sh}_E\cdot {\eta}_{E,{p}_E}}{sh_E}. $$
(5.37)

This is intuitive: Energy and capital are substitutes if their cross-price elasticity η K,pE is positive, resulting in a positive value of σ KE (the own-price elasticity η K,pE is always negative). Conversely, they are complements if their cross-price elasticity is strongly positive and the own-price elasticity of energy as well as its cost share sh E are small in absolute value, resulting in a negative value of σ KE .

Econometric estimation of substitution elasticities between energy and capital was motivated by the first oil price shock of 1973. Policy-makers wanted to know whether it was easy or difficult to substitute energy by other production factors, in particular capital. The first evidence exhibited in Table 5.4 was disappointing: Berndt and Wood (1975) found a complementary relation between energy and capital. Yet another estimate by Griffin and Gregory (1976) points to substitutability (σ KE  = 1.07). This triggered a lively discussion among economists (see e.g. Solow 1987). Later studies using more recent data and including technological change also show ambiguous results. However, the estimate presented in the last column of Table 5.4 confirms substitutability between energy and capital once it is assumed that producers have a choice of technology. This leads to the conclusion that companies have not exhausted the substitutional potential suggested by bottom-up process analysis to the same extent as the potential for automation, which amounts to replacing labor by both capital and energy.

Table 5.4 Elasticities of substitution between capital, labor, and energy

In conclusion, the relationship between energy and capital cannot be determined with sufficient precision even to this day. Likely reasons are the limited validity of aggregate data, difficulty in distinguishing between the short term and long term (K and E may be complementary in the short run but substitutes in the long run), and the challenges posed by isolating the effects of technological change.

5.4 Technological Change

In economics, technological change is defined in the following way. Technological change enables a larger output Q to be produced with the same input quantities of capital K, labor L, energy E, and materials M. An equivalent way of expressing the same idea is to say that a given output quantity Q can be produced using smaller quantities of production factors. An improvement in quality is a possible outcome, too.

In Fig. 5.6, technological change is depicted by a shift of the isoquant towards (and not away from) the origin of (K, E)-space. In the figure on the left, technological change does not affect the input mix as long as relative factor prices do not change, thus indicating neutral technological change with respect to energy and capital. In the figure on the right, technological change is energy-saving because the transition exhibits a lower E/K ratio at a given factor price ratio. Clearly, changing relative factor prices can also influence the choice of production technology, in addition to technological change.

Fig. 5.6
figure 6

Isoquants reflecting technological change

A mathematical formulation of factor-augmenting technological change is the following (see Stoneman 1983),

$$ Q=f\left(K,L,E,M\right)=g\left[{a}_K(t)K,\kern0.62em {a}_L(t)L,\kern0.72em {a}_E(t)E,{a}_M(t)M\right] $$
(5.38)

where t denotes time and a K (t), a L (t), a E (t), and a M (t) are functions indicating factor-augmenting changes resulting in savings of capital, labor, energy, and materials. These functions may depend on investment in research and development, education and training of the labor force, improved management, institutional reforms, and much more. But if technological change is to be advantageous, these functions must obey

$$ \frac{da_K}{dt}\ge 0;\kern0.5em \frac{da_L}{dt}\ge 0;\kern0.5em \frac{da_L}{dt}\ge 0,\kern0.5em \frac{da_M}{dt}\ge 0. $$
(5.39)

Using these functions, the direction of technological change can be defined. For example, da L (t)/dt > 0 while da K (t)/dt = da E (t)/dt = da M (t)/dt = 0 indicates labor-saving technological change because only labor input is scaled up as it were (which implies less of it is actually used at unchanged relative factor prices). But if da L (t)/dt = da K (t)/dt = da E (t)/dt = da M (t)/dt, then all factors of production benefit from technological change to the same degree, a case which is often referred to as Hicks-neutral technological change . In the past, however, technological change has not been neutral but first and foremost labor-saving. Using the expression for sh i derived from the translog unit cost function (5.36) and complementing it with γ i ·t to reflect technological change, one obtains (see Binswanger 1974; Hunt 1986)Footnote 5

$$ {sh}_i(t)={\alpha}_i+\sum_j{\beta}_{ij} \ln {p}_j(t)+{\gamma}_it. $$
(5.40)

As before, sh i (t) stands for the cost share of production factor i and p j (t) for the respective price. As the shares of the factor costs have to add up to 1, one has

$$ \sum_j{\gamma}_j=0. $$
(5.41)

Hicks-neutral technological change is equivalent to γ K  = γ L = γ E  = γ M  = 0. However, using British industry data covering the years from 1960 to 1980, Hunt (1986) finds evidence suggesting γ L  < γ E  < γ K . Therefore, in Great Britain at least, technological innovation has been above all labor-saving. It has also been energy-saving, though hardly capital-saving.

This finding gives rise to the question of which factors bring about such a bias to technological change. According to the induced bias hypothesis (see Stoneman 1983, Chap. 4), the direction of the technological change is conditioned by the market, meaning it depends on the development of relative prices. This was first formulated by Hicks (1932, 124f),

A change in the relative prices of the factors of production is itself a spur to invention, and to innovation of a particular kind—directed to economizing the use of a factor which has become relatively expensive.

If the price of the production factor labor increases compared to the cost of other production factors, then labor-saving technological change will come about in due time. In addition to the movement along the isoquant as in Fig. 5.6, a change in the isoquant itself also takes place, which leads to a further substitution of labor even when relative factor prices no longer change.

It can be argued that the oil crises of the 1970s with their twin price shocks have guided innovation efforts towards improved energy efficiency. Interestingly, these efforts continued into the 1990s when relative oil prices were lower again, possibly because of the (expected) scarcity of energy resources and governments aiming at reducing greenhouse gas emissions and supporting energy-efficient investments. Apparently, the price hikes triggered an enduring technological change which has decoupled the demand for energy from economic growth. Quite possibly, this decoupling will be enhanced by the renewed increase in the relative price of oil between 1999 and 2014. Historic case studies show that problem awareness in the energy industry has influenced the direction technological change (see Weizsäcker 1988) as entrepreneurs hope to make a profit by developing energy-saving and environment-friendly technologies and products. These hoped-for innovation gains thus may play an important role in the future demand for energy.