Abstract
We directly observed a Cobb–Douglas symmetric plane using the index of surface openness, which is used in geography, and successfully identified it. Based on this observation, we measured the capital shares (capital elasticity) and labor shares (labor elasticity) and compared them with the results of multiple regression analysis used in economics. We confirmed consistent agreement in seven countries: Japan, Germany, France, Spain, Italy, the UK, and the Netherlands. Thus, we show that the Cobb–Douglas production function can be clearly captured in empirical data as a geometric entity with a quasi-inverse symmetry of variables. Based on the above discussion, we theoretically clarified why the Cobb–Douglas production function is better fitted to empirical data in economics, because it uniquely derives the fact that its variable follows a power-law distribution.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In economics, firms are regarded as economic entities that produce goods \(\left( Y \right)\) using capital, labor, and other resources, which are called production factors \(\left( x_1,x_2,\ldots ,x_n \right)\). The production activity of firms is modeled as a function that inputs production factors and outputs the total production: \(Y=F\left( x_1,x_2,\ldots ,x_n\right)\). This is called a production function. As a simplified model, economists have proposed various two-variable production functions in which the production factors are capital \(\left( K\right)\) and labor \(\left( L\right)\).
A typical example is the Cobb–Douglas production function: \(Y\left( K,L\right) =A K^{\alpha } L^{\beta }\) (Cobb and Douglass 1928). Here, A is the total factor productivity, which is interpreted as a firm’s production efficiency or technological capability that cannot be measured by labor or capital. \(\alpha\) and \(\beta\) are called the capital share (capital elasticity) and labor share (labor elasticity), and represent production’s capital or labor dependency. The constant elasticity of substitution (CES) type production function, \(Y\left( K,L\right) =A \left\{ \delta K^{- \rho }+(1-\delta ) L^{-\rho } \right\} ^{-\frac{1}{\rho }}\), is an extension of the Cobb–Douglas production function (Solow 1956; Arrow et al. 1961). Here, \(\delta\) and \(\rho\) are the distribution and substitution parameters. The CES production function matches the Cobb–Douglas production function at \(\alpha + \beta = 1\) in the \(\rho \rightarrow 0\) limit as well as the Leontief production function: \(Y = A \min \{K, L\}\) in the limit of \(\rho \rightarrow + \infty\). Furthermore, economists proposed a translog-type production function, \(\log Y\left( K,L\right) =\log A + \alpha \log K+ \beta \log L + \gamma _1 \log ^2 K + \gamma _2 \log ^2 L + \gamma _3 \log K \log L\), as an extended version that explicitly includes the Cobb–Douglas production function (Christensen et al. 1973).
In economics, various forms of production functions have been proposed and widely used by researchers in microeconomics and macroeconomics. Total factor productivity has been applied to industries around the world since the late 1950s as an indicator of productivity, and many empirical studies have addressed production functions. However, the conventional research is limited to comparisons of the superiority of the production function based on the regression’s good fit to the shape of several production functions described above. Note that there is only a limited argument concerning why a production function takes a functional form, such as the Cobb–Douglas type. Among these studies, Houthakker focused on the similarity between the Cobb–Douglas production function and the power-law distribution, followed by its variables from the microeconomic foundation (Houthakker 1955). Unfortunately, however, his argument fails to reach the constant returns to scale discussed below.
In this context, as in Houthakker’s work, where the Cobb–Douglas production function is a power-law function of variables \(\left( K, L, Y \right)\), we argue that the Cobb–Douglas production function can be interpreted as an inverse symmetric plane and the residual from it in the 3D space of variables \(\left( K, L, Y \right)\) (Mizuno et al. 2012; Ishikawa et al. 2013, 2014). Here, we observed the quasi-inverse symmetry in the joint probability density function (PDF) \(P_{KLY} \left( K,L,Y\right)\) under variable exchange: \(Y \leftrightarrow a K^{\alpha } L^{\beta }\), which is deeply related to the power-law distribution in the large-scale range of three variables \(\left( K,L,Y\right)\) and the log-normal distribution in the mid-scale range (Gibrat 1932; Sutton 1997). Importantly, the observed fact that each variable (K, L, Y) of the production function follows a power-law distribution is uniquely concluded from the form of the Cobb–Douglas production function. In previous studies, however, the validation of the above analytical arguments with empirical data was indirect. We have not confirmed whether our quasi-inverse symmetric plane is consistent with the Cobb–Douglas production function, which is treated in economics. The possibility that capital share \((\alpha )\) and labor share \((\beta )\) are different between the quasi-inverse symmetric plane and the multiple regression plane cannot be denied.
In this study, we directly observed a Cobb–Douglas symmetric plane using the index of surface openness, which is used in geography, and successfully identified it. Based on this observation, we measured the capital share (capital elasticity) and labor share (labor elasticity), compared our results with the results of multiple regression analysis used in economics, and confirmed consistent agreement in seven countries: Japan, Germany, France, Spain, Italy, the UK, and The Netherlands. Thus, we showed that the Cobb–Douglas production function can be clearly captured in empirical data as a geometric entity with a quasi-inverse symmetry of variables. Our previous research was limited to indirect observation of the power-law region of the variables, but this current research is characterized by direct observation of the Cobb–Douglas symmetric plane in the whole region other than the power-law one. Based on the above discussion, we theoretically clarified why the Cobb–Douglas production function is better fitted to empirical data in economics.
The rest of this paper is structured as follows. Section 2 describes the data that we used in our analysis. Section 3 briefly reviews our previous work. In Sect. 4, we explain the method to measure the Cobb–Douglas symmetric plane using an index called surface openness. In Sect. 5, we identify the Cobb–Douglas symmetric plane by identifying a ridge in Sect. 4’s analytical preparation and measuring the capital share (capital elasticity) and labor share (labor elasticity). In addition, we measure the capital share (capital elasticity) and labor share (labor elasticity) by multiple regression analysis and confirm that both shares are consistent in seven countries: Japan, Germany, France, Spain, Italy, the UK, and The Netherlands. Finally, in Sect. 6, we summarize the main points of this paper and present future perspectives.
2 Analysis data
We used the 2016 version of the ORBIS database provided by Bureau van Dijk (2020). It is the world’s largest corporate financial database with approximately 200 million listed and unlisted firms collected worldwide from more than 120 local credit bureaus and information vendors, including from Asia, the Americas, Europe, the Middle East, and Africa. This database is characterized not only by the huge amount of data but also by the fact that its data are arranged in a standardized format and can be compared internationally. It includes the corporate financial data for 1,831,481 firms in Japan (JP), 3,458,922 firms in Italy (IT), 1,953,140 firms in France (FR), 1,204,584 firms in Germany (DE), 5,070,698 firms in the UK (GB), 1, 604, 553 firms in Spain (ES), and 3,213,808 firms in The Netherlands (NL). This study focuses on firms in these countries that have sufficient data for statistical analysis. Although the number of firms collected in the US is sufficient, we did not include it in this analysis because of an unnatural bias that may be attributable to the method of data collection by local credit bureaus or information vendors. Since there are about 1–2 million active firms in Japan (Statistics Bureau 2020), the data analyzed here are highly comprehensive. This situation is probably the same in other countries.
To make the most of the features of this database, we use sales as Y, assets as K, and number of employees as L. When economists consider a production function, they generally use Y for added value, K for fixed assets in manufacturing, K for liquid assets in non-manufacturing, and L for total labor hours or wages. On average, however, only around 1/10 of the firms in our database have detailed data to calculate added value, etc. If we use the same amount as economists, we lose the completeness of the data, and have to filter out firms that have detailed data to calculate value added, etc. What is most important in this study is that K, L, and Y follow power-law distributions in the large-scale ranges. This property is common to both the (K, L, Y) used in economics and the simple (K, L, Y) we use (Aoyama et al. 2008). If the argument of directly observing a quasi-inversion symmetric plane is self-consistent, either quantity may be used. Therefore, to ensure a statistical advantage with a sufficient amount of data, this study will proceed with the analysis using the quantities mentioned above.
3 Previous research
Such variables as sales \(\left( Y \right)\), assets \(\left( K \right)\), and number of employees \(\left( L \right)\), all of which represent a firm’s size, follow a power-law distribution in the large-scale range (Pareto 1897; Newman 2005; Clauset et al. 2009):
Here, P is a PDF, and power-law exponent \(\mu\) is also called a Pareto’s index.
Fujiwara et al. (2003, 2004) showed that these power distributions (1)–(3) can be derived from the inverse symmetry observed between the variables of 1 year \(\left( x_1\right)\) and those of the next \(\left( x_2\right)\):
and Gibrat’s law, where growth rate distribution \(Q\left( R|x_1 \right)\) conditioned by first-year size \(\left( x_1\right)\) does not depend on the first-year variables (Gibrat 1932; Sutton 1997):
Here, \(x_1\) represents \(\left( K, L, Y\right)\) in 1 year, and \(x_2\) represents \(\left( K, L, Y\right)\) in the next year. \(P _ {12}\) is the joint PDF, and \(R = x_2/x_1\) is the growth rate. In inversion symmetry, note that joint PDF \(P_{12}\) has the same shape on both sides of Eq. (4) under the permutation of variables \(x_1 \leftrightarrow x_2\). The derivation by Fujiwara et al. is effective when the power-law index does not change. This derivation is described in Appendix 1.
We extended the argument by introducing an extended growth rate of \(R = x_2/{a x _ 1 ^ { \theta }}\) in quasi-inverse symmetry under the permutation of \(a x _ 1 ^ { \theta } \leftrightarrow x _ 2\) variables:
We also showed that the power-law distribution can be derived from quasi-inverse symmetry (6) and Gibrat’s law (5) even when the power-law index varies quasi-statically. Here, a and \(\theta\) are parameters of quasi-inverse symmetry and are related with power exponents \(\left( \mu _1, \mu _2 \right)\) of variable \(\left( x_1, x_2 \right)\) as follows:
This derivation is described in Appendix 2.
First, we confirmed this relation by the temporal change of power-law index \(\mu\), which can be observed in Japanese land prices (Ishikawa 2006, 2009). We then showed that this quasi-inverse symmetry can be similarly observed among such different firm size variables as assets, number of employees, and sales \(\left( K, L, Y \right)\) at the same time and confirmed the relation (7) among their power-law indices (Mizuno et al. 2012; Ishikawa et al. 2013, 2014). In this case, we consider \(\left( x _ 1, x _ 2 \right)\) to be \(\left( K, L \right) , \left( K, Y \right)\), and \(\left( L, Y \right)\).
We extended the concept of quasi-inverse symmetry and the growth rate to three simultaneous variables \(\left( K, L, Y \right)\) and introduced growth rate \(R = Y/{a K ^ { \alpha } L ^ { \beta }}\) into a system with quasi-inverse symmetry under the variable substitution of \(a K ^ { \alpha } L ^ { \beta } \leftrightarrow Y\):
The quasi-inverse symmetry (8) of the three variables and Gibrat’s law of the three variables:
lead to power-law distributions of K, L, and Y, as in the case of two variables. The derivation of this conclusion is described below.
Note here that from the definition of the three-variable growth rate, if total factor productivity A is residual aR from the following quasi-inverse symmetric plane:
then we can interpret the Cobb–Douglas production function as the quasi-inverse symmetry in the three-variable space \(\left( K, L, Y \right)\). In the course of its derivation, we discussed the following in our previous work (Ishikawa et al. 2014).
Three-variable quasi-inverse symmetry (8) is rewritten in three variables \(\left( K, L, R \right)\) as follows:
This idea can be expressed as follows:
From the definition of a conditional PDF, \(Q\left( R|K,L\right) =P_{KLR} \left( K,L,R\right) /P_{KL} \left( K,L\right)\), this equation can be modified:
In the modification from the 2nd to the 3rd equation, we used a three-variable Gibrat’s law (9). Since the right-hand side of Eq. (13) is only a function of R, if we express it as \(G \left( R \right)\), the expression (13) can be written:
Expanding the right-hand side of Eq. (14) by \(R = 1 + \epsilon ~ ~ \left( 0 < \epsilon \ll 1 \right)\), the zeroth order of \(\epsilon\) is an obvious expression, and the first order becomes the following differential equation:
Here, \(G ^ {'} \left( \cdot \right)\) represents the differential of \(G \left( \cdot \right)\) by R. Also, no more useful information is available than the 2nd-order of \(\epsilon\).
The two variables \(\left( K, L \right)\) are strongly correlated and are not independent. Therefore, the differential equation (15) cannot be solved as it is. In our previous study (Ishikawa et al. 2014), we converted variables \(\left( K, L \right)\) into normalized variables \(\left( k, l \right)\):
and rotated them as \(- \pi /4\) and finally obtained orthogonal independent variables \(\left( z _ 1, z _ 2 \right)\):
Here, \(\left( m_K,m_L \right)\) and \(\left( \sigma _K,\sigma _L \right)\) are the mean and standard deviation of \(\left( \log K, \log L\right)\) in the power-law region of variables \(\left( K,L\right)\). In Eq. (16), the distribution’s width is normalized by dividing it by \(\left( \sigma _ K, \sigma _ L \right)\) after subtracting the average \(\left( m _ K, m _ L \right)\) from \(\left( \log K, \log L \right)\). In our previous study (Ishikawa et al. 2014), we numerically showed that this operation yields orthogonal independent variables \(\left( z _ 1, z _ 2 \right)\). We rewrote the differential equation (15) with the independent variables \(\left( z _ 1, z _ 2 \right)\) and solved it to analytically show that variable \(\left( z _ 1, z _ 2 \right)\) obeys the power-law distribution and numerically confirmed the result in empirical data.
Finally, we analytically derived the distribution function followed by variable \(\left( K, L \right)\) by the above inverse transformation and confirmed it numerically. This discussion is based on the fact that the Cobb–Douglas production function is a quasi-inverse symmetric plane in \(\left( K, L, Y \right)\) space. Therefore, the above numerical consistency indirectly confirms our interpretation of the Cobb–Douglas production function.
4 Analytical preparation for direct observation of the Cobb–Douglas quasi-inverse symmetric plane
In the previous section, variables \(\left( K, L \right)\) were normalized by dividing them by logarithmic standard deviations, which are indicators of the width of the distribution in the large-scale range where the distribution follows the power law. On the other hand, the width of the distribution of the variables following the power-law distribution of exponent \(\mu\) is also given as \(1/\mu\) on a logarithmic scale. Thus, the normalization of \(\left( \log K, \log L \right)\) divided by distribution width \(\left( \sigma _ K, \sigma _ L \right)\) in Eq. (16) is equivalent to the operation of dividing by \(\left( 1/\mu _ K, 1/\mu _ L \right)\) or multiplying by \(\left( \mu _ K, \mu _ L \right)\):
Here, the origin is shifted by subtracting the logarithm of the power-law ranges’ upper limits \((\log K_{\mathrm{up}}, \log L_{\mathrm{up}})\).
We first consider the quasi-inverse symmetry of two variables \(\left( K, L \right)\). Because the normalized widths of the k and l distributions are equal, a two-variable quasi-inverse symmetric line, \(\log L = \theta \log K + \log a\) (where \(x _ 1 = K, x _ 2 = L\)), is converted to a slope 1 inverse symmetric line: \(\log l = \log k + \log a '\). This is consistent with Eq. (7) where the relationship between slope \(\theta\) of the quasi-inverse symmetric line and exponents \(\mu _K\) and \(\mu _L\) is given:
Here \(\mu _ 1 = \mu _ K\), and \(\mu _ 2 = \mu _ L\).
Extending this concept to three variables \(\left( K, L, Y \right)\), the following relationship is established between the capital and labor share (\(\alpha\), \(\beta\)) of the three-variable quasi-inverse symmetric plane (10) and the power-law exponents \(\left( \mu _ K, \mu _ L, \mu _ Y \right)\):
This is an extension of the constant returns to scale:
Using the following normalized notation:
Equation (20) is simply expressed as:
In Fig. 1, we compared the accuracy of Eqs. (21) and (23) using \({ \alpha } _ M\), \({ \beta } _ M\), \({ \alpha '} _ M\), and \({ \beta '} _ M\) evaluated by multiple regression analysis used in economics in Japanese firms from 2010 to 2014. The database that we used was created using a collection method that reduced the number of firms in the past. Therefore, Fig. 1 shows data from 2014 to the previous five years. Data for 2015 were not included, because they were still being collected when we obtained the database. From Fig. 1, we can confirm that the accuracy of the constant returns to scale, normalized by the power-law index (23), and is higher than the conventional one (21). Results similar to those of Japanese firms can be confirmed for firms in the countries that we analyzed.
More interestingly, the normalized capital share (capital elasticity) \(\alpha '\) and the labor share (labor elasticity) \(\beta '\) can be directly estimated by observing the Cobb–Douglas symmetric plane, as described below.
In the previous section’s discussion, we rewrote the quasi-inverse symmetry of the \(\left( K, L, Y \right)\) space to the quasi-inverse one of the \(\left( K, L, R \right)\) space and limited the discussion to the power-law region of \(\left( K, L \right)\) where Gibrat’s law holds, and derived a differential equation for \(\left( K, L \right)\) by expanding growth rate R around “1”. In this discussion, we only investigate the neighborhood of the quasi-inverse symmetric plane. To directly observe the Cobb–Douglas symmetric plane, the entire space of the three variables must be observed without adopting this method.
Thus, in addition to Eq. (18), we consider the standardization of Y:
For the two variables \(\left( K, L \right)\), the direction vector of the quasi-inverse symmetric axis is \(\left( 1/\mu _K, 1/\mu _L \right)\), and the normalized direction vector is \(\left( 1,1 \right)\) in two-variable space (k, l). Using Eq. (17), we rotated it to be \(\left( \sqrt{2}, 0 \right)\), resulting in independent variables \(\left( z _ 1, z _ 2 \right)\). For the three variables \(\left( K, L, Y \right)\), the densest direction vector in the quasi-inverse symmetric plane is \(\left( 1/\mu _K, 1/\mu _L, 1/\mu _Y \right)\). The rotation of normalized direction vector \(\left( 1,1,1 \right)\) in three-variable space (k, l, y) to \(\left( \sqrt{3}, 0,0 \right)\) is given by:
This rotation caused the densest directional vector in the Cobb–Douglas quasi-inverse symmetric plane to overlap the \(z _ 1\) axis, as shown in Fig. 2.
To align a Cobb–Douglas quasi-inverse symmetric plane with the \(w _ 3 = Const.\) plane, we need to rotate it further about the \(z _ 1\) axis. The rotation, expressed using parameter \(\psi\), is as follows (see Fig. 3):
Due to the above rotations in Eqs. (25)–(29), the Cobb–Douglas quasi-inverse symmetric plane overlaps the \(w _ 3 =\) Const. plane. Here, if the \(w _ 3 =\) Const. plane is expressed by the quasi-inverse symmetric plane in the \(\left( k, l, y \right)\) space by inverse transformation, it can be expressed:
From the above equation, using parameter \(\psi\) that represents the slope of the Cobb–Douglas quasi-inverse symmetric plane, normalized capital share (capital elasticity) \(\alpha '\) and labor share (labor elasticity) \(\beta '\) can be expressed:
5 Direct observation of the Cobb–Douglas quasi-inverse symmetric plane
In the previous section, we analytically discussed the following procedures to simplify observing the Cobb–Douglas quasi-inverse symmetric plane in three-variable space \(\left( K, L, Y \right)\). First, we converted three variables \(\left( K, L, Y \right)\) into variables \(\left( k, l, y \right)\), which are normalized, such as the values of the power-law exponents in the power-law region to be “1”. Next, we consider a rotation (25)–(27) that converts the densest direction vector \(\left( 1,1,1 \right)\) in a Cobb–Douglas quasi-inverse symmetric plane into a vector of only the first component and obtain variables \(\left( z _ 1, z _ 2, z _ 3 \right)\).
If 3D data are projected onto the \(z _ 1z _ 2\) plane in the \(z _ 3\) direction, then the densest direction vector should overlap the \(z _ 1\) axis. If 3D data are projected onto the \(z _ 1z _ 3\) plane in the \(z _ 2\) direction, the densest direction vector should still overlap the \(z _ 1\) axis. When 3D data are then projected onto the \(z _ 2z _ 3\) plane in the \(z _ 1\) direction, the densest direction vector is observed, and its slope is given as \(\psi\). In this section, we first confirm the above analytical discussions by directly observing empirical data. In direct observation, identifying the densest directional vector in a 2D plane is critical. We observed a dense directional vector as a ridge using the geographical index of surface openness, as in previous studies (Ishikawa et al. 2014). The method is briefly described below.
\(z_1\) and \(z_2\) data points are scattered in the \(z_1 z_2\)-plane. For example, Fig. 4 is a scatter plot of Japanese firms in 2014. To clearly comprehend the density, we divided it into logarithmically equal-sized cells and expressed the amount of data points in the cells by different shades. For example, Fig. 5 is a diagram of Japanese firms in 2014 created in this way. The logarithm of a cell’s density is its height. Then, as stated above, the ridge must be observed horizontally in the \(z_1 z_2\) plane. As the steepest-ascent line in the profit space, a ridge was previously discussed (Souma 2007; Aoyama et al. 2008). In our previous study (Ishikawa et al. 2014), we determined the cells that constitute the ridge using the surface openness defined as follows (Yokoyama et al. 1999, 2002; Prima et al. 2006).
Figure 6 depicts the grid linked by the center points of the cells. From grid point A in Fig. 6, we counterclockwisely represent each azimuth as \(D=1, 2, \ldots , 8\). As shown in Fig. 7, the minimum zenith and nadir angles at grid point A within distance L in azimuth D are represented by \(_D \phi _L\) and \(_D \psi _L\). Positive openness \(\varPhi _L\) is defined by the mean value of \(_D \phi _L\) along the eight azimuths, and negative openness \(\varPsi _L\) is the corresponding mean of \(_D \psi _L\). The surface openness is defined by the following difference:
The surface openness takes a negative value at the depressions and the valleys, zero at the level surface, the saddle point, and the uniform slope, and positive values at the ridge (see Fig. 8) and the summit (see Fig. 9). The shading in Fig. 5 shows only absolutely high and low points. However, using the surface openness, points that are relatively higher than the circumference (a ridge or a summit) or points that are relatively lower than the circumference (a valley or a depression) can be easily extracted. This is the advantage of the surface openness.
In this analysis, by setting \(L=5\), we estimate the surface openness for each cell and extract the cells of the openness that exceed 0.9, the value of which appropriately determines the ridge in this data set. In Fig. 5, the cells are expressed by black dots. As expected, the series of densest cells observed as a ridge overlap the \(z _ 1\) axis.
A scatter plot in which 3D data are projected onto the \(z _ 1z _ 3\) plane in the \(z _ 2\) direction is similarly processed to obtain Fig. 10, which also shows that, as expected, the series of densest cells observed as a ridge overlap the \(z _ 1\)-axis.
A scatter plot in which 3D data are projected onto the \(z _ 2z _ 3\) plane in the \(z _ 1\) direction is similarly processed to obtain Fig. 11, which shows that the ridge, which is a series of the densest cells, is located from the upper right to the lower left. The angle between this ridge and the \(z _ 2\) axis is \(\psi\), which we measured, assuming that a ridge is a straight line that regresses the center of cells with a surface openness greater than 0.9, which was appropriately determined to be a ridge cell. For example, for Japanese firms in 2014, \(\psi = 0.39 \pm 0.04\). In this case, using Eq. (31), we can estimate \({ \alpha '} _ R = 0.16 \pm 0.03\) and \({ \beta '} _ R = 0.84 \pm 0.16\). On the other hand, with the same data and multiple regression analysis, we also estimate \({ \alpha '} _ M = 0.16 \pm 0.0\) and \({ \beta '} _ M = 0.84 \pm 0.0\); both values are in good agreement. Figure 12 shows measurements taken by Japanese firms from 2014 to 2010 and by firms in Germany, France, Spain, Italy, and The Netherlands during the 5-year period for which data were available (from 2013 to 2009). Among the measurements, those in the year with the largest number of firms in each country are plotted in Fig. 13.
From Figs. 12 and 13, we confirmed that capital elasticity \({ \alpha _ R} '\), measured by the ridge specified by the surface openness and capital elasticity \({ \alpha _ M}'\) measured by the multiple regression analysis used in economics, are in good agreement with the 5-year firm data of the seven countries.
6 Conclusion and discussion
Our previous study argued that the Cobb–Douglas production function, which is the core concept of economics, uniquely derived power-law distributions of its variables. In this study, we show that the Cobb–Douglas production function can be directly observed using empirical data. If firms’ assets, labor, and production are expressed as a set of points in a 3D space, the Cobb–Douglas production function can be interpreted as a quasi-inverse symmetric plane in 3D space and a residual from it. From this viewpoint, we rotated the Cobb–Douglas quasi-inverse symmetric plane in 3D space to simplify observation and projected the data in 3D space onto the 2D plane. With the geographic index of surface openness, we identified the densest axis in the plane as the ridge.
Identifying this axis is difficult when the dispersion is large in the 2D plane. For example, by dividing the data on a 2D plane into equally spaced bins in a certain direction and calculating the average value or the logarithmic average value in the vertical direction for each bin, we can combine them and find the axis. However, since the average value or the logarithmic average value varies depending on the selection method in a certain direction, the axis obtained by combining the average value or the logarithmic average value is not fixed to one axis. What is not affected by such arbitrariness is the advantage of the index of surface openness adopted in this paper. Using it, we successfully and directly observed the Cobb–Douglas production function as a quasi-inverse symmetric plane.
The approach proposed by Hildenbrnad (1981) and developed by Dosi et al. (2016) shares our arguments and perspectives on firms’ productivity, but uses a very different approach to measuring it. They theoretically showed that measurement of productivity is possible without depending on the shape of the production function by the volume of a polyhedron composed of points in 3D space and the angle formed with the axis by the main diagonal, and confirmed it by actual data. Their approach to measuring productivity is very innovative. The advantage of our study, on the other hand, is that it geometrically identifies the form of the production function that they avoided. This makes it possible to discuss specifically how production relates to assets and labor as follows.
Interestingly, the capital and labor elasticities of the Cobb–Douglas production function can be expressed using the slope of a quasi-inverse symmetric plane in 3D space. By comparing the measured values of the capital elasticity observed in this way with those by the multiple regression analysis conventionally used in economics, we showed that the accuracy of these measured values agrees with the accuracy using the data for 5 years of seven countries with sufficient data quantity, including Japan.
It is not obvious that the planes of the multiple regression analysis and the quasi-inverse symmetry become identical. If all the data in the 3D space exist on a plane, they coincide. In practice, however, the data are widely dispersed. If they are spread vertically about the plane based on a log-normal distribution and are not concentrated on the symmetry plane, clearly identifying the ridge is difficult where the data are concentrated, even if with quasi-inverse symmetry. This paper clarified that such a ridge exists in empirical data using the index of surface openness in 2D distribution in which 3D distribution is projected.
The fact that these observations are consistent in the seven countries with sufficient data means that the 3D space composed of capital, labor, and production has a ridge where the data are concentrated as a distinct entity. This idea can probably be applied to countries other than the seven countries surveyed in this paper. Therefore, if sufficient data exist, the results in this paper can be reproduced. As a result, we clarified from empirical data that the Cobb–Douglas production function, which has been discussed in economics, can be interpreted as the quasi-inverse symmetry of three variables and the residual from it. This study theoretically clarified why the Cobb–Douglas production function fits the empirical data better than a simple comparison of candidate production functions due to the goodness of fit of the data.
In the process, we also carried out the following new consideration on the constant returns to scale, which is a feature of the Cobb–Douglas production function. In economics, when the variables of the production function (K and L in this case) are multiplied by \(\lambda\) and production \(\left( Y\right)\) is also multiplied by \(\lambda\) such as \(\alpha + \beta =1\), the production function is called the constant returns to scale. The increasing returns if it exceeds \(\lambda\) times such as \(\alpha + \beta >1\), and the diminishing returns if it is less than \(\lambda\) times, such as \(\alpha + \beta <1\). These three cases of returns to scale are easy to understand by considering the following simple examples. When the number of factories is multiplied by \(\lambda\), the assets and labor of a firm are each multiplied by \(\lambda\) if the increased factories are the same size. The case where there is no change in the production efficiency and the production quantity becomes \(\lambda\) times is called the constant returns to scale; the case where the production efficiency improves and the production quantity becomes larger than \(\lambda\) times is called the increasing returns; the case where the production efficiency deteriorates and the production quantity becomes smaller than \(\lambda\) times is called the diminishing returns. Economic arguments often assume constant returns to scale in the production function, and in fact, this is often observed approximately in various empirical data analyses.
These observations indicate that, on average, a constant return to scale is achieved in firm production. In this study, we showed by direct observation that the Cobb–Douglas production function can be understood as a quasi-inverse symmetric plane in the space created by three variables \(\left( K, L, Y \right)\) and the residual from it. We concluded that the constant returns to scale is strictly satisfied with \(\alpha '\) and \(\beta '\) normalized by the power-law indices. Since the values of power-law exponents \(\mu _ K\), \(\mu _ L\), and \(\mu _ Y\) are all close to “1”, the constant returns to scale was approximately observed even in the non-standardized capital share (capital elasticity) \(\alpha\) and labor share (labor elasticity) \(\beta\).
In this study, as in our previous study, we did not limit the data area in which the Cobb–Douglas production function was observed to the large-scale region where the power law holds. As our previous study showed, there is also quasi-inverse symmetry in the mid-scale region. The collapse of the power-law distribution observed in the large-scale region reflects that Gibrat’s law does not hold in the mid-scale region. This concept is independent of quasi-inverse symmetry. On the other hand, serious questions remain about the completeness of the data in the small scale. The current study did not exclude small data. If it can be implemented in a reliable way, the numerical consistency between our proposed method and multiple regression analysis may be higher. This is a future issue.
In our method, a firm’s assets (K), number of employees (L), and production (Y) are regarded as one point in three-variable space (K, L, Y), and we focus on how those several million groups are distributed. This is an approach to microfoundations that attempts to derive the macroscopic properties of their distribution from the microscopic objects of individual points. This is an important issue that has been studied in economics for many years. Finally, we describe the relationship between our discussion and Aoki et al., a pioneering study as a physical approach to this problem.
Aoki and Yoshikawa hypothesized a multisystem in which the total number of employees and the total amount of production of a company are fixed, and found that the randomness (entropy) is greatest in a society in equilibrium. In that case, they used physics methodologies to claim that the average number of employees in firms follows the Boltzmann distribution (Aoki and Yoshikawa 2007). Aoyama et al. noticed that there is actually a large fluctuation in the total amount of production and incorporated the effect by superimposing the Boltzmann distribution. As a result, it was theoretically shown that labor productivity of (Y/L) follows the power-law distribution in the large-scale range and the Boltzmann distribution with negative temperature in the middle- and low-scale ranges, and this was confirmed by the empirical data (Aoyama et al. 2009, 2009, 2010, 2015, 2017; Souma et al. 2009; Iyetomi 2012).
This can be described geometrically as follows. First, consider the 2D plane of the logarithmic axes of labor productivity (Y/L) on the horizontal axis and number of employees (L) on the vertical axis. The horizontal axis is divided into logarithmically equal-sized bins, and the vertical average of the data in the bins is plotted. The distribution followed by these points can be described by superposition of the Boltzmann distribution.
In this paper, we show the following. Total factor productivity \((Y/{K ^ { \alpha } L ^ { \beta }})\) is divided into logarithmically equal-sized bins, and the distribution of the number of data in the bins is observed. We can determine \(( \alpha , \beta , a)\), so that the distribution is symmetric with respect to a constant a. Thus, our observations differ from those of Aoki et al. To unite the two, we need to extend the discussion of Aoyama et al. to total factor productivity, including capital (K) . If this can be carried out, it will be possible to discuss the distribution of productivity that firms’ capital as well as labor, and to analyze firms in various industries in terms of both labor and capital. This is a major challenge for the future.
References
Aoki M, Yoshikawa H (2007) Reconstructing macroeconomics - a perspecrive from statistical physics and conbinatiorial stochastic processes. Cambridge University Press, Cambrige
Aoyama H, Iyetomi H, Ikeda Y, Souma W, Fujiwara Y (2008) Econophysics. Kyoritsu Shuppan CO., LTD, Tokyo
Aoyama H, Yoshikawa H, Iyetomi H, Fujiwara Y (2009) Labour productivity superstatistics. Prog Theor Phys Suppl No 179:80–92
Aoyama H, Fujiwara Y, Ikeda Y, Iyetomi H, Souma W (2009) Superstatistics of labour productivity in manufacturing and nonmanufacturing sectors. Econ Open Access Open Assess E J 3:2009–22
Aoyama H, Yoshikawa H, Iyetomi H, Fujiwara Y (2010) Productivity dispersion: facts, theory, and implications. J Econ Interact Coord 5:27–54
Aoyama H, Iyetomi H, Yoshikawa H (2015) Equilibrium distribution of labor productivity. J Econ Interact Coord 10:57–66
Aoyama H, Fujiwara Y, Ikeda Y, Iyetomi H, Souma W, Yoshikawa H (2017) Macro-econophysics: new studies on economic networks and synchronization (physics of society: econophysics and sociophysics). Cambridge University Press, Cambrige
Arrow KJ, Chenery HB, Minhas BS, Solow RM (1961) Capital-labor substitution and economic efficiency. Rev Econ Stat 43:225–250
Bureau van Dijk (2020) http://www.bvdinfo.com/Home.aspx/
Christensen LR, Jorgenson DW, Lau LJ (1973) Transcendental logarithmic production function. Rev Econ Stat 55:28–45
Clauset A, Shalizi CR, Newman MEJ (2009) Power-law distributions in empirical data. SIAM Rev 51:661–703
Cobb CW, Douglass PH (1928) A theory of production. Am Econ Rev 18:139–165
Dosi G, Grazzi M, Marrengo L, Settepanella S (2016) Production theory: accounting for firm heterogeneity and technical change. J Ind Econ 64(4):875–907
Fujiwara Y, Souma W, Aoyama H, Kaizoji T, Aoki M (2003) Growth and fluctuations of personal income. Phys A 321:598–604
Fujiwara Y, Guilmi CD, Aoyama H, Gallegati M, Souma W (2004) Do Pareto–Zipf and Gibrat laws hold true? An analysis with European firms. Phys A 335:197–216
Gibrat R (1932) Les Inégalités Économiques. Sirey, Paris
Hildenbrnad W (1981) Short-run production functions based on microdata. Econometrica 49(5):1095–1125
Houthakker HS (1955) The Pareto distribution and the cobb-douglas production function in activity analysis. Rev Econ Stud 23:27–31
Ishikawa A (2006) Annual change of Pareto index dynamically deduced from the law of detailed quasi-balance. Phys A 371:525–535
Ishikawa A (2009) Quasistatically varying log-normal distribution in the middle scale region of Japanese land prices. Prog Theor Phys Suppl 179:103–113
Ishikawa A, Fujimoto S, Watanabe T, Mizuno T (2013) The emergence of different tail exponents in the distributions of firm size variables. Phys A 392:2104–2113
Ishikawa A, Fujimoto S, Mizuno T, Watanabe T (2014) Analytical derivation of power laws in firm size variables from Gibrat’s law and quasi-inversion symmetry: a geomorphological approach. J Phys Soc Jpn 83:034802
Iyetomi H (2012) Labor productivity distributions with negative tempreature. Prog Theor Phys Suppl No 194:135–143
Mizuno T, Ishikawa A, Fujimoto S, Watanabe T (2012) Power laws in firm productivity. Prog Theor Phys Suppl 194:122–134
Newman MEJ (2005) Power laws, Pareto distributions and Zipf’s law. Contemp Phys 46:323–351
Pareto V (1897) Cours d’Économie Politique. Macmillan, London
Prima ODA, Echigo A, Yokoyama R, Yoshida T (2006) Supervised landform classification of Northeast Honshu from DEM-derived thematic maps. Geomorphology 78:373–386
Solow RM (1956) A contribution to the theory of economic growth. Q J Econ 70:65–94
Souma W (2007) Networks of firms and the ridge in the production space. In: Chatterjee A, Chakrabarti BK (eds) Econophysics of markets and business networks. Springer Italia, Milan, pp 149–158
Souma W, Ikeda Y, Iyetomi H, Fujiwara Y (2009) Distribution of labour productivity in Japan over the period 1996–2006. Econ Open Access Open Assess E J 3:2009–14
Statistics Bureau (2020) Ministry of Internal Affairs and Communications. https://www.stat.go.jp/english/index.html
Sutton J (1997) Gibrat’s legacy. J Econ Lit 35:40–59
Yokoyama R, Sirasawa M, Kikuchi Y (1999) Representation of topographical fearures by openness. J Jpn Soc Photogramm Remote Sens 38:26–34
Yokoyama R, Sirasawa M, Pike R (2002) Visualizing topography by openness: a new application of image processing to digital elevation models. Photogramm Eng Remote Sens 68:257–265
Acknowledgements
The authors would like to thank Dr. Tsutomu Watanabe with whom much of this work was discussed. This study was supported by JSPS KAKENHI Grant Numbers 17K01277, 16H05904, 18H05217, ROIS NII Open Collaborative Research, Research Organization of Information and Systems 2019-FS01, the Obayashi Foundation, and The Okawa Foundation for Information and Telecommunications.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest regarding the publication of this article.
Ethical approval
This article does not contain any studies with human participants performed by any of the authors.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1
Here, we show that under the inverse symmetry (4), according to Fujiwara et al. (2003, 2004), the power laws of \(x _ 1\) and \(x _ 2\) with the same Pareto index are derived from Gibrat’s law (5). Using growth rate \(R=x_2/x_1\), (4) is rewritten by variables \(x_1, R\) as follows:
Using conditional PDF \(Q (R | x_1) = P_{12} (x_1, R) / P (x_1)\) and Gibrat’s law (5), this can be reduced:
Since the system has inverse symmetry, Gibrat’s law (5) is also established under the inverse transformation of \(x_1 \leftrightarrow R x_1 ~ (= x_ {2})\). The right side of Eq. (34) is only a function of R, and so, we signify it by G(R) and expand Eq. (34) by \(R = 1 + \epsilon ~(\epsilon \ll 1)\) in \(\epsilon\). The zeroth order term of \(\epsilon\) is trivial, and the first-order term yields the following differential equation:
Here, \(G '(\cdot )\) means the R differentiation of \(G (\cdot )\). No more useful information can be obtained from the second-order and higher order terms of \(\epsilon\). The solution to this equation is uniquely given:
This solution satisfies Eq. (34) even if R is not near \(R = 1\), when \(Q (R) = R ^ {-G '(1) -1} ~Q (R)\) holds (this is called a reflection law). Reflection law has been confirmed with various actual data (Fujiwara et al. 2004). Finally, if we set \(G '(1) = \mu + 1\), Eq. (36) matches the power laws (1), (2), and (3) described by variables at 1 year \(x_1\). Since this system has symmetry \(x_1 \leftrightarrow x_ {2}\), the power laws holds for the same Pareto index \(\mu\) even at time 2.
Appendix 2
Here, we show that according to Ishikawa (2006, 2009), Mizuno et al. (2012), and Ishikawa et al. (2013, 2014), under the quasi-inverse symmetry (6), the power laws of \(x _ 1\) and \(x _ 2\) with different Pareto indices are derived from Gibrat’s law (5). Using extended growth rate \(R = x_{T+1}/{a {x_T}^{\theta }}\), (6) is rewritten by variables \(x_1, R\) as follows:
Equation (37) is reduced to Eq. (33) at \(\theta = 1\). Using the conditional PDF \(Q (R | x_T)\) and Gibrat’s law (5), this is reduced to:
Here, we assume that Gibrat’s law (5) holds under a transformation: \(x_1 \leftrightarrow R^{1/\theta } x_1 ~(=(x_{2}/a)^{1/\theta })\). This is valid in a system that has quasi-inverse symmetry. Since the last term in Eq. (38) is only a function of R, we signify it by \(G _ {\theta } (R)\) and expand Eq. (38) to R near 1 as \(R = 1 + \epsilon ~(\epsilon \ll 1)\). The zeroth order of \(\epsilon\) is trivial, and the first-order term yields the following differential equation:
Here, \({G_{\theta }}'(\cdot )\) denotes the R differentiation of \(G_{\theta }(\cdot )\). No more useful information can be obtained from the second-order and higher order terms of \(\epsilon\). The solution to this equation is uniquely given:
Similar to Appendix 1, this solution satisfies Eq. (38) even if R is not near \(R = 1\), when \(Q(R) = R^{-{G_{\theta }}'(1)-1}~Q(R)\) holds.
Next, in quasi-static system \((x_1, x_ {2})\), we identify distribution \(P (x_ {2})\). Actually, we should write \(P_ {x_1} (x_1)\), \(P_ {x_ {2}} (x_ {2})\); however, because function forms are complicated, they are collectively written as P. From Eq. (40) and \(P(x_1) dx_1 = P(x_{2}) dx_{2}\), \(P (x_ {2})\) can be expressed:
Here, we signify Pareto indices at year 1, 2 by \(\mu _{1}\), \(\mu _{2}\) and represent \(P(x_1)\), \(P(x_{2})\) as follows:
Comparing Eqs. (40) and (41) to Eq. (42), we obtain \(\theta {G_{\theta }}'(1) = \mu _1 + 1\), \({G_{\theta }}'(1) - 1/\theta + 1 = \mu _{2} + 1\) and conclude the relation among \(\mu _1\), \(\mu _{2}\), and \(\theta\) as follows:
About this article
Cite this article
Ishikawa, A., Fujimoto, S. & Mizuno, T. Why does production function take the Cobb–Douglas form?. Evolut Inst Econ Rev 18, 79–102 (2021). https://doi.org/10.1007/s40844-020-00180-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40844-020-00180-3