1 Introduction

In economics, firms are regarded as economic entities that produce goods \(\left( Y \right)\) using capital, labor, and other resources, which are called production factors \(\left( x_1,x_2,\ldots ,x_n \right)\). The production activity of firms is modeled as a function that inputs production factors and outputs the total production: \(Y=F\left( x_1,x_2,\ldots ,x_n\right)\). This is called a production function. As a simplified model, economists have proposed various two-variable production functions in which the production factors are capital \(\left( K\right)\) and labor \(\left( L\right)\).

A typical example is the Cobb–Douglas production function: \(Y\left( K,L\right) =A K^{\alpha } L^{\beta }\) (Cobb and Douglass 1928). Here, A is the total factor productivity, which is interpreted as a firm’s production efficiency or technological capability that cannot be measured by labor or capital. \(\alpha\) and \(\beta\) are called the capital share (capital elasticity) and labor share (labor elasticity), and represent production’s capital or labor dependency. The constant elasticity of substitution (CES) type production function, \(Y\left( K,L\right) =A \left\{ \delta K^{- \rho }+(1-\delta ) L^{-\rho } \right\} ^{-\frac{1}{\rho }}\), is an extension of the Cobb–Douglas production function (Solow 1956; Arrow et al. 1961). Here, \(\delta\) and \(\rho\) are the distribution and substitution parameters. The CES production function matches the Cobb–Douglas production function at \(\alpha + \beta = 1\) in the \(\rho \rightarrow 0\) limit as well as the Leontief production function: \(Y = A \min \{K, L\}\) in the limit of \(\rho \rightarrow + \infty\). Furthermore, economists proposed a translog-type production function, \(\log Y\left( K,L\right) =\log A + \alpha \log K+ \beta \log L + \gamma _1 \log ^2 K + \gamma _2 \log ^2 L + \gamma _3 \log K \log L\), as an extended version that explicitly includes the Cobb–Douglas production function (Christensen et al. 1973).

In economics, various forms of production functions have been proposed and widely used by researchers in microeconomics and macroeconomics. Total factor productivity has been applied to industries around the world since the late 1950s as an indicator of productivity, and many empirical studies have addressed production functions. However, the conventional research is limited to comparisons of the superiority of the production function based on the regression’s good fit to the shape of several production functions described above. Note that there is only a limited argument concerning why a production function takes a functional form, such as the Cobb–Douglas type. Among these studies, Houthakker focused on the similarity between the Cobb–Douglas production function and the power-law distribution, followed by its variables from the microeconomic foundation (Houthakker 1955). Unfortunately, however, his argument fails to reach the constant returns to scale discussed below.

In this context, as in Houthakker’s work, where the Cobb–Douglas production function is a power-law function of variables \(\left( K, L, Y \right)\), we argue that the Cobb–Douglas production function can be interpreted as an inverse symmetric plane and the residual from it in the 3D space of variables \(\left( K, L, Y \right)\) (Mizuno et al. 2012; Ishikawa et al. 2013, 2014). Here, we observed the quasi-inverse symmetry in the joint probability density function (PDF) \(P_{KLY} \left( K,L,Y\right)\) under variable exchange: \(Y \leftrightarrow a K^{\alpha } L^{\beta }\), which is deeply related to the power-law distribution in the large-scale range of three variables \(\left( K,L,Y\right)\) and the log-normal distribution in the mid-scale range (Gibrat 1932; Sutton 1997). Importantly, the observed fact that each variable (KLY) of the production function follows a power-law distribution is uniquely concluded from the form of the Cobb–Douglas production function. In previous studies, however, the validation of the above analytical arguments with empirical data was indirect. We have not confirmed whether our quasi-inverse symmetric plane is consistent with the Cobb–Douglas production function, which is treated in economics. The possibility that capital share \((\alpha )\) and labor share \((\beta )\) are different between the quasi-inverse symmetric plane and the multiple regression plane cannot be denied.

In this study, we directly observed a Cobb–Douglas symmetric plane using the index of surface openness, which is used in geography, and successfully identified it. Based on this observation, we measured the capital share (capital elasticity) and labor share (labor elasticity), compared our results with the results of multiple regression analysis used in economics, and confirmed consistent agreement in seven countries: Japan, Germany, France, Spain, Italy, the UK, and The Netherlands. Thus, we showed that the Cobb–Douglas production function can be clearly captured in empirical data as a geometric entity with a quasi-inverse symmetry of variables. Our previous research was limited to indirect observation of the power-law region of the variables, but this current research is characterized by direct observation of the Cobb–Douglas symmetric plane in the whole region other than the power-law one. Based on the above discussion, we theoretically clarified why the Cobb–Douglas production function is better fitted to empirical data in economics.

The rest of this paper is structured as follows. Section 2 describes the data that we used in our analysis. Section 3 briefly reviews our previous work. In Sect. 4, we explain the method to measure the Cobb–Douglas symmetric plane using an index called surface openness. In Sect. 5, we identify the Cobb–Douglas symmetric plane by identifying a ridge in Sect. 4’s analytical preparation and measuring the capital share (capital elasticity) and labor share (labor elasticity). In addition, we measure the capital share (capital elasticity) and labor share (labor elasticity) by multiple regression analysis and confirm that both shares are consistent in seven countries: Japan, Germany, France, Spain, Italy, the UK, and The Netherlands. Finally, in Sect. 6, we summarize the main points of this paper and present future perspectives.

2 Analysis data

We used the 2016 version of the ORBIS database provided by Bureau van Dijk (2020). It is the world’s largest corporate financial database with approximately 200 million listed and unlisted firms collected worldwide from more than 120 local credit bureaus and information vendors, including from Asia, the Americas, Europe, the Middle East, and Africa. This database is characterized not only by the huge amount of data but also by the fact that its data are arranged in a standardized format and can be compared internationally. It includes the corporate financial data for 1,831,481 firms in Japan (JP), 3,458,922 firms in Italy (IT), 1,953,140 firms in France (FR), 1,204,584 firms in Germany (DE), 5,070,698 firms in the UK (GB), 1, 604, 553 firms in Spain (ES), and 3,213,808 firms in The Netherlands (NL). This study focuses on firms in these countries that have sufficient data for statistical analysis. Although the number of firms collected in the US is sufficient, we did not include it in this analysis because of an unnatural bias that may be attributable to the method of data collection by local credit bureaus or information vendors. Since there are about 1–2 million active firms in Japan (Statistics Bureau 2020), the data analyzed here are highly comprehensive. This situation is probably the same in other countries.

To make the most of the features of this database, we use sales as Y, assets as K, and number of employees as L. When economists consider a production function, they generally use Y for added value, K for fixed assets in manufacturing, K for liquid assets in non-manufacturing, and L for total labor hours or wages. On average, however, only around 1/10 of the firms in our database have detailed data to calculate added value, etc. If we use the same amount as economists, we lose the completeness of the data, and have to filter out firms that have detailed data to calculate value added, etc. What is most important in this study is that K, L, and Y follow power-law distributions in the large-scale ranges. This property is common to both the (KLY) used in economics and the simple (KLY) we use (Aoyama et al. 2008). If the argument of directly observing a quasi-inversion symmetric plane is self-consistent, either quantity may be used. Therefore, to ensure a statistical advantage with a sufficient amount of data, this study will proceed with the analysis using the quantities mentioned above.

3 Previous research

Such variables as sales \(\left( Y \right)\), assets \(\left( K \right)\), and number of employees \(\left( L \right)\), all of which represent a firm’s size, follow a power-law distribution in the large-scale range (Pareto 1897; Newman 2005; Clauset et al. 2009):

$$\begin{aligned}&P_K \left( K \right) \propto K^{-\mu _K-1} , \end{aligned}$$
(1)
$$\begin{aligned}&P_L \left( L\right) \propto L^{-\mu _L-1} , \end{aligned}$$
(2)
$$\begin{aligned}&P_Y \left( Y\right) \propto Y^{-\mu _Y-1}. \end{aligned}$$
(3)

Here, P is a PDF, and power-law exponent \(\mu\) is also called a Pareto’s index.

Fujiwara et al. (2003, 2004) showed that these power distributions (1)–(3) can be derived from the inverse symmetry observed between the variables of 1 year \(\left( x_1\right)\) and those of the next \(\left( x_2\right)\):

$$\begin{aligned} P_{12} \left( x_1,x_2 \right) dx_1 dx_2= P_{12} \left( x_2,x_1\right) dx_2 dx_1, \end{aligned}$$
(4)

and Gibrat’s law, where growth rate distribution \(Q\left( R|x_1 \right)\) conditioned by first-year size \(\left( x_1\right)\) does not depend on the first-year variables (Gibrat 1932; Sutton 1997):

$$\begin{aligned} Q\left( R|x_1 \right) =Q\left( R\right) . \end{aligned}$$
(5)

Here, \(x_1\) represents \(\left( K, L, Y\right)\) in 1 year, and \(x_2\) represents \(\left( K, L, Y\right)\) in the next year. \(P _ {12}\) is the joint PDF, and \(R = x_2/x_1\) is the growth rate. In inversion symmetry, note that joint PDF \(P_{12}\) has the same shape on both sides of Eq. (4) under the permutation of variables \(x_1 \leftrightarrow x_2\). The derivation by Fujiwara et al. is effective when the power-law index does not change. This derivation is described in Appendix 1.

We extended the argument by introducing an extended growth rate of \(R = x_2/{a x _ 1 ^ { \theta }}\) in quasi-inverse symmetry under the permutation of \(a x _ 1 ^ { \theta } \leftrightarrow x _ 2\) variables:

$$\begin{aligned} P_{12} \left( x_1,x_2 \right) dx_1 dx_2 = P_{12} \left( \left( \frac{x_2}{a}\right) ^{\frac{1}{\theta }}, a x_1^{\theta } \right) d \left( \left( \frac{x_2}{a}\right) ^{\frac{1}{\theta }} \right) d \left( a x_1^{\theta } \right) . \end{aligned}$$
(6)

We also showed that the power-law distribution can be derived from quasi-inverse symmetry (6) and Gibrat’s law (5) even when the power-law index varies quasi-statically. Here, a and \(\theta\) are parameters of quasi-inverse symmetry and are related with power exponents \(\left( \mu _1, \mu _2 \right)\) of variable \(\left( x_1, x_2 \right)\) as follows:

$$\begin{aligned} \theta = \frac{\mu _1}{\mu _2}. \end{aligned}$$
(7)

This derivation is described in Appendix 2.

First, we confirmed this relation by the temporal change of power-law index \(\mu\), which can be observed in Japanese land prices (Ishikawa 2006, 2009). We then showed that this quasi-inverse symmetry can be similarly observed among such different firm size variables as assets, number of employees, and sales \(\left( K, L, Y \right)\) at the same time and confirmed the relation (7) among their power-law indices (Mizuno et al. 2012; Ishikawa et al. 2013, 2014). In this case, we consider \(\left( x _ 1, x _ 2 \right)\) to be \(\left( K, L \right) , \left( K, Y \right)\), and \(\left( L, Y \right)\).

We extended the concept of quasi-inverse symmetry and the growth rate to three simultaneous variables \(\left( K, L, Y \right)\) and introduced growth rate \(R = Y/{a K ^ { \alpha } L ^ { \beta }}\) into a system with quasi-inverse symmetry under the variable substitution of \(a K ^ { \alpha } L ^ { \beta } \leftrightarrow Y\):

$$\begin{aligned}&P_{KLY} \left( K,L,Y \right) dK dL dY \nonumber \\&\quad = P_{KLY} \left( \left( \frac{Y}{a L^{\beta }}\right) ^{\frac{1}{\alpha }}, \left( \frac{Y}{a K^{\beta }}\right) ^{\frac{1}{\beta }}, a K^{\alpha } L^{\beta } \right) \nonumber \\&\qquad d \left( \left( \frac{Y}{a L^{\beta }}\right) ^{\frac{1}{\alpha }} \right) d \left( \left( \frac{Y}{a K^{\alpha }}\right) ^{\frac{1}{\beta }} \right) d \left( a K^{\alpha } L^{\beta } \right) . \end{aligned}$$
(8)

The quasi-inverse symmetry (8) of the three variables and Gibrat’s law of the three variables:

$$\begin{aligned} Q\left( R|K,L\right) =Q\left( R\right) \end{aligned}$$
(9)

lead to power-law distributions of K, L, and Y, as in the case of two variables. The derivation of this conclusion is described below.

Note here that from the definition of the three-variable growth rate, if total factor productivity A is residual aR from the following quasi-inverse symmetric plane:

$$\begin{aligned} \log Y = \alpha \log K+ \beta \log L + \log a , \end{aligned}$$
(10)

then we can interpret the Cobb–Douglas production function as the quasi-inverse symmetry in the three-variable space \(\left( K, L, Y \right)\). In the course of its derivation, we discussed the following in our previous work (Ishikawa et al. 2014).

Three-variable quasi-inverse symmetry (8) is rewritten in three variables \(\left( K, L, R \right)\) as follows:

$$\begin{aligned}&P_{KLR} \left( K,L,R\right) dK dL dR \nonumber \\&= P_{KLR} \left( R^{\frac{1}{\alpha }} K, R^{\frac{1}{\beta }} L, R^{-1} \right) d \left( R^{\frac{1}{\alpha }} K \right) d \left( R^{\frac{1}{\beta }} L \right) d ( R^{-1} ). \end{aligned}$$
(11)

This idea can be expressed as follows:

$$\begin{aligned} P_{KLR} \left( K,L,R\right) = R^{\frac{1}{\alpha }+\frac{1}{\beta }-2} P_{KLR} \left( R^\frac{1}{\alpha } K, R^\frac{1}{\beta } L, R^{-1} \right) . \end{aligned}$$
(12)

From the definition of a conditional PDF, \(Q\left( R|K,L\right) =P_{KLR} \left( K,L,R\right) /P_{KL} \left( K,L\right)\), this equation can be modified:

$$\begin{aligned} \frac{P_{KL} \left( K,L\right) }{P_{KL} \left( R^\frac{1}{\alpha } K, R^\frac{1}{\beta } L \right) }= & {} R^{\frac{1}{\alpha }+\frac{1}{\beta }-2} \frac{Q\left( R^{-1}| R^\frac{1}{\alpha } K, R^\frac{1}{\beta } L\right) }{Q\left( R|K,L\right) } \nonumber \\= & {} R^{\frac{1}{\alpha }+\frac{1}{\beta }-2} \frac{Q\left( R^{-1}\right) }{Q\left( R\right) }. \end{aligned}$$
(13)

In the modification from the 2nd to the 3rd equation, we used a three-variable Gibrat’s law (9). Since the right-hand side of Eq. (13) is only a function of R, if we express it as \(G \left( R \right)\), the expression (13) can be written:

$$\begin{aligned} P_{KL} \left( K,L\right) =G\left( R\right) P_{KL} \left( R^\frac{1}{\alpha } K, R^\frac{1}{\beta } L \right) . \end{aligned}$$
(14)

Expanding the right-hand side of Eq. (14) by \(R = 1 + \epsilon ~ ~ \left( 0 < \epsilon \ll 1 \right)\), the zeroth order of \(\epsilon\) is an obvious expression, and the first order becomes the following differential equation:

$$\begin{aligned} \left[ G^{'} \left( 1\right) + \frac{K}{\alpha } \frac{\partial }{\partial K} + \frac{L}{\beta } \frac{\partial }{\partial L} \right] P_{KL} \left( K,L\right) =0. \end{aligned}$$
(15)

Here, \(G ^ {'} \left( \cdot \right)\) represents the differential of \(G \left( \cdot \right)\) by R. Also, no more useful information is available than the 2nd-order of \(\epsilon\).

The two variables \(\left( K, L \right)\) are strongly correlated and are not independent. Therefore, the differential equation (15) cannot be solved as it is. In our previous study (Ishikawa et al. 2014), we converted variables \(\left( K, L \right)\) into normalized variables \(\left( k, l \right)\):

$$\begin{aligned} \log k= \frac{\log K- m_K}{\sigma _K} ,\quad \log l= \frac{\log L- m_L}{\sigma _L}, \end{aligned}$$
(16)

and rotated them as \(- \pi /4\) and finally obtained orthogonal independent variables \(\left( z _ 1, z _ 2 \right)\):

$$\begin{aligned} \log z_1 = \frac{1}{\sqrt{2}} \left( \log k + \log l \right) , \quad \log z_2 = \frac{1}{\sqrt{2}} \left( - \log k + \log l \right) . \end{aligned}$$
(17)

Here, \(\left( m_K,m_L \right)\) and \(\left( \sigma _K,\sigma _L \right)\) are the mean and standard deviation of \(\left( \log K, \log L\right)\) in the power-law region of variables \(\left( K,L\right)\). In Eq. (16), the distribution’s width is normalized by dividing it by \(\left( \sigma _ K, \sigma _ L \right)\) after subtracting the average \(\left( m _ K, m _ L \right)\) from \(\left( \log K, \log L \right)\). In our previous study (Ishikawa et al. 2014), we numerically showed that this operation yields orthogonal independent variables \(\left( z _ 1, z _ 2 \right)\). We rewrote the differential equation (15) with the independent variables \(\left( z _ 1, z _ 2 \right)\) and solved it to analytically show that variable \(\left( z _ 1, z _ 2 \right)\) obeys the power-law distribution and numerically confirmed the result in empirical data.

Finally, we analytically derived the distribution function followed by variable \(\left( K, L \right)\) by the above inverse transformation and confirmed it numerically. This discussion is based on the fact that the Cobb–Douglas production function is a quasi-inverse symmetric plane in \(\left( K, L, Y \right)\) space. Therefore, the above numerical consistency indirectly confirms our interpretation of the Cobb–Douglas production function.

4 Analytical preparation for direct observation of the Cobb–Douglas quasi-inverse symmetric plane

In the previous section, variables \(\left( K, L \right)\) were normalized by dividing them by logarithmic standard deviations, which are indicators of the width of the distribution in the large-scale range where the distribution follows the power law. On the other hand, the width of the distribution of the variables following the power-law distribution of exponent \(\mu\) is also given as \(1/\mu\) on a logarithmic scale. Thus, the normalization of \(\left( \log K, \log L \right)\) divided by distribution width \(\left( \sigma _ K, \sigma _ L \right)\) in Eq. (16) is equivalent to the operation of dividing by \(\left( 1/\mu _ K, 1/\mu _ L \right)\) or multiplying by \(\left( \mu _ K, \mu _ L \right)\):

$$\begin{aligned} \log k= \mu _K \left( \log K - \log K_{\mathrm{up}} \right) ,\quad \log l= \mu _L \left( \log L - \log L_{\mathrm{up}} \right) . \end{aligned}$$
(18)

Here, the origin is shifted by subtracting the logarithm of the power-law ranges’ upper limits \((\log K_{\mathrm{up}}, \log L_{\mathrm{up}})\).

We first consider the quasi-inverse symmetry of two variables \(\left( K, L \right)\). Because the normalized widths of the k and l distributions are equal, a two-variable quasi-inverse symmetric line, \(\log L = \theta \log K + \log a\) (where \(x _ 1 = K, x _ 2 = L\)), is converted to a slope 1 inverse symmetric line: \(\log l = \log k + \log a '\). This is consistent with Eq. (7) where the relationship between slope \(\theta\) of the quasi-inverse symmetric line and exponents \(\mu _K\) and \(\mu _L\) is given:

$$\begin{aligned} \frac{1}{\mu _L} = \theta \frac{1}{\mu _K}. \end{aligned}$$
(19)

Here \(\mu _ 1 = \mu _ K\), and \(\mu _ 2 = \mu _ L\).

Extending this concept to three variables \(\left( K, L, Y \right)\), the following relationship is established between the capital and labor share (\(\alpha\), \(\beta\)) of the three-variable quasi-inverse symmetric plane (10) and the power-law exponents \(\left( \mu _ K, \mu _ L, \mu _ Y \right)\):

$$\begin{aligned} \frac{1}{\mu _Y} = \alpha \frac{1}{\mu _K} + \beta \frac{1}{\mu _L}. \end{aligned}$$
(20)

This is an extension of the constant returns to scale:

$$\begin{aligned} 1 = \alpha + \beta . \end{aligned}$$
(21)

Using the following normalized notation:

$$\begin{aligned} \alpha ' = \alpha \frac{\mu _Y}{\mu _K}, \quad \beta ' = \beta \frac{\mu _Y}{\mu _L}. \end{aligned}$$
(22)

Equation (20) is simply expressed as:

$$\begin{aligned} 1 = \alpha ' + \beta '. \end{aligned}$$
(23)

In Fig. 1, we compared the accuracy of Eqs. (21) and (23) using \({ \alpha } _ M\), \({ \beta } _ M\), \({ \alpha '} _ M\), and \({ \beta '} _ M\) evaluated by multiple regression analysis used in economics in Japanese firms from 2010 to 2014. The database that we used was created using a collection method that reduced the number of firms in the past. Therefore, Fig. 1 shows data from 2014 to the previous five years. Data for 2015 were not included, because they were still being collected when we obtained the database. From Fig. 1, we can confirm that the accuracy of the constant returns to scale, normalized by the power-law index (23), and is higher than the conventional one (21). Results similar to those of Japanese firms can be confirmed for firms in the countries that we analyzed.

Fig. 1
figure 1

Comparison of constant returns to scale (\(\alpha + \beta = 1\)) of Japanese firms from 2014 to 2000 and standardized one (\(\alpha '+ \beta ' = 1\))

More interestingly, the normalized capital share (capital elasticity) \(\alpha '\) and the labor share (labor elasticity) \(\beta '\) can be directly estimated by observing the Cobb–Douglas symmetric plane, as described below.

In the previous section’s discussion, we rewrote the quasi-inverse symmetry of the \(\left( K, L, Y \right)\) space to the quasi-inverse one of the \(\left( K, L, R \right)\) space and limited the discussion to the power-law region of \(\left( K, L \right)\) where Gibrat’s law holds, and derived a differential equation for \(\left( K, L \right)\) by expanding growth rate R around “1”. In this discussion, we only investigate the neighborhood of the quasi-inverse symmetric plane. To directly observe the Cobb–Douglas symmetric plane, the entire space of the three variables must be observed without adopting this method.

Thus, in addition to Eq. (18), we consider the standardization of Y:

$$\begin{aligned} \log y= \mu _Y ( \log Y - \log Y_{\mathrm{up}} ). \end{aligned}$$
(24)

For the two variables \(\left( K, L \right)\), the direction vector of the quasi-inverse symmetric axis is \(\left( 1/\mu _K, 1/\mu _L \right)\), and the normalized direction vector is \(\left( 1,1 \right)\) in two-variable space (kl). Using Eq. (17), we rotated it to be \(\left( \sqrt{2}, 0 \right)\), resulting in independent variables \(\left( z _ 1, z _ 2 \right)\). For the three variables \(\left( K, L, Y \right)\), the densest direction vector in the quasi-inverse symmetric plane is \(\left( 1/\mu _K, 1/\mu _L, 1/\mu _Y \right)\). The rotation of normalized direction vector \(\left( 1,1,1 \right)\) in three-variable space (kly) to \(\left( \sqrt{3}, 0,0 \right)\) is given by:

$$\begin{aligned}&\log z_1 = \frac{1}{\sqrt{3}} \left( \log k + \log l + \log y \right) , \end{aligned}$$
(25)
$$\begin{aligned}&\log z_2 = \frac{1}{\sqrt{2}} \left( - \log k + \log l \right) , \end{aligned}$$
(26)
$$\begin{aligned}&\log z_3 = \frac{1}{\sqrt{6}} \left( - \log k - \log l + 2 \log y \right) . \end{aligned}$$
(27)

This rotation caused the densest directional vector in the Cobb–Douglas quasi-inverse symmetric plane to overlap the \(z _ 1\) axis, as shown in Fig. 2.

Fig. 2
figure 2

The rotation (25)–(27) causes the densest directional vector in the Cobb–Douglas quasi-inverse symmetric plane to overlap the \(z _ 1\)-axis

Fig. 3
figure 3

The rotation (28), (29) aligns a Cobb–Douglas quasi-inverse symmetric plane with the \(w _ 3 =\) Const. plane. For clarity, the figure is written as Const. \(= 0\)

To align a Cobb–Douglas quasi-inverse symmetric plane with the \(w _ 3 = Const.\) plane, we need to rotate it further about the \(z _ 1\) axis. The rotation, expressed using parameter \(\psi\), is as follows (see Fig. 3):

$$\begin{aligned}&\log w_2 = \frac{1}{\sqrt{\psi ^2 + 1}} \left( \log z_2 + \psi \log z_3 \right) , \end{aligned}$$
(28)
$$\begin{aligned}&\log w_3 = \frac{1}{\sqrt{\psi ^2 + 1}} \left( - \psi \log z_2 + \log z_3 \right) . \end{aligned}$$
(29)

Due to the above rotations in Eqs. (25)–(29), the Cobb–Douglas quasi-inverse symmetric plane overlaps the \(w _ 3 =\) Const. plane. Here, if the \(w _ 3 =\) Const. plane is expressed by the quasi-inverse symmetric plane in the \(\left( k, l, y \right)\) space by inverse transformation, it can be expressed:

$$\begin{aligned} \log y = \frac{1 - \sqrt{3} \psi }{2} \log k + \frac{1 + \sqrt{3} \psi }{2} \log l + {\text {Const.}} \end{aligned}$$
(30)

From the above equation, using parameter \(\psi\) that represents the slope of the Cobb–Douglas quasi-inverse symmetric plane, normalized capital share (capital elasticity) \(\alpha '\) and labor share (labor elasticity) \(\beta '\) can be expressed:

$$\begin{aligned} \alpha ' = \frac{1 - \sqrt{3} \psi }{2}, \quad \beta ' = \frac{1 + \sqrt{3} \psi }{2}. \end{aligned}$$
(31)

5 Direct observation of the Cobb–Douglas quasi-inverse symmetric plane

In the previous section, we analytically discussed the following procedures to simplify observing the Cobb–Douglas quasi-inverse symmetric plane in three-variable space \(\left( K, L, Y \right)\). First, we converted three variables \(\left( K, L, Y \right)\) into variables \(\left( k, l, y \right)\), which are normalized, such as the values of the power-law exponents in the power-law region to be “1”. Next, we consider a rotation (25)–(27) that converts the densest direction vector \(\left( 1,1,1 \right)\) in a Cobb–Douglas quasi-inverse symmetric plane into a vector of only the first component and obtain variables \(\left( z _ 1, z _ 2, z _ 3 \right)\).

If 3D data are projected onto the \(z _ 1z _ 2\) plane in the \(z _ 3\) direction, then the densest direction vector should overlap the \(z _ 1\) axis. If 3D data are projected onto the \(z _ 1z _ 3\) plane in the \(z _ 2\) direction, the densest direction vector should still overlap the \(z _ 1\) axis. When 3D data are then projected onto the \(z _ 2z _ 3\) plane in the \(z _ 1\) direction, the densest direction vector is observed, and its slope is given as \(\psi\). In this section, we first confirm the above analytical discussions by directly observing empirical data. In direct observation, identifying the densest directional vector in a 2D plane is critical. We observed a dense directional vector as a ridge using the geographical index of surface openness, as in previous studies (Ishikawa et al. 2014). The method is briefly described below.

\(z_1\) and \(z_2\) data points are scattered in the \(z_1 z_2\)-plane. For example, Fig. 4 is a scatter plot of Japanese firms in 2014. To clearly comprehend the density, we divided it into logarithmically equal-sized cells and expressed the amount of data points in the cells by different shades. For example, Fig. 5 is a diagram of Japanese firms in 2014 created in this way. The logarithm of a cell’s density is its height. Then, as stated above, the ridge must be observed horizontally in the \(z_1 z_2\) plane. As the steepest-ascent line in the profit space, a ridge was previously discussed (Souma 2007; Aoyama et al. 2008). In our previous study (Ishikawa et al. 2014), we determined the cells that constitute the ridge using the surface openness defined as follows (Yokoyama et al. 1999, 2002; Prima et al. 2006).

Figure 6 depicts the grid linked by the center points of the cells. From grid point A in Fig. 6, we counterclockwisely represent each azimuth as \(D=1, 2, \ldots , 8\). As shown in Fig. 7, the minimum zenith and nadir angles at grid point A within distance L in azimuth D are represented by \(_D \phi _L\) and \(_D \psi _L\). Positive openness \(\varPhi _L\) is defined by the mean value of \(_D \phi _L\) along the eight azimuths, and negative openness \(\varPsi _L\) is the corresponding mean of \(_D \psi _L\). The surface openness is defined by the following difference:

$$\begin{aligned} \varPhi _L - \varPsi _L = \frac{1}{8} \sum ^8_{D=1} {}_D \phi _L - \frac{1}{8} \sum ^8_{D=1} {}_D \psi _L. \end{aligned}$$
(32)

The surface openness takes a negative value at the depressions and the valleys, zero at the level surface, the saddle point, and the uniform slope, and positive values at the ridge (see Fig. 8) and the summit (see Fig. 9). The shading in Fig. 5 shows only absolutely high and low points. However, using the surface openness, points that are relatively higher than the circumference (a ridge or a summit) or points that are relatively lower than the circumference (a valley or a depression) can be easily extracted. This is the advantage of the surface openness.

In this analysis, by setting \(L=5\), we estimate the surface openness for each cell and extract the cells of the openness that exceed 0.9, the value of which appropriately determines the ridge in this data set. In Fig. 5, the cells are expressed by black dots. As expected, the series of densest cells observed as a ridge overlap the \(z _ 1\) axis.

Fig. 4
figure 4

Scatter plot of \(\left( z _ 1, z _ 2 \right)\) with Japanese firm data for 2014 projected on \(z _ 1z _ 2\)-plane in \(z _ 3\) direction

Fig. 5
figure 5

Data points in scatter plot of Fig. 2 are placed in cells separated at equal logarithmic intervals, and amount of data in each cell is represented by shading. In addition, cell’s center, judged to form a ridge by surface openness, is indicated by black circles

Fig. 6
figure 6

Grid linked by center points of cells: From a grid point, we counterclockwisely represent each azimuth as \(D=1, 2, \ldots , 8\)

Fig. 7
figure 7

Dots represent height of cells within distance L in azimuth D. From grid point A within distance L, we estimate zenith angles and denote minimum as \({}_D \phi _L\). Similarly, minimum nadir angle is expressed by \({}_D \psi _L\)

Fig. 8
figure 8

A schematic illustration of a ridge. In the ridge indicated by the thick lines, the surface openness is positive, because the surrounding area is often lower than the height of the ridge

Fig. 9
figure 9

A schematic illustration of a summit. At the summit, the surface openness is positive, because all the surroundings are lower than the summit

A scatter plot in which 3D data are projected onto the \(z _ 1z _ 3\) plane in the \(z _ 2\) direction is similarly processed to obtain Fig. 10, which also shows that, as expected, the series of densest cells observed as a ridge overlap the \(z _ 1\)-axis.

Fig. 10
figure 10

Japanese firms’ data for 2014 in database are projected on \(z _ 1z _ 3\) plane in \(z _ 2\)-direction. \(\left( z _ 1, z _ 3 \right)\) data points are placed in logarithmically evenly spaced cells, and amount of data is represented by shading. In addition, cell’s center, judged to form a ridge by surface openness, is indicated by black circles

A scatter plot in which 3D data are projected onto the \(z _ 2z _ 3\) plane in the \(z _ 1\) direction is similarly processed to obtain Fig. 11, which shows that the ridge, which is a series of the densest cells, is located from the upper right to the lower left. The angle between this ridge and the \(z _ 2\) axis is \(\psi\), which we measured, assuming that a ridge is a straight line that regresses the center of cells with a surface openness greater than 0.9, which was appropriately determined to be a ridge cell. For example, for Japanese firms in 2014, \(\psi = 0.39 \pm 0.04\). In this case, using Eq. (31), we can estimate \({ \alpha '} _ R = 0.16 \pm 0.03\) and \({ \beta '} _ R = 0.84 \pm 0.16\). On the other hand, with the same data and multiple regression analysis, we also estimate \({ \alpha '} _ M = 0.16 \pm 0.0\) and \({ \beta '} _ M = 0.84 \pm 0.0\); both values are in good agreement. Figure 12 shows measurements taken by Japanese firms from 2014 to 2010 and by firms in Germany, France, Spain, Italy, and The Netherlands during the 5-year period for which data were available (from 2013 to 2009). Among the measurements, those in the year with the largest number of firms in each country are plotted in Fig. 13.

From Figs. 12 and 13, we confirmed that capital elasticity \({ \alpha _ R} '\), measured by the ridge specified by the surface openness and capital elasticity \({ \alpha _ M}'\) measured by the multiple regression analysis used in economics, are in good agreement with the 5-year firm data of the seven countries.

Fig. 11
figure 11

Japanese firms’ data for 2014 in database are projected on \(z_2 z_3\) plane in \(z_1\)-direction. \(\left( z_2, z_3 \right)\) data points are placed in logarithmically evenly spaced cells, and data amount is represented by shading. In addition, cell’s center, which is judged to form a ridge by surface openness, is indicated by black circles

Fig. 12
figure 12

Normalized capital elasticity (\({ \alpha _ M} '\)), calculated by multiple regression analysis for firms of each country in database of Japan, Italy, the UK, and The Netherlands from 2010 to 2014 and France, Germany, and Spain from 2009 to 2013, is compared with normalized capital elasticity (\({ \alpha _ R} '\)) calculated from ridge judged by surface openness

Fig. 13
figure 13

Comparison between \({ \alpha _ M} '\) and \({ \alpha _ R}'\)in Fig. 8 for Japan, Italy, the UK, and The Netherlands in 2014 and France, Germany, and Spain in 2013

6 Conclusion and discussion

Our previous study argued that the Cobb–Douglas production function, which is the core concept of economics, uniquely derived power-law distributions of its variables. In this study, we show that the Cobb–Douglas production function can be directly observed using empirical data. If firms’ assets, labor, and production are expressed as a set of points in a 3D space, the Cobb–Douglas production function can be interpreted as a quasi-inverse symmetric plane in 3D space and a residual from it. From this viewpoint, we rotated the Cobb–Douglas quasi-inverse symmetric plane in 3D space to simplify observation and projected the data in 3D space onto the 2D plane. With the geographic index of surface openness, we identified the densest axis in the plane as the ridge.

Identifying this axis is difficult when the dispersion is large in the 2D plane. For example, by dividing the data on a 2D plane into equally spaced bins in a certain direction and calculating the average value or the logarithmic average value in the vertical direction for each bin, we can combine them and find the axis. However, since the average value or the logarithmic average value varies depending on the selection method in a certain direction, the axis obtained by combining the average value or the logarithmic average value is not fixed to one axis. What is not affected by such arbitrariness is the advantage of the index of surface openness adopted in this paper. Using it, we successfully and directly observed the Cobb–Douglas production function as a quasi-inverse symmetric plane.

The approach proposed by Hildenbrnad (1981) and developed by Dosi et al. (2016) shares our arguments and perspectives on firms’ productivity, but uses a very different approach to measuring it. They theoretically showed that measurement of productivity is possible without depending on the shape of the production function by the volume of a polyhedron composed of points in 3D space and the angle formed with the axis by the main diagonal, and confirmed it by actual data. Their approach to measuring productivity is very innovative. The advantage of our study, on the other hand, is that it geometrically identifies the form of the production function that they avoided. This makes it possible to discuss specifically how production relates to assets and labor as follows.

Interestingly, the capital and labor elasticities of the Cobb–Douglas production function can be expressed using the slope of a quasi-inverse symmetric plane in 3D space. By comparing the measured values of the capital elasticity observed in this way with those by the multiple regression analysis conventionally used in economics, we showed that the accuracy of these measured values agrees with the accuracy using the data for 5 years of seven countries with sufficient data quantity, including Japan.

It is not obvious that the planes of the multiple regression analysis and the quasi-inverse symmetry become identical. If all the data in the 3D space exist on a plane, they coincide. In practice, however, the data are widely dispersed. If they are spread vertically about the plane based on a log-normal distribution and are not concentrated on the symmetry plane, clearly identifying the ridge is difficult where the data are concentrated, even if with quasi-inverse symmetry. This paper clarified that such a ridge exists in empirical data using the index of surface openness in 2D distribution in which 3D distribution is projected.

The fact that these observations are consistent in the seven countries with sufficient data means that the 3D space composed of capital, labor, and production has a ridge where the data are concentrated as a distinct entity. This idea can probably be applied to countries other than the seven countries surveyed in this paper. Therefore, if sufficient data exist, the results in this paper can be reproduced. As a result, we clarified from empirical data that the Cobb–Douglas production function, which has been discussed in economics, can be interpreted as the quasi-inverse symmetry of three variables and the residual from it. This study theoretically clarified why the Cobb–Douglas production function fits the empirical data better than a simple comparison of candidate production functions due to the goodness of fit of the data.

In the process, we also carried out the following new consideration on the constant returns to scale, which is a feature of the Cobb–Douglas production function. In economics, when the variables of the production function (K and L in this case) are multiplied by \(\lambda\) and production \(\left( Y\right)\) is also multiplied by \(\lambda\) such as \(\alpha + \beta =1\), the production function is called the constant returns to scale. The increasing returns if it exceeds \(\lambda\) times such as \(\alpha + \beta >1\), and the diminishing returns if it is less than \(\lambda\) times, such as \(\alpha + \beta <1\). These three cases of returns to scale are easy to understand by considering the following simple examples. When the number of factories is multiplied by \(\lambda\), the assets and labor of a firm are each multiplied by \(\lambda\) if the increased factories are the same size. The case where there is no change in the production efficiency and the production quantity becomes \(\lambda\) times is called the constant returns to scale; the case where the production efficiency improves and the production quantity becomes larger than \(\lambda\) times is called the increasing returns; the case where the production efficiency deteriorates and the production quantity becomes smaller than \(\lambda\) times is called the diminishing returns. Economic arguments often assume constant returns to scale in the production function, and in fact, this is often observed approximately in various empirical data analyses.

These observations indicate that, on average, a constant return to scale is achieved in firm production. In this study, we showed by direct observation that the Cobb–Douglas production function can be understood as a quasi-inverse symmetric plane in the space created by three variables \(\left( K, L, Y \right)\) and the residual from it. We concluded that the constant returns to scale is strictly satisfied with \(\alpha '\) and \(\beta '\) normalized by the power-law indices. Since the values of power-law exponents \(\mu _ K\), \(\mu _ L\), and \(\mu _ Y\) are all close to “1”, the constant returns to scale was approximately observed even in the non-standardized capital share (capital elasticity) \(\alpha\) and labor share (labor elasticity) \(\beta\).

In this study, as in our previous study, we did not limit the data area in which the Cobb–Douglas production function was observed to the large-scale region where the power law holds. As our previous study showed, there is also quasi-inverse symmetry in the mid-scale region. The collapse of the power-law distribution observed in the large-scale region reflects that Gibrat’s law does not hold in the mid-scale region. This concept is independent of quasi-inverse symmetry. On the other hand, serious questions remain about the completeness of the data in the small scale. The current study did not exclude small data. If it can be implemented in a reliable way, the numerical consistency between our proposed method and multiple regression analysis may be higher. This is a future issue.

In our method, a firm’s assets (K),  number of employees (L),  and production (Y) are regarded as one point in three-variable space (KLY),  and we focus on how those several million groups are distributed. This is an approach to microfoundations that attempts to derive the macroscopic properties of their distribution from the microscopic objects of individual points. This is an important issue that has been studied in economics for many years. Finally, we describe the relationship between our discussion and Aoki et al., a pioneering study as a physical approach to this problem.

Aoki and Yoshikawa hypothesized a multisystem in which the total number of employees and the total amount of production of a company are fixed, and found that the randomness (entropy) is greatest in a society in equilibrium. In that case, they used physics methodologies to claim that the average number of employees in firms follows the Boltzmann distribution (Aoki and Yoshikawa 2007). Aoyama et al. noticed that there is actually a large fluctuation in the total amount of production and incorporated the effect by superimposing the Boltzmann distribution. As a result, it was theoretically shown that labor productivity of (Y/L) follows the power-law distribution in the large-scale range and the Boltzmann distribution with negative temperature in the middle- and low-scale ranges, and this was confirmed by the empirical data (Aoyama et al. 2009, 2009, 2010, 2015, 2017; Souma et al. 2009; Iyetomi 2012).

This can be described geometrically as follows. First, consider the 2D plane of the logarithmic axes of labor productivity (Y/L) on the horizontal axis and number of employees (L) on the vertical axis. The horizontal axis is divided into logarithmically equal-sized bins, and the vertical average of the data in the bins is plotted. The distribution followed by these points can be described by superposition of the Boltzmann distribution.

In this paper, we show the following. Total factor productivity \((Y/{K ^ { \alpha } L ^ { \beta }})\) is divided into logarithmically equal-sized bins, and the distribution of the number of data in the bins is observed. We can determine \(( \alpha , \beta , a)\), so that the distribution is symmetric with respect to a constant a. Thus, our observations differ from those of Aoki et al. To unite the two, we need to extend the discussion of Aoyama et al. to total factor productivity, including capital (K) . If this can be carried out, it will be possible to discuss the distribution of productivity that firms’ capital as well as labor, and to analyze firms in various industries in terms of both labor and capital. This is a major challenge for the future.